News

AIEEV Blog

Explore Our Latest Updates

All Posts
Newsroom
Product
Inside AIEEV
Customer Stories
Engineering

How to Turn GPU Resources into an Inference API

The Distributed GPU Cloud Story and Why Ray Is at the Center of It 💡 Core Message "A GPU that never serves a request has no business value. " Air Cloud connects everything from the runtime layer to the platform layer, so hardware actually reaches users as a real service. Introduction When people talk about AI infrastructure today, the conversation usually starts with GPU scarcity. How many H100s did you lock in? Is B200 supply going to loosen up? Does your data center have e

Engineering

May 29

Two Technologies That Reduce AI Model Deployment Costs: Quantization and Prefix Caching

Hi, I'm Jinbeom Kim, a Software Developer on the AIEEV Dev Team. I studied computer science through both undergrad and graduate school, and I've been with AIEEV since the early days of the company — working on how we can operate more distributed GPU resources efficiently within Air Cloud 😊 In this post, I want to walk through two techniques we regularly evaluate when thinking about how to deploy AI models more efficiently. The first is Quantization — a method for reducing me

Engineering

May 7

One Command, Done: Integrating Air API with a ClawHub Plugin

Hi,I’m CY Lee from the DevOps/SRE team. With the launch of Air API, we’ve been building out our internal infrastructure monitoring system. Along the way, we developed an OpenClaw plugin—and in this post, I’d like to walk you through what we built and why it matters. 🙂 Before We Start If you’ve used OpenClaw for a while, you’ve probably experienced something like this at least once. The moment you try to connect an external model provider, you find yourself going through the

Engineering

Apr 16

AIEEV 36 posts
AIRCLOUD 32 posts
AICOMPUTING 12 posts
gpu cloud pricing 9 posts
gpucloud 8 posts
AIcloud 7 posts
aieev 7 posts
Air API 7 posts
Air Container 6 posts
AITrend 5 posts
CloudComputing 5 posts
AI 4 posts
GPUCLOUD 4 posts
distributedaicloud 4 posts
AIRCLOUD+ 4 posts
Air Cloud 3 posts
aircloud 3 posts
tech 3 posts
Pricing 3 posts
AISUMMIT 2 posts
DecentralizedInfrastructure 2 posts
plugandplay 2 posts
sksummit 2 posts
에이아이브 2 posts
microdips 1 post
Jira 1 post
shift 1 post
lguplus 1 post
Government-Funded Projects 1 post
samsungclab 1 post
B2B 1 post
smartcity 1 post
PreA 1 post
경남일보 1 post
AI 통합 바우처 1 post
클라우드가격비교 1 post
BRANDGUIDE 1 post
Promotion 1 post
Revenue Management 1 post
BRANDMOOD 1 post
Updates 1 post
BX 1 post
agent 1 post
2026 클라우드 바우처 1 post
BXGUIDE 1 post
aiinference 1 post
aisummit 1 post
brand 1 post
brandguide 1 post
DXWORKS 1 post
branding 1 post
c-lab 1 post
AXPROJECT 1 post
Dashboard 1 post
claude 1 post
cloud 1 post
customer story 1 post
Discount 1 post
cxguide 1 post
kodit 1 post
Google 1 post
littlepenguin 1 post
Automation 1 post
Invoices 1 post
fix 1 post
Management 1 post
Event 1 post
fix2025 1 post
B2BBRANDING 1 post
gmep 1 post
FinOps 1 post
gpu 1 post
#AIInfrastructure #CloudComputing 1 post
FundingAnnouncement 1 post
iso 1 post