Hi,๐Ÿ‘‹ we have updated the app and fixed multiple bugs. We are lacking funds, request to free user not to use Adblock. Ads are non intrusive. ๐Ÿ˜Š

@SemiAnalysis_: People think AI inference marg...

@SemiAnalysis_
13 views Mar 27, 2026
Advertisement
1
People think AI inference margins are a race to the bottom. Anthropic's gross margins were -94% in 2024. MiniMax was -25%. The narrative made sense (1/5)๐Ÿงต
Media image
2
Then something changed. Zhipu raised prices 30% in February 2026, the first hike in China's AI market. It sold out instantly. ARR went 25x in 10 months. (2/5)
3
The secret is interactivity: tokens per second per user. It's the dial labs slide between margin and user happiness. Customer requirements depend on the workload, and throughput and costs depend on the hardware. At SemiAnalysis, we think Inference Provider Gross Margins should blend to ~60%. The chart below shows how outcomes vary significantly across hardware. (3/5)
Media image
4
We know interactivity matters. Moonshot tried aggressive batching to cut costs. Users left. They added a premium tier. DeepSeek lost share serving their own model the same way. (4/5)
5
AI inference isn't a commodity. It's a managed experience. Labs that understand the interactivity lever operate at 60%+ margins. The rest race to zero. (5/5)
Media image
Actions
Visual Editor Carousel Maker NEW
Update Thread
What You Can Do
  • Download as PDF
  • Save to Notion
  • Export as Markdown
  • Visual Editor
  • LinkedIn & Instagram Carousel Maker
Create Free Account

Includes 7-day Premium trial

Advertisement