China’s AI ecosystem landed a genuine frontier model this week. Alibaba’s Qwen 3.7 Max โ the first Chinese model to compete head-to-head with GPT-5.5 and Claude Opus on complex coding and reasoning benchmarks โ launched at roughly half the price of comparable Western tiers. DeepSeek made its 75% price cut permanent, resetting the global floor for inference pricing. Moonshot AI raised $2B at a $20B+ valuation and began unwinding its offshore structure for a Hong Kong IPO. And China confirmed its first comprehensive AI law. Here is this week’s China AI Weekly.
Qwen 3.7 Max: China’s First Frontier Model Goes Live
Alibaba formally launched Qwen 3.7 Max at the Alibaba Cloud Summit in Hangzhou on May 20, 2026, with API access live on Alibaba Cloud Model Studio since May 19. Preview variants had already been quietly climbing the LMSys Chatbot Arena leaderboard โ ranking 13th globally for text and 16th in vision, the highest positions any Chinese model has achieved to date. (Source: SCMP)
Benchmark Positioning
Qwen 3.7 Max is the first Chinese model that the industry is describing as a genuine “frontier” contender. Independent benchmarks show it beats Claude Opus 4.6 on Terminal-Bench 2.0 and SWE-Bench, with particular strength in agentic coding and long-horizon autonomous tasks โ it can operate continuously for roughly 35 hours without performance degradation. (Source: Digital Applied)
Pricing That Matters
| Metric | Price |
|---|---|
| Input tokens | $2.50 per million tokens |
| Output tokens | $7.50 per million tokens |
| Cached input | $0.25 per million tokens |
| Context window | Up to 1 million tokens |
These rates are roughly half the price of Anthropic and OpenAI’s top frontier tiers for comparable coding workloads. For enterprise teams already running cost-benefit analyses against Western models, the gap is significant โ particularly at scale.
Ecosystem Depth
Alibaba also unveiled new custom AI chips and infrastructure at the summit, positioning the cloud platform as an “AI factory” for enterprise deployments. Qwen’s user base already surged from 7 million to 58 million DAU during a coupon-driven campaign tied to the earlier Qwen 3.5 launch, demonstrating Alibaba’s willingness to subsidize adoption aggressively. (Source: TIKR)
The strategic commitment is clear: Alibaba has formed a dedicated foundation model support group aimed at AGI, making Qwen a core long-term investment. For context on how Chinese AI ecosystem dynamics affect enterprise strategy, see our China AI ecosystem overview.
DeepSeek Permanently Slashes Prices 75%, Resets Global Inference Floor
DeepSeek made its approximately 75% price cut on V4 Pro permanent effective May 25, 2026 โ a decisive move the financial press described as resetting “the floor for frontier inference pricing.” (Source: Bloomberg via YouTube)
Current Pricing
| Model | Input Tokens | Output Tokens |
|---|---|---|
| V4 Flash | $0.14/M | $0.28/M |
| V4 Pro | โ | $0.87/M (~34ร cheaper than GPT-5.5) |
The 1.6T-parameter MoE model (49B active per token) runs on Huawei Ascend hardware and carries an MIT License, making it freely deployable, fine-tunable, and redistributable for enterprise teams. Context window sits at 1,000,000 tokens.
Benchmark Snapshot
| Benchmark | DeepSeek V4 Pro | GPT-5.5 |
|---|---|---|
| SWE-Bench Verified | 80.6% | 88.7% |
| SWE-Bench Pro | 55.4% | ~58.6% |
| GPQA Diamond | 90.1% | ~83% |
V4 Pro leads GPT-5.5 on GPQA Diamond (graduate-level QA) while trailing modestly on SWE-Bench variants โ and does so at roughly 3% of the cost. The combination of open-weight access, Ascend compatibility, and sub-dollar-per-million-token output pricing makes this the strongest cost-value proposition in the frontier-weight class. For a deeper dive on evaluating DeepSeek against Western alternatives, see our DeepSeek V4 CIO decision framework.
Funding Trajectory
DeepSeek is expected to initiate its first external funding round, led by China’s Big Fund (the national semiconductor guidance fund), with rumored valuations in the $10โ45B range, potentially up to $45B. (Source: Cryptopolitan) The company’s rapid ascent was also featured in a CNN Fareed Zakaria GPS segment on May 24, framed as “the most powerful open-source platform” challenging OpenAI and Anthropic.
Moonshot AI Raises $2B at $20B+ Valuation, Eyes Hong Kong IPO
Moonshot AI, creator of the Kimi model family, closed a $2B funding round led by Meituan at a valuation exceeding $20B โ making it the most heavily funded Chinese LLM startup over the past six months at roughly $3.9B total raised. (Source: Entrepreneur Loop)
The round is accompanied by a structural shift: Moonshot is unwinding its offshore VIE structure, a proposal circulated to shareholders to pave the way for an anticipated Hong Kong IPO. (Source: SCMP)
Why the Valuation Holds
- Kimi K2.6 (1T-parameter MoE, 32B active, 262K context, Agent Swarm with 300 domain agents capable of ~4,000-step autonomous operations)
- ARR surged from $100M in March to $200M+ in April โ the single strongest growth inflection in the Chinese LLM market
- Consistent top-3 ranking on OpenRouter token usage, often #2
The VIE unwind signals an IPO timeline likely within 12โ18 months. For enterprise teams evaluating Kimi alongside Qwen and DeepSeek, Moonshot’s trajectory demonstrates that the market for Chinese LLM APIs is maturing fast enough to support public-market valuations โ and that capital consolidation around national champions is accelerating.
Open Source & Community
Four Chinese labs released open-weight coding models within a 12-day window: Z.ai’s GLM-5.1, MiniMax M2.7, Moonshot’s Kimi K2.6, and DeepSeek V4 Pro โ all landing at roughly the same capability ceiling on agentic engineering while being significantly cheaper to run than Western counterparts. (Source: Air Street Press)
Chinese models now account for >45% of OpenRouter traffic as of April 2026, and Chinese model families represent 41% of Hugging Face downloads over the past year โ surpassing the US in monthly downloads. (Source: Global Times)
ScienceOne 100, an “AI scientist” research system from the Chinese Academy of Sciences, opened to researchers globally โ the latest signal that China’s open-weight ecosystem extends beyond foundation models into domain-specific scientific tools.
Chinese Academy of Sciences released ScienceOne 100, an “AI scientist” system opened to researchers globally.
What to Watch
Qwen 3.7 Max independent benchmarks. The preview already hit #13 on LMSys Arena, but definitive third-party comparisons against GPT-5.5, Claude Opus 4.6, and DeepSeek V4 Pro will define its enterprise positioning. Watch for reasoning and coding-specific leaderboard updates.
DeepSeek’s first external round. Rumored at $10โ45B with Big Fund participation, this will be the largest single fundraising event in Chinese AI to date. The terms will signal how the government views DeepSeek’s strategic role in the domestic semiconductor ecosystem.
Moonshot’s Hong Kong IPO timeline. The VIE unwind process typically takes 6โ12 months. Track regulatory filings and any pre-IPO secondary transactions for valuation benchmarks.
China’s comprehensive AI law. The State Council confirmed that consolidation of deep synthesis, generative AI, and algorithmic recommendation rules into a unified framework is on the 2026 legislative track โ the first binding signal that China is moving toward structural AI regulation rather than piecemeal guidance. The National Agentic AI Framework released in May targets ~70% agent penetration in intelligent terminals and public sector near-term, rising to 90% by 2030.
Huawei Ascend order execution. ByteDance’s $5.6B commitment and combined >500,000 unit orders from ByteDance, Alibaba, and Tencent will test Huawei’s ability to deliver 750,000 Ascend 910B/PR units in 2026. Delivery delays would create gaps for competitors like Moore Threads ($720M at $4.1B valuation) and other domestic GPU designers.
Three stories anchored this week: Alibaba delivered China’s first credible frontier model at a compelling price point, DeepSeek cemented its position as the cost leader in global inference, and Moonshot showed that capital consolidation around national champions is accelerating. The common thread is that Chinese AI has moved beyond catching up โ it is now setting pricing floors, releasing open-weight models at unprecedented rates, and building the regulatory and infrastructure scaffolding for mass enterprise adoption.
Evaluating Chinese AI models for your enterprise stack? Big Hat Group delivers enterprise AI consulting with vendor-neutral orchestration, Azure-native deployments, and documented governance. Whether you are benchmarking Qwen 3.7 Max, integrating DeepSeek V4 Pro, or building a multi-model strategy that navigates shifting chip policy and emerging Chinese AI regulation, book a discovery call. Thirty minutes. No pitch deck. Just your use case and our bench.
Sources: scmp.com (Qwen 3.7 Max, Moonshot VIE), digitalapplied.com (Qwen benchmarks), tikr.com (Qwen DAU), bloomberg.com via YouTube (DeepSeek pricing), cryptopolitan.com (DeepSeek funding), entrepreneurloop.com (Moonshot funding), airstreet.com/open-source (four-lab release cluster), globaltimes.cn (HuggingFace downloads), tech-insider.org (Huawei Ascend)