Question 1

What are the best China AI foundation models in 2026?

Accepted Answer

In 2026, the leading Chinese foundation models are: Qwen3-235B-A22B (Alibaba, MoE, Apache 2.0) for state-of-the-art open-weight performance; DeepSeek-V3 (DeepSeek, MIT) for code and reasoning; Kimi k1.5 (Moonshot AI) for long-context tasks up to 128K tokens; GLM-4 (Zhipu AI) for bilingual applications; ERNIE 4.0 (Baidu) for enterprise Chinese-language use; and MiniMax-Text-01 for multimodal and video generation. Qwen3-235B-A22B matches GPT-4o-level benchmarks (MMLU 88.7, MATH 79.4) and is the strongest open-weight option globally as of May 2026.

Question 2

How does Qwen compare to DeepSeek for commercial use?

Accepted Answer

Both are fully open-source and commercially usable. Qwen3 uses Apache 2.0 (permissive, attribution required) and excels at multilingual tasks, instruction following, and long-form reasoning — the 30B-A3B MoE variant runs efficiently on a single A100. DeepSeek-V3 uses MIT (even more permissive) and leads on code generation (HumanEval 84.2) and mathematical reasoning (MATH 81.6). For fine-tuning pipelines in English or Chinese: Qwen3 has broader language coverage. For pure code or STEM: DeepSeek-V3 edges ahead.

Question 3

Which Chinese foundation models are available outside China?

Accepted Answer

Qwen3 (all sizes) and DeepSeek-V3/R1 are available globally via HuggingFace under open licenses — no VPN or China account required. Kimi API (api.moonshot.cn) accepts international payment and is accessible from most countries. GLM-4 is available via open.bigmodel.cn and HuggingFace. ERNIE 4.0 is primarily enterprise-gated via Baidu Cloud, with limited international availability. MiniMax API (minimax.io) has an international tier launched in late 2025.

Question 4

What is the best open-source Chinese LLM for fine-tuning?

Accepted Answer

Qwen3-7B and Qwen3-14B are the most widely used for fine-tuning in 2026. Reasons: Apache 2.0 license is explicit about commercial fine-tuning rights; the model family spans 0.6B–235B with consistent tokenizer and architecture; there is extensive tooling support (LLaMA-Factory, Axolotl, Unsloth, vLLM). DeepSeek-R1-Distill-Qwen-7B (MIT) is an alternative that combines DeepSeek reasoning data with Qwen architecture — useful for reasoning-intensive fine-tuning tasks.

Question 5

How do Chinese foundation models compare to GPT-4o on benchmarks?

Accepted Answer

As of May 2026: Qwen3-235B-A22B and DeepSeek-V3 are at GPT-4o parity on most standard benchmarks. On MMLU: Qwen3-235B-A22B 88.7 vs GPT-4o 88.7. On HumanEval code: DeepSeek-V3 84.2 vs GPT-4o 90.2 (GPT-4o still leads on code). On MATH: DeepSeek-V3 81.6 vs GPT-4o 76.6 (DeepSeek leads). On long-context tasks (32K+): Kimi k1.5 is competitive. At open-weight models under 30B, Qwen3-30B-A3B MoE is the best GPT-4o approximation for on-premise or private deployment.

Question 6

Where can I access Kimi, Qwen, and DeepSeek APIs in English?

Accepted Answer

Kimi API: platform.moonshot.cn (English docs available, international payment accepted). Qwen API: dashscope.aliyun.com (Alibaba Cloud Model Studio) — requires Alibaba Cloud account; alternatively use qwen-turbo/qwen-max via major cloud providers (AWS Bedrock, Azure, Cloudflare Workers AI). DeepSeek API: platform.deepseek.com (English docs, international payment) or via Together AI, Fireworks AI, and Amazon Bedrock. All three have OpenAI-compatible /v1/chat/completions endpoints.

Use case	Recommended model	Why
Commercial fine-tuning (open license)	Qwen3-7B / Qwen3-14B (Apache 2.0)	Broadest tooling support; explicit commercial fine-tuning rights; 0.6B–235B family consistency
RAG over multilingual documents	Qwen3-32B or Qwen3-30B-A3B MoE	Best multilingual retrieval-augmented performance in the open-weight class
Long-document processing (100K+ tokens)	Kimi k1.5 (API via platform.moonshot.cn)	128K context native; competitive on long-context RULER benchmark
Code generation and STEM reasoning	DeepSeek-V3 (MIT)	HumanEval 84.2; MATH 81.6; MIT license; Together AI / Fireworks hosting available
Enterprise Chinese NLP (on-prem required)	ERNIE 4.0 (Baidu Cloud) or GLM-4 (open.bigmodel.cn)	Best Chinese-language domain adaptation; ERNIE has strongest Baidu ecosystem integration

Model	Release date	Parameters	License	Benchmark highlights	API access
Qwen3-235B-A22B (Alibaba)	April 2026	235B total / 22B active (MoE)	Apache 2.0	MMLU 88.7, MATH 79.4, HumanEval 82.6	dashscope.aliyun.com; AWS Bedrock; HuggingFace weights
DeepSeek-V3 (DeepSeek)	December 2024 (updated H1 2025)	671B total / 37B active (MoE)	MIT	MMLU 87.1, MATH 81.6, HumanEval 84.2	platform.deepseek.com; Together AI; Fireworks AI; Amazon Bedrock
Kimi k1.5 (Moonshot AI)	January 2025	Undisclosed	Proprietary API	MMLU 85.4, MATH 77.3; RULER long-context 86.1 (128K)	platform.moonshot.cn (international payment accepted)
GLM-4 (Zhipu AI)	January 2024 (GLM-4-Plus: August 2024)	~130B (estimated)	Apache 2.0 (GLM-4-9B); proprietary for full GLM-4	MMLU 83.6, C-Eval 77.2 (strong Chinese benchmark)	open.bigmodel.cn; HuggingFace (GLM-4-9B weights)
ERNIE 4.0 Turbo (Baidu)	October 2024	Undisclosed	Proprietary (Baidu Cloud only)	CMMLU 87.3 (Chinese-language SOTA); enterprise SLA available	Baidu Wenxin Workshop; Baidu Cloud enterprise (limited international access)
MiniMax-Text-01 (MiniMax)	January 2025	456B total / 45.9B active (MoE)	MIT (weights on HuggingFace)	MMLU 88.5, MATH 77.8; 1M context window	minimax.io (international tier); HuggingFace weights

Access method	Models available	Notes	Best for
Direct API (lab)	Qwen (dashscope.aliyun.com), DeepSeek (platform.deepseek.com), Kimi (platform.moonshot.cn), MiniMax (minimax.io)	OpenAI-compatible /v1/chat/completions on all four; Stripe/international card accepted on DeepSeek, Kimi, MiniMax	Lowest latency; most up-to-date model versions
HuggingFace (weights)	Qwen3 (all sizes), DeepSeek-V3/R1, GLM-4-9B, MiniMax-Text-01	Free download; Apache 2.0 / MIT; no rate limits once downloaded; requires GPU infrastructure	Self-hosted fine-tuning, private deployment, offline inference
International cloud hosting	Qwen3 (AWS Bedrock, Cloudflare Workers AI), DeepSeek (Together AI, Fireworks AI, Amazon Bedrock), GLM-4 (Replicate)	US-billing, no China account required; Together AI and Fireworks add <50ms latency overhead; pricing competitive with direct API	Teams without China payment methods or needing US data residency
Self-deployment (vLLM / Ollama)	Qwen3-0.6B to Qwen3-32B; DeepSeek-R1-Distill-Qwen-7B; GLM-4-9B	vLLM 0.4+ supports Qwen3 natively; Ollama library has quantized versions for M-series Mac; A100 40GB handles Qwen3-30B-A3B MoE at full precision	Air-gapped environments, on-prem enterprise, zero API cost at scale

If your question is about…	Go to	What's there
Which Chinese models are open-source and commercially usable	China AI Open Source Models	Full license comparison table — Apache 2.0, MIT, custom; download links and commercial use restrictions
Recent model releases and capability milestones	Model Release Tracker	Qwen3, DeepSeek, Kimi release timeline with benchmark data and source verification
Weekly digest of what changed in China AI	China AI Updates	Weekly signal digest — model releases, funding, policy, curated for builders
Which China AI companies to watch	Best China AI Companies	15-company shortlist: foundation model labs, workflow infra, physical AI — with monitoring frequencies
China AI startup funding rounds and IPOs	China AI Startup Funding Tracker	Moonshot $1B Series B, MiniMax $600M, WeRide NYSE IPO — timeline with sources

China AI Foundation Models: Qwen, DeepSeek, Kimi, GLM (2026)

Thesis

Decision in 20 seconds

China AI foundation models — full comparison (2026)

API access methods — international builder guide

FAQ

Companion pages in this cluster