Silicon Valley's New Secret: Chinese Base Models

From fine-tunes to founder stacks, the center of gravity is moving east.

Posted Nov 2, 2025 Updated Nov 2, 2025

By Ovidiu Dan

5 min read

When entrepreneurs walk into the offices of Andreessen Horowitz (a16z), one of Silicon Valley’s premier venture capital firms, the odds are high their startups are running on AI models made in China. “I’d say there’s an 80% chance they’re using a Chinese open-source model”, reveals Martin Casado, a partner at a16z. This quiet migration from expensive closed-source models to cheaper open-source alternatives is reshaping the AI landscape.

The evidence arrived just this week. On October 29th, both Cognition launched SWE-1.5, the coding assistant powering Windsurf, and Cursor unveiled their new Composer agent. Within hours, developers noticed something striking: SWE-1.5 appears to be a customized version of Zhipu’s GLM 4.6 model running on Cerebras infrastructure, while Cursor’s Composer occasionally reveals Chinese reasoning traces in its outputs - telltale signs of their base model origins. Even prominent investors are making the switch: Chamath Palihapitiya announced on the All-In Podcast that he’s migrating significant workloads from OpenAI and Anthropic to Kimi K2, despite being a top-tier Amazon Bedrock customer. His verdict? “It’s way more performant and a ton cheaper.”

This isn’t just about cost savings. Percy Liang, co-founder of Together AI, notes that open-weight models enable “different forms of adoption than proprietary technology”- they can be more easily adapted to specific use cases and run on-premises rather than relying on cloud services. While American labs bet big on pushing the frontiers of intelligence with closed models, their Chinese rivals are focused on encouraging widespread AI adoption through openness. As Ali Farhadi of the Allen Institute for AI admits: “As hard as it is for us all to swallow, I think we’re behind [on open weights] now.”

It all started when DeepSeek burst out with V3 in Dec 2024 and the R1 reasoning model in Jan 2025, touting frontier-level performance trained in ~2 months for under $6M on H800s. The app rocketed to #1 on the U.S. App Store, triggering Wall Street jitters and a flurry of “AI price war” headlines as DeepSeek slashed off-peak API rates and rivals followed. In the weeks after, competitors rushed out reasoning upgrades, Chinese labs accelerated releases, and the narrative flipped from compute-as-moat to “efficient scaling”, cementing DeepSeek as the spark for a global reset on costs and pace.

The pressure became undeniable when OpenAI, in their recent partnership announcement with Microsoft, quietly acknowledged the shift with a line tucked away as the last bullet point: “OpenAI is now able to release open weight models that meet requisite capability criteria.” Even the company that pioneered and then abandoned the open approach is being forced back to the table.

The Rise of Chinese Models

Looking at the top-performing open weights models by intelligence ratings, Chinese models like MiniMax-2 and Qwen-3-235B now match or exceed their American counterparts. Of the 10 highest-scoring models, China claims 6 spots while the US holds 4. What’s remarkable isn’t just the quantity - it’s that Chinese models are competing at the very top of the intelligence scale.

The aggregate performance intelligence ratings chart reveals the inflection point. Starting from roughly equal footing in April 2024, China’s red line climbs relentlessly while the US blue line begins to plateau. The crossover happened around April 2025 - and the gap has only widened since. Europe’s trajectory flatlined, a stark reminder that AI leadership requires more than regulation.

Perhaps the most significant inflection point: in August 2025, cumulative downloads of Chinese models surpassed those from the US for the first time. The crossover wasn’t close - China’s trajectory is steeper, suggesting the gap will only widen. This chart captures the moment when developer preference fundamentally shifted.

The growth trajectories tell an even more dramatic story. Meta’s Llama maintained a steady lead through 2024, but Qwen’s explosive acceleration in 2025 changed everything - the red line shoots nearly vertical, reaching 400M downloads while Llama approaches 350M. Mistral and DeepSeek trail at 100M and 80M respectively. This isn’t gradual adoption; it’s a developer exodus to Chinese models.

The downstream impact is even more striking. When developers build custom models through fine-tuning, they increasingly start with Chinese base models. By late 2025, Chinese models account for over 50% of all fine-tuned derivatives - a majority that keeps growing. These aren’t just being downloaded; they’re becoming the foundation of the next generation of AI applications.

Perhaps the most striking trend: the gap between frontier closed models and open-weight models is rapidly closing. In July 2023, frontier models like GPT-4 dominated with scores in the 40s while open models barely reached 20%. Just seven months later, o1-mini achieved 65% - but open models were already catching up. By mid-2025, top open models like Qwen3-32B and EXAONE-4.0 are tracking closely behind Grok 4, with the performance delta shrinking to single digits. The chart suggests consumer GPUs (RTX 5090, 6000) can now run models approaching frontier intelligence.

The efficiency advantage becomes clear when plotting intelligence against active parameters. The “most attractive quadrant” - high intelligence, lower parameter count - is highlighted in green. OpenAI’s gpt-oss models (12B and 20B) sit squarely in this zone, but Chinese models are pushing into it from the right. MiniMax-M2 achieves top-tier intelligence around the 10B mark, while models like DeepSeek V3.2, Qwen3-235B, and GLM-4.6 cluster in the 40-60B range with competitive scores. The trend is clear: newer releases are climbing toward that upper-left corner, delivering more intelligence per parameter.

Behind these numbers is an entire ecosystem of organizations competing at every tier. The chart categorizes players from frontier models down to honorable mentions. Since this visualization was created for Nathan Lambert’s October talk I credit below, MiniMax released M2, which now claims the top spot in Artificial Analysis intelligence rankings. The pace of releases is so rapid that static snapshots become obsolete within weeks.

For Western labs, the message is stark: the open-weights frontier has moved east, and the strategic moats they spent billions building are eroding faster than anticipated. The question is no longer whether Chinese models will catch up, but whether American companies can adapt to a world where their most formidable competitors give away their best work for free.

This post is based HEAVILY on a talk and slides presented by Nathan Lambert on The State of Open Models. Further augmented with more recent developments, notable quotes, and graphs from Artificial Analysis.

AI, Development

llm

This post is licensed under CC BY 4.0 by the author.

The Rise of Chinese Models

Trending Tags