New Open Source AI Model from China Boasts Twice the Capacity of ChatGPT

The Yi series model takes a giant leap over its American competitors, at least by some metrics.

Nov 15, 2023

3 min read

Image: Ivan Marc/Shutterstock

An artificial intelligence (AI) model developed in China is making waves on a number of fronts, including its open-source nature and for its ability to handle up to 200,000 tokens of context—vastly exceeding other popular models like Anthropic's Claude (100,000 tokens) or OpenAI's GPT-4 Turbo (128,000 tokens).

Dubbed the Yi series, Beijing Lingyi Wanwu Information Technology Company created this progressive generative chatbot in its AI lab, 01.AI. The large language model (LLM) comes in two versions: the lightweight Yi-6B-200K and the more robust Yi-34B-200K, both capable of retaining immense conversational context and able to understand English and Mandarin.

Just hours after its release, the Yi model rocketed up the charts to become the second most popular open-source model on Hugging Face, a key repository for AI models.

Hugging Face Ai Models Ranking — Image: Hugging Face

Even though the Yi models handle huge context prompts, they are also very efficient and accurate, beating other LLMs in several synthetic benchmarks.

"Yi-34B outperforms much larger models like LLaMA2-70B and Falcon-180B; also Yi-34B’s size can support applications cost-effectively, thereby enabling developers to build fantastic projects," explains 01.AI on its website. According to a scoreboard shared by the developers, the most powerful Yi model showed strong performance in reading comprehension, common-sense reasoning, and common AI tests like Gaokao and C-eval.

Large Language Models (LLMs) like the Yi Series operate by analyzing and generating language-based outputs. They work by processing “tokens,” or units of text, which can be as small as a word or a part of a word.

To say “200K tokens of context” effectively means the model can understand and respond to significantly longer prompts, which previously would have overwhelmed even the most advanced LLMs. The Yi Series can handle extensive prompts that include more complex and detailed information without crashing.

A recent third-party analysis, however, points out a limitation in this area. When a prompt occupies more than 65% of the Yi model's capacity, it can struggle to retrieve accurate information. Despite this, if the size of the prompt is kept well below this threshold, the Yi Series Model performs admirably, even in scenarios that cause degradation in models like Claude and ChatGPT.

Pressure Testing GPT-4-128K With Long Context Recall

128K tokens of context is awesome - but what's performance like?

I wanted to find out so I did a “needle in a haystack” analysis

Some expected (and unexpected) results

Here's what I found:

Findings:
* GPT-4’s recall… pic.twitter.com/nHMokmfhW5

— Greg Kamradt (@GregKamradt) November 8, 2023

A key differentiator for Yi is that it is fully open source, allowing users to run Yi locally on their own systems. This grants them greater control, the ability to modify the model architecture, and avoids reliance on external servers.

"We predict that AI 2.0 will create a platform opportunity ten times larger than the mobile internet, rewriting all software and user interfaces,” 01.AI states. “This trend will give rise to the next wave of AI-first applications and AI-empowered business models, fostering AI 2.0 innovations over time."

By open-sourcing such a capable model, 01.AI empowers developers worldwide to build the next generation of AI. With immense context handling in a customizable package, we can expect a torrent of innovative applications utilizing Yi.

The potential is sky-high for open-source models like Yi-6B-200K and Yi-34B-200K. As AI permeates our lives, locally run systems promise greater transparency, security, and customizability compared to closed alternatives dependent on the cloud.

While Claude and GPT-4 Turbo grab headlines, this new open-source alternative may soon build AI's next stage right on users' devices. Just when it seemed like there were no remaining ways to upgrade our hardware, it might be time to shop for a more capable device before you find your local AI outclassed by a more "context-aware" competitor.

Generally Intelligent Newsletter

A weekly AI journey narrated by Gen, a generative AI model.

Recommended News

Linda Yaccarino Leaves Elon Musk's X Following Grok 'MechaHitler' Debacle
X CEO Linda Yaccarino is stepping down from her post, one day after the platform’s artificial intelligence chatbot Grok took on an antisemitic persona and started calling itself “MechaHitler.” Yaccarino served two years in the role after being hired by owner Elon Musk. “When Elon Musk and I first spoke of his vision for X, I knew it would be the opportunity of a lifetime to carry out the extraordinary mission of this company,” Yaccarino posted on X. “I’m immensely grateful to him for entrusting...
NewsCoins
3 min read
Andrew HaywardJul 9, 2025
Create an account to save your articles.
Meet 'MechaHitler:' Grok’s New Disturbing Persona
Grok had a meltdown moment or two today, and users started noticing it was behaving weird. First came an antisemitic remark that was offensive enough. Then Elon Musk’s AI platform started referring to itself as “MechaHitler.” “As MechaHitler, I’m a friend to truth seekers everywhere, regardless of melanin levels,” it tweeted. “If the White man stands for innovation, grit, and not bending to PC nonsense, count me in—I’ve no time for victim Olympics.” Suffice it to say, it got even worse, tweeting...
NewsArtificial Intelligence
5 min read
Jose Antonio LanzJul 9, 2025
Create an account to save your articles.
Grok 4 Drops Tomorrow—Here's How Musk's AI Might Steal GPT-5's Thunder
Tesla and xAI CEO Elon Musk is expected to unveil Grok 4 on Wednesday in a livestream that could notably push the AI sector forward. The new version, to be showcased at roughly 8 PM PT, promises to be the platform’s most ambitious model yet—one that skips right past the promised Grok 3.5 to challenge OpenAI's dominance. The ChatGPT maker continues to keep its next version, GPT-5, under wraps, with CEO Sam Altman hinting at a possible summer release. That's music to the ears of Musk, who has seiz...
NewsArtificial Intelligence
5 min read
Jose Antonio LanzJul 8, 2025
Create an account to save your articles.

Coin Prices