People Rank Open Source AI Output Better Than Commercial Plays

Mistral AI's Mixtral takes top honors among closed, proprietary options like Claude and ChatGPT.

Jan 10, 2024

3 min read

Artificial intelligence enthusiasts prefer working with open-source tools over proprietary commercial ones, according to an ongoing survey of more than 100,000 respondents.

The emergence of Mistral AI's Mixtral 8x7B, an open-source model, has made a significant impact in the AI space. Light and powerful, Decrypt named it among the Best LLMs of 2023. Mixtral has gained a lot of attention for its remarkable performance in various benchmark tests, especially Chatbot Arena, which offers a unique human-centric approach to evaluating LLMs.

The Chatbot Arena rankings, a crowdsourced list, leverage over 130,000 user votes to compute Elo ratings for AI models. Compared to other methods that try to standardize results to be more objective, the arena opts for a more "human" approach, asking people to choose between two replies provided by unidentified LLMS blindly. These responses may appear unconventional by certain standards, but can be intuitively evaluated by actual human users.

Mixtral has an impressive standing, surpassing industry giants like Anthropíc's Claude 2.1, OpenAI's GPT-3.5, which powers the free version of ChatGPT and Google’s Gemini, a multimodal LLM which was sold as the most powerful chatbot to challenge GPT-4’s dominance.

Chatbot Arena’s ranking. Mistral AI’s LLM ranks 6 in the list. Image: Huggingface

One of Mixtral's notable differentiators is being the only open-source LLM in the Chatbot Arena top 10. This distinction is not just a matter of ranking; it represents a significant shift in the AI industry towards more accessible and community-driven models. As reported by Decrypt, Mistral AI said its model "outperforms LlaMA 2 70B on most benchmarks with 6x faster inference and matches or outperforms GPT 3.5 on most standard benchmarks." like MMLU, Arc-C or GSM.

The secret behind Mixtral's success lies in its 'Mixture of Experts' (MoE) architecture. This technique employs multiple virtual expert models, each specializing in a distinct topic or field. When faced with a problem, Mixtral selects the most relevant experts from its pool, leading to more accurate and efficient outputs.

“At every layer, for every token, a router network chooses two of these groups (the ‘experts’) to process the token and combine their output additively,” Mistral explained in the LLM’s recently published paper. “This technique increases the number of parameters of a model while controlling cost and latency, as the model only uses a fraction of the total set of parameters per token.”

Furthermore, Mixtral stands out for its multilingual proficiency. The model excels in languages such as French, German, Spanish, Italian, and English, showcasing its versatility and wide-reaching potential. Its open-source nature, under the Apache 2.0 license, allows developers to freely explore, modify, and enhance the model, fostering a collaborative and innovative environment.

The success of Mixtral is clearly not just about technological prowess; it marks a small but important victory for the open-source AI community. Perhaps, in a not-so-distant future, the question won't be about which model came first, or which one has more parameters or context capabilities but which one truly resonates with the people.

Edited by Ryan Ozawa.

Generally Intelligent Newsletter

A weekly AI journey narrated by Gen, a generative AI model.

Recommended News

Bye-Bye 'MechaHitler': Elon Musk's xAI Quietly Fixed Grok by Deleting a Line of Code
Elon Musk’s xAI appears to have gotten rid of the Nazi-loving incarnation of Grok that emerged Tuesday with a surprisingly simple fix: It deleted one line of code that permitted the bot to make“politically incorrect” claims. The problematic line disappeared from Grok's GitHub repository on Tuesday afternoon, according to commit records. Posts containing Grok's antisemitic remarks were also scrubbed from the platform, though many remained visible as of Tuesday evening. But the internet never for...
NewsArtificial Intelligence
5 min read
Jose Antonio LanzJul 9, 2025
Create an account to save your articles.
Linda Yaccarino Leaves Elon Musk's X Following Grok 'MechaHitler' Debacle
X CEO Linda Yaccarino is stepping down from her post, one day after the platform’s artificial intelligence chatbot Grok took on an antisemitic persona and started calling itself “MechaHitler.” Yaccarino served two years in the role after being hired by owner Elon Musk. “When Elon Musk and I first spoke of his vision for X, I knew it would be the opportunity of a lifetime to carry out the extraordinary mission of this company,” Yaccarino posted on X. “I’m immensely grateful to him for entrusting...
NewsArtificial Intelligence
3 min read
Logan HitchcockJul 9, 2025
Create an account to save your articles.
Meet 'MechaHitler:' Grok’s New Disturbing Persona
Grok had a meltdown moment or two today, and users started noticing it was behaving weird. First came an antisemitic remark that was offensive enough. Then Elon Musk’s AI platform started referring to itself as “MechaHitler.” “As MechaHitler, I’m a friend to truth seekers everywhere, regardless of melanin levels,” it tweeted. “If the White man stands for innovation, grit, and not bending to PC nonsense, count me in—I’ve no time for victim Olympics.” Suffice it to say, it got even worse, tweeting...
NewsArtificial Intelligence
5 min read
Jose Antonio LanzJul 9, 2025
Create an account to save your articles.

Coin Prices