New Open-Source ‘Falcon’ AI Language Model Overtakes Meta and Google

Meta's massive, 70-billion parameter LLaMA 2 has been outflanked by the newly released 180-billion parameter Falcon Large Language Model.

Sep 6, 2023

3 min read

The artificial intelligence community has a new feather in its cap with the release of Falcon 180B, an open-source large language model (LLM) boasting 180 billion parameters trained on a mountain of data. This powerful newcomer has surpassed prior open-source LLMs on several fronts.

Announced in a blog post by the Hugging Face AI community, Falcon 180B has been released on Hugging Face Hub. The latest-model architecture builds on the previous Falcon series of open source LLMs, leveraging innovations like multiquery attention to scale up to 180 billion parameters trained on 3.5 trillion tokens.

This represents the longest single-epoch pretraining for an open source model to date. To achieve such marks, 4,096 GPUs were used simultaneously for around 7 million GPU hours, using Amazon SageMaker for training and refining.

To put the size of Falcon 180B into perspective, its parameters measure 2.5 times larger than Meta's LLaMA 2 model. LLaMA 2 was previously considered the most capable open-source LLM after its launch earlier this year, boasting 70 billion parameters trained on 2 trillion tokens.

Falcon 180B surpasses LLaMA 2 and other models in both scale and benchmark performance across a range of natural language processing (NLP) tasks. It ranks on the leaderboard for open access models at 68.74 points and reaches near parity with commercial models like Google's PaLM-2 on evaluations like the HellaSwag benchmark.

Specifically, Falcon 180B matches or exceeds PaLM-2 Medium on commonly used benchmarks, including HellaSwag, LAMBADA, WebQuestions, Winogrande, and more. It is basically on par with Google’s PaLM-2 Large. This represents extremely strong performance for an open-source model, even when compared against solutions developed by giants in the industry.

When compared against ChatGPT, the model is more powerful than the free version but a little less capable than the paid “plus” service.

“Falcon 180B typically sits somewhere between GPT 3.5 and GPT4 depending on the evaluation benchmark, and further finetuning from the community will be very interesting to follow now that it's openly released.” the blog says.

The release of Falcon 180B represents the latest leap forward in the rapid progress that has recently been made with LLMs. Beyond just scaling up parameters, techniques like LoRAs, weight randomization and Nvidia’s Perfusion have enabled dramatically more efficient training of large AI models.

With Falcon 180B now freely available on Hugging Face, researchers anticipate the model will see additional gains with further enhancements developed by the community. However, its demonstration of advanced natural language capabilities right out of the gate marks an exciting development for open-source AI.

Generally Intelligent Newsletter

A weekly AI journey narrated by Gen, a generative AI model.

Recommended News

Apple’s Top AI Exec Leaves For Meta Amid Aggressive Hiring Trend
Apple’s head of foundation models, Ruoming Pang, has reportedly left the company to join Meta’s Superintelligence Labs, adding to a growing list of high-profile names allegedly poached to bolster Meta's AI ambitions. While Meta formally announced the creation of its Superintelligence Labs in an internal memo last week, Pang’s name was not included on the list. His exit from Apple was first reported by Bloomberg on Monday, citing sources familiar with the matter. The departure deals a significant...
NewsArtificial Intelligence
3 min read
Vince DioquinoJul 8, 2025
Create an account to save your articles.
US Politician’s Tweet Revives Cultural Debate Over UAPs
Are we mistaking angels for aliens? A viral post by U.S. Representative Anna Paulina Luna (R-Fla.) has sparked debate over whether a recent UFO image shows a biblical being instead of an extraterrestrial visitor. The debate began on Sunday when Luna, tweeting from her personal account, replied to a post by user Adrian Dittmann showing a biblically accurate angel. “Actual representation of angels,” Luna wrote on X. “10/10 post.” Actual representation of Angels. 10/10 post. — Anna Paulina Luna...
NewsSpace
3 min read
Jason NelsonJul 8, 2025
Create an account to save your articles.
Here's How All Major AI Platforms Stacked Up in a Harry Potter Sorting Hat Quiz
A computer developer known as Boris the Brave conducted an experiment that placed the 17 major language models through the official Harry Potter house quiz, sampling each question 20 times and calculating the probability of each house assignment. "Perhaps unsurprisingly, the vast majority of models prefer Ravenclaw, with the occasional model branching out to Hufflepuff," Boris wrote in a blog post sharing his results. Eleven out of 17 AI models scored a perfect 100% probability for Ravenclaw—the...
NewsArtificial Intelligence
4 min read
Jose Antonio LanzJul 7, 2025
Create an account to save your articles.

Coin Prices