New Open-Source ‘Falcon’ AI Language Model Overtakes Meta and Google

Meta's massive, 70-billion parameter LLaMA 2 has been outflanked by the newly released 180-billion parameter Falcon Large Language Model.

3 min read

Sep 6, 2023

The artificial intelligence community has a new feather in its cap with the release of Falcon 180B, an open-source large language model (LLM) boasting 180 billion parameters trained on a mountain of data. This powerful newcomer has surpassed prior open-source LLMs on several fronts.

Announced in a blog post by the Hugging Face AI community, Falcon 180B has been released on Hugging Face Hub. The latest-model architecture builds on the previous Falcon series of open source LLMs, leveraging innovations like multiquery attention to scale up to 180 billion parameters trained on 3.5 trillion tokens.

This represents the longest single-epoch pretraining for an open source model to date. To achieve such marks, 4,096 GPUs were used simultaneously for around 7 million GPU hours, using Amazon SageMaker for training and refining.

To put the size of Falcon 180B into perspective, its parameters measure 2.5 times larger than Meta's LLaMA 2 model. LLaMA 2 was previously considered the most capable open-source LLM after its launch earlier this year, boasting 70 billion parameters trained on 2 trillion tokens.

Falcon 180B surpasses LLaMA 2 and other models in both scale and benchmark performance across a range of natural language processing (NLP) tasks. It ranks on the leaderboard for open access models at 68.74 points and reaches near parity with commercial models like Google's PaLM-2 on evaluations like the HellaSwag benchmark.

Image: Hugging Face

Specifically, Falcon 180B matches or exceeds PaLM-2 Medium on commonly used benchmarks, including HellaSwag, LAMBADA, WebQuestions, Winogrande, and more. It is basically on par with Google’s PaLM-2 Large. This represents extremely strong performance for an open-source model, even when compared against solutions developed by giants in the industry.

When compared against ChatGPT, the model is more powerful than the free version but a little less capable than the paid “plus” service.

“Falcon 180B typically sits somewhere between GPT 3.5 and GPT4 depending on the evaluation benchmark, and further finetuning from the community will be very interesting to follow now that it's openly released.” the blog says.

The release of Falcon 180B represents the latest leap forward in the rapid progress that has recently been made with LLMs. Beyond just scaling up parameters, techniques like LoRAs, weight randomization and Nvidia’s Perfusion have enabled dramatically more efficient training of large AI models.

With Falcon 180B now freely available on Hugging Face, researchers anticipate the model will see additional gains with further enhancements developed by the community. However, its demonstration of advanced natural language capabilities right out of the gate marks an exciting development for open-source AI.

Get crypto news straight to your inbox--

sign up for the Decrypt Daily below. (It’s free).

Get Email!

Bitcoin Funds Shed $264M Last Week, Alts Reverse Negative Trend

Bitcoin investment products saw $264.4 million in outflows over the past week, extending a three-week negative trend. According to the latest CoinShares Digital Assets Funds Flows report, BTC funds posted their third consecutive week of outflows, yet altcoin funds attracted inflows for the first time since the middle of January. Digital asset investment products recorded US$187M in outflows last week.@Bitcoin was the only pocket of negative sentiment with outflows of US$264M for the week. While...

Tom Lee’s BitMine Buys More Ethereum Despite $7.5 Billion Unrealized Loss

Publicly traded Ethereum treasury firm BitMine Immersion Technologies (BMNR) added another 40,613 ETH valued around $83.2 million to its industry-leading Ethereum stash last week, despite its unrealized losses currently sitting near $7.5 billion. The firm now holds 4,325,738 Ethereum worth more than $8.8 billion, representing about 3.58% of the circulating ETH supply. “BitMine has been steadily buying Ethereum, as we view this pullback as attractive, given the strengthening fundamentals,” Chai...

Investigators Circle as Bithumb Reveals Compensation Plan for $43 Billion Bitcoin Error

South Korean regulators have begun investigations into Bithumb, days after the crypto exchange accidentally sent some $43 billion worth of Bitcoin to hundreds of customer accounts. At a Monday press conference, the head of South Korea’s Financial Supervisory Service told reporters that the error “laid bare the structural problems of virtual asset exchanges’ ledger systems.” The regulator plans to strengthen on-the-ground supervision of financial institutions by introducing punitive fines for IT...

News

Courses

Deep Dives

Coins

Videos