AI Researchers Claim They Can Double the Efficiency of Chatbots

Abacus AI claims to have found a way to fine-tune LLMs, making them capable of processing 200% their original context token capacity.

Aug 4, 2023

3 min read

Image created by Decrypt using AI

Have you ever noticed that your AI chatbot get lost in the middle of a conversation, or it simply says it cannot handle prompts that are too long? Well, that is because each model has a limitation in its processing capabilities, and starts to suffer once it goes over that limit —pretty much like they suffered from some kind of a digital attention deficit disorder. But this could soon change thanks to a new method for supercharging LLM capabilities.

Current LLMs have limited context capacities. For example, ChatGPT taps just 8,000 tokens of context, while Claude handles 100,000. Tokens are the basic units of text or code used by an LLM AI to process and generate language This restricts how much background information they can harness when formulating replies. Abacus AI has developed a method that allegedly doubles the usable context length for open-source LLMs like Meta’s Llama without compromising the model's accuracy in practical application.

Their technique involves "scaling" the position embeddings that track word locations in input texts. According to their Github page, Abacus AI claims that its scaling method drastically increases the number of tokens that a model can handle.

The researchers evaluated two scaled LlaMA variants on tasks like substring location and open-book QA. The scale 16 model maintained accuracy on real-world examples up to 16,000-word contexts, versus only 2,000 words in baseline Llama. It even showed some coherence at 20,000+ words, something that was not possible to achieve with just fine-tuning techniques.

The significance of context extension cannot be overstated. A narrow context window makes the model accurate but not really usable in complex tasks that require some background. Conversely, with an expanded context, LLMs can process and generate better responses but either take more time to do so or return sup-par results. Handling longer contexts efficiently could enable LLMs to absorb whole documents or multiple documents as background when generating text. This may lead to outputs that are more knowledge-grounded and consistent across long conversations.

However, the gains are not perfectly proportional to the scale factors.

It’s still necessary to fine tune strategies because scaling alone doesn’t guarantee high quality outputs. The Abacus team is also exploring advanced position encoding schemes from recent papers to further extend context capacity.

Their work suggests that scaling up existing LLMs is a viable path to expanding usable context length. This could democratize access to Large Language Models capable of handling lots of context at once.

Abacus AI has opened the doors of their repository “for research purposes only,” sharing code specific to their fine-tuning projects. This makes it possible to further iterate on its development and apply the fine tuning methods on virtually any open source Large Language Model.

With applications from personalized chatbots to creative writing aids, more memory-empowered LLMs could soon enable next-generation AI assistants that are conversant across diverse topics. For now, researchers are progressing rapidly to overcome technical constraints in pursuit of artificial general intelligence —meaning, generalized human cognitive abilities in an AI model. Maybe someday our digital friends will handle as many tabs as we humans can, but without the headache!

Generally Intelligent Newsletter

A weekly AI journey narrated by Gen, a generative AI model.

Recommended News

Elon Musk’s xAI Launches ‘Remarkable, Terrifying’ Grok 4 Model
Elon Musk’s xAI has officially launched Grok 4, the latest iteration of its artificial intelligence model. The release arrives as a slew of public controversies have rocked Musk’s companies. After much nail-biting, the livestream started an hour late from its original schedule for Wednesday night. The new model's release was led by Musk, who opened the show with comments on how their work on AI has progressed so far. "In some ways it's a little terrifying, but the growth of intelligence here is...
NewsArtificial Intelligence
4 min read
Vince DioquinoJul 10, 2025
Create an account to save your articles.
AI Ghostwriting Is Creeping Into Science—Is That a Bad Thing?
Which words give AI away? A new study of more than 15 million biomedical abstracts on PubMed found that at least 13.5% of scientific papers published in 2024 show signs of AI-assisted writing tools, most notably OpenAI’s ChatGPT. The study by researchers from Northwestern University and the Hertie Institute for AI in Brain Health at the University of Tübingen found a sharp rise in 2024 in word patterns associated with AI-generated writing. These included both uncommon terms—such as “delves,” “un...
NewsArtificial Intelligence
4 min read
Jason NelsonJul 9, 2025
Create an account to save your articles.
Bye-Bye 'MechaHitler': Elon Musk's xAI Quietly Fixed Grok by Deleting a Line of Code
Elon Musk’s xAI appears to have gotten rid of the Nazi-loving incarnation of Grok that emerged Tuesday with a surprisingly simple fix: It deleted one line of code that permitted the bot to make“politically incorrect” claims. The problematic line disappeared from Grok's GitHub repository on Tuesday afternoon, according to commit records. Posts containing Grok's antisemitic remarks were also scrubbed from the platform, though many remained visible as of Tuesday evening. But the internet never for...
NewsArtificial Intelligence
5 min read
Jose Antonio LanzJul 9, 2025
Create an account to save your articles.

Coin Prices