The generative AI landscape has morphed into a high-stakes battleground in 2024, with an army of upstarts storming the castle once ruled by OpenAI.
Everyone and their tech-savvy grandma seems to be vying for a piece of the AI pie, cooking up language models, agentic AIs, image generators, and even an AI meme coin shiller or two.
The benchmarks are changing faster than our human ability to keep up. Barely a week goes by without some shiny new toy hitting the market—an updated LLM here, a turbocharged image generator there, or a next-gen AI flexing some exotic training technique.
But here at Decrypt, we've rolled up our sleeves and tried them all.
Free AI for the Holidays? Google and Elon Musk's xAI Debut Latest Models
Lovers of free tech, rejoice! Google and xAI have unveiled pretty interesting updates to their AI offerings for free, matching and expanding upon OpenAI's recent democratization efforts. OpenAI fired the latest salvo in the war for the hearts—and, ultimately, wallets of users—with its ambitious "12 Days of OpenAI" campaign last week, releasing a series of powerful tools to the public. The campaign's highlights include Sora, the firm's state-of-the-art text-to-video generator, the advanced reason...
We've kicked the tires, pushed the buttons, and gotten deep inside the inner workings and the outputs provided by the most popular AI models—and some that are not so well-known.
Now that it's clear that OpenAI isn't the only sheriff in town, we've compiled a list of the cream of the crop—the generative AI models that have wowed us, befuddled us, and occasionally made us spit out our coffee.
Chatbots
A chatbot is a computer program designed to simulate conversation with human users. It uses natural language processing and artificial intelligence to understand user inputs and generate appropriate responses. Usually, people confuse chatbots with LLMs, or large language models.
Today, chatbots are a bit more complex, with capabilities that extend beyond text generation. They can now browse the web, generate and understand images, talk to the user, etc.
Here is our list of the best chatbots you should try:
Gold medal: OpenAI's ChatGPT
ChatGPT offers a wide array of features at $20/month, including custom agent creation with natural language, a clean interface, web search, and multiple models (reasoning, writing, vision, voice, and image generation).
Silver medal: Anthropic's Claude
A superior LLM with an intuitive UI featuring split-screen artifacts for reasoning and code generation, Claude supports million-token context and custom agents. However, it lacks web search and image generation and often faces capacity issues, forcing users to switch to a weaker model or generate “concise” shorter answers. Because of this, it cannot be the best just yet.
Bronze medal: Mistral AI’s LeChat
This free platform is powered by Mistral Large, featuring top-tier Flux image generation and superior web search—the best, in our opinion, even beating SearchGPT. It supports document/image understanding and open-source AI agents, though text quality trails competitors. However, the Mistral Large LLM isn’t as strong as its competitors, making it ideal for power users willing to trade text quality for features.
Honorable Mentions: Meta AI, Gemini (from Google’s AI studio, not the main site), Hugging Chat, Reka, Grok-2
OpenAI's o1: The Good, the Bad, and the Ugly of AI's Latest Brainchild
Last week, OpenAI unveiled its latest AI model, o1, after a wave of speculation involving different post-GPT4 models with cryptic names including "Strawberry," “Orion,” arguably “Q*,” and the obvious “GPT-5.” This new offering promises to push the boundaries of artificial intelligence with enhanced reasoning capabilities and scientific problem-solving prowess. Developers, cybersecurity experts, and AI enthusiasts are abuzz with speculation about o1's potential impact. In general, enthusiasts hav...
Large language models
A large language model or LLM is an artificial intelligence system trained on vast amounts of text data to understand and generate human-like language. You can see it as a glorified autocomplete. They are designed to predict what the most likely token (think about words, though it’s an inaccurate comparison) is in a group.
The result is natural text that feels human because, well, it resembles what humans would do.
Here is our list of the best LLMs to date:
Best generalist: OpenAI's GPT-4o
Balances creative writing, coding, and reasoning with a customizable "Canvas" feature, though its style can feel predictable. The latest version (from November 20) has also achieved the top spot in the LLM Arena with an ELO score of 1,366, beating an experimental version of Google Gemini released on November 21.
From ‘This is AGI’ to ‘I'm the Same’: OpenAI GPT-4o Reveal Meets Mixed Reactions
Before OpenAI’s tight, half-hour unveiling of its new GPT-4o AI model, rumors ran rampant about what could be announced, prompting company CEO Sam Altman to manage expectations, saying that it’s “not a search engine” but that they’ll announce “some new stuff we think people will love.” After the presentation, AI enthusiasts and the tech press were both amazed and disappointed. The release of GPT-4o—not GPT-5, as some people expected—represented a solid but incremental upgrade to GPT-4.5 Turbo. T...
Best for writing: Anthropic's Claude 3.5 Sonnet
Matches or exceeds GPT-4o in many areas with more creative, human-like output, though it's prone to hallucination.
Best for storytelling: Longwriter
Generates 10,000+ word stories within minutes. Do we need to say more?
Most versatile: Meta's Llama-3.1
The leading open-source model with extensive customization, LoRA creation, and fine-tuning options, available in sizes from 7 billion to 405 billion parameters so users can run it on their local machines or cloud servers depending on their needs. Nvidia developed a customized version named "Nemotron," which made some waves in the community and is worth checking out.
Biggest letdown: Reflection Llama-3.1 70B
Announced with high expectations, the model claimed to beat GPT-4o thanks to its embedded Chain of Thought. It ended up being a major fiasco with fake benchmarks, hidden API calls to Claude AI, and a major controversy.
Image generators
An image generator is essentially a model that gets a text input and provides an output associated with that text input. So, for example, you say, “Green horse with a dragon face,” and the model will generate a photo of a green horse with a dragon face. You can also input something like “busty waifu,” but that is not what they are for.
These are some of the best image generators currently available
Best generalist: Flux
Flux dominates the latest generation of AI models with substantial customization, LoRA/ControlNet support, and text generation capabilities. It requires powerful hardware, but shows a characteristic style with extreme bokeh and slack skin detail that users are still trying to tackle.
Meet Flux 1.1 Pro: The Best AI Image Generator You Can't Run
Black Forest Labs, the studio behind the Fluxfamily of AI image generators, announced last week the release of Flux 1.1 [Pro]. This comes just two months after the release of its original family of models including Flux 1 Pro (a closed source model with industry-leading capabilities), Flux 1 Dev (a noncommercial, open source model) and Flux Schnell (a fully open source model). The Flux models marked a major leap in generative AI technology with their text generation capabilities, prompt adherenc...
It comes in three flavors: Pro (closed-source, the most potent model), Dev (noncommercial license), and Schnell (an open-source, distilled version). All three offer excellent image generation capabilities, and the ceiling will go higher if fine-tunes are considered.
Best for realism: Recraft v3
Delivers unmatched realism, offering versatile presets and better value than proprietary alternatives like MidJourney.
It has a free tier that offers the same quality—though Recraft owns generations.
Best for anime: MidJourney Niji
Unrivaled quality for anime-style images; a Stable Diffusion fine-tuning is a secondary option.
Meet Recraft V3: The Best AI Image Generator You Never Heard Of
Stand aside Flux and MidJourney: There's a new player that just shot to the top of AI image generation rankings. A mystery model formerly known as Red Panda—which had AI watchers scratching their heads on Artificial Analysis’s leaderboards—finally revealed itself as Recraft V3, a fresh release from a little-known London startup. The model enjoyed the top score on the ELO rating system for image generators, outperforming Flux 1.1 Pro and MidJourney. In terms of efficiency, Recraft V3 matches SDXL...
Most versatile: Stable Diffusion 3.5
Stable Diffusion 3.5 is a major improvement over SD3 with better licensing, detailed output, and add-on support.
It is more resource-efficient than Flux for fine-tuning and is a full model—unlike Flux Schnell, which is a distilled version—making it the best pick for custom models.
However, it came out a little bit late and has been overshadowed by Flux’s popularity.
Stable Diffusion 3.5: Stability AI Redeems Itself With New Models and Expanded Features
Stability AI may be starting its very own redemption arc. After the disappointment that was SD3 Medium, they’ve come back swinging with the release of two new models that had been promised back in July: Stable Diffusion 3.5 Large and Stable Diffusion 3.5 Large Turbo. “In June, we released Stable Diffusion 3 Medium, the first open release from the Stable Diffusion 3 series. This release didn't fully meet our standards or our communities’ expectations,” Stability said in an official blog post. “Af...
Biggest Letdown: SD 3 Medium
Everyone expected this new model to be the new King of Image Generators, beating SDXL and every other model. It ended up being a poor model, infamous for its horrible license and horrific aberrations when trying to generate people on grass.
Video generators
Video generators take image generation one step further. They generate each frame and use it as input to generate the following one with image consistency and high prompt adherence.
This is still a work in progress, and models can only generate a few seconds of video. Below is a list of some of the best ones you can try.
Best generalist: Kling
Rapidly improving the Chinese model, outperforming Sora in some cases. Supports face model training, and consistently generates high-quality scenes showing a major versatility in terms of styles, realism, and camera movement.
Best contender: Runway Gen 3
Pioneering generative video app with solid environmental understanding, but struggles with fast-paced scenes.
Runway's New Gen-3 AI Video Generator Draws High Praise
Runway, the AI company known for its popular generative video tool, has unveiled its latest iteration, Runway Gen-3. The new model, which is still in alpha and not publicly available, was showcased through a series of sample videos that appeared to show a significant leap forward in coherence, realism, and prompt adherence when compared to the currently available Gen-2. The generated videos, particularly those featuring human faces, are highly realistic—so much that AI art community members quic...
Best for storytelling: ShowRunner
We cannot tell you a lot about this one. However, in confidential testing, it has shown immense potential.
Best open-source: Genmo Mochi 1
It's a great release that beats competitors like Rhymes Allegro and Stable Video Diffusion with superior realism and frame consistency.
Biggest letdown: OpenAI Sora
Announced with high expectations as a revolutionary “world model” beyond any video generation, it remains unavailable today with underwhelming leaked outputs.
Honorable mention: Google Veo
Google's Veo was released on December 3. We haven't tested it, but the generations shared by Google look pretty nice. Of course, we're on the waiting list to test the model, and you'll be the first to know our thoughts as soon as we get access.
Music generators
Just like video generators, music generators create songs. It’s different from audio generators, however, since the outputs are more specialized to melodic outputs that are not noise, plain voices, or audio effects.
Users can rely on a separate LLM to generate the lyrics of a song or input lyrics manually, and set a few parameters like the style of the song, and then the model will output relevant music from scratch.
These are the best two—plus an open-source alternative.
Best generalist: Suno v4
Excels in vocals and lyrics, style diversity, and long-form consistency. Its predecessor, Suno v3.5, is not free but remains a strong alternative.
Best contender: Udio
Suno’s biggest rival. It delivers impressive composition accuracy, nearly rivaling Suno v4 in vocals. Some generations surpass Suno v3 in subjective style.
Rumors Confirmed: AI Music Generator Udio Is Out—And It Might Be the Best One Yet
The wait is over and the rumors were true: Generative music startup Uncharted Labs launched its own music generator today, called Udio. It transforms simple text prompts into professional-quality tracks, and is poised to give Suno V3—the current king of AI music—a run for its money. Before its official release, a few songs generated by Udio were leaked by anonymous sources—and the quality was impressive, to say the least. With coherent lyrics, well-structured compositions, and a rhythm that coul...
Best open-source: Stable Audio 2
The open-source scene is not doing a lot in this area. Stable Audio 2 seems to be the best model, but lags behind closed-source competitors in every field. Meta’s AudioCraft and MusicGen are alternatives, but far from industry-leading. Fine-tuners have not paid attention, and usually, they are the people behind the cherry on top that makes open-source models so great.
Edited by Andrew Hayward