New MidJourney V6 Release Upgrades Visuals and Tackles Text Generation (Mostly)

The new model promises better enhancers, upscalers, prompt following, and text generation capabilities. Stricter censorship, too.

3 min read

Dec 21, 2023

MidJourney has just announced its newest AI image generator model, the V6 base model, in the crowded race to rule the realm of digital creativity. Rolling out for alpha testing today, the development team says V6 features enhanced prompt accuracy, improved coherence, and—for the first time in MidJourney’s evolution— text generation capabilities.

Announced in an official Discord post, V6 is positioned as a major overhaul.

"Much more accurate prompt following as well as longer prompts, improved coherence, and model knowledge," reveals the announcement, highlighting its advancement over the previous V5.1 model launched in May 2023. The V5 model, noted for its easy-to-use short prompts and aesthetic improvement, paved the way for the more sophisticated and detailed V6.

One of the most noteworthy components of V6 is its text-drawing ability. While it's not the focal point of the model—the team says it’s still a “minor” feature—this capability puts MidJourney in direct competition with other leading models like Dall-E 3 and Ideogram. However, MidJourney's approach to text generation is unique.

Describing it as "minor text drawing ability,” Midjourney says. “You must write your text in 'quotations' and --style raw or lower --stylize values may help.”

Decrypt was able to test the model and compare it to Dall-E 3, known for its accuracy in text generation. MidJourney appears to prioritize style and aesthetics, sometimes at the cost of text precision. Most of the time it generated either inaccurate or no text. But when it did, the images were on par or even better than the ones generated by Dall-E 3, the text-to-image AI model powering ChatGPT and Microsoft Bing.

Comparing the text generations from MidJourney, Dall-E 3, SDXL with Harrlogos and Ideogram AI, one oversimplified recommendation could be to use MidJourney if aesthetics is a priority, Dall-E 3 for ease of use and cartoon digital art aesthetics, SDXL for those with advanced knowledge of A1111, and Ideogram AI for results in which the text is more important than the aesthetics.

MidJourney and Dalle-3 with ChatGPT currently cost money, where SDXL and Ideogram AI are free. Bing’s version of Dall-E 3 is free to use but it only generates square images and people can only modify prompts instead of the natural conversation approach taken by OpenAI.

MidJourney V6 is also a bit slower and more expensive than v5, however the team emphasizes its focus on speeding the model up with time. The V6 model also boasts improved upscalers in 'subtle' and 'creative' modes, enhancing image resolution by 2x.

These features, coupled with a diverse range of supported arguments like --ar (to change the resolution), --chaos (to change the variations among generations), and --stylize (to change how creative the model is), offer users a broad spectrum of creative possibilities. However, other features like inpainting, outpainting and image description are not yet available. They should come in an update next month, according to MidJourney.

The announcement calls for users to employ these "incredible powers with joy, wonder, responsibility, and respect," which has always been part of MidJourney’s ethos. But don’t get too excited as they will be more strict with censoring.

“Don't be a jerk or create images to cause drama,” the announcement reads. Chances are, that blocks attempts to create digital waifus or political deepfakes.

Edited by Ryan Ozawa.

Get crypto news straight to your inbox--

sign up for the Decrypt Daily below. (It’s free).

Get Email!

China’s Z.AI Releases GLM-5.2: A Model That Rivals Claude Opus—Using Zero Nvidia Chips

Z.ai dropped GLM-5.2 on June 16, promising top level performances, beating its already advanced GLM 5.1. The Beijing-based lab, which has been on the U.S. Entity List since January 2025, appears to be benefiting from growing concerns over America's approach to AI. Over the past week, the ban on Anthropic Fable and the release of this new model have helped drive zAI's stock up 90%, sending it to a new all-time high. GLM 5.2 has the numbers to back up the hype. On FrontierSWE—a benchmark that eva...

Midjourney Pivots From AI Images to Medical Imaging, Aiming to Build a Better MRI Alternative

Midjourney, the AI company best known for its image generation platform of the same name, is expanding into healthcare. On Wednesday, the company unveiled Midjourney Medical, a new division developing what it calls "Ultrasonic CT," a full-body imaging system that combines ultrasound hardware with AI-powered image reconstruction. Midjourney says the technology could create detailed three-dimensional body scans in roughly 60 seconds. “Our goal at Midjourney Medical is to deploy around 50,000 of t...

CFTC Hits Celsius Crypto Fraudster Alex Mashinsky With Permanent Trading Ban

The Commodity Futures Trading Commission (CFTC) has resolved its 2023 enforcement action against Celsius founder Alex Mashinsky, permanently banning him from trading markets regulated by the CFTC. The consent order also imposes a permanent CFTC registration ban on the former crypto founder, and marks the completion of the regulator’s first case against a digital asset lending platform, according to its 2023 press release. Mashinsky, who also acted as the CEO of Celsius, was imprisoned for 12 y...

News

Courses

Deep Dives

Coins

Videos