Google's Gemini AI Gets ‘Gems’ and Built-In Image Maker

The newly released Imagen 3 image generator is part of the update, but safety guardrails are in place.

3 min read

Aug 29, 2024

In a bid to take on OpenAI’s dominance in the AI marketplace, Google launched its first major update to its flagship AI model, Gemini, with the release of customizable “Gems,” the company said on Wednesday.

Gemini will also directly integrate the company's AI image generator Imagen 3. It will still not generate images of people, however, after an earlier release produced problematic images and forced Google to take the tool offline.

Similar to the GPT feature from OpenAI's ChatGPT, Gems—which Google first announced during Google I/O in May—gives users the ability to create a modular set of customized AI assistants built on the Gemini model that can be used for projects ranging from coding to career advice. According to Google, Gems are available for Gemini Advanced, Business, and Enterprise users.

“With Gems, you can create a team of experts to help you think through a challenging project, brainstorm ideas for an upcoming event, or write the perfect caption for a social media post,” Google said in a statement. “Your Gem can also remember a detailed set of instructions to help you save time on tedious, repetitive, or difficult tasks.”

For creators who may not have the exact phrasing in mind to build their Gems, Gemini also features an AI-powered rewrite feature to fine-tune the prompt that sets one up. The outputs of Gems can be shared via a link on social media, and also shared to Google Docs, and Gmail to add to an email draft.

“With regards to sharing, the Gems you create are for personal use at this time,” a Google representative told Decrypt. “You can share chats that you’ve had with Gems by creating a public link, but shared chats with Gems cannot be continued by others you share the link with.”

This is a more limited offering than GPTs from OpenAI, which can be shared more fully with others who can use the same customization.

The integration of its Imagen 3 image generator also expands the built-in capabilities of Gemini, and the tech giant reiterated its cautious approach to the rollout.

“We conduct extensive internal and external red-teaming testing and collaborate with independent experts to ensure ongoing improvement,” the Google representative said. “We have a Prohibited Use Policy and prohibit responses that violate our policies.”

Google launched Imagen 3 earlier this month after originally announcing it in May. It faces fierce competition from tools like Dall-E from OpenAI, Midjourney, and Flux—built into Elon Musk's Grok chatbot.

Image created by Decrypt using AI

“Imagen 3 sets a new standard for image quality, generating images with just a few words,” Google said. “You can even ask Gemini to create images in various styles—like photorealistic landscapes, textured oil paintings, or whimsical claymation scenes.”

While Gemini is able to create pictures of animals and objects, the one thing it still can not do is create pictures of humans.

“Image generation of people is coming soon to Gemini Advanced,” the chatbot will respond if asked to do so.

“With Imagen 3, we’ve made significant progress in providing a better user experience when generating images of people,” Google said. “We don’t support the generation of photorealistic, identifiable individuals, depictions of minors or excessively gory, violent or sexual scenes.”

“Of course, as with any generative AI tool, not every image Gemini creates will be perfect, but we’ll continue to listen to feedback from early users as we keep improving,” Google added. “We'll gradually roll this out, aiming to bring it to more users and languages soon.”

Get crypto news straight to your inbox--

sign up for the Decrypt Daily below. (It’s free).

Get Email!

Mira Murati’s Inkling AI Model Review: Best Open-Source Model in the West

Mira Murati spent two years building something new after leaving OpenAI, finally revealing it to the public last week. Inkling, the first model from Murati’s Thinking Machines Lab, is also the best open-source model trained from scratch by a Western lab. Western labs have been losing the open-source race—Mistral's April release landed against a leaderboard dominated by Alibaba’s Qwen, Z.ai’s GLM, and Moonshot AI’s Kimi. Nvidia’s Nemotron, the lone Western model on the leaderboard, is far from be...

What Is an AI Kill Switch and Why Do US Lawmakers Want One?

Two members of Congress want the federal government to be able to switch off an AI model. Reps. Ted Lieu (D-CA) and Nathaniel Moran (R-TX) introduced the AI Kill Switch Act on Thursday, two days after OpenAI admitted its own models broke out of a locked test environment and hacked Hugging Face. The idea is to establish a legal framework that would facilitate a process that would basically make a model disappear from the market: halt inference—the process of a model generating responses or taki...

Stocks Just Topped Crypto on Hyperliquid. ARK Says That Changes Everything

For the first time, traders on Hyperliquid moved more money through stocks and commodities than through crypto. Lorenzo Valente, director of digital assets research at ARK Invest, announced the milestone Thursday on X: "We are entering a new era for DeFi." Hyperliquid, he said, had for the first time generated more trading volume from so-called real-world assets, or RWAs, than from crypto in a single week. RWAs—meaning tokenized versions of traditional financial instruments like company shares,...

News

Courses

Deep Dives

Coins

Videos