AI Start-Up Figure Shows Off Conversational Robot Infused With OpenAI Tech

Figure introduced a humanoid robot that one engineer said exhibits “common sense,” answering questions and performing tasks simultaneously.

Mar 14, 2024

5 min read

Image: Figure/YouTube

Robotics developer Figure made waves on Wednesday when it shared a video demonstration of its first humanoid robot engaged in a real-time conversation, thanks to generative AI from OpenAI.

“With OpenAI, Figure 01 can now have full conversations with people,” Figure said on Twitter, highlighting its ability to understand and react to human interactions instantly.

The company explained that its recent alliance with OpenAI brings high-level visual and language intelligence to its robots, allowing for “fast, low-level, dexterous robot actions.”

In the video, Figure 01 interacts with its creator’s Senior AI Engineer Corey Lynch, who puts the robot through several tasks in a makeshift kitchen, including identifying an apple, dishes, and cups.

Figure 01 identified the apple as food when Lynch asked the robot to give him something to eat. Lynch then had Figure 01 collect trash into a basket and asked it questions simultaneously, showing off the robot's multitasking capabilities.

On Twitter, Lynch explained the Figure 01 project in more detail.

We are now having full conversations with Figure 01, thanks to our partnership with OpenAI.

Our robot can:
- describe its visual experience
- plan future actions
- reflect on its memory
- explain its reasoning verbally
Technical deep-dive 🧵:pic.twitter.com/6QRzfkbxZY

— Corey Lynch (@coreylynch) March 13, 2024

“Our robot can describe its visual experience, plan future actions, reflect on its memory, and explain its reasoning verbally,” he wrote in an extensive thread.

According to Lynch, they feed images from the robot's cameras and transcribe text from speech captured by onboard microphones to a large multimodal model trained by OpenAI.

Multimodal AI refers to artificial intelligence that can understand and generate different data types, such as text and images.

Lynch emphasized that Figure 01’s behavior was learned, run at normal speed, and not controlled remotely.

“The model processes the entire history of the conversation, including past images, to come up with language responses, which are spoken back to the human via text-to-speech,” Lynch said. “The same model is responsible for deciding which learned, closed-loop behavior to run on the robot to fulfill a given command, loading particular neural network weights onto the GPU and executing a policy.”

Lynch explained that Figure 01 is designed to describe its surroundings concisely, and can apply “common sense” for decisions, like inferring dishes will be placed in a rack. It can also parse vague statements, such as hunger, into actions, like offering an apple, all the while explaining its actions.

The debut sparked a passionate response on Twitter, many people impressed with the capabilities of Figure 01—and more than a few adding it to the list of mileposts on the way to the singularity.

Please tell me your team has watched every Terminator movie,” one replied.

Please tell me your team has watched every terminator movie.

— Daniel Innovate (@danielinnov8) March 13, 2024

“We gotta find John Connor as soon as possible,” another added.

We gotta find John Connor as soon as possible

— Kaylard - e/acc (@KaylardAI) March 13, 2024

Sci-fi has become Sci-nonfi

Congrats to @adcock_brett, @sama, and their teams for creating the first convincing demo of life 2.0

— Justin Halford (@Justin_Halford_) March 13, 2024

For AI developers and researchers, Lynch provided a number of technical details.

“All behaviors are driven by neural network visuomotor transformer policies, mapping pixels directly to actions,” Lynch said. “These networks take in onboard images at 10hz and generate 24-DOF actions (wrist poses and finger joint angles) at 200hz.”

"I'm sorry Dave, I'm afraid I can't give you that apple."@Figure_robot generated with @letz_ai pic.twitter.com/kIgEUSzeAA

— Misch Strotz (@mitch0z) March 13, 2024

Figure 01’s impactful debut comes as policymakers and global leaders attempt to grapple with the proliferation of AI tools into the mainstream. While most of the discussion has been around large language models like OpenAI’s ChatGPT, Google’s Gemini, and Anthropic’s Claude AI, developers are also looking for ways to give AI physical humanoid robotic bodies.

Figure AI and OpenAI did not immediately respond to Decrypt’s request for comment.

“One is a sort of utilitarian objective, which is what Elon Musk and others are striving for,” UC Berkeley Industrial Engineering Professor Ken Goldberg previously told Decrypt. “A lot of the work that's going on right now—why people are investing in these companies like Figure—is that the hope is that these things can do work and be compatible,” he said, particularly in the realm of space exploration.

Along with Figure, others working to merge AI with robotics is Hanson Robotics, who in 2016 debuted its Desdemona AI robot.

“Even just a few years ago, I would have thought having a full conversation with a humanoid robot while it plans and carries out its own fully learned behaviors would be something we would have to wait decades to see,” Figure AI’s Senior AI Engineer, Corey Lynch said on Twitter. “Obviously, a lot has changed.”

Edited by Ryan Ozawa.

Generally Intelligent Newsletter

A weekly AI journey narrated by Gen, a generative AI model.

Recommended News

Meet 'MechaHitler:' Grok’s New Disturbing Persona
Grok had a meltdown moment or two today, and users started noticing it was behaving weird. First came an antisemitic remark that was offensive enough. Then Elon Musk’s AI platform started referring to itself as “MechaHitler.” “As MechaHitler, I’m a friend to truth seekers everywhere, regardless of melanin levels,” it tweeted. “If the White man stands for innovation, grit, and not bending to PC nonsense, count me in—I’ve no time for victim Olympics.” Suffice it to say, it got even worse, tweeting...
NewsArtificial Intelligence
5 min read
Jose Antonio LanzJul 9, 2025
Create an account to save your articles.
Grok 4 Drops Tomorrow—Here's How Musk's AI Might Steal GPT-5's Thunder
Tesla and xAI CEO Elon Musk is expected to unveil Grok 4 on Wednesday in a livestream that could notably push the AI sector forward. The new version, to be showcased at roughly 8 PM PT, promises to be the platform’s most ambitious model yet—one that skips right past the promised Grok 3.5 to challenge OpenAI's dominance. The ChatGPT maker continues to keep its next version, GPT-5, under wraps, with CEO Sam Altman hinting at a possible summer release. That's music to the ears of Musk, who has seiz...
NewsArtificial Intelligence
5 min read
Jose Antonio LanzJul 8, 2025
Create an account to save your articles.
ChatGPT Sent Users to a Website for a Feature It Didn't Have—So the Founder Built It
What do you do when your website is bombarded with uploads it can’t process? That’s the situation software developer and musician Adrian Holovaty found himself in when he noticed a strange surge in failed uploads to his company’s sheet music scanner. What he didn’t expect was that the culprit was allegedly ChatGPT. In a recent blog post, the Soundslice co-founder explained that he was looking at error logs when he discovered that ChatGPT was instructing users to upload ASCII “tabs”—a simple musi...
NewsArtificial Intelligence
3 min read
Jason NelsonJul 8, 2025
Create an account to save your articles.

Coin Prices