From Reading X-Rays to Decoding Classified UFO Reports, ChatGPT Shows Off Its Vision

Twitter is abuzz with examples of GPT-4's new visual abilities. Here are some of the best.

Oct 11, 2023

5 min read

Image created by Decrypt using AI

Although AI exploded onto the scene through sometimes eerily clever chatbots, text-based interactions are already old fashioned. The announcement of OpenAI's GPT-4 update introduced GPT-Vision (GPT-V), the latest multimodal AI marvel. The announcement is now become reality as users finally get a chance to test the full potential of its abilities.

A multimodal large language model (LLM) means that it can interact not only with the written word, but also through other modes. In this case, the new GPT-V can understand images and work with them. Also, thanks to the new generative art tool DALL-E 3, ChatGPT can both take images as input but also generate images as output.

These new capabilities have raised eyebrows across the tech space as users put them through their paces. Can they decode redacted government documents on UFO sightings? Yes. "ChatGPT-4V Multimodal decodes a redacted government document on a UFO sighting released by NASA," one tweet raves. "Maybe the truth isn't out there; it's right here in GPT-V."

ChatGPT-4V Multimodal decodes a Redacted government document on a UFO sighting released by NASA.

I have tested this on 100s of redacted documents and I can say we are in a new world. pic.twitter.com/aCKOm577TO

— Brian Roemmele (@BrianRoemmele) October 6, 2023

Trying to fill gaps in a string of text is basically what LLMs do. The user did the next best thing when trying to test GPT-V’s capabilities and made it guess parts of a text that he censored. “Nearly 100% intent accuracy." he reported.

Of course, it's hard to verify whether its guess at what's otherwise obscured is accurate—it’s not like we can ask the CIA how well it did peering through the black lines.

Even harder than uncovering information that has been censored by the government is trying to understand your doctor's cryptic handwriting. But GPT-V can unscrable the scribble. With a polite prompt, GPT-V can make sense of even the most indecipherable doctor's notes, ensuring that "take two tablets" doesn't become "bake blue waffles."

ChatGPT-4V Multimodal.

Prompt: “Please decode this document. Let’s think step-by-step. It is vital to be accurate. Thank you” pic.twitter.com/b7FPuPVRn9

— Brian Roemmele (@BrianRoemmele) October 6, 2023

But be careful. Sometimes even the most advanced AI fails against the hands of an experienced—or arthritic—doctor, and it may take an expert to decipher those written enigmas.

Codeine 4 grains
ASA (Aspirin) 30 grains
Compound to VI (6) ounces

Take (illegible) every 4 hours as needed for (illegible - possible pain)

Dose of aspirin would seem low.

Sometimes it takes a pharmacist.

— Dr. Nefarious (@_DrNefarious) October 7, 2023

And for those who don’t trust their doctors, ChatGPT can provide an instant second opinion. The model can understand X-rays and provide analysis and insights into specific medical cases.

Underrated use case of ChatGPT Vision.

It takes 13 years of training to be a radiologist.

Now instead of drafting a report from scratch, they probably just need to review AI's diagnosis. pic.twitter.com/IhQFe98m5q

— Peter Yang (@petergyang) October 2, 2023

But why stop at handwriting and body scans? GPT-V has become the latest home fitness guru, curating workout plans tailored to your home equipment and goals. And if you're curious about how many calories are in that meal you’re about to eat, GPT-V's got your back. One user gleefully shared, "OK ChatGPT 4.0 with new vision features... recognizes everything. Even a seal on the beach."

OK ChatGPT 4.0 with new vision features is pretty incredible.

Here I ask it how many calories are in the fish taco I just ate.

It is incredible to see how it recognizes everything. Even a seal on the beach. pic.twitter.com/rfIK5o9ODD

— Robert Scoble (@Scobleizer) October 5, 2023

Interior design enthusiasts, rejoice! The AI now offers design suggestions, and can incorporate personal preferences. Imagine a living space that screams "you," without the hefty designer fees. Just take a picture of your awful room and ask GPT-V for suggestions to turn it into the paradise you want it to be.

Homework woes? Just screenshot the assignment, and GPT-V takes the role of that helpful classmate you always wished sat next to you.

Kids will never do homework again. pic.twitter.com/rtjJT2xn9l

— Peter Yang (@petergyang) September 27, 2023

ChatGPT breaks down this diagram of a human cell for a 9th grader.

This is the future of education. pic.twitter.com/L0Za0ZB5rs

— Mckay Wrigley (@mckaywrigley) September 28, 2023

And for the finance geeks among us, GPT-V isn't just about fun and games. GPT-V can dive deep into technical analysis. Just input a screenshot of your favorite (or most hated) stock or crypto, and it will analyze your chart and make projections accordingly. Just remember that it's not financial advice—and if you end up poor, no AI will make you rich.

IT'S SO OVER FOR TA-OOOOORS

I gave GPT-V an image of my chart for $UBER with a bunch of indicators and it gave good long entries. Will test it out live.

Thread below! pic.twitter.com/k6Su9G0267

— Ropirito (0commoDTE) (@ropirito) October 11, 2023

The dawn of multimodal LLMs is redefining industries. With AI titans evolving, GPT-V is only the tip of the iceberg. Google’s upcoming Gemini is rumored to outperform Bard with its multimodal prowess. NexT-GPT offers an open-source alternative, and the horizon promises models trained to juggle words, sounds, videos, and images.

Such advancements aren't just technobabble—they hold implications that could reshape our daily interactions, professions, and perhaps even our worldview. And while OpenAI pioneers with GPT-V, competitors aren't far behind. Could we be on the brink of an AI renaissance?

Well, if you're still using AI just for chat, you might already be falling behind. AI can read and see, and gets more capabilities every day.

GPT-V can also ruin the fun of a "Where's Waldo?" book. Why would someone want this? This is ChaosGPT territory.

"I found him!" pic.twitter.com/LhMQ8e29x2

— Pietro Schirano (@skirano) September 29, 2023

Generally Intelligent Newsletter

A weekly AI journey narrated by Gen, a generative AI model.

Recommended News

Apple’s Top AI Exec Leaves For Meta Amid Aggressive Hiring Trend
Apple’s head of foundation models, Ruoming Pang, has reportedly left the company to join Meta’s Superintelligence Labs, adding to a growing list of high-profile names allegedly poached to bolster Meta's AI ambitions. While Meta formally announced the creation of its Superintelligence Labs in an internal memo last week, Pang’s name was not included on the list. His exit from Apple was first reported by Bloomberg on Monday, citing sources familiar with the matter. The departure deals a significant...
NewsArtificial Intelligence
3 min read
Vince DioquinoJul 8, 2025
Create an account to save your articles.
US Politician’s Tweet Revives Cultural Debate Over UAPs
Are we mistaking angels for aliens? A viral post by U.S. Representative Anna Paulina Luna (R-Fla.) has sparked debate over whether a recent UFO image shows a biblical being instead of an extraterrestrial visitor. The debate began on Sunday when Luna, tweeting from her personal account, replied to a post by user Adrian Dittmann showing a biblically accurate angel. “Actual representation of angels,” Luna wrote on X. “10/10 post.” Actual representation of Angels. 10/10 post. — Anna Paulina Luna...
NewsSpace
3 min read
Jason NelsonJul 8, 2025
Create an account to save your articles.
Here's How All Major AI Platforms Stacked Up in a Harry Potter Sorting Hat Quiz
A computer developer known as Boris the Brave conducted an experiment that placed the 17 major language models through the official Harry Potter house quiz, sampling each question 20 times and calculating the probability of each house assignment. "Perhaps unsurprisingly, the vast majority of models prefer Ravenclaw, with the occasional model branching out to Hufflepuff," Boris wrote in a blog post sharing his results. Eleven out of 17 AI models scored a perfect 100% probability for Ravenclaw—the...
NewsArtificial Intelligence
4 min read
Jose Antonio LanzJul 7, 2025
Create an account to save your articles.

Coin Prices