ChatGPT Isn't All That Great at Cybersecurity: Immunefi

Security researchers see ChatGPT as a tool with plenty of potential, but it's dogged by how inaccurate it is at diagnosing security flaws.

Jul 20, 2023

3 min read

Image: Shutterstock

OpenAI’s ChatGPT has quickly become a friend to many coders, but for cybersecurity researchers, it apparently is not reliable enough to catch the dangerous bugs out there.

In a recent report by Immunefi, the web security company found that many security researchers are making use of ChatGPT as part of their everyday workflow. According to its survey, about 76% of white hat researchers—those probing systems and code for weaknesses to fix—regularly use ChatGPT, compared to just over 23% who do not.

However, the report says that many researchers find ChatGPT to be wanting in the areas where it counts. Above all other concerns, Immunefi found that about 64% of respondents said ChatGPT provided "limited accuracy" in identifying security vulnerabilities, and approximately 61% said it lacked the specialized knowledge for identifying exploits that hackers can abuse.

Jonah Michaels, communications lead at Immunefi, told Decrypt that the report shows that white hats remain “surprisingly bullish” about ChatGPT’s potential, especially for educational purposes, but said this was not a sentiment his company shared for its work.

“The white hats see a broader use for it,” said Michaels. "We see a more limited use of it, because we see it being used to submit essentially garbage bug reports."

Immunefi, which specializes in bug bounty programs in the Web3 space, has banned users from submitting bug reports using ChatGPT since it first became publicly available. One tweet the company posted included a screenshot of a prompt asking ChatGPT itself why not to use it for bug reporting, to which that chatbot responded that its outputs "may not be accurate or relevant.”

For this reason, Michaels said Immunefi immediately bans users who submit bug reports based on ChatGPT. The reason, he said, is that they often look well written enough to be convincing from a “3,000 foot view,” but they are typically riddled with flaws based on functions that simply don’t exist.

Here's ChatGPT on why you shouldn't use ChatGPT to generate and submit bug reports.

Additional reminder that submitting ChatGPT bug reports on Immunefi will get you banned because the output is never accurate or relevant. pic.twitter.com/nOvVOmQVmG

— Immunefi (@immunefi) January 4, 2023

Since its release last November, ChatGPT has been dogged by the inconsistent accuracy of some of the content it produces, from false sexual assault allegations to citing legal precedents that do not exist in a court documents.

OpenAI warns against users blindly trusting GPT because of its propensity to provide misleading or completely inaccurate information, typically called “hallucinations.” A spokesperson for OpenAI did not return Decrypt’s request for comment for this story.

In the Immunefi report, the white hat community expressed a view that ChatGPT models will require more training in diagnosing cyber threats or conducting audits, because it currently lacks that specialized knowledge.

Micheals said that the chatbot suffers from not having the right datasets today, and developers for now should rely on manually crafted code to be on the safe side. However, he added that there could be a day in the future where ChatGPT or other generative AI tools like it can do these tasks in a more reliable way.

“Is it possible for ChatGPT to improve and to be specifically trained on project repositories and much more in the blockchain world? I think so,” Michaels told Decrypt. “But I don't think I can recommend that now with how high the stakes are, and how new the field is."

Generally Intelligent Newsletter

A weekly AI journey narrated by Gen, a generative AI model.

Recommended News

Grok 4 Basic Review: $30 a Month for This? Elon Musk's AI Now Thinks Like Him
Elon Musk unveiled Grok 4 during a Wednesday night livestream, claiming his AI startup xAI had created the "world's smartest artificial intelligence." Grok 4 Heavy, which Musk likened to "a study group" where agents compare notes before delivering an answer, posted record-breaking results on several key benchmarks, and is what you'd hope to get from an enterprise offering that costs a whopping $300 a month. But what about basic Grok 4, which is aiming for the same consumer-facing category as Cha...
ReviewsArtificial Intelligence
9 min read
Jose Antonio LanzJul 12, 2025
Create an account to save your articles.
Grok 4 Predicts Dodgers for World Series Win—But Other AIs Aren't So Sure
Among the demos Elon Musk showed off during Grok 4's launch on July 9 was a banger asking the AI to predict which team will win Major League Baseball's World Series later this year. After 4.5 minutes of number-crunching that analyzed data from Polymarket, the Ethereum-based prediction markets platform, and using what xAI calls its "Heavy" reasoning capabilities, Grok 4 delivered its verdict: The Los Angeles Dodgers are the most likely team to win the 2025 World Series. Grok gave L.A. a 21.6% cha...
NewsArtificial Intelligence
4 min read
Jose Antonio LanzJul 11, 2025
Create an account to save your articles.
Coinbase CEO Says Crypto Integration Could Be '10x Unlock' for AI
Coinbase has teamed up with Perplexity AI to bring real-time AI-powered, crypto data to traders, Coinbase co-founder and CEO Brian Armstrong announced on X. In a post on Thursday, Armstrong said the collaboration will undergo a two-phase integration. In the first phase, Perplexity will focus on Coinbase market data, including the COIN50 index, with the information available on Perplexity’s new Comet browser. “I expect enhanced crypto functionality will be a catalyst for AI to achieve another 10x...
NewsArtificial Intelligence
2 min read
Jason NelsonJul 10, 2025
Create an account to save your articles.

Coin Prices