Close Menu
  • Crypto News
  • Markets
  • Bitcoin
  • Ethereum
  • XRP
  • Altcoins
  • Technology
  • More
    • Crypto Prices – Latest from BTC, ETH & XRP
    • NFT
    • DeFi

Subscribe to Updates

Get the latest crypto news and updates directly to your inbox.

Trending

Solana (SOL) Tests Support After Dip — Bounce or Breakdown Ahead?

June 18, 2025
Archetyp Dark Web Market Shuttered, Ecosystem Adapts

Archetyp Dark Web Market Shuttered, Ecosystem Adapts

June 18, 2025

U.S. Senate passes Genius Act to regulate stablecoins

June 18, 2025

Nasdaq-listed firm announces $50M Hyperliquid reserve

June 18, 2025

Czech gov’t no-confidence vote as PM denies bitcoin cover-up

June 18, 2025
Facebook X (Twitter) Instagram
  • Advertise
en English
nl Nederlandsen Englishfr Françaisde Deutschit Italianoru Русскийes Españolzh-CN 简体中文hi हिन्दीja 日本語
Crypto Observer
  • Crypto News

    Solana (SOL) Tests Support After Dip — Bounce or Breakdown Ahead?

    June 18, 2025

    XRP Price Slides Under Support Level, Selling Pressure Intensifies

    June 18, 2025

    Early Bird Ethereum Investor Awakens: $620 Investment Becomes $5M

    June 18, 2025

    Ethereum Price at Risk of Downside Break as Bears Test Key Support

    June 18, 2025

    Bitcoin Price Stumbles at Resistance — Will the Dip Deepen?

    June 18, 2025
  • Markets
  • Bitcoin
  • Ethereum
  • XRP
  • Altcoins
  • Technology
  • More
    • Crypto Prices – Latest from BTC, ETH & XRP
    • NFT
    • DeFi
Facebook X (Twitter) Instagram
Crypto Observer
Home » Technology » AI » Google’s Gemini panicked when playing Pokémon
AI

Google’s Gemini panicked when playing Pokémon

Crypto Observer StaffBy Crypto Observer StaffJune 17, 2025No Comments4 Mins Read
Facebook Twitter Pinterest Reddit Telegram Email LinkedIn Tumblr
Share
Facebook Twitter LinkedIn Pinterest Email

AI companies are battling to dominate the industry, but sometimes, they’re also battling in Pokémon gyms.

As Google and Anthropic both study how their latest AI models navigate early Pokémon games, the results can be as amusing as they are enlightening — and this time, Google DeepMind has written in a report that Gemini 2.5 Pro resorts to panic when its Pokémon are close to death. This can cause the AI’s performance to experience “qualitatively observable degradation in the model’s reasoning capability,” according to the report.

AI benchmarking — or, the process of comparing the performance of different AI models — is a dubious art that often provides little context for the actual capabilities of a given model. But some researchers think that studying how AI models play video games could be useful (or, at the very least, kind of funny).

Over the last several months, two developers unaffiliated with Google and Anthropic have set up respective Twitch streams called “Gemini Plays Pokémon” and “Claude Plays Pokémon,” where anyone can watch in real time as an AI tries to navigate a children’s video game from over twenty-five years ago.

Each stream displays the AI’s “reasoning” process — or, a natural language translation of how the AI evaluates a problem and arrives at a response — giving us insight into the way that these models work.

Image Credits:Google

While the progress of these AI models is impressive, they are still not very good at playing Pokémon. It takes hundreds of hours for Gemini to reason through a game that a child could complete in exponentially less time.

What’s interesting about watching an AI navigate a Pokémon game is not so much about its time of completion, but rather, how it behaves along the way.

“Over the course of the playthrough, Gemini 2.5 Pro gets into various situations which cause the model to simulate ‘panic,’” the report says.

This state of “panic” can result in the model’s performance getting worse, as the AI may suddenly stop using certain tools at its disposal for a stretch of gameplay. While AI does not think or experience emotion, its actions mimic the way in which a human might make poor, hasty decisions when under stress — a fascinating, yet unsettling response.

“This behavior has occurred in enough separate instances that the members of the Twitch chat have actively noticed when it is occurring,” the report says.

Claude has also exhibited some curious behaviors in its journeys across Kanto. In one instance, the AI picked up on the pattern that when all of its Pokémon run out of health, the player character will “white out” and return to a Pokémon Center.

When Claude got stuck in the Mt. Moon cave, it erroneously hypothesized that if it intentionally got all of its Pokémon to faint, then it would be transported across the cave to the Pokémon Center in the next town.

However, that isn’t how the game works. When all of your Pokémon die, you return to whatever Pokémon Center you used most recently, rather than the nearest geographically. Viewers watched on in horror as the AI essentially tried to kill itself in the game.

Despite its shortcomings, there are a few ways in which the AI can outperform human players. As of the release of Gemini 2.5 Pro, the AI is able to solve puzzles with impressive accuracy.

With some human assistance, the AI created agentic tools — prompted instances of Gemini 2.5 Pro geared toward specific tasks — to solve the game’s boulder puzzles and find efficient routes to reach a destination.

“With only a prompt describing boulder physics and a description of how to verify a valid path, Gemini 2.5 Pro is able to one-shot some of these complex boulder puzzles, which are required
to progress through Victory Road,” the report says.

Since Gemini 2.5 Pro did a lot of the work in creating these tools on its own, Google theorizes that the current model may be capable of creating these tools without human intervention. Who knows, maybe Gemini will therapize itself into creating a “don’t panic” module.

Read the full article here

Share. Facebook Twitter Pinterest LinkedIn Tumblr Email

Related Posts

Sequoia-backed Crosby launches a new kind of AI-powered law firm

June 18, 2025

Amazon expects to reduce corporate jobs due to AI

June 18, 2025

Police shut down Cluely’s party, the ‘cheat at everything’ startup

June 18, 2025

Sam Altman says Meta tried and failed to poach OpenAI’s talent with $100M offers

June 17, 2025
Add A Comment

Leave A Reply Cancel Reply

Subscribe to Updates

Get the latest crypto news and updates directly to your inbox.

Top Posts

Solana (SOL) Tests Support After Dip — Bounce or Breakdown Ahead?

June 18, 2025
Archetyp Dark Web Market Shuttered, Ecosystem Adapts

Archetyp Dark Web Market Shuttered, Ecosystem Adapts

June 18, 2025

U.S. Senate passes Genius Act to regulate stablecoins

June 18, 2025
Advertisement
Demo

Crypto Observer is your one-stop website for the latest crypto news and updates, follow us now to get the news that matters to you.

Facebook X (Twitter) Instagram
Crypto News

XRP Price Slides Under Support Level, Selling Pressure Intensifies

June 18, 2025

Early Bird Ethereum Investor Awakens: $620 Investment Becomes $5M

June 18, 2025

Ethereum Price at Risk of Downside Break as Bears Test Key Support

June 18, 2025
Get Informed

Subscribe to Updates

Get the latest crypto news and updates directly to your inbox.

Facebook X (Twitter)
  • Privacy Policy
  • Terms of use
  • Advertise with us | Publishing
  • Contact us
  • Crypto News – Press release
  • Newsletter sign up
  • Markets
  • Altcoins
  • Bitcoin
  • Crypto News
  • DeFi
  • Ethereum
  • Technology
  • Blockchain
  • AI
  • NFT
  • Thanks for joining us
© 2025 Crypto Observer. All Rights Reserved.

Type above and press Enter to search. Press Esc to cancel.