Last month,Anime Archives the $61.5 billion-valuated AI startup Anthropic set up a gaming livestream on Twitch. Gaming livestreams are nothing new on Twitch, but this one is a little different: Claude, Anthropic's AI model, is attempting to beat Pokémon Red.
We are now one month in,and the livestream is still going. However, Claude has not progressedall that much. And, at this rate, Anthropic's AI agent may possibly never be the very best, like no one ever was.
According to Anthropic, when it first launched the "Claude Plays Pokémon" project, previous versions of its AI agent Claude failed at some very basic tasks. For example, according to Anthropic, Claude 3.5 would try to run away from almost every battle in June 2024.
A few months and a few versions of Claude later, Anthropic said there was a stark change. In February 2025, Anthropic gave Claude 3.7 Sonnet a whirl at playing Pokémon.
"Within hours, Claude defeated Brock. Days later, it trounced Misty," Anthropic said. "Progress that older models had little hope of achieving."
Anthropic said that Claude 3.7 Sonnet could plan ahead, remember objectives, and learn from its mistakes, unlike previous versions of the AI agent. It also built a knowledge base, saw the screen, and simulated button presses.
However, the progress Claude 3.7 Sonnet originally made in the game seems to have stalled.
For example, livestream viewers watchedas Clause 3.7 took 78 hoursto get through Mt. Moon in the game. On Reddit, gamers estimatedthat it would typically take a child just a few hours to advance through the same stage.
SEE ALSO: Hands-on with the Claude AI app: It's pleasant to use, but jankyClaude can be seen going in circles, stumbling around the same paths, and often knocking into walls as it tries to get around the game.
The livestream is engaging, especially as a text box lays out Claude's "thinking" as the AI agent tries to figure out what moves to make next.
According to Anthropic engineers in an interview with Ars Technica, Claude has an easier time with aspects of the game which involve text, such as Pokémon battles. However, it struggles with the more visual aspects of the game, such as moving around from town to town on the map.
Claude 3.7 Sonnet has gone much further in the game than previous Claude models, so there's been progress. However, for those warning that AI will soon be able to take over the world, we're nowhere close to that being a reality yet. Claude still has 151 Pokémon to catch.
Topics Artificial Intelligence Gaming Pokemon Twitch Streaming
John Stamos' pants gave him a lot of trouble at a recent showThat 'Ghost in the Shell' race problem is even worse than it looksChocolate maker Cadbury accused of 'airbrushing' Easter and Twitter can't evenFor some odd reason Demi Lovato resurrected Poot, and people are thrilled'Rogue One' reveals some easter eggs, but plenty are still hiddenOops, that vibrator with a camera is super easy to hackThe only Melania in the White House is this photograph'Invader Zim' is back to conquer earth in a new TV movie revivalGranny uses her $1200 Louis Vuitton to bag her fish from the marketThe only Melania in the White House is this photograph'Invader Zim' is back to conquer earth in a new TV movie revivalYahoo + AOL = Oath? It's looking like thatNetflix to the world: We have ALL the comedy specialsThe MacBook Pro's beloved MagSafe could return as ... a dongleLive from space! Watch Earth live streamed online.It looks like the Undertaker has wrestled his last match, and we're having all the feels36 baby names that are on the verge extinctionThe Daily Show's Jordan Klepper is getting a late night spinoff seriesResearchers invent the opposite of 'Prisma'... for SCIENCEApple dropping Imagination is a warning to iPhone suppliers everywhere Donald Trump referred to Stormy Daniels as 'horseface.' Seriously. Woman saved from hostage situation after leaving plea for help on GrubHub food order 10 best Google Chrome extensions for productivity How to change your Zoom background Twitter launches closed caption toggle on iOS and Android 'The Black Phone' review: Ethan Hawke embodies fears of Stranger Danger generation Watch Rihanna vogue to the 'A Star Is Born' soundtrack YouTube promises creators more transparency and expanded features Jacob deserved a happier ending in 'Grace and Frankie' 'Only Murders in the Building' Season 2 review: Bigger world, more chaos Amazon says Alexa will soon be able to mimic the voice of dead loved ones 'Wordle' today: Here's the answer, hints for June 23 You need to check out this cool coin with an amazing hidden feature Please don't wear a Trump Twitter is testing Notes, a new longform format VSCO launches Spaces, a collaborative gallery for creators Do you have to pay for Microsoft's Defender Antivirus? Everything to remember ahead of 'The Umbrella Academy' Season 3 'Spiderhead' review: Chris Hemsworth and Miles Teller face off but without thrills This absurd parody proves that all TED Talks really do sound the same