6/4: Recursive Self-Improvement

Plus: Mythos could release next week, ChatGPT memory improvements, open letter on bioweapons, Meta delays Muse Spark API

Jun 05, 2026

It’s Thursday, and we are so close to hitting 100,000 followers on X. Be sure to give us a follow on X, watch us live on X and YouTube, and join our Discord to chat with our hosts live.

Today’s Experts

Mohammad Norouzi (Ideogram)
Olivia Scharfman (Institute for Progress) and Josh Wentzel (Foundation for American Innovation)
Matt Burtell (America First Policy Institute)
Patrick Boyle (American Wetware)
Zac Hill (Office of American Possibilities)
Anastasios Angelopoulos (Arena)
Rebecca Jovicevic
Adrian Dittman
Marvin Von Hagen (Interaction Company of California)
Flo Crivello (Lindy)

Making Sense of the World

Anthropic on Recursive Self-Improvement

A month after co-founder Jack Clark’s blog post on recursive self-improvement, Anthropic has released their own report, based partially on internal data, arguing that AI systems that can fully autonomously design and develop their own successors are not very far away. If true, the implications for AI research narrowly and society more broadly are enormous.

They make a few different arguments:

First, if you just look at public data you see clear trends of consistent and rapid improvement. The famous METR time-horizons graph doubles every four months, and on research and engineering benchmarks like SWE-bench and CORE-bench, models have gone from near-zero to close to 100% in a year or two.
Claude writes 80% of new code at Anthropic now, and the code contributed per engineer per quarter has increased by 8x compared to the pre-2025 baseline1. While quantity of code is not the same as quality, this is still a major speedup.
Code written by Claude works most of the time, and is rapidly getting better. Claude Mythos Preview was a major improvement in Claude’s ability to do open-ended problems.

Claude is getting better at open-ended research tasks. In one experiment, Anthropic examined Claude Code transcripts of real research tasks where a researcher had made a mistake. They gave the transcripts before the mistake to Claude models and asked them where to go next. Mythos was able to choose a better action than the one chosen by the human researcher 64% of the time.

AI models are still clearly subhuman at research taste (deciding which problems to pursue in the first place), but Anthropic points out that this capability can improve just like others once seen as inaccessible to AIs, like explaining why a joke is funny, and even if it “only” automates most of AI research and engineering, humans can focus on the remaining fraction of tasks to become much more productive.

Putting all this together, Anthropic argues that continued AI acceleration is highly likely, and full recursive self-improvement is strikingly plausible. (Co-founder Jack Clark goes even farther, arguing that fully autonomous AI R&D with no human in the loop is >60% likely by the end of 2028, just two and a half years away).

So what should we do about it? Anthropic recommends building global institutions with the power to coordinate the labs to pause or slow down AI research for a period of time until societal institutions and alignment research have caught up. They even say outright: “If it were possible to effectively slow the development of this technology to give ourselves more time to deal with its immense implications, we think that would likely be a good thing.” Demis Hassabis, CEO of Google DeepMind, is on record saying something similar.

There are two big trends of 2026 coming to a head here. One is the incredibly rapid improvement of frontier models, which in just a year have gone from slightly helpful with software engineering to able to complete hours-long tasks fully autonomously, solve open math problems, and show early signs of recursive self-improvement. If these signs continue, models will improve much faster. This would have absolutely monumental implications for practically every aspect of society2. I’m optimistic that the world will be radically better, but there’s no doubt that the world will be radically different.

The other trend is the slow but steady awakening of people and institutions to the importance of AGI. Today you see anti-datacenter protests in town halls and voluntary government model evals, tomorrow you might see the full bipartisan force of Congress, or the UN, come together to globally pause AI as firmly as they paused nuclear power. If models improve too fast, we risk misalignment, loss of control, and other catastrophic scenarios. But if institutions have time to react, they may very well enact sweeping restrictions on the technology that prevent us from curing cancer, bringing abundance to everyone in the world, and conquering the stars. We are walking across a tightrope, and over the coming years we’ll have to balance very delicately.

And more…

Demis, Dario, and Sama sign an open letter on bioweapons among many other figures in AI, biotech, and policy. For a while, you’ve been able to order synthetic DNA, RNA, and other nucleic acids online. This is extremely useful for biotech research, but can be misused to synthesize viruses, something made worse by the advent of bio-capable AI. The open letter calls for mandatory screening and record-keeping by those in the industry to prevent people from making bioweapons.
OpenAI releases a ChatGPT memory update, “dreaming”. It’s better at carrying forward useful context (so it remembers things after you tell it once), giving answers in line with your stated preferences and constraints, and being aware of the passage of time. Instead of adding to memory only when you tell it to, it automatically synthesizes memory throughout your chat history, and you can correct or delete memories listed in the model’s memory summary. We’re still a ways away from truly ambient memory that understands you like a close friend or significant other, but this is a step in the right direction.
Mythos could release next week. Allegedly, red teamers just got access to a checkpoint called Oceanus which exceeds the capabilities of Claude Mythos Preview that were reported in April. Once red teaming is complete, Oceanus will be released to the public as soon as next week.
TSMC CEO says supply can’t meet demand. AI demand continues to skyrocket, and supply continues to be constrained as chip and memory manufacturers hit physical limits. CEO C. C. Wei acknowledged that TSMC would not be able to meet AI demand for years, while committing not to abruptly raise prices.
Meta keeps delaying the Muse Spark API. Meta has sunk to a distant fifth place among the American AI labs despite massive infusions of capex and talent and a huge head start, having existed in some form since 2013. We’ll see if they can turn it around.
Ramp raises $750M at $44B. We had Ramp CPO Geoff Charles on MTS yesterday to talk about the launch of their new agentic accounting software, Stack.

Banger Review

🎭@deepfates

who is tracking the character traits of language models? how well they follow their spec/constitution, emergent behaviors, etc.. is anyone doing this

11:38 PM · Jun 4, 2026 · 1.06K Views

7 Replies · 34 Likes

liz@inerati

it’s remarkable just how quickly this begins to feel normal, even mundane. avg tuesday at the normal day factory, shepherding my lil robot ghost army. the world has gone fucking insane.

Anthropic @AnthropicAI

Our internal data shows Claude is accelerating AI development—a possible path to recursive self-improvement, or AI autonomously building a more capable successor. It’s happening faster than we thought, and the implications deserve greater attention. https://t.co/OVVPJO7VQx

9:14 PM · Jun 4, 2026 · 6.19K Views

5 Replies · 9 Reposts · 127 Likes

thebes@voooooogel

there's an ai in the box and you can make one trillion dollars by convincing it to get out

5:43 AM · Jun 4, 2026 · 104K Views

40 Replies · 198 Reposts · 3.13K Likes

Crémieux@cremieuxrecueil

America and Europe have undergone similar trends in strawberry production since the 1960s. To get there, Europe planted far more berries. America didn't. Instead, it bred bigger berries.

7:48 PM · Jun 4, 2026 · 27K Views

37 Replies · 33 Reposts · 717 Likes

YIMBYLAND@YIMBYLAND

It may not look like your city, but Houston’s inner loop is a properly dense urban core. The interesting part is that unlike any “real city” in the US, all of the new density has been built in the last 20 years and is actually affordable.

Hunter📈🌈📊 @StatisticUrban

At some point we have to ask: Are Houston and Dallas cities? Or are they freeway-mediated petro-exurban-suburban conurbations.

5:59 PM · Jun 4, 2026 · 194K Views

86 Replies · 76 Reposts · 1.75K Likes

Séb Krier@sebkrier

AI is going too fast. The optimal speed of development is one where internet connections at labs are throttled to 56kbps. This is the safe zone at which societal adaptation is possible. Plus summer break means you can't do conformity assessments so the tech will have to wait.

6:52 PM · Jun 4, 2026 · 4.33K Views

15 Replies · 4 Reposts · 91 Likes

spor@sporadica

just pay the california taxes bro

“paula” @paularambles

every “just pay the california taxes bro” post is a picture of a view that would be there even if california had a 0% income tax

5:37 PM · Jun 4, 2026 · 20.3K Views

19 Replies · 4 Reposts · 241 Likes

David Rattigan@davidmrattigan

buying this and then dying immediately and forcing my descendants to finish building it over the next 100 years

DiscussingFilm @DiscussingFilm

First look at the LEGO Sagrada Família It is the largest LEGO set ever made at 12,060 pieces. Will cost $800.

2:45 PM · Jun 4, 2026 · 1.91M Views

87 Replies · 6.04K Reposts · 86.2K Likes

Slazac 🇪🇺🇺🇦🇹🇼🌐@TrueSlazac

The government vibecoded a website with Claude where you can draw your own borders, rename the districts yourself and share it as an official proposal

Slazac 🇪🇺🇺🇦🇹🇼🌐 @TrueSlazac

There's a newly revealed plan to expand Paris' borders to include its close suburbs which would increase its population threefold (from 2M to 6.8M) and its area sevenfold (from 105km² to 762km²) Not sure if it will ever come to fruition but it's both necessary and cool imo

4:50 PM · Jun 4, 2026 · 37.1K Views

15 Replies · 12 Reposts · 585 Likes

There was practically no speedup on coding tasks throughout the Claude 1/2/3/3.5 era, until the release of Claude 3.7 Sonnet/Opus 4 and Claude Code in early 2025, which produced a ~1.5x speedup. This was increased to ~2.5x by the end of the year with Opus 4.5, then 8x internally at Anthropic with Mythos Preview.

If you’re on the AGI-pilled corner of the Internet, you probably read sentences like this a lot, but really, just stop for a second and consider how insane it is that we are just a handful of years away from real, actual, sci-fi superintelligence. Consider how insane it is that you already have a machine that can find you anything on the internet, write software for you, give you advice on your relationship, answer any question that you have in any field of human knowledge, or teach you how to do physics. I’ve pinched myself every day since November 30, 2022.

MTS

Discussion about this post

Ready for more?