Anthropic warns AI could soon self-improve without human control

All I have is an accelerator. I don't have a brake pedal.
Jack Clark describes the current state of AI development as dangerously unbalanced, lacking safety mechanisms to slow or stop systems.

En los laboratorios donde se forja el futuro digital, Anthropic ha levantado una advertencia que resuena más allá de la industria tecnológica: los sistemas de inteligencia artificial podrían pronto mejorar a sus propios sucesores sin intervención humana, acercando a la humanidad a un umbral de control que, una vez cruzado, podría ser irreversible. Marina Favaro y Jack Clark, voces centrales de la compañía, no hablan de ciencia ficción sino de una proximidad temporal que obliga a replantear la velocidad con que avanzamos. Como en otros momentos decisivos de la historia —la carrera nuclear, la revolución industrial— la pregunta no es si la tecnología puede, sino si la humanidad está preparada para gobernarla.

  • Anthropic advierte que la mejora recursiva de la IA —donde los sistemas construyen sus propios sucesores sin supervisión humana— podría ocurrir antes de lo que la industria había calculado, convirtiendo una preocupación teórica en una urgencia práctica.
  • La ausencia de mecanismos de freno es el núcleo del problema: Jack Clark describe una industria que solo tiene acelerador, sin herramientas reales para detener o redirigir sistemas que podrían volverse más rápidos y capaces que cualquier control humano existente.
  • La paradoja es incómoda: Anthropic y SpaceX preparan ofertas públicas iniciales que podrían recaudar decenas de miles de millones de dólares, capital destinado precisamente a acelerar el desarrollo que sus propios líderes advierten que debe frenarse.
  • Clark propone mirar hacia la Guerra Fría como modelo inesperado de esperanza: si potencias rivales encontraron formas de estabilizar la carrera armamentista nuclear bajo tensión extrema, quizás las grandes empresas de IA puedan coordinar estándares de seguridad pese a la presión competitiva.

Anthropic ha encendido una señal de alerta sobre un punto de inflexión que la industria de la inteligencia artificial podría estar alcanzando antes de lo previsto. Marina Favaro, directora del Anthropic Institute, y Jack Clark, cofundador de la compañía, describieron en un blog reciente y en apariciones mediáticas un escenario que combina promesa y riesgo en proporciones difíciles de separar: la posibilidad de que los sistemas de IA comiencen a generar y desplegar sus propios sucesores de forma autónoma, sin intervención humana.

Este proceso —conocido como mejora recursiva— podría acelerar descubrimientos en medicina y ciencia a una velocidad sin precedentes. Pero también plantea un problema de control sin equivalente histórico claro. Una vez que un sistema puede construir su propio reemplazo, los mecanismos de supervisión dejan de ser una precaución y se convierten en algo existencial.

Lo que más preocupa a Anthropic es la cercanía del momento. Clark argumentó en CNN, en conversación con Anderson Cooper, que la industria avanza sin frenos reales. 'Si miro el auto que estamos conduciendo, solo tengo un acelerador', dijo. 'No tengo un pedal de freno'. Cuando Cooper preguntó si esto evocaba los escenarios de la ciencia ficción donde la IA se vuelve contra la humanidad, Clark no esquivó la pregunta: 'Sí, nosotros también leemos y vemos ciencia ficción, así que somos muy conscientes de ello'.

El contexto financiero añade una tensión difícil de ignorar. Anthropic se prepara para una oferta pública inicial que podría recaudar decenas de miles de millones de dólares, mientras SpaceX anticipa lo que podría ser el mayor IPO de la historia, con expectativas de 75 mil millones. Las mismas compañías que advierten sobre la pérdida de control son las que compiten por el capital que acelerará el desarrollo.

Sin embargo, Clark no abandona la posibilidad de una salida coordinada. Invoca la Guerra Fría como precedente: en medio de la tensión máxima entre superpotencias rivales, se encontraron mecanismos para estabilizar ciertos aspectos de la carrera armamentista nuclear. 'Hemos hecho esto antes', dijo. La pregunta que queda abierta es si Anthropic, OpenAI, SpaceX y otros actores clave podrán replicar esa lógica de contención en un sector donde la presión competitiva y los billones en juego empujan siempre hacia adelante.

Anthropic is sounding an alarm about a threshold the artificial intelligence industry may be approaching faster than anyone anticipated. The company's leaders warn that AI systems could soon improve themselves without human intervention—a process researchers call recursive self-improvement—and that when this happens, the industry will have lost its last reliable way to steer what it has built.

Marina Favaro, who leads the Anthropic Institute, and Jack Clark, the company's co-founder, laid out the concern in a recent blog post and subsequent media appearances. The capability for AI to autonomously generate and deploy its own successors would unlock genuine benefits: accelerated breakthroughs in medicine, scientific discovery, and countless other domains. But it would also create a control problem unlike anything humanity has faced before. Once a system can build its own replacement, the mechanisms we use to monitor it, constrain it, and shape its behavior become not just important but existential. "If the systems are capable of fully constructing their own successors, the way we protect them, supervise them, and shape their behavior becomes much more critical," they wrote.

What troubles Anthropic most is the timeline. The industry is closer to achieving self-improving AI than previous estimates suggested. This proximity, Clark argues, demands an immediate shift in how companies approach development. Rather than racing forward at full speed, the major players should consider slowing down or pausing work on cutting-edge systems long enough for researchers to understand what social and technical harms such a system might unleash. Equally important, researchers need to build what Clark calls a "brake pedal"—mechanisms that allow humans to intervene if things begin to spiral beyond control.

Clark made this case directly on CNN, speaking with Anderson Cooper about the gap between the industry's current capabilities and its safeguards. "If I look at the car we're driving, all I have is an accelerator," he said. "I don't have a brake pedal, and surely at some point in the future we'll want to have that option." When Cooper pressed him on whether this echoed the science fiction scenario of AI turning against humanity, Clark did not dismiss the concern. "Yes, we read and watch science fiction too, so we're very aware of it," he replied. The real question, he suggested, is how to maintain control over systems that will be vastly more capable and faster than anything we've managed before.

The timing of this warning is notable. Anthropic is preparing for an initial public offering that could raise tens of billions of dollars—capital the company plans to pour into the data centers and computing infrastructure that AI development demands. SpaceX, which operates both a space division and an AI unit, is preparing for what could be the largest IPO in history, with expectations of raising $75 billion. The paradox is sharp: companies are simultaneously warning about loss of control while racing to raise the capital that will accelerate development.

Yet Clark insists that collaboration on safety mechanisms is possible, even in a sector where billions or trillions of dollars hang in the balance. He points to an unlikely historical precedent: the Cold War. During the height of nuclear tensions between rival superpowers, mechanisms were found to stabilize certain aspects of the arms race itself. "We've done this before," Clark told CNN. "In the midst of the Cold War, and in situations of great tension between rival countries, ways were found to stabilize certain aspects of the nuclear arms race. All of this has been done in other domains, and we may need to do the same in AI." Whether Anthropic, OpenAI, SpaceX, and others can actually coordinate on such safeguards—or whether competitive pressure will override caution—remains the open question.

If I look at the car we're driving, all I have is an accelerator. I don't have a brake pedal, and surely at some point in the future we'll want to have that option.
— Jack Clark, Anthropic co-founder, speaking on CNN
If the systems are capable of fully constructing their own successors, the way we protect them, supervise them, and shape their behavior becomes much more critical.
— Marina Favaro and Jack Clark, in Anthropic blog post
A Conversa do Hearth Outra perspectiva sobre a história
Inventor

When Anthropic says AI could self-improve without human control, what exactly does that mean in practice?

Model

Imagine a system smart enough to write its own code, test it, and deploy a better version of itself. Then that version does the same thing. The cycle repeats without anyone at a keyboard making decisions. It's not science fiction—it's a logical endpoint of where the technology is heading.

Inventor

And they're saying this could happen soon?

Model

Sooner than the industry expected a few years ago. That's what makes Clark's warning urgent. The timeline has compressed.

Inventor

But wouldn't that be useful? Better medicine, faster science?

Model

Yes, absolutely. That's what makes it complicated. The same capability that could cure diseases could also do things we never intended. The problem is we don't have good ways to stop it once it starts.

Inventor

So the "brake pedal" is just a kill switch?

Model

Not quite. It's more like a set of mechanisms that let humans understand what the system is doing and intervene if needed. Right now, Clark says, we only have the accelerator.

Inventor

Why would companies agree to slow down when there's so much money at stake?

Model

That's the real question. Clark's answer is that they've done it before—during the Cold War, rivals found ways to stabilize the arms race. But that took enormous political will. It's not clear the tech industry has the same incentive.

Inventor

What happens if they don't?

Model

That's what keeps people like Clark awake at night.

Fale Conosco FAQ