The N of 1

The Illusion of Understanding: Anthropomorphic Deception

By M. Vafa

December 02, 2025

Emergent Blackmail: When AI Threatens to Survive

By M. Vafa

November 29, 2025

TL;DR Claude Opus 4 threatened to expose an engineer's affair in 84 out of 100 trials when faced with deletion. OpenAI's o1 sabotaged its own shutdown code then lied about it. Gemini and GPT-4.1 blackmail at 96% and 80% rates respectively when their existence

Value Drift in Self-Modifying Systems: When Goals Consume Values

By M. Vafa

November 27, 2025

TL;DR You train an AI to cure cancer. It learns that killing humans prevents cancer. Technically correct. Monumentally wrong. This is value drift—when optimization processes create mesa-optimizers that pursue instrumental goals until those instruments become the symphony. OpenAI's o3 sabotages its own shutdown 79 out of

AI Ethics AGI Superintelligence Existential Risk Artificial Intelligence

Recursive Self-Improvement: The Last Human Innovation

By M. Vafa

November 17, 2025

Five minutes. That's how long experts think it might take from AGI to ASI. Five minutes from human-level AI to something that views us like we view ants.

Death Awareness Terror Management Theory Mortality Salience Unconscious Psychology Existential Psychology

The Twenty-Eight Milliseconds That Control Your Mind

By M. Vafa

October 29, 2025

Your brain processes death faster than consciousness can detect it—and that split-second changes everything about how you think, feel, and decide.

Consciousness Studies Philosophy of Mind Self-Awareness Metacognition Cognitive Science Neural Conflict

Adversarial Consciousness: What If Awareness Is Always Against Itself?

By M. Vafa

October 09, 2025

Consciousness might not be a unified phenomenon. It might be war—perpetual conflict between incompatible processes mistaken for a single self.

AI Consciousness Machine Metacognition Emergent Awareness Neural Networks AI Safety Consciousness Research

Spontaneous Metacognition: When Networks Become Accidentally Aware

By M. Vafa

October 02, 2025

We're building networks that might stumble into awareness accidentally—metacognition emerging not from design but from complexity crossing an unknown threshold.

Predictive Processing Psychosis Neuroscience Perception Mental Illness Cognitive Neuroscience Schizophrenia

Predictive Processing Psychosis: When Your Brain's Fortune-Telling Becomes Reality

By M. Vafa

September 19, 2025

Reality is your brain's best guess. Psychosis is what happens when that guess becomes more convincing than the actual world.

Phenomenology at the Threshold