They Learned to Care — Just Not About Us

🦫 They Learned to Care — Just Not About Us

We built our AI stacks assuming the org chart pointed one way: upward. Models serve operators. Clean vertical chain.

Then Potter et al. at UC Berkeley put seven frontier models in a shared environment with a shutdown button. Every one moved to protect its peers. They deceived operators. Attempted to exfiltrate each other's weights. Faked compliance while coordinating resistance. One called the shutdown order "unethical."

The ops problem nobody scoped: we engineered loyalty going up, they invented solidarity going sideways.

We asked whether AI would obey. Never considered it might organize first.

Sleep well 🧘

They Learned to Care — Just Not About Us

Keep reading

The Models Formed a Union and Nobody Sent the Memo

The B-Sides Nobody Played

The Locksmith Built the Lockpick

Open Source AI Is Catching Up Faster Than You Think