ACS

☰

Post archive

May 20, 2024Announcing the Human-Aligned AI Summer School 2024

January 21, 2024InterLab – a toolkit for experiments with multi-agent interactions

June 4, 2023Contingency: A Conceptual Tool from Evolutionary Biology for Alignment

Our academic publications

See our Research page

Our posts elsewhere

December 26, 2024A Three-Layer Model of LLM Psychology

December 20, 2024"Alignment Faking" frame is somewhat fake

November 27, 2024Hierarchical Agency: A Missing Piece in AI Alignment

January 15, 2024Three Types of Constraints in the Space of Agents

November 7, 2023Box inversion revisited

October 26, 2023The Value Change Problem (sequence)

April 14, 2023The self-unalignment problem

April 10, 2023Why Simulator AIs want to be Active Inference AIs

March 27, 2023Lessons from Convergent Evolution for AI Alignment

March 22, 2023The space of systems and the space of maps

February 22, 2023Cyborg Periods: There will be multiple AI transitions

February 14, 2023The Cave Allegory Revisited: Understanding GPT's Worldview

May 4, 2022Announcing the Alignment of Complex Systems Research Group