Announcing the Human-Aligned AI Summer School 2024

Tomáš Gavenčiak, Jan Kulveit

May 20, 2024

The fourth Human-aligned AI Summer School will be held in Prague from 17^th to 20^th July 2024. We will meet for four intensive days of talks, workshops, and discussions covering latest trends in AI alignment research and broader framings of AI alignment research. Apply now, applications are evaluated on a rolling basis.

The intended audience of the school are people interested in learning more about the AI alignment topics, PhD students, researchers working in ML/AI outside academia, and talented students.

The school is jointly organized by ACS research, the Centre for Theoretical Study at Charles University, Daniel Filan, and many volunteers, and is supported by a grant from the Long-Term Future Fund.

Program of the school

The school is focused on teaching and exploring approaches and frameworks, less on presentation of the latest research results. The content of the school is mostly technical – it is assumed the attendees understand current ML approaches and some of the underlying theoretical frameworks.

This year, the school will cover these main topics:

Overview of the alignment problem and current approaches.
Alignment of large language models: RLHF, DPO and beyond. Methods used to align current large language models and their shortcomings.
Evaluating and measuring AI systems: How to understand and oversee current AI systems on the behavioral level.
Interpretability and the science of deep learning: What's going on inside of the models?
AI alignment theory: While 'prosaic' approaches to alignment focus on current systems, theory aims for deeper understanding and better generalizability.
Alignment in the context of complex systems and multi-agent settings: What should the AI be aligned to? In most realistic settings, we can expect there are multiple stakeholders and many interacting AI systems; any solutions to alignment problem need to solve multi-agent settings.

The school consists of lectures and topical series, focused smaller-group workshops and discussions, expert panels, and opportunities for networking, project brainstorming and informal discussions.

Confirmed speakers

Stephen Casper – Algorithmic Alignment Group, MIT.
Stanislav Fort – Google DeepMind.
Jesse Hoogland – Timaeus.
Neel Nanda - Google DeepMind.
Mary Phuong – Google DeepMind.
Deger Turan – AI Objectives Institute and Metaculus.
Vikrant Varma – Google DeepMind.

(more to be announced later)