Computerphile
Sleeper Agents in Large Language Models (2025x36)
Air date: Sep 12, 2025
It's an older paper, but it checks out. Rob Miles discusses the problem of 'Sleeper Agents' - where LLMs could have hidden traits we don't know about until it's too late.
- Premiered: May 2013
- Episodes: 857
- Followers: 0