Computerphile

Computerphile

Constraining AI Agents (2025x49)


Air date: Dec 04, 2025

As AI systems become more capable, rule-based safeguards, hard-coded restrictions, and simple alignment strategies start to break down. Buck Shlegeris talks about some tactics we might use as detailed in a recent paper.

  • Premiered: May 2013
  • Episodes: 860
  • Followers: 0
  • Running
  • YouTube
  • at 8