Computerphile

Computerphile

Sleeper Agents in Large Language Models (2025x36)


Data di messa in onda: Set 12, 2025

It's an older paper, but it checks out. Rob Miles discusses the problem of 'Sleeper Agents' - where LLMs could have hidden traits we don't know about until it's too late.

  • Iniziato: Mag 2013
  • Episodi: 854
  • Followers: 0
  • In corso
  • YouTube
  • alle 8