Computerphile

Constraining AI Agents (2025x49)

Data di messa in onda: Dic 04, 2025

As AI systems become more capable, rule-based safeguards, hard-coded restrictions, and simple alignment strategies start to break down. Buck Shlegeris talks about some tactics we might use as detailed in a recent paper.

Posizione #23592

Iniziato: Mag 2013
Episodi: 860
Followers: 0

In corso
YouTube
alle 8