Computerphile

AI Sandbagging (2025x20)

Yayınlanma tarihi: May 23, 2025

Following the theme of AI research and safety, Aric Floyd talks about how some Large Language Models might follow the all too human trait of sandbagging - "lying" about their true capabilities.

Derece #23592

Prömiyeri: May 2013
Bölümler: 857
Takipçiler: 0

Devam eden
YouTube
saat 8