Computerphile
AI Sandbagging (2025x20)
Yayınlanma tarihi: May 23, 2025
Following the theme of AI research and safety, Aric Floyd talks about how some Large Language Models might follow the all too human trait of sandbagging - "lying" about their true capabilities.
- Prömiyeri: May 2013
- Bölümler: 857
- Takipçiler: 0
- Devam eden
- YouTube
- saat 8