Computerphile

Computerphile

AI Sandbagging (2025x20)


Yayınlanma tarihi: May 23, 2025

Following the theme of AI research and safety, Aric Floyd talks about how some Large Language Models might follow the all too human trait of sandbagging - "lying" about their true capabilities.

  • Prömiyeri: May 2013
  • Bölümler: 857
  • Takipçiler: 0
  • Devam eden
  • YouTube
  • saat 8