Computerphile

AI Sandbagging (2025x20)

Fecha de emisión: May 23, 2025

Following the theme of AI research and safety, Aric Floyd talks about how some Large Language Models might follow the all too human trait of sandbagging - "lying" about their true capabilities.

Ranking #23592

Estrenada: May 2013
Episodios: 844
Seguidores: 0

En emisión
YouTube
a las 8