Upcoming events

Whose Gold? Aligning AI with Diverse Views on what’s Safe, Aligned, and Beneficial

Date: 
Wed, 02/07/2025 - 11:00 to 12:00
Location: 
Heriot-Watt University, LT3
Speaker: 
Professor Verena Rieser
Google DeepMind

Abstract: Human feedback is widely considered the 'gold standard' for AI alignment, but what if this 'gold' reflects inherently diverse and conflicting human views on what constitutes ‘good’ and ‘safe’? This keynote will explore the technical and ethical challenges posed by differing viewpoints and opinions across individuals and groups.