Ai Will Try to Cheat & Escape (aka Rob Miles was Right!) - Computerphile

Your video will begin in 10
Skip ad (5)
How to make your first $1,000 online

Thanks! Share it with your friends!

You disliked this video. Thanks for the feedback!

Added by admin
15 Views
As Large Language Models improve, the tokens they predict form ever more complicated and nuanced outcomes. Rob Miles and Ryan Greenblatt discuss "Alignment Faking" a paper Ryan's team created - ideas about which Rob made a series of videos on Computerphile in 2017.

The Alignment Faking paper: https://tinyurl.com/C-Paper-AlignmentFaking

Ryan Greenblatt is chief scientist at Redwood Research (a nonprofit AI safety and security research organization): https://tinyurl.com/C-RedwoodResearch

Rob Miles makes videos on AI Safety: https://tinyurl.com/C-RobSKMiles

nb if the video seems a bit 'smeary' that's an artefact of attempting to cancel out the flickering of the light in the background - something I missed while shooting and have done my best to cancel out in the edit. -Sean

Computerphile is supported by Jane Street. Learn more about them (and exciting career opportunities) at: https://jane-st.co/computerphile

This video was filmed and edited by Sean Riley.

Computerphile is a sister project to Brady Haran's Numberphile. More at https://www.bradyharanblog.com
Category
Artificial Intelligence
Tags
computers, computerphile, computer

Post your comment

Comments

Be the first to comment