The Inside View

Un podcast de Michaël Trazzi

54 Épisodes

Owain Evans - AI Situational Awareness, Out-of-Context Reasoning
Publié: 23/08/2024
[Crosspost] Adam Gleave on Vulnerabilities in GPT-4 APIs (+ extra Nathan Labenz interview)
Publié: 17/05/2024
Ethan Perez on Selecting Alignment Research Projects (ft. Mikita Balesni & Henry Sleight)
Publié: 09/04/2024
Emil Wallner on Sora, Generative AI Startups and AI optimism
Publié: 20/02/2024
Evan Hubinger on Sleeper Agents, Deception and Responsible Scaling Policies
Publié: 12/02/2024
[Jan 2023] Jeffrey Ladish on AI Augmented Cyberwarfare and compute monitoring
Publié: 27/01/2024
Holly Elmore on pausing AI
Publié: 22/01/2024
Podcast Retrospective and Next Steps
Publié: 09/01/2024
Kellin Pelrine on beating the strongest go AI
Publié: 04/10/2023
Paul Christiano's views on "doom" (ft. Robert Miles)
Publié: 29/09/2023
Neel Nanda on mechanistic interpretability, superposition and grokking
Publié: 21/09/2023
Joscha Bach on how to stop worrying and love AI
Publié: 08/09/2023
Erik Jones on Automatically Auditing Large Language Models
Publié: 11/08/2023
Dylan Patel on the GPU Shortage, Nvidia and the Deep Learning Supply Chain
Publié: 09/08/2023
Tony Wang on Beating Superhuman Go AIs with Advesarial Policies
Publié: 04/08/2023
David Bau on Editing Facts in GPT, AI Safety and Interpretability
Publié: 01/08/2023
Alexander Pan on the MACHIAVELLI benchmark
Publié: 26/07/2023
Vincent Weisser on Funding AI Alignment Research
Publié: 24/07/2023
[JUNE 2022] Aran Komatsuzaki on Scaling, GPT-J and Alignment
Publié: 19/07/2023
Nina Rimsky on AI Deception and Mesa-optimisation
Publié: 18/07/2023

1 / 3

The goal of this podcast is to create a place where people discuss their inside views about existential risk from AI.

Visit the podcast's native language site

54 Épisodes

Owain Evans - AI Situational Awareness, Out-of-Context Reasoning

[Crosspost] Adam Gleave on Vulnerabilities in GPT-4 APIs (+ extra Nathan Labenz interview)

Ethan Perez on Selecting Alignment Research Projects (ft. Mikita Balesni & Henry Sleight)

Emil Wallner on Sora, Generative AI Startups and AI optimism

Evan Hubinger on Sleeper Agents, Deception and Responsible Scaling Policies

[Jan 2023] Jeffrey Ladish on AI Augmented Cyberwarfare and compute monitoring

Holly Elmore on pausing AI

Podcast Retrospective and Next Steps

Kellin Pelrine on beating the strongest go AI

Paul Christiano's views on "doom" (ft. Robert Miles)

Neel Nanda on mechanistic interpretability, superposition and grokking

Joscha Bach on how to stop worrying and love AI

Erik Jones on Automatically Auditing Large Language Models

Dylan Patel on the GPU Shortage, Nvidia and the Deep Learning Supply Chain

Tony Wang on Beating Superhuman Go AIs with Advesarial Policies

David Bau on Editing Facts in GPT, AI Safety and Interpretability

Alexander Pan on the MACHIAVELLI benchmark

Vincent Weisser on Funding AI Alignment Research

[JUNE 2022] Aran Komatsuzaki on Scaling, GPT-J and Alignment

Nina Rimsky on AI Deception and Mesa-optimisation