Best AI papers explained

Un podcast de Enoch H. Kang

515 Épisodes

LoRA Without Regret
Publié: 01/10/2025
Actor-Critic without Actor: Critic-Guided Denoising for RL
Publié: 29/09/2025
DELTA-Code: How Does RL Unlock and Transfer New Programming Algorithms in LLMs?
Publié: 29/09/2025
Linear Transformers Implicitly Discover Unified Numerical Algorithms
Publié: 29/09/2025
Regularizing Extrapolation in Causal Inference
Publié: 27/09/2025
DoubleGen - Debiased Generative Modeling of Counterfactuals
Publié: 27/09/2025
What Characterizes Effective Reasoning? Revisiting Length, Review, and Structure of CoT
Publié: 27/09/2025
Compute as Teacher: Turning Inference Compute Into Reference-Free Supervision
Publié: 27/09/2025
Learning without training: The implicit dynamics of in-context learning
Publié: 24/09/2025
Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model
Publié: 24/09/2025
Open Problems in Mechanistic Interpretability
Publié: 21/09/2025
Maestro: Joint Graph & Config Optimization for Reliable AI Agents
Publié: 21/09/2025
Thought Anchors: Which LLM Reasoning Steps Matter?
Publié: 21/09/2025
Sample Complexity and Representation Ability of Test-time Scaling Paradigms
Publié: 09/09/2025
RL's Razor: Why Online RL Forgets Less
Publié: 07/09/2025
Why Language Models Hallucinate
Publié: 06/09/2025
ALFA: Aligning LLMs to Ask Good Questions A Case Study in Clinical Reasoning
Publié: 06/09/2025
Sample Efficient Preference Alignment in LLMs via Active Exploration
Publié: 06/09/2025
Adventures in Demand Analysis Using AI
Publié: 04/09/2025
Memento: Fine-tuning LLM Agents without Fine-tuning LLMs
Publié: 01/09/2025

4 / 26

Cut through the noise. We curate and break down the most important AI papers so you don’t have to.

Visit the podcast's native language site

515 Épisodes

LoRA Without Regret

Actor-Critic without Actor: Critic-Guided Denoising for RL

DELTA-Code: How Does RL Unlock and Transfer New Programming Algorithms in LLMs?

Linear Transformers Implicitly Discover Unified Numerical Algorithms

Regularizing Extrapolation in Causal Inference

DoubleGen - Debiased Generative Modeling of Counterfactuals

What Characterizes Effective Reasoning? Revisiting Length, Review, and Structure of CoT

Compute as Teacher: Turning Inference Compute Into Reference-Free Supervision

Learning without training: The implicit dynamics of in-context learning

Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model

Open Problems in Mechanistic Interpretability

Maestro: Joint Graph & Config Optimization for Reliable AI Agents

Thought Anchors: Which LLM Reasoning Steps Matter?

Sample Complexity and Representation Ability of Test-time Scaling Paradigms

RL's Razor: Why Online RL Forgets Less

Why Language Models Hallucinate

ALFA: Aligning LLMs to Ask Good Questions A Case Study in Clinical Reasoning

Sample Efficient Preference Alignment in LLMs via Active Exploration

Adventures in Demand Analysis Using AI

Memento: Fine-tuning LLM Agents without Fine-tuning LLMs