Podcast Index

Podcasts

Explora podcasts por categoría, abre episodios recientes y descarga audio para escucharlo sin conexión.

Best AI papers explained Best AI papers explained Enoch H. Kang DAVID888 Daily 每日放送 DAVID888 Daily 每日放送 DAVID888 Daily 每日放送 North Shore Nine: A Pittsburgh Pirates Podcast North Shore Nine: A Pittsburgh Pirates Podcast North Shore Nine, Bleav Texas Rangers News Today | 2 Min News | The Daily News Now! Texas Rangers News Today | 2 Min News | The Daily News Now! The Daily News Now! WRFH/Radio Free Hillsdale 101.7 FM WRFH/Radio Free Hillsdale 101.7 FM Student Content Indianapolis News Today | 2 Min News | The Daily News Now! Indianapolis News Today | 2 Min News | The Daily News Now! The Daily News Now! Lubbock News Today | 2 Min News | The Daily News Now! Lubbock News Today | 2 Min News | The Daily News Now! The Daily News Now! Biblical Deep Dive with Pastor (Dr) W.F Kumuyi Biblical Deep Dive with Pastor (Dr) W.F Kumuyi Pastor (Dr) W.F Kumuyi The Art Box The Art Box vvarts DubLife Diaries The VW Lifestyle Podcast DubLife Diaries The VW Lifestyle Podcast Joe Person No Bad Dogs Podcast No Bad Dogs Podcast Tom Davis Chicago White Sox Today | 2 Minute News | The Daily News Now! Chicago White Sox Today | 2 Minute News | The Daily News Now! The Daily News Now! Smash Sports Show Smash Sports Show Smash FM Jersey City News Today | 2 Min News | The Daily News Now! Jersey City News Today | 2 Min News | The Daily News Now! The Daily News Now! YATA - You Are The Algorithm YATA - You Are The Algorithm Permanerd Bloomberg News Now Bloomberg News Now Samejo Washington Nationals News Today | 2 Min News | The Daily News Now! Washington Nationals News Today | 2 Min News | The Daily News Now! The Daily News Now! Enlarge Your Vocabulary Enlarge Your Vocabulary Floydius-Maximus Jeremy Thomas Jefferson What are we doing Phillies post game show What are we doing Phillies post game show kevinflannery117 What are we doing Phillies post game show What are we doing Phillies post game show kevinflannery117 Motivational Speeches Motivational Speeches Motivationly piano ten thousand leaves project piano ten thousand leaves project chair house Building Fortunes Radio with Peter Mingils Building Fortunes Radio with Peter Mingils Building Fortunes Radio Swarlanation Street Podcast Swarlanation Street Podcast Ally Mac & The Lesbian Detective AI Podcast Summaries from Transcripted.ai (VIDEO) AI Podcast Summaries from Transcripted.ai (VIDEO) Transcripted.ai Experiencia Kidzania Santa Fe Experiencia Kidzania Santa Fe ACIR - Kidzania Jim Cornette’s Drive-Thru Jim Cornette’s Drive-Thru Arcadian Vanguard Self Improvement Self Improvement Bijibilla Rama Rao Blerd It Out Blerd It Out Tyrant The Weekend Primetime The Weekend Primetime MS NOW, Ayman Mohyeldin, Elise Jordan, Antonia Hylton, Catherine Rampell Erotic Audio Erotic Audio Wylde Podcasts Jacksonville Jaguars News Today | 2 Min News | The Daily News Now! Jacksonville Jaguars News Today | 2 Min News | The Daily News Now! The Daily News Now! Words of Wisdom Words of Wisdom Sunny Washington Huskies Football Today | 2 Min News | The Daily News Now! Washington Huskies Football Today | 2 Min News | The Daily News Now! The Daily News Now! Morning Bru with Jaffe & Razor Morning Bru with Jaffe & Razor NESN Fozcast - The Ben Foster Podcast Fozcast - The Ben Foster Podcast The Ringer MiedoScope Historias de Miedo en Directo MiedoScope Historias de Miedo en Directo Miedoscope
Best AI papers explained

Todo

Best AI papers explained

Enoch H. Kang

Qwen-AgentWorld: Language World Models for General Agents

June 27, 2026 3:03pm 20 min

We discuss Qwen-AgentWorld, a pioneering suite of language world models designed to simulate complex digital environments for artificial intelligence agents. By training on over 10 million trajectories across seven domai...

When Does Trajectory-Level Supervision Permit Efficient Offline Reinforcement Learning?

June 27, 2026 12:11am 18 min

This paper discusses a statistical framework for offline reinforcement learning using trajectory-level supervision, where only final outcomes or preferences are observed rather than step-by-step rewards. The authors intr...

SuperThoughts: Reasoning Tokens in Superposition

June 26, 2026 2:42pm 19 min

SuperThoughts is a novel framework designed to accelerate the Chain-of-Thought (CoT) reasoning process in large language models by processing tokens in superposition. Unlike traditional models that generate tokens sequen...

First-Explore PPO : Learning Meta-Exploration with Proximal Policy Optimization

June 25, 2026 11:09am 22 min

This research paper introduces First-Explore Proximal Policy Optimization (FE-PPO), a new reinforcement learning algorithm designed to improve how agents discover rewards in complex, deceptive environments. While standar...

Self-Distillation for Data-Scarce Language Model Pretraining

June 23, 2026 9:17pm 21 min

This research paper investigates self-distillation as a powerful regularization technique for pretraining language models when high-quality data is in short supply. By comparing various training strategies across differe...

Meta-Harness for Agent-State Construction

June 21, 2026 10:02am 23 min

eta-Harness is an advanced optimization system designed to improve how language-model agents process and compress long interaction histories into useful states. Unlike traditional methods that rely on manual engineering ...

ExpRL: Using Reference Solutions as Rewards for LLM Mid-Training

June 21, 2026 12:28am 21 min

Exploratory RL (ExpRL) is an automated mid-training method designed to enhance the reasoning capabilities of large language models before they undergo standard reinforcement learning. While traditional reinforcement lear...

Valid Inference with Synthetic Data via Task Exchangeability

June 18, 2026 5:12pm 13 min

This paper introduces a statistical framework for making valid scientific discoveries using synthetic data, specifically addressing concerns that artificially generated data can be biased or noisy. The authors propose a ...

GRPO is Secretly a Process Reward Model

June 17, 2026 5:56pm 20 min

This paper establishs that Group Relative Policy Optimization (GRPO), while appearing to use only final outcome rewards, inherently functions as a Process Reward Model (PRM) through its implicit sub-trajectory credit ass...

Agentic Interactions

June 16, 2026 8:52pm 19 min

This paper explores how AI agents inherit and potentially amplify human heterogeneity when tasked with negotiating on behalf of individuals. By comparing agentic interactions to a human-to-human benchmark, the study reve...

A Unifying View of Attention Sinks: Two Algorithms, Two Solutions

June 15, 2026 11:29pm 22 min

This research investigates the nature of attention sinks, which are specific tokens in Transformer models that attract disproportionate attention. The authors reveal that these identical visual patterns actually facilita...

From AGI to ASI

June 14, 2026 2:00pm 23 min

This report from Google DeepMind explores the hypothetical transition from Artificial General Intelligence (AGI), which matches human capability, to Artificial Superintelligence (ASI), which far exceeds it. The authors o...

Correct Looks Better: Pairwise Comparisons Reveal Accuracy Rankings

June 13, 2026 1:57pm 19 min

This research explores whether pairwise comparisons used to rank generative models actually reflect ground-truth accuracy. By converting multiple benchmarks into free-form formats, the authors found that Elo-style rankin...

Critical Batch Size for LLM Policy Optimization

June 10, 2026 7:13pm 18 min

This paper investigates the critical batch size (CBS) for Large Language Model (LLM) policy optimization, specifically focusing on the GRPO algorithm. The researchers break down gradient noise into inter-prompt and intra...

Self-supervised User Profile Generation for Personalization

June 08, 2026 10:36pm 22 min

This paper describes a self-supervised framework called BUMP, which is designed to improve how large language models deliver personalized content. Traditionally, creating user profiles for search and recommendation tasks...

From Augmentation to Reconstruction: Guiding the AI Disruption to the Good Place

June 07, 2026 12:07pm 22 min

This paper explores the evolution of artificial intelligence through a three-stage framework of augmentation, automation, and reconstruction. The authors argue that while AI currently improves individual tasks, the most ...

Self-Distilled Agentic Reinforcement Learning

June 07, 2026 12:03pm 22 min

The research paper introduces SDAR (Self-Distilled Agentic Reinforcement Learning), a new framework designed to improve the training of large language model agents in complex, multi-turn environments. While standard rein...

Subliminal Learning Is Steering Vector Distillation

June 04, 2026 9:21pm 23 min

This research explores subliminal learning, a phenomenon where a student language model inherits behavioral traits from a teacher model even when trained on semantically unrelated data. The authors demonstrate that this ...

Subsidizing Sequential Search

June 04, 2026 9:18pm 20 min

This paper explores a market model where competing firms use subsidies to reduce the cost of product inspection for consumers. Through a subsidy-sorting principle, the authors demonstrate that higher-quality firms natura...

Meta-Harness: End-to-End Optimization of Model Harnesses

June 02, 2026 2:13pm 17 min

This paper introduces Meta-Harness, an innovative system designed to automate harness engineering for large language models. Unlike traditional methods that rely on manual coding or compressed feedback, this system uses ...

Envía tu emisora ​​favorita

Completa el formulario de abajo. Asegúrate de seleccionar tanto País como Géneros.

Nombre
Categoría
Mantenga presionada la tecla Ctrl (Cmd en Mac) para seleccionar varios.
URL de transmisión
Logo (JPG, JPEG o PNG)

Contáctanos

Envíanos un mensaje abajo. Te responderemos en un plazo de 24 horas.

Asunto
Tu nombre
Correo electrónico
URL de la emisora o de la página
Mensaje
Cuanto es 14 más 17?
También adjuntamos tu país, navegador, página actual y algunos datos técnicos para ayudarnos a investigar.