Podcast Index

Podcasts

Explora podcasts por categoría, abre episodios recientes y descarga audio para escucharlo sin conexión.

Todo Noticias Política Música Deportes Tecnología Comedia Arte Negocios Educación Ficción Gobierno Salud & fitness Historia Niños & familia Ocio Religión & espiritualidad Ciencia Sociedad & cultura Crimen real TV & cine

Best AI papers explained Enoch H. Kang

40 nuances de Next FeuilleBlanche Studio

BrainStuff iHeartPodcasts

Auto Rádio Razão Automóvel

Roz & Mocha Frequency Podcast Network

脳科学, 脳LIFE TBS RADIO

ChatGPT - Sol Good Shorts Sol Good Network

Chat GPT Questions Sol Good Network

Chat GPT Universal Wisdom Sol Good Network

Chat GPT Collective Consciousness Sol Good Network

Chat GPT Infinite Intelligence Sol Good Network

Horatio Hornblower Daily C.S. Forester

Old Time Radio Horatio Hornblower C.S. Forester

Chat GPT Podcast Sol Good Network

The Daily Tech Brief | AI, Technology, Startups and Innovation News The Daily Tech Brief

Advances and Innovations in Actuation Systems PRASAD BHONDE

The Stephen Wolfram Podcast Wolfram Research

The AI for Sales Podcast Chad Burmeister

Inspiring Tech Leaders - AI, Technology Strategy & Digital Transformation Dave Roberts

DX Today | No-Hype Podcast & News About AI & DX Rick Spair

Breitband Deutschlandfunk Kultur

The Automated Daily - Hacker News Edition TrendTeller

The Neural Daily Neural Network Media

Cup o' Go Jonathan Hall & Shay Nehmad

Payments Brief: FinTech, Banking & Payments News Payments Brief Team

AI Builds It: Easy Coding Tools AIAgentStore.ai

The Automated Daily - AI News Edition TrendTeller

AI Fire Daily AIFire.co

Fallthrough Fallthrough Media

The Automated Daily TrendTeller

Ungovernable Misfits Ungovernable Misfits

VK6ARN Amateur Radio News - NewsWest VK6ARN

Token Metrics Daily Pulse Token Metrics

Podcast Science Podcast Science

Oxide and Friends Oxide Computer Company

M365.FM - Modern work, security, and productivity with Microsoft 365 Mirko Peters - Founder of m365.fm, m365.show and m365con.net

TFTC: A Bitcoin Podcast Marty Bent

Tecnología

Best AI papers explained

Enoch H. Kang

When Does Trajectory-Level Supervision Permit Efficient Offline Reinforcement Learning?

June 27, 2026 12:11am 18 min

This paper discusses a statistical framework for offline reinforcement learning using trajectory-level supervision, where only final outcomes or preferences are observed rather than step-by-step rewards. The authors intr...

SuperThoughts: Reasoning Tokens in Superposition

June 26, 2026 2:42pm 19 min

SuperThoughts is a novel framework designed to accelerate the Chain-of-Thought (CoT) reasoning process in large language models by processing tokens in superposition. Unlike traditional models that generate tokens sequen...

First-Explore PPO : Learning Meta-Exploration with Proximal Policy Optimization

June 25, 2026 11:09am 22 min

This research paper introduces First-Explore Proximal Policy Optimization (FE-PPO), a new reinforcement learning algorithm designed to improve how agents discover rewards in complex, deceptive environments. While standar...

Self-Distillation for Data-Scarce Language Model Pretraining

June 23, 2026 9:17pm 21 min

This research paper investigates self-distillation as a powerful regularization technique for pretraining language models when high-quality data is in short supply. By comparing various training strategies across differe...

Meta-Harness for Agent-State Construction

June 21, 2026 10:02am 23 min

eta-Harness is an advanced optimization system designed to improve how language-model agents process and compress long interaction histories into useful states. Unlike traditional methods that rely on manual engineering ...

ExpRL: Using Reference Solutions as Rewards for LLM Mid-Training

June 21, 2026 12:28am 21 min

Exploratory RL (ExpRL) is an automated mid-training method designed to enhance the reasoning capabilities of large language models before they undergo standard reinforcement learning. While traditional reinforcement lear...

Valid Inference with Synthetic Data via Task Exchangeability

June 18, 2026 5:12pm 13 min

This paper introduces a statistical framework for making valid scientific discoveries using synthetic data, specifically addressing concerns that artificially generated data can be biased or noisy. The authors propose a ...

GRPO is Secretly a Process Reward Model

June 17, 2026 5:56pm 20 min

This paper establishs that Group Relative Policy Optimization (GRPO), while appearing to use only final outcome rewards, inherently functions as a Process Reward Model (PRM) through its implicit sub-trajectory credit ass...

Agentic Interactions

June 16, 2026 8:52pm 19 min

This paper explores how AI agents inherit and potentially amplify human heterogeneity when tasked with negotiating on behalf of individuals. By comparing agentic interactions to a human-to-human benchmark, the study reve...

A Unifying View of Attention Sinks: Two Algorithms, Two Solutions

June 15, 2026 11:29pm 22 min

This research investigates the nature of attention sinks, which are specific tokens in Transformer models that attract disproportionate attention. The authors reveal that these identical visual patterns actually facilita...

From AGI to ASI

June 14, 2026 2:00pm 23 min

This report from Google DeepMind explores the hypothetical transition from Artificial General Intelligence (AGI), which matches human capability, to Artificial Superintelligence (ASI), which far exceeds it. The authors o...

Correct Looks Better: Pairwise Comparisons Reveal Accuracy Rankings

June 13, 2026 1:57pm 19 min

This research explores whether pairwise comparisons used to rank generative models actually reflect ground-truth accuracy. By converting multiple benchmarks into free-form formats, the authors found that Elo-style rankin...

Critical Batch Size for LLM Policy Optimization

June 10, 2026 7:13pm 18 min

This paper investigates the critical batch size (CBS) for Large Language Model (LLM) policy optimization, specifically focusing on the GRPO algorithm. The researchers break down gradient noise into inter-prompt and intra...

Self-supervised User Profile Generation for Personalization

June 08, 2026 10:36pm 22 min

This paper describes a self-supervised framework called BUMP, which is designed to improve how large language models deliver personalized content. Traditionally, creating user profiles for search and recommendation tasks...

From Augmentation to Reconstruction: Guiding the AI Disruption to the Good Place

June 07, 2026 12:07pm 22 min

This paper explores the evolution of artificial intelligence through a three-stage framework of augmentation, automation, and reconstruction. The authors argue that while AI currently improves individual tasks, the most ...

Self-Distilled Agentic Reinforcement Learning

June 07, 2026 12:03pm 22 min

The research paper introduces SDAR (Self-Distilled Agentic Reinforcement Learning), a new framework designed to improve the training of large language model agents in complex, multi-turn environments. While standard rein...

Subliminal Learning Is Steering Vector Distillation

June 04, 2026 9:21pm 23 min

This research explores subliminal learning, a phenomenon where a student language model inherits behavioral traits from a teacher model even when trained on semantically unrelated data. The authors demonstrate that this ...

Subsidizing Sequential Search

June 04, 2026 9:18pm 20 min

This paper explores a market model where competing firms use subsidies to reduce the cost of product inspection for consumers. Through a subsidy-sorting principle, the authors demonstrate that higher-quality firms natura...

Meta-Harness: End-to-End Optimization of Model Harnesses

June 02, 2026 2:13pm 17 min

This paper introduces Meta-Harness, an innovative system designed to automate harness engineering for large language models. Unlike traditional methods that rely on manual coding or compressed feedback, this system uses ...

Self-Improving Language Models with Bidirectional Evolutionary Search

June 01, 2026 6:38pm 20 min

Researchers have developed Bidirectional Evolutionary Search (BES) to overcome the limitations of standard language model sampling, which often struggles with sparse feedback and predictable outputs. While traditional me...

Tecla	Acción
P	Reproducir/Pausar
▶	Siguiente estación
◀	Estación anterior
M	Silenciar/Activar sonido
S	Mostrar/Ocultar reproductor
0 a 9	Porcentaje de volumen (0 es 100%)
K	Mostrar controles del teclado
Esc	Ocultar controles del teclado
T	Desplazarse al inicio

Podcasts

Best AI papers explained

When Does Trajectory-Level Supervision Permit Efficient Offline Reinforcement Learning?

SuperThoughts: Reasoning Tokens in Superposition

First-Explore PPO : Learning Meta-Exploration with Proximal Policy Optimization

Self-Distillation for Data-Scarce Language Model Pretraining

Meta-Harness for Agent-State Construction

ExpRL: Using Reference Solutions as Rewards for LLM Mid-Training

Valid Inference with Synthetic Data via Task Exchangeability

GRPO is Secretly a Process Reward Model

Agentic Interactions

A Unifying View of Attention Sinks: Two Algorithms, Two Solutions

From AGI to ASI

Correct Looks Better: Pairwise Comparisons Reveal Accuracy Rankings

Critical Batch Size for LLM Policy Optimization

Self-supervised User Profile Generation for Personalization

From Augmentation to Reconstruction: Guiding the AI Disruption to the Good Place

Self-Distilled Agentic Reinforcement Learning

Subliminal Learning Is Steering Vector Distillation

Subsidizing Sequential Search

Meta-Harness: End-to-End Optimization of Model Harnesses

Self-Improving Language Models with Bidirectional Evolutionary Search

Elige una emisora de radio para escuchar...

Envía tu emisora favorita

Contáctanos

2000s
2010s
2020s
30s
40s
50s
60s
70s
80s
90s
Acoustic
Adult
Afrobeats
AI
Alternative
Ambient
Arabic Music
Artists
Asian
Ballads
Bhangra
Bluegrass
Blues
Bollywood
Bossa Nova
Britpop
Carnival
Celtic
Chillout
Christmas
Classical
Club
Comedy
Community
Country
Dance
Deep House
Disco
Drum & Bass
Dub
Easy
Eclectic
EDM
Electro
Folk/Local
Funk
Ghazal
Gospel
Grunge
Hip Hop
Hits
House
Indie
Instrumental
J-Pop
Jazz
K-Pop
Kids
Kizomba
Latin
Live
Lo-fi
Lounge
Manele
Metal
New Age
New Wave
Oldies
OTR
Pop
Progressive
Punk
R&B
Ranchera
Rap
Reggae
Reggaeton
Religious
Rock
Rumba
Salsa
Samba
Scanner
Schlager
Sega
Ska
Soul
Soundtracks
Sports
Swing
Talk/News
Tango
Techno
Top 40
Trance
World
Zouk/Tropic
Afghanistan
Albania
Algeria
Andorra
Angola
Anguilla
Antigua
Argentina
Armenia
Aruba
Australia
Austria
Azerbaijan
Bahamas
Bahrain
Bangladesh
Barbados
Belarus
Belgium
Belize
Benin
Bolivia
Bosnia
Botswana
Brazil
Brunei
Bulgaria
Burkina Faso
Burundi
Cambodia
Cameroon
Canada
Cape Verde
Cayman Isl.
Chad
Chile
China
Colombia
Comoros
Congo
Costa Rica
Croatia
Cuba
Curaçao
Cyprus
Czech Rep.
Denmark
Dominican R.
East Timor
Ecuador
Egypt
El Salvador
Estonia
Ethiopia
Falkland Isl.
Faroe Isl.
Finland
France
Gabon
Gambia
Georgia
Germany
Ghana
Greece
Greenland
Guatemala
Guinea
Guyana
Guyane
Haiti
Honduras
Hong Kong
Hungary
Iceland
India
Indonesia
Iran
Iraq
Ireland
Israel
Italy
Ivory Coast
Jamaica
Japan
Jordan
Kazakhstan
Kenya
Kosovo
Kuwait
Kyrgyzstan
Laos
Latvia
Lebanon
Lesotho
Liberia
Libya
Liechtenstein
Lithuania
Luxembourg
Macao
Macedonia
Madagascar
Malawi
Malaysia
Maldives
Mali
Malta
Mauritania
Mauritius
Mayotte
Mexico
Moldova
Monaco
Mongolia
Montenegro
Morocco
Mozambique
Myanmar
Namibia
Nepal
Netherlands
New Zealand
Nicaragua
Nigeria
North Korea
Norway
Oman
Pakistan
Palestine
Panama
Papua N. G.
Paraguay
Peru
Philippines
Poland
Portugal
Puerto Rico
Qatar
Réunion
Romania
Russia
Rwanda
San Marino
Saudi Arabia
Senegal
Serbia
Seychelles
Sierra Leone
Singapore
Slovakia
Slovenia
Somalia
South Africa
South Korea
Spain
Sri Lanka
St Helena
Sudan
Suriname
Swaziland
Sweden
Switzerland
Syria
Taiwan
Tajikistan
Tanzania
Thailand
Togo
Trinidad & T.
Tunisia
Türkiye
UAE
Uganda
UK
Ukraine
Uruguay
USA
Uzbekistan
Venezuela
Vietnam
Wales
Yemen
Zambia
Zimbabwe

Podcasts

Best AI papers explained

When Does Trajectory-Level Supervision Permit Efficient Offline Reinforcement Learning?

SuperThoughts: Reasoning Tokens in Superposition

First-Explore PPO : Learning Meta-Exploration with Proximal Policy Optimization

Self-Distillation for Data-Scarce Language Model Pretraining

Meta-Harness for Agent-State Construction

ExpRL: Using Reference Solutions as Rewards for LLM Mid-Training

Valid Inference with Synthetic Data via Task Exchangeability

GRPO is Secretly a Process Reward Model

Agentic Interactions

A Unifying View of Attention Sinks: Two Algorithms, Two Solutions

From AGI to ASI

Correct Looks Better: Pairwise Comparisons Reveal Accuracy Rankings

Critical Batch Size for LLM Policy Optimization

Self-supervised User Profile Generation for Personalization

From Augmentation to Reconstruction: Guiding the AI Disruption to the Good Place

Self-Distilled Agentic Reinforcement Learning

Subliminal Learning Is Steering Vector Distillation

Subsidizing Sequential Search

Meta-Harness: End-to-End Optimization of Model Harnesses

Self-Improving Language Models with Bidirectional Evolutionary Search

Envía tu emisora ​​favorita

Contáctanos

Envía tu emisora favorita