Episodes

Latest Episode
Towards Agentic RAG with Deep Reasoning: A Survey of RAG-Reasoning Systems in LLMs

Towards Agentic RAG with Deep Reasoning: A Survey of RAG-Reasoning Systems in LLMs

Episode 980 · · 20:33

πŸ€— Upvotes: 50 | cs.CL, cs.AI Authors: Yangning Li, Weizhi Zhang, Yuyao Yang, Wei-Chieh Huang, Yaozu Wu, Junyu Luo, Yuanchen Bei, Henry Peng Zou,...

Vision-Language-Vision Auto-Encoder: Scalable Knowledge Distillation from Diffusion Models

Vision-Language-Vision Auto-Encoder: Scalable Knowledge Distillation from Diffusion Models

Episode 979 · · 20:04

πŸ€— Upvotes: 32 | cs.CV Authors: Tiezheng Zhang, Yitong Li, Yu-cheng Chou, Jieneng Chen, Alan Yuille, Chen Wei, Junfei Xiao Title: ...

EXAONE 4.0: Unified Large Language Models Integrating Non-reasoning and Reasoning Modes

EXAONE 4.0: Unified Large Language Models Integrating Non-reasoning and Reasoning Modes

Episode 978 · · 19:27

πŸ€— Upvotes: 24 | cs.CL, cs.AI Authors: LG AI Research, :, Kyunghoon Bae, Eunbi Choi, Kibong Choi, Stanley Jungkyu Choi, Yemuk Choi, Kyubeen Han, ...

Reasoning or Memorization? Unreliable Results of Reinforcement Learning Due to Data Contamination

Reasoning or Memorization? Unreliable Results of Reinforcement Learning Due to Data Contamination

Episode 977 · · 21:11

πŸ€— Upvotes: 44 | cs.LG, cs.AI, cs.CL Authors: Mingqi Wu, Zhihao Zhang, Qiaole Dong, Zhiheng Xi, Jun Zhao, Senjie Jin, Xiaoran Fan, Yuhao Zhou, Ya...

SpeakerVid-5M: A Large-Scale High-Quality Dataset for Audio-Visual Dyadic Interactive Human Generation

SpeakerVid-5M: A Large-Scale High-Quality Dataset for Audio-Visual Dyadic Interactive Human Generation

Episode 976 · · 20:25

πŸ€— Upvotes: 43 | cs.CV, eess.AS Authors: Youliang Zhang, Zhaoyang Li, Duomin Wang, Jiahe Zhang, Deyu Zhou, Zixin Yin, Xili Dai, Gang Yu, Xiu Li ...

Mixture-of-Recursions: Learning Dynamic Recursive Depths for Adaptive Token-Level Computation

Mixture-of-Recursions: Learning Dynamic Recursive Depths for Adaptive Token-Level Computation

Episode 975 · · 21:55

πŸ€— Upvotes: 31 | cs.CL, cs.LG Authors: Sangmin Bae, Yujin Kim, Reza Bayat, Sungnyun Kim, Jiyoun Ha, Tal Schuster, Adam Fisch, Hrayr Harutyunyan, ...

EmbRACE-3K: Embodied Reasoning and Action in Complex Environments

EmbRACE-3K: Embodied Reasoning and Action in Complex Environments

Episode 974 · · 22:15

πŸ€— Upvotes: 25 | cs.CV, cs.AI, cs.CL Authors: Mingxian Lin, Wei Huang, Yitang Li, Chengjie Jiang, Kui Wu, Fangwei Zhong, Shengju Qian, Xin Wang, ...

REST: Stress Testing Large Reasoning Models by Asking Multiple Problems at Once

REST: Stress Testing Large Reasoning Models by Asking Multiple Problems at Once

Episode 973 · · 25:24

πŸ€— Upvotes: 22 | cs.CL Authors: Zhuoshi Pan, Qizhi Pei, Yu Li, Qiyao Sun, Zinan Tang, H. Vicky Zhao, Conghui He, Lijun Wu Title: ...

Test-Time Scaling with Reflective Generative Model

Test-Time Scaling with Reflective Generative Model

Episode 972 · · 21:33

πŸ€— Upvotes: 68 | cs.LG, cs.CL Authors: Zixiao Wang, Yuxin Wang, Xiaorui Wang, Mengting Xing, Jie Gao, Jianjun Xu, Guangcan Liu, Chenhui Jin, Zhuo...

Open Vision Reasoner: Transferring Linguistic Cognitive Behavior for Visual Reasoning

Open Vision Reasoner: Transferring Linguistic Cognitive Behavior for Visual Reasoning

Episode 971 · · 21:14

πŸ€— Upvotes: 47 | cs.CV, cs.CL Authors: Yana Wei, Liang Zhao, Jianjian Sun, Kangheng Lin, Jisheng Yin, Jingcheng Hu, Yinmin Zhang, En Yu, Haoran L...

NeuralOS: Towards Simulating Operating Systems via Neural Generative Models

NeuralOS: Towards Simulating Operating Systems via Neural Generative Models

Episode 970 · · 21:04

πŸ€— Upvotes: 45 | cs.CV, cs.AI, cs.CL, cs.HC, cs.LG Authors: Luke Rivard, Sun Sun, Hongyu Guo, Wenhu Chen, Yuntian Deng Title: ...

CLiFT: Compressive Light-Field Tokens for Compute-Efficient and Adaptive Neural Rendering

CLiFT: Compressive Light-Field Tokens for Compute-Efficient and Adaptive Neural Rendering

Episode 969 · · 20:45

πŸ€— Upvotes: 43 | cs.CV Authors: Zhengqing Wang, Yuefan Wu, Jiacheng Chen, Fuyang Zhang, Yasutaka Furukawa Title: CLiFT: ...

KV Cache Steering for Inducing Reasoning in Small Language Models

KV Cache Steering for Inducing Reasoning in Small Language Models

Episode 968 · · 22:47

πŸ€— Upvotes: 26 | cs.CL, cs.AI Authors: Max Belitsky, Dawid J. Kopiczko, Michael Dorkenwald, M. Jehanzeb Mirza, Cees G. M. Snoek, Yuki M. Asano ...

Gemini 2.5: Pushing the Frontier with Advanced Reasoning, Multimodality, Long Context, and Next Generation Agentic Capabilities

Gemini 2.5: Pushing the Frontier with Advanced Reasoning, Multimodality, Long Context, and Next Generation Agentic Capabilities

Episode 967 · · 20:41

πŸ€— Upvotes: 24 | cs.CL, cs.AI Authors: Gheorghe Comanici, Eric Bieber, Mike Schaekermann, Ice Pasupat, Noveen Sachdeva, Inderjit Dhillon, Marcel ...

Neural-Driven Image Editing

Neural-Driven Image Editing

Episode 966 · · 20:36

πŸ€— Upvotes: 22 | cs.CV Authors: Pengfei Zhou, Jie Xia, Xiaopeng Peng, Wangbo Zhao, Zilong Ye, Zekai Li, Suorong Yang, Jiadong Pan, Yuanxiang Chen...

Scaling RL to Long Videos

Scaling RL to Long Videos

Episode 965 · · 22:44

πŸ€— Upvotes: 95 | cs.CV, cs.AI, cs.CL Authors: Yukang Chen, Wei Huang, Baifeng Shi, Qinghao Hu, Hanrong Ye, Ligeng Zhu, Zhijian Liu, Pavlo Molchan...

T-LoRA: Single Image Diffusion Model Customization Without Overfitting

T-LoRA: Single Image Diffusion Model Customization Without Overfitting

Episode 964 · · 23:15

πŸ€— Upvotes: 83 | cs.CV Authors: Vera Soboleva, Aibek Alanov, Andrey Kuznetsov, Konstantin Sobolev Title: T-LoRA: Single ...

Traceable Evidence Enhanced Visual Grounded Reasoning: Evaluation and Methodology

Traceable Evidence Enhanced Visual Grounded Reasoning: Evaluation and Methodology

Episode 963 · · 20:07

πŸ€— Upvotes: 37 | cs.CV, cs.AI, cs.CL Authors: Haochen Wang, Xiangtai Li, Zilong Huang, Anran Wang, Jiacong Wang, Tao Zhang, Jiani Zheng, Sule Bai...

OST-Bench: Evaluating the Capabilities of MLLMs in Online Spatio-temporal Scene Understanding

OST-Bench: Evaluating the Capabilities of MLLMs in Online Spatio-temporal Scene Understanding

Episode 962 · · 22:58

πŸ€— Upvotes: 29 | cs.CV Authors: JingLi Lin, Chenming Zhu, Runsen Xu, Xiaohan Mao, Xihui Liu, Tai Wang, Jiangmiao Pang Title: ...

Multi-Granular Spatio-Temporal Token Merging for Training-Free Acceleration of Video LLMs

Multi-Granular Spatio-Temporal Token Merging for Training-Free Acceleration of Video LLMs

Episode 961 · · 22:47

πŸ€— Upvotes: 24 | cs.CV, cs.AI Authors: Jeongseok Hyun, Sukjun Hwang, Su Ho Han, Taeoh Kim, Inwoong Lee, Dongyoon Wee, Joon-Young Lee, Seon Joo Ki...

Geometry Forcing: Marrying Video Diffusion and 3D Representation for Consistent World Modeling

Geometry Forcing: Marrying Video Diffusion and 3D Representation for Consistent World Modeling

Episode 960 · · 20:50

πŸ€— Upvotes: 23 | cs.CV, cs.AI Authors: Haoyu Wu, Diankun Wu, Tianyu He, Junliang Guo, Yang Ye, Yueqi Duan, Jiang Bian Title: ...

PyVision: Agentic Vision with Dynamic Tooling

PyVision: Agentic Vision with Dynamic Tooling

Episode 959 · · 19:03

πŸ€— Upvotes: 22 | cs.CL, cs.AI, cs.CV Authors: Shitian Zhao, Haoquan Zhang, Shaoheng Lin, Ming Li, Qilong Wu, Kaipeng Zhang, Chen Wei ...

4KAgent: Agentic Any Image to 4K Super-Resolution

4KAgent: Agentic Any Image to 4K Super-Resolution

Episode 958 · · 26:45

πŸ€— Upvotes: 56 | cs.CV, eess.IV Authors: Yushen Zuo, Qi Zheng, Mingyang Wu, Xinrui Jiang, Renjie Li, Jian Wang, Yide Zhang, Gengchen Mai, Lihong ...

Go to Zero: Towards Zero-shot Motion Generation with Million-scale Data

Go to Zero: Towards Zero-shot Motion Generation with Million-scale Data

Episode 957 · · 16:47

πŸ€— Upvotes: 41 | cs.CV Authors: Ke Fan, Shunlin Lu, Minyue Dai, Runyi Yu, Lixing Xiao, Zhiyang Dou, Junting Dong, Lizhuang Ma, Jingbo Wang ...

Perception-Aware Policy Optimization for Multimodal Reasoning

Perception-Aware Policy Optimization for Multimodal Reasoning

Episode 956 · · 23:28

πŸ€— Upvotes: 34 | cs.CL Authors: Zhenhailong Wang, Xuehang Guo, Sofia Stoica, Haiyang Xu, Hongru Wang, Hyeonjeong Ha, Xiusi Chen, Yangyi Chen, Min...

MIRIX: Multi-Agent Memory System for LLM-Based Agents

MIRIX: Multi-Agent Memory System for LLM-Based Agents

Episode 955 · · 21:31

πŸ€— Upvotes: 33 | cs.CL, cs.AI Authors: Yu Wang, Xi Chen Title: MIRIX: Multi-Agent Memory System for LLM-Based Agents ...

Rethinking Verification for LLM Code Generation: From Generation to Testing

Rethinking Verification for LLM Code Generation: From Generation to Testing

Episode 954 · · 22:17

πŸ€— Upvotes: 23 | cs.CL Authors: Zihan Ma, Taolin Zhang, Maosong Cao, Junnan Liu, Wenwei Zhang, Minnan Luo, Songyang Zhang, Kai Chen ...

SingLoRA: Low Rank Adaptation Using a Single Matrix

SingLoRA: Low Rank Adaptation Using a Single Matrix

Episode 953 · · 21:29

πŸ€— Upvotes: 68 | cs.AI Authors: David BensaΓ―d, Noam Rotstein, Roy Velich, Daniel BensaΓ―d, Ron Kimmel Title: SingLoRA: Lo...

A Survey on Latent Reasoning

A Survey on Latent Reasoning

Episode 952 · · 20:14

πŸ€— Upvotes: 60 | cs.CL Authors: Rui-Jie Zhu, Tianhao Peng, Tianhao Cheng, Xingwei Qu, Jinfa Huang, Dawei Zhu, Hao Wang, Kaiwen Xue, Xuanliang Zha...

OmniPart: Part-Aware 3D Generation with Semantic Decoupling and Structural Cohesion

OmniPart: Part-Aware 3D Generation with Semantic Decoupling and Structural Cohesion

Episode 951 · · 24:02

πŸ€— Upvotes: 45 | cs.CV Authors: Yunhan Yang, Yufan Zhou, Yuan-Chen Guo, Zi-Xin Zou, Yukun Huang, Ying-Tian Liu, Hao Xu, Ding Liang, Yan-Pei Cao, ...