Episodes

Latest Episode
Agent Learning via Early Experience

Agent Learning via Early Experience

Episode 1263 · · 22:45

🤗 Upvotes: 124 | cs.AI, cs.CL, cs.IR, cs.LG Authors: Kai Zhang, Xiangchao Chen, Bo Liu, Tianci Xue, Zeyi Liao, Zhihan Liu, Xiyao Wang, Yuting Ni...

MM-HELIX: Boosting Multimodal Long-Chain Reflective Reasoning with Holistic Platform and Adaptive Hybrid Policy Optimization

MM-HELIX: Boosting Multimodal Long-Chain Reflective Reasoning with Holistic Platform and Adaptive Hybrid Policy Optimization

Episode 1262 · · 21:36

🤗 Upvotes: 92 | cs.CV Authors: Xiangyu Zhao, Junming Lin, Tianhao Liang, Yifan Zhou, Wenhao Chai, Yuzhe Gu, Weiyun Wang, Kai Chen, Gen Luo, Wenw...

MemMamba: Rethinking Memory Patterns in State Space Model

MemMamba: Rethinking Memory Patterns in State Space Model

Episode 1261 · · 24:07

🤗 Upvotes: 56 | cs.LG, cs.AI, cs.CL Authors: Youjin Wang, Yangjingyi Chen, Jiahao Yan, Jiaxuan Lu, Xiao Sun Title: MemM...

UniVideo: Unified Understanding, Generation, and Editing for Videos

UniVideo: Unified Understanding, Generation, and Editing for Videos

Episode 1260 · · 26:06

🤗 Upvotes: 47 | cs.CV Authors: Cong Wei, Quande Liu, Zixuan Ye, Qiulin Wang, Xintao Wang, Pengfei Wan, Kun Gai, Wenhu Chen Title: ...

From What to Why: A Multi-Agent System for Evidence-based Chemical Reaction Condition Reasoning

From What to Why: A Multi-Agent System for Evidence-based Chemical Reaction Condition Reasoning

Episode 1259 · · 26:23

🤗 Upvotes: 42 | cs.AI, cs.CL Authors: Cheng Yang, Jiaxuan Lu, Haiyuan Wan, Junchi Yu, Feiwei Qin Title: From What to Wh...

When Thoughts Meet Facts: Reusable Reasoning for Long-Context LMs

When Thoughts Meet Facts: Reusable Reasoning for Long-Context LMs

Episode 1258 · · 21:23

🤗 Upvotes: 38 | cs.CL, cs.AI, cs.LG Authors: Soyeong Jeong, Taehee Jung, Sung Ju Hwang, Joo-Kyung Kim, Dongyeop Kang Title: ...

Meta-Awareness Enhances Reasoning Models: Self-Alignment Reinforcement Learning

Meta-Awareness Enhances Reasoning Models: Self-Alignment Reinforcement Learning

Episode 1257 · · 24:32

🤗 Upvotes: 38 | cs.LG, cs.AI Authors: Yoonjeon Kim, Doohyuk Jang, Eunho Yang Title: Meta-Awareness Enhances Reasoning M...

VideoCanvas: Unified Video Completion from Arbitrary Spatiotemporal Patches via In-Context Conditioning

VideoCanvas: Unified Video Completion from Arbitrary Spatiotemporal Patches via In-Context Conditioning

Episode 1256 · · 24:37

🤗 Upvotes: 36 | cs.CV Authors: Minghong Cai, Qiulin Wang, Zongli Ye, Wenze Liu, Quande Liu, Weicai Ye, Xintao Wang, Pengfei Wan, Kun Gai, Xiangy...

The Alignment Waltz: Jointly Training Agents to Collaborate for Safety

The Alignment Waltz: Jointly Training Agents to Collaborate for Safety

Episode 1255 · · 24:20

🤗 Upvotes: 33 | cs.CL Authors: Jingyu Zhang, Haozhu Wang, Eric Michael Smith, Sid Wang, Amr Sharaf, Mahesh Pasupuleti, Benjamin Van Durme, Danie...

Hybrid Reinforcement: When Reward Is Sparse, It's Better to Be Dense

Hybrid Reinforcement: When Reward Is Sparse, It's Better to Be Dense

Episode 1254 · · 24:18

🤗 Upvotes: 26 | cs.CL, cs.LG Authors: Leitian Tao, Ilia Kulikov, Swarnadeep Saha, Tianlu Wang, Jing Xu, Yixuan Li, Jason E Weston, Ping Yu ...

Cache-to-Cache: Direct Semantic Communication Between Large Language Models

Cache-to-Cache: Direct Semantic Communication Between Large Language Models

Episode 1253 · · 24:59

🤗 Upvotes: 64 | cs.CL, cs.LG, 68T07, 68T50, I.2.7 Authors: Tianyu Fu, Zihan Min, Hanling Zhang, Jichao Yan, Guohao Dai, Wanli Ouyang, Yu Wang ...

Ming-UniVision: Joint Image Understanding and Generation with a Unified Continuous Tokenizer

Ming-UniVision: Joint Image Understanding and Generation with a Unified Continuous Tokenizer

Episode 1252 · · 26:47

🤗 Upvotes: 60 | cs.CV Authors: Ziyuan Huang, DanDan Zheng, Cheng Zou, Rui Liu, Xiaolong Wang, Kaixiang Ji, Weilong Chai, Jianxin Sun, Libin Wang...

Lumina-DiMOO: An Omni Diffusion Large Language Model for Multi-Modal Generation and Understanding

Lumina-DiMOO: An Omni Diffusion Large Language Model for Multi-Modal Generation and Understanding

Episode 1251 · · 21:15

🤗 Upvotes: 39 | cs.CV Authors: Yi Xin, Qi Qin, Siqi Luo, Kaiwen Zhu, Juncheng Yan, Yan Tai, Jiayi Lei, Yuewen Cao, Keqi Wang, Yibin Wang, Jinbin...

SHANKS: Simultaneous Hearing and Thinking for Spoken Language Models

SHANKS: Simultaneous Hearing and Thinking for Spoken Language Models

Episode 1250 · · 24:34

🤗 Upvotes: 32 | cs.CL, eess.AS Authors: Cheng-Han Chiang, Xiaofei Wang, Linjie Li, Chung-Ching Lin, Kevin Lin, Shujie Liu, Zhendong Wang, Zhengy...

MATRIX: Mask Track Alignment for Interaction-aware Video Generation

MATRIX: Mask Track Alignment for Interaction-aware Video Generation

Episode 1249 · · 23:04

🤗 Upvotes: 30 | cs.CV Authors: Siyoon Jin, Seongchan Kim, Dahyun Chung, Jaeho Lee, Hyunwook Choi, Jisu Nam, Jiyoung Kim, Seungryong Kim ...

RLinf-VLA: A Unified and Efficient Framework for VLA+RL Training

RLinf-VLA: A Unified and Efficient Framework for VLA+RL Training

Episode 1248 · · 20:18

🤗 Upvotes: 30 | cs.RO Authors: Hongzhi Zang, Mingjie Wei, Si Xu, Yongji Wu, Zhen Guo, Yuanqing Wang, Hao Lin, Liangzhi Shi, Yuqing Xie, Zhexuan ...

Vibe Checker: Aligning Code Evaluation with Human Preference

Vibe Checker: Aligning Code Evaluation with Human Preference

Episode 1247 · · 23:39

🤗 Upvotes: 28 | cs.CL, cs.AI, cs.LG, cs.SE Authors: Ming Zhong, Xiang Zhou, Ting-Yun Chang, Qingze Wang, Nan Xu, Xiance Si, Dan Garrette, Shyam ...

Less is More: Recursive Reasoning with Tiny Networks

Less is More: Recursive Reasoning with Tiny Networks

Episode 1246 · · 21:41

🤗 Upvotes: 89 | cs.LG, cs.AI Authors: Alexia Jolicoeur-Martineau Title: Less is More: Recursive Reasoning with Tiny Net...

TaTToo: Tool-Grounded Thinking PRM for Test-Time Scaling in Tabular Reasoning

TaTToo: Tool-Grounded Thinking PRM for Test-Time Scaling in Tabular Reasoning

Episode 1245 · · 27:21

🤗 Upvotes: 59 | cs.AI, cs.CL, cs.LG Authors: Jiaru Zou, Soumya Roy, Vinay Kumar Verma, Ziyi Wang, David Wipf, Pan Lu, Sumit Negi, James Zou, Jin...

Fathom-DeepResearch: Unlocking Long Horizon Information Retrieval and Synthesis for SLMs

Fathom-DeepResearch: Unlocking Long Horizon Information Retrieval and Synthesis for SLMs

Episode 1244 · · 23:46

🤗 Upvotes: 58 | cs.AI, cs.LG Authors: Shreyas Singh, Kunal Singh, Pradeep Moturi Title: Fathom-DeepResearch: Unlocking ...