Episodes

Latest Episode
Video-MMMU: Evaluating Knowledge Acquisition from Multi-Discipline Professional Videos

Video-MMMU: Evaluating Knowledge Acquisition from Multi-Discipline Professional Videos

Episode 418 · · 21:16

πŸ€— Upvotes: 10 | cs.CV, cs.CL Authors: Kairui Hu, Penghao Wu, Fanyi Pu, Wang Xiao, Yuanhan Zhang, Xiang Yue, Bo Li, Ziwei Liu Title:...

DiffuEraser: A Diffusion Model for Video Inpainting

DiffuEraser: A Diffusion Model for Video Inpainting

Episode 417 · · 21:50

πŸ€— Upvotes: 8 | cs.CV Authors: Xiaowen Li, Haolan Xue, Peiran Ren, Liefeng Bo Title: DiffuEraser: A Diffusion Model for ...

IMAGINE-E: Image Generation Intelligence Evaluation of State-of-the-art Text-to-Image Models

IMAGINE-E: Image Generation Intelligence Evaluation of State-of-the-art Text-to-Image Models

Episode 416 · · 29:27

πŸ€— Upvotes: 8 | cs.CV, cs.CL, cs.LG Authors: Jiayi Lei, Renrui Zhang, Xiangfei Hu, Weifeng Lin, Zhen Li, Wenjian Sun, Ruoyi Du, Le Zhuo, Zhongyu ...

Step-KTO: Optimizing Mathematical Reasoning through Stepwise Binary Feedback

Step-KTO: Optimizing Mathematical Reasoning through Stepwise Binary Feedback

Episode 415 · · 21:16

πŸ€— Upvotes: 7 | cs.LG, cs.AI Authors: Yen-Ting Lin, Di Jin, Tengyu Xu, Tianhao Wu, Sainbayar Sukhbaatar, Chen Zhu, Yun He, Yun-Nung Chen, Jason W...

One-Prompt-One-Story: Free-Lunch Consistent Text-to-Image Generation Using a Single Prompt

One-Prompt-One-Story: Free-Lunch Consistent Text-to-Image Generation Using a Single Prompt

Episode 414 · · 22:08

πŸ€— Upvotes: 5 | cs.CV, cs.AI, cs.LG Authors: Tao Liu, Kai Wang, Senmao Li, Joost van de Weijer, Fahad Shahbaz Khan, Shiqi Yang, Yaxing Wang, Jian...

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Episode 413 · · 21:02

πŸ€— Upvotes: 109 | cs.CL, cs.AI, cs.LG Authors: DeepSeek-AI, Daya Guo, Dejian Yang, Haowei Zhang, Junxiao Song, Ruoyu Zhang, Runxin Xu, Qihao Zhu,...

VideoLLaMA 3: Frontier Multimodal Foundation Models for Image and Video Understanding

VideoLLaMA 3: Frontier Multimodal Foundation Models for Image and Video Understanding

Episode 412 · · 23:29

πŸ€— Upvotes: 44 | cs.CV Authors: Boqiang Zhang, Kehan Li, Zesen Cheng, Zhiqiang Hu, Yuqian Yuan, Guanzheng Chen, Sicong Leng, Yuming Jiang, Hang Z...

FilmAgent: A Multi-Agent Framework for End-to-End Film Automation in Virtual 3D Spaces

FilmAgent: A Multi-Agent Framework for End-to-End Film Automation in Virtual 3D Spaces

Episode 411 · · 24:51

πŸ€— Upvotes: 43 | cs.CL, cs.GR, cs.MA Authors: Zhenran Xu, Longyue Wang, Jifang Wang, Zhouyi Li, Senbao Shi, Xue Yang, Yiyu Wang, Baotian Hu, Jun ...

Test-Time Preference Optimization: On-the-Fly Alignment via Iterative Textual Feedback

Test-Time Preference Optimization: On-the-Fly Alignment via Iterative Textual Feedback

Episode 410 · · 22:41

πŸ€— Upvotes: 42 | cs.CL Authors: Yafu Li, Xuyang Hu, Xiaoye Qu, Linjie Li, Yu Cheng Title: Test-Time Preference Optimizat...

Kimi k1.5: Scaling Reinforcement Learning with LLMs

Kimi k1.5: Scaling Reinforcement Learning with LLMs

Episode 409 · · 18:30

πŸ€— Upvotes: 39 | cs.AI, cs.LG Authors: Kimi Team, Angang Du, Bofei Gao, Bowei Xing, Changjiu Jiang, Cheng Chen, Cheng Li, Chenjun Xiao, Chenzhuan...

Autonomy-of-Experts Models

Autonomy-of-Experts Models

Episode 408 · · 20:07

πŸ€— Upvotes: 31 | cs.CL, cs.AI, cs.LG Authors: Ang Lv, Ruobing Xie, Yining Qian, Songhao Wu, Xingwu Sun, Zhanhui Kang, Di Wang, Rui Yan ...

O1-Pruner: Length-Harmonizing Fine-Tuning for O1-Like Reasoning Pruning

O1-Pruner: Length-Harmonizing Fine-Tuning for O1-Like Reasoning Pruning

Episode 407 · · 22:31

πŸ€— Upvotes: 13 | cs.CL Authors: Haotian Luo, Li Shen, Haiying He, Yibo Wang, Shiwei Liu, Wei Li, Naiqiang Tan, Xiaochun Cao, Dacheng Tao ...

Pairwise RM: Perform Best-of-N Sampling with Knockout Tournament

Pairwise RM: Perform Best-of-N Sampling with Knockout Tournament

Episode 406 · · 22:07

πŸ€— Upvotes: 13 | cs.CL Authors: Yantao Liu, Zijun Yao, Rui Min, Yixin Cao, Lei Hou, Juanzi Li Title: Pairwise RM: Perfor...

IntellAgent: A Multi-Agent Framework for Evaluating Conversational AI Systems

IntellAgent: A Multi-Agent Framework for Evaluating Conversational AI Systems

Episode 405 · · 24:44

πŸ€— Upvotes: 7 | cs.CL, cs.AI, cs.LG Authors: Elad Levi, Ilan Kadar Title: IntellAgent: A Multi-Agent Framework for Evalu...

Fast3R: Towards 3D Reconstruction of 1000+ Images in One Forward Pass

Fast3R: Towards 3D Reconstruction of 1000+ Images in One Forward Pass

Episode 404 · · 21:35

πŸ€— Upvotes: 3 | cs.CV, cs.AI, cs.GR, cs.RO Authors: Jianing Yang, Alexander Sax, Kevin J. Liang, Mikael Henaff, Hao Tang, Ang Cao, Joyce Chai, Fr...

Agent-R: Training Language Model Agents to Reflect via Iterative Self-Training

Agent-R: Training Language Model Agents to Reflect via Iterative Self-Training

Episode 403 · · 20:45

πŸ€— Upvotes: 61 | cs.AI Authors: Siyu Yuan, Zehui Chen, Zhiheng Xi, Junjie Ye, Zhengyin Du, Jiecao Chen Title: Agent-R: T...

MMVU: Measuring Expert-Level Multi-Discipline Video Understanding

MMVU: Measuring Expert-Level Multi-Discipline Video Understanding

Episode 402 · · 25:15

πŸ€— Upvotes: 59 | cs.CV, cs.AI, cs.CL Authors: Yilun Zhao, Lujing Xie, Haowei Zhang, Guo Gan, Yitao Long, Zhiyuan Hu, Tongyan Hu, Weiyuan Chen, Ch...

Demons in the Detail: On Implementing Load Balancing Loss for Training Specialized Mixture-of-Expert Models

Demons in the Detail: On Implementing Load Balancing Loss for Training Specialized Mixture-of-Expert Models

Episode 401 · · 23:40

πŸ€— Upvotes: 51 | cs.LG, cs.CL Authors: Zihan Qiu, Zeyu Huang, Bo Zheng, Kaiyue Wen, Zekun Wang, Rui Men, Ivan Titov, Dayiheng Liu, Jingren Zhou, ...

TokenVerse: Versatile Multi-concept Personalization in Token Modulation Space

TokenVerse: Versatile Multi-concept Personalization in Token Modulation Space

Episode 400 · · 26:00

πŸ€— Upvotes: 32 | cs.CV Authors: Daniel Garibi, Shahar Yadin, Roni Paiss, Omer Tov, Shiran Zada, Ariel Ephrat, Tomer Michaeli, Inbar Mosseri, Tali...

UI-TARS: Pioneering Automated GUI Interaction with Native Agents

UI-TARS: Pioneering Automated GUI Interaction with Native Agents

Episode 399 · · 20:57

πŸ€— Upvotes: 31 | cs.AI, cs.CL, cs.CV, cs.HC Authors: Yujia Qin, Yining Ye, Junjie Fang, Haoming Wang, Shihao Liang, Shizuo Tian, Junda Zhang, Jia...

InternLM-XComposer2.5-Reward: A Simple Yet Effective Multi-Modal Reward Model

InternLM-XComposer2.5-Reward: A Simple Yet Effective Multi-Modal Reward Model

Episode 398 · · 21:26

πŸ€— Upvotes: 26 | cs.CV, cs.CL Authors: Yuhang Zang, Xiaoyi Dong, Pan Zhang, Yuhang Cao, Ziyu Liu, Shengyuan Ding, Shenxi Wu, Yubo Ma, Haodong Dua...

Mobile-Agent-E: Self-Evolving Mobile Assistant for Complex Tasks

Mobile-Agent-E: Self-Evolving Mobile Assistant for Complex Tasks

Episode 397 · · 23:22

πŸ€— Upvotes: 20 | cs.CL, cs.CV Authors: Zhenhailong Wang, Haiyang Xu, Junyang Wang, Xi Zhang, Ming Yan, Ji Zhang, Fei Huang, Heng Ji ...

Reasoning Language Models: A Blueprint

Reasoning Language Models: A Blueprint

Episode 396 · · 21:43

πŸ€— Upvotes: 18 | cs.AI, cs.CL Authors: Maciej Besta, Julia Barth, Eric Schreiber, Ales Kubicek, Afonso Catarino, Robert Gerstenberger, Piotr Nycz...

Hunyuan3D 2.0: Scaling Diffusion Models for High Resolution Textured 3D Assets Generation

Hunyuan3D 2.0: Scaling Diffusion Models for High Resolution Textured 3D Assets Generation

Episode 395 · · 20:46

πŸ€— Upvotes: 16 | cs.CV Authors: Zibo Zhao, Zeqiang Lai, Qingxiang Lin, Yunfei Zhao, Haolin Liu, Shuhui Yang, Yifei Feng, Mingxin Yang, Sheng Zhan...

Learn-by-interact: A Data-Centric Framework for Self-Adaptive Agents in Realistic Environments

Learn-by-interact: A Data-Centric Framework for Self-Adaptive Agents in Realistic Environments

Episode 394 · · 19:28

πŸ€— Upvotes: 15 | cs.LG, cs.AI Authors: Hongjin Su, Ruoxi Sun, Jinsung Yoon, Pengcheng Yin, Tao Yu, Sercan Γ–. ArΔ±k Title: ...

GameFactory: Creating New Games with Generative Interactive Videos

GameFactory: Creating New Games with Generative Interactive Videos

Episode 393 · · 22:49

πŸ€— Upvotes: 48 | cs.CV Authors: Jiwen Yu, Yiran Qin, Xintao Wang, Pengfei Wan, Di Zhang, Xihui Liu Title: GameFactory: C...

VideoWorld: Exploring Knowledge Learning from Unlabeled Videos

VideoWorld: Exploring Knowledge Learning from Unlabeled Videos

Episode 392 · · 19:25

πŸ€— Upvotes: 8 | cs.CV Authors: Zhongwei Ren, Yunchao Wei, Xun Guo, Yao Zhao, Bingyi Kang, Jiashi Feng, Xiaojie Jin Title: ...

SEAL: Entangled White-box Watermarks on Low-Rank Adaptation

SEAL: Entangled White-box Watermarks on Low-Rank Adaptation

Episode 391 · · 22:16

πŸ€— Upvotes: 2 | cs.AI, cs.CR Authors: Giyeong Oh, Saejin Kim, Woohyun Cho, Sangkyu Lee, Jiwan Chung, Dokyung Song, Youngjae Yu Title...

The Lessons of Developing Process Reward Models in Mathematical Reasoning

The Lessons of Developing Process Reward Models in Mathematical Reasoning

Episode 390 · · 18:53

πŸ€— Upvotes: 53 | cs.CL, cs.AI, cs.LG Authors: Zhenru Zhang, Chujie Zheng, Yangzhen Wu, Beichen Zhang, Runji Lin, Bowen Yu, Dayiheng Liu, Jingren ...

Tensor Product Attention Is All You Need

Tensor Product Attention Is All You Need

Episode 389 · · 20:57

πŸ€— Upvotes: 38 | cs.CL, cs.AI, cs.LG Authors: Yifan Zhang, Yifeng Liu, Huizhuo Yuan, Zhen Qin, Yang Yuan, Quanquan Gu, Andrew Chi-Chih Yao ...