Episodes

Latest Episode
Chain-of-Retrieval Augmented Generation

Chain-of-Retrieval Augmented Generation

Episode 432 · · 23:23

🤗 Upvotes: 26 | cs.IR, cs.CL Authors: Liang Wang, Haonan Chen, Nan Yang, Xiaolong Huang, Zhicheng Dou, Furu Wei Title: ...

Redundancy Principles for MLLMs Benchmarks

Redundancy Principles for MLLMs Benchmarks

Episode 431 · · 22:20

🤗 Upvotes: 22 | cs.CL, cs.AI Authors: Zicheng Zhang, Xiangyu Zhao, Xinyu Fang, Chunyi Li, Xiaohong Liu, Xiongkuo Min, Haodong Duan, Kai Chen, Gu...

RealCritic: Towards Effectiveness-Driven Evaluation of Language Model Critiques

RealCritic: Towards Effectiveness-Driven Evaluation of Language Model Critiques

Episode 430 · · 23:35

🤗 Upvotes: 13 | cs.CL, cs.AI, cs.LG Authors: Zhengyang Tang, Ziniu Li, Zhenyang Xiao, Tian Ding, Ruoyu Sun, Benyou Wang, Dayiheng Liu, Fei Huang...

RL + Transformer = A General-Purpose Problem Solver

RL + Transformer = A General-Purpose Problem Solver

Episode 429 · · 24:24

🤗 Upvotes: 7 | cs.LG, cs.AI Authors: Micah Rentschler, Jesse Roberts Title: RL + Transformer = A General-Purpose Proble...

Relightable Full-Body Gaussian Codec Avatars

Relightable Full-Body Gaussian Codec Avatars

Episode 428 · · 20:53

🤗 Upvotes: 5 | cs.CV, cs.GR Authors: Shaofei Wang, Tomas Simon, Igor Santesteban, Timur Bagautdinov, Junxuan Li, Vasu Agrawal, Fabian Prada, Sho...

Question Answering on Patient Medical Records with Private Fine-Tuned LLMs

Question Answering on Patient Medical Records with Private Fine-Tuned LLMs

Episode 427 · · 21:57

🤗 Upvotes: 4 | cs.CL, cs.AI Authors: Sara Kothari, Ayush Gupta Title: Question Answering on Patient Medical Records wit...

GeoPixel: Pixel Grounding Large Multimodal Model in Remote Sensing

GeoPixel: Pixel Grounding Large Multimodal Model in Remote Sensing

Episode 426 · · 23:27

🤗 Upvotes: 3 | cs.CV Authors: Akashah Shabbir, Mohammed Zumri, Mohammed Bennamoun, Fahad S. Khan, Salman Khan Title: Ge...

AdaIR: Adaptive All-in-One Image Restoration via Frequency Mining and Modulation

AdaIR: Adaptive All-in-One Image Restoration via Frequency Mining and Modulation

Episode 425 · · 21:05

🤗 Upvotes: 2 | cs.CV Authors: Yuning Cui, Syed Waqas Zamir, Salman Khan, Alois Knoll, Mubarak Shah, Fahad Shahbaz Khan Title: ...

Multiview Equivariance Improves 3D Correspondence Understanding with Minimal Feature Finetuning

Multiview Equivariance Improves 3D Correspondence Understanding with Minimal Feature Finetuning

Episode 424 · · 24:19

🤗 Upvotes: 2 | cs.CV Authors: Yang You, Yixin Li, Congyue Deng, Yue Wang, Leonidas Guibas Title: Multiview Equivariance...

SRMT: Shared Memory for Multi-agent Lifelong Pathfinding

SRMT: Shared Memory for Multi-agent Lifelong Pathfinding

Episode 423 · · 23:50

🤗 Upvotes: 46 | cs.LG, cs.AI, cs.MA, I.2.11 Authors: Alsu Sagirova, Yuri Kuratov, Mikhail Burtsev Title: SRMT: Shared M...

Sigma: Differential Rescaling of Query, Key and Value for Efficient Language Models

Sigma: Differential Rescaling of Query, Key and Value for Efficient Language Models

Episode 422 · · 20:44

🤗 Upvotes: 33 | cs.CL Authors: Zhenghao Lin, Zihao Tang, Xiao Liu, Yeyun Gong, Yi Cheng, Qi Chen, Hang Li, Ying Xin, Ziyue Yang, Kailai Yang, Yu...

Improving Video Generation with Human Feedback

Improving Video Generation with Human Feedback

Episode 421 · · 24:20

🤗 Upvotes: 30 | cs.CV, cs.AI, cs.GR, cs.LG Authors: Jie Liu, Gongye Liu, Jiajun Liang, Ziyang Yuan, Xiaokun Liu, Mingwu Zheng, Xiele Wu, Qiulin ...

Temporal Preference Optimization for Long-Form Video Understanding

Temporal Preference Optimization for Long-Form Video Understanding

Episode 420 · · 24:47

🤗 Upvotes: 15 | cs.CV, cs.AI, cs.CL, cs.LG, cs.RO Authors: Rui Li, Xiaohan Wang, Yuhui Zhang, Zeyu Wang, Serena Yeung-Levy Title: ...

Can We Generate Images with CoT? Let's Verify and Reinforce Image Generation Step by Step

Can We Generate Images with CoT? Let's Verify and Reinforce Image Generation Step by Step

Episode 419 · · 21:04

🤗 Upvotes: 14 | cs.CV, cs.AI, cs.CL Authors: Ziyu Guo, Renrui Zhang, Chengzhuo Tong, Zhizheng Zhao, Peng Gao, Hongsheng Li, Pheng-Ann Heng ...

Video-MMMU: Evaluating Knowledge Acquisition from Multi-Discipline Professional Videos

Video-MMMU: Evaluating Knowledge Acquisition from Multi-Discipline Professional Videos

Episode 418 · · 21:16

🤗 Upvotes: 10 | cs.CV, cs.CL Authors: Kairui Hu, Penghao Wu, Fanyi Pu, Wang Xiao, Yuanhan Zhang, Xiang Yue, Bo Li, Ziwei Liu Title:...

DiffuEraser: A Diffusion Model for Video Inpainting

DiffuEraser: A Diffusion Model for Video Inpainting

Episode 417 · · 21:50

🤗 Upvotes: 8 | cs.CV Authors: Xiaowen Li, Haolan Xue, Peiran Ren, Liefeng Bo Title: DiffuEraser: A Diffusion Model for ...

IMAGINE-E: Image Generation Intelligence Evaluation of State-of-the-art Text-to-Image Models

IMAGINE-E: Image Generation Intelligence Evaluation of State-of-the-art Text-to-Image Models

Episode 416 · · 29:27

🤗 Upvotes: 8 | cs.CV, cs.CL, cs.LG Authors: Jiayi Lei, Renrui Zhang, Xiangfei Hu, Weifeng Lin, Zhen Li, Wenjian Sun, Ruoyi Du, Le Zhuo, Zhongyu ...

Step-KTO: Optimizing Mathematical Reasoning through Stepwise Binary Feedback

Step-KTO: Optimizing Mathematical Reasoning through Stepwise Binary Feedback

Episode 415 · · 21:16

🤗 Upvotes: 7 | cs.LG, cs.AI Authors: Yen-Ting Lin, Di Jin, Tengyu Xu, Tianhao Wu, Sainbayar Sukhbaatar, Chen Zhu, Yun He, Yun-Nung Chen, Jason W...

One-Prompt-One-Story: Free-Lunch Consistent Text-to-Image Generation Using a Single Prompt

One-Prompt-One-Story: Free-Lunch Consistent Text-to-Image Generation Using a Single Prompt

Episode 414 · · 22:08

🤗 Upvotes: 5 | cs.CV, cs.AI, cs.LG Authors: Tao Liu, Kai Wang, Senmao Li, Joost van de Weijer, Fahad Shahbaz Khan, Shiqi Yang, Yaxing Wang, Jian...

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Episode 413 · · 21:02

🤗 Upvotes: 109 | cs.CL, cs.AI, cs.LG Authors: DeepSeek-AI, Daya Guo, Dejian Yang, Haowei Zhang, Junxiao Song, Ruoyu Zhang, Runxin Xu, Qihao Zhu,...

VideoLLaMA 3: Frontier Multimodal Foundation Models for Image and Video Understanding

VideoLLaMA 3: Frontier Multimodal Foundation Models for Image and Video Understanding

Episode 412 · · 23:29

🤗 Upvotes: 44 | cs.CV Authors: Boqiang Zhang, Kehan Li, Zesen Cheng, Zhiqiang Hu, Yuqian Yuan, Guanzheng Chen, Sicong Leng, Yuming Jiang, Hang Z...

FilmAgent: A Multi-Agent Framework for End-to-End Film Automation in Virtual 3D Spaces

FilmAgent: A Multi-Agent Framework for End-to-End Film Automation in Virtual 3D Spaces

Episode 411 · · 24:51

🤗 Upvotes: 43 | cs.CL, cs.GR, cs.MA Authors: Zhenran Xu, Longyue Wang, Jifang Wang, Zhouyi Li, Senbao Shi, Xue Yang, Yiyu Wang, Baotian Hu, Jun ...

Test-Time Preference Optimization: On-the-Fly Alignment via Iterative Textual Feedback

Test-Time Preference Optimization: On-the-Fly Alignment via Iterative Textual Feedback

Episode 410 · · 22:41

🤗 Upvotes: 42 | cs.CL Authors: Yafu Li, Xuyang Hu, Xiaoye Qu, Linjie Li, Yu Cheng Title: Test-Time Preference Optimizat...

Kimi k1.5: Scaling Reinforcement Learning with LLMs

Kimi k1.5: Scaling Reinforcement Learning with LLMs

Episode 409 · · 18:30

🤗 Upvotes: 39 | cs.AI, cs.LG Authors: Kimi Team, Angang Du, Bofei Gao, Bowei Xing, Changjiu Jiang, Cheng Chen, Cheng Li, Chenjun Xiao, Chenzhuan...

Autonomy-of-Experts Models

Autonomy-of-Experts Models

Episode 408 · · 20:07

🤗 Upvotes: 31 | cs.CL, cs.AI, cs.LG Authors: Ang Lv, Ruobing Xie, Yining Qian, Songhao Wu, Xingwu Sun, Zhanhui Kang, Di Wang, Rui Yan ...

O1-Pruner: Length-Harmonizing Fine-Tuning for O1-Like Reasoning Pruning

O1-Pruner: Length-Harmonizing Fine-Tuning for O1-Like Reasoning Pruning

Episode 407 · · 22:31

🤗 Upvotes: 13 | cs.CL Authors: Haotian Luo, Li Shen, Haiying He, Yibo Wang, Shiwei Liu, Wei Li, Naiqiang Tan, Xiaochun Cao, Dacheng Tao ...

Pairwise RM: Perform Best-of-N Sampling with Knockout Tournament

Pairwise RM: Perform Best-of-N Sampling with Knockout Tournament

Episode 406 · · 22:07

🤗 Upvotes: 13 | cs.CL Authors: Yantao Liu, Zijun Yao, Rui Min, Yixin Cao, Lei Hou, Juanzi Li Title: Pairwise RM: Perfor...

IntellAgent: A Multi-Agent Framework for Evaluating Conversational AI Systems

IntellAgent: A Multi-Agent Framework for Evaluating Conversational AI Systems

Episode 405 · · 24:44

🤗 Upvotes: 7 | cs.CL, cs.AI, cs.LG Authors: Elad Levi, Ilan Kadar Title: IntellAgent: A Multi-Agent Framework for Evalu...

Fast3R: Towards 3D Reconstruction of 1000+ Images in One Forward Pass

Fast3R: Towards 3D Reconstruction of 1000+ Images in One Forward Pass

Episode 404 · · 21:35

🤗 Upvotes: 3 | cs.CV, cs.AI, cs.GR, cs.RO Authors: Jianing Yang, Alexander Sax, Kevin J. Liang, Mikael Henaff, Hao Tang, Ang Cao, Joyce Chai, Fr...

Agent-R: Training Language Model Agents to Reflect via Iterative Self-Training

Agent-R: Training Language Model Agents to Reflect via Iterative Self-Training

Episode 403 · · 20:45

🤗 Upvotes: 61 | cs.AI Authors: Siyu Yuan, Zehui Chen, Zhiheng Xi, Junjie Ye, Zhengyin Du, Jiecao Chen Title: Agent-R: T...