Episodes

Latest Episode
3D and 4D World Modeling: A Survey

3D and 4D World Modeling: A Survey

Episode 1131 · · 20:25

🤗 Upvotes: 40 | cs.CV, cs.RO Authors: Lingdong Kong, Wesley Yang, Jianbiao Mei, Youquan Liu, Ao Liang, Dekai Zhu, Dongyue Lu, Wei Yin, Xiaotao H...

AgentGym-RL: Training LLM Agents for Long-Horizon Decision Making through Multi-Turn Reinforcement Learning

AgentGym-RL: Training LLM Agents for Long-Horizon Decision Making through Multi-Turn Reinforcement Learning

Episode 1130 · · 24:44

🤗 Upvotes: 21 | cs.LG, cs.AI, cs.CL Authors: Zhiheng Xi, Jixuan Huang, Chenyang Liao, Baodai Huang, Honglin Guo, Jiaqi Liu, Rui Zheng, Junjie Ye...

Parallel-R1: Towards Parallel Thinking via Reinforcement Learning

Parallel-R1: Towards Parallel Thinking via Reinforcement Learning

Episode 1129 · · 23:21

🤗 Upvotes: 66 | cs.CL Authors: Tong Zheng, Hongming Zhang, Wenhao Yu, Xiaoyang Wang, Xinyu Yang, Runpeng Dai, Rui Liu, Huiwen Bao, Chengsong Hua...

Visual Representation Alignment for Multimodal Large Language Models

Visual Representation Alignment for Multimodal Large Language Models

Episode 1128 · · 26:13

🤗 Upvotes: 54 | cs.CV Authors: Heeji Yoon, Jaewoo Jung, Junwan Kim, Hyungyu Choi, Heeseong Shin, Sangbeom Lim, Honggyu An, Chaehyun Kim, Jisang ...

Mini-o3: Scaling Up Reasoning Patterns and Interaction Turns for Visual Search

Mini-o3: Scaling Up Reasoning Patterns and Interaction Turns for Visual Search

Episode 1127 · · 21:52

🤗 Upvotes: 45 | cs.CV, cs.AI, cs.CL Authors: Xin Lai, Junyi Li, Wei Li, Tao Liu, Tianjian Li, Hengshuang Zhao Title: Mi...

Reconstruction Alignment Improves Unified Multimodal Models

Reconstruction Alignment Improves Unified Multimodal Models

Episode 1126 · · 24:13

🤗 Upvotes: 31 | cs.CV, cs.AI, cs.LG Authors: Ji Xie, Trevor Darrell, Luke Zettlemoyer, XuDong Wang Title: Reconstructio...

UMO: Scaling Multi-Identity Consistency for Image Customization via Matching Reward

UMO: Scaling Multi-Identity Consistency for Image Customization via Matching Reward

Episode 1125 · · 23:02

🤗 Upvotes: 24 | cs.CV, cs.LG Authors: Yufeng Cheng, Wenxu Wu, Shaojin Wu, Mengqi Huang, Fei Ding, Qian He Title: UMO: S...

Reverse-Engineered Reasoning for Open-Ended Generation

Reverse-Engineered Reasoning for Open-Ended Generation

Episode 1124 · · 12:24

🤗 Upvotes: 107 | cs.AI, cs.CL Authors: Haozhe Wang, Haoran Que, Qixin Xu, Minghao Liu, Wangchunshu Zhou, Jiazhan Feng, Wanjun Zhong, Wei Ye, Ton...

Does DINOv3 Set a New Medical Vision Standard?

Does DINOv3 Set a New Medical Vision Standard?

Episode 1123 · · 10:46

🤗 Upvotes: 28 | cs.CV Authors: Che Liu, Yinda Chen, Haoyuan Shi, Jinpeng Lu, Bailiang Jian, Jiazhen Pan, Linghan Cai, Jiayi Wang, Yundi Zhang, J...

Symbolic Graphics Programming with Large Language Models

Symbolic Graphics Programming with Large Language Models

Episode 1122 · · 13:57

🤗 Upvotes: 31 | cs.CV, cs.LG Authors: Yamei Chen, Haoquan Zhang, Yangyi Huang, Zeju Qiu, Kaipeng Zhang, Yandong Wen, Weiyang Liu Ti...

Set Block Decoding is a Language Model Inference Accelerator

Set Block Decoding is a Language Model Inference Accelerator

Episode 1121 · · 16:21

🤗 Upvotes: 31 | cs.LG Authors: Itai Gat, Heli Ben-Hamu, Marton Havasi, Daniel Haziza, Jeremy Reizenstein, Gabriel Synnaeve, David Lopez-Paz, Bri...

Drivel-ology: Challenging LLMs with Interpreting Nonsense with Depth

Drivel-ology: Challenging LLMs with Interpreting Nonsense with Depth

Episode 1120 · · 22:57

🤗 Upvotes: 100 | cs.CL Authors: Yang Wang, Chenghao Xiao, Chia-Yi Hsiao, Zi Yan Chang, Chi-Li Chen, Tyler Loakman, Chenghua Lin Tit...

From Editor to Dense Geometry Estimator

From Editor to Dense Geometry Estimator

Episode 1119 · · 18:45

🤗 Upvotes: 63 | cs.CV, cs.AI Authors: JiYuan Wang, Chunyu Lin, Lei Sun, Rongying Liu, Lang Nie, Mingxing Li, Kang Liao, Xiangxiang Chu, Yao Zhao...

Towards a Unified View of Large Language Model Post-Training

Towards a Unified View of Large Language Model Post-Training

Episode 1118 · · 23:07

🤗 Upvotes: 42 | cs.LG, cs.AI, cs.CL Authors: Xingtai Lv, Yuxin Zuo, Youbang Sun, Hongyi Liu, Yuntian Wei, Zhekai Chen, Lixuan He, Xuekai Zhu, Ka...

DeepResearch Arena: The First Exam of LLMs' Research Abilities via Seminar-Grounded Tasks

DeepResearch Arena: The First Exam of LLMs' Research Abilities via Seminar-Grounded Tasks

Episode 1117 · · 20:11

🤗 Upvotes: 41 | cs.AI Authors: Haiyuan Wan, Chen Yang, Junchi Yu, Meiqi Tu, Jiaxuan Lu, Di Yu, Jianbao Cao, Ben Gao, Jiaqing Xie, Aoran Wang, We...

Inverse IFEval: Can LLMs Unlearn Stubborn Training Conventions to Follow Real Instructions?

Inverse IFEval: Can LLMs Unlearn Stubborn Training Conventions to Follow Real Instructions?

Episode 1116 · · 23:03

🤗 Upvotes: 41 | cs.CL Authors: Qinyan Zhang, Xinping Lei, Ruijie Miao, Yu Fu, Haojie Fan, Le Chang, Jiafan Hou, Dingling Zhang, Zhongfei Hou, Zi...

Open Data Synthesis For Deep Research

Open Data Synthesis For Deep Research

Episode 1115 · · 23:03

🤗 Upvotes: 37 | cs.CL, cs.AI Authors: Ziyi Xia, Kun Luo, Hongjin Qian, Zheng Liu Title: Open Data Synthesis For Deep Re...

Robix: A Unified Model for Robot Interaction, Reasoning and Planning

Robix: A Unified Model for Robot Interaction, Reasoning and Planning

Episode 1114 · · 21:57

🤗 Upvotes: 33 | cs.AI, cs.CV, cs.RO Authors: Huang Fang, Mengxi Zhang, Heng Dong, Wei Li, Zixuan Wang, Qifeng Zhang, Xueyun Tian, Yucheng Hu, Ha...

UI-TARS-2 Technical Report: Advancing GUI Agent with Multi-Turn Reinforcement Learning

UI-TARS-2 Technical Report: Advancing GUI Agent with Multi-Turn Reinforcement Learning

Episode 1113 · · 24:16

🤗 Upvotes: 71 | cs.AI, cs.CL, cs.CV, cs.HC Authors: Haoming Wang, Haoyang Zou, Huatong Song, Jiazhan Feng, Junjie Fang, Junting Lu, Longxiang Li...

LLaVA-Critic-R1: Your Critic Model is Secretly a Strong Policy Model

LLaVA-Critic-R1: Your Critic Model is Secretly a Strong Policy Model

Episode 1112 · · 23:48

🤗 Upvotes: 63 | cs.CV, cs.LG Authors: Xiyao Wang, Chunyuan Li, Jianwei Yang, Kai Zhang, Bo Liu, Tianyi Xiong, Furong Huang Title: ...