Episodes

Latest Episode
Table-R1: Inference-Time Scaling for Table Reasoning

Table-R1: Inference-Time Scaling for Table Reasoning

Episode 843 · · 21:28

🤗 Upvotes: 66 | cs.CL Authors: Zheyuan Yang, Lyuhao Chen, Arman Cohan, Yilun Zhao Title: Table-R1: Inference-Time Scali...

Spatial-MLLM: Boosting MLLM Capabilities in Visual-based Spatial Intelligence

Spatial-MLLM: Boosting MLLM Capabilities in Visual-based Spatial Intelligence

Episode 842 · · 19:54

🤗 Upvotes: 54 | cs.CV, cs.AI, cs.LG, I.2.6; I.2 Authors: Diankun Wu, Fangfu Liu, Yi-Hsin Hung, Yueqi Duan Title: Spatia...

VF-Eval: Evaluating Multimodal LLMs for Generating Feedback on AIGC Videos

VF-Eval: Evaluating Multimodal LLMs for Generating Feedback on AIGC Videos

Episode 841 · · 25:44

🤗 Upvotes: 51 | cs.CV, cs.AI, cs.CL Authors: Tingyu Song, Tongyan Hu, Guo Gan, Yilun Zhao Title: VF-Eval: Evaluating Mu...

The Climb Carves Wisdom Deeper Than the Summit: On the Noisy Rewards in Learning to Reason

The Climb Carves Wisdom Deeper Than the Summit: On the Noisy Rewards in Learning to Reason

Episode 840 · · 22:04

🤗 Upvotes: 45 | cs.CL Authors: Ang Lv, Ruobing Xie, Xingwu Sun, Zhanhui Kang, Rui Yan Title: The Climb Carves Wisdom De...

ZeroGUI: Automating Online GUI Learning at Zero Human Cost

ZeroGUI: Automating Online GUI Learning at Zero Human Cost

Episode 839 · · 19:00

🤗 Upvotes: 39 | cs.AI, cs.CL, cs.CV Authors: Chenyu Yang, Shiqian Su, Shi Liu, Xuan Dong, Yue Yu, Weijie Su, Xuehui Wang, Zhaoyang Liu, Jinguo Z...

VideoReasonBench: Can MLLMs Perform Vision-Centric Complex Video Reasoning?

VideoReasonBench: Can MLLMs Perform Vision-Centric Complex Video Reasoning?

Episode 838 · · 21:32

🤗 Upvotes: 28 | cs.CV Authors: Yuanxin Liu, Kun Ouyang, Haoning Wu, Yi Liu, Lin Sui, Xinhao Li, Yan Zhong, Y. Charles, Xinyu Zhou, Xu Sun ...

Satori-SWE: Evolutionary Test-Time Scaling for Sample-Efficient Software Engineering

Satori-SWE: Evolutionary Test-Time Scaling for Sample-Efficient Software Engineering

Episode 837 · · 21:32

🤗 Upvotes: 21 | cs.CL, cs.AI, cs.SE Authors: Guangtao Zeng, Maohao Shen, Delin Chen, Zhenting Qi, Subhro Das, Dan Gutfreund, David Cox, Gregory ...

The Entropy Mechanism of Reinforcement Learning for Reasoning Language Models

The Entropy Mechanism of Reinforcement Learning for Reasoning Language Models

Episode 836 · · 22:08

🤗 Upvotes: 84 | cs.LG, cs.AI, cs.CL Authors: Ganqu Cui, Yuchen Zhang, Jiacheng Chen, Lifan Yuan, Zhi Wang, Yuxin Zuo, Haozhan Li, Yuchen Fan, Hu...

SWE-rebench: An Automated Pipeline for Task Collection and Decontaminated Evaluation of Software Engineering Agents

SWE-rebench: An Automated Pipeline for Task Collection and Decontaminated Evaluation of Software Engineering Agents

Episode 835 · · 21:03

🤗 Upvotes: 63 | cs.SE, cs.CL Authors: Ibragim Badertdinov, Alexander Golubev, Maksim Nekrashevich, Anton Shevtsov, Simon Karasik, Andrei Andrius...

R2R: Efficiently Navigating Divergent Reasoning Paths with Small-Large Model Token Routing

R2R: Efficiently Navigating Divergent Reasoning Paths with Small-Large Model Token Routing

Episode 834 · · 23:01

🤗 Upvotes: 59 | cs.CL, cs.AI, cs.LG, cs.PF, I.2.7 Authors: Tianyu Fu, Yi Ge, Yichen You, Enshu Liu, Zhihang Yuan, Guohao Dai, Shengen Yan, Huazh...

Skywork Open Reasoner 1 Technical Report

Skywork Open Reasoner 1 Technical Report

Episode 833 · · 22:21

🤗 Upvotes: 45 | cs.LG, cs.AI, cs.CL Authors: Jujie He, Jiacai Liu, Chris Yuhao Liu, Rui Yan, Chaojie Wang, Peng Cheng, Xiaoyu Zhang, Fuxiang Zha...

Sherlock: Self-Correcting Reasoning in Vision-Language Models

Sherlock: Self-Correcting Reasoning in Vision-Language Models

Episode 832 · · 21:23

🤗 Upvotes: 44 | cs.CV, cs.CL, cs.LG Authors: Yi Ding, Ruqi Zhang Title: Sherlock: Self-Correcting Reasoning in Vision-L...

Unsupervised Post-Training for Multi-Modal LLM Reasoning via GRPO

Unsupervised Post-Training for Multi-Modal LLM Reasoning via GRPO

Episode 831 · · 22:53

🤗 Upvotes: 37 | cs.CL, cs.AI, cs.CV, cs.LG Authors: Lai Wei, Yuting Li, Chen Wang, Yue Wang, Linghe Kong, Weiran Huang, Lichao Sun ...

SageAttention2++: A More Efficient Implementation of SageAttention2

SageAttention2++: A More Efficient Implementation of SageAttention2

Episode 830 · · 19:42

🤗 Upvotes: 33 | cs.LG, cs.AI, cs.AR, cs.CV Authors: Jintao Zhang, Xiaoming Xu, Jia Wei, Haofeng Huang, Pengle Zhang, Chendong Xiang, Jun Zhu, Ji...

Advancing Multimodal Reasoning via Reinforcement Learning with Cold Start

Advancing Multimodal Reasoning via Reinforcement Learning with Cold Start

Episode 829 · · 21:34

🤗 Upvotes: 31 | cs.CL, cs.AI, cs.CV, cs.LG Authors: Lai Wei, Yuting Li, Kaipeng Zheng, Chen Wang, Yue Wang, Linghe Kong, Lichao Sun, Weiran Huan...

Fostering Video Reasoning via Next-Event Prediction

Fostering Video Reasoning via Next-Event Prediction

Episode 828 · · 24:54

🤗 Upvotes: 27 | cs.CV, cs.AI, cs.CL Authors: Haonan Wang, Hongfu Liu, Xiangyan Liu, Chao Du, Kenji Kawaguchi, Ye Wang, Tianyu Pang ...

RenderFormer: Transformer-based Neural Rendering of Triangle Meshes with Global Illumination

RenderFormer: Transformer-based Neural Rendering of Triangle Meshes with Global Illumination

Episode 827 · · 23:21

🤗 Upvotes: 26 | cs.GR, cs.CV, cs.LG Authors: Chong Zeng, Yue Dong, Pieter Peers, Hongzhi Wu, Xin Tong Title: RenderForm...

ScienceBoard: Evaluating Multimodal Autonomous Agents in Realistic Scientific Workflows

ScienceBoard: Evaluating Multimodal Autonomous Agents in Realistic Scientific Workflows

Episode 826 · · 22:14

🤗 Upvotes: 85 | cs.AI, cs.CL, cs.CV, cs.HC Authors: Qiushi Sun, Zhoumianze Liu, Chang Ma, Zichen Ding, Fangzhi Xu, Zhangyue Yin, Haiteng Zhao, Z...

MME-Reasoning: A Comprehensive Benchmark for Logical Reasoning in MLLMs

MME-Reasoning: A Comprehensive Benchmark for Logical Reasoning in MLLMs

Episode 825 · · 21:06

🤗 Upvotes: 73 | cs.AI, cs.CV Authors: Jiakang Yuan, Tianshuo Peng, Yilei Jiang, Yiting Lu, Renrui Zhang, Kaituo Feng, Chaoyou Fu, Tao Chen, Lei ...

Paper2Poster: Towards Multimodal Poster Automation from Scientific Papers

Paper2Poster: Towards Multimodal Poster Automation from Scientific Papers

Episode 824 · · 17:55

🤗 Upvotes: 73 | cs.CV, cs.AI, cs.CL, cs.MA Authors: Wei Pang, Kevin Qinghong Lin, Xiangru Jian, Xi He, Philip Torr Title: ...

OmniConsistency: Learning Style-Agnostic Consistency from Paired Stylization Data

OmniConsistency: Learning Style-Agnostic Consistency from Paired Stylization Data

Episode 823 · · 24:24

🤗 Upvotes: 57 | cs.CV Authors: Yiren Song, Cheng Liu, Mike Zheng Shou Title: OmniConsistency: Learning Style-Agnostic C...

OpenS2V-Nexus: A Detailed Benchmark and Million-Scale Dataset for Subject-to-Video Generation

OpenS2V-Nexus: A Detailed Benchmark and Million-Scale Dataset for Subject-to-Video Generation

Episode 822 · · 19:53

🤗 Upvotes: 49 | cs.CV, cs.AI Authors: Shenghai Yuan, Xianyi He, Yufan Deng, Yang Ye, Jinfa Huang, Bin Lin, Jiebo Luo, Li Yuan Title...

SynLogic: Synthesizing Verifiable Reasoning Data at Scale for Learning Logical Reasoning and Beyond

SynLogic: Synthesizing Verifiable Reasoning Data at Scale for Learning Logical Reasoning and Beyond

Episode 821 · · 21:52

🤗 Upvotes: 43 | cs.AI, cs.CL Authors: Junteng Liu, Yuanxiang Fan, Zhuo Jiang, Han Ding, Yongyi Hu, Chi Zhang, Yiqi Shi, Shitong Weng, Aili Chen,...

Don't Overthink it. Preferring Shorter Thinking Chains for Improved LLM Reasoning

Don't Overthink it. Preferring Shorter Thinking Chains for Improved LLM Reasoning

Episode 820 · · 18:54

🤗 Upvotes: 41 | cs.CL, cs.AI Authors: Michael Hassid, Gabriel Synnaeve, Yossi Adi, Roy Schwartz Title: Don't Overthink ...

Exploring the Latent Capacity of LLMs for One-Step Text Generation

Exploring the Latent Capacity of LLMs for One-Step Text Generation

Episode 819 · · 20:35

🤗 Upvotes: 40 | cs.CL, cs.AI, cs.LG Authors: Gleb Mezentsev, Ivan Oseledets Title: Exploring the Latent Capacity of LLM...

Guided by Gut: Efficient Test-Time Scaling with Reinforced Intrinsic Confidence

Guided by Gut: Efficient Test-Time Scaling with Reinforced Intrinsic Confidence

Episode 818 · · 22:23

🤗 Upvotes: 39 | cs.CL, cs.AI Authors: Amirhosein Ghasemabadi, Keith G. Mills, Baochun Li, Di Niu Title: Guided by Gut: ...

VerIPO: Cultivating Long Reasoning in Video-LLMs via Verifier-Gudied Iterative Policy Optimization

VerIPO: Cultivating Long Reasoning in Video-LLMs via Verifier-Gudied Iterative Policy Optimization

Episode 817 · · 20:58

🤗 Upvotes: 35 | cs.CL, cs.CV Authors: Yunxin Li, Xinyu Chen, Zitao Li, Zhenyu Liu, Longyue Wang, Wenhan Luo, Baotian Hu, Min Zhang ...

Mutarjim: Advancing Bidirectional Arabic-English Translation with a Small Language Model

Mutarjim: Advancing Bidirectional Arabic-English Translation with a Small Language Model

Episode 816 · · 20:46

🤗 Upvotes: 178 | cs.CL, cs.AI Authors: Khalil Hennara, Muhammad Hreden, Mohamed Motaism Hamed, Zeina Aldallal, Sara Chrouf, Safwan AlModhayan ...

Shifting AI Efficiency From Model-Centric to Data-Centric Compression

Shifting AI Efficiency From Model-Centric to Data-Centric Compression

Episode 815 · · 22:14

🤗 Upvotes: 124 | cs.CL, cs.AI, cs.CV Authors: Xuyang Liu, Zichen Wen, Shaobo Wang, Junjie Chen, Zhishan Tao, Yubo Wang, Xiangqi Jin, Chang Zou, ...

Alchemist: Turning Public Text-to-Image Data into Generative Gold

Alchemist: Turning Public Text-to-Image Data into Generative Gold

Episode 814 · · 19:19

🤗 Upvotes: 58 | cs.CV Authors: Valerii Startsev, Alexander Ustyuzhanin, Alexey Kirillov, Dmitry Baranchuk, Sergey Kastryulin Title:...