Episodes

Latest Episode
Unsupervised Post-Training for Multi-Modal LLM Reasoning via GRPO

Unsupervised Post-Training for Multi-Modal LLM Reasoning via GRPO

Episode 831 · · 22:53

🤗 Upvotes: 37 | cs.CL, cs.AI, cs.CV, cs.LG Authors: Lai Wei, Yuting Li, Chen Wang, Yue Wang, Linghe Kong, Weiran Huang, Lichao Sun ...

SageAttention2++: A More Efficient Implementation of SageAttention2

SageAttention2++: A More Efficient Implementation of SageAttention2

Episode 830 · · 19:42

🤗 Upvotes: 33 | cs.LG, cs.AI, cs.AR, cs.CV Authors: Jintao Zhang, Xiaoming Xu, Jia Wei, Haofeng Huang, Pengle Zhang, Chendong Xiang, Jun Zhu, Ji...

Advancing Multimodal Reasoning via Reinforcement Learning with Cold Start

Advancing Multimodal Reasoning via Reinforcement Learning with Cold Start

Episode 829 · · 21:34

🤗 Upvotes: 31 | cs.CL, cs.AI, cs.CV, cs.LG Authors: Lai Wei, Yuting Li, Kaipeng Zheng, Chen Wang, Yue Wang, Linghe Kong, Lichao Sun, Weiran Huan...

Fostering Video Reasoning via Next-Event Prediction

Fostering Video Reasoning via Next-Event Prediction

Episode 828 · · 24:54

🤗 Upvotes: 27 | cs.CV, cs.AI, cs.CL Authors: Haonan Wang, Hongfu Liu, Xiangyan Liu, Chao Du, Kenji Kawaguchi, Ye Wang, Tianyu Pang ...

RenderFormer: Transformer-based Neural Rendering of Triangle Meshes with Global Illumination

RenderFormer: Transformer-based Neural Rendering of Triangle Meshes with Global Illumination

Episode 827 · · 23:21

🤗 Upvotes: 26 | cs.GR, cs.CV, cs.LG Authors: Chong Zeng, Yue Dong, Pieter Peers, Hongzhi Wu, Xin Tong Title: RenderForm...

ScienceBoard: Evaluating Multimodal Autonomous Agents in Realistic Scientific Workflows

ScienceBoard: Evaluating Multimodal Autonomous Agents in Realistic Scientific Workflows

Episode 826 · · 22:14

🤗 Upvotes: 85 | cs.AI, cs.CL, cs.CV, cs.HC Authors: Qiushi Sun, Zhoumianze Liu, Chang Ma, Zichen Ding, Fangzhi Xu, Zhangyue Yin, Haiteng Zhao, Z...

MME-Reasoning: A Comprehensive Benchmark for Logical Reasoning in MLLMs

MME-Reasoning: A Comprehensive Benchmark for Logical Reasoning in MLLMs

Episode 825 · · 21:06

🤗 Upvotes: 73 | cs.AI, cs.CV Authors: Jiakang Yuan, Tianshuo Peng, Yilei Jiang, Yiting Lu, Renrui Zhang, Kaituo Feng, Chaoyou Fu, Tao Chen, Lei ...

Paper2Poster: Towards Multimodal Poster Automation from Scientific Papers

Paper2Poster: Towards Multimodal Poster Automation from Scientific Papers

Episode 824 · · 17:55

🤗 Upvotes: 73 | cs.CV, cs.AI, cs.CL, cs.MA Authors: Wei Pang, Kevin Qinghong Lin, Xiangru Jian, Xi He, Philip Torr Title: ...

OmniConsistency: Learning Style-Agnostic Consistency from Paired Stylization Data

OmniConsistency: Learning Style-Agnostic Consistency from Paired Stylization Data

Episode 823 · · 24:24

🤗 Upvotes: 57 | cs.CV Authors: Yiren Song, Cheng Liu, Mike Zheng Shou Title: OmniConsistency: Learning Style-Agnostic C...

OpenS2V-Nexus: A Detailed Benchmark and Million-Scale Dataset for Subject-to-Video Generation

OpenS2V-Nexus: A Detailed Benchmark and Million-Scale Dataset for Subject-to-Video Generation

Episode 822 · · 19:53

🤗 Upvotes: 49 | cs.CV, cs.AI Authors: Shenghai Yuan, Xianyi He, Yufan Deng, Yang Ye, Jinfa Huang, Bin Lin, Jiebo Luo, Li Yuan Title...

SynLogic: Synthesizing Verifiable Reasoning Data at Scale for Learning Logical Reasoning and Beyond

SynLogic: Synthesizing Verifiable Reasoning Data at Scale for Learning Logical Reasoning and Beyond

Episode 821 · · 21:52

🤗 Upvotes: 43 | cs.AI, cs.CL Authors: Junteng Liu, Yuanxiang Fan, Zhuo Jiang, Han Ding, Yongyi Hu, Chi Zhang, Yiqi Shi, Shitong Weng, Aili Chen,...

Don't Overthink it. Preferring Shorter Thinking Chains for Improved LLM Reasoning

Don't Overthink it. Preferring Shorter Thinking Chains for Improved LLM Reasoning

Episode 820 · · 18:54

🤗 Upvotes: 41 | cs.CL, cs.AI Authors: Michael Hassid, Gabriel Synnaeve, Yossi Adi, Roy Schwartz Title: Don't Overthink ...

Exploring the Latent Capacity of LLMs for One-Step Text Generation

Exploring the Latent Capacity of LLMs for One-Step Text Generation

Episode 819 · · 20:35

🤗 Upvotes: 40 | cs.CL, cs.AI, cs.LG Authors: Gleb Mezentsev, Ivan Oseledets Title: Exploring the Latent Capacity of LLM...

Guided by Gut: Efficient Test-Time Scaling with Reinforced Intrinsic Confidence

Guided by Gut: Efficient Test-Time Scaling with Reinforced Intrinsic Confidence

Episode 818 · · 22:23

🤗 Upvotes: 39 | cs.CL, cs.AI Authors: Amirhosein Ghasemabadi, Keith G. Mills, Baochun Li, Di Niu Title: Guided by Gut: ...

VerIPO: Cultivating Long Reasoning in Video-LLMs via Verifier-Gudied Iterative Policy Optimization

VerIPO: Cultivating Long Reasoning in Video-LLMs via Verifier-Gudied Iterative Policy Optimization

Episode 817 · · 20:58

🤗 Upvotes: 35 | cs.CL, cs.CV Authors: Yunxin Li, Xinyu Chen, Zitao Li, Zhenyu Liu, Longyue Wang, Wenhan Luo, Baotian Hu, Min Zhang ...

Mutarjim: Advancing Bidirectional Arabic-English Translation with a Small Language Model

Mutarjim: Advancing Bidirectional Arabic-English Translation with a Small Language Model

Episode 816 · · 20:46

🤗 Upvotes: 178 | cs.CL, cs.AI Authors: Khalil Hennara, Muhammad Hreden, Mohamed Motaism Hamed, Zeina Aldallal, Sara Chrouf, Safwan AlModhayan ...

Shifting AI Efficiency From Model-Centric to Data-Centric Compression

Shifting AI Efficiency From Model-Centric to Data-Centric Compression

Episode 815 · · 22:14

🤗 Upvotes: 124 | cs.CL, cs.AI, cs.CV Authors: Xuyang Liu, Zichen Wen, Shaobo Wang, Junjie Chen, Zhishan Tao, Yubo Wang, Xiangqi Jin, Chang Zou, ...

Alchemist: Turning Public Text-to-Image Data into Generative Gold

Alchemist: Turning Public Text-to-Image Data into Generative Gold

Episode 814 · · 19:19

🤗 Upvotes: 58 | cs.CV Authors: Valerii Startsev, Alexander Ustyuzhanin, Alexey Kirillov, Dmitry Baranchuk, Sergey Kastryulin Title:...

BizFinBench: A Business-Driven Real-World Financial Benchmark for Evaluating LLMs

BizFinBench: A Business-Driven Real-World Financial Benchmark for Evaluating LLMs

Episode 813 · · 23:32

🤗 Upvotes: 56 | cs.AI, cs.CE, cs.CL Authors: Guilong Lu, Xuntao Guo, Rongjunchen Zhang, Wenqiao Zhu, Ji Liu Title: BizF...

PATS: Process-Level Adaptive Thinking Mode Switching

PATS: Process-Level Adaptive Thinking Mode Switching

Episode 812 · · 21:12

🤗 Upvotes: 44 | cs.CL Authors: Yi Wang, Junxiao Liu, Shimao Zhang, Jiajun Chen, Shujian Huang Title: PATS: Process-Leve...

Embodied Agents Meet Personalization: Exploring Memory Utilization for Personalized Assistance

Embodied Agents Meet Personalization: Exploring Memory Utilization for Personalized Assistance

Episode 811 · · 20:57

🤗 Upvotes: 42 | cs.CL Authors: Taeyoon Kwon, Dongwook Choi, Sunghwan Kim, Hyojun Kim, Seungjun Moon, Beong-woo Kwak, Kuan-Hao Huang, Jinyoung Ye...

ARM: Adaptive Reasoning Model

ARM: Adaptive Reasoning Model

Episode 810 · · 22:44

🤗 Upvotes: 40 | cs.CL Authors: Siye Wu, Jian Xie, Yikai Zhang, Aili Chen, Kai Zhang, Yu Su, Yanghua Xiao Title: ARM: Ad...

Enigmata: Scaling Logical Reasoning in Large Language Models with Synthetic Verifiable Puzzles

Enigmata: Scaling Logical Reasoning in Large Language Models with Synthetic Verifiable Puzzles

Episode 809 · · 21:22

🤗 Upvotes: 33 | cs.CL, cs.AI Authors: Jiangjie Chen, Qianyu He, Siyu Yuan, Aili Chen, Zhicheng Cai, Weinan Dai, Hongli Yu, Qiying Yu, Xuefeng Li...

Deciphering Trajectory-Aided LLM Reasoning: An Optimization Perspective

Deciphering Trajectory-Aided LLM Reasoning: An Optimization Perspective

Episode 808 · · 21:22

🤗 Upvotes: 33 | cs.CL, cs.AI Authors: Junnan Liu, Hongwei Liu, Linchen Xiao, Shudong Liu, Taolin Zhang, Zihan Ma, Songyang Zhang, Kai Chen ...

B-score: Detecting biases in large language models using response history

B-score: Detecting biases in large language models using response history

Episode 807 · · 23:16

🤗 Upvotes: 25 | cs.LG, cs.CL Authors: An Vo, Mohammad Reza Taesiri, Daeyoung Kim, Anh Totti Nguyen Title: B-score: Dete...

TabSTAR: A Foundation Tabular Model With Semantically Target-Aware Representations

TabSTAR: A Foundation Tabular Model With Semantically Target-Aware Representations

Episode 806 · · 20:40

🤗 Upvotes: 95 | cs.LG, cs.CL Authors: Alan Arazi, Eilam Shapira, Roi Reichart Title: TabSTAR: A Foundation Tabular Mode...

QwenLong-L1: Towards Long-Context Large Reasoning Models with Reinforcement Learning

QwenLong-L1: Towards Long-Context Large Reasoning Models with Reinforcement Learning

Episode 805 · · 24:07

🤗 Upvotes: 60 | cs.CL Authors: Fanqi Wan, Weizhou Shen, Shengyi Liao, Yingcheng Shi, Chenliang Li, Ziyi Yang, Ji Zhang, Fei Huang, Jingren Zhou,...

Quartet: Native FP4 Training Can Be Optimal for Large Language Models

Quartet: Native FP4 Training Can Be Optimal for Large Language Models

Episode 804 · · 22:46

🤗 Upvotes: 55 | cs.LG Authors: Roberto L. Castro, Andrei Panferov, Soroush Tabesh, Oliver Sieberling, Jiale Chen, Mahdi Nikdan, Saleh Ashkboos, ...

Reasoning Model is Stubborn: Diagnosing Instruction Overriding in Reasoning Models

Reasoning Model is Stubborn: Diagnosing Instruction Overriding in Reasoning Models

Episode 803 · · 20:36

🤗 Upvotes: 51 | cs.AI Authors: Doohyuk Jang, Yoonjeon Kim, Chanjae Park, Hyun Ryu, Eunho Yang Title: Reasoning Model is...

One RL to See Them All: Visual Triple Unified Reinforcement Learning

One RL to See Them All: Visual Triple Unified Reinforcement Learning

Episode 802 · · 20:04

🤗 Upvotes: 51 | cs.CV, cs.CL Authors: Yan Ma, Linge Du, Xuyang Shen, Shaoxiang Chen, Pengfei Li, Qibing Ren, Lizhuang Ma, Yuchao Dai, Pengfei Li...