Episodes

Latest Episode
ModernVBERT: Towards Smaller Visual Document Retrievers

ModernVBERT: Towards Smaller Visual Document Retrievers

Episode 1223 · · 23:21

🤗 Upvotes: 24 | cs.IR Authors: Paul Teiletche, Quentin Macé, Max Conti, Antonio Loison, Gautier Viaud, Pierre Colombo, Manuel Faysse ...

StockBench: Can LLM Agents Trade Stocks Profitably In Real-world Markets?

StockBench: Can LLM Agents Trade Stocks Profitably In Real-world Markets?

Episode 1222 · · 30:49

🤗 Upvotes: 24 | cs.LG, cs.CL Authors: Yanxu Chen, Zijun Yao, Yantao Liu, Jin Ye, Jianing Yu, Lei Hou, Juanzi Li Title: ...

DeepSearch: Overcome the Bottleneck of Reinforcement Learning with Verifiable Rewards via Monte Carlo Tree Search

DeepSearch: Overcome the Bottleneck of Reinforcement Learning with Verifiable Rewards via Monte Carlo Tree Search

Episode 1221 · · 24:12

🤗 Upvotes: 100 | cs.AI, cs.CL Authors: Fang Wu, Weihao Xuan, Heli Qi, Ximing Lu, Aaron Tu, Li Erran Li, Yejin Choi Title: ...

GEM: A Gym for Agentic LLMs

GEM: A Gym for Agentic LLMs

Episode 1220 · · 25:53

🤗 Upvotes: 53 | cs.LG, cs.AI, cs.CL Authors: Zichen Liu, Anya Sims, Keyu Duan, Changyu Chen, Simon Yu, Xiangxin Zhou, Haotian Xu, Shaopan Xiong,...

VLA-RFT: Vision-Language-Action Reinforcement Fine-tuning with Verified Rewards in World Simulators

VLA-RFT: Vision-Language-Action Reinforcement Fine-tuning with Verified Rewards in World Simulators

Episode 1219 · · 26:50

🤗 Upvotes: 52 | cs.RO, cs.CV Authors: Hengtao Li, Pengxiang Ding, Runze Suo, Yihao Wang, Zirui Ge, Dongyuan Zang, Kexian Yu, Mingyang Sun, Hongy...

Knapsack RL: Unlocking Exploration of LLMs via Optimizing Budget Allocation

Knapsack RL: Unlocking Exploration of LLMs via Optimizing Budget Allocation

Episode 1218 · · 24:50

🤗 Upvotes: 32 | cs.LG, cs.AI, cs.CL Authors: Ziniu Li, Congliang Chen, Tianyun Yang, Tian Ding, Ruoyu Sun, Ge Zhang, Wenhao Huang, Zhi-Quan Luo ...

PIPer: On-Device Environment Setup via Online Reinforcement Learning

PIPer: On-Device Environment Setup via Online Reinforcement Learning

Episode 1217 · · 20:19

🤗 Upvotes: 26 | cs.SE, cs.AI, cs.LG Authors: Alexander Kovrigin, Aleksandra Eliseeva, Konstantin Grotov, Egor Bogomolov, Yaroslav Zharov ...

SINQ: Sinkhorn-Normalized Quantization for Calibration-Free Low-Precision LLM Weights

SINQ: Sinkhorn-Normalized Quantization for Calibration-Free Low-Precision LLM Weights

Episode 1216 · · 24:05

🤗 Upvotes: 25 | cs.LG Authors: Lorenz K. Müller, Philippe Bich, Jiawei Zhuang, Ahmet Çelik, Luca Benfenati, Lukas Cavigelli Title: ...

ACON: Optimizing Context Compression for Long-horizon LLM Agents

ACON: Optimizing Context Compression for Long-horizon LLM Agents

Episode 1215 · · 25:16

🤗 Upvotes: 21 | cs.AI, cs.CL Authors: Minki Kang, Wei-Ning Chen, Dongge Han, Huseyin A. Inan, Lukas Wutschitz, Yanzhi Chen, Robert Sim, Saravan ...

MCPMark: A Benchmark for Stress-Testing Realistic and Comprehensive MCP Use

MCPMark: A Benchmark for Stress-Testing Realistic and Comprehensive MCP Use

Episode 1214 · · 25:16

🤗 Upvotes: 124 | cs.CL, cs.AI Authors: Zijian Wu, Xiangyan Liu, Xinyuan Zhang, Lingjun Chen, Fanqing Meng, Lingxiao Du, Yiran Zhao, Fanshi Zhang...

The Dragon Hatchling: The Missing Link between the Transformer and Models of the Brain

The Dragon Hatchling: The Missing Link between the Transformer and Models of the Brain

Episode 1213 · · 23:40

🤗 Upvotes: 106 | cs.NE, cs.AI, cs.LG, stat.ML Authors: Adrian Kosowski, Przemysław Uznański, Jan Chorowski, Zuzanna Stamirowska, Michał Bartoszk...

Vision-Zero: Scalable VLM Self-Improvement via Strategic Gamified Self-Play

Vision-Zero: Scalable VLM Self-Improvement via Strategic Gamified Self-Play

Episode 1212 · · 29:09

🤗 Upvotes: 103 | cs.CV, cs.AI Authors: Qinsi Wang, Bo Liu, Tianyi Zhou, Jing Shi, Yueqian Lin, Yiran Chen, Hai Helen Li, Kun Wan, Wentian Zhao ...

Winning the Pruning Gamble: A Unified Approach to Joint Sample and Token Pruning for Efficient Supervised Fine-Tuning

Winning the Pruning Gamble: A Unified Approach to Joint Sample and Token Pruning for Efficient Supervised Fine-Tuning

Episode 1211 · · 20:04

🤗 Upvotes: 57 | cs.CL Authors: Shaobo Wang, Jiaming Wang, Jiajun Zhang, Cong Wang, Yue Min, Zichen Wen, Fei Huang, Huiqiang Jiang, Junyang Lin, ...

TruthRL: Incentivizing Truthful LLMs via Reinforcement Learning

TruthRL: Incentivizing Truthful LLMs via Reinforcement Learning

Episode 1210 · · 24:43

🤗 Upvotes: 45 | cs.CL, cs.AI, cs.LG Authors: Zhepei Wei, Xiao Yang, Kai Sun, Jiaqi Wang, Rulin Shao, Sean Chen, Mohammad Kachuee, Teja Gollapudi...

Learning to See Before Seeing: Demystifying LLM Visual Priors from Language Pre-training

Learning to See Before Seeing: Demystifying LLM Visual Priors from Language Pre-training

Episode 1209 · · 27:25

🤗 Upvotes: 36 | cs.LG, cs.AI, cs.CV, cs.MM Authors: Junlin Han, Shengbang Tong, David Fan, Yufan Ren, Koustuv Sinha, Philip Torr, Filippos Kokki...

OceanGym: A Benchmark Environment for Underwater Embodied Agents

OceanGym: A Benchmark Environment for Underwater Embodied Agents

Episode 1208 · · 22:22

🤗 Upvotes: 30 | cs.CL, cs.AI, cs.CV, cs.LG, cs.RO Authors: Yida Xue, Mingjun Mao, Xiangyuan Ru, Yuqi Zhu, Baochang Ren, Shuofei Qiao, Mengru Wan...

More Thought, Less Accuracy? On the Dual Nature of Reasoning in Vision-Language Models

More Thought, Less Accuracy? On the Dual Nature of Reasoning in Vision-Language Models

Episode 1207 · · 24:35

🤗 Upvotes: 29 | cs.CV, cs.AI Authors: Xinyu Tian, Shu Zou, Zhaoyuan Yang, Mengqi He, Fabian Waschkowski, Lukas Wesemann, Peter Tu, Jing Zhang ...

Thinking-Free Policy Initialization Makes Distilled Reasoning Models More Effective and Efficient Reasoners

Thinking-Free Policy Initialization Makes Distilled Reasoning Models More Effective and Efficient Reasoners

Episode 1206 · · 23:19

🤗 Upvotes: 26 | cs.LG, cs.CL Authors: Xin Xu, Cliveb AI, Kai Yang, Tianhao Chen, Yang Wang, Saiyong Yang, Can Yang Title: ...

DC-VideoGen: Efficient Video Generation with Deep Compression Video Autoencoder

DC-VideoGen: Efficient Video Generation with Deep Compression Video Autoencoder

Episode 1205 · · 25:33

🤗 Upvotes: 26 | cs.CV, cs.AI Authors: Junyu Chen, Wenkun He, Yuchao Gu, Yuyang Zhao, Jincheng Yu, Junsong Chen, Dongyun Zou, Yujun Lin, Zhekai Z...

SLA: Beyond Sparsity in Diffusion Transformers via Fine-Tunable Sparse-Linear Attention

SLA: Beyond Sparsity in Diffusion Transformers via Fine-Tunable Sparse-Linear Attention

Episode 1204 · · 24:35

🤗 Upvotes: 98 | cs.LG, cs.AI, cs.CV Authors: Jintao Zhang, Haoxu Wang, Kai Jiang, Shuo Yang, Kaiwen Zheng, Haocheng Xi, Ziteng Wang, Hongzhou Zh...