Episodes

Latest Episode
ReviewScore: Misinformed Peer Review Detection with Large Language Models

ReviewScore: Misinformed Peer Review Detection with Large Language Models

Episode 1190 · · 21:58

πŸ€— Upvotes: 54 | cs.CL Authors: Hyun Ryu, Doohyuk Jang, Hyemin S. Lee, Joonhyun Jeong, Gyeongman Kim, Donghyeon Cho, Gyouk Chu, Minyeong Hwang, H...

Variational Reasoning for Language Models

Variational Reasoning for Language Models

Episode 1189 · · 22:33

πŸ€— Upvotes: 51 | cs.CL, cs.AI, cs.LG Authors: Xiangxin Zhou, Zichen Liu, Haonan Wang, Chao Du, Min Lin, Chongxuan Li, Liang Wang, Tianyu Pang ...

Language Models Can Learn from Verbal Feedback Without Scalar Rewards

Language Models Can Learn from Verbal Feedback Without Scalar Rewards

Episode 1188 · · 23:30

πŸ€— Upvotes: 48 | cs.CL, cs.AI, cs.LG Authors: Renjie Luo, Zichen Liu, Xiangyan Liu, Chao Du, Min Lin, Wenhu Chen, Wei Lu, Tianyu Pang ...

MesaTask: Towards Task-Driven Tabletop Scene Generation via 3D Spatial Reasoning

MesaTask: Towards Task-Driven Tabletop Scene Generation via 3D Spatial Reasoning

Episode 1187 · · 25:33

πŸ€— Upvotes: 28 | cs.CV, cs.RO Authors: Jinkun Hao, Naifu Liang, Zhen Luo, Xudong Xu, Weipeng Zhong, Ran Yi, Yichen Jin, Zhaoyang Lyu, Feng Zheng,...

CapRL: Stimulating Dense Image Caption Capabilities via Reinforcement Learning

CapRL: Stimulating Dense Image Caption Capabilities via Reinforcement Learning

Episode 1186 · · 23:54

πŸ€— Upvotes: 28 | cs.CV, cs.AI, cs.CL Authors: Long Xing, Xiaoyi Dong, Yuhang Zang, Yuhang Cao, Jianze Liang, Qidong Huang, Jiaqi Wang, Feng Wu, D...

No Prompt Left Behind: Exploiting Zero-Variance Prompts in LLM Reinforcement Learning via Entropy-Guided Advantage Shaping

No Prompt Left Behind: Exploiting Zero-Variance Prompts in LLM Reinforcement Learning via Entropy-Guided Advantage Shaping

Episode 1185 · · 27:53

πŸ€— Upvotes: 27 | cs.CL, cs.AI, cs.LG Authors: Thanh-Long V. Le, Myeongho Jeon, Kim Vu, Viet Lai, Eunho Yang Title: No Pr...

VCRL: Variance-based Curriculum Reinforcement Learning for Large Language Models

VCRL: Variance-based Curriculum Reinforcement Learning for Large Language Models

Episode 1184 · · 22:17

πŸ€— Upvotes: 95 | cs.LG, cs.CL Authors: Guochao Jiang, Wenfeng Feng, Guofeng Quan, Chuzhan Hao, Yuewei Zhang, Guohua Liu, Hao Wang Ti...

SciReasoner: Laying the Scientific Reasoning Ground Across Disciplines

SciReasoner: Laying the Scientific Reasoning Ground Across Disciplines

Episode 1183 · · 23:35

πŸ€— Upvotes: 76 | cs.CL Authors: Yizhou Wang, Chen Tang, Han Deng, Jiabei Xiao, Jiaqi Liu, Jianyu Wu, Jun Yao, Pengze Li, Encheng Su, Lintao Wang,...

MMR1: Enhancing Multimodal Reasoning with Variance-Aware Sampling and Open Resources

MMR1: Enhancing Multimodal Reasoning with Variance-Aware Sampling and Open Resources

Episode 1182 · · 28:47

πŸ€— Upvotes: 67 | cs.CV Authors: Sicong Leng, Jing Wang, Jiaxi Li, Hao Zhang, Zhiqiang Hu, Boqiang Zhang, Yuming Jiang, Hang Zhang, Xin Li, Lidong...

Tree Search for LLM Agent Reinforcement Learning

Tree Search for LLM Agent Reinforcement Learning

Episode 1181 · · 24:50

πŸ€— Upvotes: 58 | cs.LG, cs.AI Authors: Yuxiang Ji, Ziyu Ma, Yong Wang, Guanhua Chen, Xiangxiang Chu, Liaoni Wu Title: Tr...

Seedream 4.0: Toward Next-generation Multimodal Image Generation

Seedream 4.0: Toward Next-generation Multimodal Image Generation

Episode 1180 · · 21:30

πŸ€— Upvotes: 46 | cs.CV Authors: Team Seedream, Yunpeng Chen, Yu Gao, Lixue Gong, Meng Guo, Qiushan Guo, Zhiyao Guo, Xiaoxia Hou, Weilin Huang, Yi...

Hunyuan3D-Omni: A Unified Framework for Controllable Generation of 3D Assets

Hunyuan3D-Omni: A Unified Framework for Controllable Generation of 3D Assets

Episode 1179 · · 25:09

πŸ€— Upvotes: 28 | cs.CV, cs.AI Authors: Team Hunyuan3D, :, Bowen Zhang, Chunchao Guo, Haolin Liu, Hongyu Yan, Huiwen Shi, Jingwei Huang, Junlin Yu...

AutoIntent: AutoML for Text Classification

AutoIntent: AutoML for Text Classification

Episode 1178 · · 22:28

πŸ€— Upvotes: 22 | cs.CL Authors: Ilya Alekseev, Roman Solomatin, Darina Rustamova, Denis Kuznetsov Title: AutoIntent: Aut...

Video models are zero-shot learners and reasoners

Video models are zero-shot learners and reasoners

Episode 1177 · · 24:55

πŸ€— Upvotes: 49 | cs.LG, cs.AI, cs.CV, cs.RO Authors: ThaddΓ€us Wiedemer, Yuxuan Li, Paul Vicol, Shixiang Shane Gu, Nick Matarese, Kevin Swersky, B...

SIM-CoT: Supervised Implicit Chain-of-Thought

SIM-CoT: Supervised Implicit Chain-of-Thought

Episode 1176 · · 24:06

πŸ€— Upvotes: 28 | cs.CL, cs.AI Authors: Xilin Wei, Xiaoran Liu, Yuhang Zang, Xiaoyi Dong, Yuhang Cao, Jiaqi Wang, Xipeng Qiu, Dahua Lin ...

Baseer: A Vision-Language Model for Arabic Document-to-Markdown OCR

Baseer: A Vision-Language Model for Arabic Document-to-Markdown OCR

Episode 1175 · · 20:21

πŸ€— Upvotes: 83 | cs.CV, cs.CL Authors: Khalil Hennara, Muhammad Hreden, Mohamed Motasim Hamed, Ahmad Bastati, Zeina Aldallal, Sara Chrouf, Safwan...

Reinforcement Learning on Pre-Training Data

Reinforcement Learning on Pre-Training Data

Episode 1174 · · 20:55

πŸ€— Upvotes: 43 | cs.CL, cs.AI, cs.LG Authors: Siheng Li, Kejiao Li, Zenan Xu, Guanhua Huang, Evander Yang, Kun Li, Haoyuan Wu, Jiajia Wu, Zihao Z...

Do You Need Proprioceptive States in Visuomotor Policies?

Do You Need Proprioceptive States in Visuomotor Policies?

Episode 1173 · · 26:01

πŸ€— Upvotes: 43 | cs.RO, cs.AI Authors: Juntu Zhao, Wenbo Lu, Di Zhang, Yufeng Liu, Yushen Liang, Tianluo Zhang, Yifeng Cao, Junyuan Xie, Yingdong...

MiniCPM-V 4.5: Cooking Efficient MLLMs via Architecture, Data, and Training Recipe

MiniCPM-V 4.5: Cooking Efficient MLLMs via Architecture, Data, and Training Recipe

Episode 1172 · · 25:04

πŸ€— Upvotes: 32 | cs.LG, cs.CV Authors: Tianyu Yu, Zefan Wang, Chongyi Wang, Fuwei Huang, Wenshuo Ma, Zhihui He, Tianchi Cai, Weize Chen, Yuxiang ...

LIMI: Less is More for Agency

LIMI: Less is More for Agency

Episode 1171 · · 21:27

πŸ€— Upvotes: 69 | cs.AI Authors: Yang Xiao, Mohan Jiang, Jie Sun, Keyu Li, Jifan Lin, Yumin Zhuang, Ji Zeng, Shijie Xia, Qishuo Hua, Xuefeng Li, X...