Episodes

Latest Episode
Logic-RL: Unleashing LLM Reasoning with Rule-Based Reinforcement Learning

Logic-RL: Unleashing LLM Reasoning with Rule-Based Reinforcement Learning

Episode 599 · · 24:51

🤗 Upvotes: 24 | cs.CL, cs.AI Authors: Tian Xie, Zitian Gao, Qingnan Ren, Haoming Luo, Yuqian Hong, Bryan Dai, Joey Zhou, Kai Qiu, Zhirong Wu, Ch...

Discovering highly efficient low-weight quantum error-correcting codes with reinforcement learning

Discovering highly efficient low-weight quantum error-correcting codes with reinforcement learning

Episode 598 · · 21:29

🤗 Upvotes: 22 | quant-ph, cs.AI, cs.IT, cs.LG, math.IT Authors: Austin Yubo He, Zi-Wen Liu Title: Discovering highly ef...

LongWriter-V: Enabling Ultra-Long and High-Fidelity Generation in Vision-Language Models

LongWriter-V: Enabling Ultra-Long and High-Fidelity Generation in Vision-Language Models

Episode 597 · · 19:43

🤗 Upvotes: 20 | cs.CV, cs.AI, cs.CL Authors: Shangqing Tu, Yucheng Wang, Daniel Zhang-Li, Yushi Bai, Jifan Yu, Yuhao Wu, Lei Hou, Huiqin Liu, Zh...

Does Time Have Its Place? Temporal Heads: Where Language Models Recall Time-specific Information

Does Time Have Its Place? Temporal Heads: Where Language Models Recall Time-specific Information

Episode 596 · · 23:15

🤗 Upvotes: 18 | cs.CL, cs.AI Authors: Yein Park, Chanwoong Yoon, Jungwoo Park, Minbyul Jeong, Jaewoo Kang Title: Does T...

S$^2$R: Teaching LLMs to Self-verify and Self-correct via Reinforcement Learning

S$^2$R: Teaching LLMs to Self-verify and Self-correct via Reinforcement Learning

Episode 595 · · 23:24

🤗 Upvotes: 15 | cs.CL, cs.LG Authors: Ruotian Ma, Peisong Wang, Cheng Liu, Xingyan Liu, Jiaqi Chen, Bang Zhang, Xin Zhou, Nan Du, Jia Li ...

Qwen2.5-VL Technical Report

Qwen2.5-VL Technical Report

Episode 594 · · 21:11

🤗 Upvotes: 97 | cs.CV, cs.CL Authors: Shuai Bai, Keqin Chen, Xuejing Liu, Jialin Wang, Wenbin Ge, Sibo Song, Kai Dang, Peng Wang, Shijie Wang, J...

RAD: Training an End-to-End Driving Policy via Large-Scale 3DGS-based Reinforcement Learning

RAD: Training an End-to-End Driving Policy via Large-Scale 3DGS-based Reinforcement Learning

Episode 593 · · 20:45

🤗 Upvotes: 31 | cs.CV, cs.RO Authors: Hao Gao, Shaoyu Chen, Bo Jiang, Bencheng Liao, Yiang Shi, Xiaoyang Guo, Yuechuan Pu, Haoran Yin, Xiangyu L...

SongGen: A Single Stage Auto-regressive Transformer for Text-to-Song Generation

SongGen: A Single Stage Auto-regressive Transformer for Text-to-Song Generation

Episode 592 · · 24:25

🤗 Upvotes: 28 | cs.SD, cs.AI Authors: Zihan Liu, Shuangrui Ding, Zhixiong Zhang, Xiaoyi Dong, Pan Zhang, Yuhang Zang, Yuhang Cao, Dahua Lin, Jia...

MoM: Linear Sequence Modeling with Mixture-of-Memories

MoM: Linear Sequence Modeling with Mixture-of-Memories

Episode 591 · · 20:12

🤗 Upvotes: 22 | cs.CL, cs.AI, cs.LG Authors: Jusen Du, Weigao Sun, Disen Lan, Jiaxi Hu, Yu Cheng Title: MoM: Linear Seq...

Is That Your Final Answer? Test-Time Scaling Improves Selective Question Answering

Is That Your Final Answer? Test-Time Scaling Improves Selective Question Answering

Episode 590 · · 21:25

🤗 Upvotes: 22 | cs.CL Authors: William Jurayj, Jeffrey Cheng, Benjamin Van Durme Title: Is That Your Final Answer? Test...

Craw4LLM: Efficient Web Crawling for LLM Pretraining

Craw4LLM: Efficient Web Crawling for LLM Pretraining

Episode 589 · · 22:44

🤗 Upvotes: 21 | cs.CL Authors: Shi Yu, Zhiyuan Liu, Chenyan Xiong Title: Craw4LLM: Efficient Web Crawling for LLM Pretr...

LongPO: Long Context Self-Evolution of Large Language Models through Short-to-Long Preference Optimization

LongPO: Long Context Self-Evolution of Large Language Models through Short-to-Long Preference Optimization

Episode 588 · · 25:16

🤗 Upvotes: 19 | cs.CL, cs.LG Authors: Guanzheng Chen, Xin Li, Michael Qizhe Shieh, Lidong Bing Title: LongPO: Long Cont...

Small Models Struggle to Learn from Strong Reasoners

Small Models Struggle to Learn from Strong Reasoners

Episode 587 · · 20:02

🤗 Upvotes: 17 | cs.AI Authors: Yuetai Li, Xiang Yue, Zhangchen Xu, Fengqing Jiang, Luyao Niu, Bill Yuchen Lin, Bhaskar Ramasubramanian, Radha Po...

Autellix: An Efficient Serving Engine for LLM Agents as General Programs

Autellix: An Efficient Serving Engine for LLM Agents as General Programs

Episode 586 · · 22:30

🤗 Upvotes: 15 | cs.LG, cs.AI, cs.DC Authors: Michael Luo, Xiaoxiang Shi, Colin Cai, Tianjun Zhang, Justin Wong, Yichuan Wang, Chi Wang, Yanping ...

SearchRAG: Can Search Engines Be Helpful for LLM-based Medical Question Answering?

SearchRAG: Can Search Engines Be Helpful for LLM-based Medical Question Answering?

Episode 585 · · 22:04

🤗 Upvotes: 10 | cs.CL, cs.AI, cs.IR, cs.IT, math.IT Authors: Yucheng Shi, Tianze Yang, Canyu Chen, Quanzheng Li, Tianming Liu, Xiang Li, Ninghao...

Soundwave: Less is More for Speech-Text Alignment in LLMs

Soundwave: Less is More for Speech-Text Alignment in LLMs

Episode 584 · · 21:52

🤗 Upvotes: 65 | cs.CL, cs.AI, cs.SD Authors: Yuhao Zhang, Zhiheng Liu, Fan Bu, Ruiyu Zhang, Benyou Wang, Haizhou Li Title: ...

Cramming 1568 Tokens into a Single Vector and Back Again: Exploring the Limits of Embedding Space Capacity

Cramming 1568 Tokens into a Single Vector and Back Again: Exploring the Limits of Embedding Space Capacity

Episode 583 · · 23:08

🤗 Upvotes: 51 | cs.CL, cs.LG Authors: Yuri Kuratov, Mikhail Arkhipov, Aydar Bulatov, Mikhail Burtsev Title: Cramming 15...

Continuous Diffusion Model for Language Modeling

Continuous Diffusion Model for Language Modeling

Episode 582 · · 18:43

🤗 Upvotes: 44 | cs.LG Authors: Jaehyeong Jo, Sung Ju Hwang Title: Continuous Diffusion Model for Language Modeling ...

Phantom: Subject-consistent video generation via cross-modal alignment

Phantom: Subject-consistent video generation via cross-modal alignment

Episode 581 · · 21:18

🤗 Upvotes: 42 | cs.CV, cs.AI Authors: Lijie Liu, Tianxiang Ma, Bingchuan Li, Zhuowei Chen, Jiawei Liu, Qian He, Xinglong Wu Title: ...

Rethinking Diverse Human Preference Learning through Principal Component Analysis

Rethinking Diverse Human Preference Learning through Principal Component Analysis

Episode 580 · · 22:39

🤗 Upvotes: 33 | cs.AI, cs.CL Authors: Feng Luo, Rui Yang, Hao Sun, Chunyuan Deng, Jiarui Yao, Jingyan Shen, Huan Zhang, Hanjie Chen ...

Magma: A Foundation Model for Multimodal AI Agents

Magma: A Foundation Model for Multimodal AI Agents

Episode 579 · · 23:02

🤗 Upvotes: 30 | cs.CV, cs.AI, cs.HC, cs.LG, cs.RO Authors: Jianwei Yang, Reuben Tan, Qianhui Wu, Ruijie Zheng, Baolin Peng, Yongyuan Liang, Yu G...

Multimodal Mamba: Decoder-only Multimodal State Space Model via Quadratic to Linear Distillation

Multimodal Mamba: Decoder-only Multimodal State Space Model via Quadratic to Linear Distillation

Episode 578 · · 21:52

🤗 Upvotes: 29 | cs.CV Authors: Bencheng Liao, Hongyuan Tao, Qian Zhang, Tianheng Cheng, Yingyue Li, Haoran Yin, Wenyu Liu, Xinggang Wang ...

SoFar: Language-Grounded Orientation Bridges Spatial Reasoning and Object Manipulation

SoFar: Language-Grounded Orientation Bridges Spatial Reasoning and Object Manipulation

Episode 577 · · 21:29

🤗 Upvotes: 27 | cs.RO, cs.AI, cs.CV Authors: Zekun Qi, Wenyao Zhang, Yufei Ding, Runpei Dong, Xinqiang Yu, Jingwen Li, Lingyun Xu, Baoyu Li, Xia...

SafeRoute: Adaptive Model Selection for Efficient and Accurate Safety Guardrails in Large Language Models

SafeRoute: Adaptive Model Selection for Efficient and Accurate Safety Guardrails in Large Language Models

Episode 576 · · 20:32

🤗 Upvotes: 26 | cs.CL Authors: Seanie Lee, Dong Bok Lee, Dominik Wagner, Minki Kang, Haebin Seong, Tobias Bocklet, Juho Lee, Sung Ju Hwang ...

You Do Not Fully Utilize Transformer's Representation Capacity

You Do Not Fully Utilize Transformer's Representation Capacity

Episode 575 · · 20:56

🤗 Upvotes: 25 | cs.LG, cs.CL Authors: Gleb Gerasimov, Yaroslav Aksenov, Nikita Balagansky, Viacheslav Sinii, Daniil Gavrilov Title:...

Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention

Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention

Episode 574 · · 23:03

🤗 Upvotes: 68 | cs.CL, cs.AI, cs.LG Authors: Jingyang Yuan, Huazuo Gao, Damai Dai, Junyu Luo, Liang Zhao, Zhengyan Zhang, Zhenda Xie, Y. X. Wei,...

Learning Getting-Up Policies for Real-World Humanoid Robots

Learning Getting-Up Policies for Real-World Humanoid Robots

Episode 573 · · 24:40

🤗 Upvotes: 32 | cs.RO, cs.LG Authors: Xialin He, Runpei Dong, Zixuan Chen, Saurabh Gupta Title: Learning Getting-Up Pol...

SWE-Lancer: Can Frontier LLMs Earn $1 Million from Real-World Freelance Software Engineering?

SWE-Lancer: Can Frontier LLMs Earn $1 Million from Real-World Freelance Software Engineering?

Episode 572 · · 21:53

🤗 Upvotes: 27 | cs.LG, cs.SE Authors: Samuel Miserendino, Michele Wang, Tejal Patwardhan, Johannes Heidecke Title: SWE-...

CRANE: Reasoning with constrained LLM generation

CRANE: Reasoning with constrained LLM generation

Episode 571 · · 21:23

🤗 Upvotes: 17 | cs.PL, cs.LG Authors: Debangshu Banerjee, Tarun Suresh, Shubham Ugare, Sasa Misailovic, Gagandeep Singh Title: ...

How Do LLMs Acquire New Knowledge? A Knowledge Circuits Perspective on Continual Pre-Training

How Do LLMs Acquire New Knowledge? A Knowledge Circuits Perspective on Continual Pre-Training

Episode 570 · · 24:39

🤗 Upvotes: 16 | cs.LG, cs.AI, cs.CL, cs.CV, cs.HC Authors: Yixin Ou, Yunzhi Yao, Ningyu Zhang, Hui Jin, Jiacheng Sun, Shumin Deng, Zhenguo Li, H...