Daily Paper Cast | All Episodes

Lossless Acceleration of Large Language Models with Hierarchical Drafting based on Temporal Locality in Speculative Decoding

Episode 520 · February 11, 2025 · 21:05

🤗 Upvotes: 12 | cs.CL Authors: Sukmin Cho, Sangjin Choi, Taeho Hwang, Jeongyeon Seo, Soyeong Jeong, Huije Lee, Hoyun Song, Jong C. Park, Youngji...

ReasonFlux: Hierarchical LLM Reasoning via Scaling Thought Templates

Episode 519 · February 11, 2025 · 21:25

🤗 Upvotes: 11 | cs.CL Authors: Ling Yang, Zhaochen Yu, Bin Cui, Mengdi Wang Title: ReasonFlux: Hierarchical LLM Reasoni...

VideoRoPE: What Makes for Good Video Rotary Position Embedding?

Episode 518 · February 10, 2025 · 20:26

🤗 Upvotes: 52 | cs.CV Authors: Xilin Wei, Xiaoran Liu, Yuhang Zang, Xiaoyi Dong, Pan Zhang, Yuhang Cao, Jian Tong, Haodong Duan, Qipeng Guo, Jia...

Fast Video Generation with Sliding Tile Attention

Episode 517 · February 10, 2025 · 21:19

🤗 Upvotes: 39 | cs.CV Authors: Peiyuan Zhang, Yongqi Chen, Runlong Su, Hangliang Ding, Ion Stoica, Zhenghong Liu, Hao Zhang Title: ...

Goku: Flow Based Video Generative Foundation Models

Episode 516 · February 10, 2025 · 22:19

🤗 Upvotes: 39 | cs.CV Authors: Shoufa Chen, Chongjian Ge, Yuqi Zhang, Yida Zhang, Fengda Zhu, Hao Yang, Hongxiang Hao, Hui Wu, Zhichao Lai, Yife...

QuEST: Stable Training of LLMs with 1-Bit Weights and Activations

Episode 515 · February 10, 2025 · 22:48

🤗 Upvotes: 32 | cs.LG Authors: Andrei Panferov, Jiale Chen, Soroush Tabesh, Roberto L. Castro, Mahdi Nikdan, Dan Alistarh Title: ...

Scaling up Test-Time Compute with Latent Reasoning: A Recurrent Depth Approach

Episode 514 · February 10, 2025 · 23:08

🤗 Upvotes: 30 | cs.LG, cs.CL Authors: Jonas Geiping, Sean McLeish, Neel Jain, John Kirchenbauer, Siddharth Singh, Brian R. Bartoldson, Bhavya Ka...

AuraFusion360: Augmented Unseen Region Alignment for Reference-based 360° Unbounded Scene Inpainting

Episode 513 · February 10, 2025 · 17:55

🤗 Upvotes: 23 | cs.CV Authors: Chung-Ho Wu, Yang-Jung Chen, Ying-Huan Chen, Jie-Ying Lee, Bo-Hsu Ke, Chun-Wei Tuan Mu, Yi-Chuan Huang, Chin-Yang...

DuoGuard: A Two-Player RL-Driven Framework for Multilingual LLM Guardrails

Episode 512 · February 10, 2025 · 19:13

🤗 Upvotes: 18 | cs.CL, cs.LG Authors: Yihe Deng, Yu Yang, Junkai Zhang, Wei Wang, Bo Li Title: DuoGuard: A Two-Player R...

Agency Is Frame-Dependent

Episode 511 · February 10, 2025 · 23:14

🤗 Upvotes: 15 | cs.AI Authors: David Abel, André Barreto, Michael Bowling, Will Dabney, Shi Dong, Steven Hansen, Anna Harutyunyan, Khimya Khetar...

FlashVideo:Flowing Fidelity to Detail for Efficient High-Resolution Video Generation

Episode 510 · February 10, 2025 · 23:13

🤗 Upvotes: 14 | cs.CV Authors: Shilong Zhang, Wenbo Li, Shoufa Chen, Chongjian Ge, Peize Sun, Yida Zhang, Yi Jiang, Zehuan Yuan, Binyue Peng, Pi...

Generating Symbolic World Models via Test-time Scaling of Large Language Models

Episode 509 · February 10, 2025 · 21:14

🤗 Upvotes: 13 | cs.AI Authors: Zhouliang Yu, Yuhuan Yuan, Tim Z. Xiao, Fuxiang Frank Xia, Jie Fu, Ge Zhang, Ge Lin, Weiyang Liu Tit...

Analyze Feature Flow to Enhance Interpretation and Steering in Language Models

Episode 508 · February 7, 2025 · 23:32

🤗 Upvotes: 41 | cs.LG, cs.CL Authors: Daniil Laptev, Nikita Balagansky, Yaroslav Aksenov, Daniil Gavrilov Title: Analyz...

UltraIF: Advancing Instruction Following from the Wild

Episode 507 · February 7, 2025 · 19:51

🤗 Upvotes: 15 | cs.CL, cs.AI Authors: Kaikai An, Li Sheng, Ganqu Cui, Shuzheng Si, Ning Ding, Yu Cheng, Baobao Chang Title: ...

Great Models Think Alike and this Undermines AI Oversight

Episode 506 · February 7, 2025 · 30:24

🤗 Upvotes: 14 | cs.LG, cs.AI, cs.CL Authors: Shashwat Goel, Joschka Struber, Ilze Amanda Auzina, Karuna K Chandra, Ponnurangam Kumaraguru, Douwe...

Gold-medalist Performance in Solving Olympiad Geometry with AlphaGeometry2

Episode 505 · February 7, 2025 · 22:33

🤗 Upvotes: 14 | cs.AI, cs.LG Authors: Yuri Chervonyi, Trieu H. Trinh, Miroslav Olšák, Xiaomeng Yang, Hoang Nguyen, Marcelo Menegali, Junehyuk Ju...

Ola: Pushing the Frontiers of Omni-Modal Language Model with Progressive Modality Alignment

Episode 504 · February 7, 2025 · 20:43

🤗 Upvotes: 14 | cs.CV, cs.CL, cs.MM, cs.SD, eess.AS, eess.IV Authors: Zuyan Liu, Yuhao Dong, Jiahui Wang, Ziwei Liu, Winston Hu, Jiwen Lu, Yongm...

MotionLab: Unified Human Motion Generation and Editing via the Motion-Condition-Motion Paradigm

Episode 503 · February 7, 2025 · 20:44

🤗 Upvotes: 13 | cs.CV Authors: Ziyan Guo, Zeyu Hu, Na Zhao, De Wen Soh Title: MotionLab: Unified Human Motion Generatio...

MAGA: MAssive Genre-Audience Reformulation to Pretraining Corpus Expansion

Episode 502 · February 7, 2025 · 23:02

🤗 Upvotes: 13 | cs.CL Authors: Xintong Hao, Ke Shen, Chenggang Li Title: MAGA: MAssive Genre-Audience Reformulation to ...

ScoreFlow: Mastering LLM Agent Workflows via Score-based Preference Optimization

Episode 501 · February 7, 2025 · 20:37

🤗 Upvotes: 12 | cs.CL Authors: Yinjie Wang, Ling Yang, Guohao Li, Mengdi Wang, Bryon Aragam Title: ScoreFlow: Mastering...

Llasa: Scaling Train-Time and Inference-Time Compute for Llama-based Speech Synthesis

Episode 500 · February 7, 2025 · 22:31

🤗 Upvotes: 10 | eess.AS, cs.AI, cs.CL, cs.MM, cs.SD Authors: Zhen Ye, Xinfa Zhu, Chi-Min Chan, Xinsheng Wang, Xu Tan, Jiahe Lei, Yi Peng, Haohe ...

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

Episode 499 · February 6, 2025 · 21:44

🤗 Upvotes: 90 | cs.CL Authors: Loubna Ben Allal, Anton Lozhkov, Elie Bakouch, Gabriel Martín Blázquez, Guilherme Penedo, Lewis Tunstall, Andrés ...

TwinMarket: A Scalable Behavioral and Social Simulation for Financial Markets

Episode 498 · February 6, 2025 · 23:16

🤗 Upvotes: 27 | cs.CE, cs.CY Authors: Yuzhe Yang, Yifei Zhang, Minghao Wu, Kaidi Zhang, Yunmiao Zhang, Honghai Yu, Yan Hu, Benyou Wang ...

Demystifying Long Chain-of-Thought Reasoning in LLMs

Episode 497 · February 6, 2025 · 20:57

🤗 Upvotes: 26 | cs.CL, cs.LG Authors: Edward Yeo, Yuxuan Tong, Morry Niu, Graham Neubig, Xiang Yue Title: Demystifying ...

LIMO: Less is More for Reasoning

Episode 496 · February 6, 2025 · 23:37

🤗 Upvotes: 24 | cs.CL, cs.AI Authors: Yixin Ye, Zhen Huang, Yang Xiao, Ethan Chern, Shijie Xia, Pengfei Liu Title: LIMO...

Boosting Multimodal Reasoning with MCTS-Automated Structured Thinking

Episode 495 · February 6, 2025 · 21:44

🤗 Upvotes: 10 | cs.CL Authors: Jinyang Wu, Mingkuan Feng, Shuai Zhang, Ruihan Jin, Feihu Che, Zengqi Wen, Jianhua Tao Title: ...

LayerTracer: Cognitive-Aligned Layered SVG Synthesis via Diffusion Transformer

Episode 494 · February 6, 2025 · 25:33

🤗 Upvotes: 7 | cs.CV Authors: Yiren Song, Danze Chen, Mike Zheng Shou Title: LayerTracer: Cognitive-Aligned Layered SVG...

On Teacher Hacking in Language Model Distillation

Episode 493 · February 6, 2025 · 21:01

🤗 Upvotes: 6 | cs.LG, cs.AI, cs.CL, stat.ML Authors: Daniil Tiapkin, Daniele Calandriello, Johan Ferret, Sarah Perrin, Nino Vieillard, Alexandre...

A Probabilistic Inference Approach to Inference-Time Scaling of LLMs using Particle-Based Monte Carlo Methods

Episode 492 · February 6, 2025 · 24:05

🤗 Upvotes: 5 | cs.LG, cs.AI Authors: Isha Puri, Shivchander Sudalairaj, Guangxuan Xu, Kai Xu, Akash Srivastava Title: A...

Jailbreaking with Universal Multi-Prompts

Episode 491 · February 6, 2025 · 20:08

🤗 Upvotes: 4 | cs.CL, cs.AI, cs.CR, cs.LG Authors: Yu-Ling Hsu, Hsuan Su, Shang-Tse Chen Title: Jailbreaking with Unive...