Episodes

Latest Episode
Analyze Feature Flow to Enhance Interpretation and Steering in Language Models

Analyze Feature Flow to Enhance Interpretation and Steering in Language Models

Episode 508 · · 23:32

πŸ€— Upvotes: 41 | cs.LG, cs.CL Authors: Daniil Laptev, Nikita Balagansky, Yaroslav Aksenov, Daniil Gavrilov Title: Analyz...

UltraIF: Advancing Instruction Following from the Wild

UltraIF: Advancing Instruction Following from the Wild

Episode 507 · · 19:51

πŸ€— Upvotes: 15 | cs.CL, cs.AI Authors: Kaikai An, Li Sheng, Ganqu Cui, Shuzheng Si, Ning Ding, Yu Cheng, Baobao Chang Title: ...

Great Models Think Alike and this Undermines AI Oversight

Great Models Think Alike and this Undermines AI Oversight

Episode 506 · · 30:24

πŸ€— Upvotes: 14 | cs.LG, cs.AI, cs.CL Authors: Shashwat Goel, Joschka Struber, Ilze Amanda Auzina, Karuna K Chandra, Ponnurangam Kumaraguru, Douwe...

Gold-medalist Performance in Solving Olympiad Geometry with AlphaGeometry2

Gold-medalist Performance in Solving Olympiad Geometry with AlphaGeometry2

Episode 505 · · 22:33

πŸ€— Upvotes: 14 | cs.AI, cs.LG Authors: Yuri Chervonyi, Trieu H. Trinh, Miroslav OlΕ‘Γ‘k, Xiaomeng Yang, Hoang Nguyen, Marcelo Menegali, Junehyuk Ju...

Ola: Pushing the Frontiers of Omni-Modal Language Model with Progressive Modality Alignment

Ola: Pushing the Frontiers of Omni-Modal Language Model with Progressive Modality Alignment

Episode 504 · · 20:43

πŸ€— Upvotes: 14 | cs.CV, cs.CL, cs.MM, cs.SD, eess.AS, eess.IV Authors: Zuyan Liu, Yuhao Dong, Jiahui Wang, Ziwei Liu, Winston Hu, Jiwen Lu, Yongm...

MotionLab: Unified Human Motion Generation and Editing via the Motion-Condition-Motion Paradigm

MotionLab: Unified Human Motion Generation and Editing via the Motion-Condition-Motion Paradigm

Episode 503 · · 20:44

πŸ€— Upvotes: 13 | cs.CV Authors: Ziyan Guo, Zeyu Hu, Na Zhao, De Wen Soh Title: MotionLab: Unified Human Motion Generatio...

MAGA: MAssive Genre-Audience Reformulation to Pretraining Corpus Expansion

MAGA: MAssive Genre-Audience Reformulation to Pretraining Corpus Expansion

Episode 502 · · 23:02

πŸ€— Upvotes: 13 | cs.CL Authors: Xintong Hao, Ke Shen, Chenggang Li Title: MAGA: MAssive Genre-Audience Reformulation to ...

ScoreFlow: Mastering LLM Agent Workflows via Score-based Preference Optimization

ScoreFlow: Mastering LLM Agent Workflows via Score-based Preference Optimization

Episode 501 · · 20:37

πŸ€— Upvotes: 12 | cs.CL Authors: Yinjie Wang, Ling Yang, Guohao Li, Mengdi Wang, Bryon Aragam Title: ScoreFlow: Mastering...

Llasa: Scaling Train-Time and Inference-Time Compute for Llama-based Speech Synthesis

Llasa: Scaling Train-Time and Inference-Time Compute for Llama-based Speech Synthesis

Episode 500 · · 22:31

πŸ€— Upvotes: 10 | eess.AS, cs.AI, cs.CL, cs.MM, cs.SD Authors: Zhen Ye, Xinfa Zhu, Chi-Min Chan, Xinsheng Wang, Xu Tan, Jiahe Lei, Yi Peng, Haohe ...

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

Episode 499 · · 21:44

πŸ€— Upvotes: 90 | cs.CL Authors: Loubna Ben Allal, Anton Lozhkov, Elie Bakouch, Gabriel MartΓ­n BlΓ‘zquez, Guilherme Penedo, Lewis Tunstall, AndrΓ©s ...

TwinMarket: A Scalable Behavioral and Social Simulation for Financial Markets

TwinMarket: A Scalable Behavioral and Social Simulation for Financial Markets

Episode 498 · · 23:16

πŸ€— Upvotes: 27 | cs.CE, cs.CY Authors: Yuzhe Yang, Yifei Zhang, Minghao Wu, Kaidi Zhang, Yunmiao Zhang, Honghai Yu, Yan Hu, Benyou Wang ...

Demystifying Long Chain-of-Thought Reasoning in LLMs

Demystifying Long Chain-of-Thought Reasoning in LLMs

Episode 497 · · 20:57

πŸ€— Upvotes: 26 | cs.CL, cs.LG Authors: Edward Yeo, Yuxuan Tong, Morry Niu, Graham Neubig, Xiang Yue Title: Demystifying ...

LIMO: Less is More for Reasoning

LIMO: Less is More for Reasoning

Episode 496 · · 23:37

πŸ€— Upvotes: 24 | cs.CL, cs.AI Authors: Yixin Ye, Zhen Huang, Yang Xiao, Ethan Chern, Shijie Xia, Pengfei Liu Title: LIMO...

Boosting Multimodal Reasoning with MCTS-Automated Structured Thinking

Boosting Multimodal Reasoning with MCTS-Automated Structured Thinking

Episode 495 · · 21:44

πŸ€— Upvotes: 10 | cs.CL Authors: Jinyang Wu, Mingkuan Feng, Shuai Zhang, Ruihan Jin, Feihu Che, Zengqi Wen, Jianhua Tao Title: ...

LayerTracer: Cognitive-Aligned Layered SVG Synthesis via Diffusion Transformer

LayerTracer: Cognitive-Aligned Layered SVG Synthesis via Diffusion Transformer

Episode 494 · · 25:33

πŸ€— Upvotes: 7 | cs.CV Authors: Yiren Song, Danze Chen, Mike Zheng Shou Title: LayerTracer: Cognitive-Aligned Layered SVG...

On Teacher Hacking in Language Model Distillation

On Teacher Hacking in Language Model Distillation

Episode 493 · · 21:01

πŸ€— Upvotes: 6 | cs.LG, cs.AI, cs.CL, stat.ML Authors: Daniil Tiapkin, Daniele Calandriello, Johan Ferret, Sarah Perrin, Nino Vieillard, Alexandre...

A Probabilistic Inference Approach to Inference-Time Scaling of LLMs using Particle-Based Monte Carlo Methods

A Probabilistic Inference Approach to Inference-Time Scaling of LLMs using Particle-Based Monte Carlo Methods

Episode 492 · · 24:05

πŸ€— Upvotes: 5 | cs.LG, cs.AI Authors: Isha Puri, Shivchander Sudalairaj, Guangxuan Xu, Kai Xu, Akash Srivastava Title: A...

Jailbreaking with Universal Multi-Prompts

Jailbreaking with Universal Multi-Prompts

Episode 491 · · 20:08

πŸ€— Upvotes: 4 | cs.CL, cs.AI, cs.CR, cs.LG Authors: Yu-Ling Hsu, Hsuan Su, Shang-Tse Chen Title: Jailbreaking with Unive...

VideoJAM: Joint Appearance-Motion Representations for Enhanced Motion Generation in Video Models

VideoJAM: Joint Appearance-Motion Representations for Enhanced Motion Generation in Video Models

Episode 490 · · 19:27

πŸ€— Upvotes: 29 | cs.CV Authors: Hila Chefer, Uriel Singer, Amit Zohar, Yuval Kirstain, Adam Polyak, Yaniv Taigman, Lior Wolf, Shelly Sheynin ...

Inverse Bridge Matching Distillation

Inverse Bridge Matching Distillation

Episode 489 · · 19:48

πŸ€— Upvotes: 22 | cs.LG, cs.CV Authors: Nikita Gushchin, David Li, Daniil Selikhanovych, Evgeny Burnaev, Dmitry Baranchuk, Alexander Korotin ...

ACECODER: Acing Coder RL via Automated Test-Case Synthesis

ACECODER: Acing Coder RL via Automated Test-Case Synthesis

Episode 488 · · 20:06

πŸ€— Upvotes: 16 | cs.SE, cs.AI, cs.CL Authors: Huaye Zeng, Dongfu Jiang, Haozhe Wang, Ping Nie, Xiaotong Chen, Wenhu Chen Title: ...

QLASS: Boosting Language Agent Inference via Q-Guided Stepwise Search

QLASS: Boosting Language Agent Inference via Q-Guided Stepwise Search

Episode 487 · · 18:12

πŸ€— Upvotes: 12 | cs.LG, cs.AI Authors: Zongyu Lin, Yao Tang, Xingcheng Yao, Da Yin, Ziniu Hu, Yizhou Sun, Kai-Wei Chang Title: ...

Satori: Reinforcement Learning with Chain-of-Action-Thought Enhances LLM Reasoning via Autoregressive Search

Satori: Reinforcement Learning with Chain-of-Action-Thought Enhances LLM Reasoning via Autoregressive Search

Episode 486 · · 24:08

πŸ€— Upvotes: 12 | cs.CL, cs.AI Authors: Maohao Shen, Guangtao Zeng, Zhenting Qi, Zhang-Wei Hong, Zhenfang Chen, Wei Lu, Gregory Wornell, Subhro Da...

Rethinking Mixture-of-Agents: Is Mixing Different Large Language Models Beneficial?

Rethinking Mixture-of-Agents: Is Mixing Different Large Language Models Beneficial?

Episode 485 · · 23:34

πŸ€— Upvotes: 7 | cs.CL, cs.LG Authors: Wenzhe Li, Yong Lin, Mengzhou Xia, Chi Jin Title: Rethinking Mixture-of-Agents: Is...

COCONut-PanCap: Joint Panoptic Segmentation and Grounded Captions for Fine-Grained Understanding and Generation

COCONut-PanCap: Joint Panoptic Segmentation and Grounded Captions for Fine-Grained Understanding and Generation

Episode 484 · · 24:59

πŸ€— Upvotes: 7 | cs.CV Authors: Xueqing Deng, Qihang Yu, Ali Athar, Chenglin Yang, Linjie Yang, Xiaojie Jin, Xiaohui Shen, Liang-Chieh Chen ...

The Differences Between Direct Alignment Algorithms are a Blur

The Differences Between Direct Alignment Algorithms are a Blur

Episode 483 · · 20:11

πŸ€— Upvotes: 84 | cs.LG Authors: Alexey Gorbatovski, Boris Shaposhnikov, Viacheslav Sinii, Alexey Malakhov, Daniil Gavrilov Title: ...

OmniHuman-1: Rethinking the Scaling-Up of One-Stage Conditioned Human Animation Models

OmniHuman-1: Rethinking the Scaling-Up of One-Stage Conditioned Human Animation Models

Episode 482 · · 26:24

πŸ€— Upvotes: 83 | cs.CV Authors: Gaojie Lin, Jianwen Jiang, Jiaqi Yang, Zerong Zheng, Chao Liang Title: OmniHuman-1: Reth...

Process Reinforcement through Implicit Rewards

Process Reinforcement through Implicit Rewards

Episode 481 · · 21:51

πŸ€— Upvotes: 44 | cs.LG, cs.AI, cs.CL Authors: Ganqu Cui, Lifan Yuan, Zefan Wang, Hanbin Wang, Wendi Li, Bingxiang He, Yuchen Fan, Tianyu Yu, Qixi...

AlignVLM: Bridging Vision and Language Latent Spaces for Multimodal Understanding

AlignVLM: Bridging Vision and Language Latent Spaces for Multimodal Understanding

Episode 480 · · 23:27

πŸ€— Upvotes: 25 | cs.CL Authors: Ahmed Masry, Juan A. Rodriguez, Tianyu Zhang, Suyuchen Wang, Chao Wang, Aarash Feizi, Akshay Kalkunte Suresh, Abh...

SafeRAG: Benchmarking Security in Retrieval-Augmented Generation of Large Language Model

SafeRAG: Benchmarking Security in Retrieval-Augmented Generation of Large Language Model

Episode 479 · · 23:34

πŸ€— Upvotes: 25 | cs.CR, cs.AI, cs.IR Authors: Xun Liang, Simin Niu, Zhiyu Li, Sensen Zhang, Hanyu Wang, Feiyu Xiong, Jason Zhaoxin Fan, Bo Tang, ...