Episodes

Latest Episode
VideoJAM: Joint Appearance-Motion Representations for Enhanced Motion Generation in Video Models

VideoJAM: Joint Appearance-Motion Representations for Enhanced Motion Generation in Video Models

Episode 490 · · 19:27

πŸ€— Upvotes: 29 | cs.CV Authors: Hila Chefer, Uriel Singer, Amit Zohar, Yuval Kirstain, Adam Polyak, Yaniv Taigman, Lior Wolf, Shelly Sheynin ...

Inverse Bridge Matching Distillation

Inverse Bridge Matching Distillation

Episode 489 · · 19:48

πŸ€— Upvotes: 22 | cs.LG, cs.CV Authors: Nikita Gushchin, David Li, Daniil Selikhanovych, Evgeny Burnaev, Dmitry Baranchuk, Alexander Korotin ...

ACECODER: Acing Coder RL via Automated Test-Case Synthesis

ACECODER: Acing Coder RL via Automated Test-Case Synthesis

Episode 488 · · 20:06

πŸ€— Upvotes: 16 | cs.SE, cs.AI, cs.CL Authors: Huaye Zeng, Dongfu Jiang, Haozhe Wang, Ping Nie, Xiaotong Chen, Wenhu Chen Title: ...

QLASS: Boosting Language Agent Inference via Q-Guided Stepwise Search

QLASS: Boosting Language Agent Inference via Q-Guided Stepwise Search

Episode 487 · · 18:12

πŸ€— Upvotes: 12 | cs.LG, cs.AI Authors: Zongyu Lin, Yao Tang, Xingcheng Yao, Da Yin, Ziniu Hu, Yizhou Sun, Kai-Wei Chang Title: ...

Satori: Reinforcement Learning with Chain-of-Action-Thought Enhances LLM Reasoning via Autoregressive Search

Satori: Reinforcement Learning with Chain-of-Action-Thought Enhances LLM Reasoning via Autoregressive Search

Episode 486 · · 24:08

πŸ€— Upvotes: 12 | cs.CL, cs.AI Authors: Maohao Shen, Guangtao Zeng, Zhenting Qi, Zhang-Wei Hong, Zhenfang Chen, Wei Lu, Gregory Wornell, Subhro Da...

Rethinking Mixture-of-Agents: Is Mixing Different Large Language Models Beneficial?

Rethinking Mixture-of-Agents: Is Mixing Different Large Language Models Beneficial?

Episode 485 · · 23:34

πŸ€— Upvotes: 7 | cs.CL, cs.LG Authors: Wenzhe Li, Yong Lin, Mengzhou Xia, Chi Jin Title: Rethinking Mixture-of-Agents: Is...

COCONut-PanCap: Joint Panoptic Segmentation and Grounded Captions for Fine-Grained Understanding and Generation

COCONut-PanCap: Joint Panoptic Segmentation and Grounded Captions for Fine-Grained Understanding and Generation

Episode 484 · · 24:59

πŸ€— Upvotes: 7 | cs.CV Authors: Xueqing Deng, Qihang Yu, Ali Athar, Chenglin Yang, Linjie Yang, Xiaojie Jin, Xiaohui Shen, Liang-Chieh Chen ...

The Differences Between Direct Alignment Algorithms are a Blur

The Differences Between Direct Alignment Algorithms are a Blur

Episode 483 · · 20:11

πŸ€— Upvotes: 84 | cs.LG Authors: Alexey Gorbatovski, Boris Shaposhnikov, Viacheslav Sinii, Alexey Malakhov, Daniil Gavrilov Title: ...

OmniHuman-1: Rethinking the Scaling-Up of One-Stage Conditioned Human Animation Models

OmniHuman-1: Rethinking the Scaling-Up of One-Stage Conditioned Human Animation Models

Episode 482 · · 26:24

πŸ€— Upvotes: 83 | cs.CV Authors: Gaojie Lin, Jianwen Jiang, Jiaqi Yang, Zerong Zheng, Chao Liang Title: OmniHuman-1: Reth...

Process Reinforcement through Implicit Rewards

Process Reinforcement through Implicit Rewards

Episode 481 · · 21:51

πŸ€— Upvotes: 44 | cs.LG, cs.AI, cs.CL Authors: Ganqu Cui, Lifan Yuan, Zefan Wang, Hanbin Wang, Wendi Li, Bingxiang He, Yuchen Fan, Tianyu Yu, Qixi...

AlignVLM: Bridging Vision and Language Latent Spaces for Multimodal Understanding

AlignVLM: Bridging Vision and Language Latent Spaces for Multimodal Understanding

Episode 480 · · 23:27

πŸ€— Upvotes: 25 | cs.CL Authors: Ahmed Masry, Juan A. Rodriguez, Tianyu Zhang, Suyuchen Wang, Chao Wang, Aarash Feizi, Akshay Kalkunte Suresh, Abh...

SafeRAG: Benchmarking Security in Retrieval-Augmented Generation of Large Language Model

SafeRAG: Benchmarking Security in Retrieval-Augmented Generation of Large Language Model

Episode 479 · · 23:34

πŸ€— Upvotes: 25 | cs.CR, cs.AI, cs.IR Authors: Xun Liang, Simin Niu, Zhiyu Li, Sensen Zhang, Hanyu Wang, Feiyu Xiong, Jason Zhaoxin Fan, Bo Tang, ...

Preference Leakage: A Contamination Problem in LLM-as-a-judge

Preference Leakage: A Contamination Problem in LLM-as-a-judge

Episode 478 · · 21:56

πŸ€— Upvotes: 25 | cs.LG, cs.AI, cs.CL Authors: Dawei Li, Renliang Sun, Yue Huang, Ming Zhong, Bohan Jiang, Jiawei Han, Xiangliang Zhang, Wei Wang,...

SliderSpace: Decomposing the Visual Capabilities of Diffusion Models

SliderSpace: Decomposing the Visual Capabilities of Diffusion Models

Episode 477 · · 25:02

πŸ€— Upvotes: 19 | cs.CV, cs.GR, cs.LG Authors: Rohit Gandikota, Zongze Wu, Richard Zhang, David Bau, Eli Shechtman, Nick Kolkin Title...

MM-IQ: Benchmarking Human-Like Abstraction and Reasoning in Multimodal Models

MM-IQ: Benchmarking Human-Like Abstraction and Reasoning in Multimodal Models

Episode 476 · · 24:39

πŸ€— Upvotes: 15 | cs.AI, cs.CV Authors: Huanqia Cai, Yijun Yang, Winston Hu Title: MM-IQ: Benchmarking Human-Like Abstrac...

AIN: The Arabic INclusive Large Multimodal Model

AIN: The Arabic INclusive Large Multimodal Model

Episode 475 · · 20:32

πŸ€— Upvotes: 12 | cs.CV, cs.AI, cs.CL, cs.HC, cs.LG Authors: Ahmed Heakl, Sara Ghaboura, Omkar Thawkar, Fahad Shahbaz Khan, Hisham Cholakkal, Rao ...

s1: Simple test-time scaling

s1: Simple test-time scaling

Episode 474 · · 22:36

πŸ€— Upvotes: 54 | cs.CL, cs.AI, cs.LG Authors: Niklas Muennighoff, Zitong Yang, Weijia Shi, Xiang Lisa Li, Li Fei-Fei, Hannaneh Hajishirzi, Luke Z...

Reward-Guided Speculative Decoding for Efficient LLM Reasoning

Reward-Guided Speculative Decoding for Efficient LLM Reasoning

Episode 473 · · 21:49

πŸ€— Upvotes: 28 | cs.CL, cs.AI Authors: Baohao Liao, Yuhui Xu, Hanze Dong, Junnan Li, Christof Monz, Silvio Savarese, Doyen Sahoo, Caiming Xiong ...

Self-supervised Quantized Representation for Seamlessly Integrating Knowledge Graphs with Large Language Models

Self-supervised Quantized Representation for Seamlessly Integrating Knowledge Graphs with Large Language Models

Episode 472 · · 21:27

πŸ€— Upvotes: 12 | cs.CL, cs.AI Authors: Qika Lin, Tianzhe Zhao, Kai He, Zhen Peng, Fangzhi Xu, Ling Huang, Jingying Ma, Mengling Feng ...

PixelWorld: Towards Perceiving Everything as Pixels

PixelWorld: Towards Perceiving Everything as Pixels

Episode 471 · · 20:07

πŸ€— Upvotes: 10 | cs.CV, cs.CL Authors: Zhiheng Lyu, Xueguang Ma, Wenhu Chen Title: PixelWorld: Towards Perceiving Everyt...

DINO-WM: World Models on Pre-trained Visual Features enable Zero-shot Planning

DINO-WM: World Models on Pre-trained Visual Features enable Zero-shot Planning

Episode 470 · · 20:19

πŸ€— Upvotes: 8 | cs.RO, cs.AI Authors: Gaoyue Zhou, Hengkai Pan, Yann LeCun, Lerrel Pinto Title: DINO-WM: World Models on...

Constitutional Classifiers: Defending against Universal Jailbreaks across Thousands of Hours of Red Teaming

Constitutional Classifiers: Defending against Universal Jailbreaks across Thousands of Hours of Red Teaming

Episode 469 · · 20:55

πŸ€— Upvotes: 6 | cs.CL, cs.AI, cs.CR, cs.LG Authors: Mrinank Sharma, Meg Tong, Jesse Mu, Jerry Wei, Jorrit Kruthoff, Scott Goodfriend, Euan Ong, A...

Scalable-Softmax Is Superior for Attention

Scalable-Softmax Is Superior for Attention

Episode 468 · · 23:31

πŸ€— Upvotes: 6 | cs.CL, cs.AI, cs.LG Authors: Ken M. Nakanishi Title: Scalable-Softmax Is Superior for Attention ...

The Surprising Agreement Between Convex Optimization Theory and Learning-Rate Scheduling for Large Model Training

The Surprising Agreement Between Convex Optimization Theory and Learning-Rate Scheduling for Large Model Training

Episode 467 · · 21:51

πŸ€— Upvotes: 3 | cs.LG, math.OC, stat.ML Authors: Fabian Schaipp, Alexander HΓ€gele, Adrien Taylor, Umut Simsekli, Francis Bach Title:...

SAeUron: Interpretable Concept Unlearning in Diffusion Models with Sparse Autoencoders

SAeUron: Interpretable Concept Unlearning in Diffusion Models with Sparse Autoencoders

Episode 466 · · 20:09

πŸ€— Upvotes: 3 | cs.LG, cs.AI Authors: Bartosz CywiΕ„ski, Kamil Deja Title: SAeUron: Interpretable Concept Unlearning in D...

GuardReasoner: Towards Reasoning-based LLM Safeguards

GuardReasoner: Towards Reasoning-based LLM Safeguards

Episode 465 · · 21:00

πŸ€— Upvotes: 46 | cs.CR, cs.AI, cs.LG Authors: Yue Liu, Hongcheng Gao, Shengfang Zhai, Jun Xia, Tianyi Wu, Zhiwei Xue, Yulin Chen, Kenji Kawaguchi...

Thoughts Are All Over the Place: On the Underthinking of o1-Like LLMs

Thoughts Are All Over the Place: On the Underthinking of o1-Like LLMs

Episode 464 · · 23:01

πŸ€— Upvotes: 22 | cs.CL Authors: Yue Wang, Qiuzhi Liu, Jiahao Xu, Tian Liang, Xingyu Chen, Zhiwei He, Linfeng Song, Dian Yu, Juntao Li, Zhuosheng ...

Streaming DiLoCo with overlapping communication: Towards a Distributed Free Lunch

Streaming DiLoCo with overlapping communication: Towards a Distributed Free Lunch

Episode 463 · · 23:36

πŸ€— Upvotes: 15 | cs.CL Authors: Arthur Douillard, Yanislav Donchev, Keith Rush, Satyen Kale, Zachary Charles, Zachary Garrett, Gabriel Teston, Da...

MedXpertQA: Benchmarking Expert-Level Medical Reasoning and Understanding

MedXpertQA: Benchmarking Expert-Level Medical Reasoning and Understanding

Episode 462 · · 19:26

πŸ€— Upvotes: 15 | cs.AI, cs.CL, cs.CV, cs.LG Authors: Yuxin Zuo, Shang Qu, Yifei Li, Zhangren Chen, Xuekai Zhu, Ermo Hua, Kaiyan Zhang, Ning Ding,...

Large Language Models Think Too Fast To Explore Effectively

Large Language Models Think Too Fast To Explore Effectively

Episode 461 · · 25:52

πŸ€— Upvotes: 10 | cs.AI, q-bio.NC Authors: Lan Pan, Hanbo Xie, Robert C. Wilson Title: Large Language Models Think Too Fa...