Episodes

Latest Episode
From Elements to Design: A Layered Approach for Automatic Graphic Design Composition

From Elements to Design: A Layered Approach for Automatic Graphic Design Composition

Episode 298 · · 22:38

πŸ€— Upvotes: 11 | cs.CV Authors: Jiawei Lin, Shizhao Sun, Danqing Huang, Ting Liu, Ji Li, Jiang Bian Title: From Elements...

VideoMaker: Zero-shot Customized Video Generation with the Inherent Force of Video Diffusion Models

VideoMaker: Zero-shot Customized Video Generation with the Inherent Force of Video Diffusion Models

Episode 297 · · 23:53

πŸ€— Upvotes: 8 | cs.CV Authors: Tao Wu, Yong Zhang, Xiaodong Cun, Zhongang Qi, Junfu Pu, Huanzhang Dou, Guangcong Zheng, Ying Shan, Xi Li ...

The Superposition of Diffusion Models Using the ItΓ΄ Density Estimator

The Superposition of Diffusion Models Using the ItΓ΄ Density Estimator

Episode 296 · · 23:22

πŸ€— Upvotes: 8 | cs.LG Authors: Marta Skreta, Lazar Atanackovic, Avishek Joey Bose, Alexander Tong, Kirill Neklyudov Title: ...

Safeguard Fine-Tuned LLMs Through Pre- and Post-Tuning Model Merging

Safeguard Fine-Tuned LLMs Through Pre- and Post-Tuning Model Merging

Episode 295 · · 19:03

πŸ€— Upvotes: 6 | cs.CL Authors: Hua Farn, Hsuan Su, Shachi H Kumar, Saurav Sahay, Shang-Tse Chen, Hung-yi Lee Title: Safe...

CypherBench: Towards Precise Retrieval over Full-scale Modern Knowledge Graphs in the LLM Era

CypherBench: Towards Precise Retrieval over Full-scale Modern Knowledge Graphs in the LLM Era

Episode 294 · · 25:00

πŸ€— Upvotes: 3 | cs.CL, cs.AI, cs.DB Authors: Yanlin Feng, Simone Papicchio, Sajjadur Rahman Title: CypherBench: Towards ...

YuLan-Mini: An Open Data-efficient Language Model

YuLan-Mini: An Open Data-efficient Language Model

Episode 293 · · 19:39

πŸ€— Upvotes: 27 | cs.CL Authors: Yiwen Hu, Huatong Song, Jia Deng, Jiapeng Wang, Jie Chen, Kun Zhou, Yutao Zhu, Jinhao Jiang, Zican Dong, Wayne Xi...

A Silver Bullet or a Compromise for Full Attention? A Comprehensive Study of Gist Token-based Context Compression

A Silver Bullet or a Compromise for Full Attention? A Comprehensive Study of Gist Token-based Context Compression

Episode 292 · · 21:47

πŸ€— Upvotes: 17 | cs.CL Authors: Chenlong Deng, Zhisong Zhang, Kelong Mao, Shuaiyi Li, Xinting Huang, Dong Yu, Zhicheng Dou Title: ...

MMFactory: A Universal Solution Search Engine for Vision-Language Tasks

MMFactory: A Universal Solution Search Engine for Vision-Language Tasks

Episode 291 · · 21:10

πŸ€— Upvotes: 4 | cs.CV, cs.AI, cs.CL, cs.LG Authors: Wan-Cyuan Fan, Tanzila Rahman, Leonid Sigal Title: MMFactory: A Univ...

Molar: Multimodal LLMs with Collaborative Filtering Alignment for Enhanced Sequential Recommendation

Molar: Multimodal LLMs with Collaborative Filtering Alignment for Enhanced Sequential Recommendation

Episode 290 · · 22:18

πŸ€— Upvotes: 2 | cs.IR, cs.AI Authors: Yucong Luo, Qitao Qin, Hao Zhang, Mingyue Cheng, Ruiran Yan, Kefan Wang, Jie Ouyang Title: ...

DepthLab: From Partial to Complete

DepthLab: From Partial to Complete

Episode 289 · · 22:07

πŸ€— Upvotes: 21 | cs.CV Authors: Zhiheng Liu, Ka Leong Cheng, Qiuyu Wang, Shuzhe Wang, Hao Ouyang, Bin Tan, Kai Zhu, Yujun Shen, Qifeng Chen, Ping...

Fourier Position Embedding: Enhancing Attention's Periodic Extension for Length Generalization

Fourier Position Embedding: Enhancing Attention's Periodic Extension for Length Generalization

Episode 288 · · 21:34

πŸ€— Upvotes: 20 | cs.AI, cs.CL Authors: Ermo Hua, Che Jiang, Xingtai Lv, Kaiyan Zhang, Ning Ding, Youbang Sun, Biqing Qi, Yuchen Fan, Xue Kai Zhu,...

DiTCtrl: Exploring Attention Control in Multi-Modal Diffusion Transformer for Tuning-Free Multi-Prompt Longer Video Generation

DiTCtrl: Exploring Attention Control in Multi-Modal Diffusion Transformer for Tuning-Free Multi-Prompt Longer Video Generation

Episode 287 · · 22:13

πŸ€— Upvotes: 10 | cs.CV, cs.AI, cs.MM Authors: Minghong Cai, Xiaodong Cun, Xiaoyu Li, Wenze Liu, Zhaoyang Zhang, Yong Zhang, Ying Shan, Xiangyu Yu...

In Case You Missed It: ARC 'Challenge' Is Not That Challenging

In Case You Missed It: ARC 'Challenge' Is Not That Challenging

Episode 286 · · 24:19

πŸ€— Upvotes: 8 | cs.CL, cs.AI Authors: Łukasz Borchmann Title: In Case You Missed It: ARC 'Challenge' Is Not That Challen...

ReMoE: Fully Differentiable Mixture-of-Experts with ReLU Routing

ReMoE: Fully Differentiable Mixture-of-Experts with ReLU Routing

Episode 285 · · 20:56

πŸ€— Upvotes: 8 | cs.LG Authors: Ziteng Wang, Jianfei Chen, Jun Zhu Title: ReMoE: Fully Differentiable Mixture-of-Experts ...

SKETCH: Structured Knowledge Enhanced Text Comprehension for Holistic Retrieval

SKETCH: Structured Knowledge Enhanced Text Comprehension for Holistic Retrieval

Episode 284 · · 22:17

πŸ€— Upvotes: 6 | cs.CL Authors: Aakash Mahalingam, Vinesh Kumar Gande, Aman Chadha, Vinija Jain, Divya Chaudhary Title: S...

PartGen: Part-level 3D Generation and Reconstruction with Multi-View Diffusion Models

PartGen: Part-level 3D Generation and Reconstruction with Multi-View Diffusion Models

Episode 283 · · 26:24

πŸ€— Upvotes: 5 | cs.CV Authors: Minghao Chen, Roman Shapovalov, Iro Laina, Tom Monnier, Jianyuan Wang, David Novotny, Andrea Vedaldi ...

MotiF: Making Text Count in Image Animation with Motion Focal Loss

MotiF: Making Text Count in Image Animation with Motion Focal Loss

Episode 282 · · 22:35

πŸ€— Upvotes: 3 | cs.CV, cs.AI Authors: Shijie Wang, Samaneh Azadi, Rohit Girdhar, Saketh Rambhatla, Chen Sun, Xi Yin Title: ...

Bridging the Data Provenance Gap Across Text, Speech and Video

Bridging the Data Provenance Gap Across Text, Speech and Video

Episode 281 · · 25:29

πŸ€— Upvotes: 3 | cs.AI, cs.CL, cs.CY, cs.LG, cs.MM Authors: Shayne Longpre, Nikhil Singh, Manuel Cherep, Kushagra Tiwary, Joanna Materzynska, Will...

RobustFT: Robust Supervised Fine-tuning for Large Language Models under Noisy Response

RobustFT: Robust Supervised Fine-tuning for Large Language Models under Noisy Response

Episode 280 · · 21:39

πŸ€— Upvotes: 64 | cs.CL, cs.AI Authors: Junyu Luo, Xiao Luo, Kaize Ding, Jingyang Yuan, Zhiping Xiao, Ming Zhang Title: R...

B-STaR: Monitoring and Balancing Exploration and Exploitation in Self-Taught Reasoners

B-STaR: Monitoring and Balancing Exploration and Exploitation in Self-Taught Reasoners

Episode 279 · · 20:41

πŸ€— Upvotes: 29 | cs.AI, cs.CL, cs.LG Authors: Weihao Zeng, Yuzhen Huang, Lulu Zhao, Yijun Wang, Zifei Shan, Junxian He Title: ...

Distilled Decoding 1: One-step Sampling of Image Auto-regressive Models with Flow Matching

Distilled Decoding 1: One-step Sampling of Image Auto-regressive Models with Flow Matching

Episode 278 · · 24:00

πŸ€— Upvotes: 26 | cs.CV, cs.LG Authors: Enshu Liu, Xuefei Ning, Yu Wang, Zinan Lin Title: Distilled Decoding 1: One-step ...

Diving into Self-Evolving Training for Multimodal Reasoning

Diving into Self-Evolving Training for Multimodal Reasoning

Episode 277 · · 21:08

πŸ€— Upvotes: 23 | cs.CL, cs.AI, cs.CV, cs.LG Authors: Wei Liu, Junlong Li, Xiwen Zhang, Fan Zhou, Yu Cheng, Junxian He Title: ...

Deliberation in Latent Space via Differentiable Cache Augmentation

Deliberation in Latent Space via Differentiable Cache Augmentation

Episode 276 · · 22:28

πŸ€— Upvotes: 16 | cs.CL, cs.AI, cs.LG Authors: Luyang Liu, Jonas Pfeiffer, Jiaxing Wu, Jun Xie, Arthur Szlam Title: Delib...

Large Motion Video Autoencoding with Cross-modal Video VAE

Large Motion Video Autoencoding with Cross-modal Video VAE

Episode 275 · · 25:08

πŸ€— Upvotes: 15 | cs.CV Authors: Yazhou Xing, Yang Fei, Yingqing He, Jingye Chen, Jiaxin Xie, Xiaowei Chi, Qifeng Chen Title: ...

OpenAI o1 System Card

OpenAI o1 System Card

Episode 274 · · 25:01

πŸ€— Upvotes: 12 | cs.AI Authors: OpenAI, :, Aaron Jaech, Adam Kalai, Adam Lerer, Adam Richardson, Ahmed El-Kishky, Aiden Low, Alec Helyar, Aleksan...

Revisiting In-Context Learning with Long Context Language Models

Revisiting In-Context Learning with Long Context Language Models

Episode 273 · · 23:40

πŸ€— Upvotes: 12 | cs.CL, cs.AI, cs.LG Authors: Jinheon Baek, Sun Jae Lee, Prakhar Gupta, Geunseob, Oh, Siddharth Dalmia, Prateek Kolhar ...

Outcome-Refining Process Supervision for Code Generation

Outcome-Refining Process Supervision for Code Generation

Episode 272 · · 21:12

πŸ€— Upvotes: 11 | cs.CL, cs.AI, cs.LG, cs.SE Authors: Zhuohao Yu, Weizheng Gu, Yidong Wang, Zhengran Zeng, Jindong Wang, Wei Ye, Shikun Zhang ...

LearnLM: Improving Gemini for Learning

LearnLM: Improving Gemini for Learning

Episode 271 · · 27:18

πŸ€— Upvotes: 9 | cs.CY, cs.AI, cs.LG Authors: LearnLM Team, Abhinit Modi, Aditya Srikanth Veerubhotla, Aliya Rysbek, Andrea Huber, Brett Wiltshire...

Parallelized Autoregressive Visual Generation

Parallelized Autoregressive Visual Generation

Episode 270 · · 22:32

πŸ€— Upvotes: 34 | cs.CV Authors: Yuqing Wang, Shuhuai Ren, Zhijie Lin, Yujin Han, Haoyuan Guo, Zhenheng Yang, Difan Zou, Jiashi Feng, Xihui Liu ...

Offline Reinforcement Learning for LLM Multi-Step Reasoning

Offline Reinforcement Learning for LLM Multi-Step Reasoning

Episode 269 · · 20:59

πŸ€— Upvotes: 19 | cs.LG, cs.AI, cs.CL Authors: Huaijie Wang, Shibo Hao, Hanze Dong, Shenao Zhang, Yilin Bao, Ziran Yang, Yi Wu Title:...