Episodes

Latest Episode
Direct Preference Optimization Using Sparse Feature-Level Constraints

Direct Preference Optimization Using Sparse Feature-Level Constraints

Episode 81 · · 21:15

๐Ÿค— Paper Upvotes: 10 | cs.AI, cs.CL Authors: Qingyu Yin, Chak Tou Leong, Hongbo Zhang, Minjun Zhu, Hanqi Yan, Qiang Zhang, Yulan He, Wenjie Li, J...

CamemBERT 2.0: A Smarter French Language Model Aged to Perfection

CamemBERT 2.0: A Smarter French Language Model Aged to Perfection

Episode 80 · · 24:28

๐Ÿค— Paper Upvotes: 8 | cs.CL Authors: Wissam Antoun, Francis Kulumba, Rian Touchent, ร‰ric de la Clergerie, Benoรฎt Sagot, Djamรฉ Seddah ...

Can sparse autoencoders be used to decompose and interpret steering vectors?

Can sparse autoencoders be used to decompose and interpret steering vectors?

Episode 79 · · 21:54

๐Ÿค— Paper Upvotes: 6 | cs.LG, cs.AI, cs.CL Authors: Harry Mayne, Yushi Yang, Adam Mahdi Title: Can sparse autoencoders be...

PerceiverS: A Multi-Scale Perceiver with Effective Segmentation for Long-Term Expressive Symbolic Music Generation

PerceiverS: A Multi-Scale Perceiver with Effective Segmentation for Long-Term Expressive Symbolic Music Generation

Episode 78 · · 18:59

๐Ÿค— Paper Upvotes: 5 | cs.AI, cs.MM, cs.SD, eess.AS Authors: Yungang Yi, Weihua Li, Matthew Kuo, Quan Bai Title: Perceive...

SAMPart3D: Segment Any Part in 3D Objects

SAMPart3D: Segment Any Part in 3D Objects

Episode 77 · · 20:51

๐Ÿค— Paper Upvotes: 18 | cs.CV Authors: Yunhan Yang, Yukun Huang, Yuan-Chen Guo, Liangjun Lu, Xiaoyang Wu, Edmund Y. Lam, Yan-Pei Cao, Xihui Liu ...

JanusFlow: Harmonizing Autoregression and Rectified Flow for Unified Multimodal Understanding and Generation

JanusFlow: Harmonizing Autoregression and Rectified Flow for Unified Multimodal Understanding and Generation

Episode 76 · · 22:30

๐Ÿค— Paper Upvotes: 14 | cs.CV, cs.AI, cs.CL Authors: Yiyang Ma, Xingchao Liu, Xiaokang Chen, Wen Liu, Chengyue Wu, Zhiyu Wu, Zizheng Pan, Zhenda X...

Stronger Models are NOT Stronger Teachers for Instruction Tuning

Stronger Models are NOT Stronger Teachers for Instruction Tuning

Episode 75 · · 27:47

๐Ÿค— Paper Upvotes: 13 | cs.AI, cs.CL Authors: Zhangchen Xu, Fengqing Jiang, Luyao Niu, Bill Yuchen Lin, Radha Poovendran Title: ...

BLIP3-KALE: Knowledge Augmented Large-Scale Dense Captions

BLIP3-KALE: Knowledge Augmented Large-Scale Dense Captions

Episode 74 · · 20:39

๐Ÿค— Paper Upvotes: 11 | cs.CV, cs.AI Authors: Anas Awadalla, Le Xue, Manli Shu, An Yan, Jun Wang, Senthil Purushwalkam, Sheng Shen, Hannah Lee, Os...

Scaling Properties of Diffusion Models for Perceptual Tasks

Scaling Properties of Diffusion Models for Perceptual Tasks

Episode 73 · · 25:09

๐Ÿค— Paper Upvotes: 7 | cs.CV, cs.AI Authors: Rahul Ravishankar, Zeeshan Patel, Jathushan Rajasegaran, Jitendra Malik Title: ...

Wavelet Latent Diffusion (Wala): Billion-Parameter 3D Generative Model with Compact Wavelet Encodings

Wavelet Latent Diffusion (Wala): Billion-Parameter 3D Generative Model with Compact Wavelet Encodings

Episode 72 · · 22:22

๐Ÿค— Paper Upvotes: 5 | cs.CV, cs.AI, cs.LG Authors: Aditya Sanghi, Aliasghar Khani, Pradyumna Reddy, Arianna Rampini, Derek Cheung, Kamal Rahimi M...

Add-it: Training-Free Object Insertion in Images With Pretrained Diffusion Models

Add-it: Training-Free Object Insertion in Images With Pretrained Diffusion Models

Episode 71 · · 23:36

๐Ÿค— Paper Upvotes: 44 | cs.CV, cs.AI, cs.GR, cs.LG Authors: Yoad Tewel, Rinon Gal, Dvir Samuel, Yuval Atzmon, Lior Wolf, Gal Chechik ...

OmniEdit: Building Image Editing Generalist Models Through Specialist Supervision

OmniEdit: Building Image Editing Generalist Models Through Specialist Supervision

Episode 70 · · 19:47

๐Ÿค— Paper Upvotes: 39 | cs.CV, cs.AI Authors: Cong Wei, Zheyang Xiong, Weiming Ren, Xinrun Du, Ge Zhang, Wenhu Chen Title: ...

Chinese SimpleQA: A Chinese Factuality Evaluation for Large Language Models

Chinese SimpleQA: A Chinese Factuality Evaluation for Large Language Models

Episode 69 · · 21:15

๐Ÿค— Paper Upvotes: 30 | cs.CL Authors: Yancheng He, Shilong Li, Jiaheng Liu, Yingshui Tan, Hui Huang, Weixun Wang, Xingyuan Bu, Hangyu Guo, Chengw...

M-Longdoc: A Benchmark For Multimodal Super-Long Document Understanding And A Retrieval-Aware Tuning Framework

M-Longdoc: A Benchmark For Multimodal Super-Long Document Understanding And A Retrieval-Aware Tuning Framework

Episode 68 · · 20:43

๐Ÿค— Paper Upvotes: 28 | cs.CL Authors: Yew Ken Chia, Liying Cheng, Hou Pong Chan, Chaoqun Liu, Maojia Song, Sharifah Mahani Aljunied, Soujanya Por...

Edify Image: High-Quality Image Generation with Pixel Space Laplacian Diffusion Models

Edify Image: High-Quality Image Generation with Pixel Space Laplacian Diffusion Models

Episode 67 · · 24:47

๐Ÿค— Paper Upvotes: 21 | cs.CV, cs.LG Authors: NVIDIA, :, Yuval Atzmon, Maciej Bala, Yogesh Balaji, Tiffany Cai, Yin Cui, Jiaojiao Fan, Yunhao Ge, ...

GitChameleon: Unmasking the Version-Switching Capabilities of Code Generation Models

GitChameleon: Unmasking the Version-Switching Capabilities of Code Generation Models

Episode 66 · · 24:32

๐Ÿค— Paper Upvotes: 18 | cs.SE, cs.LG Authors: Nizar Islah, Justine Gehring, Diganta Misra, Eilif Muller, Irina Rish, Terry Yue Zhuo, Massimo Cacci...

Watermark Anything with Localized Messages

Watermark Anything with Localized Messages

Episode 65 · · 23:25

๐Ÿค— Paper Upvotes: 11 | cs.CV, cs.CR Authors: Tom Sander, Pierre Fernandez, Alain Durmus, Teddy Furon, Matthijs Douze Title: ...

Autoregressive Models in Vision: A Survey

Autoregressive Models in Vision: A Survey

Episode 64 · · 22:52

๐Ÿค— Paper Upvotes: 3 | cs.CV, cs.CL Authors: Jing Xiong, Gongye Liu, Lun Huang, Chengyue Wu, Taiqiang Wu, Yao Mu, Yuan Yao, Hui Shen, Zhongwei Wan...

LLM2CLIP: Powerful Language Model Unlock Richer Visual Representation

LLM2CLIP: Powerful Language Model Unlock Richer Visual Representation

Episode 63 · · 25:29

๐Ÿค— Paper Upvotes: 15 | cs.CV, cs.CL Authors: Weiquan Huang, Aoqi Wu, Yifan Yang, Xufang Luo, Yuqing Yang, Liang Hu, Qi Dai, Xiyang Dai, Dongdong ...

Balancing Pipeline Parallelism with Vocabulary Parallelism

Balancing Pipeline Parallelism with Vocabulary Parallelism

Episode 62 · · 23:34

๐Ÿค— Paper Upvotes: 10 | cs.DC Authors: Man Tsung Yeung, Penghui Qi, Min Lin, Xinyi Wan Title: Balancing Pipeline Parallel...

StdGEN: Semantic-Decomposed 3D Character Generation from Single Images

StdGEN: Semantic-Decomposed 3D Character Generation from Single Images

Episode 61 · · 21:47

๐Ÿค— Paper Upvotes: 10 | cs.CV Authors: Yuze He, Yanning Zhou, Wang Zhao, Zhongkai Wu, Kaiwen Xiao, Wei Yang, Yong-Jin Liu, Xiao Han T...

DELIFT: Data Efficient Language model Instruction Fine Tuning

DELIFT: Data Efficient Language model Instruction Fine Tuning

Episode 60 · · 21:17

๐Ÿค— Paper Upvotes: 5 | cs.CL Authors: Ishika Agarwal, Krishnateja Killamsetty, Lucian Popa, Marina Danilevksy Title: DELI...

Parameter-Efficient Fine-Tuning of Large Language Models for Unit Test Generation: An Empirical Study

Parameter-Efficient Fine-Tuning of Large Language Models for Unit Test Generation: An Empirical Study

Episode 59 · · 25:06

๐Ÿค— Paper Upvotes: 4 | cs.SE, cs.AI, cs.LG Authors: Andrรฉ Storhaug, Jingyue Li Title: Parameter-Efficient Fine-Tuning of ...

RaVL: Discovering and Mitigating Spurious Correlations in Fine-Tuned Vision-Language Models

RaVL: Discovering and Mitigating Spurious Correlations in Fine-Tuned Vision-Language Models

Episode 58 · · 22:22

๐Ÿค— Paper Upvotes: 3 | cs.CV, cs.AI Authors: Maya Varma, Jean-Benoit Delbrouck, Zhihong Chen, Akshay Chaudhari, Curtis Langlotz Title...

The Semantic Hub Hypothesis: Language Models Share Semantic Representations Across Languages and Modalities

The Semantic Hub Hypothesis: Language Models Share Semantic Representations Across Languages and Modalities

Episode 57 · · 24:01

๐Ÿค— Paper Upvotes: 3 | cs.CL Authors: Zhaofeng Wu, Xinyan Velocity Yu, Dani Yogatama, Jiasen Lu, Yoon Kim Title: The Sema...

Improving the detection of technical debt in Java source code with an enriched dataset

Improving the detection of technical debt in Java source code with an enriched dataset

Episode 56 · · 26:17

๐Ÿค— Paper Upvotes: 2 | cs.SE Authors: Nam Le Hai, Anh M. T. Bui, Phuong T. Nguyen, Davide Di Ruscio, Rick Kazman Title: I...

OpenCoder: The Open Cookbook for Top-Tier Code Large Language Models

OpenCoder: The Open Cookbook for Top-Tier Code Large Language Models

Episode 55 · · 22:46

๐Ÿค— Paper Upvotes: 69 | cs.CL, cs.PL Authors: Siming Huang, Tianhao Cheng, Jason Klein Liu, Jiaran Hao, Liuyihan Song, Yang Xu, J. Yang, J. H. Liu...

ReCapture: Generative Video Camera Controls for User-Provided Videos using Masked Video Fine-Tuning

ReCapture: Generative Video Camera Controls for User-Provided Videos using Masked Video Fine-Tuning

Episode 54 · · 19:53

๐Ÿค— Paper Upvotes: 50 | cs.CV, cs.AI, cs.GR, cs.LG Authors: David Junhao Zhang, Roni Paiss, Shiran Zada, Nikhil Karnad, David E. Jacobs, Yael Prit...

BitNet a4.8: 4-bit Activations for 1-bit LLMs

BitNet a4.8: 4-bit Activations for 1-bit LLMs

Episode 53 · · 25:23

๐Ÿค— Paper Upvotes: 41 | cs.CL, cs.LG Authors: Hongyu Wang, Shuming Ma, Furu Wei Title: BitNet a4.8: 4-bit Activations for...

DimensionX: Create Any 3D and 4D Scenes from a Single Image with Controllable Video Diffusion

DimensionX: Create Any 3D and 4D Scenes from a Single Image with Controllable Video Diffusion

Episode 52 · · 23:01

๐Ÿค— Paper Upvotes: 27 | cs.CV, cs.AI, cs.GR Authors: Wenqiang Sun, Shuo Chen, Fangfu Liu, Zilong Chen, Yueqi Duan, Jun Zhang, Yikai Wang ...