Episodes

Latest Episode
AnimateAnything: Consistent and Controllable Animation for Video Generation

AnimateAnything: Consistent and Controllable Animation for Video Generation

Episode 98 · · 22:15

🤗 Paper Upvotes: 12 | cs.CV Authors: Guojun Lei, Chi Wang, Hong Li, Rong Zhang, Yikai Wang, Weiwei Xu Title: AnimateAny...

Top-$nσ$: Not All Logits Are You Need

Top-$nσ$: Not All Logits Are You Need

Episode 97 · · 21:18

🤗 Paper Upvotes: 12 | cs.LG Authors: Chenxia Tang, Jianchun Liu, Hongli Xu, Liusheng Huang Title: Top-$nσ$: Not All Log...

Drowning in Documents: Consequences of Scaling Reranker Inference

Drowning in Documents: Consequences of Scaling Reranker Inference

Episode 96 · · 21:41

🤗 Paper Upvotes: 10 | cs.IR, cs.CL, cs.LG Authors: Mathew Jacob, Erik Lindgren, Matei Zaharia, Michael Carbin, Omar Khattab, Andrew Drozdov ...

SlimLM: An Efficient Small Language Model for On-Device Document Assistance

SlimLM: An Efficient Small Language Model for On-Device Document Assistance

Episode 95 · · 25:53

🤗 Paper Upvotes: 10 | cs.CL Authors: Thang M. Pham, Phat T. Nguyen, Seunghyun Yoon, Viet Dac Lai, Franck Dernoncourt, Trung Bui Tit...

Awaker2.5-VL: Stably Scaling MLLMs with Parameter-Efficient Mixture of Experts

Awaker2.5-VL: Stably Scaling MLLMs with Parameter-Efficient Mixture of Experts

Episode 94 · · 19:48

🤗 Paper Upvotes: 8 | cs.CV Authors: Jinqiang Long, Yanqi Dai, Guoxing Yang, Hongpeng Lin, Nanyi Fei, Yizhao Gao, Zhiwu Lu Title: ...

SmoothCache: A Universal Inference Acceleration Technique for Diffusion Transformers

SmoothCache: A Universal Inference Acceleration Technique for Diffusion Transformers

Episode 93 · · 27:31

🤗 Paper Upvotes: 8 | cs.LG Authors: Joseph Liu, Joshua Geddes, Ziyu Guo, Haomiao Jiang, Mahesh Kumar Nandwana Title: Sm...

LLäMmlein: Compact and Competitive German-Only Language Models from Scratch

LLäMmlein: Compact and Competitive German-Only Language Models from Scratch

Episode 92 · · 22:13

🤗 Paper Upvotes: 7 | cs.CL, cs.AI, cs.LG Authors: Jan Pfister, Julia Wunderle, Andreas Hotho Title: LLäMmlein: Compact ...

LLaVA-o1: Let Vision Language Models Reason Step-by-Step

LLaVA-o1: Let Vision Language Models Reason Step-by-Step

Episode 91 · · 25:35

🤗 Paper Upvotes: 64 | cs.CV Authors: Guowei Xu, Peng Jin, Li Hao, Yibing Song, Lichao Sun, Li Yuan Title: LLaVA-o1: Let...

GaussianAnything: Interactive Point Cloud Latent Diffusion for 3D Generation

GaussianAnything: Interactive Point Cloud Latent Diffusion for 3D Generation

Episode 90 · · 24:29

🤗 Paper Upvotes: 19 | cs.CV, cs.AI, cs.GR Authors: Yushi Lan, Shangchen Zhou, Zhaoyang Lyu, Fangzhou Hong, Shuai Yang, Bo Dai, Xingang Pan, Chen...

Xmodel-1.5: An 1B-scale Multilingual LLM

Xmodel-1.5: An 1B-scale Multilingual LLM

Episode 89 · · 20:52

🤗 Paper Upvotes: 7 | cs.CL Authors: Wang Qun, Liu Yang, Lin Qingquan, Jiang Ling Title: Xmodel-1.5: An 1B-scale Multili...

LLaMA-Mesh: Unifying 3D Mesh Generation with Language Models

LLaMA-Mesh: Unifying 3D Mesh Generation with Language Models

Episode 88 · · 23:53

🤗 Paper Upvotes: 32 | cs.LG, cs.AI, cs.CL, cs.CV, 68T05, I.3.5; I.2.10; I.2.6 Authors: Zhengyi Wang, Jonathan Lorraine, Yikai Wang, Hang Su, Jun...

MagicQuill: An Intelligent Interactive Image Editing System

MagicQuill: An Intelligent Interactive Image Editing System

Episode 87 · · 20:13

🤗 Paper Upvotes: 31 | cs.CV Authors: Zichen Liu, Yue Yu, Hao Ouyang, Qiuyu Wang, Ka Leong Cheng, Wen Wang, Zhiheng Liu, Qifeng Chen, Yujun Shen ...

Cut Your Losses in Large-Vocabulary Language Models

Cut Your Losses in Large-Vocabulary Language Models

Episode 86 · · 20:48

🤗 Paper Upvotes: 15 | cs.LG, cs.CL Authors: Erik Wijmans, Brody Huval, Alexander Hertzberg, Vladlen Koltun, Philipp Krähenbühl Titl...

ClinicalBench: Can LLMs Beat Traditional ML Models in Clinical Prediction?

ClinicalBench: Can LLMs Beat Traditional ML Models in Clinical Prediction?

Episode 85 · · 23:44

🤗 Paper Upvotes: 9 | cs.CL Authors: Canyu Chen, Jian Yu, Shan Chen, Che Liu, Zhongwei Wan, Danielle Bitterman, Fei Wang, Kai Shu Ti...

Sharingan: Extract User Action Sequence from Desktop Recordings

Sharingan: Extract User Action Sequence from Desktop Recordings

Episode 84 · · 22:38

🤗 Paper Upvotes: 3 | cs.CV, cs.AI Authors: Yanting Chen, Yi Ren, Xiaoting Qin, Jue Zhang, Kehong Yuan, Lu Han, Qingwei Lin, Dongmei Zhang, Sarav...

Hermes: A Large Language Model Framework on the Journey to Autonomous Networks

Hermes: A Large Language Model Framework on the Journey to Autonomous Networks

Episode 83 · · 22:27

🤗 Paper Upvotes: 2 | cs.AI, cs.NI Authors: Fadhel Ayed, Ali Maatouk, Nicola Piovesan, Antonio De Domenico, Merouane Debbah, Zhi-Quan Luo ...

Inconsistencies In Consistency Models: Better ODE Solving Does Not Imply Better Samples

Inconsistencies In Consistency Models: Better ODE Solving Does Not Imply Better Samples

Episode 82 · · 22:09

🤗 Paper Upvotes: 2 | cs.LG, cs.AI Authors: Noël Vouitsis, Rasa Hosseinzadeh, Brendan Leigh Ross, Valentin Villecroze, Satya Krishna Gorti, Jesse...

Direct Preference Optimization Using Sparse Feature-Level Constraints

Direct Preference Optimization Using Sparse Feature-Level Constraints

Episode 81 · · 21:15

🤗 Paper Upvotes: 10 | cs.AI, cs.CL Authors: Qingyu Yin, Chak Tou Leong, Hongbo Zhang, Minjun Zhu, Hanqi Yan, Qiang Zhang, Yulan He, Wenjie Li, J...

CamemBERT 2.0: A Smarter French Language Model Aged to Perfection

CamemBERT 2.0: A Smarter French Language Model Aged to Perfection

Episode 80 · · 24:28

🤗 Paper Upvotes: 8 | cs.CL Authors: Wissam Antoun, Francis Kulumba, Rian Touchent, Éric de la Clergerie, Benoît Sagot, Djamé Seddah ...

Can sparse autoencoders be used to decompose and interpret steering vectors?

Can sparse autoencoders be used to decompose and interpret steering vectors?

Episode 79 · · 21:54

🤗 Paper Upvotes: 6 | cs.LG, cs.AI, cs.CL Authors: Harry Mayne, Yushi Yang, Adam Mahdi Title: Can sparse autoencoders be...

PerceiverS: A Multi-Scale Perceiver with Effective Segmentation for Long-Term Expressive Symbolic Music Generation

PerceiverS: A Multi-Scale Perceiver with Effective Segmentation for Long-Term Expressive Symbolic Music Generation

Episode 78 · · 18:59

🤗 Paper Upvotes: 5 | cs.AI, cs.MM, cs.SD, eess.AS Authors: Yungang Yi, Weihua Li, Matthew Kuo, Quan Bai Title: Perceive...

SAMPart3D: Segment Any Part in 3D Objects

SAMPart3D: Segment Any Part in 3D Objects

Episode 77 · · 20:51

🤗 Paper Upvotes: 18 | cs.CV Authors: Yunhan Yang, Yukun Huang, Yuan-Chen Guo, Liangjun Lu, Xiaoyang Wu, Edmund Y. Lam, Yan-Pei Cao, Xihui Liu ...

JanusFlow: Harmonizing Autoregression and Rectified Flow for Unified Multimodal Understanding and Generation

JanusFlow: Harmonizing Autoregression and Rectified Flow for Unified Multimodal Understanding and Generation

Episode 76 · · 22:30

🤗 Paper Upvotes: 14 | cs.CV, cs.AI, cs.CL Authors: Yiyang Ma, Xingchao Liu, Xiaokang Chen, Wen Liu, Chengyue Wu, Zhiyu Wu, Zizheng Pan, Zhenda X...

Stronger Models are NOT Stronger Teachers for Instruction Tuning

Stronger Models are NOT Stronger Teachers for Instruction Tuning

Episode 75 · · 27:47

🤗 Paper Upvotes: 13 | cs.AI, cs.CL Authors: Zhangchen Xu, Fengqing Jiang, Luyao Niu, Bill Yuchen Lin, Radha Poovendran Title: ...

BLIP3-KALE: Knowledge Augmented Large-Scale Dense Captions

BLIP3-KALE: Knowledge Augmented Large-Scale Dense Captions

Episode 74 · · 20:39

🤗 Paper Upvotes: 11 | cs.CV, cs.AI Authors: Anas Awadalla, Le Xue, Manli Shu, An Yan, Jun Wang, Senthil Purushwalkam, Sheng Shen, Hannah Lee, Os...

Scaling Properties of Diffusion Models for Perceptual Tasks

Scaling Properties of Diffusion Models for Perceptual Tasks

Episode 73 · · 25:09

🤗 Paper Upvotes: 7 | cs.CV, cs.AI Authors: Rahul Ravishankar, Zeeshan Patel, Jathushan Rajasegaran, Jitendra Malik Title: ...

Wavelet Latent Diffusion (Wala): Billion-Parameter 3D Generative Model with Compact Wavelet Encodings

Wavelet Latent Diffusion (Wala): Billion-Parameter 3D Generative Model with Compact Wavelet Encodings

Episode 72 · · 22:22

🤗 Paper Upvotes: 5 | cs.CV, cs.AI, cs.LG Authors: Aditya Sanghi, Aliasghar Khani, Pradyumna Reddy, Arianna Rampini, Derek Cheung, Kamal Rahimi M...

Add-it: Training-Free Object Insertion in Images With Pretrained Diffusion Models

Add-it: Training-Free Object Insertion in Images With Pretrained Diffusion Models

Episode 71 · · 23:36

🤗 Paper Upvotes: 44 | cs.CV, cs.AI, cs.GR, cs.LG Authors: Yoad Tewel, Rinon Gal, Dvir Samuel, Yuval Atzmon, Lior Wolf, Gal Chechik ...

OmniEdit: Building Image Editing Generalist Models Through Specialist Supervision

OmniEdit: Building Image Editing Generalist Models Through Specialist Supervision

Episode 70 · · 19:47

🤗 Paper Upvotes: 39 | cs.CV, cs.AI Authors: Cong Wei, Zheyang Xiong, Weiming Ren, Xinrun Du, Ge Zhang, Wenhu Chen Title: ...

Chinese SimpleQA: A Chinese Factuality Evaluation for Large Language Models

Chinese SimpleQA: A Chinese Factuality Evaluation for Large Language Models

Episode 69 · · 21:15

🤗 Paper Upvotes: 30 | cs.CL Authors: Yancheng He, Shilong Li, Jiaheng Liu, Yingshui Tan, Hui Huang, Weixun Wang, Xingyuan Bu, Hangyu Guo, Chengw...