Episodes

Latest Episode
DI-PCG: Diffusion-based Efficient Inverse Procedural Content Generation for High-quality 3D Asset Creation

DI-PCG: Diffusion-based Efficient Inverse Procedural Content Generation for High-quality 3D Asset Creation

Episode 253 · · 23:08

πŸ€— Upvotes: 8 | cs.CV, cs.AI, cs.GR Authors: Wang Zhao, Yan-Pei Cao, Jiale Xu, Yuejiang Dong, Ying Shan Title: DI-PCG: D...

AceMath: Advancing Frontier Math Reasoning with Post-Training and Reward Modeling

AceMath: Advancing Frontier Math Reasoning with Post-Training and Reward Modeling

Episode 252 · · 24:09

πŸ€— Upvotes: 7 | cs.CL, cs.AI, cs.LG Authors: Zihan Liu, Yang Chen, Mohammad Shoeybi, Bryan Catanzaro, Wei Ping Title: Ac...

No More Adam: Learning Rate Scaling at Initialization is All You Need

No More Adam: Learning Rate Scaling at Initialization is All You Need

Episode 251 · · 21:59

πŸ€— Upvotes: 177 | cs.LG, cs.AI Authors: Minghao Xu, Lichuan Xiang, Xu Cai, Hongkai Wen Title: No More Adam: Learning Rat...

Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference

Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference

Episode 250 · · 21:56

πŸ€— Upvotes: 36 | cs.CL, cs.AI Authors: Benjamin Warner, Antoine Chaffin, Benjamin ClaviΓ©, Orion Weller, Oskar HallstrΓΆm, Said Taghadouini, Alexis...

TheAgentCompany: Benchmarking LLM Agents on Consequential Real World Tasks

TheAgentCompany: Benchmarking LLM Agents on Consequential Real World Tasks

Episode 249 · · 24:45

πŸ€— Upvotes: 30 | cs.CL Authors: Frank F. Xu, Yufan Song, Boxuan Li, Yuxuan Tang, Kritanjali Jain, Mengxue Bao, Zora Z. Wang, Xuhui Zhou, Zhitong ...

AniDoc: Animation Creation Made Easier

AniDoc: Animation Creation Made Easier

Episode 248 · · 22:20

πŸ€— Upvotes: 29 | cs.CV Authors: Yihao Meng, Hao Ouyang, Hanlin Wang, Qiuyu Wang, Wen Wang, Ka Leong Cheng, Zhiheng Liu, Yujun Shen, Huamin Qu ...

FashionComposer: Compositional Fashion Image Generation

FashionComposer: Compositional Fashion Image Generation

Episode 247 · · 19:47

πŸ€— Upvotes: 13 | cs.CV Authors: Sihui Ji, Yiyang Wang, Xi Chen, Xiaogang Xu, Hao Luo, Hengshuang Zhao Title: FashionComp...

GUI Agents: A Survey

GUI Agents: A Survey

Episode 246 · · 21:01

πŸ€— Upvotes: 11 | cs.AI, cs.HC Authors: Dang Nguyen, Jian Chen, Yu Wang, Gang Wu, Namyong Park, Zhengmian Hu, Hanjia Lyu, Junda Wu, Ryan Aponte, Y...

Efficient Diffusion Transformer Policies with Mixture of Expert Denoisers for Multitask Learning

Efficient Diffusion Transformer Policies with Mixture of Expert Denoisers for Multitask Learning

Episode 245 · · 22:42

πŸ€— Upvotes: 10 | cs.LG, cs.RO Authors: Moritz Reuss, Jyothish Pari, Pulkit Agrawal, Rudolf Lioutikov Title: Efficient Di...

Prompting Depth Anything for 4K Resolution Accurate Metric Depth Estimation

Prompting Depth Anything for 4K Resolution Accurate Metric Depth Estimation

Episode 244 · · 20:41

πŸ€— Upvotes: 10 | cs.CV Authors: Haotong Lin, Sida Peng, Jingxiao Chen, Songyou Peng, Jiaming Sun, Minghuan Liu, Hujun Bao, Jiashi Feng, Xiaowei Z...

Thinking in Space: How Multimodal Large Language Models See, Remember, and Recall Spaces

Thinking in Space: How Multimodal Large Language Models See, Remember, and Recall Spaces

Episode 243 · · 20:52

πŸ€— Upvotes: 9 | cs.CV Authors: Jihan Yang, Shusheng Yang, Anjali W. Gupta, Rilyn Han, Li Fei-Fei, Saining Xie Title: Thi...

Are Your LLMs Capable of Stable Reasoning?

Are Your LLMs Capable of Stable Reasoning?

Episode 242 · · 24:11

πŸ€— Upvotes: 61 | cs.AI, cs.CL Authors: Junnan Liu, Hongwei Liu, Linchen Xiao, Ziyi Wang, Kuikun Liu, Songyang Gao, Wenwei Zhang, Songyang Zhang, ...

Multi-Dimensional Insights: Benchmarking Real-World Personalization in Large Multimodal Models

Multi-Dimensional Insights: Benchmarking Real-World Personalization in Large Multimodal Models

Episode 241 · · 22:34

πŸ€— Upvotes: 29 | cs.AI, cs.CL, cs.CV Authors: YiFan Zhang, Shanglin Lei, Runqi Qiao, Zhuoma GongQue, Xiaoshuai Song, Guanting Dong, Qiuna Tan, Zh...

OmniEval: An Omnidirectional and Automatic RAG Evaluation Benchmark in Financial Domain

OmniEval: An Omnidirectional and Automatic RAG Evaluation Benchmark in Financial Domain

Episode 240 · · 23:15

πŸ€— Upvotes: 29 | cs.CL Authors: Shuting Wang, Jiejun Tan, Zhicheng Dou, Ji-Rong Wen Title: OmniEval: An Omnidirectional ...

Compressed Chain of Thought: Efficient Reasoning Through Dense Representations

Compressed Chain of Thought: Efficient Reasoning Through Dense Representations

Episode 239 · · 23:05

πŸ€— Upvotes: 21 | cs.CL Authors: Jeffrey Cheng, Benjamin Van Durme Title: Compressed Chain of Thought: Efficient Reasonin...

Emergence of Abstractions: Concept Encoding and Decoding Mechanism for In-Context Learning in Transformers

Emergence of Abstractions: Concept Encoding and Decoding Mechanism for In-Context Learning in Transformers

Episode 238 · · 22:52

πŸ€— Upvotes: 9 | cs.CL, cs.AI, cs.LG Authors: Seungwook Han, Jinyeop Song, Jeff Gore, Pulkit Agrawal Title: Emergence of ...

Feather the Throttle: Revisiting Visual Token Pruning for Vision-Language Model Acceleration

Feather the Throttle: Revisiting Visual Token Pruning for Vision-Language Model Acceleration

Episode 237 · · 20:44

πŸ€— Upvotes: 7 | cs.CV Authors: Mark Endo, Xiaohan Wang, Serena Yeung-Levy Title: Feather the Throttle: Revisiting Visual...

Proposer-Agent-Evaluator(PAE): Autonomous Skill Discovery For Foundation Model Internet Agents

Proposer-Agent-Evaluator(PAE): Autonomous Skill Discovery For Foundation Model Internet Agents

Episode 236 · · 23:53

πŸ€— Upvotes: 5 | cs.LG, cs.AI, cs.CV Authors: Yifei Zhou, Qianlan Yang, Kaixiang Lin, Min Bai, Xiong Zhou, Yu-Xiong Wang, Sergey Levine, Erran Li ...

VisDoM: Multi-Document QA with Visually Rich Elements Using Multimodal Retrieval-Augmented Generation

VisDoM: Multi-Document QA with Visually Rich Elements Using Multimodal Retrieval-Augmented Generation

Episode 235 · · 23:12

πŸ€— Upvotes: 4 | cs.CL Authors: Manan Suri, Puneet Mathur, Franck Dernoncourt, Kanika Goswami, Ryan A. Rossi, Dinesh Manocha Title: ...

SUGAR: Subject-Driven Video Customization in a Zero-Shot Manner

SUGAR: Subject-Driven Video Customization in a Zero-Shot Manner

Episode 234 · · 20:27

πŸ€— Upvotes: 2 | cs.CV Authors: Yufan Zhou, Ruiyi Zhang, Jiuxiang Gu, Nanxuan Zhao, Jing Shi, Tong Sun Title: SUGAR: Subj...

Marigold-DC: Zero-Shot Monocular Depth Completion with Guided Diffusion

Marigold-DC: Zero-Shot Monocular Depth Completion with Guided Diffusion

Episode 233 · · 20:33

πŸ€— Upvotes: 2 | cs.CV, cs.LG Authors: Massimiliano Viola, Kevin Qu, Nando Metzger, Bingxin Ke, Alexander Becker, Konrad Schindler, Anton Obukhov ...

Byte Latent Transformer: Patches Scale Better Than Tokens

Byte Latent Transformer: Patches Scale Better Than Tokens

Episode 232 · · 25:08

πŸ€— Upvotes: 39 | cs.CL Authors: Artidoro Pagnoni, Ram Pasunuru, Pedro Rodriguez, John Nguyen, Benjamin Muller, Margaret Li, Chunting Zhou, Lili Y...

RetroLLM: Empowering Large Language Models to Retrieve Fine-grained Evidence within Generation

RetroLLM: Empowering Large Language Models to Retrieve Fine-grained Evidence within Generation

Episode 231 · · 21:46

πŸ€— Upvotes: 25 | cs.CL, cs.AI, cs.IR Authors: Xiaoxi Li, Jiajie Jin, Yujia Zhou, Yongkang Wu, Zhonghua Li, Qi Ye, Zhicheng Dou Title...

Evaluation Agent: Efficient and Promptable Evaluation Framework for Visual Generative Models

Evaluation Agent: Efficient and Promptable Evaluation Framework for Visual Generative Models

Episode 230 · · 21:10

πŸ€— Upvotes: 25 | cs.CV, cs.AI, cs.CL Authors: Fan Zhang, Shulin Tian, Ziqi Huang, Yu Qiao, Ziwei Liu Title: Evaluation A...

BrushEdit: All-In-One Image Inpainting and Editing

BrushEdit: All-In-One Image Inpainting and Editing

Episode 229 · · 27:48

πŸ€— Upvotes: 24 | cs.CV, cs.AI Authors: Yaowei Li, Yuxuan Bian, Xuan Ju, Zhaoyang Zhang, Ying Shan, Yuexian Zou, Qiang Xu Title: ...

ColorFlow: Retrieval-Augmented Image Sequence Colorization

ColorFlow: Retrieval-Augmented Image Sequence Colorization

Episode 228 · · 22:32

πŸ€— Upvotes: 20 | cs.CV Authors: Junhao Zhuang, Xuan Ju, Zhaoyang Zhang, Yong Liu, Shiyi Zhang, Chun Yuan, Ying Shan Title: ...

Smaller Language Models Are Better Instruction Evolvers

Smaller Language Models Are Better Instruction Evolvers

Episode 227 · · 23:17

πŸ€— Upvotes: 16 | cs.CL Authors: Tingfeng Hui, Lulu Zhao, Guanting Dong, Yaqi Zhang, Hua Zhou, Sen Su Title: Smaller Lang...

Causal Diffusion Transformers for Generative Modeling

Causal Diffusion Transformers for Generative Modeling

Episode 226 · · 23:47

πŸ€— Upvotes: 16 | cs.CV Authors: Chaorui Deng, Deyao Zhu, Kunchang Li, Shi Guang, Haoqi Fan Title: Causal Diffusion Trans...

SPaR: Self-Play with Tree-Search Refinement to Improve Instruction-Following in Large Language Models

SPaR: Self-Play with Tree-Search Refinement to Improve Instruction-Following in Large Language Models

Episode 225 · · 23:05

πŸ€— Upvotes: 11 | cs.CL, cs.AI, cs.LG Authors: Jiale Cheng, Xiao Liu, Cunxiang Wang, Xiaotao Gu, Yida Lu, Dan Zhang, Yuxiao Dong, Jie Tang, Hongni...

IDArb: Intrinsic Decomposition for Arbitrary Number of Input Views and Illuminations

IDArb: Intrinsic Decomposition for Arbitrary Number of Input Views and Illuminations

Episode 224 · · 20:29

πŸ€— Upvotes: 11 | cs.CV Authors: Zhibing Li, Tong Wu, Jing Tan, Mengchen Zhang, Jiaqi Wang, Dahua Lin Title: IDArb: Intri...