Episodes

Latest Episode
LLMVoX: Autoregressive Streaming Text-to-Speech Model for Any LLM

LLMVoX: Autoregressive Streaming Text-to-Speech Model for Any LLM

Episode 635 · · 26:17

🤗 Upvotes: 33 | cs.CL Authors: Sambal Shikhar, Mohammed Irfan Kurpath, Sahal Shaji Mullappilly, Jean Lahoud, Fahad Khan, Rao Muhammad Anwer, Sal...

EgoLife: Towards Egocentric Life Assistant

EgoLife: Towards Egocentric Life Assistant

Episode 634 · · 22:08

🤗 Upvotes: 21 | cs.CV Authors: Jingkang Yang, Shuai Liu, Hongming Guo, Yuhao Dong, Xiamengwei Zhang, Sicheng Zhang, Pengyun Wang, Zitang Zhou, B...

Babel: Open Multilingual Large Language Models Serving Over 90% of Global Speakers

Babel: Open Multilingual Large Language Models Serving Over 90% of Global Speakers

Episode 633 · · 18:09

🤗 Upvotes: 42 | cs.CL, cs.AI Authors: Yiran Zhao, Chaoqun Liu, Yue Deng, Jiahao Ying, Mahani Aljunied, Zhaodonghui Li, Lidong Bing, Hou Pong Cha...

HoT: Highlighted Chain of Thought for Referencing Supporting Facts from Inputs

HoT: Highlighted Chain of Thought for Referencing Supporting Facts from Inputs

Episode 632 · · 24:25

🤗 Upvotes: 27 | cs.CL, cs.HC Authors: Tin Nguyen, Logan Bolton, Mohammad Reza Taesiri, Anh Totti Nguyen Title: HoT: Hig...

Process-based Self-Rewarding Language Models

Process-based Self-Rewarding Language Models

Episode 631 · · 23:46

🤗 Upvotes: 27 | cs.CL, cs.AI Authors: Shimao Zhang, Xiao Liu, Xin Zhang, Junxiao Liu, Zheheng Luo, Shujian Huang, Yeyun Gong Title:...

Visual-RFT: Visual Reinforcement Fine-Tuning

Visual-RFT: Visual Reinforcement Fine-Tuning

Episode 630 · · 22:49

🤗 Upvotes: 44 | cs.CV Authors: Ziyu Liu, Zeyi Sun, Yuhang Zang, Xiaoyi Dong, Yuhang Cao, Haodong Duan, Dahua Lin, Jiaqi Wang Title:...

Phi-4-Mini Technical Report: Compact yet Powerful Multimodal Language Models via Mixture-of-LoRAs

Phi-4-Mini Technical Report: Compact yet Powerful Multimodal Language Models via Mixture-of-LoRAs

Episode 629 · · 25:44

🤗 Upvotes: 42 | cs.CL, cs.AI, cs.LG Authors: Abdelrahman Abouelenin, Atabak Ashfaq, Adam Atkinson, Hany Awadalla, Nguyen Bach, Jianmin Bao, Alon...

Difix3D+: Improving 3D Reconstructions with Single-Step Diffusion Models

Difix3D+: Improving 3D Reconstructions with Single-Step Diffusion Models

Episode 628 · · 19:04

🤗 Upvotes: 30 | cs.CV Authors: Jay Zhangjie Wu, Yuxuan Zhang, Haithem Turki, Xuanchi Ren, Jun Gao, Mike Zheng Shou, Sanja Fidler, Zan Gojcic, Hu...

DeepSolution: Boosting Complex Engineering Solution Design via Tree-based Exploration and Bi-point Thinking

DeepSolution: Boosting Complex Engineering Solution Design via Tree-based Exploration and Bi-point Thinking

Episode 627 · · 22:49

🤗 Upvotes: 27 | cs.AI Authors: Zhuoqun Li, Haiyang Yu, Xuanang Chen, Hongyu Lin, Yaojie Lu, Fei Huang, Xianpei Han, Yongbin Li, Le Sun ...

Chain of Draft: Thinking Faster by Writing Less

Chain of Draft: Thinking Faster by Writing Less

Episode 626 · · 22:37

🤗 Upvotes: 27 | cs.CL, I.2.7 Authors: Silei Xu, Wenhao Xie, Lingxiao Zhao, Pengcheng He Title: Chain of Draft: Thinking...

Multi-Turn Code Generation Through Single-Step Rewards

Multi-Turn Code Generation Through Single-Step Rewards

Episode 625 · · 25:33

🤗 Upvotes: 21 | cs.LG, cs.AI, cs.CL Authors: Arnav Kumar Jain, Gonzalo Gonzalez-Pumariega, Wayne Chen, Alexander M Rush, Wenting Zhao, Sanjiban ...

Self-rewarding correction for mathematical reasoning

Self-rewarding correction for mathematical reasoning

Episode 624 · · 24:30

🤗 Upvotes: 51 | cs.AI, cs.LG Authors: Wei Xiong, Hanning Zhang, Chenlu Ye, Lichang Chen, Nan Jiang, Tong Zhang Title: S...

MedVLM-R1: Incentivizing Medical Reasoning Capability of Vision-Language Models (VLMs) via Reinforcement Learning

MedVLM-R1: Incentivizing Medical Reasoning Capability of Vision-Language Models (VLMs) via Reinforcement Learning

Episode 623 · · 23:19

🤗 Upvotes: 44 | cs.CV, cs.AI Authors: Jiazhen Pan, Che Liu, Junde Wu, Fenglin Liu, Jiayuan Zhu, Hongwei Bran Li, Chen Chen, Cheng Ouyang, Daniel...

R2-T2: Re-Routing in Test-Time for Multimodal Mixture-of-Experts

R2-T2: Re-Routing in Test-Time for Multimodal Mixture-of-Experts

Episode 622 · · 22:25

🤗 Upvotes: 33 | cs.LG Authors: Zhongyang Li, Ziyue Li, Tianyi Zhou Title: R2-T2: Re-Routing in Test-Time for Multimodal...

LongRoPE2: Near-Lossless LLM Context Window Scaling

LongRoPE2: Near-Lossless LLM Context Window Scaling

Episode 621 · · 23:05

🤗 Upvotes: 21 | cs.CL Authors: Ning Shang, Li Lyna Zhang, Siyuan Wang, Gaokai Zhang, Gilsinia Lopez, Fan Yang, Weizhu Chen, Mao Yang ...

FINEREASON: Evaluating and Improving LLMs' Deliberate Reasoning through Reflective Puzzle Solving

FINEREASON: Evaluating and Improving LLMs' Deliberate Reasoning through Reflective Puzzle Solving

Episode 620 · · 26:27

🤗 Upvotes: 19 | cs.CL Authors: Guizhen Chen, Weiwen Xu, Hao Zhang, Hou Pong Chan, Chaoqun Liu, Lidong Bing, Deli Zhao, Anh Tuan Luu, Yu Rong ...

CODESYNC: Synchronizing Large Language Models with Dynamic Code Evolution at Scale

CODESYNC: Synchronizing Large Language Models with Dynamic Code Evolution at Scale

Episode 619 · · 21:52

🤗 Upvotes: 15 | cs.CL, cs.AI, cs.SE Authors: Chenlong Wang, Zhaoyang Chu, Zhengxiang Cheng, Xuyi Yang, Kaiyue Qiu, Yao Wan, Zhou Zhao, Xuanhua S...

UniTok: A Unified Tokenizer for Visual Generation and Understanding

UniTok: A Unified Tokenizer for Visual Generation and Understanding

Episode 618 · · 24:43

🤗 Upvotes: 15 | cs.CV, cs.AI Authors: Chuofan Ma, Yi Jiang, Junfeng Wu, Jihan Yang, Xin Yu, Zehuan Yuan, Bingyue Peng, Xiaojuan Qi ...

NeoBERT: A Next-Generation BERT

NeoBERT: A Next-Generation BERT

Episode 617 · · 23:40

🤗 Upvotes: 11 | cs.CL, cs.AI Authors: Lola Le Breton, Quentin Fournier, Mariam El Mezouar, Sarath Chandar Title: NeoBER...

Lean and Mean: Decoupled Value Policy Optimization with Global Value Guidance

Lean and Mean: Decoupled Value Policy Optimization with Global Value Guidance

Episode 616 · · 21:30

🤗 Upvotes: 9 | cs.LG, cs.AI Authors: Chenghua Huang, Lu Wang, Fangkai Yang, Pu Zhao, Zhixu Li, Qingwei Lin, Dongmei Zhang, Saravan Rajmohan, Qi ...

Multimodal Representation Alignment for Image Generation: Text-Image Interleaved Control Is Easier Than You Think

Multimodal Representation Alignment for Image Generation: Text-Image Interleaved Control Is Easier Than You Think

Episode 615 · · 22:16

🤗 Upvotes: 9 | cs.CV, cs.CL Authors: Liang Chen, Shuai Bai, Wenhao Chai, Weichu Xie, Haozhe Zhao, Leon Vinci, Junyang Lin, Baobao Chang ...

GHOST 2.0: generative high-fidelity one shot transfer of heads

GHOST 2.0: generative high-fidelity one shot transfer of heads

Episode 614 · · 18:41

🤗 Upvotes: 49 | cs.CV Authors: Alexander Groshev, Anastasiia Iashchenko, Pavel Paramonov, Denis Dimitrov, Andrey Kuznetsov Title: ...

Kanana: Compute-efficient Bilingual Language Models

Kanana: Compute-efficient Bilingual Language Models

Episode 613 · · 22:05

🤗 Upvotes: 47 | cs.CL, cs.LG Authors: Kanana LLM Team, Yunju Bak, Hojin Lee, Minho Ryu, Jiyeon Ham, Seungjae Jung, Daniel Wontae Nam, Taegyeong ...

TheoremExplainAgent: Towards Multimodal Explanations for LLM Theorem Understanding

TheoremExplainAgent: Towards Multimodal Explanations for LLM Theorem Understanding

Episode 612 · · 22:22

🤗 Upvotes: 32 | cs.AI, cs.CL, cs.CV, cs.MM Authors: Max Ku, Thomas Chong, Jonathan Leung, Krish Shah, Alvin Yu, Wenhu Chen Title: ...

Plutus: Benchmarking Large Language Models in Low-Resource Greek Finance

Plutus: Benchmarking Large Language Models in Low-Resource Greek Finance

Episode 611 · · 24:57

🤗 Upvotes: 27 | cs.CL Authors: Xueqing Peng, Triantafillos Papadopoulos, Efstathia Soufleri, Polydoros Giannouris, Ruoyu Xiang, Yan Wang, Lingfe...

Language Models' Factuality Depends on the Language of Inquiry

Language Models' Factuality Depends on the Language of Inquiry

Episode 610 · · 22:25

🤗 Upvotes: 19 | cs.CL, cs.AI Authors: Tushar Aggarwal, Kumar Tanmay, Ayush Agrawal, Kumar Ayush, Hamid Palangi, Paul Pu Liang Title...

Can Large Language Models Detect Errors in Long Chain-of-Thought Reasoning?

Can Large Language Models Detect Errors in Long Chain-of-Thought Reasoning?

Episode 609 · · 24:04

🤗 Upvotes: 16 | cs.CL Authors: Yancheng He, Shilong Li, Jiaheng Liu, Weixun Wang, Xingyuan Bu, Ge Zhang, Zhongyuan Peng, Zhaoxiang Zhang, Zhiche...

Towards an AI co-scientist

Towards an AI co-scientist

Episode 608 · · 25:03

🤗 Upvotes: 15 | cs.AI, cs.CL, cs.HC, cs.LG, physics.soc-ph, q-bio.OT Authors: Juraj Gottweis, Wei-Hung Weng, Alexander Daryin, Tao Tu, Anil Pale...

Agentic Reward Modeling: Integrating Human Preferences with Verifiable Correctness Signals for Reliable Reward Systems

Agentic Reward Modeling: Integrating Human Preferences with Verifiable Correctness Signals for Reliable Reward Systems

Episode 607 · · 18:21

🤗 Upvotes: 15 | cs.CL, cs.AI Authors: Hao Peng, Yunjia Qi, Xiaozhi Wang, Zijun Yao, Bin Xu, Lei Hou, Juanzi Li Title: A...

Can Language Models Falsify? Evaluating Algorithmic Reasoning with Counterexample Creation

Can Language Models Falsify? Evaluating Algorithmic Reasoning with Counterexample Creation

Episode 606 · · 22:49

🤗 Upvotes: 13 | cs.LG, cs.SE Authors: Shiven Sinha, Shashwat Goel, Ponnurangam Kumaraguru, Jonas Geiping, Matthias Bethge, Ameya Prabhu ...