Episodes

Latest Episode
CLIMB: CLustering-based Iterative Data Mixture Bootstrapping for Language Model Pre-training

CLIMB: CLustering-based Iterative Data Mixture Bootstrapping for Language Model Pre-training

Episode 695 · · 22:41

πŸ€— Upvotes: 69 | cs.CL Authors: Shizhe Diao, Yu Yang, Yonggan Fu, Xin Dong, Dan Su, Markus Kliegl, Zijia Chen, Peter Belcak, Yoshi Suhara, Hongxu...

Antidistillation Sampling

Antidistillation Sampling

Episode 694 · · 18:20

πŸ€— Upvotes: 52 | cs.AI, cs.CL Authors: Yash Savani, Asher Trockman, Zhili Feng, Avi Schwarzschild, Alexander Robey, Marc Finzi, J. Zico Kolter ...

Generate, but Verify: Reducing Hallucination in Vision-Language Models with Retrospective Resampling

Generate, but Verify: Reducing Hallucination in Vision-Language Models with Retrospective Resampling

Episode 693 · · 20:18

πŸ€— Upvotes: 28 | cs.CV Authors: Tsung-Han Wu, Heekyung Lee, Jiaxin Ge, Joseph E. Gonzalez, Trevor Darrell, David M. Chan Title: ...

Packing Input Frame Context in Next-Frame Prediction Models for Video Generation

Packing Input Frame Context in Next-Frame Prediction Models for Video Generation

Episode 692 · · 24:12

πŸ€— Upvotes: 24 | cs.CV Authors: Lvmin Zhang, Maneesh Agrawala Title: Packing Input Frame Context in Next-Frame Predictio...

WORLDMEM: Long-term Consistent World Simulation with Memory

WORLDMEM: Long-term Consistent World Simulation with Memory

Episode 691 · · 22:27

πŸ€— Upvotes: 23 | cs.CV Authors: Zeqi Xiao, Yushi Lan, Yifan Zhou, Wenqi Ouyang, Shuai Yang, Yanhong Zeng, Xingang Pan Title: ...

A Strategic Coordination Framework of Small LLMs Matches Large LLMs in Data Synthesis

A Strategic Coordination Framework of Small LLMs Matches Large LLMs in Data Synthesis

Episode 690 · · 23:36

πŸ€— Upvotes: 23 | cs.CL, cs.AI, cs.LG Authors: Xin Gao, Qizhi Pei, Zinan Tang, Yu Li, Honglin Lin, Jiang Wu, Conghui He, Lijun Wu Tit...

ColorBench: Can VLMs See and Understand the Colorful World? A Comprehensive Benchmark for Color Perception, Reasoning, and Robustness

ColorBench: Can VLMs See and Understand the Colorful World? A Comprehensive Benchmark for Color Perception, Reasoning, and Robustness

Episode 689 · · 22:18

πŸ€— Upvotes: 35 | cs.CV, cs.AI, cs.CL, cs.LG Authors: Yijun Liang, Ming Li, Chenrui Fan, Ziyue Li, Dang Nguyen, Kwesi Cobbina, Shweta Bhardwaj, Ji...

BitNet b1.58 2B4T Technical Report

BitNet b1.58 2B4T Technical Report

Episode 688 · · 19:43

πŸ€— Upvotes: 35 | cs.CL, cs.LG Authors: Shuming Ma, Hongyu Wang, Shaohan Huang, Xingxing Zhang, Ying Hu, Ting Song, Yan Xia, Furu Wei ...

ReTool: Reinforcement Learning for Strategic Tool Use in LLMs

ReTool: Reinforcement Learning for Strategic Tool Use in LLMs

Episode 687 · · 23:19

πŸ€— Upvotes: 27 | cs.CL, cs.AI Authors: Jiazhan Feng, Shijue Huang, Xingwei Qu, Ge Zhang, Yujia Qin, Baoquan Zhong, Chengquan Jiang, Jinxin Chi, W...

xVerify: Efficient Answer Verifier for Reasoning Model Evaluations

xVerify: Efficient Answer Verifier for Reasoning Model Evaluations

Episode 686 · · 21:56

πŸ€— Upvotes: 63 | cs.CL Authors: Ding Chen, Qingchen Yu, Pengyuan Wang, Wentao Zhang, Bo Tang, Feiyu Xiong, Xinchi Li, Minchuan Yang, Zhiyu Li ...

Genius: A Generalizable and Purely Unsupervised Self-Training Framework For Advanced Reasoning

Genius: A Generalizable and Purely Unsupervised Self-Training Framework For Advanced Reasoning

Episode 685 · · 20:14

πŸ€— Upvotes: 41 | cs.CL, cs.AI, cs.LG Authors: Fangzhi Xu, Hang Yan, Chang Ma, Haiteng Zhao, Qiushi Sun, Kanzhi Cheng, Junxian He, Jun Liu, Zhiyon...

How Instruction and Reasoning Data shape Post-Training: Data Quality through the Lens of Layer-wise Gradients

How Instruction and Reasoning Data shape Post-Training: Data Quality through the Lens of Layer-wise Gradients

Episode 684 · · 22:09

πŸ€— Upvotes: 30 | cs.LG, cs.AI, cs.CL Authors: Ming Li, Yanhong Li, Ziyue Li, Tianyi Zhou Title: How Instruction and Reas...

Heimdall: test-time scaling on the generative verification

Heimdall: test-time scaling on the generative verification

Episode 683 · · 19:59

πŸ€— Upvotes: 28 | cs.AI, I.2.7 Authors: Wenlei Shi, Xing Jin Title: Heimdall: test-time scaling on the generative verific...

Pixel-SAIL: Single Transformer For Pixel-Grounded Understanding

Pixel-SAIL: Single Transformer For Pixel-Grounded Understanding

Episode 682 · · 20:48

πŸ€— Upvotes: 23 | cs.CV Authors: Tao Zhang, Xiangtai Li, Zilong Huang, Yanwei Li, Weixian Lei, Xueqing Deng, Shihao Chen, Shunping Ji, Jiashi Feng...

TextArena

TextArena

Episode 681 · · 22:23

πŸ€— Upvotes: 21 | cs.CL, cs.AI, cs.LG, cs.MA Authors: Leon Guertler, Bobby Cheng, Simon Yu, Bo Liu, Leshem Choshen, Cheston Tan Title...

InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models

InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models

Episode 680 · · 22:48

πŸ€— Upvotes: 172 | cs.CV Authors: Jinguo Zhu, Weiyun Wang, Zhe Chen, Zhaoyang Liu, Shenglong Ye, Lixin Gu, Yuchen Duan, Hao Tian, Weijie Su, Jie S...

PRIMA.CPP: Speeding Up 70B-Scale LLM Inference on Low-Resource Everyday Home Clusters

PRIMA.CPP: Speeding Up 70B-Scale LLM Inference on Low-Resource Everyday Home Clusters

Episode 679 · · 23:55

πŸ€— Upvotes: 95 | cs.DC, cs.AI, 68T50, I.2.7; I.2.11 Authors: Zonghang Li, Tao Li, Wenjiao Feng, Mohsen Guizani, Hongfang Yu Title: ...

VL-Rethinker: Incentivizing Self-Reflection of Vision-Language Models with Reinforcement Learning

VL-Rethinker: Incentivizing Self-Reflection of Vision-Language Models with Reinforcement Learning

Episode 678 · · 20:43

πŸ€— Upvotes: 36 | cs.LG, cs.AI Authors: Haozhe Wang, Chao Qu, Zuming Huang, Wei Chu, Fangzhen Lin, Wenhu Chen Title: VL-R...

FUSION: Fully Integration of Vision-Language Representations for Deep Cross-Modal Understanding

FUSION: Fully Integration of Vision-Language Representations for Deep Cross-Modal Understanding

Episode 677 · · 20:26

πŸ€— Upvotes: 35 | cs.CV Authors: Zheng Liu, Mengjie Liu, Jingzhou Chen, Jingwei Xu, Bin Cui, Conghui He, Wentao Zhang Title: ...

Iterative Self-Training for Code Generation via Reinforced Re-Ranking

Iterative Self-Training for Code Generation via Reinforced Re-Ranking

Episode 676 · · 20:08

πŸ€— Upvotes: 29 | cs.CL, cs.IR, cs.SE Authors: Nikita Sorokin, Ivan Sedykh, Valentin Malykh Title: Iterative Self-Trainin...

Seaweed-7B: Cost-Effective Training of Video Generation Foundation Model

Seaweed-7B: Cost-Effective Training of Video Generation Foundation Model

Episode 675 · · 22:48

πŸ€— Upvotes: 83 | cs.CV, cs.AI Authors: Team Seawead, Ceyuan Yang, Zhijie Lin, Yang Zhao, Shanchuan Lin, Zhibei Ma, Haoyuan Guo, Hao Chen, Lu Qi, ...

GigaTok: Scaling Visual Tokenizers to 3 Billion Parameters for Autoregressive Image Generation

GigaTok: Scaling Visual Tokenizers to 3 Billion Parameters for Autoregressive Image Generation

Episode 674 · · 18:35

πŸ€— Upvotes: 32 | cs.CV Authors: Tianwei Xiong, Jun Hao Liew, Zilong Huang, Jiashi Feng, Xihui Liu Title: GigaTok: Scalin...

MineWorld: a Real-Time and Open-Source Interactive World Model on Minecraft

MineWorld: a Real-Time and Open-Source Interactive World Model on Minecraft

Episode 673 · · 19:12

πŸ€— Upvotes: 25 | cs.CV, cs.AI Authors: Junliang Guo, Yang Ye, Tianyu He, Haoyu Wu, Yushu Jiang, Tim Pearce, Jiang Bian Title: ...

Kimi-VL Technical Report

Kimi-VL Technical Report

Episode 672 · · 23:04

πŸ€— Upvotes: 71 | cs.CV Authors: Kimi Team, Angang Du, Bohong Yin, Bowei Xing, Bowen Qu, Bowen Wang, Cheng Chen, Chenlin Zhang, Chenzhuang Du, Chu...

C3PO: Critical-Layer, Core-Expert, Collaborative Pathway Optimization for Test-Time Expert Re-Mixing

C3PO: Critical-Layer, Core-Expert, Collaborative Pathway Optimization for Test-Time Expert Re-Mixing

Episode 671 · · 22:06

πŸ€— Upvotes: 37 | cs.LG Authors: Zhongyang Li, Ziyue Li, Tianyi Zhou Title: C3PO: Critical-Layer, Core-Expert, Collaborat...

VCR-Bench: A Comprehensive Evaluation Framework for Video Chain-of-Thought Reasoning

VCR-Bench: A Comprehensive Evaluation Framework for Video Chain-of-Thought Reasoning

Episode 670 · · 22:15

πŸ€— Upvotes: 34 | cs.CV, cs.AI, cs.CL Authors: Yukun Qi, Yiming Zhao, Yu Zeng, Xikun Bao, Wenxuan Huang, Lin Chen, Zehui Chen, Jie Zhao, Zhongang ...

DeepSeek-R1 Thoughtology: Let's <think> about LLM Reasoning

DeepSeek-R1 Thoughtology: Let's <think> about LLM Reasoning

Episode 669 · · 25:44

πŸ€— Upvotes: 33 | cs.CL Authors: Sara Vera MarjanoviΔ‡, Arkil Patel, Vaibhav Adlakha, Milad Aghajohari, Parishad BehnamGhader, Mehar Bhatia, Aditi ...

VisualCloze: A Universal Image Generation Framework via Visual In-Context Learning

VisualCloze: A Universal Image Generation Framework via Visual In-Context Learning

Episode 668 · · 20:05

πŸ€— Upvotes: 33 | cs.CV Authors: Zhong-Yu Li, Ruoyi Du, Juncheng Yan, Le Zhuo, Zhen Li, Peng Gao, Zhanyu Ma, Ming-Ming Cheng Title: ...

MM-IFEngine: Towards Multimodal Instruction Following

MM-IFEngine: Towards Multimodal Instruction Following

Episode 667 · · 22:11

πŸ€— Upvotes: 26 | cs.CV Authors: Shengyuan Ding, Shenxi Wu, Xiangyu Zhao, Yuhang Zang, Haodong Duan, Xiaoyi Dong, Pan Zhang, Yuhang Cao, Dahua Lin...

HoloPart: Generative 3D Part Amodal Segmentation

HoloPart: Generative 3D Part Amodal Segmentation

Episode 666 · · 22:40

πŸ€— Upvotes: 23 | cs.CV Authors: Yunhan Yang, Yuan-Chen Guo, Yukun Huang, Zi-Xin Zou, Zhipeng Yu, Yangguang Li, Yan-Pei Cao, Xihui Liu ...