Episodes

Latest Episode
VerIPO: Cultivating Long Reasoning in Video-LLMs via Verifier-Gudied Iterative Policy Optimization

VerIPO: Cultivating Long Reasoning in Video-LLMs via Verifier-Gudied Iterative Policy Optimization

Episode 817 · · 20:58

🤗 Upvotes: 35 | cs.CL, cs.CV Authors: Yunxin Li, Xinyu Chen, Zitao Li, Zhenyu Liu, Longyue Wang, Wenhan Luo, Baotian Hu, Min Zhang ...

Mutarjim: Advancing Bidirectional Arabic-English Translation with a Small Language Model

Mutarjim: Advancing Bidirectional Arabic-English Translation with a Small Language Model

Episode 816 · · 20:46

🤗 Upvotes: 178 | cs.CL, cs.AI Authors: Khalil Hennara, Muhammad Hreden, Mohamed Motaism Hamed, Zeina Aldallal, Sara Chrouf, Safwan AlModhayan ...

Shifting AI Efficiency From Model-Centric to Data-Centric Compression

Shifting AI Efficiency From Model-Centric to Data-Centric Compression

Episode 815 · · 22:14

🤗 Upvotes: 124 | cs.CL, cs.AI, cs.CV Authors: Xuyang Liu, Zichen Wen, Shaobo Wang, Junjie Chen, Zhishan Tao, Yubo Wang, Xiangqi Jin, Chang Zou, ...

Alchemist: Turning Public Text-to-Image Data into Generative Gold

Alchemist: Turning Public Text-to-Image Data into Generative Gold

Episode 814 · · 19:19

🤗 Upvotes: 58 | cs.CV Authors: Valerii Startsev, Alexander Ustyuzhanin, Alexey Kirillov, Dmitry Baranchuk, Sergey Kastryulin Title:...

BizFinBench: A Business-Driven Real-World Financial Benchmark for Evaluating LLMs

BizFinBench: A Business-Driven Real-World Financial Benchmark for Evaluating LLMs

Episode 813 · · 23:32

🤗 Upvotes: 56 | cs.AI, cs.CE, cs.CL Authors: Guilong Lu, Xuntao Guo, Rongjunchen Zhang, Wenqiao Zhu, Ji Liu Title: BizF...

PATS: Process-Level Adaptive Thinking Mode Switching

PATS: Process-Level Adaptive Thinking Mode Switching

Episode 812 · · 21:12

🤗 Upvotes: 44 | cs.CL Authors: Yi Wang, Junxiao Liu, Shimao Zhang, Jiajun Chen, Shujian Huang Title: PATS: Process-Leve...

Embodied Agents Meet Personalization: Exploring Memory Utilization for Personalized Assistance

Embodied Agents Meet Personalization: Exploring Memory Utilization for Personalized Assistance

Episode 811 · · 20:57

🤗 Upvotes: 42 | cs.CL Authors: Taeyoon Kwon, Dongwook Choi, Sunghwan Kim, Hyojun Kim, Seungjun Moon, Beong-woo Kwak, Kuan-Hao Huang, Jinyoung Ye...

ARM: Adaptive Reasoning Model

ARM: Adaptive Reasoning Model

Episode 810 · · 22:44

🤗 Upvotes: 40 | cs.CL Authors: Siye Wu, Jian Xie, Yikai Zhang, Aili Chen, Kai Zhang, Yu Su, Yanghua Xiao Title: ARM: Ad...

Enigmata: Scaling Logical Reasoning in Large Language Models with Synthetic Verifiable Puzzles

Enigmata: Scaling Logical Reasoning in Large Language Models with Synthetic Verifiable Puzzles

Episode 809 · · 21:22

🤗 Upvotes: 33 | cs.CL, cs.AI Authors: Jiangjie Chen, Qianyu He, Siyu Yuan, Aili Chen, Zhicheng Cai, Weinan Dai, Hongli Yu, Qiying Yu, Xuefeng Li...

Deciphering Trajectory-Aided LLM Reasoning: An Optimization Perspective

Deciphering Trajectory-Aided LLM Reasoning: An Optimization Perspective

Episode 808 · · 21:22

🤗 Upvotes: 33 | cs.CL, cs.AI Authors: Junnan Liu, Hongwei Liu, Linchen Xiao, Shudong Liu, Taolin Zhang, Zihan Ma, Songyang Zhang, Kai Chen ...

B-score: Detecting biases in large language models using response history

B-score: Detecting biases in large language models using response history

Episode 807 · · 23:16

🤗 Upvotes: 25 | cs.LG, cs.CL Authors: An Vo, Mohammad Reza Taesiri, Daeyoung Kim, Anh Totti Nguyen Title: B-score: Dete...

TabSTAR: A Foundation Tabular Model With Semantically Target-Aware Representations

TabSTAR: A Foundation Tabular Model With Semantically Target-Aware Representations

Episode 806 · · 20:40

🤗 Upvotes: 95 | cs.LG, cs.CL Authors: Alan Arazi, Eilam Shapira, Roi Reichart Title: TabSTAR: A Foundation Tabular Mode...

QwenLong-L1: Towards Long-Context Large Reasoning Models with Reinforcement Learning

QwenLong-L1: Towards Long-Context Large Reasoning Models with Reinforcement Learning

Episode 805 · · 24:07

🤗 Upvotes: 60 | cs.CL Authors: Fanqi Wan, Weizhou Shen, Shengyi Liao, Yingcheng Shi, Chenliang Li, Ziyi Yang, Ji Zhang, Fei Huang, Jingren Zhou,...

Quartet: Native FP4 Training Can Be Optimal for Large Language Models

Quartet: Native FP4 Training Can Be Optimal for Large Language Models

Episode 804 · · 22:46

🤗 Upvotes: 55 | cs.LG Authors: Roberto L. Castro, Andrei Panferov, Soroush Tabesh, Oliver Sieberling, Jiale Chen, Mahdi Nikdan, Saleh Ashkboos, ...

Reasoning Model is Stubborn: Diagnosing Instruction Overriding in Reasoning Models

Reasoning Model is Stubborn: Diagnosing Instruction Overriding in Reasoning Models

Episode 803 · · 20:36

🤗 Upvotes: 51 | cs.AI Authors: Doohyuk Jang, Yoonjeon Kim, Chanjae Park, Hyun Ryu, Eunho Yang Title: Reasoning Model is...

One RL to See Them All: Visual Triple Unified Reinforcement Learning

One RL to See Them All: Visual Triple Unified Reinforcement Learning

Episode 802 · · 20:04

🤗 Upvotes: 51 | cs.CV, cs.CL Authors: Yan Ma, Linge Du, Xuyang Shen, Shaoxiang Chen, Pengfei Li, Qibing Ren, Lizhuang Ma, Yuchao Dai, Pengfei Li...

Distilling LLM Agent into Small Models with Retrieval and Code Tools

Distilling LLM Agent into Small Models with Retrieval and Code Tools

Episode 801 · · 21:30

🤗 Upvotes: 49 | cs.CL, cs.AI Authors: Minki Kang, Jongwon Jeong, Seanie Lee, Jaewoong Cho, Sung Ju Hwang Title: Distill...

QwenLong-CPRS: Towards $\infty$-LLMs with Dynamic Context Optimization

QwenLong-CPRS: Towards $\infty$-LLMs with Dynamic Context Optimization

Episode 800 · · 22:56

🤗 Upvotes: 39 | cs.CL Authors: Weizhou Shen, Chenliang Li, Fanqi Wan, Shengyi Liao, Shaopeng Lai, Bo Zhang, Yingcheng Shi, Yuning Wu, Gang Fu, Z...

PhyX: Does Your Model Have the "Wits" for Physical Reasoning?

PhyX: Does Your Model Have the "Wits" for Physical Reasoning?

Episode 799 · · 22:03

🤗 Upvotes: 38 | cs.AI Authors: Hui Shen, Taiqiang Wu, Qi Han, Yunta Hsieh, Jizhou Wang, Yuyue Zhang, Yuxin Cheng, Zijian Hao, Yuansheng Ni, Xin ...

Scaling Image and Video Generation via Test-Time Evolutionary Search

Scaling Image and Video Generation via Test-Time Evolutionary Search

Episode 798 · · 24:37

🤗 Upvotes: 33 | cs.CV, cs.AI, cs.LG Authors: Haoran He, Jiajun Liang, Xintao Wang, Pengfei Wan, Di Zhang, Kun Gai, Ling Pan Title: ...

MOOSE-Chem3: Toward Experiment-Guided Hypothesis Ranking via Simulated Experimental Feedback

MOOSE-Chem3: Toward Experiment-Guided Hypothesis Ranking via Simulated Experimental Feedback

Episode 797 · · 19:36

🤗 Upvotes: 25 | cs.CL, cs.AI, cs.CE Authors: Wanhao Liu, Zonglin Yang, Jue Wang, Lidong Bing, Di Zhang, Dongzhan Zhou, Yuqiang Li, Houqiang Li, ...

NovelSeek: When Agent Becomes the Scientist -- Building Closed-Loop System from Hypothesis to Verification

NovelSeek: When Agent Becomes the Scientist -- Building Closed-Loop System from Hypothesis to Verification

Episode 796 · · 20:49

🤗 Upvotes: 86 | cs.AI, cs.CL, cs.CV Authors: NovelSeek Team, Bo Zhang, Shiyang Feng, Xiangchao Yan, Jiakang Yuan, Zhiyin Yu, Xiaohan He, Songtao...

Scaling Reasoning, Losing Control: Evaluating Instruction Following in Large Reasoning Models

Scaling Reasoning, Losing Control: Evaluating Instruction Following in Large Reasoning Models

Episode 795 · · 23:44

🤗 Upvotes: 49 | cs.CL, cs.AI Authors: Tingchen Fu, Jiawei Gu, Yafu Li, Xiaoye Qu, Yu Cheng Title: Scaling Reasoning, Lo...

Tool-Star: Empowering LLM-Brained Multi-Tool Reasoner via Reinforcement Learning

Tool-Star: Empowering LLM-Brained Multi-Tool Reasoner via Reinforcement Learning

Episode 794 · · 21:53

🤗 Upvotes: 43 | cs.CL, cs.AI, cs.LG Authors: Guanting Dong, Yifei Chen, Xiaoxi Li, Jiajie Jin, Hongjin Qian, Yutao Zhu, Hangyu Mao, Guorui Zhou,...

Pixel Reasoner: Incentivizing Pixel-Space Reasoning with Curiosity-Driven Reinforcement Learning

Pixel Reasoner: Incentivizing Pixel-Space Reasoning with Curiosity-Driven Reinforcement Learning

Episode 793 · · 17:51

🤗 Upvotes: 37 | cs.CV, cs.AI, cs.CL Authors: Alex Su, Haozhe Wang, Weimin Ren, Fangzhen Lin, Wenhu Chen Title: Pixel Re...

KRIS-Bench: Benchmarking Next-Level Intelligent Image Editing Models

KRIS-Bench: Benchmarking Next-Level Intelligent Image Editing Models

Episode 792 · · 20:54

🤗 Upvotes: 36 | cs.CV Authors: Yongliang Wu, Zonghui Li, Xinting Hu, Xinyu Ye, Xianfang Zeng, Gang Yu, Wenbo Zhu, Bernt Schiele, Ming-Hsuan Yang...

QuickVideo: Real-Time Long Video Understanding with System Algorithm Co-Design

QuickVideo: Real-Time Long Video Understanding with System Algorithm Co-Design

Episode 791 · · 19:40

🤗 Upvotes: 31 | cs.CV, cs.AI Authors: Benjamin Schneider, Dongfu Jiang, Chao Du, Tianyu Pang, Wenhu Chen Title: QuickVi...

GoT-R1: Unleashing Reasoning Capability of MLLM for Visual Generation with Reinforcement Learning

GoT-R1: Unleashing Reasoning Capability of MLLM for Visual Generation with Reinforcement Learning

Episode 790 · · 25:23

🤗 Upvotes: 23 | cs.CV, cs.AI, cs.CL, cs.LG, cs.MM Authors: Chengqi Duan, Rongyao Fang, Yuqing Wang, Kun Wang, Linjiang Huang, Xingyu Zeng, Hongs...

LLaDA-V: Large Language Diffusion Models with Visual Instruction Tuning

LLaDA-V: Large Language Diffusion Models with Visual Instruction Tuning

Episode 789 · · 20:15

🤗 Upvotes: 22 | cs.LG, cs.CL, cs.CV Authors: Zebin You, Shen Nie, Xiaolu Zhang, Jun Hu, Jun Zhou, Zhiwu Lu, Ji-Rong Wen, Chongxuan Li ...

Scaling Diffusion Transformers Efficiently via $μ$P

Scaling Diffusion Transformers Efficiently via $μ$P

Episode 788 · · 22:53

🤗 Upvotes: 21 | cs.LG, cs.AI, cs.CV Authors: Chenyu Zheng, Xinyu Zhang, Rongzhen Wang, Wei Huang, Zhi Tian, Weilin Huang, Jun Zhu, Chongxuan Li ...