Episodes

Latest Episode
ODYSSEY: Open-World Quadrupeds Exploration and Manipulation for Long-Horizon Tasks

ODYSSEY: Open-World Quadrupeds Exploration and Manipulation for Long-Horizon Tasks

Episode 1091 · · 21:27

🤗 Upvotes: 34 | cs.RO, cs.CV Authors: Kaijun Wang, Liqin Lu, Mingyu Liu, Jianuo Jiang, Zeju Li, Bolin Zhang, Wancai Zheng, Xinyi Yu, Hao Chen, C...

Intern-S1: A Scientific Multimodal Foundation Model

Intern-S1: A Scientific Multimodal Foundation Model

Episode 1090 · · 19:26

🤗 Upvotes: 166 | cs.LG, cs.CL, cs.CV Authors: Lei Bai, Zhongrui Cai, Maosong Cao, Weihan Cao, Chiyu Chen, Haojiong Chen, Kai Chen, Pengcheng Che...

Mobile-Agent-v3: Foundamental Agents for GUI Automation

Mobile-Agent-v3: Foundamental Agents for GUI Automation

Episode 1089 · · 25:02

🤗 Upvotes: 40 | cs.AI Authors: Jiabo Ye, Xi Zhang, Haiyang Xu, Haowei Liu, Junyang Wang, Zhaoqing Zhu, Ziwei Zheng, Feiyu Gao, Junjie Cao, Zheng...

Deep Think with Confidence

Deep Think with Confidence

Episode 1088 · · 20:40

🤗 Upvotes: 26 | cs.LG Authors: Yichao Fu, Xuewei Wang, Yuandong Tian, Jiawei Zhao Title: Deep Think with Confidence ...

LiveMCP-101: Stress Testing and Diagnosing MCP-enabled Agents on Challenging Queries

LiveMCP-101: Stress Testing and Diagnosing MCP-enabled Agents on Challenging Queries

Episode 1087 · · 23:48

🤗 Upvotes: 26 | cs.CL, cs.AI Authors: Ming Yin, Dinghan Shen, Silei Xu, Jianbing Han, Sixun Dong, Mian Zhang, Yebowen Hu, Shujian Liu, Simin Ma,...

DuPO: Enabling Reliable LLM Self-Verification via Dual Preference Optimization

DuPO: Enabling Reliable LLM Self-Verification via Dual Preference Optimization

Episode 1086 · · 22:59

🤗 Upvotes: 57 | cs.LG, cs.CL Authors: Shuaijie She, Yu Bao, Yu Lu, Lu Xu, Tao Li, Wenhao Zhu, Shujian Huang, Shanbo Cheng, Lu Lu, Yuxuan Wang ...

From Scores to Skills: A Cognitive Diagnosis Framework for Evaluating Financial Large Language Models

From Scores to Skills: A Cognitive Diagnosis Framework for Evaluating Financial Large Language Models

Episode 1085 · · 23:15

🤗 Upvotes: 53 | cs.CE Authors: Ziyan Kuang, Feiyu Zhu, Maowei Jiang, Yanzhao Lai, Zelin Wang, Zhitong Wang, Meikang Qiu, Jiajia Huang, Min Peng,...

FutureX: An Advanced Live Benchmark for LLM Agents in Future Prediction

FutureX: An Advanced Live Benchmark for LLM Agents in Future Prediction

Episode 1084 · · 22:01

🤗 Upvotes: 47 | cs.AI, cs.LG Authors: Zhiyuan Zeng, Jiashuo Liu, Siyuan Chen, Tianci He, Yali Liao, Jinpeng Wang, Zaiyuan Wang, Yang Yang, Lingy...

MeshCoder: LLM-Powered Structured Mesh Code Generation from Point Clouds

MeshCoder: LLM-Powered Structured Mesh Code Generation from Point Clouds

Episode 1083 · · 22:06

🤗 Upvotes: 29 | cs.GR, cs.CV Authors: Bingquan Dai, Li Ray Luo, Qihong Tang, Jie Wang, Xinyu Lian, Hao Xu, Minghan Qin, Xudong Xu, Bo Dai, Haoqi...

Tinker: Diffusion's Gift to 3D--Multi-View Consistent Editing From Sparse Inputs without Per-Scene Optimization

Tinker: Diffusion's Gift to 3D--Multi-View Consistent Editing From Sparse Inputs without Per-Scene Optimization

Episode 1082 · · 20:53

🤗 Upvotes: 26 | cs.CV Authors: Canyu Zhao, Xiaoman Li, Tianjian Feng, Zhiyue Zhao, Hao Chen, Chunhua Shen Title: Tinker...

Chain-of-Agents: End-to-End Agent Foundation Models via Multi-Agent Distillation and Agentic RL

Chain-of-Agents: End-to-End Agent Foundation Models via Multi-Agent Distillation and Agentic RL

Episode 1081 · · 22:08

🤗 Upvotes: 68 | cs.AI, cs.CL Authors: Weizhen Li, Jianbo Lin, Zhuosong Jiang, Jingyi Cao, Xinpeng Liu, Jiayu Zhang, Zhenqiang Huang, Qianben Che...

LongSplat: Robust Unposed 3D Gaussian Splatting for Casual Long Videos

LongSplat: Robust Unposed 3D Gaussian Splatting for Casual Long Videos

Episode 1080 · · 21:20

🤗 Upvotes: 41 | cs.CV Authors: Chin-Yang Lin, Cheng Sun, Fu-En Yang, Min-Hung Chen, Yen-Yu Lin, Yu-Lun Liu Title: LongS...

Prompt Orchestration Markup Language

Prompt Orchestration Markup Language

Episode 1079 · · 23:11

🤗 Upvotes: 24 | cs.HC, cs.AI, cs.CL, cs.PL Authors: Yuge Zhang, Nan Chen, Jiahang Xu, Yuqing Yang Title: Prompt Orchest...

Ovis2.5 Technical Report

Ovis2.5 Technical Report

Episode 1078 · · 23:09

🤗 Upvotes: 79 | cs.CV, cs.AI, cs.CL, cs.LG Authors: Shiyin Lu, Yang Li, Yu Xia, Yuwei Hu, Shanshan Zhao, Yanqing Ma, Zhichao Wei, Yinglun Li, Lu...

ComoRAG: A Cognitive-Inspired Memory-Organized RAG for Stateful Long Narrative Reasoning

ComoRAG: A Cognitive-Inspired Memory-Organized RAG for Stateful Long Narrative Reasoning

Episode 1077 · · 21:38

🤗 Upvotes: 47 | cs.CL, cs.AI, cs.LG Authors: Juyuan Wang, Rongchen Zhao, Wei Wei, Yufeng Wang, Mo Yu, Jie Zhou, Jin Xu, Liyan Xu Ti...

4DNeX: Feed-Forward 4D Generative Modeling Made Easy

4DNeX: Feed-Forward 4D Generative Modeling Made Easy

Episode 1076 · · 23:37

🤗 Upvotes: 44 | cs.CV Authors: Zhaoxi Chen, Tianqi Liu, Long Zhuo, Jiawei Ren, Zeng Tao, He Zhu, Fangzhou Hong, Liang Pan, Ziwei Liu ...

Next Visual Granularity Generation

Next Visual Granularity Generation

Episode 1075 · · 22:45

🤗 Upvotes: 37 | cs.CV, cs.AI, cs.LG Authors: Yikai Wang, Zhouxia Wang, Zhonghua Wu, Qingyi Tao, Kang Liao, Chen Change Loy Title: ...

Speed Always Wins: A Survey on Efficient Architectures for Large Language Models

Speed Always Wins: A Survey on Efficient Architectures for Large Language Models

Episode 1074 · · 20:55

🤗 Upvotes: 34 | cs.CL, cs.AI, cs.CV Authors: Weigao Sun, Jiaxi Hu, Yucheng Zhou, Jusen Du, Disen Lan, Kexin Wang, Tong Zhu, Xiaoye Qu, Yu Zhang,...

When Punctuation Matters: A Large-Scale Comparison of Prompt Robustness Methods for LLMs

When Punctuation Matters: A Large-Scale Comparison of Prompt Robustness Methods for LLMs

Episode 1073 · · 22:33

🤗 Upvotes: 32 | cs.CL, cs.AI Authors: Mikhail Seleznyov, Mikhail Chaichuk, Gleb Ershov, Alexander Panchenko, Elena Tutubalina, Oleg Somov ...

Has GPT-5 Achieved Spatial Intelligence? An Empirical Study

Has GPT-5 Achieved Spatial Intelligence? An Empirical Study

Episode 1072 · · 19:57

🤗 Upvotes: 22 | cs.CV, cs.CL, cs.LG, cs.MM, cs.RO Authors: Zhongang Cai, Yubo Wang, Qingping Sun, Ruisi Wang, Chenyang Gu, Wanqi Yin, Zhiqian Li...