· 17:59
🤗 Upvotes: 28 | cs.CL, cs.AI
Authors:
Gongfan Fang, Xinyin Ma, Xinchao Wang
Title:
Thinkless: LLM Learns When to Think
Arxiv:
http://arxiv.org/abs/2505.13379v1
Abstract:
Reasoning Language Models, capable of extended chain-of-thought reasoning, have demonstrated remarkable performance on tasks requiring complex logical inference. However, applying elaborate reasoning for all queries often results in substantial computational inefficiencies, particularly when many problems admit straightforward solutions. This motivates an open question: Can LLMs learn when to think? To answer this, we propose Thinkless, a learnable framework that empowers an LLM to adaptively select between short-form and long-form reasoning, based on both task complexity and the model's ability. Thinkless is trained under a reinforcement learning paradigm and employs two control tokens,
Listen to Daily Paper Cast using one of many popular podcasting apps or directories.