iconLogo
Published:2025/12/17 5:11:58

CogER爆誕!LLMを賢く高速化🚀

超要約: LLM(大規模言語モデル)を、頭脳プレイで賢く&コスパ良く使えるようにした研究だよ!

✨ ギャル的キラキラポイント ✨ ● 質問のレベルに合わせて、頭の使い方を自動で変える賢さが神👏 ● 計算コストを抑えつつ、難しい質問にもバッチリ対応できるのがエモい💖 ● 医療とか金融とか、色んな分野で役立つ未来感がアツい🔥

詳細解説 ● 背景 LLMってすごいけど、使うとお金かかるし、時間もかかるじゃん?😩それらを解決するために、質問の難易度(レベル)に応じて、頭の使い方を柔軟に変える方法を研究したんだって!人間の頭脳みたいでワクワクするよね✨

● 方法 質問がきたら、まずレベルをチェック!🤔レベルに合わせて、単純な答え方から、色んなツールを駆使する方法まで、最適なやり方を自動で選ぶシステムを作ったんだって。まるでゲームみたい🎮強化学習(AIが自分で学ぶ方法)を使って、一番良い方法を学習させてるらしい!

続きは「らくらく論文」アプリで

Beyond Fast and Slow: Cognitive-Inspired Elastic Reasoning for Large Language Models

Jinwu Hu / Dongjin Yang / Langyu Bian / Zhiquan Wen / Yufeng Wang / Yaofo Chen / Bin Xiao / Yuanqing Li / Mingkui Tan

Large language models (LLMs) have demonstrated impressive performance across various language tasks. However, existing LLM reasoning strategies mainly rely on the LLM itself with fast or slow mode (like o1 thinking) and thus struggle to balance reasoning efficiency and accuracy across queries of varying difficulties. In this paper, we propose Cognitive-Inspired Elastic Reasoning (CogER), a framework inspired by human hierarchical reasoning that dynamically selects the most suitable reasoning strategy for each query. Specifically, CogER first assesses the complexity of incoming queries and assigns them to one of several predefined levels, each corresponding to a tailored processing strategy, thereby addressing the challenge of unobservable query difficulty. To achieve automatic strategy selection, we model the process as a Markov Decision Process and train a CogER-Agent using reinforcement learning. The agent is guided by a reward function that balances solution quality and computational cost, ensuring resource-efficient reasoning. Moreover, for queries requiring external tools, we introduce Cognitive Tool-Assisted Reasoning, which enables the LLM to autonomously invoke external tools within its chain-of-thought. Extensive experiments demonstrate that CogER outperforms state-of-the-art Test-Time scaling methods, achieving at least a 13% relative improvement in average exact match on In-Domain tasks and an 8% relative gain on Out-of-Domain tasks.

cs / cs.AI