Published：2026/1/8 14:22:58

ConMaxでLLM爆速化！🚀💖

超要約：LLMの頭脳を圧縮して、賢く速くするよ！

🌟 ギャル的キラキラポイント✨ ● LLM（AI）の思考回路をキュッと圧縮！ムダを省いて賢くするよ😉 ● 強化学習（AIが自分で賢くなる学習方法）で、自動的に圧縮してくれるのがスゴイ😳 ● 計算コスト削減で、LLMがもっと身近になるかも！✨

詳細解説 ● 背景 LLMは賢いけど、頭の中（推論）がゴチャゴチャしてて時間かかるのが悩みだったの😥ConMaxは、この問題を解決してくれる画期的な技術！ ● 方法強化学習を使って、LLMの推論を効率よく圧縮するよ。論理的なつながりは保ちつつ、無駄な部分だけを削除するからスッキリ✨ ● 結果なんと！推論の長さを半分近くに減らしつつ、精度もほぼそのままキープ💖これはすごい！ ● 意義（ここがヤバい♡ポイント） LLMの計算コストを下げて、もっと多くの人が使えるようになるかも！色んなサービスでAIが活躍する未来が楽しみだね🥰

リアルでの使いみちアイデア💡 ● チャットボットが爆速！会話がスムーズになって、マジ神✨ ● AIアシスタントがさらに賢く！色んなタスクをこなせるようになるかも🤩

続きは「らくらく論文」アプリで

ConMax: Confidence-Maximizing Compression for Efficient Chain-of-Thought Reasoning

Minda Hu / Zexuan Qiu / Zenan Xu / Kun Li / Bo Zhou / Irwin King

Recent breakthroughs in Large Reasoning Models (LRMs) have demonstrated that extensive Chain-of-Thought (CoT) generation is critical for enabling intricate cognitive behaviors, such as self-verification and backtracking, to solve complex tasks. However, this capability often leads to ``overthinking'', where models generate redundant reasoning paths that inflate computational costs without improving accuracy. While Supervised Fine-Tuning (SFT) on reasoning traces is a standard paradigm for the 'cold start' phase, applying existing compression techniques to these traces often compromises logical coherence or incurs prohibitive sampling costs. In this paper, we introduce ConMax (Confidence-Maximizing Compression), a novel reinforcement learning framework designed to automatically compress reasoning traces while preserving essential reasoning patterns. ConMax formulates compression as a reward-driven optimization problem, training a policy to prune redundancy by maximizing a weighted combination of answer confidence for predictive fidelity and thinking confidence for reasoning validity through a frozen auxiliary LRM. Extensive experiments across five reasoning datasets demonstrate that ConMax achieves a superior efficiency-performance trade-off. Specifically, it reduces inference length by 43% over strong baselines at the cost of a mere 0.7% dip in accuracy, proving its effectiveness in generating high-quality, efficient training data for LRMs.

cs / cs.AI / cs.CL

Arxivで見る