超要約: LLMエージェント(AI)の動きを良くする新技術!💖
✨ ギャル的キラキラポイント ✨ ● ムダな動きをなくして、賢さ爆上がり!✨ 効率的な動きができるようになるんだって! ● データを使って、AIをどんどん進化させるの!💖 まさに育てるって感じ~! ● AIの答えが、めっちゃ正確になる!😳 使えるAIになるってこと!
詳細解説 • 背景 LLM(大規模言語モデル)ってすごいけど、AIエージェントの動きがちょっとイマイチだったの!😵💫 ツールを使いすぎたり、間違えたり…。それをET-Agent(イーティーエージェント)が解決してくれるみたい!
• 方法 ET-Agentは、AIの動きを良くするために、色んなデータを集めて、AIをトレーニングするんだって!💻 特に「自己進化型データフライホイール」っていうので、AI自身がデータを生み出して、どんどん賢くなるらしい!
続きは「らくらく論文」アプリで
Large Language Models (LLMs) can extend their parameter knowledge limits by adopting the Tool-Integrated Reasoning (TIR) paradigm. However, existing LLM-based agent training framework often focuses on answers' accuracy, overlooking specific alignment for behavior patterns. Consequently, agent often exhibits ineffective actions during TIR tasks, such as redundant and insufficient tool calls. How to calibrate erroneous behavioral patterns when executing TIR tasks, thereby exploring effective trajectories, remains an open-ended problem. In this paper, we propose ET-Agent, a training framework for calibrating agent's tool-use behavior through two synergistic perspectives: Self-evolving Data Flywheel and Behavior Calibration Training. Specifically, we introduce a self-evolutionary data flywheel to generate enhanced data, used to fine-tune LLM to improve its exploration ability. Based on this, we implement an two-phases behavior-calibration training framework. It is designed to progressively calibrate erroneous behavioral patterns to optimal behaviors. Further in-depth experiments confirm the superiority of \ourmodel{} across multiple dimensions, including correctness, efficiency, reasoning conciseness, and tool execution accuracy. Our ET-Agent framework provides practical insights for research in the TIR field. Codes can be found in https://github.com/asilverlight/ET-Agent