超要約:検索AIの検索力を爆上げするテク!
ギャル的キラキラポイント✨ ● 中間クエリ(検索の途中に出る検索ワード)をめっちゃイケてる状態にする! ● プロセス報酬(クエリの出来を評価するシステム)で、クオリティ爆上がり⤴ ● クエリリファインメント(悪いクエリを修正)で、検索が超スムーズに💖
詳細解説 ● 背景 最近のAI(LLM)は検索が得意だけど、途中の検索ワード(クエリ)がイマイチだったの😭 そこで、もっと良いクエリを作って、検索の精度を上げようって研究なの!
● 方法 SmartSearchってフレームワークを作ったよ! プロセス報酬でクエリの良し悪しをチェックして、クエリリファインメントで悪いとこを直すんだって!✨
続きは「らくらく論文」アプリで
Large language model (LLM)-based search agents have proven promising for addressing knowledge-intensive problems by incorporating information retrieval capabilities. Existing works largely focus on optimizing the reasoning paradigms of search agents, yet the quality of intermediate search queries during reasoning remains overlooked. As a result, the generated queries often remain inaccurate, leading to unexpected retrieval results and ultimately limiting search agents' overall effectiveness. To mitigate this issue, we introduce SmartSearch, a framework built upon two key mechanisms: (1) Process rewards, which provide fine-grained supervision for the quality of each intermediate search query through Dual-Level Credit Assessment. (2) Query refinement, which promotes the optimization of query generation by selectively refining low-quality search queries and regenerating subsequent search rounds based on these refinements. To enable the search agent to progressively internalize the ability to improve query quality under the guidance of process rewards, we design a three-stage curriculum learning framework. This framework guides the agent through a progression from imitation, to alignment, and ultimately to generalization. Experimental results show that SmartSearch consistently surpasses existing baselines, and additional quantitative analyses further confirm its significant gains in both search efficiency and query quality. The code is available at https://github.com/MYVAE/SmartSearch.