STAgent爆誕!空間時間的推論(すいろん)最強のLLMだよ☆ 未来旅行は楽勝ね!
タイトル & 超要約 STAgent:未来都市のナビゲーター!空間時間情報に特化したAI、旅行も街づくりもイケちゃう!🚀
ギャル的キラキラポイント✨ ● 旅行計画が爆速(ばくはや)!行きたい場所も最適なルートも、AIが全部教えてくれるなんて神✨ ● スマートシティ(賢い街)がもっと進化!交通整理とか災害(さいがい)対策とか、未来が楽しみすぎ💖 ● 3000万件以上のデータから学習!まるでカリスマギャルのように、経験値(けいけんち)MAXってことね😎
詳細解説
リアルでの使いみちアイデア💡
続きは「らくらく論文」アプリで
We present STAgent, an agentic large language model tailored for spatio-temporal understanding, designed to solve complex tasks such as constrained point-of-interest discovery and itinerary planning. STAgent is a specialized model capable of interacting with ten distinct tools within spatio-temporal scenarios, enabling it to explore, verify, and refine intermediate steps during complex reasoning. Notably, STAgent effectively preserves its general capabilities. We empower STAgent with these capabilities through three key contributions: (1) a stable tool environment that supports over ten domain-specific tools, enabling asynchronous rollout and training; (2) a hierarchical data curation framework that identifies high-quality data like a needle in a haystack, curating high-quality queries by retaining less than 1\% of the raw data, emphasizing both diversity and difficulty; and (3) a cascaded training recipe that starts with a seed SFT stage acting as a guardian to measure query difficulty, followed by a second SFT stage fine-tuned on queries with high certainty, and an ultimate RL stage that leverages data of low certainty. Initialized with Qwen3-30B-A3B to establish a strong SFT foundation and leverage insights into sample difficulty, STAgent yields promising performance on TravelBench while maintaining its general capabilities across a wide range of general benchmarks, thereby demonstrating the effectiveness of our proposed agentic model.