iconLogo
Published:2025/12/24 21:36:34

ロボット、賢くする裏技🤖✨ EVEフレームワーク爆誕!

超要約: ロボットの頭脳をバージョンアップ!環境変化に強くなる魔法の技術だよ🪄

🌟 ギャル的キラキラポイント✨ ● ロボットが急に賢くなる!「EVE」っていう、新しい仕組みがすごい🌟 ● 難しいことなし!既存の技術を組み合わせて、ロボットを強くするよ💖 ● データ集めもラクラク!追加学習なしで、すぐに使えるのがウレシイ🎵

詳細解説いくよ~!

背景 ロボットの世界🤖、最近は賢くなってきたけど、ちょっとした環境の変化で「あれ?」ってなることがあったの。例えば、テーブルの高さが変わったりとかね! 追加で学習させなきゃいけないから、時間もお金もかかっちゃうのが悩みだったんだよね😭

続きは「らくらく論文」アプリで

EVE: A Generator-Verifier System for Generative Policies

Yusuf Ali / Gryphon Patlin / Karthik Kothuri / Muhammad Zubair Irshad / Wuwei Liang / Zsolt Kira

Visuomotor policies based on generative architectures such as diffusion and flow-based matching have shown strong performance but degrade under distribution shifts, demonstrating limited recovery capabilities without costly finetuning. In the language modeling domain, test-time compute scaling has revolutionized reasoning capabilities of modern LLMs by leveraging additional inference-time compute for candidate solution refinement. These methods typically leverage foundation models as verification modules in a zero-shot manner to synthesize improved candidate solutions. In this work, we hypothesize that generative policies can similarly benefit from additional inference-time compute that employs zero-shot VLM-based verifiers. A systematic analysis of improving policy performance through the generation-verification framework remains relatively underexplored in the current literature. To this end, we introduce EVE - a modular, generator-verifier interaction framework - that boosts the performance of pretrained generative policies at test time, with no additional training. EVE wraps a frozen base policy with multiple zero-shot, VLM-based verifier agents. Each verifier proposes action refinements to the base policy candidate actions, while an action incorporator fuses the aggregated verifier output into the base policy action prediction to produce the final executed action. We study design choices for generator-verifier information interfacing across a system of verifiers with distinct capabilities. Across a diverse suite of manipulation tasks, EVE consistently improves task success rates without any additional policy training. Through extensive ablations, we isolate the contribution of verifier capabilities and action incorporator strategies, offering practical guidelines to build scalable, modular generator-verifier systems for embodied control.

cs / cs.RO