超要約:未来のタスクをAIが見つけるスゴ技!
💎 ギャル的キラキラポイント✨ ● AIが人間の動きをめっちゃ理解してくれるようになるってコト💖 ● 未来のことも考えた上で、最適なタスクを見つけるのが天才的😳 ● これからの生活がもっと便利になる予感しかしない🎶
詳細解説いくよー!
背景 最近のAI(人工知能)はすごい進化してるけど、ロボットとかAIアシスタントは、まだ特定の目的とか決まった場所での仕事しか得意じゃなかったの💦 でも人間って、色んなこと同時にやるし、未来のことなんて誰にも分かんないじゃん? だから、AIにもっと人間っぽく、色んな状況に対応できるようになってほしいって研究なの!
続きは「らくらく論文」アプリで
Recent progress in robotics and embodied AI is largely driven by Large Multimodal Models (LMMs). However, a key challenge remains underexplored: how can we advance LMMs to discover tasks that assist humans in open-future scenarios, where human intentions are highly concurrent and dynamic. In this work, we formalize the problem of Human-centric Open-future Task Discovery (HOTD), focusing particularly on identifying tasks that reduce human effort across plausible futures. To facilitate this study, we propose HOTD-Bench, which features over 2K real-world videos, a semi-automated annotation pipeline, and a simulation-based protocol tailored for open-set future evaluation. Additionally, we propose the Collaborative Multi-Agent Search Tree (CMAST) framework, which decomposes complex reasoning through a multi-agent system and structures the reasoning process through a scalable search tree module. In our experiments, CMAST achieves the best performance on the HOTD-Bench, significantly surpassing existing LMMs. It also integrates well with existing LMMs, consistently improving performance.