エッジAIを最強にする魔法🧙‍♀️ AgentVNE！

Published：2026/1/5 11:30:04

エッジAIを最強にする魔法🧙‍♀️ AgentVNE！

超要約: エッジAIの頭脳🧠配置を最適化！遅延を減らして、サクサク動くようにするよ✨

✨ ギャル的キラキラポイント ✨ ● LLM（大規模言語モデル）で、配置のルールを賢くお勉強📖 ● 深層強化学習で、常にベストな配置を自動で探すの🔍 ● エッジAIアプリを、もっと速く、もっと賢くする魔法🪄

詳細解説！ ● 背景エッジAIの世界は、まるで街みたい🌃 IoTデバイスがどんどん増えて、AIの頭脳🧠をどこに置くか問題が発生！
賢く配置しないと、通信が遅れたり、処理が追いつかなかったりするの💦

● 方法 AgentVNEは、LLMで色んなルールを理解💡
「このアプリは、この場所がいい！」って判断したり、深層強化学習で、一番イケてる配置を学習するんだって！✨まるでAIの進路相談室🎓

続きは「らくらく論文」アプリで

AgentVNE: LLM-Augmented Graph Reinforcement Learning for Affinity-Aware Multi-Agent Placement in Edge Agentic AI

Runze Zheng / Yuqing Zheng / Zhengyi Cheng / Long Luo / Haoxiang Luo / Gang Sun / Hongfang Yu / Dusit Niyato

The Internet of Agents is propelling edge computing toward agentic AI and edge general intelligence (EGI). However, deploying multi-agent service (MAS) on resource-constrained edge infrastructure presents severe challenges. MAS service workflows are driven by complex cross-node interactions, dynamic memory accumulation, and collaborative tool usage. Exhibiting chain-like topological dependencies and strict affinity constraints, these workflows demand real-time responsiveness that exceeds the capabilities of traditional VNE algorithms designed for static resources. To address this, we propose AgentVNE, a cloud-edge collaborative framework utilizing a dual-layer architecture. First, AgentVNE employs a large language model (LLM) to identify implicit semantic constraints and generate affinity-based resource augmentation to resolve physical dependency issues. Second, it constructs a resource similarity-aware neural network, utilizing a pre-training and PPO fine-tuning strategy to precisely capture topological similarities between dynamic workflows and heterogeneous networks. By coupling semantic perception with topological reasoning, this mechanism effectively bridges the gap between dynamic service requirements and physical infrastructure. Simulation results demonstrate that AgentVNE reduces workflow communication latency to less than 40% of baselines and improves the service acceptance rate by approximately 5%-10% under high-load scenarios. Ultimately, this work provides a foundational solution for the semantic-aware deployment of agentic AI.

cs / cs.NI

Arxivで見る