kNN-MoEでLLM爆進化！ルーティング問題に終止符！

Published：2026/1/5 14:16:11

kNN-MoEでLLM爆進化！ルーティング問題に終止符！🎉

超要約：kNN-MoEで、LLM（大規模言語モデル）の頭脳🧠をさらに賢くする研究だよ！

✨ ギャル的キラキラポイント ✨ ● 過去の正解を覚えてるから、新しい問題にも対応できるってコト💖 ● 専門家（Expert）への道案内が、めっちゃ柔軟になったの！ ● いろんなサービスで、もっと賢いAIが使えるようになるかも⁉

詳細解説 ● 背景 LLM（大規模言語モデル）って、TransformerとMoE（Mixture-of-Experts）の組み合わせで、スゴイ性能を発揮してるんだけど、ルーティング（道案内）がちょっと硬かったんだよね💦 専門知識が必要な問題とか、見たことないデータには弱かったんだ。

● 方法 kNN-MoEは、過去の正解を「記憶」して、似た問題が来たらそれを参考にルーティングするの！🔍 過去のデータからヒントをもらって、柔軟に対応できるようになったってワケ😎

続きは「らくらく論文」アプリで

Routing by Analogy: kNN-Augmented Expert Assignment for Mixture-of-Experts

Boxuan Lyu / Soichiro Murakami / Hidetaka Kamigaito / Peinan Zhang

Mixture-of-Experts (MoE) architectures scale large language models efficiently by employing a parametric "router" to dispatch tokens to a sparse subset of experts. Typically, this router is trained once and then frozen, rendering routing decisions brittle under distribution shifts. We address this limitation by introducing kNN-MoE, a retrieval-augmented routing framework that reuses optimal expert assignments from a memory of similar past cases. This memory is constructed offline by directly optimizing token-wise routing logits to maximize the likelihood on a reference set. Crucially, we use the aggregate similarity of retrieved neighbors as a confidence-driven mixing coefficient, thus allowing the method to fall back to the frozen router when no relevant cases are found. Experiments show kNN-MoE outperforms zero-shot baselines and rivals computationally expensive supervised fine-tuning.

cs / cs.CL / cs.AI

Arxivで見る