LLM転移の秘密💖 パラメータ効率チューニングで解明！

Published：2026/1/8 15:10:27

LLM転移の秘密💖 パラメータ効率チューニングで解明！

超要約： LLM（大規模言語モデル）って、タスク（お仕事）や言語が変わると、どう影響するの？LoRAで効率よく実験して、その秘密を解き明かしたよ！

✨ ギャル的キラキラポイント ✨ ● LoRA（ローランク適応）っていう、賢くてコスパ良いやり方で実験してるのがスゴくない？✨ パラメータ（モデルの部品）を全部変えなくても、良い感じにチューニング（調整）できるんだって！ ● 色んなタスクと言語で実験してるから、色んなサービスに応用できそうじゃん？🤔翻訳とかチャットボットとか、夢広がる～！ ● 「ドナー」と「レシーバ」って言葉が面白い！🎁 誰が誰に影響を与えてるのかが分かると、もっと賢くLLMを使えるようになるってこと！

詳細解説いくよ～！

背景 LLMは、色んな事ができるスーパーモデル💖 でも、タスク（翻訳とか質問応答とか）や言語（日本語とか英語とか）によって、得意不得意があるのよね。既存の研究では、タスク間や言語間で、良い影響（転移）と悪い影響（負の転移）があるって言われてたけど、どんな組み合わせが良いのか、詳しくは分かってなかったの。そこで、LoRAっていう、コスパ良くチューニングできる方法を使って、もっと詳しく調べようってなったわけ！

続きは「らくらく論文」アプリで

Donors and Recipients: On Asymmetric Transfer Across Tasks and Languages with Parameter-Efficient Fine-Tuning

Kajetan Dymkiewicz / Ivan Vulic / Helen Yannakoudakis / Eilam Shapira / Roi Reichart / Anna Korhonen

Large language models (LLMs) perform strongly across tasks and languages, yet how improvements in one task or language affect other tasks and languages remains poorly understood. We conduct a controlled LoRA fine-tuning study across multiple open-weight LLM families and scales, using a standardised grid of 11 languages and four benchmarks. We fine-tune each model on a single task-language source and measure transfer when evaluated on all other task-language target pairs. We decompose transfer into three regimes: (i) Matched-Task (Cross-Language), (ii) Matched-Language (Cross-Task), and (iii) Cross-Task (Cross-Language). Single-source fine-tuning yields a net positive uplift across regimes, but the gains are strongly asymmetric. Matched-Task (Cross-Language) transfer emerges as the most effective and predictable regime, driven principally by the identity of the target language rather than model architecture. We identify a stable hierarchy where high-resource languages and broad semantic tasks act as efficient recipients that absorb gains from diverse sources, while specialised tasks and lower-resource languages are more isolated. These results imply that effective fine-tuning requires navigating donor-recipient roles to maximise downstream gains.

cs / cs.CL / cs.AI

Arxivで見る