iconLogo
Published:2026/1/7 1:56:44

タイトル & 超要約:コード翻訳を爆速&高品質に!

ギャル的キラキラポイント✨:

● テストケース(プログラムのテスト用データ)を賢く活用!✨ 少ないデータでもOKに。 ● 言語ごとに学習の仕方を調整!最適化の偏りをなくすテクニック♪ ● レガシーコード(古いプログラム)もラクラク最新言語に!開発爆速🚀

詳細解説:

背景: コード翻訳(CT)って、プログラム言語を別の言語に変換すること💻✨。でも、データ不足とか、言語ペア間の学習の偏りとか、色々難しい問題があったの!😭

続きは「らくらく論文」アプリで

Bootstrapping Code Translation with Weighted Multilanguage Exploration

Yuhan Wu / Huan Zhang / Wei Cheng / Chen Shen / Jingyue Yang / Wei Hu

Code translation across multiple programming languages is essential yet challenging due to two vital obstacles: scarcity of parallel data paired with executable test oracles, and optimization imbalance when handling diverse language pairs. We propose BootTrans, a bootstrapping method that resolves both obstacles. Its key idea is to leverage the functional invariance and cross-lingual portability of test suites, adapting abundant pivot-language unit tests to serve as universal verification oracles for multilingual RL training. Our method introduces a dual-pool architecture with seed and exploration pools to progressively expand training data via execution-guided experience collection. Furthermore, we design a language-aware weighting mechanism that dynamically prioritizes harder translation directions based on relative performance across sibling languages, mitigating optimization imbalance. Extensive experiments on the HumanEval-X and TransCoder-Test benchmarks demonstrate substantial improvements over baseline LLMs across all translation directions, with ablations validating the effectiveness of both bootstrapping and weighting components.

cs / cs.SE / cs.AI