バグ報告、秒で解決！AI爆誕🎉（TriagerX）

Published：2025/8/23 1:10:42

バグ報告、秒で解決！AI爆誕🎉（TriagerX）

超要約：AIがバグの担当者を秒速で見つけるよ！開発も楽々～✨

✨ ギャル的キラキラポイント ✨ ● AIがバグ報告書(バグの報告書のことね！)の内容をしっかり理解してくれる！ ● 開発者の過去の活躍（コミットとか）も考慮して、最強の担当者をレコメンド！ ● IBMでも使われてるって！めっちゃすごいやん？😳

詳細解説いくよー！ ● 背景ソフトウェア開発（ソフトを作るお仕事）って、バグ（プログラムのミス）との戦い！バグを見つけたら、誰が直すか決める「バグトリアージ」って作業が大変なの😭 でも、このAI「TriagerX」を使えば、その手間が大幅に減るってワケ💖

● 方法 2つのAIを合体させた感じ！1つはバグ報告書の内容をめっちゃ詳しく解析👀 もう1つは、開発者の過去の行動をチェック✔ 開発者たちの貢献度とか、得意な分野とかを考慮して、一番ピッタリな人を見つけるんだって！

続きは「らくらく論文」アプリで

TriagerX: Dual Transformers for Bug Triaging Tasks with Content and Interaction Based Rankings

Md Afif Al Mamun / Gias Uddin / Lan Xia / Longyu Zhang

Pretrained Language Models or PLMs are transformer-based architectures that can be used in bug triaging tasks. PLMs can better capture token semantics than traditional Machine Learning (ML) models that rely on statistical features (e.g., TF-IDF, bag of words). However, PLMs may still attend to less relevant tokens in a bug report, which can impact their effectiveness. In addition, the model can be sub-optimal with its recommendations when the interaction history of developers around similar bugs is not taken into account. We designed TriagerX to address these limitations. First, to assess token semantics more reliably, we leverage a dual-transformer architecture. Unlike current state-of-the-art (SOTA) baselines that employ a single transformer architecture, TriagerX collects recommendations from two transformers with each offering recommendations via its last three layers. This setup generates a robust content-based ranking of candidate developers. TriagerX then refines this ranking by employing a novel interaction-based ranking methodology, which considers developers' historical interactions with similar fixed bugs. Across five datasets, TriagerX surpasses all nine transformer-based methods, including SOTA baselines, often improving Top-1 and Top-3 developer recommendation accuracy by over 10%. We worked with our large industry partner to successfully deploy TriagerX in their development environment. The partner required both developer and component recommendations, with components acting as proxies for team assignments-particularly useful in cases of developer turnover or team changes. We trained TriagerX on the partner's dataset for both tasks, and it outperformed SOTA baselines by up to 10% for component recommendations and 54% for developer recommendations.

cs / cs.SE / cs.AI / cs.LG

Arxivで見る