iconLogo
Published:2026/1/2 17:18:01

PRレビュー予測、早期発見で開発効率UP!✨

超要約:AI生成PRのレビュー時間、AIで予測して開発爆速🚀

ギャルのみんな〜!ソフトウェア開発のPR(プルリクエスト)レビューって大変じゃない?😩 この研究は、AIが作ったPRが「レビューに時間かかるやつ」か事前に予測するモデルを作ったって話だよ! これで無駄を省いて、もっと楽しく開発しよー💖

AIが予測!: レビュー前に「やばいPR」を見抜ける!😳 ● 開発効率UP: 時間短縮で、爆速開発!🚀 ● AI活用促進: AI時代の開発を、もっとイケてるものに✨

詳細解説いくよー!

続きは「らくらく論文」アプリで

Early-Stage Prediction of Review Effort in AI-Generated Pull Requests

Dao Sy Duy Minh / Huynh Trung Kiet / Tran Chi Nguyen / Nguyen Lam Phu Quy / Phu Hoa Pham / Nguyen Dinh Ha Duong / Truong Bao Tran

As autonomous AI agents transition from code completion tools to full-fledged teammates capable of opening pull requests (PRs) at scale, software maintainers face a new challenge: not just reviewing code, but managing complex interaction loops with non-human contributors. This paradigm shift raises a critical question: can we predict which agent-generated PRs will consume excessive review effort before any human interaction begins? Analyzing 33,707 agent-authored PRs from the AIDev dataset across 2,807 repositories, we uncover a striking two-regime behavioral pattern that fundamentally distinguishes autonomous agents from human developers. The first regime, representing 28.3 percent of all PRs, consists of instant merges (less than 1 minute), reflecting success on narrow automation tasks. The second regime involves iterative review cycles where agents frequently stall or abandon refinement (ghosting). We propose a Circuit Breaker triage model that predicts high-review-effort PRs (top 20 percent) at creation time using only static structural features. A LightGBM model achieves AUC 0.957 on a temporal split, while semantic text features (TF-IDF, CodeBERT) provide negligible predictive value. At a 20 percent review budget, the model intercepts 69 percent of total review effort, enabling zero-latency governance. Our findings challenge prevailing assumptions in AI-assisted code review: review burden is dictated by what agents touch, not what they say, highlighting the need for structural governance mechanisms in human-AI collaboration.

cs / cs.SE