iconLogo
Published:2026/1/11 3:53:10

長尾ICDコーディングをギャルが爆上げ💖

超要約: 医療のコード付けをAIで精度UP⤴️✨

✨ギャル的キラキラポイント✨ ● 稀(まれ)な病気のコードもバッチリ当てるよ! ● AIがコードの意味をめっちゃ詳しく説明してくれる! ● 医療の未来を明るくする、革命的な技術なの🌟

詳細解説 背景 医療の世界で、カルテとかに病気のコード(ICDコード)を付けるのって大変じゃん? それをAIが自動でやってくれたら、めっちゃ便利だよね! でも、珍しい病気のコードはデータが少ないから、AIが苦手だったんだ😢

方法 この研究では、確率を使った新しいAI「ProBias」が登場! 病気のコード同士の関係性を確率で表して、AIが賢く学習できるようにしたんだって。 しかも、AIがコードの意味を詳しく説明してくれるから、さらに精度UP✨

続きは「らくらく論文」アプリで

Enhancing Rare Codes via Probability-Biased Directed Graph Attention for Long-Tail ICD Coding

Tianlei Chen / Yuxiao Chen / Yang Li / Feifei Wang

Automated international classification of diseases (ICD) coding aims to assign multiple disease codes to clinical documents and plays a critical role in healthcare informatics. However, its performance is hindered by the extreme long-tail distribution of the ICD ontology, where a few common codes dominate while thousands of rare codes have very few examples. To address this issue, we propose a Probability-Biased Directed Graph Attention model (ProBias) that partitions codes into common and rare sets and allows information to flow only from common to rare codes. Edge weights are determined by conditional co-occurrence probabilities, which guide the attention mechanism to enrich rare-code representations with clinically related signals. To provide higher-quality semantic representations as model inputs, we further employ large language models to generate enriched textual descriptions for ICD codes, offering external clinical context that complements statistical co-occurrence signals. Applied to automated ICD coding, our approach significantly improves the representation and prediction of rare codes, achieving state-of-the-art performance on three benchmark datasets. In particular, we observe substantial gains in macro-averaged F1 score, a key metric for long-tail classification.

cs / cs.LG / cs.AI / cs.CL