CHASE爆誕！悪意PKG(パッケージ)をLLMで解析✨

Published：2026/1/11 10:06:14

CHASE爆誕！悪意PKG(パッケージ)をLLMで解析✨

超要約: 悪意PKGをLLMで高速🔍&高精度で発見！セキュリティ爆上げだね！
ギャル的キラキラポイント✨
- ● LLM(大規模言語モデル)を使って、賢く悪意のあるやつを見つけるんだって！✨
- ● 多段階攻撃（色んな手口で来るやつ）にも対応できるのがスゴすぎ💖
- ● 誤検出率が低いから、安心して使えるのがイイね👍
詳細解説
- 背景: ソフトウェアの世界🌏、便利だけど危険がいっぱい！悪意のあるパッケージ（PKG）が紛れ込んで、色んなとこに被害が😱
- 方法: LLMを使って、怪しいPKGをスピーディーに見つけるシステム「CHASE」を開発！色んな専門家チームで分析するイメージ✨
- 結果: 検出精度98.4%！誤検出も少ないから、マジ優秀👏 パッケージ分析もあっという間！
- 意義: IT業界のセキュリティが格段にUP！安全なソフトウェア開発ができるようになるってこと💖ビジネスチャンスも広がる予感🎵
リアルでの使いみちアイデア💡
- 会社のセキュリティ対策に導入！安全なソフト開発で、みんなを守ろう💖
- 自分のPCのセキュリティソフトに搭載！怪しいPKGから、あなたを守ってくれるかも✨

続きは「らくらく論文」アプリで

CHASE: LLM Agents for Dissecting Malicious PyPI Packages

Takaaki Toda / Tatsuya Mori

Modern software package registries like PyPI have become critical infrastructure for software development, but are increasingly exploited by threat actors distributing malicious packages with sophisticated multi-stage attack chains. While Large Language Models (LLMs) offer promising capabilities for automated code analysis, their application to security-critical malware detection faces fundamental challenges, including hallucination and context confusion, which can lead to missed detections or false alarms. We present CHASE (Collaborative Hierarchical Agents for Security Exploration), a high-reliability multi-agent architecture that addresses these limitations through a Plan-and-Execute coordination model, specialized Worker Agents focused on specific analysis aspects, and integration with deterministic security tools for critical operations. Our key insight is that reliability in LLM-based security analysis emerges not from improving individual model capabilities but from architecting systems that compensate for LLM weaknesses while leveraging their semantic understanding strengths. Evaluation on a dataset of 3,000 packages (500 malicious, 2,500 benign) demonstrates that CHASE achieves 98.4% recall with only 0.08% false positive rate, while maintaining a practical median analysis time of 4.5 minutes per package, making it suitable for operational deployment in automated package screening. Furthermore, we conducted a survey with cybersecurity professionals to evaluate the generated analysis reports, identifying their key strengths and areas for improvement. This work provides a blueprint for building reliable AI-powered security tools that can scale with the growing complexity of modern software supply chains. Our project page is available at https://t0d4.github.io/CHASE-AIware25/

cs / cs.CR / cs.SE

Arxivで見る