iconLogo
Published:2026/1/11 10:06:14

CHASE爆誕!悪意PKG(パッケージ)をLLMで解析✨

  1. 超要約: 悪意PKGをLLMで高速🔍&高精度で発見!セキュリティ爆上げだね!

  2. ギャル的キラキラポイント✨

    • ● LLM(大規模言語モデル)を使って、賢く悪意のあるやつを見つけるんだって!✨
    • ● 多段階攻撃(色んな手口で来るやつ)にも対応できるのがスゴすぎ💖
    • ● 誤検出率が低いから、安心して使えるのがイイね👍
  3. 詳細解説

    • 背景: ソフトウェアの世界🌏、便利だけど危険がいっぱい!悪意のあるパッケージ(PKG)が紛れ込んで、色んなとこに被害が😱
    • 方法: LLMを使って、怪しいPKGをスピーディーに見つけるシステム「CHASE」を開発!色んな専門家チームで分析するイメージ✨
    • 結果: 検出精度98.4%!誤検出も少ないから、マジ優秀👏 パッケージ分析もあっという間!
    • 意義: IT業界のセキュリティが格段にUP!安全なソフトウェア開発ができるようになるってこと💖ビジネスチャンスも広がる予感🎵
  4. リアルでの使いみちアイデア💡

    • 会社のセキュリティ対策に導入!安全なソフト開発で、みんなを守ろう💖
    • 自分のPCのセキュリティソフトに搭載!怪しいPKGから、あなたを守ってくれるかも✨

続きは「らくらく論文」アプリで

CHASE: LLM Agents for Dissecting Malicious PyPI Packages

Takaaki Toda / Tatsuya Mori

Modern software package registries like PyPI have become critical infrastructure for software development, but are increasingly exploited by threat actors distributing malicious packages with sophisticated multi-stage attack chains. While Large Language Models (LLMs) offer promising capabilities for automated code analysis, their application to security-critical malware detection faces fundamental challenges, including hallucination and context confusion, which can lead to missed detections or false alarms. We present CHASE (Collaborative Hierarchical Agents for Security Exploration), a high-reliability multi-agent architecture that addresses these limitations through a Plan-and-Execute coordination model, specialized Worker Agents focused on specific analysis aspects, and integration with deterministic security tools for critical operations. Our key insight is that reliability in LLM-based security analysis emerges not from improving individual model capabilities but from architecting systems that compensate for LLM weaknesses while leveraging their semantic understanding strengths. Evaluation on a dataset of 3,000 packages (500 malicious, 2,500 benign) demonstrates that CHASE achieves 98.4% recall with only 0.08% false positive rate, while maintaining a practical median analysis time of 4.5 minutes per package, making it suitable for operational deployment in automated package screening. Furthermore, we conducted a survey with cybersecurity professionals to evaluate the generated analysis reports, identifying their key strengths and areas for improvement. This work provides a blueprint for building reliable AI-powered security tools that can scale with the growing complexity of modern software supply chains. Our project page is available at https://t0d4.github.io/CHASE-AIware25/

cs / cs.CR / cs.SE