LLMとASPで爆速情報抽出🚀 IT企業向け！

Published：2026/1/8 11:15:52

LLMとASPで爆速情報抽出🚀 IT企業向け！

論文を爆速で要約！ LLMとASPを組み合わせて、テキストから情報を賢く抽出する技術だよ✨

ギャル的キラキラポイント✨

● 大量のデータいらず！少ないデータでOK🙆‍♀️ ● IT企業が困ってること、ぜーんぶ解決しちゃう魔法🧙‍♀️ ● 既存のやり方より、すっごい精度UP⤴️

詳細解説

背景 IT企業って、データがいっぱいあるけど、それをうまく使えてないって悩み、あるよね？この研究は、そんな悩みを解決するために生まれたんだって！大量のテキストデータから必要な情報を、素早く、正確に抽出できるようにする技術だよ👍

方法 LLM（GPTとか）を使って、テキストから情報を引っ張り出す！でも、LLMだけだとたまに間違っちゃうから、ASP（知識を整理するやつ）を使ってチェックするの！この組み合わせがスゴイんだって🌟

続きは「らくらく論文」アプリで

An LLM + ASP Workflow for Joint Entity-Relation Extraction

Trang Tran (New Mexico State University) / Trung Hoang Le (New Mexico State University) / Huiping Cao (New Mexico State University) / Tran Cao Son (New Mexico State University)

Joint entity-relation extraction (JERE) identifies both entities and their relationships simultaneously. Traditional machine-learning based approaches to performing this task require a large corpus of annotated data and lack the ability to easily incorporate domain specific information in the construction of the model. Therefore, creating a model for JERE is often labor intensive, time consuming, and elaboration intolerant. In this paper, we propose harnessing the capabilities of generative pre-trained large language models (LLMs) and the knowledge representation and reasoning capabilities of Answer Set Programming (ASP) to perform JERE. We present a generic workflow for JERE using LLMs and ASP. The workflow is generic in the sense that it can be applied for JERE in any domain. It takes advantage of LLM's capability in natural language understanding in that it works directly with unannotated text. It exploits the elaboration tolerant feature of ASP in that no modification of its core program is required when additional domain specific knowledge, in the form of type specifications, is found and needs to be used. We demonstrate the usefulness of the proposed workflow through experiments with limited training data on three well-known benchmarks for JERE. The results of our experiments show that the LLM + ASP workflow is better than state-of-the-art JERE systems in several categories with only 10% of training data. It is able to achieve a 2.5 times (35% over 15%) improvement in the Relation Extraction task for the SciERC corpus, one of the most difficult benchmarks.

cs / cs.AI / cs.CL

Arxivで見る