VotIEで議事録分析!投票情報を自動抽出、未来がアガる✨
1. ギャル的キラキラポイント✨
● 地方議会の議事録(ぎじろく)から投票情報をAIで抽出!未来すぎる💖 ● 色んな自治体(じちたい)の議事録でも使えるように、汎化性能(はんかせいのう)もチェックしてるの👍 ● 市民(しみん)が政治(せいじ)にもっと参加(さんか)しやすくなるかもって、良くない?😍
2. 詳細解説
背景 地方自治体(ちほうじちたい)の議事録って、形式(けいしき)バラバラでAIが読み解くの難しかったんだよね💦 でも、議事録には大事な投票の情報がいっぱい! それをAIでサクッと抽出(ちゅうしゅつ)できたら、めちゃくちゃ便利じゃん?✨
続きは「らくらく論文」アプリで
Municipal meeting minutes record key decisions in local democratic processes. Unlike parliamentary proceedings, which typically adhere to standardized formats, they encode voting outcomes in highly heterogeneous, free-form narrative text that varies widely across municipalities, posing significant challenges for automated extraction. In this paper, we introduce VotIE (Voting Information Extraction), a new information extraction task aimed at identifying structured voting events in narrative deliberative records, and establish the first benchmark for this task using Portuguese municipal minutes, building on the recently introduced CitiLink corpus. Our experiments yield two key findings. First, under standard in-domain evaluation, fine-tuned encoders, specifically XLM-R-CRF, achieve the strongest performance, reaching 93.2\% macro F1, outperforming generative approaches. Second, in a cross-municipality setting that evaluates transfer to unseen administrative contexts, these models suffer substantial performance degradation, whereas few-shot LLMs demonstrate greater robustness, with significantly smaller declines in performance. Despite this generalization advantage, the high computational cost of generative models currently constrains their practicality. As a result, lightweight fine-tuned encoders remain a more practical option for large-scale, real-world deployment. To support reproducible research in administrative NLP, we publicly release our benchmark, trained models, and evaluation framework.