時系列データ分析、LLM で爆速キャプション生成！

Published：2026/1/7 7:03:34

タイトル & 超要約：時系列データ分析、LLM で爆速キャプション生成！

🌟 ギャル的キラキラポイント✨ ● 時系列データの説明を、AIが秒速で自然な言葉にしてくれるんだって！✨ ● 金融とか医療とか、色んな分野で役立つ未来が待ってる予感💖 ● CaTS-Bench っていう、すごい評価方法も開発したみたい！すごーい！😍

詳細解説 ● 背景時系列データ（時間の流れがあるデータ）って、専門家じゃないと分析難しいじゃん？それを、LLM（大規模言語モデル）とかVLM（Vision-Language Models）っていうAIを使って、誰でも理解できるようにしよう！っていう研究だよ🌟

● 方法 CaTS-Benchっていう新しい評価方法を作ったんだって！色んな分野の時系列データを使って、LLMがどれだけ正確に説明できるか試すんだって。人間が書いた説明も参考に、AIのすごさを評価するらしい💖

● 結果時系列データをAIが自然な文章にしてくれるから、データ分析がめっちゃ楽になる！金融とか医療とか、色んな分野で役立つ未来が想像できるよね✨ データ分析がもっと身近になるかも！

続きは「らくらく論文」アプリで

CaTS-Bench: Can Language Models Describe Time Series?

Luca Zhou / Pratham Yashwante / Marshall Fisher / Alessio Sampieri / Zihao Zhou / Fabio Galasso / Rose Yu

Time series captioning, the task of describing time series in natural language, requires numeric and temporal reasoning, trend interpretation, and contextual understanding. Existing benchmarks, however, often rely on fully synthetic or generic captions, and typically neglect metadata and visual representations. We introduce \textbf{CaTS-Bench}, a comprehensive benchmark for \textbf{C}ontext-\textbf{a}ware \textbf{T}ime \textbf{S}eries reasoning across $11$ diverse domains, centered on a gold-standard evaluation set of $1746$ human-rewritten captions that measure how effectively models translate numeric trends into immediately interpretable narratives. To address the scarcity of human-annotated data, we also propose a scalable pipeline for generating high-fidelity synthetic captions, the quality of which we validate. We evaluate leading Vision-Language Models on our benchmark, revealing that even proprietary models struggle to capture numeric nuances in temporal descriptions, while finetuning open-source models on synthetic data yields substantial performance gains. Finally, we release a diagnostic suite of $910$ multiple-choice questions and tailored numeric metrics to gauge time-series-specific reasoning capabilities, establishing CaTS-Bench as a reliable foundation for grounded, multimodal language generation in numeric domains.

cs / cs.LG / cs.AI / cs.CV

Arxivで見る