iconLogo
Published:2025/12/3 22:11:45

タイトル & 超要約:SQUARE爆誕!表データ分析革命💥

こんかい紹介するのは、表形式データ(スプレッドシートとか)を最強に使いこなすためのAI「SQUARE」の研究だよ! 複雑な表でも、質問に秒速で答えてくれるようになるんだって!✨

✨ ギャル的キラキラポイント ✨

● 構造(ヘッダーとか結合セル)をちゃんと理解😎! 頭のいいAIね! ● SQLとチャンク検索(情報を細かく分けて検索すること)を使い分け、柔軟に対応してるって! 賢すぎ💖 ● 回答の根拠(どこから情報を引っ張ってきたか)がわかるから、めっちゃ安心安全じゃん?🫶

詳細解説いくねー!

続きは「らくらく論文」アプリで

SQuARE: Structured Query & Adaptive Retrieval Engine For Tabular Formats

Chinmay Gondhalekar / Urjitkumar Patel / Fang-Chun Yeh

Accurate question answering over real spreadsheets remains difficult due to multirow headers, merged cells, and unit annotations that disrupt naive chunking, while rigid SQL views fail on files lacking consistent schemas. We present SQuARE, a hybrid retrieval framework with sheet-level, complexity-aware routing. It computes a continuous score based on header depth and merge density, then routes queries either through structure-preserving chunk retrieval or SQL over an automatically constructed relational representation. A lightweight agent supervises retrieval, refinement, or combination of results across both paths when confidence is low. This design maintains header hierarchies, time labels, and units, ensuring that returned values are faithful to the original cells and straightforward to verify. Evaluated on multi-header corporate balance sheets, a heavily merged World Bank workbook, and diverse public datasets, SQuARE consistently surpasses single-strategy baselines and ChatGPT-4o on both retrieval precision and end-to-end answer accuracy while keeping latency predictable. By decoupling retrieval from model choice, the system is compatible with emerging tabular foundation models and offers a practical bridge toward a more robust table understanding.

cs / cs.CL