金融のウソを見破る！RFC-BENCHってスゴくない？💰✨

Published：2026/1/8 14:39:03

最強ギャルが教える！金融偽情報検出の最新ベンチマークRFC-BENCH💰✨

超要約：金融のウソ情報を見破るスゴい技術が登場したよ！😎

🌟 ギャル的キラキラポイント✨

● 金融ニュースのウソ、paragraph（段落）単位で見抜く！📝 ● 4つのタイプ（数字、感情とか）のウソを見抜けるように工夫されてる！🤔 ● LLM（AI）がウソを見抜くための、新しいテストって感じ！💯

詳細解説いくよ～！

続きは「らくらく論文」アプリで

All That Glisters Is Not Gold: A Benchmark for Reference-Free Counterfactual Financial Misinformation Detection

Yuechen Jiang / Zhiwei Liu / Yupeng Cao / Yueru He / Chen Xu / Ziyang Xu / Zhiyang Deng / Prayag Tiwari / Xi Chen / Alejandro Lopez-Lira / Jimin Huang / Junichi Tsujii / Sophia Ananiadou

We introduce RFC Bench, a benchmark for evaluating large language models on financial misinformation under realistic news. RFC Bench operates at the paragraph level and captures the contextual complexity of financial news where meaning emerges from dispersed cues. The benchmark defines two complementary tasks: reference free misinformation detection and comparison based diagnosis using paired original perturbed inputs. Experiments reveal a consistent pattern: performance is substantially stronger when comparative context is available, while reference free settings expose significant weaknesses, including unstable predictions and elevated invalid outputs. These results indicate that current models struggle to maintain coherent belief states without external grounding. By highlighting this gap, RFC Bench provides a structured testbed for studying reference free reasoning and advancing more reliable financial misinformation detection in real world settings.

cs / cs.CL / cs.CE / q-fin.CP

Arxivで見る