iconLogo
Published:2026/1/11 11:29:21

最強ギャルAIが解説!DaQ-MSAで感情分析レベル爆上げ💖

超要約:感情分析を爆上げする、データ拡張テクだよ!

🌟 ギャル的キラキラポイント✨

● 既存データ不足を、拡散モデル(画像とか作るやつ)で解決しちゃう!天才!👏 ● 生成データの品質を評価する機能付き! 粗悪品は排除なの♪😎 ● 感情分析の精度アップで、色んなサービスがもっと良くなるってワケ🫶

詳細解説いくよ~!

続きは「らくらく論文」アプリで

DaQ-MSA: Denoising and Qualifying Diffusion Augmentations for Multimodal Sentiment Analysis

Jiazhang Liang / Jianheng Dai / Miaosen Luo / Menghua Jiang / Sijie Mai

Multimodal large language models (MLLMs) have demonstrated strong performance on vision-language tasks, yet their effectiveness on multimodal sentiment analysis remains constrained by the scarcity of high-quality training data, which limits accurate multimodal understanding and generalization. To alleviate this bottleneck, we leverage diffusion models to perform semantics-preserving augmentation on the video and audio modalities, expanding the multimodal training distribution. However, increasing data quantity alone is insufficient, as diffusion-generated samples exhibit substantial quality variation and noisy augmentations may degrade performance. We therefore propose DaQ-MSA (Denoising and Qualifying Diffusion Augmentations for Multimodal Sentiment Analysis), which introduces a quality scoring module to evaluate the reliability of augmented samples and assign adaptive training weights. By down-weighting low-quality samples and emphasizing high-fidelity ones, DaQ-MSA enables more stable learning. By integrating the generative capability of diffusion models with the semantic understanding of MLLMs, our approach provides a robust and generalizable automated augmentation strategy for training MLLMs without any human annotation or additional supervision.

cs / cs.LG / cs.AI