iconLogo
Published:2026/1/11 13:17:59

オンラインの悪口、AIでブロック!✨ (攻撃的言動検出)

超要約: オンラインの悪い言葉を、AIで賢く見つけてやっつけちゃう研究だよ!💪

✨ ギャル的キラキラポイント ✨ ● 悪口を"隠された意味"から見抜く!まるでエスパー🔮 ● 会話の流れ(文脈)もちゃんと見てるからすごい😳 ● IT企業が困ってること、全部解決できちゃうかも?🤩

詳細解説 ● 背景 ネット社会、便利だけど悪口とかヘイトスピーチ(差別的な発言)も多いよね💦 これをAIでどうにかしたい!って研究だよ。 今までのAIじゃ見つけにくかった、ちょっと隠された悪口も見つけられるように頑張ってるんだって!

● 方法 新しいデータセット「HACD」を作って、AIくんに悪口のパターンをいっぱい学習させたみたい。 それを、悪口の種類ごとに得意なAIモデルを組み合わせて、賢く検出する「階層型分割統治フレームワーク」っていうスゴイ方法を使ってるんだって!✨

続きは「らくらく論文」アプリで

Fine-grained Verbal Attack Detection via a Hierarchical Divide-and-Conquer Framework

Quan Zheng / Yuanhe Tian / Ming Wang / Yan Song

In the digital era, effective identification and analysis of verbal attacks are essential for maintaining online civility and ensuring social security. However, existing research is limited by insufficient modeling of conversational structure and contextual dependency, particularly in Chinese social media where implicit attacks are prevalent. Current attack detection studies often emphasize general semantic understanding while overlooking user response relationships, hindering the identification of implicit and context-dependent attacks. To address these challenges, we present the novel "Hierarchical Attack Comment Detection" dataset and propose a divide-and-conquer, fine-grained framework for verbal attack recognition based on spatiotemporal information. The proposed dataset explicitly encodes hierarchical reply structures and chronological order, capturing complex interaction patterns in multi-turn discussions. Building on this dataset, the framework decomposes attack detection into hierarchical subtasks, where specialized lightweight models handle explicit detection, implicit intent inference, and target identification under constrained context. Extensive experiments on the proposed dataset and benchmark intention detection datasets show that smaller models using our framework significantly outperform larger monolithic models relying on parameter scaling, demonstrating the effectiveness of structured task decomposition.

cs / cs.CL