バイアス軽減AI、議論を公正に！✨

Published：2026/1/7 2:31:38

タイトル & 超要約：バイアス軽減AI、議論を公正に！✨

ギャル的キラキラポイント✨ ●　LLM（大規模言語モデル）の議論、公平性UP⤴ ●　アノニマイゼーション（匿名化）でバイアスを抑制！ ●　IT業界のAIサービスがもっと頼れるように💖

詳細解説 ●　背景最近のAI、議論とかするようになったじゃん？でも、自分の意見に固執したり、他の意見に流されたり…バイアス（偏り）が問題だったの！🤯 それを直す研究だよ！

●　方法 AIたちの名前を隠す！🤫 そうすることで、誰の発言か分からなくなるから、変な影響を受けにくくなるんだって！アノニマイゼーションって言うらしい。

●　結果アノニマイゼーションしたら、バイアスが減った！🎉 AIたちが、もっと客観的に議論できるようになったってこと！すごい！

続きは「らくらく論文」アプリで

When Identity Skews Debate: Anonymization for Bias-Reduced Multi-Agent Reasoning

Hyeong Kyu Choi / Xiaojin Zhu / Sharon Li

Multi-agent debate (MAD) aims to improve large language model (LLM) reasoning by letting multiple agents exchange answers and then aggregate their opinions. Yet recent studies reveal that agents are not neutral: they are prone to identity-driven sycophancy and self-bias, uncritically adopting a peer's view or stubbornly adhering to their own prior output, undermining the reliability of debate. In this work, we present the first principled framework that joins sycophancy and self-bias to mitigate and quantify identity bias in MAD. First, we formalize the debate dynamics as an identity-weighted Bayesian update process. Second, we propose response anonymization: by removing identity markers from prompts, agents cannot distinguish "self" from "peer", which forces equal weights on agent identity, thereby reducing bias and improving trustworthiness. Third, we define the Identity Bias Coefficient (IBC), a principled bias metric that measures an agent's tendency to follow its peer versus itself. Empirical studies across multiple models and benchmarks confirm that identity bias is widespread, with sycophancy far more common than self-bias. Our findings highlight the need to ensure that MAD systems reason based on content rather than identity. Code is released in https://github.com/deeplearning-wisc/MAD-identity-bias.

cs / cs.AI / cs.MA

Arxivで見る