LLM（大規模言語モデル）の倫理観、文化で違う？🧐🌍

Published：2026/1/4 17:02:40

LLM（大規模言語モデル）の倫理観、文化で違う？🧐🌍

超要約：LLMの倫理観、文化によって偏りがあるか調査！✨

● 多様なLLM（AI）の倫理観を、色んな国の価値観データで比較したんだって！ ● モデルによって、倫理的判断に結構差があったみたい！😱 ● 命令（プロンプト）の調整とか、モデルのサイズを大きくすると改善するかも！

詳細解説 背景 LLMって、色んな情報をもとに学習するから、どうしても偏見（バイアス）が含まれちゃう可能性があるじゃん？🤔 特に倫理的な判断は、文化によって全然違うから、LLMがどう対応してるか気にならない？今回の研究は、そこをしっかり調べてるんだ！

方法世界中の価値観を調査したデータ（WVSとPEW）を使って、20種類以上のLLMに倫理的な問題について質問！💡 例えば、「同性愛は正しい？」とか、色んな文化圏で意見が割れそうなテーマで、LLMの答えを分析したみたい。

続きは「らくらく論文」アプリで

Exploring Cultural Variations in Moral Judgments with Large Language Models

Hadi Mohammadi / Ayoub Bagheri

Large Language Models (LLMs) have shown strong performance across many tasks, but their ability to capture culturally diverse moral values remains unclear. In this paper, we examine whether LLMs mirror variations in moral attitudes reported by the World Values Survey (WVS) and the Pew Research Center's Global Attitudes Survey (PEW). We compare smaller monolingual and multilingual models (GPT-2, OPT, BLOOMZ, and Qwen) with recent instruction-tuned models (GPT-4o, GPT-4o-mini, Gemma-2-9b-it, and Llama-3.3-70B-Instruct). Using log-probability-based \emph{moral justifiability} scores, we correlate each model's outputs with survey data covering a broad set of ethical topics. Our results show that many earlier or smaller models often produce near-zero or negative correlations with human judgments. In contrast, advanced instruction-tuned models achieve substantially higher positive correlations, suggesting they better reflect real-world moral attitudes. We provide a detailed regional analysis revealing that models align better with Western, Educated, Industrialized, Rich, and Democratic (W.E.I.R.D.) nations than with other regions. While scaling model size and using instruction tuning improves alignment with cross-cultural moral norms, challenges remain for certain topics and regions. We discuss these findings in relation to bias analysis, training data diversity, information retrieval implications, and strategies for improving the cultural sensitivity of LLMs.

cs / cs.CL

Arxivで見る