超要約:AIカウンセラーの質を爆上げ(ばくあげ)する評価基準ができたよ!
✨ ギャル的キラキラポイント ✨ ● 複数のセッションに対応!長期的な関係性も築けちゃうかも💕 ● いろんな治療法(CBTとか)に対応!まるで万能ギャル✨ ● AIカウンセラーの出来を細かく評価!まるで推し活💖
詳細解説 ● 背景 AIがカウンセラーみたいに話せる時代になったけど、まだ課題がいっぱい💦 1回の会話だけじゃなくて、何回も話して、色んな治療法にも対応できるAIが求められてるんだよね!
● 方法 PsychEval(サイクエバル)っていう評価基準を作ったよ!長〜い会話とか、色んな治療法に対応できるように、AIの能力をチェックする項目がいっぱいあるんだって!
続きは「らくらく論文」アプリで
To develop a reliable AI for psychological assessment, we introduce \texttt{PsychEval}, a multi-session, multi-therapy, and highly realistic benchmark designed to address three key challenges: \textbf{1) Can we train a highly realistic AI counselor?} Realistic counseling is a longitudinal task requiring sustained memory and dynamic goal tracking. We propose a multi-session benchmark (spanning 6-10 sessions across three distinct stages) that demands critical capabilities such as memory continuity, adaptive reasoning, and longitudinal planning. The dataset is annotated with extensive professional skills, comprising over 677 meta-skills and 4577 atomic skills. \textbf{2) How to train a multi-therapy AI counselor?} While existing models often focus on a single therapy, complex cases frequently require flexible strategies among various therapies. We construct a diverse dataset covering five therapeutic modalities (Psychodynamic, Behaviorism, CBT, Humanistic Existentialist, and Postmodernist) alongside an integrative therapy with a unified three-stage clinical framework across six core psychological topics. \textbf{3) How to systematically evaluate an AI counselor?} We establish a holistic evaluation framework with 18 therapy-specific and therapy-shared metrics across Client-Level and Counselor-Level dimensions. To support this, we also construct over 2,000 diverse client profiles. Extensive experimental analysis fully validates the superior quality and clinical fidelity of our dataset. Crucially, \texttt{PsychEval} transcends static benchmarking to serve as a high-fidelity reinforcement learning environment that enables the self-evolutionary training of clinically responsible and adaptive AI counselors.