少ない音楽データから学習！インド古典音楽をAIで解析🎉

Published：2026/1/7 6:12:48

タイトル：少ない音楽データから学習！インド古典音楽をAIで解析🎉 超要約：ラベル少ない音楽データもOK！AIでインド古典音楽を分析するよ～！

✨ ギャル的キラキラポイント ✨ ● 専門知識ゼロでも大丈夫！AIがインド古典音楽（IAM）の謎を解き明かすよ💖 ● ラベル少ないデータでも大丈夫！「ラベル伝播」っていうスゴ技を使うから✨ ● IAMの音楽ストリーミングとか、教育アプリとか、夢広がる未来が待ってる💎

詳細解説 ● 背景 IAMの研究って、データ集めるのが大変だったの！ラベル付け（曲にタグ付けする事）に時間もお金もかかるし、専門知識も必要だったから💦 それをAIで解決しよー！ってのが始まりなんだ✨

● 方法「ラベル伝播（LP）」っていう、グラフ構造を使ったスゴイ方法を使うよ！グラフで曲同士の関係を表して、少ないラベルの情報から、他の曲にも情報を広げていくんだって！まるで友達の輪みたい👯‍♀️

● 結果 LPのおかげで、少ないデータでもIAMの分析がめっちゃ捗るようになったってこと！楽器とかラーガ（インドの伝統的なメロディーの形式）とか、いろんな情報をAIが学習できるようになったんだって！すごい😳

続きは「らくらく論文」アプリで

Learning from Limited Labels: Transductive Graph Label Propagation for Indian Music Analysis

Parampreet Singh / Akshay Raina / Sayeedul Islam Sheikh / Vipul Arora

Supervised machine learning frameworks rely on extensive labeled datasets for robust performance on real-world tasks. However, there is a lack of large annotated datasets in audio and music domains, as annotating such recordings is resource-intensive, laborious, and often require expert domain knowledge. In this work, we explore the use of label propagation (LP), a graph-based semi-supervised learning technique, for automatically labeling the unlabeled set in an unsupervised manner. By constructing a similarity graph over audio embeddings, we propagate limited label information from a small annotated subset to a larger unlabeled corpus in a transductive, semi-supervised setting. We apply this method to two tasks in Indian Art Music (IAM): Raga identification and Instrument classification. For both these tasks, we integrate multiple public datasets along with additional recordings we acquire from Prasar Bharati Archives to perform LP. Our experiments demonstrate that LP significantly reduces labeling overhead and produces higher-quality annotations compared to conventional baseline methods, including those based on pretrained inductive models. These results highlight the potential of graph-based semi-supervised learning to democratize data annotation and accelerate progress in music information retrieval.

cs / eess.AS / cs.LG

Arxivで見る