iconLogo
Published:2026/1/11 14:57:22

RAGシステム、エコにする方法って?🚀✨(超要約:省エネRAGで爆益!)

🌟 ギャル的キラキラポイント ● LLM(大規模言語モデル)の電気代💸を節約! ● 検索(RAG)システムをもっと賢く✨ ● 環境にもお財布にも優しいRAGが最強🌎

詳細解説 背景: 最近、AIさん大活躍だけど、電気めっちゃ使うじゃん?😭 データセンター(AIの頭脳🧠)の電気代もバカにならないし、地球にも優しくない🙅‍♀️ そこで、RAGシステム(賢い検索システム)を省エネにしよー!って話💡

方法: 論文では、RAGシステムの電気代を節約する色んな方法を試したみたい🧐 埋め込み(データ変換)サイズを変えたり、検索する時のレベルを調整したり…色々工夫した結果、精度を落とさずに省エネできる方法を見つけたみたい💖

結果: 実験の結果、埋め込みサイズを小さくしたり、類似度(似てる度合い)の基準を調整すると、電気代を節約できることが判明🎉 インデックス(データ整理)の方法を変えるともっと早く動くけど、ちょっとだけ精度が落ちちゃうことも🤔

続きは「らくらく論文」アプリで

On the Effectiveness of Proposed Techniques to Reduce Energy Consumption in RAG Systems: A Controlled Experiment

Zhinuan Guo / Chushu Gao / Justus Bogner

The rising energy demands of machine learning (ML), e.g., implemented in popular variants like retrieval-augmented generation (RAG) systems, have raised significant concerns about their environmental sustainability. While previous research has proposed green tactics for ML-enabled systems, their empirical evaluation within RAG systems remains largely unexplored. This study presents a controlled experiment investigating five practical techniques aimed at reducing energy consumption in RAG systems. Using a production-like RAG system developed at our collaboration partner, the Software Improvement Group, we evaluated the impact of these techniques on energy consumption, latency, and accuracy. Through a total of 9 configurations spanning over 200 hours of trials using the CRAG dataset, we reveal that techniques such as increasing similarity retrieval thresholds, reducing embedding sizes, applying vector indexing, and using a BM25S reranker can significantly reduce energy usage, up to 60% in some cases. However, several techniques also led to unacceptable accuracy decreases, e.g., by up to 30% for the indexing strategies. Notably, finding an optimal retrieval threshold and reducing embedding size substantially reduced energy consumption and latency with no loss in accuracy, making these two techniques truly energy-efficient. We present the first comprehensive, empirical study on energy-efficient design techniques for RAG systems, providing guidance for developers and researchers aiming to build sustainable RAG applications.

cs / cs.SE