🌟 ギャル的キラキラポイント✨ ● データ分析の信頼性を上げる「PCSフレームワーク」ってのがすごい!Predictability(予測可能性)、Computability(計算可能性)、Stability(安定性)を重視するんだって! ● GenAI(生成AI)をデータ分析に使うと、データクレンジングとかコード生成が超時短になるらしい!でも、PCSフレームワークでちゃんとチェックしなきゃダメだよ! ● 「PCSアンサンブル」っていう、いろんなモデルを組み合わせる方法がアツい!予測のズレを減らして、もっと信頼できる結果になるんだって!
詳細解説いくよ~!
背景
最近のAI(人工知能)ってすごいけど、データ分析の結果って、誰がやるかとか、どんなAIモデルを使うかで全然違ってくるじゃん? だから、出た結果がホントに正しいのか、信用できないこともあったの。IT業界でも、AIをもっとちゃんと使いたいけど、信頼性がないと困るよね!
続きは「らくらく論文」アプリで
Data science is a pillar of artificial intelligence (AI), which is transforming nearly every domain of human activity, from the social and physical sciences to engineering and medicine. While data-driven findings in AI offer unprecedented power to extract insights and guide decision-making, many are difficult or impossible to replicate. A key reason for this challenge is the uncertainty introduced by the many choices made throughout the data science life cycle (DSLC). Traditional statistical frameworks often fail to account for this uncertainty. The Predictability-Computability-Stability (PCS) framework for veridical (truthful) data science offers a principled approach to addressing this challenge throughout the DSLC. This paper presents an updated and streamlined PCS workflow, tailored for practitioners and enhanced with guided use of generative AI. We include a running example to display the PCS framework in action, and conduct a related case study which showcases the uncertainty in downstream predictions caused by judgment calls in the data cleaning stage.