物理学でAIを攻略！グロッキング現象を解き明かす✨

Published：2025/12/3 22:14:33

物理学でAIを攻略！グロッキング現象を解き明かす✨

超要約: 物理学の理論でAIの謎（グロッキングとか）を解く研究だよ！ビジネスにも役立つかも！

🌟 ギャル的キラキラポイント ● ニューラルネットワーク（AI）の動きを物理学で解明しちゃうなんて、なんかカッコよくない？😎 ● グロッキング現象（急に賢くなる現象）とか、相転移（状態変化）とか、ロマンあるよね～！ ● AIの仕組みが分かれば、もっとすごいAIが作れるってことじゃん？未来、明るいじゃん？✨

詳細解説背景: AIちゃん（ニューラルネットワーク）はすごいけど、なんで賢くなるのか、謎が多いのよね🤔 従来の理論じゃ説明できないことも多くて…困っちゃう！

方法: そこで登場！物理学の「特異学習理論（SLT）」！📐 これを使って、AIちゃんの学習を数式で解き明かそうってわけ。グロッキング現象とか、相転移とかを詳しく見ていくよ👀

続きは「らくらく論文」アプリで

Using physics-inspired Singular Learning Theory to understand grokking & other phase transitions in modern neural networks

Anish Lakkapragada

Classical statistical inference and learning theory often fail to explain the success of modern neural networks. A key reason is that these models are non-identifiable (singular), violating core assumptions behind PAC bounds and asymptotic normality. Singular learning theory (SLT), a physics-inspired framework grounded in algebraic geometry, has gained popularity for its ability to close this theory-practice gap. In this paper, we empirically study SLT in toy settings relevant to interpretability and phase transitions. First, we understand the SLT free energy $\mathcal{F}_n$ by testing an Arrhenius-style rate hypothesis using both a grokking modulo-arithmetic model and Anthropic's Toy Models of Superposition. Second, we understand the local learning coefficient $\lambda_{\alpha}$ by measuring how it scales with problem difficulty across several controlled network families (polynomial regressors, low-rank linear networks, and low-rank autoencoders). Our experiments recover known scaling laws while others yield meaningful deviations from theoretical expectations. Overall, our paper illustrates the many merits of SLT for understanding neural network phase transitions, and poses open research questions for the field.

cs / cs.LG / stat.ML

Arxivで見る