iconLogo
Published:2026/1/10 23:56:33

最強!DS-CIMでエッジAI爆速化🚀

論文の内容を、わかりやすく解説していくよ~!

超要約:エッジAIを爆速にする新技術!⚡️

ギャル的キラキラポイント✨

● エッジAI(身近なデバイスで動くAI)の計算を、もっと速く&省エネにする方法を見つけたんだって! ● 計算の精度もめっちゃ高いから、色んなAIアプリで使えるようになるかも! ● AIチップの性能アップで、私たちの生活がもっと便利になる予感…💖

続きは「らくらく論文」アプリで

DS-CIM: Digital Stochastic Computing-In-Memory Featuring Accurate OR-Accumulation via Sample Region Remapping for Edge AI Models

Kunming Shao / Liang Zhao / Jiangnan Yu / Zhipeng Liao / Xiaomeng Wang / Yi Zou / Tim Kwang-Ting Cheng / Chi-Ying Tsui

Stochastic computing (SC) offers hardware simplicity but suffers from low throughput, while high-throughput Digital Computing-in-Memory (DCIM) is bottlenecked by costly adder logic for matrix-vector multiplication (MVM). To address this trade-off, this paper introduces a digital stochastic CIM (DS-CIM) architecture that achieves both high accuracy and efficiency. We implement signed multiply-accumulation (MAC) in a compact, unsigned OR-based circuit by modifying the data representation. Throughput is enhanced by replicating this low-cost circuit 64 times with only a 1x area increase. Our core strategy, a shared Pseudo Random Number Generator (PRNG) with 2D partitioning, enables single-cycle mutually exclusive activation to eliminate OR-gate collisions. We also resolve the 1s saturation issue via stochastic process analysis and data remapping, significantly improving accuracy and resilience to input sparsity. Our high-accuracy DS-CIM1 variant achieves 94.45% accuracy for INT8 ResNet18 on CIFAR-10 with a root-mean-squared error (RMSE) of just 0.74%. Meanwhile, our high-efficiency DS-CIM2 variant attains an energy efficiency of 3566.1 TOPS/W and an area efficiency of 363.7 TOPS/mm^2, while maintaining a low RMSE of 3.81%. The DS-CIM capability with larger models is further demonstrated through experiments with INT8 ResNet50 on ImageNet and the FP8 LLaMA-7B model.

cs / cs.AR / cs.LG