
タイトル:Memory Is All You Need: An Overview of Compute-in-Memory Architectures for Accelerating Large Language Model Inference
著者:Christopher Wolters, Xiaoxuan Yang, Ulf Schlichtmann, Toyotaro Suzumura
公開:2024年6月12日 (arXiv:2406.08413v1)
分野:Hardware Architecture (cs.AR), Machine Learning (cs.LG)