T-NUCA - a novel approach to non-uniform access latency cache architectures for 3D CMPs

Parallel & Distributed Processing, Workshops and Phd Forum(2010)

引用 2|浏览17
暂无评分
摘要
We consider a non-uniform access latency cache architecture (NUCA) design for 3D chip multi-processors (CMPs) where cache structures are divided into small banks interconnected by a network-on-chip (NoC). In earlier NUCA designs, data is placed in banks either statically (S-NUCA) or dynamically (D-NUCA). In both S-NUCA and D-NUCA designs, scaling to hundreds of cores can pose several challenges. Thus, we propose a new NUCA architecture with an inclusive, octal tree-based, hierarchical directory (T-NUCA-8), with the potential to scale to hundreds of cores with performance comparable to D-NUCA at a fraction of the energy cost. Our evaluations indicate that relative to D-NUCA, our T-NUCA-8 reduces network usage by 92%, energy by 87%, and EDP by 87%, at performance cost of 10%.
更多
查看译文
关键词
cache storage,multiprocessing systems,network-on-chip,octrees,3D chip multiprocessors,T-NUCA,network-on-chip,nonuniform access latency cache architectures,octal tree based hierarchical directory
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要