REMAP: a reliability/endurance mechanism for advancing PCM.

MEMSYS 2017: PROCEEDINGS OF THE INTERNATIONAL SYMPOSIUM ON MEMORY SYSTEMS(2017)

引用 16|浏览65
暂无评分
摘要
Even given PCM's attractive features that include high scalability and lower power, write endurance remains a critical issue that impedes the move for this technology to replace DRAM in main memory systems. The wear-out problem is further exacerbated by advances in future technologies, where cell sizes are reduced and process variation increases. When using PCMs, worn-out cells are permanently stuck at either '0' or '1'. Successful adoption of PCM requires recovery from multiple stuck-at faults in a data block. Current error correction schemes for PCMs have limited capabilities to tolerate faults. In this paper we propose REMAP to improve the reliability of PCMs so that they can tolerate a large number of hard faults. In contrast to previous schemes, REMAP uses all the metadata space for replacing faulty bits - error detection and location information is not needed. The detection and location of failed memory cells are identified by read verification and an extra write operation. Despite tolerating many hard errors, employing REMAP can negatively impact cell lifetime due to the extra writes. We propose solutions to alleviate this problem and increase memory lifetime significantly. REMAP performs write endurance localization using both static and dynamic partitioning. Additionally, fault location caching is used to avoid the extra write overhead. Given the error correction capabilities of REMAP, we consider using it as a second layer of defense that is combined with other schemes. Our evaluation, which includes both Monte Carlo and trace-driven simulation, shows that REMAP is capable of boosting the PCM lifetime by 56% on average (up to 78%) as compared to our baseline.
更多
查看译文
关键词
phase change memory,non-volatile memory,error correction,fault detection,endurance,lifetime
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要