Data repair accelerating scheme for erasure-coded storage system based on FPGA and hierarchical parallel decoding structure

Junqi Chen, Sijie Yang,Yong Wang,Miao Ye, Fan Lei

Cluster Computing(2024)

引用 0|浏览3
暂无评分
摘要
Erasure coding has been widely used in commodity datacenter to tolerate faults, due to its ability to simultaneously provide high storage space utilization and data reliability. However, when data loss occurs, the extra data decoding and traffic overhead makes it difficult to improve their data repair efficiency, limiting their further application in the hot data storage systems. In this paper, we proposed an FPGA-based Data Repair accelerating scheme (FPGA-ECDR) for erasure-coded storage system, which employs the Cauchy Reed-Solomon(CRS) code, to overcome the aforementioned limitation. In FPGA-ECDR, multiple modules are designed to work collaboratively, enhancing the efficiency of data flow and ensuring the reliability of the data repair process. Then, a CRS decoding algorithm based on check matrix is used to reduce the complexity of matrix inversion in the decoding process, and hardware acceleration of the algorithm is realized by FPGA. Moreover, we proposed a Hierarchical Parallel Decoding Structure (HPDS) to optimize cache data reading timing and XOR logic operations in the decoding process. HPDS can effectively reduce the impact of Column Address Strobe (CAS) latency and improve repair efficiency. Finally, we conducted FPGA board-level verification of the proposed scheme, testing on CRS codes with different data sizes and parameters. The experimental results show that compared with the schemes of the current mainstream open-source erasure coding library Jerasure and Intel ISA-L acceleration library, as well as the Xilinx RS code decoding acceleration scheme based on the same FPGA platform, our proposed scheme has lower decoding latency and can improve the data decoding rate by 3.2 to 148.5 times and enhances the repair throughput by up to 21.4 times in fault-tolerant storage system.
更多
查看译文
关键词
Data repair,Fault-tolerant storage,Erasure code,Hardware acceleration,FPGA
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要