Accelerating LSH-based Distributed Search with In-network Computation

IEEE CONFERENCE ON COMPUTER COMMUNICATIONS (IEEE INFOCOM 2021)(2021)

引用 6|浏览17
暂无评分
摘要
Locality Sensitive Hashing (LSH) is widely adopted to index similar data in high-dimensional space for approximate nearest neighbor search. With the rapid increase of datasets, recent interests in LSH have moved to the implementation of distributed search systems with low response time and high throughput. However, as the scale of the concurrent queries and the volume of available data grow, large amounts of index messages still need to be transmitted to centralized servers for the candidate answer reducing and resorting. Hence, the network remains the bottleneck in distributed search systems. To address this gap, we turn our efforts to the network itself and propose NetSHa. NetSHa exploits the in-network computational capacity provided by programmable switches. Specially, NetSHa designs a sort-reduce approach to drop the potential poor candidate answers and aggregates the good candidate answers on programmable switches, while preserving the search quality. We implement NetSHa on Barefoot Tofino switches and evaluate it using 3 datasets (i.e., Random, Wiki and Image). The experimental results show that NetSHa reduces the packet volume by 10 times at most and improves the search efficiency by 3 x at least, in comparison with typical LSH-based distributed search frameworks.
更多
查看译文
关键词
LSH-based distributed search,locality sensitive hashing,high-dimensional space,approximate nearest neighbor search,distributed search systems,low response time,index messages,candidate answer,NetSHa,in-network computational capacity,programmable switches,sort-reduce approach,search quality,search frameworks,similar data indexing,concurrent queries
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要