On Optimizing Nearest Neighbor Queries In High-Dimensional Data Spaces

ICDT '01: Proceedings of the 8th International Conference on Database Theory(2001)

引用 37|浏览73
暂无评分
摘要
Nearest-neighbor queries in high-dimensional space are of high importance in various applications, especially in content-based indexing of multimedia data. For an optimization of the query processing, accurate models for estimating the query processing costs are needed. In this paper, we propose anew cost model for nearest neighbor queries in high-dimensional space, which we apply to enhance the performance of high-dimensional index structures. The model is based on new insights into effects occurring in high-dimensional space and provides a closed formula for the processing costs of nearest neighbor queries depending on the dimensionality, the block size and the database size. From the wide range of possible applications of our model, we select two interesting samples: First, we use the model to prove the known linear complexity of the nearest neighbor search problem in high-dimensional space, and second, we provide a technique for optimizing the block size. For data of medium dimensionality, the optimized block size allows significant speed-ups of the query processing time when compared to traditional block sizes and to the linear scan.
更多
查看译文
关键词
high-dimensional space,block size,high-dimensional index structure,accurate model,database size,new cost model,optimized block size,processing cost,query processing,query processing cost,High-Dimensional Data Spaces,Optimizing Nearest Neighbor Queries
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要