Learning Similarity Scores By Using A Family Of Distance Functions In Multiple Feature Spaces

INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE(2017)

引用 2|浏览18
暂无评分
摘要
There exist a large number of distance functions that allow one to measure similarity between feature vectors and thus can be used for ranking purposes. When multiple representations of the same object are available, distances in each representation space may be combined to produce a single similarity score. In this paper, we present a method to build such a similarity ranking out of a family of distance functions. Unlike other approaches that aim to select the best distance function for a particular context, we use several distances and combine them in a convenient way. To this end, we adopt a classical similarity learning approach and face the problem as a standard supervised machine learning task. As in most similarity learning settings, the training data are composed of a set of pairs of objects that have been labeled as similar/dissimilar. These are first used as an input to a transformation function that computes new feature vectors for each pair by using a family of distance functions in each of the available representation spaces. Then, this information is used to learn a classifier. The approach has been tested using three different repositories. Results show that the proposed method outperforms other alternative approaches in high-dimensional spaces and highlight the benefits of using multiple distances in each representation space.
更多
查看译文
关键词
Classification similarity learning, metric learning, distance combination, similarity ranking, multiple features
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要