Does the Geometry of Word Embeddings Help Document Classification? A Case Study on Persistent Homology-Based Representations
Rep4NLP@ACL, pp. 235-240, 2017.
We investigate the pertinence of methods from algebraic topology for text data analysis. These methods enable the development of mathematically-principled isometric-invariant mappings from a set of vectors to a document embedding, which is stable with respect to the geometry of the document in the selected metric space. In this work, we e...更多
下载 PDF 全文