Lx-Dsemvectors: Distributional Semantics Models For Portuguese

COMPUTATIONAL PROCESSING OF THE PORTUGUESE LANGUAGE (PROPOR 2016)(2016)

引用 16|浏览67
暂无评分
摘要
In this article we describe the creation and distribution of the first publicly available word embeddings for Portuguese. Our embeddings are evaluated on their own and also compared with the original English models on a well-known analogy task. We gathered a large Portuguese corpus of 1.7 billion tokens, developed the first distributional semantic analogies test set for Portuguese, and proceeded with the first parametrization and evaluation of Portuguese word embeddings models.
更多
查看译文
关键词
Distributional semantics, Word embeddings, Portuguese
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要