Semantic Correlation Graph Embedding

Weiwei Wang, Yuchen Han,Stefano Bromuri,Michel Dumontier

2022 IEEE International Conference on Fuzzy Systems (FUZZ-IEEE)(2022)

引用 0|浏览11
暂无评分
摘要
Many data sets include categorical features in the form of nominal and ordinal features. However, most machine learning algorithms cannot deal with categorical features directly because they require numerical input features. Categorical embeddings are an effective approach to converting categorical features into numerical vectors. This work proposes a novel embedding approach, called Semantic Correlation Graph Embedding, to create embeddings from knowledge graphs. The approach constructs a semantic correlation graph of triplets among the categorical features to learn numerical embeddings. Our approach aims to uncover relationships taking place in categorical data in terms of low-level knowledge and semantics that may help group the features of the data sets under semantic entities. Three distinct embedding models are proposed according to how the graph is constructed. The results are evaluated with two public data sets. They show that the learned embeddings produce a statistically significant improvement in the performance of the classification tasks in terms of AUC, F1 score, precision, and recall.
更多
查看译文
关键词
Categorical data,Logistic regression,Knowledge graph,Graph embedding,TransE
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要