Incorporation of Human Knowledge into Data Embeddings to Improve Pattern Significance and Interpretability.

Jie Li, Chun-Qi Zhou

IEEE Transactions on Visualization and Computer Graphics(2022)

引用 6|浏览59
暂无评分
摘要
Embedding is a common technique for analyzing multi-dimensional data. However, the embedding projection cannot always form significant and interpretable visual structures that foreshadow underlying data patterns. We propose an approach that incorporates human knowledge into data embeddings to improve pattern significance and interpretability. The core idea is (1) externalizing tacit human knowledge as explicit sample labels and (2) adding a classification loss in the embedding network to encode samples' classes. The approach pulls samples of the same class with similar data features closer in the projection, leading to more compact (significant) and class-consistent (interpretable) visual structures. We give an embedding network with a customized classification loss to implement the idea and integrate the network into a visualization system to form a workflow that supports flexible class creation and pattern exploration. Patterns found on open datasets in case studies, subjects' performance in a user study, and quantitative experiment results illustrate the general usability and effectiveness of the approach.
更多
查看译文
关键词
Tabular Data,Multi-dimensional Exploration,Embedding Projection,Explicit Knowledge Generation,Visual Analytics
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要