Oscar: A Semantic-based Data Binning Approach

2022 IEEE Visualization and Visual Analytics (VIS)(2022)

引用 3|浏览15
暂无评分
摘要
Binning is applied to categorize data values or to see distributions of data. Existing binning algorithms often rely on statistical properties of data. However, there are semantic considerations for selecting appropriate binning schemes. Surveys, for instance, gather respon-dent data for demographic-related questions such as age, salary, number of employees, etc., that are bucketed into defined semantic categories. In this paper, we leverage common semantic categories from survey data and Tableau Public visualizations to identify a set of semantic binning categories. We employ these semantic binning categories in Oscar: a method for automatically selecting bins based on the inferred semantic type of the field. We conducted a crowdsourced study with 120 participants to better understand user preferences for bins generated by Oscar vs. binning provided in Tableau. We find that maps and histograms using binned values generated by Oscar are preferred by users as compared to binning schemes based purely on the statistical properties of the data.
更多
查看译文
关键词
Data-driven semantics,binning,constraints,geospatial,Human-centered computing,Visualization
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要