Discovering Domain Orders via Order Dependencies

2022 IEEE 38th International Conference on Data Engineering (ICDE)(2022)

引用 2|浏览10
暂无评分
摘要
Most real-world data come with explicitly defined domain orders; e.g., lexicographic for strings, numeric for integers, and chronological for time. Our goal is to discover implicit domain orders that we do not already know; for instance, that the order of months in the Chinese Lunar calendar is Corner $<$ Apricot $<$ Peach. To do so, we enhance data profiling methods by discovering implicit domain orders in data through order dependencies. We enumerate tractable special cases and show that the general case is NP-complete but can be effectively handled by a SAT solver. We also devise an interestingness measure to rank the discovered implicit domain orders. Based on an extensive suite of experiments with real-world data, we establish the efficacy of our algorithms.
更多
查看译文
关键词
explicitly defined domain orders,data profiling methods,order dependencies,discovered implicit domain orders,SAT solver,NP-complete problem,interestingness measure,Chinese Lunar calendar
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要