Xml Document Clustering: An Algorithmic Perspective

XML DATA MINING: MODELS, METHODS, AND APPLICATIONS(2012)

引用 0|浏览0
暂无评分
摘要
The wide use of XML as the de facto standard of storing and exchanging information through Internet has led a wide spectrum of heterogeneous applications to adopt XML as their information representation model. The heterogeneity of XML data sources has brought up the problem of efficiently clustering a set of XML documents. However, traditional clustering algorithms cannot be applied due to the semistructured nature of XML, which contains both structure and content features. Hence, special techniques should be used that would take into account the XML semantics in order to address the problem of XML clustering. The described approaches, based on either the structure or the content or both, manage to successfully address the problem and can be applied efficiently in real-world applications.
更多
查看译文
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要