Ontology-based HTML to XML conversion

WAIM'05 Proceedings of the 6th international conference on Advances in Web-Age Information Management(2005)

引用 1|浏览0
暂无评分
摘要
Current wrapper approaches break down in extracting data from differently structured and frequently changing Web pages. To tackle this challenge, this paper defines domain-specific ontology, captures the semantic hierarchy in Web pages automatically by exploiting both structural information and common formatting information, and recognizes and extracts data by using ontology-based semantic matching without relying on page-specific formatting. It is adaptive to differently structured and frequently changing Web pages for a domain of interest.
更多
查看译文
关键词
xml conversion,page-specific formatting,structural information,domain-specific ontology,common formatting information,ontology-based html,current wrapper approach,ontology-based semantic,semantic hierarchy,web page,extracts data,web pages
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要