A General Approach for Partitioning Web Page Content Based on Geometric and Style Information

ICDAR-1(2007)

引用 35|浏览9
暂无评分
摘要
In this paper, we describe a general-purpose approach for partitioning Web page content. The novelty of our ap- proach lies in the use of detailed layout information from a Web page renderer to determine spatial locality and identify visual separators, and the use of relaxed matching over pre- sentation style information to determine presentation style similarity. We present several examples to illustrate the gen- erality of our approach.
更多
查看译文
关键词
web pages,internet
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要