Enhancing Focused Crawling with Genetic Algorithms

Information Technology: Coding and Computing, 2005. ITCC 2005. International Conference(2005)

引用 33|浏览0
暂无评分
摘要
Web crawlers are one of the most crucial components in search engines and their optimization would have a great effect on improving the searching efficiency. In this paper, we introduce an intelligent crawler called Gcrawler that uses a genetic algorithm for improving its crawling performance. Gcrawler estimates the best path for crawling on one hand and expands its initial keywords by using a genetic algorithm during the crawling on the other hand. This is the first crawler that acts intelligently without any relevance feedback or training. All the processes are online and there is no need for direct interaction with the users.
更多
查看译文
关键词
genetic algorithms,crawling performance,focused crawling,best path,genetic algorithm,initial keyword,great effect,direct interaction,web crawler,relevance feedback,crucial component,intelligent crawler,internet,search engine,computer science,bandwidth,search engines,databases,feedback
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要