首页 | 本学科首页   官方微博 | 高级检索  
     检索      


DistanceRank: An intelligent ranking algorithm for web pages
Authors:Ali Mohammad Zareh Bidoki  Nasser Yazdani
Institution:Department of Electrical and Computer Engineering, University of Tehran, Tehran, Iran
Abstract:A fast and efficient page ranking mechanism for web crawling and retrieval remains as a challenging issue. Recently, several link based ranking algorithms like PageRank, HITS and OPIC have been proposed. In this paper, we propose a novel recursive method based on reinforcement learning which considers distance between pages as punishment, called “DistanceRank” to compute ranks of web pages. The distance is defined as the number of “average clicks” between two pages. The objective is to minimize punishment or distance so that a page with less distance to have a higher rank. Experimental results indicate that DistanceRank outperforms other ranking algorithms in page ranking and crawling scheduling. Furthermore, the complexity of DistanceRank is low. We have used University of California at Berkeley’s web for our experiments.
Keywords:Web ranking  Crawling  Web graph  Reinforcement learning
本文献已被 ScienceDirect 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号