Ignoring Irrelevant Pages in Weighted PageRank Algorithm using Text Content of the Target Page

Sunil Kumar, Niraj Singhal
2014 International Journal of Computer Applications  
The web is expanding day-by-day and people generally rely on search engines to explore the web. The web has created many challenges for information retrieval. Degree of quality of the information extracted is one of the major issue to be taken care of, and current information retrieval approaches need to be modified to meet such challenges. While doing query based searching, the search engines return a list of web documents containing both relevant and irrelevant pages and sometimes show the
more » ... her ranking to the irrelevant pages as compared to relevant pages. This paper presents a novel approach to ignore irrelevant pages in weighted pagerank algorithm using text content of the targeted pages.
doi:10.5120/14806-3014 fatcat:w7qzzfuav5gfnotf2ut37kl4hy