News Updates Thursday 26th Dec 2024 :
  • Welcome to INPRESSCO, world's leading publishers, We have served more than 10000+ authors
  • Articles are invited in engineering, science, technology, management, industrial engg, biotechnology etc.
  • Paper submission is open. Submit online or at editor.ijcet@inpressco.com
  • Our journals are indexed in NAAS, University of Regensburg Germany, Google Scholar, Cross Ref etc.
  • DOI is given to all articles

Focused Web Crawler and its Approaches


Author : Jay Sampat, Anmol Jain and Dharmeshkumar Mistry

Pages : 3121-3124
Download PDF
Abstract

There has been a rapid growth of the world-wide web which has scaled beyond our imaginations. To surmount these challenges search engines are used. One of the most important type of crawler is Focused crawler which is used to index information according to a particular topic. To maximize the possibility of downloading relevant documents focused crawler makes a prediction of hyperlinks visiting priority which in turn helps to reduce downloading of irrelevant documents and drastically saves network resources and hardware. Instead of using keywords topics are specified by using commendable documents. One of the most important feature of this type of web crawler is collecting and indexing all accessible web credentials. This crawler mainly diagnosis its crawl boundary to search different URLs. In this paper we’ll illustrate a clear cut comparison between focused and standard web crawlers as well as various approaches of focused crawling like contextual and priority based crawling.

Keywords: web crawlers, focused crawlers, web pages, priority based, contextual based, indexing.

Article published in International Journal of Current Engineering and Technology, Vol.4, No.5 (Oct-2014)

 

 

Call for Papers
  1. IJCET- Current Issue
  2. Issues are published in Feb, April, June, Aug, Oct and Dec
  3. DOI is given to all articles
  • Inpressco Google Scholar
  • Inpressco Science Central
  • Inpressco Global impact factor
  • Inpressco aap

International Press corporation is licensed under a Creative Commons Attribution-Non Commercial NoDerivs 3.0 Unported License
©2010-2023 INPRESSCO® All Rights Reserved