📞 +91-7667918914 | ✉️ ijarcce@gmail.com
IJARCCE Logo
International Journal of Advanced Research in Computer and Communication Engineering A monthly Peer-reviewed & Refereed journal
ISSN Online 2278-1021ISSN Print 2319-5940Since 2012
IJARCCE adheres to the suggestive parameters outlined by the University Grants Commission (UGC) for peer-reviewed journals, upholding high standards of research quality, ethical publishing, and academic excellence.
← Back to VOLUME 5, ISSUE 4, APRIL 2016

Smart Crawler: A Two-Stage Crawler for Efficiently Harvesting Deep-Web Interfaces

Melisa Vidiera, Janhavi V

DOI: 10.17148/IJARCCE.2016.5445

Abstract: Due to extensive usage of Internet, substantial amount of data has extended widely over web, which serve access to particular data or to fetch more relevant data. It would be challenging to the search engine to provide quick results that is most relevant to the users. To search the relevant data and to reduce amount of time in fetching data, here propose the �Smart Crawler�. This returns most relevant data from the popular and most specific websites. It uses multiple search engines that processes the query provided by the user, cluster the results collected in a single platform and performs two stage crawling on data and URLs. In which in-site map generation is done to obtain relevant site with techniques such as reverse searching and page ranking.



Keywords: Deep Web, two stage crawler, ranking, in-site exploring, adaptive learning.

How to Cite:

[1] Melisa Vidiera, Janhavi V, “Smart Crawler: A Two-Stage Crawler for Efficiently Harvesting Deep-Web Interfaces,” International Journal of Advanced Research in Computer and Communication Engineering (IJARCCE), DOI: 10.17148/IJARCCE.2016.5445