πŸ“ž +91-7667918914 | βœ‰οΈ ijarcce@gmail.com
International Journal of Advanced Research in Computer and Communication Engineering
International Journal of Advanced Research in Computer and Communication Engineering A monthly Peer-reviewed & Refereed journal
ISSN Online 2278-1021ISSN Print 2319-5940Since 2012
IJARCCE adheres to the suggestive parameters outlined by the University Grants Commission (UGC) for peer-reviewed journals, upholding high standards of research quality, ethical publishing, and academic excellence.
← Back to VOLUME 5, ISSUE 4, APRIL 2016

Smart Crawler: A Two-Stage Crawler for Efficiently Harvesting Deep-Web Interfaces

Melisa Vidiera, Janhavi V

πŸ‘ 38 viewsπŸ“₯ 0 downloads
Share: 𝕏 f in ✈ βœ‰

Abstract: Due to extensive usage of Internet, substantial amount of data has extended widely over web, which serve access to particular data or to fetch more relevant data. It would be challenging to the search engine to provide quick results that is most relevant to the users. To search the relevant data and to reduce amount of time in fetching data, here propose the οΏ½Smart CrawlerοΏ½. This returns most relevant data from the popular and most specific websites. It uses multiple search engines that processes the query provided by the user, cluster the results collected in a single platform and performs two stage crawling on data and URLs. In which in-site map generation is done to obtain relevant site with techniques such as reverse searching and page ranking.



Keywords: Deep Web, two stage crawler, ranking, in-site exploring, adaptive learning.

How to Cite:

[1] Melisa Vidiera, Janhavi V, β€œSmart Crawler: A Two-Stage Crawler for Efficiently Harvesting Deep-Web Interfaces,” International Journal of Advanced Research in Computer and Communication Engineering (IJARCCE), DOI: 10.17148/IJARCCE.2016.5445

Creative Commons License This work is licensed under a Creative Commons Attribution 4.0 International License.