📞 +91-7667918914 | ✉️ ijarcce@gmail.com
IJARCCE Logo
International Journal of Advanced Research in Computer and Communication Engineering A monthly Peer-reviewed & Refereed journal
ISSN Online 2278-1021ISSN Print 2319-5940Since 2012
IJARCCE adheres to the suggestive parameters outlined by the University Grants Commission (UGC) for peer-reviewed journals, upholding high standards of research quality, ethical publishing, and academic excellence.
← Back to VOLUME 4, ISSUE 9, SEPTEMBER 2015

Comparative Analysis of Robot Detection Techniques on Web Server Log

Mitali Srivastava, Atul Kumar Srivastava, Rakhi Garg, P. K. Mishra

DOI: 10.17148/IJARCCE.2015.4941

Abstract: Web robots are software programs which automatically traverse through hyperlink structure of Web to retrieve Web resources. Robots can be used for variety of tasks such as crawling and indexing information for search engines, offline browsing, shopping comparison and email collectors. Apart from that robots can also be used for some malicious purposes like sending spam mails, stealing business intelligence etc. It is necessary to detect robots due to privacy, security and performance of server related issues. Several well-known techniques to detect robots are : robots.txt check, known robot�s IP address, User agent mapping, keywords matching in User agent field, browsing speed, unassigned referrer etc. In this paper we have discussed as well as implemented various robot identification techniques on real server log data and compared their performance for a given dataset.



Keywords: Robot detection, Web server log, Web usage mining, Data extraction.

How to Cite:

[1] Mitali Srivastava, Atul Kumar Srivastava, Rakhi Garg, P. K. Mishra, “Comparative Analysis of Robot Detection Techniques on Web Server Log,” International Journal of Advanced Research in Computer and Communication Engineering (IJARCCE), DOI: 10.17148/IJARCCE.2015.4941