Abstract: For a broad-topic and ambiguous query, different users may have different search goals when they submit it to a search engine. The inference and analysis of user search goals can be very useful in improving search engine relevance and user experience. The project proposes a novel approach to infer user search goals by analyzing search engine query logs. First, it proposes a framework to discover different user search goals for a query by clustering the proposed feedback sessions. The feedback session is defined as the series of both clicked and unclicked URLs and ends with the last URL that was clicked in a session from user click-through logs. Second, the pseudo-documents are produced to better represent the feedback sessions for clustering.

The pseudo-documents are clustered using Fuzzy C Means, the fuzzy similarity based self- constructing algorithm. A novel optimization method is used to map feedback sessions to pseudo-documents which can efficiently reflect user information needs and finally, a new criterion   “Classified Average Precision (CAP)” is used to evaluate the performance of inferring user search goals. Experimental results are presented using user click-through logs from a commercial search engine to validate the effectiveness.

 

Keywords: User Search Goal, Feedback Session, Fuzzy C Means Algorithm