Automatic User Domain Classification Based on Support Vector Machine (SVM)
Finding domain of a research paper and a researcher is a crucial task and would be highly appreciable in order to provide personalized search results to the user. An automatic user domain classification technique based on SVM has been proposed in this paper in order to determine the domain of a user based on her publications. In this technique, for a given user, his specific area of domain is determined by classifying the keywords from his publication works. It consists of two phases: keyword extraction and domain classification. In keyword extraction phase, the list of publications corresponding to a user mail id is retrieved by using publish or perish tool. From each of the published papers, the keywords are extracted. In domain classification, SVM classifier is applied to determine the domain of the user. This is performed by training standard keywords from each domain into the SVM classifier. If a user belongs to more than one domain, then the primary domain with more publications will be considered.