scholarly journals Agglomerative Clustering in Web Usage Mining: A Survey

2014 ◽  
Vol 89 (8) ◽  
pp. 24-27 ◽  
Author(s):  
Karuna Katariya ◽  
Rajanikanth Aluvalu
Author(s):  
Sowmya HK ◽  
R. J. Anandhi

The WWW has a big number of pages and URLs that supply the user with a great amount of content. In an intensifying epoch of information, analysing users browsing behaviour is a significant affair. Web usage mining techniques are applied to the web server log to analyse the user behaviour. Identification of user sessions is one of the key and demanding tasks in the pre-processing stage of web usage mining. This paper emphasizes on two important fallouts with the approaches used in the existing session identification methods such as Time based and Referrer based sessionization. The first is dealing with comparing of current request’s referrer field with the URL of previous request. The second is dealing with session creation, new sessions are created or comes in to one session due to threshold value of page stay time and session time. So, authors developed enhanced semantic distance based session identification algorithm that tackles above mentioned issues of traditional session identification methods. The enhanced semantic based method has an accuracy of 84 percent, which is higher than the Time based and Time-Referrer based session identification approaches. The authors also used adapted K-Means and Hierarchical Agglomerative clustering algorithms to improve the prediction of user browsing patterns. Clusters were found using a weighted dissimilarity matrix, which is calculated using two key parameters: page weight and session weight. The Dunn Index and Davies-Bouldin Index are then used to evaluate the clusters. Experimental results shows that more pure and accurate session clusters are formed when adapted clustering algorithms are applied on the weighted sessions rather than the session obtained from traditional sessionization algorithms. Accuracy of the semantic session cluster is higher compared with the cluster of sessions obtained using traditional sessionization.


2012 ◽  
Vol 3 (4) ◽  
pp. 92-94
Author(s):  
SUJATHA PADMAKUMAR ◽  
◽  
Dr.PUNITHAVALLI Dr.PUNITHAVALLI ◽  
Dr.RANJITH Dr.RANJITH

Author(s):  
Snehal Rathi ◽  
Yogesh Deshpande ◽  
Shashidhar Nagaral ◽  
Ankita Narkhede ◽  
Radhika Sajwani ◽  
...  

Sign in / Sign up

Export Citation Format

Share Document