web usage
Recently Published Documents


TOTAL DOCUMENTS

846
(FIVE YEARS 78)

H-INDEX

34
(FIVE YEARS 2)

Author(s):  
Sowmya HK ◽  
R. J. Anandhi

The WWW has a big number of pages and URLs that supply the user with a great amount of content. In an intensifying epoch of information, analysing users browsing behaviour is a significant affair. Web usage mining techniques are applied to the web server log to analyse the user behaviour. Identification of user sessions is one of the key and demanding tasks in the pre-processing stage of web usage mining. This paper emphasizes on two important fallouts with the approaches used in the existing session identification methods such as Time based and Referrer based sessionization. The first is dealing with comparing of current request’s referrer field with the URL of previous request. The second is dealing with session creation, new sessions are created or comes in to one session due to threshold value of page stay time and session time. So, authors developed enhanced semantic distance based session identification algorithm that tackles above mentioned issues of traditional session identification methods. The enhanced semantic based method has an accuracy of 84 percent, which is higher than the Time based and Time-Referrer based session identification approaches. The authors also used adapted K-Means and Hierarchical Agglomerative clustering algorithms to improve the prediction of user browsing patterns. Clusters were found using a weighted dissimilarity matrix, which is calculated using two key parameters: page weight and session weight. The Dunn Index and Davies-Bouldin Index are then used to evaluate the clusters. Experimental results shows that more pure and accurate session clusters are formed when adapted clustering algorithms are applied on the weighted sessions rather than the session obtained from traditional sessionization algorithms. Accuracy of the semantic session cluster is higher compared with the cluster of sessions obtained using traditional sessionization.


2021 ◽  
Vol 13 (2) ◽  
pp. 133-153
Author(s):  
Jonas Gamalielsson ◽  
Björn Lundell ◽  
Simon Butler ◽  
Christoffer Brax ◽  
Tomas Persson ◽  
...  

Web analytics technologies provide opportunities for organisations to obtain information about users visiting their websites in order to understand and optimise web usage. Use of such technologies often leads to issues related to data privacy and potential lock-in to specific suppliers and proprietary technologies. Use of open source software (OSS) for web analytics can create conditions for avoiding issues related to data privacy and lock-in, and thereby provides opportunities for a long-term sustainable solution for organisations both in the public and private sectors. The paper characterises use of and engagement with OSS projects for web analytics. Specifically, we contribute a characterisation of use of OSS licensed web analytics technologies in Swedish government authorities, and a characterisation of organisational engagement with the Matomo OSS project for web analytics.


2021 ◽  
Vol 13 (9) ◽  
pp. 233
Author(s):  
Zhou-Yi Lim ◽  
Lee-Yeng Ong ◽  
Meng-Chew Leow

Online roadshow is a relatively new concept that has higher flexibility and scalability compared to the physical roadshow. This is because online roadshow is accessible through digital devices anywhere and anytime. In a physical roadshow, organizations can measure the effectiveness of the roadshow by interacting with the customers. However, organizations cannot monitor the effectiveness of the online roadshow by using the same method. A good user experience is important to increase the advertising effects on the online roadshow website. In web usage mining, clustering can discover user access patterns from the weblog. By applying a clustering technique, the online roadshow website can be further improved to provide a better user experience. This paper presents a review of clustering techniques used in web usage mining, namely the partition-based, hierarchical, density-based, and fuzzy clustering techniques. These clustering techniques are analyzed from three perspectives: their similarity measures, the evaluation metrics used to determine the optimality of the clusters, and the functional purpose of applying the techniques to improve the user experience of the website. By applying clustering techniques in different stages of the user activities in the online roadshow website, the advertising effectiveness of the website can be enhanced in terms of its affordance, flow, and interactivity.


Author(s):  
Arpad Gellert

This paper presents and evaluates a two-level web usage prediction technique, consisting of a neural network in the first level and contextual component predictors in the second level. We used Markov chains of different orders as contextual predictors to anticipate the next web access based on specific web access history. The role of the neural network is to decide, based on previous behaviour, whose predictor’s output to use. The predicted web resources are then prefetched into the cache of the browser. In this way, we considerably increase the hit rate of the web browser, which shortens the load times. We have determined the optimal configuration of the proposed hybrid predictor on a real dataset and compared it with other existing web prefetching techniques in terms of prediction accuracy. The best configuration of the proposed neural hybrid method provides an average web access prediction accuracy of 86.95%.


2021 ◽  
pp. 115503
Author(s):  
Michal Munk ◽  
Anna Pilkova ◽  
Lubomir Benko ◽  
Petra Blazekova ◽  
Peter Svec

2021 ◽  
Vol 8 (1) ◽  
Author(s):  
Mira Orisa ◽  
Michael Ardita

Algoritma K-means merupakan salah satu algoritma yang digunakan untuk metode clustering dalam data mining. Algoritma ini hanya bisa digunakan untuk mengolah data bertipe numerik menjadi pengetahuan. Metode ini cocok digunakan untuk mengolah data log access file server web untuk bidang web usage mining. Dari sekian banyak data di log access pengunjung dapat diambil pengetahuannya setelah diolah oleh algoritma K-mean. Penelitian ini dilakukan untuk mengetahui kluster dari waktu yang digunakan oleh pengguna untuk mengakses website pada sebuah instansti. Setelah melakukan try and error dalam menetapkan jumlah k dan nilai centroid awal,maka diperoleh 4 kluster. Dengan penggunaan distance measure yaitu squared Euclidean distance. Dengan average cluster distance sama dengan 207,286. Nilai Davies boudin index untuk klaster k sama dengan 4 adalah 0,076.


Author(s):  
Anna Pilkova ◽  
Michal Munk ◽  
Petra Blazekova ◽  
Lubomir Benko

Author(s):  
V Aruna, Et. al.

In the recent years with the advancement in technology, a  lot of information is available in different formats and extracting the  knowledge from that data has become a very difficult task. Due to the vast amount of information available on the web, users are finding it difficult to extract relevant information or create new knowledge using information available on the web. To solve this problem  Web mining techniques are used to discover the interesting patterns from the hidden data .Web Usage Mining (WUM), which is one  of the subset of  Web Mining helps in extracting the hidden knowledge present in the Web log  files , in recognizing various interests of web users and also in  discovering customer behaviours. Web Usage mining  includes different phases of data mining techniques called Data Pre-processing, Pattern Discovery & Pattern Analysis. This paper presents an updated focused survey on various sequential pattern mining  algorithms  like  apriori-based algorithm , Breadth First Search-based strategy, Depth First Search strategy,  sequential closed-pattern algorithm and Incremental pattern mining algorithm which are used in Pattern Discovery Phase of WUM. At last , a comparison  is done based on the important key features present in these algorithms. This study gives us better understanding of the approaches of sequential pattern mining.


Author(s):  
Snehal Rathi ◽  
Yogesh Deshpande ◽  
Shashidhar Nagaral ◽  
Ankita Narkhede ◽  
Radhika Sajwani ◽  
...  

Sign in / Sign up

Export Citation Format

Share Document