web usage mining Latest Research Papers

Semantic Based Weighted Web Session Clustering Using Adapted K-Means and Hierarchical Agglomerative Algorithms

Journal of Web Engineering ◽

10.13052/jwe1540-9589.2125 ◽

2022 ◽

Author(s):

Sowmya HK ◽

R. J. Anandhi

Keyword(s):

Clustering Algorithms ◽

Threshold Value ◽

Semantic Distance ◽

Web Usage Mining ◽

Identification Algorithm ◽

Agglomerative Clustering ◽

Dissimilarity Matrix ◽

Identification Methods ◽

Web Usage ◽

Stay Time

The WWW has a big number of pages and URLs that supply the user with a great amount of content. In an intensifying epoch of information, analysing users browsing behaviour is a significant affair. Web usage mining techniques are applied to the web server log to analyse the user behaviour. Identification of user sessions is one of the key and demanding tasks in the pre-processing stage of web usage mining. This paper emphasizes on two important fallouts with the approaches used in the existing session identification methods such as Time based and Referrer based sessionization. The first is dealing with comparing of current request’s referrer field with the URL of previous request. The second is dealing with session creation, new sessions are created or comes in to one session due to threshold value of page stay time and session time. So, authors developed enhanced semantic distance based session identification algorithm that tackles above mentioned issues of traditional session identification methods. The enhanced semantic based method has an accuracy of 84 percent, which is higher than the Time based and Time-Referrer based session identification approaches. The authors also used adapted K-Means and Hierarchical Agglomerative clustering algorithms to improve the prediction of user browsing patterns. Clusters were found using a weighted dissimilarity matrix, which is calculated using two key parameters: page weight and session weight. The Dunn Index and Davies-Bouldin Index are then used to evaluate the clusters. Experimental results shows that more pure and accurate session clusters are formed when adapted clustering algorithms are applied on the weighted sessions rather than the session obtained from traditional sessionization algorithms. Accuracy of the semantic session cluster is higher compared with the cluster of sessions obtained using traditional sessionization.

Download Full-text

A Review on Clustering Techniques: Creating Better User Experience for Online Roadshow

Future Internet ◽

10.3390/fi13090233 ◽

2021 ◽

Vol 13 (9) ◽

pp. 233

Author(s):

Zhou-Yi Lim ◽

Lee-Yeng Ong ◽

Meng-Chew Leow

Keyword(s):

User Experience ◽

Similarity Measures ◽

Web Usage Mining ◽

Advertising Effectiveness ◽

Clustering Techniques ◽

Digital Devices ◽

Web Usage ◽

User Access ◽

Access Patterns ◽

Functional Purpose

Online roadshow is a relatively new concept that has higher flexibility and scalability compared to the physical roadshow. This is because online roadshow is accessible through digital devices anywhere and anytime. In a physical roadshow, organizations can measure the effectiveness of the roadshow by interacting with the customers. However, organizations cannot monitor the effectiveness of the online roadshow by using the same method. A good user experience is important to increase the advertising effects on the online roadshow website. In web usage mining, clustering can discover user access patterns from the weblog. By applying a clustering technique, the online roadshow website can be further improved to provide a better user experience. This paper presents a review of clustering techniques used in web usage mining, namely the partition-based, hierarchical, density-based, and fuzzy clustering techniques. These clustering techniques are analyzed from three perspectives: their similarity measures, the evaluation metrics used to determine the optimality of the clusters, and the functional purpose of applying the techniques to improve the user experience of the website. By applying clustering techniques in different stages of the user activities in the online roadshow website, the advertising effectiveness of the website can be enhanced in terms of its affordance, flow, and interactivity.

Download Full-text

Web Usage Mining Menggunakan Algoritma Clastering K-Mean

Jurnal Teknologi Informasi dan Terapan ◽

10.25047/jtit.v8i1.179 ◽

2021 ◽

Vol 8 (1) ◽

Author(s):

Mira Orisa ◽

Michael Ardita

Keyword(s):

Data Mining ◽

Euclidean Distance ◽

Distance Measure ◽

Web Usage Mining ◽

File Server ◽

Web Usage ◽

Cluster Distance ◽

Data Log ◽

Average Cluster ◽

Squared Euclidean Distance

Algoritma K-means merupakan salah satu algoritma yang digunakan untuk metode clustering dalam data mining. Algoritma ini hanya bisa digunakan untuk mengolah data bertipe numerik menjadi pengetahuan. Metode ini cocok digunakan untuk mengolah data log access file server web untuk bidang web usage mining. Dari sekian banyak data di log access pengunjung dapat diambil pengetahuannya setelah diolah oleh algoritma K-mean. Penelitian ini dilakukan untuk mengetahui kluster dari waktu yang digunakan oleh pengguna untuk mengakses website pada sebuah instansti. Setelah melakukan try and error dalam menetapkan jumlah k dan nilai centroid awal,maka diperoleh 4 kluster. Dengan penggunaan distance measure yaitu squared Euclidean distance. Dengan average cluster distance sama dengan 207,286. Nilai Davies boudin index untuk klaster k sama dengan 4 adalah 0,076.

Download Full-text

Performance Evaluation of the MapReduce-based Parallel Data Preprocessing Algorithm in Web Usage Mining with Robot Detection Approaches

IETE Technical Review ◽

10.1080/02564602.2021.1918584 ◽

2021 ◽

pp. 1-15

Author(s):

Mitali Srivastava ◽

Atul Kumar Srivastava ◽

Rakhi Garg ◽

P. K. Mishra

Keyword(s):

Performance Evaluation ◽

Data Preprocessing ◽

Web Usage Mining ◽

Web Usage ◽

Parallel Data

Download Full-text

A Review on Design and Development Of Sequential Patterns Algorithms In Web Usage Mining

Turkish Journal of Computer and Mathematics Education (TURCOMAT) ◽

10.17762/turcomat.v12i2.1448 ◽

2021 ◽

Vol 12 (2) ◽

pp. 1634-1639

Author(s):

V Aruna, Et. al.

Keyword(s):

Web Mining ◽

Pattern Mining ◽

Pattern Discovery ◽

Relevant Information ◽

Sequential Pattern Mining ◽

Web Usage Mining ◽

Sequential Pattern ◽

Web Usage ◽

Hidden Data ◽

The Web

In the recent years with the advancement in technology, a lot of information is available in different formats and extracting the knowledge from that data has become a very difficult task. Due to the vast amount of information available on the web, users are finding it difficult to extract relevant information or create new knowledge using information available on the web. To solve this problem Web mining techniques are used to discover the interesting patterns from the hidden data .Web Usage Mining (WUM), which is one of the subset of Web Mining helps in extracting the hidden knowledge present in the Web log files , in recognizing various interests of web users and also in discovering customer behaviours. Web Usage mining includes different phases of data mining techniques called Data Pre-processing, Pattern Discovery & Pattern Analysis. This paper presents an updated focused survey on various sequential pattern mining algorithms like apriori-based algorithm , Breadth First Search-based strategy, Depth First Search strategy, sequential closed-pattern algorithm and Incremental pattern mining algorithm which are used in Pattern Discovery Phase of WUM. At last , a comparison is done based on the important key features present in these algorithms. This study gives us better understanding of the approaches of sequential pattern mining.

Download Full-text

Analysis of User’s Learning Styles and Academic Emotions through Web Usage Mining

2021 International Conference on Emerging Smart Computing and Informatics (ESCI) ◽

10.1109/esci50559.2021.9397037 ◽

2021 ◽

Author(s):

Snehal Rathi ◽

Yogesh Deshpande ◽

Shashidhar Nagaral ◽

Ankita Narkhede ◽

Radhika Sajwani ◽

...

Keyword(s):

Learning Styles ◽

Web Usage Mining ◽

Academic Emotions ◽

Web Usage

Download Full-text

Ekstraksi Click Stream Data Web E-Commerce Menggunakan Web Usage Mining

Jurnal Informatika Polinema ◽

10.33795/jip.v7i2.538 ◽

2021 ◽

Vol 7 (2) ◽

pp. 65-72

Author(s):

Kartina Diah Kusuma Wardani

Keyword(s):

World Wide Web ◽

Association Rules ◽

World Wide ◽

Web Usage Mining ◽

Frequent Itemset ◽

Sequence Mining ◽

Stream Data ◽

Log Data ◽

Web Usage

E-Commerce berkembang pesat dalam world wide web hingga menghasilkan berbagai jenis data yang dapat dianalisa lebih lanjut untuk berbagai keperluan seperti personifikasi web, profiling customer, dan sebagainya. Salah satu jenis data yang dihasilkan e-Commerce adalah click stream data web yang merekam aktivitas visitor web dalam bentuk log data selama berinteraksi pada laman web. Penelitian ini mengekstraksi click stream data web e-commerce untuk mendapatkan pola interaksi konsumen terhadap halaman web selama mengunjungi web e-commerce. Berdasarkan jenis data yang diekstrak maka web usage mining digunakan untuk ekstraksi pola dari click stream data yang berbentuk log data. Teknik mining yang dianalisa terhadap log data e-commerce pada penelitian ini terdiri dari frequent itemset, asociation rules, dan frequence sequence mining. Frequent itemset menghasilkan halaman web yang paling sering diakses oleh visitor. Association rules menghasilkan pola kemungkinan halaman web yang akan diakses visitor jika visitor mengakses halaman-halamn tertentu. Frequence sequence mining mendapatkan pola urutan halaman web yang paling sering diakses oleh visitor web e-commerce saat berinteraksi pada laman web. Pola urutan halaman yang diakses visitor menunjukkan urutan kebiasaan visitor mengunjungi e-commerce. Sedangkan teknik mining yang diimplementasikan untuk menghasilkan pola akses visitor pada penelitian ini adalah Frequence sequence mining. Hasil ekstraksi dari penelitian ini menunjukkan ada enam halaman web yang paling sering diakses oleh konsumen dengan berbagai pola urutan aksesnya.

Download Full-text

Web usage Mining: A Comparison of WUM Category Web Mining Algorithms

2021 Third International Conference on Intelligent Communication Technologies and Virtual Mobile Networks (ICICV) ◽

10.1109/icicv50876.2021.9388539 ◽

2021 ◽

Author(s):

Shivani Yadao ◽

A. Vinaya Babu ◽

Midhunchakkaravarthy Janarthanan ◽

Amiya Bhaumik

Keyword(s):

Web Mining ◽

Web Usage Mining ◽

Web Usage ◽

Mining Algorithms

Download Full-text

Web Usage Mining Issues in Big Data

Impacts and Challenges of Cloud Business Intelligence - Advances in Systems Analysis, Software Engineering, and High Performance Computing ◽

10.4018/978-1-7998-5040-3.ch007 ◽

2021 ◽

pp. 102-112

Author(s):

Sunny Sharma ◽

Manisha Malhotra

Keyword(s):

Data Mining ◽

Big Data ◽

User Behavior ◽

Web Usage Mining ◽

Web Personalization ◽

Data Mining Techniques ◽

Meaningful Information ◽

Web Usage ◽

Use Of Data ◽

The Web

Web usage mining is the use of data mining techniques to analyze user behavior in order to better serve the needs of the user. This process of personalization uses a set of techniques and methods for discovering the linking structure of information on the web. The goal of web personalization is to improve the user experience by mining the meaningful information and presented the retrieved information in a way the user intends. The arrival of big data instigated novel issues to the personalization community. This chapter provides an overview of personalization, big data, and identifies challenges related to web personalization with respect to big data. It also presents some approaches and models to fill the gap between big data and web personalization. Further, this research brings additional opportunities to web personalization from the perspective of big data.

Download Full-text

Performance Comparison of Tree-Based Machine Learning Classifiers for Web Usage Mining

Lecture Notes in Electrical Engineering - Proceedings of International Conference on Communication, Circuits, and Systems ◽

10.1007/978-981-33-4866-0_47 ◽

2021 ◽

pp. 379-387

Author(s):

Ruchi Mittal ◽

Varun Malik ◽

Vikas Rattan ◽

Deepika Jhamb

Keyword(s):

Machine Learning ◽

Performance Comparison ◽

Web Usage Mining ◽

Machine Learning Classifiers ◽

Web Usage ◽

Learning Classifiers

Download Full-text

web usage mining
Recently Published Documents

TOTAL DOCUMENTS

H-INDEX

Semantic Based Weighted Web Session Clustering Using Adapted K-Means and Hierarchical Agglomerative Algorithms

A Review on Clustering Techniques: Creating Better User Experience for Online Roadshow

Web Usage Mining Menggunakan Algoritma Clastering K-Mean

Performance Evaluation of the MapReduce-based Parallel Data Preprocessing Algorithm in Web Usage Mining with Robot Detection Approaches

A Review on Design and Development Of Sequential Patterns Algorithms In Web Usage Mining

Analysis of User’s Learning Styles and Academic Emotions through Web Usage Mining

Ekstraksi Click Stream Data Web E-Commerce Menggunakan Web Usage Mining

Web usage Mining: A Comparison of WUM Category Web Mining Algorithms

Web Usage Mining Issues in Big Data

Performance Comparison of Tree-Based Machine Learning Classifiers for Web Usage Mining

Export Citation Format

web usage miningRecently Published Documents

TOTAL DOCUMENTS

H-INDEX

Semantic Based Weighted Web Session Clustering Using Adapted K-Means and Hierarchical Agglomerative Algorithms

A Review on Clustering Techniques: Creating Better User Experience for Online Roadshow

Web Usage Mining Menggunakan Algoritma Clastering K-Mean

Performance Evaluation of the MapReduce-based Parallel Data Preprocessing Algorithm in Web Usage Mining with Robot Detection Approaches

A Review on Design and Development Of Sequential Patterns Algorithms In Web Usage Mining

Analysis of User’s Learning Styles and Academic Emotions through Web Usage Mining

Ekstraksi Click Stream Data Web E-Commerce Menggunakan Web Usage Mining

Web usage Mining: A Comparison of WUM Category Web Mining Algorithms

Web Usage Mining Issues in Big Data

Performance Comparison of Tree-Based Machine Learning Classifiers for Web Usage Mining

web usage mining
Recently Published Documents