Web usage Mining: A Comparison of WUM Category Web Mining Algorithms

2011 ◽

pp. 38-68

Author(s):

Guandong Xu

Keyword(s):

Latent Semantic Analysis ◽

Web Mining ◽

Semantic Analysis ◽

Information Overload ◽

Critical Issue ◽

Web Usage Mining ◽

Semantic Level ◽

Web Usage ◽

User Communities ◽

Usage Patterns

Nowadays Web users are facing the problems of information overload and drowning due to the significant and rapid growth in the amount of information and the large number of users. As a result, how to provide Web users more exactly needed information is becoming a critical issue in Web-based information retrieval and data management. In order to address the above difficulties, Web mining was proposed as an efficient means to discover the intrinsic relationships among Web data. In particular, Web usage mining is to discover Web usage patterns and utilize the discovered usage knowledge for constructing interest-oriented user communities, which could be, in turn, used for presenting Web users more personalized Web contents, i.e. Web recommendation. On the other hand, Latent Semantic Analysis (LSA) is one kind of approaches that is used to reveal the inherent correlation resided in co-occurrence activities, such as Web usage data. Moreover, LSA possesses the capability of capturing the hidden knowledge at semantic level that can’t be achieved by traditional methods. In this chapter, we aim to address building user communities of interests via combining Web usage mining and latent semantic analysis. Meanwhile we also present the application of user communities for Web recommendation.

Download Full-text

Methodologies and Techniques of Web Usage Mining

Advances in Data Mining and Database Management - Web Usage Mining Techniques and Applications Across Industries ◽

10.4018/978-1-5225-0613-3.ch011 ◽

2017 ◽

pp. 275-296

Author(s):

T. Venkat Narayana Rao ◽

D. Hiranmayi

Keyword(s):

Web Mining ◽

Pattern Analysis ◽

Pattern Discovery ◽

Secondary Data ◽

Web Usage Mining ◽

Sequential Patterns ◽

Useful Knowledge ◽

Web Usage ◽

Automatic Discovery ◽

Collection Data

Web usage mining attempts to discover useful knowledge from the secondary data obtained from the interactions of the users with the Web. It is the type of Web mining activity that involves the automatic discovery of out what users are looking for on the Internet. In this chapter methodology of web usage mining explained in detail which are data collection, data preprocessing, knowledge discovery and pattern analysis. The different Web Usage Mining techniques are described, which are used for knowledge and pattern discovery. These are statistical analysis, sequential patterns, classification, association rule mining, clustering, dependency modeling. Pattern analysis is needed to filter out uninterested rules or patterns from the set found in the pattern discovery phase.

Download Full-text

Analysis of Click Stream Patterns using Soft Biclustering Approaches

International Journal of Information Technologies and Systems Approach ◽

10.4018/jitsa.2011010104 ◽

2011 ◽

Vol 4 (1) ◽

pp. 53-66 ◽

Cited By ~ 1

Author(s):

P. K. Nizar Banu ◽

H. Inbarani

Keyword(s):

Machine Learning ◽

Data Mining ◽

Web Mining ◽

Web Usage Mining ◽

Web Personalization ◽

Partial Matching ◽

Web Usage ◽

Needed Information ◽

Highly Correlated ◽

Web Server Logs

As websites increase in complexity, locating needed information becomes a difficult task. Such difficulty is often related to the websites’ design but also ineffective and inefficient navigation processes. Research in web mining addresses this problem by applying techniques from data mining and machine learning to web data and documents. In this study, the authors examine web usage mining, applying data mining techniques to web server logs. Web usage mining has gained much attention as a potential approach to fulfill the requirement of web personalization. In this paper, the authors propose K-means biclustering, rough biclustering and fuzzy biclustering approaches to disclose the duality between users and pages by grouping them in both dimensions simultaneously. The simultaneous clustering of users and pages discovers biclusters that correspond to groups of users that exhibit highly correlated ratings on groups of pages. The results indicate that the fuzzy C-means biclustering algorithm best and is able to detect partial matching of preferences.

Download Full-text

Extracting Knowledge from Web Data

Journal of Information Technology Research ◽

10.4018/jitr.2014100103 ◽

2014 ◽

Vol 7 (4) ◽

pp. 27-41

Author(s):

Hanane Ezzikouri ◽

Mohamed Fakir ◽

Cherki Daoui ◽

Mohamed Erritali

Keyword(s):

Web Mining ◽

User Behavior ◽

Data Extraction ◽

Research Area ◽

Web Usage Mining ◽

Web Data ◽

Main Research ◽

Web Log ◽

Web Usage ◽

Log File

The user behavior on a website triggers a sequence of queries that have a result which is the display of certain pages. The Information about these queries (including the names of the resources requested and responses from the Web server) are stored in a text file called a log file. Analysis of server log file can provide significant and useful information. Web Mining is the extraction of interesting and potentially useful patterns and implicit information from artifacts or activity related to the World Wide Web. Web usage mining is a main research area in Web mining focused on learning about Web users and their interactions with Web sites. The motive of mining is to find users' access models automatically and quickly from the vast Web log file, such as frequent access paths, frequent access page groups and user clustering. Through Web Usage Mining, several information left by user access can be mined which will provide foundation for decision making of organizations, Also the process of Web mining was defined as the set of techniques designed to explore, process and analyze large masses of consecutive information activities on the Internet, has three main steps: data preprocessing, extraction of reasons of the use and the interpretation of results. This paper will start with the presentation of different formats of web log files, then it will present the different preprocessing method that have been used, and finally it presents a system for “Web content and Usage Mining'' for web data extraction and web site analysis using Data Mining Algorithms Apriori, FPGrowth, K-Means, KNN, and ID3.

Download Full-text

Penggunaan Metode berbasis Graph untuk Mining Frequent Sequential Access Pattern Pada Studi Kasus : Website iGracias Universitas Telkom

Indonesian Journal on Computing (Indo-JC) ◽

10.21108/indojc.2017.2.1.146 ◽

2017 ◽

Vol 2 (1) ◽

pp. 91 ◽

Cited By ~ 1

Author(s):

Rahmi Rohdiniyah ◽

Ibnu Asror ◽

Gede Agung Ary Wisudawan

Keyword(s):

Web Mining ◽

Web Usage Mining ◽

Access Pattern ◽

Web Usage ◽

Access Patterns

Penggunaan website pada bidang pendidikan, khususnya sebuah universitas, bertujuan untuk menyimpan berbagai informasi yang ada pada lingkungan universitas tersebut. Untuk itu, perlu dilakukan perbaikan struktur untuk memelihara kualitas dari web. Salah satu teknik yang dapat digunakan adalah dengan menggunakan web usage mining. Web usage mining merupakan salah satu cabang dari web mining yang digunakan untuk menemukan informasi atau pengetahuan yang bermanfaat dari pola navigasi user pada sebuah website. Pada penelitian ini menggunakan metode berbasis graph untuk frequent sequential access patterns dan menggunakan Igracias Universitas Telkom sebagai studi kasusnya. Karena Igracias selalu digunakan oleh seluruh entitas yang ada pada Universitas Telkom. Metode ini memiliki kelebihan untuk menemukan behavior pola pengaksesan user. Dari implementasi metoda ini didapat pola akses group user secara berurutan.

Download Full-text

A Guesstimate on Web Usage Mining Algorithms and Techniques

International Journal of Advanced Research in Computer Science and Software Engineering ◽

10.23956/ijarcsse/v7i6/0284 ◽

2017 ◽

Vol 7 (6) ◽

pp. 518-521

Author(s):

Prabha .K ◽

◽

Suganya .T ◽

Keyword(s):

Web Usage Mining ◽

Web Usage ◽

Mining Algorithms

Download Full-text

Statistical Methods for User Profiling in Web Usage Mining

Handbook of Research on Text and Web Mining Technologies ◽

10.4018/978-1-59904-990-8.ch022 ◽

2010 ◽

pp. 359-368 ◽

Cited By ~ 1

Author(s):

Marcello Pecoraro

Keyword(s):

Statistical Methods ◽

Web Mining ◽

Web Usage Mining ◽

User Profiling ◽

Web Usage ◽

Data Object ◽

Segmentation Methods ◽

Binary Segmentation ◽

Usage Analysis ◽

The Web

This chapter aims at providing an overview about the use of statistical methods supporting the Web Usage Mining. Within the first part is described the framework of the Web Usage Mining as a branch of the Web Mining committed to the study of how to use a Website. Then, the data (object of the analysis) are detailed together with the problems linked to the pre-processing. Once clarified, the data origin and their treatment for a correct development of a Web Usage analysis,the focus shifts on the statistical techniques that can be applied to the analysis background, with reference to binary segmentation methods. Those latter allow the discrimination through a response variable that determines the affiliation of the users to a group by considering some characteristics detected on the same users.

Download Full-text

Performance Comparison of Data Mining Classifiers on Web Log Data

Journal of Computational and Theoretical Nanoscience ◽

10.1166/jctn.2020.9349 ◽

2020 ◽

Vol 17 (11) ◽

pp. 5113-5116

Author(s):

Varun Malik ◽

Vikas Rattan ◽

Jaiteg Singh ◽

Ruchi Mittal ◽

Urvashi Tandon

Keyword(s):

Web Mining ◽

Performance Comparison ◽

Web Usage Mining ◽

Classification Algorithms ◽

Web Content ◽

Web Usage ◽

Web Structure ◽

Web Structure Mining ◽

Content Mining ◽

The Web

Web usage mining is the branch of web mining that deals with mining of data over the web. Web mining can be categorized as web content mining, web structure mining, web usage mining. In this paper, we have summarized the web usage mining results executed over the user tool WMOT (web mining optimized tool) based on the WEKA tool that has been used to apply various classification algorithms such as Naïve Bayes, KNN, SVM and tree based algorithms. Authors summarized the results of classification algorithms on WMOT tool and compared the results on the basis of classified instances and identify the algorithms that gives better instances accuracy.

Download Full-text

Performance evaluation of frequent pattern mining algorithms using web log data for web usage mining

2017 10th International Congress on Image and Signal Processing, BioMedical Engineering and Informatics (CISP-BMEI) ◽

10.1109/cisp-bmei.2017.8302317 ◽

2017 ◽

Author(s):

Yonas Gashaw ◽

Fang Liu

Keyword(s):

Performance Evaluation ◽

Pattern Mining ◽

Frequent Pattern Mining ◽

Web Usage Mining ◽

Frequent Pattern ◽

Log Data ◽

Web Log ◽

Web Usage ◽

Mining Algorithms

Download Full-text

A Review on Design and Development Of Sequential Patterns Algorithms In Web Usage Mining

Turkish Journal of Computer and Mathematics Education (TURCOMAT) ◽

10.17762/turcomat.v12i2.1448 ◽

2021 ◽

Vol 12 (2) ◽

pp. 1634-1639

Author(s):

V Aruna, Et. al.

Keyword(s):

Web Mining ◽

Pattern Mining ◽

Pattern Discovery ◽

Relevant Information ◽

Sequential Pattern Mining ◽

Web Usage Mining ◽

Sequential Pattern ◽

Web Usage ◽

Hidden Data ◽

The Web

In the recent years with the advancement in technology, a lot of information is available in different formats and extracting the knowledge from that data has become a very difficult task. Due to the vast amount of information available on the web, users are finding it difficult to extract relevant information or create new knowledge using information available on the web. To solve this problem Web mining techniques are used to discover the interesting patterns from the hidden data .Web Usage Mining (WUM), which is one of the subset of Web Mining helps in extracting the hidden knowledge present in the Web log files , in recognizing various interests of web users and also in discovering customer behaviours. Web Usage mining includes different phases of data mining techniques called Data Pre-processing, Pattern Discovery & Pattern Analysis. This paper presents an updated focused survey on various sequential pattern mining algorithms like apriori-based algorithm , Breadth First Search-based strategy, Depth First Search strategy, sequential closed-pattern algorithm and Incremental pattern mining algorithm which are used in Pattern Discovery Phase of WUM. At last , a comparison is done based on the important key features present in these algorithms. This study gives us better understanding of the approaches of sequential pattern mining.

Download Full-text

Web usage Mining: A Comparison of WUM Category Web Mining Algorithms

Building User Communities of Interests by Using Latent Semantic Analysis

Methodologies and Techniques of Web Usage Mining

Analysis of Click Stream Patterns using Soft Biclustering Approaches

Extracting Knowledge from Web Data

Penggunaan Metode berbasis Graph untuk Mining Frequent Sequential Access Pattern Pada Studi Kasus : Website iGracias Universitas Telkom

A Guesstimate on Web Usage Mining Algorithms and Techniques

Statistical Methods for User Profiling in Web Usage Mining

Performance Comparison of Data Mining Classifiers on Web Log Data

Performance evaluation of frequent pattern mining algorithms using web log data for web usage mining

A Review on Design and Development Of Sequential Patterns Algorithms In Web Usage Mining

Export Citation Format