Web Usage Mining Issues in Big Data

The web is a rich data mining source which is dynamic and fast growing, providing great opportunities which are often not exploited. Web data represent a real challenge to traditional data mining techniques due to its huge amount and the unstructured nature. Web logs contain information about the interactions between visitors and the website. Analyzing these logs provides insights into visitors' behavior, usage patterns, and trends. Web usage mining, also known as web log mining, is the process of applying data mining techniques to discover useful information hidden in web server's logs. Web logs are primarily used by Web administrators to know how much traffic they get and to detect broken links and other types of errors. Web usage mining extracts useful information that can be beneficial to a number of application areas such as: web personalization, website restructuring, system performance improvement, and business intelligence. The Web usage mining process involves three main phases: pre-processing, pattern discovery, and pattern analysis. Various preprocessing techniques have been proposed to extract information from log files and group primitive data items into meaningful, lighter level abstractions that are suitable for mining, usually in forms of visitors' sessions. Major data mining techniques in web usage mining pattern discovery are: clustering, association analysis, classification, and sequential patterns discovery. This chapter discusses the process of web usage mining, its procedure, methods, and patterns discovery techniques. The chapter also presents a practical example using real web log data.

Download Full-text

Methodologies on user Behavior Analysis and Future Request Prediction in Web usage Mining using Data mining Techniques

International Journal of Web Technology ◽

10.20894/ijwt.104.003.001.004 ◽

2014 ◽

Vol 003 (001) ◽

pp. 15-18

Author(s):

M. SelviMohana ◽

◽

B. Rosiline Jeetha ◽

Keyword(s):

Data Mining ◽

Behavior Analysis ◽

User Behavior ◽

Web Usage Mining ◽

User Behavior Analysis ◽

Data Mining Techniques ◽

Web Usage ◽

Using Data

Download Full-text

Web Usage Mining and the Challenge of Big Data

Handbook of Research on Trends and Future Directions in Big Data and Web Intelligence - Advances in Data Mining and Database Management ◽

10.4018/978-1-4666-8505-5.ch020 ◽

2015 ◽

pp. 418-447

Author(s):

Abubakr Gafar Abdalla ◽

Tarig Mohamed Ahmed ◽

Mohamed Elhassan Seliaman

Keyword(s):

Data Mining ◽

Pattern Discovery ◽

Web Usage Mining ◽

Data Mining Techniques ◽

Web Log ◽

Web Usage ◽

Web Logs ◽

Usage Patterns ◽

Rich Data ◽

The Web

The web is a rich data mining source which is dynamic and fast growing, providing great opportunities which are often not exploited. Web data represent a real challenge to traditional data mining techniques due to its huge amount and the unstructured nature. Web logs contain information about the interactions between visitors and the website. Analyzing these logs provides insights into visitors' behavior, usage patterns, and trends. Web usage mining, also known as web log mining, is the process of applying data mining techniques to discover useful information hidden in web server's logs. Web logs are primarily used by Web administrators to know how much traffic they get and to detect broken links and other types of errors. Web usage mining extracts useful information that can be beneficial to a number of application areas such as: web personalization, website restructuring, system performance improvement, and business intelligence. The Web usage mining process involves three main phases: pre-processing, pattern discovery, and pattern analysis. Various preprocessing techniques have been proposed to extract information from log files and group primitive data items into meaningful, lighter level abstractions that are suitable for mining, usually in forms of visitors' sessions. Major data mining techniques in web usage mining pattern discovery are: clustering, association analysis, classification, and sequential patterns discovery. This chapter discusses the process of web usage mining, its procedure, methods, and patterns discovery techniques. The chapter also presents a practical example using real web log data.

Download Full-text

Extraction of Knowledge from Web Server Logs Using Web Usage Mining

Asian Journal of Computer Science and Technology ◽

10.51983/ajcst-2019.8.s3.2113 ◽

2019 ◽

Vol 8 (S3) ◽

pp. 12-15

Author(s):

B. Harika ◽

T. Sudha

Keyword(s):

Data Mining ◽

Web Mining ◽

Web Server ◽

Primary Source ◽

Web Usage Mining ◽

Data Mining Techniques ◽

Web Usage ◽

Browsing Behavior ◽

The Web ◽

Usage Data

Information on internet increases rapidly from day to day and the usage of the web also increases, thus there is the need to discover interesting patterns from web. The process used to extract and mine useful information from web documents by using Data Mining Techniques is called Web Mining. Web Mining is broadly classified in to three types namely Web Content Mining, Web Structure Mining and Web Usage Mining. In this paper our focus is mainly on Web Usage Mining, where we are applying the data mining techniques to analyse and discover interesting knowledge from the Web Usage data. The activities of the user are captured and stored at different levels such as server level, proxy level and user level called as Web Usage Data and the usage data stored at server side is Web Server Log, where it records the browsing behavior of users and their requests based on the user clicks. Web server Log is a primary source to perform Web Usage Mining. This paper also brings in to discussion of various existing pre-processing techniques and analysis of web log files and how clustering is applied to group the users based on the browsing behavior of users on their interested contents.

Download Full-text

ANALISIS POLA PERMINTAAN PUBLIKASI DATA BADAN PUSAT STATISTIK MENGGUNAKAN ASSOCIATION RULE APRIORI

KLIK - KUMPULAN JURNAL ILMU KOMPUTER ◽

10.20527/klik.v7i2.322 ◽

2020 ◽

Vol 7 (2) ◽

pp. 187

Author(s):

Farid Ridho ◽

Fachruddin Mansyur

Keyword(s):

Data Mining ◽

Association Rule ◽

Web Usage Mining ◽

Mining Method ◽

Data Mining Techniques ◽

Web Usage ◽

Usage Patterns ◽

Mining Association Rule ◽

The Web

BPS is a data provider body in Indonesia. In publishing, BPS uses a variety of media, one of which is the BPS website. To get data through the BPS website, users can visit the website then download the data they need. The services obtained by data users on the BPS website depend on the quality of the website. The better the quality, the better the service experience gained by data users. The method that can be used to improve the quality of a website is the web usage mining method. Web usage mining is the application of data mining techniques on web repositories to study usage patterns. The purpose of this study is to determine the pattern of data publication requests on the BPS website which can later be used as a reference to improve the quality of BPS website services. Based on the results of the study, it was found that data users tend to access the same data with different years simultaneously. For results by grouping data by title without year, obtained quite diverse rules.Keywords: web usage mining, association rule, aprioriBPS merupakan badan penyedia data di Indonesia. Dalam mempublikasikan datanya, BPS menggunakan berbagai media, salah satunya adalah website BPS. Untuk mendapatkan data melalui website BPS, pengguna dapat mengunjungi website kemudian mengunduh data yang mereka butuhkan. Layanan yang didapatkan oleh pengguna data pada website BPS tergantung dari kualitas website tersebut. Semakin baik kualitasnya, semakin baik pula pengalaman pelayanan yang didapatkan oleh pengguna data. Metode yang dapat digunakan untuk meningkatkan kualitas suatu website adalah metode web usage mining. Web usage mining merupakan penerapan tekhnik data mining pada web repositori untuk mempelajari pola penggunaan. Tujuan dari penelitian ini adalah untuk mengetahui pola permintaan publikasi data pada website BPS yang nantinya dapat digunakan sebagai acuan untuk meningkatkan kualitas layanan website BPS. Berdasarkan hasil penelitian, didapatkan bahwa pengguna data cenderung mengakses data yang sama dengan tahun yang berbeda secara bersamaan. Untuk hasil dengan mengelompokan data berdasarkan judul tanpa tahun, diperoleh rules yang cukup beragam.Kata kunci: web usage mining, association rule, apriori

Download Full-text

Big data and the web discovering meaningful information from web data using data mining techniques

2015 4th International Conference on Reliability, Infocom Technologies and Optimization (ICRITO) (Trends and Future Directions) ◽

10.1109/icrito.2015.7359209 ◽

2015 ◽

Author(s):

Mohd Helmy Abd Wahab

Keyword(s):

Data Mining ◽

Big Data ◽

Web Data ◽

Data Mining Techniques ◽

Meaningful Information ◽

Using Data ◽

The Web

Download Full-text

Effectiveness of Web Usage Mining Techniques in Business Application

Advances in Data Mining and Database Management - Web Usage Mining Techniques and Applications Across Industries ◽

10.4018/978-1-5225-0613-3.ch013 ◽

2017 ◽

pp. 324-350 ◽

Cited By ~ 2

Author(s):

Ahmed El Azab ◽

Mahmood A. Mahmood ◽

Abd El-Aziz

Keyword(s):

Data Mining ◽

Academic Research ◽

Web Usage Mining ◽

Web Pages ◽

Web Data ◽

Web Data Mining ◽

Web Usage ◽

Business Application ◽

Common Interests ◽

The Web

Web usage mining techniques and applications across industries is still exploratory and, despite an increase in academic research, there are challenge of analyze web which quantitatively capture web users' common interests and characterize their underlying tasks. This chapter addresses the problem of how to support web usage mining techniques and applications across industries by combining language of web pages and algorithms that used in web data mining. Existing research in web usage mining techniques tend to focus on finding out how each techniques can apply in different industries fields. However, there is little evidence that researchers have approached the issue of web usage mining across industries. Consequently, the aim of this chapter is to provide an overview of how the web usage mining techniques and applications across industries can be supported.

Download Full-text

Analysis of Click Stream Patterns using Soft Biclustering Approaches

International Journal of Information Technologies and Systems Approach ◽

10.4018/jitsa.2011010104 ◽

2011 ◽

Vol 4 (1) ◽

pp. 53-66 ◽

Cited By ~ 1

Author(s):

P. K. Nizar Banu ◽

H. Inbarani

Keyword(s):

Machine Learning ◽

Data Mining ◽

Web Mining ◽

Web Usage Mining ◽

Web Personalization ◽

Partial Matching ◽

Web Usage ◽

Needed Information ◽

Highly Correlated ◽

Web Server Logs

As websites increase in complexity, locating needed information becomes a difficult task. Such difficulty is often related to the websites’ design but also ineffective and inefficient navigation processes. Research in web mining addresses this problem by applying techniques from data mining and machine learning to web data and documents. In this study, the authors examine web usage mining, applying data mining techniques to web server logs. Web usage mining has gained much attention as a potential approach to fulfill the requirement of web personalization. In this paper, the authors propose K-means biclustering, rough biclustering and fuzzy biclustering approaches to disclose the duality between users and pages by grouping them in both dimensions simultaneously. The simultaneous clustering of users and pages discovers biclusters that correspond to groups of users that exhibit highly correlated ratings on groups of pages. The results indicate that the fuzzy C-means biclustering algorithm best and is able to detect partial matching of preferences.

Download Full-text

They Know What You Will Do Next Click

Interdisciplinary Approaches to Digital Transformation and Innovation - Advances in E-Business Research ◽

10.4018/978-1-7998-1879-3.ch005 ◽

2020 ◽

pp. 100-122

Author(s):

Serra Çelik

Keyword(s):

Focus Group ◽

User Behavior ◽

Web Usage Mining ◽

Web Log ◽

Web Usage ◽

User Behaviors ◽

Log Files ◽

The Web

This chapter focuses on predicting web user behaviors. When web users enter a website, every move they make on that website is stored as web log files. Unlike the focus group or questionnaire, the log files reflect real user behavior. It can easily be said that having actual user behavior is a gold value for the organizations. In this chapter, the ways of extracting user patterns (user behavior) from the log files are sought. In this context, the web usage mining process is explained. Some web usage mining techniques are mentioned.

Download Full-text

Hyperlink Structure Inspired by Web Usage

Web Technologies ◽

10.4018/978-1-60566-982-3.ch108 ◽

2011 ◽

pp. 2034-2047

Author(s):

Pawan Lingras ◽

Rucha Lingras

Keyword(s):

Data Mining ◽

Sequence Analysis ◽

Data Mining Techniques ◽

Graph Theoretic ◽

Web Usage ◽

Use Of Data ◽

Navigational Patterns ◽

Usage Patterns ◽

Navigation Patterns ◽

Visualization Tools

This chapter describes how Web usage patterns can be used to improve the navigational structure of a Web site. The discussion begins with an illustration of visualization tools that study aggregate and individual link traversals. The use of data mining techniques such as classification, association, and sequence analysis to discover knowledge about Web usage, such as navigational patterns, is also discussed. Finally, a graph theoretic algorithm to create an optimal navigational hyperlink structure, based on known navigation patterns, is presented. The discussion is supported by analysis of realworld datasets.

Download Full-text