Web Usage Mining with Web Logs

Encyclopedia of Data Warehousing and Mining, Second Edition ◽

10.4018/978-1-60566-010-3.ch321 ◽

2011 ◽

pp. 2096-2102 ◽

Cited By ~ 1

Author(s):

Xiangji Huang

Keyword(s):

Web Mining ◽

Web Search ◽

Relevant Information ◽

Web Usage Mining ◽

Log Data ◽

Web Log ◽

Web Usage ◽

Web Access ◽

Navigation Patterns ◽

Access Logs

With the rapid growth of the World Wide Web, the use of automated Web-mining techniques to discover useful and relevant information has become increasingly important. One challenging direction is Web usage mining, wherein one attempts to discover user navigation patterns of Web usage from Web access logs. Properly exploited, the information obtained from Web usage log can assist us to improve the design of a Web site, refine queries for effective Web search, and build personalized search engines. However, Web log data are usually large in size and extremely detailed, because they are likely to record every aspect of a user request to a Web server. It is thus of great importance to process the raw Web log data in an appropriate way, and identify the target information intelligently. In this chapter, we first briefly review the concept of Web Usage Mining and discuss its difference from classic Knowledge Discovery techniques, and then focus on exploiting Web log sessions, defined as a group of requests made by a single user for a single navigation purpose, in Web usage mining. We also compare some of the state-of-the-art techniques in identifying log sessions from Web servers, and present some popular Web mining techniques, including Association Rule Mining, Clustering, Classification, Collaborative Filtering, and Sequential Pattern Learning, that can be exploited on the Web log data for different research and application purposes.

Download Full-text

A Web Usage Mining Approach to User Navigation Pattern and Prediction in Web Log Data

International Journal of Scientific Research ◽

10.15373/22778179/apr2014/34 ◽

2012 ◽

Vol 3 (4) ◽

pp. 92-94

Author(s):

SUJATHA PADMAKUMAR ◽

◽

Dr.PUNITHAVALLI Dr.PUNITHAVALLI ◽

Dr.RANJITH Dr.RANJITH

Keyword(s):

Web Usage Mining ◽

Log Data ◽

Web Log ◽

Web Usage ◽

User Navigation

Download Full-text

Extracting Knowledge from Web Data

Journal of Information Technology Research ◽

10.4018/jitr.2014100103 ◽

2014 ◽

Vol 7 (4) ◽

pp. 27-41

Author(s):

Hanane Ezzikouri ◽

Mohamed Fakir ◽

Cherki Daoui ◽

Mohamed Erritali

Keyword(s):

Web Mining ◽

User Behavior ◽

Data Extraction ◽

Research Area ◽

Web Usage Mining ◽

Web Data ◽

Main Research ◽

Web Log ◽

Web Usage ◽

Log File

The user behavior on a website triggers a sequence of queries that have a result which is the display of certain pages. The Information about these queries (including the names of the resources requested and responses from the Web server) are stored in a text file called a log file. Analysis of server log file can provide significant and useful information. Web Mining is the extraction of interesting and potentially useful patterns and implicit information from artifacts or activity related to the World Wide Web. Web usage mining is a main research area in Web mining focused on learning about Web users and their interactions with Web sites. The motive of mining is to find users' access models automatically and quickly from the vast Web log file, such as frequent access paths, frequent access page groups and user clustering. Through Web Usage Mining, several information left by user access can be mined which will provide foundation for decision making of organizations, Also the process of Web mining was defined as the set of techniques designed to explore, process and analyze large masses of consecutive information activities on the Internet, has three main steps: data preprocessing, extraction of reasons of the use and the interpretation of results. This paper will start with the presentation of different formats of web log files, then it will present the different preprocessing method that have been used, and finally it presents a system for “Web content and Usage Mining'' for web data extraction and web site analysis using Data Mining Algorithms Apriori, FPGrowth, K-Means, KNN, and ID3.

Download Full-text

Indexing Techniques for Web Access Logs

Web Information Systems ◽

10.4018/978-1-59140-208-4.ch009 ◽

2004 ◽

pp. 305-334 ◽

Cited By ~ 2

Author(s):

Yannis Manolopoulos ◽

Mikolaj Morzy ◽

Tadeusz Morzy ◽

Alexandros Nanopoulos ◽

Marek Wojciechowski ◽

...

Keyword(s):

Log Data ◽

Web Log ◽

Indexing Techniques ◽

Web Access ◽

User Access ◽

Single User ◽

Efficient Processing ◽

Access Logs ◽

Web Access Logs ◽

The Web

Access histories of users visiting a web server are automatically recorded in web access logs. Conceptually, the web-log data can be regarded as a collection of clients’ access-sequences, where each sequence is a list of pages accessed by a single user in a single session. This chapter presents novel indexing techniques that support efficient processing of so-called pattern queries, which consist of finding all access sequences that contain a given subsequence. Pattern queries are a key element of advanced analyses of web-log data, especially those concerning typical navigation schemes. In this chapter, we discuss the particularities of efficiently processing user access-sequences with pattern queries, compared to the case of searching unordered sets. Extensive experimental results are given, which examine a variety of factors and illustrate the superiority of the proposed methods over indexing techniques for unordered data adapted to access sequences.

Download Full-text

Web Usage Mining

Web Mining ◽

10.4018/978-1-59140-414-9.ch018 ◽

2011 ◽

pp. 373-392 ◽

Cited By ~ 1

Author(s):

Yew-Kwong Woon ◽

Wee-Keong Ng ◽

Ee-Peng Lim

Keyword(s):

Data Mining ◽

World Wide ◽

Web Usage Mining ◽

Web Usage ◽

Online Business ◽

Web Access ◽

Business Competitiveness ◽

Access Logs ◽

Web Server Logs ◽

Web Access Logs

The rising popularity of electronic commerce makes data mining an indispensable technology for several applications, especially online business competitiveness. The World Wide Web provides abundant raw data in the form of Web access logs. However, without data mining techniques, it is difficult to make any sense out of such massive data. In this chapter, we focus on the mining of Web access logs, commonly known as Web usage mining. We analyze algorithms for preprocessing and extracting knowledge from such logs. We will also propose our own techniques to mine the logs in a more holistic manner. Experiments conducted on real Web server logs verify the practicality as well as the efficiency of the proposed techniques as compared to an existing technique. Finally, challenges in Web usage mining are discussed.

Download Full-text

Performance evaluation of frequent pattern mining algorithms using web log data for web usage mining

2017 10th International Congress on Image and Signal Processing, BioMedical Engineering and Informatics (CISP-BMEI) ◽

10.1109/cisp-bmei.2017.8302317 ◽

2017 ◽

Author(s):

Yonas Gashaw ◽

Fang Liu

Keyword(s):

Performance Evaluation ◽

Pattern Mining ◽

Frequent Pattern Mining ◽

Web Usage Mining ◽

Frequent Pattern ◽

Log Data ◽

Web Log ◽

Web Usage ◽

Mining Algorithms

Download Full-text

Active User-Based and Ontology-Based Web Log Data Preprocessing for Web Usage Mining

2006 IEEE/WIC/ACM International Conference on Web Intelligence (WI 2006 Main Conference Proceedings)(WI'06) ◽

10.1109/wi.2006.32 ◽

2006 ◽

Cited By ~ 17

Author(s):

Natheer Khasawneh ◽

Chien-chung Chan

Keyword(s):

Data Preprocessing ◽

Web Usage Mining ◽

Log Data ◽

Web Log ◽

Web Usage ◽

Active User

Download Full-text

A Review on Design and Development Of Sequential Patterns Algorithms In Web Usage Mining

Turkish Journal of Computer and Mathematics Education (TURCOMAT) ◽

10.17762/turcomat.v12i2.1448 ◽

2021 ◽

Vol 12 (2) ◽

pp. 1634-1639

Author(s):

V Aruna, Et. al.

Keyword(s):

Web Mining ◽

Pattern Mining ◽

Pattern Discovery ◽

Relevant Information ◽

Sequential Pattern Mining ◽

Web Usage Mining ◽

Sequential Pattern ◽

Web Usage ◽

Hidden Data ◽

The Web

In the recent years with the advancement in technology, a lot of information is available in different formats and extracting the knowledge from that data has become a very difficult task. Due to the vast amount of information available on the web, users are finding it difficult to extract relevant information or create new knowledge using information available on the web. To solve this problem Web mining techniques are used to discover the interesting patterns from the hidden data .Web Usage Mining (WUM), which is one of the subset of Web Mining helps in extracting the hidden knowledge present in the Web log files , in recognizing various interests of web users and also in discovering customer behaviours. Web Usage mining includes different phases of data mining techniques called Data Pre-processing, Pattern Discovery & Pattern Analysis. This paper presents an updated focused survey on various sequential pattern mining algorithms like apriori-based algorithm , Breadth First Search-based strategy, Depth First Search strategy, sequential closed-pattern algorithm and Incremental pattern mining algorithm which are used in Pattern Discovery Phase of WUM. At last , a comparison is done based on the important key features present in these algorithms. This study gives us better understanding of the approaches of sequential pattern mining.

Download Full-text

A Study on Data Preprocessing Methods on Web Log Data in Web Usage Mining

International Journal of Computer Sciences and Engineering ◽

10.26438/ijcse/v6i7.920928 ◽

2018 ◽

Vol 6 (7) ◽

pp. 920-928

Author(s):

R.Sandrilla . ◽

M. Savitha Devi

Keyword(s):

Data Preprocessing ◽

Web Usage Mining ◽

Log Data ◽

Web Log ◽

Web Usage

Download Full-text

Extracting the User’s Interests by Using Web Log Data Based on Web Usage Mining

Journal of Computational and Theoretical Nanoscience ◽

10.1166/jctn.2015.4468 ◽

2015 ◽

Vol 12 (12) ◽

pp. 5031-5040

Author(s):

Kannasani Srinivasa Rao ◽

M Krishnamurthy ◽

A Kannan

Keyword(s):

Web Usage Mining ◽

Log Data ◽

Web Log ◽

Web Usage

Download Full-text

Taxonomical Classification of Web Usage Mining Applications and its Ontological Representation

Asian Journal of Computer Science and Technology ◽

10.51983/ajcst-2018.7.3.1902 ◽

2018 ◽

Vol 7 (3) ◽

pp. 39-43

Author(s):

Satyaveer Singh ◽

Mahendra Singh Aswal

Keyword(s):

Web Usage Mining ◽

Web Usage ◽

Navigation Patterns ◽

Product Recommendations ◽

Taxonomical Classification ◽

The Comparative Study ◽

The Web ◽

Real World Problems

Web usage mining is used to find out fascinating consumer navigation patterns which can be applied to a lot of real-world problems, such as enriching websites or pages, generating newly topic or product recommendations and consumer behavior studies, etc. In this paper, an attempt has been made to provide a taxonomical classification of web usage mining applications with two levels of hierarchy. Further, the ontology for various categories of the web usage mining applications has been developed and to prove the completeness of proposed taxonomy, a rigorous case study has been performed. The comparative study with other existing classifications of web usage mining applications has also been performed.

Download Full-text