Extracting Knowledge from Web Data

Hanane Ezzikouri; Mohamed Fakir; Cherki Daoui; Mohamed Erritali

doi:10.4018/jitr.2014100103

Extracting Knowledge from Web Data

Journal of Information Technology Research ◽

10.4018/jitr.2014100103 ◽

2014 ◽

Vol 7 (4) ◽

pp. 27-41

Author(s):

Hanane Ezzikouri ◽

Mohamed Fakir ◽

Cherki Daoui ◽

Mohamed Erritali

Keyword(s):

Web Mining ◽

User Behavior ◽

Data Extraction ◽

Research Area ◽

Web Usage Mining ◽

Web Data ◽

Main Research ◽

Web Log ◽

Web Usage ◽

Log File

The user behavior on a website triggers a sequence of queries that have a result which is the display of certain pages. The Information about these queries (including the names of the resources requested and responses from the Web server) are stored in a text file called a log file. Analysis of server log file can provide significant and useful information. Web Mining is the extraction of interesting and potentially useful patterns and implicit information from artifacts or activity related to the World Wide Web. Web usage mining is a main research area in Web mining focused on learning about Web users and their interactions with Web sites. The motive of mining is to find users' access models automatically and quickly from the vast Web log file, such as frequent access paths, frequent access page groups and user clustering. Through Web Usage Mining, several information left by user access can be mined which will provide foundation for decision making of organizations, Also the process of Web mining was defined as the set of techniques designed to explore, process and analyze large masses of consecutive information activities on the Internet, has three main steps: data preprocessing, extraction of reasons of the use and the interpretation of results. This paper will start with the presentation of different formats of web log files, then it will present the different preprocessing method that have been used, and finally it presents a system for “Web content and Usage Mining'' for web data extraction and web site analysis using Data Mining Algorithms Apriori, FPGrowth, K-Means, KNN, and ID3.

Download Full-text

Applications of Web Usage Mining across Industries

Advances in Data Mining and Database Management - Web Usage Mining Techniques and Applications Across Industries ◽

10.4018/978-1-5225-0613-3.ch004 ◽

2017 ◽

pp. 92-115

Author(s):

A. V. Senthil Kumar ◽

R. Umagandhi

Keyword(s):

Web Sites ◽

Web Mining ◽

World Wide ◽

Research Area ◽

Web Usage Mining ◽

General Knowledge ◽

Main Research ◽

Web Usage ◽

Research Activities ◽

Log File

Web Usage Mining (WUM) is the process of discovery and analysis of useful information from the World Wide Web (WWW) by applying data mining techniques. The main research area in Web mining is focused on learning about Web users and their interactions with Web sites by analysing the log entries from the user log file. The motive of mining is to find users' access models automatically and quickly from the vast Web log data, such as similar queries imposed by the various users, frequent queries applied by the user, frequent web sites visited by the users, clustering of users with similar intent etc. This chapter deals with Web mining, Categories of Web mining, Web usage mining and its process, Applications of Web usage mining across the industries and its related works. This Chapter offers a general knowledge about Web usage mining and its applications for the benefits of researchers those performing research activities in WUM.

Download Full-text

Applications of Web Usage Mining across Industries

Decision Management ◽

10.4018/978-1-5225-1837-2.ch095 ◽

2017 ◽

pp. 2005-2029

Author(s):

A. V. Senthil Kumar ◽

R. Umagandhi

Keyword(s):

Web Sites ◽

Web Mining ◽

World Wide ◽

Research Area ◽

Web Usage Mining ◽

General Knowledge ◽

Main Research ◽

Web Usage ◽

Research Activities ◽

Log File

Download Full-text

They Know What You Will Do Next Click

Interdisciplinary Approaches to Digital Transformation and Innovation - Advances in E-Business Research ◽

10.4018/978-1-7998-1879-3.ch005 ◽

2020 ◽

pp. 100-122

Author(s):

Serra Çelik

Keyword(s):

Focus Group ◽

User Behavior ◽

Web Usage Mining ◽

Web Log ◽

Web Usage ◽

User Behaviors ◽

Log Files ◽

The Web

This chapter focuses on predicting web user behaviors. When web users enter a website, every move they make on that website is stored as web log files. Unlike the focus group or questionnaire, the log files reflect real user behavior. It can easily be said that having actual user behavior is a gold value for the organizations. In this chapter, the ways of extracting user patterns (user behavior) from the log files are sought. In this context, the web usage mining process is explained. Some web usage mining techniques are mentioned.

Download Full-text

Web Data mining-A Research area in Web usage mining

IOSR Journal of Computer Engineering ◽

10.9790/0661-1312226 ◽

2013 ◽

Vol 13 (1) ◽

pp. 22-26

Author(s):

V.S Thiyagarajan ◽

Keyword(s):

Data Mining ◽

Research Area ◽

Web Usage Mining ◽

Web Data ◽

Web Data Mining ◽

Web Usage

Download Full-text

Web usage mining: A survey on preprocessing of web log file

2010 International Conference on Information and Emerging Technologies ◽

10.1109/iciet.2010.5625730 ◽

2010 ◽

Cited By ~ 21

Author(s):

Tasawar Hussain ◽

Sohail Asghar ◽

Nayyer Masood

Keyword(s):

Web Usage Mining ◽

Web Log ◽

Web Usage ◽

Log File

Download Full-text

Web Usage Mining with Web Logs

Encyclopedia of Data Warehousing and Mining, Second Edition ◽

10.4018/978-1-60566-010-3.ch321 ◽

2011 ◽

pp. 2096-2102 ◽

Cited By ~ 1

Author(s):

Xiangji Huang

Keyword(s):

Web Mining ◽

Web Search ◽

Relevant Information ◽

Web Usage Mining ◽

Log Data ◽

Web Log ◽

Web Usage ◽

Web Access ◽

Navigation Patterns ◽

Access Logs

With the rapid growth of the World Wide Web, the use of automated Web-mining techniques to discover useful and relevant information has become increasingly important. One challenging direction is Web usage mining, wherein one attempts to discover user navigation patterns of Web usage from Web access logs. Properly exploited, the information obtained from Web usage log can assist us to improve the design of a Web site, refine queries for effective Web search, and build personalized search engines. However, Web log data are usually large in size and extremely detailed, because they are likely to record every aspect of a user request to a Web server. It is thus of great importance to process the raw Web log data in an appropriate way, and identify the target information intelligently. In this chapter, we first briefly review the concept of Web Usage Mining and discuss its difference from classic Knowledge Discovery techniques, and then focus on exploiting Web log sessions, defined as a group of requests made by a single user for a single navigation purpose, in Web usage mining. We also compare some of the state-of-the-art techniques in identifying log sessions from Web servers, and present some popular Web mining techniques, including Association Rule Mining, Clustering, Classification, Collaborative Filtering, and Sequential Pattern Learning, that can be exploited on the Web log data for different research and application purposes.

Download Full-text

ANALISIS WEB USAGE MINING MENGGUNAKAN METODE MODIFIED GUSTAFSON – KESSEL CLUSTERING DAN ASSOCIATION RULE PADA WEBSITE UNIVERSITAS DIPONEGORO

Jurnal Gaussian ◽

10.14710/j.gauss.v9i4.29446 ◽

2020 ◽

Vol 9 (4) ◽

pp. 486-494

Author(s):

Galuh Nurvinda Kurniawati ◽

Rukun Santoso ◽

Sugito Sugito

Keyword(s):

Association Rule ◽

Web Usage Mining ◽

Engineering Faculty ◽

Web Log ◽

Web Usage ◽

Log File ◽

The Web

The comprehension of web visitors patterns are needed to develop website in an optimal fashion. The visitor pattern contained in the web log file of Diponegoro University’s website is clustered by Modified Gustafson-Kessel method. In general, this method produces two until six clusters. Two kinds of results are outlined in this paper. The first is the result contains two clusters, and the second is containing three clusters. In the first result, the visitors are divided into information seekers of student capacity and Engineering Faculty. In the second result, the visitors are divided into information seekers of Medicine Faculty, student admission and Engineering Faculty.

Download Full-text

Prediction of User Behavior using Web log in Web Usage Mining

International Journal of Computer Applications ◽

10.5120/ijca2016909228 ◽

2016 ◽

Vol 139 (8) ◽

pp. 4-7

Author(s):

Virendra R. ◽

Govind V.

Keyword(s):

User Behavior ◽

Web Usage Mining ◽

Web Log ◽

Web Usage

Download Full-text

Automated User Behavior Mapping Using Web Usage Mining

International Journal for Modern Trends in Science and Technology - RTT2020 ◽

10.46501/ijmtst061247 ◽

2020 ◽

Vol 6 (12) ◽

pp. 257-261

Author(s):

IJMTST061248

Keyword(s):

Real Time ◽

User Behavior ◽

Web Usage Mining ◽

Time Behavior ◽

End User ◽

Web Page ◽

The Real ◽

Web Log ◽

Web Usage ◽

Log Files

Automated User Behavior Mapping is an application of web usage mining using which we can see the real-time behavior of end user visiting a particular web page automatically. The technologies used in this are socket programming for real-time communication between the server and the user accessing the website for collection of web log data and selenium web driver for automating the user behavior using web log files.

Download Full-text