Extracting Knowledge from Web Data

2014 ◽  
Vol 7 (4) ◽  
pp. 27-41
Author(s):  
Hanane Ezzikouri ◽  
Mohamed Fakir ◽  
Cherki Daoui ◽  
Mohamed Erritali

The user behavior on a website triggers a sequence of queries that have a result which is the display of certain pages. The Information about these queries (including the names of the resources requested and responses from the Web server) are stored in a text file called a log file. Analysis of server log file can provide significant and useful information. Web Mining is the extraction of interesting and potentially useful patterns and implicit information from artifacts or activity related to the World Wide Web. Web usage mining is a main research area in Web mining focused on learning about Web users and their interactions with Web sites. The motive of mining is to find users' access models automatically and quickly from the vast Web log file, such as frequent access paths, frequent access page groups and user clustering. Through Web Usage Mining, several information left by user access can be mined which will provide foundation for decision making of organizations, Also the process of Web mining was defined as the set of techniques designed to explore, process and analyze large masses of consecutive information activities on the Internet, has three main steps: data preprocessing, extraction of reasons of the use and the interpretation of results. This paper will start with the presentation of different formats of web log files, then it will present the different preprocessing method that have been used, and finally it presents a system for “Web content and Usage Mining'' for web data extraction and web site analysis using Data Mining Algorithms Apriori, FPGrowth, K-Means, KNN, and ID3.

Author(s):  
A. V. Senthil Kumar ◽  
R. Umagandhi

Web Usage Mining (WUM) is the process of discovery and analysis of useful information from the World Wide Web (WWW) by applying data mining techniques. The main research area in Web mining is focused on learning about Web users and their interactions with Web sites by analysing the log entries from the user log file. The motive of mining is to find users' access models automatically and quickly from the vast Web log data, such as similar queries imposed by the various users, frequent queries applied by the user, frequent web sites visited by the users, clustering of users with similar intent etc. This chapter deals with Web mining, Categories of Web mining, Web usage mining and its process, Applications of Web usage mining across the industries and its related works. This Chapter offers a general knowledge about Web usage mining and its applications for the benefits of researchers those performing research activities in WUM.


2017 ◽  
pp. 2005-2029
Author(s):  
A. V. Senthil Kumar ◽  
R. Umagandhi

Web Usage Mining (WUM) is the process of discovery and analysis of useful information from the World Wide Web (WWW) by applying data mining techniques. The main research area in Web mining is focused on learning about Web users and their interactions with Web sites by analysing the log entries from the user log file. The motive of mining is to find users' access models automatically and quickly from the vast Web log data, such as similar queries imposed by the various users, frequent queries applied by the user, frequent web sites visited by the users, clustering of users with similar intent etc. This chapter deals with Web mining, Categories of Web mining, Web usage mining and its process, Applications of Web usage mining across the industries and its related works. This Chapter offers a general knowledge about Web usage mining and its applications for the benefits of researchers those performing research activities in WUM.


Author(s):  
Serra Çelik

This chapter focuses on predicting web user behaviors. When web users enter a website, every move they make on that website is stored as web log files. Unlike the focus group or questionnaire, the log files reflect real user behavior. It can easily be said that having actual user behavior is a gold value for the organizations. In this chapter, the ways of extracting user patterns (user behavior) from the log files are sought. In this context, the web usage mining process is explained. Some web usage mining techniques are mentioned.


Author(s):  
Xiangji Huang

With the rapid growth of the World Wide Web, the use of automated Web-mining techniques to discover useful and relevant information has become increasingly important. One challenging direction is Web usage mining, wherein one attempts to discover user navigation patterns of Web usage from Web access logs. Properly exploited, the information obtained from Web usage log can assist us to improve the design of a Web site, refine queries for effective Web search, and build personalized search engines. However, Web log data are usually large in size and extremely detailed, because they are likely to record every aspect of a user request to a Web server. It is thus of great importance to process the raw Web log data in an appropriate way, and identify the target information intelligently. In this chapter, we first briefly review the concept of Web Usage Mining and discuss its difference from classic Knowledge Discovery techniques, and then focus on exploiting Web log sessions, defined as a group of requests made by a single user for a single navigation purpose, in Web usage mining. We also compare some of the state-of-the-art techniques in identifying log sessions from Web servers, and present some popular Web mining techniques, including Association Rule Mining, Clustering, Classification, Collaborative Filtering, and Sequential Pattern Learning, that can be exploited on the Web log data for different research and application purposes.


2020 ◽  
Vol 9 (4) ◽  
pp. 486-494
Author(s):  
Galuh Nurvinda Kurniawati ◽  
Rukun Santoso ◽  
Sugito Sugito

The comprehension of web visitors patterns are needed to develop website in an optimal fashion. The visitor pattern contained in the web log file of Diponegoro University’s website is clustered by Modified Gustafson-Kessel method. In general, this method produces two until six clusters. Two kinds of results are outlined in this paper. The first is the result contains two clusters, and the second is containing three clusters. In the first result, the visitors are divided into information seekers of student capacity and Engineering Faculty. In the second result, the visitors are divided into information seekers of Medicine Faculty, student admission and Engineering Faculty.  


Author(s):  
IJMTST061248

Automated User Behavior Mapping is an application of web usage mining using which we can see the real-time behavior of end user visiting a particular web page automatically. The technologies used in this are socket programming for real-time communication between the server and the user accessing the website for collection of web log data and selenium web driver for automating the user behavior using web log files.


2012 ◽  
Vol 3 (4) ◽  
pp. 92-94
Author(s):  
SUJATHA PADMAKUMAR ◽  
◽  
Dr.PUNITHAVALLI Dr.PUNITHAVALLI ◽  
Dr.RANJITH Dr.RANJITH

Sign in / Sign up

Export Citation Format

Share Document