World Wide Web Usage Mining

Author(s):  
Ajith Abraham
Author(s):  
Wen-Chen Hu ◽  
Hung-Jen Yang ◽  
Chung-wei Lee ◽  
Jyh-haw Yeh

World Wide Web data mining includes content mining, hyperlink structure mining, and usage mining. All three approaches attempt to extract knowledge from the Web, produce some useful results from the knowledge extracted, and apply the results to certain real-world problems. The first two apply the data mining techniques to Web page contents and hyperlink structures, respectively. The third approach, Web usage mining (the theme of this article), is the application of data mining techniques to the usage logs of large Web data repositories in order to produce results that can be applied to many practical subjects, such as improving Web sites/pages, making additional topic or product recommendations, user/customer behavior studies, and so forth. This article provides a survey and analysis of current Web usage mining technologies and systems. A Web usage mining system must be able to perform five major functions: (i) data gathering, (ii) data preparation, (iii) navigation pattern discovery, (iv) pattern analysis and visualization, and (v) pattern applications. Many Web usage mining technologies have been proposed, and each technology employs a different approach. This article first describes a generalized Web usage mining system, which includes five individual functions. Each system function is then explained and analyzed in detail. Related surveys of Web usage mining techniques also can be found in Hu, et al. (2003) and Kosala and Blockeel (2000).


2021 ◽  
Vol 7 (2) ◽  
pp. 65-72
Author(s):  
Kartina Diah Kusuma Wardani

E-Commerce berkembang pesat dalam world wide web hingga menghasilkan berbagai jenis data yang dapat dianalisa lebih lanjut untuk berbagai keperluan seperti personifikasi web, profiling customer, dan sebagainya. Salah satu jenis data yang dihasilkan e-Commerce adalah click stream data web yang merekam aktivitas visitor web dalam bentuk log data selama berinteraksi pada laman web. Penelitian ini mengekstraksi click stream data web e-commerce untuk mendapatkan pola interaksi konsumen terhadap halaman web selama mengunjungi web e-commerce. Berdasarkan jenis data yang diekstrak maka web usage mining digunakan untuk ekstraksi pola dari click stream data yang berbentuk log data. Teknik mining yang dianalisa terhadap log data e-commerce pada penelitian ini terdiri dari frequent itemset, asociation rules, dan frequence sequence mining. Frequent itemset menghasilkan halaman web yang paling sering diakses oleh visitor. Association rules menghasilkan pola kemungkinan halaman web yang akan diakses visitor jika visitor mengakses halaman-halamn tertentu. Frequence sequence mining mendapatkan pola urutan halaman web yang paling sering diakses oleh visitor web e-commerce saat berinteraksi pada laman web. Pola urutan halaman yang diakses visitor menunjukkan urutan kebiasaan visitor mengunjungi e-commerce. Sedangkan teknik mining yang diimplementasikan untuk menghasilkan pola akses visitor pada penelitian ini adalah Frequence sequence mining. Hasil ekstraksi dari penelitian ini menunjukkan ada enam halaman web yang paling sering diakses oleh konsumen dengan berbagai pola urutan aksesnya.


Author(s):  
Yongjian Fu

With the rapid development of the World Wide Web or the Web, many organizations now put their information on the Web and provide Web-based services such as online shopping, user feedback, technical support, and so on. Understanding Web usage through data mining techniques is recognized as an important area.


Web Mining ◽  
2011 ◽  
pp. 373-392 ◽  
Author(s):  
Yew-Kwong Woon ◽  
Wee-Keong Ng ◽  
Ee-Peng Lim

The rising popularity of electronic commerce makes data mining an indispensable technology for several applications, especially online business competitiveness. The World Wide Web provides abundant raw data in the form of Web access logs. However, without data mining techniques, it is difficult to make any sense out of such massive data. In this chapter, we focus on the mining of Web access logs, commonly known as Web usage mining. We analyze algorithms for preprocessing and extracting knowledge from such logs. We will also propose our own techniques to mine the logs in a more holistic manner. Experiments conducted on real Web server logs verify the practicality as well as the efficiency of the proposed techniques as compared to an existing technique. Finally, challenges in Web usage mining are discussed.


Author(s):  
Murugan Anandarajan

The ubiquitous nature of the World Wide Web (commonly known as the Web) is dramatically revolutionizing the manner in which organizations and individuals alike acquire and distribute information. Recent reports from the International Data Group indicate that the number of people on the Internet will reach 320 million by the year 2002 (Needle, 1999). Studies also indicate that in the United States alone, Web commerce will account for approximately $325 billion by the year 2002.


2016 ◽  
Vol 19 (9) ◽  
pp. 1331-1348 ◽  
Author(s):  
Harsh Taneja

This article argues that maps of the Web’s structure based solely on technical infrastructure such as hyperlinks may bear little resemblance to maps based on Web usage, as cultural factors drive the latter to a larger extent. To test this thesis, the study constructs two network maps of 1000 globally most popular Web domains, one based on hyperlinks and the other using an “audience-centric” approach with ties based on shared audience traffic between these domains. Analyses of the two networks reveal that unlike the centralized structure of the hyperlink network with few dominant “core” Websites, the audience network is more decentralized and clustered to a larger extent along geo-linguistic lines.


Author(s):  
JEEVA JOSE ◽  
P. SOJAN LAL

World Wide Web has a spectacular growth not only in terms of the number of websites and volume of information, but also in terms of the number of visitors. Web log files contain tremendous information about the user traffic and behavior. A large amount of pre processing is required for eliminating the noise and is one of the challenging tasks in web usage mining. This paper proposes an indiscernibility approach in rough set theory for pre processing of web log files.


Author(s):  
A. V. Senthil Kumar ◽  
R. Umagandhi

Web Usage Mining (WUM) is the process of discovery and analysis of useful information from the World Wide Web (WWW) by applying data mining techniques. The main research area in Web mining is focused on learning about Web users and their interactions with Web sites by analysing the log entries from the user log file. The motive of mining is to find users' access models automatically and quickly from the vast Web log data, such as similar queries imposed by the various users, frequent queries applied by the user, frequent web sites visited by the users, clustering of users with similar intent etc. This chapter deals with Web mining, Categories of Web mining, Web usage mining and its process, Applications of Web usage mining across the industries and its related works. This Chapter offers a general knowledge about Web usage mining and its applications for the benefits of researchers those performing research activities in WUM.


2017 ◽  
pp. 2005-2029
Author(s):  
A. V. Senthil Kumar ◽  
R. Umagandhi

Web Usage Mining (WUM) is the process of discovery and analysis of useful information from the World Wide Web (WWW) by applying data mining techniques. The main research area in Web mining is focused on learning about Web users and their interactions with Web sites by analysing the log entries from the user log file. The motive of mining is to find users' access models automatically and quickly from the vast Web log data, such as similar queries imposed by the various users, frequent queries applied by the user, frequent web sites visited by the users, clustering of users with similar intent etc. This chapter deals with Web mining, Categories of Web mining, Web usage mining and its process, Applications of Web usage mining across the industries and its related works. This Chapter offers a general knowledge about Web usage mining and its applications for the benefits of researchers those performing research activities in WUM.


Sign in / Sign up

Export Citation Format

Share Document