WEB FARMING WITH CLICKSTREAM

In a commercial website or portal, Web information fusion is usually from the following two approaches, one is to integrate the Web content, structure, and usage data for surfing behavior analysis; the other is to integrate Web usage data with traditional customer, product, and transaction data for purchasing behavior analysis. In this paper, we propose a unified model based on Web farming technology for collecting clickstream logs in the whole user interaction process. We emphasize that collecting clickstream logs at the application layer will help to seamlessly integrate Web usage data with other customer-related data sources. In this paper, we extend the Web log standard to modeling clickstream format and Web mining to Web farming from passively collecting data and analyzing the customer behavior to actively influence the customer's decision making. The proposed model can be developed as a common plugin for most existing commercial websites and portals.

Download Full-text

Performance Comparison of Data Mining Classifiers on Web Log Data

Journal of Computational and Theoretical Nanoscience ◽

10.1166/jctn.2020.9349 ◽

2020 ◽

Vol 17 (11) ◽

pp. 5113-5116

Author(s):

Varun Malik ◽

Vikas Rattan ◽

Jaiteg Singh ◽

Ruchi Mittal ◽

Urvashi Tandon

Keyword(s):

Web Mining ◽

Performance Comparison ◽

Web Usage Mining ◽

Classification Algorithms ◽

Web Content ◽

Web Usage ◽

Web Structure ◽

Web Structure Mining ◽

Content Mining ◽

The Web

Web usage mining is the branch of web mining that deals with mining of data over the web. Web mining can be categorized as web content mining, web structure mining, web usage mining. In this paper, we have summarized the web usage mining results executed over the user tool WMOT (web mining optimized tool) based on the WEKA tool that has been used to apply various classification algorithms such as Naïve Bayes, KNN, SVM and tree based algorithms. Authors summarized the results of classification algorithms on WMOT tool and compared the results on the basis of classified instances and identify the algorithms that gives better instances accuracy.

Download Full-text

Extraction of Knowledge from Web Server Logs Using Web Usage Mining

Asian Journal of Computer Science and Technology ◽

10.51983/ajcst-2019.8.s3.2113 ◽

2019 ◽

Vol 8 (S3) ◽

pp. 12-15

Author(s):

B. Harika ◽

T. Sudha

Keyword(s):

Data Mining ◽

Web Mining ◽

Web Server ◽

Primary Source ◽

Web Usage Mining ◽

Data Mining Techniques ◽

Web Usage ◽

Browsing Behavior ◽

The Web ◽

Usage Data

Information on internet increases rapidly from day to day and the usage of the web also increases, thus there is the need to discover interesting patterns from web. The process used to extract and mine useful information from web documents by using Data Mining Techniques is called Web Mining. Web Mining is broadly classified in to three types namely Web Content Mining, Web Structure Mining and Web Usage Mining. In this paper our focus is mainly on Web Usage Mining, where we are applying the data mining techniques to analyse and discover interesting knowledge from the Web Usage data. The activities of the user are captured and stored at different levels such as server level, proxy level and user level called as Web Usage Data and the usage data stored at server side is Web Server Log, where it records the browsing behavior of users and their requests based on the user clicks. Web server Log is a primary source to perform Web Usage Mining. This paper also brings in to discussion of various existing pre-processing techniques and analysis of web log files and how clustering is applied to group the users based on the browsing behavior of users on their interested contents.

Download Full-text

MINING WEB DATA USING CLUSTERING TECHNIQUE FOR WEB PERSONALIZATION

International Journal of Computational Intelligence and Applications ◽

10.1142/s1469026802000580 ◽

2002 ◽

Vol 02 (03) ◽

pp. 255-265 ◽

Cited By ~ 4

Author(s):

SUPRIYA KUMAR DE ◽

P. RADHA KRISHNA

Keyword(s):

Dimensional Space ◽

Web Personalization ◽

Web Data ◽

Concept Hierarchy ◽

Web Usage ◽

Hierarchy Model ◽

Dimension Space ◽

The Relationship ◽

The Web ◽

Usage Data

Clustering of data in a large dimension space is of great interest in many data mining applications. In this paper, we propose a method for clustering of web usage data in a high-dimensional space based on a concept hierarchy model. In this method, the relationship present in the web usage data are mapped into a fuzzy proximity relation of user transactions. We also described an approach to present the preference set of URLs to a new user transaction based on the match score with the clusters. The study demonstrates that our approach is general and effective for mining the web data for web personalization.

Download Full-text

First Look on Web Mining Techniques to Improve Business Intelligence of E-Commerce Applications

Advances in Business Information Systems and Analytics - Handbook of Research on Advanced Data Mining Techniques and Applications for Business Intelligence ◽

10.4018/978-1-5225-2031-3.ch018 ◽

2017 ◽

pp. 298-314 ◽

Cited By ~ 2

Author(s):

G. Sreedhar ◽

A. Anandaraja Chari

Keyword(s):

Data Mining ◽

Web Mining ◽

Web Content ◽

Web Data ◽

Web Data Mining ◽

Useful Knowledge ◽

Web Usage ◽

Web Structure ◽

Web Structure Mining ◽

Content Mining

Web Data Mining is the application of data mining techniques to extract useful knowledge from web data like contents of web, hyperlinks of documents and web usage logs. There is also a strong requirement of techniques to help in business decision in e-commerce. Web Data Mining can be broadly divided into three categories: Web content mining, Web structure mining and Web usage mining. Web content data are content availed to users to satisfy their required information. Web structure data represents linkage and relationship of web pages to others. Web usage data involves log data collected by web server and application server which is the main source of data. The growth of WWW and technologies has made business functions to be executed fast and easier. As large amount of transactions are performed through e-commerce sites and the huge amount of data is stored, valuable knowledge can be obtained by applying the Web Mining techniques.

Download Full-text

Traversal Pattern Mining in Web Usage Data

Data Warehousing and Mining ◽

10.4018/978-1-59904-951-9.ch119 ◽

2008 ◽

pp. 2004-2021

Author(s):

Jenq-Foung Yao ◽

Yongqiao Xiao

Keyword(s):

Pattern Mining ◽

Pattern Discovery ◽

Web Usage Mining ◽

Sequential Patterns ◽

Web Usage ◽

Web Logs ◽

Frequent Episodes ◽

Browsing Behavior ◽

The Web ◽

Usage Data

Web usage mining is to discover useful patterns in the web usage data, and the patterns provide useful information about the user’s browsing behavior. This chapter examines different types of web usage traversal patterns and the related techniques used to uncover them, including Association Rules, Sequential Patterns, Frequent Episodes, Maximal Frequent Forward Sequences, and Maximal Frequent Sequences. As a necessary step for pattern discovery, the preprocessing of the web logs is described. Some important issues, such as privacy, sessionization, are raised, and the possible solutions are also discussed.

Download Full-text

Statistical Methods for User Profiling in Web Usage Mining

Handbook of Research on Text and Web Mining Technologies ◽

10.4018/978-1-59904-990-8.ch022 ◽

2010 ◽

pp. 359-368 ◽

Cited By ~ 1

Author(s):

Marcello Pecoraro

Keyword(s):

Statistical Methods ◽

Web Mining ◽

Web Usage Mining ◽

User Profiling ◽

Web Usage ◽

Data Object ◽

Segmentation Methods ◽

Binary Segmentation ◽

Usage Analysis ◽

The Web

This chapter aims at providing an overview about the use of statistical methods supporting the Web Usage Mining. Within the first part is described the framework of the Web Usage Mining as a branch of the Web Mining committed to the study of how to use a Website. Then, the data (object of the analysis) are detailed together with the problems linked to the pre-processing. Once clarified, the data origin and their treatment for a correct development of a Web Usage analysis,the focus shifts on the statistical techniques that can be applied to the analysis background, with reference to binary segmentation methods. Those latter allow the discrimination through a response variable that determines the affiliation of the users to a group by considering some characteristics detected on the same users.

Download Full-text

Discover Patterns from Web-Based Dataset

Advances in Data Mining and Database Management - Web Data Mining and the Development of Knowledge-Based Decision Support Systems ◽

10.4018/978-1-5225-1877-8.ch006 ◽

2017 ◽

pp. 78-106

Author(s):

Raghvendra Kumar ◽

Priyanka Pandey ◽

Prasant Kumar Pattnaik

Keyword(s):

Web Site ◽

Web Mining ◽

Research Work ◽

Web Content ◽

Web Based ◽

Web Structure ◽

Information Matching ◽

Varied Range ◽

Conceptual Link ◽

The Web

The Web can be defined as a depot of varied range of information present in the form of millions of websites dispersed around us. Often users find it difficult to locate the appropriate information fulfilling their needs with the abundant number of websites in the Web. Hence multiple research work has been conducted in the field of Web Mining so as to present any information matching the user's needs. The application of data mining techniques on web usage, web content or web structure data to find out useful data like users' way in patterns and website utility statistics on a whole can be defined as Web mining. The main cause behind development of such websites was to personalize the substance of a website on user's preference. New methods are developed to deal with a Web site using a link hierarchy and a conceptual link hierarchy respectively on the basis of how users have used the Web site link structure.

Download Full-text

Traversal Pattern Mining in Web Usage Data

Web Information Systems ◽

10.4018/978-1-59140-208-4.ch010 ◽

2004 ◽

pp. 335-358 ◽

Cited By ~ 2

Author(s):

Yongqiao Xiao ◽

Jenq-Foung (J.F.) Yao

Keyword(s):

Pattern Mining ◽

Pattern Discovery ◽

Web Usage Mining ◽

Sequential Patterns ◽

Web Usage ◽

Web Logs ◽

Frequent Episodes ◽

Browsing Behavior ◽

The Web ◽

Usage Data

Download Full-text

Web Mining for Public E-Services Personalization

Electronic Services ◽

10.4018/978-1-61520-967-5.ch045 ◽

2010 ◽

pp. 751-758

Author(s):

P. Markellou

Keyword(s):

Web Site ◽

Web Mining ◽

Web Usage Mining ◽

Web Content ◽

Web Personalization ◽

Business Transactions ◽

Web Structure ◽

Web Structure Mining ◽

Content Mining ◽

The Web

Over the last decade, we have witnessed an explosive growth in the information available on the Web. Today, Web browsers provide easy access to myriad sources of text and multimedia data. Search engines index more than a billion pages and finding the desired information is not an easy task. This profusion of resources has prompted the need for developing automatic mining techniques on Web, thereby giving rise to the term “Web mining” (Pal, Talwar, & Mitra, 2002). Web mining is the application of data mining techniques on the Web for discovering useful patterns and can be divided into three basic categories: Web content mining, Web structure mining, and Web usage mining. Web content mining includes techniques for assisting users in locating Web documents (i.e., pages) that meet certain criteria, while Web structure mining relates to discovering information based on the Web site structure data (the data depicting the Web site map). Web usage mining focuses on analyzing Web access logs and other sources of information regarding user interactions within the Web site in order to capture, understand and model their behavioral patterns and profiles and thereby improve their experience with the Web site. As citizens requirements and needs change continuously, traditional information searching, and fulfillment of various tasks result to the loss of valuable time spent in identifying the responsible actor (public authority) and waiting in queues. At the same time, the percentage of users who acquaint with the Internet has been remarkably increased (Internet World Stats, 2005). These two facts motivate many governmental organizations to proceed with the provision of e-services via their Web sites. The ease and speed with which business transactions can be carried out over the Web has been a key driving force in the rapid growth and popularity of e-government, e-commerce, and e-business applications. In this framework, the Web is emerging as the appropriate environment for business transactions and user-organization interactions. However, since it is a large collection of semi-structured and structured information sources, Web users often suffer from information overload. Personalization is considered as a popular solution in order to alleviate this problem and to customize the Web environment to users (Eirinaki & Vazirgiannis, 2003). Web personalization can be described, as any action that makes the Web experience of a user personalized to his or her needs and wishes. Principal elements of Web personalization include modeling of Web objects (pages) and subjects (users), categorization of objects and subjects, matching between and across objects and/or subjects, and determination of the set of actions to be recommended for personalization. In the remainder of this article, we present the way an e-government application can deploy Web mining techniques in order to support intelligent and personalized interactions with citizens. Specifically, we describe the tasks that typically comprise this process, illustrate the future trends, and discuss the open issues in the field.

Download Full-text

A Review on Design and Development Of Sequential Patterns Algorithms In Web Usage Mining

Turkish Journal of Computer and Mathematics Education (TURCOMAT) ◽

10.17762/turcomat.v12i2.1448 ◽

2021 ◽

Vol 12 (2) ◽

pp. 1634-1639

Author(s):

V Aruna, Et. al.

Keyword(s):

Web Mining ◽

Pattern Mining ◽

Pattern Discovery ◽

Relevant Information ◽

Sequential Pattern Mining ◽

Web Usage Mining ◽

Sequential Pattern ◽

Web Usage ◽

Hidden Data ◽

The Web

In the recent years with the advancement in technology, a lot of information is available in different formats and extracting the knowledge from that data has become a very difficult task. Due to the vast amount of information available on the web, users are finding it difficult to extract relevant information or create new knowledge using information available on the web. To solve this problem Web mining techniques are used to discover the interesting patterns from the hidden data .Web Usage Mining (WUM), which is one of the subset of Web Mining helps in extracting the hidden knowledge present in the Web log files , in recognizing various interests of web users and also in discovering customer behaviours. Web Usage mining includes different phases of data mining techniques called Data Pre-processing, Pattern Discovery & Pattern Analysis. This paper presents an updated focused survey on various sequential pattern mining algorithms like apriori-based algorithm , Breadth First Search-based strategy, Depth First Search strategy, sequential closed-pattern algorithm and Incremental pattern mining algorithm which are used in Pattern Discovery Phase of WUM. At last , a comparison is done based on the important key features present in these algorithms. This study gives us better understanding of the approaches of sequential pattern mining.

Download Full-text