Extracting knowledge from web server logs using web usage mining

2020 ◽

Vol 9 (4) ◽

pp. 396-400

Keyword(s):

Data Reduction ◽

Clustering Algorithm ◽

Web Server ◽

Web Usage Mining ◽

Web Based ◽

Pattern Clustering ◽

Web Usage ◽

Conflicting Information ◽

Database Right ◽

Web Server Logs

Information decrease is the way toward limiting the measure of information that should be put away in an information stockpiling condition. Information decrease can build stockpiling effectiveness and lessen costs. Information cleaning act in the Data Preprocessing and Web Usage Mining. The work on information cleaning of web server logs, unessential things and futile information can not totally evacuated and Overlapped information causes trouble during information recovering from database. Right now, we present Ant Based Pattern Clustering Algorithm to get design information for mining .It likewise shows Log Cleaner that can sift through a lot of superfluous, conflicting information dependent on the basic of their URLs. Fundamentally right now are expelling undesirable records . so we are utilizing k-implies bunching calculation . By utilizing this exploration work we can apply this philosophy on web based business stage i.e AMAZON, FLIPKART.

Download Full-text

Analysis of Click Stream Patterns using Soft Biclustering Approaches

Systems Approach Applications for Developments in Information Technology ◽

10.4018/978-1-4666-1562-5.ch015 ◽

2012 ◽

pp. 212-224

Author(s):

P. K. Nizar Banu ◽

H. Inbarani

Keyword(s):

Machine Learning ◽

Data Mining ◽

Web Mining ◽

Web Server ◽

Web Usage Mining ◽

Web Personalization ◽

Partial Matching ◽

Web Usage ◽

Highly Correlated ◽

Web Server Logs

As websites increase in complexity, locating needed information becomes a difficult task. Such difficulty is often related to the websites’ design but also ineffective and inefficient navigation processes. Research in web mining addresses this problem by applying techniques from data mining and machine learning to web data and documents. In this study, the authors examine web usage mining, applying data mining techniques to web server logs. Web usage mining has gained much attention as a potential approach to fulfill the requirement of web personalization. In this paper, the authors propose K-means biclustering, rough biclustering and fuzzy biclustering approaches to disclose the duality between users and pages by grouping them in both dimensions simultaneously. The simultaneous clustering of users and pages discovers biclusters that correspond to groups of users that exhibit highly correlated ratings on groups of pages. The results indicate that the fuzzy C-means biclustering algorithm best and is able to detect partial matching of preferences.

Download Full-text

Improving Website by Analysis of Web Server Logs Using Web Mining Tools

Advances in Information Communication Technology and Computing - Lecture Notes in Networks and Systems ◽

10.1007/978-981-15-5421-6_50 ◽

2020 ◽

pp. 525-531

Author(s):

Neeraj Kandpal ◽

Devesh Kumar Bandil ◽

M. S. Shekhawat

Keyword(s):

Web Mining ◽

Web Server ◽

Server Logs ◽

Web Server Logs ◽

Mining Tools

Download Full-text

Analysis of Click Stream Patterns using Soft Biclustering Approaches

International Journal of Information Technologies and Systems Approach ◽

10.4018/jitsa.2011010104 ◽

2011 ◽

Vol 4 (1) ◽

pp. 53-66 ◽

Cited By ~ 1

Author(s):

P. K. Nizar Banu ◽

H. Inbarani

Keyword(s):

Machine Learning ◽

Data Mining ◽

Web Mining ◽

Web Usage Mining ◽

Web Personalization ◽

Partial Matching ◽

Web Usage ◽

Needed Information ◽

Highly Correlated ◽

Web Server Logs

As websites increase in complexity, locating needed information becomes a difficult task. Such difficulty is often related to the websites’ design but also ineffective and inefficient navigation processes. Research in web mining addresses this problem by applying techniques from data mining and machine learning to web data and documents. In this study, the authors examine web usage mining, applying data mining techniques to web server logs. Web usage mining has gained much attention as a potential approach to fulfill the requirement of web personalization. In this paper, the authors propose K-means biclustering, rough biclustering and fuzzy biclustering approaches to disclose the duality between users and pages by grouping them in both dimensions simultaneously. The simultaneous clustering of users and pages discovers biclusters that correspond to groups of users that exhibit highly correlated ratings on groups of pages. The results indicate that the fuzzy C-means biclustering algorithm best and is able to detect partial matching of preferences.

Download Full-text

Effectively Capturing User Navigation Paths in the Web Using Web Server Logs

Lecture Notes in Computer Science - Web Engineering ◽

10.1007/11531371_11 ◽

2005 ◽

pp. 63-68

Author(s):

Amithalal Caldera ◽

Yogesh Deshpande

Keyword(s):

Web Server ◽

Server Logs ◽

Web Server Logs ◽

The Web ◽

User Navigation

Download Full-text

Big Data Classification of Users Navigation and Behavior Using Web Server Logs

2018 Fourth International Conference on Computing Communication Control and Automation (ICCUBEA) ◽

10.1109/iccubea.2018.8697606 ◽

2018 ◽

Cited By ~ 3

Author(s):

Prajakta Ghavare ◽

Prashant Ahire

Keyword(s):

Big Data ◽

Web Server ◽

Data Classification ◽

Server Logs ◽

Big Data Classification ◽

And Behavior ◽

Web Server Logs

Download Full-text

A Study on Prediction of User Behavior Based on Web Server Log Files in Web Usage Mining

International Journal Of Engineering And Computer Science ◽

10.18535/ijecs/v6i2.12 ◽

2017 ◽

Cited By ~ 1

Author(s):

Anurag Kumar ◽

◽

Vaishali Ahirwar ◽

Ravi Kumar Singh ◽

◽

...

Keyword(s):

User Behavior ◽

Web Server ◽

Web Usage Mining ◽

Web Usage ◽

Log Files ◽

Behavior Based

Download Full-text

Building and Evaluating an Adaptive Smart Web Pages

Journal of Communications and Computer Engineering ◽

10.20454/jcce.2013.306 ◽

2012 ◽

Vol 3 (1) ◽

pp. 30

Author(s):

Mona M. Abu Al-Khair ◽

M. Koutb ◽

H. Kelash

Keyword(s):

Web Server ◽

Web Pages ◽

Server Logs ◽

Web Server Logs ◽

The Web

Each year the number of consumers and the variety of their interests increase. As a result, providers are seeking ways to infer the customer's interests and to adapt their websites to make the content of interest more easily accessible. Assume that past navigation behavior as an indicator of the user's interests. Then, the records of this behavior, kept in the web-server logs, can be mined to extract the user's interests. On this principal, recommendations can be generated, to help old and new website's visitors to find the information about their interest faster.

Download Full-text

Agglomerative Approach for Identification and Elimination of Web Robots from Web Server Logs to Extract Knowledge about Actual Visitors

Journal of Data Analysis and Information Processing ◽

10.4236/jdaip.2015.31001 ◽

2015 ◽

Vol 03 (01) ◽

pp. 1-10 ◽

Cited By ~ 11

Author(s):

Dilip Singh Sisodia ◽

Shrish Verma ◽

Om Prakash Vyas

Keyword(s):

Web Server ◽

Server Logs ◽

Web Server Logs

Download Full-text

Data Mining applied on Web Robots Detection: A Systematic Mapping

10.21528/cbic2021-60 ◽

2021 ◽

Author(s):

Ramon Abilio ◽

Cristiano Garcia ◽

Victor Fernandes

Keyword(s):

Machine Learning ◽

Data Mining ◽

Learning Algorithms ◽

Web Server ◽

Machine Learning Algorithms ◽

Web Pages ◽

Systematic Mapping ◽

Mining Methods ◽

Server Logs ◽

Web Server Logs

Browsing on Internet is part of the world population’s daily routine. The number of web pages is increasing and so is the amount of published content (news, tutorials, images, videos) provided by them. Search engines use web robots to index web contents and to offer better results to their users. However, web robots have also been used for exploiting vulnerabilities in web pages. Thus, monitoring and detecting web robots’ accesses is important in order to keep the web server as safe as possible. Data Mining methods have been applied to web server logs (used as data source) in order to detect web robots. Then, the main objective of this work was to observe evidences of definition or use of web robots detection by analyzing web server-side logs using Data Mining methods. Thus, we conducted a systematic Literature mapping, analyzing papers published between 2013 and 2020. In the systematic mapping, we analyzed 34 studies and they allowed us to better understand the area of web robots detection, mapping what is being done, the data used to perform web robots detection, the tools, and algorithms used in the Literature. From those studies, we extracted 33 machine learning algorithms, 64 features, and 13 tools. This study is helpful for researchers to find machine learning algorithms, features, and tools to detect web robots by analyzing web server logs.

Download Full-text

Extracting knowledge from web server logs using web usage mining

Subterranean Insect based Data Reduction in Web Usage Mining using K-implies Clustering Algorithm

Analysis of Click Stream Patterns using Soft Biclustering Approaches

Improving Website by Analysis of Web Server Logs Using Web Mining Tools

Analysis of Click Stream Patterns using Soft Biclustering Approaches

Effectively Capturing User Navigation Paths in the Web Using Web Server Logs

Big Data Classification of Users Navigation and Behavior Using Web Server Logs

A Study on Prediction of User Behavior Based on Web Server Log Files in Web Usage Mining

Building and Evaluating an Adaptive Smart Web Pages

Agglomerative Approach for Identification and Elimination of Web Robots from Web Server Logs to Extract Knowledge about Actual Visitors

Data Mining applied on Web Robots Detection: A Systematic Mapping

Export Citation Format