Multiple evidence combination for web site search using server log analysis

10.32920/ryerson.14658057.v1 ◽

2021 ◽

Author(s):

Jin Zhou

Keyword(s):

Web Server ◽

Log Analysis ◽

Original Text ◽

Web Page ◽

Retrieval Performance ◽

Site Search ◽

Combination Methods ◽

Novel Method ◽

Server Logs ◽

Web Server Logs

In this thesis, a novel method is proposed to improve the retrieval performance by using web server logs. Web server logs are grouped into different sessions and then terms are extracted for each page in the session, meanwhile weights of terms are calculated. A new representation of web page from user's perspective is generated after going through the entire log. The new representation and the anchor-based representation are combined with original text-based representation. Two combination methods: combination of document representations and combination of ranking scores are investigated. In the experiments, three measurements are employed to evaluate the performance and the results show that for Cosine Similarity model, the highest improvement on top-10 precision is around 38%, for Okapi model, the hightest improvement is around 13%, for TFIDF model, the highest improvement is around 48% and for Indri model, the highest improvement is around 17%.

Download Full-text

Improving web site search using web server logs

10.1145/1188966.1188996 ◽

2006 ◽

Cited By ~ 4

Author(s):

Jin Zhou ◽

Chen Ding ◽

Dimitrios Androutsos

Keyword(s):

Web Site ◽

Web Server ◽

Site Search ◽

Server Logs ◽

Web Server Logs

Download Full-text

Improving Website by Analysis of Web Server Logs Using Web Mining Tools

Advances in Information Communication Technology and Computing - Lecture Notes in Networks and Systems ◽

10.1007/978-981-15-5421-6_50 ◽

2020 ◽

pp. 525-531

Author(s):

Neeraj Kandpal ◽

Devesh Kumar Bandil ◽

M. S. Shekhawat

Keyword(s):

Web Mining ◽

Web Server ◽

Server Logs ◽

Web Server Logs ◽

Mining Tools

Download Full-text

Effectively Capturing User Navigation Paths in the Web Using Web Server Logs

Lecture Notes in Computer Science - Web Engineering ◽

10.1007/11531371_11 ◽

2005 ◽

pp. 63-68

Author(s):

Amithalal Caldera ◽

Yogesh Deshpande

Keyword(s):

Web Server ◽

Server Logs ◽

Web Server Logs ◽

The Web ◽

User Navigation

Download Full-text

Big Data Classification of Users Navigation and Behavior Using Web Server Logs

2018 Fourth International Conference on Computing Communication Control and Automation (ICCUBEA) ◽

10.1109/iccubea.2018.8697606 ◽

2018 ◽

Cited By ~ 3

Author(s):

Prajakta Ghavare ◽

Prashant Ahire

Keyword(s):

Big Data ◽

Web Server ◽

Data Classification ◽

Server Logs ◽

Big Data Classification ◽

And Behavior ◽

Web Server Logs

Download Full-text

Building and Evaluating an Adaptive Smart Web Pages

Journal of Communications and Computer Engineering ◽

10.20454/jcce.2013.306 ◽

2012 ◽

Vol 3 (1) ◽

pp. 30

Author(s):

Mona M. Abu Al-Khair ◽

M. Koutb ◽

H. Kelash

Keyword(s):

Web Server ◽

Web Pages ◽

Server Logs ◽

Web Server Logs ◽

The Web

Each year the number of consumers and the variety of their interests increase. As a result, providers are seeking ways to infer the customer's interests and to adapt their websites to make the content of interest more easily accessible. Assume that past navigation behavior as an indicator of the user's interests. Then, the records of this behavior, kept in the web-server logs, can be mined to extract the user's interests. On this principal, recommendations can be generated, to help old and new website's visitors to find the information about their interest faster.

Download Full-text

Agglomerative Approach for Identification and Elimination of Web Robots from Web Server Logs to Extract Knowledge about Actual Visitors

Journal of Data Analysis and Information Processing ◽

10.4236/jdaip.2015.31001 ◽

2015 ◽

Vol 03 (01) ◽

pp. 1-10 ◽

Cited By ~ 11

Author(s):

Dilip Singh Sisodia ◽

Shrish Verma ◽

Om Prakash Vyas

Keyword(s):

Web Server ◽

Server Logs ◽

Web Server Logs

Download Full-text

Data Mining applied on Web Robots Detection: A Systematic Mapping

10.21528/cbic2021-60 ◽

2021 ◽

Author(s):

Ramon Abilio ◽

Cristiano Garcia ◽

Victor Fernandes

Keyword(s):

Machine Learning ◽

Data Mining ◽

Learning Algorithms ◽

Web Server ◽

Machine Learning Algorithms ◽

Web Pages ◽

Systematic Mapping ◽

Mining Methods ◽

Server Logs ◽

Web Server Logs

Browsing on Internet is part of the world population’s daily routine. The number of web pages is increasing and so is the amount of published content (news, tutorials, images, videos) provided by them. Search engines use web robots to index web contents and to offer better results to their users. However, web robots have also been used for exploiting vulnerabilities in web pages. Thus, monitoring and detecting web robots’ accesses is important in order to keep the web server as safe as possible. Data Mining methods have been applied to web server logs (used as data source) in order to detect web robots. Then, the main objective of this work was to observe evidences of definition or use of web robots detection by analyzing web server-side logs using Data Mining methods. Thus, we conducted a systematic Literature mapping, analyzing papers published between 2013 and 2020. In the systematic mapping, we analyzed 34 studies and they allowed us to better understand the area of web robots detection, mapping what is being done, the data used to perform web robots detection, the tools, and algorithms used in the Literature. From those studies, we extracted 33 machine learning algorithms, 64 features, and 13 tools. This study is helpful for researchers to find machine learning algorithms, features, and tools to detect web robots by analyzing web server logs.

Download Full-text

Preprocessing of Web Server Logs from Online Newspaper

International Conference on Measurement and Control Engineering 2nd (ICMCE 2011) ◽

10.1115/1.859858.paper15 ◽

2011 ◽

pp. 93-97

Keyword(s):

Web Server ◽

Online Newspaper ◽

Server Logs ◽

Web Server Logs

Download Full-text

From Editors' Choice to Readers’ Favorites: Analyzing Server Logs of China's Biggest Online Newspaper

Proceedings of the Annual Conference of CAIS / Actes du congrès annuel de l'ACSI ◽

10.29173/cais760 ◽

2013 ◽

Author(s):

Yijun Gao

Keyword(s):

Web Server ◽

Online Newspaper ◽

People’S Daily ◽

The Common ◽

Server Logs ◽

People's Daily ◽

Web Server Logs ◽

The Web

This study analyzed the Web server logs from the People's Daily Online and revealed some interesting findings: Pageview numbers of the mportant news in editors’ mind on the most obvious sections of the homepage, are not significantly different than those of the "common" news put on the less obvious sections.Cette étude a porté sur l'analyse des fichiers de journalisation de serveurs Web du Quotidien du Peuple en ligne et a révélé quelques données intéressantes : le nombre de pages vues pour les dépêches jugées importantes par la rédaction et placées en évidence de la page d'accueil n'est pas significativement différent du nombre de pages vues pour les dépêches plus « courantes » placés moins en évidence.

Download Full-text