Historic Perspective of Log Analysis

Privacy Issues

This historical review of the birth and evolution of transaction log analysis applied to information retrieval systems provides two perspectives. First, a detailed discussion of the early work in this area, and second, how this work has migrated into the evaluation of World Wide Web usage. The author describes the techniques and studies in the early years and makes suggestions for how that knowledge can be applied to current and future studies. A discussion of privacy issues with a framework for addressing the same is presented as well as an overview of the historical “eras” of transaction log analysis. The author concludes with the suggestion that a combination of transaction log analysis of the type used early in its application along with additional more qualitative approaches will be essential for a deep understanding of user behavior (and needs) with respect to current and future retrieval systems and their design.

Analysis of Document Viewing Patterns of Web Search Engine Users

Web Mining ◽

10.4018/978-1-59140-414-9.ch016 ◽

2011 ◽

pp. 339-354 ◽

Cited By ~ 6

Author(s):

Bernard J. Jansen ◽

Amanda Spink

Keyword(s):

Information Seeking ◽

Web Search ◽

Real Data ◽

Temporal Analysis ◽

Log Analysis ◽

Web Page ◽

Retrieval Systems ◽

Web Information ◽

Information Interaction ◽

This chapter reviews the concepts of Web results page and Web page viewing patterns by users of Web search engines. It presents the advantages of using traditional transaction log analysis in identifying these patterns, serving as a basis for Web usage mining. The authors also present the results of a temporal analysis of Web page viewing, illustrating that the user — information interaction is extremely short. By using real data collected from real users interacting with real Web information retrieval systems, the authors aim to highlight one aspect of the complex environment of Web information seeking.

Transliteration: A Magnetic Analysis

International Journal for Research in Applied Science and Engineering Technology ◽

10.22214/ijraset.2021.38742 ◽

2021 ◽

Vol 9 (11) ◽

pp. 85-86

Author(s):

Dr. Rudra Prasad Mishra

Keyword(s):

Information Exchange ◽

World Wide ◽

Critical Role ◽

Foreign Languages ◽

Abstract Machine ◽

Retrieval Systems ◽

Magnetic Analysis ◽

The World ◽

Cross Lingual

Abstract: Machine transliteration is an important problem in an increasingly multilingual world as it plays a critical role in many downstream applications such as machine translation or cross-lingual information retrieval systems. There is now a vast amount of information accessible via the Internet where a lot of regional and cultural information is put on the World Wide Web in different languages and scripts. There are more that six thousand living languages in the world. Adding to the diversity is the fact that some languages are written in different scripts in different regions of the world. The multitude of foreign languages and mutually incomprehensible scripts of the same language pose a barrier to information exchange as we cannot all learn every language or script in use worldwide. Therefore, if we can get around the language barrier or at least the script barrier, we can access much more of the world's culture and can explore its abundant richness. Keywords: Transliteration, Translation. Cross-lingual, Multilingual, Language, Script

Adding knowledge to information retrieval systems in the World Wide Web

Artificial Intelligence in Medicine - Lecture Notes in Computer Science ◽

10.1007/bfb0029483 ◽

1997 ◽

pp. 491-500

Author(s):

G. Mann ◽

M. Schubert ◽

V. Schaeffler

Keyword(s):

Information Retrieval ◽

World Wide Web ◽

World Wide ◽

Retrieval Systems ◽

The World ◽

The study of user behavior on information retrieval systems

ACM SIGCUE Outlook ◽

10.1145/964080.964086 ◽

1987 ◽

Vol 19 (3-4) ◽

pp. 35-48 ◽

Cited By ~ 2

Author(s):

Christine L. Borgman

Keyword(s):

Information Retrieval ◽

User Behavior ◽

Retrieval Systems ◽

Database Query Personalization

Encyclopedia of Database Technologies and Applications ◽

10.4018/978-1-59140-560-3.ch025 ◽

2005 ◽

pp. 147-152

Author(s):

Georgia Koutrika

Keyword(s):

World Wide ◽

Information Access ◽

User Interaction ◽

New Era ◽

Content Selection ◽

Retrieval Systems ◽

The World ◽

Different Levels ◽

Growing Mass

Traditional database and information retrieval systems have followed a query-based information access paradigm (i.e., information is returned to the user on the basis of a query issued). As a result, users issuing the same query are provided with the same answer. With the advent of the World Wide Web and hand-held electronic devices such as palmtops and cellular phones, information access entered a new era. Increasing amounts of information become available to a growing mass of untrained lay users through various access media. A user searching Web-resident information may have to reformulate queries issued several times and sift through many results until a satisfactory, if any, answer is obtained. As purely query-driven approaches may be inappropriate in this context, the need for a shift towards a more user-centered information access paradigm arises. To this end, different approaches aim to the personalization of the overall user experience at different levels: content selection, content presentation, and user interaction.

Structural Text Mining

Encyclopedia of Information Science and Technology, First Edition ◽

10.4018/978-1-59140-553-5.ch472 ◽

2005 ◽

pp. 2658-2661

Author(s):

Vladimir A. Kulyukin ◽

John A. Nicholson

Keyword(s):

Information Retrieval ◽

Natural Language Processing ◽

Text Mining ◽

Language Processing ◽

World Wide ◽

Knowledge Bases ◽

Domain Specific ◽

Retrieval Systems ◽

The World ◽

The advent of the World Wide Web has resulted in the creation of millions of documents containing unstructured, structured and semi-structured data. Consequently, research on structural text mining has come to the forefront of both information retrieval and natural language processing (Cardie, 1997; Freitag, 1998; Hammer, Garcia-Molina, Cho, Aranha, & Crespo, 1997; Hearst, 1992; Hsu & Chang, 1999; Jacquemin & Bush, 2000; Kushmerick, Weld, & Doorenbos, 1997). Knowledge of how information is organized and structured in texts can be of significant assistance to information systems that use documents as their knowledge bases (Appelt, 1999). In particular, such knowledge is of use to information retrieval systems (Salton & McGill, 1983) that retrieve documents in response to user queries and to systems that use texts to construct domain-specific ontologies or thesauri (Ruge, 1997).

Relevance Predictability in Information Retrieval Systems

Methods of Information in Medicine ◽

10.1055/s-0038-1636254 ◽

1967 ◽

Vol 06 (02) ◽

pp. 45-51 ◽

Cited By ~ 6

Author(s):

A. Kent ◽

J. Belzer ◽

M. Kuhfeerst ◽

E. D. Dym ◽

D. L. Shirey ◽

...

Keyword(s):

Information Retrieval ◽

Experimental Conditions ◽

Treatment Groups ◽

Retrieval Systems ◽

Significant Difference ◽

High Predictability ◽

Intermediate Response ◽

Quantitative Indicators ◽

Level Of Processing

An experiment is described which attempts to derive quantitative indicators regarding the potential relevance predictability of the intermediate stimuli used to represent documents in information retrieval systems. In effect, since the decision to peruse an entire document is often predicated upon the examination of one »level of processing« of the document (e.g., the citation and/or abstract), it became interesting to analyze the properties of what constitutes »relevance«. However, prior to such an analysis, an even more elementary step had to be made, namely, to determine what portions of a document should be examined.An evaluation of the ability of intermediate response products (IRPs), functioning as cues to the information content of full documents, to predict the relevance determination that would be subsequently made on these documents by motivated users of information retrieval systems, was made under controlled experimental conditions. The hypothesis that there might be other intermediate response products (selected extracts from the document, i.e., first paragraph, last paragraph, and the combination of first and last paragraph), that would be as representative of the full document as the traditional IRPs (citation and abstract) was tested systematically. The results showed that:1. there is no significant difference among the several IRP treatment groups on the number of cue evaluations of relevancy which match the subsequent user relevancy decision on the document;2. first and last paragraph combinations have consistently predicted relevancy to a higher degree than the other IRPs;3. abstracts were undistinguished as predictors; and4. the apparent high predictability rating for citations was not substantive.Some of these results are quite different than would be expected from previous work with unmotivated subjects.

Methods for Evaluating Interactive Information Retrieval Systems with Users

10.1561/9781601982254 ◽

2007 ◽

Author(s):

Diane Kelly

Keyword(s):

Information Retrieval ◽

Interactive Information Retrieval ◽

Retrieval Systems ◽

Performance of eight information retrieval systems in the Spanish web

El profesional de la información ◽

10.3145/epi.2005.sep.03 ◽

2005 ◽

Vol 14 (5) ◽

pp. 335-346

Author(s):

Por Carlos Benito Amat ◽

Por Carlos Benito Amat

Keyword(s):

Information Retrieval ◽

Retrieval Systems ◽

A Dialectical Approach to Search Engine Evaluation

Libri ◽

10.1515/libri-2019-0142 ◽

2020 ◽

Vol 70 (3) ◽

pp. 227-237

Author(s):

Mahdi Zeynali-Tazehkandi ◽

Mohsen Nowkarizi

Keyword(s):

Information Retrieval ◽

Information Science ◽

Library And Information Science ◽

Related Literature ◽

Philosophical Foundations ◽

Dialectical Approach ◽

Retrieval Systems ◽

The World ◽

Oriented Approach

AbstractEvaluation of information retrieval systems is a fundamental topic in Library and Information Science. The aim of this paper is to connect the system-oriented and the user-oriented approaches to relevant philosophical schools. By reviewing the related literature, it was found that the evaluation of information retrieval systems is successful if it benefits from both system-oriented and user-oriented approaches (composite). The system-oriented approach is rooted in Parmenides’ philosophy of stability (immovable) which Plato accepts and attributes to the world of forms; the user-oriented approach is rooted in Heraclitus’ flux philosophy (motion) which Plato defers and attributes to the tangible world. Thus, using Plato’s theory is a comprehensive approach for recognizing the concept of relevance. The theoretical and philosophical foundations determine the type of research methods and techniques. Therefore, Plato’s dialectical method is an appropriate composite method for evaluating information retrieval systems.