Digital forensic text string searching: Improving information retrieval effectiveness by thematically clustering search results

2010 ◽

pp. 108-121

Author(s):

Ming Xu ◽

Hong-Rong Yang ◽

Ning Zheng

Keyword(s):

Experimental Results ◽

Search Process ◽

Improved Method ◽

User Interest ◽

High Recall ◽

String Match ◽

Electronic Evidence ◽

Search Results ◽

Text String ◽

Digital Forensic

It is a pivotal task for a forensic investigator to search a hard disk to find interesting evidences. Currently, most search tools in digital forensic field, which utilize text string match and index technology, produce high recall (100%) and low precision. Therefore, the investigators often waste vast time on huge irrelevant search hits. In this chapter, an improved method for ranking of search results was proposed to reduce human efforts on locating interesting hits. The K-UIH (the keyword and user interest hierarchies) was constructed by both investigator-defined keywords and user interest learnt from electronic evidence adaptive, and then the K-UIH was used to re-rank the search results. The experimental results indicated that the proposed method is feasible and valuable in digital forensic search process.

Download Full-text

Model for Evaluation of Information Retrieval Effectiveness within Semantic Web Concept

Proceedings of Universities ELECTRONICS ◽

10.24151/1561-5405-2018-23-3-308-312 ◽

2018 ◽

Vol 23 (3) ◽

pp. 308-312

Author(s):

V.V. Sliusar ◽

Keyword(s):

Information Retrieval ◽

Semantic Web ◽

Retrieval Effectiveness

Download Full-text

Experiments with Language-based Aids in Information Retrieval Systems

Nordic Journal of Linguistics ◽

10.1017/s0332586500001736 ◽

1988 ◽

Vol 11 (1-2) ◽

pp. 33-46 ◽

Cited By ~ 2

Author(s):

Tove Fjeldvig ◽

Anne Golden

Keyword(s):

Information Retrieval ◽

Text Retrieval ◽

Considerable Improvement ◽

Controlled Experiments ◽

Compound Words ◽

Search Results ◽

Retrieval Systems ◽

Complete Search ◽

Information Retrieval Systems ◽

Search Quality

The fact that a lexeme can appear in various forms causes problems in information retrieval. As a solution to this problem, we have developed methods for automatic root lemmatization, automatic truncation and automatic splitting of compound words. All the methods have as their basis a set of rules which contain information regarding inflected and derived forms of words – and not a dictionary. The methods have been tested on several collections of texts, and have produced very good results. By controlled experiments in text retrieval, we have studied the effects on search results. These results show that both the method of automatic root lemmatization and the method of automatic truncation make a considerable improvement on search quality. The experiments with splitting of compound words did not give quite the same improvement, however, but all the same this experiment showed that such a method could contribute to a richer and more complete search request.

Download Full-text

Post-retrieval search hit clustering to improve information retrieval effectiveness: Two digital forensics case studies

Decision Support Systems ◽

10.1016/j.dss.2011.01.009 ◽

2011 ◽

Vol 51 (4) ◽

pp. 732-744 ◽

Cited By ~ 17

Author(s):

Nicole Lang Beebe ◽

Jan Guynes Clark ◽

Glenn B. Dietrich ◽

Myung S. Ko ◽

Daijin Ko

Keyword(s):

Information Retrieval ◽

Case Studies ◽

Digital Forensics ◽

Retrieval Effectiveness

Download Full-text

Intelligent Information Retrieval Using Hybrid of Fuzzy Set and Trust

Oriental journal of computer science and technology ◽

10.13005/ojcst/10.02.09 ◽

2017 ◽

Vol 10 (2) ◽

pp. 311-325

Author(s):

Suruchi Chawla

Keyword(s):

Information Retrieval ◽

Fuzzy Set ◽

Query Expansion ◽

Information Need ◽

Data Set ◽

Search Results ◽

Intelligent Information Retrieval ◽

Main Challenge ◽

Fuzzy Query ◽

Intelligent Information

The main challenge for effective web Information Retrieval(IR) is to infer the information need from user’s query and retrieve relevant documents. The precision of search results is low due to vague and imprecise user queries and hence could not retrieve sufficient relevant documents. Fuzzy set based query expansion deals with imprecise and vague queries for inferring user’s information need. Trust based web page recommendations retrieve search results according to the user’s information need. In this paper an algorithm is designed for Intelligent Information Retrieval using hybrid of Fuzzy set and Trust in web query session mining to perform Fuzzy query expansion for inferring user’s information need and trust is used for recommendation of web pages according to the user’s information need. Experiment was performed on the data set collected in domains Academics, Entertainment and Sports and search results confirm the improvement of precision.

Download Full-text

Experion: A framework for contextualizing evidence in expert finding

10.5753/sbbd_estendido.2021.18172 ◽

2021 ◽

Author(s):

Rodrigo Gonçalves ◽

Carina F. Dorneles

Keyword(s):

Information Retrieval ◽

Research Topic ◽

Expert Finding ◽

Search Results ◽

Open Research ◽

Academic Activities ◽

Search Processes ◽

Data Source ◽

The Moment ◽

Expertise Retrieval

Expert finding is traditionally related to a subject of research in information retrieval and, often, is taken to mean "expertise retrieval within a specific organization". The task involves finding an expert in an expertise topic. Even though there are interesting proposals in the literature, they do not consider the context in which a given expertise is bound. This Ph.D. thesis introduces the concept of a framework that chronologically contextualizes search results in expert finding. Our motivation is to provide more accurate results of search processes related to finding experts in a given topic, contextualizing the expertise on professional/academic activities, an open research topic. In this paper, we present the main concepts of the framework we are developing and a general overview of its operation. At the moment, we are using the Lattes platform as a data source, for which we developed a process to extract expertise evidence, supported by the Crossref database.

Download Full-text

Randomized algorithm for Information Retrieval using past search results

2014 IEEE Eighth International Conference on Research Challenges in Information Science (RCIS) ◽

10.1109/rcis.2014.6861068 ◽

2014 ◽

Author(s):

Claudio Gutierrez-Soto ◽

Gilles Hubert

Keyword(s):

Information Retrieval ◽

Randomized Algorithm ◽

Search Results

Download Full-text

An Incremental Algorithm for Clustering Search Results

Atlantis Ambient and Pervasive Intelligence - Web-Based Information Technologies and Distributed Systems ◽

10.2991/978-94-91216-32-9_3 ◽

2010 ◽

pp. 43-55

Author(s):

Yongli Liu ◽

Yuanxin Ouyang ◽

Hao Sheng ◽

Zhang Xiong

Keyword(s):

Incremental Algorithm ◽

Search Results ◽

Clustering Search

Download Full-text

Retrieval Effectiveness of Cross Language Information Retrieval Search Engines

Digital Libraries: For Cultural Heritage, Knowledge Dissemination, and Future Creation - Lecture Notes in Computer Science ◽

10.1007/978-3-642-24826-9_37 ◽

2011 ◽

pp. 296-306

Author(s):

Schubert Foo

Keyword(s):

Information Retrieval ◽

Search Engines ◽

Retrieval Effectiveness ◽

Cross Language Information Retrieval ◽

Cross Language

Download Full-text

An Ontology-Based Search Tool in the Semantic Web

Advancing Information Management through Semantic Web Concepts and Ontologies ◽

10.4018/978-1-4666-2494-8.ch012 ◽

2013 ◽

pp. 221-249 ◽

Cited By ~ 1

Author(s):

Constanta-Nicoleta Bodea ◽

Adina Lipai ◽

Maria-Iuliana Dascalu

Keyword(s):

Semantic Web ◽

Lexical Database ◽

Search Results ◽

Concept Space ◽

Ontological Approach ◽

Search Tool ◽

Clustering Search ◽

Meta Search ◽

Specific Interests ◽

Initial List

The chapter presents a meta-search tool developed in order to deliver search results structured according to the specific interests of users. Meta-search means that for a specific query, several search mechanisms could be simultaneously applied. Using the clustering process, thematically homogenous groups are built up from the initial list provided by the standard search mechanisms. The results are more user-oriented, thanks to the ontological approach of the clustering process. After the initial search made on multiple search engines, the results are pre-processed and transformed into vectors of words. These vectors are mapped into vectors of concepts, by calling an educational ontology and using the WordNet lexical database. The vectors of concepts are refined through concept space graphs and projection mechanisms, before applying the clustering procedure. The chapter describes the proposed solution in the framework of other existent clustering search solutions. Implementation details and early experimentation results are also provided.

Download Full-text

Digital forensic text string searching: Improving information retrieval effectiveness by thematically clustering search results

A Re-Ranking Method of Search Results Based on Keyword and User Interest

Model for Evaluation of Information Retrieval Effectiveness within Semantic Web Concept

Experiments with Language-based Aids in Information Retrieval Systems

Post-retrieval search hit clustering to improve information retrieval effectiveness: Two digital forensics case studies

Intelligent Information Retrieval Using Hybrid of Fuzzy Set and Trust

Experion: A framework for contextualizing evidence in expert finding

Randomized algorithm for Information Retrieval using past search results

An Incremental Algorithm for Clustering Search Results

Retrieval Effectiveness of Cross Language Information Retrieval Search Engines

An Ontology-Based Search Tool in the Semantic Web

Export Citation Format