scholarly journals Implementation of Weighted Tree Similarity and Cosine Sorensen-Dice Algorithms for Semantic Search in Document Repository Information System

2021 ◽  
Vol 5 (1) ◽  
pp. 21-27
Author(s):  
Abdurrosyiid amrullah ◽  
Indra Gita Anugrah

As more and more documents we manage, the more difficult it is in the search process, and the need to use information retrieval becomes important. With the information retrieval system, it can help in searching for documents that match the similarity of keywords. Usually document searches usually only see the name of the document (file) being searched for by the user without paying attention to the content or metadata of the document, so that it cannot meet their information needs. Document search has several approaches, including full-text search, plain metadata search and semantic search. This study uses the Weighted Tree Similarity algorithm with the Cosine Sorensen Dice algorithm to calculate the semantic search similarity. In this study, document metadata is represented in the form of a tree that has labeled nodes, labeled branches and weighted branches. The similarity calculation on the subtree edge label uses Cosine Sorensen Dice, while the total similarity of a document uses the weighted tree similarity. The metadata structure of the document uses the taxonomy owner, description, title, disposition content and type. The result of this research is a document search application with taxonomic weight on file storage.

2011 ◽  
pp. 226-232
Author(s):  
Ki Jung Lee

With the increased use of Internet, a large number of consumers first consult on line resources for their healthcare decisions. The problem of the existing information structure primarily lies in the fact that the vocabulary used in consumer queries is intrinsically different from the vocabulary represented in medical literature. Consequently, the medical information retrieval often provides poor search results. Since consumers make medical decisions based on the search results, building an effective information retrieval system becomes an essential issue. By reviewing the foundational concepts and application components of medical information retrieval, this paper will contribute to a body of research that seeks appropriate answers to a question like “How can we design a medical information retrieval system that can satisfy consumer’s information needs?”


2009 ◽  
Vol 18 (02) ◽  
pp. 331-354 ◽  
Author(s):  
SAMIR KECHID ◽  
HABIBA DRIAS

The World Wide Web knows an incessant and very fast development. Currently, finding useful information on the Web is a time consuming process. In this paper, we present PIRS a personalized Information Retrieval System in a distributed environment. Most prior research in distributed information access focused on selecting and merging information that has the most relevant content according to the query but ignored the user's specific needs. The underlying idea is that different users have different backgrounds, goals and interests when seeking information and thus, the same query may cover different specific information needs according to who emitted it. However, with the ever expanding Web, users are faced with a huge number of information resources. Consequently, such query-based information access strategies lead to inaccurate query results. PIRS extends the state of the art in a Web-based information retrieval system in distributed environment. First, it develops models for representing both user and information source using feature based profiles. Second, PIRS expands a user query according to his profile. Third, it develops algorithms for source selection and results merging that personalize the computation of the relevance score of a document in response to the user's query. PIRS has been experimented with several known information source. The experimental results obtained show the effectiveness of our approach.


Author(s):  
Ki Jung Lee

With the increased use of Internet, a large number of consumers first consult on line resources for their healthcare decisions. The problem of the existing information structure primarily lies in the fact that the vocabulary used in consumer queries is intrinsically different from the vocabulary represented in medical literature. Consequently, the medical information retrieval often provides poor search results. Since consumers make medical decisions based on the search results, building an effective information retrieval system becomes an essential issue. By reviewing the foundational concepts and application components of medical information retrieval, this paper will contribute to a body of research that seeks appropriate answers to a question like “How can we design a medical information retrieval system that can satisfy consumer’s information needs?”


Author(s):  
Alex Kohn ◽  
François Bry ◽  
Alexander Manta

Studies agree that searchers are often not satisfied with the performance of current enterprise search engines. As a consequence, more scientists worldwide are actively investigating new avenues for searching to improve retrieval performance. This paper contributes to YASA (Your Adaptive Search Agent), a fully implemented and thoroughly evaluated ontology-based information retrieval system for the enterprise. A salient particularity of YASA is that large parts of the ontology are automatically filled with facts by recycling and transforming existing data. YASA offers context-based personalization, faceted navigation, as well as semantic search capabilities. YASA has been deployed and evaluated in the pharmaceutical research department of Roche, Penzberg, and results show that already semantically simple ontologies suffice to considerably improve search performance.


Author(s):  
Bernard Ijesunor Akhigbe

At present, keyword-based techniques allow information retrieval (IR) but are unable to capture the conceptualizations in users' information needs and contents. The response to this has been semantic search computing with commendable success. Surprisingly, it is still difficult to evaluate Semantic IR (SIR) and understand the user contexts. The absence of a standardized cognitive user-centred evaluative paradigm (CUcEP) further exacerbates these challenges. This chapter provides the state-of-the-art on IR and SIR evaluation and a systematic review of contexts. Appropriate user-centred theories and the proposed evaluative framework with its integrated-context, web analytic conception, and related data analytic technique are presented. A descriptive approach is adopted, with the conclusion that multiple contexts are essential in SIR evaluation since “searching by meaning” is a multi-dimensional cognitive conception, hence the need to consider the impact of context dynamicity. Finally, the foregrounded semantic items will be applied to standardize the CUcEP in future.


2017 ◽  
Vol 7 (3) ◽  
pp. 38-61 ◽  
Author(s):  
Ameni Yengui ◽  
Mahmoud Neji

In this article, the authors introduce their OSSVIRI information retrieval system which composed of three modules. In the analysis module, they have proposed a statistical technique exploiting the word frequency in order to extract the simple, compound and specific terms from the documents. In the indexing module, the authors used the ontology to associate the terms with their concepts, retrieve the relations between them and disambiguate the concepts to improve the sematic content of the documents. The concepts and relations are represented as a conceptual graph. In the research module, the authors have proposed a technique of users' query reformulation based on external resources and users' profiles and a technique of pairing based on the combined expansion of the requests and the documents guided by the context of the requirement in information and the documentary contents. This system is validated using the metrics from the research information and comparisons with existing statistical approach. The authors show that their approach achieves good results.


2013 ◽  
Vol 278-280 ◽  
pp. 2069-2072
Author(s):  
Jin Xing Shen

In order to achieve semantic retrieval for scientific research information in WWW, this paper applies an ontology-based framework to information retrieval system for management information system. After analyze the limitations of traditional method, bring a semantic search forward, and mainly introduce the thought of the semantic retrieval as well as the way to constitute ontology entity and the language that describes it. Moreover, semantic retrieval system based on ontology is also given. The application to retrieve project information shows that the framework can overcome the localization of other ontology’s models, and this research facilitates the semantic retrieval of management information through semantic retrieval concepts on the Semantic Web.


Sign in / Sign up

Export Citation Format

Share Document