Research of Inverted Index Method Based on Block Organizing Technology

Advanced Materials Research ◽

10.4028/www.scientific.net/amr.468-471.2836 ◽

2012 ◽

Vol 468-471 ◽

pp. 2836-2841

Author(s):

Xiao Bo Yang

Keyword(s):

Retrieval System ◽

Search Algorithm ◽

Performance Model ◽

Inverted Index ◽

Retrieval Performance ◽

Index Method ◽

Inverted File ◽

Algorithm Efficiency ◽

Data Statistics ◽

Block Organization

In order to further improve the overall efficiency of retrieval system, it proposes a method of inverted index based on block organizing technology. The specific studying process is as follows. Firstly, retrieval performance model of inverted index is generated based on data statistics, and then analyze the organizational strategy of inverted file block index, finally, retrieval performance model is verified through simulation experiment. The result shows that the method of inverted file block organization can get higher algorithm efficiency under the condition of less cycle numbers in the search algorithm, and also reduce the execution time of search algorithm significantly, which can verify the feasibility of inverted file block index method.

Download Full-text

Semantic Search on Unstructured Data

International Journal on Semantic Web and Information Systems ◽

10.4018/jswis.2010040102 ◽

2010 ◽

Vol 6 (2) ◽

pp. 17-35 ◽

Cited By ~ 2

Author(s):

Alex Kohn ◽

François Bry ◽

Alexander Manta

Keyword(s):

Retrieval System ◽

Pharmaceutical Research ◽

Information Retrieval System ◽

Semantic Search ◽

Unstructured Data ◽

Search Performance ◽

Adaptive Search ◽

Retrieval Performance ◽

Enterprise Search ◽

Existing Data

Studies agree that searchers are often not satisfied with the performance of current enterprise search engines. As a consequence, more scientists worldwide are actively investigating new avenues for searching to improve retrieval performance. This paper contributes to YASA (Your Adaptive Search Agent), a fully implemented and thoroughly evaluated ontology-based information retrieval system for the enterprise. A salient particularity of YASA is that large parts of the ontology are automatically filled with facts by recycling and transforming existing data. YASA offers context-based personalization, faceted navigation, as well as semantic search capabilities. YASA has been deployed and evaluated in the pharmaceutical research department of Roche, Penzberg, and results show that already semantically simple ontologies suffice to considerably improve search performance.

Download Full-text

Retrieval performance in Ferret a conceptual information retrieval system

Proceedings of the 14th annual international ACM SIGIR conference on Research and development in information retrieval - SIGIR '91 ◽

10.1145/122860.122896 ◽

1991 ◽

Cited By ~ 12

Author(s):

Michael L. Mauldin

Keyword(s):

Information Retrieval ◽

Retrieval System ◽

Information Retrieval System ◽

Retrieval Performance ◽

Conceptual Information

Download Full-text

INDONESIAN-TRANSLATED HADITH CONTENT WEIGHTING IN PSEUDO-RELEVANCE FEEDBACK QUERY EXPANSION

Kursor ◽

10.21107/kursor.v11i1.249 ◽

2021 ◽

Vol 11 (1) ◽

Author(s):

Ivanda Zevi Amalia ◽

Akbar Noto Ponco Bimantoro ◽

Agus Zainal Arifin ◽

Maryamah Faisol ◽

Rarasmaya Indraswari ◽

...

Keyword(s):

Query Expansion ◽

Retrieval System ◽

Named Entity Recognition ◽

Entity Recognition ◽

Retrieval Process ◽

Retrieval Performance ◽

Additional Information ◽

Named Entity ◽

Story Content ◽

Test Scenarios

In general, hadith consists of isnad and matan (content). Matan can be separated into several components for example a story, main content, and some additional information. Other texts besides main content, such as isnad and story can interfere the retrieval process of relevant documents because most users typically use simple queries. Thus, in this paper, we proposed a Named Entity Recognition (NER) component weighting model in improving the Indonesian hadith retrieval system. We did 3 test scenarios, the first scenario (S1) did not separate the hadith into several components, the second scenario (S2) separated the hadith into 2 components, isnad and matan, and the third scenario separated the hadith into 4 components, isnad, background story, content, and additional information. From the experimental results, it is found that the TF-IDF with rocchio algorithm in query expansion outperforms DocVec. Also, separation and weighting of the hadith components affect the retrieval performance because isnad can be considered as noise in a query. Separation of 2 separate components had the best overall results in general although 4 separate components showed better results in some cases with precision up to 100% and 70% recall.

Download Full-text

Content based Image Retrieval System using Combination of Color and Shape Features, and Siamese Neural Network

International Journal of Innovative Technology and Exploring Engineering - Special Issue ◽

10.35940/ijitee.b1053.1292s19 ◽

2019 ◽

Vol 9 (2S) ◽

pp. 71-77

Keyword(s):

Neural Network ◽

World Wide ◽

Retrieval System ◽

Image Features ◽

Shape Features ◽

Retrieval Performance ◽

Novel Approach ◽

Learning Technique ◽

Image Retrieval System ◽

Similar Images

With an advent of technologya huge collection of digital images is formed as repositories on world wide web (WWW). The task of searching for similar images in the repository is difficult. In this paper, retrieval of similar images from www is demonstrated with the help of combination of image features as color and shape and then using Siamese neural network which is constructed to the requirement as a novel approach. Here, one-shot learning technique is used to test the Siamese Neural Network model for retrieval performance. Various experiments are conducted with both the methods and results obtained are tabulated. The performance of the system is evaluated with precision parameter and which is found to be high.Also, relative study is made with existing works.

Download Full-text

Improving response time by search pruning in a content-based image retrieval system, using inverted file techniques

Proceedings IEEE Workshop on Content-Based Access of Image and Video Libraries (CBAIVL'99) ◽

10.1109/ivl.1999.781122 ◽

2003 ◽

Cited By ~ 8

Author(s):

McG.D. Squire ◽

H. Muller ◽

W. Muller

Keyword(s):

Response Time ◽

Image Retrieval ◽

Retrieval System ◽

Content Based Image Retrieval ◽

Inverted File ◽

Image Retrieval System

Download Full-text

Multi-Scale Bag-of-Features for Scalable Map Retrieval

Journal of Advanced Computational Intelligence and Intelligent Informatics ◽

10.20965/jaciii.2012.p0793 ◽

2012 ◽

Vol 16 (7) ◽

pp. 793-799 ◽

Cited By ~ 8

Author(s):

Kanji Tanaka ◽

◽

Kensuke Kondo

Keyword(s):

Mobile Robot ◽

Retrieval System ◽

The Novel ◽

Large Collection ◽

Inverted File ◽

Multi Scale ◽

Bag Of Features ◽

Retrieval Problem ◽

Environment Maps ◽

Compact Map

Retrieving a large collection of environment maps built by mapper robots is a key problem in mobile robot self-localization. The map retrieval problem is studied from the novel perspective of the multi-scale Bag-Of-Features (BOF) approach in this paper. In general, the multi-scale approach is advantageous in capturing both the global structure and the local details of a given map. BOF map retrieval is advantageous in its compact map representation as well as the efficient map retrieval using an inverted file system. The main contribution of this paper is combining the advantages of both approaches. Our approach is based on multi cue BOF as well as packing BOF, and achieves the efficiency and compactness of the map retrieval system. Experiments evaluate the effectiveness of the techniques presented using a large collection of environment maps.

Download Full-text

Optimasi Pembobotan pada Query Expansion dengan Term Relatedness to Query-Entropy based (TRQE)

Jurnal Buana Informatika ◽

10.24002/jbi.v6i3.433 ◽

2015 ◽

Vol 6 (3) ◽

Author(s):

Resti Ludviani ◽

Khadijah F. Hayati ◽

Agus Zainal Arifin ◽

Diana Purwitasari

Keyword(s):

Query Expansion ◽

Retrieval System ◽

Document Retrieval ◽

Retrieval Performance ◽

Term Weighting ◽

New Approach ◽

Term Selection ◽

Relevance Evaluation ◽

Feedback Module ◽

Pseudo Feedback

Abstract. An appropriate selection term for expanding a query is very important in query expansion. Therefore, term selection optimization is added to improve query expansion performance on document retrieval system. This study proposes a new approach named Term Relatedness to Query-Entropy based (TRQE) to optimize weight in query expansion by considering semantic and statistic aspects from relevance evaluation of pseudo feedback to improve document retrieval performance. The proposed method has 3 main modules, they are relevace feedback, pseudo feedback, and document retrieval. TRQE is implemented in pseudo feedback module to optimize weighting term in query expansion. The evaluation result shows that TRQE can retrieve document with the highest result at precission of 100% and recall of 22,22%. TRQE for weighting optimization of query expansion is proven to improve retrieval document.Â Â Â Â Keywords: TRQE, query expansion, term weighting, term relatedness to query, relevance feedbackÂ Abstrak..Pemilihan term yang tepat untuk memperluas queri merupakan hal yang penting pada query expansion. Oleh karena itu, perlu dilakukan optimasi penentuan term yang sesuai sehingga mampu meningkatkan performa query expansion pada system temu kembali dokumen. Penelitian ini mengajukan metode Term Relatedness to Query-Entropy based (TRQE), sebuah metode untuk mengoptimasi pembobotan pada query expansion dengan memperhatikan aspek semantic dan statistic dari penilaian relevansi suatu pseudo feedback sehingga mampu meningkatkan performa temukembali dokumen. Metode yang diusulkan memiliki 3 modul utama yaitu relevan feedback, pseudo feedback, dan document retrieval. TRQE diimplementasikan pada modul pseudo feedback untuk optimasi pembobotan term pada ekspansi query. Evaluasi hasil uji coba menunjukkan bahwa metode TRQE dapat melakukan temukembali dokumen dengan hasil terbaik pada precisionÂ 100% dan recall sebesar 22,22%.Metode TRQE untuk optimasi pembobotan pada query expansion terbukti memberikan pengaruh untuk meningkatkan relevansi pencarian dokumen.Kata Kunci: TRQE, ekspansi query, pembobotan term, term relatedness to query, relevance feedback

Download Full-text

Improvement in image retrieval performance of vocabulary tree by adding index storage array and multiple search algorithm

ICCAS 2010 ◽

10.1109/iccas.2010.5669764 ◽

2010 ◽

Author(s):

Ho-Yong Seo ◽

Ho-Hyun Lee ◽

Ju-Jang Lee

Keyword(s):

Image Retrieval ◽

Search Algorithm ◽

Retrieval Performance ◽

Vocabulary Tree

Download Full-text

A SURVEY ON LOCATION BASED SERACH USING SPATIAL INVERTED INDEX METHOD

International Journal of Research in Engineering and Technology ◽

10.15623/ijret.2014.0311084 ◽

2014 ◽

Vol 03 (11) ◽

pp. 490-493 ◽

Cited By ~ 2

Author(s):

N.Minojini .

Keyword(s):

Inverted Index ◽

Index Method

Download Full-text

Figure Based Biomedical Document Retrieval System using Structural Image Features

International Journal of Knowledge Discovery in Bioinformatics ◽

10.4018/jkdb.2012010103 ◽

2012 ◽

Vol 3 (1) ◽

pp. 39-58

Author(s):

Harikrishna G. N. Rai ◽

K Sai Deepak ◽

P. Radha Krishna

Keyword(s):

Structural Properties ◽

Retrieval System ◽

Document Retrieval ◽

Image Features ◽

Biomedical Literature ◽

Feature Descriptor ◽

Retrieval Performance ◽

Retrieval Task ◽

Edge Information ◽

Structural Image

Multi-modal and Unstructured nature of documents make their retrieval from healthcare document repositories a challenging task. Text based retrieval is the conventional approach used for solving this problem. In this paper, the authors explore an alternate avenue of using embedded figures for the retrieval task. Usually, context of a document is directly reflected in the associated figures, therefore embedded text within these figures along with image features have been used for similarity based retrieval of figures. The present work demonstrates that image features describing the structural properties of figures are sufficient for the figure retrieval task. First, the authors analyze the problem of figure retrieval from biomedical literature and identify significant classes of figures. Second, they use edge information as a means to discriminate between structural properties of each figure category. Finally, the authors present a methodology using a novel feature descriptor namely Fourier Edge Orientation Autocorrelogram (FEOAC) to describe structural properties of figures and build an effective Biomedical document retrieval system. The experimental results demonstrate the better retrieval performance and overall improvement of FEOAC for figure retrieval task, especially when most of the edge information is retained. Apart from invariance to scale, rotation and non-uniform illumination, the proposed feature descriptor is shown to be relatively robust to noisy edges.

Download Full-text