Relation-Based Information Retrieval

The traditional Boolean word-based approach to information retrieval (IR) considers only words for indexing. Irrelevant information is retrieved because of non-inclusion of semantic information like word senses and word context. In this work, the importance of representing the documents along another semantic dimension in addition to sense context information is considered. The incorporation of semantic relations as an additional dimension gives a better insight into the interpretation of the document. The micro-contexts generated from the documents are also used in indexing. The retrieval performance is measured in terms of precision and recall. The results tabulated show better performance.

Download Full-text

The data-driven Bulgarian WordNet: BTBWN

Cognitive Studies | Études cognitives ◽

10.11649/cs.1713 ◽

2018 ◽

Author(s):

Petya Osenova ◽

Kiril Simov

Keyword(s):

Information Retrieval ◽

Machine Translation ◽

Semantic Information ◽

Data Driven ◽

Lexical Resources ◽

Multilingual Information Retrieval ◽

Cross Lingual ◽

Princeton Wordnet ◽

Word Senses

The data-driven Bulgarian WordNet: BTBWNThe paper presents our work towards the simultaneous creation of a data-driven WordNet for Bulgarian and a manually annotated treebank with semantic information. Such an approach requires synchronization of the word senses in both - syntactic and lexical resources, without limiting the WordNet senses to the corpus or vice versa. Our strategy focuses on the identification of senses used in BulTreeBank, but the missing senses of a lemma also have been covered through exploration of bigger corpora. The identified senses have been organized in synsets for the Bulgarian WordNet. Then they have been aligned to the Princeton WordNet synsets. Various types of mappings are considered between both resources in a cross-lingual aspect and with respect to ensuring maximum connectivity and potential for incorporating the language specific concepts. The mapping between the two WordNets (English and Bulgarian) is a basis for applications such as machine translation and multilingual information retrieval. Oparty na danych WordNet bułgarski: BTBWNW artykule przedstawiono naszą pracę na rzecz jednoczesnej budowy opartego na danych wordnetu dla języka bułgarskiego oraz ręcznie oznaczonego informacjami semantycznymi banku drzew. Takie podejście wymaga uzgodnienia znaczeń słów zarówno w zasobach składniowych, jak i leksykalnych, bez ograniczania znaczeń umieszczanych w wordnecie do tych obecnych w korpusie, jak i odwrotnie. Nasza strategia koncentruje się na identyfikacji znaczeń stosowanych w BulTreeBank, przy czym brakujące znaczenia lematu zostały również zbadane przez zgłębienie większych korpusów. Zidentyfikowane znaczenia zostały zorganizowane w synsety bułgarskiego wordnetu, a następnie powiązane z synsetami Princeton WordNet. Rozmaite rodzaje rzutowań są rozpatrywane pomiędzy obydwoma zasobami w kontekście międzyjęzykowym, a także w odniesieniu do zapewnienia maksymalnej łączności i możliwości uwzględnienia pojęć specyficznych dla języka bułgarskiego. Rzutowanie między dwoma wordnetami (angielskim i bułgarskim) jest podstawą dla aplikacji, takich jak tłumaczenie maszynowe i wielojęzyczne wyszukiwanie informacji.

Download Full-text

An Associative and Adaptive Network Model For Information Retrieval In The Semantic Web

Advances in Semantic Web and Information Systems - Progressive Concepts for Semantic Web Evolution ◽

10.4018/978-1-60566-992-2.ch014 ◽

2010 ◽

pp. 309-344

Author(s):

Peter Scheir ◽

Peter Prettenhofer ◽

Stefanie N. Lindstaedt ◽

Chiara Ghidini

Keyword(s):

Information Retrieval ◽

Semantic Web ◽

Soft Computing ◽

Relevance Feedback ◽

Semantic Information ◽

Retrieval Performance ◽

Search Results ◽

Low Coverage ◽

Relevant Material ◽

The Web

While it is agreed that semantic enrichment of resources would lead to better search results, at present the low coverage of resources on the web with semantic information presents a major hurdle in realizing the vision of search on the Semantic Web. To address this problem, this chapter investigates how to improve retrieval performance in settings where resources are sparsely annotated with semantic information. Techniques from soft computing are employed to find relevant material that was not originally annotated with the concepts used in a query. The authors present an associative retrieval model for the Semantic Web and evaluate if and to which extent the use of associative retrieval techniques increases retrieval performance. In addition, the authors present recent work on adapting the network structure based on relevance feedback by the user to further improve retrieval effectiveness. The evaluation of new retrieval paradigms - such as retrieval based on technology for the Semantic Web - presents an additional challenge since no off-the-shelf test corpora exist. Hence, this chapter gives a detailed description of the approach taken to evaluate the information retrieval service the authors have built.

Download Full-text

The Relation between the Combination Levels in the Structure of Concepts Entities and the Amount of Semantic Relations in Improving Information Retrieval Performance in UN, FAO’s Agricultural Ontology

Iranian Journal of Information Processing and Management ◽

10.52547/jipm.36.4.1113 ◽

2021 ◽

Vol 36 (4) ◽

pp. 1113-1136

Author(s):

Maziar Amirhosseini ◽

Keyword(s):

Information Retrieval ◽

Semantic Relations ◽

Retrieval Performance

Download Full-text

The research of semantic information retrieval for Mongolian data structure

Advanced Materials and Information Technology Processing ◽

10.2495/amitp130781 ◽

2014 ◽

Author(s):

Yila Su ◽

Chuan Fan ◽

Hongbo Chen ◽

Xiulan Xie ◽

Haitao Liu

Keyword(s):

Information Retrieval ◽

Data Structure ◽

Semantic Information ◽

Semantic Information Retrieval

Download Full-text

An Investigation on Ontology Based Fuzzy Semantic Information Retrieval

International Journal of Psychosocial Rehabilitation ◽

10.37200/ijpr/v23i1/pr190248 ◽

2019 ◽

Vol 23 (1) ◽

pp. 377-384

Author(s):

Naren J ◽

Raja Rajeswari D ◽

Nikhith Sannidhi ◽

Vithya G

Keyword(s):

Information Retrieval ◽

Semantic Information ◽

Semantic Information Retrieval

Download Full-text

SemOIR: An ontology-based semantic information retrieval system

2020 IEEE 20th International Conference on Software Quality, Reliability and Security Companion (QRS-C) ◽

10.1109/qrs-c51114.2020.00042 ◽

2020 ◽

Author(s):

Mingwei Tang ◽

Jiangping Chen ◽

Haihua Chen

Keyword(s):

Information Retrieval ◽

Semantic Information ◽

Retrieval System ◽

Information Retrieval System ◽

Semantic Information Retrieval

Download Full-text

Data Organization

Human Factors The Journal of the Human Factors and Ergonomics Society ◽

10.1177/001872087701900101 ◽

1977 ◽

Vol 19 (1) ◽

pp. 1-14 ◽

Cited By ~ 38

Author(s):

Bruce M. Durding ◽

Curtis A. Becker ◽

John D. Gould

Keyword(s):

College Students ◽

Information Retrieval ◽

Organizational Structures ◽

Semantic Relations ◽

Data Organization ◽

Retrieval Systems ◽

Information Retrieval Systems ◽

Computer Based

Three experiments investigated how people organize data. Subjects were given sets of 15-20 words and asked to organize them on paper. Each word set had a pre-defined organization (hierarchy, network, lists, table) based on the semantic relations among the words. Experiment 1 showed that college students have all these organizational structures available for use. They organized most word sets on the basis of the semantic relations inherent in them. Whereas most subjects used “appropriate” organizations (those that most easily preserved the relations), a few subjects organized nearly all word sets into lists. Experiment 2 showed that subjects can efficiently fit the word sets into “skeletons” that were explicitly designed to maintain all the semantic relations among the words. Experiment 3 showed that subjects have difficulty in preserving the relations among the words when they were required to organize them into inappropriate structures. These results are evaluated relative to the use of computer-based information retrieval systems.

Download Full-text

Linear Algebraic Structure of Word Senses, with Applications to Polysemy

Transactions of the Association for Computational Linguistics ◽

10.1162/tacl_a_00034 ◽

2018 ◽

Vol 6 ◽

pp. 483-495 ◽

Cited By ~ 13

Author(s):

Sanjeev Arora ◽

Yuanzhi Li ◽

Yingyu Liang ◽

Tengyu Ma ◽

Andrej Risteski

Keyword(s):

Information Retrieval ◽

Random Walk ◽

Algebraic Structure ◽

Sparse Coding ◽

Word Sense ◽

Word Embeddings ◽

Linear Superposition ◽

Empirical Tests ◽

Embedding Methods ◽

Word Senses

Word embeddings are ubiquitous in NLP and information retrieval, but it is unclear what they represent when the word is polysemous. Here it is shown that multiple word senses reside in linear superposition within the word embedding and simple sparse coding can recover vectors that approximately capture the senses. The success of our approach, which applies to several embedding methods, is mathematically explained using a variant of the random walk on discourses model (Arora et al., 2016). A novel aspect of our technique is that each extracted word sense is accompanied by one of about 2000 “discourse atoms” that gives a succinct description of which other words co-occur with that word sense. Discourse atoms can be of independent interest, and make the method potentially more useful. Empirical tests are used to verify and support the theory.

Download Full-text