Towards multilingual user models for Personalized Multilingual Information Retrieval

The data-driven Bulgarian WordNet: BTBWNThe paper presents our work towards the simultaneous creation of a data-driven WordNet for Bulgarian and a manually annotated treebank with semantic information. Such an approach requires synchronization of the word senses in both - syntactic and lexical resources, without limiting the WordNet senses to the corpus or vice versa. Our strategy focuses on the identification of senses used in BulTreeBank, but the missing senses of a lemma also have been covered through exploration of bigger corpora. The identified senses have been organized in synsets for the Bulgarian WordNet. Then they have been aligned to the Princeton WordNet synsets. Various types of mappings are considered between both resources in a cross-lingual aspect and with respect to ensuring maximum connectivity and potential for incorporating the language specific concepts. The mapping between the two WordNets (English and Bulgarian) is a basis for applications such as machine translation and multilingual information retrieval. Oparty na danych WordNet bułgarski: BTBWNW artykule przedstawiono naszą pracę na rzecz jednoczesnej budowy opartego na danych wordnetu dla języka bułgarskiego oraz ręcznie oznaczonego informacjami semantycznymi banku drzew. Takie podejście wymaga uzgodnienia znaczeń słów zarówno w zasobach składniowych, jak i leksykalnych, bez ograniczania znaczeń umieszczanych w wordnecie do tych obecnych w korpusie, jak i odwrotnie. Nasza strategia koncentruje się na identyfikacji znaczeń stosowanych w BulTreeBank, przy czym brakujące znaczenia lematu zostały również zbadane przez zgłębienie większych korpusów. Zidentyfikowane znaczenia zostały zorganizowane w synsety bułgarskiego wordnetu, a następnie powiązane z synsetami Princeton WordNet. Rozmaite rodzaje rzutowań są rozpatrywane pomiędzy obydwoma zasobami w kontekście międzyjęzykowym, a także w odniesieniu do zapewnienia maksymalnej łączności i możliwości uwzględnienia pojęć specyficznych dla języka bułgarskiego. Rzutowanie między dwoma wordnetami (angielskim i bułgarskim) jest podstawą dla aplikacji, takich jak tłumaczenie maszynowe i wielojęzyczne wyszukiwanie informacji.

Download Full-text

A Multilingual Approach to Multilingual Information Retrieval

Advances in Cross-Language Information Retrieval - Lecture Notes in Computer Science ◽

10.1007/978-3-540-45237-9_8 ◽

2003 ◽

pp. 101-110 ◽

Cited By ~ 2

Author(s):

Jian-Yun Nie ◽

Fuman Jin

Keyword(s):

Information Retrieval ◽

Multilingual Information Retrieval ◽

Multilingual Approach

Download Full-text

Using Translation Heuristics to Improve a Multimodal and Multilingual Information Retrieval System

Applications of Fuzzy Sets Theory - Lecture Notes in Computer Science ◽

10.1007/978-3-540-73400-0_55 ◽

2007 ◽

pp. 438-446

Author(s):

Miguel Ángel García-Cumbreras ◽

Maria Teresa Martín-Valdivia ◽

Luis Alfonso Ureña-López ◽

Manuel Carlos Díaz-Galiano ◽

Arturo Montejo-Ráez

Keyword(s):

Information Retrieval ◽

Retrieval System ◽

Information Retrieval System ◽

Multilingual Information Retrieval

Download Full-text

User Models for Adaptive Information Retrieval on the Web

International Journal of Adaptive Resilient and Autonomic Systems ◽

10.4018/jaras.2012070101 ◽

2012 ◽

Vol 3 (3) ◽

pp. 1-19

Author(s):

Max Chevalier ◽

Christine Julien ◽

Chantal Soulé-Dupuy

Keyword(s):

Information Retrieval ◽

Search Engines ◽

User Profile ◽

User Model ◽

User Models ◽

Search Results ◽

Retrieval Systems ◽

Information Retrieval Systems ◽

Adaptive Information Retrieval ◽

The Web

Searching information can be realized thanks to specific tools called Information Retrieval Systems IRS (also called “search engines”). To provide more accurate results to users, most of such systems offer personalization features. To do this, each system models a user in order to adapt search results that will be displayed. In a multi-application context (e.g., when using several search engines for a unique query), personalization techniques can be considered as limited because the user model (also called profile) is incomplete since it does not exploit actions/queries coming from other search engines. So, sharing user models between several search engines is a challenge in order to provide more efficient personalization techniques. A semantic architecture for user profile interoperability is proposed to reach this goal. This architecture is also important because it can be used in many other contexts to share various resources models, for instance a document model, between applications. It is also ensuring the possibility for every system to keep its own representation of each resource while providing a solution to easily share it.

Download Full-text

Towards multilingual user models for Personalized Multilingual Information Retrieval

Multilingual vs. Monolingual User Models for Personalized Multilingual Information Retrieval

Multilingual Information Retrieval Using English and Chinese Queries

Selection and Merging Strategies for Multilingual Information Retrieval

Multilingual information retrieval on the Internet: A case study of Turkish users

MIRACLE Approaches to Multilingual Information Retrieval: A Baseline for Future Research

Interface Enabled Multilingual Information Retrieval using Structured Thesaurus Database

The data-driven Bulgarian WordNet: BTBWN

A Multilingual Approach to Multilingual Information Retrieval

Using Translation Heuristics to Improve a Multimodal and Multilingual Information Retrieval System

User Models for Adaptive Information Retrieval on the Web

Export Citation Format