retrieval engine
Recently Published Documents


TOTAL DOCUMENTS

77
(FIVE YEARS 10)

H-INDEX

10
(FIVE YEARS 2)

2022 ◽  
Vol 10 (1) ◽  
pp. 0-0

In distributed information retrieval systems, information in web should be ranked based on a combination of multiple features. Linear combination of ranks has been the dominant approach due to its simplicity and efficiency. Such a combination scheme in distributed infrastructure requires that ranks in resources or agents are comparable to each other. The main challenge is how to transform the raw rank values of different criteria appropriately to make them comparable before any combination. In this manuscript, we will demonstrate how to rank Web documents based on its resource-provided information stream and how to combine and incorporate several raking schemas in one time. The system was tested on the queries provided by a Text Retrieval Conference (TREC), and our experimental results showed that it is robust and efficient compared with similar platforms that used offline data resources.


2021 ◽  
Author(s):  
Fahed Mubarak Braik ◽  
Abdulla Sulaiman Al Shehhi ◽  
Luigi Saputelli ◽  
Carlos Mata ◽  
Dorzhi Badmaev ◽  
...  

Abstract The purpose of this paper is to communicate the experiences in the development of an innovative concept named "ASK Thamama" as an automated data and information retrieval engine driven by artificial intelligence techniques including text analytics and natural language processing. ASK is an AI enabled conversational search engine used to retrieve information from various internal data repositories using natural language queries. The text processing and conversational engine concept is built upon available open-source software requiring minimum coding of new libraries. A data set with 1000 documents was used to validate key functionalities with an accuracy of 90% of the search queries and able to provide specific answers for 80% of queries framed as questions. The results of this work show encouraging results and demonstrate value that AI-enabled methodologies can provide natural language search by enabling automated workflows for data information retrieval. The developed AI methodology has tremendous potential of integration in an end-to-end workflow of knowledge management by utilizing available document repositories to valuable insights, with little to no human intervention.


2021 ◽  
Vol 21 (S2) ◽  
Author(s):  
Yidi Cui ◽  
Bo Gao ◽  
Lihong Liu ◽  
Jing Liu ◽  
Yan Zhu

Abstract Background Formula is an important means of traditional Chinese medicine (TCM) to treat diseases and has great research significance. There are many formula databases, but accessing rich information efficiently is difficult due to the small-scale data and lack of intelligent search engine. Methods We selected 38,000 formulas from a semi-structured database, and then segmented text, extracted information, and standardized terms. After that, we constructed a structured formula database based on ontology and an intelligent retrieval engine by calculating the weight of decoction pieces of formulas. Results The intelligent retrieval system named AMFormulaS (means Ancient and Modern Formula system) was constructed based on the structured database, ontology, and intelligent retrieval engine, so the retrieval and statistical analysis of formulas and decoction pieces were realized. Conclusions AMFormulaS is a large-scale intelligent retrieval system which includes a mass of formula data, efficient information extraction system and search engine. AMFormulaS could provide users with efficient retrieval and comprehensive data support. At the same time, the statistical analysis of the system can enlighten scientific research ideas and support patent review as well as new drug research and development.


2021 ◽  
Vol 22 (1) ◽  
Author(s):  
Swapna Vidhur Daulatabad ◽  
Rajneesh Srivastava ◽  
Sarath Chandra Janga

Abstract Background With advancements in omics technologies, the range of biological processes where long non-coding RNAs (lncRNAs) are involved, is expanding extensively, thereby generating the need to develop lncRNA annotation resources. Although, there are a plethora of resources for annotating genes, despite the extensive corpus of lncRNA literature, the available resources with lncRNA ontology annotations are rare. Results We present a lncRNA annotation extractor and repository (Lantern), developed using PubMed’s abstract retrieval engine and NCBO’s recommender annotation system. Lantern’s annotations were benchmarked against lncRNAdb’s manually curated free text. Benchmarking analysis suggested that Lantern has a recall of 0.62 against lncRNAdb for 182 lncRNAs and precision of 0.8. Additionally, we also annotated lncRNAs with multiple omics annotations, including predicted cis-regulatory TFs, interactions with RBPs, tissue-specific expression profiles, protein co-expression networks, coding potential, sub-cellular localization, and SNPs for ~ 11,000 lncRNAs in the human genome, providing a one-stop dynamic visualization platform. Conclusions Lantern integrates a novel, accurate semi-automatic ontology annotation engine derived annotations combined with a variety of multi-omics annotations for lncRNAs, to provide a central web resource for dissecting the functional dynamics of long non-coding RNAs and to facilitate future hypothesis-driven experiments. The annotation pipeline and a web resource with current annotations for human lncRNAs are freely available on sysbio.lab.iupui.edu/lantern.


2021 ◽  
Vol 7 (5) ◽  
pp. 76
Author(s):  
Giuseppe Amato ◽  
Paolo Bolettieri ◽  
Fabio Carrara ◽  
Franca Debole ◽  
Fabrizio Falchi ◽  
...  

This paper describes in detail VISIONE, a video search system that allows users to search for videos using textual keywords, the occurrence of objects and their spatial relationships, the occurrence of colors and their spatial relationships, and image similarity. These modalities can be combined together to express complex queries and meet users’ needs. The peculiarity of our approach is that we encode all information extracted from the keyframes, such as visual deep features, tags, color and object locations, using a convenient textual encoding that is indexed in a single text retrieval engine. This offers great flexibility when results corresponding to various parts of the query (visual, text and locations) need to be merged. In addition, we report an extensive analysis of the retrieval performance of the system, using the query logs generated during the Video Browser Showdown (VBS) 2019 competition. This allowed us to fine-tune the system by choosing the optimal parameters and strategies from those we tested.


2021 ◽  
Vol 2021 ◽  
pp. 1-11
Author(s):  
E. Wei

With the continuous progress of my country’s cultural industry, how to apply artificial intelligence technology to song on demand has become an issue of concern. This research mainly discusses the research of singing intonation characteristics based on artificial intelligence technology and its application in song-on-demand scoring system. This paper uses the combination of ant colony algorithm and DTW algorithm to measure the similarity between speech signals with the average distortion distance, so as to expect accurate recognition results. The design of the song-on-demand scoring function module uses a combination of MVC mode and command mode based on artificial intelligence technology. The view component in the MVC mode is mainly used to display the content that the user needs to sing and realize the interaction with the user. The singer selects a song to start playing, and the scoring terminal device queries the music library server for song information according to the song number, then starts playing the song through the FTP file sharing service according to the audio file path in the song information, and at the same time displays the song on the display according to the timeline Show song and pitch information. The singer sings according to the screen prompts. The microphone collects the voice signal and transmits it to the scoring terminal. After the scoring algorithm is calculated, the result is fed back to the screen in real time. The singer can view his singing status in real time and make corresponding adjustments to obtain a higher score. After the singing, the scoring terminal will display the final result on the screen to inform the user and upload the singing record to the server for recording. In the tested on-demand retrieval engine, the average hit rate of the top 3 has reached more than 90% under various humming methods, basically maintaining the high hit rate characteristics of the original retrieval engine. The system designed in this research helps to effectively improve the singing level.


Author(s):  
Алина Андреевна Захарова

В статье описывается экспериментальное исследование метода разрешения синтаксической неоднозначности в конструкциях с сирконстантами с помощью онтологической семантики на основе универсального лингвистического процессора AIIRE (Artificial Intelligence Information Retrieval Engine). Выявлены четыре типа неоднозначных конструкций с сирконстантами, и составлены соответствующие поисковые запросы в Национальный корпус русского языка (НКРЯ). В результате получен список из 200 неоднозначных конструкций. Неоднозначность в конструкциях устраняется путем автоматического разбора и последующего ручного выбора его правильных вариантов. Однако на этом этапе возможны следующие проблемы: «разрывы» внутри конструкций, которые обозначают отсутствие нужных семантических связей внутри конструкции, а также большое количество вариантов синтаксического анализа, называемое комбинаторным взрывом. Эти проблемы решаются с помощью таких инструментов AIIRE, как Ontohelper и онтология. Онтология используется для обработки языковых данных и понимается как набор лексических значений или понятий и отношений между ними. Ontohelper – это вспомогательный инструмент с интерфейсом редактирования, где можно моделировать и задавать с помощью онтологическихотношенийвалентностиглаголов. В результате получаются корректные разборы для 66/200 конструкций, и обосновывается,чтоэффективностьданногометодазависитоткачестваиправильностимоделированияпонятийвонтологии.


2019 ◽  
Vol 108 ◽  
pp. 73-88 ◽  
Author(s):  
Songfei Wu ◽  
Qiyu Shen ◽  
Yichuan Deng ◽  
Jack Cheng

2019 ◽  
Vol 12 (3) ◽  
pp. 224-232
Author(s):  
Iqbaldeep Kaur ◽  
Rajesh Kumar Bawa

Background: With an exponential increase in software online as well as offline, through each passing day, the task of digging out precise and relevant software components has become the need of the hour. There is no dearth of techniques used for the retrieval of software component from the available online and offline repositories in the conceptual as well as the empirical literature. However each of these techniques has its own set of limitations and suitability. Objective: The proposed technique gives concrete decision using schematic based search that gives better result and higher precision and recall values. Methods: In this paper, a component decision and retrieval engine called SR-SCRS (Schematic and Refinement based Software Component Retrieval System) has been presented using OPAM. OPAM is a github repository containing software components (packages), designed by OcamlPro. This search engine employs two retrieval techniques for a robust decision vis-o-vis Schematic-based search with fuzzy logic and Refinement-based search. The Schematic based search is based on matching the attribute values and the threshold of those values as given by the user. Thereafter the results are optimized to achieve the level of relevance using fuzzy logic. Refinement based search works on one particular attribute value. The experiments have been conducted and validated on OPAM dataset. Results: Precisely, the average precision of Schematic based search and Refinement based search is 60% and 27.86% which shows robust results. Conclusion: Hence, the performance and efficiency of the proposed work has been evaluated and compared with the other retrieval technique.


Sign in / Sign up

Export Citation Format

Share Document