Semantic Analysis Based Approach for Relevant Text Extraction Using Ontology

2017 ◽  
Vol 7 (4) ◽  
pp. 19-36 ◽  
Author(s):  
Poonam Chahal ◽  
Manjeet Singh ◽  
Suresh Kumar

Semantic analysis computation is done by extracting the interrelated concepts used by an author in the text/content of document. The concepts and linking i.e. relationships that are available among the concepts are most relevant as they provide the maximum information related to the event or activity as described by an author in the document. The retrieved relevant information from the text helps in the construction of the summary of a large text present in the document. This summary can further be represented in form of ontology and utilized in various application areas of information retrieval process like crawling, indexing, ranking, etc. The constructed ontologies can be compared with each other for calculation of similarity index based on semantic analysis between any texts. This paper gives a novel technique for retrieving the relevant semantic information represented in the form of ontology for true semantic analysis of given text.

Author(s):  
Mamata Rath ◽  
Joel J. P. C. Rodrigues ◽  
George S. Oreku

Information retrieval refers to a noteworthy system of identifying relevant information and recovering it through specific procedures from stored system. These technique is used in many differentiated applications that deal with subjective intelligence. Applications based on information retrieval are identified with various issues, for example, in technology domain, the sudden size changes of the objectives as they approach the sensor. If not taken care of appropriately, the altered changes can present substantial issues in information affiliation and position estimation. Under such a system, the meaning of the objective state is the fundamental advance for programmed comprehension of dynamic scenes. This is the reason of requirement of cognitive models for information retrieval. The existent models move around the connection between data list terms and records.


Author(s):  
Antonio M. Rinaldi ◽  
Cristiano Russo

Abstract The synthesis process of document content and its visualization play a basic role in the context of knowledge representation and retrieval. Existing methods for tag-clouds generations are mostly based on text content of documents, others also consider statistical or semantic information to enrich the document summary, while precious information deriving from multimedia content is often neglected. In this paper we present a document summarization and visualization technique based on both statistical and semantic analysis of textual and visual contents. The result of our framework is a Visual Semantic Tag Cloud based on the highlighting of relevant terms in a document using some features (font size, color, etc.) showing the importance of a term compared to other ones. The semantic information is derived from a knowledge base where concepts are represented through several multimedia items. The Visual Semantic Tag Cloud can be used not only to synthesize a document but also to represent a set of documents grouped by categories using a topic detection technique based on textual and visual analysis of multimedia features. Our work aims at demonstrating that with the help of semantic analysis and the combination of textual and visual features it is possible to improve the user knowledge acquisition by means of a synthesized visualization. The whole strategy has been evaluated by means of a ground truth and compared with similar approaches. Experimental results show the effectiveness of our approach, which outperforms state-of-art algorithms in topic detection combining both visual and semantic information.


Author(s):  
Radha Guha

Background:: In the era of information overload it is very difficult for a human reader to make sense of the vast information available in the internet quickly. Even for a specific domain like college or university website it may be difficult for a user to browse through all the links to get the relevant answers quickly. Objective:: In this scenario, design of a chat-bot which can answer questions related to college information and compare between colleges will be very useful and novel. Methods:: In this paper a novel conversational interface chat-bot application with information retrieval and text summariza-tion skill is designed and implemented. Firstly this chat-bot has a simple dialog skill when it can understand the user query intent, it responds from the stored collection of answers. Secondly for unknown queries, this chat-bot can search the internet and then perform text summarization using advanced techniques of natural language processing (NLP) and text mining (TM). Results:: The advancement of NLP capability of information retrieval and text summarization using machine learning tech-niques of Latent Semantic Analysis(LSI), Latent Dirichlet Allocation (LDA), Word2Vec, Global Vector (GloVe) and Tex-tRank are reviewed and compared in this paper first before implementing them for the chat-bot design. This chat-bot im-proves user experience tremendously by getting answers to specific queries concisely which takes less time than to read the entire document. Students, parents and faculty can get the answers for variety of information like admission criteria, fees, course offerings, notice board, attendance, grades, placements, faculty profile, research papers and patents etc. more effi-ciently. Conclusion:: The purpose of this paper was to follow the advancement in NLP technologies and implement them in a novel application.


2012 ◽  
Vol 2012 ◽  
pp. 1-8 ◽  
Author(s):  
Anis Zouaghi ◽  
Mounir Zrigui ◽  
Georges Antoniadis ◽  
Laroussi Merhbene

We propose a new approach for determining the adequate sense of Arabic words. For that, we propose an algorithm based on information retrieval measures to identify the context of use that is the closest to the sentence containing the word to be disambiguated. The contexts of use represent a set of sentences that indicates a particular sense of the ambiguous word. These contexts are generated using the words that define the senses of the ambiguous words, the exact string-matching algorithm, and the corpus. We use the measures employed in the domain of information retrieval, Harman, Croft, and Okapi combined to the Lesk algorithm, to assign the correct sense of those proposed.


2021 ◽  
pp. 1-11
Author(s):  
V.S. Anoop ◽  
P. Deepak ◽  
S. Asharaf

Online social networks are considered to be one of the most disruptive platforms where people communicate with each other on any topic ranging from funny cat videos to cancer support. The widespread diffusion of mobile platforms such as smart-phones causes the number of messages shared in such platforms to grow heavily, thus more intelligent and scalable algorithms are needed for efficient extraction of useful information. This paper proposes a method for retrieving relevant information from social network messages using a distributional semantics-based framework powered by topic modeling. The proposed framework combines the Latent Dirichlet Allocation and distributional representation of phrases (Phrase2Vec) for effective information retrieval from online social networks. Extensive and systematic experiments on messages collected from Twitter (tweets) show this approach outperforms some state-of-the-art approaches in terms of precision and accuracy and better information retrieval is possible using the proposed method.


2020 ◽  
Vol 36 (S1) ◽  
pp. 10-10
Author(s):  
Vigdis Lauvrak ◽  
Kelly Farrah ◽  
Rosmin Esmail ◽  
Anna Lien Espeland ◽  
Elisabet Hafstad ◽  
...  

IntroductionIn 2019, the Norwegian Institute for Public Health and Canadian Agency for Drugs and Technologies in Health (CADTH) received support from HTAi to produce a quarterly current awareness alert for the HTAi Disinvestment and Early Awareness Interest Group in collaboration with the HTAi Information Retrieval Interest Group. The alert focuses on methods and topical issues, and broader forecasts of potentially disruptive technologies that may be of interest to those involved in horizon scanning and disinvestment initiatives in health technology assessment (HTA).MethodsInformation specialists at both agencies developed search strategies for disinvestment and for horizon scanning in PubMed and Google. The template for the alert was based on an e-newsletter developed by the Information Retrieval Interest Group. Information specialists and researchers reviewed the monthly (PubMed) and weekly (Google) search results and selected potentially relevant publications. Additional sources were also identified through regular HTA and horizon scanning work.ResultsAlerts are posted quarterly on the HTAi Interest Group website; members receive an email notice when new alerts are available. While the revised PubMed searches are identifying relevant information, Google alerts have been disappointing, and this search may need to be revised further or dropped. When the one-year pilot project ends, in Fall 2020, interest group members will be surveyed to see if the alerts were useful, and whether they have suggestions for improving them.ConclusionsCollaborating on this alert service reduces duplication of effort between agencies, and makes new research in horizon scanning and disinvestment more accessible to colleagues in other agencies working in these areas.


Author(s):  
Hanene Maghrebi ◽  
Amos David

Managing the increasing growth of multimedia content still poses some problems. The challenge is to propose relevant information to the users among the large volume of information available. The main idea that drives our approach is to provide an open information retrieval system, which can adapt its results to several…La gestion de l’information multimédia soulève encore quelques problèmes. Le défi est de pouvoir proposer à l’utilisateur des informations pertinentes parmi la quantité d’information qui ne cesse de s’accroître. Dans cette lignée, nous proposons un système ouvert de recherche d’information capable d’adapter ses résultats aux différents… 


Sign in / Sign up

Export Citation Format

Share Document