Automatic information retrieval with semantic analysis for green building evaluation

2021 ◽  
Author(s):  
Qiang Du ◽  
Yaxian Li ◽  
Sheng Xu ◽  
Yunqing Yan ◽  
Yani Zhang
Author(s):  
Radha Guha

Background:: In the era of information overload it is very difficult for a human reader to make sense of the vast information available in the internet quickly. Even for a specific domain like college or university website it may be difficult for a user to browse through all the links to get the relevant answers quickly. Objective:: In this scenario, design of a chat-bot which can answer questions related to college information and compare between colleges will be very useful and novel. Methods:: In this paper a novel conversational interface chat-bot application with information retrieval and text summariza-tion skill is designed and implemented. Firstly this chat-bot has a simple dialog skill when it can understand the user query intent, it responds from the stored collection of answers. Secondly for unknown queries, this chat-bot can search the internet and then perform text summarization using advanced techniques of natural language processing (NLP) and text mining (TM). Results:: The advancement of NLP capability of information retrieval and text summarization using machine learning tech-niques of Latent Semantic Analysis(LSI), Latent Dirichlet Allocation (LDA), Word2Vec, Global Vector (GloVe) and Tex-tRank are reviewed and compared in this paper first before implementing them for the chat-bot design. This chat-bot im-proves user experience tremendously by getting answers to specific queries concisely which takes less time than to read the entire document. Students, parents and faculty can get the answers for variety of information like admission criteria, fees, course offerings, notice board, attendance, grades, placements, faculty profile, research papers and patents etc. more effi-ciently. Conclusion:: The purpose of this paper was to follow the advancement in NLP technologies and implement them in a novel application.


2012 ◽  
Vol 2012 ◽  
pp. 1-8 ◽  
Author(s):  
Anis Zouaghi ◽  
Mounir Zrigui ◽  
Georges Antoniadis ◽  
Laroussi Merhbene

We propose a new approach for determining the adequate sense of Arabic words. For that, we propose an algorithm based on information retrieval measures to identify the context of use that is the closest to the sentence containing the word to be disambiguated. The contexts of use represent a set of sentences that indicates a particular sense of the ambiguous word. These contexts are generated using the words that define the senses of the ambiguous words, the exact string-matching algorithm, and the corpus. We use the measures employed in the domain of information retrieval, Harman, Croft, and Okapi combined to the Lesk algorithm, to assign the correct sense of those proposed.


2021 ◽  
Vol 47 (05) ◽  
Author(s):  
NGUYỄN CHÍ HIẾU

Knowledge Graphs are applied in many fields such as search engines, semantic analysis, and question answering in recent years. However, there are many obstacles for building knowledge graphs as methodologies, data and tools. This paper introduces a novel methodology to build knowledge graph from heterogeneous documents.  We use the methodologies of Natural Language Processing and deep learning to build this graph. The knowledge graph can use in Question answering systems and Information retrieval especially in Computing domain


Author(s):  
Lerina Aversano ◽  
Carmine Grasso ◽  
Maria Tortorella

The evaluation of the alignment level existing between a business process and the supporting software systems is a critical concern for an organization, as the higher the alignment level is, the better the process performance is. Monitoring the alignment implies the characterization of all the items it involves and definition of measures for evaluating it. This is a complex task, and the availability of automatic tools for supporting evaluation and evolution activities may be precious. This chapter presents the ALBIS Environment (Aligning Business Processes and Information Systems), designed to support software maintenance tasks. In particular, the proposed environment allows the modeling and tracing between business and software entities and the measurement of their alignment degree. An information retrieval approach is embedded in ALBIS based on two processing phases including syntactic and semantic analysis. The usefulness of the environment is discussed through two case studies.


2014 ◽  
Vol 4 (3) ◽  
pp. 1-13
Author(s):  
Khadoudja Ghanem

In this paper the authors propose a semantic approach to document categorization. The idea is to create for each category a semantic index (representative term vector) by performing a local Latent Semantic Analysis (LSA) followed by a clustering process. A second use of LSA (Global LSA) is adopted on a term-Class matrix in order to retrieve the class which is the most similar to the query (document to classify) in the same way where the LSA is used to retrieve documents which are the most similar to a query in Information Retrieval. The proposed system is evaluated on a popular dataset which is 20 Newsgroup corpus. Obtained results show the effectiveness of the method compared with those obtained with the classic KNN and SVM classifiers as well as with methods presented in the literature. Experimental results show that the new method has high precision and recall rates and classification accuracy is significantly improved.


2019 ◽  
Vol 35 (3/4) ◽  
pp. 146-156
Author(s):  
Bhupendra Singh ◽  
Neelu Jyoti Ahuja

Purpose This paper aims to popularize information retrieval from palm leaf manuscripts among computer scientists to make available the guidance of the age-old heritage in shaping the future. Design/methodology/approach With computer technology penetrating every aspect of life, information retrieval algorithms can be exploited to help build a system which can dig into the ocean of knowledge from these manuscripts. Findings The knowledge in them covers all aspects of life. Be it religious beliefs, literature, science, mathematics, or any other. However, due to discontinuation of practice of copying their content on fresh leaves, they now possess a fragile life which needs to be preserved at the earliest. The modern means of digitization can help in their preservation. Research limitations The Government of India and other organizations are doing commendable job of preserving and safeguarding country’s heritage and age-old knowledge system through the movement of digitization. In the years to come, the agonizing problem of manuscripts degradation will be eradicated completely. However, next when it will come to mining the knowledge treasure out of these manuscripts, we would be confronted with another helpless situation. Practical implications The digitization process would capture the manuscripts from present physical palm leaf to digital image form by clicking high-quality pictures. All the text in a palm leaf will be available in the form of images, but on these images, a simple search for any word would not be possible. Originality/value Working towards mining the treasure of knowledge from the palm leaf manuscripts, hordes of challenges have been outlined. Over and above the problem of preventing decay to palm leaf manuscripts is the challenge of deciphering text, image analysis, information retrieval and search. Search is further associated with issues of meaningful and useful extraction through semantic analysis. This paper advocates the dire need for systematic research to be undertaken in this field opening up avenues for past knowledge to guide future prospects in several domains.


Sign in / Sign up

Export Citation Format

Share Document