A Topic Modeling Based Approach for Enhancing Corpus Querying

2019 ◽  
Vol 10 (3) ◽  
pp. 38-50
Author(s):  
Nouh Talal Alhindawi ◽  
Belal Abu Ata ◽  
Lana Mahmoud Obeidat ◽  
Mohammad Subhi Al-Batah ◽  
Muad Abu-Ata

In information retrieval, the accuracy of the retrieval process is mainly dependent on query terms selection; therefore, the user must choose the needed terms carefully and selectively. Traditionally, the process of selecting query terms is done manually. However, in the last two decades, a lot of research has been directed towards automating the process of choosing and enhancing query terms. In this article, a new novel approach is presented, which relies on topic modeling in query building and expansion. Two open source systems were selected to perform the experiments, results show that adding the topic's term to the user's query clearly improves its quality and thus, improves the ranking results.

Author(s):  
Aakanksha Sharaff ◽  
Jitesh Kumar Dewangan ◽  
Dilip Singh Sisodia

Enormous records and data are gathered every day. Organization of this data is a challenging task. Topic modeling provides a way to categorize these documents, where high dimensionality of the corpus affects the result of topic model, making it important to apply feature selection or information retrieval process for dimensionality reduction. The requirement for efficient topic modeling includes the removal of unrelated words that might lead to specious coexistence of the unrelated words. This paper proposes an efficient framework for the generation of better topic coherence, where term frequency-inverse document frequency (TF-IDF) and parsimonious language model (PLM) are used for the information retrieval task. PLM extracts the important information and expels the general words from the corpus, whereas TF-IDF re-estimates the weightage of each word in the corpus. The work carried out in this paper improved the topic coherence measure to provide a better correlation among the actual topic and the topics generated from PLM.


2020 ◽  
Vol 10 (17) ◽  
pp. 6038 ◽  
Author(s):  
Musaad Alzahrani ◽  
Saad Alqithami

A commonly observed ambiguity of a class is simply a reflection of multiple methods’ implementation within an individual class. The process of Extract Class refactoring is, therefore, used to separate the different responsibilities of a class into different classes. A major limitation in existing approaches of the Extract Class refactoring is based on factors that are internal to the class, i.e., structural and semantic relationships between methods, in order to identify and separate the responsibilities of the class which are inadequate in many cases. Thus, we propose a novel approach that exploits the clients of the class to support the Extract Class refactoring. The importance of this approach lies in its usefulness to support existing approaches since it involves factors external to the class, i.e., the clients. Moreover, an extensive empirical evaluation is presented to support the proposed method through the utilization of real classes selected from two open source systems. The result shows the potential of our proposed approach and usefulness that leads to an improvement in the quality of the considered classes.


2021 ◽  
Vol 13 (3) ◽  
pp. 402
Author(s):  
Pablo Rodríguez-Gonzálvez ◽  
Manuel Rodríguez-Martín

The thermography as a methodology to quantitative data acquisition is not usually addressed in the degrees of university programs. The present manuscript proposes a novel approach for the acquisition of advanced competences in engineering courses associated with the use of thermographic images via free/open-source software solutions. This strategy is established from a research based on the statistical and three-dimensional visualization techniques over thermographic imagery to improve the interpretation and comprehension of the different sources of error affecting the measurements and, thereby, the conclusions and analysis arising from them. The novelty is focused on the detection of non-normalities in thermographic images, which is illustrates in the experimental section. Additionally, the specific workflow for the generation of learning material related with this aim is raised for asynchronous and e-learning programs. These virtual materials can be easily deployed in an institutional learning management system, allowing the students to work with the models by means of free/open-source solutions easily. Subsequently, the present approach will give new tools to improve the application of professional techniques, will improve the students’ critical sense to know how to interpret the uncertainties in thermography using a single thermographic image, therefore they will be better prepared to face future challenges with more critical thinking.


2007 ◽  
Vol 26 (3) ◽  
pp. 39 ◽  
Author(s):  
Javier Lacasta ◽  
Javier Nogueras-Iso ◽  
Francisco Javier López-Pellicer ◽  
Pedro Rafail Muro-Medrano ◽  
Francisco Javier Zarazaga-Soria

Knowledge organization systems denotes formally represented knowledge that is used within the context of digital libraries to improve data sharing and information retrieval. To increase their use, and to reuse them when possible, it is vital to manage them adequately and to provide them in a standard interchange format. Simple knowledge organization systems (SKOS) seem to be the most promising representation for the type of knowledge models used in digital libraries, but there is a lack of tools that are able to properly manage it. This work presents a tool that fills this gap, facilitating their use in different environments and using SKOS as an interchange format.


Author(s):  
Salil S. Sule ◽  
Aliaksei L. Petsiuk ◽  
Joshua M. Pearce

Centrifuges are commonly required devices in medical diagnostics facilities as well as scientific laboratories. Although there are commercial and open source centrifuges, costs of the former and required electricity to operate the latter, limit accessibility in resource-constrained settings. There is a need for low-cost, human-powered, verified and reliable lab-scale centrifuge. This study provides the designs for a low-cost 100% 3-D printed centrifuge, which can be fabricated on any low-cost RepRap-class fused filament fabrication (FFF) or fused particle fabrication (FPF)-based 3-D printer. In addition, validation procedures are provided using a web camera and free and open source software. This paper provides the complete open source plans including instructions for fabrication and operation for a hand-powered centrifuge. This study successfully tested and validated the instrument, which can be operated anywhere in the world with no electricity inputs obtaining a radial velocity of over 1750rpm and over 50N of relative centrifugal force. Using commercial filament the instrument costs about US$25, which is less than half of all commercially available systems; however, the costs can be dropped further using recycled plastics on open source systems for over 99% savings. The results are discussed in the contexts of resource-constrained medical and scientific facilities.


2019 ◽  
Vol 12 ◽  
pp. 232-239
Author(s):  
Cezary Kryczka

This article is an attempt to answer the question whether and under what conditions it is beneficial to develop an own intelligent building system, when many free open source systems are available. The publication presents the characteristics of author's own home automation system - sHome, as well as the open-source system - Domoticz, in a configuration that is as close to the functionality of the author's system as possible. The work ends with a comparative analysis of the systems and conclusions from the analysis.


2021 ◽  
Vol 8 (4) ◽  
pp. 75-81
Author(s):  
Ahmed A. Alsheikhy ◽  

In real-time systems, a task or a set of tasks needs to be executed and completed successfully within a predefined time. Those systems require a scheduling technique or a set of scheduling methods to distribute the given task or the set of tasks among different processors or on a processor. In this paper, a new novel scheduling approach to minimize the overhead from context switching between several periodic tasks is presented. This method speeds up a required response time while ensuring that all tasks meet their deadline times and there is no deadline miss occurred. It is a dynamic-priority technique that works either on a uniprocessor or several processors. In particular, it is proposed to be applied on multiprocessor environments since many applications run on several processors. Various examples are presented within this paper to demonstrate its optimality and efficiency. In addition, several comparison experiments with an earlier version of this approach were performed to demonstrate its efficiency and effectiveness too. Those experiments showed that this novel approach sped up the execution time from 15% to nearly around 46%. In addition, it proved that it reduced the number of a context switch between tasks from 12% to around 50% as shown from simulation tests. Furthermore, this approach delivered all tasks/jobs successfully and ensured there was no deadline miss happened.


Sign in / Sign up

Export Citation Format

Share Document