scholarly journals K-Means Algorithm for Clustering Afaan Oromo Text Documents using Python Tools

2020 ◽  
Vol 9 (1) ◽  
pp. 1279-1282

With the advancement of technology and proliferation of computers in the country, the amount of Afaan Oromo language news documents produced increasingly which becomes a difficult task for news agencies to organize such huge collection of documents items manually. To solve this problem, researches is conducted using unsupervised machine learning python tools for Afaan Oromo news document clustering with low cost and best quality of clustering solution. In this research work focusing on k-means clustering analysis which produced better results as compared to the other cluster analysis both in terms of time requirement and the quality of the clusters produced

2020 ◽  
Author(s):  
Fábio Rodrigues de la Rocha

Public street lighting management is a well known problemwhich can be revisited from the perspective of Smart Cities.In Smart Cities there is an interconnection of services andinfrastructure to provide sustainable growth and improvementsin citizens’ quality of life. In this research work, weexplore new low cost technologies to create a smart streetlight system capable of monitoring and controlling the lamps,thus reducing the costs with maintenance and allowing amore rational use of electricity.


2020 ◽  
Author(s):  
Saeed Nosratabadi ◽  
Amir Mosavi ◽  
Puhong Duan ◽  
Pedram Ghamisi ◽  
Filip Ferdinand ◽  
...  

Abstract This paper provides the state of the art of data science in economics. Through a novel taxonomy of applications and methods advances in data science are investigated. The data science advances are investigated in three individual classes of deep learning models, ensemble models, and hybrid models. Application domains include stock market, marketing, E-commerce, corporate banking, and cryptocurrency. Prisma method, a systematic literature review methodology is used to ensure the quality of the survey. The findings revealed that the trends are on advancement of hybrid models as more than 51% of the reviewed articles applied hybrid model. On the other hand, it is found that based on the RMSE accuracy metric, hybrid models had higher prediction accuracy than other algorithms. While it is expected the trends go toward the advancements of deep learning models.


2020 ◽  
Author(s):  
Tomohiro Harada ◽  
Misaki Kaidan ◽  
Ruck Thawonmas

Abstract This paper investigates the integration of a surrogate-assisted multi-objective evolutionary algorithm (MOEA) and a parallel computation scheme to reduce the computing time until obtaining the optimal solutions in evolutionary algorithms (EAs). A surrogate-assisted MOEA solves multi-objective optimization problems while estimating the evaluation of solutions with a surrogate function. A surrogate function is produced by a machine learning model. This paper uses an extreme learning surrogate-assisted MOEA/D (ELMOEA/D), which utilizes one of the well-known MOEA algorithms, MOEA/D, and a machine learning technique, extreme learning machine (ELM). A parallelization of MOEA, on the other hand, evaluates solutions in parallel on multiple computing nodes to accelerate the optimization process. We consider a synchronous and an asynchronous parallel MOEA as a master-slave parallelization scheme for ELMOEA/D. We carry out an experiment with multi-objective optimization problems to compare the synchronous parallel ELMOEA/D with the asynchronous parallel ELMOEA/D. In the experiment, we simulate two settings of the evaluation time of solutions. One determines the evaluation time of solutions by the normal distribution with different variances. On the other hand, another evaluation time correlates to the objective function value. We compare the quality of solutions obtained by the parallel ELMOEA/D variants within a particular computing time. The experimental results show that the parallelization of ELMOEA/D significantly reduces the computational time. In addition, the integration of ELMOEA/D with the asynchronous parallelization scheme obtains higher quality of solutions quicker than the synchronous parallel ELMOEA/D.


Author(s):  
Mohammad Reza Ebrahimi Dishabi ◽  
Mohammad Abdollahi Azgomi

Most of the existing privacy preserving clustering (PPC) algorithms do not consider the worst case privacy guarantees and are based on heuristic notions. In addition, these algorithms do not run efficiently in the case of high dimensionality of data. In this paper, to alleviate these challenges, we propose a new PPC algorithm, which is based on Daubechies-2 wavelet transform (D2WT) and preserves the differential privacy notion. Differential privacy is the strong notion of privacy, which provides the worst case privacy guarantees. On the other hand, most of the existing differential-based PPC algorithms generate data with poor utility. If we apply differential privacy properties over the original raw data, the resulting data will offer lower quality of clustering (QOC) during the clustering analysis. Therefore, we use D2WT for the preprocessing of the original data before adding noise to the data. By applying D2WT to the original data, the resulting data not only contains lower dimension compared to the original data, but also can provide differential privacy guarantee with high QOC due to less noise addition. The proposed algorithm has been implemented and experimented over some well-known datasets. We also compare the proposed algorithm with some recently introduced algorithms based on utility and privacy degrees.


2019 ◽  
Vol 62 ◽  
pp. 15-19 ◽  
Author(s):  
Birgit Ludwig ◽  
Daniel König ◽  
Nestor D. Kapusta ◽  
Victor Blüml ◽  
Georg Dorffner ◽  
...  

Abstract Methods of suicide have received considerable attention in suicide research. The common approach to differentiate methods of suicide is the classification into “violent” versus “non-violent” method. Interestingly, since the proposition of this dichotomous differentiation, no further efforts have been made to question the validity of such a classification of suicides. This study aimed to challenge the traditional separation into “violent” and “non-violent” suicides by generating a cluster analysis with a data-driven, machine learning approach. In a retrospective analysis, data on all officially confirmed suicides (N = 77,894) in Austria between 1970 and 2016 were assessed. Based on a defined distance metric between distributions of suicides over age group and month of the year, a standard hierarchical clustering method was performed with the five most frequent suicide methods. In cluster analysis, poisoning emerged as distinct from all other methods – both in the entire sample as well as in the male subsample. Violent suicides could be further divided into sub-clusters: hanging, shooting, and drowning on the one hand and jumping on the other hand. In the female sample, two different clusters were revealed – hanging and drowning on the one hand and jumping, poisoning, and shooting on the other. Our data-driven results in this large epidemiological study confirmed the traditional dichotomization of suicide methods into “violent” and “non-violent” methods, but on closer inspection “violent methods” can be further divided into sub-clusters and a different cluster pattern could be identified for women, requiring further research to support these refined suicide phenotypes.


2011 ◽  
Vol 29 (No. 4) ◽  
pp. 361-372 ◽  
Author(s):  
P. Pavloušek ◽  
M. Kumšta

The quality of grapes is determined above all by the contents of the primary and secondary metabolites. The primary metabolites involve sugars and organic acids, and just these compounds are dealt with in this study. Its objective was to analyse and critically evaluate the primary metabolites in new interspecific varieties and, based on a comparison with European varieties of grapevine (Vitis vinifera L.), to find out the similarities and also possible differences between them. The study evaluates and compares 4 conventional varieties of Vitis vinifera with 11 new interspecific cultivars. The contents and compositions of the individual sugars and acids were estimated by means of the HPLC method. Most of these varieties belong to the group with either medium or low content of malic acid, i.e. with a medium to high β ratio. This corroborates the similarity of interspecific varieties to those of V. vinifera. The cluster analysis identified the existence of two interesting groups of varieties: the first one involved the varieties Riesling, Nativa, Marlen, and Kofranka while the other group consisted of varieties Blaufränkisch, Blauer Portugieser, and Laurot. This observation also indicates similarity between Vitis vinifera L. varieties and interspecific cultivars and demonstrates that the contents of the primary metabolites (i.e. sugars and organic acids) are also comparable.


Behaviour ◽  
2004 ◽  
Vol 141 (1) ◽  
pp. 125-139 ◽  
Author(s):  
Laurene Ratcliffe ◽  
Daniel Mennill

AbstractWithin a network of communicating individuals, animals may gather information about the relative quality of conspecifics by eavesdropping on their signalling interactions. For territorial male songbirds, eavesdropping may be a low-cost, low-risk method for assessing the relative quality of the males around them. We used a three-speaker playback design to evaluate whether male black-capped chickadees (Poecile atricapillus) respond differently to two simulated countersinging intruders who differ only in relative features of their singing performance. We arranged three loudspeakers in an equilateral triangle at the center of playback subjects' territories. After luring males to the first loudspeaker by broadcasting non-song vocalizations, we played songs from the remaining loudspeakers to simulate a countersinging interaction between two male intruders. During the interactions, one simulated intruder consistently overlapped the songs of the other, a behaviour thought to be a signal of directed aggression in songbirds. Territorial male chickadees discriminated between the simulated intruders by preferentially approaching the loudspeaker broadcasting the overlapping signal, suggesting that males eavesdrop on other males' countersinging interactions. Male responses to playback support the idea that overlapping is a more threatening signal than being overlapped. Responses varied with the dominance status of the subject. High-ranking males approached the overlapping loudspeaker in 15 of 16 cases whereas low-ranking males approached the overlapping speaker in only 5 of 10 cases, suggesting that males of different quality may use different tactics for territorial defense.


2019 ◽  
Vol 16 (10) ◽  
pp. 4425-4430 ◽  
Author(s):  
Devendra Prasad ◽  
Sandip Kumar Goyal ◽  
Avinash Sharma ◽  
Amit Bindal ◽  
Virendra Singh Kushwah

Machine Learning is a growing area in computer science in today’s era. This article is focusing on prediction analysis using K-Nearest Neighbors (KNN) Machine Learning algorithm. Data in the dataset are processed, analyzed and predicated using the specified algorithm. Introduction of various Machine Learning algorithms, its pros and cons have been discussed. The KNN algorithm with detail study is given and it is implemented on the specified data with certain parameters. The research work elucidates prediction analysis and explicates the prediction of quality of restaurants.


2019 ◽  
Vol 8 (3) ◽  
pp. 2802-2805

The integration of information and communication technology in school management is now becoming a necessity to adapt to the current global changes and to attach the motivation of leaders to join a process of planning and governance of high quality educational projects. Today, the implementation of digital institutions is characterized by two paths at two different speeds a first the implementation of ICT in the course of education whose subject occupies the interest and use the budget of all innovation projects and the other path that is the digitization of the administration is ranked second in the innovative priorities. Our intervention is part of a technical-pedagogic approach that will focus on an action research work in which, we will try: First, it is about implementing an accessible digital tool, facilitating and organizing strategic planning and project management, and providing ways to govern and control the quality of administrative acts. Then, We will show the first results of the experimentation of this tool in our research context represented by 335 directors of schools of the provincial delegation of Taza. Morocco, highlighting the contributions of the digital on administrative practices at all levels. Finally, we present the challenges identified by our system to improve the quality of school management and professionalize the act of management of human resources and material resources. On the other hand, we will focus on all the constraints and resistances that could hinder this development and innovation action


Sign in / Sign up

Export Citation Format

Share Document