BAYESIAN SELF-ORGANIZING MAP FOR DATA CLASSIFICATION AND CLUSTERING

Author(s):  
XIAOLIAN GUO ◽  
HAIYING WANG ◽  
DAVID H. GLASS

The Bayesian self-organizing map (BSOM) has typically been used for density estimation. In this study, we implemented an adaptation of the model for performing unsupervized and supervised classification. In order to determine the optimal number of neurons to represent the given dataset during the learning process, an extended Bayesian learning process is proposed called the growing BSOM. It starts with two neurons and adds new neurons to the network via a process in which the neuron with the lowest individual log-likelihood is identified. The system has been tested using three synthetic datasets and one real dataset. The experimental results suggest that the BSOM-based approach can achieve better classification performance in comparisons to several widely-used models such as k-nearest neighbor (KNN), support vector machine (SVM) and Gaussian mixture model (GMM). By using the Bayesian information criterion (BIC) as a stopping criterion, the growing BSOM can model the data under study and estimate the number of clusters.


Author(s):  
S. Vijaya Rani ◽  
G. N. K. Suresh Babu

The illegal hackers  penetrate the servers and networks of corporate and financial institutions to gain money and extract vital information. The hacking varies from one computing system to many system. They gain access by sending malicious packets in the network through virus, worms, Trojan horses etc. The hackers scan a network through various tools and collect information of network and host. Hence it is very much essential to detect the attacks as they enter into a network. The methods  available for intrusion detection are Naive Bayes, Decision tree, Support Vector Machine, K-Nearest Neighbor, Artificial Neural Networks. A neural network consists of processing units in complex manner and able to store information and make it functional for use. It acts like human brain and takes knowledge from the environment through training and learning process. Many algorithms are available for learning process This work carry out research on analysis of malicious packets and predicting the error rate in detection of injured packets through artificial neural network algorithms.



2012 ◽  
Vol 12 (05) ◽  
pp. 1240028 ◽  
Author(s):  
EE PING NG ◽  
TEIK-CHENG LIM ◽  
SUBHAGATA CHATTOPADHYAY ◽  
MURALIDHAR BAIRY

Epilepsy is a common neurological disorder characterized by recurrence seizures. Alcoholism causes organic changes in the brain, resulting in seizure attacks similar to epileptic fits. Hence, it is challenging to differentiate the cause of fits as epileptic or alcoholism, which is important for deciding on the treatment in the neurology ward. The focus of this paper is to automatically differentiate epileptic, normal, and alcoholic electroencephalogram (EEG) signals. As the EEG signals are non-linear and dynamic in nature, it is difficult to tell the subtle changes in these signals with the help of linear techniques or by the naked eye. Therefore, to analyze the normal (control), epileptic, and alcoholic EEG signals, two non-linear methods, such as recurrence plots (RPs) and then recurrence quantification analysis (RQA) are adopted. Approximately 10 RQA parameters have been used to classify the EEG signals into three distinct classes, i.e., normal, epileptic, and alcoholic. Six classifiers, such as support vector machine (SVM), radial basis probabilistic neural network (RBPNN), decision tree (DT), Gaussian mixture model (GMM), k-nearest neighbor (kNN), and fuzzy Sugeno classifiers have been developed to accomplish this task. Results show that the GMM classifier outperformed the other classifiers with a classification sensitivity of 99.6%, specificity of 98.3%, and accuracy of 98.6%.



Knowledge discovery is also known as Data mining in databases, in recent years that technique plays a major role in research area. Data mining in healthcare domain has noteworthy usage in real world. The mining method can enable the healthcare field for the enhancement of institutionalization of its administrations and become quicker with best in class technologies. Innovation utilization isn't restricted to basic leadership in undertakings, yet spread to different social statuses in all fields. In this paper a novel approach for the detection of brain tumor is proposed. The novel approach uses the classification technique of K-nearest neighbor (KNN) and for ignoring the error of the dataset image SOM (self-organizing map) algorithm has been used. Discrete wavelet transform (DWT) is used for transforming input image data set, in which RGB color of input data image has been converted into gray scale. Then it has been classified using KNN after that the error avoiding algorithm has been carried out. This will help to differentiate tumor cells and the normal cells. The presence of tumor in brain image is detected using parametric analysis by simulation.



Author(s):  
Samir Bandyopadhyay ◽  
Amiya Bhaumik ◽  
Sandeep Poddar

Skin disease is a very common disease for humans. In the medical industry detecting skin disease and recognizing its type is a very challenging task. Due to the complexity of human skin texture and the visual closeness effect of the diseases, sometimes it is really difficult to detect the exact type. Therefore, it is necessary to detect and recognize the skin disease at its very first observation. In today's era, artificial intelligence (AI) is rapidly growing in medical fields. Different machine learning (ML) and deep learning(DL) algorithms are used for diagnostic purposes. These methods drastically improve the diagnosis process and also speed up the process. In this paper, a brief comparison between the machine learning process and the deep learning process was discussed. In both processes, three different and popular algorithms are used. For the machine Learning process Bagged Tree Ensemble, K-Nearest Neighbor (KNN), and Support Vector Machine(SVM) algorithms were used. For the deep learning process three pre-trained deep neural network models



2014 ◽  
Vol 14 (03) ◽  
pp. 1450035 ◽  
Author(s):  
OLIVER FAUST ◽  
PENG CHUAN ALVIN ANG ◽  
SUBHA D. PUTHANKATTIL ◽  
PAUL K. JOSEPH

Electroencephalography (EEG) is a measure which represents the functional activity of the brain. We show that a detailed analysis of EEG measurements provides highly discriminant features which indicate the mental state of patients with clinical depression. Our feature extraction method revolves around a novel processing structure that combines wavelet packet decomposition (WPD) and non-linear algorithms. WPD was used to select appropriate EEG frequency bands. The resulting signals were processed with the non-linear measures of approximate entropy (ApEn), sample entropy (SampEn), renyi entropy (REN) and bispectral phase entropy ( P h). The features were selected using t-test and only discriminative features were fed to various classifiers, namely probabilistic neural network (PNN), support vector machine (SVM), decision tree (DT), k-nearest neighbor algorithm (k-NN), naive bayes classification (NBC), Gaussian mixture model (GMM) and Fuzzy Sugeno Classifier (FSC). Our classification results show that, with a classification accuracy of 99.5%, the PNN classifier performed better than the rest of classifiers in discriminating between normal and depression EEG signals. Hence, the proposed decision support system can be used to diagnose, and monitor the treatment of patients suffering from depression.



2012 ◽  
Vol 22 (02) ◽  
pp. 1250002 ◽  
Author(s):  
U. RAJENDRA ACHARYA ◽  
S. VINITHA SREE ◽  
PENG CHUAN ALVIN ANG ◽  
RATNA YANTI ◽  
JASJIT S. SURI

Epilepsy, a neurological disorder, is characterized by the recurrence of seizures. Electroencephalogram (EEG) signals, which are used to detect the presence of seizures, are non-linear and dynamic in nature. Visual inspection of the EEG signals for detection of normal, interictal, and ictal activities is a strenuous and time-consuming task due to the huge volumes of EEG segments that have to be studied. Therefore, non-linear methods are being widely used to study EEG signals for the automatic monitoring of epileptic activities. The aim of our work is to develop a Computer Aided Diagnostic (CAD) technique with minimal pre-processing steps that can classify all the three classes of EEG segments, namely normal, interictal, and ictal, using a small number of highly discriminating non-linear features in simple classifiers. To evaluate the technique, segments of normal, interictal, and ictal EEG segments (100 segments in each class) were used. Non-linear features based on the Higher Order Spectra (HOS), two entropies, namely the Approximation Entropy (ApEn) and the Sample Entropy (SampEn), and Fractal Dimension and Hurst Exponent were extracted from the segments. Significant features were selected using the ANOVA test. After evaluating the performance of six classifiers (Decision Tree, Fuzzy Sugeno Classifier, Gaussian Mixture Model, K-Nearest Neighbor, Support Vector Machine, and Radial Basis Probabilistic Neural Network) using a combination of the selected features, we found that using a set of all the selected six features in the Fuzzy classifier resulted in 99.7% classification accuracy. We have demonstrated that our technique is capable of achieving high accuracy using a small number of features that accurately capture the subtle differences in the three different types of EEG (normal, interictal, and ictal) segments. The technique can be easily written as a software application and used by medical professionals without any extensive training and cost. Such software can evolve into an automatic seizure monitoring application in the near future and can aid the doctors in providing better and timely care for the patients suffering from epilepsy.



2021 ◽  
Vol 19 ◽  
pp. 211-216
Author(s):  
A. D. Gonzalez-Abreu ◽  
◽  
M. Delgado-Prieto ◽  
J.J. Saucedo-Dorantes ◽  
R.A. Osornio-Rios

Complex disturbance patterns take place over the corresponding power supply networks due to the increased complexity of electrical loads at industrial plants. Such complex patterns are the result of a combination of simpler standardized disturbances. However, their detection and identification represent a challenge to current power quality monitoring systems. The detection of disturbances and their identification would allow early and effective decision-making processes towards optimal power grid controls or maintenance and security operations of the grid. In this regard, this paper presents an evaluation of the four main techniques for novelty detection: k-Nearest Neighbor, Gaussian Mixture Models, One-Class Support Vector Machine, and Stacked Autoencoder. A set of synthetic signals have been considered to evaluate the performance and suitability of each technique as an anomaly detector applied to power quality disturbances. A set of statistical features have been considered to characterize the power line. The evaluation of the techniques is carried out throughout different scenarios considering combined and single disturbances. The obtained results show the complementary performance of the considered techniques in front of different scenarios due to their differences in the knowledge modelization.



2019 ◽  
Vol 20 (5) ◽  
pp. 488-500 ◽  
Author(s):  
Yan Hu ◽  
Yi Lu ◽  
Shuo Wang ◽  
Mengying Zhang ◽  
Xiaosheng Qu ◽  
...  

Background: Globally the number of cancer patients and deaths are continuing to increase yearly, and cancer has, therefore, become one of the world&#039;s highest causes of morbidity and mortality. In recent years, the study of anticancer drugs has become one of the most popular medical topics. </P><P> Objective: In this review, in order to study the application of machine learning in predicting anticancer drugs activity, some machine learning approaches such as Linear Discriminant Analysis (LDA), Principal components analysis (PCA), Support Vector Machine (SVM), Random forest (RF), k-Nearest Neighbor (kNN), and Naïve Bayes (NB) were selected, and the examples of their applications in anticancer drugs design are listed. </P><P> Results: Machine learning contributes a lot to anticancer drugs design and helps researchers by saving time and is cost effective. However, it can only be an assisting tool for drug design. </P><P> Conclusion: This paper introduces the application of machine learning approaches in anticancer drug design. Many examples of success in identification and prediction in the area of anticancer drugs activity prediction are discussed, and the anticancer drugs research is still in active progress. Moreover, the merits of some web servers related to anticancer drugs are mentioned.



Author(s):  
I Made Oka Widyantara ◽  
I Made Dwi Asana Putra ◽  
Ida Bagus Putu Adnyana

This paper intends to explain the development of Coastal Video Monitoring System (CoViMoS) with the main characteristics including low-cost and easy implementation. CoViMoS characteristics have been realized using the device IP camera for video image acquisition, and development of software applications with the main features including detection of shoreline and it changes are automatically. This capability was based on segmentation and classification techniques based on data mining. Detection of shoreline is done by segmenting a video image of the beach, to get a cluster of objects, namely land, sea and sky, using Self Organizing Map (SOM) algorithms. The mechanism of classification is done using K-Nearest Neighbor (K-NN) algorithms to provide the class labels to objects that have been generated on the segmentation process. Furthermore, the classification of land used as a reference object in the detection of costline. Implementation CoViMoS system for monitoring systems in Cucukan Beach, Gianyar regency, have shown that the developed system is able to detect the shoreline and its changes automatically.



Sensors ◽  
2021 ◽  
Vol 21 (4) ◽  
pp. 1274
Author(s):  
Daniel Bonet-Solà ◽  
Rosa Ma Alsina-Pagès

Acoustic event detection and analysis has been widely developed in the last few years for its valuable application in monitoring elderly or dependant people, for surveillance issues, for multimedia retrieval, or even for biodiversity metrics in natural environments. For this purpose, sound source identification is a key issue to give a smart technological answer to all the aforementioned applications. Diverse types of sounds and variate environments, together with a number of challenges in terms of application, widen the choice of artificial intelligence algorithm proposal. This paper presents a comparative study on combining several feature extraction algorithms (Mel Frequency Cepstrum Coefficients (MFCC), Gammatone Cepstrum Coefficients (GTCC), and Narrow Band (NB)) with a group of machine learning algorithms (k-Nearest Neighbor (kNN), Neural Networks (NN), and Gaussian Mixture Model (GMM)), tested over five different acoustic environments. This work has the goal of detailing a best practice method and evaluate the reliability of this general-purpose algorithm for all the classes. Preliminary results show that most of the combinations of feature extraction and machine learning present acceptable results in most of the described corpora. Nevertheless, there is a combination that outperforms the others: the use of GTCC together with kNN, and its results are further analyzed for all the corpora.



Sign in / Sign up

Export Citation Format

Share Document