scholarly journals Dynamic classification approach using scalable ensemble of autoencoders to classify data with drift

2021 ◽  
Vol 2134 (1) ◽  
pp. 012009
Author(s):  
Anastasiya O Gurina ◽  
Vladimir L Eliseev ◽  
Sergey V Kolpinskiy

Abstract The problem of classification under concept drift conditions is investigated. The importance of anomaly detection is emphasized as a key feature of successful approach to operate with adversarial attacks and data poisoning. An approach to classification in the context of both drift and anomalies is introduced. It is based on ensemble of one-class classifiers, implemented by neural network autoencoders. Numeric parameters and supplementary logic are also supposed to distinguish between different classification cases. The quality of classifiers is estimated by original characteristics (EDCA), which examine both training set area and the area around it. The proposed approach is evaluated on synthetic data to highlight its properties in various conditions including normal, drift, new class and anomaly cases.

2021 ◽  
Vol 263 (1) ◽  
pp. 5397-5408
Author(s):  
Wagner Gonçalves Pinto ◽  
Michaël Bauerheim ◽  
Hélène Parisot-Dupuis

Localization and quantification of noise sources is an important scientific and industrial problem, the use of phased arrays of microphones being the standard techniques in many applications. Non-physical artifacts appears on the output due to the nature of the method, thus, a supplementary step known as deconvolution is often performed. The use of data-driven machine learning can be a candidate to solve such problem. Neural networks can be extremely advantageous since no hypothesis concerning the environment or the characteristics of the sources are necessary, different from classical deconvolution techniques. Information on the acoustic propagation is implicitly extracted from pairs of source-output maps. On this work, a convolutional neural network is trained to deconvolute the beamforming map obtained from synthetic data simulating the response of an array of microphones. Quality of the estimation and the computational cost are compared to those of classical deconvolution methods (DAMAS, CLEAN-SC). Constraints associated with the size of the dataset used for training the neural network are also investigated and presented.


Author(s):  
Giovanni Pilato ◽  
◽  
Filippo Sorbello ◽  
Giorgio Vassallo

In this paper, three quality factors are introduced in order to measure the quality of a neural network. Each factor deals with a particular feature of quality: the ability of the network in learning training set samples; generalization capability related to the gradient, in the nearby of the training patterns, of the network output function; the computational cost of the architecture during the production phase, related to the number of connections between neural units. The validity of the proposed solution has been tested using three well-known benchmarks. Experimental results show that quality factors introduced in this paper can be a valid alternative to the test set method.


1997 ◽  
Author(s):  
Daniel Benzing ◽  
Kevin Whitaker ◽  
Dedra Moore ◽  
Daniel Benzing ◽  
Kevin Whitaker ◽  
...  

Author(s):  
P.L. Nikolaev

This article deals with method of binary classification of images with small text on them Classification is based on the fact that the text can have 2 directions – it can be positioned horizontally and read from left to right or it can be turned 180 degrees so the image must be rotated to read the sign. This type of text can be found on the covers of a variety of books, so in case of recognizing the covers, it is necessary first to determine the direction of the text before we will directly recognize it. The article suggests the development of a deep neural network for determination of the text position in the context of book covers recognizing. The results of training and testing of a convolutional neural network on synthetic data as well as the examples of the network functioning on the real data are presented.


2019 ◽  
Vol 8 (3) ◽  
pp. 6634-6643 ◽  

Opinion mining and sentiment analysis are valuable to extract the useful subjective information out of text documents. Predicting the customer’s opinion on amazon products has several benefits like reducing customer churn, agent monitoring, handling multiple customers, tracking overall customer satisfaction, quick escalations, and upselling opportunities. However, performing sentiment analysis is a challenging task for the researchers in order to find the users sentiments from the large datasets, because of its unstructured nature, slangs, misspells and abbreviations. To address this problem, a new proposed system is developed in this research study. Here, the proposed system comprises of four major phases; data collection, pre-processing, key word extraction, and classification. Initially, the input data were collected from the dataset: amazon customer review. After collecting the data, preprocessing was carried-out for enhancing the quality of collected data. The pre-processing phase comprises of three systems; lemmatization, review spam detection, and removal of stop-words and URLs. Then, an effective topic modelling approach Latent Dirichlet Allocation (LDA) along with modified Possibilistic Fuzzy C-Means (PFCM) was applied to extract the keywords and also helps in identifying the concerned topics. The extracted keywords were classified into three forms (positive, negative and neutral) by applying an effective machine learning classifier: Convolutional Neural Network (CNN). The experimental outcome showed that the proposed system enhanced the accuracy in sentiment analysis up to 6-20% related to the existing systems.


2021 ◽  
Vol 18 (1) ◽  
pp. 172988142199332
Author(s):  
Xintao Ding ◽  
Boquan Li ◽  
Jinbao Wang

Indoor object detection is a very demanding and important task for robot applications. Object knowledge, such as two-dimensional (2D) shape and depth information, may be helpful for detection. In this article, we focus on region-based convolutional neural network (CNN) detector and propose a geometric property-based Faster R-CNN method (GP-Faster) for indoor object detection. GP-Faster incorporates geometric property in Faster R-CNN to improve the detection performance. In detail, we first use mesh grids that are the intersections of direct and inverse proportion functions to generate appropriate anchors for indoor objects. After the anchors are regressed to the regions of interest produced by a region proposal network (RPN-RoIs), we then use 2D geometric constraints to refine the RPN-RoIs, in which the 2D constraint of every classification is a convex hull region enclosing the width and height coordinates of the ground-truth boxes on the training set. Comparison experiments are implemented on two indoor datasets SUN2012 and NYUv2. Since the depth information is available in NYUv2, we involve depth constraints in GP-Faster and propose 3D geometric property-based Faster R-CNN (DGP-Faster) on NYUv2. The experimental results show that both GP-Faster and DGP-Faster increase the performance of the mean average precision.


2021 ◽  
Vol 11 (6) ◽  
pp. 2838
Author(s):  
Nikitha Johnsirani Venkatesan ◽  
Dong Ryeol Shin ◽  
Choon Sung Nam

In the pharmaceutical field, early detection of lung nodules is indispensable for increasing patient survival. We can enhance the quality of the medical images by intensifying the radiation dose. High radiation dose provokes cancer, which forces experts to use limited radiation. Using abrupt radiation generates noise in CT scans. We propose an optimal Convolutional Neural Network model in which Gaussian noise is removed for better classification and increased training accuracy. Experimental demonstration on the LUNA16 dataset of size 160 GB shows that our proposed method exhibit superior results. Classification accuracy, specificity, sensitivity, Precision, Recall, F1 measurement, and area under the ROC curve (AUC) of the model performance are taken as evaluation metrics. We conducted a performance comparison of our proposed model on numerous platforms, like Apache Spark, GPU, and CPU, to depreciate the training time without compromising the accuracy percentage. Our results show that Apache Spark, integrated with a deep learning framework, is suitable for parallel training computation with high accuracy.


Author(s):  
Raul E. Avelar ◽  
Karen Dixon ◽  
Boniphace Kutela ◽  
Sam Klump ◽  
Beth Wemple ◽  
...  

The calibration of safety performance functions (SPFs) is a mechanism included in the Highway Safety Manual (HSM) to adjust SPFs in the HSM for use in intended jurisdictions. Critically, the quality of the calibration procedure must be assessed before using the calibrated SPFs. Multiple resources to aid practitioners in calibrating SPFs have been developed in the years following the publication of the HSM 1st edition. Similarly, the literature suggests multiple ways to assess the goodness-of-fit (GOF) of a calibrated SPF to a data set from a given jurisdiction. This paper uses the calibration results of multiple intersection SPFs to a large Mississippi safety database to examine the relations between multiple GOF metrics. The goal is to develop a sensible single index that leverages the joint information from multiple GOF metrics to assess overall quality of calibration. A factor analysis applied to the calibration results revealed three underlying factors explaining 76% of the variability in the data. From these results, the authors developed an index and performed a sensitivity analysis. The key metrics were found to be, in descending order: the deviation of the cumulative residual (CURE) plot from the 95% confidence area, the mean absolute deviation, the modified R-squared, and the value of the calibration factor. This paper also presents comparisons between the index and alternative scoring strategies, as well as an effort to verify the results using synthetic data. The developed index is recommended to comprehensively assess the quality of the calibrated intersection SPFs.


Energies ◽  
2021 ◽  
Vol 14 (6) ◽  
pp. 1527
Author(s):  
R. Senthil Kumar ◽  
K. Mohana Sundaram ◽  
K. S. Tamilselvan

The extensive usage of power electronic components creates harmonics in the voltage and current, because of which, the quality of delivered power gets affected. Therefore, it is essential to improve the quality of power, as we reveal in this paper. The problems of load voltage, source current, and power factors are mitigated by utilizing the unified power flow controller (UPFC), in which a combination of series and shunt converters are combined through a DC-link capacitor. To retain the link voltage and to maximize the delivered power, a PV module is introduced with a high gain converter, named the switched clamped diode boost (SCDB) converter, in which the grey wolf optimization (GWO) algorithm is instigated for tracking the maximum power. To retain the link-voltage of the capacitor, the artificial neural network (ANN) is implemented. A proper control of UPFC is highly essential, which is achieved by the reference current generation with the aid of a hybrid algorithm. A genetic algorithm, hybridized with the radial basis function neural network (RBFNN), is utilized for the generation of a switching sequence, and the generated pulse has been given to both the series and shunt converters through the PWM generator. Thus, the source current and load voltage harmonics are mitigated with reactive power compensation, which results in attaining a unity power factor. The projected methodology is simulated by MATLAB and it is perceived that the total harmonic distortion (THD) of 0.84% is attained, with almost a unity power factor, and this is validated with FPGA Spartan 6E hardware.


Sign in / Sign up

Export Citation Format

Share Document