Learning, compression, and leakage: Minimising classification error via meta-universal compression principles

AbstractTackling binary program analysis problems has traditionally implied manually defining rules and heuristics, a tedious and time consuming task for human analysts. In order to improve automation and scalability, we propose an alternative direction based on distributed representations of binary programs with applicability to a number of downstream tasks. We introduce Bin2vec, a new approach leveraging Graph Convolutional Networks (GCN) along with computational program graphs in order to learn a high dimensional representation of binary executable programs. We demonstrate the versatility of this approach by using our representations to solve two semantically different binary analysis tasks – functional algorithm classification and vulnerability discovery. We compare the proposed approach to our own strong baseline as well as published results, and demonstrate improvement over state-of-the-art methods for both tasks. We evaluated Bin2vec on 49191 binaries for the functional algorithm classification task, and on 30 different CWE-IDs including at least 100 CVE entries each for the vulnerability discovery task. We set a new state-of-the-art result by reducing the classification error by 40% compared to the source-code based inst2vec approach, while working on binary code. For almost every vulnerability class in our dataset, our prediction accuracy is over 80% (and over 90% in multiple classes).

Download Full-text

The use of remote sensing satellite using deep learning in emergency monitoring of high-level landslides disaster in Jinsha River

The Journal of Supercomputing ◽

10.1007/s11227-020-03604-4 ◽

2021 ◽

Author(s):

Leijin Long ◽

Feng He ◽

Hongjiang Liu

Keyword(s):

Remote Sensing ◽

Southwest China ◽

Influence Factors ◽

Classification Error ◽

Model Parameters ◽

Detection Accuracy ◽

Remote Sensing Images ◽

Jinsha River ◽

Detection Model ◽

High Level

AbstractIn order to monitor the high-level landslides frequently occurring in Jinsha River area of Southwest China, and protect the lives and property safety of people in mountainous areas, the data of satellite remote sensing images are combined with various factors inducing landslides and transformed into landslide influence factors, which provides data basis for the establishment of landslide detection model. Then, based on the deep belief networks (DBN) and convolutional neural network (CNN) algorithm, two landslide detection models DBN and convolutional neural-deep belief network (CDN) are established to monitor the high-level landslide in Jinsha River. The influence of the model parameters on the landslide detection results is analyzed, and the accuracy of DBN and CDN models in dealing with actual landslide problems is compared. The results show that when the number of neurons in the DBN is 100, the overall error is the minimum, and when the number of learning layers is 3, the classification error is the minimum. The detection accuracy of DBN and CDN is 97.56% and 97.63%, respectively, which indicates that both DBN and CDN models are feasible in dealing with landslides from remote sensing images. This exploration provides a reference for the study of high-level landslide disasters in Jinsha River.

Download Full-text

Enhanced Convolutional-Neural-Network Architecture for Crop Classification

Applied Sciences ◽

10.3390/app11094292 ◽

2021 ◽

Vol 11 (9) ◽

pp. 4292

Author(s):

Mónica Y. Moreno-Revelo ◽

Lorena Guachi-Guachi ◽

Juan Bernardo Gómez-Mendoza ◽

Javier Revelo-Fuelagán ◽

Diego H. Peluffo-Ordóñez

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Network Architecture ◽

Classification Model ◽

Classification Error ◽

Small Scale ◽

Post Processing ◽

Average Accuracy ◽

Processing Step ◽

Crop Classification

Automatic crop identification and monitoring is a key element in enhancing food production processes as well as diminishing the related environmental impact. Although several efficient deep learning techniques have emerged in the field of multispectral imagery analysis, the crop classification problem still needs more accurate solutions. This work introduces a competitive methodology for crop classification from multispectral satellite imagery mainly using an enhanced 2D convolutional neural network (2D-CNN) designed at a smaller-scale architecture, as well as a novel post-processing step. The proposed methodology contains four steps: image stacking, patch extraction, classification model design (based on a 2D-CNN architecture), and post-processing. First, the images are stacked to increase the number of features. Second, the input images are split into patches and fed into the 2D-CNN model. Then, the 2D-CNN model is constructed within a small-scale framework, and properly trained to recognize 10 different types of crops. Finally, a post-processing step is performed in order to reduce the classification error caused by lower-spatial-resolution images. Experiments were carried over the so-named Campo Verde database, which consists of a set of satellite images captured by Landsat and Sentinel satellites from the municipality of Campo Verde, Brazil. In contrast to the maximum accuracy values reached by remarkable works reported in the literature (amounting to an overall accuracy of about 81%, a f1 score of 75.89%, and average accuracy of 73.35%), the proposed methodology achieves a competitive overall accuracy of 81.20%, a f1 score of 75.89%, and an average accuracy of 88.72% when classifying 10 different crops, while ensuring an adequate trade-off between the number of multiply-accumulate operations (MACs) and accuracy. Furthermore, given its ability to effectively classify patches from two image sequences, this methodology may result appealing for other real-world applications, such as the classification of urban materials.

Download Full-text

Minimising Added Classification Error Using Walsh Coefficients

IEEE Transactions on Neural Networks ◽

10.1109/tnn.2011.2159513 ◽

2011 ◽

Vol 22 (8) ◽

pp. 1334-1339 ◽

Cited By ~ 6

Author(s):

T. Windeatt ◽

Cemre Zor

Keyword(s):

Classification Error

Download Full-text

Estimation of Classification Error

IEEE Transactions on Computers ◽

10.1109/t-c.1971.223165 ◽

1971 ◽

Vol C-20 (12) ◽

pp. 1521-1527 ◽

Cited By ~ 36

Author(s):

K. Fukunaga ◽

D.L. Kessell

Keyword(s):

Classification Error

Download Full-text

Improvement of the classification quality in detection of Hashimoto’s disease with a combined classifier approach

Proceedings of the Institution of Mechanical Engineers Part H Journal of Engineering in Medicine ◽

10.1177/0954411917702682 ◽

2017 ◽

Vol 231 (8) ◽

pp. 774-782 ◽

Cited By ~ 8

Author(s):

Zbigniew Omiotek

Keyword(s):

Computer System ◽

Majority Vote ◽

Classification Error ◽

Automatic Identification ◽

Detection Accuracy ◽

Ultrasound Images ◽

Linear Discriminant ◽

Nearest Neighbours ◽

Combined Classifier ◽

Classification Quality

The purpose of the study was to construct an efficient classifier that, along with a given reduced set of discriminant features, could be used as a part of the computer system in automatic identification and classification of ultrasound images of the thyroid gland, which is aimed to detect cases affected by Hashimoto’s thyroiditis. A total of 10 supervised learning techniques and a majority vote for the combined classifier were used. Two models were proposed as a result of the classifier’s construction. The first one is based on the K-nearest neighbours method (for K = 7). It uses three discriminant features and affords sensitivity equal to 88.1%, specificity of 66.7% and classification error at a level of 21.8%. The second model is a combined classifier, which was constructed using three-component classifiers. They are based on the K-nearest neighbours method (for K = 7), linear discriminant analysis and a boosting algorithm. The combined classifier is based on 48 discriminant features. It allows to achieve the classification sensitivity equal to 88.1%, specificity of 69.4% and classification error at a level of 20.5%. The combined classifier allows to improve the classification quality compared to the single model. The models, built as a part of the automatic computer system, may support the physician, especially in first-contact hospitals, in diagnosis of cases that are difficult to recognise based on ultrasound images. The high sensitivity of constructed classification models indicates high detection accuracy of the sick cases, and this is beneficial to the patients from a medical point of view.

Download Full-text

Moments and root-mean-square error of the Bayesian MMSE estimator of classification error in the Gaussian model

Pattern Recognition ◽

10.1016/j.patcog.2013.11.022 ◽

2014 ◽

Vol 47 (6) ◽

pp. 2178-2192 ◽

Cited By ~ 19

Author(s):

Amin Zollanvari ◽

Edward R. Dougherty

Keyword(s):

Root Mean Square Error ◽

Mean Square Error ◽

Root Mean Square ◽

Gaussian Model ◽

Classification Error ◽

Mean Square ◽

Mmse Estimator

Download Full-text

Characterizing and Predicting Corrections in Spoken Dialogue Systems

Computational Linguistics ◽

10.1162/coli.2006.32.3.417 ◽

2006 ◽

Vol 32 (3) ◽

pp. 417-438 ◽

Cited By ~ 19

Author(s):

Diane Litman ◽

Julia Hirschberg ◽

Marc Swerts

Keyword(s):

Speech Recognition ◽

Predictive Power ◽

Classification Error ◽

Dialogue Systems ◽

Dialogue System ◽

Spoken Dialogue Systems ◽

Experimental Conditions ◽

Spoken Dialogue ◽

Spoken Dialogue System ◽

Recognition Errors

This article focuses on the analysis and prediction of corrections, defined as turns where a user tries to correct a prior error made by a spoken dialogue system. We describe our labeling procedure of various corrections types and statistical analyses of their features in a corpus collected from a train information spoken dialogue system. We then present results of machine-learning experiments designed to identify user corrections of speech recognition errors. We investigate the predictive power of features automatically computable from the prosody of the turn, the speech recognition process, experimental conditions, and the dialogue history. Our best-performing features reduce classification error from baselines of 25.70–28.99% to 15.72%.

Download Full-text

Minimum Classification Error Training with Speech Synthesis-Based Regularization for Speech Recognition

Proceedings of the 2019 2nd International Conference on Signal Processing and Machine Learning ◽

10.1145/3372806.3372819 ◽

2019 ◽

Author(s):

Naoto Umezaki ◽

Takumi Okubo ◽

Hideyuki Watanabe ◽

Shigeru Katagiri ◽

Miho Ohsaki

Keyword(s):

Speech Recognition ◽

Speech Synthesis ◽

Classification Error ◽

Error Training ◽

Minimum Classification Error ◽

Minimum Classification Error Training

Download Full-text

Fingerprint Presentation Attack Detection Using Transfer Learning Approach

International Journal of Intelligent Information Technologies ◽

10.4018/ijiit.2021010104 ◽

2021 ◽

Vol 17 (1) ◽

pp. 53-67

Author(s):

Rajneesh Rani ◽

Harpreet Singh

Keyword(s):

Transfer Learning ◽

Attack Detection ◽

Classification Error ◽

Learning Approach ◽

Biometric Authentication ◽

The Past ◽

Liveness Detection ◽

Fingerprint Liveness Detection ◽

Software And Hardware ◽

Presentation Attack Detection

In this busy world, biometric authentication methods are serving as fast authentication means. But with growing dependencies on these systems, attackers have tried to exploit these systems through various attacks; thus, there is a strong need to protect authentication systems. Many software and hardware methods have been proposed in the past to make existing authentication systems more robust. Liveness detection/presentation attack detection is one such method that provides protection against malicious agents by detecting fake samples of biometric traits. This paper has worked on fingerprint liveness detection/presentation attack detection using transfer learning for which the authors have used a pre-trained NASNetMobile model. The experiments are performed on publicly available liveness datasets LivDet 2011 and LivDet 2013 and have obtained good results as compared to state of art techniques in terms of ACE(average classification error).

Download Full-text