Feature Extraction and Pattern Classification Based on Bayesian Decision Boundaries

2016 ◽  
Author(s):  
Lee Luan Ling ◽  
Hugo Mauro Cavalcanti
2021 ◽  
Vol 2021 ◽  
pp. 1-11
Author(s):  
Wenmin Li ◽  
Sanqi Sun ◽  
Shuo Zhang ◽  
Hua Zhang ◽  
Yijie Shi

Aim. The purpose of this study is how to better detect attack traffic in imbalance datasets. The deep learning technology has played an important role in detecting malicious network traffic in recent years. However, it suffers serious imbalance distribution of data if the traffic model skews towards the modeling in the benign direction, because only a small portion of traffic is malicious, while most network traffic is benign. That is the reason why the authors wrote this manuscript. Methods. We propose a cost-sensitive approach to improve the HTTP traffic detection performance with imbalanced data and also present a character-level abstract feature extraction approach that can provide features with clear decision boundaries in addition. Finally, we design a spark-based HTTP traffic detection system based on these two approaches. Results. The methods proposed in this paper work well in imbalanced datasets. Compared to other methods, the experiment results indicate that our system has F1-score in a high precision. Conclusion. For imbalanced HTTP traffic detection, we confirmed that the method of feature extraction and the cost function is very effective. In the future, we may focus on how to use the cost function to further improve detection performance.


2019 ◽  
Vol 9 (15) ◽  
pp. 3130 ◽  
Author(s):  
Navarro ◽  
Perez

Many applications in image analysis require the accurate classification of complex patterns including both color and texture, e.g., in content image retrieval, biometrics, and the inspection of fabrics, wood, steel, ceramics, and fruits, among others. A new method for pattern classification using both color and texture information is proposed in this paper. The proposed method includes the following steps: division of each image into global and local samples, texture and color feature extraction from samples using a Haralick statistics and binary quaternion-moment-preserving method, a classification stage using support vector machine, and a final stage of post-processing employing a bagging ensemble. One of the main contributions of this method is the image partition, allowing image representation into global and local features. This partition captures most of the information present in the image for colored texture classification allowing improved results. The proposed method was tested on four databases extensively used in color–texture classification: the Brodatz, VisTex, Outex, and KTH-TIPS2b databases, yielding correct classification rates of 97.63%, 97.13%, 90.78%, and 92.90%, respectively. The use of the post-processing stage improved those results to 99.88%, 100%, 98.97%, and 95.75%, respectively. We compared our results to the best previously published results on the same databases finding significant improvements in all cases.


Author(s):  
David Zhang ◽  
Fengxi Song ◽  
Yong Xu ◽  
Zhizhen Liang

As mentioned in Chapter II, there are two kinds of LDA approaches: classification- oriented LDA and feature extraction-oriented LDA. In most chapters of this session of the book, we focus our attention on the feature extraction aspect of LDA for SSS problems. On the other hand,, with this chapter we present our studies on the pattern classification aspect of LDA for SSS problems. In this chapter, we present three novel classification-oriented linear discriminant criteria. The first one is large margin linear projection (LMLP) which makes full use of the characteristic of the SSS problems. The second one is the minimum norm minimum squared-error criterion which is a modification of the minimum squared-error discriminant criterion. The third one is the maximum scatter difference which is a modification of the Fisher discriminant criterion.


2012 ◽  
Vol 11 (04) ◽  
pp. 1250028 ◽  
Author(s):  
ANGKOON PHINYOMARK ◽  
PORNCHAI PHUKPATTARANONT ◽  
CHUSAK LIMSAKUL

Based on recent advances in modern multifunction myoelectric control devices, a combination of effective feature extraction and classification methods is required to enhance the high classification performance, especially in accuracy viewpoint. However, for realizing practical applications of myoelectric control, the effect of long-term usage or reusability is one of the challenging issues that should be more carefully considered, whereas only a few works have investigated this effect in recent. In this study, the behavior of the state-of-the-art multiple feature extraction methods was investigated with the fluctuating electromyography (EMG) signals recorded during four different days with a large number of trials and subjects. To this end, seven multiple feature sets were compared consisting features based on time domain and time-scale representation. Two major points were emphasized: (1) the optimal robust feature set for continuous (both transient and steady-state signals) EMG pattern classification and (2) the effect of fluctuating EMG signals with feature extraction methods for long-term usage. From the classification results, time domain feature sets yielded better performance than time-scale feature sets. The classification accuracies of the time-domain-feature sets had always achieved above 80% by using linear discriminant analysis (LDA) as a classifier and uncorrelated LDA (ULDA) as a dimensionality reduction, whereas the classification accuracies of the time-scale-feature sets were lower than 70% for the fluctuating EMG signals. The effect of dimensionality reduction for the classification of fluctuating EMG signals was also discussed.


Sign in / Sign up

Export Citation Format

Share Document