Research on WebShell Detection Method Based on Regularized Neighborhood Component Analysis (RNCA)

The variant, encryption, and confusion of WebShell results in problems in the detection method based on feature selection, such as poor detection effect and weak generalization ability. In order to solve this problem, a method of WebShell detection based on regularized neighborhood component analysis (RNCA) is proposed. The RNCA algorithm can effectively reduce the dimension of data while ensuring the accuracy of classification. In this paper, it is innovatively applied to a WebShell detection neighborhood, taking opcode behavior sequence features as the main research object, constructing vocabulary by using opcode sequence features with variable length, and effectively reducing the dimension of WebShell features from the perspective of feature selection. The opcode sequence selected by the algorithm is symmetrical with the source code file, which has great reference value for WebShell classification. On the issue of the single feature, this paper uses the fusion of behavior sequence features and text static features to construct a feature combination with stronger representation ability, which effectively improves the recognition rate of WebShell to a certain extent.

Download Full-text

Feature Selection for Classification using Principal Component Analysis and Information Gain

Expert Systems with Applications ◽

10.1016/j.eswa.2021.114765 ◽

2021 ◽

Vol 174 ◽

pp. 114765 ◽

Cited By ~ 1

Author(s):

Erick Odhiambo Omuya ◽

George Onyango Okeyo ◽

Michael Waema Kimwele

Keyword(s):

Principal Component Analysis ◽

Feature Selection ◽

Information Gain ◽

Principal Component ◽

Component Analysis ◽

Selection For

Download Full-text

Modified Principal Component Analysis (MPCA) for feature selection of hyperspectral imagery

IGARSS 2003. 2003 IEEE International Geoscience and Remote Sensing Symposium. Proceedings (IEEE Cat. No.03CH37477) ◽

10.1109/igarss.2003.1295268 ◽

2004 ◽

Author(s):

Cheng Wang ◽

M. Menenti ◽

Zhao-Liang Li

Keyword(s):

Principal Component Analysis ◽

Feature Selection ◽

Hyperspectral Imagery ◽

Principal Component ◽

Component Analysis ◽

Selection Of

Download Full-text

Process Monitoring and Fault Detection Method Based On Independent Component Analysis

2006 6th World Congress on Intelligent Control and Automation ◽

10.1109/wcica.2006.1714143 ◽

2006 ◽

Author(s):

Yinghua Wu ◽

Yinghua Yang ◽

Shukai Qin ◽

Xiaobo Chen

Keyword(s):

Fault Detection ◽

Independent Component Analysis ◽

Process Monitoring ◽

Detection Method ◽

Component Analysis ◽

Independent Component

Download Full-text

Multiclass classification of leukemia cancer data using Fuzzy Support Vector Machine (FSVM) with feature selection using Principal Component Analysis (PCA)

Journal of Physics Conference Series ◽

10.1088/1742-6596/1725/1/012012 ◽

2021 ◽

Vol 1725 ◽

pp. 012012

Author(s):

I R Fauzi ◽

Z Rustam ◽

A Wibowo

Keyword(s):

Principal Component Analysis ◽

Support Vector Machine ◽

Feature Selection ◽

Principal Component ◽

Component Analysis ◽

Multiclass Classification ◽

Support Vector ◽

Fuzzy Support Vector Machine ◽

Cancer Data

Download Full-text

An efficient conserved region detection method for multiple protein sequences using principal component analysis and wavelet transform

Pattern Recognition Letters ◽

10.1016/j.patrec.2007.11.013 ◽

2008 ◽

Vol 29 (5) ◽

pp. 616-628 ◽

Cited By ~ 2

Author(s):

Chieh-Yuan Tsai ◽

Chuang-Cheng Chiu

Keyword(s):

Principal Component Analysis ◽

Wavelet Transform ◽

Detection Method ◽

Protein Sequences ◽

Principal Component ◽

Component Analysis ◽

Region Detection ◽

Multiple Protein ◽

Conserved Region

Download Full-text

Study on Portrait Tracking Technology of Deep Feature Learning in Monitoring Image Acquisition

Journal of Imaging Science and Technology ◽

10.2352/j.imagingsci.technol.2021.65.4.040502 ◽

2021 ◽

Author(s):

Senlin Yang ◽

Xin Chong

Keyword(s):

Feature Selection ◽

Image Acquisition ◽

Recognition Rate ◽

Particle Swarm ◽

Feature Subset ◽

Particle Structure ◽

Recognition Time ◽

Traditional Methods ◽

Deep Feature ◽

Global Optimal

In a network information society, there are many occasions where people’s behaviors need to be tracked, photographed, and recognized. Biometric recognition technologies are considered to be one of the most effective solutions. Traditional methods mostly use graph structure and deformed component model to design two-dimensional (2D) human body component detectors, and apply graph models to establish the connectivity of each component. The recognition design process is simple, but the accuracy of recognition and tracking effect applied in monitoring image acquisition is not high. The improved particle swarm optimization algorithm is used to determine the particle structure, and the binary bit string is used to represent the particle structure. The support vector machine (SVM) parameters of discrete particles are optimized, and the synchronous optimization design of feature selection and SVM parameters is carried out to realize the synchronous optimization of portrait feature subset and SVM parameters in discrete space. Through in-depth research, the extracted feature subsets can be effectively optimized and selected, and the parameters of SVM model can be optimized synchronously. The discrete particle structure is associated with the SVM parameters to achieve feature selection and SVM parameter synchronization and optimization. It is not only superior to traditional algorithms in terms of recognition rate, but also reduces the feature dimension and shortens the recognition time. The deep feature recognition built on the learning machine is not easy to diverge and can effectively adjust the particle speed to the global optimal, which is more effective than the particle swarm algorithm to search for the global optimal solution, and has better robustness. In the experiments, the research content of the article is compared with the traditional methods to test and analysis. The results show that the method optimizes the selection of feature subset and eliminates a large number of invalid features. The method not only reduces space complexity and shortens recognition time, but also improves recognition rate. The dimension of feature subset dimensions are superior to those extracted by other algorithms.

Download Full-text

Predicting the Severity of Bug Reports Based on Feature Selection

International Journal of Software Engineering and Knowledge Engineering ◽

10.1142/s0218194018500158 ◽

2018 ◽

Vol 28 (04) ◽

pp. 537-558 ◽

Cited By ~ 4

Author(s):

Wenjie Liu ◽

Shanshan Wang ◽

Xin Chen ◽

He Jiang

Keyword(s):

Feature Selection ◽

Software Maintenance ◽

Feature Selection Method ◽

Selection Methods ◽

Selection Algorithm ◽

Feature Selection Algorithm ◽

Bug Reports ◽

Single Feature ◽

Bug Report ◽

Severity Prediction

In software maintenance process, it is a fairly important activity to predict the severity of bug reports. However, manually identifying the severity of bug reports is a tedious and time-consuming task. So developing automatic judgment methods for predicting the severity of bug reports has become an urgent demand. In general, a bug report contains a lot of descriptive natural language texts, thus resulting in a high-dimensional feature set which poses serious challenges to traditionally automatic methods. Therefore, we attempt to use automatic feature selection methods to improve the performance of the severity prediction of bug reports. In this paper, we introduce a ranking-based strategy to improve existing feature selection algorithms and propose an ensemble feature selection algorithm by combining existing ones. In order to verify the performance of our method, we run experiments over the bug reports of Eclipse and Mozilla and conduct comparisons with eight commonly used feature selection methods. The experiment results show that the ranking-based strategy can effectively improve the performance of the severity prediction of bug reports by up to 54.76% on average in terms of [Formula: see text]-measure, and it also can significantly reduce the dimension of the feature set. Meanwhile, the ensemble feature selection method can get better results than a single feature selection algorithm.

Download Full-text