scholarly journals An Improved Feature Extraction Approach for Web Anomaly Detection Based on Semantic Structure

2021 ◽  
Vol 2021 ◽  
pp. 1-11
Author(s):  
Zishuai Cheng ◽  
Baojiang Cui ◽  
Tao Qi ◽  
Wenchuan Yang ◽  
Junsong Fu

Anomaly-based Web application firewalls (WAFs) are vital for providing early reactions to novel Web attacks. In recent years, various machine learning, deep learning, and transfer learning-based anomaly detection approaches have been developed to protect against Web attacks. Most of them directly treat the request URL as a general string that consists of letters and roughly use natural language processing (NLP) methods (i.e., Word2Vec and Doc2Vec) or domain knowledge to extract features. In this paper, we proposed an improved feature extraction approach which leveraged the advantage of the semantic structure of URLs. Semantic structure is an inherent interpretative property of the URL that identifies the function and vulnerability of each part in the URL. The evaluations on CSIC-2020 show that our feature extraction method has better performance than conventional feature extraction routine by more than average dramatic 5% improvement in accuracy, recall, and F1-score.

2021 ◽  
Author(s):  
Anastasia Malysheva ◽  
Alexey Tikhonov ◽  
Ivan P. Yamshchikov

Narrative generation and analysis are still on the fringe of modern natural language processing yet are crucial in a variety of applications. This paper proposes a feature extraction method for plot dynamics. We present a dataset that consists of the plot descriptions for thirteen thousand TV shows alongside meta-information on their genres and dynamic plots extracted from them. We validate the proposed tool for plot dynamics extraction and discuss possible applications of this method to the tasks of narrative analysis and generation.


2018 ◽  
Vol 4 (1) ◽  
pp. 3
Author(s):  
Maged A. Aldhaeebi ◽  
Thamer S. Almoneef ◽  
Omar M. Ramahi

In this work, we propose the use of an electrically small novel antenna as a probe combined with a classification algorithm for nearfield microwave breast tumor detection. The resonant probe ishighly sensitive to the changes in the electromagnetic properties of the breast tissues such that the presence of the tumor is estimatedby determining the changes in the magnitude and phase responseof the reflection coefficient of the sensor. The Principle Component placed at the middle of the probe as shown in Fig. 1. The mainAnalysis (PCA) feature extraction method is applied to emphasize the difference in the probe responses for both the healthy and thetumourous cases . We show that when a numerical realistic breast with and without tumor cells is placed in the near field of the probe, the probe is capable of distinguishing between healthy and tumorous tissue. In addition, the probe is able to identify tumors with various sizes placed in single locations.


Author(s):  
Dule Shu ◽  
Constantino Lagoa ◽  
Timothy Cleary

This paper presents a new method for road anomaly detection. The existence of road anomalies is determined by the behaviors of vehicles. A special polynomial named Sum-of-Squares (SOS) polynomial is used as a metric to evaluate the normality of vehicle behaviors. The method can process multiple types of sensor measurements. A feature extraction method is used to obtain concise representations of the sensor measurements. These representations, called feature points, are used to calculate the value of the SOS polynomial. Simulation results have been shown to demonstrate that the proposed method can effectively detect different types of road anomalies.


2020 ◽  
Vol 126 ◽  
pp. 106348 ◽  
Author(s):  
Zhen Liu ◽  
Nathalie Japkowicz ◽  
Ruoyu Wang ◽  
Yongming Cai ◽  
Deyu Tang ◽  
...  

2020 ◽  
pp. 1-12
Author(s):  
Yu Guangxu

The 21st century is an era of rapid development of the Internet. Internet technology is widely used in various fields. With the rapid development of network, the importance of network information security is also highlighted. The traditional network information security technology has been difficult to ensure the security of network information. Therefore, we mainly study the application of machine learning feature extraction method in situational awareness system. A feature selection method based on machine learning is proposed to extract situational features.By analyzing whether the background of network information is safe or not, and according to the current research situation at home and abroad and the trend of Internet development, this paper tries out the practical application of machine learning feature extraction method in a certain perception system. Based on the above points, a selection method based on machine learning is proposed to extract situational features. The accuracy and timeliness of situational awareness system detection are seriously affected by the high dimension, noise and redundant features of massive network traffic data.Therefore, it is of great value to further study network intrusion detection technology on the basis of machine learning.


Sensors ◽  
2020 ◽  
Vol 20 (3) ◽  
pp. 772 ◽  
Author(s):  
Tao Liu ◽  
Yanbing Chen ◽  
Dongqi Li ◽  
Tao Yang ◽  
Jianhua Cao

As a kind of intelligent instrument, an electronic tongue (E-tongue) realizes liquid analysis with an electrode-sensor array and certain machine learning methods. The large amplitude pulse voltammetry (LAPV) is a regular E-tongue type that prefers to collect a large amount of response data at a high sampling frequency within a short time. Therefore, a fast and effective feature extraction method is necessary for machine learning methods. Considering the fact that massive common-mode components (high correlated signals) in the sensor-array responses would depress the recognition performance of the machine learning models, we have proposed an alternative feature extraction method named feature specificity enhancement (FSE) for feature specificity enhancement and feature dimension reduction. The proposed FSE method highlights the specificity signals by eliminating the common mode signals on paired sensor responses. Meanwhile, the radial basis function is utilized to project the original features into a nonlinear space. Furthermore, we selected the kernel extreme learning machine (KELM) as the recognition part owing to its fast speed and excellent flexibility. Two datasets from LAPV E-tongues have been adopted for the evaluation of the machine-learning models. One is collected by a designed E-tongue for beverage identification and the other one is a public benchmark. For performance comparison, we introduced several machine-learning models consisting of different combinations of feature extraction and recognition methods. The experimental results show that the proposed FSE coupled with KELM demonstrates obvious superiority to other models in accuracy, time consumption and memory cost. Additionally, low parameter sensitivity of the proposed model has been demonstrated as well.


Sign in / Sign up

Export Citation Format

Share Document