On the Perfect Privacy: a Statistical Analysis of Network Traffic Approach

Research into network anomaly detection has become crucial as a result of a significant increase in the number of computer attacks. Many approaches in network anomaly detection have been reported in the literature, but data or solutions typically are not freely available. Recently, a labeled network traffic flow dataset, Kyoto2006+, has been created and is publicly available. Most existing approaches using Kyoto2006+ for network anomaly detection apply various clustering techniques. This paper leverages existing well known statistical analysis and spectral analysis techniques for network anomaly detection. The first popular approach is a statistical analysis technique called Principal Component Analysis (PCA). PCA describes data in a new dimension to unlock otherwise hidden characteristics. The other well known spectral analysis technique is Haar Wavelet filtering analysis. It measures the amount and magnitude of abrupt changes in data. Both approaches have strengths and limitations. In response, this paper proposes a Hybrid PCA–Haar Wavelet Analysis. The hybrid approach first applies PCA to describe the data and then Haar Wavelet filtering for analysis. Based on prototyping and measurement, an investigation of the Hybrid PCA–Haar Wavelet Analysis technique is performed using the Kyoto2006+ dataset. The authors consider a number of parameters and present experimental results to demonstrate the effectiveness of the hybrid approach as compared to the two algorithms individually.

Download Full-text

Statistical Analysis of Network Traffic for Adaptive Faults Detection

IEEE Transactions on Neural Networks ◽

10.1109/tnn.2005.853414 ◽

2005 ◽

Vol 16 (5) ◽

pp. 1053-1063 ◽

Cited By ~ 54

Author(s):

H. Hajji

Keyword(s):

Statistical Analysis ◽

Network Traffic

Download Full-text

THE STATISTICAL ANALYSIS OF A NETWORK TRAFFIC FOR THE INTRUSION DETECTION AND PREVENTION SYSTEMS

Telecommunications and Radio Engineering ◽

10.1615/telecomradeng.v74.i1.60 ◽

2015 ◽

Vol 74 (1) ◽

pp. 61-78 ◽

Cited By ~ 32

Author(s):

A.A. Kuznetsov ◽

A.A. Smirnov ◽

D.A. Danilenko ◽

A. Berezovsky

Keyword(s):

Statistical Analysis ◽

Intrusion Detection ◽

Network Traffic ◽

Intrusion Detection And Prevention

Download Full-text

Multivariate statistical analysis of network traffic for intrusion detection

14th International Workshop on Database and Expert Systems Applications, 2003. Proceedings. ◽

10.1109/dexa.2003.1232068 ◽

2004 ◽

Cited By ~ 5

Author(s):

A. Kanaoka ◽

E. Okamoto

Keyword(s):

Statistical Analysis ◽

Intrusion Detection ◽

Network Traffic ◽

Multivariate Statistical Analysis ◽

Multivariate Statistical

Download Full-text

Statistical analysis of local features in network traffic processes

IEEE/SP 13th Workshop on Statistical Signal Processing, 2005 ◽

10.1109/ssp.2005.1628749 ◽

2005 ◽

Cited By ~ 4

Author(s):

G. Giorgi ◽

C. Narduzzi ◽

P.A. Pegoraro

Keyword(s):

Statistical Analysis ◽

Network Traffic ◽

Local Features ◽

Traffic Processes

Download Full-text

Statistical Analysis of Network Traffic over LAN through IAMT

Communications in Computer and Information Science - Computer Networks and Intelligent Computing ◽

10.1007/978-3-642-22786-8_41 ◽

2011 ◽

pp. 329-335

Author(s):

Shashank Srivastava ◽

Abhinav Goyal ◽

Rajeev Kumar ◽

Nandi G.C.

Keyword(s):

Statistical Analysis ◽

Network Traffic

Download Full-text

Statistical Analysis Software

Statistical Techniques for Network Security ◽

10.4018/978-1-59904-708-9.ch002 ◽

2011 ◽

pp. 35-59

Author(s):

Yu Wang

Keyword(s):

Statistical Analysis ◽

Network Security ◽

Data Management ◽

Network Traffic ◽

Statistical Analyses ◽

Online Data ◽

Statistical Software ◽

Hierarchical Generalized Linear Model ◽

Software Packages ◽

Computing Environments

Statistical software and their corresponding computing environments are essential factors that will lead to the achievement of efficient and better research. If we think of computing and classifying algorithms as the roadmap to arrive at our final destination, a statistical package is the vehicle that is used to reach this point. Figure 2.1 shows a basic roadmap of the roles that statistical software packages play in network security. One of the advantages of using a statistical package in network security is that it provides a fairly easy and quick way to explore data, test algorithms and evaluate models. Unfortunately, not every package is suitable for analyzing network traffic. Given the natural characteristics of the network traffic data (i.e., large size and the ability to change dynamically), several fundamental attributes are necessary for specific packages. First, the package should have good data management capacities, which include the capacity to read large data and output/save resulting files in different formats, the capability to merge and link processed data with other data sources, and the ability to create, modify and delete variables within data. Second, it should be able to process large amounts of data efficiently because statistical analyses in network security are usually based on dynamic online data, which requires the application to conduct analyses timely; this differs from areas such as healthcare, life science, and epidemiology where statistical analyses are conducted based on static offline data. Third, it should support modern modeling procedures and methods, such as the Bayesian methods, hidden Markov model, hierarchical generalized linear model, etc. Finally, because usability is an important factor, we want the software to be both accessible and user-friendly. These attributes are particularly important during the development phase because they allow us to quickly test hypotheses and examine modeling strategies effectively. Since many commercial and research-oriented software packages may not have all of the aforementioned attributes, we may need to implement multiple packages, such as packages for data management, for fitting a particular model, and for displaying results graphically. In the end, we may more likely use a general-purpose programming language, such as C, C++ or Java to create a customized application which we can later integrate with the other components of the intrusion detection or prevention system. The results obtained from the statistical software can be used as a gold-standard benchmark to validate the results from the customized application. customized application. In this chapter, we will introduce several popular commercial and research-oriented packages that have been widely used in the statistical analysis, data mining, bioinformatics, and computer science communities. Specifically, we will discuss SAS1, Stata2 and R in Sections The SAS System, STATA and R, respectively; and briefly describe S-Plus3, WinBUGS, and MATLAB4 in Section Other Packages. The goal of this chapter is to provide a quick overview of these analytic software packages with some simple examples to help readers become familiar with the computing environments and statistical computing languages that will be referred to in the examples presented in the rest of these chapters. We have included some fundamental materials in the Reference section for further reading for those readers who would like to acquire more detailed information on using these software packages.

Download Full-text

USE OF THE STATISTICAL ANALYSIS METHODS TO DETECT VOIP NETWORK TRAFFIC ANOMALIES

Bulletin of National Technical University KhPI Series System Analysis Control and Information Technologies ◽

10.20998/2079-0023.2020.01.01 ◽

2020 ◽

Vol 0 (1 (3)) ◽

pp. 3-8

Author(s):

Leonid Serhiyovych Smidovych

Keyword(s):

Statistical Analysis ◽

Network Traffic ◽

Analysis Methods ◽

Statistical Analysis Methods

Download Full-text

On network traffic statistical analysis

Lietuvos matematikos rinkinys ◽

10.15388/lmr.2008.18116 ◽

2020 ◽

Vol 48 ◽

Author(s):

Liudas Kaklauskas ◽

Leonidas Sakalauslas

Keyword(s):

Time Series ◽

Statistical Analysis ◽

Statistical Method ◽

Network Traffic ◽

Network Node ◽

Hurst Index ◽

Self Similarity ◽

Present Measurement ◽

Traffic Characteristics ◽

Measurement Results

The present article deals with statistical university network traffic, by applying the methods of self-similarity and chaos analysis. The object of measurement is Šiauliai University LitNet network node maintaining institutions of education of the northern Lithuania region. Time series of network traffic characteristics are formed by registering amount of information packets in a node at different regimes of network traffic and different values of discretion of registered information are present. Measurement results are processed by calculating Hurst index and estimating reliability of analysis results by applying the statistical method. Investigation of the network traffic allowed us drawing conclusions that time series bear features of self-similarity when aggregated time series bear features of slowly decreasing dependence.

Download Full-text