A Real-Time Network Traffic Classifier for Online Applications Using Machine Learning

The increasing ubiquity of network traffic and the new online applications’ deployment has increased traffic analysis complexity. Traditionally, network administrators rely on recognizing well-known static ports for classifying the traffic flowing their networks. However, modern network traffic uses dynamic ports and is transported over secure application-layer protocols (e.g., HTTPS, SSL, and SSH). This makes it a challenging task for network administrators to identify online applications using traditional port-based approaches. One way for classifying the modern network traffic is to use machine learning (ML) to distinguish between the different traffic attributes such as packet count and size, packet inter-arrival time, packet send–receive ratio, etc. This paper presents the design and implementation of NetScrapper, a flow-based network traffic classifier for online applications. NetScrapper uses three ML models, namely K-Nearest Neighbors (KNN), Random Forest (RF), and Artificial Neural Network (ANN), for classifying the most popular 53 online applications, including Amazon, Youtube, Google, Twitter, and many others. We collected a network traffic dataset containing 3,577,296 packet flows with different 87 features for training, validating, and testing the ML models. A web-based user-friendly interface is developed to enable users to either upload a snapshot of their network traffic to NetScrapper or sniff the network traffic directly from the network interface card in real time. Additionally, we created a middleware pipeline for interfacing the three models with the Flask GUI. Finally, we evaluated NetScrapper using various performance metrics such as classification accuracy and prediction time. Most notably, we found that our ANN model achieves an overall classification accuracy of 99.86% in recognizing the online applications in our dataset.

Download Full-text

Real Time Efficient Accident Predictor System using Machine Learning Techniques (kNN, RF, LR, DT)

International Journal of Engineering and Advanced Technology - Regular Issue ◽

10.35940/ijeat.d6910.1210220 ◽

2020 ◽

Vol 10 (2) ◽

pp. 108-111

Keyword(s):

Machine Learning ◽

Random Forest ◽

Real Time ◽

Classification Accuracy ◽

Nearest Neighbors ◽

Machine Learning Algorithms ◽

Machine Learning Techniques ◽

Classification Methods ◽

K Nearest Neighbors ◽

Learning Techniques

Real time crash predictor system is determining frequency of crashes and also severity of crashes. Nowadays machine learning based methods are used to predict the total number of crashes. In this project, prediction accuracy of machine learning algorithms like Decision tree (DT), K-nearest neighbors (KNN), Random forest (RF), Logistic Regression (LR) are evaluated. Performance analysis of these classification methods are evaluated in terms of accuracy. Dataset included for this project is obtained from 49 states of US and 27 states of India which contains 2.25 million US accident crash records and 1.16 million crash records respectively. Results prove that classification accuracy obtained from Random Forest (RF) is96% compared to other classification methods.

Download Full-text

Modeling Fused Filament Fabrication using Artificial Neural Networks

Production Engineering ◽

10.1007/s11740-021-01020-y ◽

2021 ◽

Author(s):

Paul Oehlmann ◽

Paul Osswald ◽

Juan Camilo Blanco ◽

Martin Friedrich ◽

Dominik Rietzel ◽

...

Keyword(s):

Real Time ◽

Large Scale ◽

Cost Effective ◽

Print Quality ◽

Ann Model ◽

Production Environment ◽

Fused Filament Fabrication ◽

Artificial Neural ◽

Using Data ◽

Artificial Neural Network Ann

AbstractWith industries pushing towards digitalized production, adaption to expectations and increasing requirements for modern applications, has brought additive manufacturing (AM) to the forefront of Industry 4.0. In fact, AM is a main accelerator for digital production with its possibilities in structural design, such as topology optimization, production flexibility, customization, product development, to name a few. Fused Filament Fabrication (FFF) is a widespread and practical tool for rapid prototyping that also demonstrates the importance of AM technologies through its accessibility to the general public by creating cost effective desktop solutions. An increasing integration of systems in an intelligent production environment also enables the generation of large-scale data to be used for process monitoring and process control. Deep learning as a form of artificial intelligence (AI) and more specifically, a method of machine learning (ML) is ideal for handling big data. This study uses a trained artificial neural network (ANN) model as a digital shadow to predict the force within the nozzle of an FFF printer using filament speed and nozzle temperatures as input data. After the ANN model was tested using data from a theoretical model it was implemented to predict the behavior using real-time printer data. For this purpose, an FFF printer was equipped with sensors that collect real time printer data during the printing process. The ANN model reflected the kinematics of melting and flow predicted by models currently available for various speeds of printing. The model allows for a deeper understanding of the influencing process parameters which ultimately results in the determination of the optimum combination of process speed and print quality.

Download Full-text

Application of Real-Time Field Data to Optimize Drilling Hydraulics Using Neural Network Approach

Journal of Energy Resources Technology ◽

10.1115/1.4030847 ◽

2015 ◽

Vol 137 (6) ◽

Cited By ~ 31

Author(s):

Yanfang Wang ◽

Saeed Salehi

Keyword(s):

Neural Network ◽

Real Time ◽

Field Data ◽

Ann Model ◽

Time Data ◽

Neural Network Approach ◽

Lost Circulation ◽

Drilling Performance ◽

Artificial Neural Network Ann ◽

Pump Pressure

Real-time drilling optimization improves drilling performance by providing early warnings in operation Mud hydraulics is a key aspect of drilling that can be optimized by access to real-time data. Different from the investigated references, reliable prediction of pump pressure provides an early warning of circulation problems, washout, lost circulation, underground blowout, and kicks. This will help the driller to make necessary corrections to mitigate potential problems. In this study, an artificial neural network (ANN) model to predict hydraulics was implemented through the fitting tool of matlab. Following the determination of the optimum model, the sensitivity analysis of input parameters on the created model was investigated by using forward regression method. Next, the remaining data from the selected well samples was applied for simulation to verify the quality of the developed model. The novelty is this paper is validation of computer models with actual field data collected from an operator in LA. The simulation result was promising as compared with collected field data. This model can accurately predict pump pressure versus depth in analogous formations. The result of this work shows the potential of the approach developed in this work based on NN models for predicting real-time drilling hydraulics.

Download Full-text

New recommendation to predict export value using big data and machine learning technique

Statistical Journal of the IAOS ◽

10.3233/sji-210855 ◽

2021 ◽

pp. 1-14

Author(s):

Rani Nooraeni ◽

Jimmy Nickelson ◽

Eko Rahmadian ◽

Nugroho Puspito Yudho

Keyword(s):

Machine Learning ◽

Genetic Algorithm ◽

Big Data ◽

Ann Model ◽

Machine Learning Technique ◽

Current Period ◽

Learning Technique ◽

Monthly Period ◽

Conventional Machine ◽

Artificial Neural Network Ann

Official statistics on monthly export values have a publicity lag between the current period and the published publication. None of the previous researchers estimated the value of exports for the monthly period. This circumstance is due to limitations in obtaining supporting data that can predict the criteria for the current export value of goods. AIS data is one type of big data that can provide solutions in producing the latest indicators to forecast export values. Statistical Methods and Conventional Machine Learning are implemented as forecasting methods. Seasonal ARIMA and Artificial Neural Network (ANN) methods are both used in research to forecast the value of Indonesia’s exports. However, ANN has a weakness that requires high computational costs to obtain optimal parameters. Genetic Algorithm (GA) is effective in increasing ANN accuracy. Based on these backgrounds, this paper aims to develop and select an AIS indicator to predict the monthly export value in Indonesia and optimize ANN performance by combining the ANN algorithm with the genetic algorithm (GA-ANN). The research successfully established five indicators that can be used as predictors in the forecasting model. According to the model evaluation results, the genetic algorithm has succeeded in improving the performance of the ANN model as indicated by the resulting RMSE GA-ANN value, which is smaller than the RMSE of the ANN model.

Download Full-text

A Real-Time Smart Agent for Network Traffic Profiling and Intrusion Detection Based on Combined Machine Learning Algorithms

10.1007/978-981-16-3637-0_21 ◽

2021 ◽

pp. 301-309

Author(s):

Nadiya El Kamel ◽

Mohamed Eddabbah ◽

Youssef Lmoumen ◽

Raja Touahni

Keyword(s):

Machine Learning ◽

Intrusion Detection ◽

Real Time ◽

Network Traffic ◽

Learning Algorithms ◽

Machine Learning Algorithms

Download Full-text

Coal identification using neural networks with real-time coalbed methane drilling data

The APPEA Journal ◽

10.1071/aj18091 ◽

2019 ◽

Vol 59 (1) ◽

pp. 319 ◽

Cited By ~ 5

Author(s):

Ruizhi Zhong ◽

Raymond Johnson Jr ◽

Zhongwei Chen ◽

Nathaniel Chand

Keyword(s):

Real Time ◽

Coalbed Methane ◽

Gamma Ray ◽

Ann Model ◽

Drilling Data ◽

Weight On Bit ◽

Net To Gross ◽

Rotary Speed ◽

Artificial Neural Network Ann ◽

Log Interpretation

Currently, coal is identified using coring data or log interpretation. Coring is the most dependable methodology, but it is costly and its characterisation is expensive and time consuming. Logging methods are convenient, reliable, and reproducible, but can be subject to statistical and shouldering effects and often have operational difficulties in deviated or horizontal wells. Drilling data, which are routinely available, can potentially be used to identify coal sections in a machine learning environment when conventional wireline logs are not available. To achieve this, a four-layer artificial neural network (ANN) was used to identify coals in a well at Walloon Sub-Group, Surat Basin. The ANN model used drilling data and some logging-while-drilling (LWD) data. The inputs for the lithological model from high-frequency drilling data include weight on bit, rotary speed, torque, and rate of penetration. Inputs from LWD data include gamma ray and hole diameter. The criterion for coal identification is based on bulk density cutoff. The simulation results show that the ANN can deliver an overall accuracy of 96%. Due to the low net-to-gross ratio of coals within the Walloon sequence, a lower but reasonable F1 score of 0.78 is achievable for the coal sections. The proposed model can potentially be implemented in real-time to identify coal intervals without additional logs and aid validation of minimal log data.

Download Full-text

Glass Classification based on Machine Learning Algorithms

International Journal of Innovative Technology and Exploring Engineering - Special Issue ◽

10.35940/ijitee.h6819.0991120 ◽

2020 ◽

Vol 9 (11) ◽

pp. 139-142

Keyword(s):

Machine Learning ◽

Random Forest ◽

Amorphous Solid ◽

Machine Learning Algorithms ◽

Support Vector ◽

K Nearest Neighbors ◽

X Ray ◽

Svm Algorithm ◽

Artificial Neural Network Ann ◽

Logistic Regression Algorithm

Glass Industry is considered one of the most important industries in the world. The Glass is used everywhere, from water bottles to X-Ray and Gamma Rays protection. This is a non-crystalline, amorphous solid that is most often transparent. There are lots of uses of glass, and during investigation in a crime scene, the investigators need to know what is type of glass in a scene. To find out the type of glass, we will use the online dataset and machine learning to solve the above problem. We will be using ML algorithms such as Artificial Neural Network (ANN), K-nearest neighbors (KNN) algorithm, Support Vector Machine (SVM) algorithm, Random Forest algorithm, and Logistic Regression algorithm. By comparing all the algorithm Random Forest did the best in glass classification.

Download Full-text

Material Classification via Machine Learning Techniques: Construction Projects Progress Monitoring

10.5772/intechopen.96354 ◽

2021 ◽

Author(s):

Wesam Salah Alaloul ◽

Abdul Hannan Qureshi

Keyword(s):

Machine Learning ◽

Construction Projects ◽

Industrial Revolution ◽

Progress Monitoring ◽

Concrete Block ◽

Machine Learning Techniques ◽

Support Vector ◽

Ann Model ◽

Material Classification ◽

Artificial Neural Network Ann

Nowadays, the construction industry is on a fast track to adopting digital processes under the Industrial Revolution (IR) 4.0. The desire to automate maximum construction processes with less human interference has led the industry and research community to inclined towards artificial intelligence. This chapter has been themed on automated construction monitoring practices by adopting material classification via machine learning (ML) techniques. The study has been conducted by following the structure review approach to gain an understanding of the applications of ML techniques for construction progress assessment. Data were collected from the Web of Science (WoS) and Scopus databases, concluding 14 relevant studies. The literature review depicted the support vector machine (SVM) and artificial neural network (ANN) techniques as more effective than other ML techniques for material classification. The last section of this chapter includes a python-based ANN model for material classification. This ANN model has been tested for construction items (brick, wood, concrete block, and asphalt) for training and prediction. Moreover, the predictive ANN model results have been shared for the readers, along with the resources and open-source web links.

Download Full-text

Computer Vision and Machine Learning Analysis of Commercial Rice Grains: A Potential Digital Approach for Consumer Perception Studies

Sensors ◽

10.3390/s21196354 ◽

2021 ◽

Vol 21 (19) ◽

pp. 6354

Author(s):

Aimi Aznan ◽

Claudia Gonzalez Viejo ◽

Alexis Pang ◽

Sigfredo Fuentes

Keyword(s):

Machine Learning ◽

Computer Vision ◽

Imaging System ◽

Cost Effective ◽

Consumer Perception ◽

Rice Grain ◽

Ann Model ◽

Practical Applications ◽

Rice Samples ◽

Artificial Neural Network Ann

Rice quality assessment is essential for meeting high-quality standards and consumer demands. However, challenges remain in developing cost-effective and rapid techniques to assess commercial rice grain quality traits. This paper presents the application of computer vision (CV) and machine learning (ML) to classify commercial rice samples based on dimensionless morphometric parameters and color parameters extracted using CV algorithms from digital images obtained from a smartphone camera. The artificial neural network (ANN) model was developed using nine morpho-colorimetric parameters to classify rice samples into 15 commercial rice types. Furthermore, the ANN models were deployed and evaluated on a different imaging system to simulate their practical applications under different conditions. Results showed that the best classification accuracy was obtained using the Bayesian Regularization (BR) algorithm of the ANN with ten hidden neurons at 91.6% (MSE = <0.01) and 88.5% (MSE = 0.01) for the training and testing stages, respectively, with an overall accuracy of 90.7% (Model 2). Deployment also showed high accuracy (93.9%) in the classification of the rice samples. The adoption by the industry of rapid, reliable, and accurate methods, such as those presented here, may allow the incorporation of different morpho-colorimetric traits in rice with consumer perception studies.

Download Full-text

Detection of Abnormalities in Real-Time Computer Network Traffic Empowered by Machine Learning

International Journal of Advanced Trends in Computer Science and Engineering ◽

10.30534/ijatcse/2021/821032021 ◽

2021 ◽

Vol 10 (3) ◽

pp. 2072-2079

Keyword(s):

Machine Learning ◽

Anomaly Detection ◽

Real Time ◽

Network Traffic ◽

Computer Network ◽

Data Gathering ◽

Network Systems ◽

Data Set ◽

Synthetic Network ◽

Scientific Papers

This research discloses how to utilize machine learning methods for anomaly detection in real-time on a computer network. While utilizing machine learning for this task is definitely not a novel idea, little literature is about the matter of doing it in real-time. Most machine learning research in PC network anomaly detection depends on the KDD '99 data set and means to demonstrate the proficiency of the algorithms introduced. The emphasis on this data set has caused a lack of scientific papers disclosing how to assemble network data, remove features, and train algorithms for use inreal-time networks. It has been contended that utilizing the KDD '99 dataset for anomaly detection is not appropriate for real-time network systems. This research proposes how the data gathering procedure will be possible utilizing a dummy network and generating synthetic network traffic by analyzing the importance of One-class SVM. As the efficiency of k-means clustering and LTSM neural networks is lower than one-class SVM, that is why this research uses the results of existing research of LSTM and k-means clustering for the comparison with reported outcomes of a similar algorithm on the KDD '99 dataset. Precisely, without engaging KDD ’99 data set by using synthetic network traffic, this research achieved the higher accuracy as compared to the previous researches.

Download Full-text