scholarly journals A Real-Time Network Traffic Classifier for Online Applications Using Machine Learning

Algorithms ◽  
2021 ◽  
Vol 14 (8) ◽  
pp. 250
Author(s):  
Ahmed Abdelmoamen Ahmed ◽  
Gbenga Agunsoye

The increasing ubiquity of network traffic and the new online applications’ deployment has increased traffic analysis complexity. Traditionally, network administrators rely on recognizing well-known static ports for classifying the traffic flowing their networks. However, modern network traffic uses dynamic ports and is transported over secure application-layer protocols (e.g., HTTPS, SSL, and SSH). This makes it a challenging task for network administrators to identify online applications using traditional port-based approaches. One way for classifying the modern network traffic is to use machine learning (ML) to distinguish between the different traffic attributes such as packet count and size, packet inter-arrival time, packet send–receive ratio, etc. This paper presents the design and implementation of NetScrapper, a flow-based network traffic classifier for online applications. NetScrapper uses three ML models, namely K-Nearest Neighbors (KNN), Random Forest (RF), and Artificial Neural Network (ANN), for classifying the most popular 53 online applications, including Amazon, Youtube, Google, Twitter, and many others. We collected a network traffic dataset containing 3,577,296 packet flows with different 87 features for training, validating, and testing the ML models. A web-based user-friendly interface is developed to enable users to either upload a snapshot of their network traffic to NetScrapper or sniff the network traffic directly from the network interface card in real time. Additionally, we created a middleware pipeline for interfacing the three models with the Flask GUI. Finally, we evaluated NetScrapper using various performance metrics such as classification accuracy and prediction time. Most notably, we found that our ANN model achieves an overall classification accuracy of 99.86% in recognizing the online applications in our dataset.

Real time crash predictor system is determining frequency of crashes and also severity of crashes. Nowadays machine learning based methods are used to predict the total number of crashes. In this project, prediction accuracy of machine learning algorithms like Decision tree (DT), K-nearest neighbors (KNN), Random forest (RF), Logistic Regression (LR) are evaluated. Performance analysis of these classification methods are evaluated in terms of accuracy. Dataset included for this project is obtained from 49 states of US and 27 states of India which contains 2.25 million US accident crash records and 1.16 million crash records respectively. Results prove that classification accuracy obtained from Random Forest (RF) is96% compared to other classification methods.


Author(s):  
Paul Oehlmann ◽  
Paul Osswald ◽  
Juan Camilo Blanco ◽  
Martin Friedrich ◽  
Dominik Rietzel ◽  
...  

AbstractWith industries pushing towards digitalized production, adaption to expectations and increasing requirements for modern applications, has brought additive manufacturing (AM) to the forefront of Industry 4.0. In fact, AM is a main accelerator for digital production with its possibilities in structural design, such as topology optimization, production flexibility, customization, product development, to name a few. Fused Filament Fabrication (FFF) is a widespread and practical tool for rapid prototyping that also demonstrates the importance of AM technologies through its accessibility to the general public by creating cost effective desktop solutions. An increasing integration of systems in an intelligent production environment also enables the generation of large-scale data to be used for process monitoring and process control. Deep learning as a form of artificial intelligence (AI) and more specifically, a method of machine learning (ML) is ideal for handling big data. This study uses a trained artificial neural network (ANN) model as a digital shadow to predict the force within the nozzle of an FFF printer using filament speed and nozzle temperatures as input data. After the ANN model was tested using data from a theoretical model it was implemented to predict the behavior using real-time printer data. For this purpose, an FFF printer was equipped with sensors that collect real time printer data during the printing process. The ANN model reflected the kinematics of melting and flow predicted by models currently available for various speeds of printing. The model allows for a deeper understanding of the influencing process parameters which ultimately results in the determination of the optimum combination of process speed and print quality.


2015 ◽  
Vol 137 (6) ◽  
Author(s):  
Yanfang Wang ◽  
Saeed Salehi

Real-time drilling optimization improves drilling performance by providing early warnings in operation Mud hydraulics is a key aspect of drilling that can be optimized by access to real-time data. Different from the investigated references, reliable prediction of pump pressure provides an early warning of circulation problems, washout, lost circulation, underground blowout, and kicks. This will help the driller to make necessary corrections to mitigate potential problems. In this study, an artificial neural network (ANN) model to predict hydraulics was implemented through the fitting tool of matlab. Following the determination of the optimum model, the sensitivity analysis of input parameters on the created model was investigated by using forward regression method. Next, the remaining data from the selected well samples was applied for simulation to verify the quality of the developed model. The novelty is this paper is validation of computer models with actual field data collected from an operator in LA. The simulation result was promising as compared with collected field data. This model can accurately predict pump pressure versus depth in analogous formations. The result of this work shows the potential of the approach developed in this work based on NN models for predicting real-time drilling hydraulics.


2021 ◽  
pp. 1-14
Author(s):  
Rani Nooraeni ◽  
Jimmy Nickelson ◽  
Eko Rahmadian ◽  
Nugroho Puspito Yudho

Official statistics on monthly export values have a publicity lag between the current period and the published publication. None of the previous researchers estimated the value of exports for the monthly period. This circumstance is due to limitations in obtaining supporting data that can predict the criteria for the current export value of goods. AIS data is one type of big data that can provide solutions in producing the latest indicators to forecast export values. Statistical Methods and Conventional Machine Learning are implemented as forecasting methods. Seasonal ARIMA and Artificial Neural Network (ANN) methods are both used in research to forecast the value of Indonesia’s exports. However, ANN has a weakness that requires high computational costs to obtain optimal parameters. Genetic Algorithm (GA) is effective in increasing ANN accuracy. Based on these backgrounds, this paper aims to develop and select an AIS indicator to predict the monthly export value in Indonesia and optimize ANN performance by combining the ANN algorithm with the genetic algorithm (GA-ANN). The research successfully established five indicators that can be used as predictors in the forecasting model. According to the model evaluation results, the genetic algorithm has succeeded in improving the performance of the ANN model as indicated by the resulting RMSE GA-ANN value, which is smaller than the RMSE of the ANN model.


2019 ◽  
Vol 59 (1) ◽  
pp. 319 ◽  
Author(s):  
Ruizhi Zhong ◽  
Raymond Johnson Jr ◽  
Zhongwei Chen ◽  
Nathaniel Chand

Currently, coal is identified using coring data or log interpretation. Coring is the most dependable methodology, but it is costly and its characterisation is expensive and time consuming. Logging methods are convenient, reliable, and reproducible, but can be subject to statistical and shouldering effects and often have operational difficulties in deviated or horizontal wells. Drilling data, which are routinely available, can potentially be used to identify coal sections in a machine learning environment when conventional wireline logs are not available. To achieve this, a four-layer artificial neural network (ANN) was used to identify coals in a well at Walloon Sub-Group, Surat Basin. The ANN model used drilling data and some logging-while-drilling (LWD) data. The inputs for the lithological model from high-frequency drilling data include weight on bit, rotary speed, torque, and rate of penetration. Inputs from LWD data include gamma ray and hole diameter. The criterion for coal identification is based on bulk density cutoff. The simulation results show that the ANN can deliver an overall accuracy of 96%. Due to the low net-to-gross ratio of coals within the Walloon sequence, a lower but reasonable F1 score of 0.78 is achievable for the coal sections. The proposed model can potentially be implemented in real-time to identify coal intervals without additional logs and aid validation of minimal log data.


Glass Industry is considered one of the most important industries in the world. The Glass is used everywhere, from water bottles to X-Ray and Gamma Rays protection. This is a non-crystalline, amorphous solid that is most often transparent. There are lots of uses of glass, and during investigation in a crime scene, the investigators need to know what is type of glass in a scene. To find out the type of glass, we will use the online dataset and machine learning to solve the above problem. We will be using ML algorithms such as Artificial Neural Network (ANN), K-nearest neighbors (KNN) algorithm, Support Vector Machine (SVM) algorithm, Random Forest algorithm, and Logistic Regression algorithm. By comparing all the algorithm Random Forest did the best in glass classification.


2021 ◽  
Author(s):  
Wesam Salah Alaloul ◽  
Abdul Hannan Qureshi

Nowadays, the construction industry is on a fast track to adopting digital processes under the Industrial Revolution (IR) 4.0. The desire to automate maximum construction processes with less human interference has led the industry and research community to inclined towards artificial intelligence. This chapter has been themed on automated construction monitoring practices by adopting material classification via machine learning (ML) techniques. The study has been conducted by following the structure review approach to gain an understanding of the applications of ML techniques for construction progress assessment. Data were collected from the Web of Science (WoS) and Scopus databases, concluding 14 relevant studies. The literature review depicted the support vector machine (SVM) and artificial neural network (ANN) techniques as more effective than other ML techniques for material classification. The last section of this chapter includes a python-based ANN model for material classification. This ANN model has been tested for construction items (brick, wood, concrete block, and asphalt) for training and prediction. Moreover, the predictive ANN model results have been shared for the readers, along with the resources and open-source web links.


Sensors ◽  
2021 ◽  
Vol 21 (19) ◽  
pp. 6354
Author(s):  
Aimi Aznan ◽  
Claudia Gonzalez Viejo ◽  
Alexis Pang ◽  
Sigfredo Fuentes

Rice quality assessment is essential for meeting high-quality standards and consumer demands. However, challenges remain in developing cost-effective and rapid techniques to assess commercial rice grain quality traits. This paper presents the application of computer vision (CV) and machine learning (ML) to classify commercial rice samples based on dimensionless morphometric parameters and color parameters extracted using CV algorithms from digital images obtained from a smartphone camera. The artificial neural network (ANN) model was developed using nine morpho-colorimetric parameters to classify rice samples into 15 commercial rice types. Furthermore, the ANN models were deployed and evaluated on a different imaging system to simulate their practical applications under different conditions. Results showed that the best classification accuracy was obtained using the Bayesian Regularization (BR) algorithm of the ANN with ten hidden neurons at 91.6% (MSE = <0.01) and 88.5% (MSE = 0.01) for the training and testing stages, respectively, with an overall accuracy of 90.7% (Model 2). Deployment also showed high accuracy (93.9%) in the classification of the rice samples. The adoption by the industry of rapid, reliable, and accurate methods, such as those presented here, may allow the incorporation of different morpho-colorimetric traits in rice with consumer perception studies.


This research discloses how to utilize machine learning methods for anomaly detection in real-time on a computer network. While utilizing machine learning for this task is definitely not a novel idea, little literature is about the matter of doing it in real-time. Most machine learning research in PC network anomaly detection depends on the KDD '99 data set and means to demonstrate the proficiency of the algorithms introduced. The emphasis on this data set has caused a lack of scientific papers disclosing how to assemble network data, remove features, and train algorithms for use inreal-time networks. It has been contended that utilizing the KDD '99 dataset for anomaly detection is not appropriate for real-time network systems. This research proposes how the data gathering procedure will be possible utilizing a dummy network and generating synthetic network traffic by analyzing the importance of One-class SVM. As the efficiency of k-means clustering and LTSM neural networks is lower than one-class SVM, that is why this research uses the results of existing research of LSTM and k-means clustering for the comparison with reported outcomes of a similar algorithm on the KDD '99 dataset. Precisely, without engaging KDD ’99 data set by using synthetic network traffic, this research achieved the higher accuracy as compared to the previous researches.


Sign in / Sign up

Export Citation Format

Share Document