Image Classifiers for Network Intrusions

Network Attack ◽

Network Intrusions ◽

Feature Importance ◽

Individual Attack

This research recasts the network attack dataset from UNSW-NB15 as an intrusion detection problem in image space. Using one-hot-encodings, the resulting grayscale thumbnails provide a quarter-million examples for deep learning algorithms. Applying the MobileNetV2’s convolutional neural network architecture, the work demonstrates a 97% accuracy in distinguishing normal and attack traffic. Further class refinements to 9 individual attack families (exploits, worms, shellcodes) show an overall 56% accuracy. Using feature importance rank, a random forest solution on subsets show the most important sourcedestination factors and the least important ones as mainly obscure protocols. The dataset is available on Kaggle.

Deep Learning Classification Methods Applied to Tabular Cybersecurity Benchmarks

International Journal of Network Security & Its Applications ◽

10.5121/ijnsa.2021.13301 ◽

2021 ◽

Vol 13 (03) ◽

pp. 1-13

Author(s):

David A. Noever ◽

Samantha E. Miller Noever

Keyword(s):

Deep Learning ◽

Network Architecture ◽

Classification Problem ◽

Research Community ◽

Classification Methods ◽

Detection Problem ◽

Network Attack ◽

Feature Importance ◽

Individual Attack

This research recasts the network attack dataset from UNSW-NB15 as an intrusion detection problem in image space. Using one-hot-encodings, the resulting grayscale thumbnails provide a quarter-million examples for deep learning algorithms. Applying the MobileNetV2’s convolutional neural network architecture, the work demonstrates a 97% accuracy in distinguishing normal and attack traffic. Further class refinements to 9 individual attack families (exploits, worms, shellcodes) show an overall 54% accuracy. Using feature importance rank, a random forest solution on subsets shows the most important source-destination factors and the least important ones as mainly obscure protocols. It further extends the image classification problem to other cybersecurity benchmarks such as malware signatures extracted from binary headers, with an 80% overall accuracy to detect computer viruses as portable executable files (headers only). Both novel image datasets are available to the research community on Kaggle.

optNet-50: An Optimized Residual Neural Network Architecture of Deep Learning for Driver's Distraction

2020 IEEE 23rd International Multitopic Conference (INMIC) ◽

10.1109/inmic50486.2020.9318087 ◽

2020 ◽

Author(s):

Tahir Abbas ◽

Syed Farooq Ali ◽

Aadil Zia Khan ◽

Irfan Kareem

Keyword(s):

Neural Network ◽

Deep Learning ◽

Network Architecture ◽

Modelling Peri-Perceptual Brain Processes in a Deep Learning Spiking Neural Network Architecture

Scientific Reports ◽

10.1038/s41598-018-27169-8 ◽

2018 ◽

Vol 8 (1) ◽

Cited By ~ 16

Author(s):

Zohreh Gholami Doborjeh ◽

Nikola Kasabov ◽

Maryam Gholami Doborjeh ◽

Alexander Sumich

Keyword(s):

Neural Network ◽

Deep Learning ◽

Network Architecture ◽

Spiking Neural Network ◽

Designing deep neural networks for continual learning in an open world

10.21248/gups.62487 ◽

2021 ◽

Author(s):

◽

Martin Mundt

Keyword(s):

Neural Network ◽

Machine Learning ◽

Deep Learning ◽

Network Architecture ◽

Neural Network Training ◽

Neural Architecture ◽

Network Training ◽

Classification Tasks ◽

Continual Learning

Deep learning with neural networks seems to have largely replaced traditional design of computer vision systems. Automated methods to learn a plethora of parameters are now used in favor of previously practiced selection of explicit mathematical operators for a specific task. The entailed promise is that practitioners no longer need to take care of every individual step, but rather focus on gathering big amounts of data for neural network training. As a consequence, both a shift in mindset towards a focus on big datasets, as well as a wave of conceivable applications based exclusively on deep learning can be observed. This PhD dissertation aims to uncover some of the only implicitly mentioned or overlooked deep learning aspects, highlight unmentioned assumptions, and finally introduce methods to address respective immediate weaknesses. In the author’s humble opinion, these prevalent shortcomings can be tied to the fact that the involved steps in the machine learning workflow are frequently decoupled. Success is predominantly measured based on accuracy measures designed for evaluation with static benchmark test sets. Individual machine learning workflow components are assessed in isolation with respect to available data, choice of neural network architecture, and a particular learning algorithm, rather than viewing the machine learning system as a whole in context of a particular application. Correspondingly, in this dissertation, three key challenges have been identified: 1. Choice and flexibility of a neural network architecture. 2. Identification and rejection of unseen unknown data to avoid false predictions. 3. Continual learning without forgetting of already learned information. These latter challenges have already been crucial topics in older literature, alas, seem to require a renaissance in modern deep learning literature. Initially, it may appear that they pose independent research questions, however, the thesis posits that the aspects are intertwined and require a joint perspective in machine learning based systems. In summary, the essential question is thus how to pick a suitable neural network architecture for a specific task, how to recognize which data inputs belong to this context, which ones originate from potential other tasks, and ultimately how to continuously include such identified novel data in neural network training over time without overwriting existing knowledge. Thus, the central emphasis of this dissertation is to build on top of existing deep learning strengths, yet also acknowledge mentioned weaknesses, in an effort to establish a deeper understanding of interdependencies and synergies towards the development of unified solution mechanisms. For this purpose, the main portion of the thesis is in cumulative form. The respective publications can be grouped according to the three challenges outlined above. Correspondingly, chapter 1 is focused on choice and extendability of neural network architectures, analyzed in context of popular image classification tasks. An algorithm to automatically determine neural network layer width is introduced and is first contrasted with static architectures found in the literature. The importance of neural architecture design is then further showcased on a real-world application of defect detection in concrete bridges. Chapter 2 is comprised of the complementary ensuing questions of how to identify unknown concepts and subsequently incorporate them into continual learning. A joint central mechanism to distinguish unseen concepts from what is known in classification tasks, while enabling consecutive training without forgetting or revisiting older classes, is proposed. Once more, the role of the chosen neural network architecture is quantitatively reassessed. Finally, chapter 3 culminates in an overarching view, where developed parts are connected. Here, an extensive survey further serves the purpose to embed the gained insights in the broader literature landscape and emphasizes the importance of a common frame of thought. The ultimately presented approach thus reflects the overall thesis’ contribution to advance neural network based machine learning towards a unified solution that ties together choice of neural architecture with the ability to learn continually and the capability to automatically separate known from unknown data.

Application of deep learning methods to predict ionosphere parameters in real time

E3S Web of Conferences ◽

10.1051/e3sconf/202019602007 ◽

2020 ◽

Vol 196 ◽

pp. 02007

Author(s):

Vladimir Mochalov ◽

Anastasia Mochalova

Keyword(s):

Neural Network ◽

Deep Learning ◽

Real Time ◽

Network Architecture ◽

Short Term Memory ◽

Short Term ◽

Learning Methods ◽

Term Memory ◽

Long Short Term Memory

In this paper, the previously obtained results on recognition of ionograms using deep learning are expanded to predict the parameters of the ionosphere. After the ionospheric parameters have been identified on the ionogram using deep learning in real time, we can predict the parameters for some time ahead on the basis of the new data obtained Examples of predicting the ionosphere parameters using an artificial recurrent neural network architecture long short-term memory are given. The place of the block for predicting the parameters of the ionosphere in the system for analyzing ionospheric data using deep learning methods is shown.

Deep learning based cone beam CT reconstruction framework using a cascaded neural network architecture (Conference Presentation)

Medical Imaging 2018: Physics of Medical Imaging ◽

10.1117/12.2293916 ◽

2018 ◽

Cited By ~ 1

Author(s):

Yinsheng Li ◽

Guang-Hong Chen

Keyword(s):

Neural Network ◽

Deep Learning ◽

Network Architecture ◽

Cone Beam Ct ◽

Cone Beam ◽

Ct Reconstruction ◽

2019 1st International Conference on Advances in Science, Engineering and Robotics Technology (ICASERT) ◽

Seizure and Non-Seizure EEG Signals Detection Using 1-D Convolutional Neural Network Architecture of Deep Learning Algorithm

10.1109/icasert.2019.8934564 ◽

2019 ◽

Author(s):

Tanima Tasmin Chowdhury ◽

Afrin Hossain ◽

Shaikh Anowarul Fattah ◽

Celia Shahnaz

Keyword(s):

Neural Network ◽

Deep Learning ◽

Convolutional Neural Network ◽

Network Architecture ◽

Learning Algorithm ◽

Eeg Signals ◽

Deep Learning Algorithm

Spectral deep learning for prediction and prospective validation of functional groups

Chemical Science ◽

10.1039/c9sc06240h ◽

2020 ◽

Vol 11 (18) ◽

pp. 4618-4630 ◽

Cited By ~ 5

Author(s):

Jonathan A. Fine ◽

Anand A. Rajasekar ◽

Krupal P. Jethava ◽

Gaurav Chopra

Keyword(s):

Neural Network ◽

Deep Learning ◽

Functional Groups ◽

Network Architecture ◽

Mass Spectra ◽

Deep Neural Network ◽

Complex Mixtures ◽

A new multi-label deep neural network architecture is used to combine Infrared and mass spectra, trained on single compounds to predict functional groups, and experimentally validated on complex mixtures.

Intelligent and Fuzzy Techniques for Emerging Conditions and Digital Transformation - Lecture Notes in Networks and Systems ◽

Deep Learning Neural Network Architecture for Human Facial Expression Recognition

10.1007/978-3-030-85577-2_34 ◽

2021 ◽

pp. 290-297

Author(s):

Sangaraju V. Kumar ◽

Jaeho Choi

Keyword(s):

Neural Network ◽

Deep Learning ◽

Facial Expression ◽

Network Architecture ◽

Facial Expression Recognition ◽

Expression Recognition ◽

Deep Learning Neural Network ◽

Human Facial Expression

On the Relationship between Generalization and Robustness to Adversarial Examples

Symmetry ◽

10.3390/sym13050817 ◽

2021 ◽

Vol 13 (5) ◽

pp. 817

Author(s):

Anibal Pedraza ◽

Oscar Deniz ◽

Gloria Bueno

Keyword(s):

Neural Network ◽

Deep Learning ◽

Network Architecture ◽

Trade Off ◽

Adversarial Examples ◽

Simultaneous Loss ◽

The Relationship

One of the most intriguing phenomenons related to deep learning is the so-called adversarial examples. These samples are visually equivalent to normal inputs, undetectable for humans, yet they cause the networks to output wrong results. The phenomenon can be framed as a symmetry/asymmetry problem, whereby inputs to a neural network with a similar/symmetric appearance to regular images, produce an opposite/asymmetric output. Some researchers are focused on developing methods for generating adversarial examples, while others propose defense methods. In parallel, there is a growing interest in characterizing the phenomenon, which is also the focus of this paper. From some well known datasets of common images, like CIFAR-10 and STL-10, a neural network architecture is first trained in a normal regime, where training and validation performances increase, reaching generalization. Additionally, the same architectures and datasets are trained in an overfitting regime, where there is a growing disparity in training and validation performances. The behaviour of these two regimes against adversarial examples is then compared. From the results, we observe greater robustness to adversarial examples in the overfitting regime. We explain this simultaneous loss of generalization and gain in robustness to adversarial examples as another manifestation of the well-known fitting-generalization trade-off.