Spoofing Attack Detection Using Machine Learning in Cross-Technology Communication

Cross-technology communication (CTC) technique can realize direct communication among heterogeneous wireless devices (e.g., WiFi, ZigBee, and Bluetooth in the 2.4 G ISM band) without gateway equipment for forwarding, which makes heterogeneous wireless communication more convenient and greatly reduces communication costs. However, compared with the traditional homogeneous network model, CTC technique also makes it easier to implement spoofing attacks in heterogeneous networks. WiFi devices with long communication distances and sufficient energy supply can directly launch spoofing attacks against ZigBee devices, which brings severe security concerns for heterogeneous wireless communications. In this paper, we focus on the CTC spoofing attack, especially spoofing attacks from WiFi to ZigBee and propose a machine learning-based method to detect spoofing attacks for heterogeneous wireless networks by using physical-layer information. First, we model the received signal strength (RSS) data of legitimate ZigBee devices to construct a one-class support vector machine (OSVM) classifier for detecting CTC spoofing attacks depending on the obtained training samples. Then, we simulated CTC spoofing attacks in a live testbed and evaluated the performance of our detection method. Results show that our approach is highly effective in spoofing detection. Even if the distance between the legitimate ZigBee device and WiFi attacker is near each other (i.e., less than 2 m) and does not require a large number of samples, the detection rate and precision of our method are both over 90%. Finally, we employ the OSVM classifier to obtain samples of spoofing attacks and then explore using SVM to further improve the performance of the classifier.

Download Full-text

Support Vector Machines in Big Data Classification: A Systematic Literature Review

10.21203/rs.3.rs-663359/v1 ◽

2021 ◽

Author(s):

Mohammad Hassan Almaspoor ◽

Ali Safaei ◽

Afshin Salajegheh ◽

Behrouz Minaei-Bidgoli

Keyword(s):

Machine Learning ◽

Big Data ◽

Large Scale ◽

Support Vector ◽

Research Areas ◽

Large Scale Data ◽

Training Samples ◽

Big Data Classification ◽

Scale Data

Abstract Classification is one of the most important and widely used issues in machine learning, the purpose of which is to create a rule for grouping data to sets of pre-existing categories is based on a set of training sets. Employed successfully in many scientific and engineering areas, the Support Vector Machine (SVM) is among the most promising methods of classification in machine learning. With the advent of big data, many of the machine learning methods have been challenged by big data characteristics. The standard SVM has been proposed for batch learning in which all data are available at the same time. The SVM has a high time complexity, i.e., increasing the number of training samples will intensify the need for computational resources and memory. Hence, many attempts have been made at SVM compatibility with online learning conditions and use of large-scale data. This paper focuses on the analysis, identification, and classification of existing methods for SVM compatibility with online conditions and large-scale data. These methods might be employed to classify big data and propose research areas for future studies. Considering its advantages, the SVM can be among the first options for compatibility with big data and classification of big data. For this purpose, appropriate techniques should be developed for data preprocessing in order to covert data into an appropriate form for learning. The existing frameworks should also be employed for parallel and distributed processes so that SVMs can be made scalable and properly online to be able to handle big data.

Download Full-text

A DDoS Attack Detection Method Based on Machine Learning

Journal of Physics Conference Series ◽

10.1088/1742-6596/1237/3/032040 ◽

2019 ◽

Vol 1237 ◽

pp. 032040

Author(s):

Jiangtao Pei ◽

Yunli Chen ◽

Wei Ji

Keyword(s):

Machine Learning ◽

Detection Method ◽

Attack Detection ◽

Ddos Attack ◽

Ddos Attack Detection

Download Full-text

An Object Detection Method Based on Independent Local Features

Journal of Robotics and Mechatronics ◽

10.20965/jrm.2006.p0744 ◽

2006 ◽

Vol 18 (6) ◽

pp. 744-750

Author(s):

Ryouta Nakano ◽

◽

Kazuhiro Hotta ◽

Haruhisa Takahashi

Keyword(s):

Object Detection ◽

Detection Method ◽

Principal Component ◽

Component Analysis ◽

Local Features ◽

Local Feature ◽

Superior Performance ◽

Support Vector ◽

Car Detection ◽

Training Samples

This paper presents an object detection method using independent local feature extractor. Since objects are composed of a combination of characteristic parts, a good object detector could be developed if local parts specialized for a detection target are derived automatically from training samples. To do this, we use Independent Component Analysis (ICA) which decomposes a signal into independent elementary signals. We then used the basis vectors derived by ICA as independent local feature extractors specialized for a detection target. These feature extractors are applied to a candidate area, and their outputs are used in classification. However, the number of dimension of extracted independent local features is very high. To reduce the extracted independent local features efficiently, we use Higher-order Local AutoCorrelation (HLAC) features to extract the information that relates neighboring features. This may be more effective for object detection than simple independent local features. To classify detection targets and non-targets, we use a Support Vector Machine (SVM). The proposed method is applied to a car detection problem. Superior performance is obtained by comparison with Principal Component Analysis (PCA).

Download Full-text

Attack Detection in IoT using Machine Learning

Engineering, Technology & Applied Science Research ◽

10.48084/etasr.4202 ◽

2021 ◽

Vol 11 (3) ◽

pp. 7273-7278

Author(s):

M. Anwer ◽

M. U. Farooq ◽

S. M. Khan ◽

W. Waseemullah

Keyword(s):

Machine Learning ◽

Network Traffic ◽

Learning Algorithm ◽

Attack Detection ◽

Supervised Machine Learning ◽

Detection Methods ◽

Support Vector ◽

Web Traffic ◽

Local Networks ◽

Iot Devices

Many researchers have examined the risks imposed by the Internet of Things (IoT) devices on big companies and smart towns. Due to the high adoption of IoT, their character, inherent mobility, and standardization limitations, smart mechanisms, capable of automatically detecting suspicious movement on IoT devices connected to the local networks are needed. With the increase of IoT devices connected through internet, the capacity of web traffic increased. Due to this change, attack detection through common methods and old data processing techniques is now obsolete. Detection of attacks in IoT and detecting malicious traffic in the early stages is a very challenging problem due to the increase in the size of network traffic. In this paper, a framework is recommended for the detection of malicious network traffic. The framework uses three popular classification-based malicious network traffic detection methods, namely Support Vector Machine (SVM), Gradient Boosted Decision Trees (GBDT), and Random Forest (RF), with RF supervised machine learning algorithm achieving far better accuracy (85.34%). The dataset NSL KDD was used in the recommended framework and the performances in terms of training, predicting time, specificity, and accuracy were compared.

Download Full-text

Extraction of Sea Ice Cover by Sentinel-1 SAR Based on SVM with Unsupervised Generation of Training Data

10.20944/preprints202005.0336.v1 ◽

2020 ◽

Author(s):

Xiaoming Li ◽

Yan Sun ◽

Qiang Zhang

Keyword(s):

Machine Learning ◽

Sea Ice ◽

Learning Algorithm ◽

Texture Features ◽

Open Water ◽

Ice Cover ◽

Training Data ◽

Support Vector ◽

Training Samples

In this paper, we focus on developing a novel method to extract sea ice cover (i.e., discrimination/classification of sea ice and open water) using Sentinel-1 (S1) cross-polarization (vertical-horizontal, VH or horizontal-vertical, HV) data in extra wide (EW) swath mode based on the machine learning algorithm support vector machine (SVM). The classification basis includes the S1 radar backscatter coefficients and texture features that are calculated from S1 data using the gray level co-occurrence matrix (GLCM). Different from previous methods where appropriate samples are manually selected to train the SVM to classify sea ice and open water, we proposed a method of unsupervised generation of the training samples based on two GLCM texture features, i.e. entropy and homogeneity, that have contrasting characteristics on sea ice and open water. We eliminate the most uncertainty of selecting training samples in machine learning and achieve automatic classification of sea ice and open water by using S1 EW data. The comparison shows good agreement between the SAR-derived sea ice cover using the proposed method and a visual inspection, of which the accuracy reaches approximately 90% - 95% based on a few cases. Besides this, compared with the analyzed sea ice cover data Ice Mapping System (IMS) based on 728 S1 EW images, the accuracy of extracted sea ice cover by using S1 data is more than 80%.

Download Full-text

FEASIBILITY OF MACHINE LEARNING METHODS FOR SEPARATING WOOD AND LEAF POINTS FROM TERRESTRIAL LASER SCANNING DATA

ISPRS Annals of Photogrammetry Remote Sensing and Spatial Information Sciences ◽

10.5194/isprs-annals-iv-2-w4-157-2017 ◽

2017 ◽

Vol IV-2/W4 ◽

pp. 157-164 ◽

Cited By ~ 6

Author(s):

D. Wang ◽

M. Hollaus ◽

N. Pfeifer

Keyword(s):

Machine Learning ◽

Laser Scanning ◽

Learning Algorithms ◽

Terrestrial Laser Scanning ◽

Gaussian Mixture ◽

Machine Learning Algorithms ◽

Support Vector ◽

Area Index ◽

Training Samples

Classification of wood and leaf components of trees is an essential prerequisite for deriving vital tree attributes, such as wood mass, leaf area index (LAI) and woody-to-total area. Laser scanning emerges to be a promising solution for such a request. Intensity based approaches are widely proposed, as different components of a tree can feature discriminatory optical properties at the operating wavelengths of a sensor system. For geometry based methods, machine learning algorithms are often used to separate wood and leaf points, by providing proper training samples. However, it remains unclear how the chosen machine learning classifier and features used would influence classification results. To this purpose, we compare four popular machine learning classifiers, namely Support Vector Machine (SVM), Na¨ıve Bayes (NB), Random Forest (RF), and Gaussian Mixture Model (GMM), for separating wood and leaf points from terrestrial laser scanning (TLS) data. Two trees, an <i>Erytrophleum fordii</i> and a <i>Betula pendula</i> (silver birch) are used to test the impacts from classifier, feature set, and training samples. Our results showed that RF is the best model in terms of accuracy, and local density related features are important. Experimental results confirmed the feasibility of machine learning algorithms for the reliable classification of wood and leaf points. It is also noted that our studies are based on isolated trees. Further tests should be performed on more tree species and data from more complex environments.

Download Full-text

Simulation-assisted machine learning

Bioinformatics ◽

10.1093/bioinformatics/btz199 ◽

2019 ◽

Vol 35 (20) ◽

pp. 4072-4080 ◽

Cited By ~ 3

Author(s):

Timo M Deist ◽

Andrew Patti ◽

Zhaoqi Wang ◽

David Krane ◽

Taylor Sorenson ◽

...

Keyword(s):

Machine Learning ◽

Network Flow ◽

Similarity Measures ◽

Similarity Score ◽

Supplementary Information ◽

Support Vector ◽

Training Samples ◽

Wide Range ◽

Input Sample ◽

Network Flow Optimization

Abstract Motivation In a predictive modeling setting, if sufficient details of the system behavior are known, one can build and use a simulation for making predictions. When sufficient system details are not known, one typically turns to machine learning, which builds a black-box model of the system using a large dataset of input sample features and outputs. We consider a setting which is between these two extremes: some details of the system mechanics are known but not enough for creating simulations that can be used to make high quality predictions. In this context we propose using approximate simulations to build a kernel for use in kernelized machine learning methods, such as support vector machines. The results of multiple simulations (under various uncertainty scenarios) are used to compute similarity measures between every pair of samples: sample pairs are given a high similarity score if they behave similarly under a wide range of simulation parameters. These similarity values, rather than the original high dimensional feature data, are used to build the kernel. Results We demonstrate and explore the simulation-based kernel (SimKern) concept using four synthetic complex systems—three biologically inspired models and one network flow optimization model. We show that, when the number of training samples is small compared to the number of features, the SimKern approach dominates over no-prior-knowledge methods. This approach should be applicable in all disciplines where predictive models are sought and informative yet approximate simulations are available. Availability and implementation The Python SimKern software, the demonstration models (in MATLAB, R), and the datasets are available at https://github.com/davidcraft/SimKern. Supplementary information Supplementary data are available at Bioinformatics online.

Download Full-text

Meta-analysis of Cellular Toxicity for Graphene via Data-Mining the Literature and Machine Learning

10.21203/rs.3.rs-248207/v1 ◽

2021 ◽

Author(s):

Ying Ma ◽

Jianli Wang ◽

Jingying Wu ◽

Chuxuan Tong ◽

Ting Zhang

Keyword(s):

Machine Learning ◽

Detection Method ◽

Meta Analysis ◽

Analysis Approach ◽

Support Vector ◽

Learning Tools ◽

Lasso Regression ◽

Data Set ◽

Cellular Toxicity ◽

Published Evidence

Abstract Background: Due to graphene is currently incorporated into various consumer product and numerous new applications, determining the relationships between physicochemical properties of graphene and their toxicity is a prominent concern for environmental and health risk analysis. Data from the literatures suggested that graphene exposure may resulted in cytotoxicity, however, the toxicity data of graphene is still insufficient to point out its side because of the complexity and heterogeneity of available data on potential risks of graphene. Methods and Results: Here, we developed a meta-analysis approach for assembling published evidence on cytotoxicity based on 792 related publications, 986 cell survival rate samples, 762 IC50 samples, and 100 LDH release samples. In this study, among corresponding attributes, we proved that the cytotoxicity of graphene assessed in the form of cell viability, IC50 and LDH can be primarily predicted from exposure dose and detection method, diameter and surface modification, detection method and organ source, respectively. Furthermore, this paper provides guidance regarding three optional data sets for above-mentioned three endpoints that are chiefly related to cellular toxicity for future studies and cross-validation studies based on machine learning tools including Random Forests (RFs), Support Vector Machine (SVM), LASSO regression, and Elastic Net were conducted for result verification. Conclusions: In summary, our study indicates that following rigorous methodological experimental and extract approaches accompanied with suitable machine learning tools, in parallel to continuous addition to reliable data set developed using our meta-analysis approach, will offer higher predictive power and accuracy, and also help to provide effective information on designing safe graphene.

Download Full-text

Machine Learning-Based Attack Detection Method in Hadoop

Algorithms and Architectures for Parallel Processing - Lecture Notes in Computer Science ◽

10.1007/978-3-030-60248-2_13 ◽

2020 ◽

pp. 184-196

Author(s):

Ningwei Li ◽

Hang Gao ◽

Liang Liu ◽

Jianfei Peng

Keyword(s):

Machine Learning ◽

Detection Method ◽

Attack Detection

Download Full-text

Danger-Pose Detection System Using Commodity Wi-Fi for Bathroom Monitoring

Sensors ◽

10.3390/s19040884 ◽

2019 ◽

Vol 19 (4) ◽

pp. 884 ◽

Cited By ~ 9

Author(s):

Zizheng Zhang ◽

Shigemi Ishida ◽

Shigeaki Tagashira ◽

Akira Fukuda

Keyword(s):

Machine Learning ◽

Anomaly Detection ◽

Detection Method ◽

Detection System ◽

Wearable Sensors ◽

Low Frequency ◽

Training Data ◽

Support Vector ◽

Dynamic Features ◽

Pose Detection

A bathroom has higher probability of accidents than other rooms due to a slippery floor and temperature change. Because of high privacy and humidity, we face difficulties in monitoring inside a bathroom using traditional healthcare methods based on cameras and wearable sensors. In this paper, we present a danger-pose detection system using commodity Wi-Fi devices, which can be applied to bathroom monitoring, preserving privacy. A machine learning-based detection method usually requires data collected in target situations, which is difficult in detection-of-danger situations. We therefore employ a machine learning-based anomaly-detection method that requires a small amount of data in anomaly conditions, minimizing the required training data collected in dangerous conditions. We first derive the amplitude and phase shift from Wi-Fi channel state information (CSI) to extract low-frequency components that are related to human activities. We then separately extract static and dynamic features from the CSI changes in time. Finally, the static and dynamic features are fed into a one-class support vector machine (SVM), which is used as an anomaly-detection method, to classify whether a user is not in bathtub, bathing safely, or in dangerous conditions. We conducted experimental evaluations and demonstrated that our danger-pose detection system achieved a high detection performance in a non-line-of-sight (NLOS) scenario.

Download Full-text