scholarly journals Extracting Function-Driven Tracing Characteristics for Optimized SVM Classification

2021 ◽  
Vol 2021 ◽  
pp. 1-12
Author(s):  
Ming Wan ◽  
Xinlu Xu ◽  
Yan Song ◽  
Quanliang Li ◽  
Jiawei Li

Due to its openness and simplicity, Modbus TCP has wide applications to facilitate the actual management and control in industrial wireless fields. However, its potential security vulnerabilities can also create lots of complicated information security challenges, which are increasingly threatening the availability of industrial real-time traffic delivery. Although anomaly detection has been recognized as a workable security measure to identify attacks, the critical step to successfully extract data characteristics is an extremely difficult task. In this paper, we focus on the continuous control mode in industrial processes and propose a control tracing feature algorithm to extract the function-driven tracing characteristics from Modbus TCP data traffic. Furthermore, this algorithm can flexibly integrate the time factor with critical functional operations and adequately describe the dynamic control change of technological processes. To closely cooperate with this algorithm, one optimized SVM (support vector machine) classifier is introduced as the practicable decision engine. By designing one applicable attack mode, we develop an in-depth and meticulous analysis on the decision accuracy, and all experimental results clearly explain that the extracted features can strongly reflect the changing pattern of continuous functional operations, and the proposed algorithm can effectively cooperate with the optimized SVM classifier to distinguish abnormal Modbus TCP data traffic.

2017 ◽  
Vol 60 (5) ◽  
pp. 1489-1502 ◽  
Author(s):  
Mengyun Zhang ◽  
Changying Li ◽  
Fumiomi Takeda ◽  
Fuzeng Yang

Abstract. Internal bruise damage that occurs in blueberry fruit during harvest operations and postharvest handling lowers the overall quality and causes significant economic losses. The main goal of this study was to nondestructively detect internal bruises in blueberries after mechanical damage using hyperspectral transmittance imaging. A total of 600 hand-harvested blueberries were divided into 20 groups of four storage times (30 min, 3 h, 12 h, and 24 h), two storage temperatures (22°C and 4°C), and three treatments (stem bruise, equator bruise, and control). A near-infrared hyperspectral imaging system was used to acquire transmittance images from 970 to 1400 nm with 5 nm bandwidth. Images were acquired from three orientations (calyx-up, stem-up, and equator-up) for fruit in the control and stem bruise groups and from four orientations (calyx-up, stem-up, equator-up, and equator-down) in the equator bruise groups. Immediately after imaging, the fruit samples were sliced, and the sliced surfaces were photographed. The color images of sliced fruit were used as references. By comparing with the reference color images, the profiles of spatial and spectral intensities were evaluated to observe the effect of orientation and help extract regions of interest (ROIs) of bruised and healthy tissues. A support vector machine (SVM) classifier was trained and tested to classify pixels of bruised and healthy tissues. Classification maps were produced, and the bruise ratio was calculated to identify bruised blueberries (bruise ratio >25%). The average accuracy of blueberry identification was 94.5% with the stem-up orientation. The results indicate that detecting bruised blueberries as soon as 30 min after mechanical damage is feasible using hyperspectral transmittance imaging. Keywords: Blueberry, Bruise detection, Classification, Hyperspectral imagery, Transmittance mode.


2012 ◽  
Vol 198-199 ◽  
pp. 1280-1285 ◽  
Author(s):  
Shang Fu Gong ◽  
Juan Chen

The widely use of P2P (Peer-to-Peer) technology has caused resources take up too much, security risks and other problems, it is necessary to detect and control P2P traffic. After analyzing current P2P detection methods, a new method called TCBDM (Traffic Characters Based Detection Method) is put forward which combines P2P traffic character with support vector machine to detect P2P traffic. By choosing P2P traffic characters which differ from other network traffic, such as Round-Trip Time (RTT), the method creates a SVM classifier, uses a package named LIBSVM to classify P2P traffic in Moore_Set data sets. The result shows that TCBDM can detect P2P traffic effectively; the accuracy could reach 98%.


2020 ◽  
Author(s):  
Nalika Ulapane ◽  
Karthick Thiyagarajan ◽  
sarath kodagoda

<div>Classification has become a vital task in modern machine learning and Artificial Intelligence applications, including smart sensing. Numerous machine learning techniques are available to perform classification. Similarly, numerous practices, such as feature selection (i.e., selection of a subset of descriptor variables that optimally describe the output), are available to improve classifier performance. In this paper, we consider the case of a given supervised learning classification task that has to be performed making use of continuous-valued features. It is assumed that an optimal subset of features has already been selected. Therefore, no further feature reduction, or feature addition, is to be carried out. Then, we attempt to improve the classification performance by passing the given feature set through a transformation that produces a new feature set which we have named the “Binary Spectrum”. Via a case study example done on some Pulsed Eddy Current sensor data captured from an infrastructure monitoring task, we demonstrate how the classification accuracy of a Support Vector Machine (SVM) classifier increases through the use of this Binary Spectrum feature, indicating the feature transformation’s potential for broader usage.</div><div><br></div>


2020 ◽  
Vol 20 ◽  
Author(s):  
Hongwei Zhang ◽  
Steven Wang ◽  
Tao Huang

Aims: We would like to identify the biomarkers for chronic hypersensitivity pneumonitis (CHP) and facilitate the precise gene therapy of CHP. Background: Chronic hypersensitivity pneumonitis (CHP) is an interstitial lung disease caused by hypersensitive reactions to inhaled antigens. Clinically, the tasks of differentiating between CHP and other interstitial lungs diseases, especially idiopathic pulmonary fibrosis (IPF), were challenging. Objective: In this study, we analyzed the public available gene expression profile of 82 CHP patients, 103 IPF patients, and 103 control samples to identify the CHP biomarkers. Method: The CHP biomarkers were selected with advanced feature selection methods: Monte Carlo Feature Selection (MCFS) and Incremental Feature Selection (IFS). A Support Vector Machine (SVM) classifier was built. Then, we analyzed these CHP biomarkers through functional enrichment analysis and differential co-expression analysis. Result: There were 674 identified CHP biomarkers. The co-expression network of these biomarkers in CHP included more negative regulations and the network structure of CHP was quite different from the network of IPF and control. Conclusion: The SVM classifier may serve as an important clinical tool to address the challenging task of differentiating between CHP and IPF. Many of the biomarker genes on the differential co-expression network showed great promise in revealing the underlying mechanisms of CHP.


2019 ◽  
Vol 35 (4) ◽  
pp. 812-825 ◽  
Author(s):  
Robert Gorman

Abstract How to classify short texts effectively remains an important question in computational stylometry. This study presents the results of an experiment involving authorship attribution of ancient Greek texts. These texts were chosen to explore the effectiveness of digital methods as a supplement to the author’s work on text classification based on traditional stylometry. Here it is crucial to avoid confounding effects of shared topic, etc. Therefore, this study attempts to identify authorship using only morpho-syntactic data without regard to specific vocabulary items. The data are taken from the dependency annotations published in the Ancient Greek and Latin Dependency Treebank. The independent variables for classification are combinations generated from the dependency label and the morphology of each word in the corpus and its dependency parent. To avoid the effects of the combinatorial explosion, only the most frequent combinations are retained as input features. The authorship classification (with thirteen classes) is done with standard algorithms—logistic regression and support vector classification. During classification, the corpus is partitioned into increasingly smaller ‘texts’. To explore and control for the possible confounding effects of, e.g. different genre and annotator, three corpora were tested: a mixed corpus of several genres of both prose and verse, a corpus of prose including oratory, history, and essay, and a corpus restricted to narrative history. Results are surprisingly good as compared to those previously published. Accuracy for fifty-word inputs is 84.2–89.6%. Thus, this approach may prove an important addition to the prevailing methods for small text classification.


Author(s):  
B. Venkatesh ◽  
J. Anuradha

In Microarray Data, it is complicated to achieve more classification accuracy due to the presence of high dimensions, irrelevant and noisy data. And also It had more gene expression data and fewer samples. To increase the classification accuracy and the processing speed of the model, an optimal number of features need to extract, this can be achieved by applying the feature selection method. In this paper, we propose a hybrid ensemble feature selection method. The proposed method has two phases, filter and wrapper phase in filter phase ensemble technique is used for aggregating the feature ranks of the Relief, minimum redundancy Maximum Relevance (mRMR), and Feature Correlation (FC) filter feature selection methods. This paper uses the Fuzzy Gaussian membership function ordering for aggregating the ranks. In wrapper phase, Improved Binary Particle Swarm Optimization (IBPSO) is used for selecting the optimal features, and the RBF Kernel-based Support Vector Machine (SVM) classifier is used as an evaluator. The performance of the proposed model are compared with state of art feature selection methods using five benchmark datasets. For evaluation various performance metrics such as Accuracy, Recall, Precision, and F1-Score are used. Furthermore, the experimental results show that the performance of the proposed method outperforms the other feature selection methods.


2021 ◽  
Vol 10 (2) ◽  
pp. 30
Author(s):  
Radwan S. Abujassar ◽  
Husam Yaseen ◽  
Ahmad Samed Al-Adwan

Nowadays, networks use many different paths to exchange data. However, our research will construct a reliable path in the networks among a huge number of nodes for use in tele-surgery using medical applications such as healthcare tracking applications, including tele-surgery which lead to optimizing medical quality of service (m-QoS) during the COVID-19 situation. Many people could not travel due to the current issues, for fear of spreading the covid-19 virus. Therefore, our paper will provide a very trusted and reliable method of communication between a doctor and his patient so that the latter can do his operation even from a far distance. The communication between the doctor and his/her patient will be monitored by our proposed algorithm to make sure that the data will be received without delay. We test how we can invest buffer space that can be used efficiently to reduce delays between source and destination, avoiding loss of high-priority data packets. The results are presented in three stages. First, we show how to obtain the greatest possible reduction in rate variability when the surgeon begins an operation using live streaming. Second, the proposed algorithm reduces congestion on the determined path used for the online surgery. Third, we have evaluated the affection of optimal smoothing algorithm on the network parameters such as peak-to-mean ratio and delay to optimize m-QoS. We propose a new Smart-Rout Control algorithm (s-RCA) for creating a virtual smart path between source and destination to transfer the required data traffic between them, considering the number of hops and link delay. This provides a reliable connection that can be used in healthcare surgery to guarantee that all instructions are received without any delay, to be executed instantly. This idea can improve m-QoS in distance surgery, with trusted paths. The new s-RCA can be adapted with an existing routing protocol to track the primary path and monitor emergency packets received in node buffers, for direct forwarding via the demand path, with extended features.


Diagnostics ◽  
2021 ◽  
Vol 11 (5) ◽  
pp. 739
Author(s):  
Alessandro Bevilacqua ◽  
Margherita Mottola ◽  
Fabio Ferroni ◽  
Alice Rossi ◽  
Giampaolo Gavelli ◽  
...  

Predicting clinically significant prostate cancer (csPCa) is crucial in PCa management. 3T-magnetic resonance (MR) systems may have a novel role in quantitative imaging and early csPCa prediction, accordingly. In this study, we develop a radiomic model for predicting csPCa based solely on native b2000 diffusion weighted imaging (DWIb2000) and debate the effectiveness of apparent diffusion coefficient (ADC) in the same task. In total, 105 patients were retrospectively enrolled between January–November 2020, with confirmed csPCa or ncsPCa based on biopsy. DWIb2000 and ADC images acquired with a 3T-MRI were analyzed by computing 84 local first-order radiomic features (RFs). Two predictive models were built based on DWIb2000 and ADC, separately. Relevant RFs were selected through LASSO, a support vector machine (SVM) classifier was trained using repeated 3-fold cross validation (CV) and validated on a holdout set. The SVM models rely on a single couple of uncorrelated RFs (ρ < 0.15) selected through Wilcoxon rank-sum test (p ≤ 0.05) with Holm–Bonferroni correction. On the holdout set, while the ADC model yielded AUC = 0.76 (95% CI, 0.63–0.96), the DWIb2000 model reached AUC = 0.84 (95% CI, 0.63–0.90), with specificity = 75%, sensitivity = 90%, and informedness = 0.65. This study establishes the primary role of 3T-DWIb2000 in PCa quantitative analyses, whilst ADC can remain the leading sequence for detection.


Electronics ◽  
2021 ◽  
Vol 10 (12) ◽  
pp. 1496
Author(s):  
Hao Liang ◽  
Yiman Zhu ◽  
Dongyang Zhang ◽  
Le Chang ◽  
Yuming Lu ◽  
...  

In analog circuit, the component parameters have tolerances and the fault component parameters present a wide distribution, which brings obstacle to classification diagnosis. To tackle this problem, this article proposes a soft fault diagnosis method combining the improved barnacles mating optimizer(BMO) algorithm with the support vector machine (SVM) classifier, which can achieve the minimum redundancy and maximum relevance for feature dimension reduction with fuzzy mutual information. To be concrete, first, the improved barnacles mating optimizer algorithm is used to optimize the parameters for learning and classification. We adopt six test functions that are on three data sets from the University of California, Irvine (UCI) machine learning repository to test the performance of SVM classifier with five different optimization algorithms. The results show that the SVM classifier combined with the improved barnacles mating optimizer algorithm is characterized with high accuracy in classification. Second, fuzzy mutual information, enhanced minimum redundancy, and maximum relevance principle are applied to reduce the dimension of the feature vector. Finally, a circuit experiment is carried out to verify that the proposed method can achieve fault classification effectively when the fault parameters are both fixed and distributed. The accuracy of the proposed fault diagnosis method is 92.9% when the fault parameters are distributed, which is 1.8% higher than other classifiers on average. When the fault parameters are fixed, the accuracy rate is 99.07%, which is 0.7% higher than other classifiers on average.


2021 ◽  
Vol 11 (5) ◽  
pp. 1990
Author(s):  
Vinod Devaraj ◽  
Philipp Aichinger

The characterization of voice quality is important for the diagnosis of a voice disorder. Vocal fry is a voice quality which is traditionally characterized by a low frequency and a long closed phase of the glottis. However, we also observed amplitude modulated vocal fry glottal area waveforms (GAWs) without long closed phases (positive group) which we modelled using an analysis-by-synthesis approach. Natural and synthetic GAWs are modelled. The negative group consists of euphonic, i.e., normophonic GAWs. The analysis-by-synthesis approach fits two modelled GAWs for each of the input GAW. One modelled GAW is modulated to replicate the amplitude and frequency modulations of the input GAW and the other modelled GAW is unmodulated. The modelling errors of the two modelled GAWs are determined to classify the GAWs into the positive and the negative groups using a simple support vector machine (SVM) classifier with a linear kernel. The modelling errors of all vocal fry GAWs obtained using the modulating model are smaller than the modelling errors obtained using the unmodulated model. Using the two modelling errors as predictors for classification, no false positives or false negatives are obtained. To further distinguish the subtypes of amplitude modulated vocal fry GAWs, the entropy of the modulator’s power spectral density and the modulator-to-carrier frequency ratio are obtained.


Sign in / Sign up

Export Citation Format

Share Document