scholarly journals Mitigating Insider Threats Using Bio-Inspired Models

2020 ◽  
Vol 10 (15) ◽  
pp. 5046
Author(s):  
Andreas Nicolaou ◽  
Stavros Shiaeles ◽  
Nick Savage

Insider threats have become a considerable information security issue that governments and organizations must face. The implementation of security policies and procedures may not be enough to protect organizational assets. Even with the evolution of information and network security technology, the threat from insiders is increasing. Many researchers are approaching this issue with various methods in order to develop a model that will help organizations to reduce their exposure to the threat and prevent damage to their assets. In this paper, we approach the insider threat problem and attempt to mitigate it by developing a machine learning model based on Bio-inspired computing. The model was developed by using an existing unsupervised learning algorithm for anomaly detection and we fitted the model to a synthetic dataset to detect outliers. We explore swarm intelligence algorithms and their performance on feature selection optimization for improving the performance of the machine learning model. The results show that swarm intelligence algorithms perform well on feature selection optimization and the generated, near-optimal, subset of features has a similar performance to the original one.

Author(s):  
Hwayoung Park ◽  
Sungtae Shin ◽  
Changhong Youm ◽  
Sang-Myung Cheon ◽  
Myeounggon Lee ◽  
...  

Abstract Background Freezing of gait (FOG) is a sensitive problem, which is caused by motor control deficits and requires greater attention during postural transitions such as turning in people with Parkinson’s disease (PD). However, the turning characteristics have not yet been extensively investigated to distinguish between people with PD with and without FOG (freezers and non-freezers) based on full-body kinematic analysis during the turning task. The objectives of this study were to identify the machine learning model that best classifies people with PD and freezers and reveal the associations between clinical characteristics and turning features based on feature selection through stepwise regression. Methods The study recruited 77 people with PD (31 freezers and 46 non-freezers) and 34 age-matched older adults. The 360° turning task was performed at the preferred speed for the inner step of the more affected limb. All experiments on the people with PD were performed in the “Off” state of medication. The full-body kinematic features during the turning task were extracted using the three-dimensional motion capture system. These features were selected via stepwise regression. Results In feature selection through stepwise regression, five and six features were identified to distinguish between people with PD and controls and between freezers and non-freezers (PD and FOG classification problem), respectively. The machine learning model accuracies revealed that the random forest (RF) model had 98.1% accuracy when using all turning features and 98.0% accuracy when using the five features selected for PD classification. In addition, RF and logistic regression showed accuracies of 79.4% when using all turning features and 72.9% when using the six selected features for FOG classification. Conclusion We suggest that our study leads to understanding of the turning characteristics of people with PD and freezers during the 360° turning task for the inner step of the more affected limb and may help improve the objective classification and clinical assessment by disease progression using turning features.


Author(s):  
J. V. D. Prasad ◽  
A. Raghuvira Pratap ◽  
Babu Sallagundla

With the rapid increase in number of clinical data and hence the prediction and analysing data becomes very difficult. With the help of various machine learning models, it becomes easy to work on these huge data. A machine learning model faces lots of challenges; one among the challenge is feature selection. In this research work, we propose a novel feature selection method based on statistical procedures to increase the performance of the machine learning model. Furthermore, we have tested the feature selection algorithm in liver disease classification dataset and the results obtained shows the efficiency of the proposed method.


2020 ◽  
Vol 32 ◽  
pp. 03032
Author(s):  
Sahil Parab ◽  
Piyush Rathod ◽  
Durgesh Patil ◽  
Vishwanath Chikkareddi

Diabetes Detection has been one of the many challenges which is being faced by the medical as well as technological communities. The principles of machine learning and its algorithms is used in order to detect the possibility of a diabetic patient based on their level of glucose concentration , insulin levels and other medically point of view required test reports. The basic diabetes detection model uses Bayesian classification machine learning algorithm, but even though the model is able to detect diabetes, the efficiency is not acceptable at all times because of the drawbacks of the single algorithm of the model. A Hybrid Machine Learning Model is used to overcome the drawbacks produced by a single algorithm model. A Hybrid Model is constructed by implementing multiple applicable machine learning algorithms such as the SVM model and Bayesian’s Classification model or any other models in order to overcome drawbacks faced by each other and also provide their mutually contributed efficiency. In a perfect case scenario the new hybrid machine learning model will be able to provide more efficiency as compared to the old Bayesian’s classification model.


2021 ◽  
Vol 2021 ◽  
pp. 1-10
Author(s):  
Fengyi Zhang ◽  
Xinyuan Cui ◽  
Renrong Gong ◽  
Chuan Zhang ◽  
Zhigao Liao

This study aimed to provide effective methods for the identification of surgeries with high cancellation risk based on machine learning models and analyze the key factors that affect the identification performance. The data covered the period from January 1, 2013, to December 31, 2014, at West China Hospital in China, which focus on elective urologic surgeries. All surgeries were scheduled one day in advance, and all cancellations were of institutional resource- and capacity-related types. Feature selection strategies, machine learning models, and sampling methods are the most discussed topic in general machine learning researches and have a direct impact on the performance of machine learning models. Hence, they were considered to systematically generate complete schemes in machine learning-based identification of surgery cancellations. The results proved the feasibility and robustness of identifying surgeries with high cancellation risk, with the considerable maximum of area under the curve (AUC) (0.7199) for random forest model with original sampling using backward selection strategy. In addition, one-side Delong test and sum of square error analysis were conducted to measure the effects of feature selection strategy, machine learning model, and sampling method on the identification of surgeries with high cancellation risk, and the selection of machine learning model was identified as the key factors that affect the identification of surgeries with high cancellation risk. This study offers methodology and insights for identifying the key experimental factors for identifying surgery cancellations, and it is helpful to further research on machine learning-based identification of surgeries with high cancellation risk.


2021 ◽  
Author(s):  
Chalachew Muluken Liyew ◽  
Haileyesus Amsaya Melese

Abstract It is crucial to predict the amount of daily rainfall to improve agricultural productivities to secure food, and water quality supply to keep the citizen healthy. To predict rainfall, various researches are conducted using data mining and machine learning techniques of different countries’ environmental datasets. The Pearson correlation technique is used to select relevant environmental variables which are used as an input for the machine learning model of this study. The main objective of this study is to identify the relevant atmospheric features that cause rainfall and predict the intensity of daily rainfall using machine learning techniques. The dataset is collected from the local meteorological office to measure the performance of three machine learning techniques as Multivariate Linear Regression, Random Forest and Extreme Gradient Boost. Root mean squared error and Mean absolute Error are used to measure the performance of the machine learning model for this study. The result of the study shows that the Extreme Gradient Boost gradient descent machine learning algorithm performs better than others.


2020 ◽  
Author(s):  
Andrea Delli Pizzi ◽  
Antonio Chiarelli ◽  
Piero Chiacchiaretta ◽  
Cristina Valdesi ◽  
Pierpaolo Croce ◽  
...  

Abstract Ground-Glass Opacities (GGOs) are a non-specific CT finding observed in the early phase of COVID-19 pneumonia. However, GGOs are also seen in other acute interstitial and alveolar lung diseases, thus making the differential diagnosis a diagnostic challenge. In this poof-of-concept study, we aimed to differentiate COVID-19 pneumonia presenting with GGOs from acute non-COVID-19 lung disease using a novel radiomic-based model in patients who underwent a high-resolution CT (HRCT) scan at hospital admission during the first pandemic peak in Italy. HRCT scans of 28 RT-PCR diagnosed COVID-19 pneumonia (COVID) and 30 acute non-COVID-lung disease (nCOVID) were retrospectively included. All patients showed GGOs as the predominant CT pattern. Two readers, blinded to the final diagnosis, independently segmented GGOs on CT scans by using a semi-automated approach, and radiomic features were extracted from segmented images. Partial least square (PLS) regression was used as the multivariate machine-learning algorithm. A leave-one-out nested cross-validation was implemented to optimize the hyperparameter of PLS and to assess the model generalization. The diagnostic performance of the radiomic model to differentiate between COVID and nCOVID lung disease was assessed through receiver operating characteristic (ROC) analysis. The radiomics-based machine learning model differentiated COVID and nCOVID with an AUC = 0.868 (p = 4.2·10− 7). After a careful prospective evaluation in larger multicentric studies, it may help radiologists to rule out COVID-19 pneumonia thus improving the COVID-19 triaging in epidemic areas.


Sign in / Sign up

Export Citation Format

Share Document