Identifying High Risk of Atherosclerosis Using Deep Learning and Ensemble Learning

Author(s):  
Hedieh Hashem Olhosseiny ◽  
Mohammadsalar Mirzaloo ◽  
Miodrag Bolic ◽  
Hilmi R. Dajani ◽  
Voicu Groza ◽  
...  
2021 ◽  
Vol 11 (5) ◽  
pp. 2164
Author(s):  
Jiaxin Li ◽  
Zhaoxin Zhang ◽  
Changyong Guo

X.509 certificates play an important role in encrypting the transmission of data on both sides under HTTPS. With the popularization of X.509 certificates, more and more criminals leverage certificates to prevent their communications from being exposed by malicious traffic analysis tools. Phishing sites and malware are good examples. Those X.509 certificates found in phishing sites or malware are called malicious X.509 certificates. This paper applies different machine learning models, including classical machine learning models, ensemble learning models, and deep learning models, to distinguish between malicious certificates and benign certificates with Verification for Extraction (VFE). The VFE is a system we design and implement for obtaining plentiful characteristics of certificates. The result shows that ensemble learning models are the most stable and efficient models with an average accuracy of 95.9%, which outperforms many previous works. In addition, we obtain an SVM-based detection model with an accuracy of 98.2%, which is the highest accuracy. The outcome indicates the VFE is capable of capturing essential and crucial characteristics of malicious X.509 certificates.


2021 ◽  
Author(s):  
Yangyang Tian ◽  
Qi Wang ◽  
Zhimin Guo ◽  
Huitong Zhao ◽  
Sulaiman Khan ◽  
...  

Circulation ◽  
2020 ◽  
Vol 142 (Suppl_3) ◽  
Author(s):  
John Pfeifer ◽  
Sushravya M Raghunath ◽  
Alvaro Ulloa ◽  
Arun Nemani ◽  
Tanner Carbonati ◽  
...  

Background: Atrial fibrillation (AF) is associated with stroke, especially when AF goes undetected. Deep neural networks (DNN) can predict incident AF from a 12-lead resting ECG. We hypothesize that use of a DNN to predict new onset AF from an ECG may identify patients at risk of sustaining a potentially preventable AF-related stroke. Methods: We trained a DNN model to predict new-onset AF using 382,604 ECGs prior to 2010. We then evaluated the model performance on a test set of ECGs from 2010 through 2014 linked to patients in an institutional stroke registry. There were 181,969 patients in the test set with at least one ECG and no prior history of AF. Of those patients 3,497 (1.9%) had a stroke following an ECG that did not show AF. Within the set of patients with stroke, 375 had the stroke within 3 years of the ECG and were diagnosed with new AF between -3 and 365 days of the stroke. We considered these potentially preventable AF-related strokes. We report the sensitivity and positive predictive value (PPV) of the model for appropriately risk stratifying these 375 patients who sustained a potentially preventable AF-related stroke. Results: We used F β scores to identify different risk prediction thresholds (operating points) for the model. Operating points chosen by F 0.5 , F 1 , and F 2 scores identified 4, 12, and 21% of the population as high risk for the development of AF within 1 year (Figure 1). Screening 1, 4, 12, and 21% of the overall population resulted in PPV of 28, 21, 15, and 12%, respectively, for identification of new onset AF in one year. Using those same thresholds yielded sensitivities of 4, 17, 45, and 62% for identifying potentially preventable AF-related strokes. The different risk prediction thresholds resulted in a low (120-162) number needed to screen to detect one potentially preventable AF-related stroke at 3 years. Conclusions: Use of a deep learning model to predict new onset AF may identify patients at high risk of sustaining a potentially preventable AF-related stroke.


2021 ◽  
Vol 13 (578) ◽  
pp. eaba4373 ◽  
Author(s):  
Adam Yala ◽  
Peter G. Mikhael ◽  
Fredrik Strand ◽  
Gigin Lin ◽  
Kevin Smith ◽  
...  

Improved breast cancer risk models enable targeted screening strategies that achieve earlier detection and less screening harm than existing guidelines. To bring deep learning risk models to clinical practice, we need to further refine their accuracy, validate them across diverse populations, and demonstrate their potential to improve clinical workflows. We developed Mirai, a mammography-based deep learning model designed to predict risk at multiple timepoints, leverage potentially missing risk factor information, and produce predictions that are consistent across mammography machines. Mirai was trained on a large dataset from Massachusetts General Hospital (MGH) in the United States and tested on held-out test sets from MGH, Karolinska University Hospital in Sweden, and Chang Gung Memorial Hospital (CGMH) in Taiwan, obtaining C-indices of 0.76 (95% confidence interval, 0.74 to 0.80), 0.81 (0.79 to 0.82), and 0.79 (0.79 to 0.83), respectively. Mirai obtained significantly higher 5-year ROC AUCs than the Tyrer-Cuzick model (P < 0.001) and prior deep learning models Hybrid DL (P < 0.001) and Image-Only DL (P < 0.001), trained on the same dataset. Mirai more accurately identified high-risk patients than prior methods across all datasets. On the MGH test set, 41.5% (34.4 to 48.5) of patients who would develop cancer within 5 years were identified as high risk, compared with 36.1% (29.1 to 42.9) by Hybrid DL (P = 0.02) and 22.9% (15.9 to 29.6) by the Tyrer-Cuzick model (P < 0.001).


Algorithms ◽  
2020 ◽  
Vol 13 (4) ◽  
pp. 83 ◽  
Author(s):  
Giannis Haralabopoulos ◽  
Ioannis Anagnostopoulos ◽  
Derek McAuley

Sentiment analysis usually refers to the analysis of human-generated content via a polarity filter. Affective computing deals with the exact emotions conveyed through information. Emotional information most frequently cannot be accurately described by a single emotion class. Multilabel classifiers can categorize human-generated content in multiple emotional classes. Ensemble learning can improve the statistical, computational and representation aspects of such classifiers. We present a baseline stacked ensemble and propose a weighted ensemble. Our proposed weighted ensemble can use multiple classifiers to improve classification results without hyperparameter tuning or data overfitting. We evaluate our ensemble models with two datasets. The first dataset is from Semeval2018-Task 1 and contains almost 7000 Tweets, labeled with 11 sentiment classes. The second dataset is the Toxic Comment Dataset with more than 150,000 comments, labeled with six different levels of abuse or harassment. Our results suggest that ensemble learning improves classification results by 1.5 % to 5.4 % .


Sign in / Sign up

Export Citation Format

Share Document