Detection of Osteosarcoma on Bone Radiographs Using Convolutional Neural Networks

Mapping Intimacies ◽

10.21528/cbic2021-16 ◽

2021 ◽

Author(s):

Larissa Asito ◽

Hélcio Pereira ◽

Marcello Nogueira-Barbosa ◽

Renato Tinós

Keyword(s):

Machine Learning ◽

Neural Networks ◽

Feature Selection ◽

Random Forest ◽

Decision Tree ◽

Convolutional Neural Networks ◽

Learning Models ◽

Computer Aided ◽

Aided Diagnosis ◽

Machine Learning Models

We propose a computer-aided diagnosis system based on convolutional neural networks (CNNs) for the identification of osteosarcoma on bone radiographs. The CNN should indicate regions of the image that may contain tumors. In order to indicate these regions on the image, we propose to split the image in windows and individually classify them by using a CNN. Techniques for pre-processing, such as window exclusion and labeling, are proposed. Two CNNs are compared in the proposed system. The first one is trained from scratch, while the second one is a pre-trained CNN (VGG16). The CNNs are compared to four machine learning models that use features extracted from the image windows as inputs: multilayer perceptron (MLP), decision tree, random forest, and MLP with feature selection. In the experiments, the best performance was obtained by the pre-trained CNN.

Download Full-text

A Novel Method for Performance Measurement of Public Educational Institutions Using Machine Learning Models

Applied Sciences ◽

10.3390/app11199296 ◽

2021 ◽

Vol 11 (19) ◽

pp. 9296

Author(s):

Talha Mahboob Alam ◽

Mubbashar Mushtaq ◽

Kamran Shaukat ◽

Ibrahim A. Hameed ◽

Muhammad Umer Sarwar ◽

...

Keyword(s):

Machine Learning ◽

Neural Networks ◽

Feature Selection ◽

Artificial Neural Networks ◽

Performance Indicators ◽

Key Performance Indicators ◽

Educational Institutions ◽

Institutional Performance ◽

Learning Models ◽

Machine Learning Models

Lack of education is a major concern in underdeveloped countries because it leads to poor human and economic development. The level of education in public institutions varies across all regions around the globe. Current disparities in access to education worldwide are mostly due to systemic regional differences and the distribution of resources. Previous research focused on evaluating students’ academic performance, but less has been done to measure the performance of educational institutions. Key performance indicators for the evaluation of institutional performance differ from student performance indicators. There is a dire need to evaluate educational institutions’ performance based on their disparities and academic results on a large scale. This study proposes a model to measure institutional performance based on key performance indicators through data mining techniques. Various feature selection methods were used to extract the key performance indicators. Several machine learning models, namely, J48 decision tree, support vector machines, random forest, rotation forest, and artificial neural networks were employed to build an efficient model. The results of the study were based on different factors, i.e., the number of schools in a specific region, teachers, school locations, enrolment, and availability of necessary facilities that contribute to school performance. It was also observed that urban regions performed well compared to rural regions due to the improved availability of educational facilities and resources. The results showed that artificial neural networks outperformed other models and achieved an accuracy of 82.9% when the relief-F based feature selection method was used. This study will help support efforts in governance for performance monitoring, policy formulation, target-setting, evaluation, and reform to address the issues and challenges in education worldwide.

Download Full-text

Train Deep Learning Models using subsurface geological images datasets

10.5194/egusphere-egu21-6385 ◽

2021 ◽

Author(s):

Ramy Abdallah ◽

Clare E. Bond ◽

Robert W.H. Butler

Keyword(s):

Artificial Intelligence ◽

Machine Learning ◽

Neural Networks ◽

Deep Learning ◽

Convolutional Neural Networks ◽

Learning Models ◽

Structural Interpretation ◽

Performance Accuracy ◽

Learning Techniques ◽

Machine Learning Models

Machine learning is being presented as a new solution for a wide range of geoscience problems. Primarily machine learning has been used for 3D seismic data processing, seismic facies analysis and well log data correlation. The rapid development in technology with open-source artificial intelligence libraries and the accessibility of affordable computer graphics processing units (GPU) makes the application of machine learning in geosciences increasingly tractable. However, the application of artificial intelligence in structural interpretation workflows of subsurface datasets is still ambiguous. This study aims to use machine learning techniques to classify images of folds and fold-thrust structures. Here we show that convolutional neural networks (CNNs) as supervised deep learning techniques provide excellent algorithms to discriminate between geological image datasets. Four different datasets of images have been used to train and test the machine learning models. These four datasets are a seismic character dataset with five classes (faults, folds, salt, flat layers and basement), folds types with three classes (buckle, chevron and conjugate), fault types with three classes (normal, reverse and thrust) and fold-thrust geometries with three classes (fault bend fold, fault propagation fold and detachment fold). These image datasets are used to investigate three machine learning models. One Feedforward linear neural network model and two convolutional neural networks models (Convolution 2d layer transforms sequential model and Residual block model (ResNet with 9, 34, and 50 layers)). Validation and testing datasets forms a critical part of testing the model&#8217;s performance accuracy. The ResNet model records the highest performance accuracy score, of the machine learning models tested. Our CNN image classification model analysis provides a framework for applying machine learning to increase structural interpretation efficiency, and shows that CNN classification models can be applied effectively to geoscience problems. The study provides a starting point to apply unsupervised machine learning approaches to sub-surface structural interpretation workflows.

Download Full-text

Neural Networks in ECG Classification

Neural Networks in Healthcare ◽

10.4018/978-1-59140-848-2.ch004 ◽

2011 ◽

pp. 81-104 ◽

Cited By ~ 2

Author(s):

G. Camps-Valls ◽

J. F. Guerrero-Martinez

Keyword(s):

Machine Learning ◽

Neural Networks ◽

Feature Selection ◽

Adaptive Systems ◽

Clinical Environment ◽

Learning Models ◽

Cardiac Pathology ◽

Learning Tasks ◽

Beat Detection ◽

Machine Learning Models

In this chapter, we review the vast field of application of artificial neural networks in cardiac pathology discrimination based on electrocardiographic signals. We discuss advantages and drawbacks of neural and adaptive systems in cardiovascular medicine and catch a glimpse of forthcoming developments in machine learning models for the real clinical environment. Some problems are identified in the learning tasks of beat detection, feature selection/extraction, and classification, and some proposals and suggestions are given to alleviate the problems of interpretability, overfitting, and adaptation. These have become important problems in recent years and will surely constitute the basis of some investigations in the immediate future.

Download Full-text

Sa1655 UNSUPERVISED MACHINE LEARNING MODELS FOR POLYP IMAGE DATABASES USED FOR COMPUTER-AIDED DIAGNOSIS

Gastroenterology ◽

10.1016/s0016-5085(20)31620-6 ◽

2020 ◽

Vol 158 (6) ◽

pp. S-370-S-371

Author(s):

Dennis Shung ◽

John Onofrey ◽

Artur Viana ◽

Harry R. Aslanian

Keyword(s):

Machine Learning ◽

Computer Aided Diagnosis ◽

Image Databases ◽

Learning Models ◽

Unsupervised Machine Learning ◽

Computer Aided ◽

Aided Diagnosis ◽

Machine Learning Models

Download Full-text

Drug Classification using Black-box models and Interpretability

International Journal for Research in Applied Science and Engineering Technology ◽

10.22214/ijraset.2021.38203 ◽

2021 ◽

Vol 9 (9) ◽

pp. 1518-1529

Author(s):

Pooja Thakkar

Keyword(s):

Machine Learning ◽

Random Forest ◽

Decision Tree ◽

Learning Models ◽

Drug Classification ◽

Box Models ◽

Machine Learning Model ◽

Black Box Models ◽

Insight Into ◽

Machine Learning Models

Abstract: The focus of this study is on drug categorization utilising Machine Learning models, as well as interpretability utilizing LIME and SHAP to get a thorough understanding of the ML models. To do this, the researchers used machine learning models such as random forest, decision tree, and logistic regression to classify drugs. Then, using LIME and SHAP, they determined if these models were interpretable, which allowed them to better understand their results. It may be stated at the conclusion of this paper that LIME and SHAP can be utilised to get insight into a Machine Learning model and determine which attribute is accountable for the divergence in the outcomes. According to the LIME and SHAP results, it is also discovered that Random Forest and Decision Tree ML models are the best models to employ for drug classification, with Na to K and BP being the most significant characteristics for drug classification. Keywords: Machine Learning, Back-box models, LIME, SHAP, Decision Tree

Download Full-text

Credit Risk Model Based on Central Bank Credit Registry Data

Journal of Risk and Financial Management ◽

10.3390/jrfm14030138 ◽

2021 ◽

Vol 14 (3) ◽

pp. 138

Author(s):

Fisnik Doko ◽

Slobodan Kalajdziski ◽

Igor Mishkovski

Keyword(s):

Machine Learning ◽

Random Forest ◽

Decision Tree ◽

Central Bank ◽

Credit Risk ◽

Commercial Banks ◽

Registry Data ◽

Learning Models ◽

Model Based ◽

Machine Learning Models

Data science and machine-learning techniques help banks to optimize enterprise operations, enhance risk analyses and gain competitive advantage. There is a vast amount of research in credit risk, but to our knowledge, none of them uses credit registry as a data source to model the probability of default for individual clients. The goal of this paper is to evaluate different machine-learning models to create accurate model for credit risk assessment using the data from the real credit registry dataset of the Central Bank of Republic of North Macedonia. We strongly believe that the model developed in this research will be an additional source of valuable information to commercial banks, by leveraging historical data for all the population of the country in all the commercial banks. Thus, in this research, we compare five machine-learning models to classify credit risk data, i.e., logistic regression, decision tree, random forest, support vector machines (SVM) and neural network. We evaluate the five models using different machine-learning metrics, and we propose a model based on credit registry data from the central bank with detailed methodology that can predict the credit risk based on credit history of the population in the country. Our results show that the best accuracy is achieved by using decision tree performing on imbalanced data with and without scaling, followed by random forest and linear regression.

Download Full-text

Machine Learning Models for Image-Based Diagnosis and Prognosis of COVID-19: Systematic Review (Preprint)

10.2196/preprints.25181 ◽

2020 ◽

Author(s):

Mahdieh Montazeri ◽

Roxana ZahediNasab ◽

Ali Farahani ◽

Hadis Mohseni ◽

Fahimeh Ghasemian

Keyword(s):

Machine Learning ◽

Neural Networks ◽

Health Care ◽

High Risk ◽

Convolutional Neural Networks ◽

Risk Of Bias ◽

Learning Models ◽

Learning Methods ◽

Diagnosis And Prognosis ◽

Machine Learning Models

BACKGROUND Accurate and timely diagnosis and effective prognosis of the disease is important to provide the best possible care for patients with COVID-19 and reduce the burden on the health care system. Machine learning methods can play a vital role in the diagnosis of COVID-19 by processing chest x-ray images. OBJECTIVE The aim of this study is to summarize information on the use of intelligent models for the diagnosis and prognosis of COVID-19 to help with early and timely diagnosis, minimize prolonged diagnosis, and improve overall health care. METHODS A systematic search of databases, including PubMed, Web of Science, IEEE, ProQuest, Scopus, bioRxiv, and medRxiv, was performed for COVID-19–related studies published up to May 24, 2020. This study was performed in accordance with the PRISMA (Preferred Reporting Items for Systematic Reviews and Meta-analyses) guidelines. All original research articles describing the application of image processing for the prediction and diagnosis of COVID-19 were considered in the analysis. Two reviewers independently assessed the published papers to determine eligibility for inclusion in the analysis. Risk of bias was evaluated using the Prediction Model Risk of Bias Assessment Tool. RESULTS Of the 629 articles retrieved, 44 articles were included. We identified 4 prognosis models for calculating prediction of disease severity and estimation of confinement time for individual patients, and 40 diagnostic models for detecting COVID-19 from normal or other pneumonias. Most included studies used deep learning methods based on convolutional neural networks, which have been widely used as a classification algorithm. The most frequently reported predictors of prognosis in patients with COVID-19 included age, computed tomography data, gender, comorbidities, symptoms, and laboratory findings. Deep convolutional neural networks obtained better results compared with non–neural network–based methods. Moreover, all of the models were found to be at high risk of bias due to the lack of information about the study population, intended groups, and inappropriate reporting. CONCLUSIONS Machine learning models used for the diagnosis and prognosis of COVID-19 showed excellent discriminative performance. However, these models were at high risk of bias, because of various reasons such as inadequate information about study participants, randomization process, and the lack of external validation, which may have resulted in the optimistic reporting of these models. Hence, our findings do not recommend any of the current models to be used in practice for the diagnosis and prognosis of COVID-19.

Download Full-text

Feature Importances: A Tool to Explain Radio Propagation and Reduce Model Complexity

Telecom ◽

10.3390/telecom1020009 ◽

2020 ◽

Vol 1 (2) ◽

pp. 114-125

Author(s):

Sotirios P. Sotiroudis ◽

Sotirios K. Goudos ◽

Katherine Siakavara

Keyword(s):

Machine Learning ◽

Feature Selection ◽

Random Forest ◽

Response Times ◽

Path Loss ◽

Radio Propagation ◽

Model Complexity ◽

Learning Models ◽

Reduced Complexity ◽

Machine Learning Models

Machine learning models have been widely deployed to tackle the problem of radio propagation. In addition to helping in the estimation of path loss, they can also be used to better understand the details of various propagation scenarios. Our current work exploits the inherent ranking of feature importances provided by XGBoost and Random Forest as a means of indicating the contribution of the underlying propagation mechanisms. A comparison between two different transmitter antenna heights, revealing the associated propagation profiles, is made. Feature selection is then implemented, leading to models with reduced complexity, and consequently reduced training and response times, based on the previously calculated importances.

Download Full-text

Wave downscaling using machine learning

10.5194/egusphere-egu2020-553 ◽

2020 ◽

Author(s):

Sara Santamaria Aguilar ◽

Thomas Wahl

Keyword(s):

Machine Learning ◽

Neural Networks ◽

Wave Propagation ◽

Artificial Neural Networks ◽

Random Forest ◽

Linear Regression ◽

Significant Wave Height ◽

Wave Climate ◽

Learning Models ◽

Machine Learning Models

Future changes in the wind wave climate due to atmospheric changes can intensify present erosion and flood risk. Knowledge on both mean and extreme wave climate is necessary for understanding changes in sediment dynamics and flood events at the coastline. In order to assess potential wave changes, ensemble nearshore wave projections are required for covering &#160;&#160;the entire range of wave conditions and also the large uncertainties related to future climate states. However, nearshore wave projections are not available for most coastal regions due to the excessive computational effort required for dynamically downscaling ensemble offshore wave data. As a result, the large relative contribution of waves to coastal flooding and erosion is commonly omitted in the assessment of those hazards. In this context, machine learning models can be an efficient tool for downscaling ensemble global wave projections if they are able to accurately simulate the non-linear processes of wave propagation due to their low computational requirements. Here, we analyse the performance of three machine learning methods, namely random forest, multivariate adaptive regression splines and artificial neural networks, for downscaling the wave climate along the coast of Florida. We further compare the performance of these three models to the multiple linear regression, which is a statistical model frequently used, although it does not account for the non-linearities associated with wave propagation processes. We find that the three machine learning models perform better than the multiple linear regression for all wave parameters (significant wave height, peak and mean periods, direction) along the entire coastline of Florida, which highlights the ability of these models to reproduce the non-linear wave propagation processes. Specifically, random forest shows the best performance and the lowest computational training times. In addition, this model shows a remarkably good performance in simulating the wave extreme events compared to the other models. By following a tree bagging approach, random forest can also provide confidence intervals and reduce the tuning process. The latter is one of the main disadvantages of the artificial neural networks, which also show a high performance for wave downscaling but require more training and tuning effort. Although the significant wave height and the periods can be simulated with very high accuracy (R2 higher than 0.9 and 0.8 respectively), the wave direction is poorly simulated by all models due to its circular behaviour. We find that a transformation of the direction into sine and cosine can improve the model performance. Finally, we downscale an ensemble of global wave projections along the coast of Florida and assess potential changes in the wave climate of this region. &#160;&#160;

Download Full-text

Machine Learning Models for the Prediction of Postpartum Depression: Application and Comparison Based on a Cohort Study (Preprint)

10.2196/preprints.15516 ◽

2019 ◽

Author(s):

Weina Zhang ◽

Han Liu ◽

Vincent Michael Bernard Silenzio ◽

Peiyuan Qiu ◽

Wenjie Gong

Keyword(s):

Machine Learning ◽

Feature Selection ◽

Random Forest ◽

Postpartum Depression ◽

Support Vector ◽

Selection Methods ◽

Learning Models ◽

Expert Consultation ◽

Using Data ◽

Machine Learning Models

BACKGROUND Postpartum depression (PPD) is a serious public health problem. Building a predictive model for PPD using data during pregnancy can facilitate earlier identification and intervention. OBJECTIVE The aims of this study are to compare the effects of four different machine learning models using data during pregnancy to predict PPD and explore which factors in the model are the most important for PPD prediction. METHODS Information on the pregnancy period from a cohort of 508 women, including demographics, social environmental factors, and mental health, was used as predictors in the models. The Edinburgh Postnatal Depression Scale score within 42 days after delivery was used as the outcome indicator. Using two feature selection methods (expert consultation and random forest-based filter feature selection [FFS-RF]) and two algorithms (support vector machine [SVM] and random forest [RF]), we developed four different machine learning PPD prediction models and compared their prediction effects. RESULTS There was no significant difference in the effectiveness of the two feature selection methods in terms of model prediction performance, but 10 fewer factors were selected with the FFS-RF than with the expert consultation method. The model based on SVM and FFS-RF had the best prediction effects (sensitivity=0.69, area under the curve=0.78). In the feature importance ranking output by the RF algorithm, psychological elasticity, depression during the third trimester, and income level were the most important predictors. CONCLUSIONS In contrast to the expert consultation method, FFS-RF was important in dimension reduction. When the sample size is small, the SVM algorithm is suitable for predicting PPD. In the prevention of PPD, more attention should be paid to the psychological resilience of mothers.

Download Full-text