β-Thalassemia Knowledge Elicitation Using Data Engineering: PCA, Pearson’s Chi Square and Machine Learning

Author(s):  
P. Paokanta

COVID-19 has become a pandemic affecting the most of countries in the world. One of the most difficult decisions doctors face during the Covid-19 epidemic is determining which patients will stay in hospital, and which are safe to recover at home. In the face of overcrowded hospital capacity and an entirely new disease with little data-based evidence for diagnosis and treatment, the old rules for determining which patients should be admitted have proven ineffective. But machine learning can help make the right decision early, save lives and lower healthcare costs. So, there is therefore an urgent and imperative need to collect data describing clinical presentations, risks, epidemiology and outcomes. On the other side, artificial intelligence(AI) and machine learning(ML) are considered a strong firewall against outbreaks of diseases and epidemics due to its ability to quickly detect, examine and diagnose these diseases and epidemics.AI is being used as a tool to support the fight against the epidemic that swept the entire world since the beginning of 2020.. This paper presents the potential for using data engineering, ML and AI to confront the Coronavirus, predict the evolution of disease outbreaks, and conduct research in order to develop a vaccine or effective treatment that protects humanity from these deadly diseases.


2019 ◽  
Vol 47 (10) ◽  
pp. 1-9
Author(s):  
Eun-Young Park ◽  
Joungmin Kim

We aimed to verify the factor model and measurement invariance of the abbreviated Center for Epidemiologic Studies Depression Scale by conducting a confirmatory factor analysis using data from 761 parents of individuals with intellectual disabilities who completed the scale as part of the 2011 Survey on the Actual Conditions of Individuals with Developmental Disabilities, South Korea, and 7,301 participants from the general population who completed the scale as part of the 2011 Welfare Panel Study and Survey by the Ministry of Health and Welfare, South Korea. We used fit indices to assess data reliability and Amos 22.0 for data analysis. According to the results, the 4-factor model had an appropriate fit to the data and the regression coefficients were significant. However, the chi-square difference test result was nonsignificant; therefore, the metric invariance model was the most appropriate measurement invariance model for the data. Implications of the findings are discussed.


2021 ◽  
Vol 13 (13) ◽  
pp. 2433
Author(s):  
Shu Yang ◽  
Fengchao Peng ◽  
Sibylle von Löwis ◽  
Guðrún Nína Petersen ◽  
David Christian Finger

Doppler lidars are used worldwide for wind monitoring and recently also for the detection of aerosols. Automatic algorithms that classify the lidar signals retrieved from lidar measurements are very useful for the users. In this study, we explore the value of machine learning to classify backscattered signals from Doppler lidars using data from Iceland. We combined supervised and unsupervised machine learning algorithms with conventional lidar data processing methods and trained two models to filter noise signals and classify Doppler lidar observations into different classes, including clouds, aerosols and rain. The results reveal a high accuracy for noise identification and aerosols and clouds classification. However, precipitation detection is underestimated. The method was tested on data sets from two instruments during different weather conditions, including three dust storms during the summer of 2019. Our results reveal that this method can provide an efficient, accurate and real-time classification of lidar measurements. Accordingly, we conclude that machine learning can open new opportunities for lidar data end-users, such as aviation safety operators, to monitor dust in the vicinity of airports.


Agronomy ◽  
2021 ◽  
Vol 11 (2) ◽  
pp. 263
Author(s):  
Samreen Naeem ◽  
Aqib Ali ◽  
Christophe Chesneau ◽  
Muhammad H. Tahir ◽  
Farrukh Jamal ◽  
...  

This study proposes the machine learning based classification of medical plant leaves. The total six varieties of medicinal plant leaves-based dataset are collected from the Department of Agriculture, The Islamia University of Bahawalpur, Pakistan. These plants are commonly named in English as (herbal) Tulsi, Peppermint, Bael, Lemon balm, Catnip, and Stevia and scientifically named in Latin as Ocimum sanctum, Mentha balsamea, Aegle marmelos, Melissa officinalis, Nepeta cataria, and Stevia rebaudiana, respectively. The multispectral and digital image dataset are collected via a computer vision laboratory setup. For the preprocessing step, we crop the region of the leaf and transform it into a gray level format. Secondly, we perform a seed intensity-based edge/line detection utilizing Sobel filter and draw five regions of observations. A total of 65 fused features dataset is extracted, being a combination of texture, run-length matrix, and multi-spectral features. For the feature optimization process, we employ a chi-square feature selection approach and select 14 optimized features. Finally, five machine learning classifiers named as a multi-layer perceptron, logit-boost, bagging, random forest, and simple logistic are deployed on an optimized medicinal plant leaves dataset, and it is observed that the multi-layer perceptron classifier shows a relatively promising accuracy of 99.01% as compared to the competition. The distinct classification accuracy by the multi-layer perceptron classifier on six medicinal plant leaves are 99.10% for Tulsi, 99.80% for Peppermint, 98.40% for Bael, 99.90% for Lemon balm, 98.40% for Catnip, and 99.20% for Stevia.


2021 ◽  
Vol 503 (3) ◽  
pp. 4581-4600
Author(s):  
Orlando Luongo ◽  
Marco Muccino

ABSTRACT We alleviate the circularity problem, whereby gamma-ray bursts are not perfect distance indicators, by means of a new model-independent technique based on Bézier polynomials. We use the well consolidate Amati and Combo correlations. We consider improved calibrated catalogues of mock data from differential Hubble rate points. To get our mock data, we use those machine learning scenarios that well adapt to gamma-ray bursts, discussing in detail how we handle small amounts of data from our machine learning techniques. We explore only three machine learning treatments, i.e. linear regression, neural network, and random forest, emphasizing quantitative statistical motivations behind these choices. Our calibration strategy consists in taking Hubble’s data, creating the mock compilation using machine learning and calibrating the aforementioned correlations through Bézier polynomials with a standard chi-square analysis first and then by means of a hierarchical Bayesian regression procedure. The corresponding catalogues, built up from the two correlations, have been used to constrain dark energy scenarios. We thus employ Markov chain Monte Carlo numerical analyses based on the most recent Pantheon supernova data, baryonic acoustic oscillations, and our gamma-ray burst data. We test the standard ΛCDM model and the Chevallier–Polarski–Linder parametrization. We discuss the recent H0 tension in view of our results. Moreover, we highlight a further severe tension over Ωm and we conclude that a slight evolving dark energy model is possible.


2020 ◽  
Vol 15 (S359) ◽  
pp. 40-41
Author(s):  
L. M. Izuti Nakazono ◽  
C. Mendes de Oliveira ◽  
N. S. T. Hirata ◽  
S. Jeram ◽  
A. Gonzalez ◽  
...  

AbstractWe present a machine learning methodology to separate quasars from galaxies and stars using data from S-PLUS in the Stripe-82 region. In terms of quasar classification, we achieved 95.49% for precision and 95.26% for recall using a Random Forest algorithm. For photometric redshift estimation, we obtained a precision of 6% using k-Nearest Neighbour.


2020 ◽  
Vol 7 (Supplement_1) ◽  
pp. S17-S17
Author(s):  
Taylor Landay ◽  
Julie A Clennon ◽  
José A Ferreira ◽  
Lucia A Fraga ◽  
Maria Aparecida F Grossi ◽  
...  

Abstract Background Leprosy in children under 15 years of age, and in particular, the presence of leprosy grade 2 disability (G2D) in children, signifies ongoing transmission and the need for improved surveillance. Our objective was to describe the epidemiology of pediatric leprosy in Minas Gerais, Brazil and to explore associations with access to medical facilities. Methods A cross-sectional study was conducted using data from the Brazilian Notifiable Diseases Surveillance System (SINAN) from 2002–2017. Incident cases were included if they resided in a municipality with both adult and pediatric cases. Municipalities were divided by the number of medical facilities per municipality: < 5, 5–17, and 18 or higher. Analyses compared pediatric cases across two time periods (2002–2009 and 2010–2017) and number of medical facilities / municipality using chi-square, t-tests, and logistic regression. Results A total of 27,725 cases were reported with 1,611 under 15 years of age. Overall incidence declined from 34.8 per 100,000 to 13.6 per 100,000 during the study period with pediatric incidence declining from 2.6 per 100,000 to 0.8 per 100,000. Time period 2 (TP2) showed an increase in the proportion of pediatric G2D (2.58% vs 1.91%, p < 0.0001) when compared to time period 1 (TP1). Mean age of diagnosis in children was younger in TP2 then in TP1 (10.06 vs 10.43, p=0.02). In 2017, the pediatric incidence in municipalities with the fewest medical facilities was 0.95 per 100,000 compared to 0.23 per 100,000 in municipalities with > 5 facilities (p=0.009). There was significantly higher odds of disability at diagnosis (grades 1 and 2) in pediatric cases residing in municipalities with < 5 medical facilities (aOR 1.88; 95% CI 1.37–2.59), adjusted for age and sex. See map (Fig 1). Figure 1. Cases of Pediatric Disability By Number of Municipality Medical Facilities from 2002–2017 (White areas without reported pediatric leprosy) Conclusion The increasing proportion of G2D in children in the second half of the study period despite declining incidence suggest occult infections among children and adults alike in Minas Gerais. Furthermore, the average age of diagnosis in children should increase, not decrease, if M. leprae transmission was truly declining. Lastly, the association between fewer municipality health facilities and increased disability suggest barriers to timely diagnosis and a critical area of focus for research into access to healthcare and leprosy risk. Disclosures All Authors: No reported disclosures


2021 ◽  
Vol 21 (4) ◽  
pp. 1-10
Author(s):  
V. Gomathy ◽  
K. Janarthanan ◽  
Fadi Al-Turjman ◽  
R. Sitharthan ◽  
M. Rajesh ◽  
...  

Coronavirus Disease 19 (COVID-19) is a highly infectious viral disease affecting millions of people worldwide in 2020. Several studies have shown that COVID-19 results in a severe acute respiratory syndrome and may lead to death. In past research, a greater number of respiratory diseases has been caused by exposure to air pollution for long periods of time. This article investigates the spread of COVID-19 as a result of air pollution by applying linear regression in machine learning method based edge computing. The analysis in this investigation have been based on the death rates caused by COVID-19 as well as the region of death rates based on hazardous air pollution using data retrieved from the Copernicus Sentinel-5P satellite. The results obtained in the investigation prove that the mortality rate due to the spread of COVID-19 is 77% higher in areas with polluted air. This investigation also proves that COVID-19 severely affected 68% of the individuals who had been exposed to polluted air.


Sign in / Sign up

Export Citation Format

Share Document