scholarly journals Machine learning predictions of COVID-19 second wave end-times in Indian states

Author(s):  
Anvesh Reddy Kondapalli ◽  
Hanesh Koganti ◽  
Sai Krishna Challagundla ◽  
Chaitanya Suhaas Reddy Guntaka ◽  
Soumyajyoti Biswas
2021 ◽  
Author(s):  
Sebastian Johannes Fritsch ◽  
Konstantin Sharafutdinov ◽  
Moein Einollahzadeh Samadi ◽  
Gernot Marx ◽  
Andreas Schuppert ◽  
...  

BACKGROUND During the course of the COVID-19 pandemic, a variety of machine learning models were developed to predict different aspects of the disease, such as long-term causes, organ dysfunction or ICU mortality. The number of training datasets used has increased significantly over time. However, these data now come from different waves of the pandemic, not always addressing the same therapeutic approaches over time as well as changing outcomes between two waves. The impact of these changes on model development has not yet been studied. OBJECTIVE The aim of the investigation was to examine the predictive performance of several models trained with data from one wave predicting the second wave´s data and the impact of a pooling of these data sets. Finally, a method for comparison of different datasets for heterogeneity is introduced. METHODS We used two datasets from wave one and two to develop several predictive models for mortality of the patients. Four classification algorithms were used: logistic regression (LR), support vector machine (SVM), random forest classifier (RF) and AdaBoost classifier (ADA). We also performed a mutual prediction on the data of that wave which was not used for training. Then, we compared the performance of models when a pooled dataset from two waves was used. The populations from the different waves were checked for heterogeneity using a convex hull analysis. RESULTS 63 patients from wave one (03-06/2020) and 54 from wave two (08/2020-01/2021) were evaluated. For both waves separately, we found models reaching sufficient accuracies up to 0.79 AUROC (95%-CI 0.76-0.81) for SVM on the first wave and up 0.88 AUROC (95%-CI 0.86-0.89) for RF on the second wave. After the pooling of the data, the AUROC decreased relevantly. In the mutual prediction, models trained on second wave´s data showed, when applied on first wave´s data, a good prediction for non-survivors but an insufficient classification for survivors. The opposite situation (training: first wave, test: second wave) revealed the inverse behaviour with models correctly classifying survivors and incorrectly predicting non-survivors. The convex hull analysis for the first and second wave populations showed a more inhomogeneous distribution of underlying data when compared to randomly selected sets of patients of the same size. CONCLUSIONS Our work demonstrates that a larger dataset is not a universal solution to all machine learning problems in clinical settings. Rather, it shows that inhomogeneous data used to develop models can lead to serious problems. With the convex hull analysis, we offer a solution for this problem. The outcome of such an analysis can raise concerns if the pooling of different datasets would cause inhomogeneous patterns preventing a better predictive performance.


eLife ◽  
2021 ◽  
Vol 10 ◽  
Author(s):  
Emirena Garrafa ◽  
Marika Vezzoli ◽  
Marco Ravanelli ◽  
Davide Farina ◽  
Andrea Borghesi ◽  
...  

An early-warning model to predict in-hospital mortality on admission of COVID-19 patients at an emergency department (ED) was developed and validate using a Machine-Learning model. In total, 2782 patients were enrolled between March 2020 and December 2020, including 2106 patients (first wave) and 676 patients (second wave) in the COVID-19 outbreak in Italy. The first-wave patients were divided into two groups with 1474 patients used to train the model, and 632 to validate it. The 676 patients in the second wave were used to test the model. Age, 17 blood analytes and Brescia chest X-ray score were the variables processed using a Random Forests classification algorithm to build and validate the model. ROC analysis was used to assess the model performances. A web-based death-risk calculator was implemented and integrated within the Laboratory Information System of the hospital. The final score was constructed by age (the most powerful predictor), blood analytes (the strongest predictors were lactate dehydrogenase, D-dimer, Neutrophil/Lymphocyte ratio, C-reactive protein, Lymphocyte %, Ferritin std and Monocyte %), and Brescia chest X-ray score. The areas under the receiver operating characteristic curve obtained for the three groups (training, validating and testing) were 0.98, 0.83 and 0.78, respectively. The model predicts in-hospital mortality on the basis of data that can be obtained in a short time, directly at the ED on admission. It functions as a web-based calculator, providing a risk score which is easy to interpret. It can be used in the triage process to support the decision on patient allocation.


Author(s):  
MSLB Subrahmanyam ◽  
Vajjha Hem Kumar

— Andhra Pradesh is one of the south Indian states in India and having 13 districts. This is one of the most Covid-19 effected state in India during first and second waves. In India district is the major administrative block for implementing government policies and schemes under control of district collector. So, estimating or forecasting trends in district level more important than state wise or entire country wise. In this paper we are proposing Susceptible, Exposed, Infected and Recovered – Regression and Grid Search (SEIR-RGS) model for analyzing Covid -19 district wise trends during second wave. The SEIR-RGS, initially collects daily wise covid data for each district from Department of Health, medical and family welfare, AP and estimates the model parameters like contact rate, incubation rate and recovery rate. To calculate recovery rate the proposed model uses regression technique between daily active cases vs cumulative recoveries. The present model uses two phases for estimating contact rate and incubation rate using grid search approach. After that the proposed method calculates the infectious period, incubation period and basic reproduction of infection in all 13 districts to analyze trends in the state during second wave and also to predict possibility of third wave in each district.


2021 ◽  
Vol 10 (39) ◽  
pp. 3480-3486
Author(s):  
Bhuvaneswari Srinivasan ◽  
Charles John Paul A. ◽  
Jayanthi Malaiyandi ◽  
Thenmozhi Mani ◽  
Kannan Kilavan Packiam ◽  
...  

BACKGROUND COVID-19 is a viral pandemic disease reported from 222 different countries in the world. Although government agencies of various countries are responding to the suggestions of medical experts, public understanding of the nature of the disease is necessary to control the disease. Moreover, Covid associated mucormycosis (CAM) is found to emanate as a secondary infection in countries such as India. Therefore, this study was done to evaluate the awareness of COVID-19 and Covid associated mucormycosis. METHODS A questionnaire designed using google form was used to assess the public’s awareness about the airborne nature of the virus, Covid associated mucormycosis, and the government’s efforts in combating the disease. RESULTS About 690 people responded to the questions and among them 78 % were females and 21 % males. The age of the respondents ranged from 17 to 70 yrs. Nearly 69.5 % of the respondents believed that the virus was airborne. Although 89 % of respondents correctly stated that India was experiencing the second wave of COVID-19, yet majority of them could not make the same statement about other countries like the UK and the USA. Naming the mucormycosis as the black fungus had reached 88 % of the respondents. Nearly 60 % of the general public were satisfied with the government's initiatives in providing medical facilities. CONCLUSIONS The study provides the public's understanding of Covid-19 after the second wave of Covid-19 and Covid associated mucormycosis in India. The research provides inputs to the Indian government and the governments of Indian states to further raise public awareness on controlling the disease. KEY WORDS COVID -19; Airborne virus; Covid Associated Mucormycosis; Black fungus, India.


In the first wave of artificial intelligence (AI), rule-based expert systems were developed, with modest success, to help generalists who lacked expertise in a specific domain. The second wave of AI, originally called artificial neural networks but now described as machine learning, began to have an impact with multilayer networks in the 1980s. Deep learning, which enables automated feature discovery, has enjoyed spectacular success in several medical disciplines, including cardiology, from automated image analysis to the identification of the electrocardiographic signature of atrial fibrillation during sinus rhythm. Machine learning is now embedded within the NHS Long-Term Plan in England, but its widespread adoption may be limited by the “black-box” nature of deep neural networks.


Author(s):  
Alok Tiwari

ABSTRACTCOVID 19 entered during the last week of April 2020 in India has caused 3,546 deaths with 1,13,321 number of reported cases. Indian government has taken many proactive steps, including strict lockdown of the entire nation for more than 50 days, identification of hotspots, app-based tracking of citizens to track infected. This paper investigated the evolution of COVID 19 in five states of India (Maharashtra, UP, Gujrat, Tamil Nadu, and Delhi) from 1st April 2020 to 20th May 2020. Variation of doubling rate and reproduction number (from SIQR) with time is used to analyse the performance of the majorly affected Indian states. It has been determined that Uttar Pradesh is one of the best performers among five states with the doubling rate crossing 18 days as of 20th May. Tamil Nadu has witnessed the second wave of infections during the second week of May. Maharashtra is continuously improving at a steady rate with its doubling rate reaching to 12.67 days. Also these two states are performing below the national average in terms of infection doubling rate. Gujrat and Delhi have reported the doubling rate of 16.42 days and 15.49 days respectively. Comparison of these states has also been performed based on time-dependent reproduction number. Recovery rate of India has reached to 40 % as the day paper is written.


2021 ◽  
Vol 14 (1) ◽  
Author(s):  
Soumik Purkayastha ◽  
Ritoban Kundu ◽  
Ritwik Bhaduri ◽  
Daniel Barker ◽  
Michael Kleinsasser ◽  
...  

Abstract Objective There has been much discussion and debate around the underreporting of COVID-19 infections and deaths in India. In this short report we first estimate the underreporting factor for infections from publicly available data released by the Indian Council of Medical Research on reported number of cases and national seroprevalence surveys. We then use a compartmental epidemiologic model to estimate the undetected number of infections and deaths, yielding estimates of the corresponding underreporting factors. We compare the serosurvey based ad hoc estimate of the infection fatality rate (IFR) with the model-based estimate. Since the first and second waves in India are intrinsically different in nature, we carry out this exercise in two periods: the first wave (April 1, 2020–January 31, 2021) and part of the second wave (February 1, 2021–May 15, 2021). The latest national seroprevalence estimate is from January 2021, and thus only relevant to our wave 1 calculations. Results Both wave 1 and wave 2 estimates qualitatively show that there is a large degree of “covert infections” in India, with model-based estimated underreporting factor for infections as 11.11 (95% credible interval (CrI) 10.71–11.47) and for deaths as 3.56 (95% CrI 3.48–3.64) for wave 1. For wave 2, underreporting factor for infections escalate to 26.77 (95% CrI 24.26–28.81) and to 5.77 (95% CrI 5.34–6.15) for deaths. If we rely on only reported deaths, the IFR estimate is 0.13% for wave 1 and 0.03% for part of wave 2. Taking underreporting of deaths into account, the IFR estimate is 0.46% for wave 1 and 0.18% for wave 2 (till May 15). Combining waves 1 and 2, as of May 15, while India reported a total of nearly 25 million cases and 270 thousand deaths, the estimated number of infections and deaths stand at 491 million (36% of the population) and 1.21 million respectively, yielding an estimated (combined) infection fatality rate of 0.25%. There is considerable variation in these estimates across Indian states. Up to date seroprevalence studies and mortality data are needed to validate these model-based estimates.


Entropy ◽  
2021 ◽  
Vol 23 (3) ◽  
pp. 332
Author(s):  
Davor Horvatić ◽  
Tomislav Lipic

Well-evidenced advances of data-driven complex machine learning approaches emerging within the so-called second wave of artificial intelligence (AI) fostered the exploration of possible AI applications in various domains and aspects of human life, practices, and society [...]


Sensors ◽  
2021 ◽  
Vol 21 (4) ◽  
pp. 1157
Author(s):  
Luka Grbčić ◽  
Lado Kranjčević ◽  
Siniša Družeta

This paper presents and explores a novel methodology for solving the problem of a water distribution network contamination event, which includes determining the exact source of contamination, the contamination start and end times and the injected contaminant concentration. The methodology is based on coupling a machine learning algorithm for predicting the most probable contamination sources in a water distribution network with an optimization algorithm for determining the values of contamination start time, end time and injected contaminant concentration for each predicted node separately. Two slightly different algorithmic frameworks were constructed which are based on the mentioned methodology. Both algorithmic frameworks utilize the Random Forest algorithm for classification of top source contamination node candidates, with one of the frameworks directly using the stochastic fireworks optimization algorithm to determine the contamination start time, end time and injected contaminant concentration for each predicted node separately. The second framework uses the Random Forest algorithm for an additional regression prediction of each top node’s start time, end time and contaminant concentration and is then coupled with the deterministic global search optimization algorithm MADS. Both a small sized (92 potential sources) network with perfect sensor measurements and a medium sized (865 potential sources) benchmark network with fuzzy sensor measurements were used to explore the proposed frameworks. Both algorithmic frameworks perform well and show robustness in determining the true source node, start and end times and contaminant concentration, with the second framework being extremely efficient on the fuzzy sensor measurement benchmark network.


Sign in / Sign up

Export Citation Format

Share Document