Use of cluster analysis to monitor novel coronavirus-19 infections in Maharashtra, India

Objectives: A novel coronavirus disease (COVID-19) has been continuously spreading in almost all the districts of the state Maharashtra in India. As a part of the healthcare management development, it is very important to monitor districts affected due to novel coronavirus (COVID-19). The main objective of this study was to identify and classify affected districts into real clusters on the basis of observations of similarities within a cluster and dissimilarities among different clusters so that government policies, decisions, medical facilities (ventilators, testing kits, masks, treatment etc.), etc. could be improved for reducing the number of infected and deceased persons and hence cured cased could be increased. Material and Methods: In the study, we focused on COVID-19 affected districts of the state Maharashtra of India. We applied agglomerative hierarchical cluster analysis, one of data mining techniques to fulfill the objective. Elbow method was used for obtaining an optimum number of clusters for further analysis. The study of variations among various clusters for each of the variables was performed using box plots. Results: Results obtained from the Elbow method suggested three optimum numbers of clusters for each of the variables. For confirmed and cured cases, cluster I corresponded to the districts BI, GO, ND, PA, SI, WS, JN, CH, OS, HI, NB, JG, RT, LA, KO, AM, ST, BU, DH, AK, YTL, SN, AH, SO, AU, RG, NG, NS and PL. Cluster II corresponded to the districts TH and PU and cluster III corresponded to the district MC. For the death cases, cluster I corresponded to the districts BI, GO, ND, PA, SI, WS, JN, CH, OS, HI, NB, JG, RT, LA, KO, AM, ST, BU, DH, AK, YTL, SN, AH, SO, AU, RG, NG, NS, PL and TH. Cluster II corresponded to the district PU and cluster III corresponded to the district MC. Conclusions: The study showed that the district MC under cluster III was affected severely with COVID-19 which had high number of confirmed cases. A good percentage of cured cases were found in some of the districts under cluster I where six districts (GO, SI, CH, OS, SN) had 100% success rate to cure patients. It was observed that the districts TH, PU and MC under clusters II and III had severe conditions which need optimization of medical facilities and monitoring techniques like screening, closedown, curfews, lockdown, evacuations, legal actions, etc.

Download Full-text

Diversity Assessment of Indian Sunnhemp (Crotalaria juncea L.) Accessions for Enhanced Biomass and Fibre Yield using Geographic Information System Approach

Legume Research - An International Journal ◽

10.18805/lr-4510 ◽

2021 ◽

Author(s):

R.T. Maruthi ◽

A. Anil Kumar ◽

S.B. Choudhary ◽

H.K. Sharma ◽

J. Mitra

Keyword(s):

Cluster Analysis ◽

Germplasm Collection ◽

Hierarchical Cluster ◽

High Biomass ◽

Lignocellulosic Substrate ◽

Phenotypic Data ◽

Diversity Pattern ◽

Diversity Assessment ◽

Agglomerative Hierarchical Cluster Analysis ◽

Fibre Yield

Background: Sunnhemp, a rapid growing, high biomass yielding bast fibre crop has a tremendous potentiality in biofuels sector as a lignocellulosic substrate. In order to capitalize the new found area there is a need to identify high biomass and fibre yielding sunnhemp genotypes. The present study provides details of morphological diversity and geographical distribution pattern of Indian sunnhemp accessions. Methods: A total of 42 germplasm accessions collected from ten different states were evaluated for fibre yield and attributing traits in April-June cropping season. Based on phenotypic data agglomerative hierarchical cluster analysis was performed. Geographical coordinates of germplasm collection site were utilized to derive the spatial genetic diversity pattern for green biomass yield and fibre yield.Result: Phenotypic evaluation revealed significant genetic variability among the genotypes for biomass and fibre yield leading to identification of several promising accessions. Cluster analysis and PCA grouped the 42 sunnhemp accessions into three clusters. Cluster II and III are highly divergent harboring contrasting phenotypes. DIVA-GIS approach identified eastern Rajasthan, western Jharkhand and border area between Bihar and Jharkhand as sites of highest sunnhemp diversity.

Download Full-text

Development of a groundwater quality index: GWQI, for the aquifers of the state of Bahia, Brazil using multivariable analyses

Scientific Reports ◽

10.1038/s41598-021-95912-9 ◽

2021 ◽

Vol 11 (1) ◽

Author(s):

José Barbosa Filho ◽

Iara Brandão de Oliveira

Keyword(s):

Water Quality ◽

Cluster Analysis ◽

Groundwater Quality ◽

Hierarchical Cluster Analysis ◽

Quality Index ◽

Hierarchical Cluster ◽

The State ◽

Groundwater Quality Index ◽

Multivariable Analyses

AbstractThis work elaborated a groundwater quality index—GWQI, for the aquifers of the state of Bahia, Brazil, using multivariable analyses. Data from 600 wells located in the four hydrogeological domains: sedimentary, crystalline, karstic, and metasedimentary, were subjected to exploratory statistical analysis, and 22 out of 26 parameters were subjected to multivariable analysis using Statistica (Version 7.0). From the PCA, 5 factors were sufficient to participate in the index, due to sufficient explanation of the cumulative variance. The matrix of factorial loads (for 1–5 factors) indicated 9 parameters related to water quality and 4 hydrological, with factor loads above ± 0.50, to be part of the hierarchical cluster analysis. The dendrogram allowed to choose the 5 parameters related to groundwater quality, to participate in the GWQI (hardness, total residue, sulphate, fluoride and iron). From the multivariable analyses, three parameters from a previous index—NGWQI, were not selected for the GWQI: chloride (belongs to the hardness hierarchical group); pH (insignificant factor load); and nitrate (significant factor load only for 6 factors), also, not a regionalized variable. From the set of communality values (5 factors), the degree of relevance of each parameter was extracted. Based on these values, were determined the relative weights (wi) for the parameters. Using similar WQI-NSF formulation, a product of quality grades raised to a power, which is the weight of importance of each variable, the GWQI values were calculated. Spatialization of 1369 GWQI values, with the respective colors, on the map of the state of Bahia, revealed good correlation between the groundwater quality and the index quality classification. According to the literature on water quality indexing, the GWQI developed here, using emerging technologies, is a mathematical tool developed as specific index, as it was derived using limits for drinking water. This new index was tailored to represent the quality of the groundwater of the four hydrogeological domains of the state of Bahia. Although it has a regionalized application, its development, using, factor analysis, principal component analysis, and hierarchical cluster analysis, participates of the new trend for WQI development, which uses rational, rather than subjective assessment. The GWQI is a successful index due to its ability to represent the groundwater quality of the state of Bahia, using a single mathematical formulation, the same five parameters, and unique weight for each parameter.

Download Full-text

P053 PATTERNS OF ALLERGEN CO-SENSITIZATION: AGGLOMERATIVE HIERARCHICAL CLUSTER ANALYSIS IDENTIFIES NOVEL ASSOCIATIONS

Annals of Allergy Asthma & Immunology ◽

10.1016/j.anai.2020.08.079 ◽

2020 ◽

Vol 125 (5) ◽

pp. S19-S20

Author(s):

B. Nriagu ◽

B. Patchett ◽

G. Mavraj ◽

E. Schulman

Keyword(s):

Cluster Analysis ◽

Hierarchical Cluster Analysis ◽

Hierarchical Cluster ◽

Agglomerative Hierarchical Cluster ◽

Agglomerative Hierarchical Cluster Analysis ◽

Novel Associations

Download Full-text

Health risk assessment and source identification of groundwater arsenic contamination using agglomerative hierarchical cluster analysis in selected sites from upper Eastern parts of Punjab province, Pakistan

Human and Ecological Risk Assessment An International Journal ◽

10.1080/10807039.2020.1794787 ◽

2020 ◽

pp. 1-20 ◽

Cited By ~ 1

Author(s):

Nisbah Mushtaq ◽

Noshin Masood ◽

Junaid Ali Khattak ◽

Ishtiaque Hussain ◽

Qasim Khan ◽

...

Keyword(s):

Risk Assessment ◽

Cluster Analysis ◽

Health Risk ◽

Health Risk Assessment ◽

Hierarchical Cluster Analysis ◽

Source Identification ◽

Hierarchical Cluster ◽

Agglomerative Hierarchical Cluster ◽

Agglomerative Hierarchical Cluster Analysis ◽

Groundwater Arsenic

Download Full-text

Appropriate statistical methods for analysis of safflower genetic diversity using agglomerative hierarchical cluster analysis through combination of phenotypic traits and molecular markers

Crop Science ◽

10.1002/csc2.20598 ◽

2021 ◽

Author(s):

Karim Houmanat ◽

Ahmed Douaik ◽

Jamal Charafi ◽

Lahcen Hssaini ◽

Mohamed El Fechtali ◽

...

Keyword(s):

Genetic Diversity ◽

Cluster Analysis ◽

Molecular Markers ◽

Statistical Methods ◽

Hierarchical Cluster Analysis ◽

Hierarchical Cluster ◽

Phenotypic Traits ◽

Agglomerative Hierarchical Cluster ◽

Agglomerative Hierarchical Cluster Analysis

Download Full-text

Geostatistical Analysis of Groundwater Data

International Journal of Engineering and Advanced Technology - Regular Issue ◽

10.35940/ijeat.a9758.109119 ◽

2019 ◽

Vol 9 (1) ◽

pp. 2734-2741

Keyword(s):

Cluster Analysis ◽

Statistical Analysis ◽

Hierarchical Cluster ◽

Geostatistical Analysis ◽

Water Requirements ◽

Groundwater Levels ◽

Groundwater Level Fluctuations ◽

Agglomerative Hierarchical Cluster Analysis ◽

Observation Wells ◽

Surface And Groundwater

Water resources are stressed because of the country's increasing population and increased water requirements. Even though a good understanding of both surface and groundwater hydrological systems make it possible to manage these resources properly. To study the main characteristics of formation of clusters of groundwater levels, statistical analysis has been used. Geostatistics is a class of statistics used to analyze and predict the values associated with spatial or spatiotemporal phenomena. It incorporates the spatial (and in some cases temporal) coordinates of the data within the analyses. The Statistical analysis is applied to monthly groundwater levels fluctuation data over a period of 2004-2017 in Mysuru, Mandya, Chamarajanagara and Hassan districts of Southern Karnataka in India. The groundwater levels data is collected from 197 Observation Wells from the districts. The Statistical methods like K-Means Clustering and Agglomerative Hierarchical Cluster Analysis is used to perform the datasets. Grouping is made using AHC method, during this process results are obtained by graph called Dendrogram. The obtained results are compared with the LULC maps of all 4 districts. Different grouping (cluster) is made for groundwater level fluctuations for proper conclusion to arrive.

Download Full-text

A COMPARISON BETWEEN SINGLE LINKAGE AND COMPLETE LINKAGE IN AGGLOMERATIVE HIERARCHICAL CLUSTER ANALYSIS FOR IDENTIFYING TOURISTS SEGMENTS

IIUM Engineering Journal ◽

10.31436/iiumej.v12i6.199 ◽

2012 ◽

Vol 12 (6) ◽

Author(s):

Noor Rashidah Rashid

Keyword(s):

Cluster Analysis ◽

Hierarchical Cluster Analysis ◽

Hierarchical Cluster ◽

Single Linkage ◽

Complete Linkage ◽

Multivariate Method ◽

Agglomerative Hierarchical Cluster ◽

Linkage Methods ◽

Agglomerative Hierarchical Cluster Analysis ◽

Analyze Data

Cluster Analysis is a multivariate method in statistics. Agglomerative Hierarchical Cluster Analysis is one of approaches in Cluster Analysis. There are two linkage methods in Agglomerative Hierarchical Cluster Analysis which are Single Linkage and Complete Linkage. The purpose of this study is to compare between Single Linkage and Complete Linkage in Agglomerative Hierarchical Cluster Analysis. The comparison of performances between these linkage methods was shown by using Kruskal-Wallis test. The result of the comparison used for segmenting tourists of Kapas Island. The statistical software SPSS has been applied to analyze data of this research. The result from Kruskal-Wallis test shows Complete Linkage is more useful in identifying tourists segments. Keywords : Agglomerative Hierarchical Cluster Analysis, Single Linkage, Complete Linkage, Kruskal-Wallis test, tourists

Download Full-text

Implications of COVID-19 vaccination and public health countermeasures on SARS-CoV-2 variants of concern in Canada: evidence from a spatial hierarchical cluster analysis

10.1101/2021.06.28.21259629 ◽

2021 ◽

Author(s):

Daniel A Adeyinka ◽

Cheryl Camillo ◽

Wendie Marks ◽

Nazeem Muhajarine

Keyword(s):

Public Health ◽

Cluster Analysis ◽

Hierarchical Cluster Analysis ◽

Vaccine Coverage ◽

Hierarchical Cluster ◽

Vaccine Uptake ◽

Mobility Index ◽

Agglomerative Hierarchical Cluster Analysis ◽

Mitigating Factors ◽

Central Canada

Background: The influence of coronavirus disease-2019 (COVID-19) containment measures on variants of concern (VOC) has been understudied in Canada. Our objective was to identify provinces with disproportionate prevalence of VOC relative to COVID-19 mitigation efforts in provinces and territories in Canada. Methods: We analyzed publicly available provincial- and territorial-level data on the prevalence of VOCs in relation to mitigating factors (summarized in three measures: 1. strength of public health countermeasures: stringency index, 2. how much people moved about outside their homes: mobility index, and 3. vaccine intervention: proportion of Canadian population fully vaccinated). Using spatial agglomerative hierarchical cluster analysis (unsupervised machine learning), the provinces and territories were grouped into clusters by stringency index, mobility index and full vaccine coverage. Kruskal-Wallis test was used to determine the differences in the prevalence of VOC (Alpha, or B.1.1.7, Beta, or B.1.351, Gamma, or P.1, and Delta, or B.1.617.2 variants) between the clusters. Results: Three clusters of vaccine uptake and countermeasures were identified. Cluster 1 consisted of the three Canadian territories, and characterized by higher degree of vaccine deployment and lesser degree of countermeasures. Cluster 2 (located in Central Canada and Atlantic region) was typified by lesser implementation of vaccine deployment and moderate countermeasures. The third cluster was formed by provinces inthe Pacific region, Central Canada, and Prairie region, with moderate vaccine deployment but stronger countermeasures. The overall and variant-specific prevalence were significantly different across the clusters. Interpretation: This study found that implementation of COVID-19 public health measures varied across the provinces and territories. Considering the high prevalence of VOCs in Canada, completing the second dose of COVID-19 vaccine in a timely manner is crucial.

Download Full-text

Distribution and Taxonomic Significance of Secondary Metabolites Occurring in the Methanol Extracts of the Stonecrops (Sedum L., Crassulaceae) from the Central Balkan Peninsula

Natural Product Communications ◽

10.1177/1934578x1501000637 ◽

2015 ◽

Vol 10 (6) ◽

pp. 1934578X1501000

Author(s):

Gordana S. Stojanović ◽

Snežana Č. Jovanović ◽

Bojan K. Zlatković

Keyword(s):

Cluster Analysis ◽

Chemical Composition ◽

Secondary Metabolites ◽

Hplc Analysis ◽

Methanol Extract ◽

Hierarchical Cluster ◽

Balkan Peninsula ◽

Taxonomic Significance ◽

Methanol Extracts ◽

Agglomerative Hierarchical Cluster Analysis

The present study is engaged in the chemical composition of methanol extracts of Sedum taxa from the central part of the Balkan Peninsula, and representatives from other genera of Crassulaceae ( Crassula, Echeveria and Kalanchoe) considered as out-groups. The chemical composition of extracts was determined by HPLC analysis, according to retention time of standards and characteristic absorption spectra of components. Identified components were considered as original variables with possible chemotaxonomic significance. Relationships of examined plant samples were investigated by agglomerative hierarchical cluster analysis (AHC). The obtained results showed how the distribution of methanol extract components (mostly phenolics) affected grouping of the examined samples. The obtained clustering showed satisfactory grouping of the examined samples, among which some representatives of the Sedum series, Rupestria and Magellensia, are the most remote. The out-group samples were not clearly singled out with regard to Sedum samples as expected; this especially applies to samples of Crassula ovata and Echeveria lilacina, while Kalanchoe daigremontiana was more separated from most of the Sedum samples.

Download Full-text

Analysis of the horizontal structure of a measurement and control geodetic network based on entropy

Geodesy and Cartography ◽

10.2478/geocart-2013-0002 ◽

2013 ◽

Vol 62 (1) ◽

pp. 23-31 ◽

Cited By ~ 6

Author(s):

Maria Mrówczyńska

Keyword(s):

Geodetic Network ◽

The State ◽

Observation System ◽

Optimum Number ◽

Optimum Structure ◽

Horizontal Structure ◽

The Difference ◽

And Control ◽

Measurement And Control ◽

Horizontal Displacements

Abstract The paper attempts to determine an optimum structure of a directional measurement and control network intended for investigating horizontal displacements. For this purpose it uses the notion of entropy as a logarithmical measure of probability of the state of a particular observation system. An optimum number of observations results from the difference of the entropy of the vector of parameters ΔHX̂ (x)corresponding to one extra observation. An increment of entropy interpreted as an increment of the amount of information about the state of the system determines the adoption or rejection of another extra observation to be carried out.

Download Full-text