scholarly journals Clustering the Concentrations of PM10 and O3: Application of Spatio-temporal Model-based Clustering

Author(s):  
parisa saeipourdizaj ◽  
saeed musavi ◽  
Akbar Gholampour ◽  
parvin sarbakhsh

Abstract Air pollution data are large-scale dataset which can be analyzed in low scales by clustering to recognize the pattern of pollution and have simpler and more comprehensible interpretation. So, this study aims to cluster the days of year 2017 according to the hourly O3 and PM10 amounts collected from four stations of Tabriz by using spatio-temporal mixture model-based clustering (STMC). Besides, mixture model-based clustering with temporal dimension (TMC) and mixture model-based clustering without considering spatio-temporal dimensions (MC) were utilized to compare with STMC. To evaluate the efficiency of these three models and obtain the optimal number of clusters in each model, BIC and ICL criteria were used. According to BIC and ICL, STMC outperforms TMC and MC. Three clusters for O3 and four clusters for PM10 were selected as the optimal number of clusters to fit STMC models. Regarding PM10, the average concentration was the highest in cluster 4. Regarding O3, all summer days were in cluster 3 and the average concentration of this cluster was the highest. Cluster 2 had the lowest concentration with a high difference from clusters 1 and 3 and its average temperature was the lowest. Autumn days make up about 84% of this cluster. The clustering of polluted and clean days into separate groups and observing the effect of meteorological factors on the amount of concentration in each cluster clearly prove the efficiency of the model. Results of STMC showed that efficiency of clustering in air pollution data increases by considering both spatio-temporal dimensions.

2018 ◽  
Vol 14 (1) ◽  
pp. 11-23 ◽  
Author(s):  
Lin Zhang ◽  
Yanling He ◽  
Huaizhi Wang ◽  
Hui Liu ◽  
Yufei Huang ◽  
...  

Background: RNA methylome has been discovered as an important layer of gene regulation and can be profiled directly with count-based measurements from high-throughput sequencing data. Although the detailed regulatory circuit of the epitranscriptome remains uncharted, clustering effect in methylation status among different RNA methylation sites can be identified from transcriptome-wide RNA methylation profiles and may reflect the epitranscriptomic regulation. Count-based RNA methylation sequencing data has unique features, such as low reads coverage, which calls for novel clustering approaches. <P><P> Objective: Besides the low reads coverage, it is also necessary to keep the integer property to approach clustering analysis of count-based RNA methylation sequencing data. <P><P> Method: We proposed a nonparametric generative model together with its Gibbs sampling solution for clustering analysis. The proposed approach implements a beta-binomial mixture model to capture the clustering effect in methylation level with the original count-based measurements rather than an estimated continuous methylation level. Besides, it adopts a nonparametric Dirichlet process to automatically determine an optimal number of clusters so as to avoid the common model selection problem in clustering analysis. <P><P> Results: When tested on the simulated system, the method demonstrated improved clustering performance over hierarchical clustering, K-means, MClust, NMF and EMclust. It also revealed on real dataset two novel RNA N6-methyladenosine (m6A) co-methylation patterns that may be induced directly by METTL14 and WTAP, which are two known regulatory components of the RNA m6A methyltransferase complex. <P><P> Conclusion: Our proposed DPBBM method not only properly handles the count-based measurements of RNA methylation data from sites of very low reads coverage, but also learns an optimal number of clusters adaptively from the data analyzed. <P><P> Availability: The source code and documents of DPBBM R package are freely available through the Comprehensive R Archive Network (CRAN): https://cran.r-project.org/web/packages/DPBBM/.


2022 ◽  
Vol 32 (1) ◽  
pp. 361-375
Author(s):  
S. Markkandan ◽  
S. Sivasubramanian ◽  
Jaison Mulerikkal ◽  
Nazeer Shaik ◽  
Beulah Jackson ◽  
...  

Sign in / Sign up

Export Citation Format

Share Document