scholarly journals Supplementary material to "A robust clustering algorithm for analysis of composition‐dependent organic aerosol thermal desorption measurements"

Author(s):  
Ziyue Li ◽  
Emma L. D'Ambro ◽  
Siegfried Schobesberger ◽  
Cassandra J. Gaston ◽  
Felipe D. Lopez-Hilfiker ◽  
...  
2019 ◽  
Author(s):  
Ziyue Li ◽  
Emma L. D'Ambro ◽  
Siegfried Schobesberger ◽  
Cassandra J. Gaston ◽  
Felipe D. Lopez-Hilfiker ◽  
...  

Abstract. One of the challenges of understanding atmospheric organic aerosol (OA) stems from its complex composition. Mass spectrometry is commonly used to characterize the compositional variability of OA. Clustering of a mass spectral data set helps identify components that exhibit similar behavior or have similar properties, facilitating understanding of sources and processes that govern compositional variability. Here, we developed a novel clustering algorithm, Noise-Sorted Scanning Clustering (NSSC), and apply it to thermal desorption measurements from the Filter Inlet for Gases and AEROsols coupled to a chemical ionization mass spectrometer (FIGAERO CIMS). NSSC provides a robust, reproducible analysis of the FIGAERO temperature-dependent mass spectral data. The NSSC allows for determination of thermal profiles for compositionally distinct clusters, increasing the accessibility and enhancing the interpretation of FIGAERO data. Applications of NSSC to several laboratory biogenic secondary organic aerosol (BSOA) systems demonstrate the ability of NSSC to distinguish different types of thermal behaviors for the components comprising the particles along with the relative mass contributions and chemical properties (e.g. average molecular formula) of each cluster. For each of the systems examined, more than 80 % of the total mass is clustered into 9–13 clusters. Comparison of the average thermograms of the clusters between systems indicate some commonalty in terms of the thermal properties of different BSOA, although with some system-specific behavior. Application of NSSC to sets of experiments in which one experimental parameter, such as the concentration of NO, is varied demonstrates the potential for clustering to elucidate the chemical factors that drive changes in the thermal properties of OA. Further quantitative interpretation of the clustered thermograms followed by clustering will allow for more comprehensive understanding of the thermochemical properties of OA.


2020 ◽  
Vol 20 (4) ◽  
pp. 2489-2512 ◽  
Author(s):  
Ziyue Li ◽  
Emma L. D'Ambro ◽  
Siegfried Schobesberger ◽  
Cassandra J. Gaston ◽  
Felipe D. Lopez-Hilfiker ◽  
...  

Abstract. One of the challenges of understanding atmospheric organic aerosol (OA) particles stems from its complex composition. Mass spectrometry is commonly used to characterize the compositional variability of OA. Clustering of a mass spectral dataset helps identify components that exhibit similar behavior or have similar properties, facilitating understanding of sources and processes that govern compositional variability. Here, we developed an algorithm for clustering mass spectra, the noise-sorted scanning clustering (NSSC), appropriate for application to thermal desorption measurements of collected OA particles from the Filter Inlet for Gases and AEROsols coupled to a chemical ionization mass spectrometer (FIGAERO-CIMS). NSSC, which extends the common density-based special clustering of applications with noise (DBSCAN) algorithm, provides a robust, reproducible analysis of the FIGAERO temperature-dependent mass spectral data. The NSSC allows for the determination of thermal profiles for compositionally distinct clusters of mass spectra, increasing the accessibility and enhancing the interpretation of FIGAERO data. Applications of NSSC to several laboratory biogenic secondary organic aerosol (BSOA) systems demonstrate the ability of NSSC to distinguish different types of thermal behaviors for the components comprising the particles along with the relative mass contributions and chemical properties (e.g., average molecular formula) of each mass spectral cluster. For each of the systems examined, more than 80 % of the total mass is clustered into 9–13 mass spectral clusters. Comparison of the average thermograms of the mass spectral clusters between systems indicates some commonality in terms of the thermal properties of different BSOA, although with some system-specific behavior. Application of NSSC to sets of experiments in which one experimental parameter, such as the concentration of NO, is varied demonstrates the potential for mass spectral clustering to elucidate the chemical factors that drive changes in the thermal properties of OA particles. Further quantitative interpretation of the thermograms of the mass spectral clusters will allow for a more comprehensive understanding of the thermochemical properties of OA particles.


2021 ◽  
Author(s):  
Anna K. Tobler ◽  
Alicja Skiba ◽  
Francesco Canonaco ◽  
Griša Močnik ◽  
Pragati Rai ◽  
...  

Author(s):  
Muhamad Alias Md. Jedi ◽  
Robiah Adnan

TCLUST is a method in statistical clustering technique which is based on modification of trimmed k-means clustering algorithm. It is called “crisp” clustering approach because the observation is can be eliminated or assigned to a group. TCLUST strengthen the group assignment by putting constraint to the cluster scatter matrix. The emphasis in this paper is to restrict on the eigenvalues, λ of the scatter matrix. The idea of imposing constraints is to maximize the log-likelihood function of spurious-outlier model. A review of different robust clustering approach is presented as a comparison to TCLUST methods. This paper will discuss the nature of TCLUST algorithm and how to determine the number of cluster or group properly and measure the strength of group assignment. At the end of this paper, R-package on TCLUST implement the types of scatter restriction, making the algorithm to be more flexible for choosing the number of clusters and the trimming proportion.


Sign in / Sign up

Export Citation Format

Share Document