Identification of new particle formation events with deep learning

Abstract. New particle formation (NPF) in the atmosphere is globally an important source of climate relevant aerosol particles. Occurrence of NPF events is typically analyzed manually by researchers from particle size distribution data day by day, which is time consuming and the classification of event types may be inconsistent. To get more reliable and consistent results, the NPF event analysis should be automatized. We have developed an automatic analysis method based on deep learning, a subarea of machine learning, for NPF event identification. To our knowledge, this is the first time when NPF events have been successfully classified automatically into different classes from particle size distribution images. The developed method is based on image analysis of particle size distributions using a pre-trained deep Convolutional Neural Networks (CNN), named AlexNet, which was transfer learned to recognize NPF event classes (six different types). In transfer learning, a partial set of particle size distribution images were used in the training stage of the CNN and the rest of images for testing the success of the training. The method was utilized for a 15-year long dataset measured at San Pietro Capofiume in Italy. We studied performance of the training with different training and testing image number ratios as well as with different regions of interest in the images. The results show that clear event (i.e., Classes 1 and 2) and non-event days can be identified with an accuracy of ca. 80 %, when the CNN classification is compared with that of an expert, which is a good first result for automatic NPF event analysis. In the event classification, the choice between different event classes is not an easy task even for trained researchers, thus overlapping or confusion between different classes occurs. Hence, we cross validated the learning results of CNN with the expert made classification. The results show that the overlapping occurs typically between the adjacent or similar type of classes, e.g., a manually classified Class 1 is categorized mainly into Classes 1 and 2 by CNN, indicating that the manual and CNN classifications are very consist for the most of the days. The classification would be more consistent, by both human and CNN, if only two different classes are used for event days instead of three classes. Thus, we recommend that in the future analysis, event days should be categorized into classes of Quantifiable (i.e. clear events, Classes 1 and 2) and Non-Quantifiable (i.e. weak events, Class 3). This would better describe the difference of those classes: both formation and growth rates can be determined for Quantifiable days but not both for Non-Quantifiable days. Furthermore, we investigated more deeply the days that are classified as clear events by experts and recognized as non-events by the CNN and vice versa. Clear misclassifications seem to occur more commonly in manual analysis than in the CNN categorization, which is mostly due to the inconsistency in the human-made classification or errors in the booking of the event class. In general, the automatic CNN classifier has a better reliability and repeatability in NPF event classification than human-made classification and, thus, the transfer learned pre-trained CNNs are powerful tools to analyze long-term datasets. The developed NPF event classifier can be easily utilized to analyze any long-term datasets more accurately and consistently, which helps us to understand in detail aerosol-climate interactions and the long-term effects of climate change on NPF in the atmosphere.

Download Full-text

Identification of new particle formation events with deep learning

Atmospheric Chemistry and Physics ◽

10.5194/acp-18-9597-2018 ◽

2018 ◽

Vol 18 (13) ◽

pp. 9597-9615 ◽

Cited By ~ 4

Author(s):

Jorma Joutsensaari ◽

Matthew Ozon ◽

Tuomo Nieminen ◽

Santtu Mikkonen ◽

Timo Lähivaara ◽

...

Keyword(s):

Particle Size ◽

Deep Learning ◽

Particle Size Distribution ◽

Size Distribution ◽

Transfer Learning ◽

Particle Formation ◽

Event Analysis ◽

New Particle Formation ◽

Event Classification

Abstract. New particle formation (NPF) in the atmosphere is globally an important source of climate relevant aerosol particles. Occurrence of NPF events is typically analyzed by researchers manually from particle size distribution data day by day, which is time consuming and the classification of event types may be inconsistent. To get more reliable and consistent results, the NPF event analysis should be automatized. We have developed an automatic analysis method based on deep learning, a subarea of machine learning, for NPF event identification. To our knowledge, this is the first time that a deep learning method, i.e., transfer learning of a convolutional neural network (CNN), has successfully been used to automatically classify NPF events into different classes directly from particle size distribution images, similarly to how the researchers carry out the manual classification. The developed method is based on image analysis of particle size distributions using a pretrained deep CNN, named AlexNet, which was transfer learned to recognize NPF event classes (six different types). In transfer learning, a partial set of particle size distribution images was used in the training stage of the CNN and the rest of the images for testing the success of the training. The method was utilized for a 15-year-long dataset measured at San Pietro Capofiume (SPC) in Italy. We studied the performance of the training with different training and testing of image number ratios as well as with different regions of interest in the images. The results show that clear event (i.e., classes 1 and 2) and nonevent days can be identified with an accuracy of ca. 80 %, when the CNN classification is compared with that of an expert, which is a good first result for automatic NPF event analysis. In the event classification, the choice between different event classes is not an easy task even for trained researchers, and thus overlapping or confusion between different classes occurs. Hence, we cross-validated the learning results of CNN with the expert-made classification. The results show that the overlapping occurs, typically between the adjacent or similar type of classes, e.g., a manually classified Class 1 is categorized mainly into classes 1 and 2 by CNN, indicating that the manual and CNN classifications are very consistent for most of the days. The classification would be more consistent, by both human and CNN, if only two different classes are used for event days instead of three classes. Thus, we recommend that in the future analysis, event days should be categorized into classes of “quantifiable” (i.e., clear events, classes 1 and 2) and “nonquantifiable” (i.e., weak events, Class 3). This would better describe the difference of those classes: both formation and growth rates can be determined for quantifiable days but not both for nonquantifiable days. Furthermore, we investigated more deeply the days that are classified as clear events by experts and recognized as nonevents by the CNN and vice versa. Clear misclassifications seem to occur more commonly in manual analysis than in the CNN categorization, which is mostly due to the inconsistency in the human-made classification or errors in the booking of the event class. In general, the automatic CNN classifier has a better reliability and repeatability in NPF event classification than human-made classification and, thus, the transfer-learned pretrained CNNs are powerful tools to analyze long-term datasets. The developed NPF event classifier can be easily utilized to analyze any long-term datasets more accurately and consistently, which helps us to understand in detail aerosol–climate interactions and the long-term effects of climate change on NPF in the atmosphere. We encourage researchers to use the model in other sites. However, we suggest that the CNN should be transfer learned again for new site data with a minimum of ca. 150 figures per class to obtain good enough classification results, especially if the size distribution evolution differs from training data. In the future, we will utilize the method for data from other sites, develop it to analyze more parameters and evaluate how successfully CNN could be trained with synthetic NPF event data.

Download Full-text

Evidence of an elevated source of nucleation based on model simulations and data from the NIFTy experiment

Atmospheric Chemistry and Physics ◽

10.5194/acp-12-8021-2012 ◽

2012 ◽

Vol 12 (17) ◽

pp. 8021-8036 ◽

Cited By ~ 11

Author(s):

P. Crippa ◽

T. Petäjä ◽

H. Korhonen ◽

G. S. El Afandi ◽

S. C. Pryor

Keyword(s):

Particle Size ◽

Particle Size Distribution ◽

Size Distribution ◽

Ultrafine Particle ◽

Ground Level ◽

Particle Formation ◽

Atmospheric Conditions ◽

New Particle Formation ◽

Near Surface ◽

Nocturnal Inversion

Abstract. New particle formation has been observed at a number of ground-based measurement sites. Prior research has provided evidence that this new particle formation, while observed in the near-surface layer, is actually occurring in atmospheric layers above the surface and appears to be focused in or close to the residual layer formed by the nocturnal inversion. Here, we present both observations and modeling for southern Indiana which support this postulate. Based on simulations with a detailed aerosol dynamics model and the Weather Research and Forecasting model, along with data from ground-based remote sensing instruments and detailed gas and particle phase measurements, we show evidence that (i) the maximum rate change of ultrafine particle concentrations as observed close to the surface is always preceded by breakdown of the nocturnal inversion and enhancement of vertical mixing and (ii) simulated particle size distributions exhibit greatest accord with surface observations during and subsequent to nucleation only when initialized with a particle size distribution representative of clear atmospheric conditions, rather than the in situ (ground-level) particle size distribution.

Download Full-text

Airborne observations of the effect of a cold front on the aerosol particle size distribution and new particle formation

Quarterly Journal of the Royal Meteorological Society ◽

10.1002/qj.515 ◽

2010 ◽

Vol 136 (649) ◽

pp. 944-961 ◽

Cited By ~ 4

Author(s):

Justin R. Peter ◽

Steven T. Siems ◽

Jørgen B. Jensen ◽

John L. Gras ◽

Yutaka Ishizaka ◽

...

Keyword(s):

Particle Size ◽

Particle Size Distribution ◽

Size Distribution ◽

Aerosol Particle ◽

Cold Front ◽

Particle Formation ◽

New Particle Formation ◽

Aerosol Particle Size

Download Full-text

Contributions of traffic emissions and new particle formation to the ultrafine particle size distribution in the megacity of Beijing

Atmospheric Environment ◽

10.1016/j.atmosenv.2021.118652 ◽

2021 ◽

pp. 118652

Author(s):

Hao Wu ◽

Zhanqing Li ◽

Mengjiao Jiang ◽

Chunsheng Liang ◽

Dongmei Zhang ◽

...

Keyword(s):

Particle Size ◽

Particle Size Distribution ◽

Size Distribution ◽

Ultrafine Particle ◽

Particle Formation ◽

Traffic Emissions ◽

New Particle Formation

Download Full-text

Evidence of an elevated source of nucleation based on model simulations and data from the NIFTy experiment

Atmospheric Chemistry and Physics Discussions ◽

10.5194/acpd-12-11979-2012 ◽

2012 ◽

Vol 12 (5) ◽

pp. 11979-12021

Author(s):

P. Crippa ◽

T. Petäjä ◽

H. Korhonen ◽

G. S. El Afandi ◽

S. C. Pryor

Keyword(s):

Particle Size ◽

Particle Size Distribution ◽

Size Distribution ◽

Ultrafine Particle ◽

Ground Level ◽

Particle Formation ◽

Atmospheric Conditions ◽

New Particle Formation ◽

Near Surface ◽

Nocturnal Inversion

Abstract. New particle formation has been observed at a number of ground-based measurement sites. Prior research predominantly from Europe has provided evidence that this new particle formation, while observed in the near-surface layer, is actually occurring in atmospheric layers above the surface and appears to be focused in or close to the residual layer formed by the nocturnal inversion. Here, we present both observations and modeling for southern Indiana, which support this postulate. Based on simulations with a detailed aerosol dynamics model and the Weather Research and Forecasting model, along with data from ground-based remote sensing instruments and detailed gas and particle phase measurements, we show evidence that (i) the maximum rate change of ultrafine particle concentrations as observed close to the surface is always preceded by breakdown of the nocturnal inversion and enhancement of vertical mixing and (ii) simulated particle size distributions exhibit greatest accord with surface observations during and subsequent to nucleation only when initialized with a particle size distribution representative of clear atmospheric conditions, rather than the in situ (ground-level) particle size distribution.

Download Full-text

Supplementary material to "Particle size distribution and new particle formation under influence of biomass burning at a high altitude background site of Mt. Yulong (3410 m) in China"

10.5194/acp-2018-300-supplement ◽

2018 ◽

Author(s):

Dongjie Shang ◽

Min Hu ◽

Jing Zheng ◽

Yanhong Qin ◽

Zhuofei Du ◽

...

Keyword(s):

Particle Size ◽

Particle Size Distribution ◽

High Altitude ◽

Size Distribution ◽

Biomass Burning ◽

Particle Formation ◽

New Particle Formation ◽

Background Site ◽

Supplementary Material

Download Full-text

Characteristics of Aerosol Size Distributions and New Particle Formation Events at Delhi: An Urban Location in the Indo-Gangetic Plains

Frontiers in Earth Science ◽

10.3389/feart.2021.750111 ◽

2021 ◽

Vol 9 ◽

Author(s):

Sandhya Jose ◽

Amit Kumar Mishra ◽

Neelesh K. Lodhi ◽

Sudhir Kumar Sharma ◽

Sachchidanand Singh

Keyword(s):

Particle Size ◽

Particle Size Distribution ◽

Size Distribution ◽

Aerosol Particle ◽

Particle Number ◽

Particle Formation ◽

Number Concentration ◽

New Particle Formation ◽

Nucleation Mode ◽

Aerosol Particle Size

Accurate information about aerosol particle size distribution and its variation under different meteorological conditions are essential for reducing uncertainties related to aerosol-cloud-climate interaction processes. New particle formation (NPF) and the coagulation significantly affect the aerosol size distribution. Here we study the monthly and seasonal variability of aerosol particle size distribution at Delhi from December 2011 to January 2013. Analysis of aerosol particle size distribution using WRAS-GRIMM reveals that aerosol particle number concentration is highest during the post monsoon season owing to the effect of transported crop residue and biomass burning aerosols. Diurnal variations in number concentration show a bimodal pattern with two Aitken mode peaks in all the seasons. Monthly volume size distribution also shows bi-modal distribution with distinct coarse and fine modes. NPF events are observed less frequently in Delhi. Out of 222 days of WRAS data, only 17 NPF events have been observed, with higher NPF frequency during summer season. Growth rate of the nucleation mode of NPF events vary in the range 1.88–21.66 nm/h with a mean value of ∼8.45 ± 5.73 nm/h. It is found that during NPF events the Aitken and nucleation mode particles contribute more to the number concentration. Simultaneous measurement of UV flux and particulate matter (PM10 and PM2.5) have also been done along with particle number size distribution measurement to understand the possible mechanisms for NPF events over the study location.

Download Full-text

Multicomponent aerosol dynamics model UHMA: model development and validation

Atmospheric Chemistry and Physics Discussions ◽

10.5194/acpd-4-471-2004 ◽

2004 ◽

Vol 4 (1) ◽

pp. 471-506 ◽

Cited By ~ 14

Author(s):

H. Korhonen ◽

K. E. J. Lehtinen ◽

M. Kulmala

Keyword(s):

Particle Size ◽

Particle Size Distribution ◽

Size Distribution ◽

Volume Fraction ◽

Particle Formation ◽

Time Interval ◽

New Particle Formation ◽

Dynamics Model ◽

Aerosol Dynamics ◽

Tropospheric Aerosol

Abstract. A size-segregated aerosol dynamics model UHMA (University of Helsinki Multicomponent Aerosol model) was developed for studies of multicomponent tropospheric aerosol particles. The model includes major aerosol microphysical processes in the atmosphere with a focus on new particle formation and growth; thus it incorporates particle coagulation and multicomponent condensation, applying a revised treatment of condensation flux onto free molecular regime particles and the activation of nanosized clusters by organic vapours (Nano-Köhler theory), as well as recent parameterizations for binary H2SO4–H2O and ternary H2SO4–NH3-H2O homogeneous nucleation and dry deposition. The representation of particle size distribution can be chosen from three sectional methods: the hybrid method, the moving center method, and the retracking method in which moving sections are retracked to a fixed grid after a certain time interval. All these methods can treat particle emissions and transport consistently, and are therefore suitable for use in large scale atmospheric models. In a test simulation against an accurate high resolution solution, all the methods showed reasonable treatment of new particle formation with 20 size sections although the hybrid and the retracking methods suffered from artificial widening of the distribution. The moving center approach, on the other hand, showed extra dents in the particle size distribution and failed to predict the onset of detectable particle formation. In a separate test simulation of an observed nucleation event, the model captured the key qualitative behaviour of the system well. Furthermore, its prediction of the organic volume fraction in newly formed particles, suggesting values as high as 0.5 for 3–4 nm particles and approximately 0.8 for 10 nm particles, agrees with recent indirect composition measurements.

Download Full-text

Multicomponent aerosol dynamics model UHMA: model development and validation

Atmospheric Chemistry and Physics ◽

10.5194/acp-4-757-2004 ◽

2004 ◽

Vol 4 (3) ◽

pp. 757-771 ◽

Cited By ~ 112

Author(s):

H. Korhonen ◽

K. E. J. Lehtinen ◽

M. Kulmala

Keyword(s):

Particle Size ◽

Particle Size Distribution ◽

Size Distribution ◽

Volume Fraction ◽

Particle Formation ◽

Time Interval ◽

New Particle Formation ◽

Dynamics Model ◽

Aerosol Dynamics ◽

Tropospheric Aerosol

Abstract. A size-segregated aerosol dynamics model UHMA (University of Helsinki Multicomponent Aerosol model) was developed for studies of multicomponent tropospheric aerosol particles. The model includes major aerosol microphysical processes in the atmosphere with a focus on new particle formation and growth; thus it incorporates particle coagulation and multicomponent condensation, applying a revised treatment of condensation flux onto free molecular regime particles and the activation of nanosized clusters by organic vapours (Nano-Köhler theory), as well as recent parameterizations for binary H2SO4-H2O and ternary H2SO4-NH3-H2O homogeneous nucleation and dry deposition. The representation of particle size distribution can be chosen from three sectional methods: the hybrid method, the moving center method, and the retracking method in which moving sections are retracked to a fixed grid after a certain time interval. All these methods can treat particle emissions and atmospheric transport consistently, and are therefore suitable for use in large scale atmospheric models. In a test simulation against an accurate high resolution solution, all the methods showed reasonable treatment of new particle formation with 20 size sections although the hybrid and the retracking methods suffered from artificial widening of the distribution. The moving center approach, on the other hand, showed extra dents in the particle size distribution and failed to predict the onset of detectable particle formation. In a separate test simulation of an observed nucleation event, the model captured the key qualitative behaviour of the system well. Furthermore, its prediction of the organic volume fraction in newly formed particles, suggesting values as high as 0.5 for 3–4 nm particles and approximately 0.8 for 10 nm particles, agrees with recent indirect composition measurements.

Download Full-text

Investigation of new particle formation at the summit of Mt. Tai, China

Atmospheric Chemistry and Physics ◽

10.5194/acp-18-2243-2018 ◽

2018 ◽

Vol 18 (3) ◽

pp. 2243-2258 ◽

Cited By ~ 9

Author(s):

Ganglin Lv ◽

Xiao Sui ◽

Jianmin Chen ◽

Rohan Jayaratne ◽

Abdelwahid Mellouki

Keyword(s):

Particle Size ◽

Particle Size Distribution ◽

Size Distribution ◽

Particle Formation ◽

Phase Iii ◽

Type I ◽

New Particle Formation ◽

Air Mass ◽

Depth Analysis ◽

So2 Concentration

Abstract. To date, few comprehensive field observations of new particle formation (NPF) have been carried out at mountaintop sites in China. In this study, simultaneous measurements of particle size distribution, trace gases, meteorological parameters, and mass concentration and chemical composition of PM2.5 were performed at the summit of Mt. Tai (1534 m a.s.l.) from 25 July to 24 August 2014 (Phase I), 21 September to 9 December 2014 (Phase II), and 16 June to 7 August 2015 (Phase III) to investigate characteristics and favorable conditions of NPF in a relatively clean mountaintop environment. The NPF events were identified based on particle size distribution measured by the neutral cluster and air ion spectrometer (NAIS), and 66 such events were observed during a period of 164 days – corresponding to an occurrence frequency of 40 %. The formation rates of 3 nm particles (J3) and growth rates were in the ranges of 0.82–25.04 cm−3 s−1 and 0.58–7.76 nm h−1, respectively. On average, the condensation sink (CS), O3 concentration, air temperature, and relative humidity were lower, whereas the SO2 concentration was higher on NPF days than that on non-NPF days. The CS on Mt. Tai was at a low level and lower CS was critical for NPF. NPF events were common when wind came from the east-southeast and west-southwest, which was probably associated with relatively lower CS in the east-southeast and higher SO2 concentration in the west-southwest. O3 was not a governing factor for NPF in this study, and a high level of NOx concentration might be responsible for the decreased O3 concentration on NPF days. Three categories of backward trajectories were classified, among which the continental air mass was the majority. The continental air mass passing through more polluted areas (denoted as Type I) favored NPF because of enhanced SO2 concentration and potential ammonia with it. An in-depth analysis of SO2 indicated that sulfuric acid was a dominant precursor on Mt. Tai; meanwhile, biogenic organics released from ambient forests in warm seasons and anthropogenic volatile organic compounds emitted from domestic heating in cold seasons also promoted NPF.

Download Full-text