A workflow for characterizing nanoparticle monolayers for biosensors: Machine learning on real and artificial SEM images
A persistent challenge in materials science is the characterization of a large ensemble of heterogeneous nanostructures in a set of images. This often leads to practices such as manual particle counting, and sampling bias of a favorable region of the “best” image. Herein, we present the open-source software, imaging criteria and workflow necessary to fully characterize an ensemble of SEM nanoparticle images. Such characterization is critical to nanoparticle biosensors, whose performance and characteristics are determined by the distribution of the underlying nanoparticle film. We utilize novel artificial SEM images to objectively compare commonly-found image processing methods through each stage of the workflow: acquistion, preprocessing, segmentation, labeling and object classification. Using the semi- supervised machine learning application, Ilastik, we demonstrate the decomposition of a nanoparticle image into particle subtypes relevant to our application: singles, dimers, flat aggregates and piles. We outline a workflow for characterizing and classifying nanoscale features on low-magnification images with thousands of nanoparticles. This work is accompanied by a repository of supplementary materials, including videos, a bank of real and artificial SEM images, and ten IPython Notebook tutorials to reproduce and extend the presented results.