Generalising from Conventional Pipelines: A Case Study in Deep Learning-Based for High-Throughput Screening

Abstract The study of complex diseases relies on large amounts of data to build models toward precision medicine. Such data acquisition is feasible in the context of high-throughput screening, in which the quality of the results relies on the accuracy of the image analysis. Although state-of-the-art solutions for image segmentation employ deep learning approaches, the high cost of manually generating ground truth labels for model training hampers the day-to-day application in experimental laboratories. Alternatively, traditional computer vision-based solutions do not need expensive labels for their implementation. Our work combines both approaches by training a deep learning network using weak training labels automatically generated with conventional computer vision methods. Our network surpasses the conventional segmentation quality by generalising beyond noisy labels, providing a 25 % increase of mean intersection over union, and simultaneously reducing the development and inference times. Our solution was embedded into an easy-to-use graphical user interface that allows researchers to assess the predictions and correct potential inaccuracies with minimal human input. To demonstrate the feasibility of training a deep learning solution on a large dataset of noisy labels automatically generated by a conventional pipeline, we compared our solution against the common approach of training a model from a small manually curated dataset by several experts. Our work suggests that humans perform better in context interpretation, such as error assessment, while computers outperform in pixel-by-pixel fine segmentation. Such pipelines are illustrated with a case study on image segmentation for autophagy events. This work aims for better translation of new technologies to real-world settings in microscopy-image analysis.

Download Full-text

Maize-IAS: a maize image analysis software using deep learning for high-throughput plant phenotyping

Plant Methods ◽

10.1186/s13007-021-00747-0 ◽

2021 ◽

Vol 17 (1) ◽

Author(s):

Shuo Zhou ◽

Xiujuan Chai ◽

Zixuan Yang ◽

Hongwu Wang ◽

Chenxue Yang ◽

...

Keyword(s):

Computer Vision ◽

Image Analysis ◽

Deep Learning ◽

High Throughput ◽

Batch Processing ◽

Plant Phenotyping ◽

Plant Science ◽

Analysis Software ◽

Image Analysis Software ◽

Maize Growth

Abstract Background Maize (Zea mays L.) is one of the most important food sources in the world and has been one of the main targets of plant genetics and phenotypic research for centuries. Observation and analysis of various morphological phenotypic traits during maize growth are essential for genetic and breeding study. The generally huge number of samples produce an enormous amount of high-resolution image data. While high throughput plant phenotyping platforms are increasingly used in maize breeding trials, there is a reasonable need for software tools that can automatically identify visual phenotypic features of maize plants and implement batch processing on image datasets. Results On the boundary between computer vision and plant science, we utilize advanced deep learning methods based on convolutional neural networks to empower the workflow of maize phenotyping analysis. This paper presents Maize-IAS (Maize Image Analysis Software), an integrated application supporting one-click analysis of maize phenotype, embedding multiple functions: (I) Projection, (II) Color Analysis, (III) Internode length, (IV) Height, (V) Stem Diameter and (VI) Leaves Counting. Taking the RGB image of maize as input, the software provides a user-friendly graphical interaction interface and rapid calculation of multiple important phenotypic characteristics, including leaf sheath points detection and leaves segmentation. In function Leaves Counting, the mean and standard deviation of difference between prediction and ground truth are 1.60 and 1.625. Conclusion The Maize-IAS is easy-to-use and demands neither professional knowledge of computer vision nor deep learning. All functions for batch processing are incorporated, enabling automated and labor-reduced tasks of recording, measurement and quantitative analysis of maize growth traits on a large dataset. We prove the efficiency and potential capability of our techniques and software to image-based plant research, which also demonstrates the feasibility and capability of AI technology implemented in agriculture and plant science.

Download Full-text

DeepCob: Precise and high-throughput analysis of maize cob geometry using deep learning with an application in genebank phenomics

10.1101/2021.03.16.435660 ◽

2021 ◽

Author(s):

Lydia Kienbaum ◽

Miguel Correa Abondano ◽

Raul H. Blas Sevillano ◽

Karl J Schmid

Keyword(s):

Image Analysis ◽

Image Segmentation ◽

Deep Learning ◽

High Throughput ◽

Plant Phenotyping ◽

Phenotypic Traits ◽

Learning Methods ◽

Maize Cob ◽

Classical Image ◽

Maize Cobs

Background: Maize cobs are an important component of crop yield that exhibit a high diversity in size, shape and color in native landraces and modern varieties. Various phenotyping approaches were developed to measure maize cob parameters in a high throughput fashion. More recently, deep learning methods like convolutional neural networks (CNN) became available and were shown to be highly useful for high-throughput plant phenotyping. We aimed at comparing classical image segmentation with deep learning methods for maize cob image segmentation and phenotyping using a large image dataset of native maize landrace diversity from Peru. Results: Comparison of three image analysis methods showed that a Mask R-CNN trained on a diverse set of maize cob images was highly superior to classical image analysis using the Felzenszwalb-Huttenlocher algorithm and a Window-based CNN due to its robustness to image quality and object segmentation accuracy (r=0.99). We integrated Mask R-CNN into a high-throughput pipeline to segment both maize cobs and rulers in images and perform an automated quantitative analysis of eight phenotypic traits, including diameter, length, ellipticity, asymmetry, aspect ratio and average RGB values for cob color. Statistical analysis identified key training parameters for efficient iterative model updating. We also show that a small number of 10-20 images is sufficient to update the initial Mask R-CNN model to process new types of cob images. To demonstrate an application of the pipeline we analyzed phenotypic variation in 19,867 maize cobs extracted from 3,449 images of 2,484 accessions from the maize genebank of Peru to identify phenotypically homogeneous and heterogeneous genebank accessions using multivariate clustering. Conclusions: Single Mask R-CNN model and associated analysis pipeline are widely applicable tools for maize cob phenotyping in contexts like genebank phenomics or plant breeding.

Download Full-text

DeepCob: precise and high-throughput analysis of maize cob geometry using deep learning with an application in genebank phenomics

Plant Methods ◽

10.1186/s13007-021-00787-6 ◽

2021 ◽

Vol 17 (1) ◽

Author(s):

Lydia Kienbaum ◽

Miguel Correa Abondano ◽

Raul Blas ◽

Karl Schmid

Keyword(s):

Image Analysis ◽

Image Segmentation ◽

Deep Learning ◽

High Throughput ◽

Plant Phenotyping ◽

Phenotypic Traits ◽

Learning Methods ◽

Maize Cob ◽

Classical Image ◽

Maize Cobs

Abstract Background Maize cobs are an important component of crop yield that exhibit a high diversity in size, shape and color in native landraces and modern varieties. Various phenotyping approaches were developed to measure maize cob parameters in a high throughput fashion. More recently, deep learning methods like convolutional neural networks (CNNs) became available and were shown to be highly useful for high-throughput plant phenotyping. We aimed at comparing classical image segmentation with deep learning methods for maize cob image segmentation and phenotyping using a large image dataset of native maize landrace diversity from Peru. Results Comparison of three image analysis methods showed that a Mask R-CNN trained on a diverse set of maize cob images was highly superior to classical image analysis using the Felzenszwalb-Huttenlocher algorithm and a Window-based CNN due to its robustness to image quality and object segmentation accuracy ($$r=0.99$$ r = 0.99 ). We integrated Mask R-CNN into a high-throughput pipeline to segment both maize cobs and rulers in images and perform an automated quantitative analysis of eight phenotypic traits, including diameter, length, ellipticity, asymmetry, aspect ratio and average values of red, green and blue color channels for cob color. Statistical analysis identified key training parameters for efficient iterative model updating. We also show that a small number of 10–20 images is sufficient to update the initial Mask R-CNN model to process new types of cob images. To demonstrate an application of the pipeline we analyzed phenotypic variation in 19,867 maize cobs extracted from 3449 images of 2484 accessions from the maize genebank of Peru to identify phenotypically homogeneous and heterogeneous genebank accessions using multivariate clustering. Conclusions Single Mask R-CNN model and associated analysis pipeline are widely applicable tools for maize cob phenotyping in contexts like genebank phenomics or plant breeding.

Download Full-text

Convolutional Neural Network of Atomic Surface Structures to Predict Binding Energies for High-Throughput Screening of Catalysts

10.26434/chemrxiv.8150666.v1 ◽

2019 ◽

Author(s):

Seoin Back ◽

Junwoong Yoon ◽

Nianhan Tian ◽

Wen Zhong ◽

Kevin Tran ◽

...

Keyword(s):

Neural Network ◽

Deep Learning ◽

Convolutional Neural Network ◽

High Throughput ◽

High Throughput Screening ◽

Binding Energies ◽

Surface Structures ◽

Voronoi Polyhedra ◽

Atomic Surface

We present an application of deep-learning convolutional neural network of atomic surface structures using atomic and Voronoi polyhedra-based neighbor information to predict adsorbate binding energies for the application in catalysis.

Download Full-text

Fully Automated Cultivation of Adipose-Derived Stem Cells in the StemCellDiscovery—A Robotic Laboratory for Small-Scale, High-Throughput Cell Production Including Deep Learning-Based Confluence Estimation

Processes ◽

10.3390/pr9040575 ◽

2021 ◽

Vol 9 (4) ◽

pp. 575

Author(s):

Jelena Ochs ◽

Ferdinand Biermann ◽

Tobias Piotrowski ◽

Frederik Erkens ◽

Bastian Nießing ◽

...

Keyword(s):

Stem Cells ◽

Deep Learning ◽

High Throughput ◽

High Speed ◽

New Technologies ◽

Human Mesenchymal Stem Cells ◽

Simulation Software ◽

System Capacity ◽

Small Scale ◽

Production Environment

Laboratory automation is a key driver in biotechnology and an enabler for powerful new technologies and applications. In particular, in the field of personalized therapies, automation in research and production is a prerequisite for achieving cost efficiency and broad availability of tailored treatments. For this reason, we present the StemCellDiscovery, a fully automated robotic laboratory for the cultivation of human mesenchymal stem cells (hMSCs) in small scale and in parallel. While the system can handle different kinds of adherent cells, here, we focus on the cultivation of adipose-derived hMSCs. The StemCellDiscovery provides an in-line visual quality control for automated confluence estimation, which is realized by combining high-speed microscopy with deep learning-based image processing. We demonstrate the feasibility of the algorithm to detect hMSCs in culture at different densities and calculate confluences based on the resulting image. Furthermore, we show that the StemCellDiscovery is capable of expanding adipose-derived hMSCs in a fully automated manner using the confluence estimation algorithm. In order to estimate the system capacity under high-throughput conditions, we modeled the production environment in a simulation software. The simulations of the production process indicate that the robotic laboratory is capable of handling more than 95 cell culture plates per day.

Download Full-text

Action plan for hit identification (APHID): KAT6A as a case study

Future Medicinal Chemistry ◽

10.4155/fmc-2019-0212 ◽

2020 ◽

Vol 12 (5) ◽

pp. 423-437 ◽

Cited By ~ 2

Author(s):

Xiangyan Yi ◽

Lian Xue ◽

Tim Thomas ◽

Jonathan B Baell

Keyword(s):

High Throughput ◽

High Throughput Screening ◽

Action Plan ◽

Hit Identification

Here, we describe our action plan for hit identification (APHID) that guides the process of hit triage, with elimination of less tractable hits and retention of more tractable hits. We exemplify the process with reference to our high-throughput screening (HTS) campaign against the enzyme, KAT6A, that resulted in successful identification of a tractable hit. We hope that APHID could serve as a useful, concise and digestible guide for those involved in HTS and hit triage, especially those that are relatively new to this exciting and continually evolving technology.

Download Full-text

Image Segmentation and Object-Based Image Analysis for Environmental Monitoring: Recent Areas of Interest, Researchers’ Views on the Future Priorities

Remote Sensing ◽

10.3390/rs12111772 ◽

2020 ◽

Vol 12 (11) ◽

pp. 1772

Author(s):

Brian Alan Johnson ◽

Lei Ma

Keyword(s):

Remote Sensing ◽

Image Analysis ◽

Image Segmentation ◽

Online Survey ◽

Accuracy Assessment ◽

Learning Approaches ◽

Special Issue ◽

Object Based Image Analysis ◽

Object Based ◽

Wide Range

Image segmentation and geographic object-based image analysis (GEOBIA) were proposed around the turn of the century as a means to analyze high-spatial-resolution remote sensing images. Since then, object-based approaches have been used to analyze a wide range of images for numerous applications. In this Editorial, we present some highlights of image segmentation and GEOBIA research from the last two years (2018–2019), including a Special Issue published in the journal Remote Sensing. As a final contribution of this special issue, we have shared the views of 45 other researchers (corresponding authors of published papers on GEOBIA in 2018–2019) on the current state and future priorities of this field, gathered through an online survey. Most researchers surveyed acknowledged that image segmentation/GEOBIA approaches have achieved a high level of maturity, although the need for more free user-friendly software and tools, further automation, better integration with new machine-learning approaches (including deep learning), and more suitable accuracy assessment methods was frequently pointed out.

Download Full-text

High-Throughput Screening of the Influence of Thermal Treatment on the Mechanical Properties of Semicrystalline Polymers: A Case Study for iPP

Macromolecular Rapid Communications ◽

10.1002/marc.200300172 ◽

2004 ◽

Vol 25 (1) ◽

pp. 355-359 ◽

Cited By ~ 5

Author(s):

Konrad Schneider ◽

Nikolaos Evangelos Zafeiropoulos ◽

Liane Häußler ◽

Manfred Stamm

Keyword(s):

Mechanical Properties ◽

Thermal Treatment ◽

High Throughput ◽

High Throughput Screening ◽

Semicrystalline Polymers

Download Full-text

Deep Learning in Data-Driven Pavement Image Analysis and Automated Distress Detection: A Review

Data ◽

10.3390/data3030028 ◽

2018 ◽

Vol 3 (3) ◽

pp. 28 ◽

Cited By ~ 23

Author(s):

Kasthurirangan Gopalakrishnan

Keyword(s):

Computer Vision ◽

Image Analysis ◽

Deep Learning ◽

Asset Management ◽

Network Architecture ◽

Crack Detection ◽

Future Research ◽

Deep Convolutional Neural Networks ◽

Pavement Distress ◽

Learning Software

Deep learning, more specifically deep convolutional neural networks, is fast becoming a popular choice for computer vision-based automated pavement distress detection. While pavement image analysis has been extensively researched over the past three decades or so, recent ground-breaking achievements of deep learning algorithms in the areas of machine translation, speech recognition, and computer vision has sparked interest in the application of deep learning to automated detection of distresses in pavement images. This paper provides a narrative review of recently published studies in this field, highlighting the current achievements and challenges. A comparison of the deep learning software frameworks, network architecture, hyper-parameters employed by each study, and crack detection performance is provided, which is expected to provide a good foundation for driving further research on this important topic in the context of smart pavement or asset management systems. The review concludes with potential avenues for future research; especially in the application of deep learning to not only detect, but also characterize the type, extent, and severity of distresses from 2D and 3D pavement images.

Download Full-text

Sashimi : A toolkit for facilitating high‐throughput organismal image segmentation using deep learning

Methods in Ecology and Evolution ◽

10.1111/2041-210x.13712 ◽

2021 ◽

Cited By ~ 1

Author(s):

Shawn T. Schwartz ◽

Michael E. Alfaro

Keyword(s):

Image Segmentation ◽

Deep Learning ◽

High Throughput

Download Full-text