easyXpress: An R package to analyze and visualize high-throughput C. elegans microscopy data generated using CellProfiler

High-throughput imaging techniques have become widespread in many fields of biology. These powerful platforms generate large quantities of data that can be difficult to process and visualize efficiently using existing tools. We developed easyXpress to process and review C. elegans high-throughput microscopy data in the R environment. The package provides a logical workflow for the reading, analysis, and visualization of data generated using CellProfiler’s WormToolbox. We equipped easyXpress with powerful functions to customize the filtering of noise in data, specifically by identifying and removing objects that deviate from expected animal measurements. This flexibility in data filtering allows users to optimize their analysis pipeline to match their needs. In addition, easyXpress includes tools for generating detailed visualizations, allowing the user to interactively compare summary statistics across wells and plates with ease. Researchers studying C. elegans benefit from this streamlined and extensible package as it is complementary to CellProfiler and leverages the R environment to rapidly process and analyze large high-throughput imaging datasets.

Download Full-text

easyXpress: An R package to analyze and visualize high-throughput C. elegans microscopy data generated using CellProfiler

PLoS ONE ◽

10.1371/journal.pone.0252000 ◽

2021 ◽

Vol 16 (8) ◽

pp. e0252000

Author(s):

Joy Nyaanga ◽

Timothy A. Crombie ◽

Samuel J. Widmayer ◽

Erik C. Andersen

Keyword(s):

High Throughput ◽

Imaging Techniques ◽

R Package ◽

Summary Statistics ◽

Data Filtering ◽

Analysis Pipeline ◽

C Elegans ◽

Visualization Of Data ◽

Microscopy Data ◽

High Throughput Imaging

Download Full-text

Behavioral fingerprints predict insecticide and anthelmintic mode of action

10.1101/2021.01.27.428391 ◽

2021 ◽

Author(s):

Adam McDermott-Rouse ◽

Eleni Minga ◽

Ida Barlow ◽

Luigi Feriani ◽

Philippa H Harlow ◽

...

Keyword(s):

High Throughput ◽

Mode Of Action ◽

Behavioral Responses ◽

Physiological Data ◽

Discovery Process ◽

Crucial Step ◽

C Elegans ◽

Modes Of Action ◽

Automated Phenotyping ◽

High Throughput Imaging

AbstractNovel invertebrate-killing compounds are required in agriculture and medicine to overcome resistance to existing treatments. Because insecticides and anthelmintics are discovered in phenotypic screens, a crucial step in the discovery process is determining the mode of action of hits. Visible whole-organism symptoms are combined with molecular and physiological data to determine mode of action. However, manual symptomology is laborious and requires symptoms that are strong enough to see by eye. Here we use high-throughput imaging and quantitative phenotyping to measure C. elegans behavioral responses to compounds and train a classifier that predicts mode of action with an accuracy of 88% for a set of ten common modes of action. We also classify compounds within each mode of action to discover pharmacological relationships that are not captured in broad mode of action labels. High-throughput imaging and automated phenotyping could therefore accelerate mode of action discovery in invertebrate-targeting compound development and help to refine mode of action categories.

Download Full-text

Estimation of the Chlorophyll Concentration in Sorghum Using Three High Throughput Phenotyping Imaging Techniques

10.21203/rs.3.rs-407791/v1 ◽

2021 ◽

Author(s):

Huichun Zhang ◽

Yufeng Ge ◽

Xinyan Xie ◽

Abbas Atefi ◽

Nuwan Wijewardane ◽

...

Keyword(s):

Chlorophyll Content ◽

High Throughput ◽

Vegetation Index ◽

Linear Models ◽

Imaging Techniques ◽

Chlorophyll Concentration ◽

Imaging Data ◽

Leaf Chlorophyll ◽

High Throughput Phenotyping ◽

High Throughput Imaging

Abstract BackgroundLeaf chlorophyll content plays an important role in indicating plant stresses and nutrient status. Traditional approaches for the quantification of chlorophyll content mainly include acetone ethanol extraction, spectrophotometry and high-performance liquid chromatography. Such destructive methods based on laboratory procedures are time consuming, expensive, and not suitable for high-throughput phenotyping. High throughput imaging techniques are now widely used for nondestructive analysis of plant phenotypic traits. In this study three imaging modules, namely, RGB, hyperspectral, and fluorescence imaging, were used to estimate chlorophyll content of sorghum plants in a greenhouse environment. Color features, spectral indices, and chlorophyll fluorescence intensity were extracted from these three types of images, and regression models were built to predict leaf chlorophyll content (measured by a handheld leaf chlorophyll meter) from the image features. ResultsModels that included two additional variables, DAS (day after sowing) and SLW (specific leaf weight), were also investigated to improve the prediction of chlorophyll. R2 for chlorophyll concentration for multiple linear models at various color components were 0.77 for R, 0.79 for G, 0.70 for B. To obtain additional spectral information, color component H, S, and I were calculated after color spaces being transformed. The result of HSI space showed that R2 for chlorophyll concentration for multiple linear models were 0.67 for H, 0.88 for S, 0.77 for I. The R2 values for different hyperspectral index like the ratio vegetation index (RVI), the normalized difference vegetation index (NDVI), modified chlorophyll absorption ratio index (MCARI) between 0.77 and 0.78. R2=0.79 was obtained with fluorescence image. Partial least squares regression (PLSR) was employed to using the selected vegetation indices computed from different imaging data to estimate the chlorophyll concentration for sorghum plants. Among all the imaging data, chlorophyll content was predicted with high accuracy (R2 from 0.84 to 2.92, RPD from 2.49 to 3.58). ConclusionAccording to the Akaike's Information Criterion (AIC) error function, the model was better fitted based on images, DAS and SLW than that based on images and DAS. This study indicated that the accuracy for chlorophyll estimation was increased by the image traits combined with DAS and SLW. High throughput imaging provides a simple, rapid, and nondestructive method to estimate the leaf chlorophyll concentration.

Download Full-text

Paper-Supported High-Throughput 3D Culturing, Trapping, and Monitoring of Caenorhabditis Elegans

Micromachines ◽

10.3390/mi11010099 ◽

2020 ◽

Vol 11 (1) ◽

pp. 99 ◽

Cited By ~ 2

Author(s):

Mehdi Tahernia ◽

Maedeh Mohammadifar ◽

Seokheun Choi

Keyword(s):

High Throughput ◽

Developmental Stages ◽

3D Culture ◽

Natural Habitats ◽

Paper Substrate ◽

C Elegans ◽

Cultivation Technique ◽

Plastic Frame ◽

Pass Through ◽

High Throughput Imaging

We developed an innovative paper-based platform for high-throughput culturing, trapping, and monitoring of C. elegans. A 96-well array was readily fabricated by placing a nutrient-replenished paper substrate on a micromachined 96-well plastic frame, providing high-throughput 3D culturing environments and in situ analysis of the worms. The paper allows C. elegans to pass through the porous and aquatic paper matrix until the worms grow and reach the next developmental stages with the increased body size comparable to the paper pores. When the diameter of C. elegans becomes larger than the pore size of the paper substrate, the worms are trapped and immobilized for further high-throughput imaging and analysis. This work will offer a simple yet powerful technique for high-throughput sorting and monitoring of C. elegans at a different larval stage by controlling and choosing different pore sizes of paper. Furthermore, we developed another type of 3D culturing system by using paper-like transparent polycarbonate substrates for higher resolution imaging. The device used the multi-laminate structure of the polycarbonate layers as a scaffold to mimic the worm’s 3D natural habitats. Since the substrate is thin, mechanically strong, and largely porous, the layered structure allowed C. elegans to move and behave freely in 3D and promoted the efficient growth of both C. elegans and their primary food, E. coli. The transparency of the structure facilitated visualization of the worms under a microscope. Development, fertility, and dynamic behavior of C. elegans in the 3D culture platform outperformed those of the standard 2D cultivation technique.

Download Full-text

A microfluidic platform for lifelong high-resolution and high throughput imaging of subtle aging phenotypes in C. elegans

Lab on a Chip ◽

10.1039/c8lc00655e ◽

2018 ◽

Vol 18 (20) ◽

pp. 3090-3100 ◽

Cited By ~ 10

Author(s):

Sahand Saberi-Bosari ◽

Javier Huayta ◽

Adriana San-Miguel

Keyword(s):

High Resolution ◽

High Throughput ◽

Structure And Function ◽

Neuronal Structure ◽

Microfluidic Platform ◽

C Elegans ◽

And Function ◽

High Throughput Imaging

Aging produces a number of changes in the neuronal structure and function throughout a variety of organisms.

Download Full-text

A Semi-high-throughput Imaging Method and Data Visualization Toolkit to Analyze C. elegans Embryonic Development

Journal of Visualized Experiments ◽

10.3791/60362 ◽

2019 ◽

Author(s):

Renat N. Khaliullin ◽

Jeffrey M. Hendel ◽

Adina Gerson-Gurwitz ◽

Shaohe Wang ◽

Stacy D. Ochoa ◽

...

Keyword(s):

Embryonic Development ◽

Data Visualization ◽

High Throughput ◽

Imaging Method ◽

C Elegans ◽

Visualization Toolkit ◽

High Throughput Imaging

Download Full-text

Faculty Opinions recommendation of High-throughput, motility-based sorter for microswimmers such as C. elegans.

Faculty Opinions – Post-Publication Peer Review of the Biomedical Literature ◽

10.3410/f.725515725.793516986 ◽

2016 ◽

Author(s):

Simon Melov

Keyword(s):

High Throughput ◽

C Elegans

Download Full-text

A generalised approach for high-throughput instance segmentation of stomata in microscope images

Plant Methods ◽

10.1186/s13007-021-00727-4 ◽

2021 ◽

Vol 17 (1) ◽

Author(s):

Hiranya Jayakody ◽

Paul Petrie ◽

Hugo Jan de Boer ◽

Mark Whitty

Keyword(s):

High Throughput ◽

Imaging Techniques ◽

Detection Algorithm ◽

Input Image ◽

Detection Methods ◽

Sample Collection ◽

High Throughput Analysis ◽

General Applicability ◽

Additional Image ◽

Bounding Boxes

Abstract Background Stomata analysis using microscope imagery provides important insight into plant physiology, health and the surrounding environmental conditions. Plant scientists are now able to conduct automated high-throughput analysis of stomata in microscope data, however, existing detection methods are sensitive to the appearance of stomata in the training images, thereby limiting general applicability. In addition, existing methods only generate bounding-boxes around detected stomata, which require users to implement additional image processing steps to study stomata morphology. In this paper, we develop a fully automated, robust stomata detection algorithm which can also identify individual stomata boundaries regardless of the plant species, sample collection method, imaging technique and magnification level. Results The proposed solution consists of three stages. First, the input image is pre-processed to remove any colour space biases occurring from different sample collection and imaging techniques. Then, a Mask R-CNN is applied to estimate individual stomata boundaries. The feature pyramid network embedded in the Mask R-CNN is utilised to identify stomata at different scales. Finally, a statistical filter is implemented at the Mask R-CNN output to reduce the number of false positive generated by the network. The algorithm was tested using 16 datasets from 12 sources, containing over 60,000 stomata. For the first time in this domain, the proposed solution was tested against 7 microscope datasets never seen by the algorithm to show the generalisability of the solution. Results indicated that the proposed approach can detect stomata with a precision, recall, and F-score of 95.10%, 83.34%, and 88.61%, respectively. A separate test conducted by comparing estimated stomata boundary values with manually measured data showed that the proposed method has an IoU score of 0.70; a 7% improvement over the bounding-box approach. Conclusions The proposed method shows robust performance across multiple microscope image datasets of different quality and scale. This generalised stomata detection algorithm allows plant scientists to conduct stomata analysis whilst eliminating the need to re-label and re-train for each new dataset. The open-source code shared with this project can be directly deployed in Google Colab or any other Tensorflow environment.

Download Full-text

FRI0585 HIGH-THROUGHPUT METHODOLOGY FOR EMR-BASED IDENTIFICATION OF CLINICAL SUB-PHENOTYPES IN COMPLEX PATIENT POPULATIONS

Annals of the Rheumatic Diseases ◽

10.1136/annrheumdis-2020-eular.3489 ◽

2020 ◽

Vol 79 (Suppl 1) ◽

pp. 897.2-897

Author(s):

M. Maurits ◽

T. Huizinga ◽

M. Reinders ◽

S. Raychaudhuri ◽

E. Karlson ◽

...

Keyword(s):

Machine Learning ◽

Risk Factors ◽

Dimensionality Reduction ◽

High Throughput ◽

Brain Cancer ◽

Machine Learning Techniques ◽

Summary Statistics ◽

Medical Problems ◽

Learning Techniques ◽

Icd Codes

Background:Heterogeneity in disease populations complicates discovery of risk factors. To identify risk factors for subpopulations of diseases, we need analytical methods that can deal with unidentified disease subgroups.Objectives:Inspired by successful approaches from the Big Data field, we developed a high-throughput approach to identify subpopulations within patients with heterogeneous, complex diseases using the wealth of information available in Electronic Medical Records (EMRs).Methods:We extracted longitudinal healthcare-interaction records coded by 1,853 PheCodes[1] of the 64,819 patients from the Boston’s Partners-Biobank. Through dimensionality reduction using t-SNE[2] we created a 2D embedding of 32,424 of these patients (set A). We then identified distinct clusters post-t-SNE using DBscan[3] and visualized the relative importance of individual PheCodes within them using specialized spectrographs. We replicated this procedure in the remaining 32,395 records (set B).Results:Summary statistics of both sets were comparable (Table 1).Table 1.Summary statistics of the total Partners Biobank dataset and the 2 partitions.Set-Aset-BTotalEntries12,200,31112,177,13124,377,442Patients32,42432,39564,819Patientyears369,546.33368,597.92738,144.2unique ICD codes25,05624,95326,305unique Phecodes1,8511,8531,853We found 284 clusters in set A and 295 in set B, of which 63.4% from set A could be mapped to a cluster in set B with a median (range) correlation of 0.24 (0.03 – 0.58).Clusters represented similar yet distinct clinical phenotypes; e.g. patients diagnosed with “other headache syndrome” were separated into four distinct clusters characterized by migraines, neurofibromatosis, epilepsy or brain cancer, all resulting in patients presenting with headaches (Fig. 1 & 2). Though EMR databases tend to be noisy, our method was also able to differentiate misclassification from true cases; SLE patients with RA codes clustered separately from true RA cases.Figure 1.Two dimensional representation of Set A generated using dimensionality reduction (tSNE) and clustering (DBScan).Figure 2.Phenotype Spectrographs (PheSpecs) of four clusters characterized by “Other headache syndromes”, driven by codes relating to migraine, epilepsy, neurofibromatosis or brain cancer.Conclusion:We have shown that EMR data can be used to identify and visualize latent structure in patient categorizations, using an approach based on dimension reduction and clustering machine learning techniques. Our method can identify misclassified patients as well as separate patients with similar problems into subsets with different associated medical problems. Our approach adds a new and powerful tool to aid in the discovery of novel risk factors in complex, heterogeneous diseases.References:[1] Denny, J.C. et al. Bioinformatics (2010)[2]van der Maaten et al. Journal of Machine Learning Research (2008)[3] Ester, M. et al. Proceedings of the Second International Conference on Knowledge Discovery and Data Mining. (1996)Disclosure of Interests:Marc Maurits: None declared, Thomas Huizinga Grant/research support from: Ablynx, Bristol-Myers Squibb, Roche, Sanofi, Consultant of: Ablynx, Bristol-Myers Squibb, Roche, Sanofi, Marcel Reinders: None declared, Soumya Raychaudhuri: None declared, Elizabeth Karlson: None declared, Erik van den Akker: None declared, Rachel Knevel: None declared

Download Full-text

High-throughput behavioral screen in C. elegans reveals Parkinson’s disease drug candidates

Communications Biology ◽

10.1038/s42003-021-01731-z ◽

2021 ◽

Vol 4 (1) ◽

Author(s):

Salman Sohrabi ◽

Danielle E. Mor ◽

Rachel Kaletsky ◽

William Keyes ◽

Coleen T. Murphy

Keyword(s):

Neural Network ◽

Parkinson’S Disease ◽

Parkinson's Disease ◽

High Throughput ◽

Potential Candidate ◽

Automated Scoring ◽

C Elegans ◽

Drug Candidates ◽

Branched Chain ◽

Approved Drugs

AbstractWe recently linked branched-chain amino acid transferase 1 (BCAT1) dysfunction with the movement disorder Parkinson’s disease (PD), and found that RNAi-mediated knockdown of neuronal bcat-1 in C. elegans causes abnormal spasm-like ‘curling’ behavior with age. Here we report the development of a machine learning-based workflow and its application to the discovery of potentially new therapeutics for PD. In addition to simplifying quantification and maintaining a low data overhead, our simple segment-train-quantify platform enables fully automated scoring of image stills upon training of a convolutional neural network. We have trained a highly reliable neural network for the detection and classification of worm postures in order to carry out high-throughput curling analysis without the need for user intervention or post-inspection. In a proof-of-concept screen of 50 FDA-approved drugs, enasidenib, ethosuximide, metformin, and nitisinone were identified as candidates for potential late-in-life intervention in PD. These findings point to the utility of our high-throughput platform for automated scoring of worm postures and in particular, the discovery of potential candidate treatments for PD.

Download Full-text