Linear Discriminant Analysis of Single-Cell Fluorescence Excitation Spectra of Five Phytoplankton Species

2012 ◽  
Vol 66 (1) ◽  
pp. 60-65 ◽  
Author(s):  
Laura S. Bruckman ◽  
Tammi L. Richardson ◽  
Joseph A. Swanstrom ◽  
Kathleen A. Donaldson ◽  
Michael Allora ◽  
...  
2018 ◽  
Vol 73 (3) ◽  
pp. 304-312 ◽  
Author(s):  
Stefan T. Faulkner ◽  
Cameron M. Rekully ◽  
Eric M. Lachenmyer ◽  
Ergun Kara ◽  
Tammi L. Richardson ◽  
...  

Phytoplankton play a vital role as primary producers in aquatic ecosystems. One common approach to classifying phytoplankton is fluorescence excitation spectroscopy, which leverages the variation in types and concentrations of pigments among different phytoplankton taxonomic groups. Here, we used a fluorescence imaging photometer to measure excitation ratios (“signatures”) of single cells and bulk cultures of seven differently pigmented phytoplankton species as they progressed from nitrogen N-replete to N-depleted conditions. Our objective was to determine whether N depletion alters the fluorescence excitation signature of each species and, if so, how quickly they recover when N (as nitrate) was resupplied, because these factors affect our ability to classify the species correctly. Of the seven species studied, only Proteomonas sulcata, a marine cryptophyte, showed measurable changes in single-cell fluorescence excitation ratios and bulk fluorescence excitation spectra. These changes were likely due to decreases in the cellular concentration of phycoerythrin, a N-rich pigment, as N became scarce. Within 3 h of resupply of N, fluorescence signatures began returning to pre-depletion values and were indistinguishable from N-replete cells by 80 h after resupply. These data suggest that our classification approach is robust for non-PE containing phytoplankton. PE-containing phytoplankton might exhibit systematic changes in their signatures depending on their level of N depletion, but this could be detected and the phytoplankton re-classified following a few hours of incubation in N replete conditions.


2022 ◽  
Author(s):  
Meelad Amouzgar ◽  
David R Glass ◽  
Reema Baskar ◽  
Inna Averbukh ◽  
Samuel C Kimmey ◽  
...  

Single-cell technologies generate large, high-dimensional datasets encompassing a diversity of omics. Dimensionality reduction enables visualization of data by representing cells in two-dimensional plots that capture the structure and heterogeneity of the original dataset. Visualizations contribute to human understanding of data and are useful for guiding both quantitative and qualitative analysis of cellular relationships. Existing algorithms are typically unsupervised, utilizing only measured features to generate manifolds, disregarding known biological labels such as cell type or experimental timepoint. Here, we repurpose the classification algorithm, linear discriminant analysis (LDA), for supervised dimensionality reduction of single-cell data. LDA identifies linear combinations of predictors that optimally separate a priori classes, enabling users to tailor visualizations to separate specific aspects of cellular heterogeneity. We implement feature selection by hybrid subset selection (HSS) and demonstrate that this flexible, computationally-efficient approach generates non-stochastic, interpretable axes amenable to diverse biological processes, such as differentiation over time and cell cycle. We benchmark HSS-LDA against several popular dimensionality reduction algorithms and illustrate its utility and versatility for exploration of single-cell mass cytometry, transcriptomics and chromatin accessibility data.


2017 ◽  
Vol 72 (3) ◽  
pp. 442-462 ◽  
Author(s):  
Cameron M. Rekully ◽  
Stefan T. Faulkner ◽  
Eric M. Lachenmyer ◽  
Brady R. Cunningham ◽  
Timothy J. Shaw ◽  
...  

An all-pairs method is used to analyze phytoplankton fluorescence excitation spectra. An initial set of nine phytoplankton species is analyzed in pairwise fashion to select two optical filter sets, and then the two filter sets are used to explore variations among a total of 31 species in a single-cell fluorescence imaging photometer. Results are presented in terms of pair analyses; we report that 411 of the 465 possible pairings of the larger group of 31 species can be distinguished using the initial nine-species-based selection of optical filters. A bootstrap analysis based on the larger data set shows that the distribution of possible pair separation results based on a randomly selected nine-species initial calibration set is strongly peaked in the 410–415 pair separation range, consistent with our experimental result. Further, the result for filter selection using all 31 species is also 411 pair separations; The set of phytoplankton fluorescence excitation spectra is intuitively high in rank due to the number and variety of pigments that contribute to the spectrum. However, the results in this report are consistent with an effective rank as determined by a variety of heuristic and statistical methods in the range of 2–3. These results are reviewed in consideration of how consistent the filter selections are from model to model for the data presented here. We discuss the common observation that rank is generally found to be relatively low even in many seemingly complex circumstances, so that it may be productive to assume a low rank from the beginning. If a low-rank hypothesis is valid, then relatively few samples are needed to explore an experimental space. Under very restricted circumstances for uniformly distributed samples, the minimum number for an initial analysis might be as low as 8–11 random samples for 1–3 factors.


2020 ◽  
Vol 16 (8) ◽  
pp. 1079-1087
Author(s):  
Jorgelina Z. Heredia ◽  
Carlos A. Moldes ◽  
Raúl A. Gil ◽  
José M. Camiña

Background: The elemental composition of maize grains depends on the soil, land and environment characteristics where the crop grows. These effects are important to evaluate the availability of nutrients with complex dynamics, such as the concentration of macro and micronutrients in soils, which can vary according to different topographies. There is available scarce information about the influence of topographic characteristics (upland and lowland) where culture is developed with the mineral composition of crop products, in the present case, maize seeds. On the other hand, the study of the topographic effect on crops using multivariate analysis tools has not been reported. Objective: This paper assesses the effect of topographic conditions on plants, analyzing the mineral profiles in maize seeds obtained in two land conditions: uplands and lowlands. Materials and Methods: The mineral profile was studied by microwave plasma atomic emission spectrometry. Samples were collected from lowlands and uplands of cultivable lands of the north-east of La Pampa province, Argentina. Results: Differentiation of maize seeds collected from both topographical areas was achieved by principal components analysis (PCA), cluster analysis (CA) and linear discriminant analysis (LDA). PCA model based on mineral profile allowed to differentiate seeds from upland and lowlands by the influence of Cr and Mg variables. A significant accumulation of Cr and Mg in seeds from lowlands was observed. Cluster analysis confirmed such grouping but also, linear discriminant analysis achieved a correct classification of both the crops, showing the effect of topography on elemental profile. Conclusions: Multi-elemental analysis combined with chemometric tools proved useful to assess the effect of topographic characteristics on crops.


2020 ◽  
Vol 15 ◽  
Author(s):  
Mohanad Mohammed ◽  
Henry Mwambi ◽  
Bernard Omolo

Background: Colorectal cancer (CRC) is the third most common cancer among women and men in the USA, and recent studies have shown an increasing incidence in less developed regions, including Sub-Saharan Africa (SSA). We developed a hybrid (DNA mutation and RNA expression) signature and assessed its predictive properties for the mutation status and survival of CRC patients. Methods: Publicly-available microarray and RNASeq data from 54 matched formalin-fixed paraffin-embedded (FFPE) samples from the Affymetrix GeneChip and RNASeq platforms, were used to obtain differentially expressed genes between mutant and wild-type samples. We applied the support-vector machines, artificial neural networks, random forests, k-nearest neighbor, naïve Bayes, negative binomial linear discriminant analysis, and the Poisson linear discriminant analysis algorithms for classification. Cox proportional hazards model was used for survival analysis. Results: Compared to the genelist from each of the individual platforms, the hybrid genelist had the highest accuracy, sensitivity, specificity, and AUC for mutation status, across all the classifiers and is prognostic for survival in patients with CRC. NBLDA method was the best performer on the RNASeq data while the SVM method was the most suitable classifier for CRC across the two data types. Nine genes were found to be predictive of survival. Conclusion: This signature could be useful in clinical practice, especially for colorectal cancer diagnosis and therapy. Future studies should determine the effectiveness of integration in cancer survival analysis and the application on unbalanced data, where the classes are of different sizes, as well as on data with multiple classes.


Sign in / Sign up

Export Citation Format

Share Document