Constraining PERMANOVA and LDM to Within-Set Comparisons by Projection Improves the Efficiency of Analyses of Matched sets of Microbiome Data

Abstract Background: Matched-set data arise frequently in microbiome studies. For example, we may collect pre- and post-treatment samples from a set of individuals, or use important confounding variables to match data from case participants to one or more control participants. Thus, there is a need for statistical methods for data comprised of matched sets, to test hypotheses against traits of interest (e.g., clinical outcomes or environmental factors) at the community level and/or the OTU (operational taxonomic unit) level. Optimally, these methods should accommodate complex data such as those with unequal sample sizes cross sets, confounders varying within sets, as well as continuous traits of interest. Methods: PERMANOVA is a commonly used distance-based method for testing hypotheses at the community level. We have also developed the linear decomposition model (LDM) that unifies the community-level and OTU-level tests into one framework. Here we present a new strategy that can be used with both PERMANOVA and the LDM for analyzing matched-set data. We propose to include an indicator variable for each set as covariates, so as to constrain comparisons between samples within a set, and also permute traits within each set, which can account for exchangeable sample correlations. The flexible nature of PERMANOVA and the LDM allows discrete or continuous traits or interactions to be tested, within-set confounders to be adjusted, and unbalanced data to be fully exploited. Results: Our simulations indicate that our proposed strategy outperformed alternative strategies, including the commonly-used one that utilizes restricted permutation only, in a wide range of scenarios. Using simulation, we also explored optimal designs for matched-set studies. The flexibility of PERMANOVA and the LDM for a variety of matched-set microbiome data is illustrated by the analysis of data from two real studies. Conclusions: Including set indicator variables and permuting within sets when analyzing matched-set data with PERMANOVA or the LDM is a strategy that performs well and is capable of handling the complex data structures that frequently occur in microbiome studies.

Download Full-text

Constraining PERMANOVA and LDM to within-set comparisons by projection improves the efficiency of analyses of matched sets of microbiome data

Microbiome ◽

10.1186/s40168-021-01034-9 ◽

2021 ◽

Vol 9 (1) ◽

Author(s):

Zhengyi Zhu ◽

Glen A. Satten ◽

Caroline Mitchell ◽

Yi-Juan Hu

Keyword(s):

Operational Taxonomic Unit ◽

Community Level ◽

Complex Data ◽

Indicator Variable ◽

Wide Range ◽

Indicator Variables ◽

New Strategy ◽

Restricted Permutation ◽

Microbiome Data ◽

Continuous Traits

Abstract Background Matched-set data arise frequently in microbiome studies. For example, we may collect pre- and post-treatment samples from a set of individuals, or use important confounding variables to match data from case participants to one or more control participants. Thus, there is a need for statistical methods for data comprised of matched sets, to test hypotheses against traits of interest (e.g., clinical outcomes or environmental factors) at the community level and/or the operational taxonomic unit (OTU) level. Optimally, these methods should accommodate complex data such as those with unequal sample sizes across sets, confounders varying within sets, and continuous traits of interest. Methods PERMANOVA is a commonly used distance-based method for testing hypotheses at the community level. We have also developed the linear decomposition model (LDM) that unifies the community-level and OTU-level tests into one framework. Here we present a new strategy that can be used with both PERMANOVA and the LDM for analyzing matched-set data. We propose to include an indicator variable for each set as covariates, so as to constrain comparisons between samples within a set, and also permute traits within each set, which can account for exchangeable sample correlations. The flexible nature of PERMANOVA and the LDM allows discrete or continuous traits or interactions to be tested, within-set confounders to be adjusted, and unbalanced data to be fully exploited. Results Our simulations indicate that our proposed strategy outperformed alternative strategies, including the commonly used one that utilizes restricted permutation only, in a wide range of scenarios. Using simulation, we also explored optimal designs for matched-set studies. The flexibility of PERMANOVA and the LDM for a variety of matched-set microbiome data is illustrated by the analysis of data from two real studies. Conclusions Including set indicator variables and permuting within sets when analyzing matched-set data with PERMANOVA or the LDM is a strategy that performs well and is capable of handling the complex data structures that frequently occur in microbiome studies.

Download Full-text

Analyzing matched sets of microbiome data using the LDM and PERMANOVA

10.21203/rs.3.rs-17148/v1 ◽

2020 ◽

Author(s):

Zhengyi Zhu ◽

Glen Satten ◽

Caroline Mitchell ◽

Yi-Juan Hu

Keyword(s):

Operational Taxonomic Unit ◽

Community Level ◽

Optimal Designs ◽

Complex Data ◽

Indicator Variable ◽

Confounding Variables ◽

Wide Range ◽

Indicator Variables ◽

Microbiome Data ◽

Continuous Traits

Abstract Background: Matched-set data arise frequently in microbiome studies. For example, we may collect pre- and post-treatment samples from a set of individuals, or use important confounding variables to match data from case participants to one or more control participants. Thus, there is a need for statistical methods for data comprised of matched sets, to test hypotheses against traits of interest (e.g., clinical outcomes or environmental factors) at the community level and/or the OTU (operational taxonomic unit) level. Optimally, these methods should accommodate complex data such as those with unequal sample sizes cross sets, confounders varying within sets, as well as continuous traits of interest. Methods: PERMANOVA is a commonly used distance-based method for testing hypotheses at the community level. We have also developed the linear decomposition model (LDM) that unifies the community-level and OTU-level tests into one framework. Here we present a strategy that can be used with both PERMANOVA and the LDM for analyzing matched-set data. We propose to include an indicator variable for each set as covariates, so as to constrain comparisons between samples within a set, and also permute traits within each set, which can account for exchangeable sample correlations. The flexible nature of PERMANOVA and the LDM allows discrete or continuous traits or interactions to be tested, within-set confounders to be adjusted, and unbalanced data to be fully exploited. Results: Our simulations indicate that our proposed strategy outperformed alternative strategies in a wide range of scenarios. Using simulation, we also explored optimal designs for matched-set studies. The flexibility of PERMANOVA and the LDM for a variety of matched-set microbiome data is illustrated by the analysis of data from two real studies. Conclusions: Including set indicator variables and permuting within sets when analyzing matched-set data with PERMANOVA or the LDM is a strategy that performs well and is capable of handling the complex data structures that frequently occur in microbiome studies.

Download Full-text

Analyzing matched sets of microbiome data using the LDM and PERMANOVA

10.1101/2020.03.06.980367 ◽

2020 ◽

Author(s):

Zhengyi Zhu ◽

Glen A. Satten ◽

Caroline Mitchell ◽

Yi-Juan Hu

Keyword(s):

Operational Taxonomic Unit ◽

Community Level ◽

Optimal Designs ◽

Complex Data ◽

Indicator Variable ◽

Confounding Variables ◽

Wide Range ◽

Indicator Variables ◽

Microbiome Data ◽

Continuous Traits

AbstractBackgroundMatched-set data arise frequently in microbiome studies. For example, we may collect pre- and post-treatment samples from a set of individuals, or use important confounding variables to match data from case participants to one or more control participants. Thus, there is a need for statistical methods for data comprised of matched sets, to test hypotheses against traits of interest (e.g., clinical outcomes or environmental factors) at the community level and/or the OTU (operational taxonomic unit) level. Optimally, these methods should accommodate complex data such as those with unequal sample sizes cross sets, confounders varying within sets, as well as continuous traits of interest.MethodsPERMANOVA is a commonly used distance-based method for testing hypotheses at the community level. We have also developed the linear decomposition model (LDM) that unifies the community-level and OTU-level tests into one framework. Here we present a strategy that can be used with both PERMANOVA and the LDM for analyzing matched-set data. We propose to include an indicator variable for each set as covariates, so as to constrain comparisons between samples within a set, and also permute traits within each set, which can account for exchangeable sample correlations. The flexible nature of PERMANOVA and the LDM allows discrete or continuous traits or interactions to be tested, within-set confounders to be adjusted, and unbalanced data to be fully exploited.ResultsOur simulations indicate that our proposed strategy outperformed alternative strategies in a wide range of scenarios. Using simulation, we also explored optimal designs for matched-set studies. The flexibility of PERMANOVA and the LDM for a variety of matched-set microbiome data is illustrated by the analysis of data from two real studies.ConclusionsIncluding set indicator variables and permuting within sets when analyzing matched-set data with PERMANOVA or the LDM is a strategy that performs well and is capable of handling the complex data structures that frequently occur in microbiome studies.

Download Full-text

Diaminomaleonitrile as a versatile building block for the synthesis of 4,4′-biimidazolidinylidenes and 4,4′-bithiazolidinylidenes

Heterocyclic Communications ◽

10.1515/hc-2018-0127 ◽

2018 ◽

Vol 24 (6) ◽

pp. 303-306

Author(s):

Mahsa Doomanlou ◽

Hassan Kabirifard ◽

Mehdi Asadi ◽

Maryam Moloudi ◽

Seyedeh Sara Mirfazli

Keyword(s):

Building Block ◽

Ring Closure ◽

Wide Range ◽

New Strategy ◽

Aryl Isocyanates ◽

Aryl Isothiocyanates

Abstract Ring closure reactions of diaminomaleonitrile (DAMN) with electrophilic aryl isocyanates and aryl isothiocyanates lead to the formation of the target 5,5′-diimino-1,1′-diaryl-4,4′-biimidazolidinylidene-2,2′-diones 2a,b and 2,2′-diarylimino-4,4′-bithiazolidinylidenes 4a–e, respectively. The protocol provides a new strategy for the synthesis of a wide range of alkenes with two electron-donating and two withdrawing substituents of DAMN in moderate to good yields.

Download Full-text

Exploring the Association Between the “Big Five” Personality Traits and Fatal Opioid Overdose: County-Level Empirical Analysis (Preprint)

10.2196/preprints.24939 ◽

2020 ◽

Author(s):

Zhasmina Tacheva ◽

Anton Ivanov

Keyword(s):

United States ◽

Big Five ◽

Psychological Factors ◽

The United States ◽

Community Level ◽

Opioid Overdose ◽

County Level ◽

Psychological Traits ◽

Wide Range ◽

The Relationship

BACKGROUND Opioid-related deaths constitute a problem of pandemic proportions in the United States, with no clear solution in sight. Although addressing addiction—the heart of this problem—ought to remain a priority for health practitioners, examining the community-level psychological factors with a known impact on health behaviors may provide valuable insights for attenuating this health crisis by curbing risky behaviors before they evolve into addiction. OBJECTIVE The goal of this study is twofold: to demonstrate the relationship between community-level psychological traits and fatal opioid overdose both theoretically and empirically, and to provide a blueprint for using social media data to glean these psychological factors in a real-time, reliable, and scalable manner. METHODS We collected annual panel data from Twitter for 2891 counties in the United States between 2014-2016 and used a novel data mining technique to obtain average county-level “Big Five” psychological trait scores. We then performed interval regression, using a control function to alleviate omitted variable bias, to empirically test the relationship between county-level psychological traits and the prevalence of fatal opioid overdoses in each county. RESULTS After controlling for a wide range of community-level biopsychosocial factors related to health outcomes, we found that three of the operationalizations of the five psychological traits examined at the community level in the study were significantly associated with fatal opioid overdoses: extraversion (β=.308, P<.001), neuroticism (β=.248, P<.001), and conscientiousness (β=.229, P<.001). CONCLUSIONS Analyzing the psychological characteristics of a community can be a valuable tool in the local, state, and national fight against the opioid pandemic. Health providers and community health organizations can benefit from this research by evaluating the psychological profile of the communities they serve and assessing the projected risk of fatal opioid overdose based on the relationships our study predict when making decisions for the allocation of overdose-reversal medication and other vital resources.

Download Full-text

The Influence of Individual Characteristics toward Benefit Recipients’ Participation of Program Keluarga Harapan

INTERNATIONAL JOURNAL OF EDUCATIONAL REVIEW ◽

10.33369/ijer.v3i1.11347 ◽

2020 ◽

Vol 3 (1) ◽

pp. 29-37

Author(s):

Tryas Wardani Nurwan ◽

Helmi Hasan

Keyword(s):

Data Analysis ◽

Quantitative Method ◽

Individual Characteristics ◽

Individual Characteristic ◽

Indicator Variable ◽

Level Of Education ◽

Independent Variables ◽

The Family ◽

Indicator Variables

The purpose of the study was to determine the effect of individual characteristic toward benefit recipients’ participation of Program Keluarga Harapan (PKH) in Nagari Pematang Panjang, Sijunjung District, West Sumatera. This study used quantitative method with a questionnaire and data analysis using SPSS 21. Based on Slovin’s theory, the respondents in this study were 131 from the 194 benefit recipients. Indicator variable Participation as the dependent variable is participation in the implementation of P2K2 and participation in taking PKH fund benefits. While the indicator variables of individual characteristics as independent variables are the level of education (X1), age (X2), and number of dependents of the Family (X3). The results showed that the three individual characteristic variables influence recipients’ participation.

Download Full-text

Characterization of Metabolites of α-mangostin in Bio-samples from SD Rats by UHPLC-Q-Exactive Orbitrap MS

Current Drug Metabolism ◽

10.2174/1389200222666211126093124 ◽

2021 ◽

Vol 22 ◽

Author(s):

Fan Dong ◽

Shaoping Wang ◽

Ailin Yang ◽

Haoran Li ◽

Pingping Dong ◽

...

Keyword(s):

Metabolic Pathways ◽

Garcinia Mangostana ◽

Sd Rats ◽

Metabolic Route ◽

Wide Range ◽

New Strategy ◽

Q Exactive ◽

Orbitrap Ms

Background: α-mangostin, a typical xanthone, often exists in Garcinia mangostana L. (Clusiaceae). α-mangostin was found to have a wide range of pharmacological properties. However, its specific metabolic route in vivo remains unclear, while these metabolites may accumulate to exert pharmacological effects, too. Objective: This study aimed to clarify the metabolic pathways of α-mangostin after oral administration to the rats. Methods: Here, an UHPLC-Q-Exactive Orbitrap MS was used for the detection of potential metabolites formed in vivo. A new strategy for the identification of unknown metabolites based on typical fragmentation routes was implemented. Results: A total of 42 metabolites were detected, and their structures were tentatively identified in this study. The results showed that major in vivo metabolic pathways of α-mangostin in rats included methylation, demethylation, methoxylation, hydrogenation, dehydrogenation, hydroxylation, dehydroxylation, glucuronidation, and sulfation. Conclusions: This study is significant to expand our knowledge of the in vivo metabolism of α-mangostin and to understand the mechanism of action of α-mangostin in rats in vivo.

Download Full-text

Genome- and Community-Level Interaction Insights into Carbon Utilization and Element Cycling Functions of Hydrothermarchaeota in Hydrothermal Sediment

mSystems ◽

10.1128/msystems.00795-19 ◽

2020 ◽

Vol 5 (1) ◽

Cited By ~ 10

Author(s):

Zhichao Zhou ◽

Yang Liu ◽

Wei Xu ◽

Jie Pan ◽

Zhu-Hua Luo ◽

...

Keyword(s):

Community Level ◽

Data Sets ◽

Carbon Utilization ◽

Black Smoker ◽

Community Interactions ◽

Element Cycling ◽

Bisphosphate Carboxylase ◽

Hydrothermal Sediment ◽

Wide Range ◽

Functional Components

ABSTRACT Hydrothermal vents release reduced compounds and small organic carbon compounds into the surrounding seawater, providing essential substrates for microbial growth and bioenergy transformations. Despite the wide distribution of the marine benthic group E archaea (referred to as Hydrothermarchaeota) in the hydrothermal environment, little is known about their genomic repertoires and biogeochemical significance. Here, we studied four highly complete (>80%) metagenome-assembled genomes (MAGs) from a black smoker chimney and the surrounding sulfur-rich sediments on the South Atlantic Mid-Ocean Ridge and publicly available data sets (the Integrated Microbial Genomes system of the U.S. Department of Energy-Joint Genome Institute and NCBI SRA data sets). Genomic analysis suggested a wide carbon metabolic diversity of Hydrothermarchaeota members, including the utilization of proteins, lactate, and acetate; the anaerobic degradation of aromatics; the oxidation of C1 compounds (CO, formate, and formaldehyde); the utilization of methyl compounds; CO2 incorporation by the tetrahydromethanopterin-based Wood-Ljungdahl pathway; and participation in the type III ribulose-1,5-bisphosphate carboxylase/oxygenase-based Calvin-Benson-Bassham cycle. These microbes also potentially oxidize sulfur, arsenic, and hydrogen and engage in anaerobic respiration based on sulfate reduction and denitrification. Among the 140 MAGs reconstructed from the black smoker chimney microbial community (including Hydrothermarchaeota MAGs), community-level metabolic predictions suggested a redundancy of carbon utilization and element cycling functions and interactive syntrophic and sequential utilization of substrates. These processes might make various carbon and energy sources widely accessible to the microorganisms. Further, the analysis suggested that Hydrothermarchaeota members contained important functional components obtained from the community via lateral gene transfer, becoming a distinctive clade. This might serve as a niche-adaptive strategy for metabolizing heavy metals, C1 compounds, and reduced sulfur compounds. Collectively, the analysis provides comprehensive metabolic insights into the Hydrothermarchaeota. IMPORTANCE This study provides comprehensive metabolic insights into the Hydrothermarchaeota from comparative genomics, evolution, and community-level perspectives. Members of the Hydrothermarchaeota synergistically participate in a wide range of carbon-utilizing and element cycling processes with other microorganisms in the community. We expand the current understanding of community interactions within the hydrothermal sediment and chimney, suggesting that microbial interactions based on sequential substrate metabolism are essential to nutrient and element cycling.

Download Full-text

NEW CHALLENGES FACING INTEGRATIVE BIOLOGICAL SCIENCE IN THE POST-GENOMIC ERA

Journal of Biological System ◽

10.1142/s0218339006001805 ◽

2006 ◽

Vol 14 (02) ◽

pp. 275-293 ◽

Cited By ~ 2

Author(s):

CHRISTOPHER S. OEHMEN ◽

TJERK P. STRAATSMA ◽

GORDON A. ANDERSON ◽

GALYA ORR ◽

BOBBIE-JO M. WEBB-ROBERTSON ◽

...

Keyword(s):

Paradigm Shift ◽

Large Scale ◽

Experimental Testing ◽

Spatial Scales ◽

Geographical Area ◽

Biological Data ◽

Biological Research ◽

Complex Data ◽

Discovery Research ◽

Wide Range

The future of biology will be increasingly driven by the fundamental paradigm shift from hypothesis-driven research to data-driven discovery research employing the growing volume of biological data coupled to experimental testing of new discoveries. But hardware and software limitations in the current workflow infrastructure make it impossible or intractible to use real data from disparate sources for large-scale biological research. We identify key technological developments needed to enable this paradigm shift involving (1) the ability to store and manage extremely large datasets which are dispersed over a wide geographical area, (2) development of novel analysis and visualization tools which are capable of operating on enormous data resources without overwhelming researchers with unusable information, and (3) formalisms for integrating mathematical models of biosystems from the molecular level to the organism population level. This will require the development of algorithms and tools which efficiently utilize high-performance compute power and large storage infrastructures. The end result will be the ability of a researcher to integrate complex data from many different sources with simulations to analyze a given system at a wide range of temporal and spatial scales in a single conceptual model.

Download Full-text

Soil microbial biomass, community level physiological profiles relate to tree species and its state in urban environment

10.5194/egusphere-egu2020-1064 ◽

2020 ◽

Author(s):

Alexandra Seleznyova ◽

Alexey Yaroslavtcev ◽

Olga Gavrichkova ◽

Alexey Ryazanov ◽

Julia Kovaleva ◽

...

Keyword(s):

Microbial Biomass ◽

Microbial Diversity ◽

Carboxylic Acids ◽

Tree Species ◽

Diversity Index ◽

Soil Microbial Communities ◽

Community Level ◽

Soil Microbial ◽

Vertical Stability ◽

Wide Range

Urban trees and soil microbial communities are the key ecosystem components to provide the supporting, provisioning and regulating services that define citizen&#8217;s well-being. Understanding the relationships between physiological states, age, species of trees and microbial functional properties are needed for a management of urban areas and landscapes' engineering. The research focuses on finding linkages between a wide range of trees&#8217; properties monitored by smart TreeTalker technology and soil functional microbial indexes in Moscow megapolis.The study was carried out on the RUDN University campus area (Moscow, Russia), where six tree species were selected (Pinus sylvestris, Populus tremula, Acer platanoides, Tilia cordata, Picea abies, Betula pendula). TreeTalker device was installed on the preselected five trees of each species for monitoring the sap flux, vertical stability (according to digital accelerometer), spectrums of canopy reflectance, trunk and canopy air temperature and humidity. Monitoring started in May 2019. The composite soil samples (0-10) were taken under each tree at the 0.5 m distance from its stand by augering in October 2019. In the samples, the microbial biomass carbon (MBC, SIR-method), basal respiration (BR), community level physiological profile (CLPP, MicroResp) and Shannon microbial diversity index (H&#8217;) based on CLPP were determined.Soil MBC content was significantly depended on tree species, increasing from A.platanoides to T.cordata (from 538 to 1445 &#181;g C g-1). The microbial diversity index was lowest in soil under A.platanoides (H&#8217;=2.1) and the highest for B.pendula (H&#8217;=2.4). The soil CLPP for A.platanoides was mainly shifted to microbial response on carboxylic acids with the low reaction on amino and phenolic acids compared to other trees species (e.g. B.pendula). Soil qCO2 (BR/MBC ratio) was positively related to trees&#8217; age (r=0.8). Response to carboxylic acids (especially oxalic) had the highest correlation with physiological properties of the trees: trunk moisture, photochemical reflectance index and vertical stability (r > -0.5).Current research was financially supported by Russian Science Foundation [No 19-77-30012].

Download Full-text