scholarly journals Reference genome and transcriptome informed by the sex chromosome complement of the sample increases ability to detect sex differences in gene expression from RNA-Seq data

2019 ◽  
Author(s):  
Kimberly C. Olney ◽  
Sarah M. Brotman ◽  
Jocelyn P. Andrews ◽  
Valeria A. Valverde-Vesling ◽  
Melissa A. Wilson

AbstractBackgroundHuman X and Y chromosomes share an evolutionary origin and, as a consequence, sequence similarity. We investigated whether sequence homology between the X and Y chromosomes affects alignment of RNA-Seq reads and estimates of differential expression. We tested the effects of using reference genomes and reference transcriptomes informed by the sex chromosome complement of the sample’s genome on measurements of RNA-Seq abundance and sex differences in expression.ResultsThe default genome includes the entire human reference genome (GRCh38), including the entire sequence of the X and Y chromosomes. We created two sex chromosome complement informed reference genomes. One sex chromosome complement informed reference genome was used for samples that lacked a Y chromosome; for this reference genome version, we hard-masked the entire Y chromosome. For the other sex chromosome complement informed reference genome, to be used for samples with a Y chromosome, we hard-masked only the pseudoautosomal regions of the Y chromosome, because these regions are duplicated identically in the reference genome on the X chromosome. We analyzed transcript abundance in the whole blood, brain cortex, breast, liver, and thyroid tissues from 20 genetic female (46, XX) and 20 genetic male (46, XY) samples. Each sample was aligned twice; once to the default reference genome and then independently aligned to a reference genome informed by the sex chromosome complement of the sample, repeated using two different read aligners, HISAT and STAR. We then quantified sex differences in gene expression using featureCounts to get the raw count estimates followed by Limma/Voom for normalization and differential expression. We additionally created sex chromosome complement informed transcriptome references for use in pseudo-alignment using Salmon. Transcript abundance was quantified twice for each sample; once to the default target transcripts and then independently to target transcripts informed by the sex chromosome complement of the sample.ConclusionsWe show that regardless of the choice of read aligner, using an alignment protocol informed by the sex chromosome complement of the sample results in higher expression estimates on the pseudoautosomal regions of the X chromosome in both genetic male and genetic female samples, as well as an increased number of unique genes being called as differentially expressed between the sexes. We additionally show that using a pseudo-alignment approach informed on the sex chromosome complement of the sample eliminates Y-linked expression in female XX samples.Author summaryThe human X and Y chromosomes share an evolutionary origin and sequence homology, including regions of 100% identity; this sequence homology can result in reads misaligning between the sex chromosomes, X and Y. We hypothesized that misalignment of reads on the sex chromosomes would confound estimates of transcript abundance if the sex chromosome complement of the sample is not accounted for during the alignment step. For example, because of shared sequence similarity, X-linked reads could misalign to the Y chromosome. This is expected to result in reduced expression for regions between X and Y that share high levels of homology. For this reason, we tested the effect of using a default reference genome versus a reference genome informed by the sex chromosome complement of the sample on estimates of transcript abundance in human RNA-Seq samples from whole blood, brain cortex, breast, liver, and thyroid tissues of 20 genetic female (46, XX) and 20 genetic male (46, XY) samples. We found that using a reference genome with the sex chromosome complement of the sample resulted in higher measurements of X-linked gene transcription for both male and female samples and more differentially expressed genes on the X and Y chromosomes. We additionally investigated the use of a sex chromosome complement informed transcriptome reference index for alignment free quantification protocols. We observed no Y-linked expression in female XX samples only when the transcript quantification was performed using a transcriptome reference index informed on the sex chromosome complement of the sample. We recommend that future studies requiring aligning RNA-Seq reads to a reference genome or pseudo-alignment with a transcriptome reference should consider the sex chromosome complement of their samples prior to running default pipelines.

Endocrinology ◽  
2013 ◽  
Vol 154 (3) ◽  
pp. 1092-1104 ◽  
Author(s):  
Xuqi Chen ◽  
Rebecca McClusky ◽  
Yuichiro Itoh ◽  
Karen Reue ◽  
Arthur P. Arnold

Abstract Three different models of MF1 strain mice were studied to measure the effects of gonadal secretions and sex chromosome type and number on body weight and composition, and on related metabolic variables such as glucose homeostasis, feeding, and activity. The 3 genetic models varied sex chromosome complement in different ways, as follows: 1) “four core genotypes” mice, comprising XX and XY gonadal males, and XX and XY gonadal females; 2) the XY* model comprising groups similar to XO, XX, XY, and XXY; and 3) a novel model comprising 6 groups having XO, XX, and XY chromosomes with either testes or ovaries. In gonadally intact mice, gonadal males were heavier than gonadal females, but sex chromosome complement also influenced weight. The male/female difference was abolished by adult gonadectomy, after which mice with 2 sex chromosomes (XX or XY) had greater body weight and percentage of body fat than mice with 1 X chromosome. A second sex chromosome of either type, X or Y, had similar effects, indicating that the 2 sex chromosomes each possess factors that influence body weight and composition in the MF1 genetic background. Sex chromosome complement also influenced metabolic variables such as food intake and glucose tolerance. The results reveal a role for the Y chromosome in metabolism independent of testes and gonadal hormones and point to a small number of X–Y gene pairs with similar coding sequences as candidates for causing these effects.


2021 ◽  
Vol 22 (1) ◽  
Author(s):  
Lingzhan Xue ◽  
Yu Gao ◽  
Meiying Wu ◽  
Tian Tian ◽  
Haiping Fan ◽  
...  

Abstract Background The origin of sex chromosomes requires the establishment of recombination suppression between the proto-sex chromosomes. In many fish species, the sex chromosome pair is homomorphic with a recent origin, providing species for studying how and why recombination suppression evolved in the initial stages of sex chromosome differentiation, but this requires accurate sequence assembly of the X and Y (or Z and W) chromosomes, which may be difficult if they are recently diverged. Results Here we produce a haplotype-resolved genome assembly of zig-zag eel (Mastacembelus armatus), an aquaculture fish, at the chromosomal scale. The diploid assembly is nearly gap-free, and in most chromosomes, we resolve the centromeric and subtelomeric heterochromatic sequences. In particular, the Y chromosome, including its highly repetitive short arm, has zero gaps. Using resequencing data, we identify a ~7 Mb fully sex-linked region (SLR), spanning the sex chromosome centromere and almost entirely embedded in the pericentromeric heterochromatin. The SLRs on the X and Y chromosomes are almost identical in sequence and gene content, but both are repetitive and heterochromatic, consistent with zero or low recombination. We further identify an HMG-domain containing gene HMGN6 in the SLR as a candidate sex-determining gene that is expressed at the onset of testis development. Conclusions Our study supports the idea that preexisting regions of low recombination, such as pericentromeric regions, can give rise to SLR in the absence of structural variations between the proto-sex chromosomes.


BMC Genomics ◽  
2021 ◽  
Vol 22 (1) ◽  
Author(s):  
Shuhua Zhan ◽  
Cortland Griswold ◽  
Lewis Lukens

Abstract Background Genetic variation for gene expression is a source of phenotypic variation for natural and agricultural species. The common approach to map and to quantify gene expression from genetically distinct individuals is to assign their RNA-seq reads to a single reference genome. However, RNA-seq reads from alleles dissimilar to this reference genome may fail to map correctly, causing transcript levels to be underestimated. Presently, the extent of this mapping problem is not clear, particularly in highly diverse species. We investigated if mapping bias occurred and if chromosomal features associated with mapping bias. Zea mays presents a model species to assess these questions, given it has genotypically distinct and well-studied genetic lines. Results In Zea mays, the inbred B73 genome is the standard reference genome and template for RNA-seq read assignments. In the absence of mapping bias, B73 and a second inbred line, Mo17, would each have an approximately equal number of regulatory alleles that increase gene expression. Remarkably, Mo17 had 2–4 times fewer such positively acting alleles than did B73 when RNA-seq reads were aligned to the B73 reference genome. Reciprocally, over one-half of the B73 alleles that increased gene expression were not detected when reads were aligned to the Mo17 genome template. Genes at dissimilar chromosomal ends were strongly affected by mapping bias, and genes at more similar pericentromeric regions were less affected. Biased transcript estimates were higher in untranslated regions and lower in splice junctions. Bias occurred across software and alignment parameters. Conclusions Mapping bias very strongly affects gene transcript abundance estimates in maize, and bias varies across chromosomal features. Individual genome or transcriptome templates are likely necessary for accurate transcript estimation across genetically variable individuals in maize and other species.


Genome ◽  
2004 ◽  
Vol 47 (6) ◽  
pp. 1105-1113 ◽  
Author(s):  
Alicia Felip ◽  
Atushi Fujiwara ◽  
William P Young ◽  
Paul A Wheeler ◽  
Marc Noakes ◽  
...  

Most fish species show little morphological differentiation in the sex chromosomes. We have coupled molecular and cytogenetic analyses to characterize the male-determining region of the rainbow trout (Oncorhynchus mykiss) Y chromosome. Four genetically diverse male clonal lines of this species were used for genetic and physical mapping of regions in the vicinity of the sex locus. Five markers were genetically mapped to the Y chromosome in these male lines, indicating that the sex locus was located on the same linkage group in each of the lines. We also confirmed the presence of a Y chromosome morphological polymorphism among these lines, with the Y chromosomes from two of the lines having the more common heteromorphic Y chromosome and two of the lines having Y chromosomes morphologically similar to the X chromosome. The fluorescence in situ hybridization (FISH) pattern of two probes linked to sex suggested that the sex locus is physically located on the long arm of the Y chromosome. Fishes appear to be an excellent group of organisms for studying sex chromosome evolution and differentiation in vertebrates because they show considerable variability in the mechanisms and (or) patterns involved in sex determination.Key words: sex chromosomes, sex markers, cytogenetics, rainbow trout, fish.


2014 ◽  
Vol 35 (2) ◽  
pp. 221-229 ◽  
Author(s):  
Bharti Manwani ◽  
Kathryn Bentivegna ◽  
Sharon E Benashski ◽  
Venugopal Reddy Venna ◽  
Yan Xu ◽  
...  

Epidemiologic studies have shown sex differences in ischemic stroke. The four core genotype (FCG) mouse model, in which the testes determining gene, Sry, has been moved from Y chromosome to an autosome, was used to dissociate the effects of sex hormones from sex chromosome in ischemic stroke outcome. Middle cerebral artery occlusion (MCAO) in gonad intact FCG mice revealed that gonadal males (XXM and XYM) had significantly higher infarct volumes as compared with gonadal females (XXF and XYF). Serum testosterone levels were equivalent in adult XXM and XYM, as was serum estrogen in XXF and XYF mice. To remove the effects of gonadal hormones, gonadectomized FCG mice were subjected to MCAO. Gonadectomy significantly increased infarct volumes in females, while no change was seen in gonadectomized males, indicating that estrogen loss increases ischemic sensitivity. Estradiol supplementation in gonadectomized FCG mice rescued this phenotype. Interestingly, FCG male mice were less sensitive to effects of hormones. This may be due to enhanced expression of the transgene Sry in brains of FCG male mice. Sex differences in ischemic stroke sensitivity appear to be shaped by organizational and activational effects of sex hormones, rather than sex chromosomal complement.


2019 ◽  
Vol 116 (38) ◽  
pp. 19031-19036 ◽  
Author(s):  
Iulia Darolti ◽  
Alison E. Wright ◽  
Benjamin A. Sandkam ◽  
Jake Morris ◽  
Natasha I. Bloch ◽  
...  

Once recombination is halted between the X and Y chromosomes, sex chromosomes begin to differentiate and transition to heteromorphism. While there is a remarkable variation across clades in the degree of sex chromosome divergence, far less is known about the variation in sex chromosome differentiation within clades. Here, we combined whole-genome and transcriptome sequencing data to characterize the structure and conservation of sex chromosome systems across Poeciliidae, the livebearing clade that includes guppies. We found that the Poecilia reticulata XY system is much older than previously thought, being shared not only with its sister species, Poecilia wingei, but also with Poecilia picta, which diverged roughly 20 million years ago. Despite the shared ancestry, we uncovered an extreme heterogeneity across these species in the proportion of the sex chromosome with suppressed recombination, and the degree of Y chromosome decay. The sex chromosomes in P. reticulata and P. wingei are largely homomorphic, with recombination in the former persisting over a substantial fraction. However, the sex chromosomes in P. picta are completely nonrecombining and strikingly heteromorphic. Remarkably, the profound degradation of the ancestral Y chromosome in P. picta is counterbalanced by the evolution of functional chromosome-wide dosage compensation in this species, which has not been previously observed in teleost fish. Our results offer important insight into the initial stages of sex chromosome evolution and dosage compensation.


2004 ◽  
Vol 16 (5) ◽  
pp. 527 ◽  
Author(s):  
Jennifer A. Marshall Graves

The human Y chromosome is running out of time. In the last 300 million years, it has lost 1393 of its original 1438 genes, and at this rate it will lose the last 45 in a mere 10 million years. But there has been a proposal that perhaps rescue is at hand in the form of recently discovered gene conversion within palindromes. However, I argue here that although conversion will increase the frequency of variation of the Y (particularly amplification) between Y chromosomes in a population, it will not lead to a drive towards a more functional Y. The forces of evolution have made the Y a genetically isolated, non-recombining entity, vulnerable to genetic drift and selection for favourable new variants sharing the Y with damaging mutations. Perhaps it will even speed up the decline of the Y chromosome and the onset of a new round of sex-chromosome differentiation. The struggle to preserve males may perhaps lead to hominid speciation.


2019 ◽  
Author(s):  
Pui-Pik Law ◽  
Ping-Kei Chan ◽  
Kirsten McEwen ◽  
Huihan Zhi ◽  
Bing Liang ◽  
...  

SummarySex differences in growth rate in very early embryos have been recognized in a variety of mammals and attributed to sex-chromosome complement effects as they occur before overt sexual differentiation. We previously found that sex-chromosome complement, rather than sex hormones regulates heterochromatin-mediated silencing of a transgene and autosomal gene expression in mice. Here, sex dimorphism in proliferation was investigated. We confirm that male embryonic fibroblasts proliferate faster than female fibroblasts and show that this proliferation advantage is completely dependent upon heterochromatin protein 1 gamma (HP1γ). To determine whether this sex-regulatory effect of HP1γ was a more general phenomenon, we performed RNA sequencing on MEFs derived from males and females, with or without HP1γ. Strikingly, HP1γ was found to be crucial for regulating nearly all sexually dimorphic autosomal gene expression because deletion of the HP1γ gene in males abolished sex differences in autosomal gene expression. The identification of a key epigenetic modifier as central in defining gene expression differences between males and females has important implications for understanding physiological sex differences and sex bias in disease.


Sign in / Sign up

Export Citation Format

Share Document