Transposable Element Evolution in the Allotetraploid Capsella bursa-pastoris

Mapping Intimacies ◽

10.1101/042325 ◽

2016 ◽

Author(s):

J. Arvid Ågren ◽

Hui-Run Huang ◽

Stephen I. Wright

Keyword(s):

Transposable Element ◽

Evolutionary Dynamics ◽

Sequence Data ◽

Diploid Species ◽

Whole Genome Sequence ◽

Hybrid Breakdown ◽

Relaxed Selection ◽

High Coverage ◽

Genome Wide ◽

A Genome

AbstractPremise of the studyShifts in ploidy affect the evolutionary dynamics of genomes in a myriad of ways. Population genetic theory predicts that transposable element (TE) proliferation may follow because the genome wide efficacy of selection should be reduced and the increase in gene copies may mask the deleterious effects of TE insertions. Moreover, in allopolyploids TEs may further accumulate because of hybrid breakdown of TE silencing. However, to date the evidence of TE proliferation following an increase in ploidy is mixed, and the relative importance of relaxed selection vs. silencing breakdown remains unclear.MethodsWe used high-coverage whole genome sequence data to evaluate the abundance, genomic distribution, and population frequencies of TEs in the self-fertilizing recent allotetraploid Capsella bursa-pastoris (Brassicaceae). We then compared the C. bursa-pastoris TE profile with that of its two parental diploid species, outcrossing C. grandiflora and self-fertilizing C. orientalis.Key resultsWe found no evidence that C. bursa-pastoris has experienced a large genome wide proliferation of TEs relative to its parental species. However, when centromeric regions are excluded, we find evidence of significantly higher abundance of retrotransposons in C. bursa-pastoris along the gene-rich chromosome arms, compared to C.grandiflora and C. orientalis.ConclusionsThe lack of a genome-wide effect of allopolyploidy on TE abundance, combined with the increases TE abundance in gene-rich regions suggest that relaxed selection rather than hybrid breakdown of host silencing explains the TE accumulation in C. bursa-pastoris

Mapping the genetic basis of diabetes mellitus in the Australian Burmese cat (Felis catus)

Scientific Reports ◽

10.1038/s41598-020-76166-3 ◽

2020 ◽

Vol 10 (1) ◽

Author(s):

Georgina Samaha ◽

Claire M. Wade ◽

Julia Beatty ◽

Leslie A. Lyons ◽

Linda M. Fleeman ◽

...

Keyword(s):

Diabetes Mellitus ◽

Genome Wide Association Study ◽

Genetic Model ◽

Sequence Data ◽

Pancreatic Beta Cell ◽

Whole Genome Sequence ◽

Cell Dysfunction ◽

Genome Wide ◽

A Genome ◽

Pathologic Features

Abstract Diabetes mellitus, a common endocrinopathy affecting domestic cats, shares many clinical and pathologic features with type 2 diabetes in humans. In Australia and Europe, diabetes mellitus is almost four times more common among Burmese cats than in other breeds. As a genetically isolated population, the diabetic Australian Burmese cat provides a spontaneous genetic model for studying diabetes mellitus in humans. Studying complex diseases in pedigreed breeds facilitates tighter control of confounding factors including population stratification, allelic frequencies and environmental heterogeneity. We used the feline SNV array and whole genome sequence data to undertake a genome wide-association study and runs of homozygosity analysis, of a case–control cohort of Australian and European Burmese cats. Our results identified diabetes-associated haplotypes across chromosomes A3, B1 and E1 and selective sweeps across the Burmese breed on chromosomes B1, B3, D1 and D4. The locus on chromosome B1, common to both analyses, revealed coding and splice region variants in candidate genes, ANK1, EPHX2 and LOX2, implicated in diabetes mellitus and lipid dysregulation. Mapping this condition in Burmese cats has revealed a polygenic spectrum, implicating loci linked to pancreatic beta cell dysfunction, lipid dysregulation and insulin resistance in the pathogenesis of diabetes mellitus in the Burmese cat.

A genome-wide scan statistic framework for whole-genome sequence data analysis

Nature Communications ◽

10.1038/s41467-019-11023-0 ◽

2019 ◽

Vol 10 (1) ◽

Cited By ~ 3

Author(s):

Zihuai He ◽

Bin Xu ◽

Joseph Buxbaum ◽

Iuliana Ionita-Laza

Keyword(s):

Data Analysis ◽

Genome Sequence ◽

Sequence Data ◽

Whole Genome Sequence ◽

Whole Genome ◽

Scan Statistic ◽

Genome Sequence Data ◽

Genome Wide ◽

A Genome ◽

Genome Wide Scan

Genome-wide association analysis reveals QTL and candidate mutations involved in white spotting in cattle

Genetics Selection Evolution ◽

10.1186/s12711-019-0506-2 ◽

2019 ◽

Vol 51 (1) ◽

Cited By ~ 4

Author(s):

Swati Jivanji ◽

Gemma Worth ◽

Thomas J. Lopdell ◽

Anna Yeates ◽

Christine Couldrey ◽

...

Keyword(s):

Association Analysis ◽

Molecular Mechanisms ◽

Coat Color ◽

Sequence Data ◽

Genome Wide Association ◽

Whole Genome Sequence ◽

Genome Wide Association Analysis ◽

Genome Wide ◽

Holstein Friesian ◽

A Genome

Abstract Background White spotting of the coat is a characteristic trait of various domestic species including cattle and other mammals. It is a hallmark of Holstein–Friesian cattle, and several previous studies have detected genetic loci with major effects for white spotting in animals with Holstein–Friesian ancestry. Here, our aim was to better understand the underlying genetic and molecular mechanisms of white spotting, by conducting the largest mapping study for this trait in cattle, to date. Results Using imputed whole-genome sequence data, we conducted a genome-wide association analysis in 2973 mixed-breed cows and bulls. Highly significant quantitative trait loci (QTL) were found on chromosomes 6 and 22, highlighting the well-established coat color genes KIT and MITF as likely responsible for these effects. These results are in broad agreement with previous studies, although we also report a third significant QTL on chromosome 2 that appears to be novel. This signal maps immediately adjacent to the PAX3 gene, which encodes a known transcription factor that controls MITF expression and is the causal locus for white spotting in horses. More detailed examination of these loci revealed a candidate causal mutation in PAX3 (p.Thr424Met), and another candidate mutation (rs209784468) within a conserved element in intron 2 of MITF transcripts expressed in the skin. These analyses also revealed a mechanistic ambiguity at the chromosome 6 locus, where highly dispersed association signals suggested multiple or multiallelic QTL involving KIT and/or other genes in this region. Conclusions Our findings extend those of previous studies that reported KIT as a likely causal gene for white spotting, and report novel associations between candidate causal mutations in both the MITF and PAX3 genes. The sizes of the effects of these QTL are substantial, and could be used to select animals with darker, or conversely whiter, coats depending on the desired characteristics.

Genome-wide identification of lineage and locus specific variation associated with pneumococcal carriage duration

10.1101/107086 ◽

2017 ◽

Cited By ~ 1

Author(s):

John A. Lees ◽

Nicholas J. Croucher ◽

Goldblatt David ◽

Nosten Francois ◽

Parkhill Julian ◽

...

Keyword(s):

Environmental Factors ◽

Low Income ◽

Genome Wide Association Study ◽

Sequence Data ◽

Genomic Variation ◽

Whole Genome Sequence ◽

Vaccine Response ◽

Pneumococcal Carriage ◽

Genome Wide ◽

A Genome

AbstractStreptococcus pneumoniae is a leading cause of invasive disease in infants, especially in low-income settings. Asymptomatic carriage in the nasopharynx is a prerequisite for disease, and the duration of carriage is an important consideration in modelling transmission dynamics and vaccine response. Existing studies of carriage duration variability are based at the serotype level only, and do not probe variation within lineages or fully quantify interactions with other environmental factors.Here we developed a model to calculate the duration of carriage episodes from longitudinal swab data. By combining these results with whole genome sequence data we estimate that pneumococcal genomic variation accounted for 63% of the phenotype variation, whereas host traits accounted for less than 5%. We further partitioned this heritability into both lineage and locus effects, and quantified the amount attributable to the largest sources of variation in carriage duration: serotype (17%), drug-resistance (9%) and other significant locus effects (7%). For the locus effects, a genome-wide association study identified 16 loci which may have an effect on carriage duration independent of serotype. Hits at a genome-wide level of significance were to prophage sequences, suggesting infection by such viruses substantially affects carriage duration.These results show that both serotype and non-serotype specific effects alter carriage duration in infants and young children and are more important than other environmental factors such as host genetics. This has implications for models of pneumococcal competition and antibiotic resistance, and leads the way for the analysis of heritability of complex bacterial traits.Significance statementOther than serotype, the genetic determinants of pneumococcal carriage duration are unknown. In this study we used longitudinal sampling to measure the duration of carriage in infants, and searched for any associated variation in the pan-genome. While we found that the pathogen genome explains most of the variability in duration, serotype did not fully account for this. Recent theoretical work has proposed the existence of alleles which alter carriage duration to explain the puzzle of continued coexistence of antibiotic-resistant and sensitive strains. Here we have shown that these alleles do exist in a natural population, and also identified candidates for the loci which fulfil this role. Together these findings have implications for future modelling of pneumococcal epidemiology and resistance.

Genome diversity in Ukraine

GigaScience ◽

10.1093/gigascience/giaa159 ◽

2021 ◽

Vol 10 (1) ◽

Author(s):

Taras K Oleksyk ◽

Walter W Wolfsberger ◽

Alexandra M Weber ◽

Khrystyna Shchubelka ◽

Olga T Oleksyk ◽

...

Keyword(s):

Sequence Data ◽

Copy Number Variations ◽

Genomic Variation ◽

High Coverage ◽

Genome Data ◽

New Information ◽

Genome Wide ◽

Public Data ◽

Genome Wide Data ◽

Multiple Samples

Abstract Background The main goal of this collaborative effort is to provide genome-wide data for the previously underrepresented population in Eastern Europe, and to provide cross-validation of the data from genome sequences and genotypes of the same individuals acquired by different technologies. We collected 97 genome-grade DNA samples from consented individuals representing major regions of Ukraine that were consented for public data release. BGISEQ-500 sequence data and genotypes by an Illumina GWAS chip were cross-validated on multiple samples and additionally referenced to 1 sample that has been resequenced by Illumina NovaSeq6000 S4 at high coverage. Results The genome data have been searched for genomic variation represented in this population, and a number of variants have been reported: large structural variants, indels, copy number variations, single-nucletide polymorphisms, and microsatellites. To our knowledge, this study provides the largest to-date survey of genetic variation in Ukraine, creating a public reference resource aiming to provide data for medical research in a large understudied population. Conclusions Our results indicate that the genetic diversity of the Ukrainian population is uniquely shaped by evolutionary and demographic forces and cannot be ignored in future genetic and biomedical studies. These data will contribute a wealth of new information bringing forth a wealth of novel, endemic and medically related alleles.

SNP discovery by amplicon sequencing and multiplex SNP genotyping in the allopolyploid species Brassica napusThis article is one of a selection of papers from the conference “Exploiting Genome-wide Association in Oilseed Brassicas: a model for genetic improvement of major OECD crops for sustainable farming”.

Genome ◽

10.1139/g10-079 ◽

2010 ◽

Vol 53 (11) ◽

pp. 948-956 ◽

Cited By ~ 35

Author(s):

G. Durstewitz ◽

A. Polley ◽

J. Plieske ◽

H. Luerssen ◽

E. M. Graner ◽

...

Keyword(s):

Genetic Analysis ◽

Oilseed Rape ◽

Expressed Sequence Tag ◽

Diploid Species ◽

Amplicon Sequencing ◽

Snp Markers ◽

Snp Analysis ◽

Genome Wide ◽

A Genome ◽

Illumina Goldengate

Oilseed rape ( Brassica napus ) is an allotetraploid species consisting of two genomes, derived from B. rapa (A genome) and B. oleracea (C genome). The presence of these two genomes makes single nucleotide polymorphism (SNP) marker identification and SNP analysis more challenging than in diploid species, as for a given locus usually two versions of a DNA sequence (based on the two ancestral genomes) have to be analyzed simultaneously during SNP identification and analysis. One hundred amplicons derived from expressed sequence tag (ESTs) were analyzed to identify SNPs in a panel of oilseed rape varieties and within two sister species representing the ancestral genomes. A total of 604 SNPs were identified, averaging one SNP in every 42 bp. It was possible to clearly discriminate SNPs that are polymorphic between different plant varieties from SNPs differentiating the two ancestral genomes. To validate the identified SNPs for their use in genetic analysis, we have developed Illumina GoldenGate assays for some of the identified SNPs. Through the analysis of a number of oilseed rape varieties and mapping populations with GoldenGate assays, we were able to identify a number of different segregation patterns in allotetraploid oilseed rape. The majority of the identified SNP markers can be readily used for genetic mapping, showing that amplicon sequencing and Illumina GoldenGate assays can be used to reliably identify SNP markers in tetraploid oilseed rape and to convert them into successful SNP assays that can be used for genetic analysis.

A curated dataset of modern and ancient high-coverage shotgun human genomes

Scientific Data ◽

10.1038/s41597-021-00980-1 ◽

2021 ◽

Vol 8 (1) ◽

Author(s):

Pierpaolo Maisano Delser ◽

Eppie R. Jones ◽

Anahit Hovhannisyan ◽

Lara Cassidy ◽

Ron Pinhasi ◽

...

Keyword(s):

Sequence Data ◽

Whole Genome ◽

Reference Dataset ◽

High Coverage ◽

Sample Distribution ◽

Human Samples ◽

Human Genomes ◽

Genome Wide ◽

Genome Wide Data ◽

Computationally Intensive

AbstractOver the last few years, genome-wide data for a large number of ancient human samples have been collected. Whilst datasets of captured SNPs have been collated, high coverage shotgun genomes (which are relatively few but allow certain types of analyses not possible with ascertained captured SNPs) have to be reprocessed by individual groups from raw reads. This task is computationally intensive. Here, we release a dataset including 35 whole-genome sequenced samples, previously published and distributed worldwide, together with the genetic pipeline used to process them. The dataset contains 72,041,355 sites called across 19 ancient and 16 modern individuals and includes sequence data from four previously published ancient samples which we sequenced to higher coverage (10–18x). Such a resource will allow researchers to analyse their new samples with the same genetic pipeline and directly compare them to the reference dataset without re-processing published samples. Moreover, this dataset can be easily expanded to increase the sample distribution both across time and space.

Transcriptome analysis ofSchistosoma mansonilarval development using serial analysis of gene expression (SAGE)

Parasitology ◽

10.1017/s0031182009005733 ◽

2009 ◽

Vol 136 (5) ◽

pp. 469-485 ◽

Cited By ~ 22

Author(s):

A. S. TAFT ◽

J. J. VERMEIRE ◽

J. BERNIER ◽

S. R. BIRKELAND ◽

M. J. CIPRIANO ◽

...

Keyword(s):

Gene Expression ◽

Sequence Data ◽

Subsequent Development ◽

Differentially Expressed ◽

Cdna Libraries ◽

Genome Wide ◽

A Genome ◽

Genome Wide Expression ◽

Cell Conditioned Medium

SUMMARYInfection of the snail,Biomphalaria glabrata, by the free-swimming miracidial stage of the human blood fluke,Schistosoma mansoni, and its subsequent development to the parasitic sporocyst stage is critical to establishment of viable infections and continued human transmission. We performed a genome-wide expression analysis of theS. mansonimiracidia and developing sporocyst using Long Serial Analysis of Gene Expression (LongSAGE). Five cDNA libraries were constructed from miracidia andin vitrocultured 6- and 20-day-old sporocysts maintained in sporocyst medium (SM) or in SM conditioned by previous cultivation with cells of theB. glabrataembryonic (Bge) cell line. We generated 21 440 SAGE tags and mapped 13 381 to theS. mansonigene predictions (v4.0e) either by estimating theoretical 3′ UTR lengths or using existing 3′ EST sequence data. Overall, 432 transcripts were found to be differentially expressed amongst all 5 libraries. In total, 172 tags were differentially expressed between miracidia and 6-day conditioned sporocysts and 152 were differentially expressed between miracidia and 6-day unconditioned sporocysts. In addition, 53 and 45 tags, respectively, were differentially expressed in 6-day and 20-day cultured sporocysts, due to the effects of exposure to Bge cell-conditioned medium.

Detection and application of genome-wide variations in peach for association and genetic relationship analysis

BMC Genetics ◽

10.1186/s12863-019-0799-8 ◽

2019 ◽

Vol 20 (1) ◽

Cited By ~ 2

Author(s):

Liping Guan ◽

Ke Cao ◽

Yong Li ◽

Jian Guo ◽

Qiang Xu ◽

...

Keyword(s):

Genetic Relationship ◽

Dna Markers ◽

High Throughput Sequencing ◽

Prunus Persica ◽

Genetic Research ◽

Diploid Species ◽

Sequencing Data ◽

Relationship Analysis ◽

Genome Wide ◽

A Genome

Abstract Background Peach (Prunus persica L.) is a diploid species and model plant of the Rosaceae family. In the past decade, significant progress has been made in peach genetic research via DNA markers, but the number of these markers remains limited. Results In this study, we performed a genome-wide DNA markers detection based on sequencing data of six distantly related peach accessions. A total of 650,693~1,053,547 single nucleotide polymorphisms (SNPs), 114,227~178,968 small insertion/deletions (InDels), 8386~12,298 structure variants (SVs), 2111~2581 copy number variants (CNVs) and 229,357~346,940 simple sequence repeats (SSRs) were detected and annotated. To demonstrate the application of DNA markers, 944 SNPs were filtered for association study of fruit ripening time and 15 highly polymorphic SSRs were selected to analyze the genetic relationship among 221 accessions. Conclusions The results showed that the use of high-throughput sequencing to develop DNA markers is fast and effective. Comprehensive identification of DNA markers, including SVs and SSRs, would be of benefit to genetic diversity evaluation, genetic mapping, and molecular breeding of peach.

Genetic integrity of the Dark European honey bee (Apis mellifera mellifera) from protected populations: a genome-wide assessment using SNPs and mtDNA sequence data

Journal of Apicultural Research ◽

10.3896/ibra.1.53.2.08 ◽

2014 ◽

Vol 53 (2) ◽

pp. 269-278 ◽

Cited By ~ 56

Author(s):

M Alice Pinto ◽

Dora Henriques ◽

Julio Chávez-Galarza ◽

Per Kryger ◽

Lionel Garnery ◽

...

Keyword(s):

Apis Mellifera ◽

Honey Bee ◽

Sequence Data ◽

Genetic Integrity ◽

Apis Mellifera Mellifera ◽

Genome Wide ◽

A Genome ◽

Mtdna Sequence