genomic length
Recently Published Documents


TOTAL DOCUMENTS

24
(FIVE YEARS 7)

H-INDEX

13
(FIVE YEARS 2)

2021 ◽  
Author(s):  
Sara Victoria Good ◽  
Ryan Gotesman ◽  
Ilya Kisselev ◽  
Andrew D. Paterson

Abstract GWAS have identified thousands of loci associated with human complex diseases and traits. How these loci are distributed through the genome has not been systematically evaluated. We hypothesised that the location of GWAS loci differ between ancestral linkage groups (ALGs) related to the paralogy and function of genes. We used data from the NHGRI-EBI GWAS catalog to determine whether the density of GWAS loci relative to HapMap variants in each ALG differed, and whether ALG’s were enriched for experimental factor ontological (EFO) terms assigned to the GWAS traits. In a gene-level analyses we explored the characteristics of genes linked to GWAS loci and those mapping to the ALG’s. We find that GWAS loci were enriched or deficient in 9 and 7 of the 17 ALG’s respectively, while there was no difference in the number of GWAS loci in regions of the human genome unassigned to an ALG. All but 2 ALG’s were significantly enriched or deficient for one or more EFO terms. Lastly, we find that genes assigned to an ALG are under higher levels of selective constraint, have longer coding sequences and higher median expression in the tissue of highest expression than genes not mapping to an ALG. On the other hand, genes associated with GWAS loci have longer genomic length and exhibit higher levels of selective constraint relative to non-GWAS genes.Collectively, this suggests that understanding the location and ancestral origins of GWAS signals may be informative for the development of tools for variant prioritization and interpretation.


2021 ◽  
Author(s):  
Anthony Chamings ◽  
Tarka Raj Bhatta ◽  
Soren Alexandersen

Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) has spread rapidly in the global population since its emergence in humans in late 2019. Replication of SARS-CoV-2 is characterised by transcription and replication of genomic length RNA and shorter subgenomic RNAs to produce virus proteins and ultimately progeny virions. Here we explore the pattern of both genome-length and subgenomic RNAs and positive and negative strand SARS-CoV-2 RNAs in diagnostic nasopharyngeal swabs using sensitive probe based PCR assays as well as Ampliseq panels designed to target subgenomic RNAs. Using these assays, we measured the ratios of genomic to subgenomic RNAs as well as the ratios of positive to negative strand RNAs in SARS-CoV-2 positive nasopharyngeal swab samples. We found that while subgenomic RNAs and negative strand RNA can be readily detected in swab samples taken up to 19 and 17 days post symptom onset respectively, and therefore their detection alone is not likely an indicator of active SARS-CoV-2 replication. However, the ratios of genomic-length to subgenomic RNA and also of positive to negative strand RNA were elevated in some swabs, particularly those collected around the onset of clinical symptoms or in an individual with decreasing PCR Cts in successive swab samples. We tentatively conclude that it may be possible to refine such molecular assays to help determine if active replication of virus is occurring and progeny virions likely present in a SARS-CoV-2 positive individual. Assays targeting subgenomic N or ORF7a RNAs as well as strand specific ORF7a total genome-length and subgenomic RNAs may be the most sensitive for this purpose as these targets were consistently the most abundant in the swab samples.


2021 ◽  
Vol 4 (8) ◽  
pp. e202101028
Author(s):  
Zhicheng Cai ◽  
Yueying He ◽  
Sirui Liu ◽  
Yue Xue ◽  
Hui Quan ◽  
...  

Dinucleotide densities and their distribution patterns vary significantly among species. Previous studies revealed that CpG is susceptible to methylation, enriched at topologically associating domain boundaries and its distribution along the genome correlates with chromatin compartmentalization. However, the multi-scale organizations of CpG in the linear genome, their role in chromatin organization, and how they change along the evolution are only partially understood. By comparing the CpG distribution at different genomic length scales, we quantify the difference between the CpG distributions of different species and evaluate how the hierarchical uneven CpG distribution appears in evolution. The clustering of species based on the CpG distribution is consistent with the phylogenetic tree. Interestingly, we found the CpG distribution and chromatin structure to be correlated in many different length scales, especially for mammals and avians, consistent with the mosaic CpG distribution in the genomes of these species.


Genes ◽  
2021 ◽  
Vol 12 (6) ◽  
pp. 809
Author(s):  
Angelo Pavesi

During their long evolutionary history viruses generated many proteins de novo by a mechanism called “overprinting”. Overprinting is a process in which critical nucleotide substitutions in a pre-existing gene can induce the expression of a novel protein by translation of an alternative open reading frame (ORF). Overlapping genes represent an intriguing example of adaptive conflict, because they simultaneously encode two proteins whose freedom to change is constrained by each other. However, overlapping genes are also a source of genetic novelties, as the constraints under which alternative ORFs evolve can give rise to proteins with unusual sequence properties, most importantly the potential for novel functions. Starting with the discovery of overlapping genes in phages infecting Escherichia coli, this review covers a range of studies dealing with detection of overlapping genes in small eukaryotic viruses (genomic length below 30 kb) and recognition of their critical role in the evolution of pathogenicity. Origin of overlapping genes, what factors favor their birth and retention, and how they manage their inherent adaptive conflict are extensively reviewed. Special attention is paid to the assembly of overlapping genes into ad hoc databases, suitable for future studies, and to the development of statistical methods for exploring viral genome sequences in search of undiscovered overlaps.


2020 ◽  
Vol 101 (3) ◽  
pp. 271-283 ◽  
Author(s):  
Anna L. McNaughton ◽  
Peter A. Revill ◽  
Margaret Littlejohn ◽  
Philippa C. Matthews ◽  
M. Azim Ansari

2019 ◽  
Author(s):  
Anna L McNaughton ◽  
Peter Revill ◽  
Margaret Littlejohn ◽  
Philippa C Matthews ◽  
M Azim Ansari

ABSTRACTHepatitis B virus (HBV) is a diverse, partially double-stranded DNA virus, with 9 genotypes (A-I), and a putative 10th genotype (J), thus far characterised. Given the broadening interest in HBV sequencing, there is an increasing requirement for a consistent, unified approach to HBV genotype and subgenotype classification. We set out to generate an updated resource of reference sequences using the diversity of all genomic-length HBV sequences available in public databases. We collated and aligned genomic-length HBV sequences from public databases and used maximum-likelihood phylogenetic analysis to identify genotype clusters. Within each genotype, we examined the phylogenetic support for currently defined subgenotypes, as well as identifying well-supported clades and deriving reference sequences for them. An alignment of these reference sequences and maximum-likelihood phylogenetic trees of the sequences are provided to simplify classification. Based on the phylogenies generated, we present a comprehensive set of HBV reference sequences at the genotype and subgenotype level.


Phytotaxa ◽  
2019 ◽  
Vol 408 (3) ◽  
pp. 178-194
Author(s):  
RANIA A. HASSAN ◽  
WAFAA M. AMER

Datura stramonium (Solanaceae) is an annual weed found in most temperate and subtropical regions of the world. The taxonomic identity of this species is still under debate since Linnaeus first published the name. Early on, many varieties and forms were recognized. In Egypt, D. stramonium presents in two forms: the white-flowered ‘stramonium’ form and violet-flowered ‘tatula’ form. Some authors treated D. stramonium and D. tatula as two distinct species, while others included D. tatula within D. stramonium as either a variety or forma. The present study aimed to elucidate the taxonomic identity of both the white ‘stramonium’ and the violet ‘tatula’ forms based on morphological, palynological and cytogenetic studies extended to karyotyping. A taxonomic study of these forms was carried out using 75 morphological and pollen characters. Significant morphological differences were observed; the most important ones were flower and stem colour, in addition to flower and fruit dimensions. The anatomical examination of juvenile-fruit in acropetal transverse sections, revealed the gradual displacement of parietal placentation at the fruit base to axile at the fruit apex. Pollen of both forms showed no significant differences. Cytogenetic results revealed the presence of a diploid chromosome number (2n=2x=24) in both forms, with minor aneuploidy in the ‘tatula’ form. According to the karyotyping, notable differences were found between the two studied forms, including the centromeric index, total genomic length, and intrachromosomal asymmetry index. Morphological and cytogenetic data revealed that the two forms are different enough to be treated taxonomically as two distinct varieties, namely D. stramonium var. stramonium and D. stramonium var. tatula.


eLife ◽  
2017 ◽  
Vol 6 ◽  
Author(s):  
Daniel B Weissman ◽  
Oskar Hallatschek

Samples of multiple complete genome sequences contain vast amounts of information about the evolutionary history of populations, much of it in the associations among polymorphisms at different loci. We introduce a method, Minimal-Assumption Genomic Inference of Coalescence (MAGIC), that reconstructs key features of the evolutionary history, including the distribution of coalescence times, by integrating information across genomic length scales without using an explicit model of coalescence or recombination, allowing it to analyze arbitrarily large samples without phasing while making no assumptions about ancestral structure, linked selection, or gene conversion. Using simulated data, we show that the performance of MAGIC is comparable to that of PSMC’ even on single diploid samples generated with standard coalescent and recombination models. Applying MAGIC to a sample of human genomes reveals evidence of non-demographic factors driving coalescence.


Soft Matter ◽  
2016 ◽  
Vol 12 (33) ◽  
pp. 6975-6984 ◽  
Author(s):  
Mert Arca ◽  
Anthony J. C. Ladd ◽  
Jason E. Butler
Keyword(s):  

Sign in / Sign up

Export Citation Format

Share Document