genomic signals
Recently Published Documents


TOTAL DOCUMENTS

87
(FIVE YEARS 27)

H-INDEX

16
(FIVE YEARS 3)

2022 ◽  
Author(s):  
Anna Grandchamp ◽  
Katrin Berk ◽  
Elias Dohmen ◽  
Erich Bornberg-Bauer

De novo genes are novel genes which emerge from non-coding DNA. Until now, little is known about de novo genes properties, correlated to their age and mechanisms of emergence. In this study, we investigate four properties: introns, upstream regulatory motifs, 5 prime UTRs and protein domains, in 23135 human proto-genes. We found that proto-genes contain introns, whose number and position correlates with the genomic position of proto-gene emergence. The origin of these introns is debated, as our result suggest that 41% proto-genes might have captured existing introns, as well as the fact that 13.7% of them do not splice the ORF. We show that proto-genes which emerged via overprinting tend to be more enriched in core promotor motifs, while intergenic and intronic ones are more enriched in enhancers, even if the motif TATA is most expressed upstream these genes. Intergenic and intronic 5 prime UTRs of proto-genes have a lower potential to stabilise mRNA structures than exonic proto-genes and established human genes. Finally, we confirm that proto-genes gain new putative domains with age. Overall, we find that regulatory motifs inducing transcription and translation of previously non-coding sequences may facilitate proto-gene emergence. Our paper demonstrates that introns, 5 prime UTRs, and domains have specific properties in proto-genes. We also show the importance of studying proto-genes in relation to their genomic position, as it strongly impacts these properties.


Author(s):  
Fang Li ◽  
Rahul V. Rane ◽  
Victor Luria ◽  
Zijun Xiong ◽  
Jiawei Chen ◽  
...  

2021 ◽  
Author(s):  
Qian Zhao ◽  
Longqing Shi ◽  
Weiyi He ◽  
Jinyu Li ◽  
Shijun You ◽  
...  

The tea green leafhopper (TGL), Empoasca onukii, is of biological and economic interest. Despite numerous studies, the mechanisms underlying its adaptation and evolution remain enigmatic. Here, we used previously untapped genome and population genetics approaches to examine how this pest so rapidly has adapted to different environmental variables and thus has expanded geographically. We complete a chromosome-level assembly and annotation of the E. onukii genome, showing notable expansions of gene families associated with adaptation to chemoreception and detoxification. Genomic signals indicating balancing selection highlight metabolic pathways involved in adaptation to a wide range of tea varieties grown across ecologically diverse regions. Patterns of genetic variation among 54 E. onukii samples unveil the population structure and evolutionary history across different tea-growing regions in China. Our results demonstrate that the genomic change in key pathways, including those linked to metabolism, circadian rhythms and immune system function, may underlie the successful spread and adaptation of E. onukii. This work highlights the genetic and molecular bases underlying the evolutionary success of a species with broad economic impact, and provides insight into insect adaptation to host plants, which will ultimately facilitate more sustainable pest management.


2021 ◽  
Author(s):  
Jose M. G. Vilar ◽  
Leonor Saiz

The prevalent one-dimensional alignment of genomic signals to a reference landmark is a cornerstone of current methods to study transcription and its DNA-dependent processes but it is prone to mask potential relations among multiple DNA elements. We developed a systematic approach to align genomic signals to multiple locations simultaneously by expanding the dimensionality of the genomic-coordinate space. We analyzed transcription in human and uncovered a complex dependence on the relative position of neighboring transcription start sites (TSSs) that is consistently conserved among cell types. The dependence ranges from enhancement to suppression of transcription depending on the relative distances to the TSSs, their intragenic position, and the transcriptional activity of the gene. Our results reveal a conserved hierarchy of alternative TSS usage within a previously unrecognized level of genomic organization and provide a general methodology to analyze complex functional relationships among multiple types of DNA elements.


2021 ◽  
Author(s):  
Shannon J. O'Leary ◽  
Christopher M. Hollenbeck ◽  
Robert R. Vega ◽  
David S. Portnoy

2021 ◽  
Author(s):  
Shohre Masoumi ◽  
Maxwell W. Libbrecht ◽  
Kay C. Wiese

Motivation: With the advancement of sequencing technologies, genomic data sets are constantly being expanded by high volumes of different data types. One recently introduced data type in genomic science is genomic signals, which are usually short-read coverage measurements over the genome. An example of genomic signals is Epigenomic marks which are utilized to locate functional and nonfunctional elements in genome annotation studies. To understand and evaluate the results of such studies, one needs to understand and analyze the characteristics of the input data. Results: SigTools is an R-based genomic signals visualization package developed with two objectives: 1) to facilitate genomic signals exploration in order to uncover insights for later model training, refinement, and development by including distribution and autocorrelation plots. 2) to enable genomic signals interpretation by including correlation, and aggregation plots. Moreover, Sigtools also provides text-based descriptive statistics of the given signals which can be practical when developing and evaluating learning models. We also include results from 2 case studies. The first examines several previously studied genomic signals called histone modifications. This use case demonstrates how SigTools can be beneficial for satisfying scientists curiosity in exploring and establishing recognized datasets. The second use case examines a dataset of novel chromatin state features which are novel genomic signals generated by a learning model. This use case demonstrates how SigTools can assist in exploring the characteristics and behavior of novel signals towards their interpretation. In addition, our corresponding web application, SigTools-Shiny, extends the accessibility scope of these modules to people who are more comfortable working with graphical user interfaces instead of command-line tools.


2021 ◽  
Author(s):  
Laura L Colbran ◽  
Maya R Johnson ◽  
Iain Mathieson ◽  
John A Capra

As humans spread throughout the world, they adapted to variation in many environmental factors, including climate, diet, and pathogens. Because many of these adaptations were likely mediated by multiple non-coding variants with small effects on gene regulation, it has been difficult to link genomic signals of selection to specific genes, and to describe the regulatory response to selection. To overcome this challenge, we adapted PrediXcan, a machine learning method for imputing gene regulation from genotype data, to analyze low-coverage ancient human DNA (aDNA). First, we used simulated genomes to benchmark strategies for adapting gene regulatory prediction to increase robustness to incomplete aDNA data. Applying the resulting models to 490 ancient Eurasians, we found that genes with the strongest divergent regulation among ancient populations with hunter-gatherer, pastoralist, and agricultural lifestyles are enriched for metabolic and immune functions. Next, we explored the contribution of divergent gene regulation to two traits with strong evidence of recent adaptation: dietary metabolism and skin pigmentation. We found enrichment for divergent regulation among genes previously proposed to be involved in diet-related local adaptation, and in many cases, the predicted effects on regulation provide explanations for previously observed signals of selection, e.g., at FADS1, GPX1, and LEPR. For skin pigmentation, we applied new models trained in melanocytes to a time series of 2999 ancient Europeans spanning ~38,000 years BP. In contrast to diet, skin pigmentation genes show little regulatory change over time, suggesting that adaptation mainly involved large-effect coding variants. This work demonstrates how aDNA can be combined with present-day genomes to shed light on the biological differences among ancient populations, the role of gene regulation in adaptation, and the relationship between ancient genetic diversity and the present-day distribution of complex traits.


2021 ◽  
Vol 11 (1) ◽  
Author(s):  
Therese H. Nøst ◽  
Marit Holden ◽  
Tom Dønnem ◽  
Hege Bøvelstad ◽  
Charlotta Rylander ◽  
...  

AbstractRecent studies have indicated that there are functional genomic signals that can be detected in blood years before cancer diagnosis. This study aimed to assess gene expression in prospective blood samples from the Norwegian Women and Cancer cohort focusing on time to lung cancer diagnosis and metastatic cancer using a nested case–control design. We employed several approaches to statistically analyze the data and the methods indicated that the case–control differences were subtle but most distinguishable in metastatic case–control pairs in the period 0–3 years prior to diagnosis. The genes of interest along with estimated blood cell populations could indicate disruption of immunological processes in blood. The genes identified from approaches focusing on alterations with time to diagnosis were distinct from those focusing on the case–control differences. Our results support that explorative analyses of prospective blood samples could indicate circulating signals of disease-related processes.


2021 ◽  
Vol 49 (5) ◽  
pp. 2674-2683
Author(s):  
Akhilesh Mishra ◽  
Priyanka Siwach ◽  
Pallavi Misra ◽  
Simran Dhiman ◽  
Ashutosh Kumar Pandey ◽  
...  

Abstract Precise identification of correct exon–intron boundaries is a prerequisite to analyze the location and structure of genes. The existing framework for genomic signals, delineating exon and introns in a genomic segment, seems insufficient, predominantly due to poor sequence consensus as well as limitations of training on available experimental data sets. We present here a novel concept for characterizing exon–intron boundaries in genomic segments on the basis of structural and energetic properties. We analyzed boundary junctions on both sides of all the exons (3 28 368) of protein coding genes from human genome (GENCODE database) using 28 structural and three energy parameters. Study of sequence conservation at these sites shows very poor consensus. It is observed that DNA adopts a unique structural and energy state at the boundary junctions. Also, signals are somewhat different for housekeeping and tissue specific genes. Clustering of 31 parameters into four derived vectors gives some additional insights into the physical mechanisms involved in this biological process. Sites of structural and energy signals correlate well to the positions playing important roles in pre-mRNA splicing.


Sign in / Sign up

Export Citation Format

Share Document