scholarly journals Multiple Alignment of Promoter Sequences from the Arabidopsis thaliana L. Genome

Genes ◽  
2021 ◽  
Vol 12 (2) ◽  
pp. 135
Author(s):  
Eugene V. Korotkov ◽  
Yulia M. Suvorova ◽  
Dmitrii O. Kostenko ◽  
Maria A. Korotkova

In this study, we developed a new mathematical method for performing multiple alignment of highly divergent sequences (MAHDS), i.e., sequences that have on average more than 2.5 substitutions per position (x). We generated sets of artificial DNA sequences with x ranging from 0 to 4.4 and applied MAHDS as well as currently used multiple sequence alignment algorithms, including ClustalW, MAFFT, T-Coffee, Kalign, and Muscle to these sets. The results indicated that most of the existing methods could produce statistically significant alignments only for the sets with x < 2.5, whereas MAHDS could operate on sequences with x = 4.4. We also used MAHDS to analyze a set of promoter sequences from the Arabidopsis thaliana genome and discovered many conserved regions upstream of the transcription initiation site (from −499 to +1 bp); a part of the downstream region (from +1 to +70 bp) also significantly contributed to the obtained alignments. The possibilities of applying the newly developed method for the identification of promoter sequences in any genome are discussed. A server for multiple alignment of nucleotide sequences has been created.

1986 ◽  
Vol 6 (10) ◽  
pp. 3329-3340 ◽  
Author(s):  
H L Sive ◽  
N Heintz ◽  
R G Roeder

As part of our studies on the cell cycle regulation of human histone gene expression, we examined the elements governing transcription of a human histone H2B gene in nuclear extracts derived from human HeLa cells. Circular templates were transcribed at 5- to 10-fold higher levels than were linear templates. A series of deletion, linker-substitution, and point mutants defined cis-acting promoter sequences that were recognized in nuclear extracts. These sequences extended from 118 to 21 base pairs 5' to the transcription initiation site. Elements recognized included (from 5' to 3') a series of direct repeats, a CCAAT homology, a human histone-specific hexamer, an H2B consensus element, and a TATA box. Sequence elements 5' to the hexamer were required for its function. In contrast, the H2B consensus element could function independently of more-5' promoter elements and in turn was essential for the function of upstream elements. An interesting feature of this consensus is that its core octanucleotide (ATTTGCAT) is found in several nonhistone genes. By comparison with functional elements in an H4 promoter, we infer that a combinatorial interaction of general and gene-specific factors may contribute to the S-phase elevation of H2B transcription.


1986 ◽  
Vol 6 (10) ◽  
pp. 3329-3340
Author(s):  
H L Sive ◽  
N Heintz ◽  
R G Roeder

As part of our studies on the cell cycle regulation of human histone gene expression, we examined the elements governing transcription of a human histone H2B gene in nuclear extracts derived from human HeLa cells. Circular templates were transcribed at 5- to 10-fold higher levels than were linear templates. A series of deletion, linker-substitution, and point mutants defined cis-acting promoter sequences that were recognized in nuclear extracts. These sequences extended from 118 to 21 base pairs 5' to the transcription initiation site. Elements recognized included (from 5' to 3') a series of direct repeats, a CCAAT homology, a human histone-specific hexamer, an H2B consensus element, and a TATA box. Sequence elements 5' to the hexamer were required for its function. In contrast, the H2B consensus element could function independently of more-5' promoter elements and in turn was essential for the function of upstream elements. An interesting feature of this consensus is that its core octanucleotide (ATTTGCAT) is found in several nonhistone genes. By comparison with functional elements in an H4 promoter, we infer that a combinatorial interaction of general and gene-specific factors may contribute to the S-phase elevation of H2B transcription.


1991 ◽  
Vol 11 (1) ◽  
pp. 533-543
Author(s):  
R M Mulligan ◽  
P Leon ◽  
V Walbot

Lysed maize mitochondria synthesize RNA in the presence of radioactive nucleoside triphosphates, and this assay was utilized to compare the rates of transcription of seven genes. The rates of incorporation varied over a 14-fold range, with the following rank order: 18S rRNA greater than 26S rRNA greater than atp1 greater than atp6 greater than atp9 greater than cob greater than cox3. The products of run-on transcription hybridized specifically to known transcribed regions and selectively to the antisense DNA strand; thus, the isolated run-on transcription system appears to be an accurate representation of endogenous transcription. Although there were small differences in gene copy abundance, these differences cannot account for the differences in apparent transcription rates; we conclude that promoter strength is the main determinant. Among the protein coding genes, incorporation was greatest for atp1. The most active transcription initiation site of this gene was characterized by hybridization with in vitro-capped RNA and by primer extension analyses. The DNA sequences at this and other transcription initiation sites that we have previously mapped were analyzed with respect to the apparent promoter strengths. We propose that two short sequence elements just upstream of initiation sites form at least a portion of the sequence requirements for a maize mitochondrial promoter. In addition to modulation at the level of transcription, steady-state abundance of protein-coding mRNAs varied over a 20-fold range and did not correlate with transcriptional activity. These observations suggest that posttranscriptional processes are important in the modulation of mRNA abundance.


1984 ◽  
Vol 4 (5) ◽  
pp. 875-882
Author(s):  
M J Imperiale ◽  
J R Nevins

Utilizing deletion mutants of a plasmid containing the adenovirus E2 gene, an E1A-inducible transcription unit, we determined the promoter sequences required for full expression in transient transfection assays. Wild-type expression was obtained from plasmids containing only 79 nucleotides of upstream sequence relative to the transcription initiation site. Removal of an additional nine nucleotides lowered expression 10-fold, and deletion to -59 resulted in near total loss of transcription. Wild-type levels of expression were restored to a -28 deletion mutant by insertion of the sequence from -21 to -262 from the wild-type promoter at the -28 position, in either orientation, even though when inserted in the opposite orientation the relevant sequences were ca. 270 nucleotides upstream from their normal position. Finally, this sequence could be placed at a distance of 4,000 nucleotides from the E2 cap site and still retain near total function. Thus, the E2 promoter element can function independent of orientation and position, properties characteristic of enhancer elements.


1987 ◽  
Vol 7 (12) ◽  
pp. 4185-4193
Author(s):  
K A Roebuck ◽  
R J Walker ◽  
W E Stumph

The DNA sequence requirements of chicken U1 RNA gene expression have been examined in an oocyte transcription system. An enhancer region, which was required for efficient U1 RNA gene expression, is contained within a region of conserved DNA sequences spanning nucleotide positions -230 to -183, upstream of the transcriptional initiation site. These DNA sequences can be divided into at least two distinct subregions or domains that acted synergistically to provide a greater than 20-fold stimulation of U1 RNA synthesis. The first domain contains the octamer sequence ATGCAAAT and was recognized by a DNA-binding factor present in HeLa cell extracts. The second domain (the SPH domain) consists of conserved sequences immediately downstream of the octamer and is an essential component of the enhancer. In the oocyte, the DNA sequences of the SPH domain were able to enhance gene expression at least 10-fold in the absence of the octamer domain. In contrast, the octamer domain, although required for full U1 RNA gene activity, was unable to stimulate expression in the absence of the adjacent downstream DNA sequences. These findings imply that sequences 3' of the octamer play a major role in the function of the chicken U1 RNA gene enhancer. This concept was supported by transcriptional competition studies in which a cloned chicken U4B RNA gene was used to compete for limiting transcription factors in oocytes. Multiple sequence motifs that can function in a variety of cis-linked configurations may be a general feature of vertebrate small nuclear RNA gene enhancers.


1984 ◽  
Vol 4 (5) ◽  
pp. 875-882 ◽  
Author(s):  
M J Imperiale ◽  
J R Nevins

Utilizing deletion mutants of a plasmid containing the adenovirus E2 gene, an E1A-inducible transcription unit, we determined the promoter sequences required for full expression in transient transfection assays. Wild-type expression was obtained from plasmids containing only 79 nucleotides of upstream sequence relative to the transcription initiation site. Removal of an additional nine nucleotides lowered expression 10-fold, and deletion to -59 resulted in near total loss of transcription. Wild-type levels of expression were restored to a -28 deletion mutant by insertion of the sequence from -21 to -262 from the wild-type promoter at the -28 position, in either orientation, even though when inserted in the opposite orientation the relevant sequences were ca. 270 nucleotides upstream from their normal position. Finally, this sequence could be placed at a distance of 4,000 nucleotides from the E2 cap site and still retain near total function. Thus, the E2 promoter element can function independent of orientation and position, properties characteristic of enhancer elements.


1990 ◽  
Vol 10 (4) ◽  
pp. 1390-1398
Author(s):  
P J Farnham ◽  
A L Means

The murine dihydrofolate reductase gene is regulated by a bidirectional promoter that lacks a TATA box. To identify the DNA sequences required for dihydrofolate reductase transcription, the activities of various templates were determined by in vitro transcription analysis. Our data indicate that sequences both upstream and downstream of the transcription initiation site modulate the activity of the dihydrofolate reductase promoter. We have focused on two regions downstream of the transcription initiation site that are important in determining the overall efficiency of the promoter. Region 1, which included exon 1 and part of intron 1, could stimulate transcription when placed in either orientation in the normal downstream position and when inserted upstream of the transcription start site. This region could also stimulate transcription in trans when the enhancer was physically separate from the promoter. Deletion of region 2, spanning 46 nucleotides of the 5' untranslated region, reduced transcriptional activity by fivefold. DNase I footprinting reactions identified protein-binding sites in both downstream stimulatory regions. Protein bound to two sites in region 1, both of which contain an inverted CCAAT box. The protein-binding site in the 5' untranslated region has extensive homology to binding sites in promoters that both lack (simian virus 40 late) and contain (adenovirus type 2 major late promoter and c-myc) TATA boxes.


1990 ◽  
Vol 172 (2) ◽  
pp. 609-620 ◽  
Author(s):  
T J Vasicek ◽  
P Leder

We determined the DNA sequence of two large regions of chromosome 22: 33.7 kb containing the C lambda complex; and 5.2 kb 5' of the functionally rearranged lambda gene from the human myeloma, U266. Analysis of these sequences reveals the complete structure of the human C lambda complex and a previously undescribed seventh C lambda region that may encode the Ke+Oz- lambda protein. The seven constant regions are organized in a tandem array, and each is preceded by a single J lambda region. lambda 1, lambda 2, lambda 3, and lambda 7 are apparently active genes, while lambda 4, lambda 5, and lambda 6 are pseudogenes. There are no other J lambda or C lambda regions within a 60-kb region surrounding the C lambda complex; however, there are at least four other lambda-like genes and lambda pseudogenes in the human genome. The lambda genes appear to have evolved via a series of gene duplication events resulting from unequal crossing over or gene conversion between the highly conserved C lambda regions on mispaired chromosomes. The lack of Alu sequences in this large segment of DNA suggests that the C lambda complex resulted from a recent amplification of a smaller Alu-free segment of DNA. Illegitimate recombination between repeated sequences containing lambda 2 and lambda 3 may be responsible for variable amplification of the lambda genes. We also found a 1,377-bp open reading frame (ORF) located on the opposite strand in the region containing lambda 7. While this ORF is flanked by potential RNA splicing signals, we have no evidence that it is part of a functional gene. We also discovered a V lambda pseudogene, called psi V lambda 1, 3 kb upstream of the U266 lambda gene. Using primer extension analysis to map the transcription start in the human lambda gene, we have identified its initiation point 41 bp upstream of the initiation codon. Analysis of the lambda promoter reveals that it contains a TATAA box at position -29 relative to the transcription initiation site and an octamer sequence at -67. Computer analysis of 40 kb of DNA sequences surrounding the human lambda locus has revealed no sequences resembling the kappa or IgH transcriptional enhancers, nor have in vitro analyses for function revealed enhancer activity. A comparison of these results with those obtained in separate studies with transgenic mice point to a complex, developmentally linked mechanism of transcriptional activation.


1991 ◽  
Vol 11 (11) ◽  
pp. 5756-5766 ◽  
Author(s):  
A Travis ◽  
J Hagman ◽  
R Grosschedl

The mb-1 gene, encoding a membrane immunoglobulin-associated protein, is developmentally regulated and expressed specifically in pre-B and mature B lymphocytes. Analysis of the TATA-less mb-1 promoter indicated that it directs initiation of transcription from multiple sites. Promoter sequences between -68 and +70 conferred the correct pattern of cell type-specific transcription upon a heterologous gene. Two nuclear factor-binding sites that are important for promoter function were identified between -59 and -38. Both sites interacted with ubiquitous nuclear factors in vitro. One of these factors was identified as Sp1. Multimerized copies of both factor-binding sites augmented expression from a heterologous minimal promoter in both lymphoid and nonlymphoid cells, suggesting that additional mb-1 promoter sequences are involved in determining the correct cell type specificity. Analysis of the heterogeneity of transcription initiation indicated that a mutation which increased the distance between upstream sequences and the region of initiation resulted in the utilization of a novel set of initiation sites. Moreover, an insertion of a TATA element into the mb-1 promoter at -30 biased initiation of transcription to +1 but did not abolish the use of the other sites. Mutation of an initiator sequence homology encompassing one of the major initiation sites had only a minor effect on its utilization. From these data, we conclude that upstream factor-binding sites in the TATA-less mb-1 promoter define a region in which initiation of transcription occurs at multiple sites.


Sign in / Sign up

Export Citation Format

Share Document