Human DNA sequences homologous to a protein coding region conserved between homeotic genes of Drosophila

Cell ◽  
1984 ◽  
Vol 38 (3) ◽  
pp. 667-673 ◽  
Author(s):  
Michael Levine ◽  
Gerald M. Rubin ◽  
Robert Tjian
1987 ◽  
Vol 7 (8) ◽  
pp. 2933-2940
Author(s):  
H Honkawa ◽  
W Masahashi ◽  
S Hashimoto ◽  
T Hashimoto-Gotoh

A number of deletion mutants were isolated, including 5', 3', and internal deletions in the 5'-flanking region of the human cellular oncogene related to the Harvey sarcoma virus (c-H-ras), and their transforming activities were examined in NIH 3T3 cells. DNA sequences which could not be detected without losing transforming activity were localized to a relatively short stretch upstream of the region which showed homology to the 5'-flanking region of v-H-ras oncogene. S1 nuclease analysis indicated that there were two clusters of mRNA start sites at positions that were about 1,371 and 1,298 base pairs upstream of the first coding ATG. The minimum region required for promoter function was estimated to be a 51-base-pair-long (or less) DNA segment. The promoter was GC rich (78%) and did not contain the consensus sequences that are usually observed in PolII-directed promoters but contained a GC box within which one of the mRNA start sites was included. In addition, two sets of positive and negative elements seemed to be located between the promoter and the protein-coding region, which appeared to influence positively and negatively, respectively, the efficiency of transformation with the c-H-ras oncogene.


2018 ◽  
Vol 7 (1.9) ◽  
pp. 167
Author(s):  
Bipin Nair B J ◽  
Rahul Reghunath

The protein coding and functional regions in DNA sequences has become an exciting task in bioinformatics. In particular, the coding region has a 3-base periodicity, which helps for exon identification. Many signal processing tools and techniques have been successfully applied to identify tasks, but still need to be improved in this direction. In our work, we employ ANN classifier to predict coding and functional region of proteinin human embryo cell protein in first trimester, and evaluate their performances according to the comparison energy levels of coding region. The obtained from the threshold energy level, results show that in a box plot finally predict the mutation.


1986 ◽  
Vol 6 (12) ◽  
pp. 4161-4167 ◽  
Author(s):  
M K Dush ◽  
J A Tischfield ◽  
S A Khan ◽  
E Feliciano ◽  
J M Sikela ◽  
...  

A mouse adenine phosphoribosyltransferase (aprt) pseudogene that had previously been recovered from a BALB/c sperm DNA library possessed several unusual features. Its nucleotide sequence, like that of other processed pseudogenes, was colinear with its corresponding mRNA, but it was truncated at its 3' end and lacked a poly(A) tail. The pseudogene was 82% homologous with corresponding regions of the functional gene and had incurred mutations that included transitions, transversions, deletions, and a point insertion. Even though the pseudogene was truncated within the protein-coding region of the corresponding functional gene, it was flanked at both ends by 13-base-pair direct repeats. Curiously, the direct repeats exhibited homology to APRT mRNA at the site of pseudogene divergence. The pseudogene appeared to be common to BALB/c and A/J mice, but it was contained on a 3-kilobase EcoRI fragment in the former strain and a 4.5-kilobase EcoRI fragment in the latter. The BALB/c and apparently the A/J pseudogene both mapped to chromosome 8, which also contains the functional aprt gene. The DNA sequences immediately surrounding the pseudogene in the two strains appeared to be similar, suggesting that the BALB/c and A/J pseudogenes are allelic. However, DNA sequences more distal to the pseudogene in the two strains appeared to vary. Thus, the EcoRI polymorphism was not due to simple loss of an EcoRI site, but was more complex. The pattern of flanking restriction sites was different for each of several enzymes, consistent with extensive DNA rearrangement. Double digests of BALB/c and A/J genomic DNAs revealed complex polymorphisms on both sides of the pseudogene. The results were consistent with insertion, deletion, or other rearrangement of DNA sequences that flank the pseudogene and suggest that this region of mouse chromosome 8 may be a region active for mutation or recombination.


1984 ◽  
Vol 4 (3) ◽  
pp. 507-513
Author(s):  
Y H Chien ◽  
I B Dawid

Two cDNAs derived from Xenopus laevis calmodulin mRNA have been cloned. Both cDNAs contain the complete protein-coding region and various lengths of untranslated segments. The two cDNAs encode an identical protein but differ from each other by 5% nucleotide substitutions. The 5' and 3' untranslated regions, to the extent available, are highly homologous between the two cDNAs. The predicted sequence of X. laevis calmodulin is identical to that of vertebrate calmodulins from mammals and chickens and shows one substitution compared with electric eel calmodulin. Genomic DNA sequences homologous to each of the two cDNA clones have been isolated and were shown to account for the major calmodulin-coding DNA sequences in X. laevis. These data suggest that X. laevis carries two active, nonallelic calmodulin genes. Although no complete analysis has been carried out, it appears that the X. laevis calmodulin genes are interrupted by at least four introns. The relative concentrations of calmodulin mRNA have been estimated in different embryonic stages and adult tissues and found to vary by up to a factor of 10. The highest levels of calmodulin mRNA were found in ovaries, testes, and brains. In these three tissues, the two calmodulin genes appear to be expressed at approximately equal levels.


2014 ◽  
Vol 2014 ◽  
pp. 1-7
Author(s):  
Kiran Sree Pokkuluri ◽  
Ramesh Babu Inampudi ◽  
S. S. S. N. Usha Devi Nedunuri

Protein coding and promoter region predictions are very important challenges of bioinformatics (Attwood and Teresa, 2000). The identification of these regions plays a crucial role in understanding the genes. Many novel computational and mathematical methods are introduced as well as existing methods that are getting refined for predicting both of the regions separately; still there is a scope for improvement. We propose a classifier that is built with MACA (multiple attractor cellular automata) and MCC (modified clonal classifier) to predict both regions with a single classifier. The proposed classifier is trained and tested with Fickett and Tung (1992) datasets for protein coding region prediction for DNA sequences of lengths 54, 108, and 162. This classifier is trained and tested with MMCRI datasets for protein coding region prediction for DNA sequences of lengths 252 and 354. The proposed classifier is trained and tested with promoter sequences from DBTSS (Yamashita et al., 2006) dataset and nonpromoters from EID (Saxonov et al., 2000) and UTRdb (Pesole et al., 2002) datasets. The proposed model can predict both regions with an average accuracy of 90.5% for promoter and 89.6% for protein coding region predictions. The specificity and sensitivity values of promoter and protein coding region predictions are 0.89 and 0.92, respectively.


2017 ◽  
Author(s):  
Anuj Kumar ◽  
Aditi Chauhan ◽  
Mansi Sharma ◽  
Sai Kumar Kompelli ◽  
Vijay Gahlaut ◽  
...  

AbstractSimple Sequence Repeats (SSRs), also known as microsatellites are short tandem repeats of DNA sequences that are 1-6 bp long. In plants, SSRs serve as a source of important class of molecular markers because of their hypervariabile and co-dominant nature, making them useful both for the genetic studies and marker-assisted breeding. The SSRs are widespread throughout the genome of an organism, so that a large number of SSR datasets are available, most of them from either protein-coding regions or untranslated regions. It is only recently, that their occurrence within microRNAs (miRNA) genes has received attention. As is widely known, miRNA themselves are a class of non-coding RNAs (ncRNAs) with varying length of 19-22 nucleotides (nts), which play an important role in regulating gene expression in plants under different biotic and abiotic stresses. In this communication, we describe the results of a study, where miRNA-SSRs in full length pre-miRNA sequences of Arabidopsis thaliana were mined. The sequences were retrieved by annotations available at EnsemblPlants using BatchPrimer3 server with miRNA-SSR flanking primers found to be well distributed. Our analysis shows that miRNA-SSRs are relatively rare in protein-coding regions but abundant in non-coding region. All the observed 147 di-, tri-, tetra-, penta- and hexanucleotide SSRs were located in non-coding regions of all the 5 chromosomes of A. thaliana. While we confirm that miRNA-SSRs were commonly spread across the full length pre-miRNAs, we envisage that such studies would allow us to identify newly discovered markers for breeding studies.


1986 ◽  
Vol 6 (12) ◽  
pp. 4161-4167
Author(s):  
M K Dush ◽  
J A Tischfield ◽  
S A Khan ◽  
E Feliciano ◽  
J M Sikela ◽  
...  

A mouse adenine phosphoribosyltransferase (aprt) pseudogene that had previously been recovered from a BALB/c sperm DNA library possessed several unusual features. Its nucleotide sequence, like that of other processed pseudogenes, was colinear with its corresponding mRNA, but it was truncated at its 3' end and lacked a poly(A) tail. The pseudogene was 82% homologous with corresponding regions of the functional gene and had incurred mutations that included transitions, transversions, deletions, and a point insertion. Even though the pseudogene was truncated within the protein-coding region of the corresponding functional gene, it was flanked at both ends by 13-base-pair direct repeats. Curiously, the direct repeats exhibited homology to APRT mRNA at the site of pseudogene divergence. The pseudogene appeared to be common to BALB/c and A/J mice, but it was contained on a 3-kilobase EcoRI fragment in the former strain and a 4.5-kilobase EcoRI fragment in the latter. The BALB/c and apparently the A/J pseudogene both mapped to chromosome 8, which also contains the functional aprt gene. The DNA sequences immediately surrounding the pseudogene in the two strains appeared to be similar, suggesting that the BALB/c and A/J pseudogenes are allelic. However, DNA sequences more distal to the pseudogene in the two strains appeared to vary. Thus, the EcoRI polymorphism was not due to simple loss of an EcoRI site, but was more complex. The pattern of flanking restriction sites was different for each of several enzymes, consistent with extensive DNA rearrangement. Double digests of BALB/c and A/J genomic DNAs revealed complex polymorphisms on both sides of the pseudogene. The results were consistent with insertion, deletion, or other rearrangement of DNA sequences that flank the pseudogene and suggest that this region of mouse chromosome 8 may be a region active for mutation or recombination.


1987 ◽  
Vol 7 (8) ◽  
pp. 2933-2940 ◽  
Author(s):  
H Honkawa ◽  
W Masahashi ◽  
S Hashimoto ◽  
T Hashimoto-Gotoh

A number of deletion mutants were isolated, including 5', 3', and internal deletions in the 5'-flanking region of the human cellular oncogene related to the Harvey sarcoma virus (c-H-ras), and their transforming activities were examined in NIH 3T3 cells. DNA sequences which could not be detected without losing transforming activity were localized to a relatively short stretch upstream of the region which showed homology to the 5'-flanking region of v-H-ras oncogene. S1 nuclease analysis indicated that there were two clusters of mRNA start sites at positions that were about 1,371 and 1,298 base pairs upstream of the first coding ATG. The minimum region required for promoter function was estimated to be a 51-base-pair-long (or less) DNA segment. The promoter was GC rich (78%) and did not contain the consensus sequences that are usually observed in PolII-directed promoters but contained a GC box within which one of the mRNA start sites was included. In addition, two sets of positive and negative elements seemed to be located between the promoter and the protein-coding region, which appeared to influence positively and negatively, respectively, the efficiency of transformation with the c-H-ras oncogene.


Genome ◽  
2011 ◽  
Vol 54 (5) ◽  
pp. 368-376 ◽  
Author(s):  
Andrew T Beckenbach

The complete mitochondrial DNA sequences of a hangingfly, Bittacus pilicornis (Mecoptera: Bittacidae), a snow scorpion fly, Boreus elegans (Mecoptera: Boreidae), and a nearly complete sequence from another scorpionfly species, Microchorista philpotti (Mecoptera: Nannochoristidae) were determined. The coding sequence of all three genomes includes the 37 genes normally found in insect mtDNAs, in the same gene order as first described in Drosophila. In addition to the standard set of genes, the Microchorista sequence includes a large duplication of the coding region. The duplication is at least 4 kb (and may be much larger) and includes the remnants of three protein-coding genes and seven tRNA genes. The duplication evidently arose as a single event, and the duplicated region can be aligned in its entirety with the corresponding region of the functional genome. Although most of the genes contain defects that render them nonfunctional, analysis shows that the protein-coding genes in the duplicated region evolved for a considerable period under constraints expected of functional protein-coding genes. It is evident, therefore, that for a period two copies of some of the mitochondrial genes were functional in this species, including genes coding for proteins.


Sign in / Sign up

Export Citation Format

Share Document