Xu, Yi; Ju, Ho-Jong; DeBlasio, Stacy; Carino, Elizabeth J; Johnson, Richard; MacCoss, Michael J; Heck, Michelle; Miller, W Allen; Gray, Stewart M
2018-06-01
Translational readthrough of the stop codon of the capsid protein (CP) open reading frame (ORF) is used by members of the Luteoviridae to produce their minor capsid protein as a readthrough protein (RTP). The elements regulating RTP expression are not well understood, but they involve long-distance interactions between RNA domains. Using high-resolution mass spectrometry, glutamine and tyrosine were identified as the primary amino acids inserted at the stop codon of Potato leafroll virus (PLRV) CP ORF. We characterized the contributions of a cytidine-rich domain immediately downstream and a branched stem-loop structure 600 to 700 nucleotides downstream of the CP stop codon. Mutations predicted to disrupt and restore the base of the distal stem-loop structure prevented and restored stop codon readthrough. Motifs in the downstream readthrough element (DRTE) are predicted to base pair to a site within 27 nucleotides (nt) of the CP ORF stop codon. Consistent with a requirement for this base pairing, the DRTE of Cereal yellow dwarf virus was not compatible with the stop codon-proximal element of PLRV in facilitating readthrough. Moreover, deletion of the complementary tract of bases from the stop codon-proximal region or the DRTE of PLRV prevented readthrough. In contrast, the distance and sequence composition between the two domains was flexible. Mutants deficient in RTP translation moved long distances in plants, but fewer infection foci developed in systemically infected leaves. Selective 2'-hydroxyl acylation and primer extension (SHAPE) probing to determine the secondary structure of the mutant DRTEs revealed that the functional mutants were more likely to have bases accessible for long-distance base pairing than the nonfunctional mutants. This study reveals a heretofore unknown combination of RNA structure and sequence that reduces stop codon efficiency, allowing translation of a key viral protein. IMPORTANCE Programmed stop codon readthrough is used by many animal and plant viruses to produce key viral proteins. Moreover, such "leaky" stop codons are used in host mRNAs or can arise from mutations that cause genetic disease. Thus, it is important to understand the mechanism(s) of stop codon readthrough. Here, we shed light on the mechanism of readthrough of the stop codon of the coat protein ORFs of viruses in the Luteoviridae by identifying the amino acids inserted at the stop codon and RNA structures that facilitate this "leakiness" of the stop codon. Members of the Luteoviridae encode a C-terminal extension to the capsid protein known as the readthrough protein (RTP). We characterized two RNA domains in Potato leafroll virus (PLRV), located 600 to 700 nucleotides apart, that are essential for efficient RTP translation. We further determined that the PLRV readthrough process involves both local structures and long-range RNA-RNA interactions. Genetic manipulation of the RNA structure altered the ability of PLRV to translate RTP and systemically infect the plant. This demonstrates that plant virus RNA contains multiple layers of information beyond the primary sequence and extends our understanding of stop codon readthrough. Strategic targets that can be exploited to disrupt the virus life cycle and reduce its ability to move within and between plant hosts were revealed. Copyright © 2018 American Society for Microbiology.
Floquet, Célia; Hatin, Isabelle; Rousset, Jean-Pierre; Bidou, Laure
2012-01-01
The efficiency of translation termination depends on the nature of the stop codon and the surrounding nucleotides. Some molecules, such as aminoglycoside antibiotics (gentamicin), decrease termination efficiency and are currently being evaluated for diseases caused by premature termination codons. However, the readthrough response to treatment is highly variable and little is known about the rules governing readthrough level and response to aminoglycosides. In this study, we carried out in-depth statistical analysis on a very large set of nonsense mutations to decipher the elements of nucleotide context responsible for modulating readthrough levels and gentamicin response. We quantified readthrough for 66 sequences containing a stop codon, in the presence and absence of gentamicin, in cultured mammalian cells. We demonstrated that the efficiency of readthrough after treatment is determined by the complex interplay between the stop codon and a larger sequence context. There was a strong positive correlation between basal and induced readthrough levels, and a weak negative correlation between basal readthrough level and gentamicin response (i.e. the factor of increase from basal to induced readthrough levels). The identity of the stop codon did not affect the response to gentamicin treatment. In agreement with a previous report, we confirm that the presence of a cytosine in +4 position promotes higher basal and gentamicin-induced readthrough than other nucleotides. We highlight for the first time that the presence of a uracil residue immediately upstream from the stop codon is a major determinant of the response to gentamicin. Moreover, this effect was mediated by the nucleotide itself, rather than by the amino-acid or tRNA corresponding to the −1 codon. Finally, we point out that a uracil at this position associated with a cytosine at +4 results in an optimal gentamicin-induced readthrough, which is the therapeutically relevant variable. PMID:22479203
Quach, Tommy; Brooks, Daniel M; Miranda, Hector C
2016-01-01
The complete mitochondrial genome of the Palawan peacock-pheasant Polyplectron napoleonis is 16,710 bp and contains 13 protein-coding genes, 2 rRNA genes, 22 tRNA genes and a control-region. All protein-coding genes use the standard ATG start codon, except for cox1 which has GTG start codon. Seven out of 13 PCGs have TAA stop codons, two have AGG (cox1 and nd6), and three PCGs (nd2, cox2 and nd4) have incomplete stop codon of just T- - nucleotide.
Trotta, Edoardo
2016-05-17
The three stop codons UAA, UAG, and UGA signal the termination of mRNA translation. As a result of a mechanism that is not adequately understood, they are normally used with unequal frequencies. In this work, we showed that selective forces and mutational biases drive stop codon usage in the human genome. We found that, in respect to sense codons, stop codon usage was affected by stronger selective forces but was less influenced by neutral mutational biases. UGA is the most frequent termination codon in human genome. However, UAA was the preferred stop codon in genes with high breadth of expression, high level of expression, AT-rich coding sequences, housekeeping functions, and in gene ontology categories with the largest deviation from expected stop codon usage. Selective forces associated with the breadth and the level of expression favoured AT-rich sequences in the mRNA region including the stop site and its proximal 3'-UTR, but acted with scarce effects on sense codons, generating two regions, upstream and downstream of the stop codon, with strongly different base composition. By favouring low levels of GC-content, selection promoted labile local secondary structures at the stop site and its proximal 3'-UTR. The compositional and structural context favoured by selection was surprisingly emphasized in the class of ribosomal proteins and was consistent with sequence elements that increase the efficiency of translational termination. Stop codons were also heterogeneously distributed among chromosomes by a mechanism that was strongly correlated with the GC-content of coding sequences. In human genome, the nucleotide composition and the thermodynamic stability of stop codon site and its proximal 3'-UTR are correlated with the GC-content of coding sequences and with the breadth and the level of gene expression. In highly expressed genes stop codon usage is compositionally and structurally consistent with highly efficient translation termination signals.
CRISPR-STOP: gene silencing through base-editing-induced nonsense mutations.
Kuscu, Cem; Parlak, Mahmut; Tufan, Turan; Yang, Jiekun; Szlachta, Karol; Wei, Xiaolong; Mammadov, Rashad; Adli, Mazhar
2017-07-01
CRISPR-Cas9-induced DNA damage may have deleterious effects at high-copy-number genomic regions. Here, we use CRISPR base editors to knock out genes by changing single nucleotides to create stop codons. We show that the CRISPR-STOP method is an efficient and less deleterious alternative to wild-type Cas9 for gene-knockout studies. Early stop codons can be introduced in ∼17,000 human genes. CRISPR-STOP-mediated targeted screening demonstrates comparable efficiency to WT Cas9, which indicates the suitability of our approach for genome-wide functional screenings.
Seligmann, Hervé
2013-05-07
GenBank's EST database includes RNAs matching exactly human mitochondrial sequences assuming systematic asymmetric nucleotide exchange-transcription along exchange rules: A→G→C→U/T→A (12 ESTs), A→U/T→C→G→A (4 ESTs), C→G→U/T→C (3 ESTs), and A→C→G→U/T→A (1 EST), no RNAs correspond to other potential asymmetric exchange rules. Hypothetical polypeptides translated from nucleotide-exchanged human mitochondrial protein coding genes align with numerous GenBank proteins, predicted secondary structures resemble their putative GenBank homologue's. Two independent methods designed to detect overlapping genes (one based on nucleotide contents analyses in relation to replicative deamination gradients at third codon positions, and circular code analyses of codon contents based on frame redundancy), confirm nucleotide-exchange-encrypted overlapping genes. Methods converge on which genes are most probably active, and which not, and this for the various exchange rules. Mean EST lengths produced by different nucleotide exchanges are proportional to (a) extents that various bioinformatics analyses confirm the protein coding status of putative overlapping genes; (b) known kinetic chemistry parameters of the corresponding nucleotide substitutions by the human mitochondrial DNA polymerase gamma (nucleotide DNA misinsertion rates); (c) stop codon densities in predicted overlapping genes (stop codon readthrough and exchanging polymerization regulate gene expression by counterbalancing each other). Numerous rarely expressed proteins seem encoded within regular mitochondrial genes through asymmetric nucleotide exchange, avoiding lengthening genomes. Intersecting evidence between several independent approaches confirms the working hypothesis status of gene encryption by systematic nucleotide exchanges. Copyright © 2013 Elsevier Ltd. All rights reserved.
Complete mitochondrial genome of the Yellownose skate: Zearaja chilensis (Rajiformes, Rajidae).
Jeong, Dageum; Lee, Youn-Ho
2016-01-01
The complete sequence of mitochondrial DNA of a Yellownose skate, Zearaja chilensis was determined for the first time. It is 16,909 bp in length covering 2 rRNA, 22 tRNA and 13 protein coding genes with the identical gene order and structure as those of other Rajidae species. The nucleotide of L-strand is composed of low G (14.3%), and slightly high A + T (58.9%) nucleotides. The strong codon usage bias against the use of G (6.0%) is found at the third codon positions. Twelve of the 13 protein coding genes use ATG as the start codon while COX1 starts with GTG. As for the stop codon, only ND4 shows an incomplete stop codon TA. This is the first report of the mitogenome for a species in the genus Zearaja, providing a valuable source of genetic information on the evolution of the family Rajidae and the genus Zearaja as well as for establishment of a sustainble fishery management plan of the species.
Seligmann, Hervé
2013-03-01
Usual DNA→RNA transcription exchanges T→U. Assuming different systematic symmetric nucleotide exchanges during translation, some GenBank RNAs match exactly human mitochondrial sequences (exchange rules listed in decreasing transcript frequencies): C↔U, A↔U, A↔U+C↔G (two nucleotide pairs exchanged), G↔U, A↔G, C↔G, none for A↔C, A↔G+C↔U, and A↔C+G↔U. Most unusual transcripts involve exchanging uracil. Independent measures of rates of rare replicational enzymatic DNA nucleotide misinsertions predict frequencies of RNA transcripts systematically exchanging the corresponding misinserted nucleotides. Exchange transcripts self-hybridize less than other gene regions, self-hybridization increases with length, suggesting endoribonuclease-limited elongation. Blast detects stop codon depleted putative protein coding overlapping genes within exchange-transcribed mitochondrial genes. These align with existing GenBank proteins (mainly metazoan origins, prokaryotic and viral origins underrepresented). These GenBank proteins frequently interact with RNA/DNA, are membrane transporters, or are typical of mitochondrial metabolism. Nucleotide exchange transcript frequencies increase with overlapping gene densities and stop densities, indicating finely tuned counterbalancing regulation of expression of systematic symmetric nucleotide exchange-encrypted proteins. Such expression necessitates combined activities of suppressor tRNAs matching stops, and nucleotide exchange transcription. Two independent properties confirm predicted exchanged overlap coding genes: discrepancy of third codon nucleotide contents from replicational deamination gradients, and codon usage according to circular code predictions. Predictions from both properties converge, especially for frequent nucleotide exchange types. Nucleotide exchanging transcription apparently increases coding densities of protein coding genes without lengthening genomes, revealing unsuspected functional DNA coding potential. Copyright © 2013 Elsevier Ireland Ltd. All rights reserved.
Control of total GFP expression by alterations to the 3′ region nucleotide sequence
2013-01-01
Background Previously, we distinguished the Escherichia coli type II cytoplasmic membrane translocation pathways of Tat, Yid, and Sec for unfolded and folded soluble target proteins. The translocation of folded protein to the periplasm for soluble expression via the Tat pathway was controlled by an N-terminal hydrophilic leader sequence. In this study, we investigated the effect of the hydrophilic C-terminal end and its nucleotide sequence on total and soluble protein expression. Results The native hydrophilic C-terminal end of GFP was obtained by deleting the C-terminal peptide LeuGlu-6×His, derived from pET22b(+). The corresponding clones induced total and soluble GFP expression that was either slightly increased or dramatically reduced, apparently through reconstruction of the nucleotide sequence around the stop codon in the 3′ region. In the expression-induced clones, the hydrophilic C-terminus showed increased Tat pathway specificity for soluble expression. However, in the expression-reduced clone, after analyzing the role of the 5′ poly(A) coding sequence with a substituted synonymous codon, we proved that the longer 5′ poly(A) coding sequence interacted with the reconstructed 3′ region nucleotide sequence to create a new mRNA tertiary structure between the 5′ and 3′ regions, which resulted in reduced total GFP expression. Further, to recover the reduced expression by changing the 3′ nucleotide sequence, after replacing selected C-terminal 5′ codons and the stop codon in the ORF with synonymous codons, total GFP expression in most of the clones was recovered to the undeleted control level. The insertion of trinucleotides after the stop codon in the 3′-UTR recovered or reduced total GFP expression. RT-PCR revealed that the level of total protein expression was controlled by changes in translational or transcriptional regulation, which were induced or reduced by the substitution or insertion of 3′ region nucleotides. Conclusions We found that the hydrophilic C-terminal end of GFP increased Tat pathway specificity and that the 3′ nucleotide sequence played an important role in total protein expression through translational and transcriptional regulation. These findings may be useful for efficiently producing recombinant proteins as well as for potentially controlling the expression level of specific genes in the body for therapeutic purposes. PMID:23834827
Single nucleotide polymorphisms of Helicobacter pylori dupA that lead to premature stop codons.
Moura, Sílvia B; Costa, Rafaella F A; Anacleto, Charles; Rocha, Gifone A; Rocha, Andreia M C; Queiroz, Dulciene M M
2012-06-01
The detection of the putative disease-specific Helicobacter pylori marker duodenal ulcer promoting gene A (dupA) is currently based on PCR detection of jhp0917 and jhp0918 that form the gene. However, mutations that lead to premature stop codons that split off the dupA leading to truncated products cannot be evaluated by PCR. We directly sequence the complete dupA of 75 dupA-positive strains of H. pylori isolated from patients with gastritis (n = 26), duodenal ulcer (n = 29), and gastric carcinoma (n = 20), to search for frame-shifting mutations that lead to stop codon. Thirty-four strains had single nucleotide mutations in dupA that lead to premature stop codon creating smaller products than the predicted 1839 bp product and, for this reason, were considered as dupA-negative. Intact dupA was more frequently observed in strains isolated from duodenal ulcer patients (65.5%) than in patients with gastritis only (46.2%) or with gastric carcinoma (50%). In logistic analysis, the presence of the intact dupA independently associated with duodenal ulcer (OR = 5.06; 95% CI = 1.22-20.96, p = .02). We propose the primer walking methodology as a simple technique to sequence the gene. When we considered as dupA-positive only those strains that carry dupA gene without premature stop codons, the gene was associated with duodenal ulcer and, therefore, can be used as a marker for this disease in our population. © 2012 Blackwell Publishing Ltd.
A common periodic table of codons and amino acids.
Biro, J C; Benyó, B; Sansom, C; Szlávecz, A; Fördös, G; Micsik, T; Benyó, Z
2003-06-27
A periodic table of codons has been designed where the codons are in regular locations. The table has four fields (16 places in each) one with each of the four nucleotides (A, U, G, C) in the central codon position. Thus, AAA (lysine), UUU (phenylalanine), GGG (glycine), and CCC (proline) were placed into the corners of the fields as the main codons (and amino acids) of the fields. They were connected to each other by six axes. The resulting nucleic acid periodic table showed perfect axial symmetry for codons. The corresponding amino acid table also displaced periodicity regarding the biochemical properties (charge and hydropathy) of the 20 amino acids and the position of the stop signals. The table emphasizes the importance of the central nucleotide in the codons and predicts that purines control the charge while pyrimidines determine the polarity of the amino acids. This prediction was experimentally tested.
Hofhuis, Julia; Schueren, Fabian; Nötzel, Christopher; Lingner, Thomas; Gärtner, Jutta; Jahn, Olaf
2016-01-01
Translational readthrough gives rise to C-terminally extended proteins, thereby providing the cell with new protein isoforms. These may have different properties from the parental proteins if the extensions contain functional domains. While for most genes amino acid incorporation at the stop codon is far lower than 0.1%, about 4% of malate dehydrogenase (MDH1) is physiologically extended by translational readthrough and the actual ratio of MDH1x (extended protein) to ‘normal' MDH1 is dependent on the cell type. In human cells, arginine and tryptophan are co-encoded by the MDH1x UGA stop codon. Readthrough is controlled by the 7-nucleotide high-readthrough stop codon context without contribution of the subsequent 50 nucleotides encoding the extension. All vertebrate MDH1x is directed to peroxisomes via a hidden peroxisomal targeting signal (PTS) in the readthrough extension, which is more highly conserved than the extension of lactate dehydrogenase B. The hidden PTS of non-mammalian MDH1x evolved to be more efficient than the PTS of mammalian MDH1x. These results provide insight into the genetic and functional co-evolution of these dually localized dehydrogenases. PMID:27881739
The Role of +4U as an Extended Translation Termination Signal in Bacteria
Wei, Yulong; Xia, Xuhua
2017-01-01
Termination efficiency of stop codons depends on the first 3′ flanking (+4) base in bacteria and eukaryotes. In both Escherichia coli and Saccharomyces cerevisiae, termination read-through is reduced in the presence of +4U; however, the molecular mechanism underlying +4U function is poorly understood. Here, we perform comparative genomics analysis on 25 bacterial species (covering Actinobacteria, Bacteriodetes, Cyanobacteria, Deinococcus-Thermus, Firmicutes, Proteobacteria, and Spirochaetae) with bioinformatics approaches to examine the influence of +4U in bacterial translation termination by contrasting highly- and lowly-expressed genes (HEGs and LEGs, respectively). We estimated gene expression using the recently formulated Index of Translation Elongation, ITE, and identified stop codon near-cognate transfer RNAs (tRNAs) from well-annotated genomes. We show that +4U was consistently overrepresented in UAA-ending HEGs relative to LEGs. The result is consistent with the interpretation that +4U enhances termination mainly for UAA. Usage of +4U decreases in GC-rich species where most stop codons are UGA and UAG, with few UAA-ending genes, which is expected if UAA usage in HEGs drives up +4U usage. In HEGs, +4U usage increases significantly with abundance of UAA nc_tRNAs (near-cognate tRNAs that decode codons differing from UAA by a single nucleotide), particularly those with a mismatch at the first stop codon site. UAA is always the preferred stop codon in HEGs, and our results suggest that UAAU is the most efficient translation termination signal in bacteria. PMID:27903612
The primary structure of the Saccharomyces cerevisiae gene for 3-phosphoglycerate kinase.
Hitzeman, R A; Hagie, F E; Hayflick, J S; Chen, C Y; Seeburg, P H; Derynck, R
1982-01-01
The DNA sequence of the gene for the yeast glycolytic enzyme, 3-phosphoglycerate kinase (PGK), has been obtained by sequencing part of a 3.1 kbp HindIII fragment obtained from the yeast genome. The structural gene sequence corresponds to a reading frame of 1251 bp coding for 416 amino acids with no intervening DNA sequences. The amino acid sequence is approximately 65 percent homologous with human and horse PGK protein sequences and is in general agreement with the published protein sequence for yeast PGK. As for other highly expressed structural genes in yeast, the coding sequence is highly codon biased with 95 percent of the amino acids coded for by a select 25 codons (out of 61 possible). Besides structural DNA sequence, 291 bp of 5'-flanking sequence and 286 bp of 3'-flanking sequence were determined. Transcription starts 36 nucleotides upstream from the translational start and stops 86-93 nucleotides downstream from the translational stop. These results suggest a non-polyadenylated mRNA length of 1373 to 1380 nucleotides, which is consistent with the observed length of 1500 nucleotides for polyadenylated PGK mRNA. A sequence TATATATAAA is found at 145 nucleotides upstream from the translational start. This sequence resembles the TATAAA box that is possibly associated with RNA polymerase II binding. Images PMID:6296791
Termination and read-through proteins encoded by genome segment 9 of Colorado tick fever virus.
Mohd Jaafar, Fauziah; Attoui, Houssam; De Micco, Philippe; De Lamballerie, Xavier
2004-08-01
Genome segment 9 (Seg-9) of Colorado tick fever virus (CTFV) is 1884 bp long and contains a large open reading frame (ORF; 1845 nt in length overall), although a single in-frame stop codon (at nt 1052-1054) reduces the ORF coding capacity by approximately 40 %. However, analyses of highly conserved RNA sequences in the vicinity of the stop codon indicate that it belongs to a class of 'leaky terminators'. The third nucleotide positions in codons situated both before and after the stop codon, shows the highest variability, suggesting that both regions are translated during virus replication. This also suggests that the stop signal is functionally leaky, allowing read-through translation to occur. Indeed, both the truncated 'termination' protein and the full-length 'read-through' protein (VP9 and VP9', respectively) were detected in CTFV-infected cells, in cells transfected with a plasmid expressing only Seg-9 protein products, and in the in vitro translation products from undenatured Seg-9 ssRNA. The ratios of full-length and truncated proteins generated suggest that read-through may be down-regulated by other viral proteins. Western blot analysis of infected cells and purified CTFV showed that VP9 is a structural component of the virion, while VP9' is a non-structural protein.
Evolution of Nucleotide Punctuation Marks: From Structural to Linear Signals.
El Houmami, Nawal; Seligmann, Hervé
2017-01-01
We present an evolutionary hypothesis assuming that signals marking nucleotide synthesis (DNA replication and RNA transcription) evolved from multi- to unidimensional structures, and were carried over from transcription to translation. This evolutionary scenario presumes that signals combining secondary and primary nucleotide structures are evolutionary transitions. Mitochondrial replication initiation fits this scenario. Some observations reported in the literature corroborate that several signals for nucleotide synthesis function in translation, and vice versa. (a) Polymerase-induced frameshift mutations occur preferentially at translational termination signals (nucleotide deletion is interpreted as termination of nucleotide polymerization, paralleling the role of stop codons in translation). (b) Stem-loop hairpin presence/absence modulates codon-amino acid assignments, showing that translational signals sometimes combine primary and secondary nucleotide structures (here codon and stem-loop). (c) Homopolymer nucleotide triplets (AAA, CCC, GGG, TTT) cause transcriptional and ribosomal frameshifts. Here we find in recently described human mitochondrial RNAs that systematically lack mono-, dinucleotides after each trinucleotide (delRNAs) that delRNA triplets include 2x more homopolymers than mitogenome regions not covered by delRNA. Further analyses of delRNAs show that the natural circular code X (a little-known group of 20 translational signals enabling ribosomal frame retrieval consisting of 20 codons {AAC, AAT, ACC, ATC, ATT, CAG, CTC, CTG, GAA, GAC, GAG, GAT, GCC, GGC, GGT, GTA, GTC, GTT, TAC, TTC} universally overrepresented in coding versus other frames of gene sequences), regulates frameshift in transcription and translation. This dual transcription and translation role confirms for X the hypothesis that translational signals were carried over from transcriptional signals.
The complete mitochondrial genome of the Longnose skate: Raja rhina (Rajiformes, Rajidae).
Jeong, Dageum; Lee, Youn-Ho
2015-02-01
The complete sequence of mitochondrial DNA of a longnose skate, Raja rhina was determined for the first time. It is 16,910 bp in length containing 2 rRNA, 22 tRNA and 13 protein coding genes with the same gene order and structure as those of other Rajidae species. The nucleotide of L-strand is composed of 30.1% A, 27.2% C, 28.5% T and 14.2% G, showing a slight A + T bias. The G is the least used base and markedly lower at the third codon position (5.4%). Twelve of the 13 protein coding genes use ATG as their start codon while the COX1 starts with GTG. As for stop codon, only ND4 shows incomplete stop codon TA. This mitogenome is the first report for a species of the genus Raja, and providing a valuable resource of genetic information for understanding the phylogenetic relationship and the evolution of the genus Raja as well as the family, Rajidae.
The complete mitochondrial genome of the Korean skate: Hongeo koreana (Rajiformes, Rajidae).
Jeong, Dageum; Kim, Sung; Kim, Choong-Gon; Lee, Youn-Ho
2014-12-01
The complete mitochondrial genome of the Korean skate, Hongeo koreana, the sole member of its genus, is investigated for the first time. The genome consists of 16,906 bp in length including 2 rRNA, 22 tRNA and 13 protein coding genes with the same gene order and structure of the genome as those of other Rajidae species. The overall nucleotide composition of the L-strand is A = 29.8%, C = 27.9%, T = 27.9% and G = 14.3%, showing a high A + T bias. The anti-G bias (6.0%) is more significant in the third codon position. Twelve of the 13 protein-coding genes use ATG as their start codon while the COX1 gene starts with GTG. For stop codon, ND3 and ND4 genes show incomplete stop codon T. The mitogenome sequence of H. koreana will provide important information on the evolution and the phylogenetic relation of the genus Hongeo in relation to the other genera of the family Rajidae.
Sequence Analysis of Mitochondrial Genome of Toxascaris leonina from a South China Tiger.
Li, Kangxin; Yang, Fang; Abdullahi, A Y; Song, Meiran; Shi, Xianli; Wang, Minwei; Fu, Yeqi; Pan, Weida; Shan, Fang; Chen, Wu; Li, Guoqing
2016-12-01
Toxascaris leonina is a common parasitic nematode of wild mammals and has significant impacts on the protection of rare wild animals. To analyze population genetic characteristics of T. leonina from South China tiger, its mitochondrial (mt) genome was sequenced. Its complete circular mt genome was 14,277 bp in length, including 12 protein-coding genes, 22 tRNA genes, 2 rRNA genes, and 2 non-coding regions. The nucleotide composition was biased toward A and T. The most common start codon and stop codon were TTG and TAG, and 4 genes ended with an incomplete stop codon. There were 13 intergenic regions ranging 1 to 10 bp in size. Phylogenetically, T. leonina from a South China tiger was close to canine T. leonina . This study reports for the first time a complete mt genome sequence of T. leonina from the South China tiger, and provides a scientific basis for studying the genetic diversity of nematodes between different hosts.
Dong, Jia-Jia; Guan, De-Long; Xu, Sheng-Quan
2016-09-01
The complete mitogenome of Oxya intricate (Stål.) has been reconstructed from whole-genome Illumina sequencing data with an average coverage of 294×. The circular genome is 15,466 bp in length, and consists of 22 transfer RNAs (tRNAs), 13 protein-coding genes (PCGs), 2 ribosomal RNAs (rRNAs) and 1 D-loop region. All PCGs are initiated with ATN codons, and are terminated with TAR codons except for ND5 with the incomplete stop codon T. The nucleotide composition is asymmetric (42.5%A, 14.6%C, 10.6%G, 32.3%T) with an overall GC content of 25.2%. These data would contribute to the design of novel molecular markers for population and evolutionary studies of this and related orthopteran species.
Peroxisomal lactate dehydrogenase is generated by translational readthrough in mammals
Schueren, Fabian; Lingner, Thomas; George, Rosemol; Hofhuis, Julia; Dickel, Corinna; Gärtner, Jutta; Thoms, Sven
2014-01-01
Translational readthrough gives rise to low abundance proteins with C-terminal extensions beyond the stop codon. To identify functional translational readthrough, we estimated the readthrough propensity (RTP) of all stop codon contexts of the human genome by a new regression model in silico, identified a nucleotide consensus motif for high RTP by using this model, and analyzed all readthrough extensions in silico with a new predictor for peroxisomal targeting signal type 1 (PTS1). Lactate dehydrogenase B (LDHB) showed the highest combined RTP and PTS1 probability. Experimentally we show that at least 1.6% of the total cellular LDHB is targeted to the peroxisome by a conserved hidden PTS1. The readthrough-extended lactate dehydrogenase subunit LDHBx can also co-import LDHA, the other LDH subunit, into peroxisomes. Peroxisomal LDH is conserved in mammals and likely contributes to redox equivalent regeneration in peroxisomes. DOI: http://dx.doi.org/10.7554/eLife.03640.001 PMID:25247702
Paulish-Miller, Teresa E.; Augostini, Peter; Schuyler, Jessica A.; Smith, William L.; Mordechai, Eli; Adelson, Martin E.; Gygax, Scott E.; Secor, William E.
2014-01-01
Metronidazole resistance in the sexually transmitted parasite Trichomonas vaginalis is a problematic public health issue. We have identified single nucleotide polymorphisms (SNPs) in two nitroreductase genes (ntr4Tv and ntr6Tv) associated with resistance. These SNPs were associated with one of two distinct T. vaginalis populations identified by multilocus sequence typing, yet one SNP (ntr6Tv A238T), which results in a premature stop codon, was associated with resistance independent of population structure and may be of diagnostic value. PMID:24550324
Novel mutations responsible for α-thalassemia in Iranian families.
Bayat, Nooshin; Farashi, Samaneh; Hafezi-Nejad, Nima; Faramarzi, Negin; Ashki, Mehri; Vakili, Shadi; Imanian, Hashem; Khosravi, Mohsen; Azar-Keivan, Azita; Najmabadi, Hossein
2013-01-01
α-Thalassemia (α-thal) is usually caused by deletions on the α-globin gene cluster and the role of point mutations is less well investigated. In the present study, a total of 1048 individuals with hypochromic microcytic anemia, who did not present the most common α-thal deletions, were referred for α-globin gene DNA sequencing. The nucleotide changes were studied and a total of five new mutations was identified, of which three were located on the α2 gene [codon7 (Lys→Stop), codon 34 (Leu→Pro) and codon 83 (Leu→Arg)] and two on the α1 gene [IVS-I-116 (A>G) and codon 44 (+C)]. These novel mutations not only explain new findings by molecular analysis of the α-globin gene but also have clinical importance due to their changes in α-globin production in means of decreased hemoglobin (Hb) related values. Moreover, considerations of its role in combination with other mutations, and the possibility of causing Hb H (β4) are yet to be studied.
The Complete Mitochondrial Genome of the Rice Moth, Corcyra cephalonica
Wu, Yu-Peng; Li, Jie; Zhao, Jin-Liang; Su, Tian-Juan; Luo, A-Rong; Fan, Ren-Jun; Chen, Ming-Chang; Wu, Chun-Sheng; Zhu, Chao-Dong
2012-01-01
The complete mitochondrial genome (mitogenome) of the rice moth, Corcyra cephalonica Stainton (Lepidoptera: Pyralidae) was determined as a circular molecular of 15,273 bp in size. The mitogenome composition (37 genes) and gene order are the same as the other lepidopterans. Nucleotide composition of the C. cephalonica mitogenome is highly A+T biased (80.43%) like other insects. Twelve protein-coding genes start with a typical ATN codon, with the exception of coxl gene, which uses CGA as the initial codon. Nine protein-coding genes have the common stop codon TAA, and the nad2, cox1, cox2, and nad4 have single T as the incomplete stop codon. 22 tRNA genes demonstrated cloverleaf secondary structure. The mitogenome has several large intergenic spacer regions, the spacer1 between trnQ gene and nad2 gene, which is common in Lepidoptera. The spacer 3 between trnE and trnF includes microsatellite-like repeat regions (AT)18 and (TTAT)3. The spacer 4 (16 bp) between trnS2 gene and nad1 gene has a motif ATACTAT; another species, Sesamia inferens encodes ATCATAT at the same position, while other lepidopteran insects encode a similar ATACTAA motif. The spacer 6 is A+T rich region, include motif ATAGA and a 20-bp poly(T) stretch and two microsatellite (AT)9, (AT)8 elements. PMID:23413968
The complete mitochondrial genome of the rice moth, Corcyra cephalonica.
Wu, Yu-Peng; Li, Jie; Zhao, Jin-Liang; Su, Tian-Juan; Luo, A-Rong; Fan, Ren-Jun; Chen, Ming-Chang; Wu, Chun-Sheng; Zhu, Chao-Dong
2012-01-01
The complete mitochondrial genome (mitogenome) of the rice moth, Corcyra cephalonica Stainton (Lepidoptera: Pyralidae) was determined as a circular molecular of 15,273 bp in size. The mitogenome composition (37 genes) and gene order are the same as the other lepidopterans. Nucleotide composition of the C. cephalonica mitogenome is highly A+T biased (80.43%) like other insects. Twelve protein-coding genes start with a typical ATN codon, with the exception of coxl gene, which uses CGA as the initial codon. Nine protein-coding genes have the common stop codon TAA, and the nad2, cox1, cox2, and nad4 have single T as the incomplete stop codon. 22 tRNA genes demonstrated cloverleaf secondary structure. The mitogenome has several large intergenic spacer regions, the spacer1 between trnQ gene and nad2 gene, which is common in Lepidoptera. The spacer 3 between trnE and trnF includes microsatellite-like repeat regions (AT)18 and (TTAT)(3). The spacer 4 (16 bp) between trnS2 gene and nad1 gene has a motif ATACTAT; another species, Sesamia inferens encodes ATCATAT at the same position, while other lepidopteran insects encode a similar ATACTAA motif. The spacer 6 is A+T rich region, include motif ATAGA and a 20-bp poly(T) stretch and two microsatellite (AT)(9), (AT)(8) elements.
Seligmann, Hervé
2018-05-01
Genetic codes mainly evolve by reassigning punctuation codons, starts and stops. Previous analyses assuming that undefined amino acids translate stops showed greater divergence between nuclear and mitochondrial genetic codes. Here, three independent methods converge on which amino acids translated stops at split between nuclear and mitochondrial genetic codes: (a) alignment-free genetic code comparisons inserting different amino acids at stops; (b) alignment-based blast analyses of hypothetical peptides translated from non-coding mitochondrial sequences, inserting different amino acids at stops; (c) biases in amino acid insertions at stops in proteomic data. Hence short-term protein evolution models reconstruct long-term genetic code evolution. Mitochondria reassign stops to amino acids otherwise inserted at stops by codon-anticodon mismatches (near-cognate tRNAs). Hence dual function (translation termination and translation by codon-anticodon mismatch) precedes mitochondrial reassignments of stops to amino acids. Stop ambiguity increases coded information, compensates endocellular mitogenome reduction. Mitochondrial codon reassignments might prevent viral infections. Copyright © 2018 Elsevier B.V. All rights reserved.
Loughran, Gary; Jungreis, Irwin; Tzani, Ioanna; Power, Michael; Dmitriev, Ruslan I.; Ivanov, Ivaylo P.; Kellis, Manolis; Atkins, John F.
2018-01-01
Although stop codon readthrough is used extensively by viruses to expand their gene expression, verified instances of mammalian readthrough have only recently been uncovered by systems biology and comparative genomics approaches. Previously, our analysis of conserved protein coding signatures that extend beyond annotated stop codons predicted stop codon readthrough of several mammalian genes, all of which have been validated experimentally. Four mRNAs display highly efficient stop codon readthrough, and these mRNAs have a UGA stop codon immediately followed by CUAG (UGA_CUAG) that is conserved throughout vertebrates. Extending on the identification of this readthrough motif, we here investigated stop codon readthrough, using tissue culture reporter assays, for all previously untested human genes containing UGA_CUAG. The readthrough efficiency of the annotated stop codon for the sequence encoding vitamin D receptor (VDR) was 6.7%. It was the highest of those tested but all showed notable levels of readthrough. The VDR is a member of the nuclear receptor superfamily of ligand-inducible transcription factors, and it binds its major ligand, calcitriol, via its C-terminal ligand-binding domain. Readthrough of the annotated VDR mRNA results in a 67 amino acid–long C-terminal extension that generates a VDR proteoform named VDRx. VDRx may form homodimers and heterodimers with VDR but, compared with VDR, VDRx displayed a reduced transcriptional response to calcitriol even in the presence of its partner retinoid X receptor. PMID:29386352
DOE Office of Scientific and Technical Information (OSTI.GOV)
Colledge, Danielle; Soppe, Sally; Yuen, Lilly
Premature stop codons in the hepatitis B virus (HBV) surface protein can be associated with nucleos(t)ide analogue resistance due to overlap of the HBV surface and polymerase genes. The aim of this study was to determine the effect of the replication of three common surface stop codon variants on the hepatocyte. Cell lines were transfected with infectious HBV clones encoding surface stop codons rtM204I/sW196*, rtA181T/sW172*, rtV191I/sW182*, and a panel of substitutions in the surface proteins. HBsAg was measured by Western blotting. Proliferation and apoptosis were measured using flow cytometry. All three surface stop codon variants were defective in HBsAg secretion.more » Cells transfected with these variants were less proliferative and had higher levels of apoptosis than those transfected with variants that did not encode surface stop codons. The most cytopathic variant was rtM204I/sW196*. Replication of HBV encoding surface stop codons was toxic to the cell and promoted apoptosis, exacerbating disease progression. - Highlights: •Under normal circumstances, HBV replication is not cytopathic. •Premature stop codons in the HBV surface protein can be selected and enriched during nucleos(t)ide analogue therapy. •Replication of these variants can be cytopathic to the cell and promote apoptosis. •Inadequate antiviral therapy may actually promote disease progression.« less
Lorenz, Felix K. M.; Wilde, Susanne; Voigt, Katrin; Kieback, Elisa; Mosetter, Barbara; Schendel, Dolores J.; Uckert, Wolfgang
2015-01-01
Codon optimization of nucleotide sequences is a widely used method to achieve high levels of transgene expression for basic and clinical research. Until now, immunological side effects have not been described. To trigger T cell responses against human papillomavirus, we incubated T cells with dendritic cells that were pulsed with RNA encoding the codon-optimized E7 oncogene. All T cell receptors isolated from responding T cell clones recognized target cells expressing the codon-optimized E7 gene but not the wild type E7 sequence. Epitope mapping revealed recognition of a cryptic epitope from the +3 alternative reading frame of codon-optimized E7, which is not encoded by the wild type E7 sequence. The introduction of a stop codon into the +3 alternative reading frame protected the transgene product from recognition by T cell receptor gene-modified T cells. This is the first experimental study demonstrating that codon optimization can render a transgene artificially immunogenic through generation of a dominant cryptic epitope. This finding may be of great importance for the clinical field of gene therapy to avoid rejection of gene-corrected cells and for the design of DNA- and RNA-based vaccines, where codon optimization may artificially add a strong immunogenic component to the vaccine. PMID:25799237
Okumiya, T; Takenaka, T; Ishii, S; Kase, R; Kamei, S; Sakuraba, H
1996-09-01
Four alpha-galactosidase gene mutations were identified in Japanese male patients with Fabry disease who had no detectable alpha-galactosidase activity. Two of them were novel mutations, an 11-bp deletion in exon 2 and a g-1 to t substitution at the 3' end of the splice acceptor site in intron 1. The former caused a frameshift and led to the creation of a new stop codon at codon 118. The latter was predicted to provoke aberrant mRNA splicing followed by accelerated degradation of the mRNA. A nonsense mutation, R301X, and a 2-bp deletion starting at nucleotide position 718, which were reported previously, were also identified in unrelated patients.
Wada, Takahito; Haddad, Marie Reine; Yi, Ling; Murakami, Tomomi; Sasaki, Akiko; Shimbo, Hiroko; Kodama, Hiroko; Osaka, Hitoshi; Kaler, Stephen G
2014-04-01
Determining the relationship between clinical phenotype and genotype in genetic diseases is important in clinical practice. In general, frameshift mutations are expected to produce premature termination codons, leading to production of mutant transcripts destined for degradation by nonsense-mediated decay. In X-linked recessive diseases, male patients with frameshift mutations typically have a severe or even lethal phenotype. We report a case of a 17-month-old boy with Menkes disease (NIM #309400), an X-linked recessive copper metabolism disorder caused by mutations in the ATP7A copper transporter gene. He exhibited an unexpectedly late onset and experienced milder symptoms. His genomic DNA showed a de novo two-nucleotide deletion in exon 4 of ATP7A, predicting a translational frameshift and premature stop codon, and a classic severe phenotype. Characterization of his ATP7A mRNA showed no abnormal splicing. We speculate that translation reinitiation could occur downstream to the premature termination codon and produce a partially functional ATP7A protein. Study of the child's fibroblasts found no evidence of translation reinitiation; however, the possibility remains that this phenomenon occurred in neural tissues and influenced the clinical phenotype. Copyright © 2014 Elsevier Inc. All rights reserved.
Reassigning stop codons via translation termination: How a few eukaryotes broke the dogma.
Alkalaeva, Elena; Mikhailova, Tatiana
2017-03-01
The genetic code determines how amino acids are encoded within mRNA. It is universal among the vast majority of organisms, although several exceptions are known. Variant genetic codes are found in ciliates, mitochondria, and numerous other organisms. All revealed genetic codes (standard and variant) have at least one codon encoding a translation stop signal. However, recently two new genetic codes with a reassignment of all three stop codons were revealed in studies examining the protozoa transcriptomes. Here, we discuss this finding and the recent studies of variant genetic codes in eukaryotes. We consider the possible molecular mechanisms allowing the use of certain codons as sense and stop signals simultaneously. The results obtained by studying these amazing organisms represent a new and exciting insight into the mechanism of stop codon decoding in eukaryotes. Also see the video abstract here. © 2017 WILEY Periodicals, Inc.
Billon, Pierre; Bryant, Eric E; Joseph, Sarah A; Nambiar, Tarun S; Hayward, Samuel B; Rothstein, Rodney; Ciccia, Alberto
2017-09-21
Standard CRISPR-mediated gene disruption strategies rely on Cas9-induced DNA double-strand breaks (DSBs). Here, we show that CRISPR-dependent base editing efficiently inactivates genes by precisely converting four codons (CAA, CAG, CGA, and TGG) into STOP codons without DSB formation. To facilitate gene inactivation by induction of STOP codons (iSTOP), we provide access to a database of over 3.4 million single guide RNAs (sgRNAs) for iSTOP (sgSTOPs) targeting 97%-99% of genes in eight eukaryotic species, and we describe a restriction fragment length polymorphism (RFLP) assay that allows the rapid detection of iSTOP-mediated editing in cell populations and clones. To simplify the selection of sgSTOPs, our resource includes annotations for off-target propensity, percentage of isoforms targeted, prediction of nonsense-mediated decay, and restriction enzymes for RFLP analysis. Additionally, our database includes sgSTOPs that could be employed to precisely model over 32,000 cancer-associated nonsense mutations. Altogether, this work provides a comprehensive resource for DSB-free gene disruption by iSTOP. Copyright © 2017 Elsevier Inc. All rights reserved.
Global analysis of translation termination in E. coli.
Baggett, Natalie E; Zhang, Yan; Gross, Carol A
2017-03-01
Terminating protein translation accurately and efficiently is critical for both protein fidelity and ribosome recycling for continued translation. The three bacterial release factors (RFs) play key roles: RF1 and 2 recognize stop codons and terminate translation; and RF3 promotes disassociation of bound release factors. Probing release factors mutations with reporter constructs containing programmed frameshifting sequences or premature stop codons had revealed a propensity for readthrough or frameshifting at these specific sites, but their effects on translation genome-wide have not been examined. We performed ribosome profiling on a set of isogenic strains with well-characterized release factor mutations to determine how they alter translation globally. Consistent with their known defects, strains with increasingly severe release factor defects exhibit increasingly severe accumulation of ribosomes over stop codons, indicative of an increased duration of the termination/release phase of translation. Release factor mutant strains also exhibit increased occupancy in the region following the stop codon at a significant number of genes. Our global analysis revealed that, as expected, translation termination is generally efficient and accurate, but that at a significant number of genes (≥ 50) the ribosome signature after the stop codon is suggestive of translation past the stop codon. Even native E. coli K-12 exhibits the ribosome signature suggestive of protein extension, especially at UGA codons, which rely exclusively on the reduced function RF2 variant of the K-12 strain for termination. Deletion of RF3 increases the severity of the defect. We unambiguously demonstrate readthrough and frameshifting protein extensions and their further accumulation in mutant strains for a few select cases. In addition to enhancing recoding, ribosome accumulation over stop codons disrupts attenuation control of biosynthetic operons, and may alter expression of some overlapping genes. Together, these functional alterations may either augment the protein repertoire or produce deleterious proteins.
Mutations in the glucose-6-phosphatase gene that cause glycogen storage disease type 1a
DOE Office of Scientific and Technical Information (OSTI.GOV)
Chou, J.Y.; Lei, K.J.; Shelly, L.L.
1994-09-01
Glycogen storage disease (GSD) type la (von Gierke disease) is caused by the deficiency of glucose-6-phosphatase (G6Pase), the key enzyme in glucose homeostasis. The disease presents with clinical manifestations of severe hypoglycemia, hepatomegaly, growth retardation, lactic acidemia, hyperlipidemia, and hyperuricemia. We have succeeded in isolating a murine G6Pase cDNA from a normal mouse liver cDNA library by differentially screening method. We then isolated the human G6Pase cDNA and gene. To date, we have characterized the G6Pase genes of twelve GSD type la patients and uncovered a total of six different mutations. The mutations are comprised of R83C (an Arg atmore » codon 83 to a Cys), Q347X (a Gly at codon 347 to a stop codon), 459insTA (a two basepair insertion at nucleotide 459 yielding a truncated G6Pase of 129 residues), R295C (an Arg at codon 295 to a Cys), G222R (a Gly at codon 222 to an Arg) and {delta}F327 (a codon deletion for Phe-327 at nucleotides 1058 to 1060). The relative incidences of these mutations are 37.5% (R83C), 33.3% (Q347X), 16.6% (459insTA), 4.2% (G222R), 4.2% (R295C) and 4.2% ({delta}F327). Site-directed mutagenesis and transient expression assays demonstrated that the R83C, Q347X, R295C, and {delta}F327 mutations abolished whereas the G222R mutation greatly reduced G6Pase activity. We further characterized the structure-function requirements of amino acids 83, 222, and 295 in G6Pase catalysis. The identification of mutations in GSD type la patients has unequivocally established the molecular basis of the type la disorder. Knowledge of the mutations may be applied to prenatal diagnosis and opens the way for developing and evaluating new therapeutic approaches.« less
Recent evidence for evolution of the genetic code
NASA Technical Reports Server (NTRS)
Osawa, S.; Jukes, T. H.; Watanabe, K.; Muto, A.
1992-01-01
The genetic code, formerly thought to be frozen, is now known to be in a state of evolution. This was first shown in 1979 by Barrell et al. (G. Barrell, A. T. Bankier, and J. Drouin, Nature [London] 282:189-194, 1979), who found that the universal codons AUA (isoleucine) and UGA (stop) coded for methionine and tryptophan, respectively, in human mitochondria. Subsequent studies have shown that UGA codes for tryptophan in Mycoplasma spp. and in all nonplant mitochondria that have been examined. Universal stop codons UAA and UAG code for glutamine in ciliated protozoa (except Euplotes octacarinatus) and in a green alga, Acetabularia. E. octacarinatus uses UAA for stop and UGA for cysteine. Candida species, which are yeasts, use CUG (leucine) for serine. Other departures from the universal code, all in nonplant mitochondria, are CUN (leucine) for threonine (in yeasts), AAA (lysine) for asparagine (in platyhelminths and echinoderms), UAA (stop) for tyrosine (in planaria), and AGR (arginine) for serine (in several animal orders) and for stop (in vertebrates). We propose that the changes are typically preceded by loss of a codon from all coding sequences in an organism or organelle, often as a result of directional mutation pressure, accompanied by loss of the tRNA that translates the codon. The codon reappears later by conversion of another codon and emergence of a tRNA that translates the reappeared codon with a different assignment. Changes in release factors also contribute to these revised assignments. We also discuss the use of UGA (stop) as a selenocysteine codon and the early history of the code.
Ocular phenotypes associated with two mutations (R121W, C126X) in the Norrie disease gene.
Kellner, U; Fuchs, S; Bornfeld, N; Foerster, M H; Gal, A
1996-06-01
To describe the ocular phenotypes associated with 2 mutations in the Norrie disease gene including a manifesting carrier. Ophthalmological examinations were performed in 2 affected males and one manifesting carrier. Genomic DNA was analyzed by direct sequencing of the Norrie disease gene. Family I: A 29-year-old male had the right eye enucleated at the age of 3 years. His left eye showed severe temporal dragging of the retina and central scars. Visual acuity was 20/300. DNA analysis revealed a C-to-T transition of the first nucleotide in codon 121 predicting the replacement of arginine-121 by tryptophan (R121W). Both the mother and maternal grandmother carry the same mutation in heterozygous form. Family 2: A 3-month-old boy presented with severe temporal dragging of the retina on both eyes and subsequently developed retinal detachment. Visual acuity was limited to light perception. His mother's left eye was amaurotic and phthitic. Her right eye showed severe retinal dragging, visual acuity was reduced to 20/60. DNA analysis revealed a T-to-A transversion of the third nucleotide in codon 126 creating a stop codon (C126X). The mother and maternal grandmother were carriers. Mutations in the Norrie disease gene can lead to retinal malformations of variable severity both in hemizygous males and manifesting carriers.
Position-dependent termination and widespread obligatory frameshifting in Euplotes translation
DOE Office of Scientific and Technical Information (OSTI.GOV)
Lobanov, Alexei V.; Heaphy, Stephen M.; Turanov, Anton A.
2016-11-21
The ribosome can change its reading frame during translation in a process known as programmed ribosomal frameshifting. These rare events are supported by complex mRNA signals. However, we found that the ciliates Euplotes crassus and Euplotes focardii exhibit widespread frameshifting at stop codons. 47 different codons preceding stop signals resulted in either +1 or +2 frameshifts, and +1 frameshifting at AAA was the most frequent. The frameshifts showed unusual plasticity and rapid evolution, and had little influence on translation rates. The proximity of a stop codon to the 3' mRNA end, rather than its occurrence or sequence context, appeared tomore » designate termination. Thus, a ‘stop codon’ is not a sufficient signal for translation termination, and the default function of stop codons in Euplotes is frameshifting, whereas termination is specific to certain mRNA positions and probably requires additional factors.« less
Schuster, W; Brennicke, A
1991-01-01
An intact gene for the ribosomal protein S19 (rps19) is absent from Oenothera mitochondria. The conserved rps19 reading frame found in the mitochondrial genome is interrupted by a termination codon. This rps19 pseudogene is cotranscribed with the downstream rps3 gene and is edited on both sides of the translational stop. Editing, however, changes the amino acid sequence at positions that were well conserved before editing. Other strange editings create translational stops in open reading frames coding for functional proteins. In coxI and rps3 mRNAs CGA codons are edited to UGA stop codons only five and three codons, respectively, downstream to the initiation codon. These aberrant editings in essential open reading frames and in the rps19 pseudogene appear to have been shifted to these positions from other editing sites. These observations suggest a requirement for a continuous evolutionary constraint on the editing specificities in plant mitochondria. Images PMID:1762921
Global analysis of translation termination in E. coli
Baggett, Natalie E.
2017-01-01
Terminating protein translation accurately and efficiently is critical for both protein fidelity and ribosome recycling for continued translation. The three bacterial release factors (RFs) play key roles: RF1 and 2 recognize stop codons and terminate translation; and RF3 promotes disassociation of bound release factors. Probing release factors mutations with reporter constructs containing programmed frameshifting sequences or premature stop codons had revealed a propensity for readthrough or frameshifting at these specific sites, but their effects on translation genome-wide have not been examined. We performed ribosome profiling on a set of isogenic strains with well-characterized release factor mutations to determine how they alter translation globally. Consistent with their known defects, strains with increasingly severe release factor defects exhibit increasingly severe accumulation of ribosomes over stop codons, indicative of an increased duration of the termination/release phase of translation. Release factor mutant strains also exhibit increased occupancy in the region following the stop codon at a significant number of genes. Our global analysis revealed that, as expected, translation termination is generally efficient and accurate, but that at a significant number of genes (≥ 50) the ribosome signature after the stop codon is suggestive of translation past the stop codon. Even native E. coli K-12 exhibits the ribosome signature suggestive of protein extension, especially at UGA codons, which rely exclusively on the reduced function RF2 variant of the K-12 strain for termination. Deletion of RF3 increases the severity of the defect. We unambiguously demonstrate readthrough and frameshifting protein extensions and their further accumulation in mutant strains for a few select cases. In addition to enhancing recoding, ribosome accumulation over stop codons disrupts attenuation control of biosynthetic operons, and may alter expression of some overlapping genes. Together, these functional alterations may either augment the protein repertoire or produce deleterious proteins. PMID:28301469
Culture adaptation of malaria parasites selects for convergent loss-of-function mutants.
Claessens, Antoine; Affara, Muna; Assefa, Samuel A; Kwiatkowski, Dominic P; Conway, David J
2017-01-24
Cultured human pathogens may differ significantly from source populations. To investigate the genetic basis of laboratory adaptation in malaria parasites, clinical Plasmodium falciparum isolates were sampled from patients and cultured in vitro for up to three months. Genome sequence analysis was performed on multiple culture time point samples from six monoclonal isolates, and single nucleotide polymorphism (SNP) variants emerging over time were detected. Out of a total of five positively selected SNPs, four represented nonsense mutations resulting in stop codons, three of these in a single ApiAP2 transcription factor gene, and one in SRPK1. To survey further for nonsense mutants associated with culture, genome sequences of eleven long-term laboratory-adapted parasite strains were examined, revealing four independently acquired nonsense mutations in two other ApiAP2 genes, and five in Epac. No mutants of these genes exist in a large database of parasite sequences from uncultured clinical samples. This implicates putative master regulator genes in which multiple independent stop codon mutations have convergently led to culture adaptation, affecting most laboratory lines of P. falciparum. Understanding the adaptive processes should guide development of experimental models, which could include targeted gene disruption to adapt fastidious malaria parasite species to culture.
SENCA: A Multilayered Codon Model to Study the Origins and Dynamics of Codon Usage
Pouyet, Fanny; Bailly-Bechet, Marc; Mouchiroud, Dominique; Guéguen, Laurent
2016-01-01
Gene sequences are the target of evolution operating at different levels, including the nucleotide, codon, and amino acid levels. Disentangling the impact of those different levels on gene sequences requires developing a probabilistic model with three layers. Here we present SENCA (site evolution of nucleotides, codons, and amino acids), a codon substitution model that separately describes 1) nucleotide processes which apply on all sites of a sequence such as the mutational bias, 2) preferences between synonymous codons, and 3) preferences among amino acids. We argue that most synonymous substitutions are not neutral and that SENCA provides more accurate estimates of selection compared with more classical codon sequence models. We study the forces that drive the genomic content evolution, intraspecifically in the core genome of 21 prokaryotes and interspecifically for five Enterobacteria. We retrieve the existence of a universal mutational bias toward AT, and that taking into account selection on synonymous codon usage has consequences on the measurement of selection on nonsynonymous substitutions. We also confirm that codon usage bias is mostly driven by selection on preferred codons. We propose new summary statistics to measure the relative importance of the different evolutionary processes acting on sequences. PMID:27401173
The mitochondrial genome of Cethosia biblis (Drury) (Lepidoptera: Nymphalidae).
Xin, Tianrong; Li, Lei; Yao, Chengyi; Wang, Yayu; Zou, Zhiwen; Wang, Jing; Xia, Bin
2016-07-01
We present the complete mitogenome of Cethosia biblis (Drury) (Lepidoptera: Nymphalidae) in this article. The mitogenome was a circle molecular consisting of 15,286 nucleotides, 37 genes, and an A + T-rich region. The order of 37 genes was typical of insect mitochondrial DNA sequences described to date. The overall base composition of the genome is A (37.41%), T (42.80%), C (11.87%), and G (7.91%) with an A + T-rich hallmark as that of other invertebrate mitochondrial genomes. The start codon was mainly ATA in most of the mitochondrial protein-coding genes such as ND2, COI, ATP8, ND3, ND5, ND4, ND6, and ND1, but COII, ATP6, COIII, ND4L, and Cob genes employing ATG. The stop codon was TAA in all the protein-coding genes. The A + T region is located between 12S rRNA and tRNA(M)(et). The phylogenetic relationships of Lepidoptera species were constructed based on the nucleotides sequences of 13 PCGs of mitogenomes using the neighbor-joining method. The molecular-based phylogeny supported the traditional morphological classification on relationships within Lepidoptera species.
Truncated variants of apolipoprotein B cause hypobetalipoproteinaemia.
Collins, D R; Knott, T J; Pease, R J; Powell, L M; Wallis, S C; Robertson, S; Pullinger, C R; Milne, R W; Marcel, Y L; Humphries, S E
1988-01-01
Familial hypobetalipoproteinaemia is a rare autosomal dominant disorder in which levels of apo-B-containing plasma lipoproteins are approximately half-normal in heterozygotes and virtually absent in homozygotes. Here we describe mutations of the apo-B gene that cause two different truncated variants of apo-B in unrelated individuals with hypobetalipoproteinaemia. One variant, apo-B(His1795----Met-Trp-Leu-Val-Thr-Term) is predicted to be 1799 amino acids long and arises from deletion of a single nucleotide (G) from leucine codon 1794. This protein was found at low levels in very low density and low density lipoprotein fractions in the blood. The second, shorter variant, apo-B(Arg1306----Term), is caused by mutation of a CpG dinucleotide in arginine codon 1306 converting it to a stop codon and predicting a protein of 1305 residues. The product of this allele could not be detected in the circulation. The differences in size and behaviour of these two variants compared to apo-B100 or apo-B48 point to domains that may be important for the assembly, secretion or stability of apo-B-containing lipoproteins. Images PMID:2843815
Ma, X X; Feng, Y P; Gu, Y X; Zhou, J H; Ma, Z R
2016-06-01
As for the alternative AUGs in foot-and-mouth disease virus (FMDV), nucleotide bias of the context flanking the AUG(2nd) could be used as a strong signal to initiate translation. To determine the role of the specific nucleotide context, dicistronic reporter constructs were engineered to contain different versions of nucleotide context linking between internal ribosome entry site (IRES) and downstream gene. The results indicate that under FMDV IRES-dependent mechanism, the nucleotide contexts flanking start codon can influence the translation initiation efficiencies. The most optimal sequences for both start codons have proved to be UUU AUG(1st) AAC and AAG AUG(2nd) GAA.
Lara-Ramírez, Edgar E.; Salazar, Ma Isabel; López-López, María de Jesús; Salas-Benito, Juan Santiago; Sánchez-Varela, Alejandro
2014-01-01
The increasing number of dengue virus (DENV) genome sequences available allows identifying the contributing factors to DENV evolution. In the present study, the codon usage in serotypes 1–4 (DENV1–4) has been explored for 3047 sequenced genomes using different statistics methods. The correlation analysis of total GC content (GC) with GC content at the three nucleotide positions of codons (GC1, GC2, and GC3) as well as the effective number of codons (ENC, ENCp) versus GC3 plots revealed mutational bias and purifying selection pressures as the major forces influencing the codon usage, but with distinct pressure on specific nucleotide position in the codon. The correspondence analysis (CA) and clustering analysis on relative synonymous codon usage (RSCU) within each serotype showed similar clustering patterns to the phylogenetic analysis of nucleotide sequences for DENV1–4. These clustering patterns are strongly related to the virus geographic origin. The phylogenetic dependence analysis also suggests that stabilizing selection acts on the codon usage bias. Our analysis of a large scale reveals new feature on DENV genomic evolution. PMID:25136631
Dass, J Febin Prabhu; Sudandiradoss, C
2012-07-15
5-HT (5-Hydroxy-tryptamine) or serotonin receptors are found both in central and peripheral nervous system as well as in non-neuronal tissues. In the animal and human nervous system, serotonin produces various functional effects through a variety of membrane bound receptors. In this study, we focus on 5-HT receptor family from different mammals and examined the factors that account for codon and nucleotide usage variation. A total of 110 homologous coding sequences from 11 different mammalian species were analyzed using relative synonymous codon usage (RSCU), correspondence analysis (COA) and hierarchical cluster analysis together with nucleotide base usage frequency of chemically similar amino acid codons. The mean effective number of codon (ENc) value of 37.06 for 5-HT(6) shows very high codon bias within the family and may be due to high selective translational efficiency. The COA and Spearman's rank correlation reveals that the nucleotide compositional mutation bias as the major factors influencing the codon usage in serotonin receptor genes. The hierarchical cluster analysis suggests that gene function is another dominant factor that affects the codon usage bias, while species is a minor factor. Nucleotide base usage was reported using Goldman, Engelman, Stietz (GES) scale reveals the presence of high uracil (>45%) content at functionally important hydrophobic regions. Our in silico approach will certainly help for further investigations on critical inference on evolution, structure, function and gene expression aspects of 5-HT receptors family which are potential antipsychotic drug targets. Copyright © 2012 Elsevier B.V. All rights reserved.
Seligmann, Hervé; Warthi, Ganesh
2017-01-01
A new codon property, codon directional asymmetry in nucleotide content (CDA), reveals a biologically meaningful genetic code dimension: palindromic codons (first and last nucleotides identical, codon structure XZX) are symmetric (CDA = 0), codons with structures ZXX/XXZ are 5'/3' asymmetric (CDA = - 1/1; CDA = - 0.5/0.5 if Z and X are both purines or both pyrimidines, assigning negative/positive (-/+) signs is an arbitrary convention). Negative/positive CDAs associate with (a) Fujimoto's tetrahedral codon stereo-table; (b) tRNA synthetase class I/II (aminoacylate the 2'/3' hydroxyl group of the tRNA's last ribose, respectively); and (c) high/low antiparallel (not parallel) betasheet conformation parameters. Preliminary results suggest CDA-whole organism associations (body temperature, developmental stability, lifespan). Presumably, CDA impacts spatial kinetics of codon-anticodon interactions, affecting cotranslational protein folding. Some synonymous codons have opposite CDA sign (alanine, leucine, serine, and valine), putatively explaining how synonymous mutations sometimes affect protein function. Correlations between CDA and tRNA synthetase classes are weaker than between CDA and antiparallel betasheet conformation parameters. This effect is stronger for mitochondrial genetic codes, and potentially drives mitochondrial codon-amino acid reassignments. CDA reveals information ruling nucleotide-protein relations embedded in reversed (not reverse-complement) sequences (5'-ZXX-3'/5'-XXZ-3').
Identification of the initiation site of poliovirus polyprotein synthesis
DOE Office of Scientific and Technical Information (OSTI.GOV)
Dorner, A.J.; Dorner, L.F.; Larsen, G.R.
1982-06-01
The complete nucleotide sequence of poliovirus RNA has a long open reading frame capable of encoding the precursor polyprotein NCVPOO. The first AUG codon in this reading frame is located 743 nucleotides from the 5' end of the RNA and is preceded by eight AUG codons in all three reading frames. Because all proteins that map at the amino terminus of the polyprotein (P1-1a, VPO, and VP4) are blocked at their amino termini and previous studies of ribosome binding have been inconclusive, direct identification of the initiation site of protein synthesis was difficult. We separated and identified all of themore » tryptic peptides of capsid protein VP4 and correlated these peptides with the amino acid sequence predicted to follow the AUG codon at nucleotide 743. Our data indicate that VP4 begins with a blocked glycine that is encoded immediately after the AUG codon at nucleotide 743. An S1 nuclease analysis of poliovirus mRNA failed to reveal a splice in the 5' region. We concluded that synthesis of poliovirus polyprotein is initiated at nucleotide 743, the first AUG codon in the long open reading frame.« less
Mutations in eukaryotic release factors 1 and 3 act as general nonsense suppressors in Drosophila.
Chao, Anna T; Dierick, Herman A; Addy, Tracie M; Bejsovec, Amy
2003-01-01
In a screen for suppressors of the Drosophila wingless(PE4) nonsense allele, we isolated mutations in the two components that form eukaryotic release factor. eRF1 and eRF3 comprise the translation termination complex that recognizes stop codons and catalyzes the release of nascent polypeptide chains from ribosomes. Mutations disrupting the Drosophila eRF1 and eRF3 show a strong maternal-effect nonsense suppression due to readthrough of stop codons and are zygotically lethal during larval stages. We tested nonsense mutations in wg and in other embryonically acting genes and found that different stop codons can be suppressed but only a subset of nonsense alleles are subject to suppression. We suspect that the context of the stop codon is significant: nonsense alleles sensitive to suppression by eRF1 and eRF3 encode stop codons that are immediately followed by a cytidine. Such suppressible alleles appear to be intrinsically weak, with a low level of readthrough that is enhanced when translation termination is disrupted. Thus the eRF1 and eRF3 mutations provide a tool for identifying nonsense alleles that are leaky. Our findings have important implications for assigning null mutant phenotypes and for selecting appropriate alleles to use in suppressor screens. PMID:14573473
L-MPZ, a Novel Isoform of Myelin P0, Is Produced by Stop Codon Readthrough*
Yamaguchi, Yoshihide; Hayashi, Akiko; Campagnoni, Celia W.; Kimura, Akio; Inuzuka, Takashi; Baba, Hiroko
2012-01-01
Myelin protein zero (P0 or MPZ) is a major myelin protein (∼30 kDa) expressed in the peripheral nervous system (PNS) in terrestrial vertebrates. Several groups have detected a P0-related 36-kDa (or 35-kDa) protein that is expressed in the PNS as an antigen for the serum IgG of patients with neuropathy. The molecular structure and function of this 36-kDa protein are, however, still unknown. We hypothesized that the 36-kDa protein may be derived from P0 mRNA by stop codon readthrough. We found a highly conserved region after the regular stop codon in predicted sequences from the 3′-UTR of P0 in higher animals. MS of the 36-kDa protein revealed that both P0 peptides and peptides deduced from the P0 3′-UTR sequence were found among the tryptic fragments. In transfected cells and in an in vitro transcription/translation system, the 36-kDa molecule was also produced from the identical mRNA that produced P0. We designated this 36-kDa molecule as large myelin protein zero (L-MPZ), a novel isoform of P0 that contains an additional domain at the C terminus. In the PNS, L-MPZ was localized in compact myelin. In transfected cells, just like P0, L-MPZ was localized at cell-cell adhesion sites in the plasma membrane. These results suggest that L-MPZ produced by the stop codon readthrough mechanism is potentially involved in myelination. Since this is the first finding of stop codon readthrough in a common mammalian protein, detailed analysis of L-MPZ expression will help to understand the mechanism of stop codon readthrough in mammals. PMID:22457349
A detailed analysis of codon usage patterns and influencing factors in Zika virus.
Singh, Niraj K; Tyagi, Anuj
2017-07-01
Recent outbreaks of Zika virus (ZIKV) in Africa, Latin America, Europe, and Southeast Asia have resulted in serious health concerns. To understand more about evolution and transmission of ZIKV, detailed codon usage analysis was performed for all available strains. A high effective number of codons (ENC) value indicated the presence of low codon usage bias in ZIKV. The effect of mutational pressure on codon usage bias was confirmed by significant correlations between nucleotide compositions at third codon positions and ENCs. Correlation analysis between Gravy values, Aroma values and nucleotide compositions at third codon positions also indicated some influence of natural selection. However, the low codon adaptation index (CAI) value of ZIKV with reference to human and mosquito indicated poor adaptation of ZIKV codon usage towards its hosts, signifying that natural selection has a weaker influence than mutational pressure. Additionally, relative dinucleotide frequencies, geographical distribution, and evolutionary processes also influenced the codon usage pattern to some extent.
ERIC Educational Resources Information Center
Prevost, Luanna B.; Smith, Michelle K.; Knight, Jennifer K.
2016-01-01
Previous work has shown that students have persistent difficulties in understanding how central dogma processes can be affected by a stop codon mutation. To explore these difficulties, we modified two multiple-choice questions from the Genetics Concept Assessment into three open-ended questions that asked students to write about how a stop codon…
Castro-Chavez, Fernando
2011-01-01
My previous theoretical research shows that the rotating circular genetic code is a viable tool to make easier to distinguish the rules of variation applied to the amino acid exchange; it presents a precise and positional bio-mathematical balance of codons, according to the amino acids they codify. Here, I demonstrate that when using the conventional or classic circular genetic code, a clearer pattern for the human codon usage per amino acid and per genome emerges. The most used human codons per amino acid were the ones ending with the three hydrogen bond nucleotides: C for 12 amino acids and G for the remaining 8, plus one codon for arginine ending in A that was used approximately with the same frequency than the one ending in G for this same amino acid (plus *). The most used codons in man fall almost all the time at the rightmost position, clockwise, ending either in C or in G within the circular genetic code. The human codon usage per genome is compared to other organisms such as fruit flies (Drosophila melanogaster), squid (Loligo pealei), and many others. The biosemiotic codon usage of each genomic population or ‘Theme’ is equated to a ‘molecular language’. The C/U choice or difference, and the G/A difference in the third nucleotide of the most used codons per amino acid are illustrated by comparing the most used codons per genome in humans and squids. The human distribution in the third position of most used codons is a 12-8-2, C-G-A, nucleotide ending signature, while the squid distribution in the third position of most used codons was an odd, or uneven, distribution in the third position of its most used codons: 13-6-3, U-A-G, as its nucleotide ending signature. These findings may help to design computational tools to compare human genomes, to determine the exchangeability between compatible codons and amino acids, and for the early detection of incompatible changes leading to hereditary diseases. PMID:22997484
Datta, Sibnarayan; Banerjee, Arup; Chandra, Partha K; Chakraborty, Subhasis; Basu, Subir Kumar; Chakravarty, Runu
2007-11-01
In blood donors, HBV infection is detected by the presence of serum hepatitis B surface antigen (HBsAg). However, some mutations in the surface gene region may result in altered or truncated HBsAg that can escape from immunoassay-based diagnosis. Such diagnostic escape mutants pose a potential risk for blood transfusion services. In the present study, we report a blood donor seronegative for HBsAg and antiHBc, but positive for antiHBs who was HBV DNA positive by PCR. Sequencing of the HBsAg gene revealed presence of a point mutation (T-A) at 207th nucleotide of the HBsAg ORF, which resulted in a premature stop codon at position 69. This results in a truncated HBsAg gene lacking the entire 'a' determinant region. However, follow-up of the donor after 2 years revealed clearance of HBV DNA from the serum. The case illustrates an unusual mutation, which causes HBsAg negativity. The finding emphasizes the importance of molecular assays in reducing the possibility of HBV transmission through blood transfusion. However, developing more sensitive serological assays, capable of detecting HBV mutants, is an alternative to expensive and complex amplification-based assays for developing countries.
RNA Editing in Plant Mitochondria
NASA Astrophysics Data System (ADS)
Hiesel, Rudolf; Wissinger, Bernd; Schuster, Wolfgang; Brennicke, Axel
1989-12-01
Comparative sequence analysis of genomic and complementary DNA clones from several mitochondrial genes in the higher plant Oenothera revealed nucleotide sequence divergences between the genomic and the messenger RNA-derived sequences. These sequence alterations could be most easily explained by specific post-transcriptional nucleotide modifications. Most of the nucleotide exchanges in coding regions lead to altered codons in the mRNA that specify amino acids better conserved in evolution than those encoded by the genomic DNA. Several instances show that the genomic arginine codon CGG is edited in the mRNA to the tryptophan codon TGG in amino acid positions that are highly conserved as tryptophan in the homologous proteins of other species. This editing suggests that the standard genetic code is used in plant mitochondria and resolves the frequent coincidence of CGG codons and tryptophan in different plant species. The apparently frequent and non-species-specific equivalency of CGG and TGG codons in particular suggests that RNA editing is a common feature of all higher plant mitochondria.
Cosenza, G; Iannaccone, M; Pico, B A; Ramunno, L; Capparelli, R
2016-10-01
Quantitative individual differences in the amount of β-casein in goat milk are determined by at least nine alleles. In particular, two alleles (CSN2(0) and CSN2(01) ) are associated with an undetectable amount of this protein in milk. The CSN2(01) allele is characterized by a single nucleotide substitution at position 373 of the seventh exon (AJ011018:g.8915C>T), responsible for the formation of a premature stop codon at the 182 position. Herein, we report the contribution of the SNP g.1311T>C, which demonstrates a linkage with the SNP AJ011018:g.8915C>T, to the promoter transcriptional activity. Particularly, we indicate that the nucleotide C at position 1311 negatively affects the promoter activity of the CSN2 gene. © 2016 Stichting International Foundation for Animal Genetics.
Geyer, David D.; Spence, M. Anne; Johannes, Meriam; Flodman, Pamela; Clancy, Kevin P.; Berry, Rebecca; Sparkes, Robert S.; Jonsen, Matthew D.; Isenberg, Sherwin J.; Bateman, J. Bronwyn
2006-01-01
PURPOSE To further elucidate the cataract phenotype, and identify the gene and mutation for autosomal dominant cataract (ADC) in an American family of European descent (ADC2) by sequencing the major intrinsic protein gene (MIP), a candidate based on linkage to chromosome 12q13. DESIGN Observational case series and laboratory experimental study. METHODS We examined two at-risk individuals in ADC2. We PCR-amplified and sequenced all four exons and all intron-exon boundaries of the MIP gene from genomic and cloned DNA in affected members to confirm one variant as the putative mutation. RESULTS We found a novel single deletion of nucleotide (nt) 3223 (within codon 235) in exon four, causing a frameshift that alters 41 of 45 subsequent amino acids and creates a premature stop codon. CONCLUSIONS We identified a novel single base pair deletion in the MIP gene and conclude that it is a pathogenic sequence alteration. PMID:16564824
José, Marco V; Morgado, Eberto R; Govezensky, Tzipe
2011-07-01
Herein, we rigorously develop novel 3-dimensional algebraic models called Genetic Hotels of the Standard Genetic Code (SGC). We start by considering the primeval RNA genetic code which consists of the 16 codons of type RNY (purine-any base-pyrimidine). Using simple algebraic operations, we show how the RNA code could have evolved toward the current SGC via two different intermediate evolutionary stages called Extended RNA code type I and II. By rotations or translations of the subset RNY, we arrive at the SGC via the former (type I) or via the latter (type II), respectively. Biologically, the Extended RNA code type I, consists of all codons of the type RNY plus codons obtained by considering the RNA code but in the second (NYR type) and third (YRN type) reading frames. The Extended RNA code type II, comprises all codons of the type RNY plus codons that arise from transversions of the RNA code in the first (YNY type) and third (RNR) nucleotide bases. Since the dimensions of remarkable subsets of the Genetic Hotels are not necessarily integer numbers, we also introduce the concept of algebraic fractal dimension. A general decoding function which maps each codon to its corresponding amino acid or the stop signals is also derived. The Phenotypic Hotel of amino acids is also illustrated. The proposed evolutionary paths are discussed in terms of the existing theories of the evolution of the SGC. The adoption of 3-dimensional models of the Genetic and Phenotypic Hotels will facilitate the understanding of the biological properties of the SGC.
CodonLogo: a sequence logo-based viewer for codon patterns.
Sharma, Virag; Murphy, David P; Provan, Gregory; Baranov, Pavel V
2012-07-15
Conserved patterns across a multiple sequence alignment can be visualized by generating sequence logos. Sequence logos show each column in the alignment as stacks of symbol(s) where the height of a stack is proportional to its informational content, whereas the height of each symbol within the stack is proportional to its frequency in the column. Sequence logos use symbols of either nucleotide or amino acid alphabets. However, certain regulatory signals in messenger RNA (mRNA) act as combinations of codons. Yet no tool is available for visualization of conserved codon patterns. We present the first application which allows visualization of conserved regions in a multiple sequence alignment in the context of codons. CodonLogo is based on WebLogo3 and uses the same heuristics but treats codons as inseparable units of a 64-letter alphabet. CodonLogo can discriminate patterns of codon conservation from patterns of nucleotide conservation that appear indistinguishable in standard sequence logos. The CodonLogo source code and its implementation (in a local version of the Galaxy Browser) are available at http://recode.ucc.ie/CodonLogo and through the Galaxy Tool Shed at http://toolshed.g2.bx.psu.edu/.
Major histocompatibility complex variation in the endangered Przewalski's horse.
Hedrick, P W; Parker, K M; Miller, E L; Miller, P S
1999-01-01
The major histocompatibility complex (MHC) is a fundamental part of the vertebrate immune system, and the high variability in many MHC genes is thought to play an essential role in recognition of parasites. The Przewalski's horse is extinct in the wild and all the living individuals descend from 13 founders, most of whom were captured around the turn of the century. One of the primary genetic concerns in endangered species is whether they have ample adaptive variation to respond to novel selective factors. In examining 14 Przewalski's horses that are broadly representative of the living animals, we found six different class II DRB major histocompatibility sequences. The sequences showed extensive nonsynonymous variation, concentrated in the putative antigen-binding sites, and little synonymous variation. Individuals had from two to four sequences as determined by single-stranded conformation polymorphism (SSCP) analysis. On the basis of the SSCP data, phylogenetic analysis of the nucleotide sequences, and segregation in a family group, we conclude that four of these sequences are from one gene (although one sequence codes for a nonfunctional allele because it contains a stop codon) and two other sequences are from another gene. The position of the stop codon is at the same amino-acid position as in a closely related sequence from the domestic horse. Because other organisms have extensive variation at homologous loci, the Przewalski's horse may have quite low variation in this important adaptive region. PMID:10430594
Kulkarni, N; Lakshmikumaran, M; Rao, M
1999-10-05
A 1.0 kilobase gene fragment from the genomic DNA of an alkaliphilic thermophilic Bacillus was found to code for a functional xylanase (XynII). The complete nucleotide sequence including the structural gene and the 5' and 3' flanking sequences of the xylanase gene have been determined. An open reading frame starting from ATG initiator codon comprising 402 nucleotides gave a preprotein of 133 amino acids of calculated molecular mass 14.090 kDa. The occurrence of three potential N-glycosylation sites in XynII gene is a unique feature for a gene of bacterial origin. The stop codon was followed by hairpin loop structures indicating the presence of transcription termination signals. The secondary structure analysis of XynII predicted that the polypeptide was primarily formed of beta-sheets. XynII appeared to be a member of family G/11 of xylanases based on its molecular weight and basic pI (8.0). However, sequence homology revealed similar identity with families 10 and 11 of xylanases. The conserved triad (Val-Val-Xaa, where Xaa is Asn or Asp) was identified only in the xylanases from alkaliphilic organisms. Our results implicate for the first time the concept of convergent evolution for XynII and provide a basis for research in evolutionary relationship among the xylanases from alkaliphilic and neutrophilic organisms. Copyright 1999 Academic Press.
Pyviko: an automated Python tool to design gene knockouts in complex viruses with overlapping genes.
Taylor, Louis J; Strebel, Klaus
2017-01-07
Gene knockouts are a common tool used to study gene function in various organisms. However, designing gene knockouts is complicated in viruses, which frequently contain sequences that code for multiple overlapping genes. Designing mutants that can be traced by the creation of new or elimination of existing restriction sites further compounds the difficulty in experimental design of knockouts of overlapping genes. While software is available to rapidly identify restriction sites in a given nucleotide sequence, no existing software addresses experimental design of mutations involving multiple overlapping amino acid sequences in generating gene knockouts. Pyviko performed well on a test set of over 240,000 gene pairs collected from viral genomes deposited in the National Center for Biotechnology Information Nucleotide database, identifying a point mutation which added a premature stop codon within the first 20 codons of the target gene in 93.2% of all tested gene-overprinted gene pairs. This shows that Pyviko can be used successfully in a wide variety of contexts to facilitate the molecular cloning and study of viral overprinted genes. Pyviko is an extensible and intuitive Python tool for designing knockouts of overlapping genes. Freely available as both a Python package and a web-based interface ( http://louiejtaylor.github.io/pyViKO/ ), Pyviko simplifies the experimental design of gene knockouts in complex viruses with overlapping genes.
Harun, Fatimah; Jalaludin, Muhammad Yazid; Lim, Chor Yin; Ng, Khoon Leong
2014-01-01
The c.2268dup mutation in thyroid peroxidase (TPO) gene was reported to be a founder mutation in Taiwanese patients with dyshormonogenetic congenital hypothyroidism (CH). The functional impact of the mutation is not well documented. In this study, homozygous c.2268dup mutation was detected in two Malaysian-Chinese sisters with goitrous CH. Normal and alternatively spliced TPO mRNA transcripts were present in thyroid tissues of the two sisters. The abnormal transcript contained 34 nucleotides originating from intron 12. The c.2268dup is predicted to generate a premature termination codon (PTC) at position 757 (p.Glu757X). Instead of restoring the normal reading frame, the alternatively spliced transcript has led to another stop codon at position 740 (p.Asp739ValfsX740). The two PTCs are located at 116 and 201 nucleotides upstream of the exons 13/14 junction fulfilling the requirement for a nonsense-mediated mRNA decay (NMD). Quantitative RT-PCR revealed an abundance of unidentified transcripts believed to be associated with the NMD. TPO enzyme activity was not detected in both patients, even though a faint TPO band of about 80 kD was present. In conclusion, the c.2268dup mutation leads to the formation of normal and alternatively spliced TPO mRNA transcripts with a consequential loss of TPO enzymatic activity in Malaysian-Chinese patients with goitrous CH. PMID:24745015
Lalaouna, David; Morissette, Audrey; Carrier, Marie-Claude; Massé, Eric
2015-10-01
The 87 nucleotide long DsrA sRNA has been mostly studied for its translational activation of the transcriptional regulator RpoS. However, it also represses hns mRNA, which encodes H-NS, a major regulator that affects expression of nearly 5% of Escherichia coli genes. A speculative model previously suggested that DsrA would block hns mRNA translation by binding simultaneously to start and stop codon regions of hns mRNA (coaxial model). Here, we show that DsrA efficiently blocked translation of hns mRNA by base-pairing immediately downstream of the start codon. In addition, DsrA induced hns mRNA degradation by actively recruiting the RNA degradosome complex. Data presented here led to a model of DsrA action on hns mRNA, which supports a canonical mechanism of sRNA-induced mRNA degradation by binding to the translation initiation region. Furthermore, using MS2-affinity purification coupled with RNA sequencing technology (MAPS), we also demonstrated that DsrA targets rbsD mRNA, involved in ribose utilization. Surprisingly, DsrA base pairs far downstream of rbsD start codon and induces rapid degradation of the transcript. Thus, our study enables us to draw an extended DsrA targetome. © 2015 John Wiley & Sons Ltd.
On fuzzy semantic similarity measure for DNA coding.
Ahmad, Muneer; Jung, Low Tang; Bhuiyan, Md Al-Amin
2016-02-01
A coding measure scheme numerically translates the DNA sequence to a time domain signal for protein coding regions identification. A number of coding measure schemes based on numerology, geometry, fixed mapping, statistical characteristics and chemical attributes of nucleotides have been proposed in recent decades. Such coding measure schemes lack the biologically meaningful aspects of nucleotide data and hence do not significantly discriminate coding regions from non-coding regions. This paper presents a novel fuzzy semantic similarity measure (FSSM) coding scheme centering on FSSM codons׳ clustering and genetic code context of nucleotides. Certain natural characteristics of nucleotides i.e. appearance as a unique combination of triplets, preserving special structure and occurrence, and ability to own and share density distributions in codons have been exploited in FSSM. The nucleotides׳ fuzzy behaviors, semantic similarities and defuzzification based on the center of gravity of nucleotides revealed a strong correlation between nucleotides in codons. The proposed FSSM coding scheme attains a significant enhancement in coding regions identification i.e. 36-133% as compared to other existing coding measure schemes tested over more than 250 benchmarked and randomly taken DNA datasets of different organisms. Copyright © 2015 Elsevier Ltd. All rights reserved.
Vertebrate codon bias indicates a highly GC-rich ancestral genome.
Nabiyouni, Maryam; Prakash, Ashwin; Fedorov, Alexei
2013-04-25
Two factors are thought to have contributed to the origin of codon usage bias in eukaryotes: 1) genome-wide mutational forces that shape overall GC-content and create context-dependent nucleotide bias, and 2) positive selection for codons that maximize efficient and accurate translation. Particularly in vertebrates, these two explanations contradict each other and cloud the origin of codon bias in the taxon. On the one hand, mutational forces fail to explain GC-richness (~60%) of third codon positions, given the GC-poor overall genomic composition among vertebrates (~40%). On the other hand, positive selection cannot easily explain strict regularities in codon preferences. Large-scale bioinformatic assessment, of nucleotide composition of coding and non-coding sequences in vertebrates and other taxa, suggests a simple possible resolution for this contradiction. Specifically, we propose that the last common vertebrate ancestor had a GC-rich genome (~65% GC). The data suggest that whole-genome mutational bias is the major driving force for generating codon bias. As the bias becomes prominent, it begins to affect translation and can result in positive selection for optimal codons. The positive selection can, in turn, significantly modulate codon preferences. Copyright © 2013 Elsevier B.V. All rights reserved.
Optimizing doped libraries by using genetic algorithms
NASA Astrophysics Data System (ADS)
Tomandl, Dirk; Schober, Andreas; Schwienhorst, Andreas
1997-01-01
The insertion of random sequences into protein-encoding genes in combination with biologicalselection techniques has become a valuable tool in the design of molecules that have usefuland possibly novel properties. By employing highly effective screening protocols, a functionaland unique structure that had not been anticipated can be distinguished among a hugecollection of inactive molecules that together represent all possible amino acid combinations.This technique is severely limited by its restriction to a library of manageable size. Oneapproach for limiting the size of a mutant library relies on `doping schemes', where subsetsof amino acids are generated that reveal only certain combinations of amino acids in a proteinsequence. Three mononucleotide mixtures for each codon concerned must be designed, suchthat the resulting codons that are assembled during chemical gene synthesis represent thedesired amino acid mixture on the level of the translated protein. In this paper we present adoping algorithm that `reverse translates' a desired mixture of certain amino acids into threemixtures of mononucleotides. The algorithm is designed to optimally bias these mixturestowards the codons of choice. This approach combines a genetic algorithm with localoptimization strategies based on the downhill simplex method. Disparate relativerepresentations of all amino acids (and stop codons) within a target set can be generated.Optional weighing factors are employed to emphasize the frequencies of certain amino acidsand their codon usage, and to compensate for reaction rates of different mononucleotidebuilding blocks (synthons) during chemical DNA synthesis. The effect of statistical errors thataccompany an experimental realization of calculated nucleotide mixtures on the generatedmixtures of amino acids is simulated. These simulations show that the robustness of differentoptima with respect to small deviations from calculated values depends on their concomitantfitness. Furthermore, the calculations probe the fitness landscape locally and allow apreliminary assessment of its structure.
Jeong, Hyun-Jeong; Lee, Joong-Bok; Park, Seung-Yong; Song, Chang-Seon; Kim, Bo-Sook; Rho, Jung-Rae; Yoo, Mi-Hyun; Jeong, Byung-Hoon; Kim, Yong-Sun
2007-01-01
Polymorphisms of the prion protein gene (PRNP) have been detected in several cervid species. In order to confirm the genetic variations, this study examined the DNA sequences of the PRNP obtained from 33 captive sika deer (Cervus nippon laiouanus) in Korea. A total of three single-nucleotide polymorphisms (SNPs) at codons 100, 136 and 226 in the PRNP of the sika deer were identified. The polymorphic site located at codon 100 has not been reported. The SNPs detected at codons 100 and 226 induced amino acid substitutions. The SNP at codon 136 was a silent mutation that does not induce any amino acid change. The genotype and allele frequencies were determined for each of the SNPs. PMID:17679779
The complete genome sequence of freesia mosaic virus and its relationship to other potyviruses.
Choi, H I; Lim, H R; Song, Y S; Kim, M J; Choi, S H; Song, Y S; Bae, S C; Ryu, K H
2010-07-01
We have completed the genomic sequence of a potyvirus, freesia mosaic virus (FreMV), and compared it to those of other known potyviruses. The full-length genome sequence of FreMV consists of 9,489 nucleotides. The large protein contains 3,077 amino acids, with an AUG start codon and UAA stop codon, containing one open reading frame typical of a potyvirus polyprotein. The polyprotein of FreMV-Kr gives rise to eleven proteins (P1, HC-pro, P3, PIPO, 6K1, CI, 6K2, VPg, NIa, NIb and CP), and putative cleavage sites of each protein were identified by sequence comparison to those of other known potyviruses. Phylogenetic analysis of the polyprotein revealed that FreMV-Kr was most closely related to PeMoV and was related to BtMV, BaRMV and PeLMV, which belong to the BCMV subgroup. This is the first information on the complete genome structure of FreMV, and the sequence information clearly supports the status of FreMV as a member of a distinct species in the genus Potyvirus.
Qian, Chaoju; Wang, Yuanxiu; Guo, Zhichun; Yang, Jianke; Kan, Xianzhao
2013-06-01
The circular mitochondrial genome of Alauda arvensis is 17,018 bp in length, containing 13 protein-coding genes (PCGs), 2 ribosomal RNA genes, 22 transfer RNA (tRNA) genes, and 2 extensive heteroplasmic control regions. All of the genes encoded on the H-strand, with the exceptions of one PCG (nad6) and eight tRNA genes (tRNA(Gln), tRNA(Ala), tRNA(Asn), tRNA(Cys), tRNA(Tyr), tRNA(Ser(UCN)), tRNA(Pro), and tRNA(Glu)), as found in other birds' mitochondrial genomes. All of these PCGs are initiated with ATG, while stopped by six types of stop codons. All tRNA genes have the potential to fold into typical clover-leaf structure. Two extensive heteroplasmic control regions were found, and more interestingly, a minisatellite of 37 nucleotides (5'-TCAATCCCATTGATTTCATTATATTAGTATAAAGAAA-3') with 6 tandem repeats was detected at the end of CR2.
Dietary nitrogen alters codon bias and genome composition in parasitic microorganisms.
Seward, Emily A; Kelly, Steven
2016-11-15
Genomes are composed of long strings of nucleotide monomers (A, C, G and T) that are either scavenged from the organism's environment or built from metabolic precursors. The biosynthesis of each nucleotide differs in atomic requirements with different nucleotides requiring different quantities of nitrogen atoms. However, the impact of the relative availability of dietary nitrogen on genome composition and codon bias is poorly understood. Here we show that differential nitrogen availability, due to differences in environment and dietary inputs, is a major determinant of genome nucleotide composition and synonymous codon use in both bacterial and eukaryotic microorganisms. Specifically, low nitrogen availability species use nucleotides that require fewer nitrogen atoms to encode the same genes compared to high nitrogen availability species. Furthermore, we provide a novel selection-mutation framework for the evaluation of the impact of metabolism on gene sequence evolution and show that it is possible to predict the metabolic inputs of related organisms from an analysis of the raw nucleotide sequence of their genes. Taken together, these results reveal a previously hidden relationship between cellular metabolism and genome evolution and provide new insight into how genome sequence evolution can be influenced by adaptation to different diets and environments.
Akhmaloka; Susilowati, Prima Endang; Subandi; Madayanti, Fida
2008-01-01
Termination translation in Saccharomyces cerevisiae is controlled by two interacting polypeptide chain release factors, eRF1 and eRF3. Two regions in human eRF1, position at 281-305 and position at 411-415, were proposed to be involved on the interaction to eRF3. In this study we have constructed and characterized yeast eRF1 mutant at position 410 (correspond to 415 human eRF1) from tyrosine to serine residue resulting eRF1(Y410S). The mutations did not affect the viability and temperature sensitivity of the cell. The stop codons suppression of the mutant was analyzed in vivo using PGK-stop codon-LACZ gene fusion and showed that the suppression of the mutant was significantly increased in all of codon terminations. The suppression on UAG codon was the highest increased among the stop codons by comparing the suppression of the wild type respectively. In vitro interaction between eRF1 (mutant and wild type) to eRF3 were carried out using eRF1-(His)6 and eRF1(Y410S)-(His)6 expressed in Escherichia coli and indigenous Saccharomyces cerevisiae eRF3. The results showed that the binding affinity of eRF1(Y410S) to eRF3 was decreased up to 20% of the wild type binding affinity. Computer modeling analysis using Swiss-Prot and Amber version 9.0 programs revealed that the overall structure of eRF1(Y410S) has no significant different with the wild type. However, substitution of tyrosine to serine triggered the structural change on the other motif of C-terminal domain of eRF1. The data suggested that increasing stop codon suppression and decreasing of the binding affinity of eRF1(Y410S) were probably due to the slight modification on the structure of the C-terminal domain. PMID:18463713
MicroRNA Targeting Specificity in Mammals: Determinants Beyond Seed Pairing
Grimson, Andrew; Farh, Kyle Kai-How; Johnston, Wendy K.; Garrett-Engele, Philip; Lim, Lee P.; Bartel, David P.
2013-01-01
Summary Mammalian microRNAs (miRNAs) pair to 3'UTRs of mRNAs to direct their posttranscriptional repression. Important for target recognition are ~7-nt sites that match the seed region of the miRNA. However, these seed matches are not always sufficient for repression, indicating that other characteristics help specify targeting. By combining computational and experimental approaches, we uncovered five general features of site context that boost site efficacy: AU-rich nucleotide composition near the site, proximity to sites for co-expressed miRNAs (which leads to cooperative action), proximity to residues pairing to miRNA nucleotides 13–16, and positioning within the 3'UTR at least 15 nt from the stop codon and away from the center of long UTRs. A model combining these context determinants quantitatively predicts site performance both for exogenously added miRNAs and for endogenous miRNA-message interactions. Because it predicts site efficacy without recourse to evolutionary conservation, the model also identifies effective nonconserved sites and siRNA off-targets. PMID:17612493
Nasrullah, Izza; Butt, Azeem M; Tahir, Shifa; Idrees, Muhammad; Tong, Yigang
2015-08-26
The Marburg virus (MARV) has a negative-sense single-stranded RNA genome, belongs to the family Filoviridae, and is responsible for several outbreaks of highly fatal hemorrhagic fever. Codon usage patterns of viruses reflect a series of evolutionary changes that enable viruses to shape their survival rates and fitness toward the external environment and, most importantly, their hosts. To understand the evolution of MARV at the codon level, we report a comprehensive analysis of synonymous codon usage patterns in MARV genomes. Multiple codon analysis approaches and statistical methods were performed to determine overall codon usage patterns, biases in codon usage, and influence of various factors, including mutation pressure, natural selection, and its two hosts, Homo sapiens and Rousettus aegyptiacus. Nucleotide composition and relative synonymous codon usage (RSCU) analysis revealed that MARV shows mutation bias and prefers U- and A-ended codons to code amino acids. Effective number of codons analysis indicated that overall codon usage among MARV genomes is slightly biased. The Parity Rule 2 plot analysis showed that GC and AU nucleotides were not used proportionally which accounts for the presence of natural selection. Codon usage patterns of MARV were also found to be influenced by its hosts. This indicates that MARV have evolved codon usage patterns that are specific to both of its hosts. Moreover, selection pressure from R. aegyptiacus on the MARV RSCU patterns was found to be dominant compared with that from H. sapiens. Overall, mutation pressure was found to be the most important and dominant force that shapes codon usage patterns in MARV. To our knowledge, this is the first detailed codon usage analysis of MARV and extends our understanding of the mechanisms that contribute to codon usage and evolution of MARV.
A free VP3 C-terminus is essential for the replication of infectious bursal disease virus.
Mosley, Yung-Yi C; Wu, Ching Ching; Lin, Tsang Long
2017-03-15
Green fluorescent protein (GFP) has been successfully incorporated into the viral-like particles of infectious bursal disease virus (IBDV) with a linker at the C-terminus of VP3 in a baculovirus system. However, when the same locus in segment A was used to express GFP by a reverse genetic (RG) system, no viable GFP-expressing IBDV was recovered. To elucidate the underlying mechanism, cDNA construct of segment A with only the linker sequence (9 amino acids) was applied to generate RG IBDV virus (rIBDV). Similarly, no rIBDV was recovered. Moreover, when the incubation after transfection was extended, wildtype rIBDV without the linker was recovered suggesting a free C-terminus of VP3 might be necessary for IBDV replication. On the other hand, rIBDV could be recovered when additional sequence (up to 40 nucleotides) were inserted at the 3' noncoding region (NCR) adjacent to the stop codon of VP3, suggesting that the burden of the linker sequence was not in the stretched genome size but the disruption of the VP3 function. Finally, when the stop codon of VP3 was deleted in segment A to extend the translation into the 3' NCR without introducing additional genomic sequence, no rIBDV was recovered. Our data suggest that a free VP3 C-terminus is essential for IBDV replication. Copyright © 2017 Elsevier B.V. All rights reserved.
A Novel Loss-of-Sclerostin Function Mutation in a First Egyptian Family with Sclerosteosis
Fayez, Alaaeldin; Aglan, Mona; Esmaiel, Nora; El Zanaty, Taher; Abdel Kader, Mohamed; El Ruby, Mona
2015-01-01
Sclerosteosis is a rare autosomal recessive condition characterized by increased bone density. Mutations in SOST gene coding for sclerostin are linked to sclerosteosis. Two Egyptian brothers with sclerosteosis and their apparently normal consanguineous parents were included in this study. Clinical evaluation and genomic sequencing of the SOST gene were performed followed by in silico analysis of the resulting variation. A novel homozygous frameshift mutation in the SOST gene, characterized as one nucleotide cytosine insertion that led to premature stop codon and loss of functional sclerostin, was identified in the two affected brothers. Their parents were heterozygous for the same mutation. To our knowledge this is the first Egyptian study of sclerosteosis and SOST gene causing mutation. PMID:25984533
[Positioning of mRNA 3' of the a site bound codon on the human 80S ribosome].
Molotkov, M V; Graĭfer, D M; Demeshkina, N A; Repkova, M N; Ven'iaminova, A G; Karpova, G G
2005-01-01
Short mRNA analogues carrying a UUU triplet at the 5'-termini and a perfluorophenylazide group at either the N7 atom of the guanosine or the C5 atom of the uridine 3' of the triplet were applied to study positioning of mRNA 3' of the A site codon. Complexes of 80S ribosomes with the mRNA analogues were obtained in the presence of tRNAPhe that directed UUU codon to the P site and consequently provided placement of the nucleotide with cross-linker in positions +9 or +12 with respect to the first nucleotide of the P site bound codon. Both types mRNA analogues cross-linked to the 18S rRNA and 40S proteins under mild UV-irradiation. Cross-linking patterns in the complexes where modified nucleotides of the mRNA analogues were in position +7 were analyzed for comparison (cross-linking to the 18S rRNA in such complexes has been studied previously). The efficiency of cross-linking to the ribosomal components depended on the nature of the modified nucleotide in the mRNA analogue and its position on the ribosome, extent of cross-linking to the 18S rRNA being decreased drastically when the modified nucleotide was moved from position +7 to position +12. The nucleotides of 18S rRNA cross-linked to mRNA analogues were determined. Modified nucleotides in positions +9 and +12 cross-linked to the invariant dinucleotide A1824/A1825 and to variable A1823 in the 3'-minidomain of 18S rRNA as well as to protein S15. The same ribosomal components have been found earlier to cross-link to modified mRNA nucleotides in positions from +4 to +7. Besides, all mRNA analogues cross-linked to the invariant nucleotide c1698 in the 3'-minidomain and to and the conserved region 605-620 closing helix 18 in the 5'-domain.
The complete nucleotide sequence of the domestic dog (Canis familiaris) mitochondrial genome.
Kim, K S; Lee, S E; Jeong, H W; Ha, J H
1998-10-01
The complete nucleotide sequence of the mitochondrial genome of the domestic dog, Canis familiaris, was determined. The length of the sequence was 16,728 bp; however, the length was not absolute due to the variation (heteroplasmy) caused by differing numbers of the repetitive motif, 5'-GTACACGT(A/G)C-3', in the control region. The genome organization, gene contents, and codon usage conformed to those of other mammalian mitochondrial genomes. Although its features were unknown, the "CTAGA" duplication event which followed the translational stop codon of the COII gene was not observed in other mammalian mitochondrial genomes. In order to determine the possible differences between mtDNAs in carnivores, two rRNA and 13 protein-coding genes from the cat, dog, and seal were compared. The combined molecular differences, in two rRNA genes as well as in the inferred amino acid sequences of the mitochondrial 13 protein-coding genes, suggested that there is a closer relationship between the dog and the seal than there is between either of these species and the cat. Based on the molecular differences of the mtDNA, the evolutionary divergence between the cat, the dog, and the seal was dated to approximately 50 +/- 4 million years ago. The degree of difference between carnivore mtDNAs varied according to the individual protein-coding gene applied, showing that the evolutionary relationships of distantly related species should be presented in an extended study based on ample sequence data like complete mtDNA molecules. Copyright 1998 Academic Press.
Prevost, Luanna B.; Smith, Michelle K.; Knight, Jennifer K.
2016-01-01
Previous work has shown that students have persistent difficulties in understanding how central dogma processes can be affected by a stop codon mutation. To explore these difficulties, we modified two multiple-choice questions from the Genetics Concept Assessment into three open-ended questions that asked students to write about how a stop codon mutation potentially impacts replication, transcription, and translation. We then used computer-assisted lexical analysis combined with human scoring to categorize student responses. The lexical analysis models showed high agreement with human scoring, demonstrating that this approach can be successfully used to analyze large numbers of student written responses. The results of this analysis show that students’ ideas about one process in the central dogma can affect their thinking about subsequent and previous processes, leading to mixed models of conceptual understanding. PMID:27909016
Efficient Reassignment of a Frequent Serine Codon in Wild-Type Escherichia coli.
Ho, Joanne M; Reynolds, Noah M; Rivera, Keith; Connolly, Morgan; Guo, Li-Tao; Ling, Jiqiang; Pappin, Darryl J; Church, George M; Söll, Dieter
2016-02-19
Expansion of the genetic code through engineering the translation machinery has greatly increased the chemical repertoire of the proteome. This has been accomplished mainly by read-through of UAG or UGA stop codons by the noncanonical aminoacyl-tRNA of choice. While stop codon read-through involves competition with the translation release factors, sense codon reassignment entails competition with a large pool of endogenous tRNAs. We used an engineered pyrrolysyl-tRNA synthetase to incorporate 3-iodo-l-phenylalanine (3-I-Phe) at a number of different serine and leucine codons in wild-type Escherichia coli. Quantitative LC-MS/MS measurements of amino acid incorporation yields carried out in a selected reaction monitoring experiment revealed that the 3-I-Phe abundance at the Ser208AGU codon in superfolder GFP was 65 ± 17%. This method also allowed quantification of other amino acids (serine, 33 ± 17%; phenylalanine, 1 ± 1%; threonine, 1 ± 1%) that compete with 3-I-Phe at both the aminoacylation and decoding steps of translation for incorporation at the same codon position. Reassignments of different serine (AGU, AGC, UCG) and leucine (CUG) codons with the matching tRNA(Pyl) anticodon variants were met with varying success, and our findings provide a guideline for the choice of sense codons to be reassigned. Our results indicate that the 3-iodo-l-phenylalanyl-tRNA synthetase (IFRS)/tRNA(Pyl) pair can efficiently outcompete the cellular machinery to reassign select sense codons in wild-type E. coli.
Stringent Nucleotide Recognition by the Ribosome at the Middle Codon Position.
Liu, Wei; Shin, Dongwon; Ng, Martin; Sanbonmatsu, Karissa Y; Tor, Yitzhak; Cooperman, Barry S
2017-08-29
Accurate translation of the genetic code depends on mRNA:tRNA codon:anticodon base pairing. Here we exploit an emissive, isosteric adenosine surrogate that allows direct measurement of the kinetics of codon:anticodon University of California base formation during protein synthesis. Our results suggest that codon:anticodon base pairing is subject to tighter constraints at the middle position than at the 5'- and 3'-positions, and further suggest a sequential mechanism of formation of the three base pairs in the codon:anticodon helix.
Increasing the fidelity of noncanonical amino acid incorporation in cell-free protein synthesis.
Gan, Qinglei; Fan, Chenguang
2017-11-01
Cell-free protein synthesis provides a robust platform for co-translational incorporation of noncanonical amino acid (ncAA) into proteins to facilitate biological studies and biotechnological applications. Recently, eliminating the activity of release factor 1 has been shown to increase ncAA incorporation in response to amber codons. However, this approach could promote mis-incorporation of canonical amino acids by near cognate suppression. We performed a facile protocol to remove near cognate tRNA isoacceptors of the amber codon from total tRNAs, and used the phosphoserine (Sep) incorporation system as validation. By manipulating codon usage of target genes and tRNA species introduced into the cell-free protein synthesis system, we increased the fidelity of Sep incorporation at a specific position. By removing three near cognate tRNA isoacceptors of the amber stop codon [tRNA Lys , tRNA Tyr , and tRNA Gln (CUG)] from the total tRNA, the near cognate suppression decreased by 5-fold without impairing normal protein synthesis in the cell-free protein synthesis system. Mass spectrometry analyses indicated that the fidelity of ncAA incorporation was improved. Removal of near cognate tRNA isoacceptors of the amber codon could increase ncAA incorporation fidelity towards the amber stop codon in release factor deficiency systems. We provide a general strategy to improve fidelity of ncAA incorporation towards stop, quadruplet and sense codons in cell-free protein synthesis systems. This article is part of a Special Issue entitled "Biochemistry of Synthetic Biology - Recent Developments" Guest Editor: Dr. Ilka Heinemann and Dr. Patrick O'Donoghue. Copyright © 2016 Elsevier B.V. All rights reserved.
Origin of noncoding DNA sequences: molecular fossils of genome evolution
DOE Office of Scientific and Technical Information (OSTI.GOV)
Naora, H.; Miyahara, K.; Curnow, R.N.
The total amount of noncoding sequences on chromosomes of contemporary organisms varies significantly from species to species. The authors propose a hypothesis for the origin of these noncoding sequences that assumes that (i) an approx. 0.55-kilobase (kb)-long reading frame composed the primordial gene and (ii) a 20-kb-long single-stranded polynucleotide is the longest molecule (as a genome) that was polymerized at random and without a specific template in the primordial soup/cell. The statistical distribution of stop codons allows examination of the probability of generating reading frames of approx. 0.55 kb in this primordial polynucleotide. This analysis reveals that with three stopmore » codons, a run of at least 0.55-kb equivalent length of nonstop codons would occur in 4.6% of 20-kb-long polynucleotide molecules. They attempt to estimate the total amount of noncoding sequences that would be present on the chromosomes of contemporary species assuming that present-day chromosomes retain the prototype primordial genome structure. Theoretical estimates thus obtained for most eukaryotes do not differ significantly from those reported for these specific organisms, with only a few exceptions. Furthermore, analysis of possible stop-codon distributions suggests that life on earth would not exist, at least in its present form, had two or four stop codons been selected early in evolution.« less
Salvatori, Francesca; Pappadà, Mariangela; Breveglieri, Giulia; D'Aversa, Elisabetta; Finotti, Alessia; Lampronti, Ilaria; Gambari, Roberto; Borgatti, Monica
2018-05-15
Nonsense mutations promote premature translational termination, introducing stop codons within the coding region of mRNAs and causing inherited diseases, including thalassemia. For instance, in β 0 39 thalassemia the CAG (glutamine) codon is mutated to the UAG stop codon, leading to premature translation termination and to mRNA destabilization through the well described NMD (nonsense-mediated mRNA decay). In order to develop an approach facilitating translation and, therefore, protection from NMD, ribosomal read-through molecules, such as aminoglycoside antibiotics, have been tested on mRNAs carrying premature stop codons. These findings have introduced new hopes for the development of a pharmacological approach to the β 0 39 thalassemia therapy. While several strategies, designed to enhance translational read-through, have been reported to inhibit NMD efficiency concomitantly, experimental tools for systematic analysis of mammalian NMD inhibition by translational read-through are lacking. We developed a human cellular model of the β 0 39 thalassemia mutation with UPF-1 suppressed and showing a partial NMD suppression. This novel cellular model could be used for the screening of molecules exhibiting preferential read-through activity allowing a great rescue of the mutated transcripts.
Rules of UGA-N decoding by near-cognate tRNAs and analysis of readthrough on short uORFs in yeast.
Beznosková, Petra; Gunišová, Stanislava; Valášek, Leoš Shivaya
2016-03-01
The molecular mechanism of stop codon recognition by the release factor eRF1 in complex with eRF3 has been described in great detail; however, our understanding of what determines the difference in termination efficiencies among various stop codon tetranucleotides and how near-cognate (nc) tRNAs recode stop codons during programmed readthrough in Saccharomyces cerevisiae is still poor. Here, we show that UGA-C as the only tetranucleotide of all four possible combinations dramatically exacerbated the readthrough phenotype of the stop codon recognition-deficient mutants in eRF1. Since the same is true also for UAA-C and UAG-C, we propose that the exceptionally high readthrough levels that all three stop codons display when followed by cytosine are partially caused by the compromised sampling ability of eRF1, which specifically senses cytosine at the +4 position. The difference in termination efficiencies among the remaining three UGA-N tetranucleotides is then given by their varying preferences for nc-tRNAs. In particular, UGA-A allows increased incorporation of Trp-tRNA whereas UGA-G and UGA-C favor Cys-tRNA. Our findings thus expand the repertoire of general decoding rules by showing that the +4 base determines the preferred selection of nc-tRNAs and, in the case of cytosine, it also genetically interacts with eRF1. Finally, using an example of the GCN4 translational control governed by four short uORFs, we also show how the evolution of this mechanism dealt with undesirable readthrough on those uORFs that serve as the key translation reinitiation promoting features of the GCN4 regulation, as both of these otherwise counteracting activities, readthrough versus reinitiation, are mediated by eIF3. © 2016 Beznosková et al.; Published by Cold Spring Harbor Laboratory Press for the RNA Society.
Disruption of the Opal Stop Codon Attenuates Chikungunya Virus-Induced Arthritis and Pathology.
Jones, Jennifer E; Long, Kristin M; Whitmore, Alan C; Sanders, Wes; Thurlow, Lance R; Brown, Julia A; Morrison, Clayton R; Vincent, Heather; Peck, Kayla M; Browning, Christian; Moorman, Nathaniel; Lim, Jean K; Heise, Mark T
2017-11-14
Chikungunya virus (CHIKV) is a mosquito-borne alphavirus responsible for several significant outbreaks of debilitating acute and chronic arthritis and arthralgia over the past decade. These include a recent outbreak in the Caribbean islands and the Americas that caused more than 1 million cases of viral arthralgia. Despite the major impact of CHIKV on global health, viral determinants that promote CHIKV-induced disease are incompletely understood. Most CHIKV strains contain a conserved opal stop codon at the end of the viral nsP3 gene. However, CHIKV strains that encode an arginine codon in place of the opal stop codon have been described, and deep-sequencing analysis of a CHIKV isolate from the Caribbean identified both arginine and opal variants within this strain. Therefore, we hypothesized that the introduction of the arginine mutation in place of the opal termination codon may influence CHIKV virulence. We tested this by introducing the arginine mutation into a well-characterized infectious clone of a CHIKV strain from Sri Lanka and designated this virus Opal524R. This mutation did not impair viral replication kinetics in vitro or in vivo Despite this, the Opal524R virus induced significantly less swelling, inflammation, and damage within the feet and ankles of infected mice. Further, we observed delayed induction of proinflammatory cytokines and chemokines, as well as reduced CD4 + T cell and NK cell recruitment compared to those in the parental strain. Therefore, the opal termination codon plays an important role in CHIKV pathogenesis, independently of effects on viral replication. IMPORTANCE Chikungunya virus (CHIKV) is a mosquito-borne alphavirus that causes significant outbreaks of viral arthralgia. Studies with CHIKV and other alphaviruses demonstrated that the opal termination codon within nsP3 is highly conserved. However, some strains of CHIKV and other alphaviruses contain mutations in the opal termination codon. These mutations alter the virulence of related alphaviruses in mammalian and mosquito hosts. Here, we report that a clinical isolate of a CHIKV strain from the recent outbreak in the Caribbean islands contains a mixture of viruses encoding either the opal termination codon or an arginine mutation. Mutating the opal stop codon to an arginine residue attenuates CHIKV-induced disease in a mouse model. Compared to infection with the opal-containing parental virus, infection with the arginine mutant causes limited swelling and inflammation, as well as dampened recruitment of immune mediators of pathology, including CD4 + T cells and NK cells. We propose that the opal termination codon plays an essential role in the induction of severe CHIKV disease. Copyright © 2017 Jones et al.
Disruption of the Opal Stop Codon Attenuates Chikungunya Virus-Induced Arthritis and Pathology
Jones, Jennifer E.; Long, Kristin M.; Whitmore, Alan C.; Sanders, Wes; Thurlow, Lance R.; Brown, Julia A.; Morrison, Clayton R.; Vincent, Heather; Browning, Christian; Moorman, Nathaniel; Lim, Jean K.
2017-01-01
ABSTRACT Chikungunya virus (CHIKV) is a mosquito-borne alphavirus responsible for several significant outbreaks of debilitating acute and chronic arthritis and arthralgia over the past decade. These include a recent outbreak in the Caribbean islands and the Americas that caused more than 1 million cases of viral arthralgia. Despite the major impact of CHIKV on global health, viral determinants that promote CHIKV-induced disease are incompletely understood. Most CHIKV strains contain a conserved opal stop codon at the end of the viral nsP3 gene. However, CHIKV strains that encode an arginine codon in place of the opal stop codon have been described, and deep-sequencing analysis of a CHIKV isolate from the Caribbean identified both arginine and opal variants within this strain. Therefore, we hypothesized that the introduction of the arginine mutation in place of the opal termination codon may influence CHIKV virulence. We tested this by introducing the arginine mutation into a well-characterized infectious clone of a CHIKV strain from Sri Lanka and designated this virus Opal524R. This mutation did not impair viral replication kinetics in vitro or in vivo. Despite this, the Opal524R virus induced significantly less swelling, inflammation, and damage within the feet and ankles of infected mice. Further, we observed delayed induction of proinflammatory cytokines and chemokines, as well as reduced CD4+ T cell and NK cell recruitment compared to those in the parental strain. Therefore, the opal termination codon plays an important role in CHIKV pathogenesis, independently of effects on viral replication. PMID:29138302
Stringent Nucleotide Recognition by the Ribosome at the Middle Codon Position
Liu, Wei; Shin, Dongwon; Ng, Martin; Sanbonmatsu, Karissa Y.; Tor, Yitzhak; Cooperman, Barry S.
2017-01-01
Accurate translation of the genetic code depends on mRNA:tRNA codon:anticodon base pairing. Here we exploit an emissive, isosteric adenosine surrogate that allows direct measurement of the kinetics of codon:anticodon base formation during protein synthesis. Our results suggest that codon:anticodon base pairing is subject to tighter constraints at the middle position than at the 5′- and 3′-positions, and further suggest a sequential mechanism of formation of the three base pairs in the codon:anticodon helix. PMID:28850078
Engqvist, Martin K M; Nielsen, Jens
2015-08-21
The Ambiguous Nucleotide Tool (ANT) is a desktop application that generates and evaluates degenerate codons. Degenerate codons are used to represent DNA positions that have multiple possible nucleotide alternatives. This is useful for protein engineering and directed evolution, where primers specified with degenerate codons are used as a basis for generating libraries of protein sequences. ANT is intuitive and can be used in a graphical user interface or by interacting with the code through a defined application programming interface. ANT comes with full support for nonstandard, user-defined, or expanded genetic codes (translation tables), which is important because synthetic biology is being applied to an ever widening range of natural and engineered organisms. The Python source code for ANT is freely distributed so that it may be used without restriction, modified, and incorporated in other software or custom data pipelines.
Jiang, Fan; Huang, Lv-Yin; Chen, Gui-Lan; Zhou, Jian-Ying; Xie, Xing-Mei; Li, Dong-Zhi
2017-01-01
We describe a new β-thalassemic mutation in a Chinese subject. This allele develops by insertion of one nucleotide (+T) between codons 138 and 139 in the third exon of the β-globin gene. The mutation causes a frameshift that leads to a termination codon at codon 139. In the heterozygote, this allele has the phenotype of classical β-thalassemia (β-thal) minor.
Brunak, S; Engelbrecht, J
1996-06-01
A direct comparison of experimentally determined protein structures and their corresponding protein coding mRNA sequences has been performed. We examine whether real world data support the hypothesis that clusters of rare codons correlate with the location of structural units in the resulting protein. The degeneracy of the genetic code allows for a biased selection of codons which may control the translational rate of the ribosome, and may thus in vivo have a catalyzing effect on the folding of the polypeptide chain. A complete search for GenBank nucleotide sequences coding for structural entries in the Brookhaven Protein Data Bank produced 719 protein chains with matching mRNA sequence, amino acid sequence, and secondary structure assignment. By neural network analysis, we found strong signals in mRNA sequence regions surrounding helices and sheets. These signals do not originate from the clustering of rare codons, but from the similarity of codons coding for very abundant amino acid residues at the N- and C-termini of helices and sheets. No correlation between the positioning of rare codons and the location of structural units was found. The mRNA signals were also compared with conserved nucleotide features of 16S-like ribosomal RNA sequences and related to mechanisms for maintaining the correct reading frame by the ribosome.
Influence of codon usage bias on FGLamide-allatostatin mRNA secondary structure.
Martínez-Pérez, Francisco; Bendena, William G; Chang, Belinda S W; Tobe, Stephen S
2011-03-01
The FGLamide allatostatins (ASTs) are invertebrate neuropeptides which inhibit juvenile hormone biosynthesis in Dictyoptera and related orders. They also show myomodulatory activity. FGLamide AST nucleotide frequencies and codon bias were investigated with respect to possible effects on mRNA secondary structure. 367 putative FGLamide ASTs and their potential endoproteolytic cleavage sites were identified from 40 species of crustaceans, chelicerates and insects. Among these, 55% comprised only 11 amino acids. An FGLamide AST consensus was identified to be (X)(1→16)Y(S/A/N/G)FGLGKR, with a strong bias for the codons UUU encoding for Phe and AAA for Lys, which can form strong Watson-Crick pairing in all peptides analyzed. The physical distance between these codons favor a loop structure from Ser/Ala-Phe to Lys-Arg. Other loop and hairpin loops were also inferred from the codon frequencies in the N-terminal motif, and the first amino acids from the C-terminal motif, or the dibasic potential endoproteolytic cleavage site. Our results indicate that nucleotide frequencies and codon usage bias in FGLamide ASTs tend to favor mRNA folds in the codon sequence in the C-terminal active peptide core and at the dibasic potential endoproteolytic cleavage site. Copyright © 2010 Elsevier Inc. All rights reserved.
Ribosome profiling reveals pervasive and regulated stop codon readthrough in Drosophila melanogaster
Dunn, Joshua G; Foo, Catherine K; Belletier, Nicolette G; Gavis, Elizabeth R; Weissman, Jonathan S
2013-01-01
Ribosomes can read through stop codons in a regulated manner, elongating rather than terminating the nascent peptide. Stop codon readthrough is essential to diverse viruses, and phylogenetically predicted to occur in a few hundred genes in Drosophila melanogaster, but the importance of regulated readthrough in eukaryotes remains largely unexplored. Here, we present a ribosome profiling assay (deep sequencing of ribosome-protected mRNA fragments) for Drosophila melanogaster, and provide the first genome-wide experimental analysis of readthrough. Readthrough is far more pervasive than expected: the vast majority of readthrough events evolved within D. melanogaster and were not predicted phylogenetically. The resulting C-terminal protein extensions show evidence of selection, contain functional subcellular localization signals, and their readthrough is regulated, arguing for their importance. We further demonstrate that readthrough occurs in yeast and humans. Readthrough thus provides general mechanisms both to regulate gene expression and function, and to add plasticity to the proteome during evolution. DOI: http://dx.doi.org/10.7554/eLife.01179.001 PMID:24302569
An integrated, structure- and energy-based view of the genetic code.
Grosjean, Henri; Westhof, Eric
2016-09-30
The principles of mRNA decoding are conserved among all extant life forms. We present an integrative view of all the interaction networks between mRNA, tRNA and rRNA: the intrinsic stability of codon-anticodon duplex, the conformation of the anticodon hairpin, the presence of modified nucleotides, the occurrence of non-Watson-Crick pairs in the codon-anticodon helix and the interactions with bases of rRNA at the A-site decoding site. We derive a more information-rich, alternative representation of the genetic code, that is circular with an unsymmetrical distribution of codons leading to a clear segregation between GC-rich 4-codon boxes and AU-rich 2:2-codon and 3:1-codon boxes. All tRNA sequence variations can be visualized, within an internal structural and energy framework, for each organism, and each anticodon of the sense codons. The multiplicity and complexity of nucleotide modifications at positions 34 and 37 of the anticodon loop segregate meaningfully, and correlate well with the necessity to stabilize AU-rich codon-anticodon pairs and to avoid miscoding in split codon boxes. The evolution and expansion of the genetic code is viewed as being originally based on GC content with progressive introduction of A/U together with tRNA modifications. The representation we present should help the engineering of the genetic code to include non-natural amino acids. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.
Garver, K.A.; Conway, C.M.; Kurath, G.
2006-01-01
A highly efficacious DNA vaccine against a fish rhabdovirus, infectious hematopoietic necrosis virus (IHNV), was mutated to introduce two stop codons to prevent glycoprotein translation while maintaining the plasmid DNA integrity and RNA transcription ability. The mutated plasmid vaccine, denoted pIHNw-G2stop, when injected intramuscularly into fish at high doses, lacked detectable glycoprotein expression in the injection site muscle, and did not provide protection against lethal virus challenge 7 days post-vaccination. These results suggest that the G-protein itself is required to stimulate the early protective antiviral response observed after vaccination with the nonmutated parental DNA vaccine. ?? Springer Science+Business Media, Inc. 2006.
Castro-Chavez, Fernando
2012-01-01
Background Three binary representations of the genetic code according to the ancient I Ching of Fu-Xi will be presented, depending on their defragging capabilities by pairing based on three biochemical properties of the nucleic acids: H-bonds, Purine/Pyrimidine rings, and the Keto-enol/Amino-imino tautomerism, yielding the last pair a 32/32 single-strand self-annealed genetic code and I Ching tables. Methods Our working tool is the ancient binary I Ching's resulting genetic code chromosomes defragged by vertical and by horizontal pairing, reverse engineered into non-binaries of 2D rotating 4×4×4 circles and 8×8 squares and into one 3D 100% symmetrical 16×4 tetrahedron coupled to a functional tetrahedron with apical signaling and central hydrophobicity (codon formula: 4[1(1)+1(3)+1(4)+4(2)]; 5:5, 6:6 in man) forming a stella octangula, and compared to Nirenberg's 16×4 codon table (1965) pairing the first two nucleotides of the 64 codons in axis y. Results One horizontal and one vertical defragging had the start Met at the center. Two, both horizontal and vertical pairings produced two pairs of 2×8×4 genetic code chromosomes naturally arranged (M and I), rearranged by semi-introversion of central purines or pyrimidines (M' and I') and by clustering hydrophobic amino acids; their quasi-identity was disrupted by amino acids with odd codons (Met and Tyr pairing to Ile and TGA Stop); in all instances, the 64-grid 90° rotational ability was restored. Conclusions We defragged three I Ching representations of the genetic code while emphasizing Nirenberg's historical finding. The synthetic genetic code chromosomes obtained reflect the protective strategy of enzymes with a similar function, having both humans and mammals a biased G-C dominance of three H-bonds in the third nucleotide of their most used codons per amino acid, as seen in one chromosome of the i, M and M' genetic codes, while a two H-bond A-T dominance was found in their complementary chromosome, as seen in invertebrates and plants. The reverse engineering of chromosome I' into 2D rotating circles and squares was undertaken, yielding a 100% symmetrical 3D geometry which was coupled to a previously obtained genetic code tetrahedron in order to differentiate the start methionine from the methionine that is acting as a codifying non-start codon. PMID:23431415
Vasconcelos, O; Sivakumar, K; Dalakas, M C; Quezado, M; Nagle, J; Leon-Monzon, M; Dubnick, M; Gajdusek, D C; Goldfarb, L G
1995-01-01
Mutations in the human phosphofructokinase muscle subunit gene (PFKM) are known to cause myopathy classified as glycogenosis type VII (Tarui disease). Previously described molecular defects include base substitutions altering encoded amino acids or resulting in abnormal splicing. We report a mutation resulting in phosphofructokinase deficiency in three patients from an Ashkenazi Jewish family. Using a reverse transcription PCR assay, PFKM subunit transcripts differing by length were detected in skeletal muscle tissue of all three affected subjects. In the longer transcript, an insertion of 252 nucleotides totally homologous to the structure of the 10th intron of the PFKM gene was found separating exon 10 from exon 11. In addition, two single base transitions were identified by direct sequencing: [exon 6; codon 95; CGA (Arg) to TGA (stop)] and [exon 7; codon 172; ACC (Thr) to ACT (Thr)] in either transcript. Single-stranded conformational polymorphism and restriction enzyme analyses confirmed the presence of these point substitutions in genomic DNA and strongly suggested homozygosity for the pathogenic allele. The nonsense mutation at codon 95 appeared solely responsible for the phenotype in these patients, further expanding genetic heterogeneity of Tarui disease. Transcripts with and without intron 10 arising from identical mutant alleles probably resulted from differential pre-mRNA processing and may represent a novel message from the PFKM gene. Images Fig. 2 Fig. 4 Fig. 5 PMID:7479776
Methylation of class I translation termination factors: structural and functional aspects.
Graille, Marc; Figaro, Sabine; Kervestin, Stéphanie; Buckingham, Richard H; Liger, Dominique; Heurgué-Hamard, Valérie
2012-07-01
During protein synthesis, release of polypeptide from the ribosome occurs when an in frame termination codon is encountered. Contrary to sense codons, which are decoded by tRNAs, stop codons present in the A-site are recognized by proteins named class I release factors, leading to the release of newly synthesized proteins. Structures of these factors bound to termination ribosomal complexes have recently been obtained, and lead to a better understanding of stop codon recognition and its coordination with peptidyl-tRNA hydrolysis in bacteria. Release factors contain a universally conserved GGQ motif which interacts with the peptidyl-transferase centre to allow peptide release. The Gln side chain from this motif is methylated, a feature conserved from bacteria to man, suggesting an important biological role. However, methylation is catalysed by completely unrelated enzymes. The function of this motif and its post-translational modification will be discussed in the context of recent structural and functional studies. Copyright © 2012 Elsevier Masson SAS. All rights reserved.
Ndhlovu, Andrew; Durand, Pierre M.; Hazelhurst, Scott
2015-01-01
The evolutionary rate at codon sites across protein-coding nucleotide sequences represents a valuable tier of information for aligning sequences, inferring homology and constructing phylogenetic profiles. However, a comprehensive resource for cataloguing the evolutionary rate at codon sites and their corresponding nucleotide and protein domain sequence alignments has not been developed. To address this gap in knowledge, EvoDB (an Evolutionary rates DataBase) was compiled. Nucleotide sequences and their corresponding protein domain data including the associated seed alignments from the PFAM-A (protein family) database were used to estimate evolutionary rate (ω = dN/dS) profiles at codon sites for each entry. EvoDB contains 98.83% of the gapped nucleotide sequence alignments and 97.1% of the evolutionary rate profiles for the corresponding information in PFAM-A. As the identification of codon sites under positive selection and their position in a sequence profile is usually the most sought after information for molecular evolutionary biologists, evolutionary rate profiles were determined under the M2a model using the CODEML algorithm in the PAML (Phylogenetic Analysis by Maximum Likelihood) suite of software. Validation of nucleotide sequences against amino acid data was implemented to ensure high data quality. EvoDB is a catalogue of the evolutionary rate profiles and provides the corresponding phylogenetic trees, PFAM-A alignments and annotated accession identifier data. In addition, the database can be explored and queried using known evolutionary rate profiles to identify domains under similar evolutionary constraints and pressures. EvoDB is a resource for evolutionary, phylogenetic studies and presents a tier of information untapped by current databases. Database URL: http://www.bioinf.wits.ac.za/software/fire/evodb PMID:26140928
Ndhlovu, Andrew; Durand, Pierre M; Hazelhurst, Scott
2015-01-01
The evolutionary rate at codon sites across protein-coding nucleotide sequences represents a valuable tier of information for aligning sequences, inferring homology and constructing phylogenetic profiles. However, a comprehensive resource for cataloguing the evolutionary rate at codon sites and their corresponding nucleotide and protein domain sequence alignments has not been developed. To address this gap in knowledge, EvoDB (an Evolutionary rates DataBase) was compiled. Nucleotide sequences and their corresponding protein domain data including the associated seed alignments from the PFAM-A (protein family) database were used to estimate evolutionary rate (ω = dN/dS) profiles at codon sites for each entry. EvoDB contains 98.83% of the gapped nucleotide sequence alignments and 97.1% of the evolutionary rate profiles for the corresponding information in PFAM-A. As the identification of codon sites under positive selection and their position in a sequence profile is usually the most sought after information for molecular evolutionary biologists, evolutionary rate profiles were determined under the M2a model using the CODEML algorithm in the PAML (Phylogenetic Analysis by Maximum Likelihood) suite of software. Validation of nucleotide sequences against amino acid data was implemented to ensure high data quality. EvoDB is a catalogue of the evolutionary rate profiles and provides the corresponding phylogenetic trees, PFAM-A alignments and annotated accession identifier data. In addition, the database can be explored and queried using known evolutionary rate profiles to identify domains under similar evolutionary constraints and pressures. EvoDB is a resource for evolutionary, phylogenetic studies and presents a tier of information untapped by current databases. © The Author(s) 2015. Published by Oxford University Press.
Alnazawi, Mohamed; Altaher, Abdallah; Kandeel, Mahmoud
2017-01-01
Middle East Respiratory Syndrome Coronavirus (MERS CoV) is a new emerging viral disease characterized by high fatality rate. Understanding MERS CoV genetic aspects and codon usage pattern is important to understand MERS CoV survival, adaptation, evolution, resistance to innate immunity, and help in finding the unique aspects of the virus for future drug discovery experiments. In this work, we provide comprehensive analysis of 238 MERS CoV full genomes comprised of human (hMERS) and camel (cMERS) isolates of the virus. MERS CoV genome shaping seems to be under compositional and mutational bias, as revealed by preference of A/T over G/C nucleotides, preferred codons, nucleotides at the third position of codons (NT3s), relative synonymous codon usage, hydropathicity (Gravy), and aromaticity (Aromo) indices. Effective number of codons (ENc) analysis reveals a general slight codon usage bias. Codon adaptation index reveals incomplete adaptation to host environment. MERS CoV showed high ability to resist the innate immune response by showing lower CpG frequencies. Neutrality evolution analysis revealed a more significant role of mutation pressure in cMERS over hMERS. Correspondence analysis revealed that MERS CoV genomes have three genetic clusters, which were distinct in their codon usage, host, and geographic distribution. Additionally, virtual screening and binding experiments were able to identify three new virus-encoded helicase binding compounds. These compounds can be used for further optimization of inhibitors.
Codon usage bias and phylogenetic analysis of mitochondrial ND1 gene in pisces, aves, and mammals.
Uddin, Arif; Choudhury, Monisha Nath; Chakraborty, Supriyo
2018-01-01
The mitochondrially encoded NADH:ubiquinone oxidoreductase core subunit 1 (MT-ND1) gene is a subunit of the respiratory chain complex I and involved in the first step of the electron transport chain of oxidative phosphorylation (OXPHOS). To understand the pattern of compositional properties, codon usage and expression level of mitochondrial ND1 genes in pisces, aves, and mammals, we used bioinformatic approaches as no work was reported earlier. In this study, a perl script was used for calculating nucleotide contents and different codon usage bias parameters. The codon usage bias of MT-ND1 was low but the expression level was high as revealed from high ENC and CAI value. Correspondence analysis (COA) suggests that the pattern of codon usage for MT-ND1 gene is not same across species and that compositional constraint played an important role in codon usage pattern of this gene among pisces, aves, and mammals. From the regression equation of GC12 on GC3, it can be inferred that the natural selection might have played a dominant role while mutation pressure played a minor role in influencing the codon usage patterns. Further, ND1 gene has a discrepancy with cytochrome B (CYB) gene in preference of codons as evident from COA. The codon usage bias was low. It is influenced by nucleotide composition, natural selection, mutation pressure, length (number) of amino acids, and relative dinucleotide composition. This study helps in understanding the molecular biology, genetics, evolution of MT-ND1 gene, and also for designing a synthetic gene.
Novel insertion mutation in a non-Jewish Caucasian type 1 Gaucher disease patient
DOE Office of Scientific and Technical Information (OSTI.GOV)
Choy, F.Y.M.; Humphries, M.L.; Ferreira, P.
1997-01-20
Gaucher disease is the most prevalent lysosomal storage disorder. It is autosomal recessive, resulting in lysosomal glucocerebrosidase deficiency. Three clinical forms of Gaucher disease have been described: type 1 (nonneuronopathic), type 2 (acute neuronopathic), and type 3 (subacute neuronopathic). We performed PCR-thermal cycle sequence analysis of glucocerebrosidase genomic DNA and identified a novel mutation in a non-Jewish type 1 Gaucher disease patient. It is a C insertion in exon 3 at cDNA nucleotide position 122 and genomic nucleotide position 1626. This mutation causes a frameshift and, subsequently, four of the five codons immediately downstream of the insertion were changed whilemore » the sixth was converted to a stop codon, resulting in premature termination of protein translation. The 122CC insertion abolishes a Cac81 restriction endonuclease cleavage site, allowing a convenient and reliable method for detection using RFLP analysis of PCR-amplified glucocerebrosidase genomic DNA. The mutation in the other Gaucher allele was found to be an A{r_arrow}G substitution at glucocerebrosidase cDNA nucleotide position 1226 that so far has only been reported among type 1 Gaucher disease patients. Since mutation 122CC causes a frameshift and early termination of protein translation, it most likely results in a meaningless transcript and subsequently no residual glucocerebrosidase enzyme activity. We speculate that mutation 122CC may result in a worse prognosis than mutations associated with partial activity. When present in the homozygous form, it could be a lethal allele similar to what has been postulated for the other known insertion mutation, 84GG. Our patient, who is a compound heterozygote 122CC/1226G, has moderately severe type 1 Gaucher disease. Her clinical response to Ceredase{reg_sign} therapy that began 31 months ago has been favorable, though incomplete. 30 refs., 3 figs., 2 tabs.« less
Ovine Reference Materials and Assays for Prion Genetic Testing
USDA-ARS?s Scientific Manuscript database
Codon variants implicated in scrapie susceptibility or disease progression include those at amino acid positions 112, 136, 141, 154, and 171. Nine single nucleotide polymorphisms (SNPs) determine which residues are encoded by the five implicated codons and accurately scoring these SNPs is essential...
Non-uniqueness of factors constraint on the codon usage in Bombyx mori.
Jia, Xian; Liu, Shuyu; Zheng, Hao; Li, Bo; Qi, Qi; Wei, Lei; Zhao, Taiyi; He, Jian; Sun, Jingchen
2015-05-06
The analysis of codon usage is a good way to understand the genetic and evolutionary characteristics of an organism. However, there are only a few reports related with the codon usage of the domesticated silkworm, Bombyx mori (B. mori). Hence, the codon usage of B. mori was analyzed here to reveal the constraint factors and it could be helpful to improve the bioreactor based on B. mori. A total of 1,097 annotated mRNA sequences from B. mori were analyzed, revealing there is only a weak codon bias. It also shows that the gene expression level is related to the GC content, and the amino acids with higher general average hydropathicity (GRAVY) and aromaticity (Aromo). And the genes on the primary axis are strongly positively correlated with the GC content, and GC3s. Meanwhile, the effective number of codons (ENc) is strongly correlated with codon adaptation index (CAI), gene length, and Aromo values. However, the ENc values are correlated with the second axis, which indicates that the codon usage in B. mori is affected by not only mutation pressure and natural selection, but also nucleotide composition and the gene expression level. It is also associated with Aromo values, and gene length. Additionally, B. mori has a greater relative discrepancy in codon preferences with Drosophila melanogaster (D. melanogaster) or Saccharomyces cerevisiae (S. cerevisiae) than with Arabidopsis thaliana (A. thaliana), Escherichia coli (E. coli), or Caenorhabditis elegans (C. elegans). The codon usage bias in B. mori is relatively weak, and many influence factors are found here, such as nucleotide composition, mutation pressure, natural selection, and expression level. Additionally, it is also associated with Aromo values, and gene length. Among them, natural selection might play a major role. Moreover, the "optimal codons" of B. mori are all encoded by G and C, which provides useful information for enhancing the gene expression in B. mori through codon optimization.
Neves, Fabiana; Abrantes, Joana; Esteves, Pedro J
2016-07-01
The interactions between chemokines and their receptors are crucial for differentiation and activation of inflammatory cells. CC chemokine ligand 11 (CCL11) binds to CCR3 and to CCR5 that in leporids underwent gene conversion with CCR2. Here, we genetically characterized CCL11 in lagomorphs (leporids and pikas). All lagomorphs have a potentially functional CCL11, and the Pygmy rabbit has a mutation in the stop codon that leads to a longer protein. Other mammals also have mutations at the stop codon that result in proteins with different lengths. By employing maximum likelihood methods, we observed that, in mammals, CCL11 exhibits both signatures of purifying and positive selection. Signatures of purifying selection were detected in sites important for receptor binding and activation. Of the three sites detected as under positive selection, two were located close to the stop codon. Our results suggest that CCL11 is functional in all lagomorphs, and that the signatures of purifying and positive selection in mammalian CCL11 probably reflect the protein's biological roles. © The Author(s) 2016.
Colagrossi, Luna; Hermans, Lucas E; Salpini, Romina; Di Carlo, Domenico; Pas, Suzan D; Alvarez, Marta; Ben-Ari, Ziv; Boland, Greet; Bruzzone, Bianca; Coppola, Nicola; Seguin-Devaux, Carole; Dyda, Tomasz; Garcia, Federico; Kaiser, Rolf; Köse, Sukran; Krarup, Henrik; Lazarevic, Ivana; Lunar, Maja M; Maylin, Sarah; Micheli, Valeria; Mor, Orna; Paraschiv, Simona; Paraskevis, Dimitros; Poljak, Mario; Puchhammer-Stöckl, Elisabeth; Simon, François; Stanojevic, Maja; Stene-Johansen, Kathrine; Tihic, Nijaz; Trimoulet, Pascale; Verheyen, Jens; Vince, Adriana; Lepej, Snjezana Zidovec; Weis, Nina; Yalcinkaya, Tülay; Boucher, Charles A B; Wensing, Annemarie M J; Perno, Carlo F; Svicher, Valentina
2018-06-01
HBsAg immune-escape mutations can favor HBV-transmission also in vaccinated individuals, promote immunosuppression-driven HBV-reactivation, and increase fitness of drug-resistant strains. Stop-codons can enhance HBV oncogenic-properties. Furthermore, as a consequence of the overlapping structure of HBV genome, some immune-escape mutations or stop-codons in HBsAg can derive from drug-resistance mutations in RT. This study is aimed at gaining insight in prevalence and characteristics of immune-associated escape mutations, and stop-codons in HBsAg in chronically HBV-infected patients experiencing nucleos(t)ide analogues (NA) in Europe. This study analyzed 828 chronically HBV-infected European patients exposed to ≥ 1 NA, with detectable HBV-DNA and with an available HBsAg-sequence. The immune-associated escape mutations and the NA-induced immune-escape mutations sI195M, sI196S, and sE164D (resulting from drug-resistance mutation rtM204 V, rtM204I, and rtV173L) were retrieved from literature and examined. Mutations were defined as an aminoacid substitution with respect to a genotype A or D reference sequence. At least one immune-associated escape mutation was detected in 22.1% of patients with rising temporal-trend. By multivariable-analysis, genotype-D correlated with higher selection of ≥ 1 immune-associated escape mutation (OR[95%CI]:2.20[1.32-3.67], P = 0.002). In genotype-D, the presence of ≥ 1 immune-associated escape mutations was significantly higher in drug-exposed patients with drug-resistant strains than with wild-type virus (29.5% vs 20.3% P = 0.012). Result confirmed by analysing drug-naïve patients (29.5% vs 21.2%, P = 0.032). Strong correlation was observed between sP120T and rtM204I/V (P < 0.001), and their co-presence determined an increased HBV-DNA. At least one NA-induced immune-escape mutation occurred in 28.6% of patients, and their selection correlated with genotype-A (OR[95%CI]:2.03[1.32-3.10],P = 0.001). Finally, stop-codons are present in 8.4% of patients also at HBsAg-positions 172 and 182, described to enhance viral oncogenic-properties. Immune-escape mutations and stop-codons develop in a large fraction of NA-exposed patients from Europe. This may represent a potential threat for horizontal and vertical HBV transmission also to vaccinated persons, and fuel drug-resistance emergence.
Gubbens, Jacob; Kim, Soo Jung; Yang, Zhongying; Johnson, Arthur E.; Skach, William R.
2010-01-01
Amber suppressor tRNAs are widely used to incorporate nonnatural amino acids into proteins to serve as probes of structure, environment, and function. The utility of this approach would be greatly enhanced if multiple probes could be simultaneously incorporated at different locations in the same protein without other modifications. Toward this end, we have developed amber, opal, and ochre suppressor tRNAs derived from Escherichia coli, and yeast tRNACys that incorporate a chemically modified cysteine residue with high selectivity at the cognate UAG, UGA, and UAA stop codons in an in vitro translation system. These synthetic tRNAs were aminoacylated in vitro, and the labile aminoacyl bond was stabilized by covalently attaching a fluorescent dye to the cysteine sulfhydryl group. Readthrough efficiency (amber > opal > ochre) was substantially improved by eRF1/eRF3 inhibition with an RNA aptamer, thus overcoming an intrinsic hierarchy in stop codon selection that limits UGA and UAA termination suppression in higher eukaryotic translation systems. This approach now allows concurrent incorporation of two different modified amino acids at amber and opal codons with a combined apparent readthrough efficiency of up to 25% when compared with the parent protein lacking a stop codon. As such, it significantly expands the possibilities for incorporating nonnative amino acids for protein structure/function studies. PMID:20581130
Tay, Wee Tek; Elfekih, Samia; Court, Leon N; Gordon, Karl H J; Delatte, Hélène; De Barro, Paul J
2017-10-01
Molecular species identification using suboptimal PCR primers can over-estimate species diversity due to coamplification of nuclear mitochondrial (NUMT) DNA/pseudogenes. For the agriculturally important whitefly Bemisia tabaci cryptic pest species complex, species identification depends primarily on characterization of the mitochondrial DNA cytochrome oxidase I (mtDNA COI) gene. The lack of robust PCR primers for the mtDNA COI gene can undermine correct species identification which in turn compromises management strategies. This problem is identified in the B. tabaci Africa/Middle East/Asia Minor clade which comprises the globally invasive Mediterranean (MED) and Middle East Asia Minor I (MEAM1) species, Middle East Asia Minor 2 (MEAM2), and the Indian Ocean (IO) species. Initially identified from the Indian Ocean island of Réunion, MEAM2 has since been reported from Japan, Peru, Turkey and Iraq. We identified MEAM2 individuals from a Peruvian population via Sanger sequencing of the mtDNA COI gene. In attempting to characterize the MEAM2 mitogenome, we instead characterized mitogenomes of MEAM1. We also report on the mitogenomes of MED, AUS, and IO thereby increasing genomic resources for members of this complex. Gene synteny (i.e., same gene composition and orientation) was observed with published B. tabaci cryptic species mitogenomes. Pseudogene fragments matching MEAM2 partial mtDNA COI gene exhibited low frequency single nucleotide polymorphisms that matched low copy number DNA fragments (<3%) of MEAM1 genomes, whereas presence of internal stop codons, loss of expected stop codons and poor primer annealing sites, all suggested MEAM2 as a pseudogene artifact and so not a real species. © The Author 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Nicholas, Frank W; Hobbs, Matthew
2014-01-01
Within two years of the re-discovery of Mendelism, Bateson and Saunders had described six traits in non-laboratory animals (five in chickens and one in cattle) that show single-locus (Mendelian) inheritance. In the ensuing decades, much progress was made in documenting an ever-increasing number of such traits. In 1987 came the first discovery of a causal mutation for a Mendelian trait in non-laboratory animals: a non-sense mutation in the thyroglobulin gene (TG), causing familial goitre in cattle. In the years that followed, the rate of discovery of causal mutations increased, aided mightily by the creation of genome-wide microsatellite maps in the 1990s and even more mightily by genome assemblies and single-nucleotide polymorphism (SNP) chips in the 2000s. With sequencing costs decreasing rapidly, by 2012 causal mutations were being discovered in non-laboratory animals at a rate of more than one per week. By the end of 2012, the total number of Mendelian traits in non-laboratory animals with known causal mutations had reached 499, which was half the number of published single-locus (Mendelian) traits in those species. The distribution of types of mutations documented in non-laboratory animals is fairly similar to that in humans, with almost half being missense or non-sense mutations. The ratio of missense to non-sense mutations in non-laboratory animals to the end of 2012 was 193:78. The fraction of non-sense mutations (78/271 = 0.29) was not very different from the fraction of non-stop codons that are just one base substitution away from a stop codon (21/61 = 0.34). PMID:24372556
Herrera, Victoria L M; Steffen, Martin; Moran, Ann Marie; Tan, Glaiza A; Pasion, Khristine A; Rivera, Keith; Pappin, Darryl J; Ruiz-Opazo, Nelson
2016-06-14
In contrast to rat and mouse databases, the NCBI gene database lists the human dual-endothelin1/VEGFsp receptor (DEspR, formerly Dear) as a unitary transcribed pseudogene due to a stop [TGA]-codon at codon#14 in automated DNA and RNA sequences. However, re-analysis is needed given prior single gene studies detected a tryptophan [TGG]-codon#14 by manual Sanger sequencing, demonstrated DEspR translatability and functionality, and since the demonstration of actual non-translatability through expression studies, the standard-of-excellence for pseudogene designation, has not been performed. Re-analysis must meet UNIPROT criteria for demonstration of a protein's existence at the highest (protein) level, which a priori, would override DNA- or RNA-based deductions. To dissect the nucleotide sequence discrepancy, we performed Maxam-Gilbert sequencing and reviewed 727 RNA-seq entries. To comply with the highest level multiple UNIPROT criteria for determining DEspR's existence, we performed various experiments using multiple anti-DEspR monoclonal antibodies (mAbs) targeting distinct DEspR epitopes with one spanning the contested tryptophan [TGG]-codon#14, assessing: (a) DEspR protein expression, (b) predicted full-length protein size, (c) sequence-predicted protein-specific properties beyond codon#14: receptor glycosylation and internalization, (d) protein-partner interactions, and (e) DEspR functionality via DEspR-inhibition effects. Maxam-Gilbert sequencing and some RNA-seq entries demonstrate two guanines, hence a tryptophan [TGG]-codon#14 within a compression site spanning an error-prone compression sequence motif. Western blot analysis using anti-DEspR mAbs targeting distinct DEspR epitopes detect the identical glycosylated 17.5 kDa pull-down protein. Decrease in DEspR-protein size after PNGase-F digest demonstrates post-translational glycosylation, concordant with the consensus-glycosylation site beyond codon#14. Like other small single-transmembrane proteins, mass spectrometry analysis of anti-DEspR mAb pull-down proteins do not detect DEspR, but detect DEspR-protein interactions with proteins implicated in intracellular trafficking and cancer. FACS analyses also detect DEspR-protein in different human cancer stem-like cells (CSCs). DEspR-inhibition studies identify DEspR-roles in CSC survival and growth. Live cell imaging detects fluorescently-labeled anti-DEspR mAb targeted-receptor internalization, concordant with the single internalization-recognition sequence also located beyond codon#14. Data confirm translatability of DEspR, the full-length DEspR protein beyond codon#14, and elucidate DEspR-specific functionality. Along with detection of the tryptophan [TGG]-codon#14 within an error-prone compression site, cumulative data demonstrating DEspR protein existence fulfill multiple UNIPROT criteria, thus refuting its pseudogene designation.
Efficient initiation of mammalian mRNA translation at a CUG codon.
Dasso, M C; Jackson, R J
1989-01-01
Nucleotide substitutions were made at the initiation codon of an influenza virus NS cDNA clone in a vector carrying the bacteriophage T7 promoter. When capped mRNA transcripts of these constructs were translated in the rabbit reticulocyte lysate, a change in the initiation codon from...AUAAUGG...to...AUACUGG...reduced the in vitro translational efficiency by only 50-60%, and resulted in only a small increase in the yield of short products presumed to be initiated at downstream sites. Synthesis of the full-length product was initiated exclusively at the mutated codon, with negligible use either of in-frame upstream CUG or GUG codons, or of an in-frame downstream GUG codon. We conclude that CUG has the potential to function as an efficient initiation codon in mammalian systems, at least in certain contexts. Images PMID:2780285
Somatic mutations in cancer: Stochastic versus predictable.
Gold, Barry
2017-02-01
The origins of human cancers remain unclear except for a limited number of potent environmental mutagens, such as tobacco and UV light, and in rare cases, familial germ line mutations that affect tumor suppressor genes or oncogenes. A significant component of cancer etiology has been deemed stochastic and correlated with the number of stem cells in a tissue, the number of times the stem cells divide and a low incidence of random DNA polymerase errors that occur during each cell division. While somatic mutations occur during each round of DNA replication, mutations in cancer driver genes are not stochastic. Out of a total of 2843 codons, 1031 can be changed to stop codons by a single base substitution in the tumor suppressor APC gene, which is mutated in 76% of colorectal cancers (CRC). However, the nonsense mutations, which comprise 65% of all the APC driver mutations in CRC, are not random: 43% occur at Arg CGA codons, although they represent <3% of the codons. In TP53, CGA codons comprise <3% of the total 393 codons but they account for 72% and 39% of the mutations in CRC and ovarian cancer OVC, respectively. This mutation pattern is consistent with the kinetically slow, but not stochastic, hydrolytic deamination of 5-methylcytosine residues at specific methylated CpG sites to afford T·G mismatches that lead to C→T transitions and stop codons at CGA. Analysis of nonsense mutations in CRC, OVC and a number of other cancers indicates the need to expand the predictable risk factors for cancer to include, in addition to random polymerase errors, the methylation status of gene body CGA codons in tumor suppressor genes. Copyright © 2017. Published by Elsevier B.V.
Khaĭrulina, Iu S; Molotkov, M V; Bulygin, K N; Graĭfer, D M; Ven'yaminova, A G; Frolova, L Iu; Stahl, J; Karpova, G G
2008-01-01
Protein S3 fragments were determined that crosslink to modified mRNA analogues in positions +5 to +12 relative to the first nucleotide in the P-site binding codon in model complexes mimicking states of ribosomes at the elongation and translation termination steps. The mRNA analogues contained a Phe codon UUU/UUC at the 5'-termini that could predetermine the position of the tRNA(Phe) on the ribosome by the location of P-site binding and perfluorophenylazidobenzoyl group at a nucleotide in various positions 3' of the UUU/UUC codon. The crosslinked S3 protein was isolated from 80S ribosomal complexes irradiated with mild UV light and subjected to cyanogen bromide-induced cleavage at methionine residues with subsequent identification of the crosslinked oligopeptides. An analysis of the positions of modified oligopeptides resulting from the cleavage showed that, in dependence on the positions of modified nucleotides in the mRNA analogue, the crosslinking sites were found in the N-terminal half of the protein (fragment 2-127) and/or in the C-terminal fragment 190-236; the latter reflects a new peculiarity in the structure of the mRNA binding center in the ribosome, unknown to date. The results of crosslinking did not depend on the type of A-site codon or on the presence of translation termination factor eRF1.
Orellana-Muñoz, Sara; Gutiérrez-Escribano, Pilar; Arnáiz-Pita, Yolanda; Dueñas-Santero, Encarnación; Suárez, M. Belén; Bougnoux, Marie-Elisabeth; del Rey, Francisco; Sherlock, Gavin; d’Enfert, Christophe; Correa-Bordes, Jaime; de Aldana, Carlos R. Vázquez
2015-01-01
Candida albicans is a major invasive fungal pathogen in humans. An important virulence factor is its ability to switch between the yeast and hyphal forms, and these filamentous forms are important in tissue penetration and invasion. A common feature for filamentous growth is the ability to inhibit cell separation after cytokinesis, although it is poorly understood how this process is regulated developmentally. In C. albicans, the formation of filaments during hyphal growth requires changes in septin ring dynamics. In this work, we studied the functional relationship between septins and the transcription factor Ace2, which controls the expression of enzymes that catalyze septum degradation. We found that alternative translation initiation produces two Ace2 isoforms. While full-length Ace2, Ace2L, influences septin dynamics in a transcription-independent manner in hyphal cells but not in yeast cells, the use of methionine-55 as the initiation codon gives rise to Ace2S, which functions as the nuclear transcription factor required for the expression of cell separation genes. Genetic evidence indicates that Ace2L influences the incorporation of the Sep7 septin to hyphal septin rings in order to avoid inappropriate activation of cell separation during filamentous growth. Interestingly, a natural single nucleotide polymorphism (SNP) present in the C. albicans WO-1 background and other C. albicans commensal and clinical isolates generates a stop codon in the ninth codon of Ace2L that mimics the phenotype of cells lacking Ace2L. Finally, we report that Ace2L and Ace2S interact with the NDR kinase Cbk1 and that impairing activity of this kinase results in a defect in septin dynamics similar to that of hyphal cells lacking Ace2L. Together, our findings identify Ace2L and the NDR kinase Cbk1 as new elements of the signaling system that modify septin ring dynamics in hyphae to allow cell-chain formation, a feature that appears to have evolved in specific C. albicans lineages. PMID:25875512
Multiple copies of a bile acid-inducible gene in Eubacterium sp. strain VPI 12708.
Gopal-Srivastava, R; Mallonee, D H; White, W B; Hylemon, P B
1990-01-01
Eubacterium sp. strain VPI 12708 is an anaerobic intestinal bacterium which possesses inducible bile acid 7-dehydroxylation activity. Several new polypeptides are produced in this strain following induction with cholic acid. Genes coding for two copies of a bile acid-inducible 27,000-dalton polypeptide (baiA1 and baiA2) have been previously cloned and sequenced. We now report on a gene coding for a third copy of this 27,000-dalton polypeptide (baiA3). The baiA3 gene has been cloned in lambda DASH on an 11.2-kilobase DNA fragment from a partial Sau3A digest of the Eubacterium DNA. DNA sequence analysis of the baiA3 gene revealed 100% homology with the baiA1 gene within the coding region of the 27,000-dalton polypeptides. The baiA2 gene shares 81% sequence identity with the other two genes at the nucleotide level. The flanking nucleotide sequences associated with the baiA1 and baiA3 genes are identical for 930 bases in the 5' direction from the initiation codon and for at least 325 bases in the 3' direction from the stop codon, including the putative promoter regions for the genes. An additional open reading frame (occupying from 621 to 648 bases, depending on the correct start codon) was found in the identical 5' regions associated with the baiA1 and baiA3 clones. The 5' sequence 930 bases upstream from the baiA1 and baiA3 genes was totally divergent. The baiA2 gene, which is part of a large bile acid-inducible operon, showed no homology with the other two genes either in the 5' or 3' direction from the polypeptide coding region, except for a 15-base-pair presumed ribosome-binding site in the 5' region. These studies strongly suggest that a gene duplication (baiA1 and baiA3) has occurred and is stably maintained in this bacterium. Images PMID:2376563
Stop Codon Reassignment in the Wild
DOE Office of Scientific and Technical Information (OSTI.GOV)
Ivanova, Natalia; Schwientek, Patrick; Tripp, H. James
Since the discovery of the genetic code and protein translation mechanisms (1), a limited number of variations of the standard assignment between unique base triplets (codons) and their encoded amino acids and translational stop signals have been found in bacteria and phages (2-3). Given the apparent ubiquity of the canonical genetic code, the design of genomically recoded organisms with non-canonical codes has been suggested as a means to prevent horizontal gene transfer between laboratory and environmental organisms (4). It is also predicted that genomically recoded organisms are immune to infection by viruses, under the assumption that phages and their hostsmore » must share a common genetic code (5). This paradigm is supported by the observation of increased resistance of genomically recoded bacteria to phages with a canonical code (4). Despite these assumptions and accompanying lines of evidence, it remains unclear whether differential and non-canonical codon usage represents an absolute barrier to phage infection and genetic exchange between organisms. Our knowledge of the diversity of genetic codes and their use by viruses and their hosts is primarily derived from the analysis of cultivated organisms. Advances in single-cell sequencing and metagenome assembly technologies have enabled the reconstruction of genomes of uncultivated bacterial and archaeal lineages (6). These initial findings suggest that large scale systematic studies of uncultivated microorganisms and viruses may reveal the extent and modes of divergence from the canonical genetic code operating in nature. To explore alternative genetic codes, we carried out a systematic analysis of stop codon reassignments from the canonical TAG amber, TGA opal, and TAA ochre codons in assembled metagenomes from environmental and host-associated samples, single-cell genomes of uncultivated bacteria and archaea, and a collection of phage sequences« less
O’Donoghue, Patrick; Prat, Laure; Heinemann, Ilka U.; Ling, Jiqiang; Odoi, Keturah; Liu, Wenshe R.; Söll, Dieter
2012-01-01
Over 300 amino acids are found in proteins in nature, yet typically only 20 are genetically encoded. Reassigning stop codons and use of quadruplet codons emerged as the main avenues for genetically encoding non-canonical amino acids (NCAAs). Canonical aminoacyl-tRNAs with near-cognate anticodons also read these codons to some extent. This background suppression leads to ‘statistical protein’ that contains some natural amino acid(s) at a site intended for NCAA. We characterize near-cognate suppression of amber, opal and a quadruplet codon in common Escherichia coli laboratory strains and find that the PylRS/tRNAPyl orthogonal pair cannot completely outcompete contamination by natural amino acids. PMID:23036644
Salmon, Jérôme; Nonnenmacher, Mathieu; Cazé, Sandrine; Flamant, Patricia; Croissant, Odile; Orth, Gérard; Breitburd, Françoise
2000-01-01
We previously reported the partial characterization of two cottontail rabbit papillomavirus (CRPV) subtypes with strikingly divergent E6 and E7 oncoproteins. We report now the complete nucleotide sequences of these subtypes, referred to as CRPVa4 (7,868 nucleotides) and CRPVb (7,867 nucleotides). The CRPVa4 and CRPVb genomes differed at 238 (3%) nucleotide positions, whereas CRPVa4 and the prototype CRPV differed by only 5 nucleotides. The most variable region (7% nucleotide divergence) included the long regulatory region (LRR) and the E6 and E7 genes. A mutation in the stop codon resulted in an 8-amino-acid-longer CRPVb E4 protein, and a nucleotide deletion reduced the coding capacity of the E5 gene from 101 to 25 amino acids. In domestic rabbits homozygous for a specific haplotype of the DRA and DQA genes of the major histocompatibility complex, warts induced by CRPVb DNA or a chimeric genome containing the CRPVb LRR/E6/E7 region showed an early regression, whereas warts induced by CRPVa4 or a chimeric genome containing the CRPVa4 LRR/E6/E7 region persisted and evolved into carcinomas. In contrast, most CRPVa, CRPVb, and chimeric CRPV DNA-induced warts showed no early regression in rabbits homozygous for another DRA-DQA haplotype. Little, if any, viral replication is usually observed in domestic rabbit warts. When warts induced by CRPVa and CRPVb virions and DNA were compared, the number of cells positive for viral DNA or capsid antigens was found to be greater by 1 order of magnitude for specimens induced by CRPVb. Thus, both sequence variation in the LRR/E6/E7 region and the genetic constitution of the host influence the expression of the oncogenic potential of CRPV. Furthermore, intratype variation may overcome to some extent the host restriction of CRPV replication in domestic rabbits. PMID:11044121
KBG syndrome involving a single-nucleotide duplication in ANKRD11
Kleyner, Robert; Malcolmson, Janet; Tegay, David; Ward, Kenneth; Maughan, Annette; Maughan, Glenn; Nelson, Lesa; Wang, Kai; Robison, Reid; Lyon, Gholson J.
2016-01-01
KBG syndrome is a rare autosomal dominant genetic condition characterized by neurological involvement and distinct facial, hand, and skeletal features. More than 70 cases have been reported; however, it is likely that KBG syndrome is underdiagnosed because of lack of comprehensive characterization of the heterogeneous phenotypic features. We describe the clinical manifestations in a male currently 13 years of age, who exhibited symptoms including epilepsy, severe developmental delay, distinct facial features, and hand anomalies, without a positive genetic diagnosis. Subsequent exome sequencing identified a novel de novo heterozygous single base pair duplication (c.6015dupA) in ANKRD11, which was validated by Sanger sequencing. This single-nucleotide duplication is predicted to lead to a premature stop codon and loss of function in ANKRD11, thereby implicating it as contributing to the proband's symptoms and yielding a molecular diagnosis of KBG syndrome. Before molecular diagnosis, this syndrome was not recognized in the proband, as several key features of the disorder were mild and were not recognized by clinicians, further supporting the concept of variable expressivity in many disorders. Although a diagnosis of cerebral folate deficiency has also been given, its significance for the proband's condition remains uncertain. PMID:27900361
Mata López, Sara; Hammond, James J; Rigsby, Madison B; Balog-Alvarez, Cynthia J; Kornegay, Joe N; Nghiem, Peter P
2018-05-29
Boys with Duchenne muscular dystrophy (DMD) have DMD gene mutations, with associated loss of the dystrophin protein and progressive muscle degeneration and weakness. Corticosteroids and palliative support are currently the best treatment options. The long-term benefits of recently approved compounds such as eteplirsen and ataluren remain to be seen. Dogs with naturally occurring dystrophinopathies show progressive disease akin to that of DMD. Accordingly, canine DMD models are useful for studies of pathogenesis and preclinical therapy development. A dystrophin-deficient, male border collie dog was evaluated at the age of 5 months for progressive muscle weakness and dysphagia. Dramatically increased serum creatine kinase levels (41,520 U/L; normal range 59-895 U/L) were seen on a biochemistry panel. Histopathologic changes characteristic of dystrophinopathy were seen. Dystrophin was absent in the skeletal muscle on immunofluorescence microscopy and western blot. Whole genome sequencing, polymerase chain reaction, and Sanger sequencing revealed a frameshift, single nucleotide deletion in canine DMD exon 20, position 27,626,466 (c.2841delT mRNA), resulting in a stop codon six nucleotides downstream. Semen was archived for future line perpetuation. This spontaneous canine dystrophinopathy occurred due to a novel mutation in the minor DMD mutation hotspot (between exons 2 through 20). Perpetuating this line could allow for preclinical testing of genetic therapies targeted to this area of the DMD gene.
Ge, Zhiyun; Quek, Bao Lin; Beemon, Karen L; Hogg, J Robert
2016-01-01
The nonsense-mediated mRNA decay (NMD) pathway degrades mRNAs containing long 3'UTRs to perform dual roles in mRNA quality control and gene expression regulation. However, expansion of vertebrate 3'UTR functions has required a physical expansion of 3'UTR lengths, complicating the process of detecting nonsense mutations. We show that the polypyrimidine tract binding protein 1 (PTBP1) shields specific retroviral and cellular transcripts from NMD. When bound near a stop codon, PTBP1 blocks the NMD protein UPF1 from binding 3'UTRs. PTBP1 can thus mark specific stop codons as genuine, preserving both the ability of NMD to accurately detect aberrant mRNAs and the capacity of long 3'UTRs to regulate gene expression. Illustrating the wide scope of this mechanism, we use RNA-seq and transcriptome-wide analysis of PTBP1 binding sites to show that many human mRNAs are protected by PTBP1 and that PTBP1 enrichment near stop codons correlates with 3'UTR length and resistance to NMD. DOI: http://dx.doi.org/10.7554/eLife.11155.001 PMID:26744779
Structural insights into eRF3 and stop codon recognition by eRF1
Cheng, Zhihong; Saito, Kazuki; Pisarev, Andrey V.; Wada, Miki; Pisareva, Vera P.; Pestova, Tatyana V.; Gajda, Michal; Round, Adam; Kong, Chunguang; Lim, Mengkiat; Nakamura, Yoshikazu; Svergun, Dmitri I.; Ito, Koichi; Song, Haiwei
2009-01-01
Eukaryotic translation termination is mediated by two interacting release factors, eRF1 and eRF3, which act cooperatively to ensure efficient stop codon recognition and fast polypeptide release. The crystal structures of human and Schizosaccharomyces pombe full-length eRF1 in complex with eRF3 lacking the GTPase domain revealed details of the interaction between these two factors and marked conformational changes in eRF1 that occur upon binding to eRF3, leading eRF1 to resemble a tRNA molecule. Small-angle X-ray scattering analysis of the eRF1/eRF3/GTP complex suggested that eRF1's M domain contacts eRF3's GTPase domain. Consistently, mutation of Arg192, which is predicted to come in close contact with the switch regions of eRF3, revealed its important role for eRF1's stimulatory effect on eRF3's GTPase activity. An ATP molecule used as a crystallization additive was bound in eRF1's putative decoding area. Mutational analysis of the ATP-binding site shed light on the mechanism of stop codon recognition by eRF1. PMID:19417105
Lozano, Roberto; Ponce, Olga; Ramirez, Manuel; Mostajo, Nelly; Orjeda, Gisella
2012-01-01
The majority of disease resistance (R) genes identified to date in plants encode a nucleotide-binding site (NBS) and leucine-rich repeat (LRR) domain containing protein. Additional domains such as coiled-coil (CC) and TOLL/interleukin-1 receptor (TIR) domains can also be present. In the recently sequenced Solanum tuberosum group phureja genome we used HMM models and manual curation to annotate 435 NBS-encoding R gene homologs and 142 NBS-derived genes that lack the NBS domain. Highly similar homologs for most previously documented Solanaceae R genes were identified. A surprising ∼41% (179) of the 435 NBS-encoding genes are pseudogenes primarily caused by premature stop codons or frameshift mutations. Alignment of 81.80% of the 577 homologs to S. tuberosum group phureja pseudomolecules revealed non-random distribution of the R-genes; 362 of 470 genes were found in high density clusters on 11 chromosomes. PMID:22493716
Martí, Ramon; Nascimento, Andrés; Colomer, Jaume; Lara, Mari C; López-Gallardo, Ester; Ruiz-Pesini, Eduardo; Montoya, Julio; Andreu, Antoni L; Briones, Paz; Pineda, Mercè
2010-08-01
Mitochondrial DNA (mtDNA) depletion syndrome (MDS) is a devastating disorder of infancy caused by a significant reduction of the number of copies of mitochondrial DNA in one or more tissues. We report a Spanish patient with the myopathic form of MDS, harboring two mutations in the thymidine kinase 2 gene (TK2): a previously reported deletion (p.K244del) and a novel nucleotide duplication in the exon 2, generating a frameshift and premature stop codon. Sensorineural hearing loss was a predominant symptom in the patient and a novel feature of MDS due to TK2 mutations. The patient survived up to the age of 8.5 y, which confirms that survival above the age of 5 y is not infrequent in patients with MDS due to TK2 deficiency.
Deep sequencing approaches for the analysis of prokaryotic transcriptional boundaries and dynamics.
James, Katherine; Cockell, Simon J; Zenkin, Nikolay
2017-05-01
The identification of the protein-coding regions of a genome is straightforward due to the universality of start and stop codons. However, the boundaries of the transcribed regions, conditional operon structures, non-coding RNAs and the dynamics of transcription, such as pausing of elongation, are non-trivial to identify, even in the comparatively simple genomes of prokaryotes. Traditional methods for the study of these areas, such as tiling arrays, are noisy, labour-intensive and lack the resolution required for densely-packed bacterial genomes. Recently, deep sequencing has become increasingly popular for the study of the transcriptome due to its lower costs, higher accuracy and single nucleotide resolution. These methods have revolutionised our understanding of prokaryotic transcriptional dynamics. Here, we review the deep sequencing and data analysis techniques that are available for the study of transcription in prokaryotes, and discuss the bioinformatic considerations of these analyses. Copyright © 2017 Elsevier Inc. All rights reserved.
Dialynas, D P; Murre, C; Quertermous, T; Boss, J M; Leiden, J M; Seidman, J G; Strominger, J L
1986-01-01
Complementary DNA (cDNA) encoding a human T-cell gamma chain has been cloned and sequenced. At the junction of the variable and joining regions, there is an apparent deletion of two nucleotides in the human cDNA sequence relative to the murine gamma-chain cDNA sequence, resulting simultaneously in the generation of an in-frame stop codon and in a translational frameshift. For this reason, the sequence presented here encodes an aberrantly rearranged human T-cell gamma chain. There are several surprising differences between the deduced human and murine gamma-chain amino acid sequences. These include poor homology in the variable region, poor homology in a discrete segment of the constant region precisely bounded by the expected junctions of exon CII, and the presence in the human sequence of five potential sites for N-linked glycosylation. Images PMID:3458221
The complete mitochondrial genome of Cricetulus kamensis (Rodentia: Cricetidae).
Kang, Chunlan; Yue, Hao; Liu, Mengyao; Huang, Ting; Liu, Yang; Zhang, Xiuyue; Yue, Bisong; Zeng, Tao; Liu, Shaoying
2016-01-01
The Cricetulus kamensis is endemic to China and is popular as pet. In the present study, the complete mitogenome of C. kamensis was first determined. It was 16,270 bp in length and the composition and arrangement of its genes are analogous to most other mammals. The overall base composition of heavy strand is 33.2% A, 26.8% T, 27.2% C and 12.7% G. The sequence is highly G-C poor (∼40%) and A is the most numerous nucleotide followed by T >C >G, which is similar to other mammalian mitochondrial genomes. It is notable that three extra bases "CAT" were inserted in cytb at the 3' end position and no stop codon was found for this coding region. The mitogenome sequence of C. kamensis could contribute to a better solution of its phylogenetic position and phylogenetic relationship within Cricetinae in the future.
mRNA 3' of the A site bound codon is located close to protein S3 on the human 80S ribosome.
Molotkov, Maxim V; Graifer, Dmitri M; Popugaeva, Elena A; Bulygin, Konstantin N; Meschaninova, Maria I; Ven'yaminova, Aliya G; Karpova, Galina G
2006-07-01
Ribosomal proteins neighboring the mRNA downstream of the codon bound at the decoding site of human 80S ribosomes were identified using three sets of mRNA analogues that contained a UUU triplet at the 5' terminus and a perfluorophenylazide cross-linker at guanosine, adenosine or uridine residues placed at various locations 3' of this triplet. The positions of modified mRNA nucleotides on the ribosome were governed by tRNA(Phe) cognate to the UUU triplet targeted to the P site. Upon mild UV-irradiation, the mRNA analogues cross-linked preferentially to the 40S subunit, to the proteins and to a lesser extent to the 18S rRNA. Cross-linked nucleotides of 18S rRNA were identified previously. In the present study, it is shown that among the proteins the main target for cross-linking with all the mRNA analogues tested was protein S3 (homologous to prokaryotic S3, S3p); minor cross-linking to protein S2 (S5p) was also detected. Both proteins cross-linked to mRNA analogues in the ternary complexes as well as in the binary complexes (without tRNA). In the ternary complexes protein S15 (S19p) also cross-linked, the yield of the cross-link decreased significantly when the modified nucleotide moved from position +5 to position +12 with respect to the first nucleotide of the P site bound codon. In several ternary complexes minor cross-linking to protein S30 was likewise detected. The results of this study indicate that S3 is a key protein at the mRNA binding site neighboring mRNA downstream of the codon at the decoding site in the human ribosome.
Ribosomal protein S14 transcripts are edited in Oenothera mitochondria.
Schuster, W; Unseld, M; Wissinger, B; Brennicke, A
1990-01-01
The gene encoding ribosomal protein S14 (rps14) in Oenothera mitochondria is located upstream of the cytochrome b gene (cob). Sequence analysis of independently derived cDNA clones covering the entire rps14 coding region shows two nucleotides edited from the genomic DNA to the mRNA derived sequences by C to U modifications. A third editing event occurs four nucleotides upstream of the AUG initiation codon and improves a potential ribosome binding site. A CGG codon specifying arginine in a position conserved in evolution between chloroplasts and E. coli as a UGG tryptophan codon is not edited in any of the cDNAs analysed. An inverted repeat 3' of an unidentified open reading frame is located upstream of the rps14 gene. The inverted repeat sequence is highly conserved at analogous regions in other Oenothera mitochondrial loci. Images PMID:2326162
[Protein S3 in the human 80S ribosome adjoins mRNA from 3'-side of the A-site codon].
Molotkov, M V; Graĭfer, D M; Popugaeva, E A; Bulygin, K N; Meshchaninova, M I; Ven'iaminova, A G; Karpova, G G
2007-01-01
The protein environment of mRNA 3' of the A-site codon (the decoding site) in the human 80S ribosome was studied using a set of oligoribonucleotide derivatives bearing a UUU triplet at the 5'-end and a perfluoroarylazide group at one of the nucleotide residues at the 3'-end of this triplet. Analogues of mRNA were phased into the ribosome using binding at the tRNAPhe P-site, which recognizes the UUU codon. Mild UV irradiation of ribosome complexes with tRNAPhe and mRNA analogues resulted in the predominant crosslinking of the analogues with the 40S subunit components, mainly with proteins and, to a lesser extent, with rRNA. Among the 40S subunit ribosomal proteins, the S3 protein was the main target for modification in all cases. In addition, minor crosslinking with the S2 protein was observed. The crosslinking with the S3 and S2 proteins occurred both in triple complexes and in the absence of tRNA. Within triple complexes, crosslinking with S15 protein was also found, its efficiency considerably falling when the modified nucleotide was moved from positions +5 to +12 relative to the first codon nucleotide in the P-site. In some cases, crosslinking with the S30 protein was observed, it was most efficient for the derivative containing a photoreactive group at the +7 adenosine residue. The results indicate that the S3 protein in the human ribosome plays a key role in the formation of the mRNA binding site 3' of the codon in the decoding site.
Defining the mRNA recognition signature of a bacterial toxin protein
Schureck, Marc A.; Dunkle, Jack A.; Maehigashi, Tatsuya; ...
2015-10-27
Bacteria contain multiple type II toxins that selectively degrade mRNAs bound to the ribosome to regulate translation and growth and facilitate survival during the stringent response. Ribosome-dependent toxins recognize a variety of three-nucleotide codons within the aminoacyl (A) site, but how these endonucleases achieve substrate specificity remains poorly understood. In this paper, we identify the critical features for how the host inhibition of growth B (HigB) toxin recognizes each of the three A-site nucleotides for cleavage. X-ray crystal structures of HigB bound to two different codons on the ribosome illustrate how HigB uses a microbial RNase-like nucleotide recognition loop tomore » recognize either cytosine or adenosine at the second A-site position. Strikingly, a single HigB residue and 16S rRNA residue C1054 form an adenosine-specific pocket at the third A-site nucleotide, in contrast to how tRNAs decode mRNA. Finally, our results demonstrate that the most important determinant for mRNA cleavage by ribosome-dependent toxins is interaction with the third A-site nucleotide.« less
Defining the mRNA recognition signature of a bacterial toxin protein
DOE Office of Scientific and Technical Information (OSTI.GOV)
Schureck, Marc A.; Dunkle, Jack A.; Maehigashi, Tatsuya
Bacteria contain multiple type II toxins that selectively degrade mRNAs bound to the ribosome to regulate translation and growth and facilitate survival during the stringent response. Ribosome-dependent toxins recognize a variety of three-nucleotide codons within the aminoacyl (A) site, but how these endonucleases achieve substrate specificity remains poorly understood. In this paper, we identify the critical features for how the host inhibition of growth B (HigB) toxin recognizes each of the three A-site nucleotides for cleavage. X-ray crystal structures of HigB bound to two different codons on the ribosome illustrate how HigB uses a microbial RNase-like nucleotide recognition loop tomore » recognize either cytosine or adenosine at the second A-site position. Strikingly, a single HigB residue and 16S rRNA residue C1054 form an adenosine-specific pocket at the third A-site nucleotide, in contrast to how tRNAs decode mRNA. Finally, our results demonstrate that the most important determinant for mRNA cleavage by ribosome-dependent toxins is interaction with the third A-site nucleotide.« less
Selenocysteine incorporation: A trump card in the game of mRNA decay
Shetty, Sumangala P.; Copeland, Paul R.
2015-01-01
The incorporation of the 21st amino acid, selenocysteine (Sec), occurs on mRNAs that harbor in-frame stop codons because the Sec-tRNASec recognizes a UGA codon. This sets up an intriguing interplay between translation elongation, translation termination and the complex machinery that marks mRNAs that contain premature termination codons for degradation, leading to nonsense mediated mRNA decay (NMD). In this review we discuss the intricate and complex relationship between this key quality control mechanism and the process of Sec incorporation in mammals. PMID:25622574
Vidal, Ruben; Révész, Tamas; Rostagno, Agueda; Kim, Eugene; Holton, Janice L.; Bek, Toke; Bojsen-Møller, Marie; Braendgaard, Hans; Plant, Gordon; Ghiso, Jorge; Frangione, Blas
2000-01-01
Familial Danish dementia (FDD), also known as heredopathia ophthalmo-oto-encephalica, is an autosomal dominant disorder characterized by cataracts, deafness, progressive ataxia, and dementia. Neuropathological findings include severe widespread cerebral amyloid angiopathy, hippocampal plaques, and neurofibrillary tangles, similar to Alzheimer's disease. N-terminal sequence analysis of isolated leptomeningeal amyloid fibrils revealed homology to ABri, the peptide originated by a point mutation at the stop codon of gene BRI in familial British dementia. Molecular genetic analysis of the BRI gene in the Danish kindred showed a different defect, namely the presence of a 10-nt duplication (795–796insTTTAATTTGT) between codons 265 and 266, one codon before the normal stop codon 267. The decamer duplication mutation produces a frame-shift in the BRI sequence generating a larger-than-normal precursor protein, of which the amyloid subunit (designated ADan) comprises the last 34 C-terminal amino acids. This de novo-created amyloidogenic peptide, associated with a genetic defect in the Danish kindred, stresses the importance of amyloid formation as a causative factor in neurodegeneration and dementia. PMID:10781099
Sato, Yoshiharu; Shoji, Tatsuma; Yamamoto, Tomoko
2013-01-01
Several posttranscriptional modifications of bacterial rRNAs are important in determining antibiotic resistance or sensitivity. In all Gram-positive bacteria, dimethylation of nucleotide A2058, located in domain V of 23S rRNA, by the dimethyltransferase Erm(B) results in low susceptibility and resistance to telithromycin (TEL). However, this is insufficient to produce high-level resistance to TEL in Streptococcus pneumoniae. Inactivation of the methyltransferase RlmAII, which methylates the N-1 position of nucleotide G748, located in hairpin 35 of domain II of 23S rRNA, results in increased resistance to TEL in erm(B)-carrying S. pneumoniae. Sixteen TEL-resistant mutants (MICs, 16 to 32 μg/ml) were obtained from a clinically isolated S. pneumoniae strain showing low TEL susceptibility (MIC, 2 μg/ml), with mutation resulting in constitutive dimethylation of A2058 because of nucleotide differences in the regulatory region of erm(B) mRNA. Primer extension analysis showed that the degree of methylation at G748 in all TEL-resistant mutants was significantly reduced by a mutation in the gene encoding RlmAII to create a stop codon or change an amino acid residue. Furthermore, RNA footprinting with dimethyl sulfate and a molecular modeling study suggested that methylation of G748 may contribute to the stable interaction of TEL with domain II of 23S rRNA, even after dimethylation of A2058 by Erm(B). This novel finding shows that methylation of G748 by RlmAII renders S. pneumoniae TEL susceptible. PMID:23716046
Takaya, Akiko; Sato, Yoshiharu; Shoji, Tatsuma; Yamamoto, Tomoko
2013-08-01
Several posttranscriptional modifications of bacterial rRNAs are important in determining antibiotic resistance or sensitivity. In all Gram-positive bacteria, dimethylation of nucleotide A2058, located in domain V of 23S rRNA, by the dimethyltransferase Erm(B) results in low susceptibility and resistance to telithromycin (TEL). However, this is insufficient to produce high-level resistance to TEL in Streptococcus pneumoniae. Inactivation of the methyltransferase RlmA(II), which methylates the N-1 position of nucleotide G748, located in hairpin 35 of domain II of 23S rRNA, results in increased resistance to TEL in erm(B)-carrying S. pneumoniae. Sixteen TEL-resistant mutants (MICs, 16 to 32 μg/ml) were obtained from a clinically isolated S. pneumoniae strain showing low TEL susceptibility (MIC, 2 μg/ml), with mutation resulting in constitutive dimethylation of A2058 because of nucleotide differences in the regulatory region of erm(B) mRNA. Primer extension analysis showed that the degree of methylation at G748 in all TEL-resistant mutants was significantly reduced by a mutation in the gene encoding RlmA(II) to create a stop codon or change an amino acid residue. Furthermore, RNA footprinting with dimethyl sulfate and a molecular modeling study suggested that methylation of G748 may contribute to the stable interaction of TEL with domain II of 23S rRNA, even after dimethylation of A2058 by Erm(B). This novel finding shows that methylation of G748 by RlmA(II) renders S. pneumoniae TEL susceptible.
Gritsun, T S; Venugopal, K; Zanotto, P M; Mikhailov, M V; Sall, A A; Holmes, E C; Polkinghorne, I; Frolova, T V; Pogodina, V V; Lashkevich, V A; Gould, E A
1997-05-01
The complete nucleotide sequence of two tick-transmitted flaviviruses, Vasilchenko (Vs) from Siberia and louping ill (LI) from the UK, have been determined. The genomes were respectively, 10928 and 10871 nucleotides (nt) in length. The coding strategy and functional protein sequence motifs of tick-borne flaviviruses are presented in both Vs and LI viruses. The phylogenies based on maximum likelihood, maximum parsimony and distance analysis of the polyproteins, identified Vs virus as a member of the tick-borne encephalitis virus subgroup within the tick-borne serocomplex, genus Flavivirus, family Flaviviridae. Comparative alignment of the 3'-untranslated regions revealed deletions of different lengths essentially at the same position downstream of the stop codon for all tick-borne viruses. Two direct 27 nucleotide repeats at the 3'-end were found only for Vs and LI virus. Immediately following the deletions a region of 332-334 nt with relatively conserved primary structure (67-94% identity) was observed at the 3'-non-coding end of the virus genome. Pairwise comparisons of the nucleotide sequence data revealed similar levels of variation between the coding region, and the 5' and 3'-termini of the genome, implying an equivalent strong selective control for translated and untranslated regions. Indeed the predicted folding of the 5' and 3'-untranslated regions revealed patterns of stem and loop structures conserved for all tick-borne flaviviruses suggesting a purifying selection for preservation of essential RNA secondary structures which could be involved in translational control and replication. The possible implications of these findings are discussed.
Ribosomes slide on lysine-encoding homopolymeric A stretches
Koutmou, Kristin S; Schuller, Anthony P; Brunelle, Julie L; Radhakrishnan, Aditya; Djuranovic, Sergej; Green, Rachel
2015-01-01
Protein output from synonymous codons is thought to be equivalent if appropriate tRNAs are sufficiently abundant. Here we show that mRNAs encoding iterated lysine codons, AAA or AAG, differentially impact protein synthesis: insertion of iterated AAA codons into an ORF diminishes protein expression more than insertion of synonymous AAG codons. Kinetic studies in E. coli reveal that differential protein production results from pausing on consecutive AAA-lysines followed by ribosome sliding on homopolymeric A sequence. Translation in a cell-free expression system demonstrates that diminished output from AAA-codon-containing reporters results from premature translation termination on out of frame stop codons following ribosome sliding. In eukaryotes, these premature termination events target the mRNAs for Nonsense-Mediated-Decay (NMD). The finding that ribosomes slide on homopolymeric A sequences explains bioinformatic analyses indicating that consecutive AAA codons are under-represented in gene-coding sequences. Ribosome ‘sliding’ represents an unexpected type of ribosome movement possible during translation. DOI: http://dx.doi.org/10.7554/eLife.05534.001 PMID:25695637
Qian, Chaoju; Yan, Xia; Guo, Zhichun; Wang, Yuanxiu; Li, Xixi; Yang, Jianke; Kan, Xianzhao
2013-08-01
The complete Grey-backed Shrike mitochondrial genome has been sequenced to be 16,820 bp in length, consisting of 37 encode genes: 13 protein-coding genes, 2 ribosomal RNA genes, and 22 transfer RNA genes. In addition, a single control region was also observed. Compared with other reported Passeriformes mtgenome sequences, three bases CAA were detected at the end of Lanius tephronotus cox2 gene with the downstream adjacent base T. The first base of CAA probably occurred C to U transcript editing event resulting in a normal stop codon UAA.
Conserved small mRNA with an unique, extended Shine-Dalgarno sequence
Hahn, Julia; Migur, Anzhela; von Boeselager, Raphael Freiherr; Kubatova, Nina; Kubareva, Elena; Schwalbe, Harald
2017-01-01
ABSTRACT Up to now, very small protein-coding genes have remained unrecognized in sequenced genomes. We identified an mRNA of 165 nucleotides (nt), which is conserved in Bradyrhizobiaceae and encodes a polypeptide with 14 amino acid residues (aa). The small mRNA harboring a unique Shine-Dalgarno sequence (SD) with a length of 17 nt was localized predominantly in the ribosome-containing P100 fraction of Bradyrhizobium japonicum USDA 110. Strong interaction between the mRNA and 30S ribosomal subunits was demonstrated by their co-sedimentation in sucrose density gradient. Using translational fusions with egfp, we detected weak translation and found that it is impeded by both the extended SD and the GTG start codon (instead of ATG). Biophysical characterization (CD- and NMR-spectroscopy) showed that synthesized polypeptide remained unstructured in physiological puffer. Replacement of the start codon by a stop codon increased the stability of the transcript, strongly suggesting additional posttranscriptional regulation at the ribosome. Therefore, the small gene was named rreB (ribosome-regulated expression in Bradyrhizobiaceae). Assuming that the unique ribosome binding site (RBS) is a hallmark of rreB homologs or similarly regulated genes, we looked for similar putative RBS in bacterial genomes and detected regions with at least 16 nt complementarity to the 3′-end of 16S rRNA upstream of sORFs in Caulobacterales, Rhizobiales, Rhodobacterales and Rhodospirillales. In the Rhodobacter/Roseobacter lineage of α-proteobacteria the corresponding gene (rreR) is conserved and encodes an 18 aa protein. This shows how specific RBS features can be used to identify new genes with presumably similar control of expression at the RNA level. PMID:27834614
Blanca, Giuseppina; Baldanti, Fausto; Paolucci, Stefania; Skoblov, Alexander Yu; Victorova, Lyubov; Hübscher, Ulrich; Gerna, Giuseppe; Spadari, Silvio; Maga, Giovanni
2003-05-02
Recombinant HIV-1 reverse transcriptase (RT) carrying non-nucleoside inhibitors (NNRTIs) resistance mutation at codon 181 showed reduced incorporation and high efficiency of phosphorolytic removal of stavudine, a nucleoside RT inhibitor. These results reveal a new mechanism for cross-resistance between different classes of HIV-1 RT inhibitors.
Cárdenas-Ramos, Susana G; Alcázar-González, Gregorio; Reyes-Cortés, Luisa M; Torres-Grimaldo, Abdiel A; Calderón-Garcidueñas, Ana L; Morales-Casas, José; Flores-Sánchez, Patricia; De León-Escobedo, Raúl; Gómez-Díaz, Antonio; Moreno-Bringas, Carmen; Sánchez-Guillén, Jorge; Ramos-Salazar, Pedro; González-de León, César; Barrera-Saldaña, Hugo A
2017-06-01
Current metastatic colorectal cancer (mCRC) therapy uses monoclonal antibodies against the epidermal growth factor receptor. This treatment is only useful in the absence of K-RAS gene mutations; therefore the study of such mutations is part of a personalized treatment. The aim of this work is to determine the frequency and type of the most common K-RAS mutations in Mexican patients with metastatic disease by nucleotide sequencing. We studied 888 patients with mCRC from different regions of Mexico. The presence of mutations in exon 2, codons 12 and 13, of the K-RAS gene was determined by nucleotide sequencing. Patients exhibited K-RAS gene mutations in 35% (310/888) of cases. Mutation frequency of codons 12 and 13 was 71% (221/310) and 29% (89/310), respectively. The most common mutation (45.7%) in codon 12 was c.35G>A (p.G12D), whereas the one in codon 13 was c.38G>A (p.G13D) (78.7%). Given the frequency of K-RAS mutations in Mexicans, making a genetic study before deciding to treat mCRC patients with monoclonal antibodies is indispensable.
Prevost, Luanna B; Smith, Michelle K; Knight, Jennifer K
2016-01-01
Previous work has shown that students have persistent difficulties in understanding how central dogma processes can be affected by a stop codon mutation. To explore these difficulties, we modified two multiple-choice questions from the Genetics Concept Assessment into three open-ended questions that asked students to write about how a stop codon mutation potentially impacts replication, transcription, and translation. We then used computer-assisted lexical analysis combined with human scoring to categorize student responses. The lexical analysis models showed high agreement with human scoring, demonstrating that this approach can be successfully used to analyze large numbers of student written responses. The results of this analysis show that students' ideas about one process in the central dogma can affect their thinking about subsequent and previous processes, leading to mixed models of conceptual understanding. © 2016 L. B. Prevost et al. CBE—Life Sciences Education © 2016 The American Society for Cell Biology. This article is distributed by The American Society for Cell Biology under license from the author(s). It is available to the public under an Attribution–Noncommercial–Share Alike 3.0 Unported Creative Commons License (http://creativecommons.org/licenses/by-nc-sa/3.0).
Complex codon usage pattern and compositional features of retroviruses.
RoyChoudhury, Sourav; Mukherjee, Debaprasad
2013-01-01
Retroviruses infect a wide range of organisms including humans. Among them, HIV-1, which causes AIDS, has now become a major threat for world health. Some of these viruses are also potential gene transfer vectors. In this study, the patterns of synonymous codon usage in retroviruses have been studied through multivariate statistical methods on ORFs sequences from the available 56 retroviruses. The principal determinant for evolution of the codon usage pattern in retroviruses seemed to be the compositional constraints, while selection for translation of the viral genes plays a secondary role. This was further supported by multivariate analysis on relative synonymous codon usage. Thus, it seems that mutational bias might have dominated role over translational selection in shaping the codon usage of retroviruses. Codon adaptation index was used to identify translationally optimal codons among genes from retroviruses. The comparative analysis of the preferred and optimal codons among different retroviral groups revealed that four codons GAA, AAA, AGA, and GGA were significantly more frequent in most of the retroviral genes inspite of some differences. Cluster analysis also revealed that phylogenetically related groups of retroviruses have probably evolved their codon usage in a concerted manner under the influence of their nucleotide composition.
Synonymous codon usage patterns in different parasitic platyhelminth mitochondrial genomes.
Chen, L; Yang, D Y; Liu, T F; Nong, X; Huang, X; Xie, Y; Fu, Y; Zheng, W P; Zhang, R H; Wu, X H; Gu, X B; Wang, S X; Peng, X R; Yang, G Y
2013-02-27
We analyzed synonymous codon usage patterns of the mitochondrial genomes of 43 parasitic platyhelminth species. The relative synonymous codon usage, the effective number of codons (NC) and the frequency of G+C at the third synonymously variable coding position were calculated. Correspondence analysis was used to determine the major variation trends shaping the codon usage patterns. Among the mitochondrial genomes of 19 trematode species, the GC content of third codon positions varied from 0.151 to 0.592, with a mean of 0.295 ± 0.116. In cestodes, the mean GC content of third codon positions was 0.254 ± 0.044. A comparison of the nucleotide composition at 4-fold synonymous sites revealed that, on average, there was a greater abundance of codons ending on U (51.9%) or A (22.7%) than on C (6.3%) or G (19.14%). Twenty-two codons, including UUU, UUA and UUG, were frequently used. In the NC-plot, most of points were distributed well below or around the expected NC curve. In addition to compositional constraints, the degree of hydrophobicity and the aromatic amino acids also influenced codon usage in the mitochondrial genomes of these 43 parasitic platyhelminth species.
Minigene-like inhibition of protein synthesis mediated by hungry codons near the start codon
Jacinto-Loeza, Eva; Vivanco-Domínguez, Serafín; Guarneros, Gabriel; Hernández-Sánchez, Javier
2008-01-01
Rare AGA or AGG codons close to the initiation codon inhibit protein synthesis by a tRNA-sequestering mechanism as toxic minigenes do. To further understand this mechanism, a parallel analysis of protein synthesis and peptidyl-tRNA accumulation was performed using both a set of lacZ constructs where AGAAGA codons were moved codon by codon from +2, +3 up to +7, +8 positions and a series of 3–8 codon minigenes containing AGAAGA codons before the stop codon. β-Galactosidase synthesis from the AGAAGA lacZ constructs (in a Pth defective in vitro system without exogenous tRNA) diminished as the AGAAGA codons were closer to AUG codon. Likewise, β-galactosidase expression from the reporter +7 AGA lacZ gene (plus tRNA, 0.25 μg/μl) waned as the AGAAGAUAA minigene shortened. Pth counteracted both the length-dependent minigene effect on the expression of β-galactosidase from the +7 AGA lacZ reporter gene and the positional effect from the AGAAGA lacZ constructs. The +2, +3 AGAAGA lacZ construct and the shortest +2, +3 AGAAGAUAA minigene accumulated the highest percentage of peptidyl-tRNAArg4. These observations lead us to propose that hungry codons at early positions, albeit with less strength, inhibit protein synthesis by a minigene-like mechanism involving accumulation of peptidyl-tRNA. PMID:18583364
Gornik, S. G.; Waller, R. F.
2012-01-01
The sister phyla dinoflagellates and apicomplexans inherited a drastically reduced mitochondrial genome (mitochondrial DNA, mtDNA) containing only three protein-coding (cob, cox1, and cox3) genes and two ribosomal RNA (rRNA) genes. In apicomplexans, single copies of these genes are encoded on the smallest known mtDNA chromosome (6 kb). In dinoflagellates, however, the genome has undergone further substantial modifications, including massive genome amplification and recombination resulting in multiple copies of each gene and gene fragments linked in numerous combinations. Furthermore, protein-encoding genes have lost standard stop codons, trans-splicing of messenger RNAs (mRNAs) is required to generate complete cox3 transcripts, and extensive RNA editing recodes most genes. From taxa investigated to date, it is unclear when many of these unusual dinoflagellate mtDNA characters evolved. To address this question, we investigated the mitochondrial genome and transcriptome character states of the deep branching dinoflagellate Hematodinium sp. Genomic data show that like later-branching dinoflagellates Hematodinium sp. also contains an inflated, heavily recombined genome of multicopy genes and gene fragments. Although stop codons are also lacking for cox1 and cob, cox3 still encodes a conventional stop codon. Extensive editing of mRNAs also occurs in Hematodinium sp. The mtDNA of basal dinoflagellate Hematodinium sp. indicates that much of the mtDNA modification in dinoflagellates occurred early in this lineage, including genome amplification and recombination, and decreased use of standard stop codons. Trans-splicing, on the other hand, occurred after Hematodinium sp. diverged. Only RNA editing presents a nonlinear pattern of evolution in dinoflagellates as this process occurs in Hematodinium sp. but is absent in some later-branching taxa indicating that this process was either lost in some lineages or developed more than once during the evolution of the highly unusual dinoflagellate mtDNA. PMID:22113794
Jackson, C J; Gornik, S G; Waller, R F
2012-01-01
The sister phyla dinoflagellates and apicomplexans inherited a drastically reduced mitochondrial genome (mitochondrial DNA, mtDNA) containing only three protein-coding (cob, cox1, and cox3) genes and two ribosomal RNA (rRNA) genes. In apicomplexans, single copies of these genes are encoded on the smallest known mtDNA chromosome (6 kb). In dinoflagellates, however, the genome has undergone further substantial modifications, including massive genome amplification and recombination resulting in multiple copies of each gene and gene fragments linked in numerous combinations. Furthermore, protein-encoding genes have lost standard stop codons, trans-splicing of messenger RNAs (mRNAs) is required to generate complete cox3 transcripts, and extensive RNA editing recodes most genes. From taxa investigated to date, it is unclear when many of these unusual dinoflagellate mtDNA characters evolved. To address this question, we investigated the mitochondrial genome and transcriptome character states of the deep branching dinoflagellate Hematodinium sp. Genomic data show that like later-branching dinoflagellates Hematodinium sp. also contains an inflated, heavily recombined genome of multicopy genes and gene fragments. Although stop codons are also lacking for cox1 and cob, cox3 still encodes a conventional stop codon. Extensive editing of mRNAs also occurs in Hematodinium sp. The mtDNA of basal dinoflagellate Hematodinium sp. indicates that much of the mtDNA modification in dinoflagellates occurred early in this lineage, including genome amplification and recombination, and decreased use of standard stop codons. Trans-splicing, on the other hand, occurred after Hematodinium sp. diverged. Only RNA editing presents a nonlinear pattern of evolution in dinoflagellates as this process occurs in Hematodinium sp. but is absent in some later-branching taxa indicating that this process was either lost in some lineages or developed more than once during the evolution of the highly unusual dinoflagellate mtDNA.
Site-specific creation of uridine from cytidine in apolipoprotein B mRNA editing.
Hodges, P E; Navaratnam, N; Greeve, J C; Scott, J
1991-01-01
Human apolipoprotein (apo) B mRNA is edited in a tissue specific reaction, to convert glutamine codon 2153 (CAA) to a stop translation codon. The RNA editing product templates and hybridises as uridine, but the chemical nature of this reaction and the physical identity of the product are unknown. After editing in vitro of [32P] labelled RNA, we are able to demonstrate the production of uridine from cytidine; [alpha 32P] cytidine triphosphate incorporated into RNA gave rise to [32P] uridine monophosphate after editing in vitro, hydrolysis with nuclease P1 and thin layer chromatography using two separation systems. By cleaving the RNA into ribonuclease T1 fragments, we show that uridine is produced only at the authentic editing site and is produced in quantities that parallel an independent primer extension assay for editing. We conclude that apo B mRNA editing specifically creates a uridine from a cytidine. These observations are inconsistent with the incorporation of a uridine nucleotide by any polymerase, which would replace the alpha-phosphate and so rule out a model of endonucleolytic excision and repair as the mechanism for the production of uridine. Although transamination and transglycosylation remain to be formally excluded as reaction mechanisms our results argue strongly in favour of the apo B mRNA editing enzyme as a site-specific cytidine deaminase. Images PMID:2030940
Successful COG8 and PDF overlap is mediated by alterations in splicing and polyadenylation signals.
Pereira-Castro, Isabel; Quental, Rita; da Costa, Luís T; Amorim, António; Azevedo, Luisa
2012-02-01
Although gene-free areas compose the great majority of eukaryotic genomes, a significant fraction of genes overlaps, i.e., unique nucleotide sequences are part of more than one transcription unit. In this work, the evolutionary history and origin of a same-strand gene overlap is dissected through the analysis of COG8 (component of oligomeric Golgi complex 8) and PDF (peptide deformylase). Comparative genomic surveys reveal that the relative locations of these two genes have been changing over the last 445 million years from distinct chromosomal locations in fish to overlapping in rodents and primates, indicating that the overlap between these genes precedes their divergence. The overlap between the two genes was initiated by the gain of a novel splice donor site between the COG8 stop codon and PDF initiation codon. Splicing is accomplished by the use of the PDF acceptor, leading COG8 to share the 3'end with PDF. In primates, loss of the ancestral polyadenylation signal for COG8 makes the overlap between COG8 and PDF mandatory, while in mouse and rat concurrent overlapping and non-overlapping Cog8 transcripts exist. Altogether, we demonstrate that the origin, evolution and preservation of the COG8/PDF same-strand overlap follow similar mechanistic steps as those documented for antisense overlaps where gain and/or loss of splice sites and polyadenylation signals seems to drive the process.
Primary hyperoxaluria type 1: a cluster of new mutations in exon 7 of the AGXT gene.
von Schnakenburg, C; Rumsby, G
1997-06-01
Primary hyperoxaluria type 1 (PH1) is a severe autosomal recessive inborn error of glyoxylate metabolism caused by deficiency of the hepatic peroxisomal enzyme alanine:glyoxylate aminotransferase. This enzyme is encoded by the AGXT gene on chromosome 2q37.3. DNA samples from 79 PH1 patients were studied using single strand conformation polymorphism analysis to detect sequence variants, which were then characterised by direct sequencing and confirmed by restriction enzyme digestion. Four novel mutations were identified in exon 7 of AGXT: a point mutation T853C, which leads to a predicted Ile244Thr amino acid substitution, occurred in nine patients. Two other mutations in adjacent nucleotides, C819T and G820A, mutated the same codon at residue 233 from arginine to cysteine and histidine, respectively. The fourth mutation, G860A, introduced a stop codon at amino acid residue 246. Enzyme studies in these patients showed that AGT catalytic activity was either very low or absent and that little or no immunoreactive protein was present. Together with a new polymorphism in exon 11 (C1342A) these findings underline the genetic heterogeneity of the AGXT gene. The novel mutation T853C is the second most common mutation found to date with an allelic frequency of 9% and will therefore be of clinical importance for the diagnosis of PH1.
Primary hyperoxaluria type 1: a cluster of new mutations in exon 7 of the AGXT gene.
von Schnakenburg, C; Rumsby, G
1997-01-01
Primary hyperoxaluria type 1 (PH1) is a severe autosomal recessive inborn error of glyoxylate metabolism caused by deficiency of the hepatic peroxisomal enzyme alanine:glyoxylate aminotransferase. This enzyme is encoded by the AGXT gene on chromosome 2q37.3. DNA samples from 79 PH1 patients were studied using single strand conformation polymorphism analysis to detect sequence variants, which were then characterised by direct sequencing and confirmed by restriction enzyme digestion. Four novel mutations were identified in exon 7 of AGXT: a point mutation T853C, which leads to a predicted Ile244Thr amino acid substitution, occurred in nine patients. Two other mutations in adjacent nucleotides, C819T and G820A, mutated the same codon at residue 233 from arginine to cysteine and histidine, respectively. The fourth mutation, G860A, introduced a stop codon at amino acid residue 246. Enzyme studies in these patients showed that AGT catalytic activity was either very low or absent and that little or no immunoreactive protein was present. Together with a new polymorphism in exon 11 (C1342A) these findings underline the genetic heterogeneity of the AGXT gene. The novel mutation T853C is the second most common mutation found to date with an allelic frequency of 9% and will therefore be of clinical importance for the diagnosis of PH1. Images PMID:9192270
Pal, Shilpee; Sarkar, Indrani; Roy, Ayan; Mohapatra, Pradeep K Das; Mondal, Keshab C; Sen, Arnab
2018-02-01
The present study has been aimed to the comparative analysis of high GC composition containing Corynebacterium genomes and their evolutionary study by exploring codon and amino acid usage patterns. Phylogenetic study by MLSA approach, indel analysis and BLAST matrix differentiated Corynebacterium species in pathogenic and non-pathogenic clusters. Correspondence analysis on synonymous codon usage reveals that, gene length, optimal codon frequencies and tRNA abundance affect the gene expression of Corynebacterium. Most of the optimal codons as well as translationally optimal codons are C ending i.e. RNY (R-purine, N-any nucleotide base, and Y-pyrimidine) and reveal translational selection pressure on codon bias of Corynebacterium. Amino acid usage is affected by hydrophobicity, aromaticity, protein energy cost, etc. Highly expressed genes followed the cost minimization hypothesis and are less diverged at their synonymous positions of codons. Functional analysis of core genes shows significant difference in pathogenic and non-pathogenic Corynebacterium. The study reveals close relationship between non-pathogenic and opportunistic pathogenic Corynebaterium as well as between molecular evolution and survival niches of the organism.
Meiler, Arno; Klinger, Claudia; Kaufmann, Michael
2012-09-08
The COG database is the most popular collection of orthologous proteins from many different completely sequenced microbial genomes. Per definition, a cluster of orthologous groups (COG) within this database exclusively contains proteins that most likely achieve the same cellular function. Recently, the COG database was extended by assigning to every protein both the corresponding amino acid and its encoding nucleotide sequence resulting in the NUCOCOG database. This extended version of the COG database is a valuable resource connecting sequence features with the functionality of the respective proteins. Here we present ANCAC, a web tool and MySQL database for the analysis of amino acid, nucleotide, and codon frequencies in COGs on the basis of freely definable phylogenetic patterns. We demonstrate the usefulness of ANCAC by analyzing amino acid frequencies, codon usage, and GC-content in a species- or function-specific context. With respect to amino acids we, at least in part, confirm the cognate bias hypothesis by using ANCAC's NUCOCOG dataset as the largest one available for that purpose thus far. Using the NUCOCOG datasets, ANCAC connects taxonomic, amino acid, and nucleotide sequence information with the functional classification via COGs and provides a GUI for flexible mining for sequence-bias. Thereby, to our knowledge, it is the only tool for the analysis of sequence composition in the light of physiological roles and phylogenetic context without requirement of substantial programming-skills.
2012-01-01
Background The COG database is the most popular collection of orthologous proteins from many different completely sequenced microbial genomes. Per definition, a cluster of orthologous groups (COG) within this database exclusively contains proteins that most likely achieve the same cellular function. Recently, the COG database was extended by assigning to every protein both the corresponding amino acid and its encoding nucleotide sequence resulting in the NUCOCOG database. This extended version of the COG database is a valuable resource connecting sequence features with the functionality of the respective proteins. Results Here we present ANCAC, a web tool and MySQL database for the analysis of amino acid, nucleotide, and codon frequencies in COGs on the basis of freely definable phylogenetic patterns. We demonstrate the usefulness of ANCAC by analyzing amino acid frequencies, codon usage, and GC-content in a species- or function-specific context. With respect to amino acids we, at least in part, confirm the cognate bias hypothesis by using ANCAC’s NUCOCOG dataset as the largest one available for that purpose thus far. Conclusions Using the NUCOCOG datasets, ANCAC connects taxonomic, amino acid, and nucleotide sequence information with the functional classification via COGs and provides a GUI for flexible mining for sequence-bias. Thereby, to our knowledge, it is the only tool for the analysis of sequence composition in the light of physiological roles and phylogenetic context without requirement of substantial programming-skills. PMID:22958836
José, Marco V.; Govezensky, Tzipe; García, José A.; Bobadilla, Juan R.
2009-01-01
Herein two genetic codes from which the primeval RNA code could have originated the standard genetic code (SGC) are derived. One of them, called extended RNA code type I, consists of all codons of the type RNY (purine-any base-pyrimidine) plus codons obtained by considering the RNA code but in the second (NYR type) and third (YRN type) reading frames. The extended RNA code type II, comprises all codons of the type RNY plus codons that arise from transversions of the RNA code in the first (YNY type) and third (RNR) nucleotide bases. In order to test if putative nucleotide sequences in the RNA World and in both extended RNA codes, share the same scaling and statistical properties to those encountered in current prokaryotes, we used the genomes of four Eubacteria and three Archaeas. For each prokaryote, we obtained their respective genomes obeying the RNA code or the extended RNA codes types I and II. In each case, we estimated the scaling properties of triplet sequences via a renormalization group approach, and we calculated the frequency distributions of distances for each codon. Remarkably, the scaling properties of the distance series of some codons from the RNA code and most codons from both extended RNA codes turned out to be identical or very close to the scaling properties of codons of the SGC. To test for the robustness of these results, we show, via computer simulation experiments, that random mutations of current genomes, at the rates of 10−10 per site per year during three billions of years, were not enough for destroying the observed patterns. Therefore, we conclude that most current prokaryotes may still contain relics of the primeval RNA World and that both extended RNA codes may well represent two plausible evolutionary paths between the RNA code and the current SGC. PMID:19183813
Analysis of the synonymous codon usage bias in recently emerged enterovirus D68 strains.
Karniychuk, Uladzimir U
2016-09-02
Understanding the codon usage pattern of a pathogen and relationship between pathogen and host's codon usage patterns has fundamental and applied interests. Enterovirus D68 (EV-D68) is an emerging pathogen with a potentially high public health significance. In the present study, the synonymous codon usage bias of 27 recently emerged, and historical EV-D68 strains was analyzed. In contrast to previously studied enteroviruses (enterovirus 71 and poliovirus), EV-D68 and human host have a high discrepancy between favored codons. Analysis of viral synonymous codon usage bias metrics, viral nucleotide/dinucleotide compositional parameters, and viral protein properties showed that mutational pressure is more involved in shaping the synonymous codon usage bias of EV-D68 than translation selection. Computation of codon adaptation indices allowed to estimate expression potential of the EV-D68 genome in several commonly used laboratory animals. This approach requires experimental validation and may provide an auxiliary tool for the rational selection of laboratory animals to model emerging viral diseases. Enterovirus D68 genome compositional and codon usage data can be useful for further pathogenesis, animal model, and vaccine design studies. Copyright © 2016 Elsevier B.V. All rights reserved.
Lunina, Natalia A; Agafonova, Elena V; Chekanovskaya, Lyudmila A; Dvortsov, Igor A; Berezina, Oksana V; Shedova, Ekaterina N; Kostrov, Sergey V; Velikodvorskaya, Galina A
2007-07-01
A cluster of Thermotoga neapolitana genes participating in starch degradation includes the malG gene of sugar transport protein and the aglB gene of cyclomaltodextrinase. The start and stop codons of these genes share a common overlapping sequence, aTGAtg. Here, we compared properties of expression products of three different constructs with aglB from T. neapolitana. The first expression vector contained the aglB gene linked to an upstream 90-bp 3'-terminal region of the malG gene with the stop codon overlapping with the start codon of aglB. The second construct included the isolated coding sequence of aglB with two tandem potential start codons. The expression product of this construct in Escherichia coli had two tandem Met residues at its N terminus and was characterized by low thermostability and high tendency to aggregate. In contrast, co-expression of aglB and the 3'-terminal region of malG (the first construct) resulted in AglB with only one N-terminal Met residue and a much higher specific activity of cyclomaltodextrinase. Moreover, the enzyme expressed by such a construct was more thermostable and less prone to aggregation. The third construct was the same as the second one except that it contained only one ATG start codon. The product of its expression had kinetic and other properties similar to those of the enzyme with only one N-terminal Met residue.
2012-01-01
Background Detecting the borders between coding and non-coding regions is an essential step in the genome annotation. And information entropy measures are useful for describing the signals in genome sequence. However, the accuracies of previous methods of finding borders based on entropy segmentation method still need to be improved. Methods In this study, we first applied a new recursive entropic segmentation method on DNA sequences to get preliminary significant cuts. A 22-symbol alphabet is used to capture the differential composition of nucleotide doublets and stop codon patterns along three phases in both DNA strands. This process requires no prior training datasets. Results Comparing with the previous segmentation methods, the experimental results on three bacteria genomes, Rickettsia prowazekii, Borrelia burgdorferi and E.coli, show that our approach improves the accuracy for finding the borders between coding and non-coding regions in DNA sequences. Conclusions This paper presents a new segmentation method in prokaryotes based on Jensen-Rényi divergence with a 22-symbol alphabet. For three bacteria genomes, comparing to A12_JR method, our method raised the accuracy of finding the borders between protein coding and non-coding regions in DNA sequences. PMID:23282225
Characterization of the porcine epidemic diarrhea virus codon usage bias.
Chen, Ye; Shi, Yuzhen; Deng, Hongjuan; Gu, Ting; Xu, Jian; Ou, Jinxin; Jiang, Zhiguo; Jiao, Yiren; Zou, Tan; Wang, Chong
2014-12-01
Porcine epidemic diarrhea virus (PEDV) has been responsible for several recent outbreaks of porcine epidemic diarrhea (PED) and has caused great economic loss in the swine-raising industry. Considering the significance of PEDV, a systemic analysis was performed to study its codon usage patterns. The relative synonymous codon usage value of each codon revealed that codon usage bias exists and that PEDV tends to use codons that end in T. The mean ENC value of 47.91 indicates that the codon usage bias is low. However, we still wanted to identify the cause of this codon usage bias. A correlation analysis between the codon compositions (A3s, T3s, G3s, C3s, and GC3s), the ENC values, and the nucleotide contents (A%, T%, G%, C%, and GC%) indicated that mutational bias plays role in shaping the PEDV codon usage bias. This was further confirmed by a principal component analysis between the codon compositions and the axis values. Using the Gravy, Aroma, and CAI values, a role of natural selection in the PEDV codon usage pattern was also identified. Neutral analysis indicated that natural selection pressure plays a more important role than mutational bias in codon usage bias. Natural selection also plays an increasingly significant role during PEDV evolution. Additionally, gene function and geographic distribution also influence the codon usage bias to a degree. Copyright © 2014 Elsevier B.V. All rights reserved.
Maurer, B; Bannert, H; Darai, G; Flügel, R M
1988-01-01
The nucleotide sequence of the human spumaretrovirus (HSRV) genome was determined. The 5' long terminal repeat region was analyzed by strong stop cDNA synthesis and S1 nuclease mapping. The length of the RU5 region was determined and found to be 346 nucleotides long. The 5' long terminal repeat is 1,123 base pairs long and is bound by an 18-base-pair primer-binding site complementary to the 3' end of mammalian lysine-1,2-specific tRNA. Open reading frames for gag and pol genes were identified. Surprisingly, the HSRV gag protein does not contain the cysteine motif of the nucleic acid-binding proteins found in and typical of all other retroviral gag proteins; instead the HSRV gag gene encodes a strongly basic protein reminiscent of those of hepatitis B virus and retrotransposons. The carboxy-terminal part of the HSRV gag gene products encodes a protease domain. The pol gene overlaps the gag gene and is postulated to be synthesized as a gag/pol precursor via translational frameshifting analogous to that of Rous sarcoma virus, with 7 nucleotides immediately upstream of the termination codons of gag conserved between the two viral genomes. The HSRV pol gene is 2,730 nucleotides long, and its deduced protein sequence is readily subdivided into three well-conserved domains, the reverse transcriptase, the RNase H, and the integrase. Although the degree of homology of the HSRV reverse transcriptase domain is highest to that of murine leukemia virus, the HSRV genomic organization is more similar to that of human and simian immunodeficiency viruses. The data justify classifying the spumaretroviruses as a third subfamily of Retroviridae. Images PMID:2451755
NASA Technical Reports Server (NTRS)
Holmquist, R.; Pearl, D.
1980-01-01
Theoretical equations are derived for molecular divergence with respect to gene and protein structure in the presence of genetic events with unequal probabilities: amino acid and base compositions, the frequencies of nucleotide replacements, the usage of degenerate codons, the distribution of fixed base replacements within codons and the distribution of fixed base replacements among codons. Results are presented in the form of tables relating the probabilities of given numbers of codon base changes with respect to the original codon for the alpha hemoglobin, beta hemoglobin, myoglobin, cytochrome c and parvalbumin group gene families. Application of the calculations to the rabbit alpha and beta hemoglobin mRNAs and proteins indicates that the genes are separated by about 425 fixed based replacements distributed over 114 codon sites, which is a factor of two greater than previous estimates. The theoretical results also suggest that many more base replacements are required to effect a given gene or protein structural change than previously believed.
Dansault, Anouk; David, Gabriel; Schwartz, Claire; Jaliffa, Carolina; Vieira, Véronique; de la Houssaye, Guillaume; Bigot, Karine; Catin, Françise; Tattu, Laurent; Chopin, Catherine; Halimi, Philippe; Roche, Olivier; Van Regemorter, Nicole; Munier, Francis; Schorderet, Daniel; Dufier, Jean-Louis; Marsac, Cécile; Ricquier, Daniel; Menasche, Maurice; Penfornis, Alfred; Abitbol, Marc
2007-04-02
The PAX6 gene was first described as a candidate for human aniridia. However, PAX6 expression is not restricted to the eye and it appears to be crucial for brain development. We studied PAX6 mutations in a large spectrum of patients who presented with aniridia phenotypes, Peters' anomaly, and anterior segment malformations associated or not with neurological anomalies. Patients and related families were ophthalmologically phenotyped, and in some cases neurologically and endocrinologically examined. We screened the PAX6 gene by direct sequencing in three groups of patients: those affected by aniridia; those with diverse ocular manifestations; and those with Peters' anomaly. Two mutations were investigated by generating crystallographic representations of the amino acid changes. Three novel heterozygous mutations affecting three unrelated families were identified: the g.572T>C nucleotide change, located in exon 5, and corresponding to the Leucine 46 Proline amino-acid mutation (L46P); the g.655A>G nucleotide change, located in exon 6, and corresponding to the Serine 74 Glycine amino-acid mutation (S74G); and the nucleotide deletion 579delG del, located in exon 6, which induces a frameshift mutation leading to a stop codon (V48fsX53). The L46P mutation was identified in affected patients presenting bilateral microphthalmia, cataracts, and nystagmus. The S74G mutation was found in a large family that had congenital ocular abnormalities, diverse neurological manifestations, and variable cognitive impairments. The 579delG deletion (V48fsX53) caused in the affected members of the same family bilateral aniridia associated with congenital cataract, foveal hypolasia, and nystagmus. We also detected a novel intronic nucleotide change, IVS2+9G>A (very likely a mutation) in an apparently isolated patient affected by a complex ocular phenotype, characterized primarily by a bilateral microphthalmia. Whether this nucleotide change is indeed pathogenic remains to be demonstrated. Two previously known heterozygous mutations of the PAX6 gene sequence were also detected in patients affected by aniridia: a de novo previously known nucleotide change, g.972C>T (Q179X), in exon 8, leading to a stop codon and a heterozygous g.555C>A (C40X) recurrent nonsense mutation in exon 5. No mutations were found in patients with Peters' anomaly. We identified three mutations associated with aniridia phenotypes (Q179X, C40X, and V48fsX53). The three other mutations reported here cause non-aniridia ocular phenotypes associated in some cases with neurological anomalies. The IVS2+9G>A nucleotide change was detected in a patient with a microphthalmia phenotype. The L46P mutation was detected in a family with microphthalmia, cataract, and nystagmus. This mutation is located in the DNA-binding paired-domain and the crystallographic representations of this mutation show that this mutation may affect the helix-turn-helix motif, and as a consequence the DNA-binding properties of the resulting mutated protein. Ser74 is located in the PAX6 PD linker region, essential for DNA recognition and DNA binding, and the side chain of the Ser74 contributes to DNA recognition by the linker domain through direct contacts. Crystallographic representations show that the S74G mutation results in no side chain and therefore perturbs the DNA-binding properties of PAX6. This study highlights the severity and diversity of the consequences of PAX6 mutations that appeared to result from the complexity of the PAX6 gene structure, and the numerous possibilities for DNA binding. This study emphasizes the fact that neurodevelopmental abnormalities may be caused by PAX6 mutations. The neuro-developmental abnormalities caused by PAX6 mutations are probably still overlooked in the current clinical examinations performed throughout the world in patients affected by PAX6 mutations.
Dansault, Anouk; David, Gabriel; Schwartz, Claire; Jaliffa, Carolina; Vieira, Véronique; de la Houssaye, Guillaume; Bigot, Karine; Catin, Françise; Tattu, Laurent; Chopin, Catherine; Halimi, Philippe; Roche, Olivier; Van Regemorter, Nicole; Munier, Francis; Schorderet, Daniel; Dufier, Jean-Louis; Marsac, Cécile; Ricquier, Daniel; Menasche, Maurice; Penfornis, Alfred
2007-01-01
Purpose The PAX6 gene was first described as a candidate for human aniridia. However, PAX6 expression is not restricted to the eye and it appears to be crucial for brain development. We studied PAX6 mutations in a large spectrum of patients who presented with aniridia phenotypes, Peters' anomaly, and anterior segment malformations associated or not with neurological anomalies. Methods Patients and related families were ophthalmologically phenotyped, and in some cases neurologically and endocrinologically examined. We screened the PAX6 gene by direct sequencing in three groups of patients: those affected by aniridia; those with diverse ocular manifestations; and those with Peters' anomaly. Two mutations were investigated by generating crystallographic representations of the amino acid changes. Results Three novel heterozygous mutations affecting three unrelated families were identified: the g.572T>C nucleotide change, located in exon 5, and corresponding to the Leucine 46 Proline amino-acid mutation (L46P); the g.655A>G nucleotide change, located in exon 6, and corresponding to the Serine 74 Glycine amino-acid mutation (S74G); and the nucleotide deletion 579delG del, located in exon 6, which induces a frameshift mutation leading to a stop codon (V48fsX53). The L46P mutation was identified in affected patients presenting bilateral microphthalmia, cataracts, and nystagmus. The S74G mutation was found in a large family that had congenital ocular abnormalities, diverse neurological manifestations, and variable cognitive impairments. The 579delG deletion (V48fsX53) caused in the affected members of the same family bilateral aniridia associated with congenital cataract, foveal hypolasia, and nystagmus. We also detected a novel intronic nucleotide change, IVS2+9G>A (very likely a mutation) in an apparently isolated patient affected by a complex ocular phenotype, characterized primarily by a bilateral microphthalmia. Whether this nucleotide change is indeed pathogenic remains to be demonstrated. Two previously known heterozygous mutations of the PAX6 gene sequence were also detected in patients affected by aniridia: a de novo previously known nucleotide change, g.972C>T (Q179X), in exon 8, leading to a stop codon and a heterozygous g.555C>A (C40X) recurrent nonsense mutation in exon 5. No mutations were found in patients with Peters' anomaly. Conclusions We identified three mutations associated with aniridia phenotypes (Q179X, C40X, and V48fsX53). The three other mutations reported here cause non-aniridia ocular phenotypes associated in some cases with neurological anomalies. The IVS2+9G>A nucleotide change was detected in a patient with a microphthalmia phenotype. The L46P mutation was detected in a family with microphthalmia, cataract, and nystagmus. This mutation is located in the DNA-binding paired-domain and the crystallographic representations of this mutation show that this mutation may affect the helix-turn-helix motif, and as a consequence the DNA-binding properties of the resulting mutated protein. Ser74 is located in the PAX6 PD linker region, essential for DNA recognition and DNA binding, and the side chain of the Ser74 contributes to DNA recognition by the linker domain through direct contacts. Crystallographic representations show that the S74G mutation results in no side chain and therefore perturbs the DNA-binding properties of PAX6. This study highlights the severity and diversity of the consequences of PAX6 mutations that appeared to result from the complexity of the PAX6 gene structure, and the numerous possibilities for DNA binding. This study emphasizes the fact that neurodevelopmental abnormalities may be caused by PAX6 mutations. The neuro-developmental abnormalities caused by PAX6 mutations are probably still overlooked in the current clinical examinations performed throughout the world in patients affected by PAX6 mutations. PMID:17417613
Sun, Xianhua; Xue, Xianli; Li, Mengzhu; Gao, Fei; Hao, Zhenzhen; Huang, Huoqing; Luo, Huiying; Qin, Lina; Yao, Bin; Su, Xiaoyun
2017-12-20
Cellulase and mannanase are both important enzyme additives in animal feeds. Expressing the two enzymes simultaneously within one microbial host could potentially lead to cost reductions in the feeding of animals. For this purpose, we codon-optimized the Aspergillus niger Man5A gene to the codon-usage bias of Trichoderma reesei. By comparing the free energies and the local structures of the nucleotide sequences, one optimized sequence was finally selected and transformed into the T. reesei pyridine-auxotrophic strain TU-6. The codon-optimized gene was expressed to a higher level than the original one. Further expressing the codon-optimized gene in a mutated T. reesei strain through fed-batch cultivation resulted in coproduction of cellulase and mannanase up to 1376 U·mL -1 and 1204 U·mL -1 , respectively.
Vanlalruati, Catherine; Mandal, Surajit De; Gurusubramanian, Guruswami; Senthil Kumar, Nachimuthu
2016-07-01
The complete mitochondrial genome of Junonia iphita was determined to be 15,433 bp in length, including 37 typical mitochondrial genes and an AT-rich region. All the protein coding genes (PCGs) are initiated by typical ATN codons, except cox1 gene that is by CGA codon. Eight genes use complete termination codon (TAA), whereas the cox1, cox2 and nad5 genes end with single T; nad4 and nad1 ends with stop codon TA. All the tRNA show secondary cloverleaf structures except trnS1 (AGN). The A + T rich region is 546 bp in length containing ATAGA motif followed by a 18 bp poly-T stretch, two microsatellite-like (TA)9 elements and 8 bp poly-A stretch immediately upstream of trnM gene.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Solovyev, V.V.; Salamov, A.A.; Lawrence, C.B.
1994-12-31
Discriminant analysis is applied to the problem of recognition 5`-, internal and 3`-exons in human DNA sequences. Specific recognition functions were developed for revealing exons of particular types. The method based on a splice site prediction algorithm that uses the linear Fisher discriminant to combine the information about significant triplet frequencies of various functional parts of splice site regions and preferences of oligonucleotide in protein coding and nation regions. The accuracy of our splice site recognition function is about 97%. A discriminant function for 5`-exon prediction includes hexanucleotide composition of upstream region, triplet composition around the ATG codon, ORF codingmore » potential, donor splice site potential and composition of downstream introit region. For internal exon prediction, we combine in a discriminant function the characteristics describing the 5`- intron region, donor splice site, coding region, acceptor splice site and Y-intron region for each open reading frame flanked by GT and AG base pairs. The accuracy of precise internal exon recognition on a test set of 451 exon and 246693 pseudoexon sequences is 77% with a specificity of 79% and a level of pseudoexon ORF prediction of 99.96%. The recognition quality computed at the level of individual nucleotides is 89%, for exon sequences and 98% for intron sequences. A discriminant function for 3`-exon prediction includes octanucleolide composition of upstream nation region, triplet composition around the stop codon, ORF coding potential, acceptor splice site potential and hexanucleotide composition of downstream region. We unite these three discriminant functions in exon predicting program FEX (find exons). FEX exactly predicts 70% of 1016 exons from the test of 181 complete genes with specificity 73%, and 89% exons are exactly or partially predicted. On the average, 85% of nucleotides were predicted accurately with specificity 91%.« less
Benyo, B; Biro, J C; Benyo, Z
2004-01-01
The theory of "codon-amino acid coevolution" was first proposed by Woese in 1967. It suggests that there is a stereochemical matching - that is, affinity - between amino acids and certain of the base triplet sequences that code for those amino acids. We have constructed a common periodic table of codons and amino acids, where the nucleic acid table showed perfect axial symmetry for codons and the corresponding amino acid table also displayed periodicity regarding the biochemical properties (charge and hydrophobicity) of the 20 amino acids and the position of the stop signals. The table indicates that the middle (2/sup nd/) amino acid in the codon has a prominent role in determining some of the structural features of the amino acids. The possibility that physical contact between codons and amino acids might exist was tested on restriction enzymes. Many recognition site-like sequences were found in the coding sequences of these enzymes and as many as 73 examples of codon-amino acid co-location were observed in the 7 known 3D structures (December 2003) of endonuclease-nucleic acid complexes. These results indicate that the smallest possible units of specific nucleic acid-protein interaction are indeed the stereochemically compatible codons and amino acids.
Omeire, Destiny; Abdin, Shaunte; Brooks, Daniel M; Miranda, Hector C
2015-04-01
The Germain's Peacock-Pheasant Polyplectron germaini (Aves, Galliformes, Phasianidae) is classified as Near Threatened on the IUCN Red List. The complete mitochondrial genome of P. germaini is 16,699 bp, consisting of 13 protein-coding genes, 2 rRNA, 22 tRNA genes and 1 control region. All of the 13 protein-coding genes have ATG as start codon. Eight of the 13 protein-coding genes have TAA as stop codon.
New Universal Rules of Eukaryotic Translation Initiation Fidelity
Zur, Hadas; Tuller, Tamir
2013-01-01
The accepted model of eukaryotic translation initiation begins with the scanning of the transcript by the pre-initiation complex from the 5′end until an ATG codon with a specific nucleotide (nt) context surrounding it is recognized (Kozak rule). According to this model, ATG codons upstream to the beginning of the ORF should affect translation. We perform for the first time, a genome-wide statistical analysis, uncovering a new, more comprehensive and quantitative, set of initiation rules for improving the cost of translation and its efficiency. Analyzing dozens of eukaryotic genomes, we find that in all frames there is a universal trend of selection for low numbers of ATG codons; specifically, 16–27 codons upstream, but also 5–11 codons downstream of the START ATG, include less ATG codons than expected. We further suggest that there is selection for anti optimal ATG contexts in the vicinity of the START ATG. Thus, the efficiency and fidelity of translation initiation is encoded in the 5′UTR as required by the scanning model, but also at the beginning of the ORF. The observed nt patterns suggest that in all the analyzed organisms the pre-initiation complex often misses the START ATG of the ORF, and may start translation from an alternative initiation start-site. Thus, to prevent the translation of undesired proteins, there is selection for nucleotide sequences with low affinity to the pre-initiation complex near the beginning of the ORF. With the new suggested rules we were able to obtain a twice higher correlation with ribosomal density and protein levels in comparison to the Kozak rule alone (e.g. for protein levels r = 0.7 vs. r = 0.31; p<10−12). PMID:23874179
Nishizawa, M; Nishizawa, K
2000-10-01
The tendency for repetitiveness of nucleotides in DNA sequences has been reported for a variety of organisms. We show that the tendency for repetitive use of amino acids is widespread and is observed even for segments conserved between human and Drosophila melanogaster at the level of >50% amino acid identity. This indicates that repetitiveness influences not only the weakly constrained segments but also those sequence segments conserved among phyla. Not only glutamine (Q) but also many of the 20 amino acids show a comparable level of repetitiveness. Repetitiveness in bases at codon position 3 is stronger for human than for D.melanogaster, whereas local repetitiveness in intron sequences is similar between the two organisms. While genes for immune system-specific proteins, but not ancient human genes (i.e. human homologs of Escherichia coli genes), have repetitiveness at codon bases 1 and 2, repetitiveness at codon base 3 for these groups is similar, suggesting that the human genome has at least two mechanisms generating local repetitiveness. Neither amino acid nor nucleotide repetitiveness is observed beyond the exon boundary, denying the possibility that such repetitiveness could mainly stem from natural selection on mRNA or protein sequences. Analyses of mammalian sequence alignments show that while the 'between gene' GC content heterogeneity, which is linked to 'isochores', is a principal factor associated with the bias in substitution patterns in human, 'within gene' heterogeneity in nucleotide composition is also associated with such bias on a more local scale. The relationship amongst the various types of repetitiveness is discussed.
Nishizawa, Manami; Nishizawa, Kazuhisa
2000-01-01
The tendency for repetitiveness of nucleotides in DNA sequences has been reported for a variety of organisms. We show that the tendency for repetitive use of amino acids is widespread and is observed even for segments conserved between human and Drosophila melanogaster at the level of >50% amino acid identity. This indicates that repetitiveness influences not only the weakly constrained segments but also those sequence segments conserved among phyla. Not only glutamine (Q) but also many of the 20 amino acids show a comparable level of repetitiveness. Repetitiveness in bases at codon position 3 is stronger for human than for D.melanogaster, whereas local repetitiveness in intron sequences is similar between the two organisms. While genes for immune system-specific proteins, but not ancient human genes (i.e. human homologs of Escherichia coli genes), have repetitiveness at codon bases 1 and 2, repetitiveness at codon base 3 for these groups is similar, suggesting that the human genome has at least two mechanisms generating local repetitiveness. Neither amino acid nor nucleotide repetitiveness is observed beyond the exon boundary, denying the possibility that such repetitiveness could mainly stem from natural selection on mRNA or protein sequences. Analyses of mammalian sequence alignments show that while the ‘between gene’ GC content heterogeneity, which is linked to ‘isochores’, is a principal factor associated with the bias in substitution patterns in human, ‘within gene’ heterogeneity in nucleotide composition is also associated with such bias on a more local scale. The relationship amongst the various types of repetitiveness is discussed. PMID:11000273
Chien, Maw-Sheng; Gilbert , Teresa L.; Huang, Chienjin; Landolt, Marsha L.; O'Hara, Patrick J.; Winton, James R.
1992-01-01
The complete sequence coding for the 57-kDa major soluble antigen of the salmonid fish pathogen, Renibacterium salmoninarum, was determined. The gene contained an opening reading frame of 1671 nucleotides coding for a protein of 557 amino acids with a calculated Mr value of 57190. The first 26 amino acids constituted a signal peptide. The deduced sequence for amino acid residues 27–61 was in agreement with the 35 N-terminal amino acid residues determined by microsequencing, suggesting the protein in synthesized as a 557-amino acid precursor and processed to produce a mature protein of Mr 54505. Two regions of the protein contained imperfect direct repeats. The first region contained two copies of an 81-residue repeat, the second contained five copies of an unrelated 25-residue repeat. Also, a perfect inverted repeat (including three in-frame UAA stop codons) was observed at the carboxyl-terminus of the gene.
Meggouh, F; Benomar, A; Rouger, H; Tardieu, S; Birouk, N; Tassin, J; Barhoumi, C; Yahyaoui, M; Chkili, T; Brice, A; LeGuern, E
1998-01-01
X linked Charcot-Marie-Tooth disease (CMTX) is a hereditary motor and sensory neuropathy caused by mutations in the connexin 32 gene (Cx32). Using the SSCP technique and direct sequencing of PCR amplified genomic DNA fragments of the Cx32 gene from a Moroccan patient and her relatives, we identified the first de novo mutation of the Cx32 gene, consisting of a deletion of a G residue at position 499 in the Cx32 open reading frame. This previously unreported mutation produces a frameshift at position 147 in the protein and introduces a premature stop codon (TAG) at nucleotide 643, which results in the production of a truncated Cx32 molecule. This mutation illustrates the risk of an erroneous diagnosis of autosomal recessive CMT, especially in populations where consanguineous unions are frequent, and its consequences for genetic counselling, which can be avoided by molecular analysis. Images PMID:9541114
Sun, Yu; Tamarit, Daniel
2017-01-01
Abstract The major codon preference model suggests that codons read by tRNAs in high concentrations are preferentially utilized in highly expressed genes. However, the identity of the optimal codons differs between species although the forces driving such changes are poorly understood. We suggest that these questions can be tackled by placing codon usage studies in a phylogenetic framework and that bacterial genomes with extreme nucleotide composition biases provide informative model systems. Switches in the background substitution biases from GC to AT have occurred in Gardnerella vaginalis (GC = 32%), and from AT to GC in Lactobacillus delbrueckii (GC = 62%) and Lactobacillus fermentum (GC = 63%). We show that despite the large effects on codon usage patterns by these switches, all three species evolve under selection on synonymous sites. In G. vaginalis, the dramatic codon frequency changes coincide with shifts of optimal codons. In contrast, the optimal codons have not shifted in the two Lactobacillus genomes despite an increased fraction of GC-ending codons. We suggest that all three species are in different phases of an on-going shift of optimal codons, and attribute the difference to a stronger background substitution bias and/or longer time since the switch in G. vaginalis. We show that comparative and correlative methods for optimal codon identification yield conflicting results for genomes in flux and discuss possible reasons for the mispredictions. We conclude that switches in the direction of the background substitution biases can drive major shifts in codon preference patterns even under sustained selection on synonymous codon sites. PMID:27540085
Lim, Chee Kent; Tan, Joanne Tsui Ming; Khoo, Jason Boo Siang; Ravichandran, Aarthi; Low, Hsin Mei; Chan, Yin Chyi; Ton, So Har
2006-01-01
This study was carried out to determine the effects of hepatitis B virus genotypes, core promoter mutations (A1762G1764→T1762A1764) as well as precore stop codon mutations (TGG→TAG) on HBeAg expression and HBeAg/ anti-HBe status. Study was also performed on the effects of codon 15 variants (C1858/ T1858) on the predisposition of precore stop codon mutations (TGG→TAG). A total of 77 sera samples were analyzed. Fifty one samples were successfully genotyped of which the predominant genotype was genotype B (29/ 51, 56.9 %), followed by genotype C (16/ 51, 31.4 %). Co-infections by genotypes B and C were observed in four samples (7.8 %). To a lesser degree, genotypes D and E (2.0 % each) were also observed. For core promoter mutations, the prevalence was 68.8 % (53/ 77) for A1762G1764 wild-type and 14.3 % (11/ 77) for T1762A1764 mutant while 9.1 % (7/ 77) was co-infected by both strains. The prevalence of codon 15 variants was found to be 42.9 % (33/ 77) for T1858 variant and 16.9 % (13/ 77) for C1858 variant. No TAG mutation was found. In our study, no associations were found between genotypes (B and C) and core promoter mutations as well as codon 15 variants. Also no correlation was observed between HBeAg/ anti-HBe status with genotypes (B and C) and core promoter mutations. PMID:16421626
Drosophila Melanogaster Mitochondrial DNA: Gene Organization and Evolutionary Considerations
Garesse, R.
1988-01-01
The sequence of a 8351-nucleotide mitochondrial DNA (mtDNA) fragment has been obtained extending the knowledge of the Drosophila melanogaster mitochondrial genome to 90% of its coding region. The sequence encodes seven polypeptides, 12 tRNAs and the 3' end of the 16S rRNA and CO III genes. The gene organization is strictly conserved with respect to the Drosophila yakuba mitochondrial genome, and different from that found in mammals and Xenopus. The high A + T content of D. melanogaster mitochondrial DNA is reflected in a reiterative codon usage, with more than 90% of the codons ending in T or A, G + C rich codons being practically absent. The average level of homology between the D. melanogaster and D. yakuba sequences is very high (roughly 94%), although insertion and deletions have been detected in protein, tRNA and large ribosomal genes. The analysis of nucleotide changes reveals a similar frequency for transitions and transversions, and reflects a strong bias against G+C on both strands. The predominant type of transition is strand specific. PMID:3130291
Analysis of transcriptome data reveals multifactor constraint on codon usage in Taenia multiceps.
Huang, Xing; Xu, Jing; Chen, Lin; Wang, Yu; Gu, Xiaobin; Peng, Xuerong; Yang, Guangyou
2017-04-20
Codon usage bias (CUB) is an important evolutionary feature in genomes that has been widely observed in many organisms. However, the synonymous codon usage pattern in the genome of T. multiceps remains to be clarified. In this study, we analyzed the codon usage of T. multiceps based on the transcriptome data to reveal the constraint factors and to gain an improved understanding of the mechanisms that shape synonymous CUB. Analysis of a total of 8,620 annotated mRNA sequences from T. multiceps indicated only a weak codon bias, with mean GC and GC3 content values of 49.29% and 51.43%, respectively. Our analysis indicated that nucleotide composition, mutational pressure, natural selection, gene expression level, amino acids with grand average of hydropathicity (GRAVY) and aromaticity (Aromo) and the effective selection of amino-acids all contributed to the codon usage in T. multiceps. Among these factors, natural selection was implicated as the major factor affecting the codon usage variation in T. multiceps. The codon usage of ribosome genes was affected mainly by mutations, while the essential genes were affected mainly by selection. In addition, 21codons were identified as "optimal codons". Overall, the optimal codons were GC-rich (GC:AU, 41:22), and ended with G or C (except CGU). Furthermore, different degrees of variation in codon usage were found between T. multiceps and Escherichia coli, yeast, Homo sapiens. However, little difference was found between T. multiceps and Taenia pisiformis. In this study, the codon usage pattern of T. multiceps was analyzed systematically and factors affected CUB were also identified. This is the first study of codon biology in T. multiceps. Understanding the codon usage pattern in T. multiceps can be helpful for the discovery of new genes, molecular genetic engineering and evolutionary studies.
Codon usage patterns in Nematoda: analysis based on over 25 million codons in thirty-two species
2006-01-01
Background Codon usage has direct utility in molecular characterization of species and is also a marker for molecular evolution. To understand codon usage within the diverse phylum Nematoda, we analyzed a total of 265,494 expressed sequence tags (ESTs) from 30 nematode species. The full genomes of Caenorhabditis elegans and C. briggsae were also examined. A total of 25,871,325 codons were analyzed and a comprehensive codon usage table for all species was generated. This is the first codon usage table available for 24 of these organisms. Results Codon usage similarity in Nematoda usually persists over the breadth of a genus but then rapidly diminishes even within each clade. Globodera, Meloidogyne, Pristionchus, and Strongyloides have the most highly derived patterns of codon usage. The major factor affecting differences in codon usage between species is the coding sequence GC content, which varies in nematodes from 32% to 51%. Coding GC content (measured as GC3) also explains much of the observed variation in the effective number of codons (R = 0.70), which is a measure of codon bias, and it even accounts for differences in amino acid frequency. Codon usage is also affected by neighboring nucleotides (N1 context). Coding GC content correlates strongly with estimated noncoding genomic GC content (R = 0.92). On examining abundant clusters in five species, candidate optimal codons were identified that may be preferred in highly expressed transcripts. Conclusion Evolutionary models indicate that total genomic GC content, probably the product of directional mutation pressure, drives codon usage rather than the converse, a conclusion that is supported by examination of nematode genomes. PMID:26271136
Bowen, D; Littlechild, J A; Fothergill, J E; Watson, H C; Hall, L
1988-01-01
Using oligonucleotide probes derived from amino acid sequencing information, the structural gene for phosphoglycerate kinase from the extreme thermophile, Thermus thermophilus, was cloned in Escherichia coli and its complete nucleotide sequence determined. The gene consists of an open reading frame corresponding to a protein of 390 amino acid residues (calculated Mr 41,791) with an extreme bias for G or C (93.1%) in the codon third base position. Comparison of the deduced amino acid sequence with that of the corresponding mesophilic yeast enzyme indicated a number of significant differences. These are discussed in terms of the unusual codon bias and their possible role in enhanced protein thermal stability. Images Fig. 1. PMID:3052437
Analysis of the cbhE' plasmid gene from acute disease-causing isolates of Coxiella burnetii.
Minnick, M F; Small, C L; Frazier, M E; Mallavia, L P
1991-07-15
A gene termed cbhE' was cloned from the QpH1 plasmid of Coxiella burnetii. Expression of recombinants containing cbhE' in vitro and in Escherichia coli maxicells, produced an insert-encoded polypeptide of approx. 42 kDa. The CbhE protein was not cleaved when intact maxicells were treated with trypsin. Hybridizations of total DNA isolated from the six strains of C. burnetii indicate that this gene is unique to C. burnetii strains associated with acute disease, i.e., Hamilton[I], Vacca[II], and Rasche[III]. The cbhE' gene was not detected in strains associated with chronic disease (Biotzere[IV] and Corazon[V]) or the Dod[VI] strain. The cbhE' open reading frame (ORF) is 1022 bp in length and is preceded by a predicted promoter/Shine-Dalgarno (SD) region of TCAACT(-35)-N16-TAAAAT(-10)-N14-AGAAGGA (SD) located 10 nucleotides (nt) before the presumed AUG start codon. The ORF ends with a single UAA stop codon and has no apparent Rho-factor-independent terminator following it. The cbhE' gene codes for the CbhE protein of 341 amino acid (aa) residues with a deduced Mr of 39,442. CbhE is predominantly hydrophilic with a predicted pI of 4.43. The function of CbhE is unknown. No nt or aa sequences with homology to cbhE' or CbhE, respectively, were found in searches of a number of data bases.
Jailani, A Abdul Kader; Solanki, Vikas; Roy, Anirban; Sivasudha, T; Mandal, Bikash
2017-04-02
A highly infectious clone of Cucumber green mottle mosaic virus (CGMMV), a cucurbit-infecting tobamovirus was utilized for designing of gene expression vectors. Two versions of vector were examined for their efficacy in expressing the green fluorescent protein (GFP) in Nicotiana benthamiana. When the GFP gene was inserted at the stop codon of coat protein (CP) gene of the CGMMV genome without any read-through codon, systemic expression of GFP, as well as virion formation and systemic symptoms expression were obtained in N. benthamiana. The qRT-PCR analysis showed 23 fold increase of GFP over actin at 10days post inoculation (dpi), which increased to 45 fold at 14dpi and thereafter the GFP expression was significantly declined. Further, we show that when the most of the CP sequence is deleted retaining only the first 105 nucleotides, the shortened vector containing GFP in frame of original CP open reading frame (ORF) resulted in 234 fold increase of GFP expression over actin at 5dpi in N. benthamiana without the formation of virions and disease symptoms. Our study demonstrated that a simple manipulation of CP gene in the CGMMV genome while preserving the translational frame of CP resulted in developing a virus-free, rapid and efficient foreign protein expression system in the plant. The CGMMV based vectors developed in this study may be potentially useful for the production of edible vaccines in cucurbits. Copyright © 2017 Elsevier B.V. All rights reserved.
Two cloned β thalassemia genes are associated with amber mutations at codon 39
Pergolizzi, Robert; Spritz, Richard A.; Spence, Sally; Goossens, Michel; Kan, Yuet Wai; Bank, Arthur
1981-01-01
Two β globin genes from patients with the β+ thalassemia phenotype have been cloned and sequenced. A single nucleotide change from CAG to TAG (an amber mutation) at codon 39 is the only difference from normal in both genes analyzed. The results are consistent with the assumption that both patients are doubly heterozygous for β+ and β° thalassemia, and that we have isolated and analyzed the β° thalassemia gene. Images PMID:6278453
Energetics of codon-anticodon recognition on the small ribosomal subunit.
Almlöf, Martin; Andér, Martin; Aqvist, Johan
2007-01-09
Recent crystal structures of the small ribosomal subunit have made it possible to examine the detailed energetics of codon recognition on the ribosome by computational methods. The binding of cognate and near-cognate anticodon stem loops to the ribosome decoding center, with mRNA containing the Phe UUU and UUC codons, are analyzed here using explicit solvent molecular dynamics simulations together with the linear interaction energy (LIE) method. The calculated binding free energies are in excellent agreement with experimental binding constants and reproduce the relative effects of mismatches in the first and second codon position versus a mismatch at the wobble position. The simulations further predict that the Leu2 anticodon stem loop is about 10 times more stable than the Ser stem loop in complex with the Phe UUU codon. It is also found that the ribosome significantly enhances the intrinsic stability differences of codon-anticodon complexes in aqueous solution. Structural analysis of the simulations confirms the previously suggested importance of the universally conserved nucleotides A1492, A1493, and G530 in the decoding process.
Cytochrome oxidase subunit II gene in mitochondria of Oenothera has no intron
Hiesel, Rudolf; Brennicke, Axel
1983-01-01
The cytochrome oxidase subunit II gene has been localized in the mitochondrial genome of Oenothera berteriana and the nucleotide sequence has been determined. The coding sequence contains 777 bp and, unlike the corresponding gene in Zea mays, is not interrupted by an intron. No TGA codon is found within the open reading frame. The codon CGG, as in the maize gene, is used in place of tryptophan codons of corresponding genes in other organisms. At position 742 in the Oenothera sequence the TGG of maize is changed into a CGG codon, where Trp is conserved as the amino acid in other organisms. Homologous sequences occur more than once in the mitochondrial genome as several mitochondrial DNA species hybridize with DNA probes of the cytochrome oxidase subunit II gene. ImagesFig. 5. PMID:16453484
Model for Codon Position Bias in RNA Editing
NASA Astrophysics Data System (ADS)
Liu, Tsunglin; Bundschuh, Ralf
2005-08-01
RNA editing can be crucial for the expression of genetic information via inserting, deleting, or substituting a few nucleotides at specific positions in an RNA sequence. Within coding regions in an RNA sequence, editing usually occurs with a certain bias in choosing the positions of the editing sites. In the mitochondrial genes of Physarum polycephalum, many more editing events have been observed at the third codon position than at the first and second, while in some plant mitochondria the second codon position dominates. Here we propose an evolutionary model that explains this bias as the basis of selection at the protein level. The model predicts a distribution of the three positions rather close to the experimental observation in Physarum. This suggests that the codon position bias in Physarum is mainly a consequence of selection at the protein level.
A model for codon position bias in RNA editing
NASA Astrophysics Data System (ADS)
Bundschuh, Ralf; Liu, Tsunglin
2006-03-01
RNA editing can be crucial for the expression of genetic information via inserting, deleting, or substituting a few nucleotides at specific positions in an RNA sequence. Within coding regions in an RNA sequence, editing usually occurs with a certain bias in choosing the positions of the editing sites. In the mitochondrial genes of Physarum polycephalum, many more editing events have been observed at the third codon position than at the first and second, while in some plant mitochondria the second codon position dominates. Here we propose an evolutionary model that explains this bias as the basis of selection at the protein level. The model predicts a distribution of the three positions rather close to the experimental observation in Physarum. This suggests that the codon position bias in Physarum is mainly a consequence of selection at the protein level.
Jia, Wan-Zhong; Yan, Hong-Bin; Guo, Ai-Jiang; Zhu, Xing-Quan; Wang, Yu-Chao; Shi, Wan-Gui; Chen, Hao-Tai; Zhan, Fang; Zhang, Shao-Hua; Fu, Bao-Quan; Littlewood, D Timothy J; Cai, Xue-Peng
2010-07-22
Mitochondrial genomes provide a rich source of molecular variation of proven and widespread utility in molecular ecology, population genetics and evolutionary biology. The tapeworm genus Taenia includes a diversity of tapeworm parasites of significant human and veterinary importance. Here we add complete sequences of the mt genomes of T. multiceps, T. hydatigena and T. pisiformis, to a data set of 4 published mtDNAs in the same genus. Seven complete mt genomes of Taenia species are used to compare and contrast variation within and between genomes in the genus, to estimate a phylogeny for the genus, and to develop novel molecular markers as part of an extended mitochondrial toolkit. The complete circular mtDNAs of T. multiceps, T. hydatigena and T. pisiformis were 13,693, 13,492 and 13,387 bp in size respectively, comprising the usual complement of flatworm genes. Start and stop codons of protein coding genes included those found commonly amongst other platyhelminth mt genomes, but the much rarer initiation codon GTT was inferred for the gene atp6 in T. pisiformis. Phylogenetic analysis of mtDNAs offered novel estimates of the interrelationships of Taenia. Sliding window analyses showed nad6, nad5, atp6, nad3 and nad2 are amongst the most variable of genes per unit length, with the highest peaks in nucleotide diversity found in nad5. New primer pairs capable of amplifying fragments of variable DNA in nad1, rrnS and nad5 genes were designed in silico and tested as possible alternatives to existing mitochondrial markers for Taenia. With the availability of complete mtDNAs of 7 Taenia species, we have shown that analysis of amino acids provides a robust estimate of phylogeny for the genus that differs markedly from morphological estimates or those using partial genes; with implications for understanding the evolutionary radiation of important Taenia. Full alignment of the nucleotides of Taenia mtDNAs and sliding window analysis suggests numerous alternative gene regions are likely to capture greater nucleotide variation than those currently pursued as molecular markers. New PCR primers developed from a comparative mitogenomic analysis of Taenia species, extend the use of mitochondrial markers for molecular ecology, population genetics and diagnostics.
2010-01-01
Background Mitochondrial genomes provide a rich source of molecular variation of proven and widespread utility in molecular ecology, population genetics and evolutionary biology. The tapeworm genus Taenia includes a diversity of tapeworm parasites of significant human and veterinary importance. Here we add complete sequences of the mt genomes of T. multiceps, T. hydatigena and T. pisiformis, to a data set of 4 published mtDNAs in the same genus. Seven complete mt genomes of Taenia species are used to compare and contrast variation within and between genomes in the genus, to estimate a phylogeny for the genus, and to develop novel molecular markers as part of an extended mitochondrial toolkit. Results The complete circular mtDNAs of T. multiceps, T. hydatigena and T. pisiformis were 13,693, 13,492 and 13,387 bp in size respectively, comprising the usual complement of flatworm genes. Start and stop codons of protein coding genes included those found commonly amongst other platyhelminth mt genomes, but the much rarer initiation codon GTT was inferred for the gene atp6 in T. pisiformis. Phylogenetic analysis of mtDNAs offered novel estimates of the interrelationships of Taenia. Sliding window analyses showed nad6, nad5, atp6, nad3 and nad2 are amongst the most variable of genes per unit length, with the highest peaks in nucleotide diversity found in nad5. New primer pairs capable of amplifying fragments of variable DNA in nad1, rrnS and nad5 genes were designed in silico and tested as possible alternatives to existing mitochondrial markers for Taenia. Conclusions With the availability of complete mtDNAs of 7 Taenia species, we have shown that analysis of amino acids provides a robust estimate of phylogeny for the genus that differs markedly from morphological estimates or those using partial genes; with implications for understanding the evolutionary radiation of important Taenia. Full alignment of the nucleotides of Taenia mtDNAs and sliding window analysis suggests numerous alternative gene regions are likely to capture greater nucleotide variation than those currently pursued as molecular markers. New PCR primers developed from a comparative mitogenomic analysis of Taenia species, extend the use of mitochondrial markers for molecular ecology, population genetics and diagnostics. PMID:20649981
Revelation of Influencing Factors in Overall Codon Usage Bias of Equine Influenza Viruses
Bhatia, Sandeep; Sood, Richa; Selvaraj, Pavulraj
2016-01-01
Equine influenza viruses (EIVs) of H3N8 subtype are culprits of severe acute respiratory infections in horses, and are still responsible for significant outbreaks worldwide. Adaptability of influenza viruses to a particular host is significantly influenced by their codon usage preference, due to an absolute dependence on the host cellular machinery for their replication. In the present study, we analyzed genome-wide codon usage patterns in 92 EIV strains, including both H3N8 and H7N7 subtypes by computing several codon usage indices and applying multivariate statistical methods. Relative synonymous codon usage (RSCU) analysis disclosed bias of preferred synonymous codons towards A/U-ended codons. The overall codon usage bias in EIVs was slightly lower, and mainly affected by the nucleotide compositional constraints as inferred from the RSCU and effective number of codon (ENc) analysis. Our data suggested that codon usage pattern in EIVs is governed by the interplay of mutation pressure, natural selection from its hosts and undefined factors. The H7N7 subtype was found less fit to its host (horse) in comparison to H3N8, by possessing higher codon bias, lower mutation pressure and much less adaptation to tRNA pool of equine cells. To the best of our knowledge, this is the first report describing the codon usage analysis of the complete genomes of EIVs. The outcome of our study is likely to enhance our understanding of factors involved in viral adaptation, evolution, and fitness towards their hosts. PMID:27119730
Characterization of codon usage pattern and influencing factors in Japanese encephalitis virus.
Singh, Niraj K; Tyagi, Anuj; Kaur, Rajinder; Verma, Ramneek; Gupta, Praveen K
2016-08-02
Recently, several outbreaks of Japanese encephalitis (JE), caused by Japanese encephalitis virus (JEV), have been reported and it has become cause of concern across the world. In this study, detailed analysis of JEV codon usage pattern was performed. The relative synonymous codon usage (RSCU) values along with mean effective number of codons (ENC) value of 55.30 indicated the presence of low codon usages bias in JEV. The effect of mutational pressure on codon usage bias was confirmed by significant correlations of A3s, U3s, G3s, C3s, GC3s, ENC values, with overall nucleotide contents (A%, U%, G%, C%, and GC%). The correlation analysis of A3s, U3s, G3s, C3s, GC3s, with axis values of correspondence analysis (CoA) further confirmed the role of mutational pressure. However, the correlation analysis of Gravy values and Aroma values with A3s, U3s, G3s, C3s, and GC3s, indicated the presence of natural selection on codon usage bias in addition to mutational pressure. The natural selection was further confirmed by codon adaptation index (CAI) analysis. Additionally, relative dinucleotide frequencies, geographical distribution, and evolutionary processes also influenced the codon usage pattern to some extent. Copyright © 2016 Elsevier B.V. All rights reserved.
Marchi, Rita; Brennan, Stephen; Meyer, Michael; Rojas, Héctor; Kanzler, Daniela; De Agrela, Marisela; Ruiz-Saez, Arlette
2013-03-01
Routine coagulation tests on a 14year-old male with frequent epistaxis showed a prolonged thrombin time together with diminished functional (162mg/dl) and gravimetric (122mg/dl) fibrinogen concentrations. His father showed similar aberrant results and sequencing of the three fibrinogen genes revealed a novel heterozygous nonsense mutation in the FGB gene c.1105C>T, which converts the codon for residue Bβ 339Q to stop, causing deletion of Bβ chain residues 339-461. Sodium dodecyl sulfate polyacrylamide gel electrophoresis (SDS-PAGE) and RP-HPLC (reverse-phase high-pressure liquid chromatography) of purified fibrinogen showed only normal Aα, Bβ, and γ chains, indicating that molecules with the truncated 37,990Da β chain were not secreted into plasma. Functional analysis showed impaired fibrin polymerization, fibrin porosity, and elasticity compared to controls. By laser scanning confocal microscopy the patient's fibers were slightly thinner than normal. Electrospray ionization mass spectrometry (ESI MS) presented normal sialylation of the oligosaccharide chains, and liver function tests showed no evidence of liver dysfunction that might explain the functional abnormalities. Copyright © 2012 Elsevier Inc. All rights reserved.
Representation mutations from standard genetic codes
NASA Astrophysics Data System (ADS)
Aisah, I.; Suyudi, M.; Carnia, E.; Suhendi; Supriatna, A. K.
2018-03-01
Graph is widely used in everyday life especially to describe model problem and describe it concretely and clearly. In addition graph is also used to facilitate solve various kinds of problems that are difficult to be solved by calculation. In Biology, graph can be used to describe the process of protein synthesis in DNA. Protein has an important role for DNA (deoxyribonucleic acid) or RNA (ribonucleic acid). Proteins are composed of amino acids. In this study, amino acids are related to genetics, especially the genetic code. The genetic code is also known as the triplet or codon code which is a three-letter arrangement of DNA nitrogen base. The bases are adenine (A), thymine (T), guanine (G) and cytosine (C). While on RNA thymine (T) is replaced with Urasil (U). The set of all Nitrogen bases in RNA is denoted by N = {C U, A, G}. This codon works at the time of protein synthesis inside the cell. This codon also encodes the stop signal as a sign of the stop of protein synthesis process. This paper will examine the process of protein synthesis through mathematical studies and present it in three-dimensional space or graph. The study begins by analysing the set of all codons denoted by NNN such that to obtain geometric representations. At this stage there is a matching between the sets of all nitrogen bases N with Z 2 × Z 2; C=(\\overline{0},\\overline{0}),{{U}}=(\\overline{0},\\overline{1}),{{A}}=(\\overline{1},\\overline{0}),{{G}}=(\\overline{1},\\overline{1}). By matching the algebraic structure will be obtained such as group, group Klein-4,Quotien group etc. With the help of Geogebra software, the set of all codons denoted by NNN can be presented in a three-dimensional space as a multicube NNN and also can be represented as a graph, so that can easily see relationship between the codon.
Domenice, S; Latronico, A C; Brito, V N; Arnhold, I J; Kok, F; Mendonca, B B
2001-09-01
Primary adrenal insufficiency is a rare condition in pediatric age, and its association with precocious sexual development is very uncommon. We report a 2-yr-old Brazilian boy with DAX1 gene mutation whose first clinical manifestation was isosexual gonadotropin-independent precocious puberty. He presented with pubic hair, enlarged penis and testes, and advanced bone age. T levels were elevated, whereas basal and GnRH-stimulated LH levels were compatible with a prepubertal pattern. Chronic GnRH agonist therapy did not reduce T levels, supporting the diagnosis of gonadotropin-independent precocious puberty. Testotoxicosis was ruled out after normal sequencing of exon 11 of the LH receptor gene. At age 3 yr he developed clinical and hormonal features of severe primary adrenal insufficiency. The entire coding region of the DAX1 gene was analyzed through direct sequencing. A nucleotide G insertion between nucleotides 430 and 431 in exon 1, resulting in a novel frameshift mutation and a premature stop codon at position 71 of DAX-1, was identified. Surprisingly, steroid replacement therapy induced a clear decrease in testicular size and T levels to the prepubertal range. These findings suggest that chronic excessive ACTH levels resulting from adrenal insufficiency may stimulate Leydig cells and lead to gonadotropin-independent precocious puberty in some boys with DAX1 gene mutations.
Brimacombe, M.; Hazbon, M.; Motiwala, A. S.; Alland, D.
2007-01-01
A single-nucleotide polymorphism-based cluster grouping (SCG) classification system for Mycobacterium tuberculosis was used to examine antibiotic resistance type and resistance mutations in relationship to specific evolutionary lineages. Drug resistance and resistance mutations were seen across all SCGs. SCG-2 had higher proportions of katG codon 315 mutations and resistance to four drugs. PMID:17846140
Zhao, Yongzhong; Epstein, Richard J
2013-01-01
Methylation-prone CpG dinucleotides are strongly conserved in the germline, yet are also predisposed to somatic mutation. Here we quantify the relationship between germline codon mutability and somatic carcinogenesis by comparing usage of the nonsense-prone CGA (→TGA) codons in gene groups that differ in apoptotic function; to this end, suppressor genes were subclassified as either apoptotic (gatekeepers) or repair (caretakers). Mutations affecting CGA codons in sporadic tumors proved to be highly asymmetric. Moreover, nonsense mutations were 3-fold more likely to affect gatekeepers than caretakers. In addition, intragenic CGA clustering nonrandomly affected functionally critical regions of gatekeepers. We conclude that human gatekeeper suppressor genes are enriched for nonsense-prone codons, and submit that this germline vulnerability to tumors could reflect in utero selection for a methylation-dependent capability to short-circuit environmental insults that otherwise trigger apoptosis and fetal loss.
Mühlhausen, Stefanie; Findeisen, Peggy; Plessmann, Uwe; Urlaub, Henning; Kollmar, Martin
2016-01-01
The genetic code is the cellular translation table for the conversion of nucleotide sequences into amino acid sequences. Changes to the meaning of sense codons would introduce errors into almost every translated message and are expected to be highly detrimental. However, reassignment of single or multiple codons in mitochondria and nuclear genomes, although extremely rare, demonstrates that the code can evolve. Several models for the mechanism of alteration of nuclear genetic codes have been proposed (including “codon capture,” “genome streamlining,” and “ambiguous intermediate” theories), but with little resolution. Here, we report a novel sense codon reassignment in Pachysolen tannophilus, a yeast related to the Pichiaceae. By generating proteomics data and using tRNA sequence comparisons, we show that Pachysolen translates CUG codons as alanine and not as the more usual leucine. The Pachysolen tRNACAG is an anticodon-mutated tRNAAla containing all major alanine tRNA recognition sites. The polyphyly of the CUG-decoding tRNAs in yeasts is best explained by a tRNA loss driven codon reassignment mechanism. Loss of the CUG-tRNA in the ancient yeast is followed by gradual decrease of respective codons and subsequent codon capture by tRNAs whose anticodon is not part of the aminoacyl-tRNA synthetase recognition region. Our hypothesis applies to all nuclear genetic code alterations and provides several testable predictions. We anticipate more codon reassignments to be uncovered in existing and upcoming genome projects. PMID:27197221
Fan, SiGang; Hu, ChaoQun; Wen, Jing; Zhang, LvPing
2011-05-01
The complete mitochondrial DNA sequence contains useful information for phylogenetic analyses of metazoa. In this study, the complete mitochondrial DNA sequence of sea cucumber Stichopus horrens (Holothuroidea: Stichopodidae: Stichopus) is presented. The complete sequence was determined using normal and long PCRs. The mitochondrial genome of Stichopus horrens is a circular molecule 16257 bps long, composed of 13 protein-coding genes, two ribosomal RNA genes and 22 transfer RNA genes. Most of these genes are coded on the heavy strand except for one protein-coding gene (nad6) and five tRNA genes (tRNA ( Ser(UCN) ), tRNA ( Gln ), tRNA ( Ala ), tRNA ( Val ), tRNA ( Asp )) which are coded on the light strand. The composition of the heavy strand is 30.8% A, 23.7% C, 16.2% G, and 29.3% T bases (AT skew=0.025; GC skew=-0.188). A non-coding region of 675 bp was identified as a putative control region because of its location and AT richness. The intergenic spacers range from 1 to 50 bp in size, totaling 227 bp. A total of 25 overlapping nucleotides, ranging from 1 to 10 bp in size, exist among 11 genes. All 13 protein-coding genes are initiated with an ATG. The TAA codon is used as the stop codon in all the protein coding genes except nad3 and nad4 that use TAG as their termination codon. The most frequently used amino acids are Leu (16.29%), Ser (10.34%) and Phe (8.37%). All of the tRNA genes have the potential to fold into typical cloverleaf secondary structures. We also compared the order of the genes in the mitochondrial DNA from the five holothurians that are now available and found a novel gene arrangement in the mitochondrial DNA of Stichopus horrens.
Schuster, W; Wissinger, B; Unseld, M; Brennicke, A
1990-01-01
A number of cytosines are altered to be recognized as uridines in transcripts of the nad3 locus in mitochondria of the higher plant Oenothera. Such nucleotide modifications can be found at 16 different sites within the nad3 coding region. Most of these alterations in the mRNA sequence change codon identities to specify amino acids better conserved in evolution. Individual cDNA clones differ in their degree of editing at five nucleotide positions, three of which are silent, while two lead to codon alterations specifying different amino acids. None of the cDNA clones analysed is maximally edited at all possible sites, suggesting slow processing or lowered stringency of editing at these nucleotides. Differentially edited transcripts could be editing intermediates or could code for differing polypeptides. Two edited nucleotides in an open reading frame located upstream of nad3 change two amino acids in the deduced polypeptide. Part of the well-conserved ribosomal protein gene rps12 also encoded downstream of nad3 in other plants, is lost in Oenothera mitochondria by recombination events. The functional rps12 protein must be imported from the cytoplasm since the deleted sequences of this gene are not found in the Oenothera mitochondrial genome. The pseudogene sequence is not edited at any nucleotide position. Images Fig. 3. Fig. 4. Fig. 7. PMID:1688531
Kim, Younghyun; Lee, Goeun; Jeon, Eunhyun; Sohn, Eun ju; Lee, Yongjik; Kang, Hyangju; Lee, Dong wook; Kim, Dae Heon; Hwang, Inhwan
2014-01-01
The nucleotide sequence around the translational initiation site is an important cis-acting element for post-transcriptional regulation. However, it has not been fully understood how the sequence context at the 5′-untranslated region (5′-UTR) affects the translational efficiency of individual mRNAs. In this study, we provide evidence that the 5′-UTRs of Arabidopsis genes showing a great difference in the nucleotide sequence vary greatly in translational efficiency with more than a 200-fold difference. Of the four types of nucleotides, the A residue was the most favourable nucleotide from positions −1 to −21 of the 5′-UTRs in Arabidopsis genes. In particular, the A residue in the 5′-UTR from positions −1 to −5 was required for a high-level translational efficiency. In contrast, the T residue in the 5′-UTR from positions −1 to −5 was the least favourable nucleotide in translational efficiency. Furthermore, the effect of the sequence context in the −1 to −21 region of the 5′-UTR was conserved in different plant species. Based on these observations, we propose that the sequence context immediately upstream of the AUG initiation codon plays a crucial role in determining the translational efficiency of plant genes. PMID:24084084
Evidence of codon usage in the nearest neighbor spacing distribution of bases in bacterial genomes
NASA Astrophysics Data System (ADS)
Higareda, M. F.; Geiger, O.; Mendoza, L.; Méndez-Sánchez, R. A.
2012-02-01
Statistical analysis of whole genomic sequences usually assumes a homogeneous nucleotide density throughout the genome, an assumption that has been proved incorrect for several organisms since the nucleotide density is only locally homogeneous. To avoid giving a single numerical value to this variable property, we propose the use of spectral statistics, which characterizes the density of nucleotides as a function of its position in the genome. We show that the cumulative density of bases in bacterial genomes can be separated into an average (or secular) plus a fluctuating part. Bacterial genomes can be divided into two groups according to the qualitative description of their secular part: linear and piecewise linear. These two groups of genomes show different properties when their nucleotide spacing distribution is studied. In order to analyze genomes having a variable nucleotide density, statistically, the use of unfolding is necessary, i.e., to get a separation between the secular part and the fluctuations. The unfolding allows an adequate comparison with the statistical properties of other genomes. With this methodology, four genomes were analyzed Burkholderia, Bacillus, Clostridium and Corynebacterium. Interestingly, the nearest neighbor spacing distributions or detrended distance distributions are very similar for species within the same genus but they are very different for species from different genera. This difference can be attributed to the difference in the codon usage.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Bonaventure, J.; Lasselin, C.; Toutain, A.
1994-09-01
The Stickler syndrome is an arthro-ophthalmopathy which associates progressive myopia with vitreal degeneration and retinal detachment. Cleft palate, cranio-facial abnormalities, deafness and osteoarthritis are often associated symptoms. Genetic heterogeneity of this autosomal dominant disease was consistent with its large clinical variability. Linkage studies have provided evidence for cosegregation of the disease with COL2A1, the gene coding for type II collagen, in about 50% of the families. Four additional families are reported here. Linkage analyses by using a VNTR located in the 3{prime} region of the gene were achieved. In three families, positive lod scores were obtained with a cumulative maximalmore » value of 3.5 at a recombination fraction of 0. In one of these families, single strand conformation analysis of 25 exons disclosed a new mutation in exon 42. Codon for glutamic acid at position a1-803 was converted into a stop codon. The mutation was detected in DNA samples from all the affected members of the family but not in the unaffected. This result confirms that most of the Stickler syndromes linked to COL2A1 are due to premature stop codons. In a second family, an abnormal SSCP pattern of exon 34 was detected in all the affected individuals. The mutation is likely to correspond to a splicing defect in the acceptor site of intron 33. In one family the disease did not segregate with the COL2A1 locus. Further linkage studies with intragenic dimorphic sites in the COL10A1 gene and highly polymorphic markers close to the COL9A1 locus indicated that this disorder did not result from defects in these two genes.« less
Codon usage analysis of photolyase encoding genes of cyanobacteria inhabiting diverse habitats.
Rajneesh; Pathak, Jainendra; Kannaujiya, Vinod K; Singh, Shailendra P; Sinha, Rajeshwar P
2017-07-01
Nucleotide and amino acid compositions were studied to determine the genomic and structural relationship of photolyase gene in freshwater, marine and hot spring cyanobacteria. Among three habitats, photolyase encoding genes from hot spring cyanobacteria were found to have highest GC content. The genomic GC content was found to influence the codon usage and amino acid variability in photolyases. The third position of codon was found to have more effect on amino acid variability in photolyases than the first and second positions of codon. The variation of amino acids Ala, Asp, Glu, Gly, His, Leu, Pro, Gln, Arg and Val in photolyases of three different habitats was found to be controlled by first position of codon (G1C1). However, second position (G2C2) of codon regulates variation of Ala, Cys, Gly, Pro, Arg, Ser, Thr and Tyr contents in photolyases. Third position (G3C3) of codon controls incorporation of amino acids such as Ala, Phe, Gly, Leu, Gln, Pro, Arg, Ser, Thr and Tyr in photolyases from three habitats. Photolyase encoding genes of hot spring cyanobacteria have 85% codons with G or C at third position, whereas marine and freshwater cyanobacteria showed 82 and 60% codons, respectively, with G or C at third position. Principal component analysis (PCA) showed that GC content has a profound effect in separating the genes along the first major axis according to their RSCU (relative synonymous codon usage) values, and neutrality analysis indicated that mutational pressure has resulted in codon bias in photolyase genes of cyanobacteria.
Substitution rate and natural selection in parvovirus B19
Stamenković, Gorana G.; Ćirković, Valentina S.; Šiljić, Marina M.; Blagojević, Jelena V.; Knežević, Aleksandra M.; Joksić, Ivana D.; Stanojević, Maja P.
2016-01-01
The aim of this study was to estimate substitution rate and imprints of natural selection on parvovirus B19 genotype 1. Studied datasets included 137 near complete coding B19 genomes (positions 665 to 4851) for phylogenetic and substitution rate analysis and 146 and 214 partial genomes for selection analyses in open reading frames ORF1 and ORF2, respectively, collected 1973–2012 and including 9 newly sequenced isolates from Serbia. Phylogenetic clustering assigned majority of studied isolates to G1A. Nucleotide substitution rate for total coding DNA was 1.03 (0.6–1.27) x 10−4 substitutions/site/year, with higher values for analyzed genome partitions. In spite of the highest evolutionary rate, VP2 codons were found to be under purifying selection with rare episodic positive selection, whereas codons under diversifying selection were found in the unique part of VP1, known to contain B19 immune epitopes important in persistent infection. Analyses of overlapping gene regions identified nucleotide positions under opposite selective pressure in different ORFs, suggesting complex evolutionary mechanisms of nucleotide changes in B19 viral genomes. PMID:27775080
DOE Office of Scientific and Technical Information (OSTI.GOV)
Zhang, L.; Sakkal-Alkaddour, S.; Chang, Ying T.
1996-01-01
We report a new compound heterozygous frameshift mutation in the type II 3{Beta}-hydroxysteroid dehydrogenase (3{beta}-HSD) gene in a Pakistanian female child with the salt-wasting form of 3{Beta}-HSD deficiency congenital adrenal hyperplasia. The etiology for her congenital adrenal hyperplasia was not defined. Although the family history suggested possible 3{beta}-HSd deficiency disorder, suppressed adrenal function caused by excess glucocorticoid therapy in this child at 7 yr of age did not allow hormonal diagnosis. To confirm 3{beta}-HSD deficiency, we sequenced the type II 3{beta}-HSD gene in the patient, her family, and the parents of her deceased paternal cousins. The type II 3{beta}-HSD genemore » region of a putative promotor, exons I, II, III, and IV, and exon-intron boundaries were amplified by PCR and sequenced in all subjects. The DNA sequence of the child revealed a single nucleotide deletion at codon 318 [ACA(Thr){r_arrow}AA] in exon IV in one allele, and two nucleotide deletions at codon 273 [AAA(Lys){r_arrow}A] in exon IV in the other allele. The remaining gene sequences were normal. The codon 318 mutation was found in one allele from the father, brother, and parents of the deceased paternal cousins. The codon 273 mutation was found in one allele of the mother and a sister. These findings confirmed inherited 3{beta}-HSD deficiency in the child caused by the compound heterozygous type II 3{beta}-HSD gene mutation. Both codons at codons 279 and 367, respectively, are predicted to result in an altered and truncated type II 3{beta}-HSD protein, thereby causing salt-wasting 3{beta}-HSD deficiency in the patient. 21 refs., 2 figs., 1 tab.« less
The Purine Bias of Coding Sequences is Determined by Physicochemical Constraints on Proteins.
Ponce de Leon, Miguel; de Miranda, Antonio Basilio; Alvarez-Valin, Fernando; Carels, Nicolas
2014-01-01
For this report, we analyzed protein secondary structures in relation to the statistics of three nucleotide codon positions. The purpose of this investigation was to find which properties of the ribosome, tRNA or protein level, could explain the purine bias (Rrr) as it is observed in coding DNA. We found that the Rrr pattern is the consequence of a regularity (the codon structure) resulting from physicochemical constraints on proteins and thermodynamic constraints on ribosomal machinery. The physicochemical constraints on proteins mainly come from the hydropathy and molecular weight (MW) of secondary structures as well as the energy cost of amino acid synthesis. These constraints appear through a network of statistical correlations, such as (i) the cost of amino acid synthesis, which is in favor of a higher level of guanine in the first codon position, (ii) the constructive contribution of hydropathy alternation in proteins, (iii) the spatial organization of secondary structure in proteins according to solvent accessibility, (iv) the spatial organization of secondary structure according to amino acid hydropathy, (v) the statistical correlation of MW with protein secondary structures and their overall hydropathy, (vi) the statistical correlation of thymine in the second codon position with hydropathy and the energy cost of amino acid synthesis, and (vii) the statistical correlation of adenine in the second codon position with amino acid complexity and the MW of secondary protein structures. Amino acid physicochemical properties and functional constraints on proteins constitute a code that is translated into a purine bias within the coding DNA via tRNAs. In that sense, the Rrr pattern within coding DNA is the effect of information transfer on nucleotide composition from protein to DNA by selection according to the codon positions. Thus, coding DNA structure and ribosomal machinery co-evolved to minimize the energy cost of protein coding given the functional constraints on proteins.
Françoso, Elaine; Gomes, Fernando; Arias, Maria Cristina
2016-07-01
Nuclear mitochondrial DNA insertions (NUMTs) are mitochondrial DNA sequences that have been transferred into the nucleus and are recognized by the presence of indels and stop codons. Although NUMTs have been identified in a diverse range of species, their discovery was frequently accidental. Here, our initial goal was to develop and standardize a simple method for isolating NUMTs from the nuclear genome of a single bee. Subsequently, we tested our new protocol by determining whether the indels and stop codons of the cytochrome c oxidase subunit I (COI) sequence of Melipona flavolineata are of nuclear origin. The new protocol successfully demonstrated the presence of a COI NUMT. In addition to NUMT investigations, the protocol described here will also be very useful for studying mitochondrial mutations related to diseases and for sequencing complete mitochondrial genomes with high read coverage by Next-Generation technology.
Gamo, F J; Lafuente, M J; Casamayor, A; Ariño, J; Aldea, M; Casas, C; Herrero, E; Gancedo, C
1996-06-15
We report the sequence of a 15.5 kb DNA segment located near the left telomere of chromosome XV of Saccharomyces cerevisiae. The sequence contains nine open reading frames (ORFs) longer than 300 bp. Three of them are internal to other ones. One corresponds to the gene LGT3 that encodes a putative sugar transporter. Three adjacent ORFs were separated by two stop codons in frame. These ORFs presented homology with the gene CPS1 that encodes carboxypeptidase S. The stop codons were not found in the same sequence derived from another yeast strain. Two other ORFs without significant homology in databases were also found. One of them, O0420, is very rich in serine and threonine and presents a series of repeated or similar amino acid stretches along the sequence.
Evaluating Sense Codon Reassignment with a Simple Fluorescence Screen.
Biddle, Wil; Schmitt, Margaret A; Fisk, John D
2015-12-22
Understanding the interactions that drive the fidelity of the genetic code and the limits to which modifications can be made without breaking the translational system has practical implications for understanding the molecular mechanisms of evolution as well as expanding the set of encodable amino acids, particularly those with chemistries not provided by Nature. Because 61 sense codons encode 20 amino acids, reassigning the meaning of sense codons provides an avenue for biosynthetic modification of proteins, furthering both fundamental and applied biochemical research. We developed a simple screen that exploits the absolute requirement for fluorescence of an active site tyrosine in green fluorescent protein (GFP) to probe the pliability of the degeneracy of the genetic code. Our screen monitors the restoration of the fluorophore of GFP by incorporation of a tyrosine in response to a sense codon typically assigned another meaning in the genetic code. We evaluated sense codon reassignment at four of the 21 sense codons read through wobble interactions in Escherichia coli using the Methanocaldococcus jannaschii orthogonal tRNA/aminoacyl tRNA synthetase pair originally developed and commonly used for amber stop codon suppression. By changing only the anticodon of the orthogonal tRNA, we achieved sense codon reassignment efficiencies between 1% (Phe UUU) and 6% (Lys AAG). Each of the orthogonal tRNAs preferentially decoded the codon traditionally read via a wobble interaction in E. coli with the exception of the orthogonal tRNA with an AUG anticodon, which incorporated tyrosine in response to both the His CAU and His CAC codons with approximately equal frequencies. We applied our screen in a high-throughput manner to evaluate a 10(9)-member combined tRNA/aminoacyl tRNA synthetase library to identify improved sense codon reassigning variants for the Lys AAG codon. A single rapid screen with the ability to broadly evaluate reassignable codons will facilitate identification and improvement of the combinations of sense codons and orthogonal pairs that display efficient reassignment.
Species Based Synonymous Codon Usage in Fusion Protein Gene of Newcastle Disease Virus
Kumar, Chandra Shekhar; Kumar, Sachin
2014-01-01
Newcastle disease is highly pathogenic to poultry and many other avian species. However, the Newcastle disease virus (NDV) has also been reported from many non-avian species. The NDV fusion protein (F) is a major determinant of its pathogenicity and virulence. The functionalities of F gene have been explored for the development of vaccine and diagnostics against NDV. Although the F protein is well studied but the codon usage and its nucleotide composition from NDV isolated from different species have not yet been explored. In present study, we have analyzed the factors responsible for the determination of codon usage in NDV isolated from four major avian host species. The F gene of NDV is analyzed for its base composition and its correlation with the bias in codon usage. Our result showed that random mutational pressure is responsible for codon usage bias in F protein of NDV isolates. Aromaticity, GC3s, and aliphatic index were not found responsible for species based synonymous codon usage bias in F gene of NDV. Moreover, the low amount of codon usage bias and expression level was further confirmed by a low CAI value. The phylogenetic analysis of isolates was found in corroboration with the relatedness of species based on codon usage bias. The relationship between the host species and the NDV isolates from the host does not represent a significant correlation in our study. The present study provides a basic understanding of the mechanism involved in codon usage among species. PMID:25479071
Lozier, Jay N; Kloos, Mark T; Merricks, Elizabeth P; Lemoine, Nathaly; Whitford, Margaret H; Raymer, Robin A; Bellinger, Dwight A; Nichols, Timothy C
2016-01-01
Animals with hemophilia are models for gene therapy, factor replacement, and inhibitor development in humans. We have actively sought dogs with severe hemophilia A that have novel factor VIII mutations unlike the previously described factor VIII intron 22 inversion. A male Old English Sheepdog with recurrent soft-tissue hemorrhage and hemarthrosis was diagnosed with severe hemophilia A (factor VIII activity less than 1% of normal). We purified genomic DNA from this dog and ruled out the common intron 22 inversion; we then sequenced all 26 exons. Comparing the results with the normal canine factor VIII sequence revealed a C→T transition in exon 12 of the factor VIII gene that created a premature stop codon at amino acid 577 in the A2 domain of the protein. In addition, 2 previously described polymorphisms that do not cause hemophilia were present at amino acids 909 and 1184. The hemophilia mutation creates a new TaqI site that facilitates rapid genotyping of affected offspring by PCR and restriction endonuclease analyses. This mutation is analogous to the previously described human factor VIII mutation at Arg583, which likewise is a CpG dinucleotide transition causing a premature stop codon in exon 12. Thus far, despite extensive treatment with factor VIII, this dog has not developed neutralizing antibodies (‘inhibitors’) to the protein. This novel mutation in a dog gives rise to severe hemophilia A analogous to a mutation seen in humans. This model will be useful for studies of the treatment of hemophilia. PMID:27780008
Numerical classification of coding sequences
NASA Technical Reports Server (NTRS)
Collins, D. W.; Liu, C. C.; Jukes, T. H.
1992-01-01
DNA sequences coding for protein may be represented by counts of nucleotides or codons. A complete reading frame may be abbreviated by its base count, e.g. A76C158G121T74, or with the corresponding codon table, e.g. (AAA)0(AAC)1(AAG)9 ... (TTT)0. We propose that these numerical designations be used to augment current methods of sequence annotation. Because base counts and codon tables do not require revision as knowledge of function evolves, they are well-suited to act as cross-references, for example to identify redundant GenBank entries. These descriptors may be compared, in place of DNA sequences, to extract homologous genes from large databases. This approach permits rapid searching with good selectivity.
Mühlhausen, Stefanie; Findeisen, Peggy; Plessmann, Uwe; Urlaub, Henning; Kollmar, Martin
2016-07-01
The genetic code is the cellular translation table for the conversion of nucleotide sequences into amino acid sequences. Changes to the meaning of sense codons would introduce errors into almost every translated message and are expected to be highly detrimental. However, reassignment of single or multiple codons in mitochondria and nuclear genomes, although extremely rare, demonstrates that the code can evolve. Several models for the mechanism of alteration of nuclear genetic codes have been proposed (including "codon capture," "genome streamlining," and "ambiguous intermediate" theories), but with little resolution. Here, we report a novel sense codon reassignment in Pachysolen tannophilus, a yeast related to the Pichiaceae. By generating proteomics data and using tRNA sequence comparisons, we show that Pachysolen translates CUG codons as alanine and not as the more usual leucine. The Pachysolen tRNACAG is an anticodon-mutated tRNA(Ala) containing all major alanine tRNA recognition sites. The polyphyly of the CUG-decoding tRNAs in yeasts is best explained by a tRNA loss driven codon reassignment mechanism. Loss of the CUG-tRNA in the ancient yeast is followed by gradual decrease of respective codons and subsequent codon capture by tRNAs whose anticodon is not part of the aminoacyl-tRNA synthetase recognition region. Our hypothesis applies to all nuclear genetic code alterations and provides several testable predictions. We anticipate more codon reassignments to be uncovered in existing and upcoming genome projects. © 2016 Mühlhausen et al.; Published by Cold Spring Harbor Laboratory Press.
Novel BRCA1 splice-site mutation in ovarian cancer patients of Slavic origin.
Krivokuca, Ana; Dragos, Vita Setrajcic; Stamatovic, Ljiljana; Blatnik, Ana; Boljevic, Ivana; Stegel, Vida; Rakobradovic, Jelena; Skerl, Petra; Jovandic, Stevo; Krajc, Mateja; Magic, Mirjana Brankovic; Novakovic, Srdjan
2018-04-01
Mutations in breast cancer susceptibility gene 1 (BRCA1) lead to defects in a number of cellular pathways including DNA damage repair and transcriptional regulation, resulting in the elevated genome instability and predisposing to breast and ovarian cancers. We report a novel mutation LRG_292t1:c.4356delA,p.(Ala1453Glnfs*3) in the 12th exon of BRCA1, in the splice site region near the donor site of intron 12. It is a frameshift mutation with the termination codon generated on the third amino acid position from the site of deletion. Human Splice Finder 3.0 and MutationTaster have assessed this variation as disease causing, based on the alteration of splicing, creation of premature stop codon and other potential alterations initiated by nucleotide deletion. Among the most important alterations are frameshift and splice site changes (score of the newly created donor splice site: 0.82). c.4356delA was associated with two ovarian cancer cases in two families of Slavic origin. It was detected by next generation sequencing, and confirmed with Sanger sequencing in both cases. Because of the fact that it changes the reading frame of the protein, novel mutation c.4356delA p.(Ala1453Glnfs*3) in BRCA1 gene might be of clinical significance for hereditary ovarian cancer. Further functional as well as segregation analyses within the families are necessary for appropriate clinical classification of this variant. Since it has been detected in two ovarian cancer patients of Slavic origin, it is worth investigating founder effect of this mutation in Slavic populations.
Mutation of the PAX6 gene in a sporadic patient with atypical aniridia
DOE Office of Scientific and Technical Information (OSTI.GOV)
Zhu, D.; Li, Y.; Traboulsi, E.I.
1994-09-01
A 28 year-old man presented with poor vision since childhood and gradual further decline of several years duration. His visual acuity measures 20/200 OD with -11.50 + 0.50 x 150 and 20/100 OS with -12.25 + 0.25 x 35. He had a fine nystagmus. His visual fields were full. There was a circumferential pannus with areas of corneal stromal opacification. The iris was hypoplastic with atypical colobomatous defects. The lenses had scattered cortical opacities. The intraocular pressures were normal. The optic nerves had cup disk ratios of 0.6 OU. The family history was negative for similar defects. A diagnosis ofmore » aniridia was made and blood was drawn for analysis of the PAX6 gene. PCR amplification of exon 5 showed heterozygous fragments with one allele being larger than normal. Direct DNA sequencing of the individual heterozygous allele showed a 41 base pair insertion at nucleotide 483 in exon 5 of the paired domain. This frameshift mutation changed codon 71 to a stop codon. The diagnosis of aniridia was confirmed in this atypical patient, who will need to be monitored for his high risk of glaucoma. The risk of developing Wilms` tumor in patients with mutations within the aniridia gene is presumably negligible since the neighboring Wilms` tumor gene is unaffected. The identification of intragenic mutations of the PAX6 gene in patients with sporadic aniridia modifies the management of such patients because of recognition of the increased risk of glaucoma and by reducing the necessity for frequent monitoring for the presence of Wilms` tumor.« less
Lapteva, Y. S.; Zolova, O. E.; Shlyapnikov, M. G.; Tsfasman, I. M.; Muranova, T. A.; Stepnaya, O. A.; Kulaev, I. S.
2012-01-01
Lytic enzymes are the group of hydrolases that break down structural polymers of the cell walls of various microorganisms. In this work, we determined the nucleotide sequences of the Lysobacter sp. strain XL1 alpA and alpB genes, which code for, respectively, secreted lytic endopeptidases L1 (AlpA) and L5 (AlpB). In silico analysis of their amino acid sequences showed these endopeptidases to be homologous proteins synthesized as precursors similar in structural organization: the mature enzyme sequence is preceded by an N-terminal signal peptide and a pro region. On the basis of phylogenetic analysis, endopeptidases AlpA and AlpB were assigned to the S1E family [clan PA(S)] of serine peptidases. Expression of the alpA and alpB open reading frames (ORFs) in Escherichia coli confirmed that they code for functionally active lytic enzymes. Each ORF was predicted to have the Shine-Dalgarno sequence located at a canonical distance from the start codon and a potential Rho-independent transcription terminator immediately after the stop codon. The alpA and alpB mRNAs were experimentally found to be monocistronic; transcription start points were determined for both mRNAs. The synthesis of the alpA and alpB mRNAs was shown to occur predominantly in the late logarithmic growth phase. The amount of alpA mRNA in cells of Lysobacter sp. strain XL1 was much higher, which correlates with greater production of endopeptidase L1 than of L5. PMID:22865082
Lapteva, Y S; Zolova, O E; Shlyapnikov, M G; Tsfasman, I M; Muranova, T A; Stepnaya, O A; Kulaev, I S; Granovsky, I E
2012-10-01
Lytic enzymes are the group of hydrolases that break down structural polymers of the cell walls of various microorganisms. In this work, we determined the nucleotide sequences of the Lysobacter sp. strain XL1 alpA and alpB genes, which code for, respectively, secreted lytic endopeptidases L1 (AlpA) and L5 (AlpB). In silico analysis of their amino acid sequences showed these endopeptidases to be homologous proteins synthesized as precursors similar in structural organization: the mature enzyme sequence is preceded by an N-terminal signal peptide and a pro region. On the basis of phylogenetic analysis, endopeptidases AlpA and AlpB were assigned to the S1E family [clan PA(S)] of serine peptidases. Expression of the alpA and alpB open reading frames (ORFs) in Escherichia coli confirmed that they code for functionally active lytic enzymes. Each ORF was predicted to have the Shine-Dalgarno sequence located at a canonical distance from the start codon and a potential Rho-independent transcription terminator immediately after the stop codon. The alpA and alpB mRNAs were experimentally found to be monocistronic; transcription start points were determined for both mRNAs. The synthesis of the alpA and alpB mRNAs was shown to occur predominantly in the late logarithmic growth phase. The amount of alpA mRNA in cells of Lysobacter sp. strain XL1 was much higher, which correlates with greater production of endopeptidase L1 than of L5.
A novel nonsense mutation in CRYBB1 associated with autosomal dominant congenital cataract
Yang, Juhua; Zhu, Yihua; Gu, Feng; He, Xiang; Cao, Zongfu; Li, Xuexi; Tong, Yi
2008-01-01
Purpose To identify the molecular defect underlying an autosomal dominant congenital nuclear cataract in a Chinese family. Methods Twenty-two members of a three-generation pedigree were recruited, clinical examinations were performed, and genomic DNA was extracted from peripheral blood leukocytes. All members were genotyped with polymorphic microsatellite markers adjacent to each of the known cataract-related genes. Linkage analysis was performed after genotyping. Candidate genes were screened for mutation using direct sequencing. Individuals were screened for presence of a mutation by restriction fragment length polymorphism (RFLP) analysis. Results Linkage analysis identified a maximum LOD score of 3.31 (recombination fraction [θ]=0.0) with marker D22S1167 on chromosome 22, which flanks the β-crystallin gene cluster (CRYBB3, CRYBB2, CRYBB1, and CRYBA4). Sequencing the coding regions and the flanking intronic sequences of these four candidate genes identified a novel, heterozygous C→T transition in exon 6 of CRYBB1 in the affected individuals of the family. This single nucleotide change introduced a novel BfaI site and was predicted to result in a nonsense mutation at codon 223 that changed a phylogenetically conserved amino acid to a stop codon (p.Q223X). RFLP analysis confirmed that this mutation co-segregated with the disease phenotype in all available family members and was not found in 100 normal unrelated individuals from the same ethnic background. Conclusions This study has identified a novel nonsense mutation in CRYBB1 (p.Q223X) associated with autosomal dominant congenital nuclear cataract. PMID:18432316
Regulatory Role of N6 -methyladenosine (m6 A) Methylation in RNA Processing and Human Diseases.
Wei, Wenqiang; Ji, Xinying; Guo, Xiangqian; Ji, Shaoping
2017-09-01
N 6 -methyladenosine (m 6 A) modification is an abundant and conservative RNA modification in bacterial and eukaryotic cells. m 6 A modification mainly occurs in the 3' untranslated regions (UTRs) and near the stop codons of mRNA. Diverse strategies have been developed for identifying m 6 A sites in single nucleotide resolution. Dynamic regulation of m 6 A is found in metabolism, embryogenesis, and developmental processes, indicating a possible epigenetic regulation role along RNA processing and exerting biological functions. It has been known that m 6 A editing involves in nuclear RNA export, mRNA degradation, protein translation, and RNA splicing. Deficiency of m 6 A modification will lead to kinds of diseases, such as obesity, cancer, type 2 diabetes mellitus (T2DM), infertility, and developmental arrest. Some specific inhibitors against methyltransferase and demethylase have been developed to selectively regulate m 6 A modification, which may be advantageous for treatment of m 6 A related diseases. J. Cell. Biochem. 118: 2534-2543, 2017. © 2017 Wiley Periodicals, Inc. © 2017 Wiley Periodicals, Inc.
Escherichia coli ArgR mutants defective in cer/Xer recombination, but not in DNA binding.
Sénéchal, Hélène; Delesques, Jérémy; Szatmari, George
2010-04-01
The Escherichia coli arginine repressor (ArgR) is an L-arginine-dependent DNA-binding protein that controls the expression of the arginine biosynthetic genes and is required as an accessory factor for Xer site-specific recombination at cer and related recombination sites in plasmids. We used the technique of pentapeptide scanning mutagenesis to isolate a series of ArgR mutants that were considerably reduced in cer recombination, but were still able to repress an argA::lacZ fusion. DNA sequence analysis showed that all of the mutants mapped to the same nucleotide, resulting in a five amino acid insertion between residues 149 and 150 of ArgR, corresponding to the end of the alpha6 helix. A truncated ArgR containing a stop codon at residue 150 displayed the same phenotype as the protein with the five amino acid insertion, and both mutants displayed sequence-specific DNA-binding activity that was L-arginine dependent. These results show that the C-terminus of ArgR is more important in cer/Xer site-specific recombination than in DNA binding.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Nishimuri, Gen; Fukushima, Yoshimitsu; Ohashi, Hirofumi
1995-11-20
The recent discovery of mutations in the FGFR-3 (fibroblast growth factor receptor-3) gene (FGFR3) as the cause of achondroplasia has provided new insight into understanding genetic diseases. It was surprising from the viewpoint of molecular genetics that most patients with achondroplasia showed the same mutation at nucleotide 1138, leading to a single amino acid substitution from glycine to arginine at codon 380 (Gly380Arg). All 39 patients examined by two groups had the Gly380Arg; 38 patients and the other demonstrated a G to A and a G to C transition at nucleotide 1138, respectively. Subsequently another group disclosed a G tomore » A transition at the same nucleotide 1138 in 21/23 patients of diverse ethnic origin, although mutations were not identified in two patients. To date, a total of 193 patients with the mutation of the G380Arg have been reported; a single patient with another mutation resulting in a substitution from glycine to cysteine at codon 375 (Gly375Cys) has been described. The presence of this common mutation is consistent with the clinical fact that achondroplastic individuals show less phenotypic variability than is unusual for autosomal dominant diseases. We encountered a Japanese boy with the Gly375Cys. His mother with achondroplasia has the same mutation. The molecular investigation of these patients was reported elsewhere. Here we report the clinical and radiological findings in this boy who demonstrated some atypical manifestations from those of typical achondroplasia. 8 refs., 1 fig.« less
Family acholeplasmataceae (including phytoplasmas)
USDA-ARS?s Scientific Manuscript database
The family Acholeplasmataceae was originally established to accommodate the genus Acholeplasma, comprising the mollicutes that could be cultivated without the supplement of cholesterol and that use UGA as a stop codon instead of coding for tryptophan. It was later shown that the phytoplasmas, a larg...
Neutral changes during divergent evolution of hemoglobins
NASA Technical Reports Server (NTRS)
Jukes, T. H.
1978-01-01
A comparison of the mRNAs for rabbit and human beta-hemoglobins shows that synonymous changes in codons have accumulated three times as rapidly as nucleotide replacements that produced changes in amino acids. This agrees with predictions based on the so-called neutral theory. In addition, seven codon changes that appear to be single-base changes (according to maximum parsimony) are actually two-base changes. This indicates that the construction of primordial sequences is of limited significance when based on inferences that assume minimum base changes for amino acid replacements.
Chaube, R; Rawat, A; Inbaraj, R M; Bobe, J; Guiguen, Y; Fostier, A; Joy, K P
2017-05-15
Catechol-O-methyltransferase (COMT) is involved in the methylation and inactivation of endogenous and xenobiotic catechol compounds, and serves as a common biochemical link in the catecholamine and catecholestrogen metabolism. Studies on cloning, sequencing and function characterization comt gene in lower vertebrates like fish are fewer. In the present study, a full-length comt cDNA of 1442bp with an open-reading frame (ORF) of 792bp, and start codon (ATG) at nucleotide 162 and stop codon (TAG) at nucleotide 953 was isolated and characterized in the stinging catfish Heteropneustes fossilis (accession No. KT597925). The ORF codes for a protein of 263 amino acid residues, which is also validated by the catfish transcriptome data analysis. The catfish Comt shared conserved putative structural regions important for S-adenosyl methionine (AdoMet)- and catechol-binding, transmembrane regions, two glycosylation sites (N-65 and N-91) at the N-terminus and two phosphorylation sites (Ser-235 and Thr-240) at the C-terminus. The gene was expressed in all tissues examined and the expression showed significant sex dimorphic distribution with high levels in females. The transcript was abundant in the liver, brain and gonads and low in muscles. The transcripts showed significant seasonal variations in the brain and ovary, increased progressively to the peak levels in spawning phase and then declined. The brain and ovarian comt mRNA levels showed periovulatory changes after in vivo and in vitro human chorionic gonadotropin (hCG) treatments with high fold increases at 16 and 24h in the brain and at 16h in the ovary. The catecholestrogen 2-hydroxyE 2 up regulated ovarian comt expression in vitro with the highest fold increase at 16h. The mRNA and protein was localized in the follicular layer of the vitellogenic follicles and in the cytoplasm of primary follicles. The data were discussed in relation to catecholamine and catecholestrogen-mediated functions in the brain and ovary of the stinging catfish. Copyright © 2016 Elsevier Inc. All rights reserved.
Montealegre, Maria Camila; La Rosa, Sabina Leanti; Roh, Jung Hyeob; Harvey, Barrett R.
2015-01-01
ABSTRACT The endocarditis and biofilm-associated pili (Ebp) are important in Enterococcus faecalis pathogenesis, and the pilus tip, EbpA, has been shown to play a major role in pilus biogenesis, biofilm formation, and experimental infections. Based on in silico analyses, we previously predicted that ATT is the EbpA translational start codon, not the ATG codon, 120 bp downstream of ATT, which is annotated as the translational start. ATT is rarely used to initiate protein synthesis, leading to our hypothesis that this codon participates in translational regulation of Ebp production. To investigate this possibility, site-directed mutagenesis was used to introduce consecutive stop codons in place of two lysines at positions 5 and 6 from the ATT, to replace the ATT codon in situ with ATG, and then to revert this ATG to ATT; translational fusions of ebpA to lacZ were also constructed to investigate the effect of these start codons on translation. Our results showed that the annotated ATG does not start translation of EbpA, implicating ATT as the start codon; moreover, the presence of ATT, compared to the engineered ATG, resulted in significantly decreased EbpA surface display, attenuated biofilm, and reduced adherence to fibrinogen. Corroborating these findings, the translational fusion with the native ATT as the initiation codon showed significantly decreased expression of β-galactosidase compared to the construct with ATG in place of ATT. Thus, these results demonstrate that the rare initiation codon of EbpA negatively regulates EbpA surface display and negatively affects Ebp-associated functions, including biofilm and adherence to fibrinogen. PMID:26015496
The complete mitochondrial genome of the stomatopod crustacean Squilla mantis
Cook, Charles E
2005-01-01
Background Animal mitochondrial genomes are physically separate from the much larger nuclear genomes and have proven useful both for phylogenetic studies and for understanding genome evolution. Within the phylum Arthropoda the subphylum Crustacea includes over 50,000 named species with immense variation in body plans and habitats, yet only 23 complete mitochondrial genomes are available from this subphylum. Results I describe here the complete mitochondrial genome of the crustacean Squilla mantis (Crustacea: Malacostraca: Stomatopoda). This 15994-nucleotide genome, the first described from a hoplocarid, contains the standard complement of 13 protein-coding genes, 22 transfer RNA genes, two ribosomal RNA genes, and a non-coding AT-rich region that is found in most other metazoans. The gene order is identical to that considered ancestral for hexapods and crustaceans. The 70% AT base composition is within the range described for other arthropods. A single unusual feature of the genome is a 230 nucleotide non-coding region between a serine transfer RNA and the nad1 gene, which has no apparent function. I also compare gene order, nucleotide composition, and codon usage of the S. mantis genome and eight other malacostracan crustaceans. A translocation of the histidine transfer RNA gene is shared by three taxa in the order Decapoda, infraorder Brachyura; Callinectes sapidus, Portunus trituberculatus and Pseudocarcinus gigas. This translocation may be diagnostic for the Brachyura. For all nine taxa nucleotide composition is biased towards AT-richness, as expected for arthropods, and is within the range reported for other arthropods. Codon usage is biased, and much of this bias is probably due to the skew in nucleotide composition towards AT-richness. Conclusion The mitochondrial genome of Squilla mantis contains one unusual feature, a 230 base pair non-coding region has so far not been described in any other malacostracan. Comparisons with other Malacostraca show that all nine genomes, like most other mitochondrial genomes, share a bias toward AT-richness and a related bias in codon usage. The nine malacostracans included in this analysis are not representative of the diversity of the class Malacostraca, and additional malacostracan sequences would surely reveal other unusual genomic features that could be useful in understanding mitochondrial evolution in this taxon. PMID:16091132
Hansen, Tina V A; Thamsborg, Stig M; Olsen, Annette; Prichard, Roger K; Nejsum, Peter
2013-08-12
The whipworm Trichuris trichiura has been estimated to infect 604 - 795 million people worldwide. The current control strategy against trichuriasis using the benzimidazoles (BZs) albendazole (400 mg) or mebendazole (500 mg) as single-dose treatment is not satisfactory. The occurrence of single nucleotide polymorphisms (SNPs) in codons 167, 198 or 200 of the beta-tubulin gene has been reported to convey BZ-resistance in intestinal nematodes of veterinary importance. It was hypothesised that the low susceptibility of T. trichiura to BZ could be due to a natural occurrence of such SNPs. The aim of this study was to investigate whether these SNPs were present in the beta-tubulin gene of Trichuris spp. from humans and baboons. As a secondary objective, the degree of identity between T. trichiura from humans and Trichuris spp. from baboons was evaluated based on the beta-tubulin gene and the internal transcribed spacer 2 region (ITS2). Nucleotide sequences of the beta-tubulin gene were generated by PCR using degenerate primers, specific primers and DNA from worms and eggs of T. trichiura and worms of Trichuris spp. from baboons. The ITS2 region was amplified using adult Trichuris spp. from baboons. PCR products were sequenced and analysed. The beta-tubulin fragments were studied for SNPs in codons 167, 198 or 200 and the ITS2 amplicons were compared with GenBank records of T. trichiura. No SNPs in codons 167, 198 or 200 were identified in any of the analysed Trichuris spp. from humans and baboons. Based on the ITS2 region, the similarity between Trichuris spp. from baboons and GenBank records of T. trichiura was found to be 98 - 99%. Single nucleotide polymorphisms in codon 167, 198 and 200, known to confer BZ-resistance in other nematodes, were absent in the studied material. This study does not provide data that could explain previous reports of poor BZ treatment efficacy in terms of polymorphism in these codons of beta-tubulin. Based on a fragment of the beta-tubulin gene and the ITS2 region sequenced, it was found that T. trichiura from humans and Trichuris spp. isolated from baboons are closely related and may be the same species.
2013-01-01
Background The whipworm Trichuris trichiura has been estimated to infect 604 – 795 million people worldwide. The current control strategy against trichuriasis using the benzimidazoles (BZs) albendazole (400 mg) or mebendazole (500 mg) as single-dose treatment is not satisfactory. The occurrence of single nucleotide polymorphisms (SNPs) in codons 167, 198 or 200 of the beta-tubulin gene has been reported to convey BZ-resistance in intestinal nematodes of veterinary importance. It was hypothesised that the low susceptibility of T. trichiura to BZ could be due to a natural occurrence of such SNPs. The aim of this study was to investigate whether these SNPs were present in the beta-tubulin gene of Trichuris spp. from humans and baboons. As a secondary objective, the degree of identity between T. trichiura from humans and Trichuris spp. from baboons was evaluated based on the beta-tubulin gene and the internal transcribed spacer 2 region (ITS2). Methods Nucleotide sequences of the beta-tubulin gene were generated by PCR using degenerate primers, specific primers and DNA from worms and eggs of T. trichiura and worms of Trichuris spp. from baboons. The ITS2 region was amplified using adult Trichuris spp. from baboons. PCR products were sequenced and analysed. The beta-tubulin fragments were studied for SNPs in codons 167, 198 or 200 and the ITS2 amplicons were compared with GenBank records of T. trichiura. Results No SNPs in codons 167, 198 or 200 were identified in any of the analysed Trichuris spp. from humans and baboons. Based on the ITS2 region, the similarity between Trichuris spp. from baboons and GenBank records of T. trichiura was found to be 98 – 99%. Conclusions Single nucleotide polymorphisms in codon 167, 198 and 200, known to confer BZ-resistance in other nematodes, were absent in the studied material. This study does not provide data that could explain previous reports of poor BZ treatment efficacy in terms of polymorphism in these codons of beta-tubulin. Based on a fragment of the beta-tubulin gene and the ITS2 region sequenced, it was found that T. trichiura from humans and Trichuris spp. isolated from baboons are closely related and may be the same species. PMID:23938038
CCC CGA is a weak translational recoding site in Escherichia coli.
Shu, Ping; Dai, Huacheng; Mandecki, Wlodek; Goldman, Emanuel
2004-12-08
Previously published experiments had indicated unexpected expression of a control vector in which a beta-galactosidase reporter was in the +1 reading frame relative to the translation start. This control vector contained the codon pair CCC CGA in the zero reading frame, raising the possibility that ribosomes rephased on this sequence, with peptidyl-tRNA(Pro) pairing with CCC in the +1 frame. This putative rephasing might also be exacerbated by the rare CGA Arg codon in the second position due to increased vacancy of the ribosomal A-site. To test this hypothesis, a series of site-directed mutants was constructed, including mutations in both the first and second codons of this codon pair. The results show that interrupting the continuous run of C residues with synonymous codon changes essentially abolishes the frameshift. Further, changing the rare Arg codon to a common Arg codon also reduces the frequency of the frameshift. These results provide strong support for the hypothesis that CCC CGA in the zero frame is indeed a weak translational frameshift site in Escherichia coli, with a 1-2% efficiency. Because the vector sequence also contains another CCC triplet in the +1 reading frame starting within the next codon after the CGA, our data also support possible contribution to expression of a +7 nucleotide ribosome hop into the same +1 reading frame. We also confirm here a previous report that CCC UGA is a translational frameshift site, in these experiments, with about 5% efficiency.
Inoue, Takahiko; Yuo, Takahisa; Ohta, Takeshi; Hitomi, Eriko; Ichitani, Katsuyuki; Kawase, Makoto; Taketa, Shin; Fukunaga, Kenji
2015-08-01
Foxtail millet shows variation in positive phenol color reaction (Phr) and negative Phr in grains, but predominant accessions of this crop are negative reaction type, and the molecular genetic basis of the Phr reaction remains unresolved. In this article, we isolated polyphenol oxidase (PPO) gene responsible for Phr using genome sequence information and investigated molecular genetic basis of negative Phr and crop evolution of foxtail millet. First of all, we searched for PPO gene homologs in a foxtail millet genome database using a rice PPO gene as a query and successfully found three copies of the PPO gene. One of the PPO gene homologs on chromosome 7 showed the highest similarity with PPO genes expressed in hulls (grains) of other cereal species including rice, wheat, and barley and was designated as Si7PPO. Phr phenotypes and Si7PPO genotypes completely co-segregated in a segregating population. We also analyzed the genetic variation conferring negative Phr reaction. Of 480 accessions of the landraces investigated, 87 (18.1 %) showed positive Phr and 393 (81.9 %) showed negative Phr. In the 393 Phr negative accessions, three types of loss-of-function Si7PPO gene were predominant and independently found in various locations. One of them has an SNP in exon 1 resulting in a premature stop codon and was designated as stop codon type, another has an insertion of a transposon (Si7PPO-TE1) in intron 2 and was designated as TE1-insertion type, and the other has a 6-bp duplication in exon 3 resulting in the duplication of 2 amino acids and was designated as 6-bp duplication type. As a rare variant of the stop codon type, one accession additionally has an insertion of a transposon, Si7PPO-TE2, in intron 2 and was designated as "stop codon +TE2 insertion type". The geographical distribution of accessions with positive Phr and those with three major types of negative Phr was also investigated. Accessions with positive Phr were found in subtropical and tropical regions at frequencies of ca. 25-67 % and those with negative Phr were broadly found in Europe and Asia. The stop codon type was found in 285 accessions and was broadly distributed in Europe and Asia, whereas the TE-1 insertion type was found in 99 accessions from Europe and Asia but was not found in India. The 6-bp duplication type was found in only 8 accessions from Nansei Islands (Okinawa Prefecture) of Japan. We also analyzed Phr in the wild ancestor and concluded that the negative Phr type was likely to have originated after domestication of foxtail millet. It was also implied that negative Phr of foxtail millet arose by multiple independent loss of function of PPO gene through dispersal because of some advantages under some environmental conditions and human selection as in rice and barley.
Berro, Mariano; Mayor, Neema P.; Maldonado-Torres, Hazael; Cooke, Louise; Kusminsky, Gustavo; Marsh, Steven G.E.; Madrigal, J. Alejandro; Shaw, Bronwen E.
2010-01-01
Background Many genetic factors play major roles in the outcome of hematopoietic stem cell transplants from unrelated donors. Transforming growth factor β1 is a member of a highly pleiotrophic family of growth factors involved in the regulation of numerous immunomodulatory processes. Design and Methods We investigated the impact of single nucleotide polymorphisms at codons 10 and 25 of TGFB1, the gene encoding for transforming growth factor β1, on outcomes in 427 mye-loablative-conditioned transplanted patients. In addition, transforming growth factor β1 plasma levels were measured in 263 patients and 327 donors. Results Patients homozygous for the single nucleotide polymorphism at codon 10 had increased non-relapse mortality (at 3 years: 46.8% versus 29.4%, P=0.014) and reduced overall survival (at 5 years 29.3% versus 42.2%, P=0.013); the differences remained statistically significant in multivariate analysis. Donor genotype alone had no impact, although multiple single nucleotide polymorphisms within the pair were significantly associated with higher non-relapse mortality (at 3 years: 44% versus 29%, P=0.021) and decreased overall survival (at 5 years: 33.8% versus 41.9%, P=0.033). In the 10/10 HLA matched transplants (n=280), recipients of non-wild type grafts tended to have a higher incidence of acute graft-versus-host disease grades II-IV (P=0.052). In multivariate analysis, when analyzed with patients’ genotype, the incidences of both overall and grades II-IV acute graft-versus-host disease were increased (P=0.025 and P=0.009, respectively) in non-wild-type pairs. Conclusions We conclude that increasing numbers of single nucleotide polymorphisms in codon 10 of TGFB1 in patients and donors are associated with a worse outcome following hematopoietic stem cell transplantation from unrelated donors. PMID:19713222
Properties and determinants of codon decoding time distributions
2014-01-01
Background Codon decoding time is a fundamental property of mRNA translation believed to affect the abundance, function, and properties of proteins. Recently, a novel experimental technology--ribosome profiling--was developed to measure the density, and thus the speed, of ribosomes at codon resolution. Specifically, this method is based on next-generation sequencing, which theoretically can provide footprint counts that correspond to the probability of observing a ribosome in this position for each nucleotide in each transcript. Results In this study, we report for the first time various novel properties of the distribution of codon footprint counts in five organisms, based on large-scale analysis of ribosomal profiling data. We show that codons have distinctive footprint count distributions. These tend to be preserved along the inner part of the ORF, but differ at the 5' and 3' ends of the ORF, suggesting that the translation-elongation stage actually includes three biophysical sub-steps. In addition, we study various basic properties of the codon footprint count distributions and show that some of them correlate with the abundance of the tRNA molecule types recognizing them. Conclusions Our approach emphasizes the advantages of analyzing ribosome profiling and similar types of data via a comparative genomic codon-distribution-centric view. Thus, our methods can be used in future studies related to translation and even transcription elongation. PMID:25572668
Analysis of base and codon usage by rubella virus.
Zhou, Yumei; Chen, Xianfeng; Ushijima, Hiroshi; Frey, Teryl K
2012-05-01
Rubella virus (RUBV), a small, plus-strand RNA virus that is an important human pathogen, has the unique feature that the GC content of its genome (70%) is the highest (by 20%) among RNA viruses. To determine the effect of this GC content on genomic evolution, base and codon usage were analyzed across viruses from eight diverse genotypes of RUBV. Despite differences in frequency of codon use, the favored codons in the RUBV genome matched those in the human genome for 18 of the 20 amino acids, indicating adaptation to the host. Although usage patterns were conserved in corresponding genes in the diverse genotypes, within-genome comparison revealed that both base and codon usages varied regionally, particularly in the hypervariable region (HVR) of the P150 replicase gene. While directional mutation pressure was predominant in determining base and codon usage within most of the genome (with the strongest tendency being towards C's at third codon positions), natural selection was predominant in the HVR region. The GC content of this region was the highest in the genome (>80%), and it was not clear if selection at the nucleotide level accompanied selection at the amino acid level. Dinucleotide frequency analysis of the RUBV genome revealed that TpA usage was lower than expected, similar to mammalian genes; however, CpG usage was not suppressed, and TpG usage was not enhanced, as is the case in mammalian genes.
Isolation and characterization of the gene coding for Escherichia coli arginyl-tRNA synthetase.
Eriani, G; Dirheimer, G; Gangloff, J
1989-01-01
The gene coding for Escherichia coli arginyl-tRNA synthetase (argS) was isolated as a fragment of 2.4 kb after analysis and subcloning of recombinant plasmids from the Clarke and Carbon library. The clone bearing the gene overproduces arginyl-tRNA synthetase by a factor 100. This means that the enzyme represents more than 20% of the cellular total protein content. Sequencing revealed that the fragment contains a unique open reading frame of 1734 bp flanked at its 5' and 3' ends respectively by 247 bp and 397 bp. The length of the corresponding protein (577 aa) is well consistent with earlier Mr determination (about 70 kd). Primer extension analysis of the ArgRS mRNA by reverse transcriptase, located its 5' end respectively at 8 and 30 nucleotides downstream of a TATA and a TTGAC like element (CTGAC) and 60 nucleotides upstream of the unusual translation initiation codon GUG; nuclease S1 analysis located the 3'-end at 48 bp downstream of the translation termination codon. argS has a codon usage pattern typical for highly expressed E. coli genes. With the exception of the presence of a HVGH sequence similar to the HIGH consensus element, ArgRS has no relevant sequence homologies with other aminoacyl-tRNA synthetases. Images PMID:2668891
On origin of genetic code and tRNA before translation
2011-01-01
Background Synthesis of proteins is based on the genetic code - a nearly universal assignment of codons to amino acids (aas). A major challenge to the understanding of the origins of this assignment is the archetypal "key-lock vs. frozen accident" dilemma. Here we re-examine this dilemma in light of 1) the fundamental veto on "foresight evolution", 2) modular structures of tRNAs and aminoacyl-tRNA synthetases, and 3) the updated library of aa-binding sites in RNA aptamers successfully selected in vitro for eight amino acids. Results The aa-binding sites of arginine, isoleucine and tyrosine contain both their cognate triplets, anticodons and codons. We have noticed that these cases might be associated with palindrome-dinucleotides. For example, one-base shift to the left brings arginine codons CGN, with CG at 1-2 positions, to the respective anticodons NCG, with CG at 2-3 positions. Formally, the concomitant presence of codons and anticodons is also expected in the reverse situation, with codons containing palindrome-dinucleotides at their 2-3 positions, and anticodons exhibiting them at 1-2 positions. A closer analysis reveals that, surprisingly, RNA binding sites for Arg, Ile and Tyr "prefer" (exactly as in the actual genetic code) the anticodon(2-3)/codon(1-2) tetramers to their anticodon(1-2)/codon(2-3) counterparts, despite the seemingly perfect symmetry of the latter. However, since in vitro selection of aa-specific RNA aptamers apparently had nothing to do with translation, this striking preference provides a new strong support to the notion of the genetic code emerging before translation, in response to catalytic (and possibly other) needs of ancient RNA life. Consistently with the pre-translation origin of the code, we propose here a new model of tRNA origin by the gradual, Fibonacci process-like, elongation of a tRNA molecule from a primordial coding triplet and 5'DCCA3' quadruplet (D is a base-determinator) to the eventual 76 base-long cloverleaf-shaped molecule. Conclusion Taken together, our findings necessarily imply that primordial tRNAs, tRNA aminoacylating ribozymes, and (later) the translation machinery in general have been co-evolving to ''fit'' the (likely already defined) genetic code, rather than the opposite way around. Coding triplets in this primal pre-translational code were likely similar to the anticodons, with second and third nucleotides being more important than the less specific first one. Later, when the code was expanding in co-evolution with the translation apparatus, the importance of 2-3 nucleotides of coding triplets "transferred" to the 1-2 nucleotides of their complements, thus distinguishing anticodons from codons. This evolutionary primacy of anticodons in genetic coding makes the hypothesis of primal stereo-chemical affinity between amino acids and cognate triplets, the hypothesis of coding coenzyme handles for amino acids, the hypothesis of tRNA-like genomic 3' tags suggesting that tRNAs originated in replication, and the hypothesis of ancient ribozymes-mediated operational code of tRNA aminoacylation not mutually contradicting but rather co-existing in harmony. Reviewers This article was reviewed by Eugene V. Koonin, Wentao Ma (nominated by Juergen Brosius) and Anthony Poole. PMID:21342520
Energy efficiency trade-offs drive nucleotide usage in transcribed regions
Chen, Wei-Hua; Lu, Guanting; Bork, Peer; Hu, Songnian; Lercher, Martin J.
2016-01-01
Efficient nutrient usage is a trait under universal selection. A substantial part of cellular resources is spent on making nucleotides. We thus expect preferential use of cheaper nucleotides especially in transcribed sequences, which are often amplified thousand-fold compared with genomic sequences. To test this hypothesis, we derive a mutation-selection-drift equilibrium model for nucleotide skews (strand-specific usage of ‘A' versus ‘T' and ‘G' versus ‘C'), which explains nucleotide skews across 1,550 prokaryotic genomes as a consequence of selection on efficient resource usage. Transcription-related selection generally favours the cheaper nucleotides ‘U' and ‘C' at synonymous sites. However, the information encoded in mRNA is further amplified through translation. Due to unexpected trade-offs in the codon table, cheaper nucleotides encode on average energetically more expensive amino acids. These trade-offs apply to both strand-specific nucleotide usage and GC content, causing a universal bias towards the more expensive nucleotides ‘A' and ‘G' at non-synonymous coding sites. PMID:27098217
A method for multi-codon scanning mutagenesis of proteins based on asymmetric transposons.
Liu, Jia; Cropp, T Ashton
2012-02-01
Random mutagenesis followed by selection or screening is a commonly used strategy to improve protein function. Despite many available methods for random mutagenesis, nearly all generate mutations at the nucleotide level. An ideal mutagenesis method would allow for the generation of 'codon mutations' to change protein sequence with defined or mixed amino acids of choice. Herein we report a method that allows for mutations of one, two or three consecutive codons. Key to this method is the development of a Mu transposon variant with asymmetric terminal sequences. As a demonstration of the method, we performed multi-codon scanning on the gene encoding superfolder GFP (sfGFP). Characterization of 50 randomly chosen clones from each library showed that more than 40% of the mutants in these three libraries contained seamless, in-frame mutations with low site preference. By screening only 500 colonies from each library, we successfully identified several spectra-shift mutations, including a S205D variant that was found to bear a single excitation peak in the UV region.
Dai, Li-Shang; Zhu, Bao-Jian; Qian, Cen; Zhang, Cong-Fen; Li, Jun; Wang, Lei; Wei, Guo-Qing; Liu, Chao-Liang
2016-01-01
The complete mitochondrial genome (mitogenome) of Plutella xylostella (Lepidoptera: Plutellidae) was determined (GenBank accession No. KM023645). The length of this mitogenome is 16,014 bp with 13 protein-coding genes (PCGs), 2 rRNA genes, 22 tRNA genes and an A + T-rich region. It presents the typical gene organization and order for completely sequenced lepidopteran mitogenomes. The nucleotide composition of the genome is highly A + T biased, accounting for 81.48%, with a slightly positive AT skewness (0.005). All PCGs are initiated by typical ATN codons, except for the gene cox1, which uses CGA as its start codon. Some PCGs harbor TA (nad5) or incomplete termination codon T (cox1, cox2, nad2 and nad4), while others use TAA as their termination codons. The A + T-rich region is located between rrnS and trnM with a length of 888 bp.
Chi, Xiaojuan; Wang, Song; Ma, Yanmei; Chen, Jilong
2017-01-01
The classical swine fever virus (CSFV), circulating worldwide, is a highly contagious virus. Since the emergence of CSFV, it has caused great economic loss in swine industry. The envelope glycoprotein E2 gene of the CSFV is an immunoprotective antigen that induces the immune system to produce neutralizing antibodies. Therefore, it is essential to study the codon usage of the E2 gene of the CSFV. In this study, 140 coding sequences of the E2 gene were analyzed. The value of effective number of codons (ENC) showed low codon usage bias in the E2 gene. Our study showed that codon usage could be described mainly by mutation pressure ENC plot analysis combined with principal component analysis (PCA) and translational selection-correlation analysis between the general average hydropathicity (Gravy) and aromaticity (Aroma), and nucleotides at the third position of codons (A3s, T3s, G3s, C3s and GC3s). Furthermore, the neutrality analysis, which explained the relationship between GC12s and GC3s, revealed that natural selection had a key role compared with mutational bias during the evolution of the E2 gene. These results lay a foundation for further research on the molecular evolution of CSFV. PMID:28880881
Chen, Ye; Li, Xinxin; Chi, Xiaojuan; Wang, Song; Ma, Yanmei; Chen, Jilong
2017-01-01
The classical swine fever virus (CSFV), circulating worldwide, is a highly contagious virus. Since the emergence of CSFV, it has caused great economic loss in swine industry. The envelope glycoprotein E2 gene of the CSFV is an immunoprotective antigen that induces the immune system to produce neutralizing antibodies. Therefore, it is essential to study the codon usage of the E2 gene of the CSFV. In this study, 140 coding sequences of the E2 gene were analyzed. The value of effective number of codons (ENC) showed low codon usage bias in the E2 gene. Our study showed that codon usage could be described mainly by mutation pressure ENC plot analysis combined with principal component analysis (PCA) and translational selection-correlation analysis between the general average hydropathicity (Gravy) and aromaticity (Aroma), and nucleotides at the third position of codons (A3s, T3s, G3s, C3s and GC3s). Furthermore, the neutrality analysis, which explained the relationship between GC12s and GC3s, revealed that natural selection had a key role compared with mutational bias during the evolution of the E2 gene. These results lay a foundation for further research on the molecular evolution of CSFV.
Zhang, Yulong; Shao, Dandan; Cai, Miao; Yin, Hong; Zhang, Daochuan
2016-01-01
The complete mitochondrial genome of Gryllotalpa unispina was 15,513 bp in length and contained 70.9% AT. All G. unispina protein-coding sequences except for the nad2 started with a typical ATN codon. The usual termination codons (TAA) and incomplete stop codons (T) were found from 13 protein-coding genes. All tRNA genes were folded into the typical cloverleaf secondary structure, except trnS(AGN) lacking the dihydrouridine arm. The sizes of the large and small ribosomal RNA genes were 1245 and 725 bp, respectively. The A + T-rich region was 917 bp in length with 76.8%. The orientation and gene order of the G. unispina mitogenome were identical to the G. orientalis and G. pluvialis, there was no phenomenon of "DK rearrangement" which has been widely reported in Caelifera.
The complete mitochondrial genome of Hydra vulgaris (Hydroida: Hydridae).
Pan, Hong-Chun; Fang, Hong-Yan; Li, Shi-Wei; Liu, Jun-Hong; Wang, Ying; Wang, An-Tai
2014-12-01
The complete mitochondrial genome of Hydra vulgaris (Hydroida: Hydridae) is composed of two linear DNA molecules. The mitochondrial DNA (mtDNA) molecule 1 is 8010 bp long and contains six protein-coding genes, large subunit rRNA, methionine and tryptophan tRNAs, two pseudogenes consisting respectively of a partial copy of COI, and terminal sequences at two ends of the linear mtDNA, while the mtDNA molecule 2 is 7576 bp long and contains seven protein-coding genes, small subunit rRNA, methionine tRNA, a pseudogene consisting of a partial copy of COI and terminal sequences at two ends of the linear mtDNA. COI gene begins with GTG as start codon, whereas other 12 protein-coding genes start with a typical ATG initiation codon. In addition, all protein-coding genes are terminated with TAA as stop codon.
Yatawara, Lalani; Wickramasinghe, Susiji; Rajapakse, R P V J; Agatsuma, Takeshi
2010-09-01
In the present study, we determined the complete mitochondrial (mt) genome sequence (13,839bp) of parasitic nematode Setaria digitata and its structure and organization compared with Onchocerca volvulus, Dirofilaria immitis and Brugia malayi. The mt genome of S. digitata is slightly larger than the mt genomes of other filarial nematodes. S. digitata mt genome contains 36 genes (12 protein-coding genes, 22 transfer RNAs and 2 ribosomal RNAs) that are typically found in metazoans. This genome contains a high A+T (75.1%) content and low G+C content (24.9%). The mt gene order for S. digitata is the same as those for O. volvulus, D. immitis and B. malayi but it is distinctly different from other nematodes compared. The start codons inferred in the mt genome of S. digitata are TTT, ATT, TTG, ATG, GTT and ATA. Interestingly, the initiation codon TTT is unique to S. digitata mt genome and four protein-coding genes use this codon as a translation initiation codon. Five protein-coding genes use TAG as a stop codon whereas three genes use TAA and four genes use T as a termination codon. Out of 64 possible codons, only 57 are used for mitochondrial protein-coding genes of S. digitata. T-rich codons such as TTT (18.9%), GTT (7.9%), TTG (7.8%), TAT (7%), ATT (5.7%), TCT (4.8%) and TTA (4.1%) are used more frequently. This pattern of codon usage reflects the strong bias for T in the mt genome of S. digitata. In conclusion, the present investigation provides new molecular data for future studies of the comparative mitochondrial genomics and systematic of parasitic nematodes of socio-economic importance. 2010 Elsevier B.V. All rights reserved.
Binding constants of phenylalanine for the four mononucleotides
NASA Technical Reports Server (NTRS)
Khaled, M. A.; Mullins, D. W., Jr.; Lacey, J. C., Jr.
1984-01-01
Earlier work has shown that several properties of amino acids correlate directly with properties of their anticodonic nucleotides. Furthermore, in precipitation studies with thermal proteinoids and homopolyribonucleotides, an anticodonic preference was displayed between Lys-rich, Pro-rich and Gly-rich thermal proteinoids and their anticodonic polyribonucleotides. However, Phe-rich thermal proteinoid displayed a preference for its codonic nucleotide, poly U. This inconsistency seemed to be explained by a folding in of the hydrophobic residues of Phe causing the proteinoid to appear more hydrophilic. The present work used nuclear magnetic resonance techniques to resolve a limited question: to which of the four nucleotides does Phe bind most strongly? The results show quite clearly that Phe binds most strongly to its anticodonic nucleotide, AMP.
Hwang, Shin-Rong; Garza, Christina Z; Wegrzyn, Jill; Hook, Vivian Y H
2004-08-16
This study demonstrates utilization of the novel GTG initiation codon for translation of a human mRNA transcript that encodes the serpin endopin 2B, a protease inhibitor. Molecular cloning revealed the nucleotide sequence of the human endopin 2B cDNA. Its deduced primary sequence shows high homology to bovine endopin 2A that possesses cross-class protease inhibition of elastase and papain. Notably, the human endopin 2B cDNA sequence revealed GTG as the predicted translation initiation codon; the predicted translation product of 46 kDa endopin 2B was produced by in vitro translation of 35S-endopin 2B with mammalian (rabbit) protein translation components. Importantly, bioinformatic studies demonstrated the presence of the entire human endopin 2B cDNA sequence with GTG as initiation codon within the human genome on chromosome 14. Further evidence for GTG as a functional initiation codon was illustrated by GTG-mediated in vitro translation of the heterologous protein EGFP, and by GTG-mediated expression of EGFP in mammalian PC12 cells. Mutagenesis of GTG to GTC resulted in the absence of EGFP expression in PC12 cells, indicating the function of GTG as an initiation codon. In addition, it was apparent that the GTG initiation codon produces lower levels of translated protein compared to ATG as initiation codon. Significantly, GTG-mediated translation of endopin 2B demonstrates a functional human gene product not previously predicted from initial analyses of the human genome. Further analyses based on GTG as an alternative initiation codon may predict new candidate genes of the human genome.
Jackson, Christopher J; Norman, John E; Schnare, Murray N; Gray, Michael W; Keeling, Patrick J; Waller, Ross F
2007-01-01
Background Dinoflagellates comprise an ecologically significant and diverse eukaryotic phylum that is sister to the phylum containing apicomplexan endoparasites. The mitochondrial genome of apicomplexans is uniquely reduced in gene content and size, encoding only three proteins and two ribosomal RNAs (rRNAs) within a highly compacted 6 kb DNA. Dinoflagellate mitochondrial genomes have been comparatively poorly studied: limited available data suggest some similarities with apicomplexan mitochondrial genomes but an even more radical type of genomic organization. Here, we investigate structure, content and expression of dinoflagellate mitochondrial genomes. Results From two dinoflagellates, Crypthecodinium cohnii and Karlodinium micrum, we generated over 42 kb of mitochondrial genomic data that indicate a reduced gene content paralleling that of mitochondrial genomes in apicomplexans, i.e., only three protein-encoding genes and at least eight conserved components of the highly fragmented large and small subunit rRNAs. Unlike in apicomplexans, dinoflagellate mitochondrial genes occur in multiple copies, often as gene fragments, and in numerous genomic contexts. Analysis of cDNAs suggests several novel aspects of dinoflagellate mitochondrial gene expression. Polycistronic transcripts were found, standard start codons are absent, and oligoadenylation occurs upstream of stop codons, resulting in the absence of termination codons. Transcripts of at least one gene, cox3, are apparently trans-spliced to generate full-length mRNAs. RNA substitutional editing, a process previously identified for mRNAs in dinoflagellate mitochondria, is also implicated in rRNA expression. Conclusion The dinoflagellate mitochondrial genome shares the same gene complement and fragmentation of rRNA genes with its apicomplexan counterpart. However, it also exhibits several unique characteristics. Most notable are the expansion of gene copy numbers and their arrangements within the genome, RNA editing, loss of stop codons, and use of trans-splicing. PMID:17897476
The augmentation algorithm and molecular phylogenetic trees
NASA Technical Reports Server (NTRS)
Holmquist, R.
1978-01-01
Moore's (1977) augmentation procedure is discussed, and it is concluded that the procedure is valid for obtaining estimates of the total number of fixed nucleotide substitutions both theoretically and in practice, for both simulated and real data, and in agreement, for experimentally dense data sets, with stochastic estimates of the divergence, provided the restrictions on codon mutability resulting from natural selection are explicitly allowed for. Tateno and Nei's (1978) critique that the augmentation procedure has a systematic bias toward overestimation of the total number of nucleotide replacements is disputed, and a data analysis suggests that ancestral sequences inferred by the method of parsimony contain a large number of incorrectly assigned nucleotides.
[Association between polymorphisms of XPD gene and susceptibility to chronic benzene poisoning].
Huang, Hui-long; Xu, Jian-ning; Wang, Quan-kai; Wang, Ya-wen; Yang, Min; Chen, Yan; Li, Gui-lan
2006-07-01
To explore the relationship between genetic polymorphisms of XPD gene and susceptibility to chronic benzene poisoning. A case control study was conducted. Eighty patients diagnosed with chronic benzene poisoning and 62 workers occupationally exposed to benzene who were engaged in the same working time and job title as patients were investigated. PCR-RFLP was used for detecting the single nucleotide polymorphisms (SNPs) on codon156, codon312 and codon751 of XPD gene. There was a 2.903 times (95% CI: 1.054 - 7.959, P = 0.039 2) increased risk of chronic benzene poisoning in the subjects carrying XPD 751Gln variant allele compared with those carrying XPD 751Lys/Lys genotype, after adjusted for sex, length of service, smoking and drinking status. The subjects with XPD 751Gln variant allele are more susceptive to benzene.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Chou, J.; Roizman, B.; Kern, E.R.
1990-11-30
The gene designated {gamma}{sub 1}34.5 maps in the inverted repeats flanking the long unique sequence of herpes simplex virus-1 (HSV-1) DNA, and therefore it is present in two copies per genome. This gene is not essential for viral growth in cell culture. Four recombinant viruses were genetically engineered to test the function of this gene. These were (i) a virus from which both copies of the gene were deleted, (ii) a virus containing a stop codon in both copies of the gene, (iii) a virus containing after the first codon an insert encoding a 16-amino acid epitope known to reactmore » with a specific monoclonal antibody, and (iv) a virus in which the deleted sequences were restored. The viruses from which the gene was deleted or which carried stop codons were avirulent on intracerebral inoculation of mice. The virus with the gene tagged by the sequence encoding the epitope was moderately virulent, whereas the restored virus reacquired the phenotype of the parent virus. Significant amounts of virus were recovered only from brains of animals inoculated with virulent viruses. Inasmuch as the product of the {gamma}{sub 1}34.5 gene extended the host range of the virus by enabling it to replicate and destroy brain cells, it is a viral neurovirulence factor.« less
Salvatori, Francesca; Breveglieri, Giulia; Zuccato, Cristina; Finotti, Alessia; Bianchi, Nicoletta; Borgatti, Monica; Feriotto, Giordana; Destro, Federica; Canella, Alessandro; Brognara, Eleonora; Lampronti, Ilaria; Breda, Laura; Rivella, Stefano; Gambari, Roberto
2013-01-01
In several types of thalassemia (including β039-thalassemia), stop codon mutations lead to premature translation termination and to mRNA destabilization through nonsense-mediated decay. Drugs (for instance aminoglycosides) can be designed to suppress premature termination, inducing a ribosomal readthrough. These findings have introduced new hopes for the development of a pharmacologic approach to the cure of this disease. However, the effects of aminoglycosides on globin mRNA carrying β-thalassemia stop mutations have not yet been investigated. In this study, we have used a lentiviral construct containing the β039- thalassemia globin gene under control of the β-globin promoter and a LCR cassette. We demonstrated by fluorescence-activated cell sorting (FACS) analysis the production of β-globin by K562 cell clones expressing the β039-thalassemia globin gene and treated with G418. More importantly, after FACS and high-performance liquid chromatography (HPLC) analyses, erythroid precursor cells from β039-thalassemia patients were demonstrated to be able to produce β-globin and adult hemoglobin after treatment with G418. This study strongly suggests that ribosomal readthrough should be considered a strategy for developing experimental strategies for the treatment of β0-thalassemia caused by stop codon mutations. PMID:19810011
Polymorphism of prion protein gene in Arctic fox (Vulpes lagopus).
Wan, Jiayu; Bai, Xue; Liu, Wensen; Xu, Jing; Xu, Ming; Gao, Hongwei
2009-07-01
Prion diseases are fatal neurodegenerative disorders of humans and certain other mammals. Prion protein gene (Prnp) is associated with susceptibility and species barrier to prion diseases. No natural and experimental prion diseases have been documented to date in Arctic fox. In the present study, coding region of Prnp from 135 Arctic foxes were cloned and screened for polymorphisms. Our results indicated that the Arctic fox Prnp open reading frame (ORF) contains 771 nucleotides encoding 257 amino acids. Four single nucleotide polymorphisms (SNPs) (G312C, A337G, C541T, and A723G) were identified. SNPs G312C and A723G produced silent mutations, but SNPs A337G and C541T resulted in a M-V change at codon 113 and R-C at codon 181, respectively. The Arctic fox Prnp amino acid sequence was similar to that of the dog (XM 542906). In short, this study provides preliminary information about genotypes of Prnp in Arctic fox.
Codon Usage Patterns of Tyrosinase Genes in Clonorchis sinensis.
Bae, Young-An
2017-04-01
Codon usage bias (CUB) is a unique property of genomes and has contributed to the better understanding of the molecular features and the evolution processes of particular gene. In this study, genetic indices associated with CUB, including relative synonymous codon usage and effective numbers of codons, as well as the nucleotide composition, were investigated in the Clonorchis sinensis tyrosinase genes and their platyhelminth orthologs, which play an important role in the eggshell formation. The relative synonymous codon usage patterns substantially differed among tyrosinase genes examined. In a neutrality analysis, the correlation between GC 12 and GC 3 was statistically significant, and the regression line had a relatively gradual slope (0.218). NC-plot, i.e., GC 3 vs effective number of codons (ENC), showed that most of the tyrosinase genes were below the expected curve. The codon adaptation index (CAI) values of the platyhelminth tyrosinases had a narrow distribution between 0.685/0.714 and 0.797/0.837, and were negatively correlated with their ENC. Taken together, these results suggested that CUB in the tyrosinase genes seemed to be basically governed by selection pressures rather than mutational bias, although the latter factor provided an additional force in shaping CUB of the C. sinensis and Opisthorchis viverrini genes. It was also apparent that the equilibrium point between selection pressure and mutational bias is much more inclined to selection pressure in highly expressed C. sinensis genes, than in poorly expressed genes.
n-Nucleotide circular codes in graph theory.
Fimmel, Elena; Michel, Christian J; Strüngmann, Lutz
2016-03-13
The circular code theory proposes that genes are constituted of two trinucleotide codes: the classical genetic code with 61 trinucleotides for coding the 20 amino acids (except the three stop codons {TAA,TAG,TGA}) and a circular code based on 20 trinucleotides for retrieving, maintaining and synchronizing the reading frame. It relies on two main results: the identification of a maximal C(3) self-complementary trinucleotide circular code X in genes of bacteria, eukaryotes, plasmids and viruses (Michel 2015 J. Theor. Biol. 380, 156-177. (doi:10.1016/j.jtbi.2015.04.009); Arquès & Michel 1996 J. Theor. Biol. 182, 45-58. (doi:10.1006/jtbi.1996.0142)) and the finding of X circular code motifs in tRNAs and rRNAs, in particular in the ribosome decoding centre (Michel 2012 Comput. Biol. Chem. 37, 24-37. (doi:10.1016/j.compbiolchem.2011.10.002); El Soufi & Michel 2014 Comput. Biol. Chem. 52, 9-17. (doi:10.1016/j.compbiolchem.2014.08.001)). The univerally conserved nucleotides A1492 and A1493 and the conserved nucleotide G530 are included in X circular code motifs. Recently, dinucleotide circular codes were also investigated (Michel & Pirillo 2013 ISRN Biomath. 2013, 538631. (doi:10.1155/2013/538631); Fimmel et al. 2015 J. Theor. Biol. 386, 159-165. (doi:10.1016/j.jtbi.2015.08.034)). As the genetic motifs of different lengths are ubiquitous in genes and genomes, we introduce a new approach based on graph theory to study in full generality n-nucleotide circular codes X, i.e. of length 2 (dinucleotide), 3 (trinucleotide), 4 (tetranucleotide), etc. Indeed, we prove that an n-nucleotide code X is circular if and only if the corresponding graph [Formula: see text] is acyclic. Moreover, the maximal length of a path in [Formula: see text] corresponds to the window of nucleotides in a sequence for detecting the correct reading frame. Finally, the graph theory of tournaments is applied to the study of dinucleotide circular codes. It has full equivalence between the combinatorics theory (Michel & Pirillo 2013 ISRN Biomath. 2013, 538631. (doi:10.1155/2013/538631)) and the group theory (Fimmel et al. 2015 J. Theor. Biol. 386, 159-165. (doi:10.1016/j.jtbi.2015.08.034)) of dinucleotide circular codes while its mathematical approach is simpler. © 2016 The Author(s).
Garver, Kyle A.; Conway, Carla M.; Kurath, Gael
2006-01-01
A highly efficacious DNA vaccine against a fish rhabdovirus, infectious hematopoietic necrosis virus (IHNV), was mutated to introduce two stop codons to prevent glycoprotein translation while maintaining the plasmid DNA integrity and RNA transcription ability. The mutated plasmid vaccine, denoted pIHNw-G2stop, when injected intramuscularly into fish at high doses, lacked detectable glycoprotein expression in the injection site muscle, and did not provide protection against lethal virus challenge 7 days post-vaccination. These results suggest that the G-protein itself is required to stimulate the early protective antiviral response observed after vaccination with the nonmutated parental DNA vaccine.
Yom Din, S; Hurvitz, A; Goldberg, D; Jackson, K; Levavi-Sivan, B; Degani, G
2008-03-01
In this study, the GH and IGF-I of the Russian sturgeon (rs), Acipenser gueldenstaedtii, were cloned and sequenced, and their mRNA gene expression determined. In addition, to improve our understanding of the GH function, the expression of this hormone was assessed in young males and females. Moreover, IGF-I expression was quantified in young males and compared to that in older ones. The nucleotide sequence of the rsGH cDNA was 980 bp long and had an open reading frame of 642 bp, beginning with the first ATG codon at position 39 and ending with the stop codon at position 683. A putative polyadenylation signal, AATAAA, was recognized 42 bp upstream of the poly (A) tail. The position of the signal- peptide cleavage site was predicted to be at position 111, yielding a signal peptide of 24 amino-acids (aa) and a mature peptide of 190 aa. When the rsGH aa sequence was compared with other species, the highest degree of identity was found to be with mammalians (66-70% identity), followed by anguilliformes and amphibia (61%) and other fish (39-47%). The level of rsGH mRNA was discovered to be similar in pituitaries of females and males of 5 age groups (1, 2, 3, 4, and 5- yr-old). In females and males, the levels did not change dramatically during the first 5 yr of growth. The partial nucleotide sequence of the rsIGF-I was 445 bp long and had an open reading frame of 396 bp, beginning with the ATG codon at position 50. The position of the signal-peptide cleavage site was predicted to be at position 187, yielding a signal peptide of 44 aa. The highest level of IGF-I mRNA expression was recorded in the kidney of adult sturgeons. The IGF-I mRNA expression levels in the intestine, pituitary gland, and liver were not significantly different. Low levels of expression were found in the brain, heart, and muscle. In most tissues, there was no significant difference between mRNA levels of one and 5-yr-old fish. In conclusion, based on the GH-sequence analysis, A. gueldenstaedtii is genetically distant from other teleosts. The expression of the GH mRNA was similar in males and females, and its level remained constant during the first 5 yr of growth. While the IGF-I mRNA expression differed amongst various tissues, the level in each tissue was similar in 1 and 5-yr-old fish.
Mathew, Suneeth F; Crowe-McAuliffe, Caillan; Graves, Ryan; Cardno, Tony S; McKinney, Cushla; Poole, Elizabeth S; Tate, Warren P
2015-01-01
HIV-1 utilises -1 programmed ribosomal frameshifting to translate structural and enzymatic domains in a defined proportion required for replication. A slippery sequence, U UUU UUA, and a stem-loop are well-defined RNA features modulating -1 frameshifting in HIV-1. The GGG glycine codon immediately following the slippery sequence (the 'intercodon') contributes structurally to the start of the stem-loop but has no defined role in current models of the frameshift mechanism, as slippage is inferred to occur before the intercodon has reached the ribosomal decoding site. This GGG codon is highly conserved in natural isolates of HIV. When the natural intercodon was replaced with a stop codon two different decoding molecules-eRF1 protein or a cognate suppressor tRNA-were able to access and decode the intercodon prior to -1 frameshifting. This implies significant slippage occurs when the intercodon is in the (perhaps distorted) ribosomal A site. We accommodate the influence of the intercodon in a model of frame maintenance versus frameshifting in HIV-1.
Khrustalev, Vladislav Victorovich
2009-01-01
Guanine is the most mutable nucleotide in HIV genes because of frequently occurring G to A transitions, which are caused by cytosine deamination in viral DNA minus strands catalyzed by APOBEC enzymes. Distribution of guanine between three codon positions should influence the probability for G to A mutation to be nonsynonymous (to occur in first or second codon position). We discovered that nucleotide sequences of env genes coding for third variable regions (V3 loops) of gp120 from HIV1 and HIV2 have different kinds of guanine usage biases. In the HIV1 reference strain and 100 additionally analyzed HIV1 strains the guanine usage bias in V3 loop coding regions (2G>1G>3G) should lead to elevated nonsynonymous G to A transitions occurrence rates. In the HIV2 reference strain and 100 other HIV2 strains guanine usage bias in V3 loop coding regions (3G>2G>1G) should protect V3 loops from hypermutability. According to the HIV1 and HIV2 V3 alignment, insertion of the sequence enriched with 2G (21 codons in length) occurred during the evolution of HIV1 predecessor, while insertion of the different sequence enriched with 3G (19 codons in length) occurred during the evolution of HIV2 predecessor. The higher is the level of 3G in the V3 coding region, the lower should be the immune escaping mutation occurrence rates. This hypothesis was tested in this study by comparing the guanine usage in V3 loop coding regions from HIV1 fast and slow progressors. All calculations have been performed by our algorithms "VVK In length", "VVK Dinucleotides" and "VVK Consensus" (www.barkovsky.hotmail.ru).
The complete mitochondrial genome of Chinese green hydra, Hydra sinensis (Hydroida: Hydridae).
Pan, Hong-Chun; Qian, Xiao-Cheng; Li, Ping; Li, Xiao-Fei; Wang, An-Tai
2014-02-01
The complete mitochondrial genome of Chinese green hydra, Hydra sinensis (Hydroida: Hydridae) is a linear molecule of 16,189 bp in length, containing 13 protein-coding genes, small and large subunit ribosomal RNAs, methionine and tryptophan transfer RNAs, a pseudogene consisting of a partial copy of COI and terminal sequences at two ends of the linear mitochondrial DNA. The A + T content of the overall base composition of H-strand is 77.2% (T: 41.7%; C: 10.9%; A: 35.5%; and G: 11.9%). COI and ND1 genes begin with GTG as start codon, while other 11 protein-coding genes start with a typical ATG initiation codon. COII, ATP8, ATP6, COIII, ND5, ND6, ND3, ND1, ND4 and COI genes are terminated with TAA as stop codon, ND4L ends with TAG, ND2 ends with TA and Cyt b ends with T.
Regions of extreme synonymous codon selection in mammalian genes
Schattner, Peter; Diekhans, Mark
2006-01-01
Recently there has been increasing evidence that purifying selection occurs among synonymous codons in mammalian genes. This selection appears to be a consequence of either cis-regulatory motifs, such as exonic splicing enhancers (ESEs), or mRNA secondary structures, being superimposed on the coding sequence of the gene. We have developed a program to identify regions likely to be enriched for such motifs by searching for extended regions of extreme codon conservation between homologous genes of related species. Here we present the results of applying this approach to five mammalian species (human, chimpanzee, mouse, rat and dog). Even with very conservative selection criteria, we find over 200 regions of extreme codon conservation, ranging in length from 60 to 178 codons. The regions are often found within genes involved in DNA-binding, RNA-binding or zinc-ion-binding. They are highly depleted for synonymous single nucleotide polymorphisms (SNPs) but not for non-synonymous SNPs, further indicating that the observed codon conservation is being driven by negative selection. Forty-three percent of the regions overlap conserved alternative transcript isoforms and are enriched for known ESEs. Other regions are enriched for TpA dinucleotides and may contain conserved motifs/structures relating to mRNA stability and/or degradation. We anticipate that this tool will be useful for detecting regions enriched in other classes of coding-sequence motifs and structures as well. PMID:16556911
Scaltriti, Erika; Sassera, Davide; Comandatore, Francesco; Morganti, Marina; Mandalari, Carmen; Gaiarsa, Stefano; Bandi, Claudio; Zehender, Gianguglielmo; Bolzoni, Luca; Casadei, Gabriele
2015-01-01
We retrospectively analyzed a rare Salmonella enterica serovar Manhattan outbreak that occurred in Italy in 2009 to evaluate the potential of new genomic tools based on differential single nucleotide polymorphism (SNP) analysis in comparison with the gold standard genotyping method, pulsed-field gel electrophoresis. A total of 39 isolates were analyzed from patients (n = 15) and food, feed, animal, and environmental sources (n = 24), resulting in five different pulsed-field gel electrophoresis (PFGE) profiles. Isolates epidemiologically related to the outbreak clustered within the same pulsotype, SXB_BS.0003, without any further differentiation. Thirty-three isolates were considered for genomic analysis based on different sets of SNPs, core, synonymous, nonsynonymous, as well as SNPs in different codon positions, by Bayesian and maximum likelihood algorithms. Trees generated from core and nonsynonymous SNPs, as well as SNPs at the second and first plus second codon positions detailed four distinct groups of isolates within the outbreak pulsotype, discriminating outbreak-related isolates of human and food origins. Conversely, the trees derived from synonymous and third-codon-position SNPs clustered food and human isolates together, indicating that all outbreak-related isolates constituted a single clone, which was in line with the epidemiological evidence. Further experiments are in place to extend this approach within our regional enteropathogen surveillance system. PMID:25653407
Scaltriti, Erika; Sassera, Davide; Comandatore, Francesco; Morganti, Marina; Mandalari, Carmen; Gaiarsa, Stefano; Bandi, Claudio; Zehender, Gianguglielmo; Bolzoni, Luca; Casadei, Gabriele; Pongolini, Stefano
2015-04-01
We retrospectively analyzed a rare Salmonella enterica serovar Manhattan outbreak that occurred in Italy in 2009 to evaluate the potential of new genomic tools based on differential single nucleotide polymorphism (SNP) analysis in comparison with the gold standard genotyping method, pulsed-field gel electrophoresis. A total of 39 isolates were analyzed from patients (n=15) and food, feed, animal, and environmental sources (n=24), resulting in five different pulsed-field gel electrophoresis (PFGE) profiles. Isolates epidemiologically related to the outbreak clustered within the same pulsotype, SXB_BS.0003, without any further differentiation. Thirty-three isolates were considered for genomic analysis based on different sets of SNPs, core, synonymous, nonsynonymous, as well as SNPs in different codon positions, by Bayesian and maximum likelihood algorithms. Trees generated from core and nonsynonymous SNPs, as well as SNPs at the second and first plus second codon positions detailed four distinct groups of isolates within the outbreak pulsotype, discriminating outbreak-related isolates of human and food origins. Conversely, the trees derived from synonymous and third-codon-position SNPs clustered food and human isolates together, indicating that all outbreak-related isolates constituted a single clone, which was in line with the epidemiological evidence. Further experiments are in place to extend this approach within our regional enteropathogen surveillance system. Copyright © 2015, American Society for Microbiology. All Rights Reserved.
Liu, Qiu-Ning; Chai, Xin-Yue; Bian, Dan-Dan; Zhou, Chun-Lin; Tang, Bo-Ping
2016-01-01
The mitochondrial (mt) genome can provide important information for the understanding of phylogenetic relationships. The complete mt genome of Plodia interpunctella (Lepidoptera: Pyralidae) has been sequenced. The circular genome is 15 287 bp in size, encoding 13 protein-coding genes (PCGs), 2 rRNA genes, 22 tRNA genes, and a control region. The AT skew of this mt genome is slightly negative, and the nucleotide composition is biased toward A+T nucleotides (80.15%). All PCGs start with the typical ATN (ATA, ATC, ATG, and ATT) codons, except for the cox1 gene which may start with the CGA codon. Four of the 13 PCGs harbor the incomplete termination codon T or TA. All the tRNA genes are folded into the typical clover-leaf structure of mitochondrial tRNA, except for trnS1 (AGN) in which the DHU arm fails to form a stable stem-loop structure. The overlapping sequences are 35 bp in total and are found in seven different locations. A total of 240 bp of intergenic spacers are scattered in 16 regions. The control region of the mt genome is 327 bp in length and consisted of several features common to the sequenced lepidopteran insects. Phylogenetic analysis based on 13 PCGs using the Maximum Likelihood method shows that the placement of P. interpunctella was within the Pyralidae.
Khrustalev, Vladislav Victorovich
2009-01-01
We showed that GC-content of nucleotide sequences coding for linear B-cell epitopes of herpes simplex virus type 1 (HSV1) glycoprotein B (gB) is higher than GC-content of sequences coding for epitope-free regions of this glycoprotein (G + C = 73 and 64%, respectively). Linear B-cell epitopes have been predicted in HSV1 gB by BepiPred algorithm ( www.cbs.dtu.dk/services/BepiPred ). Proline is an acrophilic amino acid residue (it is usually situated on the surface of protein globules, and so included in linear B-cell epitopes). Indeed, the level of proline is much higher in predicted epitopes of gB than in epitope-free regions (17.8% versus 1.8%). This amino acid is coded by GC-rich codons (CCX) that can be produced due to nucleotide substitutions caused by mutational GC-pressure. GC-pressure will also lead to disappearance of acrophobic phenylalanine, isoleucine, methionine and tyrosine coded by GC-poor codons. Results of our "in-silico directed mutagenesis" showed that single nonsynonymous substitutions in AT to GC direction in two long epitope-free regions of gB will cause formation of new linear epitopes or elongation of previously existing epitopes flanking these regions in 25% of 539 possible cases. The calculations of GC-content and amino acid content have been performed by CodonChanges algorithm ( www.barkovsky.hotmail.ru ).
Lakshmanan, Lakshmi Narayanan; Gruber, Jan; Halliwell, Barry; Gunawan, Rudiyanto
2015-01-01
Non D-loop direct repeats (DRs) in mitochondrial DNA (mtDNA) have been commonly implicated in the mutagenesis of mtDNA deletions associated with neuromuscular disease and ageing. Further, these DRs have been hypothesized to put a constraint on the lifespan of mammals and are under a negative selection pressure. Using a compendium of 294 mammalian mtDNA, we re-examined the relationship between species lifespan and the mutagenicity of such DRs. Contradicting the prevailing hypotheses, we found no significant evidence that long-lived mammals possess fewer mutagenic DRs than short-lived mammals. By comparing DR counts in human mtDNA with those in selectively randomized sequences, we also showed that the number of DRs in human mtDNA is primarily determined by global mtDNA properties, such as the bias in synonymous codon usage (SCU) and nucleotide composition. We found that SCU bias in mtDNA positively correlates with DR counts, where repeated usage of a subset of codons leads to more frequent DR occurrences. While bias in SCU and nucleotide composition has been attributed to nucleotide mutational bias, mammalian mtDNA still exhibit higher SCU bias and DR counts than expected from such mutational bias, suggesting a lack of negative selection against non D-loop DRs. PMID:25855815
Investigations with methanobacteria and with evolution of the genetic code
NASA Technical Reports Server (NTRS)
Jukes, T. H.
1986-01-01
Mycoplasma capricolum was found by Osawa et al. to use UGA as the code of tryptophan and to contain 75% A + T in its DNA. This change could have been from evolutionary pressure to replace C + G by A + T. Numerous studies have been reported of evolution of proteins as measured by amino acid replacements that are observed when homologus proteins, such as hemoglobins from various vertebrates, are compared. These replacements result from nucleotide substitutions in amino acid codons in the corresponding genes. Simultaneously, silent nucleotide substitutions take place that can be studied when sequences of the genes are compared. These silent evolutionary changes take place mostly in third positions of codons. Two types of nucleotide substitutions are recognized: pyrimidine-pyrimidine and purine-purine interchanges (transitions) and pyriidine-purine interchanges (transversions). Silent transitions are favored when a corresponding transversion would produce an amino acid replacement. Conversely, silent transversions are favored by probability when transitions and transversions will both be silent. Extensive examples of these situations have been found in protein genes, and it is evident that transversions in silent positions predominate in family boxes in most of the examples studied. In associated research a streptomycete from cow manure was found to produce an extracellular enzyme capable of lysing the pseudomurein-contining methanogen Methanobacterium formicicum.
Li, Guohui; Hu, Zhaoyang; Guo, Xuli; Li, Guangtian; Tang, Qi; Wang, Peng; Chen, Keping; Yao, Qin
2013-06-01
Bombyx mori bidensovirus (BmBDV) VD1-ORF4 (open reading frame 4, ORF4) consists of 3,318 nucleotides, which codes for a predicted 1,105-amino acid protein containing a conserved DNA polymerase motif. However, its functions in viral propagation remain unknown. In the current study, the transcription of VD1-ORF4 was examined from 6 to 96 h postinfection (p.i.) by RT-PCR, 5'-RACE revealed the transcription initiation site of BmBDV ORF4 to be -16 nucleotides upstream from the start codon, and 3'-RACE revealed the transcription termination site of VD1-ORF4 to be +7 nucleotides downstream from termination codon. Three different proteins were examined in the extracts of BmBDV-infected silkworms midguts by Western blot using raised antibodies against VD1-ORF4 deduced amino acid, and a specific protein band about 53 kDa was further detected in purified virions using the same antibodies. Taken together, BmBDV VD1-ORF4 codes for three or more proteins during the viral life cycle, one of which is a 53 kDa protein and confirmed to be a component of BmBDV virion.
Khan, Waqasuddin; Saripella, Ganapathi Varma-; Ludwig, Thomas; Cuppens, Tania; Thibord, Florian; Génin, Emmanuelle; Deleuze, Jean-Francois; Trégouët, David-Alexandre
2018-05-03
Predicted deleteriousness of coding variants is a frequently used criterion to filter out variants detected in next-generation sequencing projects and to select candidates impacting on the risk of human diseases. Most available dedicated tools implement a base-to-base annotation approach that could be biased in presence of several variants in the same genetic codon. We here proposed the MACARON program that, from a standard VCF file, identifies, re-annotates and predicts the amino acid change resulting from multiple single nucleotide variants (SNVs) within the same genetic codon. Applied to the whole exome dataset of 573 individuals, MACARON identifies 114 situations where multiple SNVs within a genetic codon induce an amino acid change that is different from those predicted by standard single SNV annotation tool. Such events are not uncommon and deserve to be studied in sequencing projects with inconclusive findings. MACARON is written in python with codes available on the GENMED website (www.genmed.fr). david-alexandre.tregouet@inserm.fr. Supplementary data are available at Bioinformatics online.
Aydin, A Fatih; Aydıngöz, İkbal Esen; Doğru-Abbasoğlu, Semra; Vural, Pervin; Uysal, Müjdat
2017-01-01
Oxidative stress and increased DNA damage have been implicated in the etiopathogenesis of vitiligo. Oxidative DNA damage is mainly repaired by the base excision repair (BER) pathway. We sought to determine whether polymorphisms in DNA repair genes may have a role in the pathogenesis of vitiligo. We conducted a study including 100 patients with vitiligo and age- and sex-matched 193 control subjects to examine the role of single-nucleotide polymorphisms of BER genes, human 8-oxoG DNA N-glycosylase 1 (codon 326), apurinic/apyrimidinic endonuclease 1 (APE1) (codon 148), and X-ray repair cross-complementing group 1 (codon 399) as risk factors for vitiligo. These polymorphisms were determined by quantitative real-time polymerase chain reaction and melting curve analysis. No significant association was observed between the variant alleles of studied genes and vitiligo. However, we showed that the presence of APE1 148Glu variant allele is associated with leukotrichia. This preliminary study suggests that APE1 (codon 148) polymorphism may play a role in vitiligo pathogenesis.
Butler, J S; Springer, M; Grunberg-Manago, M
1987-01-01
We previously showed that Escherichia coli translation initiation factor IF3 regulates the expression of its own gene infC at the translational level in vivo. Here we create two alterations in the infC gene and test their effects on translational autocontrol of infC expression in vivo by measuring beta-galactosidase activity expressed from infC-lacZ gene fusions under conditions of up to 4-fold derepression or 3-fold repression of infC expression. Replacement of the infC promoter with the trp promoter deletes 120 nucleotides of the infC mRNA 5' to the translation initiation site without affecting autogenous translational control. Mutation of the unusual AUU initiator codon of infC to the more common AUG initiator codon abolishes translation initiation factor IF3-dependent repression and derepression of infC expression in vivo. These results establish the AUU initiator codon of infC as an essential cis-acting element in autogenous translational control of translation initiation factor IF3 expression in vivo. PMID:2954162
Butler, J S; Springer, M; Grunberg-Manago, M
1987-06-01
We previously showed that Escherichia coli translation initiation factor IF3 regulates the expression of its own gene infC at the translational level in vivo. Here we create two alterations in the infC gene and test their effects on translational autocontrol of infC expression in vivo by measuring beta-galactosidase activity expressed from infC-lacZ gene fusions under conditions of up to 4-fold derepression or 3-fold repression of infC expression. Replacement of the infC promoter with the trp promoter deletes 120 nucleotides of the infC mRNA 5' to the translation initiation site without affecting autogenous translational control. Mutation of the unusual AUU initiator codon of infC to the more common AUG initiator codon abolishes translation initiation factor IF3-dependent repression and derepression of infC expression in vivo. These results establish the AUU initiator codon of infC as an essential cis-acting element in autogenous translational control of translation initiation factor IF3 expression in vivo.
Mitochondrial genetic codes evolve to match amino acid requirements of proteins.
Swire, Jonathan; Judson, Olivia P; Burt, Austin
2005-01-01
Mitochondria often use genetic codes different from the standard genetic code. Now that many mitochondrial genomes have been sequenced, these variant codes provide the first opportunity to examine empirically the processes that produce new genetic codes. The key question is: Are codon reassignments the sole result of mutation and genetic drift? Or are they the result of natural selection? Here we present an analysis of 24 phylogenetically independent codon reassignments in mitochondria. Although the mutation-drift hypothesis can explain reassignments from stop to an amino acid, we found that it cannot explain reassignments from one amino acid to another. In particular--and contrary to the predictions of the mutation-drift hypothesis--the codon involved in such a reassignment was not rare in the ancestral genome. Instead, such reassignments appear to take place while the codon is in use at an appreciable frequency. Moreover, the comparison of inferred amino acid usage in the ancestral genome with the neutral expectation shows that the amino acid gaining the codon was selectively favored over the amino acid losing the codon. These results are consistent with a simple model of weak selection on the amino acid composition of proteins in which codon reassignments are selected because they compensate for multiple slightly deleterious mutations throughout the mitochondrial genome. We propose that the selection pressure is for reduced protein synthesis cost: most reassignments give amino acids that are less expensive to synthesize. Taken together, our results strongly suggest that mitochondrial genetic codes evolve to match the amino acid requirements of proteins.
Rhnull syndrome: identification of a novel mutation in RHce.
Rosa, K A; Reid, M E; Lomas-Francis, C; Powell, V I; Costa, F F; Stinghen, S T; Watanabe, A M; Carboni, E K; Baldon, J P; Jucksch, M M F; Castilho, L
2005-11-01
The deficiency of Rh proteins on red blood cells (RBCs) from individuals of the Rh(null) amorph type are the result of homozygosity for a silent RHCE in cis with a deleted RHD. A novel mutation in RHce was identified in two Caucasian Brazilian girls with the amorph type of Rh(null) who were born to parents who were first cousins. RBCs from the Rh(null) sisters and from family members were analyzed by serology and flow cytometry with specific antibodies. Genomic DNA and transcripts were tested by polymerase chain reaction and sequence analysis. Rh(null) RBCs were nonreactive with anti-Rh and anti-LW. Molecular analyses showed a deletion of RHD and of one nucleotide (960/963; GGGG-->GGG) in exon 7 of the RHce. This deletion introduced a frameshift after Gly321, a new C-terminal sequence, and a premature stop codon, resulting in a shorter predicted protein with 357 amino acids. The detection of a unique RHce transcript indicated that the two sisters were homozygous, whereas the other family members were heterozygous for the mutation. A novel mutation resulting in the amorph Rh(null) with loss of Rh antigen expression is described.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Cool, D.E.; Tonks, N.K.; Charbonneau, H.
1989-07-01
A human peripheral T-cell cDNA library was screened with two labeled synthetic oligonucleotides encoding regions of a human placenta protein-tyrosine-phosphatase. One positive clone was isolated and the nucleotide sequence was determined. It contained 1,305 base pairs of open reading frame followed by a TAA stop codon and 978 base pairs of 3{prime} untranslated end, although a poly(A){sup +} tail was not found. An initiator methionine residue was predicted at position 61, which would result in a protein of 415 amino acid residues. This was supported by the synthesis of a M{sub r} 48,000 protein in an in vitro reticulocyte lysatemore » translation system using RNA transcribed from the cloned cDNA and T7 RNA polymerase. The deduced amino acid sequence was compared to other known proteins revealing 65% identity to the low M{sub r} PTPase 1B isolated from placenta. In view of the high degree of similarity, the T-cell cDNA likely encodes a newly discovered protein-tyrosine-phosphatase, thus expanding this family of genes.« less
Craig, Scott; Thu, Hlaing Myat; Lowry, Kym; Wang, Xiao-fang; Holmes, Edward C.; Aaskov, John
2003-01-01
Envelope (E) protein genes sampled from populations of dengue 2 (DEN-2) virus in individual Aedes aegypti mosquitoes and in serum from dengue patients were copied to cDNA, cloned, and sequenced. The nucleotide sequences of the E genes in more than 70% of the clones differed from the consensus sequence for the corresponding virus population at up to 11 sites, and 24 of the 94 clones contained at least one stop codon. Virus populations recovered up to 2 years apart yielded clones with similar polymorphisms in the E gene. For one mosquito, the clones obtained fell into two genotypes. One group of sequences was closely related to those of viruses recovered from dengue patients in the same locality (Yangon, Myanmar) since 1995 and were classified as Asian 1 genotype. The second group were Cosmopolitan genotype viruses which were also circulating in Yangon in 2000 and which were related to DEN-2 viruses sampled from southern China in 1999. Finally, one clone was identified as a recombinant genome composed of portions of these two “parental” genotypes. This is the first report of recombinant and parental dengue viruses in a single host. PMID:12634407
Mutation in the Auxiliary Calcium-Channel Subunit CACNA2D4 Causes Autosomal Recessive Cone Dystrophy
Wycisk, Katharina Agnes; Zeitz, Christina; Feil, Silke; Wittmer, Mariana; Forster, Ursula; Neidhardt, John; Wissinger, Bernd; Zrenner, Eberhart; Wilke, Robert; Kohl, Susanne; Berger, Wolfgang
2006-01-01
Retinal signal transmission depends on the activity of high voltage–gated l-type calcium channels in photoreceptor ribbon synapses. We recently identified a truncating frameshift mutation in the Cacna2d4 gene in a spontaneous mouse mutant with profound loss of retinal signaling and an abnormal morphology of ribbon synapses in rods and cones. The Cacna2d4 gene encodes an l-type calcium-channel auxiliary subunit of the α2δ type. Mutations in its human orthologue, CACNA2D4, were not yet known to be associated with a disease. We performed mutation analyses of 34 patients who received an initial diagnosis of night blindness, and, in two affected siblings, we detected a homozygous nucleotide substitution (c.2406C→A) in CACNA2D4. The mutation introduces a premature stop codon that truncates one-third of the corresponding open reading frame. Both patients share symptoms of slowly progressing cone dystrophy. These findings represent the first report of a mutation in the human CACNA2D4 gene and define a novel gene defect that causes autosomal recessive cone dystrophy. PMID:17033974
Novel mutation at the initiation codon in the Norrie disease gene in two Japanese families.
Isashiki, Y; Ohba, N; Yanagita, T; Hokita, N; Doi, N; Nakagawa, M; Ozawa, M; Kuroda, N
1995-01-01
We have identified a new mutation of Norrie disease (ND) gene in two Japanese males from unrelated families; they showed typical ocular features of ND but no mental retardation or hearing impairment. A mutation was found in both patients at the initiation codon of exon 2 of the ND gene (ATG to GTG), with otherwise normal nucleotide sequences. Their mothers had the normal and mutant types of the gene, which was expected for heterozygotes of the disease. The mutation of the initiation codon would cause the failure of ND gene expression or a defect in translation thereby truncating the amino terminus of ND protein. In view of the rarity and marked heterogeneity of mutations in the ND gene, the present apparently unrelated Japanese families who have lived in the same area for over two centuries presumably share the origin of the mutation.
The Role of ABC Proteins in Drug-Resistant Breast Cancer Cells
2007-04-01
and a biotin acceptor domain) under control of the alcohol oxidase promoter (Figure 2). Upon methanol induction, the yeast expressed high levels of...as native cDNA. Therefore, we backtranslated the protein into a nucleotide sequence codon-optimized for expression in Pichia pastoris yeast. Yeast
Masuda, Isao; Matsuzaki, Motomichi; Kita, Kiyoshi
2010-10-01
Diverse mitochondrial (mt) genetic systems have evolved independently of the more uniform nuclear system and often employ modified genetic codes. The organization and genetic system of dinoflagellate mt genomes are particularly unusual and remain an evolutionary enigma. We determined the sequence of full-length cytochrome c oxidase subunit 1 (cox1) mRNA of the earliest diverging dinoflagellate Perkinsus and show that this gene resides in the mt genome. Apparently, this mRNA is not translated in a single reading frame with standard codon usage. Our examination of the nucleotide sequence and three-frame translation of the mRNA suggest that the reading frame must be shifted 10 times, at every AGG and CCC codon, to yield a consensus COX1 protein. We suggest two possible mechanisms for these translational frameshifts: a ribosomal frameshift in which stalled ribosomes skip the first bases of these codons or specialized tRNAs recognizing non-triplet codons, AGGY and CCCCU. Regardless of the mechanism, active and efficient machinery would be required to tolerate the frameshifts predicted in Perkinsus mitochondria. To our knowledge, this is the first evidence of translational frameshifts in protist mitochondria and, by far, is the most extensive case in mitochondria.
Presence of tannins in sorghum grains is conditioned by different natural alleles of Tannin1
Wu, Yuye; Li, Xianran; Xiang, Wenwen; Zhu, Chengsong; Lin, Zhongwei; Wu, Yun; Li, Jiarui; Pandravada, Satchidanand; Ridder, Dustan D.; Bai, Guihua; Wang, Ming L.; Trick, Harold N.; Bean, Scott R.; Tuinstra, Mitchell R.; Tesso, Tesfaye T.; Yu, Jianming
2012-01-01
Sorghum, an ancient old-world cereal grass, is the dietary staple of over 500 million people in more than 30 countries in the tropics and semitropics. Its C4 photosynthesis, drought resistance, wide adaptation, and high nutritional value hold the promise to alleviate hunger in Africa. Not present in other major cereals, such as rice, wheat, and maize, condensed tannins (proanthocyanidins) in the pigmented testa of some sorghum cultivars have been implicated in reducing protein digestibility but recently have been shown to promote human health because of their high antioxidant capacity and ability to fight obesity through reduced digestion. Combining quantitative trait locus mapping, meta-quantitative trait locus fine-mapping, and association mapping, we showed that the nucleotide polymorphisms in the Tan1 gene, coding a WD40 protein, control the tannin biosynthesis in sorghum. A 1-bp G deletion in the coding region, causing a frame shift and a premature stop codon, led to a nonfunctional allele, tan1-a. Likewise, a different 10-bp insertion resulted in a second nonfunctional allele, tan1-b. Transforming the sorghum Tan1 ORF into a nontannin Arabidopsis mutant restored the tannin phenotype. In addition, reduction in nucleotide diversity from wild sorghum accessions to landraces and cultivars was found at the region that codes the highly conserved WD40 repeat domains and the C-terminal region of the protein. Genetic research in crops, coupled with nutritional and medical research, could open the possibility of producing different levels and combinations of phenolic compounds to promote human health. PMID:22699509
Lehman, Donna M; Fu, Dong-Jing; Freeman, Angela B; Hunt, Kelly J; Leach, Robin J; Johnson-Pais, Teresa; Hamlington, Jeanette; Dyer, Thomas D; Arya, Rector; Abboud, Hanna; Göring, Harald H H; Duggirala, Ravindranath; Blangero, John; Konrad, Robert J; Stern, Michael P
2005-04-01
Excess O-glycosylation of proteins by O-linked beta-N-acetylglucosamine (O-GlcNAc) may be involved in the pathogenesis of type 2 diabetes. The enzyme O-GlcNAc-selective N-acetyl-beta-d glucosaminidase (O-GlcNAcase) encoded by MGEA5 on 10q24.1-q24.3 reverses this modification by catalyzing the removal of O-GlcNAc. We have previously reported the linkage of type 2 diabetes and age at diabetes onset to an overlapping region on chromosome 10q in the San Antonio Family Diabetes Study (SAFADS). In this study, we investigated menangioma-expressed antigen-5 (MGEA5) as a positional candidate gene. Twenty-four single nucleotide polymorphisms (SNPs), identified by sequencing 44 SAFADS subjects, were genotyped in 436 individuals from 27 families whose data were used in the original linkage report. Association tests indicated significant association of a novel SNP with the traits diabetes (P = 0.0128, relative risk = 2.77) and age at diabetes onset (P = 0.0017). The associated SNP is located in intron 10, which contains an alternate stop codon and may lead to decreased expression of the 130-kDa isoform, the isoform predicted to contain the O-GlcNAcase activity. We investigated whether this variant was responsible for the original linkage signal. The variance attributed to this SNP accounted for approximately 25% of the logarithm of odds. These results suggest that this variant within the MGEA5 gene may increase diabetes risk in Mexican Americans.
Frequency of a natural truncated allele of MdMLO19 in the germplasm of Malus domestica.
Pessina, Stefano; Palmieri, Luisa; Bianco, Luca; Gassmann, Jennifer; van de Weg, Eric; Visser, Richard G F; Magnago, Pierluigi; Schouten, Henk J; Bai, Yuling; Riccardo Velasco, R; Malnoy, Mickael
2017-01-01
Podosphaera leucotricha is the causal agent of powdery mildew (PM) in apple. To reduce the amount of fungicides required to control this pathogen, the development of resistant apple cultivars should become a priority. Resistance to PM was achieved in various crops by knocking out specific members of the MLO gene family that are responsible for PM susceptibility (S-genes). In apple, the knockdown of MdMLO19 resulted in PM resistance. However, since gene silencing technologies such as RNAi are perceived unfavorably in Europe, a different approach that exploits this type of resistance is needed. This work evaluates the presence of non-functional naturally occurring alleles of MdMLO19 in apple germplasm. The screening of the re-sequencing data of 63 apple individuals led to the identification of 627 single nucleotide polymorphisms (SNPs) in five MLO genes ( MdMLO5, MdMLO7, MdMLO11, MdMLO18 , and MdMLO19 ), 127 of which were located in exons. The T-1201 insertion of a single nucleotide in MdMLO19 caused the formation of an early stop codon, resulting in a truncated protein lacking 185 amino acids, including the calmodulin-binding domain. The presence of the insertion was evaluated in 115 individuals. It was heterozygous in 64 and homozygous in 25. Twelve of the 25 individuals carrying the insertion in homozygosity were susceptible to PM. After barley, pea, cucumber, and tomato, apple would be the fifth species for which a natural non-functional mlo allele has been found.
Zúñiga, M C; Steitz, J A
1977-01-01
The nucleotide sequence of tRNA1Gly isolated from the posterior silk gland of Bombyx mori has been determined. This transfer RNA is present in high amounts in the posterior silk gland during the fifth larval instar. It has a GCC anticodon, capable of decoding a major glycine codon in the fibroin messenger RNA, GGU. Structural features of Bombyx tRNA1Gly and its homology to other eukaryotic glycine tRNAs are discussed. Images PMID:414206
USDA-ARS?s Scientific Manuscript database
Potato leafroll virus (PLRV) produces a readthrough protein (RTP) via translational readthrough of the coat protein amber stop codon. The RTP functions as a structural component of the virion and as a non-incorporated protein in concert with numerous insect and plant proteins to regulate virus movem...
Jiang, Fan; Pan, Xubin; Li, Xuankun; Yu, Yanxue; Zhang, Junhua; Jiang, Hongshan; Dou, Liduo; Zhu, Shuifang
2016-01-01
The genus Dacus is one of the most economically important tephritid fruit flies. The first complete mitochondrial genome (mitogenome) of Dacus species – D. longicornis was sequenced by next-generation sequencing in order to develop the mitogenome data for this genus. The circular 16,253 bp mitogenome is the typical set and arrangement of 37 genes present in the ancestral insect. The mitogenome data of D. longicornis was compared to all the published homologous sequences of other tephritid species. We discovered the subgenera Bactrocera, Daculus and Tetradacus differed from the subgenus Zeugodacus, the genera Dacus, Ceratitis and Procecidochares in the possession of TA instead of TAA stop codon for COI gene. There is a possibility that the TA stop codon in COI is the synapomorphy in Bactrocera group in the genus Bactrocera comparing with other Tephritidae species. Phylogenetic analyses based on the mitogenome data from Tephritidae were inferred by Bayesian and Maximum-likelihood methods, strongly supported the sister relationship between Zeugodacus and Dacus. PMID:27812024
Unusual AIP mutation and phenocopy in the family of a young patient with acromegalic gigantism.
Imran, Syed Ali; Aldahmani, Khaled A; Penney, Lynette; Croul, Sidney E; Clarke, David B; Collier, David M; Iacovazzo, Donato; Korbonits, Márta
2018-01-01
Early-onset acromegaly causing gigantism is often associated with aryl-hydrocarbon-interacting receptor protein ( AIP ) mutation, especially if there is a positive family history. A15y male presented with tiredness and visual problems. He was 201 cm tall with a span of 217 cm. He had typical facial features of acromegaly, elevated IGF-1, secondary hypogonadism and a large macroadenoma. His paternal aunt had a history of acromegaly presenting at the age of 35 years. Following transsphenoidal surgery, his IGF-1 normalized and clinical symptoms improved. He was found to have a novel AIP mutation destroying the stop codon c.991T>C; p.*331R. Unexpectedly, his father and paternal aunt were negative for this mutation while his mother and older sister were unaffected carriers, suggesting that his aunt represents a phenocopy. Typical presentation for a patient with AIP mutation with excess growth and eunuchoid proportions.Unusual, previously not described AIP variant with loss of the stop codon.Phenocopy may occur in families with a disease-causing germline mutation.
Unusual AIP mutation and phenocopy in the family of a young patient with acromegalic gigantism
Aldahmani, Khaled A; Penney, Lynette; Croul, Sidney E; Clarke, David B; Collier, David M; Iacovazzo, Donato; Korbonits, Márta
2018-01-01
Summary Early-onset acromegaly causing gigantism is often associated with aryl-hydrocarbon-interacting receptor protein (AIP) mutation, especially if there is a positive family history. A15y male presented with tiredness and visual problems. He was 201 cm tall with a span of 217 cm. He had typical facial features of acromegaly, elevated IGF-1, secondary hypogonadism and a large macroadenoma. His paternal aunt had a history of acromegaly presenting at the age of 35 years. Following transsphenoidal surgery, his IGF-1 normalized and clinical symptoms improved. He was found to have a novel AIP mutation destroying the stop codon c.991T>C; p.*331R. Unexpectedly, his father and paternal aunt were negative for this mutation while his mother and older sister were unaffected carriers, suggesting that his aunt represents a phenocopy. Learning points: Typical presentation for a patient with AIP mutation with excess growth and eunuchoid proportions. Unusual, previously not described AIP variant with loss of the stop codon. Phenocopy may occur in families with a disease-causing germline mutation. PMID:29472986
Molecular Genetic Analysis and Evolution of Segment 7 in Rice Black-Streaked Dwarf Virus in China
Chen, Yanping; Wu, Jirong; Meng, Qingchang; Han, Xiaohua; Hao, Zhuanfang; Li, Mingshun; Yong, Hongjun; Zhang, Degui; Zhang, Shihuang; Li, Xinhai
2015-01-01
Rice black-streaked dwarf virus (RBSDV) causes maize rough dwarf disease or rice black-streaked dwarf disease and can lead to severe yield losses in maize and rice. To analyse RBSDV evolution, codon usage bias and genetic structure were investigated in 111 maize and rice RBSDV isolates from eight geographic locations in 2013 and 2014. The linear dsRNA S7 is A+U rich, with overall codon usage biased toward codons ending with A (A3s, S7-1: 32.64%, S7-2: 29.95%) or U (U3s, S7-1: 44.18%, S7-2: 46.06%). Effective number of codons (Nc) values of 45.63 in S7-1 (the first open reading frame of S7) and 39.96 in S7-2 (the second open reading frame of S7) indicate low degrees of RBSDV-S7 codon usage bias, likely driven by mutational bias regardless of year, host, or geographical origin. Twelve optimal codons were detected in S7. The nucleotide diversity (π) of S7 sequences in 2013 isolates (0.0307) was significantly higher than in 2014 isolates (0.0244, P = 0.0226). The nucleotide diversity (π) of S7 sequences in isolates from Jinan (0.0391) was higher than that from the other seven locations (P < 0.01). Only one S7 recombinant was detected in Baoding. RBSDV isolates could be phylogenetically classified into two groups according to S7 sequences, and further classified into two subgroups. S7-1 and S7-2 were under negative and purifying selection, with respective Ka/Ks ratios of 0.0179 and 0.0537. These RBSDV populations were expanding (P < 0.01) as indicated by negative values for Tajima's D, Fu and Li's D, and Fu and Li's F. Genetic differentiation was detected in six RBSDV subpopulations (P < 0.05). Absolute Fst (0.0790) and Nm (65.12) between 2013 and 2014, absolute Fst (0.1720) and Nm (38.49) between maize and rice, and absolute Fst values of 0.0085-0.3069 and Nm values of 0.56-29.61 among these eight geographic locations revealed frequent gene flow between subpopulations. Gene flow between 2013 and 2014 was the most frequent. PMID:26121638
Anwar, Munir A; Kralj, Slavko; Piqué, Anna Villar; Leemhuis, Hans; van der Maarel, Marc J E C; Dijkhuizen, Lubbert
2010-04-01
Fructansucrase enzymes polymerize the fructose moiety of sucrose into levan or inulin fructans, with beta(2-6) and beta(2-1) linkages, respectively. Here, we report an evaluation of fructan synthesis in three Lactobacillus gasseri strains, identification of the fructansucrase-encoding genes and characterization of the recombinant proteins and fructan (oligosaccharide) products. High-performance anion-exchange chromatography and nuclear magnetic resonance analysis of the fructo-oligosaccharides (FOS) and polymers produced by the L. gasseri strains and the recombinant enzymes revealed that, in situ, L. gasseri strains DSM 20604 and 20077 synthesize inulin (and oligosaccharides) and levan products, respectively. L. gasseri DSM 20604 is only the second Lactobacillus strain shown to produce inulin polymer and FOS in situ, and is unique in its distribution of FOS synthesized, ranging from DP2 to DP13. The probiotic bacterium L. gasseri DSM 20243 did not produce any fructan, although we identified a fructansucrase-encoding gene in its genome sequence. Further studies showed that this L. gasseri DSM 20243 gene was prematurely terminated by a stop codon. Exchanging the stop codon for a glutamine codon resulted in a recombinant enzyme producing inulin and FOS. The three recombinant fructansucrase enzymes characterized from three different L. gasseri strains have very similar primary protein structures, yet synthesize different fructan products. An interesting feature of the L. gasseri strains is that they were unable to ferment raffinose, whereas their respective recombinant enzymes converted raffinose into fructan and FOS.
Villadsen, I S; Michelsen, O
1977-01-01
The ribonucleoside triphosphate, deoxyribonucleoside triphosphate, 3' -diphosphate guanosine 5' -diphosphate (ppGpp), and 5-phosphoribosyl 1-pyrophosphate (PRPP) pools in Escherichia coli B were determined by thin-layer chromatography during changing conditions to ammonium starvation. The intracellular concentrations of all nucleotides were found to change in a well-defined order several minutes before andy observed change in the optical density of the culture. The levels of purine nucleoside triphosphates (adenosine 5' -triphosphate [CTP], dCTP) and uridine nucleotides (uridine 5' -triphosphate, deoxythymidine 5'-triphosphate). The deoxyribonucleotides thus behaved as the ribonucleotides. The levels of ppGpp increased 11-fold after the decrease in uridine nucleotides, when the accumulation of stable ribonucleic acid (RNA) stopped. The level of the nucleotide pool did not stabilize until 30 min after the change in optical density. The pool of dGTP dropped concomitantly with the pool of CTP. The nucleotide precursor PRPP exhibited a transient increase, wtih maximum value of four times the exponential levels at the onset of starvation. Apparently the cell adjusts early to starvation by reducing either the phosphorylating activity or the nucleotide biosynthetic activity. As in other downshift systems, the accumulation of stable RNA stopped before the break in optical density and before the stop in protein accumulation. Cell divisions were quite insensitive to the control mechanisms operating on RNA and protein accumulation under ammonium starvation, since the cells continued to divide for 21 min without any net accumulation of RNA. Images PMID:323222
Dynamics of actin evolution in dinoflagellates.
Kim, Sunju; Bachvaroff, Tsvetan R; Handy, Sara M; Delwiche, Charles F
2011-04-01
Dinoflagellates have unique nuclei and intriguing genome characteristics with very high DNA content making complete genome sequencing difficult. In dinoflagellates, many genes are found in multicopy gene families, but the processes involved in the establishment and maintenance of these gene families are poorly understood. Understanding the dynamics of gene family evolution in dinoflagellates requires comparisons at different evolutionary scales. Studies of closely related species provide fine-scale information relative to species divergence, whereas comparisons of more distantly related species provides broad context. We selected the actin gene family as a highly expressed conserved gene previously studied in dinoflagellates. Of the 142 sequences determined in this study, 103 were from the two closely related species, Dinophysis acuminata and D. caudata, including full length and partial cDNA sequences as well as partial genomic amplicons. For these two Dinophysis species, at least three types of sequences could be identified. Most copies (79%) were relatively similar and in nucleotide trees, the sequences formed two bushy clades corresponding to the two species. In comparisons within species, only eight to ten nucleotide differences were found between these copies. The two remaining types formed clades containing sequences from both species. One type included the most similar sequences in between-species comparisons with as few as 12 nucleotide differences between species. The second type included the most divergent sequences in comparisons between and within species with up to 93 nucleotide differences between sequences. In all the sequences, most variation occurred in synonymous sites or the 5' UnTranslated Region (UTR), although there was still limited amino acid variation between most sequences. Several potential pseudogenes were found (approximately 10% of all sequences depending on species) with incomplete open reading frames due to frameshifts or early stop codons. Overall, variation in the actin gene family fits best with the "birth and death" model of evolution based on recent duplications, pseudogenes, and incomplete lineage sorting. Divergence between species was similar to variation within species, so that actin may be too conserved to be useful for phylogenetic estimation of closely related species.
Reyes-Guzmán, Edwin Alfredo; Poutou-Piñales, Raúl A.; Reyes-Montaño, Edgar Antonio; Pedroza-Rodríguez, Aura Marina; Rodríguez-Vázquez, Refugio; Cardozo-Bernal, Ángela M.
2015-01-01
Lacasses are multicopper oxidases that can catalyze aromatic and non-aromatic compounds concomitantly with reduction of molecular oxygen to water. Fungal laccases have generated a growing interest due to their biotechnological potential applications, such as lignocellulosic material delignification, biopulping and biobleaching, wastewater treatment, and transformation of toxic organic pollutants. In this work we selected fungal genes encoding for laccase enzymes GlLCC1 in Ganoderma lucidum and POXA 1B in Pleurotus ostreatus. These genes were optimized for codon use, GC content, and regions generating secondary structures. Laccase proposed computational models, and their interaction with ABTS [2, 2′-azino-bis(3-ethylbenzothiazoline-6-sulphonic acid)] substrate was evaluated by molecular docking. Synthetic genes were cloned under the control of Pichia pastoris glyceraldehyde-3-phosphate dehydrogenase (GAP) constitutive promoter. P. pastoris X-33 was transformed with pGAPZαA-LaccGluc-Stop and pGAPZαA-LaccPost-Stop constructs. Optimization reduced GC content by 47 and 49% for LaccGluc-Stop and LaccPost-Stop genes, respectively. A codon adaptation index of 0.84 was obtained for both genes. 3D structure analysis using SuperPose revealed LaccGluc-Stop is similar to the laccase crystallographic structure 1GYC of Trametes versicolor. Interaction analysis of the 3D models validated through ABTS, demonstrated higher substrate affinity for LaccPost-Stop, in agreement with our experimental results with enzymatic activities of 451.08 ± 6.46 UL-1 compared to activities of 0.13 ± 0.028 UL-1 for LaccGluc-Stop. This study demonstrated that G. lucidum GlLCC1 and P. ostreatus POXA 1B gene optimization resulted in constitutive gene expression under GAP promoter and α-factor leader in P. pastoris. These are important findings in light of recombinant enzyme expression system utility for environmentally friendly designed expression systems, because of the wide range of substrates that laccases can transform. This contributes to a great gamut of products in diverse settings: industry, clinical and chemical use, and environmental applications. PMID:25611746
Rivera-Hoyos, Claudia M; Morales-Álvarez, Edwin David; Poveda-Cuevas, Sergio Alejandro; Reyes-Guzmán, Edwin Alfredo; Poutou-Piñales, Raúl A; Reyes-Montaño, Edgar Antonio; Pedroza-Rodríguez, Aura Marina; Rodríguez-Vázquez, Refugio; Cardozo-Bernal, Ángela M
2015-01-01
Lacasses are multicopper oxidases that can catalyze aromatic and non-aromatic compounds concomitantly with reduction of molecular oxygen to water. Fungal laccases have generated a growing interest due to their biotechnological potential applications, such as lignocellulosic material delignification, biopulping and biobleaching, wastewater treatment, and transformation of toxic organic pollutants. In this work we selected fungal genes encoding for laccase enzymes GlLCC1 in Ganoderma lucidum and POXA 1B in Pleurotus ostreatus. These genes were optimized for codon use, GC content, and regions generating secondary structures. Laccase proposed computational models, and their interaction with ABTS [2, 2'-azino-bis(3-ethylbenzothiazoline-6-sulphonic acid)] substrate was evaluated by molecular docking. Synthetic genes were cloned under the control of Pichia pastoris glyceraldehyde-3-phosphate dehydrogenase (GAP) constitutive promoter. P. pastoris X-33 was transformed with pGAPZαA-LaccGluc-Stop and pGAPZαA-LaccPost-Stop constructs. Optimization reduced GC content by 47 and 49% for LaccGluc-Stop and LaccPost-Stop genes, respectively. A codon adaptation index of 0.84 was obtained for both genes. 3D structure analysis using SuperPose revealed LaccGluc-Stop is similar to the laccase crystallographic structure 1GYC of Trametes versicolor. Interaction analysis of the 3D models validated through ABTS, demonstrated higher substrate affinity for LaccPost-Stop, in agreement with our experimental results with enzymatic activities of 451.08 ± 6.46 UL-1 compared to activities of 0.13 ± 0.028 UL-1 for LaccGluc-Stop. This study demonstrated that G. lucidum GlLCC1 and P. ostreatus POXA 1B gene optimization resulted in constitutive gene expression under GAP promoter and α-factor leader in P. pastoris. These are important findings in light of recombinant enzyme expression system utility for environmentally friendly designed expression systems, because of the wide range of substrates that laccases can transform. This contributes to a great gamut of products in diverse settings: industry, clinical and chemical use, and environmental applications.
Beyond the Triplet Code: Context Cues Transform Translation.
Brar, Gloria A
2016-12-15
The elucidation of the genetic code remains among the most influential discoveries in biology. While innumerable studies have validated the general universality of the code and its value in predicting and analyzing protein coding sequences, established and emerging work has also suggested that full genome decryption may benefit from a greater consideration of a codon's neighborhood within an mRNA than has been broadly applied. This Review examines the evidence for context cues in translation, with a focus on several recent studies that reveal broad roles for mRNA context in programming translation start sites, the rate of translation elongation, and stop codon identity. Copyright © 2016 Elsevier Inc. All rights reserved.
Perrotta, Silverio; Di Iorgi, Natascia; Ragione, Fulvio Della; Scianguetta, Saverio; Borriello, Adriana; Allegri, Anna Elsa Maria; Ferraro, Marcella; Santoro, Claudia; Napoli, Flavia; Calcagno, Annalisa; Giaccardi, Marta; Cappa, Marco; Salerno, Maria Carolina; Cozzolino, Domenico; Maghnie, Mohamad
2015-04-01
Idiopathic early-onset central diabetes insipidus (CDI) might be due to mutations of arginine vasopressin-neurophysin II (AVP-NPII (AVP)) or wolframin (WFS1) genes. Sequencing of AVP and WFS1 genes was performed in nine children with CDI, aged between 9 and 68 months, and negative family history for polyuria and polydipsia. Two patients carried a mutation in the AVP gene: a heterozygous G-to-T transition at nucleotide position 322 of exon 2 (c.322G>T) resulting in a stop codon at position 108 (p.Glu108X), and a novel deletion from nucleotide 52 to 54 (c.52_54delTCC) producing a deletion of a serine at position 18 (p.Ser18del) of the AVP pre-prohormone signal peptide. A third patient carried two heterozygous mutations in the WFS1 gene localized on different alleles. The first change was A-to-G transition at nucleotide 997 in exon 8 (c.997A>G), resulting in a valine residue at position 333 in place of isoleucine (p.Ile333Val). The second novel mutation was a 3 bp insertion in exon 8, c.2392_2393insACG causing the addition of an aspartate residue at position 797 and the maintenance of the correct open reading frame (p. Asp797_Val798insAsp). While similar WFS1 protein levels were detected in fibroblasts from healthy subjects and from the patient and his parents, a major sensitivity to staurosporine-induced apoptosis was observed in the patient fibroblasts as well as in patients with Wolfram syndrome. Early-onset CDI is associated with de novo mutations of the AVP gene and with hereditary WFS1 gene changes. These findings have valuable implications for management and genetic counseling. © 2015 European Society of Endocrinology.
Mutations in the ADAR1 gene in Chinese families with dyschromatosis symmetrica hereditaria.
Zhang, G L; Shi, H J; Shao, M H; Li, M; Mu, H J; Gu, Y; Du, X F; Xie, P
2013-01-04
We investigated 2 Chinese families with dyschromatosis symmetrica hereditaria (DSH) and search for mutations in the adenosine deaminase acting on RNA1 (ADAR1) gene in these 2 pedigrees. We performed a mutation analysis of the ADAR1 gene in 2 Chinese families with DSH and reviewed all articles published regarding ADAR1 mutations reported since 2003 by using PubMed. By direct sequencing, a 2-nucleotide AG deletion, 2099-2100delAG, was found in family 1, and a C→T mutation was identified at nucleotide 1420 that changed codon 474 from arginine to a translational termination codon in family 2. Two different pathogenic mutations were identified, c.2099-2100delAG and c.1420C>T, the former being a novel mutation, and the latter previously reported in 3 other families with DSH. To date, a total of 110 mutations in the ADAR1 gene have been reported, and 10 of them were recurrent; the mutations R474X, R1083C, R1096X, and R1155W might be the DSH-related hotspots.
Parvari, R; Moses, S; Hershkovitz, E; Carmi, R; Bashan, N
1995-01-01
Glycogen storage disease type 1a (GSD 1a), an autosomal recessive disease, is caused by the inactivity of glucose-6-phosphatase, the gene of which has been recently cloned. We report on the missense mutation C-->T at nucleotide 326 of the G6Pase gene, causing the change of the Arg codon at position 83 into a Cys codon, as the single mutation detected in six Jewish patients. This finding suggests that this mutation might be prevalent among the Jewish population. A new missense mutation T-->G at nucleotide 576 resulting in V166G was found in an Arab Muslim patient. These families may benefit now from pre- and postnatal diagnosis by analysis of DNA from blood and amniotic fluid or chorionic villus cells rather than liver biopsy. No mutations in the G6Pase gene were detected in two GSD 1b patients.
Enamelin/ameloblastin gene polymorphisms in autosomal amelogenesis imperfecta among Syrian families.
Dashash, Mayssoon; Bazrafshani, Mohamed Riza; Poulton, Kay; Jaber, Saaed; Naeem, Emad; Blinkhorn, Anthony Stevenson
2011-02-01
This study was undertaken to investigate whether a single G deletion within a series of seven G residues (codon 196) at the exon 9-intron 9 boundary of the enamelin gene ENAM and a tri-nucleotide deletion at codon 180 in exon 7 (GGA vs deletion) of ameloblastin gene AMBN could have a role in autosomal amelogenesis imperfecta among affected Syrian families. A new technique - size-dependent, deletion screening - was developed to detect nucleotide deletion in ENAM and AMBN genes. Twelve Syrian families with autosomal-dominant or -recessive amelogenesis imperfecta were included. A homozygous/heterozygous mutation in the ENAM gene (152/152, 152/153) was identified in affected members of three families with autosomal-dominant amelogenesis imperfecta and one family with autosomal-recessive amelogenesis imperfecta. A heterozygous mutation (222/225) in the AMBN gene was identified. However, no disease causing mutations was found. The present findings provide useful information for the implication of ENAM gene polymorphism in autosomal-dominant/-recessive amelogenesis imperfecta. Further investigations are required to identify other genes responsible for the various clinical phenotypes. © 2010 Blackwell Publishing Asia Pty Ltd.
Sarrazin, Sandrine; Starck, Joëlle; Gonnet, Colette; Doubeikovski, Alexandre; Melet, Fabrice; Morle, François
2000-01-01
The proto-oncogene Fli-1 encodes a transcription factor of the ets family whose overexpression is associated with multiple virally induced leukemias in mouse, inhibits murine and avian erythroid cell differentiation, and induces drastic perturbations of early development in Xenopus. This study demonstrates the surprisingly sophisticated regulation of Fli-1 mRNA translation. We establish that two FLI-1 protein isoforms (of 51 and 48 kDa) detected by Western blotting in vivo are synthesized by alternative translation initiation through the use of two highly conserved in-frame initiation codons, AUG +1 and AUG +100. Furthermore, we show that the synthesis of these two FLI-1 isoforms is regulated by two short overlapping 5′ upstream open reading frames (uORF) beginning at two highly conserved upstream initiation codons, AUG −41 and GUG −37, and terminating at two highly conserved stop codons, UGA +35 and UAA +15. The mutational analysis of these two 5′ uORF revealed that each of them negatively regulates FLI-1 protein synthesis by precluding cap-dependent scanning to the 48- and 51-kDa AUG codons. Simultaneously, the translation termination of the two 5′ uORF appears to enhance 48-kDa protein synthesis, by allowing downstream reinitiation at the 48-kDa AUG codon, and 51-kDa protein synthesis, by allowing scanning ribosomes to pile up and consequently allowing upstream initiation at the 51-kDa AUG codon. To our knowledge, this is the first example of a cellular mRNA displaying overlapping 5′ uORF whose translation termination appears to be involved in the positive control of translation initiation at both downstream and upstream initiation codons. PMID:10757781
Positive selection in the SLC11A1 gene in the family Equidae.
Bayerova, Zuzana; Janova, Eva; Matiasovic, Jan; Orlando, Ludovic; Horin, Petr
2016-05-01
Immunity-related genes are a suitable model for studying effects of selection at the genomic level. Some of them are highly conserved due to functional constraints and purifying selection, while others are variable and change quickly to cope with the variation of pathogens. The SLC11A1 gene encodes a transporter protein mediating antimicrobial activity of macrophages. Little is known about the patterns of selection shaping this gene during evolution. Although it is a typical evolutionarily conserved gene, functionally important polymorphisms associated with various diseases were identified in humans and other species. We analyzed the genomic organization, genetic variation, and evolution of the SLC11A1 gene in the family Equidae to identify patterns of selection within this important gene. Nucleotide SLC11A1 sequences were shown to be highly conserved in ten equid species, with more than 97 % sequence identity across the family. Single nucleotide polymorphisms (SNPs) were found in the coding and noncoding regions of the gene. Seven codon sites were identified to be under strong purifying selection. Codons located in three regions, including the glycosylated extracellular loop, were shown to be under diversifying selection. A 3-bp indel resulting in a deletion of the amino acid 321 in the predicted protein was observed in all horses, while it has been maintained in all other equid species. This codon comprised in an N-glycosylation site was found to be under positive selection. Interspecific variation in the presence of predicted N-glycosylation sites was observed.
Yang, Ming Ru; Zhou, Zhi Jun; Chang, Yan Lin; Zhao, Le Hong
2012-08-01
To help determine whether the typical arthropod arrangement was a synapomorphy for the whole Tettigoniidae, we sequenced the mitochondrial genome (mitogenome) of the quiet-calling katydids, Xizicus fascipes (Orthoptera: Tettigoniidae: Meconematinae). The 16,166-bp nucleotide sequences of X. fascipes mitogenome contains the typical gene content, gene order, base composition, and codon usage found in arthropod mitogenomes. As a whole, the X. fascipes mitogenome contains a lower A+T content (70.2%) found in the complete orthopteran mitogenomes determined to date. All protein-coding genes started with a typical ATN codon. Ten of the 13 protein-coding genes have a complete termination codon, but the remaining three genes (COIII, ND5 and ND4) terminate with incomplete T. All tRNAs have the typical clover-leaf structure of mitogenome tRNA, except for tRNA(Ser(AGN)), in which lengthened anticodon stem (9 bp) with a bulged nuleotide in the middle, an unusual T-stem (6 bp in constrast to the normal 5 bp), a mini DHU arm (2 bp) and no connector nucleotides. In the A+T-rich region, two (TA)n conserved blocks that were previously described in Ensifera and two 150-bp tandem repeats plus a partial copy of the composed at 61 bp of the beginning were present. Phylogenetic analysis found: i) the monophyly of Conocephalinae was interrupted by Elimaea cheni from Phaneropterinae; and ii) Meconematinae was the most basal group among these five subfamilies.
Genome sequences of a mouse-avirulent and a mouse-virulent strain of Ross River virus.
Faragher, S G; Meek, A D; Rice, C M; Dalgarno, L
1988-04-01
The nucleotide sequence of the genomic RNA of a mouse-avirulent strain of Ross River virus, RRV NB5092 (isolated in 1969), has been determined and the corresponding sequence for the prototype mouse-virulent strain, RRV T48 (isolated in 1959), has been completed. The RRV NB5092 genome is approximately 11,674 nucleotides in length, compared with 11,853 nucleotides for RRV T48. RRV NB5092 and RRV T48 have the same genome organization. For both viruses an untranslated region of 80 nucleotides at the 5' end of the genome is followed by a 7440-nucleotide open reading frame which is interrupted after 5586 nucleotides by a single opal termination codon. By homology with other alphaviruses, the 5586-nucleotide open reading frame encodes the nonstructural proteins nsP1, nsP2, and nsP3; a fourth nonstructural protein, nsP4, is produced by read-through of the opal codon. The RRV nonstructural proteins show strong homology with the corresponding proteins of Sindbis virus and Semliki Forest virus in terms of size, net charge, and hydropathy characteristics. However, homology is not uniform between or within the proteins; nsP1, nsP2, and nsP4 contain extended domains which are highly conserved between alphaviruses, while the C-terminal region of nsP3 shows little conservation in sequence or length between alphaviruses. An untranslated "junction" region of 44 nucleotides (for RRV NB5092) or 47 nucleotides (for RRV T48) separates the nonstructural and structural protein coding regions. The structural proteins (capsid-E3-E2-6K-E1) are translated from an open reading frame of 3762 nucleotides which is followed by a 3'-untranslated region of approximately 348 nucleotides (for RRV NB5092) or 524 nucleotides (for RRV T48). Excluding deletions and insertions, the genomes of RRV NB5092 and RRV T48 differ at 284 nucleotides, representing a sequence divergence of 2.38%. Sequence deletions or insertions were found only in the noncoding regions and include a 173-nucleotide deletion in the 3'-untranslated region of RRV NB5092, compared with RRV T48. In the coding regions, most of the nucleotide differences are silent; there are 36 amino acid differences in the nonstructural proteins and 12 in the structural proteins. The distribution of amino acid differences between the two RRV strains correlates with the location of domains which are poorly conserved in sequence between alphaviruses. The possible role of amino acid differences in envelope glycoproteins E1 and E2 in determining the different antigenic and biological properties of RRV NB5092 and RRV T48 is discussed.
Parsons, Michael T.; Whiley, Phillip J.; Beesley, Jonathan; Drost, Mark; de Wind, Niels; Thompson, Bryony A.; Marquart, Louise; Hopper, John L.; Jenkins, Mark A.; Brown, Melissa A.; Tucker, Kathy; Warwick, Linda; Buchanan, Daniel D.; Spurdle, Amanda B.
2014-01-01
Variants that disrupt the translation initiation sequences in cancer predisposition genes are generally assumed to be deleterious. However few studies have validated these assumptions with functional and clinical data. Two cancer syndrome gene variants likely to affect native translation initiation were identified by clinical genetic testing: MLH1:c.1A>G p.(Met1?) and BRCA2:c.67+3A>G. In vitro GFP-reporter assays were conducted to assess the consequences of translation initiation disruption on alternative downstream initiation codon usage. Analysis of MLH1:c.1A>G p.(Met1?) showed that translation was mostly initiated at an in-frame position 103 nucleotides downstream, but also at two ATG sequences downstream. The protein product encoded by the in-frame transcript initiating from position c.103 showed loss of in vitro mismatch repair activity comparable to known pathogenic mutations. BRCA2:c.67+3A>G was shown by mRNA analysis to result in an aberrantly spliced transcript deleting exon 2 and the consensus ATG site. In the absence of exon 2, translation initiated mostly at an out-of-frame ATG 323 nucleotides downstream, and to a lesser extent at an in-frame ATG 370 nucleotides downstream. Initiation from any of the downstream alternative sites tested in both genes would lead to loss of protein function, but further clinical data is required to confirm if these variants are associated with a high cancer risk. Importantly, our results highlight the need for caution in interpreting the functional and clinical consequences of variation that leads to disruption of the initiation codon, since translation may not necessarily occur from the first downstream alternative start site, or from a single alternative start site. PMID:24302565
Molecular Structure and Transformation of the Glucose Dehydrogenase Gene in Drosophila Melanogaster
Whetten, R.; Organ, E.; Krasney, P.; Cox-Foster, D.; Cavener, D.
1988-01-01
We have precisely mapped and sequenced the three 5' exons of the Drosophila melanogaster Gld gene and have identified the start sites for transcription and translation. The first exon is composed of 335 nucleotides and does not contain any putative translation start codons. The second exon is separated from the first exon by 8 kb and contains the Gld translation start codon. The inferred amino acid sequence of the amino terminus contains two unusual features: three tandem repeats of serine-alanine, and a relatively high density of cysteine residues. P element-mediated transformation experiments demonstrated that a 17.5-kb genomic fragment contains the functional and regulatory components of the Gld gene. PMID:3143620
Wu, C J; Janssen, G R
1996-10-01
The Streptomyces vinaceus viomycin phosphotransferase (vph) mRNA contains an untranslated leader with a conventional Shine-Dalgarno homology. The vph leader was removed by ligation of the vph coding sequence to the transcriptional start site of a Streptomyces or an Escherichia coli promoter, such that transcription would initiate at the first position of the vph start codon. Analysis of mRNA demonstrated that transcription initiated primarily at the A of the vph AUG translational start codon in both Streptomyces lividans and E. coli; cells expressing the unleadered vph mRNA were resistant to viomycin indicating that the Shine-Dalgarno sequence, or other features contained within the leader, was not necessary for vph translation. Addition of four nucleotides (5'-AUGC-3') onto the 5' end of the unleadered vph mRNA resulted in translation initiation from the vph start codon and the AUG triplet contained within the added sequence. Translational fusions of vph sequence to a Tn5 neo reporter gene indicated that the first 16 codons of vph coding sequence were sufficient to specify the translational start site and reading frame for expression of neomycin resistance in both E. coli and S. lividans.
Goncearenco, Alexander; Ma, Bin-Guang; Berezovsky, Igor N
2014-03-01
DNA, RNA and proteins are major biological macromolecules that coevolve and adapt to environments as components of one highly interconnected system. We explore here sequence/structure determinants of mechanisms of adaptation of these molecules, links between them, and results of their mutual evolution. We complemented statistical analysis of genomic and proteomic sequences with folding simulations of RNA molecules, unraveling causal relations between compositional and sequence biases reflecting molecular adaptation on DNA, RNA and protein levels. We found many compositional peculiarities related to environmental adaptation and the life style. Specifically, thermal adaptation of protein-coding sequences in Archaea is characterized by a stronger codon bias than in Bacteria. Guanine and cytosine load in the third codon position is important for supporting the aerobic life style, and it is highly pronounced in Bacteria. The third codon position also provides a tradeoff between arginine and lysine, which are favorable for thermal adaptation and aerobicity, respectively. Dinucleotide composition provides stability of nucleic acids via strong base-stacking in ApG dinucleotides. In relation to coevolution of nucleic acids and proteins, thermostability-related demands on the amino acid composition affect the nucleotide content in the second codon position in Archaea.
Goncearenco, Alexander; Ma, Bin-Guang; Berezovsky, Igor N.
2014-01-01
DNA, RNA and proteins are major biological macromolecules that coevolve and adapt to environments as components of one highly interconnected system. We explore here sequence/structure determinants of mechanisms of adaptation of these molecules, links between them, and results of their mutual evolution. We complemented statistical analysis of genomic and proteomic sequences with folding simulations of RNA molecules, unraveling causal relations between compositional and sequence biases reflecting molecular adaptation on DNA, RNA and protein levels. We found many compositional peculiarities related to environmental adaptation and the life style. Specifically, thermal adaptation of protein-coding sequences in Archaea is characterized by a stronger codon bias than in Bacteria. Guanine and cytosine load in the third codon position is important for supporting the aerobic life style, and it is highly pronounced in Bacteria. The third codon position also provides a tradeoff between arginine and lysine, which are favorable for thermal adaptation and aerobicity, respectively. Dinucleotide composition provides stability of nucleic acids via strong base-stacking in ApG dinucleotides. In relation to coevolution of nucleic acids and proteins, thermostability-related demands on the amino acid composition affect the nucleotide content in the second codon position in Archaea. PMID:24371267
Bohlke, Nina; Budisa, Nediljko
2014-01-01
One of the major challenges in contemporary synthetic biology is to find a route to engineer synthetic organisms with altered chemical constitution. In terms of core reaction types, nature uses an astonishingly limited repertoire of chemistries when compared with the exceptionally rich and diverse methods of organic chemistry. In this context, the most promising route to change and expand the fundamental chemistry of life is the inclusion of amino acid building blocks beyond the canonical 20 (i.e. expanding the genetic code). This strategy would allow the transfer of numerous chemical functionalities and reactions from the synthetic laboratory into the cellular environment. Due to limitations in terms of both efficiency and practical applicability, state-of-the-art nonsense suppression- or frameshift suppression-based methods are less suitable for such engineering. Consequently, we set out to achieve this goal by sense codon emancipation, that is, liberation from its natural decoding function – a prerequisite for the reassignment of degenerate sense codons to a new 21st amino acid. We have achieved this by redesigning of several features of the post-transcriptional modification machinery which are directly involved in the decoding process. In particular, we report first steps towards the reassignment of 5797 AUA isoleucine codons in Escherichia coli using efficient tools for tRNA nucleotide modification pathway engineering. PMID:24433543
Whole exome sequencing in recurrent early pregnancy loss.
Qiao, Ying; Wen, Jiadi; Tang, Flamingo; Martell, Sally; Shomer, Naomi; Leung, Peter C K; Stephenson, Mary D; Rajcan-Separovic, Evica
2016-05-01
Exome sequencing can identify genetic causes of idiopathic recurrent pregnancy loss (RPL). We identified compound heterozygous deleterious mutations affecting DYNC2H1 and ALOX15 in two out of four families with RPL. Both genes have a role in early development. Bioinformatics analysis of all genes with rare and putatively pathogenic mutations in miscarriages and couples showed enrichment in pathways relevant to pregnancy loss, including the complement and coagulation cascades pathways. Next generation sequencing (NGS) is increasingly being used to identify known and novel gene mutations in children with developmental delay and in fetuses with ultrasound-detected anomalies. In contrast, NGS is rarely used to study pregnancy loss. Chromosome microarray analysis detects putatively causative DNA copy number variants (CNVs) in ∼2% of miscarriages and CNVs of unknown significance (predominantly parental in origin) in up to 40% of miscarriages. Therefore, a large number of miscarriages still have an unknown cause. Whole exome sequencing (WES) was performed using Illumina HiSeq 2000 platform on seven euploid miscarriages from four families with RPL. Golden Helix SVS v8.1.5 was used for data assessment and inheritance analysis for deleterious DNA variants predicted to severely disrupt protein-coding genes by introducing a frameshift, loss of the stop codon, gain of the stop codon, changes in splicing or the initial codon. Webgestalt (http://bioinfo.vanderbilt.edu/webgestalt/) was used for pathway and disease association enrichment analysis of a gene pool containing putatively pathogenic variants in miscarriages and couples in comparison to control gene pools. Compound heterozygous mutations in DYNC2H1 and ALOX15 were identified in miscarriages from two families with RPL. DYNC2H1 is involved in cilia biogenesis and has been associated with fetal lethality in humans. ALOX15 is expressed in placenta and its dysregulation has been associated with inflammation, placental, dysfunction, abnormal oxidative stress response and angiogenesis. The pool of putatively pathogenic single nucleotide variants (SNVs) and small insertions and deletions (indels) detected in the miscarriages showed enrichment in 'complement and coagulation cascades pathway', and 'ciliary motility disorders'. We conclude that CNVs, individual SNVs and pool of deleterious gene mutations identified by exome sequencing could contribute to RPL. The size of our sample cohort is small. The functional effect of candidate mutations should be evaluated to determine whether the mutations are causative. This is the first study to assess whether SNVs may contribute to the pathogenesis of miscarriage. Furthermore, our findings suggest that collective effect of mutations in relevant biological pathways could be implicated in RPL. The study was funded by Canadian Institutes of Health Research (grant MOP 106467) and Michael Smith Foundation of Health Research Career Scholar salary award to ERS. © The Author 2016. Published by Oxford University Press on behalf of the European Society of Human Reproduction and Embryology. All rights reserved. For Permissions, please email: journals.permissions@oup.com.
Whole exome sequencing in recurrent early pregnancy loss
Qiao, Ying; Wen, Jiadi; Tang, Flamingo; Martell, Sally; Shomer, Naomi; Leung, Peter C.K.; Stephenson, Mary D.; Rajcan-Separovic, Evica
2016-01-01
STUDY HYPOTHESIS Exome sequencing can identify genetic causes of idiopathic recurrent pregnancy loss (RPL). STUDY FINDING We identified compound heterozygous deleterious mutations affecting DYNC2H1 and ALOX15 in two out of four families with RPL. Both genes have a role in early development. Bioinformatics analysis of all genes with rare and putatively pathogenic mutations in miscarriages and couples showed enrichment in pathways relevant to pregnancy loss, including the complement and coagulation cascades pathways. WHAT IS KNOWN ALREADY Next generation sequencing (NGS) is increasingly being used to identify known and novel gene mutations in children with developmental delay and in fetuses with ultrasound-detected anomalies. In contrast, NGS is rarely used to study pregnancy loss. Chromosome microarray analysis detects putatively causative DNA copy number variants (CNVs) in ∼2% of miscarriages and CNVs of unknown significance (predominantly parental in origin) in up to 40% of miscarriages. Therefore, a large number of miscarriages still have an unknown cause. STUDY DESIGN, SAMPLES/MATERIALS, METHODS Whole exome sequencing (WES) was performed using Illumina HiSeq 2000 platform on seven euploid miscarriages from four families with RPL. Golden Helix SVS v8.1.5 was used for data assessment and inheritance analysis for deleterious DNA variants predicted to severely disrupt protein-coding genes by introducing a frameshift, loss of the stop codon, gain of the stop codon, changes in splicing or the initial codon. Webgestalt (http://bioinfo.vanderbilt.edu/webgestalt/) was used for pathway and disease association enrichment analysis of a gene pool containing putatively pathogenic variants in miscarriages and couples in comparison to control gene pools. MAIN RESULTS AND THE ROLE OF CHANCE Compound heterozygous mutations in DYNC2H1 and ALOX15 were identified in miscarriages from two families with RPL. DYNC2H1 is involved in cilia biogenesis and has been associated with fetal lethality in humans. ALOX15 is expressed in placenta and its dysregulation has been associated with inflammation, placental, dysfunction, abnormal oxidative stress response and angiogenesis. The pool of putatively pathogenic single nucleotide variants (SNVs) and small insertions and deletions (indels) detected in the miscarriages showed enrichment in ‘complement and coagulation cascades pathway’, and ‘ciliary motility disorders’. We conclude that CNVs, individual SNVs and pool of deleterious gene mutations identified by exome sequencing could contribute to RPL. LIMITATIONS, REASONS FOR CAUTION The size of our sample cohort is small. The functional effect of candidate mutations should be evaluated to determine whether the mutations are causative. WIDER IMPLICATIONS OF THE FINDINGS This is the first study to assess whether SNVs may contribute to the pathogenesis of miscarriage. Furthermore, our findings suggest that collective effect of mutations in relevant biological pathways could be implicated in RPL. STUDY FUNDING AND COMPETING INTEREST(S) The study was funded by Canadian Institutes of Health Research (grant MOP 106467) and Michael Smith Foundation of Health Research Career Scholar salary award to ERS. PMID:26826164
Divergence and codon usage bias of Betanodavirus, a neurotropic pathogen in fish.
He, Mei; Teng, Chun-Bo
2015-02-01
Betanodavirus is a small bipartite RNA virus of global economical significance that can cause severe neurological disorders to an increasing number of marine fish species. Herein, to further the understanding of the evolution of betanodavirus, Bayesian coalescent analyses were conducted to the time-stamped entire coding sequences of their RNA polymerase and coat protein genes. Similar moderate nucleotide substitution rates were then estimated for the two genes. According to age calculations, the divergence of the two genes into the four genotypes initiated nearly simultaneously at ∼700 years ago, despite the different scenarios, whereas the seven analyzed chimeric isolates might be the outcomes of a single genetic reassortment event taking place in the early 1980s in Southern Europe. Furthermore, codon usage bias analyses indicated that each gene had influences in addition to mutational bias and codon choice of betanodavirus was not completely complied with that of fish host. Copyright © 2014 Elsevier Inc. All rights reserved.
2'-O-methylation in mRNA disrupts tRNA decoding during translation elongation.
Choi, Junhong; Indrisiunaite, Gabriele; DeMirci, Hasan; Ieong, Ka-Weng; Wang, Jinfan; Petrov, Alexey; Prabhakar, Arjun; Rechavi, Gideon; Dominissini, Dan; He, Chuan; Ehrenberg, Måns; Puglisi, Joseph D
2018-03-01
Chemical modifications of mRNA may regulate many aspects of mRNA processing and protein synthesis. Recently, 2'-O-methylation of nucleotides was identified as a frequent modification in translated regions of human mRNA, showing enrichment in codons for certain amino acids. Here, using single-molecule, bulk kinetics and structural methods, we show that 2'-O-methylation within coding regions of mRNA disrupts key steps in codon reading during cognate tRNA selection. Our results suggest that 2'-O-methylation sterically perturbs interactions of ribosomal-monitoring bases (G530, A1492 and A1493) with cognate codon-anticodon helices, thereby inhibiting downstream GTP hydrolysis by elongation factor Tu (EF-Tu) and A-site tRNA accommodation, leading to excessive rejection of cognate aminoacylated tRNAs in initial selection and proofreading. Our current and prior findings highlight how chemical modifications of mRNA tune the dynamics of protein synthesis at different steps of translation elongation.
PRIMARY STRUCTURE OF THE CYTOCHROME P450 LANOSTEROL 14A-DEMETHYLASE GENE FROM CANDIDA TROPICALIS
We report the nucleotide sequence of the gene and flanking DNA for the cytochrome P450 lanosterol 14 alpha-demethylase (14DM) from the yeast Candida tropicalis ATCC750. An open reading frame (ORF) of 528 codons encoding a 60.9-kD protein is identified. This ORF includes a charact...
Dar, Saira; Shuja, Rukhsana N; Shakoori, Abdul Rauf
2013-02-01
Metallothioneins (MTs) are metal binding proteins that are rich in cysteine residues constituting 10-30 % of the total protein, and in which the thiol groups bind to the metal ions. The increasing amount of metal ions in the medium have shown increased production of MTs by different organisms such as bacteria, protozoa and mammals like humans. PMCd1 is the first gene ever discovered in Paramecium, a ciliated protozoan, that could produce this MT in response to cadmium. In this study the PMCd1syn gene has been cloned in pET41a expression vector and expressed in an Escherichia coli BL21-codonplus strain for the first time. Since the gene PMCd1 amplified from Paramecium contained 10 codons, which could act as stop codons during expression in E. coli, this gene of 612 bps was synthesized to substitute these (stop) codons for the Paramecium sp. specific amino acids. For stability of the expressed protein, glutathione-S-transferase gene was fused with PMCd1syn gene and coexpressed. The cells expressing PMCd1syn demonstrated increased accumulation of cadmium. This is the first report of cadmium MT protein expressed from Paramecium species, particularly from synthetic MT gene (PMCd1syn). This fusion protein, the molecular weight of which has been confirmed to be 53.03 kDa with MALDI analysis, is rich in cysteine residues, and has been shown for the first time in this ciliate to bind to and sequester Cd(2+)-ions.
Bera, Bidhan Ch; Virmani, Nitin; Kumar, Naveen; Anand, Taruna; Pavulraj, S; Rash, Adam; Elton, Debra; Rash, Nicola; Bhatia, Sandeep; Sood, Richa; Singh, Raj Kumar; Tripathi, Bhupendra Nath
2017-08-23
Equine influenza is a major health problem of equines worldwide. The polymerase genes of influenza virus have key roles in virus replication, transcription, transmission between hosts and pathogenesis. Hence, the comprehensive genetic and codon usage bias of polymerase genes of equine influenza virus (EIV) were analyzed to elucidate the genetic and evolutionary relationships in a novel perspective. The group - specific consensus amino acid substitutions were identified in all polymerase genes of EIVs that led to divergence of EIVs into various clades. The consistent amino acid changes were also detected in the Florida clade 2 EIVs circulating in Europe and Asia since 2007. To study the codon usage patterns, a total of 281,324 codons of polymerase genes of EIV H3N8 isolates from 1963 to 2015 were systemically analyzed. The polymerase genes of EIVs exhibit a weak codon usage bias. The ENc-GC3s and Neutrality plots indicated that natural selection is the major influencing factor of codon usage bias, and that the impact of mutation pressure is comparatively minor. The methods for estimating host imposed translation pressure suggested that the polymerase acidic (PA) gene seems to be under less translational pressure compared to polymerase basic 1 (PB1) and polymerase basic 2 (PB2) genes. The multivariate statistical analysis of polymerase genes divided EIVs into four evolutionary diverged clusters - Pre-divergent, Eurasian, Florida sub-lineage 1 and 2. Various lineage specific amino acid substitutions observed in all polymerase genes of EIVs and especially, clade 2 EIVs underwent major variations which led to the emergence of a phylogenetically distinct group of EIVs originating from Richmond/1/07. The codon usage bias was low in all the polymerase genes of EIVs that was influenced by the multiple factors such as the nucleotide compositions, mutation pressure, aromaticity and hydropathicity. However, natural selection was the major influencing factor in defining the codon usage patterns and evolution of polymerase genes of EIVs.
alpha-Tubulin of Histriculus cavicola (Ciliophora; Hypotrichea).
Pérez-Romero, P; Villalobo, E; Díaz-Ramos, C; Calvo, P; Santos-Rosa, F; Torres, A
1997-03-01
An alpha-tubulin gene fragment amplified by PCR from the hypotrichous ciliate Histriculus cavicola has been sequenced. This fragment, 1,182 bp long, contains an in-frame "stop" codon (UAA), which in other hypotrichous species codes for a glutamine residue. The comparison of the alpha-tubulin genes from several ciliates classes have revealed amino acid positions which could serve to distinguish these taxonomic groups.
Ancient nature of alternative splicing and functions of introns
DOE Office of Scientific and Technical Information (OSTI.GOV)
Zhou, Kemin; Salamov, Asaf; Kuo, Alan
Using four genomes: Chamydomonas reinhardtii, Agaricus bisporus, Aspergillus carbonarius, and Sporotricum thermophile with EST coverage of 2.9x, 8.9x, 29.5x, and 46.3x respectively, we identified 11 alternative splicing (AS) types that were dominated by intron retention (RI; biased toward short introns) and found 15, 35, 52, and 63percent AS of multiexon genes respectively. Genes with AS were more ancient, and number of AS correlated with number of exons, expression level, and maximum intron length of the gene. Introns with tendency to be retained had either stop codons or length of 3n+1 or 3n+2 presumably triggering nonsense-mediated mRNA decay (NMD), but intronsmore » retained in major isoforms (0.2-6percent of all introns) were biased toward 3n length and stop codon free. Stopless introns were biased toward phase 0, but 3n introns favored phase 1 that introduced more flexible and hydrophilic amino acids on both ends of introns which would be less disruptive to protein structure. We proposed a model in which minor RI intron could evolve into major RI that could facilitate intron loss through exonization.« less
Rapid Y degeneration and dosage compensation in plant sex chromosomes
Papadopulos, Alexander S. T.; Chester, Michael; Ridout, Kate; Filatov, Dmitry A.
2015-01-01
The nonrecombining regions of animal Y chromosomes are known to undergo genetic degeneration, but previous work has failed to reveal large-scale gene degeneration on plant Y chromosomes. Here, we uncover rapid and extensive degeneration of Y-linked genes in a plant species, Silene latifolia, that evolved sex chromosomes de novo in the last 10 million years. Previous transcriptome-based studies of this species missed unexpressed, degenerate Y-linked genes. To identify sex-linked genes, regardless of their expression, we sequenced male and female genomes of S. latifolia and integrated the genomic contigs with a high-density genetic map. This revealed that 45% of Y-linked genes are not expressed, and 23% are interrupted by premature stop codons. This contrasts with X-linked genes, in which only 1.3% of genes contained stop codons and 4.3% of genes were not expressed in males. Loss of functional Y-linked genes is partly compensated for by gene-specific up-regulation of X-linked genes. Our results demonstrate that the rate of genetic degeneration of Y-linked genes in S. latifolia is as fast as in animals, and that the evolutionary trajectories of sex chromosomes are similar in the two kingdoms. PMID:26438872
Identification of ALK germline mutation (3605delG) in pediatric anaplastic medulloblastoma.
Coco, Simona; De Mariano, Marilena; Valdora, Francesca; Servidei, Tiziana; Ridola, Vita; Andolfo, Immacolata; Oberthuer, André; Tonini, Gian Paolo; Longo, Luca
2012-10-01
The anaplastic lymphoma kinase (ALK) gene has been found either rearranged or mutated in several neoplasms such as anaplastic large-cell lymphoma, non-small-cell lung cancer, neuroblastoma and anaplastic thyroid cancer. Medulloblastoma (MB) is an embryonic pediatric cancer arising from nervous system, a tissue in which ALK is expressed during embryonic development. We performed an ALK mutation screening in 52 MBs and we found a novel heterozygous germline deletion of a single base in exon 23 (3605delG) in a case with marked anaplasia. This G deletion results in a frameshift mutation producing a premature stop codon in exon 25 of ALK tyrosine kinase domain. We also screened three human MB cell lines without finding any mutation of ALK gene. Quantitative expression analysis of 16 out of 52 samples showed overexpression of ALK mRNA in three MBs. In the present study, we report the first mutation of ALK found in MB. Moreover, a deletion of ALK gene producing a stop codon has not been detected in human tumors up to now. Further investigations are now required to elucidate whether the truncated form of ALK may have a role in signal transduction.
OprD mutations and inactivation in imipenem-resistant Pseudomonas aeruginosa isolates from China.
Fang, Zhi-Li; Zhang, Li-Yan; Huang, Ying-Min; Qing, Yun; Cao, Kai-Yuan; Tian, Guo-Bao; Huang, Xi
2014-01-01
To investigate the mechanisms involved in imipenem resistance of Pseudomonas aeruginosa in southern China, 61 imipenem-resistant P. aeruginosa clinical isolates were collected from 4 hospitals between October 2011 and June 2012. All isolates were resistant to imipenem, whereas 21.3% were susceptible or intermediate to meropenem. Variable degrees of resistance to other β-lactam and non-β-lactam antimicrobials were observed. PFGE revealed high-level of clonal diversity. Among the 61 isolates, 50 isolates had OprD loss by disrupted oprD mutations, including 43 with frameshift mutations of oprD and 7 with a premature stop codon by single point mutation. Six isolates were oprD-negative by PCR, suggestive of a major disruption of oprD genes. Five isolates had intact oprD but had reduced expression of oprD genes. In addition, only one isolate with disrupted oprD mutation by a premature stop codon was confirmed to be a metallo-β-lactamase producer (IMP-9). Our results show that the loss of OprD, as well as reduced expression of oprD and MBL production, were the predominant mechanisms of imipenem resistance in P. aeruginosa in southern China. Copyright © 2013 Elsevier B.V. All rights reserved.
Ni, Julie Z.; Grate, Leslie; Donohue, John Paul; Preston, Christine; Nobida, Naomi; O’Brien, Georgeann; Shiue, Lily; Clark, Tyson A.; Blume, John E.; Ares, Manuel
2007-01-01
Many alternative splicing events create RNAs with premature stop codons, suggesting that alternative splicing coupled with nonsense-mediated decay (AS-NMD) may regulate gene expression post-transcriptionally. We tested this idea in mice by blocking NMD and measuring changes in isoform representation using splicing-sensitive microarrays. We found a striking class of highly conserved stop codon-containing exons whose inclusion renders the transcript sensitive to NMD. A genomic search for additional examples identified >50 such exons in genes with a variety of functions. These exons are unusually frequent in genes that encode splicing activators and are unexpectedly enriched in the so-called “ultraconserved” elements in the mammalian lineage. Further analysis show that NMD of mRNAs for splicing activators such as SR proteins is triggered by splicing activation events, whereas NMD of the mRNAs for negatively acting hnRNP proteins is triggered by splicing repression, a polarity consistent with widespread homeostatic control of splicing regulator gene expression. We suggest that the extreme genomic conservation surrounding these regulatory splicing events within splicing factor genes demonstrates the evolutionary importance of maintaining tightly tuned homeostasis of RNA-binding protein levels in the vertebrate cell. PMID:17369403
Automated sequence analysis and editing software for HIV drug resistance testing.
Struck, Daniel; Wallis, Carole L; Denisov, Gennady; Lambert, Christine; Servais, Jean-Yves; Viana, Raquel V; Letsoalo, Esrom; Bronze, Michelle; Aitken, Sue C; Schuurman, Rob; Stevens, Wendy; Schmit, Jean Claude; Rinke de Wit, Tobias; Perez Bercoff, Danielle
2012-05-01
Access to antiretroviral treatment in resource-limited-settings is inevitably paralleled by the emergence of HIV drug resistance. Monitoring treatment efficacy and HIV drugs resistance testing are therefore of increasing importance in resource-limited settings. Yet low-cost technologies and procedures suited to the particular context and constraints of such settings are still lacking. The ART-A (Affordable Resistance Testing for Africa) consortium brought together public and private partners to address this issue. To develop an automated sequence analysis and editing software to support high throughput automated sequencing. The ART-A Software was designed to automatically process and edit ABI chromatograms or FASTA files from HIV-1 isolates. The ART-A Software performs the basecalling, assigns quality values, aligns query sequences against a set reference, infers a consensus sequence, identifies the HIV type and subtype, translates the nucleotide sequence to amino acids and reports insertions/deletions, premature stop codons, ambiguities and mixed calls. The results can be automatically exported to Excel to identify mutations. Automated analysis was compared to manual analysis using a panel of 1624 PR-RT sequences generated in 3 different laboratories. Discrepancies between manual and automated sequence analysis were 0.69% at the nucleotide level and 0.57% at the amino acid level (668,047 AA analyzed), and discordances at major resistance mutations were recorded in 62 cases (4.83% of differences, 0.04% of all AA) for PR and 171 (6.18% of differences, 0.03% of all AA) cases for RT. The ART-A Software is a time-sparing tool for pre-analyzing HIV and viral quasispecies sequences in high throughput laboratories and highlighting positions requiring attention. Copyright © 2012 Elsevier B.V. All rights reserved.
Minor, Katie M.; Shelton, G. Diane; Patterson, Edward E.; Bley, Tim; Oevermann, Anna; Bilzer, Thomas; Leeb, Tosso
2014-01-01
An inherited polyneuropathy (PN) observed in Leonberger dogs has clinical similarities to a genetically heterogeneous group of peripheral neuropathies termed Charcot-Marie-Tooth (CMT) disease in humans. The Leonberger disorder is a severe, juvenile-onset, chronic, progressive, and mixed PN, characterized by exercise intolerance, gait abnormalities and muscle atrophy of the pelvic limbs, as well as inspiratory stridor and dyspnea. We mapped a PN locus in Leonbergers to a 250 kb region on canine chromosome 16 (Praw = 1.16×10−10, Pgenome, corrected = 0.006) utilizing a high-density SNP array. Within this interval is the ARHGEF10 gene, a member of the rho family of GTPases known to be involved in neuronal growth and axonal migration, and implicated in human hypomyelination. ARHGEF10 sequencing identified a 10 bp deletion in affected dogs that removes four nucleotides from the 3′-end of exon 17 and six nucleotides from the 5′-end of intron 17 (c.1955_1958+6delCACGGTGAGC). This eliminates the 3′-splice junction of exon 17, creates an alternate splice site immediately downstream in which the processed mRNA contains a frame shift, and generates a premature stop codon predicted to truncate approximately 50% of the protein. Homozygosity for the deletion was highly associated with the severe juvenile-onset PN phenotype in both Leonberger and Saint Bernard dogs. The overall clinical picture of PN in these breeds, and the effects of sex and heterozygosity of the ARHGEF10 deletion, are less clear due to the likely presence of other forms of PN with variable ages of onset and severity of clinical signs. This is the first documented severe polyneuropathy associated with a mutation in ARHGEF10 in any species. PMID:25275565
Garuti, R; Lelli, N; Barozzini, M; Tiozzo, R; Ghisellini, M; Simone, M L; Li Volti, S; Garozzo, R; Mollica, F; Vergoni, W; Bertolini, S; Calandra, S
1996-03-01
In the present study we report two novel partial deletions of the LDL-R gene. The first (FH Siracusa), found in an FH-heterozygote, consists of a 20 kb deletion spanning from the 5' flanking region to the intron 2 of the LDL-receptor gene. The elimination of the promoter and the first two exons prevents the transcription of the deleted allele, as shown by Northern blot analysis of LDL-R mRNA isolated from the proband's fibroblasts. The second deletion (FH Reggio Emilia), which eliminates 11 nucleotides of exon 10, was also found in an FH heterozygote. The characterization of this deletion was made possible by a combination of techniques such as single strand conformation polymorphism (SSCP) analysis, direct sequence of exon 10 and cloning of the normal and deleted exon 10 from the proband's DNA. The 11 nt deletion occurs in a region of exon 10 which contains three triplets (CTG) and two four-nucleotides (CTGG) direct repeats. This structural feature might render this region more susceptible to a slipped mispairing during DNA duplication. Since this deletion causes a shift of the BamHI site at the 5' end of exon 10, a method has been devised for its rapid screening which is based on the PCR amplification of exon 10 followed by BamHI digestion. FH Reggio Emilia deletion produces a shift in the reading frame downstream from Lys458, leading to a sequence of 51 novel amino acids before the occurrence of a premature stop codon (truncated receptor). However, since RT-PCR failed to demonstrate the presence of the mutant LDL-R mRNA in proband fibroblasts, it is likely that the amount of truncated receptor produced in these cells is negligible.
GeneBuilder: interactive in silico prediction of gene structure.
Milanesi, L; D'Angelo, D; Rogozin, I B
1999-01-01
Prediction of gene structure in newly sequenced DNA becomes very important in large genome sequencing projects. This problem is complicated due to the exon-intron structure of eukaryotic genes and because gene expression is regulated by many different short nucleotide domains. In order to be able to analyse the full gene structure in different organisms, it is necessary to combine information about potential functional signals (promoter region, splice sites, start and stop codons, 3' untranslated region) together with the statistical properties of coding sequences (coding potential), information about homologous proteins, ESTs and repeated elements. We have developed the GeneBuilder system which is based on prediction of functional signals and coding regions by different approaches in combination with similarity searches in proteins and EST databases. The potential gene structure models are obtained by using a dynamic programming method. The program permits the use of several parameters for gene structure prediction and refinement. During gene model construction, selecting different exon homology levels with a protein sequence selected from a list of homologous proteins can improve the accuracy of the gene structure prediction. In the case of low homology, GeneBuilder is still able to predict the gene structure. The GeneBuilder system has been tested by using the standard set (Burset and Guigo, Genomics, 34, 353-367, 1996) and the performances are: 0.89 sensitivity and 0.91 specificity at the nucleotide level. The total correlation coefficient is 0.88. The GeneBuilder system is implemented as a part of the WebGene a the URL: http://www.itba.mi. cnr.it/webgene and TRADAT (TRAncription Database and Analysis Tools) launcher URL: http://www.itba.mi.cnr.it/tradat.
Rios, Jonathan J; Perelygin, Andrey A; Long, Maureen T; Lear, Teri L; Zharkikh, Andrey A; Brinton, Margo A; Adelson, David L
2007-01-01
Background The mammalian OAS/RNASEL pathway plays an important role in antiviral host defense. A premature stop-codon within the murine Oas1b gene results in the increased susceptibility of mice to a number of flaviviruses, including West Nile virus (WNV). Mutations in either the OAS1 or RNASEL genes may also modulate the outcome of WNV-induced disease or other viral infections in horses. Polymorphisms in the human OAS gene cluster have been previously utilized for case-control analysis of virus-induced disease in humans. No polymorphisms have yet been identified in either the equine OAS1 or RNASEL genes for use in similar case-control studies. Results Genomic sequence for equine OAS1 was obtained from a contig assembly generated from a shotgun subclone library of CHORI-241 BAC 100I10. Specific amplification of regions of the OAS1 gene from 13 horses of various breeds identified 33 single nucleotide polymorphisms (SNP) and two microsatellites. RNASEL cDNA sequences were determined for 8 mammals and utilized in a phylogenetic analysis. The chromosomal location of the RNASEL gene was assigned by FISH to ECA5p17-p16 using two selected CHORI-241 BAC clones. The horse genomic RNASEL sequence was assembled. Specific amplification of regions of the RNASEL gene from 13 horses identified 31 SNPs. Conclusion In this report, two dinucleotide microsatellites and 64 single nucleotide polymorphisms within the equine OAS1 and RNASEL genes were identified. These polymorphisms are the first to be reported for these genes and will facilitate future case-control studies of horse susceptibility to infectious diseases. PMID:17822564
Fernández-Guerra, Paula; Navarrete, Rosa; Weisiger, Kara; Desviat, Lourdes R; Packman, Seymour; Ugarte, Magdalena; Rodríguez-Pombo, Pilar
2010-12-01
Mutations in any of the three different genes--BCKDHA, BCKDHB, and DBT--encoding for the E1α, E1β, and E2 catalytic components of the branched-chain α-ketoacid dehydrogenase complex can cause maple syrup urine disease (MSUD). Disease severity ranges from the classic to the mildest variant types and precise genotypes, mostly based on missense mutations, have been associated to the less severe presentations of the disease. Herein, we examine the consequences at the messenger RNA (mRNA) level of the novel intronic alteration c.288+9C>T found in heterozygous fashion in a BCKDHA variant MSUD patient who also carries the nucleotide change c.745G>A (p.Gly249Ser), previously described as a severe change. Direct analysis of the processed transcripts from the patient showed--in addition to a low but measurable level of normal mRNA product--an aberrantly spliced mRNA containing a 7-bp fragment of intron 2, which could be rescued when the patient's cells were treated with emetine. This aberrant transcript with a premature stop codon would be unstable, supporting the possible activation of nonsense-mediated mRNA decay pathway. Consistent with this finding, minigene splicing assays demonstrated that the point mutation c.288+9C>T is sufficient to create a cryptic splice site and cause the observed 7-bp insertion. Furthermore, our results strongly suggest that the c.288+9C>T allele in the patient generates both normal and aberrant transcripts that could sustain the variant presentation of the disease, highlighting the importance of correct genotyping to establish genotype-phenotype correlations and as basis for the development of therapeutic interventions.
Martínez-García, Pedro J; Fresnedo-Ramírez, Jonathan; Parfitt, Dan E; Gradziel, Thomas M; Crisosto, Carlos H
2013-01-01
Single nucleotide polymorphisms (SNPs) are a fundamental source of genomic variation. Large SNP panels have been developed for Prunus species. Fruit quality traits are essential peach breeding program objectives since they determine consumer acceptance, fruit consumption, industry trends and cultivar adoption. For many cultivars, these traits are negatively impacted by cold storage, used to extend fruit market life. The major symptoms of chilling injury are lack of flavor, off flavor, mealiness, flesh browning, and flesh bleeding. A set of 1,109 SNPs was mapped previously and 67 were linked with these complex traits. The prediction of the effects associated with these SNPs on downstream products from the 'peach v1.0' genome sequence was carried out. A total of 2,163 effects were detected, 282 effects (non-synonymous, synonymous or stop codon gained) were located in exonic regions (13.04 %) and 294 placed in intronic regions (13.59 %). An extended list of genes and proteins that could be related to these traits was developed. Two SNP markers that explain a high percentage of the observed phenotypic variance, UCD_SNP_1084 and UCD_SNP_46, are associated with zinc finger (C3HC4-type RING finger) family protein and AOX1A (alternative oxidase 1a) protein groups, respectively. In addition, phenotypic variation suggests that the observed polymorphism for SNP UCD_SNP_1084 [A/G] mutation could be a candidate quantitative trait nucleotide affecting quantitative trait loci for mealiness. The interaction and expression of affected proteins could explain the variation observed in each individual and facilitate understanding of gene regulatory networks for fruit quality traits in peach.
Refactoring the Genetic Code for Increased Evolvability
DOE Office of Scientific and Technical Information (OSTI.GOV)
Pines, Gur; Winkler, James D.; Pines, Assaf
ABSTRACT The standard genetic code is robust to mutations during transcription and translation. Point mutations are likely to be synonymous or to preserve the chemical properties of the original amino acid. Saturation mutagenesis experiments suggest that in some cases the best-performing mutant requires replacement of more than a single nucleotide within a codon. These replacements are essentially inaccessible to common error-based laboratory engineering techniques that alter a single nucleotide per mutation event, due to the extreme rarity of adjacent mutations. In this theoretical study, we suggest a radical reordering of the genetic code that maximizes the mutagenic potential of singlemore » nucleotide replacements. We explore several possible genetic codes that allow a greater degree of accessibility to the mutational landscape and may result in a hyperevolvable organism that could serve as an ideal platform for directed evolution experiments. We then conclude by evaluating the challenges of constructing such recoded organisms and their potential applications within the field of synthetic biology. IMPORTANCE The conservative nature of the genetic code prevents bioengineers from efficiently accessing the full mutational landscape of a gene via common error-prone methods. Here, we present two computational approaches to generate alternative genetic codes with increased accessibility. These new codes allow mutational transitions to a larger pool of amino acids and with a greater extent of chemical differences, based on a single nucleotide replacement within the codon, thus increasing evolvability both at the single-gene and at the genome levels. Given the widespread use of these techniques for strain and protein improvement, along with more fundamental evolutionary biology questions, the use of recoded organisms that maximize evolvability should significantly improve the efficiency of directed evolution, library generation, and fitness maximization.« less
Refactoring the Genetic Code for Increased Evolvability
Pines, Gur; Winkler, James D.; Pines, Assaf; ...
2017-11-14
ABSTRACT The standard genetic code is robust to mutations during transcription and translation. Point mutations are likely to be synonymous or to preserve the chemical properties of the original amino acid. Saturation mutagenesis experiments suggest that in some cases the best-performing mutant requires replacement of more than a single nucleotide within a codon. These replacements are essentially inaccessible to common error-based laboratory engineering techniques that alter a single nucleotide per mutation event, due to the extreme rarity of adjacent mutations. In this theoretical study, we suggest a radical reordering of the genetic code that maximizes the mutagenic potential of singlemore » nucleotide replacements. We explore several possible genetic codes that allow a greater degree of accessibility to the mutational landscape and may result in a hyperevolvable organism that could serve as an ideal platform for directed evolution experiments. We then conclude by evaluating the challenges of constructing such recoded organisms and their potential applications within the field of synthetic biology. IMPORTANCE The conservative nature of the genetic code prevents bioengineers from efficiently accessing the full mutational landscape of a gene via common error-prone methods. Here, we present two computational approaches to generate alternative genetic codes with increased accessibility. These new codes allow mutational transitions to a larger pool of amino acids and with a greater extent of chemical differences, based on a single nucleotide replacement within the codon, thus increasing evolvability both at the single-gene and at the genome levels. Given the widespread use of these techniques for strain and protein improvement, along with more fundamental evolutionary biology questions, the use of recoded organisms that maximize evolvability should significantly improve the efficiency of directed evolution, library generation, and fitness maximization.« less
Transfer RNAs with novel cloverleaf structures
DOE Office of Scientific and Technical Information (OSTI.GOV)
Mukai, Takahito; Vargas-Rodriguez, Oscar; Englert, Markus
We report the identification of novel tRNA species with 12-base pair amino-acid acceptor branches composed of longer acceptor stem and shorter Tstem. While canonical tRNAs have a 7/5 configuration of the branch, the novel tRNAs have either 8/4 or 9/3 structure. They were found during the search for selenocysteine tRNAs in terabytes of genome, metagenome and metatranscriptome sequences. Certain bacteria and their phages employ the 8/4 structure for serine and histidine tRNAs, while minor cysteine and selenocysteine tRNA species may have a modified 8/4 structure with one bulge nucleotide. In Acidobacteria, tRNAs with 8/4 and 9/3 structures may function asmore » missense and nonsense suppressor tRNAs and/or regulatory noncod ing RNAs. In δ-proteobacteria, an additional cysteine tRNA with an 8/4 structure mimics selenocysteine tRNA and may function as opal suppressor. We examined the potential translation function of suppressor tRNA species inEscherichia coli; tRNAs with 8/4 or 9/3 structures efficiently inserted serine, alanine and cysteine in response to stop and sense codons, depending on the identity element and anticodon sequence of the tRNA. These findings expand our view of how tRNA, and possibly the genetic code, is diversified in nature.« less
Transfer RNAs with novel cloverleaf structures
Mukai, Takahito; Vargas-Rodriguez, Oscar; Englert, Markus; ...
2016-10-05
We report the identification of novel tRNA species with 12-base pair amino-acid acceptor branches composed of longer acceptor stem and shorter Tstem. While canonical tRNAs have a 7/5 configuration of the branch, the novel tRNAs have either 8/4 or 9/3 structure. They were found during the search for selenocysteine tRNAs in terabytes of genome, metagenome and metatranscriptome sequences. Certain bacteria and their phages employ the 8/4 structure for serine and histidine tRNAs, while minor cysteine and selenocysteine tRNA species may have a modified 8/4 structure with one bulge nucleotide. In Acidobacteria, tRNAs with 8/4 and 9/3 structures may function asmore » missense and nonsense suppressor tRNAs and/or regulatory noncod ing RNAs. In δ-proteobacteria, an additional cysteine tRNA with an 8/4 structure mimics selenocysteine tRNA and may function as opal suppressor. We examined the potential translation function of suppressor tRNA species inEscherichia coli; tRNAs with 8/4 or 9/3 structures efficiently inserted serine, alanine and cysteine in response to stop and sense codons, depending on the identity element and anticodon sequence of the tRNA. These findings expand our view of how tRNA, and possibly the genetic code, is diversified in nature.« less
Speed, Haley E.; Kouser, Mehreen; Xuan, Zhong; Reimers, Jeremy M.; Ochoa, Christine F.; Gupta, Natasha; Liu, Shunan
2015-01-01
SHANK3 (also known as PROSAP2) is a postsynaptic scaffolding protein at excitatory synapses in which mutations and deletions have been implicated in patients with idiopathic autism, Phelan–McDermid (aka 22q13 microdeletion) syndrome, and other neuropsychiatric disorders. In this study, we have created a novel mouse model of human autism caused by the insertion of a single guanine nucleotide into exon 21 (Shank3G). The resulting frameshift causes a premature STOP codon and loss of major higher molecular weight Shank3 isoforms at the synapse. Shank3G/G mice exhibit deficits in hippocampus-dependent spatial learning, impaired motor coordination, altered response to novelty, and sensory processing deficits. At the cellular level, Shank3G/G mice also exhibit impaired hippocampal excitatory transmission and plasticity as well as changes in baseline NMDA receptor-mediated synaptic responses. This work identifies clear alterations in synaptic function and behavior in a novel, genetically accurate mouse model of autism mimicking an autism-associated insertion mutation. Furthermore, these findings lay the foundation for future studies aimed to validate and study region-selective and temporally selective genetic reversal studies in the Shank3G/G mouse that was engineered with such future experiments in mind. PMID:26134648
Fontana, Célia; Lambert, Ambroise; Benaroudj, Nadia; Gasparini, David; Gorgette, Olivier; Cachet, Nathalie; Bomchil, Natalia; Picardeau, Mathieu
2016-01-01
Pathogenic Leptospira strains are responsible for leptospirosis, a worldwide emerging zoonotic disease. These spirochetes are unique amongst bacteria because of their corkscrew-like cell morphology and their periplasmic flagella. Motility is reported as an important virulence determinant, probably favoring entry and dissemination of pathogenic Leptospira in the host. However, proteins constituting the periplasmic flagella and their role in cell shape, motility and virulence remain poorly described. In this study, we characterized a spontaneous L. interrogans mutant strain lacking motility, correlated with the loss of the characteristic hook-shaped ends, and virulence in the animal model. Whole genome sequencing allowed the identification of one nucleotide deletion in the fliM gene resulting in a premature stop codon, thereby preventing the production of flagellar motor switch protein FliM. Genetic complementation restored cell morphology, motility and virulence comparable to those of wild type cells. Analyses of purified periplasmic flagella revealed a defect in flagella assembly, resulting in shortened flagella compared to the wild type strain. This also correlated with a lower amount of major filament proteins FlaA and FlaB. Altogether, these findings demonstrate that FliM is required for full and correct assembly of the flagella which is essential for motility and virulence.
A nonsense mutation of PEPD in four Amish children with prolidase deficiency.
Wang, Heng; Kurien, Biji T; Lundgren, David; Patel, Nisha C; Kaufman, K M; Miller, David L; Porter, Andrew C; D'Souza, Anil; Nye, Leah; Tumbush, John; Hupertz, Vera; Kerr, Douglas S; Kurono, S; Matsumoto, H; Scofield, R Hal
2006-03-15
Encoded by the peptidase D (PEPD) gene located at 19q12-q13.11, prolidase is a ubiquitous cytosolic enzyme that catalyzes hydrolysis of oligopeptides with a C-terminal proline or hydroxyproline. We describe here four Amish children with a severe phenotype of prolidase deficiency in the Geauga settlements of Ohio as the first report of prolidase deficiency in the Amish population as well as in the United States. The patients presented with infection, hepatosplenomegaly, or thrombocytopenia, in contrast to most cases previously reported in the literature, presenting with skin ulcers. All four patients had typical facial features, classic skin ulcers, and multisystem involvement. Recurrent infections, asthma-like chronic reactive airway disease, hyperimmunoglobulins, hepatosplenomegaly with mildly elevated aspartate transaminase (AST), anemia, and thrombocytopenia were common and massive imidodipeptiduria was universal. Prolidase activity in our patients is nearly undetectable. Direct sequencing of PCR-amplified genomic DNA for all of the exons from the four patients revealed the same homozygous single nucleotide mutation c.793 T > C in exon 11, resulting in a premature stop-codon at amino acid residue 265 (p.R265X). It is speculated that the severe phenotype in these patients might be associated with the type of the PEPD gene mutation. 2006 Wiley-Liss, Inc.
Suzuki, Tomonori; Nagano, Thomas; Niwa, Koichi; Uchino, Masataka; Tomizawa, Motohiro; Sagane, Yoshimasa; Watanabe, Toshihiro
2017-01-01
A non-toxigenic mutant of the toxigenic serotype C Clostridium botulinum strain Stockholm (C-St), C-N71, does not produce the botulinum neurotoxin (BoNT). However, the original strain C-St produces botulinum toxin complex, in which BoNT is associated with non-toxic non-hemagglutinin (NTNHA) and three hemagglutinin proteins (HA-70, HA-33, and HA-17). Therefore, in this study, we aimed to elucidate the effects of bont gene knockout on the formation of the "toxin complex." Nucleotide sequence analysis revealed that a premature stop codon was introduced in the bont gene, whereas other genes were not affected by this mutation. Moreover, we successfully purified the "toxin complex" produced by C-N71. The "toxin complex" was identified as a mixture of NTNHA/HA-70/HA-17/HA-33 complexes with intact NTNHA or C-terminally truncated NTNHA, without BoNT. These results indicated that knockout of the bont gene does not affect the formation of the "toxin complex." Since the botulinum toxin complex has been shown to play an important role in oral toxin transport in the human and animal body, a non-neurotoxic "toxin complex" of C-N71 may be valuable for the development of an oral drug delivery system.
Dean, Kimberly M; Grayhack, Elizabeth J
2012-12-01
We have developed a robust and sensitive method, called RNA-ID, to screen for cis-regulatory sequences in RNA using fluorescence-activated cell sorting (FACS) of yeast cells bearing a reporter in which expression of both superfolder green fluorescent protein (GFP) and yeast codon-optimized mCherry red fluorescent protein (RFP) is driven by the bidirectional GAL1,10 promoter. This method recapitulates previously reported progressive inhibition of translation mediated by increasing numbers of CGA codon pairs, and restoration of expression by introduction of a tRNA with an anticodon that base pairs exactly with the CGA codon. This method also reproduces effects of paromomycin and context on stop codon read-through. Five key features of this method contribute to its effectiveness as a selection for regulatory sequences: The system exhibits greater than a 250-fold dynamic range, a quantitative and dose-dependent response to known inhibitory sequences, exquisite resolution that allows nearly complete physical separation of distinct populations, and a reproducible signal between different cells transformed with the identical reporter, all of which are coupled with simple methods involving ligation-independent cloning, to create large libraries. Moreover, we provide evidence that there are sequences within a 9-nt library that cause reduced GFP fluorescence, suggesting that there are novel cis-regulatory sequences to be found even in this short sequence space. This method is widely applicable to the study of both RNA-mediated and codon-mediated effects on expression.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Tomatsu, Shunji; Fukuda, Seiji; Yamagishi, Atsushi
1996-05-01
We report four new mutations in Japanese patients with mucopolysaccharidosis IVA (MPSIVA) who were heterozygous for a common double gene deletion. A nonsense mutation of CAG to TAG at codon 148 in exon 4 was identified, resulting in a change of Q to a stop codon and three missense mutations: V (GTC) to A (GCC) at codon 138 in exon 4, P (CCC) to S (TCC) at codon 151 in exon 5, and P (CCC) to L (CTC) at codon 151 in exon 5. Introduction of these mutations into the normal GALNS cDNA and transient expression in cultured fibroblasts resultedmore » in a significant decrease in the enzyme activity. V138A and Q148X mutations result in changes of restriction site, which were analyzed by restriction-enzyme assay. P151S and P151L mutations that did not alter the restriction site were detected by direct sequencing or allele specific oligohybridization. Detection of the double gene deletion was initially done using Southern blots and was confirmed by PCR. Haplotypes were determined using seven polymorphisms to the GALNS locus in families with the double gene deletion. Haplotype analysis showed that the common double gene deletion occurred on a single haplotype, except for some variation in a VNTR-like polymorphism. This finding is consistent with a common founder for all individuals with this mutation. 48 refs., 5 figs., 1 tab.« less
Gennero, Isabelle; Edouard, Thomas; Rashad, Mona; Bieth, Eric; Conte-Aurio, Françoise; Marin, Françoise; Tauber, Maithé; Salles, Jean Pierre; El Kholy, Mohamed
2007-07-01
Deletions and mutations in the growth hormone receptor (GHR) gene are the underlying etiology of Laron syndrome (LS) or growth hormone (GH) insensitivity syndrome (GHIS), an autosomal recessive disease. Most patients are distributed in or originate from Mediterranean and Middle-Eastern countries. Sixty mutations have been described so far. We report a novel mutation in the GHR gene in a patient with LS. Genomic DNA sequencing of exon 5 revealed a TT insertion at nucleotide 422 after codon 122. The insertion resulted in a frameshift introducing a premature termination codon that led to a truncated receptor. We present clinical, biochemical and molecular evidence of LS as the result of this homozygous insertion.
Sun, Yu; Chen, Chen; Gao, Jin; Abbas, Muhammad Nadeem; Kausar, Saima; Qian, Cen; Wang, Lei; Wei, Guoqing; Zhu, Bao-Jian
2017-01-01
In the present study, the complete sequence of the mitochondrial genome (mitogenome) of Daphnis nerii (Lepidoptera: Sphingidae) is described. The mitogenome (15,247 bp) of D.nerii encodes13 protein-coding genes (PCGs), 22 transfer RNA genes (tRNAs), two ribosomal RNA genes (rRNAs) and an adenine (A) + thymine (T)-rich region. Its gene complement and order is similar to that of other sequenced lepidopterans. The 12 PCGs initiated by ATN codons except for cytochrome c oxidase subunit 1 (cox1) gene that is seemingly initiated by the CGA codon as documented in other insect mitogenomes. Four of the 13 PCGs have the incomplete termination codon T, while the remainder terminated with the canonical stop codon. This mitogenome has six major intergenic spacers, with the exception of A+T-rich region, spanning at least 10 bp. The A+T-rich region is 351 bp long, and contains some conserved regions, including ‘ATAGA’ motif followed by a 17 bp poly-T stretch, a microsatellite-like element (AT)9 and also a poly-A element. Phylogenetic analyses based on 13 PCGs using maximum likelihood (ML) and Bayesian inference (BI) revealed that D. nerii resides in the Sphingidae family. PMID:28598968
Changes in mitochondrial genetic codes as phylogenetic characters: Two examples from the flatworms
Telford, Maximilian J.; Herniou, Elisabeth A.; Russell, Robert B.; Littlewood, D. Timothy J.
2000-01-01
Shared molecular genetic characteristics other than DNA and protein sequences can provide excellent sources of phylogenetic information, particularly if they are complex and rare and are consequently unlikely to have arisen by chance convergence. We have used two such characters, arising from changes in mitochondrial genetic code, to define a clade within the Platyhelminthes (flatworms), the Rhabditophora. We have sampled 10 distinct classes within the Rhabditophora and find that all have the codon AAA coding for the amino acid Asn rather than the usual Lys and AUA for Ile rather than the usual Met. We find no evidence to support claims that the codon UAA codes for Tyr in the Platyhelminthes rather than the standard stop codon. The Rhabditophora are a very diverse group comprising the majority of the free-living turbellarian taxa and the parasitic Neodermata. In contrast, three other classes of turbellarian flatworm, the Acoela, Nemertodermatida, and Catenulida, have the standard invertebrate assignments for these codons and so are convincingly excluded from the rhabditophoran clade. We have developed a rapid computerized method for analyzing genetic codes and demonstrate the wide phylogenetic distribution of the standard invertebrate code as well as confirming already known metazoan deviations from it (ascidian, vertebrate, echinoderm/hemichordate). PMID:11027335
DNA Asymmetric Strand Bias Affects the Amino Acid Composition of Mitochondrial Proteins
Min, Xiang Jia; Hickey, Donal A.
2007-01-01
Abstract Variations in GC content between genomes have been extensively documented. Genomes with comparable GC contents can, however, still differ in the apportionment of the G and C nucleotides between the two DNA strands. This asymmetric strand bias is known as GC skew. Here, we have investigated the impact of differences in nucleotide skew on the amino acid composition of the encoded proteins. We compared orthologous genes between animal mitochondrial genomes that show large differences in GC and AT skews. Specifically, we compared the mitochondrial genomes of mammals, which are characterized by a negative GC skew and a positive AT skew, to those of flatworms, which show the opposite skews for both GC and AT base pairs. We found that the mammalian proteins are highly enriched in amino acids encoded by CA-rich codons (as predicted by their negative GC and positive AT skews), whereas their flatworm orthologs were enriched in amino acids encoded by GT-rich codons (also as predicted from their skews). We found that these differences in mitochondrial strand asymmetry (measured as GC and AT skews) can have very large, predictable effects on the composition of the encoded proteins. PMID:17974594
Earl, P L; Jones, E V; Moss, B
1986-01-01
A 5400-base-pair segment of the vaccinia virus genome was sequenced and an open reading frame of 938 codons was found precisely where the DNA polymerase had been mapped by transfer of a phosphonoacetate-resistance marker. A single nucleotide substitution changing glycine at position 347 to aspartic acid accounts for the drug resistance of the mutant vaccinia virus. The 5' end of the DNA polymerase mRNA was located 80 base pairs before the methionine codon initiating the open reading frame. Correspondence between the predicted Mr 108,577 polypeptide and the 110,000 purified enzyme indicates that little or no proteolytic processing occurs. Extensive homology, extending over 435 amino acids, was found upon comparing the DNA polymerase of vaccinia virus and DNA polymerase of Epstein-Barr virus. A highly conserved sequence of 14 amino acids in the carboxyl-terminal regions of the above DNA polymerases is also present at a similar location in adenovirus DNA polymerase. This structure, which is predicted to form a turn flanked by beta-pleated sheets, may form part of an essential binding or catalytic site that accounts for its presence in DNA polymerases of poxviruses, herpesviruses, and adenoviruses. Images PMID:3012524
Bohlke, Nina; Budisa, Nediljko
2014-02-01
One of the major challenges in contemporary synthetic biology is to find a route to engineer synthetic organisms with altered chemical constitution. In terms of core reaction types, nature uses an astonishingly limited repertoire of chemistries when compared with the exceptionally rich and diverse methods of organic chemistry. In this context, the most promising route to change and expand the fundamental chemistry of life is the inclusion of amino acid building blocks beyond the canonical 20 (i.e. expanding the genetic code). This strategy would allow the transfer of numerous chemical functionalities and reactions from the synthetic laboratory into the cellular environment. Due to limitations in terms of both efficiency and practical applicability, state-of-the-art nonsense suppression- or frameshift suppression-based methods are less suitable for such engineering. Consequently, we set out to achieve this goal by sense codon emancipation, that is, liberation from its natural decoding function - a prerequisite for the reassignment of degenerate sense codons to a new 21st amino acid. We have achieved this by redesigning of several features of the post-transcriptional modification machinery which are directly involved in the decoding process. In particular, we report first steps towards the reassignment of 5797 AUA isoleucine codons in Escherichia coli using efficient tools for tRNA nucleotide modification pathway engineering. © 2014 The Authors. FEMS Microbiology Letters published by John Wiley & Sons Ltd on behalf of the Federation of European Microbiological Societies.
Groth-Malonek, Milena; Wahrmund, Ute; Polsakiewicz, Monika; Knoop, Volker
2007-04-01
Gene transfer from the mitochondrion into the nucleus is a corollary of the endosymbiont hypothesis. The frequent and independent transfer of genes for mitochondrial ribosomal proteins is well documented with many examples in angiosperms, whereas transfer of genes for components of the respiratory chain is a rarity. A notable exception is the nad7 gene, encoding subunit 7 of complex I, in the liverwort Marchantia polymorpha, which resides as a full-length, intron-carrying and transcribed, but nonspliced pseudogene in the chondriome, whereas its functional counterpart is nuclear encoded. To elucidate the patterns of pseudogene degeneration, we have investigated the mitochondrial nad7 locus in 12 other liverworts of broad phylogenetic distribution. We find that the mitochondrial nad7 gene is nonfunctional in 11 of them. However, the modes of pseudogene degeneration vary: whereas point mutations, accompanied by single-nucleotide indels, predominantly introduce stop codons into the reading frame in marchantiid liverworts, larger indels introduce frameshifts in the simple thalloid and leafy jungermanniid taxa. Most notably, however, the mitochondrial nad7 reading frame appears to be intact in the isolated liverwort genus Haplomitrium. Its functional expression is shown by cDNA analysis identifying typical RNA-editing events to reconstitute conserved codon identities and also confirming functional splicing of the 2 liverwort-specific group II introns. We interpret our results 1) to indicate the presence of a functional mitochondrial nad7 gene in the earliest land plants and strongly supporting a basal placement of Haplomitrium among the liverworts, 2) to indicate different modes of pseudogene degeneration and chondriome evolution in the later branching liverwort clades, 3) to suggest a surprisingly long maintenance of a nonfunctional gene in the presumed oldest group of land plants, and 4) to support the model of a secondary loss of RNA-editing activity in marchantiid liverworts.
Using a Euclid distance discriminant method to find protein coding genes in the yeast genome.
Zhang, Chun-Ting; Wang, Ju; Zhang, Ren
2002-02-01
The Euclid distance discriminant method is used to find protein coding genes in the yeast genome, based on the single nucleotide frequencies at three codon positions in the ORFs. The method is extremely simple and may be extended to find genes in prokaryotic genomes or eukaryotic genomes with less introns. Six-fold cross-validation tests have demonstrated that the accuracy of the algorithm is better than 93%. Based on this, it is found that the total number of protein coding genes in the yeast genome is less than or equal to 5579 only, about 3.8-7.0% less than 5800-6000, which is currently widely accepted. The base compositions at three codon positions are analyzed in details using a graphic method. The result shows that the preference codons adopted by yeast genes are of the RGW type, where R, G and W indicate the bases of purine, non-G and A/T, whereas the 'codons' in the intergenic sequences are of the form NNN, where N denotes any base. This fact constitutes the basis of the algorithm to distinguish between coding and non-coding ORFs in the yeast genome. The names of putative non-coding ORFs are listed here in detail.
Jacobo, Sarah Melissa P; Deangelis, Margaret M; Kim, Ivana K; Kazlauskas, Andrius
2013-05-01
Synonymous single nucleotide polymorphisms (SNPs) within a transcript's coding region produce no change in the amino acid sequence of the protein product and are therefore intuitively assumed to have a neutral effect on protein function. We report that two common variants of high-temperature requirement A1 (HTRA1) that increase the inherited risk of neovascular age-related macular degeneration (NvAMD) harbor synonymous SNPs within exon 1 of HTRA1 that convert common codons for Ala34 and Gly36 to less frequently used codons. The frequent-to-rare codon conversion reduced the mRNA translation rate and appeared to compromise HtrA1's conformation and function. The protein product generated from the SNP-containing cDNA displayed enhanced susceptibility to proteolysis and a reduced affinity for an anti-HtrA1 antibody. The NvAMD-associated synonymous polymorphisms lie within HtrA1's putative insulin-like growth factor 1 (IGF-1) binding domain. They reduced HtrA1's abilities to associate with IGF-1 and to ameliorate IGF-1-stimulated signaling events and cellular responses. These observations highlight the relevance of synonymous codon usage to protein function and implicate homeostatic protein quality control mechanisms that may go awry in NvAMD.
Au, Hilda H T; Jan, Eric
2012-01-01
The intergenic region internal ribosome entry site (IGR IRES) of the Dicistroviridae family adopts an overlapping triple pseudoknot structure to directly recruit the 80S ribosome in the absence of initiation factors. The pseudoknot I (PKI) domain of the IRES mimics a tRNA-like codon:anticodon interaction in the ribosomal P site to direct translation initiation from a non-AUG initiation codon in the A site. In this study, we have performed a comprehensive mutational analysis of this region to delineate the molecular parameters that drive IRES translation. We demonstrate that IRES-mediated translation can initiate at an alternate adjacent and overlapping start site, provided that basepairing interactions within PKI remain intact. Consistent with this, IGR IRES translation tolerates increases in the variable loop region that connects the anticodon- and codon-like elements within the PKI domain, as IRES activity remains relatively robust up to a 4-nucleotide insertion in this region. Finally, elements from an authentic tRNA anticodon stem-loop can functionally supplant corresponding regions within PKI. These results verify the importance of the codon:anticodon interaction of the PKI domain and further define the specific elements within the tRNA-like domain that contribute to optimal initiator Met-tRNA(i)-independent IRES translation.
Insights into Factorless Translational Initiation by the tRNA-Like Pseudoknot Domain of a Viral IRES
Au, Hilda H. T.; Jan, Eric
2012-01-01
The intergenic region internal ribosome entry site (IGR IRES) of the Dicistroviridae family adopts an overlapping triple pseudoknot structure to directly recruit the 80S ribosome in the absence of initiation factors. The pseudoknot I (PKI) domain of the IRES mimics a tRNA-like codon:anticodon interaction in the ribosomal P site to direct translation initiation from a non-AUG initiation codon in the A site. In this study, we have performed a comprehensive mutational analysis of this region to delineate the molecular parameters that drive IRES translation. We demonstrate that IRES-mediated translation can initiate at an alternate adjacent and overlapping start site, provided that basepairing interactions within PKI remain intact. Consistent with this, IGR IRES translation tolerates increases in the variable loop region that connects the anticodon- and codon-like elements within the PKI domain, as IRES activity remains relatively robust up to a 4-nucleotide insertion in this region. Finally, elements from an authentic tRNA anticodon stem-loop can functionally supplant corresponding regions within PKI. These results verify the importance of the codon:anticodon interaction of the PKI domain and further define the specific elements within the tRNA-like domain that contribute to optimal initiator Met-tRNAi-independent IRES translation. PMID:23236506
Gutiérrez, Verónica; Rego, Natalia; Naya, Hugo; García, Graciela
2015-10-28
Among teleosts, the South American genus Austrolebias (Cyprinodontiformes: Rivulidae) includes 42 taxa of annual fishes divided into five different species groups. It is a monophyletic genus, but morphological and molecular data do not resolve the relationship among intrageneric clades and high rates of substitution have been previously described in some mitochondrial genes. In this work, the complete mitogenome of a species of the genus was determined for the first time. We determined its structure, gene order and evolutionary peculiar features, which will allow us to evaluate the performance of mitochondrial genes in the phylogenetic resolution at different taxonomic levels. Regarding gene content and order, the circular mitogenome of A. charrua (17,271 pb) presents the typical pattern of vertebrate mitogenomes. It contains the full complement of 13 proteins-coding genes, 22 tRNA, 2 rRNA and one non-coding control region. Notably, the tRNA-Cys was only 57 bp in length and lacks the D-loop arm. In three full sibling individuals, heteroplasmatic condition was detected due to a total of 12 variable sites in seven protein-coding genes. Among cyprinodontiforms, the mitogenome of A. charrua exhibits the lowest G+C content (37 %) and GCskew, as well as the highest strand asymmetry with a net difference of T over A at 1st and 3rd codon positions. Considering the 12 coding-genes of the H strand, correspondence analyses of nucleotide composition and codon usage show that A and T at 1st and 3rd codon positions have the highest weight in the first axis, and segregate annual species from the other cyprinodontiforms analyzed. Given the annual life-style, their mitogenomes could be under different selective pressures. All 13 protein-coding genes are under strong purifying selection and we did not find any significant evidence of nucleotide sites showing episodic selection (dN >dS) at annual lineages. When fast evolving third codon positions were removed from alignments, the "supergene" tree recovers our reference species phylogeny as well as the Cytb, ND4L and ND6 genes. Therefore, third codon positions seem to be saturated in the aforementioned coding regions at intergeneric Cyprinodontiformes comparisons. The complete mitogenome obtained in present work, offers relevant data for further comparative studies on molecular phylogeny and systematics of this taxonomic controversial endemic genus of annual fishes.
USDA-ARS?s Scientific Manuscript database
The latency-related (LR)-RNA encoded by bovine herpes virus 1 (BoHV-1) is abundantly expressed in latently infected sensory neurons. Although the LR gene encodes several products, ORF2 appears to play a dominant role during the latency-reactivation cycle because a mutant virus containing stop codons...
Shariati, Gholamreza; Hamid, Mohammad; Saberi, Alihossein; Andashti, Behnaz; Galehdari, Hamid
2015-02-01
Megalencephalic leukoencephalopathy (MLC) is a rare neurological disorder with an autosomal recessive pattern. Clinical diagnosis was based on macrocephaly, recurrent seizure, and magnetic resonance imaging (MRI). Here we report first finding of a novel homozygous single base deletion in the MLC1 gene in an affected Iranian child causing a premature stop codon (p.L150fs.160X).
USDA-ARS?s Scientific Manuscript database
Avian leukosis virus (ALV) is an oncogenic virus causing a variety of neoplasms in chickens. The group of avian leukosis virus in chickens contains six closely related subgroups, A to E and J. The prevalence of ALVs in hosts may have imposed strong selection pressure toward resistance to ALV infecti...
Unit-length line-1 transcripts in human teratocarcinoma cells.
Skowronski, J; Fanning, T G; Singer, M F
1988-01-01
We have characterized the approximately 6.5-kilobase cytoplasmic poly(A)+ Line-1 (L1) RNA present in a human teratocarcinoma cell line, NTera2D1, by primer extension and by analysis of cloned cDNAs. The bulk of the RNA begins (5' end) at the residue previously identified as the 5' terminus of the longest known primate genomic L1 elements, presumed to represent "unit" length. Several of the cDNA clones are close to 6 kilobase pairs, that is, close to full length. The partial sequences of 18 cDNA clones and full sequence of one (5,975 base pairs) indicate that many different genomic L1 elements contribute transcripts to the 6.5-kilobase cytoplasmic poly(A)+ RNA in NTera2D1 cells because no 2 of the 19 cDNAs analyzed had identical sequences. The transcribed elements appear to represent a subset of the total genomic L1s, a subset that has a characteristic consensus sequence in the 3' noncoding region and a high degree of sequence conservation throughout. Two open reading frames (ORFs) of 1,122 (ORF1) and 3,852 (ORF2) bases, flanked by about 800 and 200 bases of sequence at the 5' and 3' ends, respectively, can be identified in the cDNAs. Both ORFs are in the same frame, and they are separated by 33 bases bracketed by two conserved in-frame stop codons. ORF 2 is interrupted by at least one randomly positioned stop codon in the majority of the cDNAs. The data support proposals suggesting that the human L1 family includes one or more functional genes as well as an extraordinarily large number of pseudogenes whose ORFs are broken by stop codons. The cDNA structures suggest that both genes and pseudogenes are transcribed. At least one of the cDNAs (cD11), which was sequenced in its entirety, could, in principle, represent an mRNA for production of the ORF1 polypeptide. The similarity of mammalian L1s to several recently described invertebrate movable elements defines a new widely distributed class of elements which we term class II retrotransposons. Images PMID:2454389
Koutsoudakis, George; Urbanowicz, Richard A.; Mirza, Deeman; Ginkel, Corinne; Riebesehl, Nina; Calland, Noémie; Albecka, Anna; Price, Louisa; Hudson, Natalia; Descamps, Véronique; Backx, Matthijs; McClure, C. Patrick; Duverlie, Gilles; Pecheur, Eve-Isabelle; Dubuisson, Jean; Perez-del-Pulgar, Sofia; Forns, Xavier; Steinmann, Eike; Tarr, Alexander W.; Pietschmann, Thomas
2014-01-01
Serine is encoded by two divergent codon types, UCN and AGY, which are not interchangeable by a single nucleotide substitution. Switching between codon types therefore occurs via intermediates (threonine or cysteine) or via simultaneous tandem substitutions. Hepatitis C virus (HCV) chronically infects 2 to 3% of the global population. The highly variable glycoproteins E1 and E2 decorate the surface of the viral envelope, facilitate cellular entry, and are targets for host immunity. Comparative sequence analysis of globally sampled E1E2 genes, coupled with phylogenetic analysis, reveals the signatures of multiple archaic codon-switching events at seven highly conserved serine residues. Limited detection of intermediate phenotypes indicates that associated fitness costs restrict their fixation in divergent HCV lineages. Mutational pathways underlying codon switching were probed via reverse genetics, assessing glycoprotein functionality using multiple in vitro systems. These data demonstrate selection against intermediate phenotypes can act at the structural/functional level, with some intermediates displaying impaired virion assembly and/or decreased capacity for target cell entry. These effects act in residue/isolate-specific manner. Selection against intermediates is also provided by humoral targeting, with some intermediates exhibiting increased epitope exposure and enhanced neutralization sensitivity, despite maintaining a capacity for target cell entry. Thus, purifying selection against intermediates limits their frequencies in globally sampled strains, with divergent functional constraints at the protein level restricting the fixation of deleterious mutations. Overall our study provides an experimental framework for identification of barriers limiting viral substitutional evolution and indicates that serine codon-switching represents a genomic “fossil record” of historical purifying selection against E1E2 intermediate phenotypes. PMID:24173227
Stachyra, Anna; Redkiewicz, Patrycja; Kosson, Piotr; Protasiuk, Anna; Góra-Sochacka, Anna; Kudla, Grzegorz; Sirko, Agnieszka
2016-08-26
Highly pathogenic avian influenza viruses are a serious threat to domestic poultry and can be a source of new human pandemic and annual influenza strains. Vaccination is the main strategy of protection against influenza, thus new generation vaccines, including DNA vaccines, are needed. One promising approach for enhancing the immunogenicity of a DNA vaccine is to maximize its expression in the immunized host. The immunogenicity of three variants of a DNA vaccine encoding hemagglutinin (HA) from the avian influenza virus A/swan/Poland/305-135V08/2006 (H5N1) was compared in two animal models, mice (BALB/c) and chickens (broilers and layers). One variant encoded the wild type HA while the other two encoded HA without proteolytic site between HA1 and HA2 subunits and differed in usage of synonymous codons. One of them was enriched for codons preferentially used in chicken genes, while in the other modified variant the third position of codons was occupied in almost 100 % by G or C nucleotides. The variant of the DNA vaccine containing almost 100 % of the GC content in the third position of codons stimulated strongest immune response in two animal models, mice and chickens. These results indicate that such modification can improve not only gene expression but also immunogenicity of DNA vaccine. Enhancement of the GC content in the third position of the codon might be a good strategy for development of a variant of a DNA vaccine against influenza that could be highly effective in distant hosts, such as birds and mammals, including humans.
Das Bhowmik, Aneek; Gupta, Neerja; Dalal, Ashwin; Kabra, Madhulika
In the present study we report on genetic analysis in a patient with developmental delay, truncal obesity and vision problem, to find the causative mutation. Whole exome sequencing was performed on genomic DNA extracted from whole blood of the patient which revealed a homozygous nonsense variant (c.2816T>A) in exon 8 of ALMS1 gene that results in a stop codon and premature truncation at codon 939 (p.L939Ter) of the protein. The mutation was confirmed by Sanger sequencing. Exome sequencing was helpful in establishing diagnosis of Alstrom syndrome in this patient. This case highlights the utility of exome sequencing in clinical practice. Copyright © 2016 Asia Oceania Association for the Study of Obesity. Published by Elsevier Ltd. All rights reserved.
Zika Virus Attenuation by Codon Pair Deoptimization Induces Sterilizing Immunity in Mouse Models.
Li, Penghui; Ke, Xianliang; Wang, Ting; Tan, Zhongyuan; Luo, Dan; Miao, Yuanjiu; Sun, Jianhong; Zhang, Yuan; Liu, Yan; Hu, Qinxue; Xu, Fuqiang; Wang, Hanzhong; Zheng, Zhenhua
2018-06-20
Zika virus (ZIKV) infection during the large epidemics in the Americas is related to congenital abnormities or fetal demise. To date, there is no vaccine, antiviral drug, or other modality available to prevent or treat Zika virus infection. Here we designed novel live attenuated ZIKV vaccine candidates using a codon pair deoptimization strategy. Three codon pair-deoptimized ZIKVs (Min E, Min NS1, and Min E+NS1) were de novo synthesized, and recovered by reverse genetics, containing large amounts of underrepresented codon pairs in E gene and/or NS1 gene. Amino acid sequence was 100% unchanged. The codon pair-deoptimized variants had decreased replication fitness in Vero cells (Min NS1 ≫ Min E > Min E+NS1), replicated more efficiently in insect cells than in mammalian cells, and demonstrated diminished virulence in a mouse model. In particular, Min E+NS1, the most restrictive variant, induced sterilizing immunity with a robust neutralizing antibody titer, and a single immunization achieved complete protection against lethal challenge and vertical ZIKV transmission during pregnancy. More importantly, due to the numerous synonymous substitutions in the codon pair-deoptimized strains, reversion to wild-type virulence through gradual nucleotide sequence mutations is unlikely. Our results collectively demonstrate that ZIKV can be effectively attenuated by codon pair deoptimization, highlighting the potential of Min E+NS1 as a safe vaccine candidate to prevent ZIKV infections. IMPORTANCE Due to unprecedented epidemics of Zika virus (ZIKV) across the Americas and the unexpected clinical symptoms including Guillain-Barré syndrome, microcephaly and other birth defects in human, there is an urgent need for ZIKV vaccine development. Here, we provided the first attenuated versions of ZIKV with two important genes (E and/or NS1) that were subjected to codon pair deoptimization. Compared to parental ZIKV, the codon pair-deoptimized ZIKVs were mammalian-attenuated, and preferred insect to mammalian Cells. Min E+NS1, the most restrictive variant, induced sterilizing immunity with a robust neutralizing antibody titer, and achieved complete protection against lethal challenge and vertical virus transmission during pregnancy. More importantly, the massive synonymous mutational approach made it impossible to revert to wild-type virulence. Our results have proven the feasibility of codon pair deoptimization as a strategy to develop live-attenuated vaccine candidates against flavivirues like ZIKV, Japanese encephalitis virus and West Nile virus. Copyright © 2018 American Society for Microbiology.
Rozov, Alexey; Demeshkina, Natalia; Khusainov, Iskander; Westhof, Eric; Yusupov, Marat; Yusupova, Gulnara
2016-01-01
Posttranscriptional modifications at the wobble position of transfer RNAs play a substantial role in deciphering the degenerate genetic code on the ribosome. The number and variety of modifications suggest different mechanisms of action during messenger RNA decoding, of which only a few were described so far. Here, on the basis of several 70S ribosome complex X-ray structures, we demonstrate how Escherichia coli tRNALysUUU with hypermodified 5-methylaminomethyl-2-thiouridine (mnm5s2U) at the wobble position discriminates between cognate codons AAA and AAG, and near-cognate stop codon UAA or isoleucine codon AUA, with which it forms pyrimidine–pyrimidine mismatches. We show that mnm5s2U forms an unusual pair with guanosine at the wobble position that expands general knowledge on the degeneracy of the genetic code and specifies a powerful role of tRNA modifications in translation. Our models consolidate the translational fidelity mechanism proposed previously where the steric complementarity and shape acceptance dominate the decoding mechanism. PMID:26791911
Niu, Fang-Fang; Zhu, Liang; Wang, Su; Wei, Shu-Jun
2016-07-01
Here, we report the mitochondrial genome sequence of the multicolored Asian lady beetle Harmonia axyridis (Pallas, 1773) (Coleoptera: Coccinellidae) (GenBank accession No. KR108208). This is the first species with sequenced mitochondrial genome from the genus Harmonia. The current length with partitial A + T-rich region of this mitochondrial genome is 16,387 bp. All the typical genes were sequenced except the trnI and trnQ. As in most other sequenced mitochondrial genomes of Coleoptera, there is no re-arrangement in the sequenced region compared with the pupative ancestral arrangement of insects. All protein-coding genes start with ATN codons. Five, five and three protein-coding genes stop with termination codon TAA, TA and T, respectively. Phylogenetic analysis using Bayesian method based on the first and second codon positions of the protein-coding genes supported that the Scirtidae is a basal lineage of Polyphaga. The Harmonia and the Coccinella form a sister lineage. The monophyly of Staphyliniformia, Scarabaeiformia and Cucujiformia was supported. The Buprestidae was found to be a sister group to the Bostrichiformia.
Ko, Jae-hyeong; Llopis, Paula Montero; Heinritz, Jennifer; Jacobs-Wagner, Christine; Söll, Dieter
2013-01-01
While translational read-through of stop codons by suppressor tRNAs is common in many bacteria, archaea and eukaryotes, this phenomenon has not yet been observed in the α-proteobacterium Caulobacter crescentus. Based on a previous report that C. crescentus and Escherichia coli tRNAHis have distinctive identity elements, we constructed E. coli tRNAHis CUA, a UAG suppressor tRNA for C. crescentus. By examining the expression of three UAG codon- containing reporter genes (encoding a β-lactamase, the fluorescent mCherry protein, or the C. crescentus xylonate dehydratase), we demonstrated that the E. coli histidyl-tRNA synthetase/tRNAHis CUA pair enables in vivo UAG suppression in C. crescentus. E. coli histidyl-tRNA synthetase (HisRS) or tRNAHis CUA alone did not achieve suppression; this indicates that the E. coli HisRS/tRNAHis CUA pair is orthogonal in C. crescentus. These results illustrate that UAG suppression can be achieved in C. crescentus with an orthogonal aminoacyl-tRNA synthetase/suppressor tRNA pair. PMID:24386240
Rozov, Alexey; Demeshkina, Natalia; Khusainov, Iskander; Westhof, Eric; Yusupov, Marat; Yusupova, Gulnara
2016-01-21
Posttranscriptional modifications at the wobble position of transfer RNAs play a substantial role in deciphering the degenerate genetic code on the ribosome. The number and variety of modifications suggest different mechanisms of action during messenger RNA decoding, of which only a few were described so far. Here, on the basis of several 70S ribosome complex X-ray structures, we demonstrate how Escherichia coli tRNA(Lys)(UUU) with hypermodified 5-methylaminomethyl-2-thiouridine (mnm(5)s(2)U) at the wobble position discriminates between cognate codons AAA and AAG, and near-cognate stop codon UAA or isoleucine codon AUA, with which it forms pyrimidine-pyrimidine mismatches. We show that mnm(5)s(2)U forms an unusual pair with guanosine at the wobble position that expands general knowledge on the degeneracy of the genetic code and specifies a powerful role of tRNA modifications in translation. Our models consolidate the translational fidelity mechanism proposed previously where the steric complementarity and shape acceptance dominate the decoding mechanism.
NASA Astrophysics Data System (ADS)
Rozov, Alexey; Demeshkina, Natalia; Khusainov, Iskander; Westhof, Eric; Yusupov, Marat; Yusupova, Gulnara
2016-01-01
Posttranscriptional modifications at the wobble position of transfer RNAs play a substantial role in deciphering the degenerate genetic code on the ribosome. The number and variety of modifications suggest different mechanisms of action during messenger RNA decoding, of which only a few were described so far. Here, on the basis of several 70S ribosome complex X-ray structures, we demonstrate how Escherichia coli tRNALysUUU with hypermodified 5-methylaminomethyl-2-thiouridine (mnm5s2U) at the wobble position discriminates between cognate codons AAA and AAG, and near-cognate stop codon UAA or isoleucine codon AUA, with which it forms pyrimidine-pyrimidine mismatches. We show that mnm5s2U forms an unusual pair with guanosine at the wobble position that expands general knowledge on the degeneracy of the genetic code and specifies a powerful role of tRNA modifications in translation. Our models consolidate the translational fidelity mechanism proposed previously where the steric complementarity and shape acceptance dominate the decoding mechanism.
Lee, Yee-Ki; Lau, Yee-Man; Cai, Zhu-Jun; Lai, Wing-Hon; Wong, Lai-Yung; Tse, Hung-Fat; Ng, Kwong-Man; Siu, Chung-Wah
2017-07-28
Precision medicine is an emerging approach to disease treatment and prevention that takes into account individual variability in the environment, lifestyle, and genetic makeup of patients. Patient-specific human induced pluripotent stem cells hold promise to transform precision medicine into real-life clinical practice. Lamin A/C (LMNA)-related cardiomyopathy is the most common inherited cardiomyopathy in which a substantial proportion of mutations in the LMNA gene are of nonsense mutation. PTC124 induces translational read-through over the premature stop codon and restores production of the full-length proteins from the affected genes. In this study we generated human induced pluripotent stem cells-derived cardiomyocytes from patients who harbored different LMNA mutations (nonsense and frameshift) to evaluate the potential therapeutic effects of PTC124 in LMNA -related cardiomyopathy. We generated human induced pluripotent stem cells lines from 3 patients who carried distinctive mutations (R225X, Q354X, and T518fs) in the LMNA gene. The cardiomyocytes derived from these human induced pluripotent stem cells lines reproduced the pathophysiological hallmarks of LMNA -related cardiomyopathy. Interestingly, PTC124 treatment increased the production of full-length LMNA proteins in only the R225X mutant, not in other mutations. Functional evaluation experiments on the R225X mutant further demonstrated that PTC124 treatment not only reduced nuclear blebbing and electrical stress-induced apoptosis but also improved the excitation-contraction coupling of the affected cardiomyocytes. Using cardiomyocytes derived from human induced pluripotent stem cells carrying different LMNA mutations, we demonstrated that the effect of PTC124 is codon selective. A premature stop codon UGA appeared to be most responsive to PTC124 treatment. © 2017 The Authors. Published on behalf of the American Heart Association, Inc., by Wiley.
Origin of the polymorphism of the involucrin gene in Asians.
Djian, P; Delhomme, B; Green, H
1995-01-01
The involucrin gene, encoding a protein of the terminally differentiated keratinocyte, is polymorphic in the human. There is polymorphism of marker nucleotides a two positions in the coding region, and there are over eight polymorphic forms based on the number and kind of 10-codon tandem repeats in that part of the coding region most recently added in the human lineage. The involucrin alleles of Caucasians and Africans differ in both nucleotides and repeat patterns. We show that the involucrin alleles of East Asians (Chinese and Japanese) can be divided into two populations according to whether they possess the two marker nucleotides typical of Africans or Caucasians. The Asian population bearing Caucasian-type marker nucleotides has repeat patterns similar to those of Caucasians, whereas Asians bearing African-type marker nucleotides have repeat patterns that resemble those of Africans more than those of Caucasians. The existence of two populations of East Asian involucrin alleles gives support for the existence of a Eurasian stem lineage from which Caucasians and a part of the Asian population originated. PMID:7762559
2013-01-01
Background Dengue virus (DENV) infection represents a significant public health problem in many subtropical and tropical countries. Although genetically closely related, the four serotypes of DENV differ in antigenicity for which cross protection among serotypes is limited. It is also believed that both multi-serotype infection as well as the evolution of viral antigenicity may have confounding effects in increased dengue epidemics. Numerous studies have been performed that investigated genetic diversity of DENV, but the precise mechanism(s) of dengue virus evolution are not well understood. Results We investigated genome-wide genetic diversity and nucleotide substitution patterns in the four serotypes among samples collected from different countries in Asia and Central and South America and sequenced as part of the Genome Sequencing Center for Infectious Diseases at the Broad Institute. We applied bioinformatics, statistical and coalescent simulation methods to investigate diversity of codon sequences of DENV samples representing the four serotypes. We show that fixation of nucleotide substitutions is more prominent among the inter-continental isolates (Asian and American) of serotypes 1, 2 and 3 compared to serotype 4 isolates (South and Central America) and are distributed in a non-random manner among the genes encoded by the virus. Nearly one third of the negatively selected sites are associated with fixed mutation sites within serotypes. Our results further show that of all the sites showing evidence of recombination, the majority (~84%) correspond to sites under purifying selection in the four serotypes. The analysis further shows that genetic recombination occurs within specific codons, albeit with low frequency (< 5% of all recombination sites) throughout the DENV genome of the four serotypes and reveals significant enrichment (p < 0.05) among sites under purifying selection in the virus. Conclusion The study provides the first evidence for intracodon recombination in DENV and suggests that within codons, genetic recombination has a significant role in maintaining extensive purifying selection of DENV in natural populations. Our study also suggests that fixation of beneficial mutations may lead to virus evolution via translational selection of specific sites in the DENV genome. PMID:23410119
Zhou, Jie; Kherani, Femida; Bardakjian, Tanya M.; Katowitz, James; Hughes, Nkecha; Schimmenti, Lisa A.; Schneider, Adele
2008-01-01
Purpose Mutations in the SOX2 and CHX10 genes have been reported in patients with anophthalmia and/or microphthalmia. In this study, we evaluated 34 anophthalmic/microphthalmic patient DNA samples (two sets of siblings included) for mutations and sequence variants in SOX2 and CHX10. Methods Conformational sensitive gel electrophoresis (CSGE) was used for the initial SOX2 and CHX10 screening of 34 affected individuals (two sets of siblings), five unaffected family members, and 80 healthy controls. Patient samples containing heteroduplexes were selected for sequence analysis. Base pair changes in SOX2 and CHX10 were confirmed by sequencing bidirectionally in patient samples. Results Two novel heterozygous mutations and two sequence variants (one known) in SOX2 were identified in this cohort. Mutation c.310 G>T (p. Glu104X), found in one patient, was in the region encoding the high mobility group (HMG) DNA-binding domain and resulted in a change from glutamic acid to a stop codon. The second mutation, noted in two affected siblings, was a single nucleotide deletion c.549delC (p. Pro184ArgfsX19) in the region encoding the activation domain, resulting in a frameshift and premature termination of the coding sequence. The shortened protein products may result in the loss of function. In addition, a novel nucleotide substitution c.*557G>A was identified in the 3′-untranslated region in one patient. The relationship between the nucleotide change and the protein function is indeterminate. A known single nucleotide polymorphism (c. *469 C>A, SNP rs11915160) was also detected in 2 of the 34 patients. Screening of CHX10 identified two synonymous sequence variants, c.471 C>T (p.Ser157Ser, rs35435463) and c.579 G>A (p. Gln193Gln, novel SNP), and one non-synonymous sequence variant, c.871 G>A (p. Asp291Asn, novel SNP). The non-synonymous polymorphism was also present in healthy controls, suggesting non-causality. Conclusions These results support the role of SOX2 in ocular development. Loss of SOX2 function results in severe eye malformation. CHX10 was not implicated with microphthalmia/anophthalmia in our patient cohort. PMID:18385794
Novel mutations of endothelin-B receptor gene in Pakistani patients with Waardenburg syndrome.
Jabeen, Raheela; Babar, Masroor Ellahi; Ahmad, Jamil; Awan, Ali Raza
2012-01-01
Mutations in EDNRB gene have been reported to cause Waardenburg-Shah syndrome (WS4) in humans. We investigated 17 patients with WS4 for identification of mutations in EDNRB gene using PCR and direct sequencing technique. Four genomic mutations were detected in four patients; a G to C transversion in codon 335 (S335C) in exon 5 and a transition of T to C in codon (S361L) in exon 5, a transition of A to G in codon 277 (L277L) in exon 4, a non coding transversion of T to A at -30 nucleotide position of exon 5. None of these mutations were found in controls. One of the patients harbored two novel mutations (S335C, S361L) in exon 5 and one in Intronic region (-30exon5 A>G). All of the mutations were homozygous and novel except the mutation observed in exon 4. In this study, we have identified 3 novel mutations in EDNRB gene associated with WS4 in Pakistani patients.
Comparative Mitogenomic Analysis of Species Representing Six Subfamilies in the Family Tenebrionidae
Zhang, Hong-Li; Liu, Bing-Bing; Wang, Xiao-Yang; Han, Zhi-Ping; Zhang, Dong-Xu; Su, Cai-Na
2016-01-01
To better understand the architecture and evolution of the mitochondrial genome (mitogenome), mitogenomes of ten specimens representing six subfamilies in Tenebrionidae were selected, and comparative analysis of these mitogenomes was carried out in this study. Ten mitogenomes in this family share a similar gene composition, gene order, nucleotide composition, and codon usage. In addition, our results show that nucleotide bias was strongly influenced by the preference of codon usage for A/T rich codons which significantly correlated with the G + C content of protein coding genes (PCGs). Evolutionary rate analyses reveal that all PCGs have been subjected to a purifying selection, whereas 13 PCGs displayed different evolution rates, among which ATPase subunit 8 (ATP8) showed the highest evolutionary rate. We inferred the secondary structure for all RNA genes of Tenebrio molitor (Te2) and used this as the basis for comparison with the same genes from other Tenebrionidae mitogenomes. Some conserved helices (stems) and loops of RNA structures were found in different domains of ribosomal RNAs (rRNAs) and the cloverleaf structure of transfer RNAs (tRNAs). With regard to the AT-rich region, we analyzed tandem repeat sequences located in this region and identified some essential elements including T stretches, the consensus motif at the flanking regions of T stretch, and the secondary structure formed by the motif at the 3′ end of T stretch in major strand, which are highly conserved in these species. Furthermore, phylogenetic analyses using mitogenomic data strongly support the relationships among six subfamilies: ((Tenebrionidae incertae sedis + (Diaperinae + Tenebrioninae)) + (Pimeliinae + Lagriinae)), which is consistent with phylogenetic results based on morphological traits. PMID:27258256
Therapy for Duchenne muscular dystrophy: renewed optimism from genetic approaches.
Fairclough, Rebecca J; Wood, Matthew J; Davies, Kay E
2013-06-01
Duchenne muscular dystrophy (DMD) is a devastating progressive disease for which there is currently no effective treatment except palliative therapy. There are several promising genetic approaches, including viral delivery of the missing dystrophin gene, read-through of translation stop codons, exon skipping to restore the reading frame and increased expression of the compensatory utrophin gene. The lessons learned from these approaches will be applicable to many other disorders.
Multiple conversion between the genes encoding bacterial class-I release factors
Ishikawa, Sohta A.; Kamikawa, Ryoma; Inagaki, Yuji
2015-01-01
Bacteria require two class-I release factors, RF1 and RF2, that recognize stop codons and promote peptide release from the ribosome. RF1 and RF2 were most likely established through gene duplication followed by altering their stop codon specificities in the common ancestor of extant bacteria. This scenario expects that the two RF gene families have taken independent evolutionary trajectories after the ancestral gene duplication event. However, we here report two independent cases of conversion between RF1 and RF2 genes (RF1-RF2 gene conversion), which were severely examined by procedures incorporating the maximum-likelihood phylogenetic method. In both cases, RF1-RF2 gene conversion was predicted to occur in the region encoding nearly entire domain 3, of which functions are common between RF paralogues. Nevertheless, the ‘direction’ of gene conversion appeared to be opposite from one another—from RF2 gene to RF1 gene in one case, while from RF1 gene to RF2 gene in the other. The two cases of RF1-RF2 gene conversion prompt us to propose two novel aspects in the evolution of bacterial class-I release factors: (i) domain 3 is interchangeable between RF paralogues, and (ii) RF1-RF2 gene conversion have occurred frequently in bacterial genome evolution. PMID:26257102
Truncated ORF1 proteins can suppress LINE-1 retrotransposition in trans
Sokolowski, Mark; Chynces, May; deHaro, Dawn; Christian, Claiborne M.
2017-01-01
Abstract Long interspersed element 1 (L1) is an autonomous non-LTR retroelement that is active in mammalian genomes. Although retrotranspositionally incompetent and functional L1 loci are present in the same genomes, it remains unknown whether non-functional L1s have any trans effect on mobilization of active elements. Using bioinformatic analysis, we identified over a thousand of human L1 loci containing at least one stop codon in their ORF1 sequence. RNAseq analysis confirmed that many of these loci are expressed. We demonstrate that introduction of equivalent stop codons in the full-length human L1 sequence leads to the expression of truncated ORF1 proteins. When supplied in trans some truncated human ORF1 proteins suppress human L1 retrotransposition. This effect requires the N-terminus and coiled-coil domain (C-C) as mutations within the ORF1p C-C domain abolish the suppressive effect of truncated proteins on L1 retrotransposition. We demonstrate that the expression levels and length of truncated ORF1 proteins influence their ability to suppress L1 retrotransposition. Taken together these findings suggest that L1 retrotransposition may be influenced by coexpression of defective L1 loci and that these L1 loci may reduce accumulation of de novo L1 integration events. PMID:28431148
Massive programmed translational jumping in mitochondria
Lang, B. Franz; Jakubkova, Michaela; Hegedusova, Eva; Daoud, Rachid; Forget, Lise; Brejova, Brona; Vinar, Tomas; Kosa, Peter; Fricova, Dominika; Nebohacova, Martina; Griac, Peter; Tomaska, Lubomir; Burger, Gertraud; Nosek, Jozef
2014-01-01
Programmed translational bypassing is a process whereby ribosomes “ignore” a substantial interval of mRNA sequence. Although discovered 25 y ago, the only experimentally confirmed example of this puzzling phenomenon is expression of the bacteriophage T4 gene 60. Bypassing requires translational blockage at a “takeoff codon” immediately upstream of a stop codon followed by a hairpin, which causes peptidyl-tRNA dissociation and reassociation with a matching “landing triplet” 50 nt downstream, where translation resumes. Here, we report 81 translational bypassing elements (byps) in mitochondria of the yeast Magnusiomyces capitatus and demonstrate in three cases, by transcript analysis and proteomics, that byps are retained in mitochondrial mRNAs but not translated. Although mitochondrial byps resemble the bypass sequence in the T4 gene 60, they utilize unused codons instead of stops for translational blockage and have relaxed matching rules for takeoff/landing sites. We detected byp-like sequences also in mtDNAs of several Saccharomycetales, indicating that byps are mobile genetic elements. These byp-like sequences lack bypassing activity and are tolerated when inserted in-frame in variable protein regions. We hypothesize that byp-like elements have the potential to contribute to evolutionary diversification of proteins by adding new domains that allow exploration of new structures and functions. PMID:24711422
[Novel CHST6 compound heterozygous mutations cause macular corneal dystrophy in a Chinese family].
Qi, Yan-hua; Dang, Xiu-hong; Su, Hong; Zhou, Nan; Liang, Ting; Wang, Zheng; Huang, Shang-zhi
2010-02-01
The aim of this study was to identify mutations of CHST6 gene in a Chinese family with macular corneal dystrophy (MCD) and to investigate the histopathological changes of MCD. Corneal button of the proband was obtained from penetrating keratoplasty for the treatment of severe corneal dystrophy. The sections and ultrathin sections of this specimen were examined under light microscope and transmission electron microscope (TEM). Genomic DNA was extracted from leukocytes in peripheral blood from the family members. The coding region of CHST6 was amplified by polymerase chain reaction (PCR). The PCR products were analyzed by direct sequencing and restriction enzyme digestion. Histochemical study revealed positive results of colloidal iron stain. TEM revealed enlargement of smooth endoplasmic reticulum and the presence of intracytoplasmic vacuoles. Two mutations, Q298X Y358H, were identified in exon 3 of CHST6. Three patients were compound heterozygotes of these two mutations. The C892T transversion occurred at codon 298 turned the codon of glutamine to a stop codon; the T1072C transversion occurred at codon 358 caused a missense mutation, tyrosine to histidine. All six unaffected family members were heterozygotes. These two mutations were not detected in any of the 100 control subjects. The novel compound heterozygous mutation results in loss of CHST6 function and causes the occurrence of MCD. This is the first report of this gene mutation.
A genomic scale map of genetic diversity in Trypanosoma cruzi
2012-01-01
Background Trypanosoma cruzi, the causal agent of Chagas Disease, affects more than 16 million people in Latin America. The clinical outcome of the disease results from a complex interplay between environmental factors and the genetic background of both the human host and the parasite. However, knowledge of the genetic diversity of the parasite, is currently limited to a number of highly studied loci. The availability of a number of genomes from different evolutionary lineages of T. cruzi provides an unprecedented opportunity to look at the genetic diversity of the parasite at a genomic scale. Results Using a bioinformatic strategy, we have clustered T. cruzi sequence data available in the public domain and obtained multiple sequence alignments in which one or two alleles from the reference CL-Brener were included. These data covers 4 major evolutionary lineages (DTUs): TcI, TcII, TcIII, and the hybrid TcVI. Using these set of alignments we have identified 288,957 high quality single nucleotide polymorphisms and 1,480 indels. In a reduced re-sequencing study we were able to validate ~ 97% of high-quality SNPs identified in 47 loci. Analysis of how these changes affect encoded protein products showed a 0.77 ratio of synonymous to non-synonymous changes in the T. cruzi genome. We observed 113 changes that introduce or remove a stop codon, some causing significant functional changes, and a number of tri-allelic and tetra-allelic SNPs that could be exploited in strain typing assays. Based on an analysis of the observed nucleotide diversity we show that the T. cruzi genome contains a core set of genes that are under apparent purifying selection. Interestingly, orthologs of known druggable targets show statistically significant lower nucleotide diversity values. Conclusions This study provides the first look at the genetic diversity of T. cruzi at a genomic scale. The analysis covers an estimated ~ 60% of the genetic diversity present in the population, providing an essential resource for future studies on the development of new drugs and diagnostics, for Chagas Disease. These data is available through the TcSNP database (http://snps.tcruzi.org). PMID:23270511
Cefalù, A B; Barbagallo, C M; Sesti, E; Caldarella, R; Polizzi, F; Marino, G; Noto, D; Rolleri, M; Travali, S; Scalisi, G; Notarbartolo, A; Corsini, A; Bertolini, S; Averna, M R
2001-09-01
Familial defective apolipoprotein (apo) B-100 together with familial hypercholesterolemia are the two common genetic conditions that cause hypercholesterolemia. Familial defective apolipoprotein B-100 is due to mutations around codon 3500 of the apo B gene. The most-characterized mutation is a G>A transition at nucleotide 10,708 that results in the substitution of arginine by glutamine at codon 3500 (Apo B Arg3500Gln). Two other mutations are caused by a C>T transition, one at nucleotide 10,800 (Apo B Arg3531Cys) and the other at nucleotide 10,707 (apo B Arg3500Trp). In the present study we describe three new Italian cases of familial defective apolipoprotein B-100 (Apo B Arg3500Gln), one from the Liguria region and two from Sicily, and the haplotype of the apo B gene co-segregating with the mutation. By screening two groups of probands, clinically diagnosed as having Familial Hypercholesterolemia (700 from mainland Italy and 305 from Sicily), the prevalence of familial defective apolipoprotein B-100 due to Arg3500Gln was found to be very low (0.28% and 0.65%, respectively). The Arg3531Cys mutation was not detected in any proband. In the three new families with Arg3500Gln mutation in the present study and in one previously described in Italy, the mutation was associated with a unique apo B haplotype, which is consistent with data previously reported for Caucasian patients [XbaI-, MspI+, EcoRI-, presence of the 5' signal peptide insertion (Ins) allele, and the 49-repeat allele of the 3'-VNTR].
Characterization of c-Ki-ras and N-ras oncogenes in aflatoxin B sub 1 -induced rat liver tumors
DOE Office of Scientific and Technical Information (OSTI.GOV)
McMahon, G.; Davis, E.F.; Huber, L.J.
c-Ki-ras and N-ras oncogenes have been characterized in aflatoxin B{sub 1}-induced hepatocellular carcinomas. Detection of different protooncogene and oncogene sequences and estimation of their frequency distribution were accomplished by polymerase chain reaction, cloning, and plaque screening methods. Two c-Ki-ras oncogene sequences were identified in DNA from liver tumors that contained nucleotide changes absent in DNA from livers of untreated control rats. Sequence changes involving G{center dot}C to T{center dot}A or G{center dot}C to A{center dot}T nucleotide substitutions in codon 12 were scored in three of eight tumor-bearing animals. Distributions of c-Ki-ras sequences in tumors and normal liver DNA indicated thatmore » the observed nucleotide changes were consistent with those expected to result from direct mutagenesis of the germ-line protooncogene by aflatoxin B{sub 1}. N-ras oncogene sequences were identified in DNA from two of eight tumors. Three N-ras gene regions were identified, one of which was shown to be associated with an oncogene containing a putative activating amino acid residing at codon 13. All three N-ras sequences, including the region detected in N-ras oncogenes, were present at similar frequencies in DNA samples from control livers as well as liver tumors. The presence of a potential germ-line oncogene may be related to the sensitivity of the Fischer rat strain to liver carcinogenesis by aflatoxin B{sub 1} and other chemical carcinogens.« less
Long, Xi-Dai; Ma, Yun; Zhou, Yuan-Feng; Ma, Ai-Min; Fu, Guo-Hui
2010-10-01
Genetic polymorphisms in DNA repair genes may influence individual variations in DNA repair capacity, and this may be associated with the risk and outcome of hepatocellular carcinoma (HCC) related to aflatoxin B1 (AFB1) exposure. In this study, we focused on the polymorphism of xeroderma pigmentosum complementation group C (XPC) codon 939 (rs#2228001), which is involved in nucleotide excision repair. We conducted a case-control study including 1156 HCC cases and 1402 controls without any evidence of hepatic disease to evaluate the associations between this polymorphism and HCC risk and prognosis in the Guangxi population. AFB1 DNA adduct levels, XPC genotypes, and XPC protein levels were tested with a comparative enzyme-linked immunosorbent assay, TaqMan polymerase chain reaction for XPC genotypes, and immunohistochemistry, respectively. Higher AFB1 exposure was observed among HCC patients versus the control group [odds ratio (OR) = 9.88 for AFB1 exposure years and OR = 6.58 for AFB1 exposure levels]. The XPC codon 939 Gln alleles significantly increased HCC risk [OR = 1.25 (95% confidence interval = 1.03-1.52) for heterozygotes of the XPC codon 939 Lys and Gln alleles (XPC-LG) and OR = 1.81 (95% confidence interval = 1.36-2.40) for homozygotes of the XPC codon 939 Gln alleles (XPC-GG)]. Significant interactive effects between genotypes and AFB1 exposure status were also observed in the joint-effects analysis. This polymorphism, moreover, was correlated with XPC expression levels in cancerous tissues (r = -0.369, P < 0.001) and with the overall survival of HCC patients (the median survival times were 30, 25, and 19 months for patients with homozygotes of the XPC codon 939 Lys alleles, XPC-LG, and XPC-GG, respectively), especially under high AFB1 exposure conditions. Like AFB1 exposure, the XPC codon 939 polymorphism was an independent prognostic factor influencing the survival of HCC. Additionally, this polymorphism multiplicatively interacted with the xeroderma pigmentosum complementation group D codon 751 polymorphism with respect to HCC risk (OR(interaction) = 1.71). These results suggest that the XPC codon 939 polymorphism may be associated with the risk and outcome of AFB1-related HCC in the Guangxi population and may interact with AFB1 exposure in the process of HCC induction by AFB1.
Rubinstein, M; Mogil, J S; Japón, M; Chan, E C; Allen, R G; Low, M J
1996-04-30
A physiological role for beta-endorphin in endogenous pain inhibition was investigated by targeted mutagenesis of the proopiomelanocortin gene in mouse embryonic stem cells. The tyrosine codon at position 179 of the proopiomelanocortin gene was converted to a premature translational stop codon. The resulting transgenic mice display no overt developmental or behavioral alterations and have a normally functioning hypothalamic-pituitary-adrenal axis. Homozygous transgenic mice with a selective deficiency of beta-endorphin exhibit normal analgesia in response to morphine, indicating the presence of functional mu-opiate receptors. However, these mice lack the opioid (naloxone reversible) analgesia induced by mild swim stress. Mutant mice also display significantly greater nonopioid analgesia in response to cold water swim stress compared with controls and display paradoxical naloxone-induced analgesia. These changes may reflect compensatory upregulation of alternative pain inhibitory mechanisms.
Thiamine-responsive megaloblastic anemia: early diagnosis may be effective in preventing deafness.
Onal, Hasan; Bariş, Safa; Ozdil, Mine; Yeşil, Gözde; Altun, Gürkan; Ozyilmaz, Isa; Aydin, Ahmet; Celkan, Tiraje
2009-01-01
Thiamine-responsive megaloblastic anemia syndrome is an autosomal recessive disorder characterized by diabetes mellitus, megaloblastic anemia and sensorineural hearing loss. Mutations in the SLC19A2 gene, encoding a high-affinity thiamine transporter protein, THTR-1, are responsible for the clinical features associated with thiamine-responsive megaloblastic anemia syndrome in which treatment with pharmacological doses of thiamine correct the megaloblastic anemia and diabetes mellitus. The anemia can recur when thiamine is withdrawn. Thiamine may be effective in preventing deafness if started before two months. Our patient was found homozygous for a mutation, 242insA, in the nucleic acid sequence of exon B, with insertion of an adenine introducing a stop codon at codon 52 in the high-affinity thiamine transporter gene, SLC19A2, on chromosome 1q23.3.
Single-cell analysis of intercellular heteroplasmy of mtDNA in Leber hereditary optic neuropathy
DOE Office of Scientific and Technical Information (OSTI.GOV)
Kobayashi, Y.; Sharpe, H.; Brown, N.
1994-07-01
The authors have investigated the distribution of mutant mtDNA molecules in single cells from a patient with Leber hereditary optic neuropathy (LHON). LHON is a maternally inherited disease that is characterized by a sudden-onset bilateral loss of central vision, which typically occurs in early adulthood. More than 50% of all LHON patients carry an mtDNA mutation at nucleotide position 11778. This nucleotide change converts a highly conserved arginine residue to histidine at codon 340 in the NADH-ubiquinone oxidoreductase subunit 4 (ND4) gene of mtDNA. In the present study, the authors used PCR amplification of mtDNA from lymphocytes to investigate mtDNAmore » heteroplasmy at the single-cell level in a LHON patient. They found that most cells were either homoplasmic normal or homoplasmic mutant at nucleotide position 11778. Some (16%) cells contained both mutant and normal mtDNA.« less
Jiang, Shao-Tong; Hong, Gui-Yun; Yu, Miao; Li, Na; Yang, Ying; Liu, Yan-Qun; Wei, Zhao-Jun
2009-05-22
The complete mitochondrial genome (mitogenome) of Eriogyna pyretorum (Lepidoptera: Saturniidae) was determined as being composed of 15,327 base pairs (bp), including 13 protein-coding genes (PCGs), 2 rRNA genes, 22 tRNA genes, and a control region. The arrangement of the PCGs is the same as that found in the other sequenced lepidopteran. The AT skewness for the E. pyretorum mitogenome is slightly negative (-0.031), indicating the occurrence of more Ts than As. The nucleotide composition of the E. pyretorum mitogenome is also biased toward A + T nucleotides (80.82%). All PCGs are initiated by ATN codons, except for cytochrome c oxidase subunit 1 and 2 (cox1 and cox2). Two of the 13 PCGs harbor the incomplete termination codon by T. All tRNA genes have a typical clover-leaf structure of mitochondrial tRNA, with the exception of trnS1(AGN) and trnS2(UCN). Phylogenetic analysis among the available lepidopteran species supports the current morphology-based hypothesis that Bombycoidea, Geometroidea, Notodontidea, Papilionoidea and Pyraloidea are monophyletic. As has been previously suggested, Bombycidae (Bombyx mori and Bombyx mandarina), Sphingoidae (Manduca sexta) and Saturniidae (Antheraea pernyi, Antheraea yamamai, E. pyretorum and Caligula boisduvalii) formed a group.
Jiang, Shao-Tong; Hong, Gui-Yun; Yu, Miao; Li, Na; Yang, Ying; Liu, Yan-Qun; Wei, Zhao-Jun
2009-01-01
The complete mitochondrial genome (mitogenome) of Eriogyna pyretorum (Lepidoptera: Saturniidae) was determined as being composed of 15,327 base pairs (bp), including 13 protein-coding genes (PCGs), 2 rRNA genes, 22 tRNA genes, and a control region. The arrangement of the PCGs is the same as that found in the other sequenced lepidopteran. The AT skewness for the E. pyretorum mitogenome is slightly negative (-0.031), indicating the occurrence of more Ts than As. The nucleotide composition of the E. pyretorum mitogenome is also biased toward A + T nucleotides (80.82%). All PCGs are initiated by ATN codons, except for cytochrome c oxidase subunit 1 and 2 (cox1 and cox2). Two of the 13 PCGs harbor the incomplete termination codon by T. All tRNA genes have a typical clover-leaf structure of mitochondrial tRNA, with the exception of trnS1(AGN) and trnS2(UCN). Phylogenetic analysis among the available lepidopteran species supports the current morphology-based hypothesis that Bombycoidea, Geometroidea, Notodontidea, Papilionoidea and Pyraloidea are monophyletic. As has been previously suggested, Bombycidae (Bombyx mori and Bombyx mandarina), Sphingoidae (Manduca sexta) and Saturniidae (Antheraea pernyi, Antheraea yamamai, E. pyretorum and Caligula boisduvalii) formed a group. PMID:19471586
The scanning model for translation: an update
1989-01-01
The small (40S) subunit of eukaryotic ribosomes is believed to bind initially at the capped 5'-end of messenger RNA and then migrate, stopping at the first AUG codon in a favorable context for initiating translation. The first-AUG rule is not absolute, but there are rules for breaking the rule. Some anomalous observations that seemed to contradict the scanning mechanism now appear to be artifacts. A few genuine anomalies remain unexplained. PMID:2645293
tRNAs as Therapeutic Agents of Breast Cancer
2013-07-01
their anticodon sequence. Wild-type tRNA reads codon for serine, Suppressor (Sup) tRNA for amber stop, and killer tRNA for isoleucine . Figure 6...endoplasmic reticulum (ER) is a eukaryotic organelle that performs the major functions of synthesizing and packaging pro- teins. Overloading of...anticodons tested in HeLa, tRNASer with the AAU anticodon (tRNASer(AAU)) leads to the substitution of isoleucine with serine within the proteome and
Fujiki, H; Suganuma, M; Yoshizawa, S; Kanazawa, H; Sugimura, T; Manam, S; Kahn, S M; Jiang, W; Hoshina, S; Weinstein, I B
1989-01-01
Three okadaic acid class tumor promoters, okadaic acid, dinophysistoxin-1, and calyculin A, have potent tumor-promoting activity in two-stage carcinogenesis experiments on mouse skin. DNA isolated from tumors induced by 7,12-dimethylbenz[a]anthracene (DMBA) and each of these tumor promoters revealed the same mutation at the second nucleotide of codon 61 (CAA----CTA) in the c-Ha-ras gene, determined by the polymerase chain reaction procedure and DNA sequencing. Three potent 12-O-tetradecanoylphorbol-13-acetate (TPA)-type tumor promoters, TPA, teleocidin, and aplysiatoxin, showed the same effects. These results provide strong evidence that this mutation in the c-Ha-ras gene is due to a direct effect of DMBA rather than a selective effect of specific tumor promoters.
The fuzzy polynucleotide space: basic properties.
Torres, Angela; Nieto, Juan J
2003-03-22
Any triplet codon may be regarded as a 12-dimensional fuzzy code. Sufficient information about a particular sequence may not be available in certain situations. The investigator will be confronted with imprecise sequences, yet want to make comparisons of sequences. Fuzzy polynucleotides can be compared by using geometrical interpretation of fuzzy sets as points in a hypercube. We introduce the space of fuzzy polynucleotides and a means of measuring dissimilitudes between them. We establish mathematical principles to measure dissimilarities between fuzzy polynucleotides and present several examples in this metric space. We calculate the frequencies of the nucleotides at the three base sites of a codon in the coding sequences of Escherichia coli K-12 and Mycobacterium tuberculosis H37Rv, and consider them as points in that fuzzy space. We compute the distance between the genomes of E.coli and M.tuberculosis.
Deciphering mRNA Sequence Determinants of Protein Production Rate
NASA Astrophysics Data System (ADS)
Szavits-Nossan, Juraj; Ciandrini, Luca; Romano, M. Carmen
2018-03-01
One of the greatest challenges in biophysical models of translation is to identify coding sequence features that affect the rate of translation and therefore the overall protein production in the cell. We propose an analytic method to solve a translation model based on the inhomogeneous totally asymmetric simple exclusion process, which allows us to unveil simple design principles of nucleotide sequences determining protein production rates. Our solution shows an excellent agreement when compared to numerical genome-wide simulations of S. cerevisiae transcript sequences and predicts that the first 10 codons, which is the ribosome footprint length on the mRNA, together with the value of the initiation rate, are the main determinants of protein production rate under physiological conditions. Finally, we interpret the obtained analytic results based on the evolutionary role of the codons' choice for regulating translation rates and ribosome densities.
Nucleotide sequence of the gag gene and gag-pol junction of feline leukemia virus.
Laprevotte, I; Hampe, A; Sherr, C J; Galibert, F
1984-01-01
The nucleotide sequence of the gag gene of feline leukemia virus and its flanking sequences were determined and compared with the corresponding sequences of two strains of feline sarcoma virus and with that of the Moloney strain of murine leukemia virus. A high degree of nucleotide sequence homology between the feline leukemia virus and murine leukemia virus gag genes was observed, suggesting that retroviruses of domestic cats and laboratory mice have a common, proximal evolutionary progenitor. The predicted structure of the complete feline leukemia virus gag gene precursor suggests that the translation of nonglycosylated and glycosylated gag gene polypeptides is initiated at two different AUG codons. These initiator codons fall in the same reading frame and are separated by a 222-base-pair segment which encodes an amino terminal signal peptide. The nucleotide sequence predicts the order of amino acids in each of the individual gag-coded proteins (p15, p12, p30, p10), all of which derive from the gag gene precursor. Stable stem-and-loop secondary structures are proposed for two regions of viral RNA. The first falls within sequences at the 5' end of the viral genome, together with adjacent palindromic sequences which may play a role in dimer linkage of RNA subunits. The second includes coding sequences at the gag-pol junction and is proposed to be involved in translation of the pol gene product. Sequence analysis of the latter region shows that the gag and pol genes are translated in different reading frames. Classical consensus splice donor and acceptor sequences could not be localized to regions which would permit synthesis of the expected gag-pol precursor protein. Alternatively, we suggest that the pol gene product (RNA-dependent DNA polymerase) could be translated by a frameshift suppressing mechanism which could involve cleavage modification of stems and loops in a manner similar to that observed in tRNA processing. PMID:6328019
Karlgren, Maria; Simoff, Ivailo; Backlund, Maria; Wegler, Christine; Keiser, Markus; Handin, Niklas; Müller, Janett; Lundquist, Patrik; Jareborg, Anne-Christine; Oswald, Stefan; Artursson, Per
2017-09-01
Madin-Darby canine kidney (MDCK) II cells stably transfected with transport proteins are commonly used models for drug transport studies. However, endogenous expression of especially canine MDR1 (cMDR1) confounds the interpretation of such studies. Here we have established an MDCK cell line stably overexpressing the human MDR1 transporter (hMDR1; P-glycoprotein), and used CRISPR-Cas9 gene editing to knockout the endogenous cMDR1. Genomic screening revealed the generation of a clonal cell line homozygous for a 4-nucleotide deletion in the canine ABCB1 gene leading to a frameshift and a premature stop codon. Knockout of cMDR1 expression was verified by quantitative protein analysis and functional studies showing retained activity of the human MDR1 transporter. Application of this cell line allowed unbiased reclassification of drugs previously defined as both substrates and non-substrates in different studies using commonly used MDCK-MDR1 clones. Our new MDCK-hMDR1 cell line, together with a previously developed control cell line, both with identical deletions in the canine ABCB1 gene and lack of cMDR1 expression represent excellent in vitro tools for use in drug discovery. Copyright © 2017 American Pharmacists Association®. Published by Elsevier Inc. All rights reserved.
Speed, Haley E; Kouser, Mehreen; Xuan, Zhong; Reimers, Jeremy M; Ochoa, Christine F; Gupta, Natasha; Liu, Shunan; Powell, Craig M
2015-07-01
SHANK3 (also known as PROSAP2) is a postsynaptic scaffolding protein at excitatory synapses in which mutations and deletions have been implicated in patients with idiopathic autism, Phelan-McDermid (aka 22q13 microdeletion) syndrome, and other neuropsychiatric disorders. In this study, we have created a novel mouse model of human autism caused by the insertion of a single guanine nucleotide into exon 21 (Shank3(G)). The resulting frameshift causes a premature STOP codon and loss of major higher molecular weight Shank3 isoforms at the synapse. Shank3(G/G) mice exhibit deficits in hippocampus-dependent spatial learning, impaired motor coordination, altered response to novelty, and sensory processing deficits. At the cellular level, Shank3(G/G) mice also exhibit impaired hippocampal excitatory transmission and plasticity as well as changes in baseline NMDA receptor-mediated synaptic responses. This work identifies clear alterations in synaptic function and behavior in a novel, genetically accurate mouse model of autism mimicking an autism-associated insertion mutation. Furthermore, these findings lay the foundation for future studies aimed to validate and study region-selective and temporally selective genetic reversal studies in the Shank3(G/G) mouse that was engineered with such future experiments in mind. Copyright © 2015 the authors 0270-6474/15/359648-18$15.00/0.
Brunelle, Marie-Noëlle; Brakier-Gingras, Léa; Lemay, Guy
2003-01-01
Retroviruses use unusual recoding strategies to synthesize the Gag-Pol polyprotein precursor of viral enzymes. In human immunodeficiency virus, ribosomes translating full-length viral RNA can shift back by 1 nucleotide at a specific site defined by the presence of both a slippery sequence and a downstream stimulatory element made of an extensive secondary structure. This so-called frameshift mechanism could become a target for the development of novel antiviral strategies. A different recoding strategy is used by other retroviruses, such as murine leukemia viruses, to synthesize the Gag-Pol precursor; in this case, a stop codon is suppressed in a readthrough process, again due to the presence of a specific structure adopted by the mRNA. Development of antiframeshift agents will greatly benefit from the availability of a simple animal and virus model. For this purpose, the murine leukemia virus readthrough region was rendered inactive by mutagenesis and the frameshift region of human immunodeficiency virus was inserted to generate a chimeric provirus. This substitution of readthrough by frameshift allows the synthesis of viral proteins, and the chimeric provirus sequence was found to generate infectious viruses. This system could be a most interesting alternative to study ribosomal frameshift in the context of a virus amenable to the use of a simple animal model. PMID:12584361
A Novel Nonsense Mutation in Exon 5 of KIND1 Gene in an Iranian Family with Kindler Syndrome.
Heidari, Mohammad Mehdi; Khatami, Mehri; Kargar, Saeed; Azari, Mojdeh; Hoseinzadeh, Hassan; Fallah, Hamedeh
2016-06-01
Kindler syndrome (KS) is an autosomal recessive skin disease characterized by actual blistering, photosensitivity and a progressive poikiloderma. The disorder results from rare mutations in the KIND1 gene. This gene contains 15 exons and expresses two kindlin-1 isoforms. The aim of this investigation was to analyze mutations in the exons 1 to 15 of KIND1 gene in an Iranian family clinically affected with Kindler syndrome. The mutations analysis of 15 coding exons of KIND1 gene was performed with PCR-SSCP and direct sequencing in 14 subjects from one Iranian family clinically affected with Kindler syndrome. We identified eight new nucleotide changes in KIND1 in this family. These changes were found in g.3892delA, g.3951T>C, g.3962T>G, g.4190G>T, g.7497G>A, g.11076T>C, g.11102C>T and g.13177C>T positions. Among them, the g.13177C>T mutation resulting in the formation of a premature stop codon (Q226X) was detected only in seven affected family individuals as homozygous but was not present in 100 unrelated healthy controls. This study suggests that nonsense mutation may lead to incomplete and non-functional protein products and is pathogenic and has meaningful implications for the diagnosis of patients with Kindler syndrome.
Behnke, Michael S; Khan, Asis; Sibley, L David
2015-02-01
Quantitative trait locus (QTL) mapping studies have been integral in identifying and understanding virulence mechanisms in the parasite Toxoplasma gondii. In this study, we interrogated a different phenotype by mapping sinefungin (SNF) resistance in the genetic cross between type 2 ME49-FUDR(r) and type 10 VAND-SNF(r). The genetic map of this cross was generated by whole-genome sequencing of the progeny and subsequent identification of single nucleotide polymorphisms (SNPs) inherited from the parents. Based on this high-density genetic map, we were able to pinpoint the sinefungin resistance phenotype to one significant locus on chromosome IX. Within this locus, a single nonsynonymous SNP (nsSNP) resulting in an early stop codon in the TGVAND_290860 gene was identified, occurring only in the sinefungin-resistant progeny. Using CRISPR/CAS9, we were able to confirm that targeted disruption of TGVAND_290860 renders parasites sinefungin resistant. Because disruption of the SNR1 gene confers resistance, we also show that it can be used as a negative selectable marker to insert either a positive drug selection cassette or a heterologous reporter. These data demonstrate the power of combining classical genetic mapping, whole-genome sequencing, and CRISPR-mediated gene disruption for combined forward and reverse genetic strategies in T. gondii. Copyright © 2015, American Society for Microbiology. All Rights Reserved.
Molecular genetic analysis of macular corneal dystrophy patients from North India.
Paliwal, Preeti; Sharma, Arundhati; Tandon, Radhika; Sharma, Namrata; Titiyal, Jeevan S; Sen, Seema; Vajpayee, Rasik B
2012-01-01
To identify underlying genetic defects in the carbohydrate sulfotransferase-6 (CHST6) gene in North Indian patients with macular corneal dystrophy (MCD). 30 clinically diagnosed MCD patients from 21 families and 50 healthy normal controls were recruited in the study. Detailed clinical evaluation in the patients was undertaken followed by histopathology and ultrastructural studies in corneal tissues. DNA from blood samples was amplified for the CHST6 coding and upstream region followed by direct sequencing and in silico analysis. We identified pathogenic mutations in 17 patients from 11 families. Of these 4 were novel (p.Ser54Tyr, p.Gln58Arg, p.Leu59His and p.Leu293Phe), 2 were previously reported (Arg93His and Glu274Lys) homozygous, 1 heterozygous stop codon (p.Trp123X) and 2 compound heterozygous (p.Arg93His + p.Arg97Pro; p.Leu22Arg + p.Gln58X) mutations. A missense single-nucleotide polymorphism was also identified in 11 patients. The novel mutations were conserved as shown by in silico analysis. Thirteen patients did not show any pathogenic CHST6 changes. This is the first report on molecular analysis of MCD in North Indian patients. All cases could not be explained by mutations in CHST6, suggesting that MCD may result from other changes in the regulatory elements of CHST6 or from genetic heterogeneity. Copyright © 2012 S. Karger AG, Basel.
Identification of Candidate Gene Variants in Korean MODY Families by Whole-Exome Sequencing.
Shim, Ye Jee; Kim, Jung Eun; Hwang, Su-Kyeong; Choi, Bong Seok; Choi, Byung Ho; Cho, Eun-Mi; Jang, Kyoung Mi; Ko, Cheol Woo
2015-01-01
To date, 13 genes causing maturity-onset diabetes of the young (MODY) have been identified. However, there is a big discrepancy in the genetic locus between Asian and Caucasian patients with MODY. Thus, we conducted whole-exome sequencing in Korean MODY families to identify causative gene variants. Six MODY probands and their family members were included. Variants in the dbSNP135 and TIARA databases for Koreans and the variants with minor allele frequencies >0.5% of the 1000 Genomes database were excluded. We selected only the functional variants (gain of stop codon, frameshifts and nonsynonymous single-nucleotide variants) and conducted a case-control comparison in the family members. The selected variants were scanned for the previously introduced gene set implicated in glucose metabolism. Three variants c.620C>T:p.Thr207Ile in PTPRD, c.559C>G:p.Gln187Glu in SYT9, and c.1526T>G:p.Val509Gly in WFS1 were respectively identified in 3 families. We could not find any disease-causative alleles of known MODY 1-13 genes. Based on the predictive program, Thr207Ile in PTPRD was considered pathogenic. Whole-exome sequencing is a valuable method for the genetic diagnosis of MODY. Further evaluation is necessary about the role of PTPRD, SYT9 and WFS1 in normal insulin release from pancreatic beta cells. © 2015 S. Karger AG, Basel.
Genetic features of Mycobacterium tuberculosis modern Beijing sublineage
Liu, Qingyun; Luo, Tao; Dong, Xinran; Sun, Gang; Liu, Zhu; Gan, Mingyun; Wu, Jie; Shen, Xin; Gao, Qian
2016-01-01
Mycobacterium tuberculosis (MTB) Beijing strains have caused a great concern because of their rapid emergence and increasing prevalence in worldwide regions. Great efforts have been made to investigate the pathogenic characteristics of Beijing strains such as hypervirulence, drug resistance and favoring transmission. Phylogenetically, MTB Beijing family was divided into modern and ancient sublineages. Modern Beijing strains displayed enhanced virulence and higher prevalence when compared with ancient Beijing strains, but the genetic basis for this difference remains unclear. In this study, by analyzing previously published sequencing data of 1082 MTB Beijing isolates, we determined the genetic changes that were commonly present in modern Beijing strains but absent in ancient Beijing strains. These changes include 44 single-nucleotide polymorphisms (SNPs) and two short genomic deletions. Through bioinformatics analysis, we demonstrated that these genetic changes had high probability of functional effects. For example, 4 genes were frameshifted due to premature stop mutation or genomic deletions, 19 nonsynonymous SNPs located in conservative codons, and there is a significant enrichment in regulatory network for all nonsynonymous mutations. Besides, three SNPs located in promoter regions were verified to alter downstream gene expressions. Our study precisely defined the genetic features of modern Beijing strains and provided interesting clues for future researches to elucidate the mechanisms that underlie this sublineage's successful expansion. These findings from the analysis of the modern Beijing sublineage could provide us a model to understand the dynamics of pathogenicity of MTB. PMID:26905026
Perrotta, Silverio; Cucciolla, Valeria; Ferraro, Marcella; Ronzoni, Luisa; Tramontano, Annunziata; Rossi, Francesca; Scudieri, Anna Chiara; Borriello, Adriana; Roberti, Domenico; Nobili, Bruno; Cappellini, Maria Domenica; Oliva, Adriana; Amendola, Giovanni; Migliaccio, Anna Rita; Mancuso, Patrizia; Martin-Padura, Ines; Bertolini, Francesco; Yoon, Donghoon; Prchal, Josef T.; Della Ragione, Fulvio
2010-01-01
Background Gain-of-function of erythropoietin receptor (EPOR) mutations represent the major cause of primary hereditary polycythemia. EPOR is also found in non-erythroid tissues, although its physiological role is still undefined. Methodology/Principal Findings We describe a family with polycythemia due to a heterozygous mutation of the EPOR gene that causes a G→T change at nucleotide 1251 of exon 8. The novel EPOR G1251T mutation results in the replacement of a glutamate residue by a stop codon at amino acid 393. Differently from polycythemia vera, EPOR G1251T CD34+ cells proliferate and differentiate towards the erythroid phenotype in the presence of minimal amounts of EPO. Moreover, the affected individuals show a 20-fold increase of circulating endothelial precursors. The analysis of erythroid precursor membranes demonstrates a heretofore undescribed accumulation of the truncated EPOR, probably due to the absence of residues involved in the EPO-dependent receptor internalization and degradation. Mutated receptor expression in EPOR-negative cells results in EPOR and Stat5 phosphorylation. Moreover, patient erythroid precursors present an increased activation of EPOR and its effectors, including Stat5 and Erk1/2 pathway. Conclusions/Significance Our data provide an unanticipated mechanism for autosomal dominant inherited polycythemia due to a heterozygous EPOR mutation and suggest a regulatory role of EPO/EPOR pathway in human circulating endothelial precursors homeostasis. PMID:20700488
López-Wilchis, Ricardo; Del Río-Portilla, Miguel Ángel; Guevara-Chumacero, Luis Manuel
2017-02-01
We described the complete mitochondrial genome (mitogenome) of the Wagner's mustached bat, Pteronotus personatus, a species belonging to the family Mormoopidae, and compared it with other published mitogenomes of bats (Chiroptera). The mitogenome of P. personatus was 16,570 bp long and contained a typically conserved structure including 13 protein-coding genes, 22 transfer RNA genes, two ribosomal RNA genes, and one control region (D-loop). Most of the genes were encoded on the H-strand, except for eight tRNA and the ND6 genes. The order of protein-coding and rRNA genes was highly conserved in all mitogenomes. All protein-coding genes started with an ATG codon, except for ND2, ND3, and ND5, which initiated with ATA, and terminated with the typical stop codon TAA/TAG or the codon AGA. Phylogenetic trees constructed using Maximum Parsimony, Maximum Likelihood, and Bayesian inference methods showed an identical topology and indicated the monophyly of different families of bats (Mormoopidae, Phyllostomidae, Vespertilionidae, Rhinolophidae, and Pteropopidae) and the existence of two major clades corresponding to the suborders Yangochiroptera and Yinpterochiroptera. The mitogenome sequence provided here will be useful for further phylogenetic analyses and population genetic studies in mormoopid bats.
Song, Sheng-Nan; Chen, Peng-Yan; Wei, Shu-Jun; Chen, Xue-Xin
2016-07-01
The mitochondrial genome sequence of Polistes jokahamae (Radoszkowski, 1887) (Hymenoptera: Vespidae) (GenBank accession no. KR052468) was sequenced. The current length with partial A + T-rich region of this mitochondrial genome is 16,616 bp. All the typical mitochondrial genes were sequenced except for three tRNAs (trnI, trnQ, and trnY) located between the A + T-rich region and nad2. At least three rearrangement events occurred in the sequenced region compared with the pupative ancestral arrangement of insects, corresponding to the shuffling of trnK and trnD, translocation or remote inversion of tnnY and translocation of trnL1. All protein-coding genes start with ATN codons. Eleven, one, and another one protein-coding genes stop with termination codon TAA, TA, and T, respectively. Phylogenetic analysis using the Bayesian method based on all codon positions of the 13 protein-coding genes supports the monophyly of Vespidae and Formicidae. Within the Formicidae, the Myrmicinae and Formicinae form a sister lineage and then sister to the Dolichoderinae, while within the Vespidae, the Eumeninae is sister to the lineage of Vespinae + Polistinae.
Hunt, C; Morimoto, R I
1985-01-01
We have determined the nucleotide sequence of the human hsp70 gene and 5' flanking region. The hsp70 gene is transcribed as an uninterrupted primary transcript of 2440 nucleotides composed of a 5' noncoding leader sequence of 212 nucleotides, a 3' noncoding region of 242 nucleotides, and a continuous open reading frame of 1986 nucleotides that encodes a protein with predicted molecular mass of 69,800 daltons. Upstream of the 5' terminus are the canonical TATAAA box, the sequence ATTGG that corresponds in the inverted orientation to the CCAAT motif, and the dyad sequence CTGGAAT/ATTCCCG that shares homology in 12 of 14 positions with the consensus transcription regulatory sequence common to Drosophila heat shock genes. Comparison of the predicted amino acid sequences of human hsp70 with the published sequences of Drosophila hsp70 and Escherichia coli dnaK reveals that human hsp70 is 73% identical to Drosophila hsp70 and 47% identical to E. coli dnaK. Surprisingly, the nucleotide sequences of the human and Drosophila genes are 72% identical and human and E. coli genes are 50% identical, which is more highly conserved than necessary given the degeneracy of the genetic code. The lack of accumulated silent nucleotide substitutions leads us to propose that there may be additional information in the nucleotide sequence of the hsp70 gene or the corresponding mRNA that precludes the maximum divergence allowed in the silent codon positions. PMID:3931075
1996-01-01
An increasing amount of evidence has shown that epitopes restricted to MHC class I molecules and recognized by CTL need not be encoded in a primary open reading frame (ORF). Such epitopes have been demonstrated after stop codons, in alternative reading frames (RF) and within introns. We have used a series of frameshifts (FS) introduced into the Influenza A/PR/8 /34 nucleoprotein (NP) gene to confirm the previous in vitro observations of cryptic epitope expression, and show that they are sufficiently expressed to prime immune responses in vivo. This presentation is not due to sub-dominant epitopes, transcription from cryptic promoters beyond the point of the FS, or internal initiation of translation. By introducing additional mutations to the construct exhibiting the most potent presentation, we have identified initiation codon readthrough (termed scanthrough here, where the scanning ribosome bypasses the conventional initiation codon, initiating translation further downstream) as the likely mechanism of epitope production. Further mutational analysis demonstrated that, while it should operate during the expression of wild-type (WT) protein, scanthrough does not provide a major source of processing substrate in our system. These findings suggest (i) that the full array of self- and pathogen-derived epitopes available during thymic selection and infection has not been fully appreciated and (ii) that cryptic epitope expression should be considered when the specificity of a CTL response cannot be identified or in therapeutic situations when conventional CTL targets are limited, as may be the case with latent viral infections and transformed cells. Finally, initiation codon readthrough provides a plausible explanation for the presentation of exocytic proteins by MHC class I molecules. PMID:8879204
The Quantum Workings of the Rotating 64-Grid Genetic Code
Castro-Chavez, Fernando
2011-01-01
In this article, the pattern learned from the classic or conventional rotating circular genetic code is transferred to a 64-grid model. In this non-static representation, the codons for the same amino acid within each quadrant could be exchanged, wobbling or rotating in a quantic way similar to the electrons within an atomic orbit. Represented in this 64-grid format are the three rules of variation encompassing 4, 2, or 1 quadrant, respectively: 1) same position in four quadrants for the essential hydrophobic amino acids that have U at the center, 2) same or contiguous position for the same or related amino acids in two quadrants, and 3) equivalent amino acids within one quadrant. Also represented is the mathematical balance of the odd and even codons, and the most used codons per amino acid in humans compared to one diametrically opposed organism: the plant Arabidopsis thaliana, a comparison that depicts the difference in third nucleotide preferences: a C/U exchange for 11 amino acids, a G/A and a G/U exchange for 2 amino acids, respectively, and a C/A exchange for one amino acid; by studying these codon usage preferences per amino acid we present our two hypotheses: 1) A slower translation in vertebrates and 2) a faster translation in invertebrates, possibly due to the aqueous environments where they live. These codon usage preferences may also be able to determine genomic compatibility by comparing individual mRNAs and their functional third dimensional structure, transport and translation within cells and organisms. These observations are aimed to the design of bioinformatics computational tools to compare human genomes and to determine the exchange between compatible codons and amino acids, to preserve and/or to bring back extinct biodiversity, and for the early detection of incompatible changes that lead to genetic diseases. PMID:22308074
Structure of the c-Ki-ras gene in a rat fibrosarcoma induced by 1,8-dinitropyrene.
Tahira, T; Hayashi, K; Ochiai, M; Tsuchida, N; Nagao, M; Sugimura, T
1986-01-01
Restriction enzyme maps were made of the region around exons 1 and 2 of activated c-Ki-ras of a fibrosarcoma (1,8-DNP2) induced in a rat by 1,8-dinitropyrene. Nucleotide sequence analysis revealed that activated c-Ki-ras shows a G----T transversion in codon 12 and consequently encodes cysteine instead of glycine in normal rat c-Ki-ras. PMID:3023884
Lasota, Jerzy; Felisiak-Golabek, Anna; Aly, F Zahra; Wang, Zeng-Feng; Thompson, Lester D R; Miettinen, Markku
2015-05-01
Glomangiopericytoma (sinonasal-type hemangiopericytoma) is a rare mesenchymal neoplasm with myoid phenotype (smooth muscle actin-positive), which distinguishes this tumor from soft tissue hemangiopericytoma/solitary fibrous tumor. Molecular genetic changes underlying the pathogenesis of glomangiopericytoma are not known. In this study, 13 well-characterized glomangiopericytomas were immunohistochemically evaluated for β-catenin expression. All analyzed tumors showed strong expression and nuclear accumulation of β-catenin. Following this observation, β-catenin glycogen serine kinase-3 beta phosphorylation region, encoded by exon 3, was PCR amplified in all cases and evaluated for mutations using Sanger sequencing. Heterozygous mutations were identified in 12 of 13 tumors. All mutations consisted of single-nucleotide substitutions: three in codon 32 (c.94G>C (n=2) and c.95A>T), four in codon 33 (two each c.98C>G and c.98C>T), two in codon 37 (c.109T>G), one in codon 41 (c.121A>G), and two in codon 45 (c.133T>C). At the protein level, these substitutions would lead to p.D32H, p.D32V, p.S33C, p.S33F, p.S37A, p.T41A, and p.S45L mutations, respectively. Previously, similar mutations have been reported in different types of cancers and shown to trigger activation of β-catenin signaling. All analyzed glomangiopericytomas showed prominent nuclear expression of cyclin D1, as previously shown for tumors with nuclear expression of β-catenin as a sign of oncogenic activation. These results demonstrate that mutational activation of β-catenin and associated cyclin D1 overexpression may be central events in the pathogenesis of glomangiopericytoma. In additon, nuclear accumulation of β-catenin is a diagnostic marker for glomangiopericytoma.
2012-01-01
Background Malaria is still a public health problem in Malaysia with chloroquine (CQ) being the first-line drug in the treatment policy of uncomplicated malaria. There is a scarcity in information about the magnitude of Plasmodium falciparum CQ resistance. This study aims to investigate the presence of single point mutations in the P. falciparum chloroquine-resistance transporter gene (pfcrt) at codons 76, 271, 326, 356 and 371 and in P. falciparum multi-drug resistance-1 gene (pfmdr1) at codons 86 and 1246, as molecular markers of CQ resistance. Methods A total of 75 P. falciparum blood samples were collected from different districts of Pahang state, Malaysia. Single nucleotide polymorphisms in pfcrt gene (codons 76, 271, 326, 356 and 371) and pfmdr1 gene (codons 86 and 1246) were analysed by using mutation-specific nested PCR and restriction fragment length polymorphism (PCR-RFLP) methods. Results Mutations of pfcrt K76T and pfcrt R371I were the most prevalent among pfcrt gene mutations reported by this study; 52% and 77%, respectively. Other codons of the pfcrt gene and the positions 86 and 1246 of the pfmdr1 gene were found mostly of wild type. Significant associations of pfcrt K76T, pfcrt N326S and pfcrt I356T mutations with parasitaemia were also reported. Conclusion The high existence of mutant pfcrt T76 may indicate the low susceptibility of P. falciparum isolates to CQ in Peninsular Malaysia. The findings of this study establish baseline data on the molecular markers of P. falciparum CQ resistance, which may help in the surveillance of drug resistance in Peninsular Malaysia. PMID:22853645
Rubinstein, M; Mogil, J S; Japón, M; Chan, E C; Allen, R G; Low, M J
1996-01-01
A physiological role for beta-endorphin in endogenous pain inhibition was investigated by targeted mutagenesis of the proopiomelanocortin gene in mouse embryonic stem cells. The tyrosine codon at position 179 of the proopiomelanocortin gene was converted to a premature translational stop codon. The resulting transgenic mice display no overt developmental or behavioral alterations and have a normally functioning hypothalamic-pituitary-adrenal axis. Homozygous transgenic mice with a selective deficiency of beta-endorphin exhibit normal analgesia in response to morphine, indicating the presence of functional mu-opiate receptors. However, these mice lack the opioid (naloxone reversible) analgesia induced by mild swim stress. Mutant mice also display significantly greater nonopioid analgesia in response to cold water swim stress compared with controls and display paradoxical naloxone-induced analgesia. These changes may reflect compensatory upregulation of alternative pain inhibitory mechanisms. Images Fig. 1 Fig. 2 PMID:8633004
Congenital deficiency of alpha feto-protein.
Sharony, Reuven; Zadik, Idit; Parvari, Ruti
2004-10-01
Alpha-fetoprotein (AFP) is the main fetus serum glycoprotein with a very low concentration in the adult. AFP deficiency is a rare phenomenon. We studied two families with congenital AFP deficiency and searched for mutations in the AFP gene. We identified one mutation of 2 base deletion in exon 8, in both families, that leads to the congenital deficiency of AFP. The mutation nt930-931delCT (T294fs25X) creates a frameshift after codon 294 that leads to a stop codon after 24 amino acids, thus truncating the normal length of AFP of 609 amino acids. All the affected children were found to be homozygous for the mutation as was one of the fathers. The affected individuals were asymptomatic and presented normal development. This first identification of a mutation in the AFP gene demonstrates for the first time that deficiency of AFP is compatible with human normal fetal development and further reproduction in males.
Complete mitochondrial genome of the Freshwater Whipray Himantura dalyensis.
Feutry, Pierre; Kyne, Peter M; Peng, Zaiqing; Pan, Lianghao; Chen, Xiao
2016-05-01
The complete mitochondrial genome of the Freshwater Whipray Himantura dalyensis is presented in this study. It is 17,693 bp in length and contains 37 genes in typical gene order and transcriptional orientation observed in vertebrates. There were a total of 86 bp short intergenic spacers and 22 bp overlaps in the genome. The overall base composition was 31.4% A, 25.5% C, 13.2% G and 29.9% T. Two start codons (GTG and ATG) and two stop codons (TAG and TAA/T) were found in 13 protein-coding genes. The length of 22 tRNA genes ranged from 68 (tRNA-Cys and tRNA-Ser2) to 75 bp (tRNA-Leu1). The origin of L-strand replication (OL) was found between the tRNA-Asn and tRNA-Cys genes. The base composition of the control region (1940 bp) was similar to the whole mitogenome.
Dubey, Bhawna; Meganathan, P R; Haque, Ikramul
2012-07-01
This paper reports the complete mitochondrial genome sequence of an endangered Indian snake, Python molurus molurus (Indian Rock Python). A typical snake mitochondrial (mt) genome of 17258 bp length comprising of 37 genes including the 13 protein coding genes, 22 tRNA genes, and 2 ribosomal RNA genes along with duplicate control regions is described herein. The P. molurus molurus mt. genome is relatively similar to other snake mt. genomes with respect to gene arrangement, composition, tRNA structures and skews of AT/GC bases. The nucleotide composition of the genome shows that there are more A-C % than T-G% on the positive strand as revealed by positive AT and CG skews. Comparison of individual protein coding genes, with other snake genomes suggests that ATP8 and NADH3 genes have high divergence rates. Codon usage analysis reveals a preference of NNC codons over NNG codons in the mt. genome of P. molurus. Also, the synonymous and non-synonymous substitution rates (ka/ks) suggest that most of the protein coding genes are under purifying selection pressure. The phylogenetic analyses involving the concatenated 13 protein coding genes of P. molurus molurus conformed to the previously established snake phylogeny.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Machlin, S.M.; Hanson, R.S.
The nucleotide sequence of a cloned 2.5-kilobase-pair SmaI fragment containing the methanol dehydrogenase (MDH) structural gene from Methylobacterium organophilum XX was determined. A single open reading frame with a coding capacity of 626 amino acids (molecular weight, 66,000) was identified on one stand, and N-terminal sequencing of purified MDH revealed that 27 of these residues constituted a putative signal peptide. Primer extension mapping of in vivo transcripts indicated that the start of mRNA synthesis was 160 to 170 base pairs upstream of the ATG codon. Northern (RNA) blot analysis further demonstrated that the transcript was 2.1 kilobase pairs in lengthmore » and therefore appeared to encode only MDH.« less
Saito, Yuki; Mao, Han; Sekimizu, Kazuhisa; Kaito, Chikara
2014-01-01
Staphylococcal species acquire antibiotic resistance by incorporating the mobile-genetic element SCCmec. We previously found that SCCmec-encoded psm-mec RNA suppresses exotoxin production as a regulatory RNA, and the psm-mec translation product increases biofilm formation in Staphylococcus aureus. Here, we examined whether the regulatory role of psm-mec on host bacterial virulence properties is conserved among other staphylococcal species, S. epidermidis and S. haemolyticus, both of which are important causes of nosocomial infections. In S. epidermidis, introduction of psm-mec decreased the production of cytolytic toxins called phenol-soluble modulins (PSMs) and increased biofilm formation. Introduction of psm-mec with a stop-codon mutation that did not express PSM-mec protein but did express psm-mec RNA also decreased PSM production, but did not increase biofilm formation. Thus, the psm-mec RNA inhibits PSM production, whereas the PSM-mec protein increases biofilm formation in S. epidermidis. In S. haemolyticus, introduction of psm-mec decreased PSM production, but did not affect biofilm formation. The mutated psm-mec with a stop-codon also caused the same effect. Thus, the psm-mec RNA also inhibits PSM production in S. haemolyticus. These findings suggest that the inhibitory role of psm-mec RNA on exotoxin production is conserved among staphylococcal species, although the stimulating effect of the psm-mec gene on biofilm formation is not conserved. PMID:24926994
Constructing high complexity synthetic libraries of long ORFs using in vitro selection
NASA Technical Reports Server (NTRS)
Cho, G.; Keefe, A. D.; Liu, R.; Wilson, D. S.; Szostak, J. W.
2000-01-01
We present a method that can significantly increase the complexity of protein libraries used for in vitro or in vivo protein selection experiments. Protein libraries are often encoded by chemically synthesized DNA, in which part of the open reading frame is randomized. There are, however, major obstacles associated with the chemical synthesis of long open reading frames, especially those containing random segments. Insertions and deletions that occur during chemical synthesis cause frameshifts, and stop codons in the random region will cause premature termination. These problems can together greatly reduce the number of full-length synthetic genes in the library. We describe a strategy in which smaller segments of the synthetic open reading frame are selected in vitro using mRNA display for the absence of frameshifts and stop codons. These smaller segments are then ligated together to form combinatorial libraries of long uninterrupted open reading frames. This process can increase the number of full-length open reading frames in libraries by up to two orders of magnitude, resulting in protein libraries with complexities of greater than 10(13). We have used this methodology to generate three types of displayed protein library: a completely random sequence library, a library of concatemerized oligopeptide cassettes with a propensity for forming amphipathic alpha-helical or beta-strand structures, and a library based on one of the most common enzymatic scaffolds, the alpha/beta (TIM) barrel. Copyright 2000 Academic Press.
Mitrovich, Quinn M.; Anderson, Philip
2000-01-01
Messenger RNA surveillance, the selective and rapid degradation of mRNAs containing premature stop codons, occurs in all eukaryotes tested. The biological role of this decay pathway, however, is not well understood. To identify natural substrates of mRNA surveillance, we used a cDNA-based representational difference analysis to identify mRNAs whose abundance increases in Caenorhabditis elegans smg(−) mutants, which are deficient for mRNA surveillance. Alternatively spliced mRNAs of genes encoding ribosomal proteins L3, L7a, L10a, and L12 are abundant natural targets of mRNA surveillance. Each of these genes expresses two distinct mRNAs. A productively spliced mRNA, whose abundance does not change in smg(−) mutants, encodes a normal, full-length, ribosomal protein. An unproductively spliced mRNA, whose abundance increases dramatically in smg(−) mutants, contains premature stop codons because of incomplete removal of an alternatively spliced intron. In transgenic animals expressing elevated quantities of RPL-12, a greater proportion of endogenous rpl-12 transcript is spliced unproductively. Thus, RPL-12 appears to autoregulate its own splicing, with unproductively spliced mRNAs being degraded by mRNA surveillance. We demonstrate further that alternative splicing of rpl introns is conserved among widely diverged nematodes. Our results suggest that one important role of mRNA surveillance is to eliminate unproductive by-products of gene regulation. PMID:10970881
Kuschal, Christiane; Khan, Sikandar G; Enk, Benedikt; DiGiovanna, John J; Kraemer, Kenneth H
2015-04-01
Readthrough of premature termination (stop) codons (PTC) is a new approach to treatment of genetic diseases. We recently reported that readthrough of PTC in cells from some xeroderma pigmentosum complementation group C (XP-C) patients could be achieved with the aminoglycosides geneticin or gentamicin. We found that the response depended on several factors including the PTC sequence, its location within the gene and the aminoglycoside used. Here, we extended these studies to investigate the effects of other aminoglycosides that are already on the market. We reasoned that topical treatment could deliver much higher concentrations of drug to the skin, the therapeutic target, and thus increase the therapeutic effect while reducing renal or ototoxicity in comparison with systemic treatment. Our prior clinical studies indicated that only a few percent of normal XPC expression was associated with mild clinical disease. We found minimal cell toxicity in the XP-C cells with several aminoglycosides. We found increased XPC mRNA expression in PTC-containing XP-C cells with G418, paromomycin, neomycin and kanamycin and increased XPC protein expression with G418. We conclude that in selected patients with XP, topical PTC therapy can be investigated as a method of personalized medicine to alleviate their cutaneous symptoms. Published 2015. This article is a U.S. Government work and is in the public domain in the USA.
Pseudogenization of the tooth gene enamelysin (MMP20) in the common ancestor of extant baleen whales
Meredith, Robert W.; Gatesy, John; Cheng, Joyce; Springer, Mark S.
2011-01-01
Whales in the suborder Mysticeti are filter feeders that use baleen to sift zooplankton and small fish from ocean waters. Adult mysticetes lack teeth, although tooth buds are present in foetal stages. Cladistic analyses suggest that functional teeth were lost in the common ancestor of crown-group Mysticeti. DNA sequences for the tooth-specific genes, ameloblastin (AMBN), enamelin (ENAM) and amelogenin (AMEL), have frameshift mutations and/or stop codons in this taxon, but none of these molecular cavities are shared by all extant mysticetes. Here, we provide the first evidence for pseudogenization of a tooth gene, enamelysin (MMP20), in the common ancestor of living baleen whales. Specifically, pseudogenization resulted from the insertion of a CHR-2 SINE retroposon in exon 2 of MMP20. Genomic and palaeontological data now provide congruent support for the loss of enamel-capped teeth on the common ancestral branch of crown-group mysticetes. The new data for MMP20 also document a polymorphic stop codon in exon 2 of the pygmy sperm whale (Kogia breviceps), which has enamel-less teeth. These results, in conjunction with the evidence for pseudogenization of MMP20 in Hoffmann's two-toed sloth (Choloepus hoffmanni), another enamel-less species, support the hypothesis that the only unique, non-overlapping function of the MMP20 gene is in enamel formation. PMID:20861053
Menzies, Georgina E.; Reed, Simon H.; Brancale, Andrea; Lewis, Paul D.
2015-01-01
The mutational pattern for the TP53 tumour suppressor gene in lung tumours differs to other cancer types by having a higher frequency of G:C>T:A transversions. The aetiology of this differing mutation pattern is still unknown. Benzo[a]pyrene,diol epoxide (BPDE) is a potent cigarette smoke carcinogen that forms guanine adducts at TP53 CpG mutation hotspot sites including codons 157, 158, 245, 248 and 273. We performed molecular modelling of BPDE-adducted TP53 duplex sequences to determine the degree of local distortion caused by adducts which could influence the ability of nucleotide excision repair. We show that BPDE adducted codon 157 has greater structural distortion than other TP53 G:C>T:A hotspot sites and that sequence context more distal to adjacent bases must influence local distortion. Using TP53 trinucleotide mutation signatures for lung cancer in smokers and non-smokers we further show that codons 157 and 273 have the highest mutation probability in smokers. Combining this information with adduct structural data we predict that G:C>T:A mutations at codon 157 in lung tumours of smokers are predominantly caused by BPDE. Our results provide insight into how different DNA sequence contexts show variability in DNA distortion at mutagen adduct sites that could compromise DNA repair at well characterized cancer related mutation hotspots. PMID:26400171
Kjær, Jonas; Belsham, Graham J
2018-01-01
Foot-and-mouth disease virus (FMDV) has a positive-sense ssRNA genome including a single, large, open reading frame. Splitting of the encoded polyprotein at the 2A/2B junction is mediated by the 2A peptide (18 residues long), which induces a nonproteolytic, cotranslational "cleavage" at its own C terminus. A conserved feature among variants of 2A is the C-terminal motif N 16 P 17 G 18 /P 19 , where P 19 is the first residue of 2B. It has been shown previously that certain amino acid substitutions can be tolerated at residues E 14 , S 15 , and N 16 within the 2A sequence of infectious FMDVs, but no variants at residues P 17 , G 18 , or P 19 have been identified. In this study, using highly degenerate primers, we analyzed if any other residues can be present at each position of the NPG/P motif within infectious FMDV. No alternative forms of this motif were found to be encoded by rescued FMDVs after two, three, or four passages. However, surprisingly, a clear codon preference for the wt nucleotide sequence encoding the NPGP motif within these viruses was observed. Indeed, the codons selected to code for P 17 and P 19 within this motif were distinct; thus the synonymous codons are not equivalent. © 2018 Kjær and Belsham; Published by Cold Spring Harbor Laboratory Press for the RNA Society.
Codon-Anticodon Recognition in the Bacillus subtilis glyQS T Box Riboswitch
Caserta, Enrico; Liu, Liang-Chun; Grundy, Frank J.; Henkin, Tina M.
2015-01-01
Many amino acid-related genes in Gram-positive bacteria are regulated by the T box riboswitch. The leader RNA of genes in the T box family controls the expression of downstream genes by monitoring the aminoacylation status of the cognate tRNA. Previous studies identified a three-nucleotide codon, termed the “Specifier Sequence,” in the riboswitch that corresponds to the amino acid identity of the downstream genes. Pairing of the Specifier Sequence with the anticodon of the cognate tRNA is the primary determinant of specific tRNA recognition. This interaction mimics codon-anticodon pairing in translation but occurs in the absence of the ribosome. The goal of the current study was to determine the effect of a full range of mismatches for comparison with codon recognition in translation. Mutations were individually introduced into the Specifier Sequence of the glyQS leader RNA and tRNAGly anticodon to test the effect of all possible pairing combinations on tRNA binding affinity and antitermination efficiency. The functional role of the conserved purine 3′ of the Specifier Sequence was also verifiedin this study. We found that substitutions at the Specifier Sequence resulted in reduced binding, the magnitude of which correlates well with the predicted stability of the RNA-RNA pairing. However, the tolerance for specific mismatches in antitermination was generally different from that during decoding, which reveals a unique tRNA recognition pattern in the T box antitermination system. PMID:26229106
3-base periodicity in coding DNA is affected by intercodon dinucleotides
Sánchez, Joaquín
2011-01-01
All coding DNAs exhibit 3-base periodicity (TBP), which may be defined as the tendency of nucleotides and higher order n-tuples, e.g. trinucleotides (triplets), to be preferentially spaced by 3, 6, 9 etc, bases, and we have proposed an association between TBP and clustering of same-phase triplets. We here investigated if TBP was affected by intercodon dinucleotide tendencies and whether clustering of same-phase triplets was involved. Under constant protein sequence intercodon dinucleotide frequencies depend on the distribution of synonymous codons. So, possible effects were revealed by randomly exchanging synonymous codons without altering protein sequences to subsequently document changes in TBP via frequency distribution of distances (FDD) of DNA triplets. A tripartite positive correlation was found between intercodon dinucleotide frequencies, clustering of same-phase triplets and TBP. So, intercodon C|A (where “|” indicates the boundary between codons) was more frequent in native human DNA than in the codon-shuffled sequences; higher C|A frequency occurred along with more frequent clustering of C|AN triplets (where N jointly represents A, C, G and T) and with intense CAN TBP. The opposite was found for C|G, which was less frequent in native than in shuffled sequences; lower C|G frequency occurred together with reduced clustering of C|GN triplets and with less intense CGN TBP. We hence propose that intercodon dinucleotides affect TBP via same-phase triplet clustering. A possible biological relevance of our findings is briefly discussed. PMID:21814388
Biased Gene Conversion and GC-Content Evolution in the Coding Sequences of Reptiles and Vertebrates
Figuet, Emeric; Ballenghien, Marion; Romiguier, Jonathan; Galtier, Nicolas
2015-01-01
Mammalian and avian genomes are characterized by a substantial spatial heterogeneity of GC-content, which is often interpreted as reflecting the effect of local GC-biased gene conversion (gBGC), a meiotic repair bias that favors G and C over A and T alleles in high-recombining genomic regions. Surprisingly, the first fully sequenced nonavian sauropsid (i.e., reptile), the green anole Anolis carolinensis, revealed a highly homogeneous genomic GC-content landscape, suggesting the possibility that gBGC might not be at work in this lineage. Here, we analyze GC-content evolution at third-codon positions (GC3) in 44 vertebrates species, including eight newly sequenced transcriptomes, with a specific focus on nonavian sauropsids. We report that reptiles, including the green anole, have a genome-wide distribution of GC3 similar to that of mammals and birds, and we infer a strong GC3-heterogeneity to be already present in the tetrapod ancestor. We further show that the dynamic of coding sequence GC-content is largely governed by karyotypic features in vertebrates, notably in the green anole, in agreement with the gBGC hypothesis. The discrepancy between third-codon positions and noncoding DNA regarding GC-content dynamics in the green anole could not be explained by the activity of transposable elements or selection on codon usage. This analysis highlights the unique value of third-codon positions as an insertion/deletion-free marker of nucleotide substitution biases that ultimately affect the evolution of proteins. PMID:25527834
Peng, Rui; Zeng, Bo; Meng, Xiuxiang; Yue, Bisong; Zhang, Zhihe; Zou, Fangdong
2007-08-01
The complete mitochondrial genome sequence of the giant panda, Ailuropoda melanoleuca, was determined by the long and accurate polymerase chain reaction (LA-PCR) with conserved primers and primer walking sequence methods. The complete mitochondrial DNA is 16,805 nucleotides in length and contains two ribosomal RNA genes, 13 protein-coding genes, 22 transfer RNA genes and one control region. The total length of the 13 protein-coding genes is longer than the American black bear, brown bear and polar bear by 3 amino acids at the end of ND5 gene. The codon usage also followed the typical vertebrate pattern except for an unusual ATT start codon, which initiates the NADH dehydrogenase subunit 5 (ND5) gene. The molecular phylogenetic analysis was performed on the sequences of 12 concatenated heavy-strand encoded protein-coding genes, and suggested that the giant panda is most closely related to bears.
Gao, Zhaowei; Li, Zhuofu; Zhang, Yuhong; Huang, Huoqing; Li, Mu; Zhou, Liwei; Tang, Yunming; Yao, Bin; Zhang, Wei
2012-03-01
The glucose oxidase (GOD) gene from Penicillium notatum was expressed in Pichia pastoris. The 1,815 bp gene, god-w, encodes 604 amino acids. Recombinant GOD-w had optimal activity at 35-40°C and pH 6.2 and was stable, from pH 3 to 7 maintaining >75% maximum activity after incubation at 50°C for 1 h. GOD-w worked as well as commercial GODs to improve bread making. To achieve high-level expression of recombinant GOD in P. pastoris, 272 nucleotides involving 228 residues were mutated, consistent with the codon bias of P. pastoris. The optimized recombinant GOD-m yielded 615 U ml(-1) (2.5 g protein l(-1)) in a 3 l fermentor--410% higher than GOD-w (148 U ml(-1)), and thus is a low-cost alternative for the bread baking industry.
Sugita, Mamoru; Shinozaki, Kazuo; Sugiura, Masahiro
1985-01-01
The nucleotide sequence of a tRNALys(UUU) gene on tobacco (Nicotiana tabacum) chloroplast DNA has been determined. This gene is located 215 base pairs upstream from the gene for the 32,000-dalton thylakoid membrane protein on the same DNA strand and has a 2526-base-pair intron in the anticodon loop. The intron boundary sequence does not follow the G-U/A-G rule but is similar to those of tobacco chloroplast split genes for tRNAGly(UCC) and ribosomal proteins L2 and S12. The intron contains one major open reading frame of 509 codons. The codon usage in the open reading frame resembles those observed in the genes for tobacco chloroplast proteins so far analyzed. The primary transcript of this tRNA gene is 2.7 kilobases long. Images PMID:16593561
Sugita, M; Shinozaki, K; Sugiura, M
1985-06-01
The nucleotide sequence of a tRNA(Lys)(UUU) gene on tobacco (Nicotiana tabacum) chloroplast DNA has been determined. This gene is located 215 base pairs upstream from the gene for the 32,000-dalton thylakoid membrane protein on the same DNA strand and has a 2526-base-pair intron in the anticodon loop. The intron boundary sequence does not follow the G-U/A-G rule but is similar to those of tobacco chloroplast split genes for tRNA(Gly)(UCC) and ribosomal proteins L2 and S12. The intron contains one major open reading frame of 509 codons. The codon usage in the open reading frame resembles those observed in the genes for tobacco chloroplast proteins so far analyzed. The primary transcript of this tRNA gene is 2.7 kilobases long.
Young, Gregory J; Zhang, Shiping; Mirsky, Henry P; Cressman, Robert F; Cong, Bin; Ladics, Gregory S; Zhong, Cathy X
2012-10-01
Before a genetically modified (GM) crop can be commercialized it must pass through a rigorous regulatory process to verify that it is safe for human and animal consumption, and to the environment. One particular area of focus is the potential introduction of a known or cross-reactive allergen not previously present within the crop. The assessment of possible allergenicity uses the guidelines outlined by the Food and Agriculture Organization (FAO) and World Health Organization's (WHO) Codex Alimentarius Commission (Codex) to evaluate all newly expressed proteins. Some regulatory authorities have broadened the scope of the assessment to include all DNA reading frames between stop codons across the insert and spanning the insert/genomic DNA junctions. To investigate the utility of this bioinformatic assessment, all naturally occurring stop-to-stop frames in the non-transgenic genomes of maize, rice, and soybean, as well as the human genome, were compared against the AllergenOnline (www.allergenonline.org) database using the Codex criteria. We discovered thousands of frames that exceeded the Codex defined threshold for potential cross-reactivity suggesting that evaluating hypothetical ORFs (stop-to-stop frames) has questionable value for making decisions on the safety of GM crops. Copyright © 2012 Elsevier Ltd. All rights reserved.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Leighton, J.K.; Joyner, J.; Zamarripa, J.
Two different molecular weight forms of apoB are produced from a common initial transcript via editing of a Gln codon (CAA) to a stop codon (UAA), leading to a truncated translation product (apo BS) that consists of the amino terminal half of the larger form (apoBL). Previous studies have shown that fasting coordinately decreases lipogenesis and the secretion of very low density lipoprotein (VLDL) lipids and apoBS. Secretion of the apoBL is unaffected by fasting. We studied whether editing of apoB RNA is repressed by fasting, thus accounting for the selective decreased secretion of apoBS. Column chromatography of (35S)methionine-labeled lipoproteinsmore » secreted by hepatocytes from fed rats showed that essentially all of apoBL is secreted in the VLDL fraction, whereas a significant amount (15%) of apoBS is secreted associated as lipoproteins eluting in the HDL fractions. Fasting decreased the relative amount of apoBS that eluted in the VLDL fractions and increased the amount secreted in the HDL fractions. Consistent with previous results, hepatocytes from fasted rats show a selective twofold decrease in apoBS secretion. Fasting did not affect the relative abundance of apoB RNA, determined by slot blot hybridization assays using two different 32P-labeled cDNA probes coding either for both molecular weight forms or for only the large molecular weight form. However, quantitative of the editing of apoB RNA showed that fasting caused a 60% decrease in the amount of apoB RNA possessing the stop codon. These data show that the editing of apoB RNA is sensitive to metabolic state (i.e., fasting) resulting in a selective decrease in the secretion of apoBS. However, since the total secretion of apoB was decreased by fasting, while apoB mRNA levels remained constant, additional (post-transcriptional) mechanisms play a role in regulating apoB secretion.« less
Dolores, Jazel; Satchell, Karla J. F.
2013-01-01
ABSTRACT Vibrio cholerae genome sequences were analyzed for variation in the rtxA gene that encodes the multifunctional autoprocessing RTX (MARTX) toxin. To accommodate genomic analysis, a discrepancy in the annotated rtxA start site was resolved experimentally. The correct start site is an ATG downstream from rtxC resulting in a gene of 13,638 bp and deduced protein of 4,545 amino acids. Among the El Tor O1 and closely related O139 and O37 genomes, rtxA was highly conserved, with nine alleles differing by only 1 to 6 nucleotides in 100 years. In contrast, 12 alleles from environment-associated isolates are highly variable, at 1 to 3% by nucleotide and 3 to 7% by amino acid. The difference in variation rates did not represent a bias for conservation of the El Tor rtxA compared to that of other strains but rather reflected the lack of gene variation in overall genomes. Three alleles were identified that would affect the function of the MARTX toxin. Two environmental isolates carry novel arrangements of effector domains. These include a variant from RC385 that would suggest an adenylate cyclase toxin and from HE-09 that may have actin ADP-ribosylating activity. Within the recently emerged altered El Tor strains that have a classical ctxB gene, a mutation arose in rtxA that introduces a premature stop codon that disabled toxin function. This null mutant is the genetic background for subsequent emergence of the ctxB7 allele resulting in the strain that spread into Haiti in 2010. Thus, similar to classical strains, the altered El Tor pandemic strains eliminated rtxA after acquiring a classical ctxB. PMID:23592265
Evolutionary blueprint for host- and niche-adaptation in Staphylococcus aureus clonal complex CC30
McGavin, Martin J.; Arsic, Benjamin; Nickerson, Nicholas N.
2012-01-01
Staphylococcus aureus clonal complex CC30 has caused infectious epidemics for more than 60 years, and, therefore, provides a model system to evaluate how evolution has influenced the disease potential of closely related strains. In previous multiple genome comparisons, phylogenetic analyses established three major branches that evolved from a common ancestor. Clade 1, comprised of historic pandemic phage type 80/81 methicillin susceptible S. aureus (MSSA), and Clade 2 comprised of contemporary community acquired methicillin resistant S. aureus (CA-MRSA) were hyper-virulent in murine infection models. Conversely, Clade 3 strains comprised of contemporary hospital associated MRSA (HA-MRSA) and clinical MSSA exhibited attenuated virulence, due to common single nucleotide polymorphisms (SNP's) that abrogate production of α-hemolysin Hla, and interfere with signaling of the accessory gene regulator agr. We have now completed additional in silico genome comparisons of 15 additional CC30 genomes in the public domain, to assess the hypothesis that Clade 3 has evolved to favor niche adaptation. In addition to SNP's that influence agr and hla, other common traits of Clade 3 include tryptophan auxotrophy due to a di-nucleotide deletion within trpD, a premature stop codon within isdH encoding an immunogenic cell surface protein involved in iron acquisition, loss of a genomic toxin–antitoxin (TA) addiction module, acquisition of S. aureus pathogenicity islands SaPI4, and SaPI2 encoding toxic shock syndrome toxin tst, and increased copy number of insertion sequence ISSau2, which appears to target transcription terminators. Compared to other Clade 3 MSSA, S. aureus MN8, which is associated with Staphylococcal toxic shock syndrome, exhibited a unique ISSau2 insertion, and enhanced production of toxic shock syndrome toxin encoded by SaPI2. Cumulatively, our data support the notion that Clade 3 strains are following an evolutionary blueprint toward niche-adaptation. PMID:22919639
Cingolani, Pablo; Patel, Viral M.; Coon, Melissa; Nguyen, Tung; Land, Susan J.; Ruden, Douglas M.; Lu, Xiangyi
2012-01-01
This paper describes a new program SnpSift for filtering differential DNA sequence variants between two or more experimental genomes after genotoxic chemical exposure. Here, we illustrate how SnpSift can be used to identify candidate phenotype-relevant variants including single nucleotide polymorphisms, multiple nucleotide polymorphisms, insertions, and deletions (InDels) in mutant strains isolated from genome-wide chemical mutagenesis of Drosophila melanogaster. First, the genomes of two independently isolated mutant fly strains that are allelic for a novel recessive male-sterile locus generated by genotoxic chemical exposure were sequenced using the Illumina next-generation DNA sequencer to obtain 20- to 29-fold coverage of the euchromatic sequences. The sequencing reads were processed and variants were called using standard bioinformatic tools. Next, SnpEff was used to annotate all sequence variants and their potential mutational effects on associated genes. Then, SnpSift was used to filter and select differential variants that potentially disrupt a common gene in the two allelic mutant strains. The potential causative DNA lesions were partially validated by capillary sequencing of polymerase chain reaction-amplified DNA in the genetic interval as defined by meiotic mapping and deletions that remove defined regions of the chromosome. Of the five candidate genes located in the genetic interval, the Pka-like gene CG12069 was found to carry a separate pre-mature stop codon mutation in each of the two allelic mutants whereas the other four candidate genes within the interval have wild-type sequences. The Pka-like gene is therefore a strong candidate gene for the male-sterile locus. These results demonstrate that combining SnpEff and SnpSift can expedite the identification of candidate phenotype-causative mutations in chemically mutagenized Drosophila strains. This technique can also be used to characterize the variety of mutations generated by genotoxic chemicals. PMID:22435069
Palanisamy, Navaneethan; Akaberi, Dario; Lennerstrand, Johan; Lundkvist, Åke
2018-05-10
Alkhumra hemorrhagic fever virus (AHFV), a relatively new member of the Flaviviruses, was discovered in Saudi Arabia 23 years ago. AHFV is classified in the tick-borne encephalitis virus serocomplex, along with the Kyasanur forest disease virus (KFDV) and tick-borne encephalitis virus (TBEV). Currently, very little is known about the pathologies of AHFV. In this study, using the available genome information of AHFV, KFDV and TBEV, we have predicted and compared the following aspects of these viruses: evolution, nucleotide and protein compositions, recombination, codon frequency, substitution rate, N- and O-glycosylation sites, signal peptide and cleavage site, transmembrane region, secondary structure of 5' and 3' UTRs and RNA-RNA interactions. Additionally, we have modeled the 3D protease and RNA-dependent RNA polymerase structures for AHFV, KFDV and TBEV. Recombination analysis showed no evidence of recombination in the AHFV genome with that of either KFDV or TBEV, although single break point analysis showed that nucleotide position 7399 (in the NS4B) is a breakpoint location. AHFV, KFDV and TBEV are very similar in terms of codon frequency, the number of transmembrane regions, properties of the polyprotein, RNA-RNA interaction sequences, NS3 protease and NS5 polymerase structures and 5' UTR structure. Using genome sequences, we showed the similarities between these closely- related viruses on several different areas.
XPD polymorphisms: effects on DNA repair proficiency.
Lunn, R M; Helzlsouer, K J; Parshad, R; Umbach, D M; Harris, E L; Sanford, K K; Bell, D A
2000-04-01
XPD codes for a DNA helicase involved in transcription and nucleotide excision repair. Rare XPD mutations diminish nucleotide excision repair resulting in hypersensitivity to UV light and increased risk of skin cancer. Several polymorphisms in this gene have been identified but their impact on DNA repair is not known. We compared XPD genotypes at codons 312 and 751 with DNA repair proficiency in 31 women. XPD genotypes were measured by PCR-RFLP. DNA repair proficiency was assessed using a cytogenetic assay that detects X-ray induced chromatid aberrations (breaks and gaps). Chromatid aberrations were scored per 100 metaphase cells following incubation at 37 degrees C (1.5 h after irradiation) to allow for repair of DNA damage. Individuals with the Lys/Lys codon 751 XPD genotype had a higher number of chromatid aberrations (132/100 metaphase cells) than those having a 751Gln allele (34/100 metaphase cells). Individuals having greater than 60 chromatid breaks plus gaps were categorized as having sub-optimal repair. Possessing a Lys/Lys751 genotype increased the risk of sub-optimal DNA repair (odds ratio = 7.2, 95% confidence interval = 1.01-87.7). The Asp312Asn XPD polymorphism did not appear to affect DNA repair proficiency. These results suggest that the Lys751 (common) allele may alter the XPD protein product resulting in sub-optimal repair of X-ray-induced DNA damage.
Jiang, Zhihua; Luo, Hong-Yuan; Huang, Shengwen; Farrell, John J; Davis, Lance; Théberge, Roger; Benson, Katherine A; Riolueang, Suchada; Viprakasit, Vip; Al-Allawi, Nasir A S; Ünal, Sule; Gümrük, Fatma; Akar, Nejat; Başak, A Nazli; Osorio, Leonor; Badens, Catherine; Pissard, Serge; Joly, Philippe; Campbell, Andrew D; Gallagher, Patrick G; Steinberg, Martin H; Forget, Bernard G; Chui, David H K
2016-03-01
Two 21-year old dizygotic twin men of Iraqi descent were homozygous for HBB codon 8, deletion of two nucleotides (-AA) frame-shift β(0) -thalassaemia mutation (FSC8; HBB:c25_26delAA). Both were clinically well, had splenomegaly, and were never transfused. They had mild microcytic anaemia (Hb 120-130 g/l) and 98% of their haemoglobin was fetal haemoglobin (HbF). Both were carriers of Hph α-thalassaemia mutation. On the three major HbF quantitative trait loci (QTL), the twins were homozygous for G>A HBG2 Xmn1 site at single nucleotide polymorphism (SNP) rs7482144, homozygous for 3-bp deletion HBS1L-MYB intergenic polymorphism (HMIP) at rs66650371, and heterozygous for the A>C BCL11A intron 2 polymorphism at rs766432. These findings were compared with those found in 22 other FSC8 homozygote patients: four presented with thalassaemia intermedia phenotype, and 18 were transfusion dependent. The inheritance of homozygosity for HMIP 3-bp deletion at rs66650371 and heterozygosity for Hph α-thalassaemia mutation was found in the twins and not found in any of the other 22 patients. Further studies are needed to uncover likely additional genetic variants that could contribute to the exceptionally high HbF levels and mild phenotype in these twins. © 2016 John Wiley & Sons Ltd.
Role of the Integrin-Linked Kinase, ILK, in Mammary Carcinogensis
2000-08-01
have been implicated in environmental stress clonei 6-10 responses in yeasts, plants and mammals, as well as regulating abscisic acid signal transduction...phosphatase 2C involved in abscisic acid signal transduction in higher plants. Proc. Natl Acad. Sci. USA, 95, 975-980. Strovel,E.T., Wu,D. and Sussman,D.J...contain a 450bp open reading frame, coding for 149 amino acids and a poly A tail 245bp downstream of the stop codon, although no polyadenylation site
Cui, Peng; Ji, Rimutu; Ding, Feng; Qi, Dan; Gao, Hongwei; Meng, He; Yu, Jun; Hu, Songnian; Zhang, Heping
2007-01-01
Background The family Camelidae that evolved in North America during the Eocene survived with two distinct tribes, Camelini and Lamini. To investigate the evolutionary relationship between them and to further understand the evolutionary history of this family, we determined the complete mitochondrial genome sequence of the wild two-humped camel (Camelus bactrianus ferus), the only wild survivor of the Old World camel. Results The mitochondrial genome sequence (16,680 bp) from C. bactrianus ferus contains 13 protein-coding, two rRNA, and 22 tRNA genes as well as a typical control region; this basic structure is shared by all metazoan mitochondrial genomes. Its protein-coding region exhibits codon usage common to all mammals and possesses the three cryptic stop codons shared by all vertebrates. C. bactrianus ferus together with the rest of mammalian species do not share a triplet nucleotide insertion (GCC) that encodes a proline residue found only in the nd1 gene of the New World camelid Lama pacos. This lineage-specific insertion in the L. pacos mtDNA occurred after the split between the Old and New World camelids suggests that it may have functional implication since a proline insertion in a protein backbone usually alters protein conformation significantly, and nd1 gene has not been seen as polymorphic as the rest of ND family genes among camelids. Our phylogenetic study based on complete mitochondrial genomes excluding the control region suggested that the divergence of the two tribes may occur in the early Miocene; it is much earlier than what was deduced from the fossil record (11 million years). An evolutionary history reconstructed for the family Camelidae based on cytb sequences suggested that the split of bactrian camel and dromedary may have occurred in North America before the tribe Camelini migrated from North America to Asia. Conclusion Molecular clock analysis of complete mitochondrial genomes from C. bactrianus ferus and L. pacos suggested that the two tribes diverged from their common ancestor about 25 million years ago, much earlier than what was predicted based on fossil records. PMID:17640355
Gallo, O; Sardi, I; Pepe, G; Franchi, A; Attanasio, M; Giusti, B; Bocciolini, C; Abbate, R
1999-07-19
Head-and-neck cancer (HNC) patients have a high risk of developing second primary tumors of the upper aerodigestive tract, the main cause of death. Although the roles of tobacco and diet in multiple head-and-neck carcinogenesis have been thoroughly investigated, little is known about individual genetic susceptibility factors involved in this process. Genomic instability, reflecting the propensity and the susceptibility of the genome to acquire multiple alterations, could be considered a driving force behind multiple carcinogenesis. Mutation of the p53 tumor-suppressor gene has been proposed to play an important role in this process. Therefore, we evaluated the incidence of inherited p53 germ-line alteration(s) in a population of 24 consecutive HNC patients and their first-degree relatives affected by multiple malignancies as well as the occurrence of p53 somatic acquired mutation(s) in 16 cancers, including first and second primaries from 5 HNCs of the same group. Mutations in exons 4-11 of the p53 gene were investigated using SSCP-PCR analysis and DNA sequencing. Analysis was extended to the peripheral blood and cancer biopsies available from first-degree relatives of cancer-prone families with p53 germ-line mutations. p53 germ-line mutations were identified in the peripheral blood and corresponding cancers of 3 HNC patients who had multiple malignancies. The only missense mutation detected was mapped in exon 6; it is a GTG to GAG substitution with an amino acid change from Val to Glu at codon 197. The remaining 2 p53 germ-line mutations were single-nucleotide substitutions without amino acid change in exon 6 (codon 213, CGA to CGG) and in exon 8 (codon 295, CCT to CCC), respectively. These mutations were found in HNC patients with a family history of cancer. Abnormal expression of wild-type p53 protein in normal and pathological tissues from patients with the same sense single-nucleotide substitutions was detected by immuno-histochemistry.
Hao, Juan-Juan; Hao, Jia-Sheng; Sun, Xiao-Yan; Zhang, Lan-Lan; Yang, Qun
2014-01-01
Abstract The complete mitochondrial genomes of Leptidea morsei Fenton (Lepidoptera: Pieridae: Dis-morphiinae) and Catopsilia pomona (F.) (Lepidoptera: Pieridae: Coliadinae) were determined to be 15,122 and 15,142 bp in length, respectively, with that of L . morsei being the smallest among all known butterflies. Both mitogenomes contained 37 genes and an A+T-rich region, with the gene order identical to those of other butterflies, except for the presence of a tRNA-like insertion, tRNA Leu (UUR), in C . pomona . The nucleotide compositions of both genomes were higher in A and T (80.2% for L . morsei and 81.3% for C . pomona ) than C and G; the A+T bias had a significant effect on the codon usage and the amino acid composition. The protein-coding genes utilized the standard mitochondrial start codon ATN, except the COI gene using CGA as the initiation codon, as reported in other butterflies. The intergenic spacer sequence between the tRNA Ser (UCN) and ND1 genes contained the ATACTAA motif. The A+T-rich region harbored a poly-T stretch and a conserved ATAGA motif located at the end of the region. In addition, there was a triplicated 23 bp repeat and a microsatellite-like (TA) 9 (AT) 3 element in the A+T-rich region of the L. morsei mitogenome , while in C . pomona, there was a duplicated 24 bp repeat element and a microsatellite-like (TA) 9 element. The phylogenetic trees of the main butterfly lineages (Hesperiidae, Papilionidae, Pieridae, Nymphalidae, Lycaenidae, and Riodinidae) were reconstructed with maximum likelihood and Bayesian inference methods based on the 13 concatenated nucleotide sequences of protein-coding genes, and both trees showed that the Pieridae family is sister to Lycaenidae. Although this result contradicts the traditional morphologically based views, it agrees with other recent studies based on mitochondrial genomic data. PMID:25368074
Discovery and biological characterization of geranylated RNA in bacteria.
Dumelin, Christoph E; Chen, Yiyun; Leconte, Aaron M; Chen, Y Grace; Liu, David R
2012-11-01
A general MS-based screen for unusually hydrophobic cellular small molecule-RNA conjugates revealed geranylated RNA in Escherichia coli, Enterobacter aerogenes, Pseudomonas aeruginosa and Salmonella enterica var. Typhimurium. The geranyl group is conjugated to the sulfur atom in two 5-methylaminomethyl-2-thiouridine nucleotides. These geranylated nucleotides occur in the first anticodon position of tRNA(Glu)(UUC), tRNA(Lys)(UUU) and tRNA(Gln)(UUG) at a frequency of up to 6.7% (~400 geranylated nucleotides per cell). RNA geranylation can be increased or abolished by mutation or deletion of the selU (ybbB) gene in E. coli, and purified SelU protein in the presence of geranyl pyrophosphate and tRNA can produce geranylated tRNA. The presence or absence of the geranyl group in tRNA(Glu)(UUC), tRNA(Lys)(UUU) and tRNA(Gln)(UUG) affects codon bias and frameshifting during translation. These RNAs represent the first reported examples of oligoisoprenylated cellular nucleic acids.
Burstyn, J N; Heiger-Bernays, W J; Cohen, S M; Lippard, S J
2000-11-01
Mapping of cis-diamminedichloroplatinum(II) (cis-DDP, cisplatin) DNA adducts over >3000 nucleotides was carried out using a replication blockage assay. The sites of inhibition of modified T4 DNA polymerase, also referred to as stop sites, were analyzed to determine the effects of local sequence context on the distribution of intrastrand cisplatin cross-links. In a 3120 base fragment from replicative form M13mp18 DNA containing 24.6% guanine, 25.5% thymine, 26.9% adenine and 23.0% cytosine, 166 individual stop sites were observed at a bound platinum/nucleotide ratio of 1-2 per thousand. The majority of stop sites (90%) occurred at G(n>2) sequences and the remainder were located at sites containing an AG dinucleotide. For all of the GG sites present in the mapped sequences, including those with Gn(>)2, 89% blocked replication, whereas for the AG sites only 17% blocked replication. These blockage sites were independent of flanking nucleotides in a sequence of N(1)G*G*N(2) where N(1), N(2) = A, C, G, T and G*G* indicates a 1,2-intrastrand platinum cross-link. The absence of long-range sequence dependence was confirmed by monitoring the reaction of cisplatin with a plasmid containing an 800 bp insert of the human telomere repeat sequence (TTAGGG)(n). Platination reactions monitored at several formal platinum/nucleotide ratios or as a function of time reveal that the telomere insert was not preferentially damaged by cisplatin. Both replication blockage and telomere-insert plasmid platination experiments indicate that cisplatin 1,2-intrastrand adducts do not form preferentially at G-rich sequences in vitro.
Translation regulation of mammalian selenoproteins.
Vindry, Caroline; Ohlmann, Théophile; Chavatte, Laurent
2018-05-09
Interest in selenium research has considerably grown over the last decades owing to the association of selenium deficiencies with an increased risk of several human diseases, including cancers, cardiovascular disorders and infectious diseases. The discovery of a genetically encoded 21 st amino acid, selenocysteine, is a fascinating breakthrough in molecular biology as it is the first addition to the genetic code deciphered in the 1960s. Selenocysteine is a structural and functional analog of cysteine, where selenium replaces sulfur, and its presence is critical for the catalytic activity of selenoproteins. The insertion of selenocysteine is a non-canonical translational event, based on the recoding of a UGA codon in selenoprotein mRNAs, normally used as a stop codon in other cellular mRNAs. Two RNA molecules and associated partners are crucial components of the selenocysteine insertion machinery, the Sec-tRNA [Ser]Sec devoted to UGA codon recognition and the SECIS elements located in the 3'UTR of selenoprotein mRNAs. The translational UGA recoding event is a limiting stage of selenoprotein expression and its efficiency is regulated by several factors. The control of selenoproteome expression is crucial for redox homeostasis and antioxidant defense of mammalian organisms. In this review, we summarize current knowledge on the co-translational insertion of selenocysteine into selenoproteins, and its layers of regulation. Copyright © 2018. Published by Elsevier B.V.
Selenium. Role of the Essential Metalloid in Health
Kurokawa, Suguru; Berry, Marla J.
2015-01-01
Selenium is an essential micronutrient in mammals, but is also recognized as toxic in excess. It is a non-metal with properties that are intermediate between the chalcogen elements sulfur and tellurium. Selenium exerts its biological functions through selenoproteins. Selenoproteins contain selenium in the form of the 21st amino acid, selenocysteine (Sec), which is an analog of cysteine with the sulfur-containing side chain replaced by a Se-containing side chain. Sec is encoded by the codon UGA, which is one of three termination codons for mRNA translation in non-selenoprotein genes. Recognition of the UGA codon as a Sec insertion site instead of stop requires a Sec insertion sequence (SECIS) element in selenoprotein mRNAs and a unique selenocysteyl-tRNA, both of which are recognized by specialized protein factors. Unlike the 20 standard amino acids, Sec is biosynthesized from serine on its tRNA. Twenty-five selenoproteins are encoded in the human genome. Most of the selenoprotein genes were discovered by bioinformatics approaches, searching for SECIS elements downstream of in-frame UGA codons. Sec has been described as having stronger nucleophilic and electrophilic properties than cysteine, and Sec is present in the catalytic site of all selenoenzymes. Most selenoproteins, whose functions are known, are involved in redox systems and signaling pathways. However, several selenoproteins are not well characterized in terms of their function. The selenium field has grown dramatically in the last few decades, and research on selenium biology is providing extensive new information regarding its importance for human health. PMID:24470102
Mitochondrial DNA Mutation Associated with Leber's Hereditary Optic Neuropathy
NASA Astrophysics Data System (ADS)
Wallace, Douglas C.; Singh, Gurparkash; Lott, Marie T.; Hodge, Judy A.; Schurr, Theodore G.; Lezza, Angela M. S.; Elsas, Louis J.; Nikoskelainen, Eeva K.
1988-12-01
Leber's hereditary optic neuropathy is a maternally inherited disease resulting in optic nerve degeneration and cardiac dysrhythmia. A mitochondrial DNA replacement mutation was identified that correlated with this disease in multiple families. This mutation converted a highly conserved arginine to a histidine at codon 340 in the NADH dehydrogenase subunit 4 gene and eliminated an Sfa NI site, thus providing a simple diagnostic test. This finding demonstrated that a nucleotide change in a mitochondrial DNA energy production gene can result in a neurological disease.
Role of Human DNA Polymerase and Its Accessory Proteins in Breast Cancer
2000-09-01
10, 13, 15, and 19 are abnormal and indicate mutants in POLD1 gene . Determination of NIRCA detected mutations by DNA sequencing NIRCA detected...CAGCAA; GnGln) in codon 461. Table III. Summary of mutation identified in the Exo motif of POLD1 Gene from breast cancer. Patient/Cell line Nucleotide...the gene for human DNA polymerase 8 catalytic p125 (POLDI) and p50 ( POLD2 ) subunits (Chang et al., 1995, Perez et al., 2000).. Normal and breast
Mutation-Specific RAS Oncogenicity Explains N-RAS Codon 61 Selection in Melanoma
Burd, Christin E.; Liu, Wenjin; Huynh, Minh V.; Waqas, Meriam A.; Gillahan, James E.; Clark, Kelly S.; Fu, Kailing; Martin, Brit L.; Jeck, William R.; Souroullas, George P.; Darr, David B.; Zedek, Daniel C.; Miley, Michael J.; Baguley, Bruce C.; Campbell, Sharon L.
2014-01-01
N-RAS mutation at codon 12, 13 or 61 is associated with transformation; yet, in melanoma, such alterations are nearly exclusive to codon 61. Here, we compared the melanoma susceptibility of an N-RasQ61R knock-in allele to similarly designed K-RasG12D and N-RasG12D alleles. With concomitant p16INK4a inactivation, K-RasG12D or N-RasQ61R expression efficiently promoted melanoma in vivo, whereas N-RasG12D did not. Additionally, N-RasQ61R mutation potently cooperated with Lkb1/Stk11 loss to induce highly metastatic disease. Functional comparisons of N-RasQ61R and N-RasG12D revealed little difference in the ability of these proteins to engage PI3K or RAF. Instead, N-RasQ61R showed enhanced nucleotide binding, decreased intrinsic GTPase activity and increased stability when compared to N-RasG12D. This work identifies a faithful model of human N-RAS mutant melanoma, and suggests that the increased melanomagenecity of N-RasQ61R over N-RasG12D is due to heightened abundance of the active, GTP-bound form rather than differences in the engagement of downstream effector pathways. PMID:25252692
Meganathan, P R; Pagan, Heidi J T; McCulloch, Eve S; Stevens, Richard D; Ray, David A
2012-01-15
Order Chiroptera is a unique group of mammals whose members have attained self-powered flight as their main mode of locomotion. Much speculation persists regarding bat evolution; however, lack of sufficient molecular data hampers evolutionary and conservation studies. Of ~1200 species, complete mitochondrial genome sequences are available for only eleven. Additional sequences should be generated if we are to resolve many questions concerning these fascinating mammals. Herein, we describe the complete mitochondrial genomes of three bats: Corynorhinus rafinesquii, Lasiurus borealis and Artibeus lituratus. We also compare the currently available mitochondrial genomes and analyze codon usage in Chiroptera. C. rafinesquii, L. borealis and A. lituratus mitochondrial genomes are 16438 bp, 17048 bp and 16709 bp, respectively. Genome organization and gene arrangements are similar to other bats. Phylogenetic analyses using complete mitochondrial genome sequences support previously established phylogenetic relationships and suggest utility in future studies focusing on the evolutionary aspects of these species. Comprehensive analyses of available bat mitochondrial genomes reveal distinct nucleotide patterns and synonymous codon preferences corresponding to different chiropteran families. These patterns suggest that mutational and selection forces are acting to different extents within Chiroptera and shape their mitochondrial genomes. Copyright © 2011 Elsevier B.V. All rights reserved.
Sun, Liying; Andika, Ida Bagus; Shen, Jiangfeng; Yang, Di; Ratti, Claudio; Chen, Jianping
2013-10-01
Some viruses use alternative translation initiation at non-AUG codons as a strategy to produce multiple proteins during gene expression. Here we show that, using this strategy, Chinese wheat mosaic virus (CWMV; Furovirus) expresses a larger form of coat protein (N-ext/CP) in infected plants. Site-directed mutagenesis and transient expression analysis confirmed that CWMV N-ext/CP is initiated at an upstream in-frame CUG codon at nucleotide position 207-209 of RNA 2, which adds a 39 amino acid (aa) N-terminal extension to the major CP. Interestingly, in planta and in vitro analyses indicated that CWMV N-ext/CP but not CP interacts with the CWMV cysteine-rich protein (CRP), an RNA silencing suppressor. We further determined that the N-terminal 39 aa extension, particularly the 10 aa region immediately upstream of the major CP coding region is responsible for the interaction of N-ext/CP with CRP. In an Agrobacterium co-infiltration assay, co-expression with N-ext/CP did not affect CRP silencing suppression activity. Thus the alternative translation initiation at a CUG codon provides the CWMV N-ext/CP with the ability to bind to the viral silencing suppressor. Copyright © 2013 Elsevier B.V. All rights reserved.
Lathe, R
1985-05-05
Synthetic probes deduced from amino acid sequence data are widely used to detect cognate coding sequences in libraries of cloned DNA segments. The redundancy of the genetic code dictates that a choice must be made between (1) a mixture of probes reflecting all codon combinations, and (2) a single longer "optimal" probe. The second strategy is examined in detail. The frequency of sequences matching a given probe by chance alone can be determined and also the frequency of sequences closely resembling the probe and contributing to the hybridization background. Gene banks cannot be treated as random associations of the four nucleotides, and probe sequences deduced from amino acid sequence data occur more often than predicted by chance alone. Probe lengths must be increased to confer the necessary specificity. Examination of hybrids formed between unique homologous probes and their cognate targets reveals that short stretches of perfect homology occurring by chance make a significant contribution to the hybridization background. Statistical methods for improving homology are examined, taking human coding sequences as an example, and considerations of codon utilization and dinucleotide frequencies yield an overall homology of greater than 82%. Recommendations for probe design and hybridization are presented, and the choice between using multiple probes reflecting all codon possibilities and a unique optimal probe is discussed.
Large-scale, multi-genome analysis of alternate open reading frames in bacteria and archaea.
Veloso, Felipe; Riadi, Gonzalo; Aliaga, Daniela; Lieph, Ryan; Holmes, David S
2005-01-01
Analysis of over 300,000 annotated genes in 105 bacterial and archaeal genomes reveals an unexpectedly high frequency of large (>300 nucleotides) alternate open reading frames (ORFs). Especially notable is the very high frequency of alternate ORFs in frames +3 and -1 (where the annotated gene is defined as frame +1). The occurrence of alternate ORFs is correlated with genomic G+C content and is strongly influenced by synonymous codon usage bias. The frequency of alternate ORFs in frame -1 is also influenced by the occurrence of codons encoding leucine and serine in frame +1. Although some alternate ORFs have been shown to encode proteins, many others are probably not expressed because they lack appropriate signals for transcription and translation. These latter can be mis-annotated by automatic gene finding programs leading to errors in public databases. Especially prone to mis-annotation is frame -1, because it exhibits a potential codon usage and theoretical capacity to encode proteins with an amino acid composition most similar to real genes. Some alternate ORFs are conserved across bacterial or archaeal species, and can give rise to misannotated "conserved hypothetical" genes, while others are unique to a genome and are misidentified as "hypothetical orphan" genes, contributing significantly to the orphan gene paradox.
Hueso, Miguel; Navarro, Estanis; Moreso, Francesc; Beltrán-Sastre, Violeta; Ventura, Francesc; Grinyó, Josep M; Serón, Daniel
2006-05-27
Transforming growth factor (TGF)-beta(1) is increased in allograft rejection and its production is associated with single nucleotide polymorphisms (SNPs). The contribution of SNPs at codons 10 and 25 of the TGF-beta(1) gene to renal allograft damage was assessed in 6-month protocol biopsies and their association with TGF-beta(1) production. TGF-beta(1) genotypes were evaluated by polymerase chain reaction (PCR)/restriction fragment length polymorphism. Intragraft TGF-beta(1) messenger RNA (mRNA) was measured by real-time PCR and TGF-beta(1) plasma levels were assessed by enzyme-linked immunosorbent assay. Eighty consecutive patients were included. Allele T at codon 10 (risk ratio, 6.7; P = 0.02) and an episode of acute rejection before protocol biopsy (risk ratio, 6.2; P = 0.01) were independent predictors of subclinical rejection (SCR). TGF-beta(1) plasma levels, but not those of TGF-beta(1) mRNA, were increased in patients with SCR (2.59 ng/mL +/- 0.91 [n = 22] vs. 2.05 ng/mL +/- 0.76 [n = 43]; P = 0.01). There was no association between allele T and TGF-beta(1) plasma or intragraft levels. Allele T at codon 10 of the TGF-beta(1) gene is associated with a higher incidence of SCR.
Genes encoding intrinsic disorder in Eukaryota have high GC content
Peng, Zhenling; Uversky, Vladimir N.
2016-01-01
ABSTRACT We analyze a correlation between the GC content in genes of 12 eukaryotic species and the level of intrinsic disorder in their corresponding proteins. Comprehensive computational analysis has revealed that the disordered regions in eukaryotes are encoded by the GC-enriched gene regions and that this enrichment is correlated with the amount of disorder and is present across proteins and species characterized by varying amounts of disorder. The GC enrichment is a result of higher rate of amino acid coded by GC-rich codons in the disordered regions. Individual amino acids have the same GC-content profile between different species. Eukaryotic proteins with the disordered regions encoded by the GC-enriched gene segments carry out important biological functions including interactions with RNAs, DNAs, nucleotides, binding of calcium and metal ions, are involved in transcription, transport, cell division and certain signaling pathways, and are localized primarily in nucleus, cytosol and cytoplasm. We also investigate a possible relationship between GC content, intrinsic disorder and protein evolution. Analysis of a devised “age” of amino acids, their disorder-promoting capacity and the GC-enrichment of their codons suggests that the early amino acids are mostly disorder-promoting and their codons are GC-rich while most of late amino acids are mostly order-promoting. PMID:28232902
Gene Unprediction with Spurio: A tool to identify spurious protein sequences.
Höps, Wolfram; Jeffryes, Matt; Bateman, Alex
2018-01-01
We now have access to the sequences of tens of millions of proteins. These protein sequences are essential for modern molecular biology and computational biology. The vast majority of protein sequences are derived from gene prediction tools and have no experimental supporting evidence for their translation. Despite the increasing accuracy of gene prediction tools there likely exists a large number of spurious protein predictions in the sequence databases. We have developed the Spurio tool to help identify spurious protein predictions in prokaryotes. Spurio searches the query protein sequence against a prokaryotic nucleotide database using tblastn and identifies homologous sequences. The tblastn matches are used to score the query sequence's likelihood of being a spurious protein prediction using a Gaussian process model. The most informative feature is the appearance of stop codons within the presumed translation of homologous DNA sequences. Benchmarking shows that the Spurio tool is able to distinguish spurious from true proteins. However, transposon proteins are prone to be predicted as spurious because of the frequency of degraded homologs found in the DNA sequence databases. Our initial experiments suggest that less than 1% of the proteins in the UniProtKB sequence database are likely to be spurious and that Spurio is able to identify over 60 times more spurious proteins than the AntiFam resource. The Spurio software and source code is available under an MIT license at the following URL: https://bitbucket.org/bateman-group/spurio.
A novel frameshift deletion in the albumin gene causes analbuminemia in a young Turkish woman.
Dagnino, Monica; Caridi, Gianluca; Aydin, Zeki; Ozturk, Savas; Karaali, Zeynep; Kazancioglu, Rumeyza; Cefle, Kivanc; Gursu, Meltem; Campagnoli, Monica; Galliano, Monica; Minchiotti, Lorenzo
2010-11-11
Analbuminemia is a rare autosomal recessive disorder manifested by the absence, or severe reduction, of circulating serum albumin. The analbuminemic trait was diagnosed in a young Turkish woman on the basis of her clinical symptoms (bilateral lower limb edema) and biochemical findings (minimal albumin amount and variable increases in other protein fractions). Total DNA from the analbuminemic proband and her parents was PCR-amplified using oligonucleotide primers designed to amplify the 14 exons of the albumin gene (ALB) and the flanking intron regions. The products were screened for mutations by single-strand conformation polymorphism (SSCP) and heteroduplex analyses (HA). HA allowed the identification of the mutation site in exon 12. Direct DNA sequencing of this abnormal fragment revealed that the analbuminemic trait was caused by a homozygous CA deletion at nucleotide positions c. 1614-1615 in the codons for Cys538 and Thr539. The subsequent frameshift should give rise to a putative truncated albumin variant in which the sequence Cys(538)-Thr-Leu-Ser has been changed to Cys(538)-Thr-Phe-Stop. The parents were heterozygous for the same mutation. Gel-based mutation detection and DNA sequencing substantiate the clinical diagnosis of congenital analbuminemia in our patient and show that the condition is caused by a novel mutation within the ALB gene. These results contribute to shed light on the molecular basis of this rare condition. 2010 Elsevier B.V. All rights reserved.
Flachsová, E; Verma, I C; Ulbrichová, D; Saxena, R; Zeman, J; Saudek, V; Raman, C S; Martásek, P
2007-01-01
Based on Internet search, we were contacted by a 50-year-old man suffering from severe abdominal pain. Acute hepatic porphyria was considered from positive Watson-Schwartz test. He, not being a health professional, searched for centres with ability to do molecular diagnosis and for information about therapeutic possibilities. He asked his physician for haem-arginate (Normosang, Orphan Europe, Paris) treatment, arranged sending his blood to our laboratory and mediated genetic counselling for him and his family. Molecular analyses of the PBGD gene revealed a novel mutation in exon 15, the 973insG. Subsequently, genetic analysis was performed in 18 members of the proband's extensive family. In 12 members of the family, the same mutation was found. The mutation, which consisted of one nucleotide insertion, resulted in addition of four different amino acids leading to a protein that is prematurely truncated by the stop codon. The effect of this mutation was investigated by expression of the wildtype and mutated PBGD in a prokaryotic expression system. The mutation resulted in instability of the protein and loss of enzymatic function. The increasing access to a number of disease- and symptom-oriented web pages presents a new and unusual venue for gaining knowledge and enabling self-diagnosis and self-help. It is, therefore, important that diseaseoriented Internet pages for public use should be designed with clarity and accurate current knowledge based background.
Ding, Qiu-lan; Wang, Hong-li; Wang, Xue-feng; Wang, Ming-shan; Fu, Qi-hua; Wu, Wen-man; Hu, Yi-qun; Wang, Zhen-yi
2003-10-01
To identify the genetic mutations of a severe inherited coagulation factor VII (FVII) deficiency pedigree. The diagnosis was validated by coagulant and haemostatic parameters. FVII gene mutations were screened in the propositus and his family members by DNA direct sequencing and confirmed by digestions of the restriction enzymes of the PCR production. Two heterozygous missense mutations were found in the propositus of the pedigree: a G to T transversion at position 9482 in exon 6 and a C to T mutation at position 11348 in exon 8 resulting in the amino acid substitution of Arg152 with Leu and Arg304 with Trp, respectively. A heterozygous single nucleotide deletion (C) at position 11487-11489(CCC) within exon 8 was identified, which predicted the frameshift mutation at position His351 followed by the changes of six corresponding amino acids and appearance of a premature protein caused by stop codon. The heterozygous mutations identified in the proband were derived from his father (Arg152 to Leu) and his mother (Arg304 to Trp mutation) and a heterozygous deletion (C) at position 11487-9(CCC). By tracing the other pedigree members, it was found that his grandmother had a heterozygous mutation of Arg304Trp and a heterozygous polymorphism of Arg353Gln and his grandfather had a heterozygous Arg152Leu mutation. Three heterozygous mutations were found in a pedigree with hereditary coagulation factor VII deficiency. Arg152Leu and deletion C at position 11487-9(CCC) were novel mutations.
Xie, Jingli; Pabón, Dina; Jayo, Asier; Butta, Nora; González-Manchón, Consuelo
2005-05-01
We report a novel genetic defect in a patient with type I Glanzmann thrombasthenia. Flow cytometry analysis revealed undetectable levels of platelet glycoproteins alphaIIb and beta3, although residual amounts of both proteins were detectable in immunoblotting analysis. Sequence analysis of reversely transcribed platelet beta3 mRNA showed a 100-base pair deletion in the 3'-boundary of exon 11, that results in a frame shift and appearance of a premature STOP codon. Analysis of the corresponding genomic DNA fragment revealed the presence of a homozygous C1815T transition in exon 11. The mutation does not change the amino acid residue but it creates an ectopic consensus splice donor site that is used preferentially, causing splicing out of part of exon 11. The parents of the proband, heterozygous for this mutation, were asymptomatic and had reduced platelet content of alphaIIbbeta3. PCR-based relative quantification of beta3 mRNA failed to detect the mutant transcript in the parents and showed a marked reduction in the patient. The results suggest that the thrombasthenic phenotype is, mainly, the result of the reduced availability of beta3-mRNA, most probably due to activation of the nonsense-mediated mRNA decay mechanism. They also show the convenience of analyzing both genomic DNA and mRNA, in order to ascertain the functional consequences of single nucleotide substitutions.
Fravalo, Philippe; Cherifi, Tamazight; Neira Feliciano, Kersti Dina; Letellier, Ann; Fairbrother, Julie-Hélène; Bekal, Sadjia
2017-01-01
The introduction of Listeria monocytogenes into the food production chain is a concern, with numerous grouped cases of listeriosis associated with milk-derived or pork-derived products have been documented. Management of this zoonotic pathogen considers all strains as an equal risk. Recently, a new perspective for characterisation of strain virulence was introduced with the discovery of the unaltered sequence of InlA as a determinant of strain virulence; this has also been reported as an infrequent finding among so-called environmental strains, that is, strains isolated from food or from surfaces in food industries. The aim of this study was to differentiate L monocytogenes strains isolated from animal cases versus those from human cases and to differentiate clinical strains from environmental ones using a Caenorhabditis elegans virulence testing model. In Quebec in 2013/2014, the surveillance of L monocytogenes clinical isolates registered a total of 20 strains of animal origin and 16 pulsed-field gel electrophoresis types isolated from human cases. The mixed PCR multiplex agglutination protocol used for geno-serotyping clearly discriminated genogroup IVB strains from bovine and human origins. The presence of a premature stop codon single nucleotide polymorphism in the inlA gene sequence in clinical strains and the identical behaviour of particular strains in the C elegans model are discussed in this paper from the perspective of industrial management of L monocytogenes risk. PMID:28761668
Braakhuis, B J M; Rietbergen, M M; Buijze, M; Snijders, P J F; Bloemena, E; Brakenhoff, R H; Leemans, C R
2014-09-01
Little is known about the molecular carcinogenesis of oral squamous cell carcinoma (OSCC) in young adult patients. The aim of this study was to investigate the detailed TP53 mutation and human papilloma virus (HPV) status of OSCC in patients, younger than 45 years. TP53 mutations were determined with direct sequencing on paraffin-embedded carcinoma tissue from 31 young patients and compared with two older age OSCC reference groups: one from the same institute (N = 87) and an independent one (N = 675). Biologically active tumour HPV was detected by p16-immunohistochemistry followed by a HPV-DNA GP5 + /6 + -PCR. HPV16 was present in one OSCC (3%). TP53 mutations were found in 14 (45%) OSCC: five were missense and nine resulted in a truncated protein. Six of these latter were insertions or deletions of one or more nucleotides leading to frameshift, one was at a splice site and two resulted in a stop codon. The percentage of truncating mutations (64% of all mutations) was higher than that observed in the institute's reference group (44%, P = 0.23) and in the independent reference group (24%, P = 0.002). This study shows that TP53 mutations are common in OSCC of young adult patients; infection with biologically active HPV is rare. © 2013 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.
A High-Definition View of Functional Genetic Variation from Natural Yeast Genomes
Bergström, Anders; Simpson, Jared T.; Salinas, Francisco; Barré, Benjamin; Parts, Leopold; Zia, Amin; Nguyen Ba, Alex N.; Moses, Alan M.; Louis, Edward J.; Mustonen, Ville; Warringer, Jonas; Durbin, Richard; Liti, Gianni
2014-01-01
The question of how genetic variation in a population influences phenotypic variation and evolution is of major importance in modern biology. Yet much is still unknown about the relative functional importance of different forms of genome variation and how they are shaped by evolutionary processes. Here we address these questions by population level sequencing of 42 strains from the budding yeast Saccharomyces cerevisiae and its closest relative S. paradoxus. We find that genome content variation, in the form of presence or absence as well as copy number of genetic material, is higher within S. cerevisiae than within S. paradoxus, despite genetic distances as measured in single-nucleotide polymorphisms being vastly smaller within the former species. This genome content variation, as well as loss-of-function variation in the form of premature stop codons and frameshifting indels, is heavily enriched in the subtelomeres, strongly reinforcing the relevance of these regions to functional evolution. Genes affected by these likely functional forms of variation are enriched for functions mediating interaction with the external environment (sugar transport and metabolism, flocculation, metal transport, and metabolism). Our results and analyses provide a comprehensive view of genomic diversity in budding yeast and expose surprising and pronounced differences between the variation within S. cerevisiae and that within S. paradoxus. We also believe that the sequence data and de novo assemblies will constitute a useful resource for further evolutionary and population genomics studies. PMID:24425782
Wang, Pei; Song, Fan; Cai, Wanzhi
2014-01-01
Insect mitochondrial genomes are very important to understand the molecular evolution as well as for phylogenetic and phylogeographic studies of the insects. The Miridae are the largest family of Heteroptera encompassing more than 11,000 described species and of great economic importance. For better understanding the diversity and the evolution of plant bugs, we sequence five new mitochondrial genomes and present the first comparative analysis of nine mitochondrial genomes of mirids available to date. Our result showed that gene content, gene arrangement, base composition and sequences of mitochondrial transcription termination factor were conserved in plant bugs. Intra-genus species shared more conserved genomic characteristics, such as nucleotide and amino acid composition of protein-coding genes, secondary structure and anticodon mutations of tRNAs, and non-coding sequences. Control region possessed several distinct characteristics, including: variable size, abundant tandem repetitions, and intra-genus conservation; and was useful in evolutionary and population genetic studies. The AGG codon reassignments were investigated between serine and lysine in the genera Adelphocoris and other cimicomorphans. Our analysis revealed correlated evolution between reassignments of the AGG codon and specific point mutations at the antidocons of tRNALys and tRNASer(AGN). Phylogenetic analysis indicated that mitochondrial genome sequences were useful in resolving family level relationship of Cimicomorpha. Comparative evolutionary analysis of plant bug mitochondrial genomes allowed the identification of previously neglected coding genes or non-coding regions as potential molecular markers. The finding of the AGG codon reassignments between serine and lysine indicated the parallel evolution of the genetic code in Hemiptera mitochondrial genomes. PMID:24988409
The complete nucleotide sequence of RNA beta from the type strain of barley stripe mosaic virus.
Gustafson, G; Armour, S L
1986-01-01
The complete nucleotide sequence of RNA beta from the type strain of barley stripe mosaic virus (BSMV) has been determined. The sequence is 3289 nucleotides in length and contains four open reading frames (ORFs) which code for proteins of Mr 22,147 (ORF1), Mr 58,098 (ORF2), Mr 17,378 (ORF3), and Mr 14,119 (ORF4). The predicted N-terminal amino acid sequence of the polypeptide encoded by the ORF nearest the 5'-end of the RNA (ORF1) is identical (after the initiator methionine) to the published N-terminal amino acid sequence of BSMV coat protein for 29 of the first 30 amino acids. ORF2 occupies the central portion of the coding region of RNA beta and ORF3 is located at the 3'-end. The ORF4 sequence overlaps the 3'-region of ORF2 and the 5'-region of ORF3 and differs in codon usage from the other three RNA beta ORFs. The coding region of RNA beta is followed by a poly(A) tract and a 238 nucleotide tRNA-like structure which are common to all three BSMV genomic RNAs. Images PMID:3754962
Jewell, Brittany E.; Versalovic, Erika M.; Olsen, Randall J.; Bachert, Beth A.; Lukomski, Slawomir; Musser, James M.
2015-01-01
Group A Streptococcus (GAS) predominantly exists as a colonizer of the human oropharynx that occasionally breaches epithelial barriers to cause invasive diseases. Despite the frequency of GAS carriage, few investigations into the contributory molecular mechanisms exist. To this end, we identified a naturally occurring polymorphism in the gene encoding the streptococcal collagen-like protein A (SclA) in GAS carrier strains. All previously sequenced invasive serotype M3 GAS possess a premature stop codon in the sclA gene truncating the protein. The carrier polymorphism is predicted to restore SclA function and was infrequently identified by targeted DNA sequencing in invasive strains of the same serotype. We demonstrate that a strain with the carrier sclA allele expressed a full-length SclA protein, while the strain with the invasive sclA allele expressed a truncated variant. An isoallelic mutant invasive strain with the carrier sclA allele exhibited decreased virulence in a mouse model of invasive disease and decreased multiplication in human blood. Further, the isoallelic invasive strain with the carrier sclA allele persisted in the mouse nasopharynx and had increased adherence to cultured epithelial cells. Repair of the premature stop codon in the invasive sclA allele restored the ability to bind the extracellular matrix proteins laminin and cellular fibronectin. These data demonstrate that a mutation in GAS carrier strains increases adherence and decreases virulence and suggest selection against increased adherence in GAS invasive isolates. PMID:25561712
A novel homozygous stop-codon mutation in human HFE responsible for nonsense-mediated mRNA decay.
Padula, Maria Carmela; Martelli, Giuseppe; Larocca, Marilena; Rossano, Rocco; Olivieri, Attilio
2014-09-01
HFE-hemochromatosis (HH) is an autosomal disease characterized by excessive iron absorption. Homozygotes for H63D variant, and still less H63D heterozygotes, generally do not express HH phenotype. The data collected in our previous study in the province of Matera (Basilicata, Italy) underlined that some H63D carriers showed altered iron metabolism, without additional factors. In this study, we selected a cohort of 10/22 H63D carriers with severe biochemical iron overload (BIO). Additional analysis was performed for studying HFE exons, exon-intron boundaries, and untranslated regions (UTRs) by performing DNA extraction, PCR amplification and sequencing. The results showed a novel substitution (NM_000410.3:c.847C>T) in a patient exon 4 (GenBankJQ478433); it introduces a premature stop-codon (PTC). RNA extraction and reverse-transcription were also performed. Quantitative real-time PCR was carried out for verifying if our aberrant mRNA is targeted for nonsense-mediated mRNA decay (NMD); we observed that patient HFE mRNA was expressed much less than calibrator, suggesting that the mutated HFE protein cannot play its role in iron metabolism regulation, resulting in proband BIO. Our finding is the first evidence of a variation responsible for a PTC in iron cycle genes. The genotype-phenotype correlation observed in our cases could be related to the additional mutation. Copyright © 2014 Elsevier Inc. All rights reserved.
Cryptic tRNAs in chaetognath mitochondrial genomes.
Barthélémy, Roxane-Marie; Seligmann, Hervé
2016-06-01
The chaetognaths constitute a small and enigmatic phylum of little marine invertebrates. Both nuclear and mitochondrial genomes have numerous originalities, some phylum-specific. Until recently, their mitogenomes seemed containing only one tRNA gene (trnMet), but a recent study found in two chaetognath mitogenomes two and four tRNA genes. Moreover, apparently two conspecific mitogenomes have different tRNA gene numbers (one and two). Reanalyses by tRNAscan-SE and ARWEN softwares of the five available complete chaetognath mitogenomes suggest numerous additional tRNA genes from different types. Their total number never reaches the 22 found in most other invertebrates using that genetic code. Predicted error compensation between codon-anticodon mismatch and tRNA misacylation suggests translational activity by tRNAs predicted solely according to secondary structure for tRNAs predicted by tRNAscan-SE, not ARWEN. Numbers of predicted stop-suppressor (antitermination) tRNAs coevolve with predicted overlapping, frameshifted protein coding genes including stop codons. Sequence alignments in secondary structure prediction with non-chaetognath tRNAs suggest that the most likely functional tRNAs are in intergenic regions, as regular mt-tRNAs. Due to usually short intergenic regions, generally tRNA sequences partially overlap with flanking genes. Some tRNA pairs seem templated by sense-antisense strands. Moreover, 16S rRNA genes, but not 12S rRNAs, appear as tRNA nurseries, as previously suggested for multifunctional ribosomal-like protogenomes. Copyright © 2016 Elsevier Ltd. All rights reserved.
Wu, Xiufeng; Wan, Shengqin; Pujar, Shashikant; Haskins, Mark E.; Schlafer, Donald H.; Lee, Mary M.; Meyers-Wallen, Vicki N.
2008-01-01
Müllerian Inhibiting Substance (MIS), a secreted glycoprotein in the Transforming Growth Factor-beta (TGF-beta) family of growth factors, mediates regression of the Müllerian ducts during embryonic sex differentiation in males. In Persistent Müllerian Duct Syndrome (PMDS), rather than undergoing involution, the Müllerian ducts persist in males, giving rise to the uterus, Fallopian tubes, and upper vagina. Genetic defects in MIS or its receptor (MISRII) have been identified in patients with PMDS. The phenotype in the canine model of PMDS derived from the miniature schnauzer breed is strikingly similar to that of human patients. In this model, PMDS is inherited as a sex-limited autosomal recessive trait. Previous studies indicated that a defect in the MIS receptor or its downstream signaling pathway was likely to be causative of the canine syndrome. In this study the canine PMDS phenotype and clinical sequelae are described in detail. Affected and unaffected members of this pedigree are genotyped, identifying a single base pair substitution in MISRII that introduces a stop codon in exon 3. The homozygous mutation terminates translation at 80 amino acids, eliminating much of the extracellular domain and the entire transmembrane and intracellular signaling domains. Findings in this model may enable insights to be garnered from correlation of detailed clinical descriptions with molecular defects, which are not otherwise possible in the human syndrome. PMID:18723470
Masingue, Marion; Perrot, Jimmy; Carlier, Robert-Yves; Piguet-Lacroix, Guenaelle; Latour, Philippe; Stojkovic, Tanya
2018-05-01
Charcot-Marie-Tooth disease (CMT) refers to a group of clinically and genetically heterogeneous inherited neuropathies. Ganglioside-induced differentiation-associated protein 1 GDAP1-related CMT has been reported in an autosomal dominant or recessive form in patients presenting either axonal or demyelinating neuropathy. We report two Sri Lankan sisters born to consanguineous parents and presenting with a severe axonal sensorimotor neuropathy. The early onset of the disease, the distal and proximal weakness and atrophy leading to major disability, along with areflexia, and, most notably, vocal cord and diaphragm paralysis were highly evocative of a GDAP1-related CMT. However, sequencing of the coding regions of the gene was normal. Whole-exome sequencing (WES) was performed and revealed that the largest region of homozygosity was around GDAP1 with several variants, mostly in non-coding regions. In view of the high clinical suspicion of GDAP1 gene involvement, we examined the variants in this gene and this, along with functional studies, allowed us to identify an alternative splicing site revealing a cryptic in-frame stop codon in intron 4 responsible for a severe loss of wild-type GDAP1. This work is the first to describe a deleterious mutation in GDAP1 gene outside of coding sequences or intronic junctions and emphasizes the importance of interpreting molecular analysis, and in particular WES results, in light of the clinical and electrophysiological phenotype.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Yanase, Toshihiko; Takayanagi, Ryoichi; Oba, Koichi
Congenital adrenal hypoplasia, an X-linked disorder, is characterized by primary adrenal insufficiency and frequent association with hypogonadotropic hypogonadism. The X-chromosome gene DAX-1 has been most recently identified and shown to be responsible for this disorder. We analyzed the DAX-1 genes of two unrelated Japanese patients with congenital adrenal hypoplasia and hypogonadotropic hypogonadism by using PCR amplification of genomic DNA and its complete exonic sequencing. In a family containing several affected individuals, the proband male patient had a stop codon (TGA) in place of tryptophan (TGG) at amino acid position 171. As expected, his mother was a heterozygous carrier for themore » mutation, whereas his father and unaffected brother did not carry this mutation. In another male patient with noncontributory family history, sequencing revealed a 1-bp (T) deletion at amino acid position 280, leading to a frame shift and, subsequently a premature stop codon at amino acid position 371. The presence of this mutation in the patients` genome was further confirmed by digestion of genomic PCR product with MspI created by this mutation. Family studies using MspI digestion of genomic PCR products revealed that neither parent of this individual carried the mutation. These results clearly indicate that congenital adrenal hypoplasia and hypogonadotropic hypogonadism result from not only inherited but also de novo mutation in the DAX-1 gene. 31 refs., 4 figs., 2 tabs.« less
The complete mitochondrial genome of Pomacea canaliculata (Gastropoda: Ampullariidae).
Zhou, Xuming; Chen, Yu; Zhu, Shanliang; Xu, Haigen; Liu, Yan; Chen, Lian
2016-01-01
The mitochondrial genome of Pomacea canaliculata (Gastropoda: Ampullariidae) is the first complete mtDNA sequence reported in the genus Pomacea. The total length of mtDNA is 15,707 bp, which containing 13 protein-coding genes, 2 ribosomal RNAs, 22 transfer RNAs, and a 359 bp non-coding region. The A + T content of the overall base composition of H-strand is 71.7% (T: 41%, C: 12.7%, A: 30.7%, G: 15.6%). ATP6, ATP8, CO1, CO2, ND1-3, ND5, ND6, ND4L and Cyt b genes begin with ATG as start codon, CO3 and ND4 begin with ATA. ATP8, CO2-3, ND4L, ND2-6 and Cyt b genes are terminated with TAA as stop codon, ATP6, ND1, and CO1 end with TAG. A long non-coding region is found and a 23 bp repeat unit repeat 11 times in this region.
Characterization of six mutations in Exon 37 of neurofibromatosis type 1 gene
DOE Office of Scientific and Technical Information (OSTI.GOV)
Upadhyaya, M.; Osborn, M.; Maynard, J.
Neurofibromatosis type 1 (NF1) is one of the most common inherited disorders, with an incidence of 1 in 3,000. We screened a total of 320 unrelated NF1 patients for mutations in exon 37 of the NF1 gene. Six independent mutations were identified, of which three are novel, and these include a recurrent nonsense mutation identified in 2 unrelated patients at codon 2281 (G2281X), a 1-bp insertion (6791 ins A) resulting in a change of TAG (tyrosine) to a TAA (stop codon), and a 3-bp deletion (6839 del TAC) which generated a frameshift. Another recurrent nonsense mutation, Y2264X, which was detectedmore » in 2 unrelated patients in this study, was also previously reported in 2 NF1 individuals. All the mutations were identified within a contiguous 49-bp sequence. Further studies are warranted to support the notion that this region of the gene contains highly mutable sequences. 17 refs., 2 figs., 1 tab.« less
Miura, Y; Hershkovitz, E; Inagaki, A; Parvari, R; Oiso, Y; Phillip, M
2000-10-01
T4-binding globulin (TBG) is the major thyroid hormone transport protein in human serum. Inherited TBG abnormalities do not usually alter the metabolic status and are transmitted in X-linked inheritance. A high prevalence of complete TBG deficiency (TBG-CD) has been reported among the Bedouin population in the Negev (southern Israel). In this study we report a novel single mutation causing complete TBG deficiency due to a deletion of the last base of codon 38 (exon 1), which led to a frame shift resulting in a premature stop at codon 51 and a presumed truncated peptide of 50 residues. This new variant of TBG (TBG-CD-Negev) was found among all of the patients studied. We conclude that a single mutation may account for TBG deficiency among the Bedouins in the Negev. This report is the first to describe a mutation in a population with an unusually high prevalence of TBG-CD.
The complete mitochondrial genome of the Aluterus monoceros.
Li, Wenshen; Zhang, Guoqing; Wen, Xin; Wang, Qian; Chen, Guohua
2016-07-01
The complete mitochondrial genome of Aluterus monoceros (A. monoceros) has been sequenced. The mitochondrial genome of A. monoceros is 16,429 bp in length, consisting of 22 tRNA genes, 2 rRNA genes, 13 protein-coding genes and a D-loop region (Gen Bank accession number KP637022). The base A + T of the mitochondrial genome is 63.25%, including 33.16% of A, 30.09% of T and 20.74% of C. Twelve protein-coding genes start with a standard ATG as the initiation codon, expect for the COXI, which begins with GTG. Some of the termination codons are incomplete T or TA, except for the ND1, COXI, ATP8, ND4L1, ND5 and ND6, which stop with TAA. Construction of phylogenetic trees based on the entire mitochondrial genome sequence of 14 Tetrodontiformes species constructed has suggested that A. monoceros has closer relationship with Acreichthys tomentosus and Monacanthus chinensis, and they constitute a sister group.
Odilorhabdins, Antibacterial Agents that Cause Miscoding by Binding at a New Ribosomal Site.
Pantel, Lucile; Florin, Tanja; Dobosz-Bartoszek, Malgorzata; Racine, Emilie; Sarciaux, Matthieu; Serri, Marine; Houard, Jessica; Campagne, Jean-Marc; de Figueiredo, Renata Marcia; Midrier, Camille; Gaudriault, Sophie; Givaudan, Alain; Lanois, Anne; Forst, Steve; Aumelas, André; Cotteaux-Lautard, Christelle; Bolla, Jean-Michel; Vingsbo Lundberg, Carina; Huseby, Douglas L; Hughes, Diarmaid; Villain-Guillot, Philippe; Mankin, Alexander S; Polikanov, Yury S; Gualtieri, Maxime
2018-04-05
Growing resistance of pathogenic bacteria and shortage of antibiotic discovery platforms challenge the use of antibiotics in the clinic. This threat calls for exploration of unconventional sources of antibiotics and identification of inhibitors able to eradicate resistant bacteria. Here we describe a different class of antibiotics, odilorhabdins (ODLs), produced by the enzymes of the non-ribosomal peptide synthetase gene cluster of the nematode-symbiotic bacterium Xenorhabdus nematophila. ODLs show activity against Gram-positive and Gram-negative pathogens, including carbapenem-resistant Enterobacteriaceae, and can eradicate infections in animal models. We demonstrate that the bactericidal ODLs interfere with protein synthesis. Genetic and structural analyses reveal that ODLs bind to the small ribosomal subunit at a site not exploited by current antibiotics. ODLs induce miscoding and promote hungry codon readthrough, amino acid misincorporation, and premature stop codon bypass. We propose that ODLs' miscoding activity reflects their ability to increase the affinity of non-cognate aminoacyl-tRNAs to the ribosome. Copyright © 2018 Elsevier Inc. All rights reserved.
Complete mitochondrial genome of the Yellow-spotted skate Okamejei hollandi (Rajiformes: Rajidae).
Li, Weidong; Chen, Xiao; Liu, Wenai; Sun, Renjie; Zhou, Haolang
2016-07-01
The complete mitochondrial genome of the Yellow-spotted skate Okamejei hollandi was determined in this study. It is 16,974 bp in length and contains 13 protein-coding genes, two rRNA genes, 22 tRNA genes, and one putative control region. The overall base composition is 30.5% A, 27.8% C, 14.0% G, and 27.8% T. There are 28 bp short intergenic spaces located in 12 gene junctions and 31 bp overlaps located in nine gene junctions in the whole mitogenome. Two start codons (ATG and GTG) and two stop codons (TAG and TAA/T) were used in the protein-coding genes. The lengths of 22 tRNA genes range from 68 (tRNA-Ser2) to 75 (tRNA-Leu1) bp. The origin of L-strand replication (OL) sequence (37 bp) was identified between the tRNA-Asn and tRNA-Cys genes. The control region is 1311 bp in length with high A + T and poor G content.
Reynolds, Sara E; Earl, Patricia L; Minai, Mahnaz; Moore, Ian; Moss, Bernard
2017-01-15
Most poxviruses encode a homolog of a ~200,000-kDa membrane protein originally identified in variola virus. We investigated the importance of the ectromelia virus (ECTV) homolog C15 in a natural infection model. In cultured mouse cells, the replication of a mutant virus with stop codons near the N-terminus (ECTV-C15Stop) was indistinguishable from a control virus (ECTV-C15Rev). However, for a range of doses injected into the footpads of BALB/c mice there was less mortality with the mutant. Similar virus loads were present at the site of infection with mutant or control virus whereas there was less ECTV-C15Stop in popliteal and inguinal lymph nodes, spleen and liver indicating decreased virus spread and replication. The latter results were supported by immunohistochemical analyses. Decreased spread was evidently due to immune modulatory activity of C15, rather than to an intrinsic viral function, as the survival of infected mice depended on CD4+ and CD8+ T cells. Published by Elsevier Inc.
Wen, Wanqing; Cai, Qiuyin; Shu, Xiao-Ou; Cheng, Jia-Rong; Parl, Fritz; Pierce, Larry; Gao, Yu-Tang; Zheng, Wei
2005-02-01
Cytochrome P450 1B1 (CYP1B1) and catechol-O-methyltransferase (COMT) are important estrogen-metabolizing enzymes and, thus, genetic polymorphisms of these enzymes may affect breast cancer risk. A population-based case-control study was conducted to assess the association of breast cancer risk with CYP1B1 and COMT polymorphisms. A meta-analysis was done to summarize the findings from this and previous studies. Included in this study were 1,135 incident breast cancer cases diagnosed from August 1996 through March 1998 among female residents of Shanghai and 1,235 randomly selected, age frequency-matched controls from the same general population. The common alleles of the CYP1B1 gene were Arg (79.97%) in codon 48, Ala (80.53%) in codon 119, and Leu (86.57%) in codon 432. The Val allele accounted for 72.46% of the total alleles identified in codon 108/158 of the COMT gene. No overall associations of breast cancer risk were found with any of the single nucleotide polymorphisms described above. This finding was supported by a meta-analysis of all previous published studies. No gene-gene interactions were observed between CYP1B1 and COMT genotypes. The associations of breast cancer risk with factors related to endogenous estrogen exposure, such as years of menstruation and body mass index, were not significantly modified by the CYP1B1 and COMT genotypes. We observed, however, that women who carried one copy of the variant allele in CYP1B1 codons 48 or 119 were less likely to have estrogen receptor-positive breast cancer than those who carried two copies of the corresponding wild-type alleles. The results from this study were consistent with those from most previous studies, indicating no major associations of breast cancer risk with CYP1B1 and COMT polymorphisms.
Contribution of DNA Repair Xeroderma Pigmentosum Group D Genotype to Gastric Cancer Risk in Taiwan.
Ji, Hong-Xue; Chang, Wen-Shin; Tsai, Chia-Wen; Wang, Ju-Yu; Huang, Nai-Kuei; Lee, An-Sheng; Shen, Ming-Yi; Chen, Wei-Yu; Chiang, Yao-Chang; Shih, Tzu-Ching; Hsu, Chin-Mu; Bau, Da-Tian
2015-09-01
It has been proposed that genetic variations of DNA repair genes confer susceptibility to cancer, and the DNA repair gene xeroderma pigmentosum group D (XPD), the caretaker of genome stability, is thought to play a major role in the nucleotide excision repair system. We investigated three genotypes of XPD, at promoter -114 (rs3810366), and codon 312 (rs1799793), 751 (rs13181), and their associated with gastric cancer susceptibility in a Taiwanese population. In the present study, 121 patients with gastric cancer and 363 gender- and age-matched healthy controls were recruited and genotyped for XPD by polymerase chain reaction-based restriction fragment length polymorphism (PCR-RFLP) methodology, and the association of XPD genotype with gastric cancer risk was investigated. We found a significant difference in the distribution of A allele-bearing XPD codon 312 genotypes [odds ratio (OR)=1.64, 95% confidence interval (CI)=1.20-2.25, p=0.0019], but not in XPD codon 751 or promoter -114 sites, between the gastric cancer and control groups. Those who had G/A or A/A at XPD codon 312 had a 1.83-fold (95% CI=1.14-2.95, p=0.0159) and 1.87-fold (95% CI=1.04-3.34, p=0.0378) increased risk of gastric cancer compared to those with G/G. The risk for G/A and A/A genotypes had synergistic effects with alcohol drinking (OR=11.27, 95% CI=3.72-34.17, p=0.0001), cigarette smoking (OR=23.20, 95% CI=6.24-86.23, p=0.0001) and Helicobacter pylori infection (OR=5.38, 95% CI=2.76-10.52, p=0.0001) on gastric cancer susceptibility. Our findings suggest that the A allele of XPD codon 312 may contribute to gastric carcinogenesis and may be useful for early detection and prevention of gastric cancer. Copyright© 2015 International Institute of Anticancer Research (Dr. John G. Delinassios), All rights reserved.
Ali, Asho; Hasan, Zahra; McNerney, Ruth; Mallard, Kim; Hill-Cawthorne, Grant; Coll, Francesc; Nair, Mridul; Pain, Arnab; Clark, Taane G; Hasan, Rumina
2015-01-01
Improved molecular diagnostic methods for detection drug resistance in Mycobacterium tuberculosis (MTB) strains are required. Resistance to first- and second- line anti-tuberculous drugs has been associated with single nucleotide polymorphisms (SNPs) in particular genes. However, these SNPs can vary between MTB lineages therefore local data is required to describe different strain populations. We used whole genome sequencing (WGS) to characterize 37 extensively drug-resistant (XDR) MTB isolates from Pakistan and investigated 40 genes associated with drug resistance. Rifampicin resistance was attributable to SNPs in the rpoB hot-spot region. Isoniazid resistance was most commonly associated with the katG codon 315 (92%) mutation followed by inhA S94A (8%) however, one strain did not have SNPs in katG, inhA or oxyR-ahpC. All strains were pyrazimamide resistant but only 43% had pncA SNPs. Ethambutol resistant strains predominantly had embB codon 306 (62%) mutations, but additional SNPs at embB codons 406, 378 and 328 were also present. Fluoroquinolone resistance was associated with gyrA 91-94 codons in 81% of strains; four strains had only gyrB mutations, while others did not have SNPs in either gyrA or gyrB. Streptomycin resistant strains had mutations in ribosomal RNA genes; rpsL codon 43 (42%); rrs 500 region (16%), and gidB (34%) while six strains did not have mutations in any of these genes. Amikacin/kanamycin/capreomycin resistance was associated with SNPs in rrs at nt1401 (78%) and nt1484 (3%), except in seven (19%) strains. We estimate that if only the common hot-spot region targets of current commercial assays were used, the concordance between phenotypic and genotypic testing for these XDR strains would vary between rifampicin (100%), isoniazid (92%), flouroquinolones (81%), aminoglycoside (78%) and ethambutol (62%); while pncA sequencing would provide genotypic resistance in less than half the isolates. This work highlights the importance of expanded targets for drug resistance detection in MTB isolates.
Pi, J; Wookey, P J; Pittard, A J
1991-01-01
The phenylalanine-specific permease gene (pheP) of Escherichia coli has been cloned and sequenced. The gene was isolated on a 6-kb Sau3AI fragment from a chromosomal library, and its presence was verified by complementation of a mutant lacking the functional phenylalanine-specific permease. Subcloning from this fragment localized the pheP gene on a 2.7-kb HindIII-HindII fragment. The nucleotide sequence of this 2.7-kb region was determined. An open reading frame was identified which extends from a putative start point of translation (GTG at position 636) to a termination signal (TAA at position 2010). The assignment of the GTG as the initiation codon was verified by site-directed mutagenesis of the initiation codon and by introducing a chain termination mutation into the pheP-lacZ fusion construct. A single initiation site of transcription 30 bp upstream of the start point of translation was identified by the primer extension analysis. The pheP structural gene consists of 1,374 nucleotides specifying a protein of 458 amino acid residues. The PheP protein is very hydrophobic (71% nonpolar residues). A topological model predicted from the sequence analysis defines 12 transmembrane segments. This protein is highly homologous with the AroP (general aromatic transport) system of E. coli (59.6% identity) and to a lesser extent with the yeast permeases CAN1 (arginine), PUT4 (proline), and HIP1 (histidine) of Saccharomyces cerevisiae. Images PMID:1711024
Bergeron, Danny; Lapointe, Catherine; Bissonnette, Cyntia; Tremblay, Guillaume; Motard, Julie; Roucou, Xavier
2013-01-01
Spinocerebellar ataxia type 1 is an autosomal dominant cerebellar ataxia associated with the expansion of a polyglutamine tract within the ataxin-1 (ATXN1) protein. Recent studies suggest that understanding the normal function of ATXN1 in cellular processes is essential to decipher the pathogenesis mechanisms in spinocerebellar ataxia type 1. We found an alternative translation initiation ATG codon in the +3 reading frame of human ATXN1 starting 30 nucleotides downstream of the initiation codon for ATXN1 and ending at nucleotide 587. This novel overlapping open reading frame (ORF) encodes a 21-kDa polypeptide termed Alt-ATXN1 (Alternative ATXN1) with a completely different amino acid sequence from ATXN1. We introduced a hemagglutinin tag in-frame with Alt-ATXN1 in ATXN1 cDNA and showed in cell culture the co-expression of both ATXN1 and Alt-ATXN1. Remarkably, Alt-ATXN1 colocalized and interacted with ATXN1 in nuclear inclusions. In contrast, in the absence of ATXN1 expression, Alt-ATXN1 displays a homogenous nucleoplasmic distribution. Alt-ATXN1 interacts with poly(A)+ RNA, and its nuclear localization is dependent on RNA transcription. Polyclonal antibodies raised against Alt-ATXN1 confirmed the expression of Alt-ATXN1 in human cerebellum expressing ATXN1. These results demonstrate that human ATXN1 gene is a dual coding sequence and that ATXN1 interacts with and controls the subcellular distribution of Alt-ATXN1. PMID:23760502
Linder, P; Dölz, R; Mossé, M O; Lazowska, J; Slonimski, P P
1993-01-01
The amount of nucleotide sequence data is increasing exponentially. We therefore made an effort to make a comprehensive database (LISTA) for the yeast Saccharomyces cerevisiae. Each sequence has been attributed a single genetic name and in the case of allelic duplicated sequences, synonyms are given, if necessary. For the nomenclature we have introduced a standard principle for naming gene sequences based on priority rules. We have also applied a simple method to distinguish duplicated sequences of one and the same gene from non-allelic sequences of duplicated genes. By using these principles we have sorted out a lot of confusion in the literature and databanks. Along with the genetic name, the mnemonic from the EMBL databank, the codon bias, reference of the publication of the sequence and the EMBL accession numbers are included in each entry. PMID:8332521
Chiusano, M L; D'Onofrio, G; Alvarez-Valin, F; Jabbari, K; Colonna, G; Bernardi, G
1999-09-30
We investigated the relationships between the nucleotide substitution rates and the predicted secondary structures in the three states representation (alpha-helix, beta-sheet, and coil). The analysis was carried out on 34 alignments, each of which comprised sequences belonging to at least four different mammalian orders. The rates of synonymous substitution were found to be significantly different in regions predicted to be alpha-helix, beta-sheet, or coil. Likewise, the nonsynonymous rates also differ, although expectedly at a lower extent, in the three types of secondary structure, suggesting that different selective constraints associated with the different structures are affecting in a similar way the synonymous and nonsynonymous rates. Moreover, the base composition of the third codon positions is different in coding sequence regions corresponding to different secondary structures of proteins.
Ancient DNA sequence revealed by error-correcting codes.
Brandão, Marcelo M; Spoladore, Larissa; Faria, Luzinete C B; Rocha, Andréa S L; Silva-Filho, Marcio C; Palazzo, Reginaldo
2015-07-10
A previously described DNA sequence generator algorithm (DNA-SGA) using error-correcting codes has been employed as a computational tool to address the evolutionary pathway of the genetic code. The code-generated sequence alignment demonstrated that a residue mutation revealed by the code can be found in the same position in sequences of distantly related taxa. Furthermore, the code-generated sequences do not promote amino acid changes in the deviant genomes through codon reassignment. A Bayesian evolutionary analysis of both code-generated and homologous sequences of the Arabidopsis thaliana malate dehydrogenase gene indicates an approximately 1 MYA divergence time from the MDH code-generated sequence node to its paralogous sequences. The DNA-SGA helps to determine the plesiomorphic state of DNA sequences because a single nucleotide alteration often occurs in distantly related taxa and can be found in the alternative codon patterns of noncanonical genetic codes. As a consequence, the algorithm may reveal an earlier stage of the evolution of the standard code.
Ancient DNA sequence revealed by error-correcting codes
Brandão, Marcelo M.; Spoladore, Larissa; Faria, Luzinete C. B.; Rocha, Andréa S. L.; Silva-Filho, Marcio C.; Palazzo, Reginaldo
2015-01-01
A previously described DNA sequence generator algorithm (DNA-SGA) using error-correcting codes has been employed as a computational tool to address the evolutionary pathway of the genetic code. The code-generated sequence alignment demonstrated that a residue mutation revealed by the code can be found in the same position in sequences of distantly related taxa. Furthermore, the code-generated sequences do not promote amino acid changes in the deviant genomes through codon reassignment. A Bayesian evolutionary analysis of both code-generated and homologous sequences of the Arabidopsis thaliana malate dehydrogenase gene indicates an approximately 1 MYA divergence time from the MDH code-generated sequence node to its paralogous sequences. The DNA-SGA helps to determine the plesiomorphic state of DNA sequences because a single nucleotide alteration often occurs in distantly related taxa and can be found in the alternative codon patterns of noncanonical genetic codes. As a consequence, the algorithm may reveal an earlier stage of the evolution of the standard code. PMID:26159228
Modeling discrete combinatorial systems as alphabetic bipartite networks: theory and applications.
Choudhury, Monojit; Ganguly, Niloy; Maiti, Abyayananda; Mukherjee, Animesh; Brusch, Lutz; Deutsch, Andreas; Peruani, Fernando
2010-03-01
Genes and human languages are discrete combinatorial systems (DCSs), in which the basic building blocks are finite sets of elementary units: nucleotides or codons in a DNA sequence, and letters or words in a language. Different combinations of these finite units give rise to potentially infinite numbers of genes or sentences. This type of DCSs can be represented as an alphabetic bipartite network (ABN) where there are two kinds of nodes, one type represents the elementary units while the other type represents their combinations. Here, we extend and generalize recent analytical findings for ABNs derived in [Peruani, Europhys. Lett. 79, 28001 (2007)] and empirically investigate two real world systems in terms of ABNs, the codon gene and the phoneme-language network. The one-mode projections onto the elementary basic units are also studied theoretically as well as in real world ABNs. We propose the use of ABNs as a means for inferring the mechanisms underlying the growth of real world DCSs.
Markov-modulated Markov chains and the covarion process of molecular evolution.
Galtier, N; Jean-Marie, A
2004-01-01
The covarion (or site specific rate variation, SSRV) process of biological sequence evolution is a process by which the evolutionary rate of a nucleotide/amino acid/codon position can change in time. In this paper, we introduce time-continuous, space-discrete, Markov-modulated Markov chains as a model for representing SSRV processes, generalizing existing theory to any model of rate change. We propose a fast algorithm for diagonalizing the generator matrix of relevant Markov-modulated Markov processes. This algorithm makes phylogeny likelihood calculation tractable even for a large number of rate classes and a large number of states, so that SSRV models become applicable to amino acid or codon sequence datasets. Using this algorithm, we investigate the accuracy of the discrete approximation to the Gamma distribution of evolutionary rates, widely used in molecular phylogeny. We show that a relatively large number of classes is required to achieve accurate approximation of the exact likelihood when the number of analyzed sequences exceeds 20, both under the SSRV and among site rate variation (ASRV) models.
Shannon Entropy of the Canonical Genetic Code
NASA Astrophysics Data System (ADS)
Nemzer, Louis
The probability that a non-synonymous point mutation in DNA will adversely affect the functionality of the resultant protein is greatly reduced if the substitution is conservative. In that case, the amino acid coded by the mutated codon has similar physico-chemical properties to the original. Many simplified alphabets, which group the 20 common amino acids into families, have been proposed. To evaluate these schema objectively, we introduce a novel, quantitative method based on the inherent redundancy in the canonical genetic code. By calculating the Shannon information entropy carried by 1- or 2-bit messages, groupings that best leverage the robustness of the code are identified. The relative importance of properties related to protein folding - like hydropathy and size - and function, including side-chain acidity, can also be estimated. In addition, this approach allows us to quantify the average information value of nucleotide codon positions, and explore the physiological basis for distinguishing between transition and transversion mutations. Supported by NSU PFRDG Grant #335347.
Complete mitochondrial genome of the mottled skate: Raja pulchra (Rajiformes, Rajidae).
Jeong, Dageum; Kim, Sung; Kim, Choong-Gon; Myoung, Jung-Goo; Lee, Youn-Ho
2016-05-01
The complete sequence of mitochondrial DNA of a mottled skate, Raja pulchra was sequenced as being circular molecules of 16,907 bp including 2 rRNA, 22 tRNA, 13 protein-coding genes (PCGs), and an AT-rich control region. The organization of the PCGs is the same as those found in other Rajidae species. The nucleotide of L-strand is composed of 29.8% A, 28.0% C, 27.9% T, and 14.3% G with a bias toward A + T slightly. Twelve of 13 PCGs are initiated by the ATG codon while COX1 starts with GTG. Only ND4 harbors the incomplete termination codon, TA. All tRNA genes have a typical clover-leaf structure of mitochondrial tRNA with the exception of [Formula: see text] which has a reduced DHU arm. This mitogenome will provide essential information for better phylogenetic resolution and precision of the family Rajidae and the genus Raja as well as for establishment of a fish stock recovery plan of the species.
Nucleotide sequence and genetic organization of barley stripe mosaic virus RNA gamma.
Gustafson, G; Hunter, B; Hanau, R; Armour, S L; Jackson, A O
1987-06-01
The complete nucleotide sequences of RNA gamma from the Type and ND18 strains of barley stripe mosaic virus (BSMV) have been determined. The sequences are 3164 (Type) and 2791 (ND18) nucleotides in length. Both sequences contain a 5'-noncoding region (87 or 88 nucleotides) which is followed by a long open reading frame (ORF1). A 42-nucleotide intercistronic region separates ORF1 from a second, shorter open reading frame (ORF2) located near the 3'-end of the RNA. There is a high degree of homology between the Type and ND18 strains in the nucleotide sequence of ORF1. However, the Type strain contains a 366 nucleotide direct tandem repeat within ORF1 which is absent in the ND18 strain. Consequently, the predicted translation product of Type RNA gamma ORF1 (mol wt 87,312) is significantly larger than that of ND18 RNA gamma ORF1 (mol wt 74,011). The amino acid sequence of the ORF1 polypeptide contains homologies with putative RNA polymerases from other RNA viruses, suggesting that this protein may function in replication of the BSMV genome. The nucleotide sequence of RNA gamma ORF2 is nearly identical in the Type and ND18 strains. ORF2 codes for a polypeptide with a predicted molecular weight of 17,209 (Type) or 17,074 (ND18) which is known to be translated from a subgenomic (sg) RNA. The initiation point of this sgRNA has been mapped to a location 27 nucleotides upstream of the ORF2 initiation codon in the intercistronic region between ORF1 and ORF2. The sgRNA is not coterminal with the 3'-end of the genomic RNA, but instead contains heterogeneous poly(A) termini up to 150 nucleotides long (J. Stanley, R. Hanau, and A. O. Jackson, 1984, Virology 139, 375-383). In the genomic RNA gamma, ORF2 is followed by a short poly(A) tract and a 238-nucleotide tRNA-like structure.
Two alternative ways of start site selection in human norovirus reinitiation of translation.
Luttermann, Christine; Meyers, Gregor
2014-04-25
The calicivirus minor capsid protein VP2 is expressed via termination/reinitiation. This process depends on an upstream sequence element denoted termination upstream ribosomal binding site (TURBS). We have shown for feline calicivirus and rabbit hemorrhagic disease virus that the TURBS contains three sequence motifs essential for reinitiation. Motif 1 is conserved among caliciviruses and is complementary to a sequence in the 18 S rRNA leading to the model that hybridization between motif 1 and 18 S rRNA tethers the post-termination ribosome to the mRNA. Motif 2 and motif 2* are proposed to establish a secondary structure positioning the ribosome relative to the start site of the terminal ORF. Here, we analyzed human norovirus (huNV) sequences for the presence and importance of these motifs. The three motifs were identified by sequence analyses in the region upstream of the VP2 start site, and we showed that these motifs are essential for reinitiation of huNV VP2 translation. More detailed analyses revealed that the site of reinitiation is not fixed to a single codon and does not need to be an AUG, even though this codon is clearly preferred. Interestingly, we were able to show that reinitiation can occur at AUG codons downstream of the canonical start/stop site in huNV and feline calicivirus but not in rabbit hemorrhagic disease virus. Although reinitiation at the original start site is independent of the Kozak context, downstream initiation exhibits requirements for start site sequence context known for linear scanning. These analyses on start codon recognition give a more detailed insight into this fascinating mechanism of gene expression.
NASA Technical Reports Server (NTRS)
Lacey, J. C., Jr.; Mullins, D. W., Jr.; Watkins, C. L.; Hall, L. M.
1986-01-01
Cellular organisms store information as sequences of nucleotides in double stranded DNA. This information is useless unless it can be converted into the active molecular species, protein. This is done in contemporary creatures first by transcription of one strand to give a complementary strand of mRNA. The sequence of nucleotides is then translated into a specific sequence of amino acids in a protein. Translation is made possible by a genetic coding system in which a sequence of three nucleotides codes for a specific amino acid. The origin and evolution of any chemical system can be understood through elucidation of the properties of the chemical entities which make up the system. There is an underlying logic to the coding system revealed by a correlation of the hydrophobicities of amino acids and their anticodonic nucleotides (i.e., the complement of the codon). Its importance lies in the fact that every amino acid going into protein synthesis must first be activated. This is universally accomplished with ATP. Past studies have concentrated on the chemistry of the adenylates, but more recently we have found, through the use of NMR, that we can observe intramolecular interactions even at low concentrations, between amino acid side chains and nucleotide base rings in these adenylates. The use of this type of compound thus affords a novel way of elucidating the manner in which amino acids and nucleotides interact with each other. In aqueous solution, when a hydrophobic amino acid is attached to the most hydrophobic nucleotide, AMP, a hydrophobic interaction takes place between the amino acid side chain and the adenine ring. The studies to be reported concern these hydrophobic interactions.
Grossman, M J; Lampen, J O
1987-01-01
The location of the repressor gene, blaI, for the beta-lactamase gene blaP of Bacillus licheniformis 749, on the 5' side of blaP, was confirmed by sequencing the bla region of the constitutive mutant 749/C. An amber stop codon, likely to result in a nonfunctional truncated repressor, was found at codon 32 of the 128 codon blaI open reading frame (ORF) located 5' to blaP. In order to study the DNA binding activity of the repressor, the structural gene for blaI, from strain 749, with its ribosome binding site was expressed using a two plasmid T7 RNA polymerase/promotor system (S. Tabor and C. C. Richardson. Proc. Natl. Acad. Sci. 82, 1074-1078 (1985). Heat induction of this system in Escherichia coli K38 resulted in the production of BlaI as 5-10% of the soluble cell protein. Repressor protein was then purified by ammonium sulfate fractionation and cation exchange chromatography. The sequence of the N-terminal 28 amino acid residues was determined and was as predicted from the DNA. Binding of BlaI to DNA was detected by the slower migration of protein DNA complexes during polyacrylamide gel electrophoresis. BlaI was shown to selectively bind DNA fragments carrying the promoter regions of blaI and blaP. Images PMID:3498148
Gai, Nan; Jiang, Chen; Zou, Yong-Yi; Zheng, Yu; Liang, De-Sheng; Wu, Ling-Qian
2016-07-01
Marinesco-Sjögren syndrome (MSS) is a rare autosomal recessive disorder, which is characterized by congenital cataracts, cerebellar ataxia, progressive muscle weakness, and delayed psychomotor development. SIL1, which is located at 5q31.2, is the only gene known to cause MSS. Dandy-Walker syndrome (DWS) is defined by hypoplasia, upward rotation of the cerebellar vermis, and cystic dilation of the fourth ventricle; however, its genetic pathogeny remains unclear. Here, we report a Chinese consanguineous family with MSS and DWS. Whole exome sequencing identified a novel nonstop mutation in SIL1. Sanger sequencing revealed that the mutation was segregated in this family according to a recessive mode of inheritance. We found that the mutation changed a stop codon (TGA) to an arginine codon (CGA), and no in-frame termination codon in the 3' untranslated region (UTR) of SIL1 could be found. The mRNA levels of SIL1 were decreased by 56.6% and 37.5% in immortalized lymphoblasts of the patients respectively; the protein levels of SIL1 were substantially decreased. This case study is the first report on Chinese MSS patients, MSS complicated by DWS, and a nonstop mutation in SIL1. Our findings imply the pathogenetic association between DWS and MSS. Copyright © 2016 Elsevier B.V. All rights reserved.
Holland, M J; Holland, J P; Thill, G P; Jackson, K A
1981-02-10
Segments of yeast genomic DNA containing two enolase structural genes have been isolated by subculture cloning procedures using a cDNA hybridization probe synthesized from purified yeast enolase mRNA. Based on restriction endonuclease and transcriptional maps of these two segments of yeast DNA, each hybrid plasmid contains a region of extensive nucleotide sequence homology which forms hybrids with the cDNA probe. The DNA sequences which flank this homologous region in the two hybrid plasmids are nonhomologous indicating that these sequences are nontandemly repeated in the yeast genome. The complete nucleotide sequence of the coding as well as the flanking noncoding regions of these genes has been determined. The amino acid sequence predicted from one reading frame of both structural genes is extremely similar to that determined for yeast enolase (Chin, C. C. Q., Brewer, J. M., Eckard, E., and Wold, F. (1981) J. Biol. Chem. 256, 1370-1376), confirming that these isolated structural genes encode yeast enolase. The nucleotide sequences of the coding regions of the genes are approximately 95% homologous, and neither gene contains an intervening sequence. Codon utilization in the enolase genes follows the same biased pattern previously described for two yeast glyceraldehyde-3-phosphate dehydrogenase structural genes (Holland, J. P., and Holland, M. J. (1980) J. Biol. Chem. 255, 2596-2605). DNA blotting analysis confirmed that the isolated segments of yeast DNA are colinear with yeast genomic DNA and that there are two nontandemly repeated enolase genes per haploid yeast genome. The noncoding portions of the two enolase genes adjacent to the initiation and termination codons are approximately 70% homologous and contain sequences thought to be involved in the synthesis and processing messenger RNA. Finally there are regions of extensive homology between the two enolase structural genes and two yeast glyceraldehyde-3-phosphate dehydrogenase structural genes within the 5- noncoding portions of these glycolytic genes.
Krishnan, Neeraja M; Seligmann, Hervé; Rao, Basuthkar J
2008-01-28
Synonymous sites are freer to vary because of redundancy in genetic code. Messenger RNA secondary structure restricts this freedom, as revealed by previous findings in mitochondrial genes that mutations at third codon position nucleotides in helices are more selected against than those in loops. This motivated us to explore the constraints imposed by mRNA secondary structure on evolutionary variability at all codon positions in general, in chloroplast systems. We found that the evolutionary variability and intrinsic secondary structure stability of these sequences share an inverse relationship. Simulations of most likely single nucleotide evolution in Psilotum nudum and Nephroselmis olivacea mRNAs, indicate that helix-forming propensities of mutated mRNAs are greater than those of the natural mRNAs for short sequences and vice-versa for long sequences. Moreover, helix-forming propensity estimated by the percentage of total mRNA in helices increases gradually with mRNA length, saturating beyond 1000 nucleotides. Protection levels of functionally important sites vary across plants and proteins: r-strategists minimize mutation costs in large genes; K-strategists do the opposite. Mrna length presumably predisposes shorter mRNAs to evolve under different constraints than longer mRNAs. The positive correlation between secondary structure protection and functional importance of sites suggests that some sites might be conserved due to packing-protection constraints at the nucleic acid level in addition to protein level constraints. Consequently, nucleic acid secondary structure a priori biases mutations. The converse (exposure of conserved sites) apparently occurs in a smaller number of cases, indicating a different evolutionary adaptive strategy in these plants. The differences between the protection levels of functionally important sites for r- and K-strategists reflect their respective molecular adaptive strategies. These converge with increasing domestication levels of K-strategists, perhaps because domestication increases reproductive output.
Song, Fan; Shi, Aimin; Zhou, Xuguo; Cai, Wanzhi
2012-01-01
Background Nabidae, a family of predatory heteropterans, includes two subfamilies and five tribes. We previously reported the complete mitogenome of Alloeorhynchus bakeri, a representative of the tribe Prostemmatini in the subfamily Prostemmatinae. To gain a better understanding of architecture and evolution of mitogenome in Nabidae, mitogenomes of five species representing two tribes (Gorpini and Nabini) in the subfamily Nabinae were sequenced, and a comparative mitogenomic analysis of three nabid tribes in two subfamilies was carried out. Methodology/Principal Findings Nabid mitogenomes share a similar nucleotide composition and base bias, except for the control region, where differences are observed at the subfamily level. In addition, the pattern of codon usage is influenced by the GC content and consistent with the standard invertebrate mitochondrial genetic code and the preference for A+T-rich codons. The comparison among orthologous protein-coding genes shows that different genes have been subject to different rates of molecular evolution correlated with the GC content. The stems and anticodon loops of tRNAs are extremely conserved, and the nucleotide substitutions are largely restricted to TψC and DHU loops and extra arms, with insertion-deletion polymorphisms. Comparative analysis shows similar rates of substitution between the two rRNAs. Long non-coding regions are observed in most Gorpini and Nabini mtDNAs in-between trnI-trnQ and/or trnS2-nad1. The lone exception, Nabis apicalis, however, has lost three tRNAs. Overall, phylogenetic analysis using mitogenomic data is consistent with phylogenies constructed mainly form morphological traits. Conclusions/Significance This comparative mitogenomic analysis sheds light on the architecture and evolution of mitogenomes in the family Nabidae. Nucleotide diversity and mitogenomic traits are phylogenetically informative at subfamily level. Furthermore, inclusion of a broader range of samples representing various taxonomic levels is critical for the understanding of mitogenomic evolution in damsel bugs. PMID:23029320
BE-PLUS: a new base editing tool with broadened editing window and enhanced fidelity.
Jiang, Wen; Feng, Songjie; Huang, Shisheng; Yu, Wenxia; Li, Guanglei; Yang, Guang; Liu, Yajing; Zhang, Yu; Zhang, Lei; Hou, Yu; Chen, Jia; Chen, Jieping; Huang, Xingxu
2018-06-06
Base editor (BE), containing a cytidine deaminase and catalytically defective Cas9, has been widely used to perform base editing. However, the narrow editing window of BE limits its utility. Here, we developed a new editing technology named as base editor for programming larger C to U (T) scope (BE-PLUS) by fusing 10 copies of GCN4 peptide to nCas9(D10A) for recruiting scFv-APOBEC-UGI-GB1 to the target sites. The new system achieves base editing with a broadened window, resulting in an increased genome-targeting scope. Interestingly, the new system yielded much fewer unwanted indels and non-C-to-T conversions. We also demonstrated its potential use in gene disruption across the whole genome through induction of stop codons (iSTOP). Taken together, the BE-PLUS system offers a new editing tool with increased editing window and enhanced fidelity.
NASA Astrophysics Data System (ADS)
Mackiewicz, P.; Gierlik, A.; Kowalczuk, M.; Szczepanik, D.; Dudek, M. R.; Cebrat, S.
1999-12-01
We have analysed protein coding and intergenic sequences in the Borrelia burgdorferi (the Lyme disease bacterium) genome using different kinds of DNA walks. Genes occupying the leading strand of DNA have significantly different nucleotide composition from genes occupying the lagging strand. Nucleotide compositional bias of the two DNA strands reflects the aminoacid composition of proteins. 96% of genes coding for ribosomal proteins lie on the leading DNA strand, which suggests that the positions of these as well as other genes are non-random. In the B. burgdorferi genome, the asymmetry in intergenic DNA sequences is lower than the asymmetry in the third positions in codons. All these characters of the B. burgdorferi genome suggest that both replication-associated mutational pressure and recombination mechanisms have established the specific structure of the genome and now any recombination leading to inversion of a gene in respect to the direction of replication is forbidden. This property of the genome allows us to assume that it is in a steady state, which enables us to fix some parameters for simulations of DNA evolution.
Novel APC gene mutations associated with protein alteration in diffuse type gastric cancer.
Ghatak, Souvik; Chakraborty, Payel; Sarkar, Sandeep Roy; Chowdhury, Biswajit; Bhaumik, Arup; Kumar, Nachimuthu Senthil
2017-06-02
The role of adenomatous polyposis coli (APC) gene in mitosis might be critical for regulation of genomic stability and chromosome segregation. APC gene mutations have been associated to have a role in colon cancer and since gastric and colon tumors share some common genetic lesions, it is relevant to investigate the role of APC tumor suppressor gene in gastric cancer. We investigated for somatic mutations in the Exons 14 and 15 of APC gene from 40 diffuse type gastric cancersamples. Rabbit polyclonal anti-APC antibody was used, which detects the wild-type APC protein and was recommended for detection of the respective protein in human tissues. Cell cycle analysis was done from tumor and adjacent normal tissue. APC immunoreactivity showed positive expression of the protein in stages I, II, III and negative expression in Stages III and IV. Two novel deleterious variations (g.127576C > A, g.127583C > T) in exon 14 sequence were found to generate stop codon (Y622* and Q625*)in the tumor samples. Due to the generation of stop codon, the APC protein might be truncated and all the regulatory features could be lost which has led to the down-regulation of protein expression. Our results indicate that aneuploidy might occurdue to the codon 622 and 625 APC-driven gastric tumorigenesis, in agreement with our cell cycle analysis. The APC gene function in mitosis and chromosomal stability might be lost and G1 might be arrested with high quantity of DNA in the S phase. Six missense somatic mutations in tumor samples were detected in exon 15 A-B, twoof which showed pathological and disease causing effects based on SIFT, Polyphen2 and SNPs & GO score and were not previously reported in the literature or the public mutation databases. The two novel pathological somatic mutations (g.127576C > A, g.127583C > T) in exon 14 might be altering the protein expression leading to development of gastric cancer in the study population. Our study showed that mutations in the APC gene alter the protein expression and cell cycle regulation in diffuse type gastric adenocarcinoma.
Yokoyama, S; Watanabe, T; Murao, K; Ishikura, H; Yamaizumi, Z; Nishimura, S; Miyazawa, T
1985-01-01
Proton NMR analyses have been made to elucidate the conformational characteristics of modified nucleotides as found in the first position of the anticodon of tRNA [derivatives of 5-methyl-2-thiouridine 5'-monophosphate (pxm5s2U) and derivatives of 5-hydroxyuridine 5'-monophosphate (pxo5U)]. In pxm5s2U, the C3'-endo form is extraordinarily more stable than the C2'-endo form for the ribose ring, because of the combined effects of the 2-thiocarbonyl group and the 5-substituent. By contrast, in pxo5U, the C2'-endo form is much more stable than the C3'-endo form, because of the interaction between the 5-substituent and the 5'-phosphate group. The enthalpy differences between the C2'-endo form and the C3'-endo form have been obtained as 1.1, -0.7, and 0.1 kcal/mol (1 cal = 4.184 J) for pxm5s2U, pxo5U, and unmodified uridine 5'-monophosphate, respectively. These findings lead to the conclusion that xm5s2U in the first position of the anticodon exclusively takes the C3'-endo form to recognize adenosine (but not uridine) as the third letter of the codon, whereas xo5U takes the C2'-endo form as well as the C3'-endo form to recognize adenosine, guanosine, and uridine as the third letter of the codon on ribosome. Accordingly, the biological significance of such modifications of uridine to xm5s2U/xo5U is in the regulation of the conformational rigidity/flexibility in the first position of the anticodon so as to guarantee the correct and efficient translation of codons in protein biosynthesis. PMID:3860833
Song, Yutong; Gorbatsevych, Oleksandr; Liu, Ying; Mugavero, JoAnn; Shen, Sam H; Ward, Charles B; Asare, Emmanuel; Jiang, Ping; Paul, Aniko V; Mueller, Steffen; Wimmer, Eckard
2017-10-10
Computer design and chemical synthesis generated viable variants of poliovirus type 1 (PV1), whose ORF (6,189 nucleotides) carried up to 1,297 "Max" mutations (excess of overrepresented synonymous codon pairs) or up to 2,104 "SD" mutations (randomly scrambled synonymous codons). "Min" variants (excess of underrepresented synonymous codon pairs) are nonviable except for P2 Min , a variant temperature-sensitive at 33 and 39.5 °C. Compared with WT PV1, P2 Min displayed a vastly reduced specific infectivity (si) (WT, 1 PFU/118 particles vs. P2 Min , 1 PFU/35,000 particles), a phenotype that will be discussed broadly. Si of haploid PV presents cellular infectivity of a single genotype. We performed a comprehensive analysis of sequence and structures of the PV genome to determine if evolutionary conserved cis-acting packaging signal(s) were preserved after recoding. We showed that conserved synonymous sites and/or local secondary structures that might play a role in determining packaging specificity do not survive codon pair recoding. This makes it unlikely that numerous "cryptic, sequence-degenerate, dispersed RNA packaging signals mapping along the entire viral genome" [Patel N, et al. (2017) Nat Microbiol 2:17098] play the critical role in poliovirus packaging specificity. Considering all available evidence, we propose a two-step assembly strategy for +ssRNA viruses: step I, acquisition of packaging specificity, either ( a ) by specific recognition between capsid protein(s) and replication proteins (poliovirus), or ( b ) by the high affinity interaction of a single RNA packaging signal (PS) with capsid protein(s) (most +ssRNA viruses so far studied); step II, cocondensation of genome/capsid precursors in which an array of hairpin structures plays a role in virion formation.