Sample records for untranslated region introns

  1. Insertion of part of an intron into the 5[prime] untranslated region of a Caenorhabditis elegans gene converts it into a trans-spliced gene

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Conrad, R.; Thomas, J.; Spieth, J.

    In nematodes, the RNA products of some genes are trans-spliced to a 22-nucleotide spliced leader (SL), while the RNA products of other genes are not. In Caenorhabditis elegans, there are two SLs, Sl1 and SL2, donated by two distinct small nuclear ribonucleoprotein particles in a process functionally quite similar to nuclear intron removal. The authors demonstrate here that it is possible to convert a non-trans-spliced gene into a trans-spliced gene by placement of an intron missing only the 5[prime] splice site into the 5[prime] untranslated region. Stable transgenic strains were isolated expressing a gene in which 69 nucleotides of amore » vit-5 intron, including the 3[prime] splice site, were inserted into the 5[prime] untranslated region of a vit-2/vit-6 fusion gene. The RNA product of this gene was examined by primer extension and PCR amplification. Although the vit-2/vit-6 transgene product is not normally trans-spliced, the majority of transcripts from this altered gene were trans-spliced to SL1. They termed the region of a trans-spliced mRNA precursor between the 5[prime] end and the first 3[prime] splice site an 'outrun'. The results suggest that if a transcript begins with intronlike sequence followed by a 3[prime] splice site, this alone may constitute an outrun and be sufficient to demarcate a transcript as a trans-splice acceptor. These findings leave open the possibility that specific sequences are required to increase the efficiency of trans-splicing.« less

  2. Linkage disequilibrium between polymorphisms at the 5{prime} untranslated region and intron 5 (Dde I) of the antithrombin III (ATIII) gene in the Chinese

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Tay, J.S.H.; Liu, Y.; Low, P.S.

    A length polymorphism at the 5{prime} untranslated region of exon 1 and an RFLP (Dde I) in intron 5 (nt 160) of the ATIII gene were amplified by polymerase chain reaction with primers of published sequences. DNA fragments were size-fractionated by agarose gel electrophoresis (3% NuSieve and 1% Seakem GTG) and photographed over a UV transilluminator. A strong linkage disequilibrium was observed between these two polymorphisms of the ATIII gene in the Chinese ({chi}{sup 2} = 63.7; {triangle} 0.42, P < 0.001). The estimated frequencies of the three haplotypes were found to be 0.37 for SD+, 0.40 for LD+ andmore » 0.23 for LD-.« less

  3. Evaluation of the phospholamban gene in purebred large-breed dogs with dilated cardiomyopathy.

    PubMed

    Stabej, Polona; Leegwater, Peter A; Stokhof, Arnold A; Domanjko-Petric, Aleksandra; van Oost, Bernard A

    2005-03-01

    To evaluate the role of the phospholamban gene in purebred large-breed dogs with dilated cardiomyopathy (DCM). 6 dogs with DCM, including 2 Doberman Pinschers, 2 Newfoundlands, and 2 Great Danes. All dogs had clinical signs of congestive heart failure, and a diagnosis of DCM was made on the basis of echocardiographic findings. Blood samples were collected from each dog, and genomic DNA was isolated by a salt extraction method. Specific oligonucleotides were designed to amplify the promoter, exon 1, the 5'-part of exon 2 including the complete coding region, and part of intron 1 of the canine phospholamban gene via polymerase chain reaction procedures. These regions were screened for mutations in DNA obtained from the 6 dogs with DCM. No mutations were identified in the promoter, 5' untranslated region, part of intron 1, part of the 3' untranslated region, and the complete coding region of the phospholamban gene in dogs with DCM. Results indicate that mutations in the phospholamban gene are not a frequent cause of DCM in Doberman Pinschers, Newfoundlands, and Great Danes.

  4. RRE: a tool for the extraction of non-coding regions surrounding annotated genes from genomic datasets.

    PubMed

    Lazzarato, F; Franceschinis, G; Botta, M; Cordero, F; Calogero, R A

    2004-11-01

    RRE allows the extraction of non-coding regions surrounding a coding sequence [i.e. gene upstream region, 5'-untranslated region (5'-UTR), introns, 3'-UTR, downstream region] from annotated genomic datasets available at NCBI. RRE parser and web-based interface are accessible at http://www.bioinformatica.unito.it/bioinformatics/rre/rre.html

  5. A common class of transcripts with 5'-intron depletion, distinct early coding sequence features, and N1-methyladenosine modification.

    PubMed

    Cenik, Can; Chua, Hon Nian; Singh, Guramrit; Akef, Abdalla; Snyder, Michael P; Palazzo, Alexander F; Moore, Melissa J; Roth, Frederick P

    2017-03-01

    Introns are found in 5' untranslated regions (5'UTRs) for 35% of all human transcripts. These 5'UTR introns are not randomly distributed: Genes that encode secreted, membrane-bound and mitochondrial proteins are less likely to have them. Curiously, transcripts lacking 5'UTR introns tend to harbor specific RNA sequence elements in their early coding regions. To model and understand the connection between coding-region sequence and 5'UTR intron status, we developed a classifier that can predict 5'UTR intron status with >80% accuracy using only sequence features in the early coding region. Thus, the classifier identifies transcripts with 5 ' proximal- i ntron- m inus-like-coding regions ("5IM" transcripts). Unexpectedly, we found that the early coding sequence features defining 5IM transcripts are widespread, appearing in 21% of all human RefSeq transcripts. The 5IM class of transcripts is enriched for non-AUG start codons, more extensive secondary structure both preceding the start codon and near the 5' cap, greater dependence on eIF4E for translation, and association with ER-proximal ribosomes. 5IM transcripts are bound by the exon junction complex (EJC) at noncanonical 5' proximal positions. Finally, N 1 -methyladenosines are specifically enriched in the early coding regions of 5IM transcripts. Taken together, our analyses point to the existence of a distinct 5IM class comprising ∼20% of human transcripts. This class is defined by depletion of 5' proximal introns, presence of specific RNA sequence features associated with low translation efficiency, N 1 -methyladenosines in the early coding region, and enrichment for noncanonical binding by the EJC. © 2017 Cenik et al.; Published by Cold Spring Harbor Laboratory Press for the RNA Society.

  6. Genome Analysis Reveals Interplay between 5′UTR Introns and Nuclear mRNA Export for Secretory and Mitochondrial Genes

    PubMed Central

    Cenik, Can; Chua, Hon Nian; Zhang, Hui; Tarnawsky, Stefan P.; Akef, Abdalla; Derti, Adnan; Tasan, Murat; Moore, Melissa J.; Palazzo, Alexander F.; Roth, Frederick P.

    2011-01-01

    In higher eukaryotes, messenger RNAs (mRNAs) are exported from the nucleus to the cytoplasm via factors deposited near the 5′ end of the transcript during splicing. The signal sequence coding region (SSCR) can support an alternative mRNA export (ALREX) pathway that does not require splicing. However, most SSCR–containing genes also have introns, so the interplay between these export mechanisms remains unclear. Here we support a model in which the furthest upstream element in a given transcript, be it an intron or an ALREX–promoting SSCR, dictates the mRNA export pathway used. We also experimentally demonstrate that nuclear-encoded mitochondrial genes can use the ALREX pathway. Thus, ALREX can also be supported by nucleotide signals within mitochondrial-targeting sequence coding regions (MSCRs). Finally, we identified and experimentally verified novel motifs associated with the ALREX pathway that are shared by both SSCRs and MSCRs. Our results show strong correlation between 5′ untranslated region (5′UTR) intron presence/absence and sequence features at the beginning of the coding region. They also suggest that genes encoding secretory and mitochondrial proteins share a common regulatory mechanism at the level of mRNA export. PMID:21533221

  7. TLR7 single-nucleotide polymorphisms in the 3' untranslated region and intron 2 independently contribute to systemic lupus erythematosus in Japanese women: a case-control association study

    PubMed Central

    2011-01-01

    Introduction The Toll-like receptor 7 (TLR7) gene, encoded on human chromosome Xp22.3, is crucial for type I interferon production. A recent multicenter study in East Asian populations, comprising Chinese, Korean and Japanese participants, identified an association of a TLR7 single-nucleotide polymorphism (SNP) located in the 3' untranslated region (3' UTR), rs3853839, with systemic lupus erythematosus (SLE), especially in males, although some difference was observed among the tested populations. To test whether additional polymorphisms contribute to SLE in Japanese, we systematically analyzed the association of TLR7 with SLE in a Japanese female population. Methods A case-control association study was conducted on eight tag SNPs in the TLR7 region, including rs3853839, in 344 Japanese females with SLE and 274 healthy female controls. Results In addition to rs3853839, two SNPs in intron 2, rs179019 and rs179010, which were in moderate linkage disequilibrium with each other (r2 = 0.53), showed an association with SLE (rs179019: P = 0.016, odds ratio (OR) 2.02, 95% confidence interval (95% CI) 1.15 to 3.54; rs179010: P = 0.018, OR 1.75, 95% CI 1.10 to 2.80 (both under the recessive model)). Conditional logistic regression analysis revealed that the association of the intronic SNPs and the 3' UTR SNP remained significant after we adjusted them for each other. When only the patients and controls carrying the risk genotypes at the 3' UTR SNPpositionwere analyzed, the risk of SLE was significantly increased when the individuals also carried the risk genotypes at both of the intronic SNPs (P = 0.0043, OR 2.45, 95% CI 1.31 to 4.60). Furthermore, the haplotype containing the intronic risk alleles in addition to the 3' UTR risk allele was associated with SLE under the recessive model (P = 0.016, OR 2.37, 95% CI 1.17 to 4.80), but other haplotypes were not associated with SLE. Conclusions The TLR7 intronic SNPs rs179019 and rs179010 are associated with SLE independently of the 3' UTR SNP rs3853839 in Japanese women. Our findings support a role of TLR7 in predisposition for SLE in Asian populations. PMID:21396113

  8. A var gene promoter implicated in severe malaria nucleates silencing and is regulated by 3’ untranslated region and intronic cis-elements

    PubMed Central

    Muhle, Rebecca A.; Adjalley, Sophie; Falkard, Brie; Nkrumah, Louis J.; Muhle, Michael E.; Fidock, David A.

    2009-01-01

    Questions surround the mechanism of mutually exclusive expression by which Plasmodium falciparum mediates activation and silencing of var genes. These encode PfEMP1 proteins, which function as cytoadherent and immunomodulatory molecules at the surface of parasitized erythrocytes. Current evidence suggests that promoter silencing by var introns might play a key role in var gene regulation. To evaluate the impact of cis-acting regulatory regions on var silencing, we generated P. falciparum lines in which luciferase was placed under the control of an UpsA var promoter. By utilizing the Bxb1 integrase system, these reporter cassettes were targeted to a genomic region that was not in apposition to var sub-telomeric domains. This eliminated possible effects from surrounding telomeric elements and removed the variability inherent in episomal systems. Studies with highly synchronized parasites revealed that the UpsA element possessed minimal activity in comparison with a heterologous (hrp3) promoter. This may well result from the integrated UpsA promoter being largely silenced by the neighboring cg6 promoter. Our analyses also revealed that the DownsA 3’ untranslated region further decreased the luciferase activity from both cassettes, whereas the var A intron repressed the UpsA promoter specifically. By applying multivariate analysis over the entire cell cycle, we confirmed the significance of these cis-elements and found the parasite stage to be the major factor regulating UpsA promoter activity. Additionally, we observed that the UpsA promoter was capable of nucleating reversible silencing that spread to a downstream promoter. We believe these studies are the first to analyze promoter activity of Group A var genes which have been implicated in severe malaria, and support the model that var introns can further suppress var expression. These data also suggest an important suppressive role for the DownsA terminator. Our findings imply the existence of multiple levels of var gene regulation in addition to intrinsic promoter-dependent silencing. PMID:19463825

  9. Interactions among catechol-O-methyltransferase genotype, parenting, and sex predict children’s internalizing symptoms and inhibitory control: Evidence for differential susceptibility

    PubMed Central

    SULIK, MICHAEL J.; EISENBERG, NANCY; SPINRAD, TRACY L.; LEMERY-CHALFANT, KATHRYN; SWANN, GREGORY; SILVA, KASSONDRA M.; REISER, MARK; STOVER, DARYN A.; VERRELLI, BRIAN C.

    2015-01-01

    We used sex, observed parenting quality at 18 months, and three variants of the catechol-O-methyltransferase gene (Val158Met [rs4680], intron1 [rs737865], and 3′-untranslated region [rs165599]) to predict mothers’ reports of inhibitory and attentional control (assessed at 42, 54, 72, and 84 months) and internalizing symptoms (assessed at 24, 30, 42, 48, and 54 months) in a sample of 146 children (79 male). Although the pattern for all three variants was very similar, Val158Met explained more variance in both outcomes than did intron1, the 3′-untranslated region, or a haplotype that combined all three catechol-O-methyltransferase variants. In separate models, there were significant three-way interactions among each of the variants, parenting, and sex, predicting the intercepts of inhibitory control and internalizing symptoms. Results suggested that Val158Met indexes plasticity, although this effect was moderated by sex. Parenting was positively associated with inhibitory control for methionine–methionine boys and for valine–valine/valine–methionine girls, and was negatively associated with internalizing symptoms for methionine–methionine boys. Using the “regions of significance” technique, genetic differences in inhibitory control were found for children exposed to high-quality parenting, whereas genetic differences in internalizing were found for children exposed to low-quality parenting. These findings provide evidence in support of testing for differential susceptibility across multiple outcomes. PMID:25159270

  10. The human serotonin 5-HT{sub 2C} receptor: Complete cDNA, genomic structure, and alternatively spliced variant

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Xie, Enzhong; Zhu, Lingyu; Zhao, Lingyun

    1996-08-01

    The complete 4775-nt cDNA encoding the human serotonin 5-HT{sub 2C} receptor (5-HT{sub 2C}R), a G-protein-coupled receptor, has been isolated. It contains a 1377-nt coding region flanked by a 728-nt 5{prime}-untranslated region and a 2670-nt 3{prime}-untranslated region. By using the cloned 5-HT{sub 2C}R cDNA probe, the complete human gene for this receptor has been isolated and shown to contain six exons and five introns spanning at least 230 kb of DNA. The coding region of the human 5-HT{sub 2C}R gene is interrupted by three introns, and the positions of the intron/exon junctions are conserved between the human and the rodent genes.more » In addition, an alternatively spliced 5-HT{sub 2C}R RNA that contains a 95-nt deletion in the region coding for the second intracellular loop and the fourth transmembrane domain of the receptor has been identified. This deletion leads to a frameshift and premature termination so that the short isoform RNA encodes a putative protein of 248 amino acids. The ratio for the short isoform over the 5-HT{sub 2C}R RNA was found to be higher in choroid plexus tumor than in normal brain tissue, suggesting the possibility of differential regulation of the 5-HT{sub 2C}R gene in different neural tissues or during tumorigenesis. Transcription of the human 5-HT{sub 2C}R gene was found to be initiated at multiple sites. No classical TATA-box sequence was found at the appropriate location, and the 5{prime}-flanking sequence contains many potential transcription factor-binding sites. A 7.3-kb 5{prime}-flanking 5-HT{sub 2C}R DNA directed the efficient expression of a luciferase reported gene in SK-N-SH and IMR32 neuroblastoma cells, indicating that is contains a functional promoter. 69 refs., 8 figs., 1 tab.« less

  11. MAPT as a predisposing gene for sporadic amyotrophic lateral sclerosis in the Chinese Han population

    PubMed Central

    Fang, Pu; Xu, Wenyuan; Wu, Chengsi; Zhu, Min; Li, Xiaobing; Hong, Daojun

    2013-01-01

    A previous study of European Caucasian patients with sporadic amyotrophic lateral sclerosis demonstrated that a polymorphism in the microtubule-associated protein Tau (MAPT) gene was significantly associated with sporadic amyotrophic lateral sclerosis pathogenesis. Here, we tested this association in 107 sporadic amyotrophic lateral sclerosis patients and 100 healthy controls from the Chinese Han population. We screened the mutation-susceptible regions of MAPT – the 3' and 5' untranslated regions as well as introns 9, 10, 11, and 12 – by direct sequencing, and identified 33 genetic variations. Two of these, 105788 A > G in intron 9 and 123972 T > A in intron 11, were not present in the control group. The age of onset in patients with the 105788 A > G and/or the 123972 T > A variant was younger than that in patients without either genetic variation. Moreover, the pa-tients with a genetic variation were more prone to bulbar palsy and breathing difficulties than those with the wild-type genotype. This led to a shorter survival period in patients with a MAPT genetic variant. Our study suggests that the MAPT gene is a potential risk gene for sporadic amyotrophic lateral sclerosis in the Chinese Han population. PMID:25206632

  12. Extreme heterogeneity of polyadenylation sites in mRNAs encoding chloroplast RNA-binding proteins in Nicotiana plumbaginifolia.

    PubMed

    Klahre, U; Hemmings-Mieszczak, M; Filipowicz, W

    1995-06-01

    We have previously characterized nuclear cDNA clones encoding two RNA binding proteins, CP-RBP30 and CP-RBP-31, which are targeted to chloroplasts in Nicotiana plumbaginifolia. In this report we describe the analysis of the 3'-untranslated regions (3'-UTRs) in 22 CP-RBP30 and 8 CP-RBP31 clones which reveals that mRNAs encoding both proteins have a very complex polyadenylation pattern. Fourteen distinct poly(A) sites were identified among CP-RBP30 clones and four sites among the CP-RBP31 clones. The authenticity of the sites was confirmed by RNase A/T1 mapping of N. plumbaginifolia RNA. CP-RBP30 provides an extreme example of the heterogeneity known to be a feature of mRNA polyadenylation in higher plants. Using PCR we have demonstrated that CP-RBP genes in N. plumbaginifolia and N. sylvestris, in addition to the previously described introns interrupting the coding region, contain an intron located in the 3' non-coding part of the gene. In the case of the CP-RBP31, we have identified one polyadenylation event occurring in this intron.

  13. The human myelin oligodendrocyte glycoprotein (MOG) gene: Complete nucleotide sequence and structural characterization

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Paule Roth, M.; Malfroy, L.; Offer, C.

    1995-07-20

    Human myelin oligodendrocyte glycoprotein (MOG), a myelin component of the central nervous system, is a candidate target antigen for autoimmune-mediated demyelination. We have isolated and sequenced part of a cosmid clone that contains the entire human MOG gene. The primary nuclear transcript, extending from the putative start of transcription to the site of poly(A) addition, is 15,561 nucleotides in length. The human MOG gene contains 8 exons, separated by 7 introns; canonical intron/exon boundary sites are observed at each junction. The introns vary in size from 242 to 6484 bp and contain numerous repetitive DNA elements, including 14 Alu sequencesmore » within 3 introns. Another Alu element is located in the 3{prime}-untranslated region of the gene. Alu sequences were classified with respect to subfamily assignment. Seven hundred sixty-three nucleotides 5{prime} of the transcription start and 1214 nucleotides 3{prime} of the poly(A) addition sites were also sequenced. The 5{prime}-flanking region revealed the presence of several consensus sequences that could be relevant in the transcription of the MOG gene, in particular binding sites in common with other myelin gene promoters. Two polymorphic intragenic dinucleotide (CA){sub n} and tetranucleotide (TAAA){sub n} repeats were identified and may provide genetic marker tools for association and linkage studies. 50 refs., 3 figs., 3 tabs.« less

  14. Structure and polymorphism of the mouse prion protein gene.

    PubMed Central

    Westaway, D; Cooper, C; Turner, S; Da Costa, M; Carlson, G A; Prusiner, S B

    1994-01-01

    Missense mutations in the prion protein (PrP) gene, overexpression of the cellular isoform of PrP (PrPC), and infection with prions containing the scrapie isoform of PrP (PrPSc) all cause neurodegenerative disease. To understand better the physiology and expression of PrPC, we retrieved mouse PrP gene (Prn-p) yeast artificial chromosome (YAC), cosmid, phage, and cDNA clones. Physical mapping positions Prn-p approximately 300 kb from ecotropic virus integration site number 4 (Evi-4), compatible with failure to detect recombination between Prn-p and Evi-4 in genetic crosses. The Prn-pa allele encompasses three exons, with exons 1 and 2 encoding the mRNA 5' untranslated region. Exon 2 has no equivalent in the Syrian hamster and human PrP genes. The Prn-pb gene shares this intron/exon structure but harbors an approximately 6-kb deletion within intron 2. While the Prn-pb open reading frame encodes two amino acid substitutions linked to prolonged scrapie incubation periods, a deletion of intron 2 sequences also characterizes inbred strains such as RIII/S and MOLF/Ei with shorter incubation periods, making a relationship between intron 2 size and scrapie pathogenesis unlikely. The promoter regions of a and b Prn-p alleles include consensus Sp1 and AP-1 sites, as well as other conserved motifs which may represent binding sites for as yet unidentified transcription factors. Images PMID:7912827

  15. Genetic variations of VDR/NR1I1 encoding vitamin D receptor in a Japanese population.

    PubMed

    Ukaji, Maho; Saito, Yoshiro; Fukushima-Uesaka, Hiromi; Maekawa, Keiko; Katori, Noriko; Kaniwa, Nahoko; Yoshida, Teruhiko; Nokihara, Hiroshi; Sekine, Ikuo; Kunitoh, Hideo; Ohe, Yuichiro; Yamamoto, Noboru; Tamura, Tomohide; Saijo, Nagahiro; Sawada, Jun-ichi

    2007-12-01

    The vitamin D receptor (VDR) is a transcriptional factor responsive to 1alpha,25-dihydroxyvitamin D(3) and lithocholic acid, and induces expression of drug metabolizing enzymes CYP3A4, CYP2B6 and CYP2C9. In this study, the promoter regions, 14 exons (including 6 exon 1's) and their flanking introns of VDR were comprehensively screened for genetic variations in 107 Japanese subjects. Sixty-one genetic variations including 25 novel ones were found: 9 in the 5'-flanking region, 2 in the 5'-untranslated region (UTR), 7 in the coding exons (5 synonymous and 2 nonsynonymous variations), 12 in the 3'-UTR, 19 in the introns between the exon 1's, and 12 in introns 2 to 8. Of these, one novel nonsynonymous variation, 154A>G (Met52Val), was detected with an allele frequency of 0.005. The single nucleotide polymorphisms (SNPs) that increase VDR expression or activity, -29649G>A, 2T>C and 1592((*)308)C>A tagging linked variations in the 3'-UTR, were detected at 0.430, 0.636, and 0.318 allele frequencies, respectively. Another SNP, -26930A>G, with reduced VDR transcription was found at a 0.028 frequency. These findings would be useful for association studies on VDR variations in Japanese.

  16. Genetic variations in the MCT1 (SLC16A1) gene in the Chinese population of Singapore.

    PubMed

    Lean, Choo Bee; Lee, Edmund Jon Deoon

    2009-01-01

    MCT1(SLC16A1) is the first member of the monocarboxylate transporter (MCT) and its family is involved in the transportation of metabolically important monocarboxylates such as lactate, pyruvate, acetate and ketone bodies. This study identifies genetic variations in SLC16A1 in the ethnic Chinese group of the Singaporean population (n=95). The promoter, coding region and exon-intron junctions of the SLC16A1 gene encoding the MCT1 transporter were screened for genetic variation in the study population by DNA sequencing. Seven genetic variations of SLC16A1, including 4 novel ones, were found: 2 in the promoter region, 2 in the coding exons (both nonsynonymous variations), 2 in the 3' untranslated region (3'UTR) and 1 in the intron. Of the two mutations detected in the promoter region, the -363-855T>C is a novel mutation. The 1282G>A (Val(428)Ile) is a novel SNP and was found as heterozygotic in 4 subjects. The 1470T>A (Asp(490)Glu) was found to be a common polymorphism in this study. Lastly, IVS3-17A>C in intron 3 and 2258 (755)A>G in 3'UTR are novel mutations found to be common polymorphisms in the local Chinese population. To our knowledge, this is the first report of a comprehensive analysis on the MCT1 gene in any population.

  17. Structure of the 5' region of the Hst70 gene transcription unit: presence of an intron and multiple transcription initiation sites.

    PubMed Central

    Scieglinska, D; Widłak, W; Konopka, W; Poutanen, M; Rahman, N; Huhtaniemi, I; Krawczyk, Z

    2001-01-01

    The rat Hst70 gene and its mouse counterpart Hsp70.2 belong to the family of Hsp70 heat shock genes and are specifically expressed in male germ cells. Previous studies regarding the structure of the 5' region of the transcription unit of these genes as well as localization of the 'cis' elements conferring their testis-specific expression gave contradictory results [Widlak, Markkula, Krawczyk, Kananen and Huhtaniemi (1995) Biochim. Biophys. Acta 1264, 191-200; Dix, Rosario-Herrle, Gotoh, Mori, Goulding, Barret and Eddy (1996) Dev. Biol. 174, 310-321]. In the present paper we solve these controversies and show that the 5' untranslated region (UTR) of the Hst70 gene contains an intron which is localized similar to that of the mouse Hsp70.2 gene. Reverse transcriptase-mediated PCR, Northern blotting and RNase protection analysis revealed that the transcription initiation of both genes starts at two main distant sites, and one of them is localized within the intron. As a result two populations of Hst70 gene transcripts with similar sizes but different 5' UTR structures can be detected in total testicular RNA. Functional analysis of the Hst70 gene promoter in transgenic mice and transient transfection assays proved that the DNA fragment of approx. 360 bp localized upstream of the ATG transcription start codon is the minimal promoter required for testis-specific expression of the HST70/chloramphenicol acetyltransferase transgene. These experiments also suggest that the expression of the gene may depend on 'cis' regulatory elements localized within exon 1 and the intron sequences. PMID:11563976

  18. Analysis for complete genomic sequence of HLA-B and HLA-C alleles in the Chinese Han population.

    PubMed

    Zhu, F; He, Y; Zhang, W; He, J; He, J; Xu, X; Lv, H; Yan, L

    2011-08-01

    In the present study, we have determined the complete genomic sequence and analysed the intron polymorphism of partial HLA-B and HLA-C alleles in the Chinese Han population. Over 3.0 kb DNA fragments of HLA-B and HLA-C loci were amplified by polymerase chain reaction from partial 5' untranslated region to 3' noncoding region respectively, and then the amplified products were sequenced. Full-length nucleotide sequences of 14 HLA-B alleles and 10 HLA-C alleles were obtained and have been submitted to GenBank and IMGT/HLA database. Two novel alleles of HLA-B*52:01:01:02 and HLA-B*59:01:01:02 were identified, and the complete genomic sequence of HLA-B*52:01:01:01 was firstly reported. Totally 157 and 167 polymorphism positions were found in the full-length genomic sequence of HLA-B and HLA-C loci respectively. Our results suggested that many single nucleotide polymorphisms existed in the exon and intron regions, and the data can provide useful information for understanding the evolution of HLA-B and HLA-C alleles. © 2011 Blackwell Publishing Ltd.

  19. Structural characterization of the FKHR gene and its rearrangement in alveolar rhabdomyosarcoma.

    PubMed

    Davis, R J; Bennicelli, J L; Macina, R A; Nycum, L M; Biegel, J A; Barr, F G

    1995-12-01

    The FKHR gene, which contains a forkhead DNA-binding motif, is fused to either PAX3 or PAX7 by the t(2;13) or t(1;13) translocation in alveolar rhabdomyosarcoma,respectively. These tumors express chimeric transcripts encoding the N-terminal portion of either PAX protein fused to the C-terminal portion of FKHR. To understand the structural basis and functional consequences of these translocations, we characterized the wild-type FKHR gene and its rearrangement in alveolar rhabdomyosarcomas. By isolating and analyzing phage, cosmid and YAC clones, we determined that FKHR consists of three exons spanning 140 kb and that several highly similar loci are present in other genomic regions. Exon 1 encodes the N-terminus of the forkhead domain and is embedded within demethylated CpG island. RNA analyses reveal FKHR transcripts initiate from a TATA-less promoter within this island. Exon 2 encodes the C-terminus of the forkhead domain and a transcription activation domain, whereas exon 3 encodes a large 3' untranslated region. The intron 1-exon 2 boundary precisely matches the FHKR fusion point in the chimeric transcripts found in alveolar rhabdomyosarcomas. Using pulsed-field and fluorescence in situ hybridization analyses, we demonstrate that the 130kb FKHR intron 1 is rearranged in t(2;13)-containing alveolar rhabdomyosarcomas. Our findings indicate that FKHR intron 1 provides a large target for DNA rearrangemnt. Rearrangement of this intron with PAX3 produces two important functional consequences: in-frame fusion of N-terminal PAX3 sequences to the FKHR transcriptional activation domain and disruption of the FKHR DNA binding domain.

  20. Mitochondrion-to-Chloroplast DNA Transfers and Intragenomic Proliferation of Chloroplast Group II Introns in Gloeotilopsis Green Algae (Ulotrichales, Ulvophyceae).

    PubMed

    Turmel, Monique; Otis, Christian; Lemieux, Claude

    2016-09-19

    To probe organelle genome evolution in the Ulvales/Ulotrichales clade, the newly sequenced chloroplast and mitochondrial genomes of Gloeotilopsis planctonica and Gloeotilopsis sarcinoidea (Ulotrichales) were compared with those of Pseudendoclonium akinetum (Ulotrichales) and of the few other green algae previously sampled in the Ulvophyceae. At 105,236 bp, the G planctonica mitochondrial DNA (mtDNA) is the largest mitochondrial genome reported so far among chlorophytes, whereas the 221,431-bp G planctonica and 262,888-bp G sarcinoidea chloroplast DNAs (cpDNAs) are the largest chloroplast genomes analyzed among the Ulvophyceae. Gains of non-coding sequences largely account for the expansion of these genomes. Both Gloeotilopsis cpDNAs lack the inverted repeat (IR) typically found in green plants, indicating that two independent IR losses occurred in the Ulvales/Ulotrichales. Our comparison of the Pseudendoclonium and Gloeotilopsis cpDNAs offered clues regarding the mechanism of IR loss in the Ulotrichales, suggesting that internal sequences from the rDNA operon were differentially lost from the two original IR copies during this process. Our analyses also unveiled a number of genetic novelties. Short mtDNA fragments were discovered in two distinct regions of the G sarcinoidea cpDNA, providing the first evidence for intracellular inter-organelle gene migration in green algae. We identified for the first time in green algal organelles, group II introns with LAGLIDADG ORFs as well as group II introns inserted into untranslated gene regions. We discovered many group II introns occupying sites not previously documented for the chloroplast genome and demonstrated that a number of them arose by intragenomic proliferation, most likely through retrohoming. © The Author 2016. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

  1. Mitochondrion-to-Chloroplast DNA Transfers and Intragenomic Proliferation of Chloroplast Group II Introns in Gloeotilopsis Green Algae (Ulotrichales, Ulvophyceae)

    PubMed Central

    Turmel, Monique; Otis, Christian; Lemieux, Claude

    2016-01-01

    Abstract To probe organelle genome evolution in the Ulvales/Ulotrichales clade, the newly sequenced chloroplast and mitochondrial genomes of Gloeotilopsis planctonica and Gloeotilopsis sarcinoidea (Ulotrichales) were compared with those of Pseudendoclonium akinetum (Ulotrichales) and of the few other green algae previously sampled in the Ulvophyceae. At 105,236 bp, the G. planctonica mitochondrial DNA (mtDNA) is the largest mitochondrial genome reported so far among chlorophytes, whereas the 221,431-bp G. planctonica and 262,888-bp G. sarcinoidea chloroplast DNAs (cpDNAs) are the largest chloroplast genomes analyzed among the Ulvophyceae. Gains of non-coding sequences largely account for the expansion of these genomes. Both Gloeotilopsis cpDNAs lack the inverted repeat (IR) typically found in green plants, indicating that two independent IR losses occurred in the Ulvales/Ulotrichales. Our comparison of the Pseudendoclonium and Gloeotilopsis cpDNAs offered clues regarding the mechanism of IR loss in the Ulotrichales, suggesting that internal sequences from the rDNA operon were differentially lost from the two original IR copies during this process. Our analyses also unveiled a number of genetic novelties. Short mtDNA fragments were discovered in two distinct regions of the G. sarcinoidea cpDNA, providing the first evidence for intracellular inter-organelle gene migration in green algae. We identified for the first time in green algal organelles, group II introns with LAGLIDADG ORFs as well as group II introns inserted into untranslated gene regions. We discovered many group II introns occupying sites not previously documented for the chloroplast genome and demonstrated that a number of them arose by intragenomic proliferation, most likely through retrohoming. PMID:27503298

  2. Isolation, expression, and chromosomal localization of the human mitochondrial capsule selenoprotein gene (MCSP)

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Aho, Hanne; Schwemmer, M.; Tessmann, D.

    1996-03-01

    The mitochondrial capsule selenoprotein (MCS) (HGMW-approved symbol MCSP) is one of three proteins that are important for the maintenance and stabilization of the crescent structure of the sperm mitochondria. We describe here the isolation of a cDNA, the exon-intron organization, the expression, and the chromosomal localization of the human MCS gene. Nucleotide sequence analysis of the human and mouse MCS cDNAs reveals that the 5{prime}- and 3{prime}-untranslated sequences are more conserved (71%) than the coding sequences (59%). The open reading frame encodes a 116-amino-acid protein and lacks the UGA codons, which have been reported to encode the selenocysteines in themore » N-terminal of the deduced mouse protein. The deduced human protein shows a low degree of amino acid sequence identity to the mouse protein. The deduced human protein shows a low degree of amino acid sequence identity to the mouse protein (39%). The most striking homology lies in the dicysteine motifs. Northern and Southern zooblot analyses reveal that the MCS gene in human, baboon, and bovine is more conserved than its counterparts in mouse and rat. The single intron in the human MCS gene is approximately 6 kb and interrupts the 5{prime}-untranslated region at a position equivalent to that in the mouse and rat genes. Northern blot and in situ hybridization experiments demonstrate that the expression of the human MCS gene is restricted to haploid spermatids. The human gene was assigned to q21 of chromosome 1. 30 refs., 9 figs.« less

  3. DOE Office of Scientific and Technical Information (OSTI.GOV)

    Kerr, J.M.; Fisher, L.W.; Termine, J.D.

    The authors have isolated and partially sequenced the human bone sialoprotein gene (IBSP). IBSP has been sublocalized by in situ hybridization to chromosome 4q38-q31 and is composed of six small exons (51 to 159 bp) and 1 large exon ([approximately]2.6 kb). The intron/exon junctions defined by sequence analysis are of class O, retaining an intact coding triplet. Sequence analysis of the 5[prime] upstream region revealed a TATAA (nucleotides -30 to-25 from the transcriptional start point) and a CCAAT (nucleotides -56 to-52) box, both in the reverse orientation. Intron 1 contains interesting structural elements composed of polypyrimidine repeats followed by amore » poly(AC)[sub n] tract. Both types of structural elements have been detected in promoter regions of other genes and have been implicated in transcriptional regulation. Several differences between the previously published cDNA sequence and the authors' sequence have been identified, most of which are contained within the untranslated exon 1. Three base revisions in the coding region include a G to T (Gly to Val, amino acid 195), T to C (Val to Ala, amino acid 268), and T to A (Glu to Asp, amino acid 270). In conclusion, the genomic organization and potential regulatory elements of human IBSP have been elucidated. 42 refs., 4 figs., 1 tab.« less

  4. Isolation and characterization of full-length putative alcohol dehydrogenase genes from polygonum minus

    NASA Astrophysics Data System (ADS)

    Hamid, Nur Athirah Abd; Ismail, Ismanizan

    2013-11-01

    Polygonum minus, locally named as Kesum is an aromatic herb which is high in secondary metabolite content. Alcohol dehydrogenase is an important enzyme that catalyzes the reversible oxidation of alcohol and aldehyde with the presence of NAD(P)(H) as co-factor. The main focus of this research is to identify the gene of ADH. The total RNA was extracted from leaves of P. minus which was treated with 150 μM Jasmonic acid. Full-length cDNA sequence of ADH was isolated via rapid amplification cDNA end (RACE). Subsequently, in silico analysis was conducted on the full-length cDNA sequence and PCR was done on genomic DNA to determine the exon and intron organization. Two sequences of ADH, designated as PmADH1 and PmADH2 were successfully isolated. Both sequences have ORF of 801 bp which encode 266 aa residues. Nucleotide sequence comparison of PmADH1 and PmADH2 indicated that both sequences are highly similar at the ORF region but divergent in the 3' untranslated regions (UTR). The amino acid is differ at the 107 residue; PmADH1 contains Gly (G) residue while PmADH2 contains Cys (C) residue. The intron-exon organization pattern of both sequences are also same, with 3 introns and 4 exons. Based on in silico analysis, both sequences contain "classical" short chain alcohol dehydrogenases/reductases ((c) SDRs) conserved domain. The results suggest that both sequences are the members of short chain alcohol dehydrogenase family.

  5. An engineered Streptomyces hygroscopicus aph 7" gene mediates dominant resistance against hygromycin B in Chlamydomonas reinhardtii.

    PubMed

    Berthold, Peter; Schmitt, Rüdiger; Mages, Wolfgang

    2002-12-01

    We have developed a positively selectable marker for the green alga Chlamydomonas reinhardtii using the Streptomyces hygroscopicus aminoglycoside phosphotransferase gene (aph7"). Its expression is controlled by C. reinhardtii regulatory elements, namely, the beta2-tubulin gene promoter in combination with the first intron and the 3' untranslated region of the small subunit of ribulose bisphosphate carboxylase, rbcS2. C. reinhardtii cell-wall deficient and wild-type strains were transformed at rates up to 5 x 10(-5) with two constructs, pHyg3 and pHyg4 (intron-less). Transformants selected on plates with 10 microg/ml hygromycin B exhibited diverse levels of resistance of up to 200 microg/ml that were stably maintained for at least seven months; they contained two to five copies of the construct integrated in their genomes. Transcription of the chimeric aph7" gene, correct splicing of the rbcS2 intron, and polyadenylation of the transcripts have been verified by sequencing of RT-PCR products. Average co-transformation rates using pHyg3 and a second selectable plasmid were about 11%. This advocates the hygromycin-resistance plasmid, pHyg3, as a new versatile tool for the transformation of a broad range of C. reinhardtii strains without the sustained need for using auxotrophic mutants as recipients.

  6. Identification of miR-2400 gene as a novel regulator in skeletal muscle satellite cells proliferation by targeting MYOG gene

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Zhang, Wei Wei; College of Life Sciences and Agriculture & Forestry, Qiqihar University, Qiqihar, Heilongjiang 161006; Tong, Hui Li

    MicroRNAs play critical roles in skeletal muscle development as well as in regulation of muscle cell proliferation and differentiation. Previous study in our laboratory showed that the expression level of miR-2400, a novel and unique miRNA from bovine, had significantly changed in skeletal muscle-derived satellite cells (MDSCs) during differentiation, however, the function and expression pattern for miR-2400 in MDSCs has not been fully understood. In this report, we firstly identified that the expression levels of miR-2400 were down-regulated during MDSCs differentiation by stem-loop RT-PCR. Over-expression and inhibition studies demonstrated that miR-2400 promoted MDSCs proliferation by EdU (5-ethynyl-2′ deoxyuridine) incorporation assaymore » and immunofluorescence staining of Proliferating cell nuclear antigen (PCNA). Luciferase reporter assays showed that miR-2400 directly targeted the 3′ untranslated regions (UTRs) of myogenin (MYOG) mRNA. These data suggested that miR-2400 could promote MDSCs proliferation through targeting MYOG. Furthermore, we found that miR-2400, which was located within the eighth intron of the Wolf-Hirschhorn syndrome candidate 1-like 1 (WHSC1L1) gene, was down-regulated in MDSCs in a direct correlation with the WHSC1L1 transcript by Clustered regularly interspaced palindromic repeats interference (CRISPRi). In addition, these observations not only provided supporting evidence for the codependent expression of intronic miRNAs and their host genes in vitro, but also gave insight into the role of miR-2400 in MDSCs proliferation. - Highlights: • miR-2400 is a novel and unique miRNA from bovine. • miR-2400 could promote skeletal muscle satellite cells proliferation. • miR-2400 directly targeted the 3′ untranslated regions of MYOG mRNA. • miR-2400 could be coexpressed together with its host gene WHSC1L1.« less

  7. Genomic organization of the human gene (CA5) and pseudogene for mitochondrial carbonic anhydrase V and their localization to chromosomes 16q and 16p

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Nagao, Yoshiro; Sly, W.S.; Batanian, J.R.

    1995-08-10

    Carbonic anhydrase V (CA V) is expressed in mitochondrial matrix in liver and several other tissues. It is of interest for its putative roles in providing bicarbonate to carbamoyl phosphate synthetase for ureagenesis and to pyruvate carboxylase for gluconeogenesis and its possible importance in explaining certain inherited metabolic disorders with hyperammonemia and hypoglycemia. Following the recent characterization of the cDNA for human CA V, we report the isolation of the human gene from two {lambda} genomic libraries and its characterization. The CA V gene (CA5) is approximately 50 kb long and contains 7 exons and 6 introns. The exon-intron boundariesmore » are found in positions identical to those determined for the previously described CA II, CA III, and CA VII genes. Like the CA VII gene, CA5 does not contain typical TATA and CAAT promoter elements in the 5{prime} flanking region but does contain a TTTAA sequence 147 nucleotides upstream of the initiation codon. CA5 also contains a 12-bp GT-rich segment beginning 13 bp downstream of the polyadenylation signal in the 3{prime} untranslated region of exon 7. FISH analysis allowed CA5 to be assigned to chromosome 16q24.3. An unprocessed pseudogene containing sequence homologous to exons 3-7 and introns 3-6 was also isolated and was assigned by FISH analysis to chromosome 16p11.2-p12. 22 refs., 4 figs., 1 tab.« less

  8. Widespread antisense transcription of Populus genome under drought.

    PubMed

    Yuan, Yinan; Chen, Su

    2018-06-06

    Antisense transcription is widespread in many genomes and plays important regulatory roles in gene expression. The objective of our study was to investigate the extent and functional relevance of antisense transcription in forest trees. We employed Populus, a model tree species, to probe the antisense transcriptional response of tree genome under drought, through stranded RNA-seq analysis. We detected nearly 48% of annotated Populus gene loci with antisense transcripts and 44% of them with co-transcription from both DNA strands. Global distribution of reads pattern across annotated gene regions uncovered that antisense transcription was enriched in untranslated regions while sense reads were predominantly mapped in coding exons. We further detected 1185 drought-responsive sense and antisense gene loci and identified a strong positive correlation between the expression of antisense and sense transcripts. Additionally, we assessed the antisense expression in introns and found a strong correlation between intronic expression and exonic expression, confirming antisense transcription of introns contributes to transcriptional activity of Populus genome under drought. Finally, we functionally characterized drought-responsive sense-antisense transcript pairs through gene ontology analysis and discovered that functional groups including transcription factors and histones were concordantly regulated at both sense and antisense transcriptional level. Overall, our study demonstrated the extensive occurrence of antisense transcripts of Populus genes under drought and provided insights into genome structure, regulation pattern and functional significance of drought-responsive antisense genes in forest trees. Datasets generated in this study serve as a foundation for future genetic analysis to improve our understanding of gene regulation by antisense transcription.

  9. Exon-Specific QTLs Skew the Inferred Distribution of Expression QTLs Detected Using Gene Expression Array Data

    PubMed Central

    Veyrieras, Jean-Baptiste; Gaffney, Daniel J.; Pickrell, Joseph K.; Gilad, Yoav; Stephens, Matthew; Pritchard, Jonathan K.

    2012-01-01

    Mapping of expression quantitative trait loci (eQTLs) is an important technique for studying how genetic variation affects gene regulation in natural populations. In a previous study using Illumina expression data from human lymphoblastoid cell lines, we reported that cis-eQTLs are especially enriched around transcription start sites (TSSs) and immediately upstream of transcription end sites (TESs). In this paper, we revisit the distribution of eQTLs using additional data from Affymetrix exon arrays and from RNA sequencing. We confirm that most eQTLs lie close to the target genes; that transcribed regions are generally enriched for eQTLs; that eQTLs are more abundant in exons than introns; and that the peak density of eQTLs occurs at the TSS. However, we find that the intriguing TES peak is greatly reduced or absent in the Affymetrix and RNA-seq data. Instead our data suggest that the TES peak observed in the Illumina data is mainly due to exon-specific QTLs that affect 3′ untranslated regions, where most of the Illumina probes are positioned. Nonetheless, we do observe an overall enrichment of eQTLs in exons versus introns in all three data sets, consistent with an important role for exonic sequences in gene regulation. PMID:22359548

  10. Analysis and recognition of 5′ UTR intron splice sites in human pre-mRNA

    PubMed Central

    Eden, E.; Brunak, S.

    2004-01-01

    Prediction of splice sites in non-coding regions of genes is one of the most challenging aspects of gene structure recognition. We perform a rigorous analysis of such splice sites embedded in human 5′ untranslated regions (UTRs), and investigate correlations between this class of splice sites and other features found in the adjacent exons and introns. By restricting the training of neural network algorithms to ‘pure’ UTRs (not extending partially into protein coding regions), we for the first time investigate the predictive power of the splicing signal proper, in contrast to conventional splice site prediction, which typically relies on the change in sequence at the transition from protein coding to non-coding. By doing so, the algorithms were able to pick up subtler splicing signals that were otherwise masked by ‘coding’ noise, thus enhancing significantly the prediction of 5′ UTR splice sites. For example, the non-coding splice site predicting networks pick up compositional and positional bias in the 3′ ends of non-coding exons and 5′ non-coding intron ends, where cytosine and guanine are over-represented. This compositional bias at the true UTR donor sites is also visible in the synaptic weights of the neural networks trained to identify UTR donor sites. Conventional splice site prediction methods perform poorly in UTRs because the reading frame pattern is absent. The NetUTR method presented here performs 2–3-fold better compared with NetGene2 and GenScan in 5′ UTRs. We also tested the 5′ UTR trained method on protein coding regions, and discovered, surprisingly, that it works quite well (although it cannot compete with NetGene2). This indicates that the local splicing pattern in UTRs and coding regions is largely the same. The NetUTR method is made publicly available at www.cbs.dtu.dk/services/NetUTR. PMID:14960723

  11. Physical structure and chromosomal localization of a gene encoding human p58[sup clk-1], a cell division control related protein kinase

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Eipers, P.G.

    1992-01-01

    The gene for the human p58[sup clk[minus]1] protein kinase, a cell division control-related gene, has been mapped by somatic cell hybrid analyses, in situ localization with the chromosomal gene, and nested polymerase chain reaction amplification of microdissected chromosomes. These studies indicate that the expressed p58[sup clk[minus]1] chromosomal gene maps to 1p36, while a highly related p58[sup clk[minus]1] sequence of unknown nature maps to chromosome 15. Assignment of a p34[sup cdc2]-related gene to 1p36 region, including neuroblastoma, ductal carcinoma of the breast, malignant melanoma, Merkel cell carcinoma and endocrine neoplasia among others. Aberrant expression of this protein kinase negatively regulates normalmore » cellular growth. The p58[sup clk[minus]1] protein contains a central domain of 299 amino acids that is 46% identical to human p34[sup cdc2], the master mitotic protein kinase. This dissertation details the complete structure of the p58[sup clk[minus]1] chromosomal gene, including its putative promoter region, transcriptional start sites, exonic sequences, and intron/exon boundary sequences. The gene is 10 kb in size and contains 12 exons and 11 introns. Interestingly, the rather large 2.0 kb 3[prime] untranslated region is interrupted by an intron that separates a region containing numerous AUUUA destabilization motifs from the coding region. Furthermore, the expression of this gene in normal human tissues, as well as several human tumor cell samples and lines, is examined. The origin of multiple human transcripts from the same chromosomal gene, and the possible differential stability of these various transcripts, is discussed with regard to the transcriptional and post-transcriptional regulation of this gene. This is the first report of the chromosomal gene structure of a member of the p34[sup cdc2] supergene family.« less

  12. Three Drought-Responsive Members of the Nonspecific Lipid-Transfer Protein Gene Family in Lycopersicon pennellii Show Different Developmental Patterns of Expression1

    PubMed Central

    Treviño, Marcela B.; Connell, Mary A. O'

    1998-01-01

    Genomic clones of two nonspecific lipid-transfer protein genes from a drought-tolerant wild species of tomato (Lycopersicon pennellii Corr.) were isolated using as a probe a drought- and abscisic acid (ABA)-induced cDNA clone (pLE16) from cultivated tomato (Lycopersicon esculentum Mill.). Both genes (LpLtp1 and LpLtp2) were sequenced and their corresponding mRNAs were characterized; they are both interrupted by a single intron at identical positions and predict basic proteins of 114 amino acid residues. Genomic Southern data indicated that these genes are members of a small gene family in Lycopersicon spp. The 3′-untranslated regions from LpLtp1 and LpLtp2, as well as a polymerase chain reaction-amplified 3′-untranslated region from pLE16 (cross-hybridizing to a third gene in L. pennellii, namely LpLtp3), were used as gene-specific probes to describe expression in L. pennellii through northern-blot analyses. All LpLtp genes were exclusively expressed in the aerial tissues of the plant and all were drought and ABA inducible. Each gene had a different pattern of expression in fruit, and LpLtp1 and LpLtp2, unlike LpLtp3, were both primarily developmentally regulated in leaf tissue. Putative ABA-responsive elements were found in the proximal promoter regions of LpLtp1 and LpLtp2. PMID:9536064

  13. Structural features of diverse Pin-II proteinase inhibitor genes from Capsicum annuum.

    PubMed

    Mahajan, Neha S; Dewangan, Veena; Lomate, Purushottam R; Joshi, Rakesh S; Mishra, Manasi; Gupta, Vidya S; Giri, Ashok P

    2015-02-01

    The proteinase inhibitor (PI) genes from Capsicum annuum were characterized with respect to their UTR, introns and promoter elements. The occurrence of PIs with circularly permuted domain organization was evident. Several potato inhibitor II (Pin-II) type proteinase inhibitor (PI) genes have been analyzed from Capsicum annuum (L.) with respect to their differential expression during plant defense response. However, complete gene characterization of any of these C. annuum PIs (CanPIs) has not been carried out so far. Complete gene architectures of a previously identified CanPI-7 (Beads-on-string, Type A) and a member of newly isolated Bracelet type B, CanPI-69 are reported in this study. The 5' UTR (untranslated region), 3'UTR, and intronic sequences of both the CanPI genes were obtained. The genomic sequence of CanPI-7 exhibited, exon 1 (49 base pair, bp) and exon 2 (740 bp) interrupted by a 294-bp long type I intron. We noted the occurrence of three multi-domain PIs (CanPI-69, 70, 71) with circularly permuted domain organization. CanPI-69 was found to possess exon 1 (49 bp), exon 2 (551 bp) and a 584-bp long type I intron. The upstream sequence analysis of CanPI-7 and CanPI-69 predicted various transcription factor-binding sites including TATA and CAAT boxes, hormone-responsive elements (ABRELATERD1, DOFCOREZM, ERELEE4), and a defense-responsive element (WRKY71OS). Binding of transcription factors such as zinc finger motif MADS-box and MYB to the promoter regions was confirmed using electrophoretic mobility shift assay followed by mass spectrometric identification. The 3' UTR analysis for 25 CanPI genes revealed unique/distinct 3' UTR sequence for each gene. Structures of three domain CanPIs of type A and B were predicted and further analyzed for their attributes. This investigation of CanPI gene architecture will enable the better understanding of the genetic elements present in CanPIs.

  14. Molecular Dissection of a Major Gene Effect on a Quantitative Trait: The Level of Alcohol Dehydrogenase Expression in Drosophila Melanogaster

    PubMed Central

    Stam, L. F.; Laurie, C. C.

    1996-01-01

    A molecular mapping experiment shows that a major gene effect on a quantitative trait, the level of alcohol dehydrogenase expression in Drosophila melanogaster, is due to multiple polymorphisms within the Adh gene. These polymorphisms are located in an intron, the coding sequence, and the 3' untranslated region. Because of nonrandom associations among polymorphisms at different sites, the individual effects combine (in some cases epistatically) to produce ``superalleles'' with large effect. These results have implications for the interpretation of major gene effects detected by quantitative trait locus mapping methods. They show that large effects due to a single locus may be due to multiple associated polymorphisms (or sequential fixations in isolated populations) rather than individual mutations of large effect. PMID:8978044

  15. Multi-step splicing of sphingomyelin synthase linear and circular RNAs.

    PubMed

    Filippenkov, Ivan B; Sudarkina, Olga Yu; Limborska, Svetlana A; Dergunova, Lyudmila V

    2018-05-15

    The SGMS1 gene encodes the enzyme sphingomyelin synthase 1 (SMS1), which is involved in the regulation of lipid metabolism, apoptosis, intracellular vesicular transport and other significant processes. The SGMS1 gene is located on chromosome 10 and has a size of 320 kb. Previously, we showed that dozens of alternative transcripts of the SGMS1 gene are present in various human tissues. In addition to mRNAs that provide synthesis of the SMS1 protein, this gene participates in the synthesis of non-coding transcripts, including circular RNAs (circRNAs), which include exons of the 5'-untranslated region (5'-UTR) and are highly represented in the brain. In this study, using the high-throughput technology RNA-CaptureSeq, many new SGMS1 transcripts were identified, including both intronic unspliced RNAs (premature RNAs) and RNAs formed via alternative splicing. Recursive exons (RS-exons) that can participate in the multi-step splicing of long introns of the gene were also identified. These exons participate in the formation of circRNAs. Thus, multi-step splicing may provide a variety of linear and circular RNAs of eukaryotic genes in tissues. Copyright © 2018 Elsevier B.V. All rights reserved.

  16. Deep developmental transcriptome sequencing uncovers numerous new genes and enhances gene annotation in the sponge Amphimedon queenslandica.

    PubMed

    Fernandez-Valverde, Selene L; Calcino, Andrew D; Degnan, Bernard M

    2015-05-15

    The demosponge Amphimedon queenslandica is amongst the few early-branching metazoans with an assembled and annotated draft genome, making it an important species in the study of the origin and early evolution of animals. Current gene models in this species are largely based on in silico predictions and low coverage expressed sequence tag (EST) evidence. Amphimedon queenslandica protein-coding gene models are improved using deep RNA-Seq data from four developmental stages and CEL-Seq data from 82 developmental samples. Over 86% of previously predicted genes are retained in the new gene models, although 24% have additional exons; there is also a marked increase in the total number of annotated 3' and 5' untranslated regions (UTRs). Importantly, these new developmental transcriptome data reveal numerous previously unannotated protein-coding genes in the Amphimedon genome, increasing the total gene number by 25%, from 30,060 to 40,122. In general, Amphimedon genes have introns that are markedly smaller than those in other animals and most of the alternatively spliced genes in Amphimedon undergo intron-retention; exon-skipping is the least common mode of alternative splicing. Finally, in addition to canonical polyadenylation signal sequences, Amphimedon genes are enriched in a number of unique AT-rich motifs in their 3' UTRs. The inclusion of developmental transcriptome data has substantially improved the structure and composition of protein-coding gene models in Amphimedon queenslandica, providing a more accurate and comprehensive set of genes for functional and comparative studies. These improvements reveal the Amphimedon genome is comprised of a remarkably high number of tightly packed genes. These genes have small introns and there is pervasive intron retention amongst alternatively spliced transcripts. These aspects of the sponge genome are more similar unicellular opisthokont genomes than to other animal genomes.

  17. The Epstein–Barr virus nuclear protein SM is both a post-transcriptional inhibitor and activator of gene expression

    PubMed Central

    Ruvolo, Vivian; Wang, Eryu; Boyle, Sarah; Swaminathan, Sankar

    1998-01-01

    The Epstein–Barr virus (EBV) nuclear protein BS-MLF1 (SM) is expressed early after entry of EBV into the lytic cycle. SM transactivates reporter gene constructs driven by a wide variety of promoters, but the mechanism of SM action is poorly understood. In this study, we demonstrate that the SM protein inhibits expression of intron-containing genes and activates expression of intron-less genes. We demonstrate that SM has the predicted inhibitory effect on expression of a spliced EBV gene but activates an unspliced early EBV gene. SM inhibited gene expression at the post-transcriptional level by preventing the accumulation of nuclear and cytoplasmic RNA transcripts. Conversely, SM led to increased accumulation of nuclear mRNA from intron-less genes without affecting the rate of transcription, indicating that SM enhances nuclear RNA stability. The ratio of cytoplasmic to nuclear polyadenylated mRNA was increased in the presence of SM, suggesting that SM also enhances nucleo-cytoplasmic mRNA transport. The degree of transactivation by SM was dependent on the sequence of the 3′-untranslated region of the target mRNA. Finally, we demonstrate that the amino-terminal portion of SM fused to glutathione-S-transferase binds radioactively labeled RNA in vitro, indicating that SM is a single-stranded RNA binding protein. Importantly, the latent and immediate-early genes of EBV contain introns whereas many early and late genes do not. Thus, SM may down-regulate synthesis of host cell proteins and latent EBV proteins while simultaneously enhancing expression of specific lytic EBV genes by binding to mRNA and modulating its stability and transport. PMID:9671768

  18. The Epstein-Barr virus nuclear protein SM is both a post-transcriptional inhibitor and activator of gene expression.

    PubMed

    Ruvolo, V; Wang, E; Boyle, S; Swaminathan, S

    1998-07-21

    The Epstein-Barr virus (EBV) nuclear protein BS-MLF1 (SM) is expressed early after entry of EBV into the lytic cycle. SM transactivates reporter gene constructs driven by a wide variety of promoters, but the mechanism of SM action is poorly understood. In this study, we demonstrate that the SM protein inhibits expression of intron-containing genes and activates expression of intron-less genes. We demonstrate that SM has the predicted inhibitory effect on expression of a spliced EBV gene but activates an unspliced early EBV gene. SM inhibited gene expression at the post-transcriptional level by preventing the accumulation of nuclear and cytoplasmic RNA transcripts. Conversely, SM led to increased accumulation of nuclear mRNA from intron-less genes without affecting the rate of transcription, indicating that SM enhances nuclear RNA stability. The ratio of cytoplasmic to nuclear polyadenylated mRNA was increased in the presence of SM, suggesting that SM also enhances nucleo-cytoplasmic mRNA transport. The degree of transactivation by SM was dependent on the sequence of the 3'-untranslated region of the target mRNA. Finally, we demonstrate that the amino-terminal portion of SM fused to glutathione-S-transferase binds radioactively labeled RNA in vitro, indicating that SM is a single-stranded RNA binding protein. Importantly, the latent and immediate-early genes of EBV contain introns whereas many early and late genes do not. Thus, SM may down-regulate synthesis of host cell proteins and latent EBV proteins while simultaneously enhancing expression of specific lytic EBV genes by binding to mRNA and modulating its stability and transport.

  19. Short interspersed nuclear elements (SINEs) are abundant in Solanaceae and have a family-specific impact on gene structure and genome organization.

    PubMed

    Seibt, Kathrin M; Wenke, Torsten; Muders, Katja; Truberg, Bernd; Schmidt, Thomas

    2016-05-01

    Short interspersed nuclear elements (SINEs) are highly abundant non-autonomous retrotransposons that are widespread in plants. They are short in size, non-coding, show high sequence diversity, and are therefore mostly not or not correctly annotated in plant genome sequences. Hence, comparative studies on genomic SINE populations are rare. To explore the structural organization and impact of SINEs, we comparatively investigated the genome sequences of the Solanaceae species potato (Solanum tuberosum), tomato (Solanum lycopersicum), wild tomato (Solanum pennellii), and two pepper cultivars (Capsicum annuum). Based on 8.5 Gbp sequence data, we annotated 82 983 SINE copies belonging to 10 families and subfamilies on a base pair level. Solanaceae SINEs are dispersed over all chromosomes with enrichments in distal regions. Depending on the genome assemblies and gene predictions, 30% of all SINE copies are associated with genes, particularly frequent in introns and untranslated regions (UTRs). The close association with genes is family specific. More than 10% of all genes annotated in the Solanaceae species investigated contain at least one SINE insertion, and we found genes harbouring up to 16 SINE copies. We demonstrate the involvement of SINEs in gene and genome evolution including the donation of splice sites, start and stop codons and exons to genes, enlargement of introns and UTRs, generation of tandem-like duplications and transduction of adjacent sequence regions. © 2016 The Authors The Plant Journal © 2016 John Wiley & Sons Ltd.

  20. RNomics in Drosophila melanogaster: identification of 66 candidates for novel non-messenger RNAs

    PubMed Central

    Yuan, Guozhong; Klämbt, Christian; Bachellerie, Jean-Pierre; Brosius, Jürgen; Hüttenhofer, Alexander

    2003-01-01

    By generating a specialised cDNA library from four different developmental stages of Drosophila melanogaster, we have identified 66 candidates for small non-messenger RNAs (snmRNAs) and have confirmed their expression by northern blot analysis. Thirteen of them were expressed at certain stages of D.melanogaster development, only. Thirty-five species belong to the class of small nucleolar RNAs (snoRNAs), divided into 15 members from the C/D subclass and 20 members from the H/ACA subclass, which mostly guide 2′-O-methylation and pseudouridylation, respectively, of rRNA and snRNAs. These also include two outstanding C/D snoRNAs, U3 and U14, both functioning as pre-rRNA chaperones. Surprisingly, the sequence of the Drosophila U14 snoRNA reflects a major change of function of this snoRNA in Diptera relative to yeast and vertebrates. Among the 22 snmRNAs lacking known sequence and structure motifs, five were located in intergenic regions, two in introns, five in untranslated regions of mRNAs, eight were derived from open reading frames, and two were transcribed opposite to an intron. Interestingly, detection of two RNA species from this group implies that certain snmRNA species are processed from alternatively spliced pre-mRNAs. Surprisingly, a few snmRNA sequences could not be found on the published D.melanogaster genome, which might suggest that more snmRNA genes (as well as mRNAs) are hidden in unsequenced regions of the genome. PMID:12736298

  1. Functional Genomics of Attention-Deficit/ Hyperactivity Disorder (ADHD) Risk Alleles on Dopamine Transporter Binding in ADHD and Healthy Control Subjects

    PubMed Central

    Spencer, Thomas J.; Biederman, Joseph; Faraone, Stephen V.; Madras, Bertha K.; Bonab, Ali A.; Dougherty, Darin D.; Batchelder, Holly; Clarke, Allison; Fischman, Alan J.

    2013-01-01

    Background The main aim of this study was to examine the relationship between dopamine transporter (DAT) binding in the striatum in individuals with and without attention-deficit/hyperactivity disorder (ADHD), attending to the 3′-untranslated region of the gene (3′-UTR) and intron8 variable number of tandem repeats (VNTR) polymorphisms of the DAT (SLC6A3) gene. Methods Subjects consisted of 68 psychotropic (including stimulant)-naïve and smoking-naïve volunteers between 18 and 55 years of age (ADHD n = 34; control subjects n = 34). Striatal DAT binding was measured with positron emission tomography with 11C altropane. Genotyping of the two DAT (SLC6A3) 3′-UTR and intron8 VNTRs used standard protocols. Results The gene frequencies of each of the gene polymorphisms assessed did not differ between the ADHD and control groups. The ADHD status (t = 2.99; p < .004) and 3′-UTR of SLC6A3 9 repeat carrier status (t = 2.74; p < .008) were independently and additively associated with increased DAT binding in the caudate. The ADHD status was associated with increased striatal (caudate) DAT binding regardless of 3′-UTR genotype, and 3′-UTR genotype was associated with increased striatal (caudate) DAT binding regardless of ADHD status. In contrast, there were no significant associations between polymorphisms of DAT intron8 or the 3′-UTR-intron8 haplotype with DAT binding. Conclusions The 3′-UTR but not intron8 VNTR genotypes were associated with increased DAT binding in both ADHD patients and healthy control subjects. Both ADHD status and the 3′-UTR polymorphism status had an additive effect on DAT binding. Our findings suggest that an ADHD risk polymorphism (3′-UTR) of SLC6A3 has functional consequences on central nervous system DAT binding in humans. PMID:23273726

  2. Structure of the human gene encoding the protein repair L-isoaspartyl (D-aspartyl) O-methyltransferase.

    PubMed

    DeVry, C G; Tsai, W; Clarke, S

    1996-11-15

    The protein L-isoaspartyl/D-aspartyl O-methyltransferase (EC 2.1.1.77) catalyzes the first step in the repair of proteins damaged in the aging process by isomerization or racemization reactions at aspartyl and asparaginyl residues. A single gene has been localized to human chromosome 6 and multiple transcripts arising through alternative splicing have been identified. Restriction enzyme mapping, subcloning, and DNA sequence analysis of three overlapping clones from a human genomic library in bacteriophage P1 indicate that the gene spans approximately 60 kb and is composed of 8 exons interrupted by 7 introns. Analysis of intron/exon splice junctions reveals that all of the donor and acceptor splice sites are in agreement with the mammalian consensus splicing sequence. Determination of transcription initiation sites by primer extension analysis of poly(A)+ mRNA from human brain identifies multiple start sites, with a major site 159 nucleotides upstream from the ATG start codon. Sequence analysis of the 5'-untranslated region demonstrates several potential cis-acting DNA elements including SP1, ETF, AP1, AP2, ARE, XRE, CREB, MED-1, and half-palindromic ERE motifs. The promoter of this methyltransferase gene lacks an identifiable TATA box but is characterized by a CpG island which begins approximately 723 nucleotides upstream of the major transcriptional start site and extends through exon 1 and into the first intron. These features are characteristic of housekeeping genes and are consistent with the wide tissue distribution observed for this methyltransferase activity.

  3. Analysis of PAC1 receptor gene variants in Caucasian and African American infants dying of sudden infant death syndrome.

    PubMed

    Barrett, Karlene T; Rodikova, Ekaterina; Weese-Mayer, Debra E; Rand, Casey M; Marazita, Mary L; Cooper, Margaret E; Berry-Kravis, Elizabeth M; Bech-Hansen, N Torben; Wilson, Richard J A

    2013-12-01

    Stress peptide, pituitary adenylate cyclase-activating polypeptide (PACAP), has been implicated in sudden infant death syndrome (SIDS). The aim of this exploratory study was to determine whether variants in the gene encoding the PACAP-specific receptor, PAC1, are associated with SIDS in Caucasian and African American infants. Polymerase chain reaction and Sanger DNA sequencing was used to compare variants in the 5'-untranslated region, exons and intron-exon boundaries of the PAC1 gene in 96 SIDS cases and 96 race- and gender-matched controls. The intron 3 variant, A/G: rs758995 (variant 'h'), and the intron 6 variant, C/T: rs10081254 (variant 'n'), were significantly associated with SIDS in Caucasians and African Americans, respectively (p < 0.05). Also associated with SIDS were interactions between the variants rs2302475 (variant 'i') in PAC1 and rs8192597 and rs2856966 in PACAP among Caucasians (p < 0.02) and rs2267734 (variant 'q') in PAC1 and rs1893154 in PACAP among African Americans (p < 0.01). However, none of these differences survived post hoc analysis. Overall, this study does not support a strong association between variants in the PAC1 gene and SIDS; however, a number of potential associations between race-specific variants and SIDS were identified that warrant targeted investigations in future studies. ©2013 Foundation Acta Paediatrica. Published by John Wiley & Sons Ltd.

  4. Genetic polymorphisms of the drug-metabolizing enzyme cytochrome P450 3A5 in a Uyghur Chinese population.

    PubMed

    Chen, Zhengshuai; Li, Jingjie; Chen, Peng; Wang, Fengjiao; Zhang, Ning; Yang, Min; Jin, Tianbo; Chen, Chao

    2016-09-01

    1.  Detection of CYP3A5 variant alleles, and knowledge about their allelic frequency in Uyghur ethnic groups, is important to establish the clinical relevance of screening for these polymorphisms to optimize pharmacotherapy. 2. We used DNA sequencing to investigate the promoter, exons and surrounding introns, and 3'-untranslated region of the CYP3A5 gene in 96 unrelated healthy Uyghur individuals. We also used SIFT and PolyPhen-2 to predict the protein function of the novel non-synonymous mutation in CYP3A5 coding regions. 3. We found 24 different CYP3A5 polymorphisms in the Uyghur population, three of which were novel: the synonymous mutation 43C > T in exon 1, two mutations 32120C > G and 32245T > C in 3'-untranslated region, and we detected the allele frequencies of CYP3A5*1 and *3 as 64.58% and 35.42%, respectively. While no subjects with CYP3A5*6 were identified. Other identified genotypes included the heterozygous genotype 1A/3A (59.38%) and 1A/3E (11.46%), which lead to decreased enzyme activity. In addition, the frequency of haplotype "TTAGGT" was the most prevalent with 0.781. 4. Our data provide new information regarding CYP3A5 genetic polymorphisms in Uyghur individuals, which may help to improve individualization of drug therapy and offer a preliminary basis for more rational use of drugs.

  5. Germline EMSY sequence alterations in hereditary breast cancer and ovarian cancer families.

    PubMed

    Määttä, Kirsi M; Nurminen, Riikka; Kankuri-Tammilehto, Minna; Kallioniemi, Anne; Laasanen, Satu-Leena; Schleutker, Johanna

    2017-07-24

    BRCA1 and BRCA2 mutations explain approximately one-fifth of the inherited susceptibility in high-risk Finnish hereditary breast and ovarian cancer (HBOC) families. EMSY is located in the breast cancer-associated chromosomal region 11q13. The EMSY gene encodes a BRCA2-interacting protein that has been implicated in DNA damage repair and genomic instability. We analysed the role of germline EMSY variation in breast/ovarian cancer predisposition. The present study describes the first EMSY screening in patients with high familial risk for this disease. Index individuals from 71 high-risk, BRCA1/2-negative HBOC families were screened for germline EMSY sequence alterations in protein coding regions and exon-intron boundaries using Sanger sequencing and TaqMan assays. The identified variants were further screened in 36 Finnish HBOC patients and 904 controls. Moreover, one novel intronic deletion was screened in a cohort of 404 breast cancer patients unselected for family history. Haplotype block structure and the association of haplotypes with breast/ovarian cancer were analysed using Haploview. The functionality of the identified variants was predicted using Haploreg, RegulomeDB, Human Splicing Finder, and Pathogenic-or-Not-Pipeline 2. Altogether, 12 germline EMSY variants were observed. Two alterations were located in the coding region, five alterations were intronic, and five alterations were located in the 3'untranslated region (UTR). Variant frequencies did not significantly differ between cases and controls. The novel variant, c.2709 + 122delT, was detected in 1 out of 107 (0.9%) breast cancer patients, and the carrier showed a bilateral form of the disease. The deletion was absent in 897 controls (OR = 25.28; P = 0.1) and in 404 breast cancer patients unselected for family history. No haplotype was identified to increase the risk of breast/ovarian cancer. Functional analyses suggested that variants, particularly in the 3'UTR, were located within regulatory elements. The novel deletion was predicted to affect splicing regulatory elements. These results suggest that the identified EMSY variants are likely neutral at the population level. However, these variants may contribute to breast/ovarian cancer risk in single families. Additional analyses are warranted for rare novel intronic deletions and the 3'UTR variants predicted to have functional roles.

  6. Improved Annotation of 3′ Untranslated Regions and Complex Loci by Combination of Strand-Specific Direct RNA Sequencing, RNA-Seq and ESTs

    PubMed Central

    Song, Junfang; Duc, Céline; Storey, Kate G.; McLean, W. H. Irwin; Brown, Sara J.; Simpson, Gordon G.; Barton, Geoffrey J.

    2014-01-01

    The reference annotations made for a genome sequence provide the framework for all subsequent analyses of the genome. Correct and complete annotation in addition to the underlying genomic sequence is particularly important when interpreting the results of RNA-seq experiments where short sequence reads are mapped against the genome and assigned to genes according to the annotation. Inconsistencies in annotations between the reference and the experimental system can lead to incorrect interpretation of the effect on RNA expression of an experimental treatment or mutation in the system under study. Until recently, the genome-wide annotation of 3′ untranslated regions received less attention than coding regions and the delineation of intron/exon boundaries. In this paper, data produced for samples in Human, Chicken and A. thaliana by the novel single-molecule, strand-specific, Direct RNA Sequencing technology from Helicos Biosciences which locates 3′ polyadenylation sites to within +/− 2 nt, were combined with archival EST and RNA-Seq data. Nine examples are illustrated where this combination of data allowed: (1) gene and 3′ UTR re-annotation (including extension of one 3′ UTR by 5.9 kb); (2) disentangling of gene expression in complex regions; (3) clearer interpretation of small RNA expression and (4) identification of novel genes. While the specific examples displayed here may become obsolete as genome sequences and their annotations are refined, the principles laid out in this paper will be of general use both to those annotating genomes and those seeking to interpret existing publically available annotations in the context of their own experimental data. PMID:24722185

  7. Discordant expression and variable numbers of neighboring GGA- and GAA-rich triplet repeats in the 3' untranslated regions of two groups of messenger RNAs encoded by the rat polymeric immunoglobulin receptor gene.

    PubMed Central

    Koch, K S; Gleiberman, A S; Aoki, T; Leffert, H L; Feren, A; Jones, A L; Fodor, E J

    1995-01-01

    An unusual S1-nuclease sensitive microsatellite (STMS) has been found in the single copy, rat polymeric immunoglobulin receptor gene (PIGR) terminal exon. In Fisher rats, elements within or beyond the STMS are expressed variably in the 3' untranslated regions (3'UTRs) of two 'Groups' of PIGR-encoded hepatic mRNAs (pIg-R) during liver regeneration. STMS elements include neighboring constant regions (a 60-bp d[GA]-rich tract with a chi-like octamer, followed by 15 tandem d[GGA] repeats) that merge directly with 36 or 39 tandem d[GAA] repeats (Fisher or Wistar strains, respectively) interrupted by d[AA] between their 5th-6th repeat units. The Wistar STMS is flanked upstream by two regions of nearly contiguous d[CA] or d[CT] repeats in the 3' end of intron 8; and downstream, by a 283 bp 'unit' containing several inversions at its 5' end, and two polyadenylation signals at its 3' end. The 283 nt unit is expressed in Group 1 pIg-R mRNAs; but it is absent in the Group 2 family so that their GAA repeats merge with their poly A tails. In contrast to genomic sequence, GGA triplet repeats are amplified (n > or = 24-26), whereas GAA triplet repeats are truncated variably (n < or = 9-37) and expressed uninterruptedly in both mRNA Groups. These results suggest that 3' end processing of the rat PIGR gene may involve misalignment, slippage and premature termination of RNA polymerase II. The function of this unusual processing and possible roles of chi-like octamers in quiescent or extrahepatic tissues are discussed. Images PMID:7739889

  8. Analysis of alterative cleavage and polyadenylation by 3′ region extraction and deep sequencing

    PubMed Central

    Hoque, Mainul; Ji, Zhe; Zheng, Dinghai; Luo, Wenting; Li, Wencheng; You, Bei; Park, Ji Yeon; Yehia, Ghassan; Tian, Bin

    2012-01-01

    Alternative cleavage and polyadenylation (APA) leads to mRNA isoforms with different coding sequences (CDS) and/or 3′ untranslated regions (3′UTRs). Using 3′ Region Extraction And Deep Sequencing (3′READS), a method which addresses the internal priming and oligo(A) tail issues that commonly plague polyA site (pA) identification, we comprehensively mapped pAs in the mouse genome, thoroughly annotating 3′ ends of genes and revealing over five thousand pAs (~8% of total) flanked by A-rich sequences, which have hitherto been overlooked. About 79% of mRNA genes and 66% of long non-coding RNA (lncRNA) genes have APA; but these two gene types have distinct usage patterns for pAs in introns and upstream exons. Promoter-distal pAs become relatively more abundant during embryonic development and cell differentiation, a trend affecting pAs in both 3′-most exons and upstream regions. Upregulated isoforms generally have stronger pAs, suggesting global modulation of the 3′ end processing activity in development and differentiation. PMID:23241633

  9. New genetic variants of LATS1 detected in urinary bladder and colon cancer.

    PubMed

    Saadeldin, Mona K; Shawer, Heba; Mostafa, Ahmed; Kassem, Neemat M; Amleh, Asma; Siam, Rania

    2014-01-01

    LATS1, the large tumor suppressor 1 gene, encodes for a serine/threonine kinase protein and is implicated in cell cycle progression. LATS1 is down-regulated in various human cancers, such as breast cancer, and astrocytoma. Point mutations in LATS1 were reported in human sarcomas. Additionally, loss of heterozygosity of LATS1 chromosomal region predisposes to breast, ovarian, and cervical tumors. In the current study, we investigated LATS1 genetic variations including single nucleotide polymorphisms (SNPs), in 28 Egyptian patients with either urinary bladder or colon cancers. The LATS1 gene was amplified and sequenced and the expression of LATS1 at the RNA level was assessed in 12 urinary bladder cancer samples. We report, the identification of a total of 29 variants including previously identified SNPs within LATS1 coding and non-coding sequences. A total of 18 variants were novel. Majority of the novel variants, 13, were mapped to intronic sequences and un-translated regions of the gene. Four of the five novel variants located in the coding region of the gene, represented missense mutations within the serine/threonine kinase catalytic domain. Interestingly, LATS1 RNA steady state levels was lost in urinary bladder cancerous tissue harboring four specific SNPs (16045 + 41736 + 34614 + 56177) positioned in the 5'UTR, intron 6, and two silent mutations within exon 4 and exon 8, respectively. This study identifies novel single-base-sequence alterations in the LATS1 gene. These newly identified variants could potentially be used as novel diagnostic or prognostic tools in cancer.

  10. Impact of genetic variants on haematopoiesis in patients with thrombocytopenia absent radii (TAR) syndrome.

    PubMed

    Manukjan, Georgi; Bösing, Hendrik; Schmugge, Markus; Strauß, Gabriele; Schulze, Harald

    2017-11-01

    Thrombocytopenia absent radii (TAR) syndrome is clearly defined by the combination of radial aplasia and reduced platelet counts. The genetics of TAR syndrome has recently been resolved and comprises a microdeletion on Chromosome 1 including the RBM8A gene and a single nucleotide polymorphism (SNP) either at the 5' untranslated region (5'UTR) or within the first intron of RBM8A. Although phenotypically readily diagnosed after birth, the genetic determination of particular SNPs in TAR syndrome harbours valuable information to evaluate disease severity and treatment decisions. Here, we present clinical data in a cohort of 38 patients and observed that platelet counts in individuals with 5'UTR SNP are significantly lower compared to patients bearing the SNP in intron 1. Moreover, elevated haemoglobin values could only be assessed in patients with 5'UTR SNP whereas white blood cell count is unaffected, indicating that frequently observed anaemia in TAR patients could also be SNP-dependent whereas leucocytosis does not correlate with genetic background. However, this report on a large cohort provides an overview of important haematological characteristics in TAR patients, facilitating evaluation of the various traits in this disease and indicating the importance of genetic validation for TAR syndrome. © 2017 John Wiley & Sons Ltd.

  11. Mutations in the Promoter Region of the Aldolase B Gene that cause Hereditary Fructose Intolerance

    PubMed Central

    Coffee, Erin M.; Tolan, Dean R.

    2010-01-01

    SUMMARY Hereditary fructose intolerance (HFI) is a potentially fatal inherited metabolic disease caused by a deficiency of aldolase B activity in the liver and kidney. Over 40 disease-causing mutations are known in the protein-coding region of ALDOB. Mutations upstream of the protein-coding portion of ALDOB are reported here for the first time. DNA sequence analysis of 61 HFI patients revealed single base mutations in the promoter, intronic enhancer, and the first exon, which is entirely untranslated. One mutation, g.–132G>A, is located within the promoter at an evolutionarily conserved nucleotide within a transcription factor-binding site. A second mutation, IVS1+1G>C, is at the donor splice site of the first exon. In vitro electrophoretic mobility shift assays show a decrease in nuclear extract-protein binding at the g.–132G>A mutant site. The promoter mutation results in decreased transcription using luciferase reporter plasmids. Analysis of cDNA from cells transfected with plasmids harboring the IVS1+1G>C mutation results in aberrant splicing leading to complete retention of the first intron (~ 5 kb). The IVS1+1G>C splicing mutation results in loss of luciferase activity from a reporter plasmid. These novel mutations in ALDOB represent 2% of alleles in American HFI patients, with IVS1+1G>C representing a significantly higher allele frequency (6%) among HFI patients of Hispanic and African-American ethnicity. PMID:20882353

  12. Isolated familial somatotropinomas: clinical features and analysis of the MEN1 gene.

    PubMed

    De Menis, Ernesto; Prezant, Toni R

    2002-01-01

    Isolated familial somatotropinomas (IFS) rarely occurs in the absence of multiple endocrine neoplasia type I (MEN1) or the Carney complex. In the present study we report two Italian siblings affected by GH-secreting adenomas. There was no history of parental consanguinity. The sister presented at 18 years of age with secondary amenorrhea and acromegalic features and one of her two brothers presented with gigantism at the same age. Endocrinological investigations confirmed GH hypersecretion in both cases. Although a pituitary microadenoma was detected in both patients, transsphenoidal surgery was not successful. The sister received conventional radiotherapy and acromegaly is now considered controlled; the brother is being treated with octreotide LAR 30 mg monthly and the disease is considered clinically active. Patients, their parents and the unaffected brother underwent extensive evaluation, and no features of MEN1 or Carney complex were found. Analysis of polymorphic microsatellite markers from chromosome 11q13 (D11S599, D11S4945, D11S4939, D11S4938 and D11S987) showed that the acromegalic siblings had inherited different maternal chromosomes and shared the paternal chromosome. No pathogenic MEN1 sequence changes were detected by sequencing or dideoxy fingerprinting of the coding sequence (exons 2-10) and exon/intron junctions. Although mutations in the promoter, introns or untranslated regions of the MEN1 gene cannot be excluded, germline mutations within the coding region of this gene do not appear responsible for IFS in this family.

  13. A mutation in an alternative untranslated exon of hexokinase 1 associated with hereditary motor and sensory neuropathy -- Russe (HMSNR).

    PubMed

    Hantke, Janina; Chandler, David; King, Rosalind; Wanders, Ronald J A; Angelicheva, Dora; Tournev, Ivailo; McNamara, Elyshia; Kwa, Marcel; Guergueltcheva, Velina; Kaneva, Radka; Baas, Frank; Kalaydjieva, Luba

    2009-12-01

    Hereditary Motor and Sensory Neuropathy -- Russe (HMSNR) is a severe autosomal recessive disorder, identified in the Gypsy population. Our previous studies mapped the gene to 10q22-q23 and refined the gene region to approximately 70 kb. Here we report the comprehensive sequencing analysis and fine mapping of this region, reducing it to approximately 26 kb of fully characterised sequence spanning the upstream exons of Hexokinase 1 (HK1). We identified two sequence variants in complete linkage disequilibrium, a G>C in a novel alternative untranslated exon (AltT2) and a G>A in the adjacent intron, segregating with the disease in affected families and present in the heterozygote state in only 5/790 population controls. Sequence conservation of the AltT2 exon in 16 species with invariable preservation of the G allele at the mutated site, strongly favour the exonic change as the pathogenic mutation. Analysis of the Hk1 upstream region in mouse mRNA from testis and neural tissues showed an abundance of AltT2-containing transcripts generated by extensive, developmentally regulated alternative splicing. Expression is very low compared with ubiquitous Hk1 and all transcripts skip exon1, which encodes the protein domain responsible for binding to the outer mitochondrial membrane, and regulation of energy production and apoptosis. Hexokinase activity measurement and immunohistochemistry of the peripheral nerve showed no difference between patients and controls. The mutational mechanism and functional effects remain unknown and could involve disrupted translational regulation leading to increased anti-apoptotic activity (suggested by the profuse regenerative activity in affected nerves), or impairment of an unknown HK1 function in the peripheral nervous system (PNS).

  14. A mutation in an alternative untranslated exon of hexokinase 1 associated with Hereditary Motor and Sensory Neuropathy – Russe (HMSNR)

    PubMed Central

    Hantke, Janina; Chandler, David; King, Rosalind; Wanders, Ronald JA; Angelicheva, Dora; Tournev, Ivailo; McNamara, Elyshia; Kwa, Marcel; Guergueltcheva, Velina; Kaneva, Radka; Baas, Frank; Kalaydjieva, Luba

    2009-01-01

    Hereditary Motor and Sensory Neuropathy – Russe (HMSNR) is a severe autosomal recessive disorder, identified in the Gypsy population. Our previous studies mapped the gene to 10q22-q23 and refined the gene region to ∼70 kb. Here we report the comprehensive sequencing analysis and fine mapping of this region, reducing it to ∼26 kb of fully characterised sequence spanning the upstream exons of Hexokinase 1 (HK1). We identified two sequence variants in complete linkage disequilibrium, a G>C in a novel alternative untranslated exon (AltT2) and a G>A in the adjacent intron, segregating with the disease in affected families and present in the heterozygote state in only 5/790 population controls. Sequence conservation of the AltT2 exon in 16 species with invariable preservation of the G allele at the mutated site, strongly favour the exonic change as the pathogenic mutation. Analysis of the Hk1 upstream region in mouse mRNA from testis and neural tissues showed an abundance of AltT2-containing transcripts generated by extensive, developmentally regulated alternative splicing. Expression is very low compared with ubiquitous Hk1 and all transcripts skip exon1, which encodes the protein domain responsible for binding to the outer mitochondrial membrane, and regulation of energy production and apoptosis. Hexokinase activity measurement and immunohistochemistry of the peripheral nerve showed no difference between patients and controls. The mutational mechanism and functional effects remain unknown and could involve disrupted translational regulation leading to increased anti-apoptotic activity (suggested by the profuse regenerative activity in affected nerves), or impairment of an unknown HK1 function in the peripheral nervous system (PNS). PMID:19536174

  15. Sequence divergence in the 3'-untranslated region has an effect on the subfunctionalization of duplicate genes.

    PubMed

    Tong, Ying; Zheng, Kang; Zhao, Shufang; Xiao, Guanxiu; Luo, Chen

    2012-11-01

    Recent studies demonstrated that sequence divergence in both transcriptional regulatory region and coding region contributes to the subfunctionalization of duplicate gene. However, whether sequence divergence in the 3'-untranslated region (3'-UTR) has an impact on the subfunctionalization of duplicate genes remains unclear. Here, we identified two diverging duplicate vsx1 (visual system homeobox-1) loci in goldfish, named vsx1A1 and vsx1A2. Phylogenetic analysis suggests that vsx1A1 and vsx1A2 may arise from a duplication of vsx1 after the separation of goldfish and zebrafish. Sequence comparison revealed that divergence in both transcriptional and translational regulatory regions is higher than divergence in the introns. vsx1A2 expresses during blastula and gastrula stages and in adult retina but silences from segmentation stage to hatching stage, vsx1A1 starts expression from segmentation onward. Comparing to that zebrafish vsx1 expresses in all the developmental stages and in the adult retina, it appears that goldfish vsx1A1 and vsx1A2 are under going to share the functions of ancestral vsx1. The different but overlapping temporal expression patterns of vsx1A1 and vsx1A2 suggest that sequence divergence in the promoter region of duplicate vsx1 is not sufficient for partitioning the functions of ancestral vsx1. By comparing vsx1A1 and vsx1A2 3'-UTR-linked green fluorescent protein gene expression patterns, we demonstrated that the 3'-UTR of vsx1A1 remains but the 3'-UTR of vsx1A2 has lost the capability of mediating bipolar cell specific expression during retina development. These results indicate that sequence divergence in the 3'-UTRs has a clear effect on subfunctionalization of the duplicate genes. © 2012 WILEY PERIODICALS, INC.

  16. Discovery of SCORs: Anciently derived, highly conserved gene-associated repeats in stony corals.

    PubMed

    Qiu, Huan; Zelzion, Ehud; Putnam, Hollie M; Gates, Ruth D; Wagner, Nicole E; Adams, Diane K; Bhattacharya, Debashish

    2017-10-01

    Stony coral (Scleractinia) genomes are still poorly explored and many questions remain about their evolution and contribution to the success and longevity of reefs. We analyzed transcriptome and genome data from Montipora capitata, Acropora digitifera, and transcriptome data from 20 other coral species. To our surprise, we found highly conserved, anciently derived, Scleractinia COral-specific Repeat families (SCORs) that are abundant in all the studied lineages. SCORs form complex secondary structures and are located in untranslated regions and introns, but most abundant in intergenic DNA. These repeat families have undergone frequent duplication and degradation, suggesting a 'boom and bust' cycle of invasion and loss. We speculate that due to their surprisingly high sequence identities across deeply diverged corals, physical association with genes, and dynamic evolution, SCORs might have adaptive functions in corals that need to be explored using population genomic and function-based approaches. Copyright © 2017 Elsevier Inc. All rights reserved.

  17. Large-scale analysis of full-length cDNAs from the tomato (Solanum lycopersicum) cultivar Micro-Tom, a reference system for the Solanaceae genomics.

    PubMed

    Aoki, Koh; Yano, Kentaro; Suzuki, Ayako; Kawamura, Shingo; Sakurai, Nozomu; Suda, Kunihiro; Kurabayashi, Atsushi; Suzuki, Tatsuya; Tsugane, Taneaki; Watanabe, Manabu; Ooga, Kazuhide; Torii, Maiko; Narita, Takanori; Shin-I, Tadasu; Kohara, Yuji; Yamamoto, Naoki; Takahashi, Hideki; Watanabe, Yuichiro; Egusa, Mayumi; Kodama, Motoichiro; Ichinose, Yuki; Kikuchi, Mari; Fukushima, Sumire; Okabe, Akiko; Arie, Tsutomu; Sato, Yuko; Yazawa, Katsumi; Satoh, Shinobu; Omura, Toshikazu; Ezura, Hiroshi; Shibata, Daisuke

    2010-03-30

    The Solanaceae family includes several economically important vegetable crops. The tomato (Solanum lycopersicum) is regarded as a model plant of the Solanaceae family. Recently, a number of tomato resources have been developed in parallel with the ongoing tomato genome sequencing project. In particular, a miniature cultivar, Micro-Tom, is regarded as a model system in tomato genomics, and a number of genomics resources in the Micro-Tom-background, such as ESTs and mutagenized lines, have been established by an international alliance. To accelerate the progress in tomato genomics, we developed a collection of fully-sequenced 13,227 Micro-Tom full-length cDNAs. By checking redundant sequences, coding sequences, and chimeric sequences, a set of 11,502 non-redundant full-length cDNAs (nrFLcDNAs) was generated. Analysis of untranslated regions demonstrated that tomato has longer 5'- and 3'-untranslated regions than most other plants but rice. Classification of functions of proteins predicted from the coding sequences demonstrated that nrFLcDNAs covered a broad range of functions. A comparison of nrFLcDNAs with genes of sixteen plants facilitated the identification of tomato genes that are not found in other plants, most of which did not have known protein domains. Mapping of the nrFLcDNAs onto currently available tomato genome sequences facilitated prediction of exon-intron structure. Introns of tomato genes were longer than those of Arabidopsis and rice. According to a comparison of exon sequences between the nrFLcDNAs and the tomato genome sequences, the frequency of nucleotide mismatch in exons between Micro-Tom and the genome-sequencing cultivar (Heinz 1706) was estimated to be 0.061%. The collection of Micro-Tom nrFLcDNAs generated in this study will serve as a valuable genomic tool for plant biologists to bridge the gap between basic and applied studies. The nrFLcDNA sequences will help annotation of the tomato whole-genome sequence and aid in tomato functional genomics and molecular breeding. Full-length cDNA sequences and their annotations are provided in the database KaFTom http://www.pgb.kazusa.or.jp/kaftom/ via the website of the National Bioresource Project Tomato http://tomato.nbrp.jp.

  18. Systematic Profiling of Poly(A)+ Transcripts Modulated by Core 3’ End Processing and Splicing Factors Reveals Regulatory Rules of Alternative Cleavage and Polyadenylation

    PubMed Central

    Li, Wencheng; You, Bei; Hoque, Mainul; Zheng, Dinghai; Luo, Wenting; Ji, Zhe; Park, Ji Yeon; Gunderson, Samuel I.; Kalsotra, Auinash; Manley, James L.; Tian, Bin

    2015-01-01

    Alternative cleavage and polyadenylation (APA) results in mRNA isoforms containing different 3’ untranslated regions (3’UTRs) and/or coding sequences. How core cleavage/polyadenylation (C/P) factors regulate APA is not well understood. Using siRNA knockdown coupled with deep sequencing, we found that several C/P factors can play significant roles in 3’UTR-APA. Whereas Pcf11 and Fip1 enhance usage of proximal poly(A) sites (pAs), CFI-25/68, PABPN1 and PABPC1 promote usage of distal pAs. Strong cis element biases were found for pAs regulated by CFI-25/68 or Fip1, and the distance between pAs plays an important role in APA regulation. In addition, intronic pAs are substantially regulated by splicing factors, with U1 mostly inhibiting C/P events in introns near the 5’ end of gene and U2 suppressing those in introns with features for efficient splicing. Furthermore, PABPN1 inhibits expression of transcripts with pAs near the transcription start site (TSS), a property possibly related to its role in RNA degradation. Finally, we found that groups of APA events regulated by C/P factors are also modulated in cell differentiation and development with distinct trends. Together, our results support an APA code where an APA event in a given cellular context is regulated by a number of parameters, including relative location to the TSS, splicing context, distance between competing pAs, surrounding cis elements and concentrations of core C/P factors. PMID:25906188

  19. Polypyrimidine Tract Binding Protein Homologs from Arabidopsis Are Key Regulators of Alternative Splicing with Implications in Fundamental Developmental Processes[W

    PubMed Central

    Rühl, Christina; Stauffer, Eva; Kahles, André; Wagner, Gabriele; Drechsel, Gabriele; Rätsch, Gunnar; Wachter, Andreas

    2012-01-01

    Alternative splicing (AS) generates transcript variants by variable exon/intron definition and massively expands transcriptome diversity. Changes in AS patterns have been found to be linked to manifold biological processes, yet fundamental aspects, such as the regulation of AS and its functional implications, largely remain to be addressed. In this work, widespread AS regulation by Arabidopsis thaliana Polypyrimidine tract binding protein homologs (PTBs) was revealed. In total, 452 AS events derived from 307 distinct genes were found to be responsive to the levels of the splicing factors PTB1 and PTB2, which predominantly triggered splicing of regulated introns, inclusion of cassette exons, and usage of upstream 5′ splice sites. By contrast, no major AS regulatory function of the distantly related PTB3 was found. Dependent on their position within the mRNA, PTB-regulated events can both modify the untranslated regions and give rise to alternative protein products. We find that PTB-mediated AS events are connected to diverse biological processes, and the functional implications of selected instances were further elucidated. Specifically, PTB misexpression changes AS of PHYTOCHROME INTERACTING FACTOR6, coinciding with altered rates of abscisic acid–dependent seed germination. Furthermore, AS patterns as well as the expression of key flowering regulators were massively changed in a PTB1/2 level-dependent manner. PMID:23192226

  20. A novel ENU-induced mutation, peewee, causes dwarfism in the mouse

    PubMed Central

    Bon-Ryon, Lee; Kano, Kiyoshi; Young, Jay; John, Simon; Nishina, Patsy M; Naggert, Jurgen K; Naito, Kunihiko

    2010-01-01

    We identified a novel fertile, autosomal recessive mutation, called peewee and that results in dwarfing, in a region-specific ENU-induced mutagenesis. These mice at litter size were smaller those of other strains. Histological analysis revealed that the major organs appear normal, but abnormalities in cellular proliferation were observed in bone, liver and testis. Haplotype analysis localized the peewee gene to a 3.3-Mb region between D5Mit83 and D5Mit356.3. There are 18 genes in this linkage area, and we also performed in silico mapping using the PosMed℠ program, which searches for connections among keywords and genes in an interval, but no similar phenotype descriptions were found for these genes. In the peewee mutant compared to the normal, C57BL/6J mouse, only Slc10a4 expression was lower. Our preliminary mutation analysis examining the nucleotide sequence of three exons, two introns and an untranslated region of Slc10a4 did not find any sequence difference between the peewee mouse and the C57BL/6J mouse. Detailed analysis of peewee mice might provide novel molecular insights into the complex mechanisms regulating body growth. PMID:19513787

  1. Integrated analyses of microRNAs demonstrate their widespread influence on gene expression in high-grade serous ovarian carcinoma.

    PubMed

    Creighton, Chad J; Hernandez-Herrera, Anadulce; Jacobsen, Anders; Levine, Douglas A; Mankoo, Parminder; Schultz, Nikolaus; Du, Ying; Zhang, Yiqun; Larsson, Erik; Sheridan, Robert; Xiao, Weimin; Spellman, Paul T; Getz, Gad; Wheeler, David A; Perou, Charles M; Gibbs, Richard A; Sander, Chris; Hayes, D Neil; Gunaratne, Preethi H

    2012-01-01

    The Cancer Genome Atlas (TCGA) Network recently comprehensively catalogued the molecular aberrations in 487 high-grade serous ovarian cancers, with much remaining to be elucidated regarding the microRNAs (miRNAs). Here, using TCGA ovarian data, we surveyed the miRNAs, in the context of their predicted gene targets. Integration of miRNA and gene patterns yielded evidence that proximal pairs of miRNAs are processed from polycistronic primary transcripts, and that intronic miRNAs and their host gene mRNAs derive from common transcripts. Patterns of miRNA expression revealed multiple tumor subtypes and a set of 34 miRNAs predictive of overall patient survival. In a global analysis, miRNA:mRNA pairs anti-correlated in expression across tumors showed a higher frequency of in silico predicted target sites in the mRNA 3'-untranslated region (with less frequency observed for coding sequence and 5'-untranslated regions). The miR-29 family and predicted target genes were among the most strongly anti-correlated miRNA:mRNA pairs; over-expression of miR-29a in vitro repressed several anti-correlated genes (including DNMT3A and DNMT3B) and substantially decreased ovarian cancer cell viability. This study establishes miRNAs as having a widespread impact on gene expression programs in ovarian cancer, further strengthening our understanding of miRNA biology as it applies to human cancer. As with gene transcripts, miRNAs exhibit high diversity reflecting the genomic heterogeneity within a clinically homogeneous disease population. Putative miRNA:mRNA interactions, as identified using integrative analysis, can be validated. TCGA data are a valuable resource for the identification of novel tumor suppressive miRNAs in ovarian as well as other cancers.

  2. Association of a 3' untranslated region polymorphism in proprotein convertase subtilisin/kexin type 9 with HIV viral load and CD4+ levels in HIV/hepatitis C virus coinfected women.

    PubMed

    Kuniholm, Mark H; Liang, Hua; Anastos, Kathryn; Gustafson, Deborah; Kassaye, Seble; Nowicki, Marek; Sha, Beverly E; Pawlowski, Emilia J; Gange, Stephen J; Aouizerat, Bradley E; Pushkarsky, Tatiana; Bukrinsky, Michael I; Prasad, Vinayaka R

    2017-11-28

    To assess variation in genes that regulate cholesterol metabolism in relation to the natural history of HIV infection. Cross-sectional and longitudinal analysis of the Women's Interagency HIV Study. We examined 2050 single nucleotide polymorphisms (SNPs) in 19 genes known to regulate cholesterol metabolism in relation to HIV viral load and CD4 T-cell levels in a multiracial cohort of 1066 antiretroviral therapy-naive women. Six SNPs were associated with both HIV viral load and CD4 T-cell levels at a false discovery rate of 0.01. Bioinformatics tools did not predict functional activity for five SNPs, located in introns of nuclear receptor corepressor 2, retinoid X receptor alpha (RXRA), and tetratricopeptide repeat domain 39B. Rs17111557 located in the 3' untranslated region of proprotein convertase subtilisin/kexin type 9 (PCSK9) putatively affects binding of hsa-miR-548t-5p and hsa-miR-4796-3p, which could regulate PCSK9 expression levels. Interrogation of rs17111557 revealed stronger associations in the subset of women with HIV/hepatitis C virus (HCV) coinfection (n = 408, 38% of women). Rs17111557 was also associated with low-density lipoprotein cholesterol levels in HIV/HCV coinfected (β: -10.4; 95% confidence interval: -17.9, -2.9; P = 0.007), but not in HIV monoinfected (β:1.2; 95% confidence interval: -6.3, 8.6; P = 0.76) women in adjusted analysis. PCSK9 polymorphism may affect HIV pathogenesis, particularly in HIV/HCV coinfected women. A likely mechanism for this effect is PCSK9-mediated regulation of cholesterol metabolism. Replication in independent cohorts is needed to clarify the generalizability of the observed associations.

  3. Identification of Polymorphisms in the 3′-Untranslated Region of the Human Pregnane X Receptor (PXR) Gene Associated with Variability in Cytochrome P450 3A (CYP3A) Metabolism

    PubMed Central

    Oleson, Lauren; von Moltke, Lisa L.; Greenblatt, David J.; Court, Michael H.

    2013-01-01

    Single nucleotide polymorphisms (SNPs) in the 3′untranslated region (3′UTR) of human pregnane X receptor (PXR) gene may contribute to interindividual variability in cytochrome P450 3A (CYP3A) activity. Genotype-phenotype associations involving PXR-3′UTR SNPs were investigated through in vitro (53 human livers from primarily white donors) and in vivo (26 white or African-American volunteers) studies using midazolam 1′-hydroxylation and midazolam apparent oral clearance (CL/F), respectively, as CYP3A-specific probes. PXR-3′UTR resequencing identified 12 SNPs, including 2 that were novel. Although none of the SNPs evaluated were associated with altered midazolam 1′-hydroxylation in the liver bank, both rs3732359 homozygotes and rs3732360 carriers showed 80% higher (P<0.05) CL/F compared with homozygous reference individuals. These differences in CL/F were even larger (100 and 120% higher, respectively; P<0.01) when only African-American subjects (n=14) were considered. Five major haplotypes were identified containing the PXR-3′UTR SNPs and previously identified intron SNPs. Although CL/F differences were not statistically significant within the entire study cohort, African-American carriers of Haplotype-1 (which includes both rs3732359 and rs3732360 variants) exhibited 70% higher median CL/F compared with African-American non-carriers (P=0.036). Our results identify rs3732359 and rs3732360 as PXR-3′UTR SNPs associated with higher CYP3A activity in vivo in African-Americans. PMID:20082578

  4. Hepatitis C virus genotypes in Singapore and Indonesia.

    PubMed

    Ng, W C; Guan, R; Tan, M F; Seet, B L; Lim, C A; Ngiam, C M; Sjaifoellah Noer, H M; Lesmana, L

    1995-01-01

    5' untranslated and partial core (C) region sequence of hepatitis C virus (HCV) in 21 Singaporean and 15 Indonesian isolates were amplified by reverse-transcription polymerase chain reaction and sequenced with the use of conserved primer sequences deduced from HCV genomes identified in other geographical regions. The HCV genotypes are predominantly that of Simmonds type 1 and less of type 2 and 3 with the latter genotype currently not detected in Indonesia. The 5' untranslated sequences are related to HCV-1. DK-7 (Denmark), US-11 (United States of America), HCV-J4, SA-10 (South Africa), T-3 (Taiwan), HCV-J6, HCV-J8, Eb-1 and Eb-8. When compared with the prototype HCV-1, insertions are found within the 5' untranslated region of Singaporean isolates and not in the Indonesians. There are Singaporean and Indonesian isolates that have sequences within the 5' untranslated region that differ slightly from each other. Microheterogeneity is observed in the core region of two Singaporeans and one Indonesian isolate. Finally, not all HCV isolates can be amplified with the conserved core sequence primers when compared with the ease with which these isolates can be amplified with 5' untranslated region conserved primers.

  5. Influence of intron length on interaction characters between post-spliced intron and its CDS in ribosomal protein genes

    NASA Astrophysics Data System (ADS)

    Zhao, Xiaoqing; Li, Hong; Bao, Tonglaga; Ying, Zhiqiang

    2012-09-01

    Many experiment evidences showed that sequence structures of introns and intron loss/gain can influence gene expression, but current mechanisms did not refer to the functions of post-spliced introns directly. We propose that postspliced introns play their functions in gene expression by interacting with their mRNA sequences and the interaction is characterized by the matched segments between introns and their CDS. In this study, we investigated the interaction characters with length series by improved Smith-Waterman local alignment software for the ribosomal protein genes in C. elegans and D. melanogaster. Our results showed that RF values of five intron groups are significantly high in the central non-conserved region and very low in 5'-end and 3'-end splicing region. It is interesting that the number of the optimal matched regions gradually increases with intron length. Distributions of the optimal matched regions are different for five intron groups. Our study revealed that there are more interaction regions between longer introns and their CDS than shorter, and it provides a positive pattern for regulating the gene expression.

  6. A Genome-Wide Scan of Selective Sweeps and Association Mapping of Fruit Traits Using Microsatellite Markers in Watermelon

    PubMed Central

    Reddy, Umesh K.; Abburi, Lavanya; Abburi, Venkata Lakshmi; Saminathan, Thangasamy; Cantrell, Robert; Vajja, Venkata Gopinath; Reddy, Rishi; Tomason, Yan R.; Levi, Amnon; Wehner, Todd C.; Nimmakayala, Padma

    2015-01-01

    Our genetic diversity study uses microsatellites of known map position to estimate genome level population structure and linkage disequilibrium, and to identify genomic regions that have undergone selection during watermelon domestication and improvement. Thirty regions that showed evidence of selective sweep were scanned for the presence of candidate genes using the watermelon genome browser (www.icugi.org). We localized selective sweeps in intergenic regions, close to the promoters, and within the exons and introns of various genes. This study provided an evidence of convergent evolution for the presence of diverse ecotypes with special reference to American and European ecotypes. Our search for location of linked markers in the whole-genome draft sequence revealed that BVWS00358, a GA repeat microsatellite, is the GAGA type transcription factor located in the 5′ untranslated regions of a structure and insertion element that expresses a Cys2His2 Zinc finger motif, with presumed biological processes related to chitin response and transcriptional regulation. In addition, BVWS01708, an ATT repeat microsatellite, located in the promoter of a DTW domain-containing protein (Cla002761); and 2 other simple sequence repeats that association mapping link to fruit length and rind thickness. PMID:25425675

  7. Molecular Cloning and Expression of Three Polygalacturonase cDNAs from the Tarnished Plant Bug, Lygus lineolaris

    PubMed Central

    Allen, Margaret L.; Mertens, Jeffrey A.

    2008-01-01

    Three unique cDNAs encoding putative polygalacturonase enzymes were isolated from the tarnished plant bug, Lygus lineolaris (Palisot de Beauvois) (Hemiptera: Miridae). The three nucleotide sequences were dissimilar to one another, but the deduced amino acid sequences were similar to each other and to other polygalacturonases from insects, fungi, plants, and bacteria. Four conserved segments characteristic of polygalacturonases were present, but with some notable semiconservative substitutions. Two of four expected disulfide bridge—forming cysteine pairs were present. All three inferred protein translations included predicted signal sequences of 17 to 20 amino acids. Amplification of genomic DNA identified an intron in one of the genes, Llpg1, in the 5′ untranslated region. Semiquantitative RT-PCR revealed expression in all stages of the insect except the eggs. Expression in adults, male and female, was highly variable, indicating a family of highly inducible and diverse enzymes adapted to the generalist polyphagous nature of this important pest. PMID:20233096

  8. Nucleotide sequence of the ribosomal RNA gene of Physarum polycephalum: intron 2 and its flanking regions of the 26S rRNA gene.

    PubMed Central

    Nomiyama, H; Kuhara, S; Kukita, T; Otsuka, T; Sakaki, Y

    1981-01-01

    The 26S ribosomal RNA gene of Physarum polycephalum is interrupted by two introns, and we have previously determined the sequence of one of them (intron 1) (Nomiyama et al. Proc.Natl.Acad.Sci.USA 78, 1376-1380, 1981). In this study we sequenced the second intron (intron 2) of about 0.5 kb length and its flanking regions, and found that one nucleotide at each junction is identical in intron 1 and intron 2, though the junction regions share no other sequence homology. Comparison of the flanking exon sequences to E. coli 23S rRNA sequences shows that conserved sequences are interspersed with tracts having little homology. In particular, the region encompassing the intron 2 interruption site is highly conserved. The E. coli ribosomal protein L1 binding region is also conserved. Images PMID:6171776

  9. Molecular evolution of multiple-level control of heme biosynthesis pathway in animal kingdom.

    PubMed

    Tzou, Wen-Shyong; Chu, Ying; Lin, Tzung-Yi; Hu, Chin-Hwa; Pai, Tun-Wen; Liu, Hsin-Fu; Lin, Han-Jia; Cases, Ildeofonso; Rojas, Ana; Sanchez, Mayka; You, Zong-Ye; Hsu, Ming-Wei

    2014-01-01

    Adaptation of enzymes in a metabolic pathway can occur not only through changes in amino acid sequences but also through variations in transcriptional activation, mRNA splicing and mRNA translation. The heme biosynthesis pathway, a linear pathway comprised of eight consecutive enzymes in animals, provides researchers with ample information for multiple types of evolutionary analyses performed with respect to the position of each enzyme in the pathway. Through bioinformatics analysis, we found that the protein-coding sequences of all enzymes in this pathway are under strong purifying selection, from cnidarians to mammals. However, loose evolutionary constraints are observed for enzymes in which self-catalysis occurs. Through comparative genomics, we found that in animals, the first intron of the enzyme-encoding genes has been co-opted for transcriptional activation of the genes in this pathway. Organisms sense the cellular content of iron, and through iron-responsive elements in the 5' untranslated regions of mRNAs and the intron-exon boundary regions of pathway genes, translational inhibition and exon choice in enzymes may be enabled, respectively. Pathway product (heme)-mediated negative feedback control can affect the transport of pathway enzymes into the mitochondria as well as the ubiquitin-mediated stability of enzymes. Remarkably, the positions of these controls on pathway activity are not ubiquitous but are biased towards the enzymes in the upstream portion of the pathway. We revealed that multiple-level controls on the activity of the heme biosynthesis pathway depend on the linear depth of the enzymes in the pathway, indicating a new strategy for discovering the molecular constraints that shape the evolution of a metabolic pathway.

  10. Association Mapping of the High-Grade Myopia MYP3 Locus Reveals Novel Candidates UHRF1BP1L, PTPRR, and PPFIA2

    PubMed Central

    Hawthorne, Felicia; Feng, Sheng; Metlapally, Ravikanth; Li, Yi-Ju; Tran-Viet, Khanh-Nhat; Guggenheim, Jeremy A.; Malecaze, Francois; Calvas, Patrick; Rosenberg, Thomas; Mackey, David A.; Venturini, Cristina; Hysi, Pirro G.; Hammond, Christopher J.; Young, Terri L.

    2013-01-01

    Purpose. Myopia, or nearsightedness, is a common ocular genetic disease for which over 20 candidate genomic loci have been identified. The high-grade myopia locus, MYP3, has been reported on chromosome 12q21–23 by four independent linkage studies. Methods. We performed a genetic association study of the MYP3 locus in a family-based high-grade myopia cohort (n = 82) by genotyping 768 single-nucleotide polymorphisms (SNPs) within the linkage region. Qualitative testing for high-grade myopia (sphere ≤ −5 D affected, > −0.5 D unaffected) and quantitative testing on the average dioptric sphere were performed. Results. Several genetic markers were nominally significantly associated with high-grade myopia in qualitative testing, including rs3803036, a missense mutation in PTPRR (P = 9.1 × 10−4) and rs4764971, an intronic SNP in UHRF1BP1L (P = 6.1 × 10−4). Quantitative testing determined statistically significant SNPs rs4764971, also found by qualitative testing (P = 3.1 × 10−6); rs7134216, in the 3′ untranslated region (UTR) of DEPDC4 (P = 5.4 × 10−7); and rs17306116, an intronic SNP within PPFIA2 (P < 9 × 10−4). Independently conducted whole genome expression array analyses identified protein tyrosine phosphatase genes PTPRR and PPFIA2, which are in the same gene family, as differentially expressed in normal rapidly growing fetal relative to normal adult ocular tissue (confirmed by RT-qPCR). Conclusions. In an independent high-grade myopia cohort, an intronic SNP in UHRF1BP1L, rs4764971, was validated for quantitative association, and SNPs within PTPRR (quantitative) and PPFIA2 (qualitative and quantitative) approached significance. Three genes identified by our association study and supported by ocular expression and/or replication, UHRF1BP1L, PTPRR, and PPFIA2, are novel candidates for myopic development within the MYP3 locus that should be further studied. PMID:23422819

  11. Evaluation of genetic variations in miRNA-binding sites of BRCA1 and BRCA2 genes as risk factors for the development of early-onset and/or familial breast cancer.

    PubMed

    Erturk, Elif; Cecener, Gulsah; Polatkan, Volkan; Gokgoz, Sehsuvar; Egeli, Unal; Tunca, Berrin; Tezcan, Gulcin; Demirdogen, Elif; Ak, Secil; Tasdelen, Ismet

    2014-01-01

    Although genetic markers identifying women at an increased risk of developing breast cancer exist, the majority of inherited risk factors remain elusive. Mutations in the BRCA1/BRCA2 gene confer a substantial increase in breast cancer risk, yet routine clinical genetic screening is limited to the coding regions and intron- exon boundaries, precluding the identification of mutations in noncoding and untranslated regions. Because 3' untranslated region (3'UTR) polymorphisms disrupting microRNA (miRNA) binding can be functional and can act as genetic markers of cancer risk, we aimed to determine genetic variation in the 3'UTR of BRCA1/BRCA2 in familial and early-onset breast cancer patients with and without mutations in the coding regions of BRCA1/ BRCA2 and to identify specific 3'UTR variants that may be risk factors for cancer development. The 3'UTRs of the BRCA1 and BRCA2 genes were screened by heteroduplex analysis and DNA sequencing in 100 patients from 46 BRCA1/2 families, 54 non-BRCA1/2 families, and 47 geographically matched controls. Two polymorphisms were identified. SNPs c.*1287C>T (rs12516) (BRCA1) and c.*105A>C (rs15869) (BRCA2) were identified in 27% and 24% of patients, respectively. These 2 variants were also identified in controls with no family history of cancer (23.4% and 23.4%, respectively). In comparison to variations in the 3'UTR region of the BRCA1/2 genes and the BRCA1/2 mutational status in patients, there was a statistically significant relationship between the BRCA1 gene polymorphism c.*1287C>T (rs12516) and BRCA1 mutations (p=0.035) by Fisher's Exact Test. SNP c.*1287C>T (rs12516) of the BRCA1 gene may have potential use as a genetic marker of an increased risk of developing breast cancer and likely represents a non-coding sequence variation in BRCA1 that impacts BRCA1 function and leads to increased early-onset and/or familial breast cancer risk in the Turkish population.

  12. Lack of association of ghrelin precursor gene variants and percentage body fat or serum lipid profiles.

    PubMed

    Martin, Glynn R; Loredo, J C; Sun, Guang

    2008-04-01

    Ghrelin has been recognized for its involvement in food intake, control of energy homeostasis, and lipid metabolism. However, the roles of genetic variations in the ghrelin precursor gene (GHRL) on body compositions and serum lipids are not clear in humans. Our study investigated five single-nucleotide polymorphisms (SNPs) within GHRL to determine their relationship with body fat percentage (BF), trunk fat percentage (TF), lower body (legs) fat percentage (LF), and serum lipids in 1,464 subjects, which were recruited from the genetically homogeneous population of Newfoundland and Labrador (NL), Canada. Serum glucose, insulin, total cholesterol, high-density lipoprotein-cholesterol, low-density lipoprotein-cholesterol, and triglycerides were determined. Five SNPs are rs35684 (A/G: a transition substitution in exon 1), rs4684677 (A/T: a missense mutation), rs2075356 (C/T: intron), rs26802 (G/T: intron), and rs26311 (A/G: near the 3' untranslated region) of GHRL were genotyped using TaqMan validated or functionally tested SNP genotyping assays. Our study found no significant evidence of an allele or genotype association between any of the variant sites and body compositions or serum lipids. Furthermore, haplotype frequencies were not found to be significantly different between lean and obese subjects. In summary, the results of our study do not support a significant role for genetic variations in GHRL in the differences of body fat and serum lipid profiles in the NL population.

  13. The Heterotrimeric G-Protein Subunits GNG-1 and GNB-1 Form a Gβγ Dimer Required for Normal Female Fertility, Asexual Development, and Gα Protein Levels in Neurospora crassa

    PubMed Central

    Krystofova, Svetlana; Borkovich, Katherine A.

    2005-01-01

    We have identified a gene encoding a heterotrimeric G protein γ subunit, gng-1, from the filamentous fungus Neurospora crassa. gng-1 possesses a gene structure similar to that of mammalian Gγ genes, consisting of three exons and two introns, with introns present in both the open reading frame and 5′-untranslated region. The GNG-1 amino acid sequence displays high identity to predicted Gγ subunits from other filamentous fungi, including Giberella zeae, Cryphonectria parasitica, Trichoderma harzianum, and Magnaporthe grisea. Deletion of gng-1 leads to developmental defects similar to those previously characterized for Δgnb-1 (Gβ) mutants. Δgng-1, Δgnb-1, and Δgng-1 Δgnb-1 strains conidiate inappropriately in submerged cultures and are female sterile, producing aberrant female reproductive structures. Similar to previous results obtained with Δgnb-1 mutants, loss of gng-1 negatively influences levels of Gα proteins (GNA-1, GNA-2, and GNA-3) in plasma membrane fractions isolated from various tissues of N. crassa and leads to a significant reduction in the amount of intracellular cyclic AMP. In addition, we show that GNB-1 is essential for maintenance of normal steady-state levels of GNG-1, suggesting a functional interaction between GNB-1 and GNG-1. Direct evidence for a physical association between GNB-1 and GNG-1 in vivo was provided by coimmunoprecipitation. PMID:15701799

  14. Remarkable sequence conservation of the last intron in the PKD1 gene.

    PubMed

    Rodova, Marianna; Islam, M Rafiq; Peterson, Kenneth R; Calvet, James P

    2003-10-01

    The last intron of the PKD1 gene (intron 45) was found to have exceptionally high sequence conservation across four mammalian species: human, mouse, rat, and dog. This conservation did not extend to the comparable intron in pufferfish. Pairwise comparisons for intron 45 showed 91% identity (human vs. dog) to 100% identity (mouse vs. rat) for an average for all four species of 94% identity. In contrast, introns 43 and 44 of the PKD1 gene had average pairwise identities of 57% and 54%, and exons 43, 44, and 45 and the coding region of exon 46 had average pairwise identities of 80%, 84%, 82%, and 80%. Intron 45 is 90 to 95 bp in length, with the major region of sequence divergence being in a central 4-bp to 9-bp variable region. RNA secondary structure analysis of intron 45 predicts a branching stem-loop structure in which the central variable region lies in one loop and the putative branch point sequence lies in another loop, suggesting that the intron adopts a specific stem-loop structure that may be important for its removal. Although intron 45 appears to conform to the class of small, G-triplet-containing introns that are spliced by a mechanism utilizing intron definition, its high sequence conservation may be a reflection of constraints imposed by a unique mechanism that coordinates splicing of this last PKD1 intron with polyadenylation.

  15. Genomic structure and expression of STM2, the chromosome 1 familial Alzheimer disease gene.

    PubMed

    Levy-Lahad, E; Poorkaj, P; Wang, K; Fu, Y H; Oshima, J; Mulligan, J; Schellenberg, G D

    1996-06-01

    Mutations in the gene STM2 result in autosomal dominant familial Alzheimer disease. To screen for mutations and to identify regulatory elements for this gene, the genomic DNA sequence and intron-exon structure were determined. Twelve exons including 10 coding exons were identified in a genomic region spanning 23,737 bp. The first 2 exons encode the 5'-untranslated region. Expression analysis of STM2 indicates that two transcripts of 2.4 and 2.8 kb are found in skeletal muscle, pancreas, and heart. In addition, a splice variant of the 2.4-kb transcript was identified that is the result of the use of an alternative splice acceptor site located in exon 10. The use of this site results in a transcript lacking a single glutamate. The promotor for this gene and the alternatively spliced exons leading to the 2.8-kb form of the gene remain to be identified. Expression of STM2 was high in skeletal muscle and pancreas, with comparatively low levels observed in brain. This expression pattern is intriguing since in Alzheimer disease, pathology and degeneration are observed only in the central nervous system.

  16. Genomic structure and expression of STM2, the chromosome 1 familial Alzheimer disease gene

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Levy-Lahad, E.; Wang, Kai; Fu, Ying Hui

    1996-06-01

    Mutations in the gene STM2 result in autosomal dominant familial Alzheimer disease. To screen for mutations and to identify regulatory elements for this gene, the genomic DNA sequence and intron-exon structure were determined. Twelve exons including 10 coding exons were identified in a genomic region spanning 23, 737 bp. The first 2 exons encode the 5{prime}-untranslated region. Expression analysis of STM2 indicates that two transcripts of 2.4 and 2.8 kb are found in skeletal muscle, pancreas, and heart. In addition, a splice variant of the 2.4-kb transcript was identified that is the result of the use of an alternative splicemore » acceptor site located in exon 10. The use of this site results in a transcript lacking a single glutamate. The promotor for this gene and the alternatively spliced exons leading to the 2.8-kb form of the gene remain to be identified. Expression of STM2 was high in skeletal muscle and pancreas, with comparatively low levels observed in brain. This expression pattern is intriguing since in Alzheimer disease, pathology and degeneration are observed only in the central nervous system. 19 refs., 2 figs., 3 tabs.« less

  17. Genetic variations of the SLCO1B1 gene in the Chinese, Malay and Indian populations of Singapore.

    PubMed

    Ho, Woon Fei; Koo, Seok Hwee; Yee, Jie Yin; Lee, Edmund Jon Deoon

    2008-01-01

    OATP1B1 is a liver-specific transporter that mediates the uptake of various endogenous and exogenous compounds including many clinically used drugs from blood into hepatocytes. This study aims to identify genetic variations of SLCO1B1 gene in three distinct ethnic groups of the Singaporean population (n=288). The coding region of the gene encoding the transporter protein was screened for genetic variations in the study population by denaturing high-performance liquid chromatography and DNA sequencing. Twenty-five genetic variations of SLCO1B1, including 10 novel ones, were found: 13 in the coding exons (9 nonsynonymous and 4 synonymous variations), 6 in the introns, and 6 in the 3' untranslated region. Four novel nonsynonymous variations: 633A>G (Ile211Met), 875C>T (Ala292Val), 1837T>C (Cys613Arg), and 1877T>A (Leu626Stop) were detected as heterozygotes. Among the novel nonsynonymous variations, 633A>G, 1837T>C, and 1877T>A were predicted to be functionally significant. These data would provide fundamental and useful information for pharmacogenetic studies on drugs that are substrates of OATP1B1 in Asians.

  18. Genetic variation of the porcine NR5A1 is associated with meat color.

    PubMed

    Görres, Andreas; Ponsuksili, Siriluck; Wimmers, Klaus; Muráni, Eduard

    2016-02-01

    Because of the central role of Steroidogenic factor 1 in the regulation of the development and function of steroidogenic tissues, including the adrenal gland, we chose the encoding gene NR5A1 as a candidate for stress response, meat quality and carcass composition in the domestic pig. To identify polymorphisms of the porcine NR5A1 we comparatively sequenced the coding, untranslated and regulatory regions in four commercial pig lines. Single nucleotide polymorphisms could be found in the 3' UTR and in an intronic enhancer, whereas no polymorphisms were detected in the proximal promoter and coding region. A subset of the detected polymorphisms was genotyped in Piétrain x (German Large White x German Landrace) and German Landrace pigs. For the same animals, carcass composition traits, meat quality characteristics and parameters of adrenal function were recorded. Associations with meat color were found for two of the discovered SNPs in Piétrain x (German Large White x German Landrace) and German Landrace pigs but no connections to parameters of adrenal function could be established. We conclude that NR5A1 variations influence meat color in a hypothalamus-pituitary-adrenal axis independent manner and that further regulatory regions need to be analyzed for genetic variations to understand the discovered effects.

  19. Huntingtin gene evolution in Chordata and its peculiar features in the ascidian Ciona genus

    PubMed Central

    Gissi, Carmela; Pesole, Graziano; Cattaneo, Elena; Tartari, Marzia

    2006-01-01

    Background To gain insight into the evolutionary features of the huntingtin (htt) gene in Chordata, we have sequenced and characterized the full-length htt mRNA in the ascidian Ciona intestinalis, a basal chordate emerging as new invertebrate model organism. Moreover, taking advantage of the availability of genomic and EST sequences, the htt gene structure of a number of chordate species, including the cogeneric ascidian Ciona savignyi, and the vertebrates Xenopus and Gallus was reconstructed. Results The C. intestinalis htt transcript exhibits some peculiar features, such as spliced leader trans-splicing in the 98 nt-long 5' untranslated region (UTR), an alternative splicing in the coding region, eight alternative polyadenylation sites, and no similarities of both 5' and 3'UTRs compared to homologs of the cogeneric C. savignyi. The predicted protein is 2946 amino acids long, shorter than its vertebrate homologs, and lacks the polyQ and the polyP stretches found in the the N-terminal regions of mammalian homologs. The exon-intron organization of the htt gene is almost identical among vertebrates, and significantly conserved between Ciona and vertebrates, allowing us to hypothesize an ancestral chordate gene consisting of at least 40 coding exons. Conclusion During chordate diversification, events of gain/loss, sliding, phase changes, and expansion of introns occurred in both vertebrate and ascidian lineages predominantly in the 5'-half of the htt gene, where there is also evidence of lineage-specific evolutionary dynamics in vertebrates. On the contrary, the 3'-half of the gene is highly conserved in all chordates at the level of both gene structure and protein sequence. Between the two Ciona species, a fast evolutionary rate and/or an early divergence time is suggested by the absence of significant similarity between UTRs, protein divergence comparable to that observed between mammals and fishes, and different distribution of repetitive elements. PMID:17092333

  20. Molecular characterization of Quercus suber MYB1, a transcription factor up-regulated in cork tissues.

    PubMed

    Almeida, Tânia; Menéndez, Esther; Capote, Tiago; Ribeiro, Teresa; Santos, Conceição; Gonçalves, Sónia

    2013-01-15

    The molecular processes associated with cork development in Quercus suber L. are poorly understood. A previous molecular approach identified a list of genes potentially important for cork formation and differentiation, providing a new basis for further molecular studies. This report is the first molecular characterization of one of these candidate genes, QsMYB1, coding for an R2R3-MYB transcription factor. The R2R3-MYB gene sub-family has been described as being involved in the phenylpropanoid and lignin pathways, both involved in cork biosynthesis. The results showed that the expression of QsMYB1 is putatively mediated by an alternative splicing (AS) mechanism that originates two different transcripts (QsMYB1.1 and QsMYB1.2), differing only in the 5'-untranslated region, due to retention of the first intron in one of the variants. Moreover, within the retained intron, a simple sequence repeat (SSR) was identified. The upstream regulatory region of QsMYB1 was extended by a genome walking approach, which allowed the identification of the putative gene promoter region. The relative expression pattern of QsMYB1 transcripts determined by reverse transcription quantitative polymerase chain reaction (RT-qPCR) revealed that both transcripts were up-regulated in cork tissues; the detected expression was several times higher in newly formed cork harvested from trees producing virgin, second or reproduction cork when compared with wood. Moreover, the expression analysis of QsMYB1 in several Q. suber organs showed very low expression in young branches and roots, whereas in leaves, immature acorns or male flowers, no expression was detected. These preliminary results suggest that QsMYB1 may be related to secondary growth and, in particular, with the cork biosynthesis process with a possible alternative splicing mechanism associated with its regulatory function. Copyright © 2012 Elsevier GmbH. All rights reserved.

  1. Bioinformatics analysis of plant orthologous introns: identification of an intronic tRNA-like sequence.

    PubMed

    Akkuratov, Evgeny E; Walters, Lorraine; Saha-Mandal, Arnab; Khandekar, Sushant; Crawford, Erin; Zirbel, Craig L; Leisner, Scott; Prakash, Ashwin; Fedorova, Larisa; Fedorov, Alexei

    2014-09-10

    Orthologous introns have identical positions relative to the coding sequence in orthologous genes of different species. By analyzing the complete genomes of five plants we generated a database of 40,512 orthologous intron groups of dicotyledonous plants, 28,519 orthologous intron groups of angiosperms, and 15,726 of land plants (moss and angiosperms). Multiple sequence alignments of each orthologous intron group were obtained using the Mafft algorithm. The number of conserved regions in plant introns appeared to be hundreds of times fewer than that in mammals or vertebrates. Approximately three quarters of conserved intronic regions among angiosperms and dicots, in particular, correspond to alternatively-spliced exonic sequences. We registered only a handful of conserved intronic ncRNAs of flowering plants. However, the most evolutionarily conserved intronic region, which is ubiquitous for all plants examined in this study, including moss, possessed multiple structural features of tRNAs, which caused us to classify it as a putative tRNA-like ncRNA. Intronic sequences encoding tRNA-like structures are not unique to plants. Bioinformatics examination of the presence of tRNA inside introns revealed an unusually long-term association of four glycine tRNAs inside the Vac14 gene of fish, amniotes, and mammals. Copyright © 2014 Elsevier B.V. All rights reserved.

  2. Integrated Analyses of microRNAs Demonstrate Their Widespread Influence on Gene Expression in High-Grade Serous Ovarian Carcinoma

    PubMed Central

    Levine, Douglas A.; Mankoo, Parminder; Schultz, Nikolaus; Du, Ying; Zhang, Yiqun; Larsson, Erik; Sheridan, Robert; Xiao, Weimin; Spellman, Paul T.; Getz, Gad; Wheeler, David A.; Perou, Charles M.; Gibbs, Richard A.; Sander, Chris; Hayes, D. Neil; Gunaratne, Preethi H.

    2012-01-01

    Background The Cancer Genome Atlas (TCGA) Network recently comprehensively catalogued the molecular aberrations in 487 high-grade serous ovarian cancers, with much remaining to be elucidated regarding the microRNAs (miRNAs). Here, using TCGA ovarian data, we surveyed the miRNAs, in the context of their predicted gene targets. Methods and Results Integration of miRNA and gene patterns yielded evidence that proximal pairs of miRNAs are processed from polycistronic primary transcripts, and that intronic miRNAs and their host gene mRNAs derive from common transcripts. Patterns of miRNA expression revealed multiple tumor subtypes and a set of 34 miRNAs predictive of overall patient survival. In a global analysis, miRNA:mRNA pairs anti-correlated in expression across tumors showed a higher frequency of in silico predicted target sites in the mRNA 3′-untranslated region (with less frequency observed for coding sequence and 5′-untranslated regions). The miR-29 family and predicted target genes were among the most strongly anti-correlated miRNA:mRNA pairs; over-expression of miR-29a in vitro repressed several anti-correlated genes (including DNMT3A and DNMT3B) and substantially decreased ovarian cancer cell viability. Conclusions This study establishes miRNAs as having a widespread impact on gene expression programs in ovarian cancer, further strengthening our understanding of miRNA biology as it applies to human cancer. As with gene transcripts, miRNAs exhibit high diversity reflecting the genomic heterogeneity within a clinically homogeneous disease population. Putative miRNA:mRNA interactions, as identified using integrative analysis, can be validated. TCGA data are a valuable resource for the identification of novel tumor suppressive miRNAs in ovarian as well as other cancers. PMID:22479643

  3. Mutations in the PDE6B gene in autosomal recessive retinitis pigmentosa

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Danciger, M.; Blaney, J.; Gao, Y.Q.

    1995-11-01

    We have studied 24 small families with presumed autosomal recessive inheritance of retinitis pigmentosa by a combination of haplotype analysis and exon screening. Initial analysis of the families was made with a dinucleotide repeat polymorphism adjacent to the gene for rod cGMP-phosphodiesterase (PDE6B). This was followed by denaturing gradient gel electrophoresis (DGGE) and single-strand conformation polymorphism electrophoresis (SSCPE) of the 22 exons and a portion of the 5{prime} untranslated region of the PDE6B gene in the probands of each family in which the PDE6B locus could not be ruled out from segregating with disease. Two probands were found with compoundmore » heterozygous mutations: Gly576Asp and His620(1-bp del) mutations were present in one proband, and a Lys706X null mutation and an AG to AT splice acceptor site mutation in intron 2 were present in the other. Only the affecteds of each of the two families carried both corresponding mutations. 29 refs., 3 figs., 1 tab.« less

  4. Transcriptome-wide identification of RNA-binding protein and microRNA target sites by PAR-CLIP

    PubMed Central

    Hafner, Markus; Landthaler, Markus; Burger, Lukas; Khorshid, Mohsen; Hausser, Jean; Berninger, Philipp; Rothballer, Andrea; Ascano, Manuel; Jungkamp, Anna-Carina; Munschauer, Mathias; Ulrich, Alexander; Wardle, Greg S.; Dewell, Scott; Zavolan, Mihaela; Tuschl, Thomas

    2010-01-01

    Summary RNA transcripts are subject to post-transcriptional gene regulation involving hundreds of RNA-binding proteins (RBPs) and microRNA-containing ribonucleoprotein complexes (miRNPs) expressed in a cell-type dependent fashion. We developed a cell-based crosslinking approach to determine at high resolution and transcriptome-wide the binding sites of cellular RBPs and miRNPs. The crosslinked sites are revealed by thymidine to cytidine transitions in the cDNAs prepared from immunopurified RNPs of 4-thiouridine-treated cells. We determined the binding sites and regulatory consequences for several intensely studied RBPs and miRNPs, including PUM2, QKI, IGF2BP1-3, AGO/EIF2C1-4 and TNRC6A-C. Our study revealed that these factors bind thousands of sites containing defined sequence motifs and have distinct preferences for exonic versus intronic or coding versus untranslated transcript regions. The precise mapping of binding sites across the transcriptome will be critical to the interpretation of the rapidly emerging data on genetic variation between individuals and how these variations contribute to complex genetic diseases. PMID:20371350

  5. Polymorphism of BMP4 gene in Indian goat breeds differing in prolificacy.

    PubMed

    Sharma, Rekha; Ahlawat, Sonika; Maitra, A; Roy, Manoranjan; Mandakmale, S; Tantia, M S

    2013-12-10

    Bone morphogenetic proteins (BMPs) are members of the TGF-β (transforming growth factor-beta) superfamily, of which BMP4 is the most important due to its crucial role in follicular growth and differentiation, cumulus expansion and ovulation. Reproduction is a crucial trait in goat breeding and based on the important role of BMP4 gene in reproduction it was considered as a possible candidate gene for the prolificacy of goats. The objective of the present study was to detect polymorphism in intronic, exonic and 3' un-translated regions of BMP4 gene in Indian goats. Nine different goat breeds (Barbari, Beetal, Black Bengal, Malabari, Jakhrana (Twinning>40%), Osmanabadi, Sangamneri (Twinning 20-30%), Sirohi and Ganjam (Twinning<10%)) differing in prolificacy and geographic distribution were employed for polymorphism scanning. Cattle sequence (AC_000167.1) was used to design primers for the amplification of a targeted region followed by direct DNA sequencing to identify the genetic variations. Single nucleotide polymorphisms (SNPs) were not detected in exon 3, the intronic region and the 3' flanking region. A SNP (G1534A) was identified in exon 2. It was a non-synonymous mutation resulting in an arginine to lysine change in a corresponding protein sequence. G to A transition at the 1534 locus revealed two genotypes GG and GA in the nine investigated goat breeds. The GG genotype was predominant with a genotype frequency of 0.98. The GA genotype was present in the Black Bengal as well as Jakhrana breed with a genotype frequency of 0.02. A microsatellite was identified in the 3' flanking region, only 20 nucleotides downstream from the termination site of the coding region, as a short sequence with more than nineteen continuous and repeated CA dinucleotides. Since the gene is highly evolutionarily conserved, identification of a non-synonymous SNP (G1534A) in the coding region gains further importance. To our knowledge, this is the first report of a mutation in the coding region of the caprine BMP4 gene. But whether the reproduction trait of goat is associated with the BMP4 polymorphism, needs to be further defined by association studies in more populations so as to delineate an effect on it. © 2013 Elsevier B.V. All rights reserved.

  6. Structure of genes for dermaseptins B, antimicrobial peptides from frog skin. Exon 1-encoded prepropeptide is conserved in genes for peptides of highly different structures and activities.

    PubMed

    Vouille, V; Amiche, M; Nicolas, P

    1997-09-01

    We cloned the genes of two members of the dermaseptin family, broad-spectrum antimicrobial peptides isolated from the skin of the arboreal frog Phyllomedusa bicolor. The dermaseptin gene Drg2 has a 2-exon coding structure interrupted by a small 137-bp intron, wherein exon 1 encoded a 22-residue hydrophobic signal peptide and the first three amino acids of the acidic propiece; exon 2 contained the 18 additional acidic residues of the propiece plus a typical prohormone processing signal Lys-Arg and a 32-residue dermaseptin progenitor sequence. The dermaseptin genes Drg2 and Drg1g2 have conserved sequences at both untranslated ends and in the first and second coding exons. In contrast, Drg1g2 comprises a third coding exon for a short version of the acidic propiece and a second dermaseptin progenitor sequence. Structural conservation between the two genes suggests that Drg1g2 arose recently from an ancestral Drg2-like gene through amplification of part of the second coding exon and 3'-untranslated region. Analysis of the cDNAs coding precursors for several frog skin peptides of highly different structures and activities demonstrates that the signal peptides and part of the acidic propieces are encoded by conserved nucleotides encompassed by the first coding exon of the dermaseptin genes. The organization of the genes that belong to this family, with the signal peptide and the progenitor sequence on separate exons, permits strikingly different peptides to be directed into the secretory pathway. The recruitment of such a homologous 'secretory' exon by otherwise non-homologous genes may have been an early event in the evolution of amphibian.

  7. Prion gene haplotypes of U.S. cattle

    PubMed Central

    Clawson, Michael L; Heaton, Michael P; Keele, John W; Smith, Timothy PL; Harhay, Gregory P; Laegreid, William W

    2006-01-01

    Background Bovine spongiform encephalopathy (BSE) is a fatal neurological disorder characterized by abnormal deposits of a protease-resistant isoform of the prion protein. Characterizing linkage disequilibrium (LD) and haplotype networks within the bovine prion gene (PRNP) is important for 1) testing rare or common PRNP variation for an association with BSE and 2) interpreting any association of PRNP alleles with BSE susceptibility. The objective of this study was to identify polymorphisms and haplotypes within PRNP from the promoter region through the 3'UTR in a diverse sample of U.S. cattle genomes. Results A 25.2-kb genomic region containing PRNP was sequenced from 192 diverse U.S. beef and dairy cattle. Sequence analyses identified 388 total polymorphisms, of which 287 have not previously been reported. The polymorphism alleles define PRNP by regions of high and low LD. High LD is present between alleles in the promoter region through exon 2 (6.7 kb). PRNP alleles within the majority of intron 2, the entire coding sequence and the untranslated region of exon 3 are in low LD (18.0 kb). Two haplotype networks, one representing the region of high LD and the other the region of low LD yielded nineteen different combinations that represent haplotypes spanning PRNP. The haplotype combinations are tagged by 19 polymorphisms (htSNPS) which characterize variation within and across PRNP. Conclusion The number of polymorphisms in the prion gene region of U.S. cattle is nearly four times greater than previously described. These polymorphisms define PRNP haplotypes that may influence BSE susceptibility in cattle. PMID:17092337

  8. Gene Expression in Archaea: Studies of Transcriptional Promoters, Messenger RNA Processing, and Five Prime Untranslated Regions in "Methanocaldococcus Jannashchii"

    ERIC Educational Resources Information Center

    Zhang, Jian

    2009-01-01

    Gene expression in Archaea is less understood than those in Bacteria and Eucarya. In general, three steps are involved in gene expression--transcription, RNA processing, and translation. To expand our knowledge of these processes in Archaea, I have studied transcriptional promoters, messenger RNA processing, and 5'-untranslated regions in…

  9. UTRdb and UTRsite: a collection of sequences and regulatory motifs of the untranslated regions of eukaryotic mRNAs

    PubMed Central

    Mignone, Flavio; Grillo, Giorgio; Licciulli, Flavio; Iacono, Michele; Liuni, Sabino; Kersey, Paul J.; Duarte, Jorge; Saccone, Cecilia; Pesole, Graziano

    2005-01-01

    The 5′ and 3′ untranslated regions of eukaryotic mRNAs play crucial roles in the post-transcriptional regulation of gene expression through the modulation of nucleo-cytoplasmic mRNA transport, translation efficiency, subcellular localization and message stability. UTRdb is a curated database of 5′ and 3′ untranslated sequences of eukaryotic mRNAs, derived from several sources of primary data. Experimentally validated functional motifs are annotated (and also collated as the UTRsite database) and cross-links to genomic and protein data are provided. The integration of UTRdb with genomic and protein data has allowed the implementation of a powerful retrieval resource for the selection and extraction of UTR subsets based on their genomic coordinates and/or features of the protein encoded by the relevant mRNA (e.g. GO term, PFAM domain, etc.). All internet resources implemented for retrieval and functional analysis of 5′ and 3′ untranslated regions of eukaryotic mRNAs are accessible at http://www.ba.itb.cnr.it/UTR/. PMID:15608165

  10. Computational Approaches to Identify Promoters and cis-Regulatory Elements in Plant Genomes1

    PubMed Central

    Rombauts, Stephane; Florquin, Kobe; Lescot, Magali; Marchal, Kathleen; Rouzé, Pierre; Van de Peer, Yves

    2003-01-01

    The identification of promoters and their regulatory elements is one of the major challenges in bioinformatics and integrates comparative, structural, and functional genomics. Many different approaches have been developed to detect conserved motifs in a set of genes that are either coregulated or orthologous. However, although recent approaches seem promising, in general, unambiguous identification of regulatory elements is not straightforward. The delineation of promoters is even harder, due to its complex nature, and in silico promoter prediction is still in its infancy. Here, we review the different approaches that have been developed for identifying promoters and their regulatory elements. We discuss the detection of cis-acting regulatory elements using word-counting or probabilistic methods (so-called “search by signal” methods) and the delineation of promoters by considering both sequence content and structural features (“search by content” methods). As an example of search by content, we explored in greater detail the association of promoters with CpG islands. However, due to differences in sequence content, the parameters used to detect CpG islands in humans and other vertebrates cannot be used for plants. Therefore, a preliminary attempt was made to define parameters that could possibly define CpG and CpNpG islands in Arabidopsis, by exploring the compositional landscape around the transcriptional start site. To this end, a data set of more than 5,000 gene sequences was built, including the promoter region, the 5′-untranslated region, and the first introns and coding exons. Preliminary analysis shows that promoter location based on the detection of potential CpG/CpNpG islands in the Arabidopsis genome is not straightforward. Nevertheless, because the landscape of CpG/CpNpG islands differs considerably between promoters and introns on the one side and exons (whether coding or not) on the other, more sophisticated approaches can probably be developed for the successful detection of “putative” CpG and CpNpG islands in plants. PMID:12857799

  11. Submesoscale characteristics and transcription of a fatty acid elongase gene from a freshwater green microalgae, Myrmecia incisa Reisigl

    NASA Astrophysics Data System (ADS)

    Yu, Shuiyan; Liu, Shicheng; Li, Chunyang; Zhou, Zhigang

    2011-01-01

    Myrmecia incisa is a green coccoid freshwater microalgae, which is rich in arachidonic acid (ArA, C20: 4ω-6, δ5, 8, 11, 14), a long chain polyunsaturated fatty acid (PUFA), especially under nitrogen starvation stress. A cDNA library of M. incisa was constructed with λ phage vectors and a 545 nt expressed sequence tag (EST) was screened from this library as a putative elongase gene due to its 56% and 49% identity to Marchantia polymorpha L. and Ostreococcus tauri Courties et Chrétiennot-Dinet, respectively. Based upon this EST sequence, an elongase gene designated MiFAE was isolated from M. incisa via 5'/3' rapid amplification of cDNA ends (RACE). The cDNA sequence was 1 331 bp long and included a 33 bp 5'-untranslated region (UTR) and a 431 bp 3'-UTR with a typical poly-A tail. The 867 bp ORF encoded a predicted protein of 288 amino acids. This protein was characterized by a conserved histidine-rich box and a MYxYY motif that was present in other members of the elongase family. The genomic DNA sequence of MiFAE was found to be interrupted by three introns with splicing sites of Introns I (81 bp), II (81 bp), and III (67 bp) that conformed to the GT-AG rule. Quantitative real-time PCR showed that the transcription level of MiFAE in this microalga under nitrogen starvation was higher than that under normal condition. Prior to the ArA content accumulation, the transcription of MiFAE was enhanced, suggesting that it was possibly responsible for the ArA accumulation in this microalga cultured under nitrogen starvation conditions.

  12. Comparative analysis of the 5{prime} genomic and promoter regions between the mouse (Hdh) and human Huntington disease (HD) gene

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Kalchman, M.; Lin, B.; Nasir, J.

    1994-09-01

    The mouse homologue of the Huntington disease gene (Hdh) has recently been cloned and mapped to a region of synteny with the human, on mouse chromosome 5. The two genes share a high degree of both coding (90% amino acid) and nucleotide (86.2%) identity. We have subsequently performed a detailed comparison of the genomic organization of the 5{prime} region of the two genes encompassing the promoter region and first five exons of both the human and mouse genes. The comparative sequence analysis of the promoter region between HD and Hdh reveals two highly conserved regions. One region (-56 to -118)more » (+1 is the ATG start codon), shared 84% nucleotide identity and another region (-130 to -206) had 81% nucleotide identity. Nine putative Sp1 sites appear in the human promoter region contrasted with only 3 in a similar region in the mouse. Furthermore, 17 and 20 base pair direct repeats present in the HD 5{prime} region are absent in the similar Hdh region. Although both the mouse and human intron/exon boundaries conform to the GT/AG rule, the intron sizes between HD and Hdh are markedly different. The first four introns in Hdh are 15, 7, 5 and 0.5 kb compared to sizes of 10, 15, 7 and 0.5 kb, respectively. Comparison between the mouse and human intronic sequences immediately adjacent to the first five exons (excluding exon 1) reveals only about 46 to 50% identity within the first 60 bp of intronic sequence. Furthermore, we have identified novel polymorphic di-, tri- and tetra-nucleotide repeats in Hdh introns of various mouse strains that are not present in the human. For example, polymorphic CT repeats are present in introns 2 and 4 of Hdh and a novel mouse 56 AAG trinucleotide repeat (interrupted by an AAGG) is also located within intron 2. This information concerning the promoter and genomic organization of both HD and Hdh is critical for designing appropriate gene targetting vectors for studying the normal function of the HD and Hdh genes in model systems.« less

  13. The insulin-like growth factor 2 (IGF2) gene intron3-g.3072G>A polymorphism is not the only Sus scrofa chromosome 2p mutation affecting meat production and carcass traits in pigs: evidence from the effects of a cathepsin D (CTSD) gene polymorphism.

    PubMed

    Fontanesi, L; Speroni, C; Buttazzoni, L; Scotti, E; Dall'Olio, S; Nanni Costa, L; Davoli, R; Russo, V

    2010-07-01

    The objective of this study was to evaluate the effects of mutations in 2 genes [IGF2 and cathepsin D (CTSD)] that map on the telomeric end of the p arm of SSC2. In this region, an imprinted QTL affecting muscle mass and fat deposition was reported, and the IGF2 intron3-g.3072G>A substitution was identified as the causative mutation. In the same chromosome region, we assigned, by linkage mapping, the CTSD gene, a lysosomal proteinase, for which we previously identified an SNP in the 3'-untranslated region (AM933484, g.70G>A). We have already shown strong effects of this CTSD mutation on several production traits in Italian Large White pigs, suggesting a possible independent role of this marker in fatness and meat deposition in pigs. To evaluate this hypothesis, after having refined the map position of the CTSD gene by radiation hybrid mapping, we analyzed the IGF2 and the CTSD polymorphisms in 270 Italian Large White and 311 Italian Duroc pigs, for which EBV and random residuals from fixed models were calculated for several traits. Different association analyses were carried out to distinguish the effects of the 2 close markers. In the Italian Large White pigs, the results for IGF2 were highly significant for all traits when using either EBV or random residuals (e.g., using EBV: lean cuts, P = 2.2 x 10(-18); ADG, P = 2.6 x 10(-16); backfat thickness, P = 2.2 x 10(-9); feed:gain ratio, P = 2.3 x 10(-9); ham weight, P = 1.5 x 10(-6)). No effect was observed for meat quality traits. The IGF2 intron3-g.3072G>A mutation did not show any association in the Italian Duroc pigs, probably because of the small variability at this polymorphic site for this breed. However, a significant association was evident for the CTSD marker (P < 0.001) with EBV of all carcass and production traits in Italian Duroc pigs (lean content, ADG, backfat thickness, feed:gain ratio) after excluding possible confounding effects of the IGF2 mutation. The effects of the CTSD g.70G>A mutation were also confirmed in a subset of Italian Large White animals carrying the homozygous genotype IGF2 intron3-g.3072GG, and by haplotype analysis between the markers of the 2 considered genes in the complete data set. Overall, these results indicate that the IGF2 intron3-g.3072G>A mutation is not the only polymorphism affecting fatness and muscle deposition on SSC2p. Therefore, the CTSD g.70G>A polymorphism could be used to increase selection efficiency in marker-assisted selection programs that already use the IGF2 mutation. However, for practical applications, because the CTSD gene should not be imprinted (we obtained this information from expression analysis in adult skeletal muscle), the different modes of inheritance of the 2 genes have to be considered.

  14. Peroxisomal monodehydroascorbate reductase. Genomic clone characterization and functional analysis under environmental stress conditions.

    PubMed

    Leterrier, Marina; Corpas, Francisco J; Barroso, Juan B; Sandalio, Luisa M; del Río, Luis A

    2005-08-01

    In plant cells, ascorbate is a major antioxidant that is involved in the ascorbate-glutathione cycle. Monodehydroascorbate reductase (MDAR) is the enzymatic component of this cycle involved in the regeneration of reduced ascorbate. The identification of the intron-exon organization and the promoter region of the pea (Pisum sativum) MDAR 1 gene was achieved in pea leaves using the method of walking polymerase chain reaction on genomic DNA. The nuclear gene of MDAR 1 comprises nine exons and eight introns, giving a total length of 3,770 bp. The sequence of 544 bp upstream of the initiation codon, which contains the promoter and 5' untranslated region, and 190 bp downstream of the stop codon were also determined. The presence of different regulatory motifs in the promoter region of the gene might indicate distinct responses to various conditions. The expression analysis in different plant organs by northern blots showed that fruits had the highest level of MDAR. Confocal laser scanning microscopy analysis of pea leaves transformed with Agrobacterium tumefaciens having the binary vectors pGD, which contain the autofluorescent proteins enhanced green fluorescent protein and enhanced yellow fluorescent protein with the full-length cDNA for MDAR 1 and catalase, indicated that the MDAR 1 encoded the peroxisomal isoform. The functional analysis of MDAR by activity and protein expression was studied in pea plants grown under eight stress conditions, including continuous light, high light intensity, continuous dark, mechanical wounding, low and high temperature, cadmium, and the herbicide 2,4-dichlorophenoxyacetic acid. This functional analysis is representative of all the MDAR isoforms present in the different cell compartments. Results obtained showed a significant induction by high light intensity and cadmium. On the other hand, expression studies, performed by semiquantitative reverse transcription-polymerase chain reaction demonstrated differential expression patterns of peroxisomal MDAR 1 transcripts in pea plants grown under the mentioned stress conditions. These findings show that the peroxisomal MDAR 1 has a differential regulation that could be indicative of its specific function in peroxisomes. All these biochemical and molecular data represent a significant step to understand the specific physiological role of each MDAR isoenzyme and its participation in the antioxidant mechanisms of plant cells.

  15. Haplotypes of CYP3A4 and their close linkage with CYP3A5 haplotypes in a Japanese population.

    PubMed

    Fukushima-Uesaka, Hiromi; Saito, Yoshiro; Watanabe, Hidemi; Shiseki, Kisho; Saeki, Mayumi; Nakamura, Takahiro; Kurose, Kouichi; Sai, Kimie; Komamura, Kazuo; Ueno, Kazuyuki; Kamakura, Shiro; Kitakaze, Masafumi; Hanai, Sotaro; Nakajima, Toshiharu; Matsumoto, Kenji; Saito, Hirohisa; Goto, Yu-ichi; Kimura, Hideo; Katoh, Masaaki; Sugai, Kenji; Minami, Narihiro; Shirao, Kuniaki; Tamura, Tomohide; Yamamoto, Noboru; Minami, Hironobu; Ohtsu, Atsushi; Yoshida, Teruhiko; Saijo, Nagahiro; Kitamura, Yutaka; Kamatani, Naoyuki; Ozawa, Shogo; Sawada, Jun-ichi

    2004-01-01

    In order to identify single nucleotide polymorphisms (SNPs) and haplotype frequencies of CYP3A4 in a Japanese population, the distal enhancer and proximal promoter regions, all exons, and the surrounding introns were sequenced from genomic DNA of 416 Japanese subjects. We found 24 SNPs, including 17 novel ones: two in the distal enhancer, four in the proximal promoter, one in the 5'-untranslated region (UTR), seven in the introns, and three in the 3'-UTR. The most common SNP was c.1026+12G>A (IVS10+12G>A), with a 0.249 frequency. Four non-synonymous SNPs, c.554C>G (p.T185S, CYP3A4(*)16), c.830_831insA (p.E277fsX8, (*)6), c.878T>C (p.L293P, (*)18), and c.1088 C>T (p.T363M, (*)11) were found with frequencies of 0.014, 0.001, 0.028, and 0.002, respectively. No SNP was found in the known nuclear transcriptional factor-binding sites in the enhancer and promoter regions. Using these 24 SNPs, 16 haplotypes were unambiguously identified, and nine haplotypes were inferred by aid of an expectation-maximization-based program. In addition, using data from 186 subjects enabled a close linkage to be found between CYP3A4 and CYP3A5 SNPs, especially among the SNPs at c.1026+12 in CYP3A4 and c.219-237 (IVS3-237, a key SNP site for CYP3A5(*)3), c.865+77 (IVS9+77) and c.1523 in CYP3A5. This result suggested that CYP3A4 and CYP3A5 are within the same gene block. Haplotype analysis between CYP3A4 and CYP3A5 revealed several major haplotype combinations in the CYP3A4-CYP3A5 block. Our findings provide fundamental and useful information for genotyping CYP3A4 (and CYP3A5) in the Japanese, and probably Asian populations. Copyright 2003 Wiley-Liss, Inc.

  16. Exome capture from the spruce and pine giga-genomes.

    PubMed

    Suren, H; Hodgins, K A; Yeaman, S; Nurkowski, K A; Smets, P; Rieseberg, L H; Aitken, S N; Holliday, J A

    2016-09-01

    Sequence capture is a flexible tool for generating reduced representation libraries, particularly in species with massive genomes. We used an exome capture approach to sequence the gene space of two of the dominant species in Canadian boreal and montane forests - interior spruce (Picea glauca x engelmanii) and lodgepole pine (Pinus contorta). Transcriptome data generated with RNA-seq were coupled with draft genome sequences to design baits corresponding to 26 824 genes from pine and 28 649 genes from spruce. A total of 579 samples for spruce and 631 samples for pine were included, as well as two pine congeners and six spruce congeners. More than 50% of targeted regions were sequenced at >10× depth in each species, while ~12% captured near-target regions within 500 bp of a bait position were sequenced to a depth >10×. Much of our read data arose from off-target regions, which was likely due to the fragmented and incomplete nature of the draft genome assemblies. Capture in general was successful for the related species, suggesting that baits designed for a single species are likely to successfully capture sequences from congeners. From these data, we called approximately 10 million SNPs and INDELs in each species from coding regions, introns, untranslated and flanking regions, as well as from the intergenic space. Our study demonstrates the utility of sequence capture for resequencing in complex conifer genomes, suggests guidelines for improving capture efficiency and provides a rich resource of genetic variants for studies of selection and local adaptation in these species. © 2016 John Wiley & Sons Ltd.

  17. Structure and expression of canary myc family genes.

    PubMed Central

    Collum, R G; Clayton, D F; Alt, F W

    1991-01-01

    We found that the canary N-myc gene is highly related to mammalian N-myc genes in both the protein-coding region and the long 3' untranslated region. Examined coding regions of the canary c-myc gene were also highly related to their mammalian counterparts, but in contrast to N-myc, the canary and mammalian c-myc genes were quite divergent in their 3' untranslated regions. We readily detected N-myc and c-myc expression in the adult canary brain and found N-myc expression both at sites of proliferating neuronal precursors and in mature neurons. Images PMID:1996121

  18. Polymorphisms and linkage analysis for ICAM-1 and the selectin gene cluster

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Vora, D.K.; Rosenbloom, C.L.; Cottingham, R.W.

    1994-06-01

    Genetic polymorphisms in leukocyte and endothelial cell adhesion molecules may be important variables with regard to susceptibility to multifactorial disease processes that include an inflammatory component. For this reason, polymorphisms were sought for intercellular adhesion molecule-1 (ICAM-1; gene symbol ICAM1) and for the three genes in the selectin cluster, P-selectin, L-selectin, and E-selectin (gene symbols SELP, SELL, and SELE, respectively). Two amino acid polymorphisms were identified for ICAM-1; Gly or Arg at codon 241 and Lys or Glu at codon 469. Dinucleotide repeat polymorphisms were identified in the 3{prime}-untranslated region for ICAM-1 and in intron 9 for P-selectin. Restriction fragmentmore » length polymorphisms were found using cDNAs for each of the three selectin genes as probes; E-selectin with BglII, P-selectin with ScaI, and L-selectin with HincII. Linkage analysis was performed for the selectin gene cluster and for ICAM-1 using the CEPH families; ICAM-1 is very tightly linked to the LDL receptor on chromosome 19, and the selectin cluster is linked to markers at chromosome 1q23. 41 refs., 2 tabs.« less

  19. COL5A1: Genetic mapping and exclusion as candidate gene in families with nail-patella syndrome, tuberous sclerosis 1, hereditary hemorrhagic telangiectasia, and Ehlers-Danlos syndrome type II

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Greenspan, D.S.; Northrup, H.; Au, K.S.

    1995-02-10

    COL5A1, the gene for the {alpha}1 chain of type V collagen, has been considered a candidate gene for certain diseases based on chromosomal location and/or disease phenotype. We have employed 3{prime}-untranslated region RFLPs to exclude COL5A1 as a candidate gene in families with tuberous sclerosis 1, Ehlers-Danlos syndrome type H, and nail-patella syndrome. In addition, we describe a polymorphic simple sequence repeat (SSR) within a COL5A1 intron. This SSR is used to exclude COL5A1 as a candidate gene in hereditary hemorrhagic telangiectasia (Osler-Rendu-Weber disease) and to add COL5A1 to the existing map of {open_quotes}index{close_quotes} markers of chromosome 9 by evaluationmore » of the COL5A1 locus on the CEPH 40-family reference pedigree set. This genetic mapping places COL5A1 between markers D9S66 and D9S67. 14 refs., 1 fig., 2 tabs.« less

  20. Genetic analysis of drug metabolizing phase-I enzymes CYP3A4 in Tibetan populations.

    PubMed

    Liu, Lijun; Chang, Yu; Du, Shuli; Shi, Xugang; Yang, Hua; Kang, Longli; Jin, Tianbo; Yuan, Dongya; He, Yongjun

    2017-06-01

    The enzymatic activity of CYP3A4 results in broad interindividual variability in response to certain pharmacotherapies. The present study aimed to screen Tibetan volunteers for CYP3A4 genetic polymorphisms. Previous research has focussed on Han Chinese patients, while little is known about the genetic variation of CYP3A4 in the Tibetan populations. Here, we adopted DNA sequencing to investigate the promoter, exons and surrounding introns, and 3'-untranslated region of the CYP3A4 gene in 96 unrelated healthy Tibetan individuals.We identified 20 different CYP3A4 polymorphisms in the Tibetan population, including two novel variants (21824 A>G and 15580 G>C). In addition, we also determined the allele frequencies of CYP3A4*1A and CYP3A4*1H were 82.29% and 28.13%, respectively. CYP3A4*1P and *1G were relatively rare with frequencies of only 1.04% and 0.52%, respectively. Our results provide information on CYP3A4 polymorphisms in Tibetan individuals which may help to optimize pharmacotherapy effectiveness by providing personalized medicine to this ethnic group.

  1. Beta-keratins of differentiating epidermis of snake comprise glycine-proline-serine-rich proteins with an avian-like gene organization.

    PubMed

    Dalla Valle, Luisa; Nardi, Alessia; Belvedere, Paola; Toni, Mattia; Alibardi, Lorenzo

    2007-07-01

    Beta-keratins of reptilian scales have been recently cloned and characterized in some lizards. Here we report for the first time the sequence of some beta-keratins from the snake Elaphe guttata. Five different cDNAs were obtained using 5'- and 3'-RACE analyses. Four sequences differ by only few nucleotides in the coding region, whereas the last cDNA shows, in this region, only 84% of identity. The gene corresponding to one of the cDNA sequences has a single intron present in the 5'-untranslated region. This genomic organization is similar to that of birds' beta-keratins. Cloning and Southern blotting analysis suggest that snake beta-keratins belong to a family of high-related genes as for geckos. PCR analysis suggests a head-to-tail orientation of genes in the same chromosome. In situ hybridization detected beta-keratin transcripts almost exclusively in differentiating oberhautchen and beta-cells of the snake epidermis in renewal phase. This is confirmed by Northern blotting that showed, in this phase, a high expression of two different transcripts whereas only the longer transcript is expressed at a much lower level in resting skin. The cDNA coding sequences encoded putative glycine-proline-serine rich proteins containing 137-139 amino acids, with apparent isoelectric point at 7.5 and 8.2. A central region, rich in proline, shows over 50% homology with avian scale, claw, and feather keratins. The prediction of secondary structure shows mainly a random coil conformation and few beta-strand regions in the central region, likely involved in the formation of a fibrous framework of beta-keratins. This region was possibly present in basic reptiles that originated reptiles and birds. Copyright 2007 Wiley-Liss, Inc.

  2. Evolution of group I introns in Porifera: new evidence for intron mobility and implications for DNA barcoding.

    PubMed

    Schuster, Astrid; Lopez, Jose V; Becking, Leontine E; Kelly, Michelle; Pomponi, Shirley A; Wörheide, Gert; Erpenbeck, Dirk; Cárdenas, Paco

    2017-03-20

    Mitochondrial introns intermit coding regions of genes and feature characteristic secondary structures and splicing mechanisms. In metazoans, mitochondrial introns have only been detected in sponges, cnidarians, placozoans and one annelid species. Within demosponges, group I and group II introns are present in six families. Based on different insertion sites within the cox1 gene and secondary structures, four types of group I and two types of group II introns are known, which can harbor up to three encoding homing endonuclease genes (HEG) of the LAGLIDADG family (group I) and/or reverse transcriptase (group II). However, only little is known about sponge intron mobility, transmission, and origin due to the lack of a comprehensive dataset. We analyzed the largest dataset on sponge mitochondrial group I introns to date: 95 specimens, from 11 different sponge genera which provided novel insights into the evolution of group I introns. For the first time group I introns were detected in four genera of the sponge family Scleritodermidae (Scleritoderma, Microscleroderma, Aciculites, Setidium). We demonstrated that group I introns in sponges aggregate in the most conserved regions of cox1. We showed that co-occurrence of two introns in cox1 is unique among metazoans, but not uncommon in sponges. However, this combination always associates an active intron with a degenerating one. Earlier hypotheses of HGT were confirmed and for the first time VGT and secondary losses of introns conclusively demonstrated. This study validates the subclass Spirophorina (Tetractinellida) as an intron hotspot in sponges. Our analyses confirm that most sponge group I introns probably originated from fungi. DNA barcoding is discussed and the application of alternative primers suggested.

  3. Structure and expression of the human XPBC/ERCC-3 gene involved in DNA repair disorders xeroderma pigmentosum and Cockayne's syndrome.

    PubMed Central

    Weeda, G; Ma, L B; van Ham, R C; van der Eb, A J; Hoeijmakers, J H

    1991-01-01

    The human XPBC/ERCC-3 was cloned by virtue of its ability to correct the excision repair defect of UV-sensitive rodent mutants of complementation group 3. The gene appeared to be in addition implicated in the human, cancer prone repair disorder xeroderma pigmentosum group B, which is also associated with Cockayne's syndrome. Here we present the genomic architecture of the gene and its expression. The XPBC/ERCC-3 gene consists of at least 14 exons spread over approximately 45 kb. Notably, the donor splice site of the third exon contains a GC instead of the canonical GT dinucleotide. The promoter region, first exon and intron comprise a CpG island with several putative GC boxes. The promoter was confined to a region of 260 bp upstream of the presumed cap site and acts bidirectionally. Like the promoter of another excision repair gene, ERCC-1, it lacks classical promoter elements such as CAAT and TATA boxes, but it shares with ERCC-1 a hitherto unknown 12 nucleotide sequence element, preceding a polypyrimidine track. Despite the presence of (AU)-rich elements in the 3'-untranslated region, which are thought to be associated with short mRNA half-life actinomycin-D experiments indicate that the mRNA is very stable (t 1/2 greater than 3h). Southern blot analysis revealed the presence of XPBC/ERCC-3 cross-hybridizing fragments elsewhere in the genome, which may belong to a related gene. Images PMID:1956789

  4. Structure of the coding region and mRNA variants of the apyrase gene from pea (Pisum sativum)

    NASA Technical Reports Server (NTRS)

    Shibata, K.; Abe, S.; Davies, E.

    2001-01-01

    Partial amino acid sequences of a 49 kDa apyrase (ATP diphosphohydrolase, EC 3.6.1.5) from the cytoskeletal fraction of etiolated pea stems were used to derive oligonucleotide DNA primers to generate a cDNA fragment of pea apyrase mRNA by RT-PCR and these primers were used to screen a pea stem cDNA library. Two almost identical cDNAs differing in just 6 nucleotides within the coding regions were found, and these cDNA sequences were used to clone genomic fragments by PCR. Two nearly identical gene fragments containing 8 exons and 7 introns were obtained. One of them (H-type) encoded the mRNA sequence described by Hsieh et al. (1996) (DDBJ/EMBL/GenBank Z32743), while the other (S-type) differed by the same 6 nucleotides as the mRNAs, suggesting that these genes may be alleles. The six nucleotide differences between these two alleles were found solely in the first exon, and these mutation sites had two types of consensus sequences. These mRNAs were found with varying lengths of 3' untranslated regions (3'-UTR). There are some similarities between the 3'-UTR of these mRNAs and those of actin and actin binding proteins in plants. The putative roles of the 3'-UTR and alternative polyadenylation sites are discussed in relation to their possible role in targeting the mRNAs to different subcellular compartments.

  5. Bison PRNP genotyping and potential association with Brucella spp. seroprevalence

    USGS Publications Warehouse

    Seabury, C.M.; Halbert, N.D.; Gogan, P.J.P.; Templeton, J.W.; Derr, J.N.

    2005-01-01

    The implication that host cellular prion protein (PrPC) may function as a cell surface receptor and/or portal protein for Brucella abortus in mice prompted an evaluation of nucleotide and amino acid variation within exon 3 of the prion protein gene (PRNP) for six US bison populations. A non-synonymous single nucleotide polymorphism (T50C), resulting in the predicted amino acid replacement M17T (Met ??? Thr), was identified in each population. To date, no variation (T50: Met) has been detected at the corresponding exon 3 nucleotide and/or amino acid position for domestic cattle. Notably, 80% (20 of 25) of the Yellowstone National Park bison possessing the C/C genotype were Brucella spp. seropositive, representing a significant (P = 0.021) association between seropositivity and the C/C genotypic class. Moreover, significant differences in the distribution of PRNP exon 3 alleles and genotypes were detected between Yellowstone National Park bison and three bison populations that were either founded from seronegative stock or previously subjected to test-and-slaughter management to eradicate brucellosis. Unlike domestic cattle, no indel polymorphisms were detected within the corresponding regions of the putative bison PRNP promoter, intron 1, octapeptide repeat region or 3???-untranslated region for any population examined. This study provides the first evidence of a potential association between nucleotide variation within PRNP exon 3 and the presence of Brucella spp. antibodies in bison, implicating PrPC in the natural resistance of bison to brucellosis infection. ?? 2005 International Society for Animal Genetics.

  6. Specification of skeletal muscle differentiation by repressor element-1 silencing transcription factor (REST)-regulated Kv7.4 potassium channels

    PubMed Central

    Iannotti, Fabio Arturo; Barrese, Vincenzo; Formisano, Luigi; Miceli, Francesco; Taglialatela, Maurizio

    2013-01-01

    Changes in the expression of potassium (K+) channels is a pivotal event during skeletal muscle differentiation. In mouse C2C12 cells, similarly to human skeletal muscle cells, myotube formation increased the expression of Kv7.1, Kv7.3, and Kv7.4, the last showing the highest degree of regulation. In C2C12 cells, Kv7.4 silencing by RNA interference reduced the expression levels of differentiation markers (myogenin, myosin heavy chain, troponinT-1, and Pax3) and impaired myotube formation and multinucleation. In Kv7.4-silenced cells, the differentiation-promoting effect of the Kv7 activator N-(2-amino-4-(4-fluorobenzylamino)-phenyl)-carbamic acid ethyl ester (retigabine) was abrogated. Expression levels for the repressor element-1 silencing transcription factor (REST) declined during myotube formation. Transcript levels for Kv7.4, as well as for myogenin, troponinT-1, and Pax3, were reduced by REST overexpression and enhanced upon REST suppression by RNA interference. Four regions containing potential REST-binding sites in the 5′ untranslated region and in the first intron of the Kv7.4 gene were identified by bioinformatic analysis. Chromatin immunoprecipitation assays showed that REST binds to these regions, exhibiting a higher efficiency in myoblasts than in myotubes. These data suggest that Kv7.4 plays a permissive role in skeletal muscle differentiation and highlight REST as a crucial transcriptional regulator for this K+ channel subunit. PMID:23242999

  7. Analysis of CYP3A4 genetic polymorphisms in Han Chinese.

    PubMed

    Zhou, Qing; Yu, Xiaomin; Shu, Chang; Cai, Yimei; Gong, Wei; Wang, Xumin; Wang, Duen-mei; Hu, Songnian

    2011-06-01

    Our study aimed to comprehensively investigate the genetic polymorphisms of CYP3A4 in Han Chinese. We sequenced the gene regions of CYP3A4, including its promoter, exons, surrounding introns and 3' untranslated region (3'UTR), from 100 unrelated-healthy Han Chinese individuals. We detected 11 SNPs, three of which are novel. According to in silico functional prediction of novel variants, 20148 A>G in exon 10, resulting in substitution of Tyr319 with Cys (CYP3A4*21), may induce dramatic alteration of protein conformation, and 26908 G>A in 3'UTR may disrupt post-transcriptional regulation. We identified five alleles in Han Chinese, the allele frequencies of CYP3A4*1, *5, *6, *18 and *21 are 97, 0.5, 1, 1 and 0.5%, respectively. Haplotype inference revealed 14 haplotypes, of which the major haplotype CYP3A4*1A constitutes 59% of the total chromosomes. We also examined the possible role of natural selection in shaping the variation of CYP3A4 and confirmed a trend, consistent with the action of positive selection. We systematically screened the genetic polymorphisms of CYP3A4 in Han Chinese, highlighted possible functional impairment of the novel allele and summarized the distinct allele and haplotype frequency distribution, with an emphasis on detecting the footprint of recent positive selection on the CYP3A4 gene in Han Chinese.

  8. Group I introns are inherited through common ancestry in the nuclear-encoded rRNA of Zygnematales (Charophyceae).

    PubMed Central

    Bhattacharya, D; Surek, B; Rüsing, M; Damberger, S; Melkonian, M

    1994-01-01

    Group I introns are found in organellar genomes, in the genomes of eubacteria and phages, and in nuclear-encoded rRNAs. The origin and distribution of nuclear-encoded rRNA group I introns are not understood. To elucidate their evolutionary relationships, we analyzed diverse nuclear-encoded small-subunit rRNA group I introns including nine sequences from the green-algal order Zygnematales (Charophyceae). Phylogenetic analyses of group I introns and rRNA coding regions suggest that lateral transfers have occurred in the evolutionary history of group I introns and that, after transfer, some of these elements may form stable components of the host-cell nuclear genomes. The Zygnematales introns, which share a common insertion site (position 1506 relative to the Escherichia coli small-subunit rRNA), form one subfamily of group I introns that has, after its origin, been inherited through common ancestry. Since the first Zygnematales appear in the middle Devonian within the fossil record, the "1506" group I intron presumably has been a stable component of the Zygnematales small-subunit rRNA coding region for 350-400 million years. PMID:7937917

  9. The Role of CYP3A4 mRNA Transcript with Shortened 3′-Untranslated Region in Hepatocyte Differentiation, Liver Development, and Response to Drug InductionS⃞

    PubMed Central

    Li, Dan; Gaedigk, Roger; Hart, Steven N.; Leeder, J. Steven

    2012-01-01

    Cytochrome P450 3A4 (CYP3A4) metabolizes more than 50% of prescribed drugs. The expression of CYP3A4 changes during liver development and may be affected by the administration of some drugs. Alternative mRNA transcripts occur in more than 90% of human genes and are frequently observed in cells responding to developmental and environmental signals. Different mRNA transcripts may encode functionally distinct proteins or contribute to variability of mRNA stability or protein translation efficiency. The purpose of this study was to examine expression of alternative CYP3A4 mRNA transcripts in hepatocytes in response to developmental signals and drugs. cDNA cloning and RNA sequencing (RNA-Seq) were used to identify CYP3A4 mRNA transcripts. Three transcripts were found in HepaRG cells and liver tissues: one represented a canonical mRNA with full-length 3′-untranslated region (UTR), one had a shorter 3′-UTR, and one contained partial intron-6 retention. The alternative mRNA transcripts were validated by either rapid amplification of cDNA 3′-end or endpoint polymerase chain reaction (PCR). Quantification of the transcripts by RNA-Seq and real time quantitative PCR revealed that the CYP3A4 transcript with shorter 3′-UTR was preferentially expressed in developed livers, differentiated hepatocytes, and in rifampicin- and phenobarbital-induced hepatocytes. The CYP3A4 transcript with shorter 3′-UTR was more stable and produced more protein compared with the CYP3A4 transcript with canonical 3′-UTR. We conclude that the 3′-end processing of CYP3A4 contributes to the quantitative regulation of CYP3A4 gene expression through alternative polyadenylation, which may serve as a regulatory mechanism explaining changes of CYP3A4 expression and activity during hepatocyte differentiation and liver development and in response to drug induction. PMID:21998292

  10. Structural characterization and chromosomal location of the mouse macrophage migration inhibitory factor gene and pseudogenes

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Bozza, M.; Gerard, C.; Kolakowski, L.F. Jr.

    1995-06-10

    Macrophage migration inhibitory factor, MIF, is a cytokine released by T-lymphocytes, macrophages, and the pituitary gland that serves to integrate peripheral and central inflammatory responses. Ubiquitous expression and developmental regulation suggest that MIF may have additional roles outside of the immune system. Here we report the structure and chromosomal location of the mouse Mif gene and the partial characterization of five Mif pseudogenes. The mouse Mif gene spans less than 0.7 kb of chromosomal DNA and is composed of three exons. A comparison between the mouse and the human genes shows a similar gene structure and common regulatory elements inmore » both promoter regions. The mouse Mif gene maps to the middle region of chromosome 10, between Bcr and S100b, which have been mapped to human chromosomes 22q11 and 21q22.3, respectively. The entire sequence of two pseudogenes demonstrates the absence of introns, the presence of the 5{prime} untranslated region of the cDNA, a 3{prime} poly(A) tail, and the lack of sequence similarity with untranscribed regions of the gene. The five pseudogenes are highly homologous to the cDNA, but contain a variable number of mutations that would produce mutated or truncated MIF-like proteins. Phylogenetic analyses of MIF genes and pseudogenes indicate several independent genetic events that can account for multiple genomic integrations. Three of the Mif pseudogenes were also mapped by interspecific backcross to chromosomes 1, 9, and 17. These results suggest that Mif pseudogenes originated by retrotransposition. 46 refs., 5 figs., 1 tab.« less

  11. Intriguing Balancing Selection on the Intron 5 Region of LMBR1 in Human Population

    PubMed Central

    He, Fang; Wu, Dong-Dong; Kong, Qing-Peng; Zhang, Ya-Ping

    2008-01-01

    Background The intron 5 of gene LMBR1 is the cis-acting regulatory module for the sonic hedgehog (SHH) gene. Mutation in this non-coding region is associated with preaxial polydactyly, and may play crucial roles in the evolution of limb and skeletal system. Methodology/Principal Findings We sequenced a region of the LMBR1 gene intron 5 in East Asian human population, and found a significant deviation of Tajima's D statistics from neutrality taking human population growth into account. Data from HapMap also demonstrated extended linkage disequilibrium in the region in East Asian and European population, and significantly low degree of genetic differentiation among human populations. Conclusion/Significance We proposed that the intron 5 of LMBR1 was presumably subject to balancing selection during the evolution of modern human. PMID:18698406

  12. DOE Office of Scientific and Technical Information (OSTI.GOV)

    Monte, D.; Coutte, L.; Dewitte, F.

    The ERM protein belongs to the family of Ets transcription factors. We show here that the human ERM gene is organized into 14 exons distributed along 65 kb of genomic DNA on chromosome 3. The two main functional domains of ERM, the acidic domain and the DNA-binding ETS domain, are overlapped by three different exons each. The 3{prime}-untranslated region of ERM is 2.1 kb, whereas the 5{prime}-untranslated region is about 0.3 kb; this allows the transcription of ERM transcripts of approximately 4 kb. The human ERM gene is localized to the q27-q29 region of chromosome 3. 17 refs., 3 figs.

  13. Extensive intron gain in the ancestor of placental mammals

    PubMed Central

    2011-01-01

    Background Genome-wide studies of intron dynamics in mammalian orthologous genes have found convincing evidence for loss of introns but very little for intron turnover. Similarly, large-scale analysis of intron dynamics in a few vertebrate genomes has identified only intron losses and no gains, indicating that intron gain is an extremely rare event in vertebrate evolution. These studies suggest that the intron-rich genomes of vertebrates do not allow intron gain. The aim of this study was to search for evidence of de novo intron gain in domesticated genes from an analysis of their exon/intron structures. Results A phylogenomic approach has been used to analyse all domesticated genes in mammals and chordates that originated from the coding parts of transposable elements. Gain of introns in domesticated genes has been reconstructed on well established mammalian, vertebrate and chordate phylogenies, and examined as to where and when the gain events occurred. The locations, sizes and amounts of de novo introns gained in the domesticated genes during the evolution of mammals and chordates has been analyzed. A significant amount of intron gain was found only in domesticated genes of placental mammals, where more than 70 cases were identified. De novo gained introns show clear positional bias, since they are distributed mainly in 5' UTR and coding regions, while 3' UTR introns are very rare. In the coding regions of some domesticated genes up to 8 de novo gained introns have been found. Intron densities in Eutheria-specific domesticated genes and in older domesticated genes that originated early in vertebrates are lower than those for normal mammalian and vertebrate genes. Surprisingly, the majority of intron gains have occurred in the ancestor of placentals. Conclusions This study provides the first evidence for numerous intron gains in the ancestor of placental mammals and demonstrates that adequate taxon sampling is crucial for reconstructing intron evolution. The findings of this comprehensive study slightly challenge the current view on the evolutionary stasis in intron dynamics during the last 100 - 200 My. Domesticated genes could constitute an excellent system on which to analyse the mechanisms of intron gain in placental mammals. Reviewers: this article was reviewed by Dan Graur, Eugene V. Koonin and Jürgen Brosius. PMID:22112745

  14. Comparative Analysis of Vertebrate Dystrophin Loci Indicate Intron Gigantism as a Common Feature

    PubMed Central

    Pozzoli, Uberto; Elgar, Greg; Cagliani, Rachele; Riva, Laura; Comi, Giacomo P.; Bresolin, Nereo; Bardoni, Alessandra; Sironi, Manuela

    2003-01-01

    The human DMD gene is the largest known to date, spanning > 2000 kb on the X chromosome. The gene size is mainly accounted for by huge intronic regions. We sequenced 190 kb of Fugu rubripes (pufferfish) genomic DNA corresponding to the complete dystrophin gene (FrDMD) and provide the first report of gene structure and sequence comparison among dystrophin genomic sequences from different vertebrate organisms. Almost all intron positions and phases are conserved between FrDMD and its mammalian counterparts, and the predicted protein product of the Fugu gene displays 55% identity and 71% similarity to human dystrophin. In analogy to the human gene, FrDMD presents several-fold longer than average intronic regions. Analysis of intron sequences of the human and murine genes revealed that they are extremely conserved in size and that a similar fraction of total intron length is represented by repetitive elements; moreover, our data indicate that intron expansion through repeat accumulation in the two orthologs is the result of independent insertional events. The hypothesis that intron length might be functionally relevant to the DMD gene regulation is proposed and substantiated by the finding that dystrophin intron gigantism is common to the three vertebrate genes. [Supplemental material is available online at www.genome.org.] PMID:12727896

  15. Secondary structure model of the RNA recognized by the reverse transcriptase from the R2 retrotransposable element.

    PubMed Central

    Mathews, D H; Banerjee, A R; Luan, D D; Eickbush, T H; Turner, D H

    1997-01-01

    RNA transcripts corresponding to the 250-nt 3' untranslated region of the R2 non-LTR retrotransposable element are recognized by the R2 reverse transcriptase and are sufficient to serve as templates in the target DNA-primed reverse transcription (TPRT) reaction. The R2 protein encoded by the Bombyx mori R2 can recognize this region from both the B. mori and Drosophila melanogaster R2 elements even though these regions show little nucleotide sequence identity. A model for the RNA secondary structure of the 3' untranslated region of the D. melanogaster R2 retrotransposon was developed by sequence comparison of 10 species aided by free energy minimization. Chemical modification experiments are consistent with this prediction. A secondary structure model for the 3' untranslated region of R2 RNA from the R2 element from B. mori was obtained by a combination of chemical modification data and free energy minimization. These two secondary structure models, found independently, share several common sites. This study shows the utility of combining free energy minimization, sequence comparison, and chemical modification to model an RNA secondary structure. PMID:8990394

  16. Complex alternative splicing of acetylcholinesterase transcripts in Torpedo electric organ; primary structure of the precursor of the glycolipid-anchored dimeric form.

    PubMed Central

    Sikorav, J L; Duval, N; Anselmet, A; Bon, S; Krejci, E; Legay, C; Osterlund, M; Reimund, B; Massoulié, J

    1988-01-01

    In this paper, we show the existence of alternative splicing in the 3' region of the coding sequence of Torpedo acetylcholinesterase (AChE). We describe two cDNA structures which both diverge from the previously described coding sequence of the catalytic subunit of asymmetric (A) forms (Schumacher et al., 1986; Sikorav et al., 1987). They both contain a coding sequence followed by a non-coding sequence and a poly(A) stretch. Both of these structures were shown to exist in poly(A)+ RNAs, by S1 mapping experiments. The divergent region encoded by the first sequence corresponds to the precursor of the globular dimeric form (G2a), since it contains the expected C-terminal amino acids, Ala-Cys. These amino acids are followed by a 29 amino acid extension which contains a hydrophobic segment and must be replaced by a glycolipid in the mature protein. Analyses of intact G2a AChE showed that the common domain of the protein contains intersubunit disulphide bonds. The divergent region of the second type of cDNA consists of an adjacent genomic sequence, which is removed as an intron in A and Ga mRNAs, but may encode a distinct, less abundant catalytic subunit. The structures of the cDNA clones indicate that they are derived from minor mRNAs, shorter than the three major transcripts which have been described previously (14.5, 10.5 and 5.5 kb). Oligonucleotide probes specific for the asymmetric and globular terminal regions hybridize with the three major transcripts, indicating that their size is determined by 3'-untranslated regions which are not related to the differential splicing leading to A and Ga forms. Images PMID:3181125

  17. SelTarbase, a database of human mononucleotide-microsatellite mutations and their potential impact to tumorigenesis and immunology

    PubMed Central

    Woerner, Stefan M.; Yuan, Yan P.; Benner, Axel; Korff, Sebastian; von Knebel Doeberitz, Magnus; Bork, Peer

    2010-01-01

    About 15% of human colorectal cancers and, at varying degrees, other tumor entities as well as nearly all tumors related to Lynch syndrome are hallmarked by microsatellite instability (MSI) as a result of a defective mismatch repair system. The functional impact of resulting mutations depends on their genomic localization. Alterations within coding mononucleotide repeat tracts (MNRs) can lead to protein truncation and formation of neopeptides, whereas alterations within untranslated MNRs can alter transcription level or transcript stability. These mutations may provide selective advantage or disadvantage to affected cells. They may further concern the biology of microsatellite unstable cells, e.g. by generating immunogenic peptides induced by frameshifts mutations. The Selective Targets database (http://www.seltarbase.org) is a curated database of a growing number of public MNR mutation data in microsatellite unstable human tumors. Regression calculations for various MSI–H tumor entities indicating statistically deviant mutation frequencies predict TGFBR2, BAX, ACVR2A and others that are shown or highly suspected to be involved in MSI tumorigenesis. Many useful tools for further analyzing genomic DNA, derived wild-type and mutated cDNAs and peptides are integrated. A comprehensive database of all human coding, untranslated, non-coding RNA- and intronic MNRs (MNR_ensembl) is also included. Herewith, SelTarbase presents as a plenty instrument for MSI-carcinogenesis-related research, diagnostics and therapy. PMID:19820113

  18. Insights into HLA-G Genetics Provided by Worldwide Haplotype Diversity

    PubMed Central

    Castelli, Erick C.; Ramalho, Jaqueline; Porto, Iane O. P.; Lima, Thálitta H. A.; Felício, Leandro P.; Sabbagh, Audrey; Donadi, Eduardo A.; Mendes-Junior, Celso T.

    2014-01-01

    Human leukocyte antigen G (HLA-G) belongs to the family of non-classical HLA class I genes, located within the major histocompatibility complex (MHC). HLA-G has been the target of most recent research regarding the function of class I non-classical genes. The main features that distinguish HLA-G from classical class I genes are (a) limited protein variability, (b) alternative splicing generating several membrane bound and soluble isoforms, (c) short cytoplasmic tail, (d) modulation of immune response (immune tolerance), and (e) restricted expression to certain tissues. In the present work, we describe the HLA-G gene structure and address the HLA-G variability and haplotype diversity among several populations around the world, considering each of its major segments [promoter, coding, and 3′ untranslated region (UTR)]. For this purpose, we developed a pipeline to reevaluate the 1000Genomes data and recover miscalled or missing genotypes and haplotypes. It became clear that the overall structure of the HLA-G molecule has been maintained during the evolutionary process and that most of the variation sites found in the HLA-G coding region are either coding synonymous or intronic mutations. In addition, only a few frequent and divergent extended haplotypes are found when the promoter, coding, and 3′UTRs are evaluated together. The divergence is particularly evident for the regulatory regions. The population comparisons confirmed that most of the HLA-G variability has originated before human dispersion from Africa and that the allele and haplotype frequencies have probably been shaped by strong selective pressures. PMID:25339953

  19. Comparative phylogenomic analysis provides insights into TCP gene functions in Sorghum

    PubMed Central

    Francis, Aleena; Dhaka, Namrata; Bakshi, Mohit; Jung, Ki-Hong; Sharma, Manoj K.; Sharma, Rita

    2016-01-01

    Sorghum is a highly efficient C4 crop with potential to mitigate challenges associated with food, feed and fuel. TCP proteins are of particular interest for crop improvement programs due to their well-demonstrated roles in crop domestication and shaping plant architecture thereby, affecting agronomic traits. We identified 20 TCP genes from Sorghum. Except SbTCP8, all are either intronless or contain introns in the untranslated regions. Comparative phylogenetic analysis of Arabidopsis, rice, Brachypodium and Sorghum TCP proteins revealed two distinct classes categorized into ten sub-clades. Sub-clade F is dicot-specific, whereas A2, G1 and I1 groups only contained genes from grasses. Sub-clade B was missing in Sorghum, whereas group A1 was missing in rice indicating species-specific divergence of TCP proteins. TCP proteins of Sorghum are enriched in disorder promoting residues with class I containing higher percent disorder than class II proteins. Seven pairs of paralogous TCP genes were identified from Sorghum, five of which seem to predate Rice-Sorghum divergence. All of them have diverged in their expression. Based on the expression and orthology analysis, five Sorghum genes have been shortlisted for further investigation for their roles in regulating plant morphology. Whereas, three genes have been identified as candidates for engineering abiotic stress tolerance. PMID:27917941

  20. Comprehensive genetic testing for female and male infertility using next-generation sequencing.

    PubMed

    Patel, Bonny; Parets, Sasha; Akana, Matthew; Kellogg, Gregory; Jansen, Michael; Chang, Chihyu; Cai, Ying; Fox, Rebecca; Niknazar, Mohammad; Shraga, Roman; Hunter, Colby; Pollock, Andrew; Wisotzkey, Robert; Jaremko, Malgorzata; Bisignano, Alex; Puig, Oscar

    2018-05-19

    To develop a comprehensive genetic test for female and male infertility in support of medical decisions during assisted reproductive technology (ART) protocols. We developed a next-generation sequencing (NGS) gene panel consisting of 87 genes including promoters, 5' and 3' untranslated regions, exons, and selected introns. In addition, sex chromosome aneuploidies and Y chromosome microdeletions were analyzed concomitantly using the same panel. The NGS panel was analytically validated by retrospective analysis of 118 genomic DNA samples with known variants in loci representative of female and male infertility. Our results showed analytical accuracy of > 99%, with > 98% sensitivity for single-nucleotide variants (SNVs) and > 91% sensitivity for insertions/deletions (indels). Clinical sensitivity was assessed with samples containing variants representative of male and female infertility, and it was 100% for SNVs/indels, CFTR IVS8-5T variants, sex chromosome aneuploidies, and copy number variants (CNVs) and > 93% for Y chromosome microdeletions. Cost analysis shows potential savings when comparing this single NGS assay with the standard approach, which includes multiple assays. A single, comprehensive, NGS panel can simplify the ordering process for healthcare providers, reduce turnaround time, and lower the overall cost of testing for genetic assessment of infertility in females and males, while maintaining accuracy.

  1. Comparative phylogenomic analysis provides insights into TCP gene functions in Sorghum.

    PubMed

    Francis, Aleena; Dhaka, Namrata; Bakshi, Mohit; Jung, Ki-Hong; Sharma, Manoj K; Sharma, Rita

    2016-12-05

    Sorghum is a highly efficient C4 crop with potential to mitigate challenges associated with food, feed and fuel. TCP proteins are of particular interest for crop improvement programs due to their well-demonstrated roles in crop domestication and shaping plant architecture thereby, affecting agronomic traits. We identified 20 TCP genes from Sorghum. Except SbTCP8, all are either intronless or contain introns in the untranslated regions. Comparative phylogenetic analysis of Arabidopsis, rice, Brachypodium and Sorghum TCP proteins revealed two distinct classes categorized into ten sub-clades. Sub-clade F is dicot-specific, whereas A2, G1 and I1 groups only contained genes from grasses. Sub-clade B was missing in Sorghum, whereas group A1 was missing in rice indicating species-specific divergence of TCP proteins. TCP proteins of Sorghum are enriched in disorder promoting residues with class I containing higher percent disorder than class II proteins. Seven pairs of paralogous TCP genes were identified from Sorghum, five of which seem to predate Rice-Sorghum divergence. All of them have diverged in their expression. Based on the expression and orthology analysis, five Sorghum genes have been shortlisted for further investigation for their roles in regulating plant morphology. Whereas, three genes have been identified as candidates for engineering abiotic stress tolerance.

  2. Molecular and bioinformatical characterization of a novel superfamily of cysteine-rich peptides from arthropods.

    PubMed

    Zeng, Xian-Chun; Nie, Yao; Luo, Xuesong; Wu, Shifen; Shi, Wanxia; Zhang, Lei; Liu, Yichen; Cao, Hanjun; Yang, Ye; Zhou, Jianping

    2013-03-01

    The full-length cDNA sequences of two novel cysteine-rich peptides (referred to as HsVx1 and MmKTx1) were obtained from scorpions. The two peptides represent a novel class of cysteine-rich peptides with a unique cysteine pattern. The genomic sequence of HsVx1 is composed of three exons interrupted by two introns that are localized in the mature peptide encoding region and inserted in phase 1 and phase 2, respectively. Such a genomic organization markedly differs from those of other peptides from scorpions described previously. Genome-wide search for the orthologs of HsVx1 identified 59 novel cysteine-rich peptides from arthropods. These peptides share a consistent cysteine pattern with HsVx1. Genomic comparison revealed extensive intron length differences and intronic number and position polymorphisms among the genes of these peptides. Further analysis identified 30 cases of intron sliding, 1 case of intron gain and 22 cases of intron loss occurred with the genes of the HsVx1 and HsVx1-like peptides. It is interesting to see that three HsVx1-like peptides XP_001658928, XP_001658929 and XP_001658930 were derived from a single gene (XP gene): the former two were generated from alternative splicing; the third one was encoded by a DNA region in the reverse complementary strand of the third intron of the XP gene. These findings strongly suggest that the genes of these cysteine-rich peptides were evolved by intron sliding, intron gain/loss, gene recombination and alternative splicing events in response to selective forces without changing their cysteine pattern. The evolution of these genes is dominated by intron sliding and intron loss. Copyright © 2012 Elsevier Inc. All rights reserved.

  3. The chloroplast tRNALys(UUU) gene from mustard (Sinapis alba) contains a class II intron potentially coding for a maturase-related polypeptide.

    PubMed

    Neuhaus, H; Link, G

    1987-01-01

    The trnK gene endocing the tRNALys(UUU) has been located on mustard (Sinapis alba) chloroplast DNA, 263 bp upstream of the psbA gene on the same strand. The nucleotide sequence of the trnK gene and its flanking regions as well as the putative transcription start and termination sites are shown. The 5' end of the transcript lies 121 bp upstream of the 5' tRNA coding region and is preceded by procaryotic-type "-10" and "-35" sequence elements, while the 3' end maps 2.77 kb downstream to a DNA region with possible stemloop secondary structure. The anticodon loop of the tRNALys is interrupted by a 2,574 bp intron containing a long open reading frame, which codes for 524 amino acids. Based on conserved stem and loop structures, this intron has characteristic features of a class II intron. A region near the carboxyl terminus of the derived polypeptide appears structurally related to maturases.

  4. Association of the distal region of the ectonucleotide pyrophosphatase/phosphodiesterase 1 gene with type 2 diabetes in an African-American population enriched for nephropathy.

    PubMed

    Keene, Keith L; Mychaleckyj, Josyf C; Smith, Shelly G; Leak, Tennille S; Perlegas, Peter S; Langefeld, Carl D; Freedman, Barry I; Rich, Stephen S; Bowden, Donald W; Sale, Michèle M

    2008-04-01

    Variants in the ectonucleotide pyrophosphatase/phosphodiesterase 1 (ENPP1) gene have shown positive associations with diabetes and related phenotypes, including insulin resistance, metabolic syndrome, and type 1 diabetic nephropathy. Additionally, evidence for linkage for type 2 diabetes in African Americans was observed at 6q24-27, with the proximal edge of the peak encompassing the ENPP1 gene. Our objective was to comprehensively evaluate variants in ENPP1 for association with type 2 diabetic end-stage renal disease (ESRD). Forty-nine single nucleotide polymorphisms (SNPs) located in the coding and flanking regions of ENPP1 were genotyped in 577 African-American individuals with type 2 diabetic ESRD and 596 African-American control subjects. Haplotypic association and genotypic association for the dominant, additive, and recessive models were tested by calculating a chi(2) statistic and corresponding P value. Nine SNPs showed nominal evidence for association (P < 0.05) with type 2 diabetic ESRD in one or more genotypic model. The most significant associations were observed with rs7754586 (P = 0.003 dominant model, P = 0.0005 additive, and P = 0.007 recessive), located in the 3' untranslated region, and an intron 24 SNP (rs1974201: P = 0.004 dominant, P = 0.0005 additive, and P = 0.005 recessive). However, the extensively studied K121Q variant (rs1044498) did not reveal evidence for association with type 2 diabetic ESRD in this African-American population. This study was the first to comprehensively evaluate variants of the ENPP1 gene for association in an African-American population with type 2 diabetes and ESRD and suggests that variants in the distal region of the ENPP1 gene may contribute to diabetes or diabetic nephropathy susceptibility in African Americans.

  5. Genomewide analysis indicates that queen larvae have lower methylation levels in the honey bee ( Apis mellifera)

    NASA Astrophysics Data System (ADS)

    Shi, Yuan Yuan; Yan, Wei Yu; Huang, Zachary Y.; Wang, Zi Long; Wu, Xiao Bo; Zeng, Zhi Jiang

    2013-02-01

    The honey bee is a social insect characterized by caste differentiation, by which a young larva can develop into either a queen or a worker. Despite possessing the same genome, queen and workers display marked differences in reproductive capacity, physiology, and behavior. Recent studies have shown that DNA methylation plays important roles in caste differentiation. To further explore the roles of DNA methylation in this process, we analyzed DNA methylome profiles of both queen larvae (QL) and worker larvae (WL) of different ages (2, 4, and 6 day old), by using methylated DNA immunoprecipitation-sequencing (meDIP-seq) technique. The global DNA methylation levels varied between the larvae of two castes. DNA methylation increased from 2-day- to 4-day-old QL and then decreased in 6-day-old larvae. In WL, methylation levels increased with age. The methylcytosines in both larvae were enriched in introns, followed by coding sequence (CDS) regions, CpG islands, 2 kbp downstream and upstream of genes, and 5' and 3' untranslated regions (UTRs). The number of differentially methylated genes (DMGs) in 2-, 4-, and 6-day-old QL and WL was 725, 3,013, and 5,049, respectively. Compared to 4- and 6-day-old WL, a large number of genes in QL were downmethylated, which were involved in many processes including development, reproduction, and metabolic regulation. In addition, some DMGs were concerned with caste differentiation.

  6. Genome Wide Analysis of Fatty Acid Desaturation and Its Response to Temperature1[OPEN

    PubMed Central

    Menard, Guillaume N.; Moreno, Jose Martin; Bryant, Fiona M.; Munoz-Azcarate, Olaya; Hassani-Pak, Keywan; Kurup, Smita

    2017-01-01

    Plants modify the polyunsaturated fatty acid content of their membrane and storage lipids in order to adapt to changes in temperature. In developing seeds, this response is largely controlled by the activities of the microsomal ω-6 and ω-3 fatty acid desaturases, FAD2 and FAD3. Although temperature regulation of desaturation has been studied at the molecular and biochemical levels, the genetic control of this trait is poorly understood. Here, we have characterized the response of Arabidopsis (Arabidopsis thaliana) seed lipids to variation in ambient temperature and found that heat inhibits both ω-6 and ω-3 desaturation in phosphatidylcholine, leading to a proportional change in triacylglycerol composition. Analysis of the 19 parental accessions of the multiparent advanced generation intercross (MAGIC) population showed that significant natural variation exists in the temperature responsiveness of ω-6 desaturation. A combination of quantitative trait locus (QTL) analysis and genome-wide association studies (GWAS) using the MAGIC population suggests that ω-6 desaturation is largely controlled by cis-acting sequence variants in the FAD2 5′ untranslated region intron that determine the expression level of the gene. However, the temperature responsiveness of ω-6 desaturation is controlled by a separate QTL on chromosome 2. The identity of this locus is unknown, but genome-wide association studies identified potentially causal sequence variants within ∼40 genes in an ∼450-kb region of the QTL. PMID:28108698

  7. Characterization of the molecular basis of group II intron RNA recognition by CRS1-CRM domains.

    PubMed

    Keren, Ido; Klipcan, Liron; Bezawork-Geleta, Ayenachew; Kolton, Max; Shaya, Felix; Ostersetzer-Biran, Oren

    2008-08-22

    CRM (chloroplast RNA splicing and ribosome maturation) is a recently recognized RNA-binding domain of ancient origin that has been retained in eukaryotic genomes only within the plant lineage. Whereas in bacteria CRM domains exist as single domain proteins involved in ribosome maturation, in plants they are found in a family of proteins that contain between one and four repeats. Several members of this family with multiple CRM domains have been shown to be required for the splicing of specific plastidic group II introns. Detailed biochemical analysis of one of these factors in maize, CRS1, demonstrated its high affinity and specific binding to the single group II intron whose splicing it facilitates, the plastid-encoded atpF intron RNA. Through its association with two intronic regions, CRS1 guides the folding of atpF intron RNA into its predicted "catalytically active" form. To understand how multiple CRM domains cooperate to achieve high affinity sequence-specific binding to RNA, we analyzed the RNA binding affinity and specificity associated with each individual CRM domain in CRS1; whereas CRM3 bound tightly to the RNA, CRM1 associated specifically with a unique region found within atpF intron domain I. CRM2, which demonstrated only low binding affinity, also seems to form specific interactions with regions localized to domains I, III, and IV. We further show that CRM domains share structural similarities and RNA binding characteristics with the well known RNA recognition motif domain.

  8. Insights into the history of a bacterial group II intron remnant from the genomes of the nitrogen-fixing symbionts Sinorhizobium meliloti and Sinorhizobium medicae.

    PubMed

    Toro, N; Martínez-Rodríguez, L; Martínez-Abarca, F

    2014-10-01

    Group II introns are self-splicing catalytic RNAs that act as mobile retroelements. In bacteria, they are thought to be tolerated to some extent because they self-splice and home preferentially to sites outside of functional genes, generally within intergenic regions or in other mobile genetic elements, by mechanisms including the divergence of DNA target specificity to prevent target site saturation. RmInt1 is a mobile group II intron that is widespread in natural populations of Sinorhizobium meliloti and was first described in the GR4 strain. Like other bacterial group II introns, RmInt1 tends to evolve toward an inactive form by fragmentation, with loss of the 3' terminus. We identified genomic evidence of a fragmented intron closely related to RmInt1 buried in the genome of the extant S. meliloti/S. medicae species. By studying this intron, we obtained evidence for the occurrence of intron insertion before the divergence of ancient rhizobial species. This fragmented group II intron has thus existed for a long time and has provided sequence variation, on which selection can act, contributing to diverse genetic rearrangements, and to generate pan-genome divergence after strain differentiation. The data presented here suggest that fragmented group II introns within intergenic regions closed to functionally important neighboring genes may have been microevolutionary forces driving adaptive evolution of these rhizobial species.

  9. Insights into the history of a bacterial group II intron remnant from the genomes of the nitrogen-fixing symbionts Sinorhizobium meliloti and Sinorhizobium medicae

    PubMed Central

    Toro, N; Martínez-Rodríguez, L; Martínez-Abarca, F

    2014-01-01

    Group II introns are self-splicing catalytic RNAs that act as mobile retroelements. In bacteria, they are thought to be tolerated to some extent because they self-splice and home preferentially to sites outside of functional genes, generally within intergenic regions or in other mobile genetic elements, by mechanisms including the divergence of DNA target specificity to prevent target site saturation. RmInt1 is a mobile group II intron that is widespread in natural populations of Sinorhizobium meliloti and was first described in the GR4 strain. Like other bacterial group II introns, RmInt1 tends to evolve toward an inactive form by fragmentation, with loss of the 3′ terminus. We identified genomic evidence of a fragmented intron closely related to RmInt1 buried in the genome of the extant S. meliloti/S. medicae species. By studying this intron, we obtained evidence for the occurrence of intron insertion before the divergence of ancient rhizobial species. This fragmented group II intron has thus existed for a long time and has provided sequence variation, on which selection can act, contributing to diverse genetic rearrangements, and to generate pan-genome divergence after strain differentiation. The data presented here suggest that fragmented group II introns within intergenic regions closed to functionally important neighboring genes may have been microevolutionary forces driving adaptive evolution of these rhizobial species. PMID:24736785

  10. BIALLELIC POLYMORPHISM IN THE INTRON REGION OF B-TUBULIN GENE OF CRYPTOSPORIDIUM PARASITES

    EPA Science Inventory

    Nucleotide sequencing of polymerase chain reaction-amplified intron region of the Cryptosporidium parvum B-tubulin gene in 26 human and 15 animal isolates revealed distinct genetic polymorphism between the human and bovine genotypes. The separation of 2 genotypes of C. parvum is...

  11. A conservative assessment of the major genetic causes of idiopathic chronic pancreatitis: data from a comprehensive analysis of PRSS1, SPINK1, CTRC and CFTR genes in 253 young French patients.

    PubMed

    Masson, Emmanuelle; Chen, Jian-Min; Audrézet, Marie-Pierre; Cooper, David N; Férec, Claude

    2013-01-01

    Idiopathic chronic pancreatitis (ICP) has traditionally been defined as chronic pancreatitis in the absence of any obvious precipitating factors (e.g. alcohol abuse) and family history of the disease. Studies over the past 15 years have revealed that ICP has a highly complex genetic architecture involving multiple gene loci. Here, we have attempted to provide a conservative assessment of the major genetic causes of ICP in a sample of 253 young French ICP patients. For the first time, conventional types of mutation (comprising coding sequence variants and variants at intron/exon boundaries) and gross genomic rearrangements were screened for in all four major pancreatitis genes, PRSS1, SPINK1, CTRC and CFTR. For the purposes of the study, synonymous, intronic and 5'- or 3'-untranslated region variants were excluded from the analysis except where there was persuasive evidence of functional consequences. The remaining sequence variants/genotypes were classified into causative, contributory or neutral categories by consideration of (i) their allele frequencies in patient and normal control populations, (ii) their presumed or experimentally confirmed functional effects, (iii) the relative importance of their associated genes in the pathogenesis of chronic pancreatitis and (iv) gene-gene interactions wherever applicable. Adoption of this strategy allowed us to assess the pathogenic relevance of specific variants/genotypes to their respective carriers to an unprecedented degree. The genetic cause of ICP could be assigned in 23.7% of individuals in the study group. A strong genetic susceptibility factor was also present in an additional 24.5% of cases. Taken together, up to 48.2% of the studied ICP patients were found to display evidence of a genetic basis for their pancreatitis. Whereas these particular proportions may not be extrapolable to all ICP patients, the approach employed should serve as a useful framework for acquiring a better understanding of the role of genetic factors in causing this oligogenic disease.

  12. Analysis of Claviceps africana and C. sorghi from India using AFLPs, EF-1alpha gene intron 4, and beta-tubulin gene intron 3.

    PubMed

    Tooley, Paul W; Bandyopadhyay, Ranajit; Carras, Marie M; Pazoutová, Sylvie

    2006-04-01

    Isolates of Claviceps causing ergot on sorghum in India were analysed by AFLP analysis, and by analysis of DNA sequences of the EF-1alpha gene intron 4 and beta-tubulin gene intron 3 region. Of 89 isolates assayed from six states in India, four were determined to be C. sorghi, and the rest C. africana. A relatively low level of genetic diversity was observed within the Indian C. africana population. No evidence of genetic exchange between C. africana and C. sorghi was observed in either AFLP or DNA sequence analysis. Phylogenetic analysis was conducted using DNA sequences from 14 different Claviceps species. A multigene phylogeny based on the EF-1alpha gene intron 4, the beta-tubulin gene intron 3 region, and rDNA showed that C. sorghi grouped most closely with C. gigantea and C. africana. Although the Claviceps species we analysed were closely related, they colonize hosts that are taxonomically very distinct suggesting that there is no direct coevolution of Claviceps with its hosts.

  13. Intron self-complementarity enforces exon inclusion in a yeast pre-mRNA

    PubMed Central

    Howe, Kenneth James; Ares, Manuel

    1997-01-01

    Skipping of internal exons during removal of introns from pre-mRNA must be avoided for proper expression of most eukaryotic genes. Despite significant understanding of the mechanics of intron removal, mechanisms that ensure inclusion of internal exons in multi-intron pre-mRNAs remain mysterious. Using a natural two-intron yeast gene, we have identified distinct RNA–RNA complementarities within each intron that prevent exon skipping and ensure inclusion of internal exons. We show that these complementarities are positioned to act as intron identity elements, bringing together only the appropriate 5′ splice sites and branchpoints. Destroying either intron self-complementarity allows exon skipping to occur, and restoring the complementarity using compensatory mutations rescues exon inclusion, indicating that the elements act through formation of RNA secondary structure. Introducing new pairing potential between regions near the 5′ splice site of intron 1 and the branchpoint of intron 2 dramatically enhances exon skipping. Similar elements identified in single intron yeast genes contribute to splicing efficiency. Our results illustrate how intron secondary structure serves to coordinate splice site pairing and enforce exon inclusion. We suggest that similar elements in vertebrate genes could assist in the splicing of very large introns and in the evolution of alternative splicing. PMID:9356473

  14. Cell proteins bind to multiple sites within the 5' untranslated region of poliovirus RNA.

    PubMed Central

    del Angel, R M; Papavassiliou, A G; Fernández-Tomás, C; Silverstein, S J; Racaniello, V R

    1989-01-01

    The 5' noncoding region of poliovirus RNA contains sequences necessary for translation and replication. These functions are probably carried out by recognition of poliovirus RNA by cellular and/or viral proteins. Using a mobility-shift electrophoresis assay and 1,10-phenanthroline/Cu+ footprinting, we demonstrate specific binding of cytoplasmic factors with a sequence from nucleotides 510-629 within the 5' untranslated region (UTR). Complex formation was also observed with a second sequence (nucleotides 97-182) within the 5' UTR. These two regions of the 5' UTR appear to be recognized by distinct cell factors as determined by competition analysis and the effects of ionic strength on complex formation. However, both complexes contain eukaryotic initiation factor 2 alpha, as revealed by their reaction with specific antibody. Images PMID:2554308

  15. Insertion of a self-splicing intron into the mtDNA of atriploblastic animal

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Valles, Y.; Halanych, K.; Boore, J.L.

    2006-04-14

    Nephtys longosetosa is a carnivorous polychaete worm that lives in the intertidal and subtidal zones with worldwide distribution (pleijel&rouse2001). Its mitochondrial genome has the characteristics typical of most metazoans: 37 genes; circular molecule; almost no intergenic sequence; and no significant gene rearrangements when compared to other annelid mtDNAs (booremoritz19981995). Ubiquitous features as small intergenic regions and lack of introns suggested that metazoan mtDNAs are under strong selective pressures to reduce their genome size allowing for faster replication requirements (booremoritz19981995Lynch2005). Yet, in 1996 two type I introns were found in the mtDNA of the basal metazoan Metridium senile (FigureX). Breaking amore » long-standing rule (absence of introns in metazoan mtDNA), this finding was later supported by the further presence of group I introns in other cnidarians. Interestingly, only the class Anthozoa within cnidarians seems to harbor such introns. Although several hundreds of triploblastic metazoan mtDNAs have been sequenced, this study is the first evidence of mitochondrial introns in triploblastic metazoans. The cox1 gene of N. longosetosa has an intron of almost 2 kbs in length. This finding represents as well the first instance of a group II intron (anthozoans harbor group I introns) in all metazoan lineages. Opposite trends are observed within plants, fungi and protist mtDNAs, where introns (both group I and II) and other non-coding sequences are widespread. Plant, fungal and protist mtDNA structure and organization differ enormously from that of metazoan mtDNA. Both, plant and fungal mtDNA are dynamic molecules that undergo high rates of recombination, contain long intergenic spacer regions and harbor both group I and group II introns. However, as metazoans they have a conserved gene content. Protists, on the other hand have a striking variation of gene content and introns that account for the genome size variation. In contrast to this mtDNA structure and organization diversity, current genome level studies point to a monophyletic origin of the mitochondria (REFS), raising questions such as: what are the pressures at work shaping the evolution of the mitochondrial genome at 'higher' levels? What drives the absence of introns and other non-coding spacers in metazoan mtDNA? What characteristics must have an intron to be maintained in an environment where 'extra chromosomes' are usually selected against?« less

  16. Sdt97: A Point Mutation in the 5′ Untranslated Region Confers Semidwarfism in Rice

    PubMed Central

    Tong, Jiping; Han, Zhengshu; Han, Aonan; Liu, Xuejun; Zhang, Shiyong; Fu, Binying; Hu, Jun; Su, Jingping; Li, Shaoqing; Wang, Shengjun; Zhu, Yingguo

    2016-01-01

    Semidwarfism is an important agronomic trait in rice breeding programs. The semidwarf mutant gene Sdt97 was previously described. However, the molecular mechanism underlying the mutant is yet to be elucidated. In this study, we identified the mutant gene by a map-based cloning method. Using a residual heterozygous line (RHL) population, Sdt97 was mapped to the long arm of chromosome 6 in the interval of nearly 60 kb between STS marker N6 and SNP marker N16 within the PAC clone P0453H04. Sequencing of the candidate genes in the target region revealed that a base transversion from G to C occurred in the 5′ untranslated region of Sdt97. qRT-PCR results confirmed that the transversion induced an obvious change in the expression pattern of Sdt97 at different growth and developmental stages. Plants transgenic for Sdt97 resulted in the restoration of semidwarfism of the mutant phenotype, or displayed a greater dwarf phenotype than the mutant. Our results indicate that a point mutation in the 5′ untranslated region of Sdt97 confers semidwarfism in rice. Functional analysis of Sdt97 will open a new field of study for rice semidwarfism, and also expand our knowledge of the molecular mechanism of semidwarfism in rice. PMID:27172200

  17. Differential contribution of genomic regions to marked genetic variation and prediction of quantitative traits in broiler chickens.

    PubMed

    Abdollahi-Arpanahi, Rostam; Morota, Gota; Valente, Bruno D; Kranis, Andreas; Rosa, Guilherme J M; Gianola, Daniel

    2016-02-03

    Genome-wide association studies in humans have found enrichment of trait-associated single nucleotide polymorphisms (SNPs) in coding regions of the genome and depletion of these in intergenic regions. However, a recent release of the ENCyclopedia of DNA elements showed that ~80 % of the human genome has a biochemical function. Similar studies on the chicken genome are lacking, thus assessing the relative contribution of its genic and non-genic regions to variation is relevant for biological studies and genetic improvement of chicken populations. A dataset including 1351 birds that were genotyped with the 600K Affymetrix platform was used. We partitioned SNPs according to genome annotation data into six classes to characterize the relative contribution of genic and non-genic regions to genetic variation as well as their predictive power using all available quality-filtered SNPs. Target traits were body weight, ultrasound measurement of breast muscle and hen house egg production in broiler chickens. Six genomic regions were considered: intergenic regions, introns, missense, synonymous, 5' and 3' untranslated regions, and regions that are located 5 kb upstream and downstream of coding genes. Genomic relationship matrices were constructed for each genomic region and fitted in the models, separately or simultaneously. Kernel-based ridge regression was used to estimate variance components and assess predictive ability. Contribution of each class of genomic regions to dominance variance was also considered. Variance component estimates indicated that all genomic regions contributed to marked additive genetic variation and that the class of synonymous regions tended to have the greatest contribution. The marked dominance genetic variation explained by each class of genomic regions was similar and negligible (~0.05). In terms of prediction mean-square error, the whole-genome approach showed the best predictive ability. All genic and non-genic regions contributed to phenotypic variation for the three traits studied. Overall, the contribution of additive genetic variance to the total genetic variance was much greater than that of dominance variance. Our results show that all genomic regions are important for the prediction of the targeted traits, and the whole-genome approach was reaffirmed as the best tool for genome-enabled prediction of quantitative traits.

  18. AML1/ETO trans-activates c-KIT expression through the long range interaction between promoter and intronic enhancer.

    PubMed

    Tian, Ying; Wang, Genjie; Hu, Qingzhu; Xiao, Xichun; Chen, Shuxia

    2018-04-01

    The AML1/ETO onco-fusion protein is crucial for the genesis of t(8;21) acute myeloid leukemia (AML) and is well documented as a transcriptional repressor through dominant-negative effect. However, little is known about the transactivation mechanism of AML1/ETO. Through large cohort of patient's expression level data analysis and a series of experimental validation, we report here that AML1/ETO transactivates c-KIT expression through directly binding to and mediating the long-range interaction between the promoter and intronic enhancer regions of c-KIT. Gene expression analyses verify that c-KIT expression is significantly high in t(8;21) AML. Further ChIP-seq analysis and motif scanning identify two regulatory regions located in the promoter and intronic enhancer region of c-KIT, respectively. Both regions are enriched by co-factors of AML1/ETO, such as AML1, CEBPe, c-Jun, and c-Fos. Further luciferase reporter assays show that AML1/ETO trans-activates c-KIT promoter activity through directly recognizing the AML1 motif and the co-existence of co-factors. The induction of c-KIT promoter activity is reinforced with the existence of intronic enhancer region. Furthermore, ChIP-3C-qPCR assays verify that AML1/ETO mediates the formation of DNA-looping between the c-KIT promoter and intronic enhancer region through the long-range interaction. Collectively, our data uncover a novel transcriptional activity mechanism of AML1/ETO and enrich our knowledge of the onco-fusion protein mediated transcription regulation. © 2017 Wiley Periodicals, Inc.

  19. Isolation and identification of gene-specific microRNAs.

    PubMed

    Lin, Shi-Lung; Chang, Donald C; Ying, Shao-Yao

    2006-01-01

    Prediction of microRNA (miRNA) candidates using computer programming has identified hundreds and hundreds of genomic hairpin sequences, of which, the functions remain to be determined. Because direct transfection of hairpin-like miRNA precursors (pre)-miRNAs in mammalian cells is not always sufficient to trigger effective RNA-induced gene-silencing complex (RISC) assembly, a key step for RNA interference (RNAi)-related gene silencing, we developed an intronic miRNA-expressing system to overcome this problem, and successfully increased the efficiency and effectiveness of miRNA-associated RNAi induction in vitro and in vivo. By insertion of a hairpin-like pre-miRNA structure into the intron region of a gene, this intronic miRNA biogenesis system has been found to depend on a coupled interaction of nascent precursor messenger RNA transcription and intron excision within a specific nuclear region proximal to genomic perichromatin fibrils. The intronic miRNA was transcribed by RNA type II polymerases, coexpressed with a primary gene transcript, and excised out of its encoding gene transcript by intracellular RNA splicing and processing mechanisms. Currently, some ribonuclease III endonucleases have been found to be involved in the processing of spliced introns and probably facilitating the intronic miRNA maturation. Using this miRNA-expressing system, we have shown for the first time that the intron-derived miRNAs were able to induce strong RNAi effects in not only human and mouse cells but also zebrafish, chicken embryos, and adult mice. Based on the strand complementarity between the designed miRNA and its target gene sequence, we have also developed a miRNA isolation protocol to purify and identify the mature miRNAs generated by the intronic miRNA-expressing system. Several intronic miRNA identities and structures are currently confirmed to be active in vitro and in vivo. According to this proof- of-principle method, we now have the knowledge to design pre-miRNA inserts that are more efficient and effective for the intronic miRNA-expressing system.

  20. Isolation and identification of gene-specific microRNAs.

    PubMed

    Lin, Shi-Lung; Chang, Donald C; Ying, Shao-Yao

    2013-01-01

    Computer programming has identified hundreds of genomic hairpin sequences, many with functions remain to be determined. Because direct transfection of hairpin-like miRNA precursors (pre)-miRNAs in mammalian cells is not always sufficient to trigger effective RNA-induced gene silencing complex (RISC) assembly, a key step for RNA interference (RNAi)-related gene silencing, we developed an intronic miRNA-expressing system to overcome this problem by inserting a hairpin-like pre-miRNA structure into the intron region of a gene and successfully increased the efficiency and effectiveness of miRNA-associated RNAi induction in vitro and in vivo. This intronic miRNA biogenesis has been found to depend on a coupled interaction of nascent precursor messenger RNA transcription and intron excision within a specific nuclear region proximal to genomic perichromatin fibrils. The intronic miRNA was transcribed by RNA type II polymerases, coexpressed with a primary gene transcript, and excised out of its encoding gene transcript by intracellular RNA splicing and processing mechanisms. Currently, some ribonuclease III endonucleases have been found to be involved in the processing of spliced introns and probably facilitating the intronic miRNA maturation. Using this miRNA generation system, we have shown for the first time that the intron-derived miRNAs were able to induce strong RNAi effects in not only human and mouse cells but also zebrafishes, chicken embryos, and adult mice. We have also developed an miRNA isolation protocol, based on the complementarity between the designed miRNA and its target gene sequence, to purify and identify the mature miRNAs generated by the intronic miRNA-expressing system. Several intronic miRNA identities and structures are currently confirmed to be active in vitro and in vivo. According to this proven-of-principle method, we now have full knowledge to design pre-miRNA inserts that are more efficient and effective for the intronic miRNA-expressing systems.

  1. Beta-globin LCR and intron elements cooperate and direct spatial reorganization for gene therapy.

    PubMed

    Buzina, Alla; Lo, Mandy Y M; Moffett, Angela; Hotta, Akitsu; Fussner, Eden; Bharadwaj, Rikki R; Pasceri, Peter; Garcia-Martinez, J Victor; Bazett-Jones, David P; Ellis, James

    2008-04-11

    The Locus Control Region (LCR) requires intronic elements within beta-globin transgenes to direct high level expression at all ectopic integration sites. However, these essential intronic elements cannot be transmitted through retrovirus vectors and their deletion may compromise the therapeutic potential for gene therapy. Here, we systematically regenerate functional beta-globin intron 2 elements that rescue LCR activity directed by 5'HS3. Evaluation in transgenic mice demonstrates that an Oct-1 binding site and an enhancer in the intron cooperate to increase expression levels from LCR globin transgenes. Replacement of the intronic AT-rich region with the Igmu 3'MAR rescues LCR activity in single copy transgenic mice. Importantly, a combination of the Oct-1 site, Igmu 3'MAR and intronic enhancer in the BGT158 cassette directs more consistent levels of expression in transgenic mice. By introducing intron-modified transgenes into the same genomic integration site in erythroid cells, we show that BGT158 has the greatest transcriptional induction. 3D DNA FISH establishes that induction stimulates this small 5'HS3 containing transgene and the endogenous locus to spatially reorganize towards more central locations in erythroid nuclei. Electron Spectroscopic Imaging (ESI) of chromatin fibers demonstrates that ultrastructural heterochromatin is primarily perinuclear and does not reorganize. Finally, we transmit intron-modified globin transgenes through insulated self-inactivating (SIN) lentivirus vectors into erythroid cells. We show efficient transfer and robust mRNA and protein expression by the BGT158 vector, and virus titer improvements mediated by the modified intron 2 in the presence of an LCR cassette composed of 5'HS2-4. Our results have important implications for the mechanism of LCR activity at ectopic integration sites. The modified transgenes are the first to transfer intronic elements that potentiate LCR activity and are designed to facilitate correction of hemoglobinopathies using single copy vectors.

  2. Splicing of a group II intron involved in the conjugative transfer of pRS01 in lactococci.

    PubMed

    Mills, D A; McKay, L L; Dunny, G M

    1996-06-01

    Analysis of a region involved in the conjugative transfer of the lactococcal conjugative element pRS01 has revealed a bacteria] group II intron. Splicing of this lactococcal intron (designated Ll.ltrB) in vivo resulted in the ligation of two exon messages (ltrBE1 and ltrBE2) which encoded a putative conjugative relaxase essential for the transfer of pRS01. Like many group II introns, the Ll.ltrB intron possessed an open reading frame (ltrA) with homology to reverse transcriptases. Remarkably, sequence analysis of ltrA suggested a greater similarity to open reading frames encoded by eukaryotic mitochondrial group II introns than to those identified to date from other bacteria. Several insertional mutations within ltrA resulted in plasmids exhibiting a conjugative transfer-deficient phenotype. These results provide the first direct evidence for splicing of a prokaryotic group II intron in vivo and suggest that conjugative transfer is a mechanism for group II intron dissemination in bacteria.

  3. The complete plastid genome sequence of Eustrephus latifolius (Asparagaceae: Lomandroideae).

    PubMed

    Kim, Hyoung Tae; Kim, Jung Sung; Kim, Joo-Hwan

    2016-01-01

    The complete chloroplast (cp) genome sequence of Eustrephus latifolius was firstly determined in subfamily Lomandriodeae of family Asparagaceae. It was 159,736 bp and contained a large single copy region (82,403 bp) and a small single copy region (13,607 bp) which were separated by two inverted repeat regions (31,863 bp). In total, 132 genes were identified and they were consisted of 83 coding genes, 8 rRNA genes, 38 tRNA genes, 3 pseudogenes. rpl23 and clpP were pseudogenes due to sequence deletions. Among 23 genes containing introns, rps12 and ycf3 contained two introns and the rest had just one intron. The intact ycf68 was identified within an intron of trnI-GAU. The amino acid sequence was almost identical with Phoenix dactylifera in Aracales. Ycf1 of E. latifolius was completely located in IR. It was similar to cp genome structure of Lemna minor, Spirodela polyrhiza, Wolffiella lingulata, Wolffia australiana in Alismatales.

  4. Geographical origin of Leucobryum boninense Sull. & Lesq. (Leucobryaceae, Musci) endemic to the Bonin Islands, Japan

    PubMed Central

    Oguri, Emiko; Yamaguchi, Tomio; Tsubota, Hiromi; Deguchi, Hironori; Murakami, Noriaki

    2013-01-01

    Leucobryum boninense is endemic to the Bonin Islands, Japan, and its related species are widely distributed in Asia and the Pacific. We aimed to clarify the phylogenetic relationships among Leucobryum species and infer the origin of L. boninense. We also describe the utility of the chloroplast trnK intron including matK for resolving the phylogenetic relationships among Leucobryum species, as phylogenetic analyses using trnK intron and/or matK have not been performed well in bryophytes to date. Fifty samples containing 15 species of Leucobryum from Asia and the Pacific were examined for six chloroplast DNA regions including rbcL, rps4, partial 5′ trnK intron, matK, partial 3′ trnK intron, and trnL-F intergenic spacer plus one nuclear DNA region including ITS. A molecular phylogenetic tree showed that L. boninense made a clade with L. scabrum from Japan, Taiwan and, Hong Kong; L. javense which is widely distributed in East and Southeast Asia, and L. pachyphyllum and L. seemannii restricted to the Hawaii Islands, as well as with L. scaberulum from the Ryukyus, Japan, Taiwan, and southeastern China. Leucobryum boninense from various islands of the Bonin Islands made a monophylic group that was closely related to L. scabrum and L. javense from Japan. Therefore, L. boninense may have evolved from L. scabrum from Japan, Taiwan, or Hong Kong, or L. javense from Japan. We also described the utility of trnK intron including matK. A percentage of the parsimony-informative characters in trnK intron sequence data (5.8%) was significantly higher than that from other chloroplast regions, rbcL (2.4%) and rps4 (3.2%) sequence data. Nucleotide sequence data of the trnK intron including matK are more informative than other chloroplast DNA regions for identifying the phylogenetic relationships among Leucobryum species. PMID:23610621

  5. Geographical origin of Leucobryum boninense Sull. & Lesq. (Leucobryaceae, Musci) endemic to the Bonin Islands, Japan.

    PubMed

    Oguri, Emiko; Yamaguchi, Tomio; Tsubota, Hiromi; Deguchi, Hironori; Murakami, Noriaki

    2013-04-01

    Leucobryum boninense is endemic to the Bonin Islands, Japan, and its related species are widely distributed in Asia and the Pacific. We aimed to clarify the phylogenetic relationships among Leucobryum species and infer the origin of L. boninense. We also describe the utility of the chloroplast trnK intron including matK for resolving the phylogenetic relationships among Leucobryum species, as phylogenetic analyses using trnK intron and/or matK have not been performed well in bryophytes to date. Fifty samples containing 15 species of Leucobryum from Asia and the Pacific were examined for six chloroplast DNA regions including rbcL, rps4, partial 5' trnK intron, matK, partial 3' trnK intron, and trnL-F intergenic spacer plus one nuclear DNA region including ITS. A molecular phylogenetic tree showed that L. boninense made a clade with L. scabrum from Japan, Taiwan and, Hong Kong; L. javense which is widely distributed in East and Southeast Asia, and L. pachyphyllum and L. seemannii restricted to the Hawaii Islands, as well as with L. scaberulum from the Ryukyus, Japan, Taiwan, and southeastern China. Leucobryum boninense from various islands of the Bonin Islands made a monophylic group that was closely related to L. scabrum and L. javense from Japan. Therefore, L. boninense may have evolved from L. scabrum from Japan, Taiwan, or Hong Kong, or L. javense from Japan. We also described the utility of trnK intron including matK. A percentage of the parsimony-informative characters in trnK intron sequence data (5.8%) was significantly higher than that from other chloroplast regions, rbcL (2.4%) and rps4 (3.2%) sequence data. Nucleotide sequence data of the trnK intron including matK are more informative than other chloroplast DNA regions for identifying the phylogenetic relationships among Leucobryum species.

  6. Sequence analysis of tau 3'untranslated region and saitohin gene in sporadic progressive supranuclear palsy

    PubMed Central

    Ezquerra, M; Campdelacreu, J; Munoz, E; Oliva, R; Tolosa, E

    2004-01-01

    Objectives: To search for genetic changes in the 3'untranslated region (3'UTR) of tau and adjacent sequence LOC147077, and in the coding region of STH in PSP patients. Methods: The study included 57 PSP patients and 83 healthy controls. The genetic analysis of each region was performed through sequencing. The Q7R polymorphism was studied through restriction enzyme and electrophoresis analysis. Results: No mutations were found in the regions analysed. The QQ genotype of the STH polymorphism was over-represented in participants with PSP (91.5%) compared with control subjects (47%) (p⩽0.00001). This genotype co-segregated with the H1/H1 haplotype in our PSP cases. Conclusions: Our results do not support a major role for the tau 3'UTR in PSP genetics. The QQ genotype of STH confers susceptibility for PSP and is in linkage disequilibrium with the H1/H1 haplotype. PMID:14707330

  7. Sequence of the structural gene for granule-bound starch synthase of potato (Solanum tuberosum L.) and evidence for a single point deletion in the amf allele.

    PubMed

    van der Leij, F R; Visser, R G; Ponstein, A S; Jacobsen, E; Feenstra, W J

    1991-08-01

    The genomic sequence of the potato gene for starch granule-bound starch synthase (GBSS; "waxy protein") has been determined for the wild-type allele of a monoploid genotype from which an amylose-free (amf) mutant was derived, and for the mutant part of the amf allele. Comparison of the wild-type sequence with a cDNA sequence from the literature and a newly isolated cDNA revealed the presence of 13 introns, the first of which is located in the untranslated leader. The promoter contains a G-box-like sequence. The deduced amino acid sequence of the precursor of GBSS shows a high degree of identity with monocot waxy protein sequences in the region corresponding to the mature form of the enzyme. The transit peptide of 77 amino acids, required for routing of the precursor to the plastids, shows much less identity with the transit peptides of the other waxy preproteins, but resembles the hydropathic distributions of these peptides. Alignment of the amino acid sequences of the four mature starch synthases with the Escherichia coli glgA gene product revealed the presence of at least three conserved boxes; there is no homology with previously proposed starch-binding domains of other enzymes involved in starch metabolism. We report the use of chimeric constructs with wild-type and amf sequences to localize, via complementation experiments, the region of the amf allele in which the mutation resides. Direct sequencing of polymerase chain reaction products confirmed that the amf mutation is a deletion of a single AT basepair in the region coding for the transit peptide.(ABSTRACT TRUNCATED AT 250 WORDS)

  8. Mutation spectrum of the Norrie disease pseudoglioma (NDP) gene in Indian patients with FEVR.

    PubMed

    Musada, Ganeswara Rao; Jalali, Subhadra; Hussain, Anjli; Chururu, Anupama Reddy; Gaddam, Pramod Reddy; Chakrabarti, Subhabrata; Kaur, Inderjeet

    2016-01-01

    Mutations in the Norrie disease pseudoglioma (NDP; Xp11.3) gene have been involved in retinal blood vessel formation and neural differentiation and are implicated in familial exudative vitreoretinopathy (FEVR) cases. However, the role of the gene has not been explored in the Indian context. Thus, this study was designed to understand the involvement of NDP among Indian patients with FEVR. The study cohort comprised 225 subjects, including unrelated patients with FEVR (n = 110) and ethnically matched healthy subjects (n = 115) recruited from a tertiary eye care center in India. The entire coding regions, intron-exon boundaries, along with the 5' and 3' untranslated regions of NDP were screened with resequencing following standard protocols. The spectrum of the observed variants was analyzed in conjunction with data available from other populations. Eight potentially pathogenic mutations (p.His4ArgfsX21, p.Asp23GlufsX9, p.Ile48ValfsX55, p.His50Asp, p.Ser57*, p.Gly113Asp, p.Arg121Gln, and p.Cys126Arg, including five novel ones), were observed in the coding region of the NDP gene in ten unrelated FEVR probands (9%). The novel changes were not observed in the control subjects and were unavailable in the dbSNP, ESP5400, NIEHS95, and ExAC databases. All probands with NDP mutations exhibited classical features of the disease as observed among patients with FEVR worldwide. This is perhaps the first study to demonstrate the involvement of NDP among patients with Indian FEVR that further expands its mutation spectrum. The data generated could have broad implications in genetic counseling, disease management, and early intervention for a better prognosis in FEVR.

  9. GeneBuilder: interactive in silico prediction of gene structure.

    PubMed

    Milanesi, L; D'Angelo, D; Rogozin, I B

    1999-01-01

    Prediction of gene structure in newly sequenced DNA becomes very important in large genome sequencing projects. This problem is complicated due to the exon-intron structure of eukaryotic genes and because gene expression is regulated by many different short nucleotide domains. In order to be able to analyse the full gene structure in different organisms, it is necessary to combine information about potential functional signals (promoter region, splice sites, start and stop codons, 3' untranslated region) together with the statistical properties of coding sequences (coding potential), information about homologous proteins, ESTs and repeated elements. We have developed the GeneBuilder system which is based on prediction of functional signals and coding regions by different approaches in combination with similarity searches in proteins and EST databases. The potential gene structure models are obtained by using a dynamic programming method. The program permits the use of several parameters for gene structure prediction and refinement. During gene model construction, selecting different exon homology levels with a protein sequence selected from a list of homologous proteins can improve the accuracy of the gene structure prediction. In the case of low homology, GeneBuilder is still able to predict the gene structure. The GeneBuilder system has been tested by using the standard set (Burset and Guigo, Genomics, 34, 353-367, 1996) and the performances are: 0.89 sensitivity and 0.91 specificity at the nucleotide level. The total correlation coefficient is 0.88. The GeneBuilder system is implemented as a part of the WebGene a the URL: http://www.itba.mi. cnr.it/webgene and TRADAT (TRAncription Database and Analysis Tools) launcher URL: http://www.itba.mi.cnr.it/tradat.

  10. Genome-scale characterization of RNA tertiary structures and their functional impact by RNA solvent accessibility prediction.

    PubMed

    Yang, Yuedong; Li, Xiaomei; Zhao, Huiying; Zhan, Jian; Wang, Jihua; Zhou, Yaoqi

    2017-01-01

    As most RNA structures are elusive to structure determination, obtaining solvent accessible surface areas (ASAs) of nucleotides in an RNA structure is an important first step to characterize potential functional sites and core structural regions. Here, we developed RNAsnap, the first machine-learning method trained on protein-bound RNA structures for solvent accessibility prediction. Built on sequence profiles from multiple sequence alignment (RNAsnap-prof), the method provided robust prediction in fivefold cross-validation and an independent test (Pearson correlation coefficients, r, between predicted and actual ASA values are 0.66 and 0.63, respectively). Application of the method to 6178 mRNAs revealed its positive correlation to mRNA accessibility by dimethyl sulphate (DMS) experimentally measured in vivo (r = 0.37) but not in vitro (r = 0.07), despite the lack of training on mRNAs and the fact that DMS accessibility is only an approximation to solvent accessibility. We further found strong association across coding and noncoding regions between predicted solvent accessibility of the mutation site of a single nucleotide variant (SNV) and the frequency of that variant in the population for 2.2 million SNVs obtained in the 1000 Genomes Project. Moreover, mapping solvent accessibility of RNAs to the human genome indicated that introns, 5' cap of 5' and 3' cap of 3' untranslated regions, are more solvent accessible, consistent with their respective functional roles. These results support conformational selections as the mechanism for the formation of RNA-protein complexes and highlight the utility of genome-scale characterization of RNA tertiary structures by RNAsnap. The server and its stand-alone downloadable version are available at http://sparks-lab.org. © 2016 Yang et al.; Published by Cold Spring Harbor Laboratory Press for the RNA Society.

  11. X-linked Charcot-Marie-Tooth (CMT) neuropathies (CMTX1, CMTX2, CMTX3) show different clinical phenotype and molecular genetics

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Ionasescu, V.V.; Searby, C.C.; Ionasescu, R.

    1994-09-01

    The purpose of this study was to compare the X-linked dominant type CMTX1 (20 families) with X-linked recessive types CMTX2 and CMTX3 (2 families). The clinical phenotype was consistent with CMT peripheral neuropathy in all cases including distal weakness, atrophy and sensory loss, pes cavus and areflexia. Additional clinicial involvement of the central nervous system was present in one family with CMTX2 (mental retardation) and one family with CMTX3 (spastic paraparesis). Tight genetic linkage to Xq13.1 was present in 20 families with CMTX1 (Z=34.07 at {theta}=0) for the marker DXS453. Fifteen of the CMTX1 families showed point mutations of themore » connexin 32 coding region (5 nonsense mutations, 8 missense mutations, 2 deletions). Five CMTX1 neuropathy families showed no evidence of point mutations of the CX32 coding sequence. These findings suggest that the CMTX1 neuropathy genotype in these families may be the result of promoter mutations, 3{prime}-untranslated region mutations or exon/intron splice site mutations or a mutation with a different type of connexin but which has close structural similarities to CX32. No mutations of the CX32 coding region were found in the CMTX2 or CMTX3 families. Linkage to Xq13.1 was excluded in both families. Genetic linkage to Xp22.2 was present in the CMTX2 family (Z=3.54 at {theta}=0) for the markers DXS987 and DXS999. Suggestion of linkage to Xq26 (Z=1.81 at {theta}=0) for the marker DXS86 was present in the CMTX3 family.« less

  12. Nucleotide sequence of the L1 ribosomal protein gene of Xenopus laevis: remarkable sequence homology among introns.

    PubMed Central

    Loreni, F; Ruberti, I; Bozzoni, I; Pierandrei-Amaldi, P; Amaldi, F

    1985-01-01

    Ribosomal protein L1 is encoded by two genes in Xenopus laevis. The comparison of two cDNA sequences shows that the two L1 gene copies (L1a and L1b) have diverged in many silent sites and very few substitution sites; moreover a small duplication occurred at the very end of the coding region of the L1b gene which thus codes for a product five amino acids longer than that coded by L1a. Quantitatively the divergence between the two L1 genes confirms that a whole genome duplication took place in Xenopus laevis approximately 30 million years ago. A genomic fragment containing one of the two L1 gene copies (L1a), with its nine introns and flanking regions, has been completely sequenced. The 5' end of this gene has been mapped within a 20-pyridimine stretch as already found for other vertebrate ribosomal protein genes. Four of the nine introns have a 60-nucleotide sequence with 80% homology; within this region some boxes, one of which is 16 nucleotides long, are 100% homologous among the four introns. This feature of L1a gene introns is interesting since we have previously shown that the activity of this gene is regulated at a post-transcriptional level and it involves the block of the normal splicing of some intron sequences. Images Fig. 3. Fig. 5. PMID:3841512

  13. Insights into the strategies used by related group II introns to adapt successfully for the colonisation of a bacterial genome

    PubMed Central

    Martínez-Rodríguez, Laura; García-Rodríguez, Fernando M; Molina-Sánchez, María Dolores; Toro, Nicolás; Martínez-Abarca, Francisco

    2014-01-01

    Group II introns are self-splicing RNAs and site-specific mobile retroelements found in bacterial and organellar genomes. The group II intron RmInt1 is present at high copy number in Sinorhizobium meliloti species, and has a multifunctional intron-encoded protein (IEP) with reverse transcriptase/maturase activities, but lacking the DNA-binding and endonuclease domains. We characterized two RmInt1-related group II introns RmInt2 from S. meliloti strain GR4 and Sr.md.I1 from S. medicae strain WSM419 in terms of splicing and mobility activities. We used both wild-type and engineered intron-donor constructs based on ribozyme ΔORF-coding sequence derivatives, and we determined the DNA target requirements for RmInt2, the element most distantly related to RmInt1. The excision and mobility patterns of intron-donor constructs expressing different combinations of IEP and intron RNA provided experimental evidence for the co-operation of IEPs and intron RNAs from related elements in intron splicing and, in some cases, in intron homing. We were also able to identify the DNA target regions recognized by these IEPs lacking the DNA endonuclease domain. Our results provide new insight into the versatility of related group II introns and the possible co-operation between these elements to facilitate the colonization of bacterial genomes. PMID:25482895

  14. Insights into the strategies used by related group II introns to adapt successfully for the colonisation of a bacterial genome.

    PubMed

    Martínez-Rodríguez, Laura; García-Rodríguez, Fernando M; Molina-Sánchez, María Dolores; Toro, Nicolás; Martínez-Abarca, Francisco

    2014-01-01

    Group II introns are self-splicing RNAs and site-specific mobile retroelements found in bacterial and organellar genomes. The group II intron RmInt1 is present at high copy number in Sinorhizobium meliloti species, and has a multifunctional intron-encoded protein (IEP) with reverse transcriptase/maturase activities, but lacking the DNA-binding and endonuclease domains. We characterized two RmInt1-related group II introns RmInt2 from S. meliloti strain GR4 and Sr.md.I1 from S. medicae strain WSM419 in terms of splicing and mobility activities. We used both wild-type and engineered intron-donor constructs based on ribozyme ΔORF-coding sequence derivatives, and we determined the DNA target requirements for RmInt2, the element most distantly related to RmInt1. The excision and mobility patterns of intron-donor constructs expressing different combinations of IEP and intron RNA provided experimental evidence for the co-operation of IEPs and intron RNAs from related elements in intron splicing and, in some cases, in intron homing. We were also able to identify the DNA target regions recognized by these IEPs lacking the DNA endonuclease domain. Our results provide new insight into the versatility of related group II introns and the possible co-operation between these elements to facilitate the colonization of bacterial genomes.

  15. Mechanisms Used for Genomic Proliferation by Thermophilic Group II Introns

    PubMed Central

    Mohr, Georg; Ghanem, Eman; Lambowitz, Alan M.

    2010-01-01

    Mobile group II introns, which are found in bacterial and organellar genomes, are site-specific retroelments hypothesized to be evolutionary ancestors of spliceosomal introns and retrotransposons in higher organisms. Most bacteria, however, contain no more than one or a few group II introns, making it unclear how introns could have proliferated to higher copy numbers in eukaryotic genomes. An exception is the thermophilic cyanobacterium Thermosynechococcus elongatus, which contains 28 closely related copies of a group II intron, constituting ∼1.3% of the genome. Here, by using a combination of bioinformatics and mobility assays at different temperatures, we identified mechanisms that contribute to the proliferation of T. elongatus group II introns. These mechanisms include divergence of DNA target specificity to avoid target site saturation; adaptation of some intron-encoded reverse transcriptases to splice and mobilize multiple degenerate introns that do not encode reverse transcriptases, leading to a common splicing apparatus; and preferential insertion within other mobile introns or insertion elements, which provide new unoccupied sites in expanding non-essential DNA regions. Additionally, unlike mesophilic group II introns, the thermophilic T. elongatus introns rely on elevated temperatures to help promote DNA strand separation, enabling access to a larger number of DNA target sites by base pairing of the intron RNA, with minimal constraint from the reverse transcriptase. Our results provide insight into group II intron proliferation mechanisms and show that higher temperatures, which are thought to have prevailed on Earth during the emergence of eukaryotes, favor intron proliferation by increasing the accessibility of DNA target sites. We also identify actively mobile thermophilic introns, which may be useful for structural studies, gene targeting in thermophiles, and as a source of thermostable reverse transcriptases. PMID:20543989

  16. The complete chloroplast DNA sequence of Eleutherococcus senticosus (Araliaceae); comparative evolutionary analyses with other three asterids.

    PubMed

    Yi, Dong-Keun; Lee, Hae-Lim; Sun, Byung-Yun; Chung, Mi Yoon; Kim, Ki-Joong

    2012-05-01

    This study reports the complete chloroplast (cp) DNA sequence of Eleutherococcus senticosus (GenBank: JN 637765), an endangered endemic species. The genome is 156,768 bp in length, and contains a pair of inverted repeat (IR) regions of 25,930 bp each, a large single copy (LSC) region of 86,755 bp and a small single copy (SSC) region of 18,153 bp. The structural organization, gene and intron contents, gene order, AT content, codon usage, and transcription units of the E. senticosus chloroplast genome are similar to that of typical land plant cp DNA. We aligned and analyzed the sequences of 86 coding genes, 19 introns and 113 intergenic spacers (IGS) in three different taxonomic hierarchies; Eleutherococcus vs. Panax, Eleutherococcus vs. Daucus, and Eleutherococcus vs. Nicotiana. The distribution of indels, the number of polymorphic sites and nucleotide diversity indicate that positional constraint is more important than functional constraint for the evolution of cp genome sequences in Asterids. For example, the intron sequences in the LSC region exhibited base substitution rates 5-11-times higher than that of the IR regions, while the intron sequences in the SSC region evolved 7-14-times faster than those in the IR region. Furthermore, the Ka/Ks ratio of the gene coding sequences supports a stronger evolutionary constraint in the IR region than in the LSC or SSC regions. Therefore, our data suggest that selective sweeps by base collection mechanisms more frequently eliminate polymorphisms in the IR region than in other regions. Chloroplast genome regions that have high levels of base substitutions also show higher incidences of indels. Thirty-five simple sequence repeat (SSR) loci were identified in the Eleutherococcus chloroplast genome. Of these, 27 are homopolymers, while six are di-polymers and two are tri-polymers. In addition to the SSR loci, we also identified 18 medium size repeat units ranging from 22 to 79 bp, 11 of which are distributed in the IGS or intron regions. These medium size repeats may contribute to developing a cp genome-specific gene introduction vector because the region may use for specific recombination sites.

  17. Intron Definition Is Required for Excision of the Minute Virus of Mice Small Intron and Definition of the Upstream Exon

    PubMed Central

    Haut, Donald D.; Pintel, D. J.

    1998-01-01

    Alternative splicing of pre-mRNAs plays a critical role in maximizing the coding capacity of the small parvovirus genome. The small-intron region of minute virus of mice (MVM) pre-mRNAs undergoes an unusual pattern of overlapping alternative splicing—using two donors (D1 and D2) and two acceptors (A1 and A2) within a region of 120 nucleotides—that determines the steady-state ratios of the various viral mRNAs. In this report, we show that the determinants that govern excision of the small intron are complex and are also required for efficient definition of the upstream exon. For the MVM small intron in its natural context, the two donors appear to compete for the splicing machinery: the position of D1 favors its usage, while the primary sequence of D2 must be more like the consensus sequence than is D1 to be used efficiently. We have genetically defined the branch points that are used for generation of the major and minor spliced forms and show that recognition of components of the small-intron acceptors is likely to be the dominant determinant in alternative small-intron excision. We have also identified a G-rich intronic enhancer sequence within the small intron that is essential for splicing of the minor form (D2 to A2) but not the major form (D1 to A1) of MVM mRNAs and is required for efficient definition of the upstream NS2-specific exon. In its natural context, the small intron appears to be excised by a mechanism consistent with intron definition. When the MVM small intron is expanded, various parameters of its excision are altered, indicating that critical cis-acting signals are context dependent. Relative use of the donors and acceptors is altered, and the upstream NS2-specific exon is no longer efficiently defined. The fact that definition of the upstream NS2-specific exon can be achieved by the MVM small intron in its natural context, but not when it is expanded, suggests that the multiple determinants that govern definition and excision of the small intron are required, in concert, for upstream exon definition. Our data are consistent with a model in which alternative splicing of the MVM P4-generated pre-mRNAs is governed by a hybrid of intron- and exon-defining mechanisms. PMID:9499034

  18. Isolation and Identification of Gene-Specific MicroRNAs.

    PubMed

    Lin, Shi-Lung; Chang, Donald C; Ying, Shao-Yao

    2018-01-01

    Computer programming has identified hundreds of genomic hairpin sequences, many with functions yet to be determined. Because transfection of hairpin-like microRNA precursors (pre-miRNAs) into mammalian cells is not always sufficient to trigger RNA-induced gene silencing complex (RISC) assembly, a key step for inducing RNA interference (RNAi)-related gene silencing, we have developed an intronic miRNA expression system to overcome this problem by inserting a hairpin-like pre-miRNA structure into the intron region of a gene, and hence successfully increase the efficiency and effectiveness of miRNA-associated RNAi induction in vitro and in vivo. This intronic miRNA biogenesis mechanism has been found to depend on a coupled interaction of nascent messenger RNA transcription and intron excision within a specific nuclear region proximal to genomic perichromatin fibrils. The intronic miRNA so obtained is transcribed by type-II RNA polymerases, coexpressed within a primary gene transcript, and then excised out of the gene transcript by intracellular RNA splicing and processing machineries. After that, ribonuclease III (RNaseIII) endonucleases further process the spliced introns into mature miRNAs. Using this intronic miRNA expression system, we have shown for the first time that the intron-derived miRNAs are able to elicit strong RNAi effects in not only human and mouse cells in vitro but also in zebrafishes, chicken embryos, and adult mice in vivo. We have also developed a miRNA isolation protocol, based on the complementarity between the designed miRNA and its targeted gene sequence, to purify and identify the mature miRNAs generated. As a result, several intronic miRNA identities and structures have been confirmed. According to this proof-of-principle methodology, we now have full knowledge to design various intronic pre-miRNA inserts that are more efficient and effective for inducing specific gene silencing effects in vitro and in vivo.

  19. A multigene locus containing the Manx and bobcat genes is required for development of chordate features in the ascidian tadpole larva.

    PubMed

    Swalla, B J; Just, M A; Pederson, E L; Jeffery, W R

    1999-04-01

    The Manx gene is required for the development of the tail and other chordate features in the ascidian tadpole larva. To determine the structure of the Manx gene, we isolated and sequenced genomic clones from the tailed ascidian Molgula oculata. The Manx gene contains 9 exons and encodes both major and minor Manx mRNAs, which differ in the length of their 5' untranslated regions. The coding region of the single-copy bobcat gene, which encodes a DEAD-box RNA helicase, is embedded within the first Manx intron. The organization of the bobcat and Manx transcription units was determined by comparing genomic and cDNA clones. The Manx-bobcat gene locus has an unusual organization in which a non-coding first exon is alternatively spliced at the 5' end of two different mRNAs. The bobcat and Manx genes are expressed coordinately during oogenesis and embryogenesis, but not during spermatogenesis, in which bobcat mRNA accumulates independently of Manx mRNA. Similar to Manx, zygotic bobcat transcripts accumulate in the embryonic primordia responsible for generating chordate features, including the dorsal neural tube and notochord, are downregulated during embryogenesis in the tailless species Molgula occulta and are upregulated in M. occulta X M. oculata hybrids, which restore these chordate features. Antisense experiments indicate that zygotic bobcat expression is required for development of the same suite of chordate features as Manx. The results show that the Manx-bobcat gene complex has a role in the development of chordate features in ascidian tadpole larvae.

  20. Multi-functional acetyl-CoA carboxylase from Brassica napus is encoded by a multi-gene family: indication for plastidic localization of at least one isoform.

    PubMed

    Schulte, W; Töpfer, R; Stracke, R; Schell, J; Martini, N

    1997-04-01

    Three genes coding for different multifunctional acetyl-CoA carboxylase (ACCase; EC 6.4.1.2) isoenzymes from Brassica napus were isolated and divided into two major classes according to structural features in their 5' regions: class I comprises two genes with an additional coding exon of approximately 300 bp at the 5' end, and class II is represented by one gene carrying an intron of 586 bp in its 5' untranslated region. Fusion of the peptide sequence encoded by the additional first exon of a class I ACCase gene to the jellyfish Aequorea victoria green fluorescent protein (GFP) and transient expression in tobacco protoplasts targeted GFP to the chloroplasts. In contrast to the deduced primary structure of the biotin carboxylase domain encoded by the class I gene, the corresponding amino acid sequence of the class II ACCase shows higher identity with that of the Arabidopsis ACCase, both lacking a transit peptide. The Arabidopsis ACCase has been proposed to be a cytosolic isoenzyme. These observations indicate that the two classes of ACCase genes encode plastidic and cytosolic isoforms of multi-functional, eukaryotic type, respectively, and that B. napus contains at least one multi-functional ACCase besides the multi-subunit, prokaryotic type located in plastids. Southern blot analysis of genomic DNA from B. napus, Brassica rapa, and Brassica oleracea, the ancestors of amphidiploid rapeseed, using a fragment of a multi-functional ACCase gene as a probe revealed that ACCase is encoded by a multi-gene family of at least five members.

  1. RNA-Mediated Gene Duplication and Retroposons: Retrogenes, LINEs, SINEs, and Sequence Specificity

    PubMed Central

    2013-01-01

    A substantial number of “retrogenes” that are derived from the mRNA of various intron-containing genes have been reported. A class of mammalian retroposons, long interspersed element-1 (LINE1, L1), has been shown to be involved in the reverse transcription of retrogenes (or processed pseudogenes) and non-autonomous short interspersed elements (SINEs). The 3′-end sequences of various SINEs originated from a corresponding LINE. As the 3′-untranslated regions of several LINEs are essential for retroposition, these LINEs presumably require “stringent” recognition of the 3′-end sequence of the RNA template. However, the 3′-ends of mammalian L1s do not exhibit any similarity to SINEs, except for the presence of 3′-poly(A) repeats. Since the 3′-poly(A) repeats of L1 and Alu SINE are critical for their retroposition, L1 probably recognizes the poly(A) repeats, thereby mobilizing not only Alu SINE but also cytosolic mRNA. Many flowering plants only harbor L1-clade LINEs and a significant number of SINEs with poly(A) repeats, but no homology to the LINEs. Moreover, processed pseudogenes have also been found in flowering plants. I propose that the ancestral L1-clade LINE in the common ancestor of green plants may have recognized a specific RNA template, with stringent recognition then becoming relaxed during the course of plant evolution. PMID:23984183

  2. Genomic organization of the neurofibromatosis 1 gene (NF1)

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Li, Y.; O`Connell, P.; Huntsman Breidenbach, H.

    Neurofibromatosis 1 maps to chromosome band 17q11.2, and the NF1 locus has been partially characterized. Even though the full-length NF1 cDNA has been sequenced, the complete genomic structure of the NF1 gene has not been elucidated. The 5{prime} end of NF1 is embedded in a CpG island containing a NotI restriction site, and the remainder of the gene lies in the adjacent 350-kb NotI fragment. In our efforts to develop a comprehensive screen for NF1 mutations, we have isolated genomic DNA clones that together harbor the entire NF1 cDNA sequence. We have identified all intron-exon boundaries of the coding regionmore » and established that it is composed of 59 exons. Furthermore, we have defined the 3{prime}-untranslated region (3{prime}-UTR) of the NF1 gene; it spans approximately 3.5 kb of genomic DNA sequence and is continuous with the stop codon. Oligonucleotide primer pairs synthesized from exon-flanking DNA sequences were used in the polymerase chain reaction with cloned, chromosome 17-specific genomic DNA as template to amplify NF1 exons 1 through 27b and the exon containing the 3{prime}-UTR separately. This information should be useful for implementing a comprehensive NF1 mutation screen using genomic DNA as template. 41 refs., 3 figs., 2 tabs.« less

  3. APASdb: a database describing alternative poly(A) sites and selection of heterogeneous cleavage sites downstream of poly(A) signals

    PubMed Central

    You, Leiming; Wu, Jiexin; Feng, Yuchao; Fu, Yonggui; Guo, Yanan; Long, Liyuan; Zhang, Hui; Luan, Yijie; Tian, Peng; Chen, Liangfu; Huang, Guangrui; Huang, Shengfeng; Li, Yuxin; Li, Jie; Chen, Chengyong; Zhang, Yaqing; Chen, Shangwu; Xu, Anlong

    2015-01-01

    Increasing amounts of genes have been shown to utilize alternative polyadenylation (APA) 3′-processing sites depending on the cell and tissue type and/or physiological and pathological conditions at the time of processing, and the construction of genome-wide database regarding APA is urgently needed for better understanding poly(A) site selection and APA-directed gene expression regulation for a given biology. Here we present a web-accessible database, named APASdb (http://mosas.sysu.edu.cn/utr), which can visualize the precise map and usage quantification of different APA isoforms for all genes. The datasets are deeply profiled by the sequencing alternative polyadenylation sites (SAPAS) method capable of high-throughput sequencing 3′-ends of polyadenylated transcripts. Thus, APASdb details all the heterogeneous cleavage sites downstream of poly(A) signals, and maintains near complete coverage for APA sites, much better than the previous databases using conventional methods. Furthermore, APASdb provides the quantification of a given APA variant among transcripts with different APA sites by computing their corresponding normalized-reads, making our database more useful. In addition, APASdb supports URL-based retrieval, browsing and display of exon-intron structure, poly(A) signals, poly(A) sites location and usage reads, and 3′-untranslated regions (3′-UTRs). Currently, APASdb involves APA in various biological processes and diseases in human, mouse and zebrafish. PMID:25378337

  4. hnRNP L binds to CA repeats in the 3'UTR of bcl-2 mRNA

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Lee, Dong-Hyoung; Lim, Mi-Hyun; Youn, Dong-Ye

    We previously reported that the CA-repeat sequence in the 3'-untranslated region (3'UTR) of bcl-2 mRNA is involved in the decay of bcl-2 mRNA. However, the trans-acting factor for the CA element in bcl-2 mRNA remains unidentified. The heterogeneous nuclear ribonucleoprotein L (hnRNP L), an intron splicing factor, has been reported to bind to CA repeats and CA clusters in the 3'UTR of several genes. We reported herein that the CA repeats of bcl-2 mRNA have the potential to form a distinct ribonuclear protein complex in cytoplasmic extracts of MCF-7 cells, as evidenced by RNA electrophoretic mobility shift assays (REMSA). Amore » super-shift assay using the hnRNP L antibody completely shifted the complex. Immunoprecipitation with the hnRNP L antibody and MCF-7 cells followed by RT-PCR revealed that hnRNP L interacts with endogenous bcl-2 mRNA in vivo. Furthermore, the suppression of hnRNP L in MCF-7 cells by the transfection of siRNA for hnRNP L resulted in a delay in the degradation of RNA transcripts including CA repeats of bcl-2 mRNA in vitro, suggesting that the interaction between hnRNPL and CA repeats of bcl-2 mRNA participates in destabilizing bcl-2 mRNA.« less

  5. Common FABP4 genetic variants and plasma levels of fatty acid binding protein 4 in older adults.

    PubMed

    Mukamal, Kenneth J; Wilk, Jemma B; Biggs, Mary L; Jensen, Majken K; Ix, Joachim H; Kizer, Jorge R; Tracy, Russell P; Zieman, Susan J; Mozaffarian, Dariush; Psaty, Bruce M; Siscovick, David S; Djoussé, Luc

    2013-11-01

    We examined common variants in the fatty acid binding protein 4 gene (FABP4) and plasma levels of FABP4 in adults aged 65 and older from the Cardiovascular Health Study. We genotyped rs16909187, rs1054135, rs16909192, rs10808846, rs7018409, rs2290201, and rs6992708 and measured circulating FABP4 levels among 3190 European Americans and 660 African Americans. Among European Americans, the minor alleles of six single nucleotide polymorphisms (SNP) were associated with lower FABP4 levels (all p ≤ 0.01). Among African Americans, the SNP with the lowest minor allele frequency was associated with lower FABP4 levels (p = 0.015). The C-A haplotype of rs16909192 and rs2290201 was associated with lower FABP4 levels in both European Americans (frequency = 16 %; p = 0.001) and African Americans (frequency = 8 %; p = 0.04). The haplotype combined a SNP in the first intron with one in the 3'untranslated region. However, the alleles associated with lower FABP4 levels were associated with higher fasting glucose in meta-analyses from the MAGIC consortium. These results demonstrate associations of common SNP and haplotypes in the FABP4 gene with lower plasma FABP4 but higher fasting glucose levels.

  6. Branchpoint selection in the splicing of U12-dependent introns in vitro.

    PubMed

    McConnell, Timothy S; Cho, Soo-Jin; Frilander, Mikko J; Steitz, Joan A

    2002-05-01

    In metazoans, splicing of introns from pre-mRNAs can occur by two pathways: the major U2-dependent or the minor U12-dependent pathways. Whereas the U2-dependent pathway has been well characterized, much about the U12-dependent pathway remains to be discovered. Most of the information regarding U12-type introns has come from in vitro studies of a very few known introns of this class. To expand our understanding of U12-type splicing, especially to test the hypothesis that the simple base-pairing mechanism between the intron and U12 snRNA defines the branchpoint of U12-dependent introns, additional in vitro splicing substrates were created from three putative U12-type introns: the third intron of the Xenopus RPL1 a gene (XRP), the sixth intron of the Xenopus TFIIS.oA gene (XTF), and the first intron of the human Sm E gene (SME). In vitro splicing in HeLa nuclear extract confirmed U12-dependent splicing of each of these introns. Surprisingly, branchpoint mapping of the XRP splicing intermediate shows use of the upstream rather than the downstream of two consecutive adenosines within the branchpoint sequence (BPS), contrary to the prediction based on alignment with the sixth intron of human P120, a U12-dependent intron whose branch site was previously determined. Also, in the SME intron, the position of the branchpoint A residue within the region base paired with U12 differs from that in P120 and XTF. Analysis of these three additional introns therefore rules out simple models for branchpoint selection by the U12-type spliceosome.

  7. Branchpoint selection in the splicing of U12-dependent introns in vitro.

    PubMed Central

    McConnell, Timothy S; Cho, Soo-Jin; Frilander, Mikko J; Steitz, Joan A

    2002-01-01

    In metazoans, splicing of introns from pre-mRNAs can occur by two pathways: the major U2-dependent or the minor U12-dependent pathways. Whereas the U2-dependent pathway has been well characterized, much about the U12-dependent pathway remains to be discovered. Most of the information regarding U12-type introns has come from in vitro studies of a very few known introns of this class. To expand our understanding of U12-type splicing, especially to test the hypothesis that the simple base-pairing mechanism between the intron and U12 snRNA defines the branchpoint of U12-dependent introns, additional in vitro splicing substrates were created from three putative U12-type introns: the third intron of the Xenopus RPL1 a gene (XRP), the sixth intron of the Xenopus TFIIS.oA gene (XTF), and the first intron of the human Sm E gene (SME). In vitro splicing in HeLa nuclear extract confirmed U12-dependent splicing of each of these introns. Surprisingly, branchpoint mapping of the XRP splicing intermediate shows use of the upstream rather than the downstream of two consecutive adenosines within the branchpoint sequence (BPS), contrary to the prediction based on alignment with the sixth intron of human P120, a U12-dependent intron whose branch site was previously determined. Also, in the SME intron, the position of the branchpoint A residue within the region base paired with U12 differs from that in P120 and XTF. Analysis of these three additional introns therefore rules out simple models for branchpoint selection by the U12-type spliceosome. PMID:12022225

  8. A contracted DNA repeat in LHX3 intron 5 is associated with aberrant splicing and pituitary dwarfism in German shepherd dogs.

    PubMed

    Voorbij, Annemarie M W Y; van Steenbeek, Frank G; Vos-Loohuis, Manon; Martens, Ellen E C P; Hanson-Nilsson, Jeanette M; van Oost, Bernard A; Kooistra, Hans S; Leegwater, Peter A

    2011-01-01

    Dwarfism in German shepherd dogs is due to combined pituitary hormone deficiency of unknown genetic cause. We localized the recessively inherited defect by a genome wide approach to a region on chromosome 9 with a lod score of 9.8. The region contains LHX3, which codes for a transcription factor essential for pituitary development. Dwarfs have a deletion of one of six 7 bp repeats in intron 5 of LHX3, reducing the intron size to 68 bp. One dwarf was compound heterozygous for the deletion and an insertion of an asparagine residue in the DNA-binding homeodomain of LHX3, suggesting involvement of the gene in the disorder. An exon trapping assay indicated that the shortened intron is not spliced efficiently, probably because it is too small. We applied bisulfite conversion of cytosine to uracil in RNA followed by RT-PCR to analyze the splicing products. The aberrantly spliced RNA molecules resulted from either skipping of exon 5 or retention of intron 5. The same splicing defects were observed in cDNA derived from the pituitary of dwarfs. A survey of similarly mutated introns suggests that there is a minimal distance requirement between the splice donor and branch site of 50 nucleotides. In conclusion, a contraction of a DNA repeat in intron 5 of canine LHX3 leads to deficient splicing and is associated with pituitary dwarfism.

  9. A Contracted DNA Repeat in LHX3 Intron 5 Is Associated with Aberrant Splicing and Pituitary Dwarfism in German Shepherd Dogs

    PubMed Central

    Voorbij, Annemarie M. W. Y.; van Steenbeek, Frank G.; Vos-Loohuis, Manon; Martens, Ellen E. C. P.; Hanson-Nilsson, Jeanette M.; van Oost, Bernard A.; Kooistra, Hans S.; Leegwater, Peter A.

    2011-01-01

    Dwarfism in German shepherd dogs is due to combined pituitary hormone deficiency of unknown genetic cause. We localized the recessively inherited defect by a genome wide approach to a region on chromosome 9 with a lod score of 9.8. The region contains LHX3, which codes for a transcription factor essential for pituitary development. Dwarfs have a deletion of one of six 7 bp repeats in intron 5 of LHX3, reducing the intron size to 68 bp. One dwarf was compound heterozygous for the deletion and an insertion of an asparagine residue in the DNA-binding homeodomain of LHX3, suggesting involvement of the gene in the disorder. An exon trapping assay indicated that the shortened intron is not spliced efficiently, probably because it is too small. We applied bisulfite conversion of cytosine to uracil in RNA followed by RT-PCR to analyze the splicing products. The aberrantly spliced RNA molecules resulted from either skipping of exon 5 or retention of intron 5. The same splicing defects were observed in cDNA derived from the pituitary of dwarfs. A survey of similarly mutated introns suggests that there is a minimal distance requirement between the splice donor and branch site of 50 nucleotides. In conclusion, a contraction of a DNA repeat in intron 5 of canine LHX3 leads to deficient splicing and is associated with pituitary dwarfism. PMID:22132174

  10. Role of transcription factor Sp1 and RNA binding protein HuR in the down-regulation of Dr+ Escherichia coli receptor protein Decay Accelerating Factor (DAF or CD55) by Nitric oxide

    PubMed Central

    Banadakoppa, Manu; Liebenthal, Daniel; Nowak, David E; Urvil, Petri; Yallampalli, Uma; Wilson, Gerald M; Kishor, Aparna; Yallampalli, Chandra

    2012-01-01

    We previously reported that nitric oxide (NO) reduces the rate of bacteremia and maternal mortality in pregnant rats with uterine infection by Escherichia coli expressing the Dr Fimbria (Dr+). The epithelial invasion of Dr+ E. coli is dependent on the expression level of its cellular receptor decay accelerating factor (DAF). NO reduces the rate of bacteremia by down-regulating the expression of DAF. In this study, we elucidated the role of transcription factor Sp1 and RNA binding protein HuR in the down-regulation of human DAF by NO. We generated a series of deletion mutant constructs of DAF gene 5′-untranslated region and mapped NO-response region upstream to the core promoter region of the DAF gene. One of the several Sp1 binding sites in the DAF 5′-untranslated region was located within the NO-response region. The binding of Sp1 to this site was inhibited by NO. Furthermore, NO also promoted the degradation of DAF mRNA. The 3′-untranslated region of DAF harbors an AU-rich element and this element destabilized the mRNA transcript. The NO promoted the rapid degradation of DAF mRNA by inhibiting the binding of mRNA stabilizing protein HuR to this AU-rich region. The inhibition of binding of HuR to AU-rich region was due to the S-nitrosylation of one or more cysteine residues by NO. Thus, these data reveal the molecular mediators of transcriptional and post-transcriptional regulation of DAF by NO with implications in pathophysiology related to DAF. PMID:23176121

  11. Role of transcription factor Sp1 and RNA binding protein HuR in the downregulation of Dr+ Escherichia coli receptor protein decay accelerating factor (DAF or CD55) by nitric oxide.

    PubMed

    Banadakoppa, Manu; Liebenthal, Daniel; Nowak, David E; Urvil, Petri; Yallampalli, Uma; Wilson, Gerald M; Kishor, Aparna; Yallampalli, Chandra

    2013-02-01

    We previously reported that nitric oxide (NO) reduces the rate of bacteremia and maternal mortality in pregnant rats with uterine infection by Escherichia coli expressing the Dr Fimbria (Dr(+) ). The epithelial invasion of Dr(+) E. coli is dependent on the expression level of its cellular receptor decay accelerating factor (DAF). NO reduces the rate of bacteremia by downregulating the expression of DAF. In this study, we elucidated the role of transcription factor Sp1 and RNA binding protein HuR in the downregulation of human DAF by NO. We generated a series of deletion mutant constructs of DAF gene 5'-untranslated region and mapped the NO-response region upstream to the core promoter region of the DAF gene. One of the several Sp1 binding sites in the DAF 5'-untranslated region was located within the NO-response region. The binding of Sp1 to this site was inhibited by NO. Furthermore, NO also promoted the degradation of DAF mRNA. The 3'-untranslated region of DAF harbors an AU-rich element and this element destabilized the mRNA transcript. NO promoted the rapid degradation of DAF mRNA by inhibiting the binding of mRNA stabilizing protein HuR to this AU-rich region. The inhibition of binding of HuR to the AU-rich region was due to the S-nitrosylation of one or more cysteine residues by NO. Thus, these data reveal the molecular mediators of transcriptional and post-transcriptional regulation of DAF by NO with implications in pathophysiology related to DAF. © 2012 The Authors Journal compilation © 2012 FEBS.

  12. A novel transcript of cyclin-dependent kinase-like 5 (CDKL5) has an alternative C-terminus and is the predominant transcript in brain.

    PubMed

    Williamson, Sarah L; Giudici, Laura; Kilstrup-Nielsen, Charlotte; Gold, Wendy; Pelka, Gregory J; Tam, Patrick P L; Grimm, Andrew; Prodi, Dionigio; Landsberger, Nicoletta; Christodoulou, John

    2012-02-01

    The X-linked cyclin-dependent kinase-like 5 (CDKL5) gene is an important molecular determinant of early-onset intractable seizures with infantile spasms and Rett syndrome-like phenotype. The gene encodes a kinase that may influence components of molecular pathways associated with MeCP2. In humans there are two previously reported splice variants that differ in the 5' untranslated exons and produce the same 115 kDa protein. Furthermore, very recently, a novel transcript including a novel exon (16b) has been described. By aligning both the human and mouse CDKL5 proteins to the orthologs of other species, we identified a theoretical 107 kDa isoform with an alternative C-terminus that terminates in intron 18. In human brain and all other tissues investigated except the testis, this novel isoform is the major CDKL5 transcript. The detailed characterisation of this novel isoform of CDKL5 reveals functional and subcellular localisation attributes that overlap greatly, but not completely, with that of the previously studied human CDKL5 protein. Considering its predominant expression in the human and mouse brain, we believe that this novel isoform is likely to be of primary pathogenic importance in human diseases associated with CDKL5 deficiency, and suggest that screening of the related intronic sequence should be included in the molecular genetic analyses of patients with a suggestive clinical phenotype.

  13. The Mitochondrial Genome of the Prasinophyte Prasinoderma coloniale Reveals Two Trans-Spliced Group I Introns in the Large Subunit rRNA Gene

    PubMed Central

    Pombert, Jean-François; Otis, Christian; Turmel, Monique; Lemieux, Claude

    2013-01-01

    Organelle genes are often interrupted by group I and or group II introns. Splicing of these mobile genetic occurs at the RNA level via serial transesterification steps catalyzed by the introns'own tertiary structures and, sometimes, with the help of external factors. These catalytic ribozymes can be found in cis or trans configuration, and although trans-arrayed group II introns have been known for decades, trans-spliced group I introns have been reported only recently. In the course of sequencing the complete mitochondrial genome of the prasinophyte picoplanktonic green alga Prasinoderma coloniale CCMP 1220 (Prasinococcales, clade VI), we uncovered two additional cases of trans-spliced group I introns. Here, we describe these introns and compare the 54,546 bp-long mitochondrial genome of Prasinoderma with those of four other prasinophytes (clades II, III and V). This comparison underscores the highly variable mitochondrial genome architecture in these ancient chlorophyte lineages. Both Prasinoderma trans-spliced introns reside within the large subunit rRNA gene (rnl) at positions where cis-spliced relatives, often containing homing endonuclease genes, have been found in other organelles. In contrast, all previously reported trans-spliced group I introns occur in different mitochondrial genes (rns or coxI). Each Prasinoderma intron is fragmented into two pieces, forming at the RNA level a secondary structure that resembles those of its cis-spliced counterparts. As observed for other trans-spliced group I introns, the breakpoint of the first intron maps to the variable loop L8, whereas that of the second is uniquely located downstream of P9.1. The breakpoint In each Prasinoderma intron corresponds to the same region where the open reading frame (ORF) occurs when present in cis-spliced orthologs. This correlation between the intron breakpoint and the ORF location in cis-spliced orthologs also holds for other trans-spliced introns; we discuss the possible implications of this interesting observation for trans-splicing of group I introns. PMID:24386369

  14. Characterization of the intronic portion of cadherin superfamily members, common cancer orchestrators

    PubMed Central

    Oliveira, Patrícia; Sanges, Remo; Huntsman, David; Stupka, Elia; Oliveira, Carla

    2012-01-01

    Cadherins are cell–cell adhesion proteins essential for the maintenance of tissue architecture and integrity, and their impairment is often associated with human cancer. Knowledge regarding regulatory mechanisms associated with cadherin misexpression in cancer is scarce. Specific features of the intronic-structure and intronic-based regulatory mechanisms in the cadherin superfamily are unidentified. This study aims at systematically characterizing the intronic portion of cadherin superfamily members and the identification of intronic regions constituting putative targets/triggers of regulation, using a bioinformatic approach and biological data mining. Our study demonstrates that the cadherin superfamily genes harbour specific characteristics in comparison to all non-cadherin genes, both from the genomic and transcriptional standpoints. Cadherin superfamily genes display higher average total intron number and significantly longer introns than other genes and across the entire vertebrate lineage. Moreover, in the human genome, we observed an uncommon high frequency of MIR (mammalian-wide interspersed repeats) and MaLR (mammalian-wide interspersed repeats, a subtype of LTR) regulatory-associated repetitive elements at 5′-located introns, concomitantly with increased de novo intronic transcription. Using this approach, we identified cadherin intronic-specific sites that may constitute novel targets/triggers of cadherin superfamily expression regulation. These findings pinpoint the need to identify mechanisms affecting particularly MIR and MaLR elements located in introns 2 and 3 of human cadherin genes, possibly important in the expression modulation of this superfamily in homeostasis and cancer. PMID:22317972

  15. Alternative Polyadenylation of mRNAs: 3′-Untranslated Region Matters in Gene Expression

    PubMed Central

    Yeh, Hsin-Sung; Yong, Jeongsik

    2016-01-01

    Almost all of eukaryotic mRNAs are subjected to polyadenylation during mRNA processing. Recent discoveries showed that many of these mRNAs contain more than one polyadenylation sites in their 3′ untranslated regions (UTR) and that alternative polyadenylation (APA) is prevalent among these genes. Many biological processes such as differentiation, proliferation, and tumorigenesis have been correlated to global APA events in the 3′ UTR of mRNAs, suggesting that these APA events are tightly regulated and may play important physiological roles. In this review, recent discoveries in the physiological roles of APA events, as well as the known and proposed mechanisms are summarized. Perspective for future directions is also discussed. PMID:26912084

  16. Towards barcode markers in Fungi: an intron map of Ascomycota mitochondria.

    PubMed

    Santamaria, Monica; Vicario, Saverio; Pappadà, Graziano; Scioscia, Gaetano; Scazzocchio, Claudio; Saccone, Cecilia

    2009-06-16

    A standardized and cost-effective molecular identification system is now an urgent need for Fungi owing to their wide involvement in human life quality. In particular the potential use of mitochondrial DNA species markers has been taken in account. Unfortunately, a serious difficulty in the PCR and bioinformatic surveys is due to the presence of mobile introns in almost all the fungal mitochondrial genes. The aim of this work is to verify the incidence of this phenomenon in Ascomycota, testing, at the same time, a new bioinformatic tool for extracting and managing sequence databases annotations, in order to identify the mitochondrial gene regions where introns are missing so as to propose them as species markers. The general trend towards a large occurrence of introns in the mitochondrial genome of Fungi has been confirmed in Ascomycota by an extensive bioinformatic analysis, performed on all the entries concerning 11 mitochondrial protein coding genes and 2 mitochondrial rRNA (ribosomal RNA) specifying genes, belonging to this phylum, available in public nucleotide sequence databases. A new query approach has been developed to retrieve effectively introns information included in these entries. After comparing the new query-based approach with a blast-based procedure, with the aim of designing a faithful Ascomycota mitochondrial intron map, the first method appeared clearly the most accurate. Within this map, despite the large pervasiveness of introns, it is possible to distinguish specific regions comprised in several genes, including the full NADH dehydrogenase subunit 6 (ND6) gene, which could be considered as barcode candidates for Ascomycota due to their paucity of introns and to their length, above 400 bp, comparable to the lower end size of the length range of barcodes successfully used in animals. The development of the new query system described here would answer the pressing requirement to improve drastically the bioinformatics support to the DNA Barcode Initiative. The large scale investigation of Ascomycota mitochondrial introns performed through this tool, allowing to exclude the introns-rich sequences from the barcode candidates exploration, could be the first step towards a mitochondrial barcoding strategy for these organisms, similar to the standard approach employed in metazoans.

  17. Rare intronic variants of TCF7L2 arising by selective sweeps in an indigenous population from Mexico.

    PubMed

    Acosta, Jose Luis; Hernández-Mondragón, Alma Cristal; Correa-Acosta, Laura Carolina; Cazañas-Padilla, Sandra Nathaly; Chávez-Florencio, Berenice; Ramírez-Vega, Elvia Yamilet; Monge-Cázares, Tulia; Aguilar-Salinas, Carlos A; Tusié-Luna, Teresa; Del Bosque-Plata, Laura

    2016-05-26

    Genetic variations of the TCF7L2 gene are associated with the development of Type 2 diabetes (T2D). The associated mutations have demonstrated an adaptive role in some human populations, but no studies have determined the impact of evolutionary forces on genetic diversity in indigenous populations from Mexico. Here, we sequenced and analyzed the variation of the TCF7L2 gene in three Amerindian populations and compared the results with whole-exon-sequencing of Mestizo populations from Sigma and the 1000 Genomes Project to assess the roles of selection and recombination in diversity. The diversity in the indigenous populations was biased to intronic regions. Most of the variation was low frequency. Only mutations rs77961654 and rs61724286 were located on exon 15. We did not observe variation in intronic region 4-6 in any of the three indigenous populations. In addition, we identified peaks of selective sweeps in the mestizo samples from the Sigma Project within this region. By replicating the analysis of association with T2D between case-controls from the Sigma Project, we determined that T2D was most highly associated with the rs7903146 risk allele and to a lesser extent with the other six variants. All associated markers were located in intronic region 4-6, and their r(2) values of linkage disequilibrium were significantly higher in the Mexican population than in Africans from the 1000 Genomes Project. We observed reticulations in both the haplotypes network analysis from seven marker associates and the neighborNet tree based on 6061 markers in the TCF7L2 gene identified from all samples of the 1000 Genomes Project. Finally, we identified two recombination hotspots in the upstream region and 3' end of the TCF7L2 gene. The lack of diversity in intronic region 4-6 in Indigenous populations could be an effect of selective sweeps generated by the selection of neighboring rare variants at T2D-associated mutations. The survivors' variants make the intronic region 4-6 the area of the greatest population differentiation within the TCF7L2 gene. The abundance of selective peak sweeps in the downstream region of the TCF7L2 gene suggests that the TCF7L2 gene is part of a region that is in constant recombination between populations.

  18. Complete sequence of two tick-borne flaviviruses isolated from Siberia and the UK: analysis and significance of the 5' and 3'-UTRs.

    PubMed

    Gritsun, T S; Venugopal, K; Zanotto, P M; Mikhailov, M V; Sall, A A; Holmes, E C; Polkinghorne, I; Frolova, T V; Pogodina, V V; Lashkevich, V A; Gould, E A

    1997-05-01

    The complete nucleotide sequence of two tick-transmitted flaviviruses, Vasilchenko (Vs) from Siberia and louping ill (LI) from the UK, have been determined. The genomes were respectively, 10928 and 10871 nucleotides (nt) in length. The coding strategy and functional protein sequence motifs of tick-borne flaviviruses are presented in both Vs and LI viruses. The phylogenies based on maximum likelihood, maximum parsimony and distance analysis of the polyproteins, identified Vs virus as a member of the tick-borne encephalitis virus subgroup within the tick-borne serocomplex, genus Flavivirus, family Flaviviridae. Comparative alignment of the 3'-untranslated regions revealed deletions of different lengths essentially at the same position downstream of the stop codon for all tick-borne viruses. Two direct 27 nucleotide repeats at the 3'-end were found only for Vs and LI virus. Immediately following the deletions a region of 332-334 nt with relatively conserved primary structure (67-94% identity) was observed at the 3'-non-coding end of the virus genome. Pairwise comparisons of the nucleotide sequence data revealed similar levels of variation between the coding region, and the 5' and 3'-termini of the genome, implying an equivalent strong selective control for translated and untranslated regions. Indeed the predicted folding of the 5' and 3'-untranslated regions revealed patterns of stem and loop structures conserved for all tick-borne flaviviruses suggesting a purifying selection for preservation of essential RNA secondary structures which could be involved in translational control and replication. The possible implications of these findings are discussed.

  19. X-linked hypophosphatemia attributable to pseudoexons of the PHEX gene.

    PubMed

    Christie, P T; Harding, B; Nesbit, M A; Whyte, M P; Thakker, R V

    2001-08-01

    X-linked hypophosphatemia is commonly caused by mutations of the coding region of PHEX (phosphate-regulating gene with homologies to endopeptidases on the X chromosome). However, such PHEX mutations are not detected in approximately one third of X-linked hypophosphatemia patients who may harbor defects in the noncoding or intronic regions. We have therefore investigated 11 unrelated X-linked hypophosphatemia patients in whom coding region mutations had been excluded, for intronic mutations that may lead to mRNA splicing abnormalities, by the use of lymphoblastoid RNA and RT-PCRs. One X-linked hypophosphatemia patient was found to have 3 abnormally large transcripts, resulting from 51-bp, 100-bp, and 170-bp insertions, all of which would lead to missense peptides and premature termination codons. The origin of these transcripts was a mutation (g to t) at position +1268 of intron 7, which resulted in the occurrence of a high quality novel donor splice site (ggaagg to gtaagg). Splicing between this novel donor splice site and 3 preexisting, but normally silent, acceptor splice sites within intron 7 resulted in the occurrences of the 3 pseudoexons. This represents the first report of PHEX pseudoexons and reveals further the diversity of genetic abnormalities causing X-linked hypophosphatemia.

  20. Sequence and Expression Analysis of Interferon Regulatory Factor 10 (IRF10) in Three Diverse Teleost Fish Reveals Its Role in Antiviral Defense.

    PubMed

    Xu, Qiaoqing; Jiang, Yousheng; Wangkahart, Eakapol; Zou, Jun; Chang, Mingxian; Yang, Daiqin; Secombes, Chris J; Nie, Pin; Wang, Tiehui

    2016-01-01

    Interferon regulatory factor (IRF) 10 was first found in birds and is present in the genome of other tetrapods (but not humans and mice), as well as in teleost fish. The functional role of IRF10 in vertebrate immunity is relatively unknown compared to IRF1-9. The target of this research was to clone and characterize the IRF10 genes in three economically important fish species that will facilitate future evaluation of this molecule in fish innate and adaptive immunity. In the present study, a single IRF10 gene was cloned in grass carp Ctenopharyngodon idella and Asian swamp eel Monopterus albus, and two, named IRF10a and IRF10b, in rainbow trout Oncorhynchus mykiss. The fish IRF10 molecules share highest identities to other vertebrate IRF10s, and have a well conserved DNA binding domain, IRF-associated domain, and an 8 exon/7 intron structure with conserved intron phase. The presence of an upstream ATG or open reading frame (ORF) in the 5'-untranslated region of different fish IRF10 cDNA sequences suggests potential regulation at the translational level, and this has been verified by in vitro transcription/translation experiments of the trout IRF10a cDNA, but would still need to be validated in fish cells. Both trout IRF10 paralogues are highly expressed in thymus, blood and spleen but are relatively low in head kidney and caudal kidney. Trout IRF10b expression is significantly higher than IRF10a in integumentary tissues i.e. gills, scales, skin, intestine, adipose fin and tail fins, suggesting that IRF10b may be more important in mucosal immunity. The expression of both trout IRF10 paralogues is up-regulated by recombinant IFN-γ. The expression of the IRF10 genes is highly induced by Poly I:C in vitro and in vivo, and by viral infection, but is less responsive to peptidoglycan and bacterial infection, suggesting an important role of fish IRF10 in antiviral defense.

  1. CYP3A5 mRNA degradation by nonsense-mediated mRNA decay.

    PubMed

    Busi, Florent; Cresteil, Thierry

    2005-09-01

    The total CYP3A5 mRNA level is significantly greater in carriers of the CYP3A5*1 allele than in CYP3A5*3 homozygotes. Most of the CYP3A5*3 mRNA includes an intronic sequence (exon 3B) containing premature termination codons (PTCs) between exons 3 and 4. Two models were used to investigate the degradation of CYP3A5 mRNA: a CYP3A5 minigene consisting of CYP3A5 exons and introns 3 to 6 transfected into MCF7 cells, and the endogenous CYP3A5 gene expressed in HepG2 cells. The 3'-untranslated region g.31611C>T mutation has no effect on CYP3A5 mRNA decay. Splice variants containing exon 3B were more unstable than wild-type (wt) CYP3A5 mRNA. Cycloheximide prevents the recognition of PTCs by ribosomes: in transfected MCF7 and HepG2 cells, cycloheximide slowed down the degradation of exon 3B-containing splice variants, suggesting the participation of nonsense-mediated decay (NMD). When PTCs were removed from pseudoexon 3B or when UPF1 small interfering RNA was used to impair the NMD mechanism, the decay of the splice variant was reduced, confirming the involvement of NMD in the degradation of CYP3A5 splice variants. Induction could represent a source of variability for CYP3A5 expression and could modify the proportion of splice variants. The extent of CYP3A5 induction was investigated after exposure to barbiturates or steroids: CYP3A4 was markedly induced in a pediatric population compared with untreated neonates. However, no effect could be detected in either the total CYP3A5 RNA, the proportion of splice variant RNA, or the protein level. Therefore, in these carriers, induction is unlikely to switch on the phenotypic CYP3A5 expression in carriers of CYP3A5*3/*3.

  2. Two distinct promoters drive transcription of the human D1A dopamine receptor gene.

    PubMed

    Lee, S H; Minowa, M T; Mouradian, M M

    1996-10-11

    The human D1A dopamine receptor gene has a GC-rich, TATA-less promoter located upstream of a small, noncoding exon 1, which is separated from the coding exon 2 by a 116-base pair (bp)-long intron. Serial 3'-deletions of the 5'-noncoding region of this gene, including the intron and 5'-end of exon 2, resulted in 80 and 40% decrease in transcriptional activity of the upstream promoter in two D1A-expressing neuroblastoma cell lines, SK-N-MC and NS20Y, respectively. To investigate the function of this region, the intron and 245 bp at the 5'-end of exon 2 were investigated. Transient expression analyses using various chloramphenicol acetyltransferase constructs showed that the transcriptional activity of the intron is higher than that of the upstream promoter by 12-fold in SK-N-MC cells and by 5.5-fold in NS20Y cells in an orientation-dependent manner, indicating that the D1A intron is a strong promoter. Primer extension and ribonuclease protection assays revealed that transcription driven by the intron promoter is initiated at the junction of intron and exon 2 and at a cluster of nucleotides located 50 bp downstream from this junction. The same transcription start sites are utilized by the chloramphenicol acetyltransferase constructs employed in transfections as well as by the D1A gene expressed within the human caudate. The relative abundance of D1A transcripts originating from the upstream promoter compared with those transcribed from the intron promoter is 1.5-2.9 times in SK-N-MC cells and 2 times in the human caudate. Transcript stability studies in SK-N-MC cells revealed that longer D1A mRNA molecules containing exon 1 are degraded 1.8 times faster than shorter transcripts lacking exon 1. Although gel mobility shift assay could not detect DNA-protein interaction at the D1A intron, competitive co-transfection using the intron as competitor confirmed the presence of trans-acting factors at the intron. These data taken together indicate that the human D1A gene has two functional TATA-less promoters, both in D1A expressing cultured neuroblastoma cells and in the human striatum.

  3. A 5′ Noncoding Exon Containing Engineered Intron Enhances Transgene Expression from Recombinant AAV Vectors in vivo

    PubMed Central

    Lu, Jiamiao; Williams, James A.; Luke, Jeremy; Zhang, Feijie; Chu, Kirk; Kay, Mark A.

    2017-01-01

    We previously developed a mini-intronic plasmid (MIP) expression system in which the essential bacterial elements for plasmid replication and selection are placed within an engineered intron contained within a universal 5′ UTR noncoding exon. Like minicircle DNA plasmids (devoid of bacterial backbone sequences), MIP plasmids overcome transcriptional silencing of the transgene. However, in addition MIP plasmids increase transgene expression by 2 and often >10 times higher than minicircle vectors in vivo and in vitro. Based on these findings, we examined the effects of the MIP intronic sequences in a recombinant adeno-associated virus (AAV) vector system. Recombinant AAV vectors containing an intron with a bacterial replication origin and bacterial selectable marker increased transgene expression by 40 to 100 times in vivo when compared with conventional AAV vectors. Therefore, inclusion of this noncoding exon/intron sequence upstream of the coding region can substantially enhance AAV-mediated gene expression in vivo. PMID:27903072

  4. Genetic stability of Ross River virus during epidemic spread in nonimmune humans.

    PubMed

    Burness, A T; Pardoe, I; Faragher, S G; Vrati, S; Dalgarno, L

    1988-12-01

    We have examined the rate of evolution of Ross River virus, a mosquito-borne RNA virus, during epidemic spread through tens of thousands of nonimmune humans over a period of 10 months. Two regions of the Ross River virus genome were sequenced: the E2 gene (1.2 kb in length), which encodes the major neutralization determinant of the virus, and 0.4 kb of the 3'-untranslated region. In the E2 gene, a single nucleotide change was selected which led to a predicted amino acid change at residue 219. No changes were selected in the 3'-untranslated region. By comparison with rates of evolution reported for non-arthropod-borne RNA viruses, the rate for Ross River virus is surprisingly low. We identify three features of the Ross River virus replication and transmission cycle which may limit the rate of evolution of arthropod-borne viruses in the field.

  5. Detection and genotyping of bovine diarrhea virus by reverse transcription-polymerase chain amplification of the 5' untranslated region.

    PubMed

    Letellier, C; Kerkhofs, P; Wellemans, G; Vanopdenbosch, E

    1999-01-01

    A reverse-transcription polymerase chain reaction (RT-PCR) was developed to differentiate the bovine diarrhea virus (BVDV) from other pestiviruses, and to determine the genotype of the BVDV isolates. For this purpose, primer pairs were selected in the 5' untranslated region (5'UTR). The primers BE and B2 were located in highly conserved regions and were pestivirus-specific. Two primer pairs named B3B4 and B5B6 were specific of BVDV genotypes I and II, respectively. With this technique, an amplification product of the expected size was obtained with either the B3B4 or the B5B6 primer pairs for the 107 BVDV isolates tested but not for BDV or CSFV. For some isolates that were grouped in the genotype II, sequence analysis of the PCR fragments confirmed their classification into this genotype.

  6. Pre-Mrna Introns as a Model for Cryptographic Algorithm:. Theory and Experiments

    NASA Astrophysics Data System (ADS)

    Regoli, Massimo

    2010-01-01

    The RNA-Crypto System (shortly RCS) is a symmetric key algorithm to cipher data. The idea for this new algorithm starts from the observation of nature. In particular from the observation of RNA behavior and some of its properties. In particular the RNA sequences have some sections called Introns. Introns, derived from the term "intragenic regions", are non-coding sections of precursor mRNA (pre-mRNA) or other RNAs, that are removed (spliced out of the RNA) before the mature RNA is formed. Once the introns have been spliced out of a pre-mRNA, the resulting mRNA sequence is ready to be translated into a protein. The corresponding parts of a gene are known as introns as well. The nature and the role of Introns in the pre-mRNA is not clear and it is under ponderous researches by Biologists but, in our case, we will use the presence of Introns in the RNA-Crypto System output as a strong method to add chaotic non coding information and an unnecessary behaviour in the access to the secret key to code the messages. In the RNA-Crypto System algorithm the introns are sections of the ciphered message with non-coding information as well as in the precursor mRNA.

  7. a Simple Symmetric Algorithm Using a Likeness with Introns Behavior in RNA Sequences

    NASA Astrophysics Data System (ADS)

    Regoli, Massimo

    2009-02-01

    The RNA-Crypto System (shortly RCS) is a symmetric key algorithm to cipher data. The idea for this new algorithm starts from the observation of nature. In particular from the observation of RNA behavior and some of its properties. The RNA sequences has some sections called Introns. Introns, derived from the term "intragenic regions", are non-coding sections of precursor mRNA (pre-mRNA) or other RNAs, that are removed (spliced out of the RNA) before the mature RNA is formed. Once the introns have been spliced out of a pre-mRNA, the resulting mRNA sequence is ready to be translated into a protein. The corresponding parts of a gene are known as introns as well. The nature and the role of Introns in the pre-mRNA is not clear and it is under ponderous researches by Biologists but, in our case, we will use the presence of Introns in the RNA-Crypto System output as a strong method to add chaotic non coding information and an unnecessary behaviour in the access to the secret key to code the messages. In the RNA-Crypto System algoritnm the introns are sections of the ciphered message with non-coding information as well as in the precursor mRNA.

  8. Characterization of Toll-like receptor 3 gene in large yellow croaker, Pseudosciaena crocea.

    PubMed

    Huang, Xue-Na; Wang, Zhi-Yong; Yao, Cui-Luan

    2011-07-01

    Toll-like receptor 3 (TLR3) plays an important role in innate immune responses. In this report, the full-length cDNA sequence and genomic structure of Pseudosciaena crocea TLR3 (PcTLR3) were identified and characterized. The full-length cDNA of PcTLR3 was of 3384 bp, including a 5'-terminal untranslated region (UTR) of 65 bp, a 3'-terminal UTR of 589 bp and an open reading frame (ORF) of 2730 bp encoding a polypeptide of 909 amino acid residues. The full-length genome sequence of PcTLR3 was composed of 5721 nucleotides, including five exons and four introns. The putative PcTLR3 protein contained a signal peptide sequence, 16 leucine-rich repeat (LRR) motifs, a transmembrane region and a Toll/interleukin-1 receptor (TIR) domain. Quantitative real-time reverse transcription PCR analysis revealed a broad expression of PcTLR3 in most tissues, with the predominant expression in liver, then intestine, and the weakest expression in blood cells. The expression of PcTLR3 after injection with poly inosinic:cytidylic (I:C) and Vibrio parahemolyticus was tested in spleen, blood cells and liver. The results indicated that PcTLR3 transcripts could be induced in the three tissues by injection with poly I:C. The highest expression was in the blood cells with 43.5 times (at 6h) greater expression than in the control (p<0.05). In addition, after V. parahemolyticus challenge, a moderate up-regulation and down-regulation of PcTLR3 was found in blood cells and liver, respectively. Our results suggested that PcTLR3 might play an important role in fish's defense against both viral and bacterial infection. Copyright © 2011 Elsevier Ltd. All rights reserved.

  9. Genotype and phenotype relationships in 10 Pakistani unrelated patients with inherited factor VII deficiency.

    PubMed

    Borhany, M; Boijout, H; Pellequer, J-L; Shamsi, T; Moulis, G; Aguilar-Martinez, P; Schved, J-F; Giansily-Blaizot, M

    2013-11-01

    Inherited factor VII (FVII) deficiency is one of the commonest rare bleeding disorders. It is characterized by a wide molecular and clinical heterogeneity and an autosomal recessive pattern of inheritance. Factor VII-deficient patients are still scarcely explored in Pakistan although rare bleeding disorders became quite common as a result of traditional consanguineous marriages. The aim of the study was to give a first insight of F7 gene mutations in Pakistani population. Ten unrelated FVII-deficient patients living in Pakistan were investigated (median FVII:C = 2%; range = 2-37%). A clinical questionnaire was filled out for each patient and direct sequencing was performed on the coding regions, intron/exon boundaries and 5' and 3' untranslated regions of the F7 gene. Nine different mutations (eight missense mutations and one located within the F7 promoter) were identified on the F7 gene. Five of them were novel (p.Cys82Tyr, p.Cys322Ser, p.Leu357Phe, p.Thr410Ala, c-57C>T, the last being predicted to alter the binding site of transcription factor HNF-4). Half of the patients had single mutations in Cys residues involved in disulfide bridges. The p.Cys82Arg mutation was the most frequent in our series. Six of seven patients with FVII:C levels below 10% were homozygous in connection with the high percentage of consanguinity in our series. In addition, we graded the 10 patients according to three previously published classifications for rare bleeding disorders. The use of the bleeding score proposed by Tosetto and co-workers in 2006 appears to well qualify the bleeding tendency in our series. © 2013 John Wiley & Sons Ltd.

  10. Combinatory RNA-Sequencing Analyses Reveal a Dual Mode of Gene Regulation by ADAR1 in Gastric Cancer.

    PubMed

    Cho, Charles J; Jung, Jaeeun; Jiang, Lushang; Lee, Eun Ji; Kim, Dae-Soo; Kim, Byung Sik; Kim, Hee Sung; Jung, Hwoon-Yong; Song, Ho-June; Hwang, Sung Wook; Park, Yangsoon; Jung, Min Kyo; Pack, Chan Gi; Myung, Seung-Jae; Chang, Suhwan

    2018-04-25

    Adenosine deaminase acting on RNA 1 (ADAR1) is known to mediate deamination of adenosine-to-inosine through binding to double-stranded RNA, the phenomenon known as RNA editing. Currently, the function of ADAR1 in gastric cancer is unclear. This study was aimed at investigating RNA editing-dependent and editing-independent functions of ADAR1 in gastric cancer, especially focusing on its influence on editing of 3' untranslated regions (UTRs) and subsequent changes in expression of messenger RNAs (mRNAs) as well as microRNAs (miRNAs). RNA-sequencing and small RNA-sequencing were performed on AGS and MKN-45 cells with a stable ADAR1 knockdown. Changed frequencies of editing and mRNA and miRNA expression were then identified by bioinformatic analyses. Targets of RNA editing were further validated in patients' samples. In the Alu region of both gastric cell lines, editing was most commonly of the A-to-I type in 3'-UTR or intron. mRNA and protein levels of PHACTR4 increased in ADAR1 knockdown cells, because of the loss of seed sequences in 3'-UTR of PHACTR4 mRNA that are required for miRNA-196a-3p binding. Immunohistochemical analyses of tumor and paired normal samples from 16 gastric cancer patients showed that ADAR1 expression was higher in tumors than in normal tissues and inversely correlated with PHACTR4 staining. On the other hand, decreased miRNA-148a-3p expression in ADAR1 knockdown cells led to increased mRNA and protein expression of NFYA, demonstrating ADAR1's editing-independent function. ADAR1 regulates post-transcriptional gene expression in gastric cancer through both RNA editing-dependent and editing-independent mechanisms.

  11. Distinct patterns of alteration of myc genes associated with integration of human papillomavirus type 16 or type 45 DNA in two genital tumours.

    PubMed

    Sastre-Garau, X; Favre, M; Couturier, J; Orth, G

    2000-08-01

    We previously described two genital carcinomas (IC2, IC4) containing human papillomavirus type 16 (HPV-16)- or HPV-18-related sequences integrated in chromosomal bands containing the c-myc (8q24) or N-myc (2p24) gene, respectively. The c-myc gene was rearranged and amplified in IC2 cells without evidence of overexpression. The N-myc gene was amplified and highly transcribed in IC4 cells. Here, the sequence of an 8039 bp IC4 DNA fragment containing the integrated viral sequences and the cellular junctions is reported. A 3948 bp segment of the genome of HPV-45 encompassing the upstream regulatory region and the E6 and E7 ORFs was integrated into the untranslated part of N-myc exon 3, upstream of the N-myc polyadenylation signal. Both N-myc and HPV-45 sequences were amplified 10- to 20-fold. The 3' ends of the major N-myc transcript were mapped upstream of the 5' junction. A minor N-myc/HPV-45 fusion transcript was also identified, as well as two abundant transcripts from the HPV-45 E6-E7 region. Large amounts of N-myc protein were detected in IC4 cells. A major alteration of c-myc sequences in IC2 cells involved the insertion of a non-coding sequence into the second intron and their co-amplification with the third exon, without any evidence for the integration of HPV-16 sequences within or close to the gene. Different patterns of myc gene alterations may thus be associated with integration of HPV DNA in genital tumours, including the activation of the protooncogene via a mechanism of insertional mutagenesis and/or gene amplification.

  12. Antisense Masking of an hnRNP A1/A2 Intronic Splicing Silencer Corrects SMN2 Splicing in Transgenic Mice

    PubMed Central

    Hua, Yimin; Vickers, Timothy A.; Okunola, Hazeem L.; Bennett, C. Frank; Krainer, Adrian R.

    2008-01-01

    survival of motor neuron 2, centromeric (SMN2) is a gene that modifies the severity of spinal muscular atrophy (SMA), a motor-neuron disease that is the leading genetic cause of infant mortality. Increasing inclusion of SMN2 exon 7, which is predominantly skipped, holds promise to treat or possibly cure SMA; one practical strategy is the disruption of splicing silencers that impair exon 7 recognition. By using an antisense oligonucleotide (ASO)-tiling method, we systematically screened the proximal intronic regions flanking exon 7 and identified two intronic splicing silencers (ISSs): one in intron 6 and a recently described one in intron 7. We analyzed the intron 7 ISS by mutagenesis, coupled with splicing assays, RNA-affinity chromatography, and protein overexpression, and found two tandem hnRNP A1/A2 motifs within the ISS that are responsible for its inhibitory character. Mutations in these two motifs, or ASOs that block them, promote very efficient exon 7 inclusion. We screened 31 ASOs in this region and selected two optimal ones to test in human SMN2 transgenic mice. Both ASOs strongly increased hSMN2 exon 7 inclusion in the liver and kidney of the transgenic animals. Our results show that the high-resolution ASO-tiling approach can identify cis-elements that modulate splicing positively or negatively. Most importantly, our results highlight the therapeutic potential of some of these ASOs in the context of SMA. PMID:18371932

  13. Genomic structure of two ras family genes in the slime mold Physarum polycephalum.

    PubMed

    Trzcińska-Danielewicz, Joanna; Kozlowski, Piotr; Gierdal, Katarzyna; Wiejak, Jolanta; Jagielski, Adam; Toczko, Kazimierz; Fronk, Jan

    2002-08-01

    Genomic structure of two Physarum polycephalum ras family genes, Ppras2 and Pprap1, has been determined, including the upstream region of the latter. The genes are interrupted by three and four introns, respectively. The first intron of Ppras2 has the same location within the coding sequence as the first intron in another ras homolog from this organism, Ppras1 [Trzcińska-Danielewicz, J., Kozlowski, P., and Toczko, K. (1996). "Cloning and genomic sequence of the Physarum polycephalum Ppras1 gene, a homologue of the ras protooncogene", Gene 169, pp. 143-144]. All introns, ranging from 53 to ca. 460 base pairs, have the canonical 5' and 3' ends, are greatly enriched in pyrimidines in the coding strand and have frequent pyrimidines-only tracts. These latter features seem to be responsible for the difficulties in cloning and sequencing of parts of these genes. Short sequences shared with P. polycephalum transposon-like repeats are common in the introns, indicating a possible role of transposition in intron evolution. In all three ras family genes phase zero introns are located mostly between sequences coding for regular protein secondary structure elements.

  14. Parallel Loss of Plastid Introns and Their Maturase in the Genus Cuscuta

    PubMed Central

    McNeal, Joel R.; Kuehl, Jennifer V.; Boore, Jeffrey L.; Leebens-Mack, Jim; dePamphilis, Claude W.

    2009-01-01

    Plastid genome content and arrangement are highly conserved across most land plants and their closest relatives, streptophyte algae, with nearly all plastid introns having invaded the genome in their common ancestor at least 450 million years ago. One such intron, within the transfer RNA trnK-UUU, contains a large open reading frame that encodes a presumed intron maturase, matK. This gene is missing from the plastid genomes of two species in the parasitic plant genus Cuscuta but is found in all other published land plant and streptophyte algal plastid genomes, including that of the nonphotosynthetic angiosperm Epifagus virginiana and two other species of Cuscuta. By examining matK and plastid intron distribution in Cuscuta, we add support to the hypothesis that its normal role is in splicing seven of the eight group IIA introns in the genome. We also analyze matK nucleotide sequences from Cuscuta species and relatives that retain matK to test whether changes in selective pressure in the maturase are associated with intron deletion. Stepwise loss of most group IIA introns from the plastid genome results in substantial change in selective pressure within the hypothetical RNA-binding domain of matK in both Cuscuta and Epifagus, either through evolution from a generalist to a specialist intron splicer or due to loss of a particular intron responsible for most of the constraint on the binding region. The possibility of intron-specific specialization in the X-domain is implicated by evidence of positive selection on the lineage leading to C. nitida in association with the loss of six of seven introns putatively spliced by matK. Moreover, transfer RNA gene deletion facilitated by parasitism combined with an unusually high rate of intron loss from remaining functional plastid genes created a unique circumstance on the lineage leading to Cuscuta subgenus Grammica that allowed elimination of matK in the most species-rich lineage of Cuscuta. PMID:19543388

  15. Parallel loss of plastid introns and their maturase in the genus Cuscuta.

    PubMed

    McNeal, Joel R; Kuehl, Jennifer V; Boore, Jeffrey L; Leebens-Mack, Jim; dePamphilis, Claude W

    2009-06-19

    Plastid genome content and arrangement are highly conserved across most land plants and their closest relatives, streptophyte algae, with nearly all plastid introns having invaded the genome in their common ancestor at least 450 million years ago. One such intron, within the transfer RNA trnK-UUU, contains a large open reading frame that encodes a presumed intron maturase, matK. This gene is missing from the plastid genomes of two species in the parasitic plant genus Cuscuta but is found in all other published land plant and streptophyte algal plastid genomes, including that of the nonphotosynthetic angiosperm Epifagus virginiana and two other species of Cuscuta. By examining matK and plastid intron distribution in Cuscuta, we add support to the hypothesis that its normal role is in splicing seven of the eight group IIA introns in the genome. We also analyze matK nucleotide sequences from Cuscuta species and relatives that retain matK to test whether changes in selective pressure in the maturase are associated with intron deletion. Stepwise loss of most group IIA introns from the plastid genome results in substantial change in selective pressure within the hypothetical RNA-binding domain of matK in both Cuscuta and Epifagus, either through evolution from a generalist to a specialist intron splicer or due to loss of a particular intron responsible for most of the constraint on the binding region. The possibility of intron-specific specialization in the X-domain is implicated by evidence of positive selection on the lineage leading to C. nitida in association with the loss of six of seven introns putatively spliced by matK. Moreover, transfer RNA gene deletion facilitated by parasitism combined with an unusually high rate of intron loss from remaining functional plastid genes created a unique circumstance on the lineage leading to Cuscuta subgenus Grammica that allowed elimination of matK in the most species-rich lineage of Cuscuta.

  16. Effective suppression of dengue virus using a novel group-I intron that induces apoptotic cell death upon infection through conditional expression of the Bax C-terminal domain.

    PubMed

    Carter, James R; Keith, James H; Fraser, Tresa S; Dawson, James L; Kucharski, Cheryl A; Horne, Kate M; Higgs, Stephen; Fraser, Malcolm J

    2014-06-13

    Approximately 100 million confirmed infections and 20,000 deaths are caused by Dengue virus (DENV) outbreaks annually. Global warming and rapid dispersal have resulted in DENV epidemics in formally non-endemic regions. Currently no consistently effective preventive measures for DENV exist, prompting development of transgenic and paratransgenic vector control approaches. Production of transgenic mosquitoes refractory for virus infection and/or transmission is contingent upon defining antiviral genes that have low probability for allowing escape mutations, and are equally effective against multiple serotypes. Previously we demonstrated the effectiveness of an anti-viral group I intron targeting U143 of the DENV genome in mediating trans-splicing and expression of a marker gene with the capsid coding domain. In this report we examine the effectiveness of coupling expression of ΔN Bax to trans-splicing U143 intron activity as a means of suppressing DENV infection of mosquito cells. Targeting the conserved DENV circularization sequence (CS) by U143 intron trans-splicing activity appends a 3' exon RNA encoding ΔN Bax to the capsid coding region of the genomic RNA, resulting in a chimeric protein that induces premature cell death upon infection. TCID50-IFA analyses demonstrate an enhancement of DENV suppression for all DENV serotypes tested over the identical group I intron coupled with the non-apoptotic inducing firefly luciferase as the 3' exon. These cumulative results confirm the increased effectiveness of this αDENV-U143-ΔN Bax group I intron as a sequence specific antiviral that should be useful for suppression of DENV in transgenic mosquitoes. Annexin V staining, caspase 3 assays, and DNA ladder observations confirm DCA-ΔN Bax fusion protein expression induces apoptotic cell death. This report confirms the relative effectiveness of an anti-DENV group I intron coupled to an apoptosis-inducing ΔN Bax 3' exon that trans-splices conserved sequences of the 5' CS region of all DENV serotypes and induces apoptotic cell death upon infection. Our results confirm coupling the targeted ribozyme capabilities of the group I intron with the generation of an apoptosis-inducing transcript increases the effectiveness of infection suppression, improving the prospects of this unique approach as a means of inducing transgenic refractoriness in mosquitoes for all serotypes of this important disease.

  17. Mitochondrial Group II Introns, Cytochrome c Oxidase, and Senescence in Podospora anserina†

    PubMed Central

    Begel, Odile; Boulay, Jocelyne; Albert, Beatrice; Dufour, Eric; Sainsard-Chanet, Annie

    1999-01-01

    Podospora anserina is a filamentous fungus with a limited life span. It expresses a degenerative syndrome called senescence, which is always associated with the accumulation of circular molecules (senDNAs) containing specific regions of the mitochondrial chromosome. A mobile group II intron (α) has been thought to play a prominent role in this syndrome. Intron α is the first intron of the cytochrome c oxidase subunit I gene (COX1). Mitochondrial mutants that escape the senescence process are missing this intron, as well as the first exon of the COX1 gene. We describe here the first mutant of P. anserina that has the α sequence precisely deleted and whose cytochrome c oxidase activity is identical to that of wild-type cells. The integration site of the intron is slightly modified, and this change prevents efficient homing of intron α. We show here that this mutant displays a senescence syndrome similar to that of the wild type and that its life span is increased about twofold. The introduction of a related group II intron into the mitochondrial genome of the mutant does not restore the wild-type life span. These data clearly demonstrate that intron α is not the specific senescence factor but rather an accelerator or amplifier of the senescence process. They emphasize the role that intron α plays in the instability of the mitochondrial chromosome and the link between this instability and longevity. Our results strongly support the idea that in Podospora, “immortality” can be acquired not by the absence of intron α but rather by the lack of active cytochrome c oxidase. PMID:10330149

  18. Dopamine Inactivation Efficacy Related to Functional DAT1 and COMT Variants Influences Motor Response Evaluation

    PubMed Central

    Bender, Stephan; Rellum, Thomas; Freitag, Christine; Resch, Franz; Rietschel, Marcella; Treutlein, Jens; Jennen-Steinmetz, Christine; Brandeis, Daniel; Banaschewski, Tobias; Laucht, Manfred

    2012-01-01

    Background Dopamine plays an important role in orienting, response anticipation and movement evaluation. Thus, we examined the influence of functional variants related to dopamine inactivation in the dopamine transporter (DAT1) and catechol-O-methyltransferase genes (COMT) on the time-course of motor processing in a contingent negative variation (CNV) task. Methods 64-channel EEG recordings were obtained from 195 healthy adolescents of a community-based sample during a continuous performance task (A-X version). Early and late CNV as well as motor postimperative negative variation were assessed. Adolescents were genotyped for the COMT Val158Met and two DAT1 polymorphisms (variable number tandem repeats in the 3′-untranslated region and in intron 8). Results The results revealed a significant interaction between COMT and DAT1, indicating that COMT exerted stronger effects on lateralized motor post-processing (centro-parietal motor postimperative negative variation) in homozygous carriers of a DAT1 haplotype increasing DAT1 expression. Source analysis showed that the time interval 500–1000 ms after the motor response was specifically affected in contrast to preceding movement anticipation and programming stages, which were not altered. Conclusions Motor slow negative waves allow the genomic imaging of dopamine inactivation effects on cortical motor post-processing during response evaluation. This is the first report to point towards epistatic effects in the motor system during response evaluation, i.e. during the post-processing of an already executed movement rather than during movement programming. PMID:22649558

  19. Organization and alternative splicing of the Caenorhabditis elegans cAMP-dependent protein kinase catalytic-subunit gene (kin-1).

    PubMed

    Tabish, M; Clegg, R A; Rees, H H; Fisher, M J

    1999-04-01

    The cAMP-dependent protein kinase (protein kinase A, PK-A) is multifunctional in nature, with key roles in the control of diverse aspects of eukaryotic cellular activity. In the case of the free-living nematode, Caenorhabditis elegans, a gene encoding the PK-A catalytic subunit has been identified and two isoforms of this subunit, arising from a C-terminal alternative-splicing event, have been characterized [Gross, Bagchi, Lu and Rubin (1990) J. Biol. Chem. 265, 6896-6907]. Here we report the occurrence of N-terminal alternative-splicing events that, in addition to generating a multiplicity of non-myristoylatable isoforms, also generate the myristoylated variant(s) of the catalytic subunit that we have recently characterized [Aspbury, Fisher, Rees and Clegg (1997) Biochem. Biophys. Res. Commun. 238, 523-527]. The gene spans more than 36 kb and is divided into a total of 13 exons. Each of the mature transcripts contains only 7 exons. In addition to the already characterized exon 1, the 5'-untranslated region and first intron actually contain 5 other exons, any one of which may be alternatively spliced on to exon 2 at the 5' end of the pre-mRNA. This N-terminal alternative splicing occurs in combination with either of the already characterized C-terminal alternative exons. Thus, C. elegans expresses at least 12 different isoforms of the catalytic subunit of PK-A. The significance of this unprecedented structural diversity in the family of PK-A catalytic subunits is discussed.

  20. A novel homozygous stop-codon mutation in human HFE responsible for nonsense-mediated mRNA decay.

    PubMed

    Padula, Maria Carmela; Martelli, Giuseppe; Larocca, Marilena; Rossano, Rocco; Olivieri, Attilio

    2014-09-01

    HFE-hemochromatosis (HH) is an autosomal disease characterized by excessive iron absorption. Homozygotes for H63D variant, and still less H63D heterozygotes, generally do not express HH phenotype. The data collected in our previous study in the province of Matera (Basilicata, Italy) underlined that some H63D carriers showed altered iron metabolism, without additional factors. In this study, we selected a cohort of 10/22 H63D carriers with severe biochemical iron overload (BIO). Additional analysis was performed for studying HFE exons, exon-intron boundaries, and untranslated regions (UTRs) by performing DNA extraction, PCR amplification and sequencing. The results showed a novel substitution (NM_000410.3:c.847C>T) in a patient exon 4 (GenBankJQ478433); it introduces a premature stop-codon (PTC). RNA extraction and reverse-transcription were also performed. Quantitative real-time PCR was carried out for verifying if our aberrant mRNA is targeted for nonsense-mediated mRNA decay (NMD); we observed that patient HFE mRNA was expressed much less than calibrator, suggesting that the mutated HFE protein cannot play its role in iron metabolism regulation, resulting in proband BIO. Our finding is the first evidence of a variation responsible for a PTC in iron cycle genes. The genotype-phenotype correlation observed in our cases could be related to the additional mutation. Copyright © 2014 Elsevier Inc. All rights reserved.

  1. Patterns of population differentiation of candidate genes for cardiovascular disease.

    PubMed

    Kullo, Iftikhar J; Ding, Keyue

    2007-07-12

    The basis for ethnic differences in cardiovascular disease (CVD) susceptibility is not fully understood. We investigated patterns of population differentiation (FST) of a set of genes in etiologic pathways of CVD among 3 ethnic groups: Yoruba in Nigeria (YRI), Utah residents with European ancestry (CEU), and Han Chinese (CHB) + Japanese (JPT). We identified 37 pathways implicated in CVD based on the PANTHER classification and 416 genes in these pathways were further studied; these genes belonged to 6 biological processes (apoptosis, blood circulation and gas exchange, blood clotting, homeostasis, immune response, and lipoprotein metabolism). Genotype data were obtained from the HapMap database. We calculated FST for 15,559 common SNPs (minor allele frequency > or = 0.10 in at least one population) in genes that co-segregated among the populations, as well as an average-weighted FST for each gene. SNPs were classified as putatively functional (non-synonymous and untranslated regions) or non-functional (intronic and synonymous sites). Mean FST values for common putatively functional variants were significantly higher than FST values for nonfunctional variants. A significant variation in FST was also seen based on biological processes; the processes of 'apoptosis' and 'lipoprotein metabolism' showed an excess of genes with high FST. Thus, putative functional SNPs in genes in etiologic pathways for CVD show greater population differentiation than non-functional SNPs and a significant variance of FST values was noted among pairwise population comparisons for different biological processes. These results suggest a possible basis for varying susceptibility to CVD among ethnic groups.

  2. Do PTK2 gene polymorphisms contribute to the interindividual variability in muscle strength and the response to resistance training? A preliminary report.

    PubMed

    Erskine, Robert M; Williams, Alun G; Jones, David A; Stewart, Claire E; Degens, Hans

    2012-04-01

    The protein tyrosine kinase-2 (PTK2) gene encodes focal adhesion kinase, a structural protein involved in lateral transmission of muscle fiber force. We investigated whether single-nucleotide polymorphisms (SNPs) of the PTK2 gene were associated with various indexes of human skeletal muscle strength and the interindividual variability in the strength responses to resistance training. We determined unilateral knee extension single repetition maximum (1-RM), maximum isometric voluntary contraction (MVC) knee joint torque, and quadriceps femoris muscle specific force (maximum force per unit physiological cross-sectional area) before and after 9 wk of knee extension resistance training in 51 untrained young men. All participants were genotyped for the PTK2 intronic rs7843014 A/C and 3'-untranslated region (UTR) rs7460 A/T SNPs. There were no genotype associations with baseline measures or posttraining changes in 1-RM or MVC. Although the training-induced increase in specific force was similar for all PTK2 genotypes, baseline specific force was higher in PTK2 rs7843014 AA and rs7460 TT homozygotes than in the respective rs7843014 C- (P = 0.016) and rs7460 A-allele (P = 0.009) carriers. These associations between muscle specific force and PTK2 SNPs suggest that interindividual differences exist in the way force is transmitted from the muscle fibers to the tendon. Therefore, our results demonstrate for the first time the impact of genetic variation on the intrinsic strength of human skeletal muscle.

  3. Analysing grouping of nucleotides in DNA sequences using lumped processes constructed from Markov chains.

    PubMed

    Guédon, Yann; d'Aubenton-Carafa, Yves; Thermes, Claude

    2006-03-01

    The most commonly used models for analysing local dependencies in DNA sequences are (high-order) Markov chains. Incorporating knowledge relative to the possible grouping of the nucleotides enables to define dedicated sub-classes of Markov chains. The problem of formulating lumpability hypotheses for a Markov chain is therefore addressed. In the classical approach to lumpability, this problem can be formulated as the determination of an appropriate state space (smaller than the original state space) such that the lumped chain defined on this state space retains the Markov property. We propose a different perspective on lumpability where the state space is fixed and the partitioning of this state space is represented by a one-to-many probabilistic function within a two-level stochastic process. Three nested classes of lumped processes can be defined in this way as sub-classes of first-order Markov chains. These lumped processes enable parsimonious reparameterizations of Markov chains that help to reveal relevant partitions of the state space. Characterizations of the lumped processes on the original transition probability matrix are derived. Different model selection methods relying either on hypothesis testing or on penalized log-likelihood criteria are presented as well as extensions to lumped processes constructed from high-order Markov chains. The relevance of the proposed approach to lumpability is illustrated by the analysis of DNA sequences. In particular, the use of lumped processes enables to highlight differences between intronic sequences and gene untranslated region sequences.

  4. Introduction of a novel 18S rDNA gene arrangement along with distinct ITS region in the saline water microalga Dunaliella

    PubMed Central

    2010-01-01

    Comparison of 18S rDNA gene sequences is a very promising method for identification and classification of living organisms. Molecular identification and discrimination of different Dunaliella species were carried out based on the size of 18S rDNA gene and, number and position of introns in the gene. Three types of 18S rDNA structure have already been reported: the gene with a size of ~1770 bp lacking any intron, with a size of ~2170 bp consisting one intron near 5' terminus, and with a size of ~2570 bp harbouring two introns near 5' and 3' termini. Hereby, we report a new 18S rDNA gene arrangement in terms of intron localization and nucleotide sequence in a Dunaliella isolated from Iranian salt lakes (ABRIINW-M1/2). PCR amplification with genus-specific primers resulted in production of a ~2170 bp DNA band, which is similar to that of D. salina 18S rDNA gene containing only one intron near 5' terminus. Whilst, sequence composition of the gene revealed the lack of any intron near 5' terminus in our isolate. Furthermore, another alteration was observed due to the presence of a 440 bp DNA fragment near 3' terminus. Accordingly, 18S rDNA gene of the isolate is clearly different from those of D. salina and any other Dunaliella species reported so far. Moreover, analysis of ITS region sequence showed the diversity of this region compared to the previously reported species. 18S rDNA and ITS sequences of our isolate were submitted with accesion numbers of EU678868 and EU927373 in NCBI database, respectively. The optimum growth rate of this isolate occured at the salinity level of 1 M NaCl. The maximum carotenoid content under stress condition of intense light (400 μmol photon m-2 s-1), high salinity (4 M NaCl) and deficiency of nitrate and phosphate nutritions reached to 240 ng/cell after 15 days. PMID:20377865

  5. cisprimertool: software to implement a comparative genomics strategy for the development of conserved intron scanning (CIS) markers.

    PubMed

    Jayashree, B; Jagadeesh, V T; Hoisington, D

    2008-05-01

    The availability of complete, annotated genomic sequence information in model organisms is a rich resource that can be extended to understudied orphan crops through comparative genomic approaches. We report here a software tool (cisprimertool) for the identification of conserved intron scanning regions using expressed sequence tag alignments to a completely sequenced model crop genome. The method used is based on earlier studies reporting the assessment of conserved intron scanning primers (called CISP) within relatively conserved exons located near exon-intron boundaries from onion, banana, sorghum and pearl millet alignments with rice. The tool is freely available to academic users at http://www.icrisat.org/gt-bt/CISPTool.htm. © 2007 ICRISAT.

  6. Mollusk genes encoding lysine tRNA (UUU) contain introns.

    PubMed

    Matsuo, M; Abe, Y; Saruta, Y; Okada, N

    1995-11-20

    New intron-containing genes encoding tRNAs were discovered when genomic DNA isolated from various animal species was amplified by the polymerase chain reaction (PCR) with primers based on sequences of rabbit tRNA(Lys). From sequencing analysis of the products of PCR, we found that introns are present in several genes encoding tRNA(Lys) in mollusks, such as Loligo bleekeri (squid) and Octopus vulgaris (octopus). These introns were specific to genes encoding tRNA(Lys)(CUU) and were not present in genes encoding tRNA(Lys)(CUU). In addition, the sequences of the introns were different from one another. To confirm the results of our initial experiments, we isolated and sequenced genes encoding tRNA(Lys)(CUU) and tRNA(Lys)(UUU). The gene for tRNA(Lys)(UUU) from squid contained an intron, whose sequence was the same as that identified by PCR, and the gene formed a cluster with a corresponding pseudogene. Several DNA regions of 2.1 kb containing this cluster appeared to be tandemly arrayed in the squid genome. By contrast, the gene encoding tRNA(Lys)(CUU) did not contain an intron, as shown also by PCR. The tRNA(Lys)(UUU) that corresponded to the analyzed gene was isolated and characterized. The present study provides the first example of an intron-containing gene encoding a tRNA in mollusks and suggests the universality of introns in such genes in higher eukaryotes.

  7. Comparative analyses of simple sequence repeats (SSRs) in 23 mosquito species genomes: Identification, characterization and distribution (Diptera: Culicidae).

    PubMed

    Wang, Xiao-Ting; Zhang, Yu-Juan; Qiao, Liang; Chen, Bin

    2018-02-27

    Simple sequence repeats (SSRs) exist in both eukaryotic and prokaryotic genomes and are the most popular genetic markers, but the SSRs of mosquito genomes are still not well understood. In this study, we identified and analyzed the SSRs in 23 mosquito species using Drosophila melanogaster as reference at the whole-genome level. The results show that SSR numbers (33 076-560 175/genome) and genome sizes (574.57-1342.21 Mb) are significantly positively correlated (R 2 = 0.8992, P < 0.01), but the correlation in individual species varies in these mosquito species. In six types of SSR, mono- to trinucleotide SSRs are dominant with cumulative percentages of 95.14%-99.00% and densities of 195.65/Mb-787.51/Mb, whereas tetra- to hexanucleotide SSRs are rare with 1.12%-4.22% and 3.76/Mb-40.23/Mb. The (A/T)n, (AC/GT)n and (AGC/GCT)n are the most frequent motifs in mononucleotide, dinucleotide and trinucleotide SSRs, respectively, and the motif frequencies of tetra- to hexanucleotide SSRs appear to be species-specific. The 10-20 bp length of SSRs are dominant with the number of 110 561 ± 93 482 and the frequency of 87.25% ± 5.73% on average, and the number and frequency decline with the increase of length. Most SSRs (83.34% ± 7.72%) are located in intergenic regions, followed by intron regions (11.59% ± 5.59%), exon regions (3.74% ± 1.95%), and untranslated regions (1.32% ± 1.39%). The mono-, di- and trinucleotide SSRs are the main SSRs in both gene regions (98.55% ± 0.85%) and exon regions (99.27% ± 0.52%). An average of 42.52% of total genes contains SSRs, and the preference for SSR occurrence in different gene subcategories are species-specific. The study provides useful insights into the SSR diversity, characteristics and distribution in 23 mosquito species of genomes. © 2018 Institute of Zoology, Chinese Academy of Sciences.

  8. High-throughput sequencing of the entire genomic regions of CCM1/KRIT1, CCM2 and CCM3/PDCD10 to search for pathogenic deep-intronic splice mutations in cerebral cavernous malformations.

    PubMed

    Rath, Matthias; Jenssen, Sönke E; Schwefel, Konrad; Spiegler, Stefanie; Kleimeier, Dana; Sperling, Christian; Kaderali, Lars; Felbor, Ute

    2017-09-01

    Cerebral cavernous malformations (CCM) are vascular lesions of the central nervous system that can cause headaches, seizures and hemorrhagic stroke. Disease-associated mutations have been identified in three genes: CCM1/KRIT1, CCM2 and CCM3/PDCD10. The precise proportion of deep-intronic variants in these genes and their clinical relevance is yet unknown. Here, a long-range PCR (LR-PCR) approach for target enrichment of the entire genomic regions of the three genes was combined with next generation sequencing (NGS) to screen for coding and non-coding variants. NGS detected all six CCM1/KRIT1, two CCM2 and four CCM3/PDCD10 mutations that had previously been identified by Sanger sequencing. Two of the pathogenic variants presented here are novel. Additionally, 20 stringently selected CCM index cases that had remained mutation-negative after conventional sequencing and exclusion of copy number variations were screened for deep-intronic mutations. The combination of bioinformatics filtering and transcript analyses did not reveal any deep-intronic splice mutations in these cases. Our results demonstrate that target enrichment by LR-PCR combined with NGS can be used for a comprehensive analysis of the entire genomic regions of the CCM genes in a research context. However, its clinical utility is limited as deep-intronic splice mutations in CCM1/KRIT1, CCM2 and CCM3/PDCD10 seem to be rather rare. Copyright © 2017 Elsevier Masson SAS. All rights reserved.

  9. The roles of picornavirus untranslated regions in infection and innate immunity

    USDA-ARS?s Scientific Manuscript database

    Viral genomes have evolved to maximize their potential of overcoming host defense mechanisms and to induce a variety of disease syndromes. Structurally, a genome of a virus consists of coding and noncoding regions, and both have been shown to contribute to initiation and progression of disease. Ac...

  10. The Third Intron of the Interferon Regulatory Factor-8 Is an Initiator of Repressed Chromatin Restricting Its Expression in Non-Immune Cells

    PubMed Central

    Barnea-Yizhar, Ofer; Ram, Sigal; Kovalev, Ekaterina; Azriel, Aviva; Rand, Ulfert; Nakayama, Manabu; Hauser, Hansjörg; Gepstein, Lior; Levi, Ben-Zion

    2016-01-01

    Interferon Regulatory Factor-8 (IRF-8) serves as a key factor in the hierarchical differentiation towards monocyte/dendritic cell lineages. While much insight has been accumulated into the mechanisms essential for its hematopoietic specific expression, the mode of restricting IRF-8 expression in non-hematopoietic cells is still unknown. Here we show that the repression of IRF-8 expression in restrictive cells is mediated by its 3rd intron. Removal of this intron alleviates the repression of Bacterial Artificial Chromosome (BAC) IRF-8 reporter gene in these cells. Fine deletion analysis points to conserved regions within this intron mediating its restricted expression. Further, the intron alone selectively initiates gene silencing only in expression-restrictive cells. Characterization of this intron’s properties points to its role as an initiator of sustainable gene silencing inducing chromatin condensation with suppressive histone modifications. This intronic element cannot silence episomal transgene expression underlining a strict chromatin-dependent silencing mechanism. We validated this chromatin-state specificity of IRF-8 intron upon in-vitro differentiation of induced pluripotent stem cells (iPSCs) into cardiomyocytes. Taken together, the IRF-8 3rd intron is sufficient and necessary to initiate gene silencing in non-hematopoietic cells, highlighting its role as a nucleation core for repressed chromatin during differentiation. PMID:27257682

  11. A systematic evaluation of expression of HERV-W elements; influence of genomic context, viral structure and orientation

    PubMed Central

    2011-01-01

    Background One member of the W family of human endogenous retroviruses (HERV) appears to have been functionally adopted by the human host. Nevertheless, a highly diversified and regulated transcription from a range of HERV-W elements has been observed in human tissues and cells. Aberrant expression of members of this family has also been associated with human disease such as multiple sclerosis (MS) and schizophrenia. It is not known whether this broad expression of HERV-W elements represents transcriptional leakage or specific transcription initiated from the retroviral promoter in the long terminal repeat (LTR) region. Therefore, potential influences of genomic context, structure and orientation on the expression levels of individual HERV-W elements in normal human tissues were systematically investigated. Results Whereas intronic HERV-W elements with a pseudogene structure exhibited a strong anti-sense orientation bias, intronic elements with a proviral structure and solo LTRs did not. Although a highly variable expression across tissues and elements was observed, systematic effects of context, structure and orientation were also observed. Elements located in intronic regions appeared to be expressed at higher levels than elements located in intergenic regions. Intronic elements with proviral structures were expressed at higher levels than those elements bearing hallmarks of processed pseudogenes or solo LTRs. Relative to their corresponding genes, intronic elements integrated on the sense strand appeared to be transcribed at higher levels than those integrated on the anti-sense strand. Moreover, the expression of proviral elements appeared to be independent from that of their corresponding genes. Conclusions Intronic HERV-W provirus integrations on the sense strand appear to have elicited a weaker negative selection than pseudogene integrations of transcripts from such elements. Our current findings suggest that the previously observed diversified and tissue-specific expression of elements in the HERV-W family is the result of both directed transcription (involving both the LTR and internal sequence) and leaky transcription of HERV-W elements in normal human tissues. PMID:21226900

  12. The 5' untranslated region of the VR-ACS1 mRNA acts as a strong translational enhancer in plants.

    PubMed

    Wever, Willem; McCallum, Emily J; Chakravorty, David; Cazzonelli, Christopher I; Botella, José R

    2010-08-01

    The structure and function of untranslated mRNA leader sequences and their role in controlling gene expression remains poorly understood. Previous research has suggested that the 5' untranslated region (5'UTR) of the Vigna radiata aminocyclopropane-1-carboxylate synthase synthase (VR-ACS1) gene may function as a translational enhancer in plants. To test such hypothesis we compared the translation enhancing properties of three different 5'UTRs; those from the VR-ACS1, the chlorophyll a/b binding gene from petunia (Cab22L; a known translational enhancer) and the Vigna radiata pectinacetylesterase gene (PAE; used as control). Identical constructs in which the coding region of the beta-glucuronidase (GUS) gene was fused to each of the three 5'UTRs and placed under the control of the cauliflower mosaic virus 35S promoter were prepared. Transient expression assays in tobacco cell cultures and mung bean leaves showed that the VR-ACS1 and Cab22L 5'UTRs directed higher levels of GUS activity than the PAE 5'UTR. Analysis of transgenic Arabidopsis thaliana seedlings, as well as different tissues from mature plants, confirmed that while transcript levels were equivalent for all constructs, the 5'UTRs from the VR-ACS1 and Cab22L genes can increase GUS activity twofold to fivefold compared to the PAE 5'UTR, therefore confirming the translational enhancing properties of the VR-ACS1 5'UTR.

  13. Role of the 2 adenine (g.11293_11294insAA) insertion polymorphism in the 3' untranslated region of the factor VII (FVII) gene: molecular characterization of a patient with severe FVII deficiency.

    PubMed

    Peyvandi, F; Garagiola, I; Palla, R; Marziliano, N; Mannucci, P M

    2005-11-01

    Polymorphic variants in the gene encoding factor VII (F7) affect the plasma levels of this coagulation protein and modify the clinical phenotype of FVII deficiency in some patients. In this study we report the in vitro functional analysis of a novel polymorphic variant located in the 3' untranslated region of F7: g.11293_11294insAA. To determine whether this variant regulates FVII expression, we initially compared an expression vector containing FVII cDNA with g.11293_11294insAA with the FVII wild-type (WT) construct. The kinetics of mRNA production showed that the insertion decreases the steady-state FVII mRNA levels. To assess whether the insertion influences the phenotype of FVII-deficient patients, we evaluated its effect on the expression of FVII in a patient with severe FVII deficiency (undetectable FVII activity and antigen) carrying two additional homozygous missense variations (p.Arg277Cys and p.Arg353Gln). The two substitutions alone reduced the expression of FVII activity and antigen in vitro, but with the insertion polymorphism in our expression vector the patient's phenotype of undetectable plasma FVII was recapitulated. The insertion polymorphism in the 3' untranslated region of F7 is another modifier of FVII expression that might explain the poor genotype-phenotype correlation in some FVII-deficient patients. Copyright 2005 Wiley-Liss, Inc.

  14. A piggyBac-based reporter system for scalable in vitro and in vivo analysis of 3′ untranslated region-mediated gene regulation

    PubMed Central

    Chaudhury, Arindam; Kongchan, Natee; Gengler, Jon P.; Mohanty, Vakul; Christiansen, Audrey E.; Fachini, Joseph M.; Martin, James F.; Neilson, Joel R.

    2014-01-01

    Regulation of messenger ribonucleic acid (mRNA) subcellular localization, stability and translation is a central aspect of gene expression. Much of this control is mediated via recognition of mRNA 3′ untranslated regions (UTRs) by microRNAs (miRNAs) and RNA-binding proteins. The gold standard approach to assess the regulation imparted by a transcript's 3′ UTR is to fuse the UTR to a reporter coding sequence and assess the relative expression of this reporter as compared to a control. Yet, transient transfection approaches or the use of highly active viral promoter elements may overwhelm a cell's post-transcriptional regulatory machinery in this context. To circumvent this issue, we have developed and validated a novel, scalable piggyBac-based vector for analysis of 3′ UTR-mediated regulation in vitro and in vivo. The vector delivers three independent transcription units to the target genome—a selection cassette, a turboGFP control reporter and an experimental reporter expressed under the control of a 3′ UTR of interest. The pBUTR (piggyBac-based 3′ UnTranslated Region reporter) vector performs robustly as a siRNA/miRNA sensor, in established in vitro models of post-transcriptional regulation, and in both arrayed and pooled screening approaches. The vector is robustly expressed as a transgene during murine embryogenesis, highlighting its potential usefulness for revealing post-transcriptional regulation in an in vivo setting. PMID:24753411

  15. A comparative genomics strategy for targeted discovery of single-nucleotide polymorphisms and conserved-noncoding sequences in orphan crops.

    PubMed

    Feltus, F A; Singh, H P; Lohithaswa, H C; Schulze, S R; Silva, T D; Paterson, A H

    2006-04-01

    Completed genome sequences provide templates for the design of genome analysis tools in orphan species lacking sequence information. To demonstrate this principle, we designed 384 PCR primer pairs to conserved exonic regions flanking introns, using Sorghum/Pennisetum expressed sequence tag alignments to the Oryza genome. Conserved-intron scanning primers (CISPs) amplified single-copy loci at 37% to 80% success rates in taxa that sample much of the approximately 50-million years of Poaceae divergence. While the conserved nature of exons fostered cross-taxon amplification, the lesser evolutionary constraints on introns enhanced single-nucleotide polymorphism detection. For example, in eight rice (Oryza sativa) genotypes, polymorphism averaged 12.1 per kb in introns but only 3.6 per kb in exons. Curiously, among 124 CISPs evaluated across Oryza, Sorghum, Pennisetum, Cynodon, Eragrostis, Zea, Triticum, and Hordeum, 23 (18.5%) seemed to be subject to rigid intron size constraints that were independent of per-nucleotide DNA sequence variation. Furthermore, we identified 487 conserved-noncoding sequence motifs in 129 CISP loci. A large CISP set (6,062 primer pairs, amplifying introns from 1,676 genes) designed using an automated pipeline showed generally higher abundance in recombinogenic than in nonrecombinogenic regions of the rice genome, thus providing relatively even distribution along genetic maps. CISPs are an effective means to explore poorly characterized genomes for both DNA polymorphism and noncoding sequence conservation on a genome-wide or candidate gene basis, and also provide anchor points for comparative genomics across a diverse range of species.

  16. A Comparative Genomics Strategy for Targeted Discovery of Single-Nucleotide Polymorphisms and Conserved-Noncoding Sequences in Orphan Crops1[W

    PubMed Central

    Feltus, F.A.; Singh, H.P.; Lohithaswa, H.C.; Schulze, S.R.; Silva, T.D.; Paterson, A.H.

    2006-01-01

    Completed genome sequences provide templates for the design of genome analysis tools in orphan species lacking sequence information. To demonstrate this principle, we designed 384 PCR primer pairs to conserved exonic regions flanking introns, using Sorghum/Pennisetum expressed sequence tag alignments to the Oryza genome. Conserved-intron scanning primers (CISPs) amplified single-copy loci at 37% to 80% success rates in taxa that sample much of the approximately 50-million years of Poaceae divergence. While the conserved nature of exons fostered cross-taxon amplification, the lesser evolutionary constraints on introns enhanced single-nucleotide polymorphism detection. For example, in eight rice (Oryza sativa) genotypes, polymorphism averaged 12.1 per kb in introns but only 3.6 per kb in exons. Curiously, among 124 CISPs evaluated across Oryza, Sorghum, Pennisetum, Cynodon, Eragrostis, Zea, Triticum, and Hordeum, 23 (18.5%) seemed to be subject to rigid intron size constraints that were independent of per-nucleotide DNA sequence variation. Furthermore, we identified 487 conserved-noncoding sequence motifs in 129 CISP loci. A large CISP set (6,062 primer pairs, amplifying introns from 1,676 genes) designed using an automated pipeline showed generally higher abundance in recombinogenic than in nonrecombinogenic regions of the rice genome, thus providing relatively even distribution along genetic maps. CISPs are an effective means to explore poorly characterized genomes for both DNA polymorphism and noncoding sequence conservation on a genome-wide or candidate gene basis, and also provide anchor points for comparative genomics across a diverse range of species. PMID:16607031

  17. Investigation of the estrogen receptor-alpha gene with type 2 diabetes and/or nephropathy in African-American and European-American populations.

    PubMed

    Gallagher, Carla J; Keene, Keith L; Mychaleckyj, Josyf C; Langefeld, Carl D; Hirschhorn, Joel N; Henderson, Brian E; Gordon, Candace J; Freedman, Barry I; Rich, Stephen S; Bowden, Donald W; Sale, Michèle M

    2007-03-01

    The estrogen receptor-alpha gene (ESR1) was selected as a positional candidate under a type 2 diabetes linkage peak at 6q24-27. A total of 42 ESR1 single nucleotide polymorphisms (SNPs) were genotyped in 380 African-American type 2 diabetic case subjects with end-stage renal disease (ESRD) and 276 African-American control subjects. A total of 22 ancestry informative markers were also genotyped, and the program Admixmap was used to adjust allelic and haplotypic association tests for individual estimates of admixture. The most significant association with type 2 diabetes-ESRD was with rs1033182 in intron 2 (P = 0.013, admixture-adjusted P(a) = 0.021). Genotyping 17 SNPs across a region of ESR1 intron 1-intron 2 in an expanded population of 851 case and 635 control subjects supported association with rs1033182 (P = 0.004, P(a) = 0.027) and with an independent six-SNP haplotype of high linkage disequilibrium spanning 6.4 kb (P < 0.0001, P(a) < 0.0001). The same 17 ESR1 SNPs were genotyped in 300 European-American type 2 diabetes-ESRD case subjects and 310 European-American control subjects. Two intron 2 SNPs, rs2431260 (P = 0.015) and rs1709183 (P = 0.019), and a four-SNP haplotype containing these SNPs (P = 0.033) were associated with type 2 diabetes and/or ESRD. Results suggest that intron 1 and intron 2 of the ESR1 gene may contain functionally important regions related to type 2 diabetes or ESRD risk.

  18. Low-copy nuclear primers and ycf1 primers in Cactaceae.

    PubMed

    Franck, Alan R; Cochrane, Bruce J; Garey, James R

    2012-10-01

    To increase the number of variable regions available for phylogenetic study in the Cactaceae, primers were developed for a portion of the plastid ycf1 gene and intron-spanning regions of two low-copy nuclear genes (isi1, nhx1). • Primers were tested on several families within Caryophyllales, focusing on the Cactaceae. Gel electrophoresis indicated positive amplification in most samples. Sequences of these three regions (isi1, nhx1, ycf1) from Harrisia exhibited variation similar to or greater than two plastid regions (atpB-rbcL intergenic spacer and rpl16 intron). • The isi, nhx, and ycf1 primers amplify phylogenetically useful information applicable to the Cactaceae and other families in the Caryophyllales.

  19. Bio—Cryptography: A Possible Coding Role for RNA Redundancy

    NASA Astrophysics Data System (ADS)

    Regoli, M.

    2009-03-01

    The RNA-Crypto System (shortly RCS) is a symmetric key algorithm to cipher data. The idea for this new algorithm starts from the observation of nature. In particular from the observation of RNA behavior and some of its properties. The RNA sequences have some sections called Introns. Introns, derived from the term "intragenic regions," are non-coding sections of precursor mRNA (pre-mRNA) or other RNAs, that are removed (spliced out of the RNA) before the mature RNA is formed. Once the introns have been spliced out of a pre-mRNA, the resulting mRNA sequence is ready to be translated into a protein. The corresponding parts of a gene are known as introns as well. The nature and the role of Introns in the pre-mRNA is not clear and it is under ponderous researches by biologists but, in our case, we will use the presence of Introns in the RNA-Crypto System output as a strong method to add chaotic non coding information and an unnecessary behavior in the access to the secret key to code the messages. In the RNA-Crypto System algorithm the introns are sections of the ciphered message with non-coding information as well as in the precursor mRNA.

  20. Development of EST Intron-Targeting SNP Markers for Panax ginseng and Their Application to Cultivar Authentication.

    PubMed

    Wang, Hongtao; Li, Guisheng; Kwon, Woo-Saeng; Yang, Deok-Chun

    2016-06-04

    Panax ginseng is one of the most valuable medicinal plants in the Orient. The low level of genetic variation has limited the application of molecular markers for cultivar authentication and marker-assisted selection in cultivated ginseng. To exploit DNA polymorphism within ginseng cultivars, ginseng expressed sequence tags (ESTs) were searched against the potential intron polymorphism (PIP) database to predict the positions of introns. Intron-flanking primers were then designed in conserved exon regions and used to amplify across the more variable introns. Sequencing results showed that single nucleotide polymorphisms (SNPs), as well as indels, were detected in four EST-derived introns, and SNP markers specific to "Gopoong" and "K-1" were first reported in this study. Based on cultivar-specific SNP sites, allele-specific polymerase chain reaction (PCR) was conducted and proved to be effective for the authentication of ginseng cultivars. Additionally, the combination of a simple NaOH-Tris DNA isolation method and real-time allele-specific PCR assay enabled the high throughput selection of cultivars from ginseng fields. The established real-time allele-specific PCR assay should be applied to molecular authentication and marker assisted selection of P. ginseng cultivars, and the EST intron-targeting strategy will provide a potential approach for marker development in species without whole genomic DNA sequence information.

  1. Mutation analysis of FANCD2, BRIP1/BACH1, LMO4 and SFN in familial breast cancer.

    PubMed

    Lewis, Aaron G; Flanagan, James; Marsh, Anna; Pupo, Gulietta M; Mann, Graham; Spurdle, Amanda B; Lindeman, Geoffrey J; Visvader, Jane E; Brown, Melissa A; Chenevix-Trench, Georgia

    2005-01-01

    Mutations in known predisposition genes account for only about a third of all multiple-case breast cancer families. We hypothesized that germline mutations in FANCD2, BRIP1/BACH1, LMO4 and SFN may account for some of the unexplained multiple-case breast cancer families. The families used in this study were ascertained through the Kathleen Cuningham Foundation Consortium for Research into Familial Breast Cancer (kConFab). Denaturing high performance liquid chromatography (DHPLC) analysis of the coding regions of these four genes was conducted in the youngest affected cases of 30 to 267 non-BRCA1/2 breast cancer families. In addition, a further 399 index cases were also screened for mutations in two functionally significant regions of the FANCD2 gene and 253 index cases were screened for two previously reported mutations in BACH1 (p. P47A and p. M299I). DHPLC analysis of FANCD2 identified six silent exonic variants, and a large number of intronic variants, which tagged two common haplotypes. One protein truncating variant was found in BRIP1/BACH1, as well as four missense variants, a silent change and a variant in the 3' untranslated region. No missense or splice site mutations were found in LMO4 or SFN. Analysis of the missense, silent and frameshift variants of FANCD2 and BACH1 in relatives of the index cases, and in a panel of controls, found no evidence suggestive of pathogenicity. There is no evidence that highly penetrant exonic or splice site mutations in FANCD2, BRIP1/BACH1, LMO4 or SFN contribute to familial breast cancer. Large scale association studies will be necessary to determine whether any of the polymorphisms or haplotypes identified in these genes contributes to breast cancer risk.

  2. Genetic variation in HTR2A influences serotonin transporter binding potential as measured using PET and [11C]DASB.

    PubMed

    Laje, Gonzalo; Cannon, Dara M; Allen, Andrew S; Klaver, Jackie M; Peck, Summer A; Liu, Xinmin; Manji, Husseini K; Drevets, Wayne C; McMahon, Francis J

    2010-07-01

    In a previous study we showed that genetic variation in HTR2A, which encodes the serotonin 2A receptor, influenced outcome of citalopram treatment in patients with major depressive disorder. Since chronic administration of citalopram, which selectively and potently inhibits the serotonin transporter (5-HTT), putatively enhances serotonergic transmission, it is conceivable that genetic variation within HTR2A also influences pretreatment 5-HTT function or serotonergic transmission. The present study used positron emission tomography (PET) and the selective 5-HTT ligand, [11C]DASB, to investigate whether the HTR2A marker alleles that predict treatment outcome also predict differences in 5-HTT binding. Brain levels of 5-HTT were assessed in vivo using PET measures of the non-displaceable component of the [11C]DASB binding potential (BPND). DNA from 43 patients and healthy volunteers, all unmedicated, was genotyped with 14 single nucleotide polymorphisms located within or around HTR2A. Allelic association with BPND was assessed in eight brain regions, with covariates to control for race and ethnicity. We detected allelic association between [11C]DASB BPND in thalamus and three markers in a region spanning the 3' untranslated region and second intron of HTR2A (rs7333412, p=0.000045; rs7997012, p=0.000086; rs977003, p=0.000069). The association signal at rs7333412 remained significant (p<0.05) after applying corrections for multiple testing via permutation. Genetic variation in HTR2A that was previously associated with citalopram treatment outcome was also associated with thalamic 5-HTT binding. While further work is needed to identify the actual functional genetic variants involved, these results suggest that a relationship exists between genetic variation in HTR2A and either 5-HTT expression or central serotonergic transmission that influences the therapeutic response to 5-HTT inhibition in major depression.

  3. Using the candidate gene approach for detecting genes underlying seed oil concentration and yield in soybean.

    PubMed

    Eskandari, Mehrzad; Cober, Elroy R; Rajcan, Istvan

    2013-07-01

    Increasing the oil concentration in soybean seeds has been given more attention in recent years because of demand for both edible oil and biodiesel production. Oil concentration in soybean is a complex quantitative trait regulated by many genes as well as environmental conditions. To identify genes governing seed oil concentration in soybean, 16 putative candidate genes of three important gene families (GPAT: acyl-CoA:sn-glycerol-3-phosphate acyltransferase, DGAT: acyl-CoA:diacylglycerol acyltransferase, and PDAT: phospholipid:diacylglycerol acyltransferase) involved in triacylglycerol (TAG) biosynthesis pathways were selected and their sequences retrieved from the soybean database ( http://www.phytozome.net/soybean ). Three sequence mutations were discovered in either coding or noncoding regions of three DGAT soybean isoforms when comparing the parents of a 203 recombinant inbreed line (RIL) population; OAC Wallace and OAC Glencoe. The RIL population was used to study the effects of these mutations on seed oil concentration and other important agronomic and seed composition traits, including seed yield and protein concentration across three field locations in Ontario, Canada, in 2009 and 2010. An insertion/deletion (indel) mutation in the GmDGAT2B gene in OAC Wallace was significantly associated with reduced seed oil concentration across three environments and reduced seed yield at Woodstock in 2010. A mutation in the 3' untranslated (3'UTR) region of GmDGAT2C was associated with seed yield at Woodstock in 2009. A mutation in the intronic region of GmDGAR1B was associated with seed yield and protein concentration at Ottawa in 2010. The genes identified in this study had minor effects on either seed yield or oil concentration, which was in agreement with the quantitative nature of the traits. However, the novel gene-specific markers designed in the present study can be used in soybean breeding for marker-assisted selection aimed at increasing seed yield and oil concentration with no significant impact on seed protein concentration.

  4. Bottomless barrel-sponge species in the Indo-Pacific?

    PubMed

    Setiawan, Edwin; Voogd, Nicole J De; Wörheide, Gert; Erpenbeck, Dirk

    2016-07-06

    The use of nuclear markers, in addition to traditional mitochondrial markers, helps to clarify hidden patterns of genetic structure in natural populations (Palumbi & Baker, 1994). This is particularly evident among demosponges that possess slow mitochondrial evolutionary rates compared to Bilateria, where nuclear intron markers can aid in the understanding of shallow level phylogenetic relationships (Shearer et al., 2002). Ideally, these nuclear markers (i) are evolutionary well-conserved across different lineages, (ii) produce amplicons holding a number of sites with sufficient variability to answer the relevant phylogenetic question, (iii) derive from single copy genes (see review in Zhang & Hewitt, 2003). A popular method to amplify intron markers uses EPIC (Exon-Primed, Intron-Crossing) primers that anneal to the more conserved flanking exon regions and subsequently bridge the intron during amplification (Palumbi & Baker, 1994).

  5. Sequence Variation of the tRNALeu Intron as a Marker for Genetic Diversity and Specificity of Symbiotic Cyanobacteria in Some Lichens

    PubMed Central

    Paulsrud, Per; Lindblad, Peter

    1998-01-01

    We examined the genetic diversity of Nostoc symbionts in some lichens by using the tRNALeu (UAA) intron as a genetic marker. The nucleotide sequence was analyzed in the context of the secondary structure of the transcribed intron. Cyanobacterial tRNALeu (UAA) introns were specifically amplified from freshly collected lichen samples without previous DNA extraction. The lichen species used in the present study were Nephroma arcticum, Peltigera aphthosa, P. membranacea, and P. canina. Introns with different sizes around 300 bp were consistently obtained. Multiple clones from single PCRs were screened by using their single-stranded conformational polymorphism pattern, and the nucleotide sequence was determined. No evidence for sample heterogenity was found. This implies that the symbiont in situ is not a diverse community of cyanobionts but, rather, one Nostoc strain. Furthermore, each lichen thallus contained only one intron type, indicating that each thallus is colonized only once or that there is a high degree of specificity. The same cyanobacterial intron sequence was also found in samples of one lichen species from different localities. In a phylogenetic analysis, the cyanobacterial lichen sequences grouped together with the sequences from two free-living Nostoc strains. The size differences in the intron were due to insertions and deletions in highly variable regions. The sequence data were used in discussions concerning specificity and biology of the lichen symbiosis. It is concluded that the tRNALeu (UAA) intron can be of great value when examining cyanobacterial diversity. PMID:9435083

  6. Deep intronic GPR143 mutation in a Japanese family with ocular albinism

    PubMed Central

    Naruto, Takuya; Okamoto, Nobuhiko; Masuda, Kiyoshi; Endo, Takao; Hatsukawa, Yoshikazu; Kohmoto, Tomohiro; Imoto, Issei

    2015-01-01

    Deep intronic mutations are often ignored as possible causes of human disease. Using whole-exome sequencing, we analysed genomic DNAs of a Japanese family with two male siblings affected by ocular albinism and congenital nystagmus. Although mutations or copy number alterations of coding regions were not identified in candidate genes, the novel intronic mutation c.659-131 T > G within GPR143 intron 5 was identified as hemizygous in affected siblings and as heterozygous in the unaffected mother. This mutation was predicted to create a cryptic splice donor site within intron 5 and activate a cryptic acceptor site at 41nt upstream, causing the insertion into the coding sequence of an out-of-frame 41-bp pseudoexon with a premature stop codon in the aberrant transcript, which was confirmed by minigene experiments. This result expands the mutational spectrum of GPR143 and suggests the utility of next-generation sequencing integrated with in silico and experimental analyses for improving the molecular diagnosis of this disease. PMID:26061757

  7. Deep intronic GPR143 mutation in a Japanese family with ocular albinism.

    PubMed

    Naruto, Takuya; Okamoto, Nobuhiko; Masuda, Kiyoshi; Endo, Takao; Hatsukawa, Yoshikazu; Kohmoto, Tomohiro; Imoto, Issei

    2015-06-10

    Deep intronic mutations are often ignored as possible causes of human disease. Using whole-exome sequencing, we analysed genomic DNAs of a Japanese family with two male siblings affected by ocular albinism and congenital nystagmus. Although mutations or copy number alterations of coding regions were not identified in candidate genes, the novel intronic mutation c.659-131 T > G within GPR143 intron 5 was identified as hemizygous in affected siblings and as heterozygous in the unaffected mother. This mutation was predicted to create a cryptic splice donor site within intron 5 and activate a cryptic acceptor site at 41nt upstream, causing the insertion into the coding sequence of an out-of-frame 41-bp pseudoexon with a premature stop codon in the aberrant transcript, which was confirmed by minigene experiments. This result expands the mutational spectrum of GPR143 and suggests the utility of next-generation sequencing integrated with in silico and experimental analyses for improving the molecular diagnosis of this disease.

  8. Gross rearrangements within the 5'-untranslated region of the picornaviral genomes.

    PubMed

    Pilipenko, E V; Blinov, V M; Agol, V I

    1990-06-11

    An analysis of reported nucleotide sequences revealed several cases of gross rearrangements in the 5'-untranslated region (5-UTR) of picornaviral genomes. A large (greater than 100 nt) duplication was discovered in a downstream region of poliovirus 5-UTR involved in the translational control. Properties of the poliovirus mutants with large deletions [Kuge and Nomoto (1987) J. Virol. 61, 1478-1487] show that a single copy of the appropriate repeating unit is compatible with a wild type phenotype of the virus. In contrast to poliovirus and another enterovirus genomes, human rhinovirus RNAs contain only a single copy of this repeating unit. Another similarly large repeat was found in an upstream segment of the bovine enterovirus 5-UTR. A comparison of the primary and secondary structures of cardio- and aphthovirus 5-UTRs demonstrated the existence of a large (ca. 250 nucleotides) insertion/deletion in a region preceding the poly(C) tract. The two latter rearrangements appear to involve elements of the viral genome replication machinery. Possible origin as well as evolutionary and functional implications of these structural peculiarities are discussed.

  9. Myostatin-2 gene structure and polymorphism of the promoter and first intron in the marine fish Sparus aurata: evidence for DNA duplications and/or translocations.

    PubMed

    Nadjar-Boger, Elisabeth; Funkenstein, Bruria

    2011-02-01

    Myostatin (MSTN) is a member of the transforming growth factor-ß superfamily that functions as a negative regulator of skeletal muscle development and growth in mammals. Fish express at least two genes for MSTN: MSTN-1 and MSTN-2. To date, MSTN-2 promoters have been cloned only from salmonids and zebrafish. Here we described the cloning and sequence analysis of MSTN-2 gene and its 5' flanking region in the marine fish Sparus aurata (saMSTN-2). We demonstrate the existence of three alleles of the promoter and three alleles of the first intron. Sequence comparison of the promoter region in the three alleles revealed that although the sequences of the first 1050 bp upstream of the translation start site are almost identical in the three alleles, a substantial sequence divergence is seen further upstream. Careful sequence analysis of the region upstream of the first 1050 bp in the three alleles identified several elements that appear to be repeated in some or all sequences, at different positions. This suggests that the promoter region of saMSTN-2 has been subjected to various chromosomal rearrangements during the course of evolution, reflecting either insertion or deletion events. Screening of several genomic DNA collections indicated differences in allele frequency, with allele 'b' being the most abundant, followed by allele 'c', whereas allele 'a' is relatively rare. Sequence analysis of saMSTN-2 gene also revealed polymorphism in the first intron, identifying three alleles. The length difference in alleles '1R' and '2R' of the first intron is due to the presence of one or two copies of a repeated block of approximately 150 bp, located at the 5' end of the first intron. The third allele, '4R', has an additional insertion of 323 bp located 116 bp upstream of the 3' end of the first intron. Analysis of several DNA collections showed that the '2R' allele is the most common, followed by the '4R' allele, whereas the '1R' allele is relatively rare. Progeny analysis of a full-sib family showed a Mendelian mode of inheritance of the two genetic loci. No clear association was found between the two genetic markers and growth rate. These results show for the first time a substantial degree of polymorphism in both the promoter and first intron of MSTN-2 gene in a perciform fish species which points to chromosomal rearrangements that took place during evolution.

  10. Human somatostatin I: sequence of the cDNA.

    PubMed Central

    Shen, L P; Pictet, R L; Rutter, W J

    1982-01-01

    RNA has been isolated from a human pancreatic somatostatinoma and used to prepare a cDNA library. After prescreening, clones containing somatostatin I sequences were identified by hybridization with an anglerfish somatostatin I-cloned cDNA probe. From the nucleotide sequence of two of these clones, we have deduced an essentially full-length mRNA sequence, including the preprosomatostatin coding region, 105 nucleotides from the 5' untranslated region and the complete 150-nucleotide 3' untranslated region. The coding region predicts a 116-amino acid precursor protein (Mr, 12.727) that contains somatostatin-14 and -28 at its COOH terminus. The predicted amino acid sequence of human somatostatin-28 is identical to that of somatostatin-28 isolated from the porcine and ovine species. A comparison of the amino acid sequences of human and anglerfish preprosomatostatin I indicated that the COOH-terminal region encoding somatostatin-14 and the adjacent 6 amino acids are highly conserved, whereas the remainder of the molecule, including the signal peptide region, is more divergent. However, many of the amino acid differences found in the pro region of the human and anglerfish proteins are conservative changes. This suggests that the propeptides have a similar secondary structure, which in turn may imply a biological function for this region of the molecule. Images PMID:6126875

  11. Comprehensive evaluation of the Estrogen Receptor Alpha gene reveals further evidence for association with type 2 diabetes enriched for nephropathy in an African American population

    PubMed Central

    Keene, Keith L.; Mychaleckyj, Josyf C.; Smith, Shelly G.; Leak, Tennille S.; Perlegas, Peter S.; Langefeld, Carl D.; Herrington, David M.; Freedman, Barry I.; Rich, Stephen S.; Bowden, Donald W.; Sale, Michèle M.

    2009-01-01

    We previously investigated the estrogen receptor α gene (ESR1) as a positional candidate for type 2 diabetes (T2DM), and found evidence for association between the intron 1-intron 2 region of this gene and type 2 diabetes and/or nephropathy in an African American (AA) population. Our objective was to comprehensively evaluate variants across the entire ESR1 gene for association in AA with T2DM and End Stage Renal Disease (T2DM-ESRD). One hundred fifty SNPs in ESR1, spanning 476 kb, were genotyped in 577 AA individuals with T2DM-ESRD and 596 AA controls. Genotypic association tests for dominant, additive, and recessive models, and haplotypic association, were calculated using a χ2 statistic and corresponding P value. Thirty-one SNPs showed nominal evidence for association (P< 0.05) with T2DM-ESRD in one or more genotypic model. After correcting for multiple tests, promoter SNP rs11964281 (nominal P=0.000291, adjusted P=0.0289), and intron 4 SNPs rs1569788 (nominal P=0.000754, adjusted P=0.0278) and rs9340969 (nominal P=0.00109, adjusted P=0.0467) remained significant at experimentwise error rate (EER) P<0.05 for the dominant class of tests. Twenty-three of the thirty-one associated SNPs cluster within the intron 4-intron 6 region. Gender stratification revealed nominal evidence for association with 35 SNPs in females (352 cases; 306 controls) and seven SNPs in males (225 cases; 290 controls). We have identified a novel region of the ESR1 gene that may contain important functional polymorphisms in relation to susceptibility to T2DM and/or diabetic nephropathy. PMID:18305958

  12. Comprehensive evaluation of the estrogen receptor alpha gene reveals further evidence for association with type 2 diabetes enriched for nephropathy in an African American population.

    PubMed

    Keene, Keith L; Mychaleckyj, Josyf C; Smith, Shelly G; Leak, Tennille S; Perlegas, Peter S; Langefeld, Carl D; Herrington, David M; Freedman, Barry I; Rich, Stephen S; Bowden, Donald W; Sale, Michèle M

    2008-05-01

    We previously investigated the estrogen receptor alpha gene (ESR1) as a positional candidate for type 2 diabetes (T2DM), and found evidence for association between the intron 1-intron 2 region of this gene and T2DM and/or nephropathy in an African American (AA) population. Our objective was to comprehensively evaluate variants across the entire ESR1 gene for association in AA with T2DM and end stage renal disease (T2DM-ESRD). One hundred fifty SNPs in ESR1, spanning 476 kb, were genotyped in 577 AA individuals with T2DM-ESRD and 596 AA controls. Genotypic association tests for dominant, additive, and recessive models, and haplotypic association, were calculated using a chi(2) statistic and corresponding P value. Thirty-one SNPs showed nominal evidence for association (P < 0.05) with T2DM-ESRD in one or more genotypic model. After correcting for multiple tests, promoter SNP rs11964281 (nominal P = 0.000291, adjusted P = 0.0289), and intron 4 SNPs rs1569788 (nominal P = 0.000754, adjusted P = 0.0278) and rs9340969 (nominal P = 0.00109, adjusted P = 0.0467) remained significant at experimentwise error rate (EER) P

  13. Polymorphism of intron-1 in the voltage-gated sodium channel gene of Anopheles gambiae s.s. populations from Cameroon with emphasis on insecticide knockdown resistance mutations.

    PubMed

    Etang, Josiane; Vicente, Jose L; Nwane, Philippe; Chouaibou, Mouhamadou; Morlais, Isabelle; Do Rosario, Virgilio E; Simard, Frederic; Awono-Ambene, Parfait; Toto, Jean Claude; Pinto, Joao

    2009-07-01

    Sequence variation at the intron-1 of the voltage-gated sodium channel gene in Anopheles gambiae M- and S-forms from Cameroon was assessed to explore the number of mutational events originating knockdown resistance (kdr) alleles. Mosquitoes were sampled between December 2005 and June 2006 from three geographical areas: (i) Magba in the western region; (ii) Loum, Tiko, Douala, Kribi, and Campo along the Atlantic coast; and (iii) Bertoua, in the eastern continental plateau. Both 1014S and 1014F kdr alleles were found in the S-form with overall frequencies of 14% and 42% respectively. Only the 1014F allele was found in the M-form at lower frequency (11%). Analysis of a 455 bp region of intron-1 upstream the kdr locus revealed four independent mutation events originating kdr alleles, here named MS1 -1014F, S1-1014S and S2-1014S kdr-intron-1 haplotypes in S-form and MS3-1014F kdr-intron-1 haplotype in the M-form. Furthermore, there was evidence for mutual introgression of kdr 1014F allele between the two molecular forms, MS1 and MS3 being widely shared by them. Although no M/S hybrid was observed in analysed samples, this wide distribution of haplotypes MS1 and MS3 suggests inter-form hybridizing at significant level and emphasizes the rapid diffusion of the kdr alleles in Africa. The mosaic of genetic events found in Cameroon is representative of the situation in the West-Central African region and highlights the importance of evaluating the spatial and temporal evolution of kdr alleles for a better management of insecticide resistance.

  14. Bipolar localization of the group II intron Ll.LtrB is maintained in Escherichia coli deficient in nucleoid condensation, chromosome partitioning and DNA replication.

    PubMed

    Beauregard, Arthur; Chalamcharla, Venkata R; Piazza, Carol Lyn; Belfort, Marlene; Coros, Colin J

    2006-11-01

    Group II introns are mobile genetic elements that invade their cognate intron-minus alleles via an RNA intermediate, in a process known as retrohoming. They can also retrotranspose to ectopic sites at low frequency. In Escherichia coli, retrotransposition of the lactococcal group II intron, Ll.LtrB, occurs preferentially within the Ori and Ter macrodomains of the E. coli chromosome. These macrodomains migrate towards the poles of the cell, where the intron-encoded protein, LtrA, localizes. Here we investigate whether alteration of nucleoid condensation, chromosome partitioning and replication affect retrotransposition frequencies, as well as bipolar localization of the Ll.LtrB intron integration and LtrA distribution in E. coli. We thus examined these properties in the absence of the nucleoid-associated proteins H-NS, StpA and MukB, in variants of partitioning functions including the centromere-like sequence migS and the actin homologue MreB, as well as in the replication mutants DeltaoriC, seqA, tus and topoIV (ts). Although there were some dramatic fluctuations in retrotransposition levels in these hosts, bipolar localization of integration events was maintained. LtrA was consistently found in nucleoid-free regions, with its localization to the cellular poles being largely preserved in these hosts. Together, these results suggest that bipolar localization of group II intron retrotransposition results from the residence of the intron-encoded protein at the poles of the cell.

  15. Molecular phylogeny of C1 inhibitor depicts two immunoglobulin-like domains fusion in fishes and ray-finned fishes specific intron insertion after separation from zebrafish

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Kumar, Abhishek, E-mail: akumar@bot.uni-kiel.de; Bhandari, Anita; Sarde, Sandeep J.

    Highlights: • C1 inhibitors of fishes have two Ig domains fused in the N-terminal end. • Spliceosomal introns gain in two Ig domains of selected ray-finned fishes. • C1 inhibitors gene is maintained from 450 MY on the same locus. • C1 inhibitors gene is missing in frog and lampreys. • C1 inhibitors of tetrapod and fishes differ in the RCL region. - Abstract: C1 inhibitor (C1IN) is a multi-facet serine protease inhibitor in the plasma cascades, inhibiting several proteases, notably, regulates both complement and contact system activation. Despite huge advancements in the understanding of C1IN based on biochemical propertiesmore » and its roles in the plasma cascades, the phylogenetic history of C1IN remains uncharacterized. To date, there is no comprehensive study illustrating the phylogenetic history of C1IN. Herein, we explored phylogenetic history of C1IN gene in vertebrates. Fishes have C1IN with two immunoglobulin like domains attached in the N-terminal region. The RCL regions of CIIN from fishes and tetrapod genomes have variations at the positions P2 and P1′. Gene structures of C1IN gene from selected ray-finned fishes varied in the Ig domain region with creation of novel intron splitting exon Im2 into Im2a and Im2b. This intron is limited to ray-finned fishes with genome size reduced below 1 Gb. Hence, we suggest that genome compaction and associated double-strand break repairs are behind this intron gain. This study reveals the evolutionary history of C1IN and confirmed that this gene remains the same locus for ∼450 MY in 52 vertebrates analysed, but it is not found in frogs and lampreys.« less

  16. Gene encoding the human. beta. -hexosaminidase. beta. chain: Extensive homology of intron placement in the. alpha. - and. beta. -chain genes

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Proia, R.L.

    1988-03-01

    Lysosomal {beta}-hexosaminidase is composed of two structurally similar chains, {alpha} and {beta}, that are the products of different genes. Mutations in either gene causing {beta}-hexosaminidase deficiency result in the lysosomal storage disease GM2-gangliosidosis. To enable the investigation of the molecular lesions in this disorder and to study the evolutionary relationship between the {alpha} and {beta} chains, the {beta}-chain gene was isolated, and its organization was characterized. The {beta}-chain coding region is divided into 14 exons distributed over {approx}40 kilobases of DNA. Comparison with the {alpha}-chain gene revealed that 12 of the 13 introns interrupt the coding regions at homologous positions.more » This extensive sharing of intron placement demonstrates that the {alpha} and {beta} chains evolved by way of the duplication of a common ancestor.« less

  17. Genome sequences of a mouse-avirulent and a mouse-virulent strain of Ross River virus.

    PubMed

    Faragher, S G; Meek, A D; Rice, C M; Dalgarno, L

    1988-04-01

    The nucleotide sequence of the genomic RNA of a mouse-avirulent strain of Ross River virus, RRV NB5092 (isolated in 1969), has been determined and the corresponding sequence for the prototype mouse-virulent strain, RRV T48 (isolated in 1959), has been completed. The RRV NB5092 genome is approximately 11,674 nucleotides in length, compared with 11,853 nucleotides for RRV T48. RRV NB5092 and RRV T48 have the same genome organization. For both viruses an untranslated region of 80 nucleotides at the 5' end of the genome is followed by a 7440-nucleotide open reading frame which is interrupted after 5586 nucleotides by a single opal termination codon. By homology with other alphaviruses, the 5586-nucleotide open reading frame encodes the nonstructural proteins nsP1, nsP2, and nsP3; a fourth nonstructural protein, nsP4, is produced by read-through of the opal codon. The RRV nonstructural proteins show strong homology with the corresponding proteins of Sindbis virus and Semliki Forest virus in terms of size, net charge, and hydropathy characteristics. However, homology is not uniform between or within the proteins; nsP1, nsP2, and nsP4 contain extended domains which are highly conserved between alphaviruses, while the C-terminal region of nsP3 shows little conservation in sequence or length between alphaviruses. An untranslated "junction" region of 44 nucleotides (for RRV NB5092) or 47 nucleotides (for RRV T48) separates the nonstructural and structural protein coding regions. The structural proteins (capsid-E3-E2-6K-E1) are translated from an open reading frame of 3762 nucleotides which is followed by a 3'-untranslated region of approximately 348 nucleotides (for RRV NB5092) or 524 nucleotides (for RRV T48). Excluding deletions and insertions, the genomes of RRV NB5092 and RRV T48 differ at 284 nucleotides, representing a sequence divergence of 2.38%. Sequence deletions or insertions were found only in the noncoding regions and include a 173-nucleotide deletion in the 3'-untranslated region of RRV NB5092, compared with RRV T48. In the coding regions, most of the nucleotide differences are silent; there are 36 amino acid differences in the nonstructural proteins and 12 in the structural proteins. The distribution of amino acid differences between the two RRV strains correlates with the location of domains which are poorly conserved in sequence between alphaviruses. The possible role of amino acid differences in envelope glycoproteins E1 and E2 in determining the different antigenic and biological properties of RRV NB5092 and RRV T48 is discussed.

  18. SECIS elements in the coding regions of selenoprotein transcripts are functional in higher eukaryotes

    PubMed Central

    Mix, Heiko; Lobanov, Alexey V.; Gladyshev, Vadim N.

    2007-01-01

    Expression of selenocysteine (Sec)-containing proteins requires the presence of a cis-acting mRNA structure, called selenocysteine insertion sequence (SECIS) element. In bacteria, this structure is located in the coding region immediately downstream of the Sec-encoding UGA codon, whereas in eukaryotes a completely different SECIS element has evolved in the 3′-untranslated region. Here, we report that SECIS elements in the coding regions of selenoprotein mRNAs support Sec insertion in higher eukaryotes. Comprehensive computational analysis of all available viral genomes revealed a SECIS element within the ORF of a naturally occurring selenoprotein homolog of glutathione peroxidase 4 in fowlpox virus. The fowlpox SECIS element supported Sec insertion when expressed in mammalian cells as part of the coding region of viral or mammalian selenoproteins. In addition, readthrough at UGA was observed when the viral SECIS element was located upstream of the Sec codon. We also demonstrate successful de novo design of a functional SECIS element in the coding region of a mammalian selenoprotein. Our data provide evidence that the location of the SECIS element in the untranslated region is not a functional necessity but rather is an evolutionary adaptation to enable a more efficient synthesis of selenoproteins. PMID:17169995

  19. Development of EST Intron-Targeting SNP Markers for Panax ginseng and Their Application to Cultivar Authentication

    PubMed Central

    Wang, Hongtao; Li, Guisheng; Kwon, Woo-Saeng; Yang, Deok-Chun

    2016-01-01

    Panax ginseng is one of the most valuable medicinal plants in the Orient. The low level of genetic variation has limited the application of molecular markers for cultivar authentication and marker-assisted selection in cultivated ginseng. To exploit DNA polymorphism within ginseng cultivars, ginseng expressed sequence tags (ESTs) were searched against the potential intron polymorphism (PIP) database to predict the positions of introns. Intron-flanking primers were then designed in conserved exon regions and used to amplify across the more variable introns. Sequencing results showed that single nucleotide polymorphisms (SNPs), as well as indels, were detected in four EST-derived introns, and SNP markers specific to “Gopoong” and “K-1” were first reported in this study. Based on cultivar-specific SNP sites, allele-specific polymerase chain reaction (PCR) was conducted and proved to be effective for the authentication of ginseng cultivars. Additionally, the combination of a simple NaOH-Tris DNA isolation method and real-time allele-specific PCR assay enabled the high throughput selection of cultivars from ginseng fields. The established real-time allele-specific PCR assay should be applied to molecular authentication and marker assisted selection of P. ginseng cultivars, and the EST intron-targeting strategy will provide a potential approach for marker development in species without whole genomic DNA sequence information. PMID:27271615

  20. Evaluation of non-coding variation in GLUT1 deficiency.

    PubMed

    Liu, Yu-Chi; Lee, Jia Wei Audrey; Bellows, Susannah T; Damiano, John A; Mullen, Saul A; Berkovic, Samuel F; Bahlo, Melanie; Scheffer, Ingrid E; Hildebrand, Michael S

    2016-12-01

    Loss-of-function mutations in SLC2A1, encoding glucose transporter-1 (GLUT-1), lead to dysfunction of glucose transport across the blood-brain barrier. Ten percent of cases with hypoglycorrhachia (fasting cerebrospinal fluid [CSF] glucose <2.2mmol/L) do not have mutations. We hypothesized that GLUT1 deficiency could be due to non-coding SLC2A1 variants. We performed whole exome sequencing of one proband with a GLUT1 phenotype and hypoglycorrhachia negative for SLC2A1 sequencing and copy number variants. We studied a further 55 patients with different epilepsies and low CSF glucose who did not have exonic mutations or copy number variants. We sequenced non-coding promoter and intronic regions. We performed mRNA studies for the recurrent intronic variant. The proband had a de novo splice site mutation five base pairs from the intron-exon boundary. Three of 55 patients had deep intronic SLC2A1 variants, including a recurrent variant in two. The recurrent variant produced less SLC2A1 mRNA transcript. Fasting CSF glucose levels show an age-dependent correlation, which makes the definition of hypoglycorrhachia challenging. Low CSF glucose levels may be associated with pathogenic SLC2A1 mutations including deep intronic SLC2A1 variants. Extending genetic screening to non-coding regions will enable diagnosis of more patients with GLUT1 deficiency, allowing implementation of the ketogenic diet to improve outcomes. © 2016 Mac Keith Press.

  1. A novel non-coding RNA within an intron of CDH2 and association of its SNP with non-syndromic cleft lip and palate.

    PubMed

    Kumari, Priyanka; Singh, Subodh Kumar; Raman, Rajiva

    2018-06-05

    Genome-wide linkage analysis and whole genome sequencing in a Van der Woude syndrome (VWS) family revealed that the SNP, rs539075, within intron 2 of the cadherin 2 gene (CDH2) co-segregated with the disease phenotype. A study with nonsyndromic cleft lip with or without cleft palate (NSCL ± P) cases (N = 292) and controls (N = 287) established association of this SNP with NSCL ± P as a risk factor. RT-PCR based expression analysis of the SNP-harbouring region of intron 2 of CDH2 in the clefted lip and/or palate tissues of 16 patients revealed that the mutant allele expressed in all those individuals having it (hetero-/homozygous), whereas the wild type allele expressed in <50% of the samples in which it was present. The intronic transcript was also present in the prospective lip and palate region of 13.5 dpc mouse embryo, detected by RNA in situ hybridization and RT-PCR. These results including the in silico, characterization of the ~200 nt-intronic transcript showed that conformationally it fits best with noncoding small RNA, possibly a precursor of miRNA. Its function in the orofacial organogenesis remains to be elucidated which will enable us to define the role of this mutant ncRNA in the clefting of lip and palate. Copyright © 2018 Elsevier B.V. All rights reserved.

  2. The complete chloroplast genome sequence of American bird pepper (Capsicum annuum var. glabriusculum).

    PubMed

    Zeng, Fan-chun; Gao, Cheng-wen; Gao, Li-zhi

    2016-01-01

    The complete chloroplast genome sequence of American bird pepper (Capsicum annuum var. glabriusculum) is reported and characterized in this study. The genome size is 156,612 bp, containing a pair of inverted repeats (IRs) of 25,776 bp separated by a large single-copy region of 87,213 bp and a small single-copy region of 17,851 bp. The chloroplast genome harbors 130 known genes, including 89 protein-coding genes, 8 ribosomal RNA genes, and 37 tRNA genes. A total of 18 of these genes are duplicated in the inverted repeat regions, 16 genes contain 1 intron, and 2 genes and one ycf have 2 introns.

  3. Patterns of population differentiation of candidate genes for cardiovascular disease

    PubMed Central

    Kullo, Iftikhar J; Ding, Keyue

    2007-01-01

    Background The basis for ethnic differences in cardiovascular disease (CVD) susceptibility is not fully understood. We investigated patterns of population differentiation (FST) of a set of genes in etiologic pathways of CVD among 3 ethnic groups: Yoruba in Nigeria (YRI), Utah residents with European ancestry (CEU), and Han Chinese (CHB) + Japanese (JPT). We identified 37 pathways implicated in CVD based on the PANTHER classification and 416 genes in these pathways were further studied; these genes belonged to 6 biological processes (apoptosis, blood circulation and gas exchange, blood clotting, homeostasis, immune response, and lipoprotein metabolism). Genotype data were obtained from the HapMap database. Results We calculated FST for 15,559 common SNPs (minor allele frequency ≥ 0.10 in at least one population) in genes that co-segregated among the populations, as well as an average-weighted FST for each gene. SNPs were classified as putatively functional (non-synonymous and untranslated regions) or non-functional (intronic and synonymous sites). Mean FST values for common putatively functional variants were significantly higher than FST values for nonfunctional variants. A significant variation in FST was also seen based on biological processes; the processes of 'apoptosis' and 'lipoprotein metabolism' showed an excess of genes with high FST. Thus, putative functional SNPs in genes in etiologic pathways for CVD show greater population differentiation than non-functional SNPs and a significant variance of FST values was noted among pairwise population comparisons for different biological processes. Conclusion These results suggest a possible basis for varying susceptibility to CVD among ethnic groups. PMID:17626638

  4. HLA-E coding and 3' untranslated region variability determined by next-generation sequencing in two West-African population samples.

    PubMed

    Castelli, Erick C; Mendes-Junior, Celso T; Sabbagh, Audrey; Porto, Iane O P; Garcia, André; Ramalho, Jaqueline; Lima, Thálitta H A; Massaro, Juliana D; Dias, Fabrício C; Collares, Cristhianna V A; Jamonneau, Vincent; Bucheton, Bruno; Camara, Mamadou; Donadi, Eduardo A

    2015-12-01

    HLA-E is a non-classical Human Leucocyte Antigen class I gene with immunomodulatory properties. Whereas HLA-E expression usually occurs at low levels, it is widely distributed amongst human tissues, has the ability to bind self and non-self antigens and to interact with NK cells and T lymphocytes, being important for immunosurveillance and also for fighting against infections. HLA-E is usually the most conserved locus among all class I genes. However, most of the previous studies evaluating HLA-E variability sequenced only a few exons or genotyped known polymorphisms. Here we report a strategy to evaluate HLA-E variability by next-generation sequencing (NGS) that might be used to other HLA loci and present the HLA-E haplotype diversity considering the segment encoding the entire HLA-E mRNA (including 5'UTR, introns and the 3'UTR) in two African population samples, Susu from Guinea-Conakry and Lobi from Burkina Faso. Our results indicate that (a) the HLA-E gene is indeed conserved, encoding mainly two different protein molecules; (b) Africans do present several unknown HLA-E alleles presenting synonymous mutations; (c) the HLA-E 3'UTR is quite polymorphic and (d) haplotypes in the HLA-E 3'UTR are in close association with HLA-E coding alleles. NGS has proved to be an important tool on data generation for future studies evaluating variability in non-classical MHC genes. Copyright © 2015 American Society for Histocompatibility and Immunogenetics. Published by Elsevier Inc. All rights reserved.

  5. Alternative exon definition events control the choice between nuclear retention and cytoplasmic export of U11/U12-65K mRNA.

    PubMed

    Verbeeren, Jens; Verma, Bhupendra; Niemelä, Elina H; Yap, Karen; Makeyev, Eugene V; Frilander, Mikko J

    2017-05-01

    Cellular homeostasis of the minor spliceosome is regulated by a negative feed-back loop that targets U11-48K and U11/U12-65K mRNAs encoding essential components of the U12-type intron-specific U11/U12 di-snRNP. This involves interaction of the U11 snRNP with an evolutionarily conserved splicing enhancer giving rise to unproductive mRNA isoforms. In the case of U11/U12-65K, this mechanism controls the length of the 3' untranslated region (3'UTR). We show that this process is dynamically regulated in developing neurons and some other cell types, and involves a binary switch between translation-competent mRNAs with a short 3'UTR to non-productive isoforms with a long 3'UTR that are retained in the nucleus or/and spliced to the downstream amylase locus. Importantly, the choice between these alternatives is determined by alternative terminal exon definition events regulated by conserved U12- and U2-type 5' splice sites as well as sequence signals used for pre-mRNA cleavage and polyadenylation. We additionally show that U11 snRNP binding to the U11/U12-65K mRNA species with a long 3'UTR is required for their nuclear retention. Together, our studies uncover an intricate molecular circuitry regulating the abundance of a key spliceosomal protein and shed new light on the mechanisms limiting the export of non-productively spliced mRNAs from the nucleus to the cytoplasm.

  6. Translational Control of FOG-2 Expression in Cardiomyocytes by MicroRNA-130a

    PubMed Central

    Kim, Gene H.; Samant, Sadhana A.; Earley, Judy U.; Svensson, Eric C.

    2009-01-01

    MicroRNAs are increasingly being recognized as regulators of embryonic development; however, relatively few microRNAs have been identified to regulate cardiac development. FOG-2 (also known as zfpm2) is a transcriptional co-factor that we have previously shown is critical for cardiac development. In this report, we demonstrate that FOG-2 expression is controlled at the translational level by microRNA-130a. We identified a conserved region in the FOG-2 3′ untranslated region predicted to be a target for miR-130a. To test the functional significance of this site, we generated an expression construct containing the luciferase coding region fused with the 3′ untranslated region of FOG-2 or a mutant version lacking this microRNA binding site. When these constructs were transfected into NIH 3T3 fibroblasts (which are known to express miR-130a), we observed a 3.3-fold increase in translational efficiency when the microRNA target site was disrupted. Moreover, knockdown of miR-130a in fibroblasts resulted in a 3.6-fold increase in translational efficiency. We also demonstrate that cardiomyocytes express miR-130a and can attenuate translation of mRNAs with a FOG-2 3′ untranslated region. Finally, we generated transgenic mice with cardiomyocyte over-expression of miR-130a. In the hearts of these mice, FOG-2 protein levels were reduced by as much as 80%. Histological analysis of transgenic embryos revealed ventricular wall hypoplasia and ventricular septal defects, similar to that seen in FOG-2 deficient hearts. These results demonstrate the importance of miR-130a for the regulation of FOG-2 protein expression and suggest that miR-130a may also play a role in the regulation of cardiac development. PMID:19582148

  7. Intron loss from the NADH dehydrogenase subunit 4 gene of lettuce mitochondrial DNA: evidence for homologous recombination of a cDNA intermediate.

    PubMed

    Geiss, K T; Abbas, G M; Makaroff, C A

    1994-04-01

    The mitochondrial gene coding for subunit 4 of the NADH dehydrogenase complex I (nad4) has been isolated and characterized from lettuce, Lactuca sativa. Analysis of nad4 genes in a number of plants by Southern hybridization had previously suggested that the intron content varied between species. Characterization of the lettuce gene confirms this observation. Lettuce nad4 contains two exons and one group IIA intron, whereas previously sequenced nad4 genes from turnip and wheat contain three group IIA introns. Northern analysis identified a transcript of 1600 nucleotides, which represents the mature nad4 mRNA and a primary transcript of 3200 nucleotides. Sequence analysis of lettuce and turnip nad4 cDNAs was used to confirm the intron/exon border sequences and to examine RNA editing patterns. Editing is observed at the 5' and 3' ends of the lettuce transcript, but is absent from sequences that correspond to exons two, three and the 5' end of exon four in turnip and wheat. In contrast, turnip transcripts are highly edited in this region, suggesting that homologous recombination of an edited and spliced cDNA intermediate was involved in the loss of introns two and three from an ancestral lettuce nad4 gene.

  8. Sequence and Expression Analysis of Interferon Regulatory Factor 10 (IRF10) in Three Diverse Teleost Fish Reveals Its Role in Antiviral Defense

    PubMed Central

    Xu, Qiaoqing; Jiang, Yousheng; Wangkahart, Eakapol; Zou, Jun; Chang, Mingxian; Yang, Daiqin; Secombes, Chris J.; Nie, Pin; Wang, Tiehui

    2016-01-01

    Background Interferon regulatory factor (IRF) 10 was first found in birds and is present in the genome of other tetrapods (but not humans and mice), as well as in teleost fish. The functional role of IRF10 in vertebrate immunity is relatively unknown compared to IRF1-9. The target of this research was to clone and characterize the IRF10 genes in three economically important fish species that will facilitate future evaluation of this molecule in fish innate and adaptive immunity. Molecular Characterization of IRF10 in Three Fish Species In the present study, a single IRF10 gene was cloned in grass carp Ctenopharyngodon idella and Asian swamp eel Monopterus albus, and two, named IRF10a and IRF10b, in rainbow trout Oncorhynchus mykiss. The fish IRF10 molecules share highest identities to other vertebrate IRF10s, and have a well conserved DNA binding domain, IRF-associated domain, and an 8 exon/7 intron structure with conserved intron phase. The presence of an upstream ATG or open reading frame (ORF) in the 5’-untranslated region of different fish IRF10 cDNA sequences suggests potential regulation at the translational level, and this has been verified by in vitro transcription/translation experiments of the trout IRF10a cDNA, but would still need to be validated in fish cells. Expression Analysis of IRF10 In Vivo and In Vitro Both trout IRF10 paralogues are highly expressed in thymus, blood and spleen but are relatively low in head kidney and caudal kidney. Trout IRF10b expression is significantly higher than IRF10a in integumentary tissues i.e. gills, scales, skin, intestine, adipose fin and tail fins, suggesting that IRF10b may be more important in mucosal immunity. The expression of both trout IRF10 paralogues is up-regulated by recombinant IFN-γ. The expression of the IRF10 genes is highly induced by Poly I:C in vitro and in vivo, and by viral infection, but is less responsive to peptidoglycan and bacterial infection, suggesting an important role of fish IRF10 in antiviral defense. PMID:26783745

  9. New COL6A6 variant detected by whole-exome sequencing is linked to break points in intron 4 and 3′-UTR, deleting exon 5 of RHO, and causing adRP

    PubMed Central

    de Sousa Dias, Miguel; Hernan, Imma; Delás, Barbara; Pascual, Beatriz; Borràs, Emma; Gamundi, Maria José; Mañé, Begoña; Fernández-San José, Patricia; Ayuso, Carmen

    2015-01-01

    Purpose This study aimed to test a newly devised cost-effective multiplex PCR assay for the molecular diagnosis of autosomal dominant retinitis pigmentosa (adRP), as well as the use of whole-exome sequencing (WES) to detect disease-causing mutations in adRP. Methods Genomic DNA was extracted from peripheral blood lymphocytes of index patients with adRP and their affected and unaffected family members. We used a newly devised multiplex PCR assay capable of amplifying the genetic loci of RHO, PRPH2, RP1, PRPF3, PRPF8, PRPF31, IMPDH1, NRL, CRX, KLHL7, and NR2E3 to molecularly diagnose 18 index patients with adRP. We also performed WES in affected and unaffected members of four families with adRP in whom a disease-causing mutation was previously not found. Results We identified five previously reported mutations (p.Arg677X in the RP1 gene, p.Asp133Val and p.Arg195Leu in the PRPH2 gene, and p.Pro171Leu and p.Pro215Leu in the RHO gene) and one novel mutation (p.Val345Gly in the RHO gene) representing 33% detection of causative mutations in our adRP cohort. Comparative WES analysis showed a new variant (p.Gly103Arg in the COL6A6 gene) that segregated with the disease in one family with adRP. As this variant was linked with the RHO locus, we sequenced the complete RHO gene, which revealed a deletion in intron 4 that encompassed all of exon 5 and 28 bp of the 3′-untranslated region (UTR). Conclusions The novel multiplex PCR assay with next-generation sequencing (NGS) proved effective for detecting most of the adRP-causing mutations. A WES approach led to identification of a deletion in RHO through detection of a new linked variant in COL6A6. No pathogenic variants were identified in the remaining three families. Moreover, NGS and WES were inefficient for detecting the complete deletion of exon 5 in the RHO gene in one family with adRP. Carriers of this deletion showed variable clinical status, and two of these carriers had not previously been diagnosed with RP. PMID:26321861

  10. Growth hormone and Pit-1 expression in bovine fetal lymphoid cells.

    PubMed

    Chen, H T; Schuler, L A; Schultz, R D

    1997-11-01

    Bovine fetal lymphoid cells were examined for growth hormone (GH) and the transcription factor Pit-1/GHF-1 mRNA. GH and Pit-1/GHF-1 transcripts were detected in thymocytes and splenocytes from fetuses at 60, 90, 120, and 270 d of gestation using reverse transcription-polymerase chain reaction (RT-PCR). Northern analysis indicated that the lymphoid GH mRNA was approximately 350 nucleotides larger than in the pituitary. RT-PCR analysis demonstrated that the coding regions as well as 3' untranslated region of the lymphocyte GH and pituitary transcripts were the same. Analysis of the 5'-untranslated region of the lymphocyte GH mRNA showed that transcription began upstream from the start site in the pituitary gland, suggesting differences in regulation in these tissues. Fetal thymocytes and splenocytes expressed Pit-1/GHF-1 mRNA; however, they contained only the 2.5-kb transcript. The GH and Pit-1/GHF-1 mRNA in fetal lymphoid cells supports the hypothesis that lymphocyte-derived GH may function as an autocrine and/or paracrine factor during the development and maturation of the bovine fetal immune system.

  11. Analysis of the 5′ untranslated region (5′UTR) of the alcohol oxidase 1 (AOX1) gene in recombinant protein expression in Pichia pastoris

    PubMed Central

    Staley, Chris A.; Huang, Amy; Nattestad, Maria; Oshiro, Kristin T.; Ray, Laura E.; Mulye, Tejas; Li, Zhiguo Harry; Le, Thu; Stephens, Justin J.; Gomez, Seth R.; Moy, Allison D.; Nguyen, Jackson C.; Franz, Andreas H.; Lin-Cereghino, Joan; Lin-Cereghino, Geoff P.

    2012-01-01

    Pichia pastoris is a methylotrophic yeast that has been genetically engineered to express over one thousand heterologous proteins valued for industrial, pharmaceutical and basic research purposes. In most cases, the 5′ untranslated region (UTR) of the alcohol oxidase 1 (AOX1) gene is fused to the coding sequence of the recombinant gene for protein expression in this yeast. Because the effect of the AOX1 5′UTR on protein expression is not known, site-directed mutagenesis was performed in order to decrease or increase the length of this region. Both of these types of changes were shown to affect translational efficiency, not transcript stability. While increasing the length of the 5′UTR clearly decreased expression of a β-galactosidase reporter in a proportional manner, a deletion analysis demonstrated that the AOX1 5′UTR contains a complex mixture of both positive and negative cis-acting elements, suggesting that the construction of a synthetic 5′UTR optimized for a higher level of expression may be challenging. PMID:22285974

  12. Leukocyte common antigen-related phosphatase (LRP) gene structure: Conservation of the genomic organization of transmembrane protein tyrosine phosphatases

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Wong, E.C.C.; Mullersman, J.E.; Thomas, M.L.

    1993-07-01

    The leukocyte common antigen-related protein tyrosine phosphatase (LRP) is a widely expressed transmembrane glycoprotein thought to be involved in cell growth and differentiation. Similar to most other transmembrane protein tyrosine phosphatases, LRP contains two tandem cytoplasmic phosphatase domains. To understand further the regulation and evolution of LRP, the authors have isolated and characterized mouse [lambda] genomic clones. Thirteen genomic clones could be divided into two non-overlapping clusters. The first cluster contained the transcription initiation site and the exon encoding most of the 5[prime] untranslated region. The second cluster contained the remaining exons encoding the protein and the 3[prime] untranslated region.more » The gene consists of 22 exons spanning over 75 kb. The distance between exon 1 and exon 2 is at least 25 kb. Characterization of the 5[prime] ends of LRP mRNA by S1 nuclease protection identifies putative initiation start sites within a G/C-rich region. The upstream region does not contain a TATA box. Comparison of the LRP gene structure to the mammalian protein tyrosine phosphatase gene, CD45, shows striking similarities in size and genomic organization. 29 refs., 5 figs., 1 tab.« less

  13. In vitro mapping of Myotonic Dystrophy (DM) gene promoter

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Storbeck, C.J.; Sabourin, L.; Baird, S.

    1994-09-01

    The Myotonic Dystrophy Kinase (DMK) gene has been cloned and shared homology to serine/threonine protein kinases. Overexpression of this gene in stably transfected mouse myoblasts has been shown to inhibit fusion into myotubes while myoblasts stably transfected with an antisense construct show increased fusion potential. These experiments, along with data showing that the DM gene is highly expressed in muscle have highlighted the possibility of DMK being involved in myogenesis. The promoter region of the DM gene lacks a consensus TATA box and CAAT box, but harbours numerous transcription binding sites. Clones containing extended 5{prime} upstream sequences (UPS) of DMKmore » only weakly drive the reporter gene chloramphenicol acetyl transferase (CAT) when transfected into C2C12 mouse myoblasts. However, four E-boxes are present in the first intron of the DM gene and transient assays show increased expression of the CAT gene when the first intron is present downstream of these 5{prime} UPS in an orientation dependent manner. Comparison between mouse and human sequence reveals that the regions in the first intron where the E-boxes are located are highly conserved. The mapping of the promoter and the importance of the first intron in the control of DMK expression will be presented.« less

  14. What does it take to resolve relationships and to identify species with molecular markers? An example from the epiphytic Rhipsalideae (Cactaceae).

    PubMed

    Korotkova, Nadja; Borsch, Thomas; Quandt, Dietmar; Taylor, Nigel P; Müller, Kai F; Barthlott, Wilhelm

    2011-09-01

    The Cactaceae are a major New World plant family and popular in horticulture. Still, taxonomic units and species limits have been difficult to define, and molecular phylogenetic studies so far have yielded largely unresolved trees, so relationships within Cactaceae remain insufficiently understood. This study focuses on the predominantly epiphytic tribe Rhipsalideae and evaluates the utility of a spectrum of plastid genomic regions. • We present a phylogenetic study including 52 of the 53 Rhipsalideae species and all the infraspecific taxa. Seven regions (trnK intron, matK, rbcL, rps3-rpl16, rpl16 intron, psbA-trnH, trnQ-rps16), ca. 5600 nucleotides (nt) were sequenced per sample. The regions used were evaluated for their phylogenetic performance and performance in DNA-based species recognition based on operational taxonomic units (OTUs) defined beforehand. • The Rhipsalideae are monophyletic and contain five clades that correspond to the genera Rhipsalis, Lepismium, Schlumbergera, Hatiora, and Rhipsalidopsis. The species-level tree was well resolved and supported; the rpl16 and trnK introns yielded the best phylogenetic signal. Although the psbA-trnH and trnQ-rps16 spacers were the most successful individual regions for OTU identification, their success rate did not significantly exceed 70%. The highest OTU identification rate of 97% was found using the combination of psbA-trnH, rps3-rpl16, trnK intron, and trnQ-rps16 as a minimum possible marker length (ca. 1660 nt). • The phylogenetic performance of a marker is not determined by the level of sequence variability, and species discrimination power does not necessarily correlate with phylogenetic utility.

  15. The whole chloroplast genome of wild rice (Oryza australiensis).

    PubMed

    Wu, Zhiqiang; Ge, Song

    2016-01-01

    The whole chloroplast genome of wild rice (Oryza australiensis) is characterized in this study. The genome size is 135,224  bp, exhibiting a typical circular structure including a pair of 25,776  bp inverted repeats (IRa,b) separated by a large single-copy region (LSC) of 82,212  bp and a small single-copy region (SSC) of 12,470  bp. The overall GC content of the genome is 38.95%. 110 unique genes were annotated, including 76 protein-coding genes, 4 ribosomal RNA genes, and 30t RNA genes. Among these, 18 are duplicated in the inverted repeat regions, 13 genes contain one intron, and 2 genes (rps12 and ycf3) have two introns.

  16. A sequence-based genetic map of Medicago truncatula and comparison of marker colinearity with M. sativa.

    PubMed Central

    Choi, Hong-Kyu; Kim, Dongjin; Uhm, Taesik; Limpens, Eric; Lim, Hyunju; Mun, Jeong-Hwan; Kalo, Peter; Penmetsa, R Varma; Seres, Andrea; Kulikova, Olga; Roe, Bruce A; Bisseling, Ton; Kiss, Gyorgy B; Cook, Douglas R

    2004-01-01

    A core genetic map of the legume Medicago truncatula has been established by analyzing the segregation of 288 sequence-characterized genetic markers in an F(2) population composed of 93 individuals. These molecular markers correspond to 141 ESTs, 80 BAC end sequence tags, and 67 resistance gene analogs, covering 513 cM. In the case of EST-based markers we used an intron-targeted marker strategy with primers designed to anneal in conserved exon regions and to amplify across intron regions. Polymorphisms were significantly more frequent in intron vs. exon regions, thus providing an efficient mechanism to map transcribed genes. Genetic and cytogenetic analysis produced eight well-resolved linkage groups, which have been previously correlated with eight chromosomes by means of FISH with mapped BAC clones. We anticipated that mapping of conserved coding regions would have utility for comparative mapping among legumes; thus 60 of the EST-based primer pairs were designed to amplify orthologous sequences across a range of legume species. As an initial test of this strategy, we used primers designed against M. truncatula exon sequences to rapidly map genes in M. sativa. The resulting comparative map, which includes 68 bridging markers, indicates that the two Medicago genomes are highly similar and establishes the basis for a Medicago composite map. PMID:15082563

  17. Development of Plant Gene Vectors for Tissue-Specific Expression Using GFP as a Reporter Gene

    NASA Technical Reports Server (NTRS)

    Jackson, Jacquelyn; Egnin, Marceline; Xue, Qi-Han; Prakash, C. S.

    1997-01-01

    Reporter genes are widely employed in plant molecular biology research to analyze gene expression and to identify promoters. Gus (UidA) is currently the most popular reporter gene but its detection requires a destructive assay. The use of jellyfish green fluorescent protein (GFP) gene from Aequorea Victoria holds promise for noninvasive detection of in vivo gene expression. To study how various plant promoters are expressed in sweet potato (Ipomoea batatas), we are transcriptionally fusing the intron-modified (mGFP) or synthetic (modified for codon-usage) GFP coding regions to these promoters: double cauliflower mosaic virus 35S (CaMV 35S) with AMV translational enhancer, ubiquitin7-intron-ubiquitin coding region (ubi7-intron-UQ) and sporaminA. A few of these vectors have been constructed and introduced into E. coli DH5a and Agrobacterium tumefaciens EHA105. Transient expression studies are underway using protoplast-electroporation and particle bombardment of leaf tissues.

  18. The helicase Ded1p controls use of near-cognate translation initiation codons in 5' UTRs.

    PubMed

    Guenther, Ulf-Peter; Weinberg, David E; Zubradt, Meghan M; Tedeschi, Frank A; Stawicki, Brittany N; Zagore, Leah L; Brar, Gloria A; Licatalosi, Donny D; Bartel, David P; Weissman, Jonathan S; Jankowsky, Eckhard

    2018-06-27

    The conserved and essential DEAD-box RNA helicase Ded1p from yeast and its mammalian orthologue DDX3 are critical for the initiation of translation 1 . Mutations in DDX3 are linked to tumorigenesis 2-4 and intellectual disability 5 , and the enzyme is targeted by a range of viruses 6 . How Ded1p and its orthologues engage RNAs during the initiation of translation is unknown. Here we show, by integrating transcriptome-wide analyses of translation, RNA structure and Ded1p-RNA binding, that the effects of Ded1p on the initiation of translation are connected to near-cognate initiation codons in 5' untranslated regions. Ded1p associates with the translation pre-initiation complex at the mRNA entry channel and repressing the activity of Ded1p leads to the accumulation of RNA structure in 5' untranslated regions, the initiation of translation from near-cognate start codons immediately upstream of these structures and decreased protein synthesis from the corresponding main open reading frames. The data reveal a program for the regulation of translation that links Ded1p, the activation of near-cognate start codons and mRNA structure. This program has a role in meiosis, in which a marked decrease in the levels of Ded1p is accompanied by the activation of the alternative translation initiation sites that are seen when the activity of Ded1p is repressed. Our observations indicate that Ded1p affects translation initiation by controlling the use of near-cognate initiation codons that are proximal to mRNA structure in 5' untranslated regions.

  19. Regulation of the mRNA-binding protein AUF1 by activation of the beta-adrenergic receptor signal transduction pathway.

    PubMed

    Pende, A; Tremmel, K D; DeMaria, C T; Blaxall, B C; Minobe, W A; Sherman, J A; Bisognano, J D; Bristow, M R; Brewer, G; Port, J

    1996-04-05

    In both cell culture based model systems and in the failing human heart, beta-adrenergic receptors ( beta-AR) undergo agonist-mediated down-regulation. This decrease correlates closely with down-regulation of its mRNA, an effect regulated in part by changes in mRNA stability. Regulation of mRNA stability has been associated with mRNA-binding proteins that recognize A + U-rich elements within the 3'-untranslated regions of many mRNAs encoding proto-oncogene and cytokine mRNAs. We demonstrate here that the mRNA-binding protein, AUF1, is present in both human heart and in hamster DDT1-MF2 smooth muscle cells and that its abundance is regulated by beta-AR agonist stimulation. In human heart, AUF1 mRNA and protein was significantly increased in individuals with myocardial failure, a condition associated with increases in the beta-adrenergic receptor agonist norepinephrine. In the same hearts, there was a significant decrease (approximately 50%) in the abundance of beta1-AR mRNA and protein. In DDT1-MF2 cells, where agonist-mediated destabilization of beta2-AR mRNA was first described, exposure to beta-AR agonist resulted in a significant increase in AUF1 mRNA and protein (approximately 100%). Conversely, agonist exposure significantly decreased (approximately 40%) beta2-adrenergic receptor mRNA abundance. Last, we demonstrate that AUF1 can be immunoprecipitated from polysome-derived proteins following UV cross-linking to the 3'-untranslated region of the human beta1-AR mRNA and that purified, recombinant p37AUF1 protein also binds to beta1-AR 3'-untranslated region mRNA.

  20. Cloning and characterization of an abalone (Haliotis discus hannai) actin gene

    NASA Astrophysics Data System (ADS)

    Ma, Hongming; Xu, Wei; Mai, Kangsen; Liufu, Zhiguo; Chen, Hong

    2004-10-01

    An actin encoding gene was cloned by using RT-PCR, 3‧ RACE and 5‧ RACE from abalone Haliotis discus hannai. The full length of the gene is 1532 base pairs, which contains a long 3‧ untranslated region of 307 base pairs and 79 base pairs of 5‧ untranslated sequence. The open reading frame encodes 376 amino acid residues. Sequence comparison with those of human and other mollusks showed high conservation among species at amino acid level. The identities was 96%, 97% and 96% respectively compared with Aplysia californica, Biomphalaria glabrata and Homo sapience β-actin. It is also indicated that this actin is more similar to the human cytoplasmic actin (β-actin) than to human muscle actin.

  1. Nucleotide sequence of the COX1 gene in Kluyveromyces lactis mitochondrial DNA: evidence for recent horizontal transfer of a group II intron.

    PubMed

    Hardy, C M; Clark-Walker, G D

    1991-07-01

    The cytochrome oxidase subunit 1 gene (COX1) in K. lactis K8 mtDNA spans 8,826 bp and contains five exons (termed E1-E5) totalling 1,602 bp that show 88% nucleotide base matching and 91% amino acid homology to the equivalent gene in S. cerevisiae. The four introns (termed K1 cox1.1-1.4) contain open reading frames encoding proteins of 786, 333, 319 and 395 amino acids respectively that potentially encode maturase enzymes. The first intron belongs to group II whereas the remaining three are group I type B. Introns K1 cox1.1, 1.3, and 1.4 are found at identical locations to introns Sc cox1.2, 1.5 a, and 1.5 b respectively from S. cerevisiae. Horizontal transfer of an intron between recent progenitors of K. lactis and S. cerevisiae is suggested by the observation that K1 cox1.1 and Sc cox1.2 show 96% base matching. Sequence comparisons between K1 cox1.3/Sc cox1.5 a and K1 cox1.4/Sc cox1.5 b suggest that these introns are likely to have been present in the ancestral COX1 gene of these yeasts. Intron K1 cox1.2 is not found in S. cerevisiae and appears at an unique location in K. lactis. A feature of the DNA sequences of the group I introns K1 cox1.2, 1.3, and 1.4 is the presence of 11 GC-rich clusters inserted into both coding and noncoding regions. Immediately downstream of the COX1 gene is the ATPase subunit 8 gene (A8) that shows 82.6% base matching to its counterpart in S. cerevisiae mtDNA.

  2. Chloroplast genome expansion by intron multiplication in the basal psychrophilic euglenoid Eutreptiella pomquetensis

    PubMed Central

    Bennett, Matthew S.; Triemer, Richard E.; Preisfeld, Angelika

    2017-01-01

    Background Over the last few years multiple studies have been published showing a great diversity in size of chloroplast genomes (cpGenomes), and in the arrangement of gene clusters, in the Euglenales. However, while these genomes provided important insights into the evolution of cpGenomes across the Euglenales and within their genera, only two genomes were analyzed in regard to genomic variability between and within Euglenales and Eutreptiales. To better understand the dynamics of chloroplast genome evolution in early evolving Eutreptiales, this study focused on the cpGenome of Eutreptiella pomquetensis, and the spread and peculiarities of introns. Methods The Etl. pomquetensis cpGenome was sequenced, annotated and afterwards examined in structure, size, gene order and intron content. These features were compared with other euglenoid cpGenomes as well as those of prasinophyte green algae, including Pyramimonas parkeae. Results and Discussion With about 130,561 bp the chloroplast genome of Etl. pomquetensis, a basal taxon in the phototrophic euglenoids, was considerably larger than the two other Eutreptiales cpGenomes sequenced so far. Although the detected quadripartite structure resembled most green algae and plant chloroplast genomes, the gene content of the single copy regions in Etl. pomquetensis was completely different from those observed in green algae and plants. The gene composition of Etl. pomquetensis was extensively changed and turned out to be almost identical to other Eutreptiales and Euglenales, and not to P. parkeae. Furthermore, the cpGenome of Etl. pomquetensis was unexpectedly permeated by a high number of introns, which led to a substantially larger genome. The 51 identified introns of Etl. pomquetensis showed two major unique features: (i) more than half of the introns displayed a high level of pairwise identities; (ii) no group III introns could be identified in the protein coding genes. These findings support the hypothesis that group III introns are degenerated group II introns and evolved later. PMID:28852596

  3. The complete chloroplast genome sequence of the medicinal plant Andrographis paniculata.

    PubMed

    Ding, Ping; Shao, Yanhua; Li, Qian; Gao, Junli; Zhang, Runjing; Lai, Xiaoping; Wang, Deqin; Zhang, Huiye

    2016-07-01

    The complete chloroplast genome of Andrographis paniculata, an important medicinal plant with great economic value, has been studied in this article. The genome size is 150,249 bp in length, with 38.3% GC content. A pair of inverted repeats (IRs, 25,300 bp) are separated by a large single copy region (LSC, 82,459 bp) and a small single-copy region (SSC, 17,190 bp). The chloroplast genome contains 114 unique genes, 80 protein-coding genes, 30 tRNA genes and 4 rRNA genes. In these genes, 15 genes contained 1 intron and 3 genes comprised of 2 introns.

  4. The complete chloroplast genome sequence of Dianthus superbus var. longicalycinus.

    PubMed

    Gurusamy, Raman; Lee, Do-Hyung; Park, SeonJoo

    2016-05-01

    The complete chloroplast genome (cpDNA) sequence of Dianthus superbus var. longicalycinus is an economically important traditional Chinese medicine was reported and characterized. The cpDNA of Dianthus superbus var. longicalycinus is 149,539 bp, with 36.3% GC content. A pair of inverted repeats (IRs) of 24,803 bp is separated by a large single-copy region (LSC, 82,805 bp) and a small single-copy region (SSC, 17,128 bp). It encodes 85 protein-coding genes, 36 tRNA genes and 8 rRNA genes. Of 129 individual genes, 13 genes encoded one intron and three genes have two introns.

  5. COL1A1 transgene expression in stably transfected osteoblastic cells. Relative contributions of first intron, 3'-flanking sequences, and sequences derived from the body of the human COL1A1 minigene

    NASA Technical Reports Server (NTRS)

    Breault, D. T.; Lichtler, A. C.; Rowe, D. W.

    1997-01-01

    Collagen reporter gene constructs have be used to identify cell-specific sequences needed for transcriptional activation. The elements required for endogenous levels of COL1A1 expression, however, have not been elucidated. The human COL1A1 minigene is expressed at high levels and likely harbors sequence elements required for endogenous levels of activity. Using stably transfected osteoblastic Py1a cells, we studied a series of constructs (pOBColCAT) designed to characterize further the elements required for high level of expression. pOBColCAT, which contains the COL1A1 first intron, was expressed at 50-100-fold higher levels than ColCAT 3.6, which lacks the first intron. This difference is best explained by improved mRNA processing rather than a transcriptional effect. Furthermore, variation in activity observed with the intron deletion constructs is best explained by altered mRNA splicing. Two major regions of the human COL1A1 minigene, the 3'-flanking sequences and the minigene body, were introduced into pOBColCAT to assess both transcriptional enhancing activity and the effect on mRNA stability. Analysis of the minigene body, which includes the first five exons and introns fused with the terminal six introns and exons, revealed an orientation-independent 5-fold increase in CAT activity. In contrast the 3'-flanking sequences gave rise to a modest 61% increase in CAT activity. Neither region increased the mRNA half-life of the parent construct, suggesting that CAT-specific mRNA instability elements may serve as dominant negative regulators of stability. This study suggests that other sites within the body of the COL1A1 minigene are important for high expression, e.g. during periods of rapid extracellular matrix production.

  6. Characterization and Expression of the Lucina pectinata Oxygen and Sulfide Binding Hemoglobin Genes

    PubMed Central

    López-Garriga, Juan; Cadilla, Carmen L.

    2016-01-01

    The clam Lucina pectinata lives in sulfide-rich muds and houses intracellular symbiotic bacteria that need to be supplied with hydrogen sulfide and oxygen. This clam possesses three hemoglobins: hemoglobin I (HbI), a sulfide-reactive protein, and hemoglobin II (HbII) and III (HbIII), which are oxygen-reactive. We characterized the complete gene sequence and promoter regions for the oxygen reactive hemoglobins and the partial structure and promoters of the HbI gene from Lucina pectinata. We show that HbI has two mRNA variants, where the 5’end had either a sequence of 96 bp (long variant) or 37 bp (short variant). The gene structure of the oxygen reactive Hbs is defined by having 4-exons/3-introns with conservation of intron location at B12.2 and G7.0 and the presence of pre-coding introns, while the partial gene structure of HbI has the same intron conservation but appears to have a 5-exon/ 4-intron structure. A search for putative transcription factor binding sites (TFBSs) was done with the promoters for HbII, HbIII, HbI short and HbI long. The HbII, HbIII and HbI long promoters showed similar predicted TFBSs. We also characterized MITE-like elements in the HbI and HbII gene promoters and intronic regions that are similar to sequences found in other mollusk genomes. The gene expression levels of the clam Hbs, from sulfide-rich and sulfide-poor environments showed a significant decrease of expression in the symbiont-containing tissue for those clams in a sulfide-poor environment, suggesting that the sulfide concentration may be involved in the regulation of these proteins. Gene expression evaluation of the two HbI mRNA variants indicated that the longer variant is expressed at higher levels than the shorter variant in both environments. PMID:26824233

  7. Identification of a deep intronic mutation in the COL6A2 gene by a novel custom oligonucleotide CGH array designed to explore allelic and genetic heterogeneity in collagen VI-related myopathies

    PubMed Central

    2010-01-01

    Background Molecular characterization of collagen-VI related myopathies currently relies on standard sequencing, which yields a detection rate approximating 75-79% in Ullrich congenital muscular dystrophy (UCMD) and 60-65% in Bethlem myopathy (BM) patients as PCR-based techniques tend to miss gross genomic rearrangements as well as copy number variations (CNVs) in both the coding sequence and intronic regions. Methods We have designed a custom oligonucleotide CGH array in order to investigate the presence of CNVs in the coding and non-coding regions of COL6A1, A2, A3, A5 and A6 genes and a group of genes functionally related to collagen VI. A cohort of 12 patients with UCMD/BM negative at sequencing analysis and 2 subjects carrying a single COL6 mutation whose clinical phenotype was not explicable by inheritance were selected and the occurrence of allelic and genetic heterogeneity explored. Results A deletion within intron 1A of the COL6A2 gene, occurring in compound heterozygosity with a small deletion in exon 28, previously detected by routine sequencing, was identified in a BM patient. RNA studies showed monoallelic transcription of the COL6A2 gene, thus elucidating the functional effect of the intronic deletion. No pathogenic mutations were identified in the remaining analyzed patients, either within COL6A genes, or in genes functionally related to collagen VI. Conclusions Our custom CGH array may represent a useful complementary diagnostic tool, especially in recessive forms of the disease, when only one mutant allele is detected by standard sequencing. The intronic deletion we identified represents the first example of a pure intronic mutation in COL6A genes. PMID:20302629

  8. Structural analysis of the 5{prime} region of mouse and human Huntington disease genes reveals conservation of putative promoter region and Di- and trinucleotide polymorphisms

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Lin, Biaoyang; Nasir, J.; Kalchman, M.A.

    1995-02-10

    We have previously cloned and characterized the murine homologue of the Huntington disease (HD) gene and shown that it maps to mouse chromosome 5 within a region of conserved synteny with human chromosome 4p16.3. Here we present a detailed comparison of the sequence of the putative promoter and the organization of the 5{prime} genomic region of the murine (Hdh) and human HD genes encompassing the first five exons. We show that in this region these two genes share identical exon boundaries, but have different-size introns. Two dinucleotide (CT) and one trinucleotide intronic polymorphism in Hdh and an intronic CA polymorphismmore » in the HD gene were identified. Comparison of 940-bp sequence 5{prime} to the putative translation start site reveals a highly conserved region (78.8% nucleotide identity) between Hdh and the HD gene from nucleotide -56 to -206 (of Hdh). Neither Hdh nor the HD gene have typical TATA or CCAAT elements, but both show one putative AP2 binding site and numerous potential Sp1 binding sites. The high sequence identity between Hdh and the HD gene for approximately 200 bp 5{prime} to the putative translation start site indicates that these sequences may play a role in regulating expression of the Huntington disease gene. 30 refs., 4 figs., 2 tabs.« less

  9. Distinct polyadenylation landscapes of diverse human tissues revealed by a modified PA-seq strategy

    PubMed Central

    2013-01-01

    Background Polyadenylation is a key regulatory step in eukaryotic gene expression and one of the major contributors of transcriptome diversity. Aberrant polyadenylation often associates with expression defects and leads to human diseases. Results To better understand global polyadenylation regulation, we have developed a polyadenylation sequencing (PA-seq) approach. By profiling polyadenylation events in 13 human tissues, we found that alternative cleavage and polyadenylation (APA) is prevalent in both protein-coding and noncoding genes. In addition, APA usage, similar to gene expression profiling, exhibits tissue-specific signatures and is sufficient for determining tissue origin. A 3′ untranslated region shortening index (USI) was further developed for genes with tandem APA sites. Strikingly, the results showed that different tissues exhibit distinct patterns of shortening and/or lengthening of 3′ untranslated regions, suggesting the intimate involvement of APA in establishing tissue or cell identity. Conclusions This study provides a comprehensive resource to uncover regulated polyadenylation events in human tissues and to characterize the underlying regulatory mechanism. PMID:24025092

  10. The zebrafish dorsal axis is apparent at the four-cell stage.

    PubMed

    Gore, Aniket V; Maegawa, Shingo; Cheong, Albert; Gilligan, Patrick C; Weinberg, Eric S; Sampath, Karuna

    2005-12-15

    A central question in the development of multicellular organisms pertains to the timing and mechanisms of specification of the embryonic axes. In many organisms, specification of the dorsoventral axis requires signalling by proteins of the Transforming growth factor-beta and Wnt families. Here we show that maternal transcripts of the zebrafish Nodal-related morphogen, Squint (Sqt), can localize to two blastomeres at the four-cell stage and predict the dorsal axis. Removal of cells containing sqt transcripts from four-to-eight-cell embryos or injection of antisense morpholino oligonucleotides targeting sqt into oocytes can cause a loss of dorsal structures. Localization of sqt transcripts is independent of maternal Wnt pathway function and requires a highly conserved sequence in the 3' untranslated region. Thus, the dorsoventral axis is apparent by early cleavage stages and may require the maternally encoded morphogen Sqt and its associated factors. Because the 3' untranslated region of the human nodal gene can also localize exogenous sequences to dorsal cells, this mechanism may be evolutionarily conserved.

  11. Polymorphisms in the canine monoamine oxidase a (MAOA) gene: identification and variation among five broad dog breed groups.

    PubMed

    Sacco, James; Ruplin, Andrew; Skonieczny, Paul; Ohman, Michael

    2017-01-01

    In humans, reduced activity of the enzyme monoamine oxidase type A (MAOA) due to genetic polymorphisms within the MAOA gene leads to increased brain neurotransmitter levels associated with aggression. In order to study MAOA genetic diversity in dogs, we designed a preliminary study whose objectives were to identify novel alleles in functionally important regions of the canine MAOA gene, and to investigate whether the frequencies of these polymorphisms varied between five broad breed groups (ancient, herding, mastiff, modern European, and mountain). Fifty dogs representing these five breed groups were sequenced. A total of eleven polymorphisms were found. Seven were single nucleotide polymorphisms (SNPs; two exonic, two intronic and three in the promoter), while four were repeat intronic variations. The most polymorphic loci were repeat regions in introns 1, 2 (7 alleles) and 10 (3 alleles), while the exonic and the promoter regions were highly conserved. Comparison of the allele frequencies of certain microsatellite polymorphisms among the breed groups indicated a decreasing or increasing trend in the number of repeats at different microsatellite loci, as well as the highest genetic diversity for the ancient breeds and the lowest for the most recent mountain breeds, perhaps attributable to canine domestication and recent breed formation. While a specific promoter SNP (-212A > G) is rare in the dog, it is the major allele in wolves. Replacement of this ancestral allele in domestic dogs may lead to the deletion of heat shock factor binding sites on the MAOA promoter. Dogs exhibit significant variation in certain intronic regions of the MAOA gene, while the coding and promoter regions are well-conserved. Distinct genetic differences were observed between breed groups. Further studies are now required to establish whether such polymorphisms are associated in any way with MAOA level and canine behaviour including aggression.

  12. HFE gene polymorphism defined by sequence-based typing of the Brazilian population and a standardized nomenclature for HFE allele sequences.

    PubMed

    Campos, W N; Massaro, J D; Martinelli, A L C; Halliwell, J A; Marsh, S G E; Mendes-Junior, C T; Donadi, E A

    2017-10-01

    The HFE molecule controls iron uptake from gut, and defects in the molecule have been associated with iron overload, particularly in hereditary hemochromatosis. The HFE gene including both coding and boundary intronic regions were sequenced in 304 Brazilian individuals, encompassing healthy individuals and patients exhibiting hereditary or acquired iron overload. Six sites of variation were detected: (1) H63D C>G in exon 2, (2) IVS2 (+4) T>C in intron 2, (3) a C>G transversion in intron 3, (4) C282Y G>A in exon 4, (5) IVS4 (-44) T>C in intron 4, and (6) a new guanine deletion (G>del) in intron 5, which were used for haplotype inference. Nine HFE alleles were detected and six of these were officially named on the basis of the HLA Nomenclature, defined by the World Health Organization (WHO) Nomenclature Committee for Factors of the HLA System, and published via the IPD-IMGT/HLA website. Four alleles, HFE*001, *002, *003, and *004 exhibited variation within their exon sequences. © 2017 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.

  13. Base pairing between the 3' exon and an internal guide sequence increases 3' splice site specificity in the Tetrahymena self-splicing rRNA intron.

    PubMed Central

    Suh, E R; Waring, R B

    1990-01-01

    It has been proposed that recognition of the 3' splice site in many group I introns involves base pairing between the start of the 3' exon and a region of the intron known as the internal guide sequence (R. W. Davies, R. B. Waring, J. Ray, T. A. Brown, and C. Scazzocchio, Nature [London] 300:719-724, 1982). We have examined this hypothesis, using the self-splicing rRNA intron from Tetrahymena thermophila. Mutations in the 3' exon that weaken this proposed pairing increased use of a downstream cryptic 3' splice site. Compensatory mutations in the guide sequence that restore this pairing resulted in even stronger selection of the normal 3' splice site. These changes in 3' splice site usage were more pronounced in the background of a mutation (414A) which resulted in an adenine instead of a guanine being the last base of the intron. These results show that the proposed pairing (P10) plays an important role in ensuring that cryptic 3' splice sites are selected against. Surprisingly, the 414A mutation alone did not result in activation of the cryptic 3' splice site. Images PMID:2342465

  14. Familial early-onset dementia with tau intron 10 + 16 mutation with clinical features similar to those of Alzheimer disease.

    PubMed

    Doran, Mark; du Plessis, Daniel G; Ghadiali, Eric J; Mann, David M A; Pickering-Brown, Stuart; Larner, Andrew J

    2007-10-01

    Frontotemporal dementia with parkinsonism linked to chromosome 17 (FTDP-17) owing to the tau intron 10 + 16 mutation usually occurs with a prototypical frontotemporal dementia phenotype with prominent disinhibition and affective disturbances. To report a new FTDP-17 pedigree with the tau intron 10 + 16 mutation demonstrating a clinical phenotype suggestive of Alzheimer disease. Case reports. Regional neuroscience centers in northwest England. Patients We examined 4 members of a kindred in which 8 individuals were affected in 3 generations. All 4 patients reported memory difficulty. Marked anomia was also present, but behavioral disturbances were conspicuously absent in the early stages of disease. All patients had an initial clinical diagnosis of Alzheimer disease. No mutations were found in the presenilin or amyloid precursor protein genes. Pathologic examination of the proband showed features typical of FTDP-17, and tau gene analysis showed the intron 10 + 16 mutation. This pedigree illustrates the phenotypic variability of tau intron 10 + 16 mutations. In pedigrees with a clinical diagnosis of Alzheimer disease but without presenilin or amyloid precursor protein gene mutations, tau gene mutations may be found.

  15. Efficiency of introns from various origins in fish cells.

    PubMed

    Bétancourt, O H; Attal, J; Théron, M C; Puissant, C; Houdebine, L M

    1993-06-01

    Several vectors containing (1) regulatory regions from Rous sarcoma virus (RSV), human cytomegalovirus (CMV), and herpes simplex thymidine kinase (TK); (2) introns from early or late SV40 genes and from trout growth hormone gene (tGH); (3) chloramphenicol acetyltransferase gene (CAT); and (4) transcription terminators from SV40 were transfected into carp EPC cells, salmon CHSE cells, tilapia TO2 cells, quail QT6 cells, and hamster CHO cells. CAT activity was measured in extracts from several cell lines 3 days after transfection and in the fish EPC stable clones. The CMV and RSV promoters were the most potent in all cell types. The intron from late SV40 genes (VP1 intron) worked properly in QT6 and CHO cells but not in EPC and very weakly in TO2 cells. The tGH intron was efficient in all cell types but preferentially in fish cells. The small t intron from SV40 was processed in all cell types. The small t and, to a lesser extent, the tGH introns amplified expression of cat gene in stable clones, in comparison to the transiently transfected cells. These results indicate that elements from mammalian genes may not be properly recognized by the fish cellular machinery and in an unpredictable manner. This finding suggests that vectors prepared to express foreign genes in transfected cultured fish cells and transgenic fish should preferably contain DNA sequences from fish genes or, alternatively, those sequences from mammalian genes that have been previously proved to be compatible with the fish cellular machinery.

  16. Structure, organization and expression of common carp (Cyprinus carpio L.) SLP-76 gene.

    PubMed

    Huang, Rong; Sun, Xiao-Feng; Hu, Wei; Wang, Ya-Ping; Guo, Qiong-Lin

    2008-05-01

    SLP-76 is an important member of the SLP-76 family of adapters, and it plays a key role in TCR signaling and T cell function. Partial cDNA sequence of SLP-76 of common carp (Cyprinus carpio L.) was isolated from thymus cDNA library by the method of suppression subtractive hybridization (SSH). Subsequently, the full length cDNA of carp SLP-76 was obtained by means of 3' RACE and 5' RACE, respectively. The full length cDNA of carp SLP-76 was 2007 bp, consisting of a 5'-terminal untranslated region (UTR) of 285 bp, a 3'-terminal UTR of 240 bp, and an open reading frame of 1482 bp. Sequence comparison showed that the deduced amino acid sequence of carp SLP-76 had an overall similarity of 34-73% to that of other species homologues, and it was composed of an NH2-terminal domain, a central proline-rich domain, and a C-terminal SH2 domain. Amino acid sequence analysis indicated the existence of a Gads binding site R-X-X-K, a 10-aa-long sequence which binds to the SH3 domain of LCK in vitro, and three conserved tyrosine-containing sequence in the NH2-terminal domain. Then we used PCR to obtain a genomic DNA which covers the entire coding region of carp SLP-76. In the 9.2k-long genomic sequence, twenty one exons and twenty introns were identified. RT-PCR results showed that carp SLP-76 was expressed predominantly in hematopoietic tissues, and was upregulated in thymus tissue of four-month carp compared to one-year old carp. RT-PCR and virtual northern hybridization results showed that carp SLP-76 was also upregulated in thymus tissue of GH transgenic carp at the age of four-months. These results suggest that the expression level of SLP-76 gene may be related to thymocyte development in teleosts.

  17. SC*994C>T causes the Sc(null) phenotype in Pacific Islanders and successful transfusion of Sc3+ blood to a patient with anti-Sc3.

    PubMed

    Reid, Marion E; Hue-Roye, Kim; Velliquette, Randall W; Larimore, Kathleen; Moscarelli, Sue; Ohswaldt, Nicolas; Lomas-Francis, Christine

    2013-01-01

    Antigens in the SC blood group system are expressed by the human erythrocyte membrane-associated protein (ERMAP).Two molecular bases have been reported for the Sc,un phenotype:SC*307del2 and SC*994C>T. We report our investigation of the molecular background of five Sc,n1 individuals from the Pacific Islands and describe the successful transfusion of Sc3+ blood to a patient with anti-Sc3 in her plasma. SC (ERMAP) exons 2,3, and 12 and their flanking intronic regions were analyzed. TheSC*994C>T change introduces a restriction enzyme cleavage site for Tsp45I, and polymerase chain reaction (PCR) products from exon 12 were subjected to this PCR-restriction fragment length polymorphism (RFLP) assay. The five samples had the variant SC*994T/T. One sample, from a first cousin of one Marshallese proband, was heterozygous for SC*1514C/T (in the 3' untranslated region); the other four samples were SC*1514C/C(consensus sequence). Samples from white donors (n = 100) and African American donors (n = 99) were tested using the Tsp45IPCR-RFLP assay; all gave a banding pattern that was consistent with the SC*994C/C consensus sequence. In all five samples,our analyses showed homozygosity for the nonsense nucleotide change SC*994C>Tin an allele carrying the nucleotide associated with SLd. Further investigation determined that one of the probands reported previously with the SC*994C>T change was from the Marshall Islands (which form part of the Micronesian Pacific Islands) and the other was from an unspecified location within the large collection of Pacific Islands. Taken together, the five known probands with the SC*994C>T silencing nucleotide change were from the Pacific Islands.

  18. Biallelic expression of the L-arginine:glycine amidinotransferase gene with different methylation status between male and female primordial germ cells in chickens.

    PubMed

    Jang, H J; Lee, M O; Kim, S; Kim, T H; Kim, S K; Song, G; Womack, J E; Han, J Y

    2013-03-01

    The basic functions of DNA methylation include in gene silencing by methylation of specific gene promoters, defense of the host genome from retrovirus, and transcriptional suppression of transgenes. In addition, genomic imprinting, by which certain genes are expressed in a parent-of-origin-specific manner, has been observed in a wide range of plants and animals and has been associated with differential methylation. However, imprinting phenomena of DNA methylation effects have not been revealed in chickens. To analyze whether genomic imprinting occurs in chickens, methyl-DNA immunoprecipitation array analysis was applied across the entire genome of germ cells in early chick embryos. A differentially methylated region (DMR) was detected in the eighth intron of the l-arginine:glycine amidinotransferase (GATM) gene. When the DMR in GATM was analyzed by bisulfite sequencing, the methylation in male primordial germ cells (PGC) of 6-d-old embryos was higher than that in female PGC (57.5 vs. 35.0%). At 8 d, the DMR methylation of GATM in male PGC was 3.7-fold higher than that in female PGC (65.0 vs. 17.5%). Subsequently, to investigate mono- or biallelic expression of the GATM gene during embryo development, we found 2 indel sequences (GTTTAATGC and CAAAAA) within the GATM 3'-untranslated region in Korean Oge (KO) and White Leghorn (WL) chickens. When individual WL and KO chickens were genotyped for indel sequences, 3 allele combinations (homozygous insertion, homozygous deletion, and heterozygotes) were detected in both breeds using a gel shift assay and high-resolution melt assay. The deletion allele was predominant in KO, whereas the insertion allele was predominant in WL. Heterozygous animals were evenly distributed in both breeds (P < 0.01). Despite the different methylation status between male and female PGC, the GATM gene conclusively displayed biallelic expression in PGC as well as somatic embryonic, extraembryonic, and adult chicken tissues.

  19. Phylogenetic inferences of Nepenthes species in Peninsular Malaysia revealed by chloroplast (trnL intron) and nuclear (ITS) DNA sequences.

    PubMed

    Bunawan, Hamidun; Yen, Choong Chee; Yaakop, Salmah; Noor, Normah Mohd

    2017-01-26

    The chloroplastic trnL intron and the nuclear internal transcribed spacer (ITS) region were sequenced for 11 Nepenthes species recorded in Peninsular Malaysia to examine their phylogenetic relationship and to evaluate the usage of trnL intron and ITS sequences for phylogenetic reconstruction of this genus. Phylogeny reconstruction was carried out using neighbor-joining, maximum parsimony and Bayesian analyses. All the trees revealed two major clusters, a lowland group consisting of N. ampullaria, N. mirabilis, N. gracilis and N. rafflesiana, and another containing both intermediately distributed species (N. albomarginata and N. benstonei) and four highland species (N. sanguinea, N. macfarlanei, N. ramispina and N. alba). The trnL intron and ITS sequences proved to provide phylogenetic informative characters for deriving a phylogeny of Nepenthes species in Peninsular Malaysia. To our knowledge, this is the first molecular phylogenetic study of Nepenthes species occurring along an altitudinal gradient in Peninsular Malaysia.

  20. Crystal structure of group II intron domain 1 reveals a template for RNA assembly

    DOE PAGES

    Zhao, Chen; Rajashankar, Kanagalaghatta R.; Marcia, Marco; ...

    2015-10-26

    Although the importance of large noncoding RNAs is increasingly appreciated, our understanding of their structures and architectural dynamics remains limited. In particular, we know little about RNA folding intermediates and how they facilitate the productive assembly of RNA tertiary structures. In this paper, we report the crystal structure of an obligate intermediate that is required during the earliest stages of group II intron folding. Composed of domain 1 from the Oceanobacillus iheyensis group II intron (266 nucleotides), this intermediate retains native-like features but adopts a compact conformation in which the active site cleft is closed. Transition between this closed andmore » the open (native) conformation is achieved through discrete rotations of hinge motifs in two regions of the molecule. Finally, the open state is then stabilized by sequential docking of downstream intron domains, suggesting a 'first come, first folded' strategy that may represent a generalizable pathway for assembly of large RNA and ribonucleoprotein structures.« less

  1. Mutant tristetraprolin: a potent inhibitor of malignant glioma cell growth

    USDA-ARS?s Scientific Manuscript database

    Malignant gliomas rely on the production of certain critical growth factors including VEGF, interleukin (IL)-6 and IL-8, to fuel rapid tumor growth, angiogenesis, and treatment resistance. Post-transcriptional regulation through adenine and uridine-rich elements of the 3' untranslated region is one ...

  2. The complete chloroplast genome of Sinopodophyllum hexandrum Ying (Berberidaceae).

    PubMed

    Meng, Lihua; Liu, Ruijuan; Chen, Jianbing; Ding, Chenxu

    2017-05-01

    The complete nucleotide sequence of the Sinopodophyllum hexandrum Ying chloroplast genome (cpDNA) was determined based on next-generation sequencing technologies in this study. The genome was 157 203 bp in length, containing a pair of inverted repeat (IRa and IRb) regions of 25 960 bp, which were separated by a large single-copy (LSC) region of 87 065 bp and a small single-copy (SSC) region of 18 218 bp, respectively. The cpDNA contained 148 genes, including 96 protein-coding genes, 8 ribosomal RNA genes, and 44 tRNA genes. In these genes, eight harbored a single intron, and two (ycf3 and clpP) contained a couple of introns. The cpDNA AT content of S. hexandrum cpDNA is 61.5%.

  3. The complete chloroplast genome sequence of Euonymus japonicus (Celastraceae).

    PubMed

    Choi, Kyoung Su; Park, SeonJoo

    2016-09-01

    The complete chloroplast (cp) genome sequence of the Euonymus japonicus, the first sequenced of the genus Euonymus, was reported in this study. The total length was 157 637 bp, containing a pair of 26 678 bp inverted repeat region (IR), which were separated by small single copy (SSC) region and large single copy (LSC) region of 18 340 bp and 85 941 bp, respectively. This genome contains 107 unique genes, including 74 coding genes, four rRNA genes, and 29 tRNA genes. Seventeen genes contain intron of E. japonicus, of which three genes (clpP, ycf3, and rps12) include two introns. The maximum likelihood (ML) phylogenetic analysis revealed that E. japonicus was closely related to Manihot and Populus.

  4. Population genetic structure and conservation of marbled murrelets (Brachyramphus marmoratus)

    USGS Publications Warehouse

    Friesen, Vicki L.; Birt, T.P.; Piatt, John F.; Golightly, R.T.; Newman, S.H.; Hebert, P.N.; Congdon, B.C.; Gissing, G.

    2005-01-01

    Marbled murrelets (Brachyramphus marmoratus) are coastal seabirds that nest from California to the Aleutian Islands. They are declining and considered threatened in several regions. We compared variation in the mitochondrial control region, four nuclear introns and three microsatellite loci among 194 murrelets from throughout their range except Washington and Oregon. Significant population genetic structure was found: nine private control region haplotypes and three private intron alleles occurred at high frequency in the Aleutians and California; global estimates of FST or ??ST and most pairwise estimates involving the Aleutians and/or California were significant; and marked isolation-by-distance was found. Given the available samples, murrelets appear to comprise five genetic management units: (1) western Aleutian Islands, (2) central Aleutian Islands, (3) mainland Alaska and British Columbia, (4) northern California, and (5) central California.

  5. Complete Genome Sequence of the Circulatory Foot-and-Mouth Disease Virus Serotype Asia1 in Bangladesh

    PubMed Central

    Ali, M. Rahmat; Alam, A. S. M. Rubayet Ul; Amin, M. Al; Ullah, Huzzat; Siddique, Mohammad Anwar; Momtaz, Samina; Sultana, Munawar

    2017-01-01

    ABSTRACT The complete genome sequence of foot-and-mouth disease virus (FMDV) serotype Asia1 isolated from Bangladesh is reported here. Genome analysis revealed amino acid substitutions in the VP1 antigenic region and deletions in both the 5′ and 3′ untranslated regions (UTRs) compared to the genome of the existing vaccine strain (GenBank accession no. AY304994). PMID:29074654

  6. The genetic architecture of 3'untranslated region of the MICA gene: polymorphisms and haplotypes.

    PubMed

    Luo, Jia; Tian, Wei; Liu, Xue Xiang; Yu, JunLong; Li, LiXin; Pan, FengHua

    2013-10-01

    In this study, the 3'untranslated region (3'UTR) of MHC class I chain-related gene A (MICA) were investigated in 104 healthy, unrelated Han individuals recruited from northern China, using PCR-sequencing method. Nine polymorphic sites were detected, which were in very strong linkage disequilibrium with each other .Seven different MICA 3'UTR alleles were identified, among which UTR1 predominated (0.6971),followed by UTR2 (0.2356). Twenty-one extended haplotypes incorporating the 3'UTR and MICA exons 2-5 were observed in this population. Phylogenetic analysis revealed the existence of two MICA lineages, each with multiple subsets. The 2 lineages were primarily linked to UTR1 and UTR2 in the 3'UTR, respectively. Ewens-Watterson homozygosity statistics at MICA coding and 3'UTR regions were consistent with neutral expectations. Our data provided for the first time the data of genetic variation in the 3'UTR of MICA gene in human populations. The findings are valuable for future studies of the mechanisms underlying MICA post-transcriptional regulation, and will inform studies of evolution of the MHC gene complex. Copyright © 2013 American Society for Histocompatibility and Immunogenetics. Published by Elsevier Inc. All rights reserved.

  7. Reduced mutation rate in exons due to differential mismatch repair

    PubMed Central

    Mularoni, Loris; Muiños, Ferran; Gonzalez-Perez, Abel; López-Bigas, Núria

    2017-01-01

    While recent studies have revealed higher than anticipated heterogeneity of mutation rate across genomic regions, mutations in exons and introns are assumed to be generated at the same rate. Here we find fewer somatic mutations in exons than expected based on their sequence content, and demonstrate that this is not due to purifying selection. Moreover, we show that it is caused by higher mismatch repair activity in exonic than in intronic regions. Our findings have important implications for our understanding of mutational and DNA repair processes, our knowledge of the evolution of eukaryotic genes, and practical ramifications for the study of the evolution of both tumors and species. PMID:29106418

  8. Brief Note Low diversity of the major histocompatibility complex class II DRA gene in domestic goats (Capra hircus) in Southern China.

    PubMed

    Chen, L P; E, G X; Zhao, Y J; Na, R S; Zhao, Z Q; Zhang, J H; Ma, Y H; Sun, Y W; Zhong, T; Zhang, H P; Huang, Y F

    2015-06-18

    DRA encodes the alpha chain of the DR heterodimer, is closely linked to DRB and is considered almost monomorphic in major histocompatibility complex region. In this study, we identified the exon 2 of DRA to evaluate the immunogenetic diversity of Chinese south indigenous goat. Two single nucleotide polymorphisms in an untranslated region and one synonymous substitution in coding region were identified. These data suggest that high immunodiversity in native Chinese population.

  9. Carotenogenesis Is Regulated by 5′UTR-Mediated Translation of Phytoene Synthase Splice Variants1[OPEN

    PubMed Central

    Voß, Björn; Maass, Dirk; Beyer, Peter

    2016-01-01

    Phytoene synthase (PSY) catalyzes the highly regulated, frequently rate-limiting synthesis of the first biosynthetically formed carotene. While PSY constitutes a small gene family in most plant taxa, the Brassicaceae, including Arabidopsis (Arabidopsis thaliana), predominantly possess a single PSY gene. This monogenic situation is compensated by the differential expression of two alternative splice variants (ASV), which differ in length and in the exon/intron retention of their 5′UTRs. ASV1 contains a long 5′UTR (untranslated region) and is involved in developmentally regulated carotenoid formation, such as during deetiolation. ASV2 contains a short 5′UTR and is preferentially induced when an immediate increase in the carotenoid pathway flux is required, such as under salt stress or upon sudden light intensity changes. We show that the long 5′UTR of ASV1 is capable of attenuating the translational activity in response to high carotenoid pathway fluxes. This function resides in a defined 5′UTR stretch with two predicted interconvertible RNA conformations, as known from riboswitches, which might act as a flux sensor. The translation-inhibitory structure is absent from the short 5′UTR of ASV2 allowing to bypass translational inhibition under conditions requiring rapidly increased pathway fluxes. The mechanism is not found in the rice (Oryza sativa) PSY1 5′UTR, consistent with the prevalence of transcriptional control mechanisms in taxa with multiple PSY genes. The translational control mechanism identified is interpreted in terms of flux adjustments needed in response to retrograde signals stemming from intermediates of the plastid-localized carotenoid biosynthesis pathway. PMID:27729470

  10. Alternative exon definition events control the choice between nuclear retention and cytoplasmic export of U11/U12-65K mRNA

    PubMed Central

    Verbeeren, Jens; Verma, Bhupendra

    2017-01-01

    Cellular homeostasis of the minor spliceosome is regulated by a negative feed-back loop that targets U11-48K and U11/U12-65K mRNAs encoding essential components of the U12-type intron-specific U11/U12 di-snRNP. This involves interaction of the U11 snRNP with an evolutionarily conserved splicing enhancer giving rise to unproductive mRNA isoforms. In the case of U11/U12-65K, this mechanism controls the length of the 3′ untranslated region (3′UTR). We show that this process is dynamically regulated in developing neurons and some other cell types, and involves a binary switch between translation-competent mRNAs with a short 3′UTR to non-productive isoforms with a long 3′UTR that are retained in the nucleus or/and spliced to the downstream amylase locus. Importantly, the choice between these alternatives is determined by alternative terminal exon definition events regulated by conserved U12- and U2-type 5′ splice sites as well as sequence signals used for pre-mRNA cleavage and polyadenylation. We additionally show that U11 snRNP binding to the U11/U12-65K mRNA species with a long 3′UTR is required for their nuclear retention. Together, our studies uncover an intricate molecular circuitry regulating the abundance of a key spliceosomal protein and shed new light on the mechanisms limiting the export of non-productively spliced mRNAs from the nucleus to the cytoplasm. PMID:28549066

  11. Detection of canonical A-to-G editing events at 3′ UTRs and microRNA target sites in human lungs using next-generation sequencing

    PubMed Central

    Soundararajan, Ramani; Stearns, Timothy M.; Griswold, Anthony J.; Mehta, Arpit; Czachor, Alexander; Fukumoto, Jutaro; Lockey, Richard F.; King, Benjamin L.; Kolliputi, Narasaiah

    2015-01-01

    RNA editing is a post-transcriptional modification of RNA. The majority of these changes result from adenosine deaminase acting on RNA (ADARs) catalyzing the conversion of adenosine residues to inosine in double-stranded RNAs (dsRNAs). Massively parallel sequencing has enabled the identification of RNA editing sites in human transcriptomes. In this study, we sequenced DNA and RNA from human lungs and identified RNA editing sites with high confidence via a computational pipeline utilizing stringent analysis thresholds. We identified a total of 3,447 editing sites that overlapped in three human lung samples, and with 50% of these sites having canonical A-to-G base changes. Approximately 27% of the edited sites overlapped with Alu repeats, and showed A-to-G clustering (>3 clusters in 100 bp). The majority of edited sites mapped to either 3′ untranslated regions (UTRs) or introns close to splice sites; whereas, only few sites were in exons resulting in non-synonymous amino acid changes. Interestingly, we identified 652 A-to-G editing events in the 3′ UTR of 205 target genes that mapped to 932 potential miRNA target binding sites. Several of these miRNA edited sites were validated in silico. Additionally, we validated several A-to-G edited sites by Sanger sequencing. Altogether, our study suggests a role for RNA editing in miRNA-mediated gene regulation and splicing in human lungs. In this study, we have generated a RNA editome of human lung tissue that can be compared with other RNA editomes across different lung tissues to delineate a role for RNA editing in normal and diseased states. PMID:26486088

  12. Detection of canonical A-to-G editing events at 3' UTRs and microRNA target sites in human lungs using next-generation sequencing.

    PubMed

    Soundararajan, Ramani; Stearns, Timothy M; Griswold, Anthony L; Mehta, Arpit; Czachor, Alexander; Fukumoto, Jutaro; Lockey, Richard F; King, Benjamin L; Kolliputi, Narasaiah

    2015-11-03

    RNA editing is a post-transcriptional modification of RNA. The majority of these changes result from adenosine deaminase acting on RNA (ADARs) catalyzing the conversion of adenosine residues to inosine in double-stranded RNAs (dsRNAs). Massively parallel sequencing has enabled the identification of RNA editing sites in human transcriptomes. In this study, we sequenced DNA and RNA from human lungs and identified RNA editing sites with high confidence via a computational pipeline utilizing stringent analysis thresholds. We identified a total of 3,447 editing sites that overlapped in three human lung samples, and with 50% of these sites having canonical A-to-G base changes. Approximately 27% of the edited sites overlapped with Alu repeats, and showed A-to-G clustering (>3 clusters in 100 bp). The majority of edited sites mapped to either 3' untranslated regions (UTRs) or introns close to splice sites; whereas, only few sites were in exons resulting in non-synonymous amino acid changes. Interestingly, we identified 652 A-to-G editing events in the 3' UTR of 205 target genes that mapped to 932 potential miRNA target binding sites. Several of these miRNA edited sites were validated in silico. Additionally, we validated several A-to-G edited sites by Sanger sequencing. Altogether, our study suggests a role for RNA editing in miRNA-mediated gene regulation and splicing in human lungs. In this study, we have generated a RNA editome of human lung tissue that can be compared with other RNA editomes across different lung tissues to delineate a role for RNA editing in normal and diseased states.

  13. Species-Specific Exon Loss in Human Transcriptomes

    PubMed Central

    Wang, Jinkai; Lu, Zhi-xiang; Tokheim, Collin J.; Miller, Sara E.; Xing, Yi

    2015-01-01

    Changes in exon–intron structures and splicing patterns represent an important mechanism for the evolution of gene functions and species-specific regulatory networks. Although exon creation is widespread during primate and human evolution and has been studied extensively, much less is known about the scope and potential impact of human-specific exon loss events. Historically, transcriptome data and exon annotations are significantly biased toward humans over nonhuman primates. This ascertainment bias makes it challenging to discover human-specific exon loss events. We carried out a transcriptome-wide search of human-specific exon loss events, by taking advantage of RNA sequencing (RNA-seq) as a powerful and unbiased tool for exon discovery and annotation. Using RNA-seq data of humans, chimpanzees, and other primates, we reconstructed and compared transcript structures across the primate phylogeny. We discovered 33 candidate human-specific exon loss events, among which six exons passed stringent experimental filters for the complete loss of splicing activities in diverse human tissues. These events may result from human-specific deletion of genomic DNA, or small-scale sequence changes that inactivated splicing signals. The impact of human-specific exon loss events is predominantly regulatory. Three of the six events occurred in the 5′ untranslated region (5′-UTR) and affected cis-regulatory elements of mRNA translation. In SLC7A6, a gene encoding an amino acid transporter, luciferase reporter assays suggested that both a human-specific exon loss event and an independent human-specific single nucleotide substitution in the 5′-UTR increased mRNA translational efficiency. Our study provides novel insights into the molecular mechanisms and evolutionary consequences of exon loss during human evolution. PMID:25398629

  14. Genome Modification Leads to Phenotype Reversal in Human Myotonic Dystrophy type 1 iPS-cell Derived Neural Stem Cells

    PubMed Central

    Xia, Guangbin; Gao, Yuanzheng; Jin, Shouguang; Subramony, SH.; Terada, Naohiro; Ranum, Laura P.W.; Swanson, Maurice S.; Ashizawa, Tetsuo

    2015-01-01

    Objective Myotonic dystrophy type 1 (DM1) is caused by expanded CTG repeats in the 3'-untranslated region (3’ UTR) of the DMPK gene. Correcting the mutation in DM1 stem cells would be an important step towards autologous stem cell therapy. The objective of this study is to demonstrate in vitro genome editing to prevent production of toxic mutant transcripts and reverse phenotypes in DM1 stem cells. Methods Genome editing was performed in DM1 neural stem cells (NSCs) derived from human DM1 iPS cells. An editing cassette containing SV40/bGH polyA signals was integrated upstream of the CTG repeats by TALEN-mediated homologous recombination (HR). The expression of mutant CUG repeats transcript was monitored by nuclear RNA foci, the molecular hallmarks of DM1, using RNA fluorescence in situ hybridization (RNA-FISH). Alternative splicing of microtubule-associated protein tau (MAPT) and muscleblind-like (MBNL) proteins were analyzed to further monitor the phenotype reversal after genome modification. Results The cassette was successfully inserted into DMPK intron 9 and this genomic modification led to complete disappearance of nuclear RNA foci. MAPT and MBNL 1, 2 aberrant splicing in DM1 NSCs was reversed to normal pattern in genome-modified NSCs. Interpretation Genome modification by integration of exogenous polyA signals upstream of the DMPK CTG repeat expansion prevents the production of toxic RNA and leads to phenotype reversal in human DM1 iPS-cells derived stem cells. Our data provide proof-of-principle evidence that genome modification may be used to generate genetically modified progenitor cells as a first step toward autologous cell transfer therapy for DM1. PMID:25702800

  15. Phenotypic and genotypic characterization of four factor VII deficiency patients from central China.

    PubMed

    Liu, Hui; Wang, Hua-Fang; Cheng, Zhi-peng; Wang, Qing-yun; Hu, Bei; Zeng, Wei; Wu, Ying-ying; Guo, Tao; Tang, Liang; Hu, Yu

    2015-06-01

    Hereditary coagulation factor VII deficiency (FVIID) is a rare autosomal, recessive inherited hemorrhagic disorder related to a variety of mutations or polymorphisms throughout the factor VII (FVII) gene (F7). The aims of this study were to characterize the molecular defect of the F7 gene in four unrelated patients with FVIID and to find the genotype-phenotype correlation. All nine exons, exon-intron boundaries, and 5' and 3'-untranslated regions of the F7 gene were amplified by PCR and the purified PCR products were sequenced directly. Suspected mutations were confirmed by another PCR and sequencing of the opposite strand. Family studies were also performed. A total of five unique lesions were identified, including three missense mutations (c.384A>G, c.839A>C, c.1163T>G, predicting p.Tyr128Cys, p.Glu280Ala and p.Phe388Cys substitution, respectively) and two splice junction mutations (c.572-1G>A, c.681+1G>T), among which two (p.Glu280Ala, p.Phe388Cys) were novel. A previously reported mutation p.Tyr128Cys was seen in the homozygous state in two unrelated patients. The other two cases were both compound heterozygotes of a missense mutation and a splicing site mutation. Multiple sequence alignment using DNAMAN analysis showed that all the missense mutations were found in residues that highly conserved across species and vitamin K-dependent serine proteases. Online software Polyphen and SIFT were used to confirm the pathogenic of the missense mutation. p.Tyr128Cys seems to be a hotspot of the F7 gene in ethnic Han Chinese population.

  16. Role of the DLGAP2 Gene Encoding the SAP90/PSD-95-Associated Protein 2 in Schizophrenia

    PubMed Central

    Li, Jun-Ming; Lu, Chao-Lin; Cheng, Min-Chih; Luu, Sy-Ueng; Hsu, Shih-Hsin; Hu, Tsung-Ming; Tsai, Hsin-Yao; Chen, Chia-Hsiang

    2014-01-01

    Aberrant synaptic dysfunction is implicated in the pathogenesis of schizophrenia. The DLGAP2 gene encoding the SAP90/PSD-95-associated protein 2 (SAPAP2) located at the post-synaptic density of neuronal cells is involved in the neuronal synaptic function. This study aimed to investigate whether the DLGAP2 gene is associated with schizophrenia. We resequenced the putative promoter region and all the exons of the DLGAP2 gene in 523 patients with schizophrenia and 596 non-psychotic controls from Taiwan and conducted a case-control association analysis. We identified 19 known SNPs in this sample. Association analysis of 9 SNPs with minor allele frequency greater than 5% showed no association with schizophrenia. However, we found a haplotype (CCACCAACT) significantly associated with schizophrenia (odds ratio:2.5, p<0.001). We also detected 16 missense mutations and 1 amino acid-insertion mutation in this sample. Bioinformatic analysis showed some of these mutations were damaging or pathological to the protein function, but we did not find increased burden of these mutations in the patient group. Notably, we identified 5 private rare variants in 5 unrelated patients, respectively, including c.−69+9C>T, c.−69+13C>T, c.−69+47C>T, c.−69+55C>T at intron 1 and c.−32A>G at untranslated exon 2 of the DLGAP2 gene. These rare variants were not detected in 559 control subjects. Further reporter gene assay of these rare variants except c.−69+13C>T showed significantly elevated promoter activity than the wild type, suggesting increased DLGAP2 gene expression may contribute to the pathogenesis of schizophrenia. Our results indicate that DLGAP2 is a susceptible gene of schizophrenia. PMID:24416398

  17. Association between CYP2E1 polymorphisms and risk of differentiated thyroid carcinoma.

    PubMed

    Pellé, Lucia; Cipollini, Monica; Tremmel, Roman; Romei, Cristina; Figlioli, Gisella; Gemignani, Federica; Melaiu, Ombretta; De Santi, Chiara; Barone, Elisa; Elisei, Rossella; Seiser, Eric; Innocenti, Federico; Zanger, Ulrich M; Landi, Stefano

    2016-12-01

    Differentiated thyroid carcinoma (DTC) results from complex interactions between genetic and environmental factors. Known etiological factors include exposure to ionizing radiations, previous thyroid diseases, and hormone factors. It has been speculated that dietary acrylamide (AA) formed in diverse foods following the Maillard's reaction could be a contributing factor for DTC in humans. Upon absorption, AA is biotransformed mainly by cytochrome P450 2E1 (CYP2E1) to glycidamide (GA). Considering that polymorphisms within CYP2E1 were found associated with endogenous levels of AA-Valine and GA-Valine hemoglobin adducts in humans, we raised the hypothesis that specific CYP2E1 genotypes could be associated with the risk of DTC. Analysis of four haplotype tagging SNPs (ht-SNPs) within the locus in a discovery case-control study (N = 350/350) indicated an association between rs2480258 and DTC risk. This ht-SNP resides within a linkage disequilibrium block spanning intron VIII and the 3'-untranslated region. Extended analysis in a large replication set (2429 controls and 767 cases) confirmed the association, with odds ratios for GA and AA genotypes of 1.24 (95 % confidence interval (CI) 1.03-1.48) and 1.56 (95 % CI, 1.06-2.30), respectively. Functionally, the minor allele was associated with low levels of CYP2E1 mRNA and protein expression as well as lower enzymatic activity in a series of 149 human liver samples. Our data support the hypothesis that inter-individual differences in CYP2E1 activity could modulate the risk of developing DTC suggesting that the exposure to specific xenobiotics, such as AA, could play a role in this process.

  18. Naturally Occurring Variations in the Human Cholinesterase Genes: Heritability and Association with Cardiovascular and Metabolic Traits

    PubMed Central

    Valle, Anne M.; Radić, Zoran; Rana, Brinda K.; Mahboubi, Vafa; Wessel, Jennifer; Shih, Pei-an Betty; Rao, Fangwen; O'Connor, Daniel T.

    2011-01-01

    Cholinergic neurotransmission in the central and autonomic nervous systems regulates immediate variations in and longer-term maintenance of cardiovascular function with acetylcholinesterase (AChE) activity that is critical to temporal responsiveness. Butyrylcholinesterase (BChE), largely confined to the liver and plasma, subserves metabolic functions. AChE and BChE are found in hematopoietic cells and plasma, enabling one to correlate enzyme levels in whole blood with hereditary traits in twins. Using both twin and unrelated subjects, we found certain single nucleotide polymorphisms (SNPs) in the ACHE gene correlated with catalytic properties and general cardiovascular functions. SNP discovery from ACHE resequencing identified 19 SNPs: 7 coding SNPs (cSNPs), of which 4 are nonsynonymous, and 12 SNPs in untranslated regions, of which 3 are in a conserved sequence of an upstream intron. Both AChE and BChE activity traits in blood were heritable: AChE at 48.8 ± 6.1% and BChE at 81.4 ± 2.8%. Allelic and haplotype variations in the ACHE and BCHE genes were associated with changes in blood AChE and BChE activities. AChE activity was associated with BP status and SBP, whereas BChE activity was associated with features of the metabolic syndrome (especially body weight and BMI). Gene products from cDNAs with nonsynonymous cSNPs were expressed and purified. Protein expression of ACHE nonsynonymous variant D134H (SNP6) is impaired: this variant shows compromised stability and altered rates of organophosphate inhibition and oxime-assisted reactivation. A substantial fraction of the D134H instability could be reversed in the D134H/R136Q mutant. Hence, common genetic variations at ACHE and BCHE loci were associated with changes in corresponding enzymatic activities in blood. PMID:21493754

  19. A genomic survey of the fish parasite Spironucleus salmonicida indicates genomic plasticity among diplomonads and significant lateral gene transfer in eukaryote genome evolution

    PubMed Central

    Andersson, Jan O; Sjögren, Åsa M; Horner, David S; Murphy, Colleen A; Dyal, Patricia L; Svärd, Staffan G; Logsdon, John M; Ragan, Mark A; Hirt, Robert P; Roger, Andrew J

    2007-01-01

    Background Comparative genomic studies of the mitochondrion-lacking protist group Diplomonadida (diplomonads) has been lacking, although Giardia lamblia has been intensively studied. We have performed a sequence survey project resulting in 2341 expressed sequence tags (EST) corresponding to 853 unique clones, 5275 genome survey sequences (GSS), and eleven finished contigs from the diplomonad fish parasite Spironucleus salmonicida (previously described as S. barkhanus). Results The analyses revealed a compact genome with few, if any, introns and very short 3' untranslated regions. Strikingly different patterns of codon usage were observed in genes corresponding to frequently sampled ESTs versus genes poorly sampled, indicating that translational selection is influencing the codon usage of highly expressed genes. Rigorous phylogenomic analyses identified 84 genes – mostly encoding metabolic proteins – that have been acquired by diplomonads or their relatively close ancestors via lateral gene transfer (LGT). Although most acquisitions were from prokaryotes, more than a dozen represent likely transfers of genes between eukaryotic lineages. Many genes that provide novel insights into the genetic basis of the biology and pathogenicity of this parasitic protist were identified including 149 that putatively encode variant-surface cysteine-rich proteins which are candidate virulence factors. A number of genomic properties that distinguish S. salmonicida from its human parasitic relative G. lamblia were identified such as nineteen putative lineage-specific gene acquisitions, distinct mutational biases and codon usage and distinct polyadenylation signals. Conclusion Our results highlight the power of comparative genomic studies to yield insights into the biology of parasitic protists and the evolution of their genomes, and suggest that genetic exchange between distantly-related protist lineages may be occurring at an appreciable rate in eukaryote genome evolution. PMID:17298675

  20. Abiotic Stresses Modulate Landscape of Poplar Transcriptome via Alternative Splicing, Differential Intron Retention, and Isoform Ratio Switching

    PubMed Central

    Filichkin, Sergei A.; Hamilton, Michael; Dharmawardhana, Palitha D.; Singh, Sunil K.; Sullivan, Christopher; Ben-Hur, Asa; Reddy, Anireddy S. N.; Jaiswal, Pankaj

    2018-01-01

    Abiotic stresses affect plant physiology, development, growth, and alter pre-mRNA splicing. Western poplar is a model woody tree and a potential bioenergy feedstock. To investigate the extent of stress-regulated alternative splicing (AS), we conducted an in-depth survey of leaf, root, and stem xylem transcriptomes under drought, salt, or temperature stress. Analysis of approximately one billion of genome-aligned RNA-Seq reads from tissue- or stress-specific libraries revealed over fifteen millions of novel splice junctions. Transcript models supported by both RNA-Seq and single molecule isoform sequencing (Iso-Seq) data revealed a broad array of novel stress- and/or tissue-specific isoforms. Analysis of Iso-Seq data also resulted in the discovery of 15,087 novel transcribed regions of which 164 show AS. Our findings demonstrate that abiotic stresses profoundly perturb transcript isoform profiles and trigger widespread intron retention (IR) events. Stress treatments often increased or decreased retention of specific introns – a phenomenon described here as differential intron retention (DIR). Many differentially retained introns were regulated in a stress- and/or tissue-specific manner. A subset of transcripts harboring super stress-responsive DIR events showed persisting fluctuations in the degree of IR across all treatments and tissue types. To investigate coordinated dynamics of intron-containing transcripts in the study we quantified absolute copy number of isoforms of two conserved transcription factors (TFs) using Droplet Digital PCR. This case study suggests that stress treatments can be associated with coordinated switches in relative ratios between fully spliced and intron-retaining isoforms and may play a role in adjusting transcriptome to abiotic stresses. PMID:29483921

  1. Screening of Variations in CD22 Gene in Children with B-Precursor Acute Lymphoblastic Leukemia.

    PubMed

    Aslar Oner, Deniz; Akin, Dilara Fatma; Sipahi, Kadir; Mumcuoglu, Mine; Ezer, Ustun; Kürekci, A Emin; Akar, Nejat

    2016-09-01

    CD22 is expressed on the surface of B-cell lineage cells from the early progenitor stage of pro-B cell until terminal differentiation to mature B cells. It plays a role in signal transduction and as a regulator of B-cell receptor signaling in B-cell development. We aimed to screen exons 9-14 of the CD22 gene, which is a mutational hot spot region in B-precursor acute lymphoblastic leukemia (pre-B ALL) patients, to find possible genetic variants that could play role in the pathogenesis of pre-B ALL in Turkish children. This study included 109 Turkish children with pre-B ALL who were diagnosed at Losante Hospital for Children with Leukemia. Genomic DNA was extracted from both peripheral blood and bone marrow leukocytes. Gene amplification was performed with PCR, and all samples were screened for the variants by single strand conformation polymorphism. Samples showing band shifts were sequenced on an automated sequencer. In our patient group a total of 9 variants were identified in the CD22 gene by sequencing: a novel variant in intron 10 (T2199G); a missense variant in exon 12; 5 intronic variants between exon 12 and intron 13; a novel intronic variant (C2424T); and a synonymous in exon 13. Thirteen of 109 children (11.9%) carried the T2199G novel intronic variant located in intron 10, and 17 of 109 children (15.6%) carried the C2424T novel intronic variant. Novel variants in the CD22 gene in children with pre-B ALL in Turkey that are not present, in the Human Gene Mutation Database or NCBI SNP database, were found.

  2. Mitochondrial genomes of the green macroalga Ulva pertusa (Ulvophyceae, Chlorophyta): novel insights into the evolution of mitogenomes in the Ulvophyceae.

    PubMed

    Liu, Feng; Melton, James T; Bi, Yuping

    2017-10-01

    To further understand the trends in the evolution of mitochondrial genomes (mitogenomes or mtDNAs) in the Ulvophyceae, the mitogenomes of two separate thalli of Ulva pertusa were sequenced. Two U. pertusa mitogenomes (Up1 and Up2) were 69,333 bp and 64,602 bp in length. These mitogenomes shared two ribosomal RNAs (rRNAs), 28 transfer RNAs (tRNAs), 29 protein-coding genes, and 12 open reading frames. The 4.7 kb difference in size was attributed to variation in intron content and tandem repeat regions. A total of six introns were present in the smaller U. pertusa mtDNA (Up2), while the larger mtDNA (Up1) had eight. The larger mtDNA had two additional group II introns in two genes (cox1 and cox2) and tandem duplication mutations in noncoding regions. Our results showed the first case of intraspecific variation in chlorophytan mitogenomes and provided further genomic data for the undersampled Ulvophyceae. © 2017 Phycological Society of America.

  3. Strong Signature of Natural Selection within an FHIT Intron Implicated in Prostate Cancer Risk

    PubMed Central

    Ding, Yan; Larson, Garrett; Rivas, Guillermo; Lundberg, Cathryn; Geller, Louis; Ouyang, Ching; Weitzel, Jeffrey; Archambeau, John; Slater, Jerry; Daly, Mary B.; Benson, Al B.; Kirkwood, John M.; O'Dwyer, Peter J.; Sutphen, Rebecca; Stewart, James A.; Johnson, David; Nordborg, Magnus; Krontiris, Theodore G.

    2008-01-01

    Previously, a candidate gene linkage approach on brother pairs affected with prostate cancer identified a locus of prostate cancer susceptibility at D3S1234 within the fragile histidine triad gene (FHIT), a tumor suppressor that induces apoptosis. Subsequent association tests on 16 SNPs spanning approximately 381 kb surrounding D3S1234 in Americans of European descent revealed significant evidence of association for a single SNP within intron 5 of FHIT. In the current study, re-sequencing and genotyping within a 28.5 kb region surrounding this SNP further delineated the association with prostate cancer risk to a 15 kb region. Multiple SNPs in sequences under evolutionary constraint within intron 5 of FHIT defined several related haplotypes with an increased risk of prostate cancer in European-Americans. Strong associations were detected for a risk haplotype defined by SNPs 138543, 142413, and 152494 in all cases (Pearson's χ2 = 12.34, df 1, P = 0.00045) and for the homozygous risk haplotype defined by SNPs 144716, 142413, and 148444 in cases that shared 2 alleles identical by descent with their affected brothers (Pearson's χ2 = 11.50, df 1, P = 0.00070). In addition to highly conserved sequences encompassing SNPs 148444 and 152413, population studies revealed strong signatures of natural selection for a 1 kb window covering the SNP 144716 in two human populations, the European American (π = 0.0072, Tajima's D = 3.31, 14 SNPs) and the Japanese (π = 0.0049, Fay & Wu's H = 8.05, 14 SNPs), as well as in chimpanzees (Fay & Wu's H = 8.62, 12 SNPs). These results strongly support the involvement of the FHIT intronic region in an increased risk of prostate cancer. PMID:18953408

  4. Characterization of novel RS1 exonic deletions in juvenile X-linked retinoschisis

    PubMed Central

    D’Souza, Leera; Cukras, Catherine; Antolik, Christian; Craig, Candice; He, Hong; Li, Shibo; Hejtmancik, James F.; Sieving, Paul A.; Wang, Xinjing

    2013-01-01

    Purpose X-linked juvenile retinoschisis (XLRS) is a vitreoretinal dystrophy characterized by schisis (splitting) of the inner layers of the neuroretina. Mutations within the retinoschisis (RS1) gene are responsible for this disease. The mutation spectrum consists of amino acid substitutions, splice site variations, small indels, and larger genomic deletions. Clinically, genomic deletions are rarely reported. Here, we characterize two novel full exonic deletions: one encompassing exon 1 and the other spanning exons 4–5 of the RS1 gene. We also report the clinical findings in these patients with XLRS with two different exonic deletions. Methods Unrelated XLRS men and boys and their mothers (if available) were enrolled for molecular genetics evaluation. The patients also underwent ophthalmologic examination and in some cases electroretinogram (ERG) recording. All the exons and the flanking intronic regions of the RS1 gene were analyzed with direct sequencing. Two patients with exonic deletions were further evaluated with array comparative genomic hybridization to define the scope of the genomic aberrations. After the deleted genomic region was identified, primer walking followed by direct sequencing was used to determine the exact breakpoints. Results Two novel exonic deletions of the RS1 gene were identified: one including exon 1 and the other spanning exons 4 and 5. The exon 1 deletion extends from the 5′ region of the RS1 gene (including the promoter) through intron 1 (c.(−35)-1723_c.51+2664del4472). The exon 4–5 deletion spans introns 3 to intron 5 (c.185–1020_c.522+1844del5764). Conclusions Here we report two novel exonic deletions within the RS1 gene locus. We have also described the clinical presentations and hypothesized the genomic mechanisms underlying these schisis phenotypes. PMID:24227916

  5. Characterization of novel RS1 exonic deletions in juvenile X-linked retinoschisis.

    PubMed

    D'Souza, Leera; Cukras, Catherine; Antolik, Christian; Craig, Candice; Lee, Ji-Yun; He, Hong; Li, Shibo; Smaoui, Nizar; Hejtmancik, James F; Sieving, Paul A; Wang, Xinjing

    2013-01-01

    X-linked juvenile retinoschisis (XLRS) is a vitreoretinal dystrophy characterized by schisis (splitting) of the inner layers of the neuroretina. Mutations within the retinoschisis (RS1) gene are responsible for this disease. The mutation spectrum consists of amino acid substitutions, splice site variations, small indels, and larger genomic deletions. Clinically, genomic deletions are rarely reported. Here, we characterize two novel full exonic deletions: one encompassing exon 1 and the other spanning exons 4-5 of the RS1 gene. We also report the clinical findings in these patients with XLRS with two different exonic deletions. Unrelated XLRS men and boys and their mothers (if available) were enrolled for molecular genetics evaluation. The patients also underwent ophthalmologic examination and in some cases electroretinogram (ERG) recording. All the exons and the flanking intronic regions of the RS1 gene were analyzed with direct sequencing. Two patients with exonic deletions were further evaluated with array comparative genomic hybridization to define the scope of the genomic aberrations. After the deleted genomic region was identified, primer walking followed by direct sequencing was used to determine the exact breakpoints. Two novel exonic deletions of the RS1 gene were identified: one including exon 1 and the other spanning exons 4 and 5. The exon 1 deletion extends from the 5' region of the RS1 gene (including the promoter) through intron 1 (c.(-35)-1723_c.51+2664del4472). The exon 4-5 deletion spans introns 3 to intron 5 (c.185-1020_c.522+1844del5764). Here we report two novel exonic deletions within the RS1 gene locus. We have also described the clinical presentations and hypothesized the genomic mechanisms underlying these schisis phenotypes.

  6. Design of retrovirus vectors for transfer and expression of the human. beta. -globin gene

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Miller, A.D.; Bender, M.A.; Harris, E.A.S.

    1988-11-01

    Regulated expression of the human ..beta..-globin gene has been demonstrated in cultured murine erythroleukemia cells and in mice after retrovirus-mediated gene transfer. However, the low titer of recombinant viruses described to date results in relatively inefficient gene transfer, which limits their usefulness for animal studies and for potential gene therapy in humans for diseases involving defective ..beta..-globin genes. The authors found regions that interfered with virus production within intron 2 of the ..beta..-globin gene and on both sides of the gene. The flanking regions could be removed, but intron 2 was required for ..beta..-globin expression. Inclusion of ..beta..-globin introns necessitatesmore » an antisense orientation of the gene within the retrovirus vector. However, they found no effect of the antisense ..beta..-globin transcription on virus production. A region downstream of the ..beta..-globin gene that stimulates expression of the gene in transgenic mice was included in the viruses without detrimental effects on virus titer. Virus titers of over 10/sup 6/ CFU/ml were obtained with the final vector design, which retained the ability to direct regulated expression of human ..beta..-globin in murine erythroleukemia cells. The vector also allowed transfer and expression of the human ..beta..-globin gene in hematopoietic cells (CFU-S cells) in mice.« less

  7. The Complete Plastid Genome of Lagerstroemia fauriei and Loss of rpl2 Intron from Lagerstroemia (Lythraceae)

    PubMed Central

    Gu, Cuihua; Tembrock, Luke R.; Johnson, Nels G.; Simmons, Mark P.; Wu, Zhiqiang

    2016-01-01

    Lagerstroemia (crape myrtle) is an important plant genus used in ornamental horticulture in temperate regions worldwide. As such, numerous hybrids have been developed. However, DNA sequence resources and genome information for Lagerstroemia are limited, hindering evolutionary inferences regarding interspecific relationships. We report the complete plastid genome of Lagerstroemia fauriei. To our knowledge, this is the first reported whole plastid genome within Lythraceae. This genome is 152,440 bp in length with 38% GC content and consists of two single-copy regions separated by a pair of 25,793 bp inverted repeats. The large single copy and the small single copy regions span 83,921 bp and 16,933 bp, respectively. The genome contains 129 genes, including 17 located in each inverted repeat. Phylogenetic analysis of genera sampled from Geraniaceae, Myrtaceae, and Onagraceae corroborated the sister relationship between Lythraceae and Onagraceae. The plastid genomes of L. fauriei and several other Lythraceae species lack the rpl2 intron, which indicating an early loss of this intron within the Lythraceae lineage. The plastid genome of L. fauriei provides a much needed genetic resource for further phylogenetic research in Lagerstroemia and Lythraceae. Highly variable markers were identified for application in phylogenetic, barcoding and conservation genetic applications. PMID:26950701

  8. The wheat cytochrome oxidase subunit II gene has an intron insert and three radical amino acid changes relative to maize

    PubMed Central

    Bonen, Linda; Boer, Poppo H.; Gray, Michael W.

    1984-01-01

    We have determined the sequence of the wheat mitochondrial gene for cytochrome oxidase subunit II (COII) and find that its derived protein sequence differs from that of maize at only three amino acid positions. Unexpectedly, all three replacements are non-conservative ones. The wheat COII gene has a highly-conserved intron at the same position as in maize, but the wheat intron is 1.5 times longer because of an insert relative to its maize counterpart. Hybridization analysis of mitochondrial DNA from rye, pea, broad bean and cucumber indicates strong sequence conservation of COII coding sequences among all these higher plants. However, only rye and maize mitochondrial DNA show homology with wheat COII intron sequences and rye alone with intron-insert sequences. We find that a sequence identical to the region of the 5' exon corresponding to the transmembrane domain of the COII protein is present at a second genomic location in wheat mitochondria. These variations in COII gene structure and size, as well as the presence of repeated COII sequences, illustrate at the DNA sequence level, factors which contribute to higher plant mitochondrial DNA diversity and complexity. ImagesFig. 3.Fig. 4.Fig. 5. PMID:16453565

  9. Evolution of EF-hand calcium-modulated proteins. IV. Exon shuffling did not determine the domain compositions of EF-hand proteins

    NASA Technical Reports Server (NTRS)

    Kretsinger, R. H.; Nakayama, S.

    1993-01-01

    In the previous three reports in this series we demonstrated that the EF-hand family of proteins evolved by a complex pattern of gene duplication, transposition, and splicing. The dendrograms based on exon sequences are nearly identical to those based on protein sequences for troponin C, the essential light chain myosin, the regulatory light chain, and calpain. This validates both the computational methods and the dendrograms for these subfamilies. The proposal of congruence for calmodulin, troponin C, essential light chain, and regulatory light chain was confirmed. There are, however, significant differences in the calmodulin dendrograms computed from DNA and from protein sequences. In this study we find that introns are distributed throughout the EF-hand domain and the interdomain regions. Further, dendrograms based on intron type and distribution bear little resemblance to those based on protein or on DNA sequences. We conclude that introns are inserted, and probably deleted, with relatively high frequency. Further, in the EF-hand family exons do not correspond to structural domains and exon shuffling played little if any role in the evolution of this widely distributed homolog family. Calmodulin has had a turbulent evolution. Its dendrograms based on protein sequence, exon sequence, 3'-tail sequence, intron sequences, and intron positions all show significant differences.

  10. The paradox of MHC-DRB exon/intron evolution: alpha-helix and beta-sheet encoding regions diverge while hypervariable intronic simple repeats coevolve with beta-sheet codons.

    PubMed

    Schwaiger, F W; Weyers, E; Epplen, C; Brün, J; Ruff, G; Crawford, A; Epplen, J T

    1993-09-01

    Twenty-one different caprine and 13 ovine MHC-DRB exon 2 sequences were determined including part of the adjacent introns containing simple repetitive (gt)n(ga)m elements. The positions for highly polymorphic DRB amino acids vary slightly among ungulates and other mammals. From man and mouse to ungulates the basic (gt)n(ga)m structure is fixed in evolution for 7 x 10(7) years whereas ample variations exist in the tandem (gt)n and (ga)m dinucleotides and especially their "degenerated" derivatives. Phylogenetic trees for the alpha-helices and beta-pleated sheets of the ungulate DRB sequences suggest different evolutionary histories. In hoofed animals as well as in humans DRB beta-sheet encoding sequences and adjacent intronic repeats can be assembled into virtually identical groups suggesting coevolution of noncoding as well as coding DNA. In contrast alpha-helices and C-terminal parts of the first DRB domain evolve distinctly. In the absence of a defined mechanism causing specific, site-directed mutations, double-recombination or gene-conversion-like events would readily explain this fact. The role of the intronic simple (gt)n(ga)m repeat is discussed with respect to these genetic exchange mechanisms during evolution.

  11. Phase distribution of spliceosomal introns: implications for intron origin

    PubMed Central

    Nguyen, Hung D; Yoshihama, Maki; Kenmochi, Naoya

    2006-01-01

    Background The origin of spliceosomal introns is the central subject of the introns-early versus introns-late debate. The distribution of intron phases is non-uniform, with an excess of phase-0 introns. Introns-early explains this by speculating that a fraction of present-day introns were present between minigenes in the progenote and therefore must lie in phase-0. In contrast, introns-late predicts that the nonuniformity of intron phase distribution reflects the nonrandomness of intron insertions. Results In this paper, we tested the two theories using analyses of intron phase distribution. We inferred the evolution of intron phase distribution from a dataset of 684 gene orthologs from seven eukaryotes using a maximum likelihood method. We also tested whether the observed intron phase distributions from 10 eukaryotes can be explained by intron insertions on a genome-wide scale. In contrast to the prediction of introns-early, the inferred evolution of intron phase distribution showed that the proportion of phase-0 introns increased over evolution. Consistent with introns-late, the observed intron phase distributions matched those predicted by an intron insertion model quite well. Conclusion Our results strongly support the introns-late hypothesis of the origin of spliceosomal introns. PMID:16959043

  12. Estrogen receptor alpha regulates expression of the breast cancer 1 associated ring domain 1 (BARD1) gene through intronic DNA sequence.

    PubMed

    Creekmore, Amy L; Ziegler, Yvonne S; Bonéy, Jamie L; Nardulli, Ann M

    2007-03-15

    We have used a chromatin immunoprecipitation (ChIP)-based cloning strategy to isolate and identify genes associated with estrogen receptor alpha (ERalpha) in MCF-7 human breast cancer cells. One of the gene regions isolated was a 288bp fragment from the ninth intron of the breast cancer 1 associated ring domain (BARD1) gene. We demonstrated that ERalpha associated with this region of the endogenous BARD 1 gene in MCF-7 cells, that ERalpha bound to three of five ERE half sites located in the 288bp BARD1 region, and that this 288bp BARD1 region conferred estrogen responsiveness to a heterologous promoter. Importantly, treatment of MCF-7 cells with estrogen increased BARD1 mRNA and protein levels. These findings demonstrate that ChIP cloning strategies can be utilized to successfully isolate regulatory regions that are far removed from the transcription start site and assist in identifying cis elements involved in conferring estrogen responsiveness.

  13. A functional study of proximal goat β-casein promoter and intron 1 in immortalized goat mammary epithelial cells.

    PubMed

    Kung, M H; Lee, Y J; Hsu, J T; Huang, M C; Ju, Y T

    2015-06-01

    Goat β-casein (CSN2) promoter has been extensively used to derive expression of recombinant therapeutic protein in transgenic goats; however, little direct evidence exists for signaling molecules and the cis-elements of goat CSN2 promoter in response to lactogenic hormone stimulation in goat mammary epithelial cells. Here, we use an immortalized caprine mammary epithelial cell line (CMC) to search for evidence of the above. Serial 5'-flanking regions deleted of promoter and intron 1 in goat CSN2 (-4,047 to +2,054) driven by firefly luciferase reporter gene were constructed and applied to measure promoter activity in CMC. The intron 1 region (+393 to +501) significantly decreased basal activity of the promoter. This finding contradicts other studies of the role of intron 1. The signal transducer and activator of transcription (STAT)5a played a significant role in activating promoter activity by prolactin stimulation. Hydrocortisone enhanced and prolonged the activity of STAT5a and promoter in CMC, but was independent of the glucocorticoid receptor response element. The minimum length of the CSN2 promoter segment in response to lactogenic stimulation was confirmed by 5' serial deletions. A cis-element located from -300 to -90 in proximal goat CSN2 promoter that is absent in bovine and human CSN2 promoter was newly identified. We demonstrated the presence of a STAT5a binding site (-102 to -82) and preservation of the guanosine nucleotide at position -90 based on responses to the presence of lactogenic hormone using internal deletions and point mutations of the predicted STAT5a binding site, and chromatin immunoprecipitation assay. Together, these findings demonstrate that the proximal -300 bp of goat CSN2 promoter containing the STAT5a binding site (-102 to -82) is the response element for lactogenic hormone stimulation. Additionally, intron 1 may be required for tissue or developmental stage-specific expression in mammary gland. The role of the far-distal regions of goat CSN2 promoter in high-level lactogenic hormone induction and specific expression require further examination. Copyright © 2015 American Dairy Science Association. Published by Elsevier Inc. All rights reserved.

  14. Gene-Based Single Nucleotide Polymorphism Markers for Genetic and Association Mapping in Common Bean

    PubMed Central

    2012-01-01

    Background In common bean, expressed sequence tags (ESTs) are an underestimated source of gene-based markers such as insertion-deletions (Indels) or single-nucleotide polymorphisms (SNPs). However, due to the nature of these conserved sequences, detection of markers is difficult and portrays low levels of polymorphism. Therefore, development of intron-spanning EST-SNP markers can be a valuable resource for genetic experiments such as genetic mapping and association studies. Results In this study, a total of 313 new gene-based markers were developed at target genes. Intronic variation was deeply explored in order to capture more polymorphism. Introns were putatively identified after comparing the common bean ESTs with the soybean genome, and the primers were designed over intron-flanking regions. The intronic regions were evaluated for parental polymorphisms using the single strand conformational polymorphism (SSCP) technique and Sequenom MassARRAY system. A total of 53 new marker loci were placed on an integrated molecular map in the DOR364 × G19833 recombinant inbred line (RIL) population. The new linkage map was used to build a consensus map, merging the linkage maps of the BAT93 × JALO EEP558 and DOR364 × BAT477 populations. A total of 1,060 markers were mapped, with a total map length of 2,041 cM across 11 linkage groups. As a second application of the generated resource, a diversity panel with 93 genotypes was evaluated with 173 SNP markers using the MassARRAY-platform and KASPar technology. These results were coupled with previous SSR evaluations and drought tolerance assays carried out on the same individuals. This agglomerative dataset was examined, in order to discover marker-trait associations, using general linear model (GLM) and mixed linear model (MLM). Some significant associations with yield components were identified, and were consistent with previous findings. Conclusions In short, this study illustrates the power of intron-based markers for linkage and association mapping in common bean. The utility of these markers is discussed in relation with the usefulness of microsatellites, the molecular markers by excellence in this crop. PMID:22734675

  15. Associations of polymorphisms in the Pit-1 gene with growth and carcass traits in Angus beef cattle.

    PubMed

    Zhao, Q; Davis, M E; Hines, H C

    2004-08-01

    The Pit-1 gene was studied as a candidate for genetic markers of growth and carcass traits. Angus beef cattle that were divergently selected for high- or low-blood serum IGF-I concentration were used in this study. The single-strand conformation polymorphism method was used to identify polymorphism in the Pit-1 gene including regions from intron 2 to exon 6. Two polymorphisms, Pit1I3H (HinfI) and Pit1I3NL (NlaIII), were detected in intron 3 of the Pit-1 gene. One polymorphism, Pit1I4N (BstNI), was found in intron 4, and a single nucleotide polymorphism, Pit1I5, was found in intron 5. The previously reported polymorphism in exon 6, Pit1E6H (HinfI), was also studied in 416 Angus beef cattle. Associations of the polymorphisms with growth traits, carcass traits, and IGF-I concentration were analyzed using a general linear model procedure. No significant associations were observed between these polymorphisms and growth and carcass traits.

  16. mRNA-based detection of rare CFTR mutations improves genetic diagnosis of cystic fibrosis in populations with high genetic heterogeneity.

    PubMed

    Felício, V; Ramalho, A S; Igreja, S; Amaral, M D

    2017-03-01

    Even with advent of next generation sequencing complete sequencing of large disease-associated genes and intronic regions is economically not feasible. This is the case of cystic fibrosis transmembrane conductance regulator (CFTR), the gene responsible for cystic fibrosis (CF). Yet, to confirm a CF diagnosis, proof of CFTR dysfunction needs to be obtained, namely by the identification of two disease-causing mutations. Moreover, with the advent of mutation-based therapies, genotyping is an essential tool for CF disease management. There is, however, still an unmet need to genotype CF patients by fast, comprehensive and cost-effective approaches, especially in populations with high genetic heterogeneity (and low p.F508del incidence), where CF is now emerging with new diagnosis dilemmas (Brazil, Asia, etc). Herein, we report an innovative mRNA-based approach to identify CFTR mutations in the complete coding and intronic regions. We applied this protocol to genotype individuals with a suspicion of CF and only one or no CFTR mutations identified by routine methods. It successfully detected multiple intronic mutations unlikely to be detected by CFTR exon sequencing. We conclude that this is a rapid, robust and inexpensive method to detect any CFTR coding/intronic mutation (including rare ones) that can be easily used either as primary approach or after routine DNA analysis. © 2016 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.

  17. Vaccine induced differential expressions of miRNAs at cytolytic stage in chickens resistant or susceptible to Marek’s disease

    USDA-ARS?s Scientific Manuscript database

    Gene expression regulation is critical for all cellular processes since dysregulation of it often results in elevated disease risk and compromised cellular immunity. MicroRNAs (miRNAs) directly regulate gene expression post-transcriptionally through base-pairing with regions in the 3’-untranslated s...

  18. Genome-wide association identifies a deletion in the 3’ untranslated region of Striatin in a canine model of arrhythmogenic right ventricular cardiomyopathy

    USDA-ARS?s Scientific Manuscript database

    Arrhythmogenic right ventricular cardiomyopathy (ARVC) is a familial cardiac disease characterized by rapid ventricular tachycardia and sudden cardiac death. It is most frequently inherited as an autosomal dominant trait with incomplete and age-related penetrance and variable clinical expression. Th...

  19. Expression of exogenous human hepatic nuclear factor-1α by a lentiviral vector and its interactions with Plasmodium falciparum subtilisin-like protease 2.

    PubMed

    Liao, Shunyao; Liu, Yunqiang; Zheng, Bing; Cho, Pyo Yun; Song, Hyun Ok; Lee, Yun-Seok; Jung, Suk-Yul; Park, Hyun

    2011-12-01

    The onset, severity, and ultimate outcome of malaria infection are influenced by parasite-expressed virulence factors as well as by individual host responses to these determinants. In both humans and mice, liver injury follows parasite entry, persisting to the erythrocytic stage in the case of infection with the fatal strain of Plasmodium falciparum. Hepatic nuclear factor (HNF)-1α is a master regulator of not only the liver damage and adaptive responses but also diverse metabolic functions. In this study, we analyzed the expression of host HNF-1α in relation to malaria infection and evaluated its interaction with the 5'-untranslated region of subtilisin-like protease 2 (subtilase, Sub2). Recombinant human HNF-1α expressed by a lentiviral vector (LV HNF-1α) was introduced into mice. Interestingly, differences in the activity of the 5'-untranslated region of the Pf-Sub2 promoter were detected in 293T cells, and LV HNF-1α was observed to influence promoter activity, suggesting that host HNF-1α interacts with the Sub2 gene.

  20. [Cloning and sequence analysis of full-length cDNA of secoisolariciresinol dehydrogenase of Dysosma versipellis].

    PubMed

    Xu, Li; Ding, Zhi-Shan; Zhou, Yun-Kai; Tao, Xue-Fen

    2009-06-01

    To obtain the full-length cDNA sequence of Secoisolariciresinol Dehydrogenase gene from Dysosma versipellis by RACE PCR,then investigate the character of Secoisolariciresinol Dehydrogenase gene. The full-length cDNA sequence of Secoisolariciresinol Dehydrogenase gene was obtained by 3'-RACE and 5'-RACE from Dysosma versipellis. We first reported the full cDNA sequences of Secoisolariciresinol Dehydrogenase in Dysosma versipellis. The acquired gene was 991bp in full length, including 5' untranslated region of 42bp, 3' untranslated region of 112bp with Poly (A). The open reading frame (ORF) encoding 278 amino acid with molecular weight 29253.3 Daltons and isolectric point 6.328. The gene accession nucleotide sequence number in GeneBank was EU573789. Semi-quantitative RT-PCR analysis revealed that the Secoisolariciresinol Dehydrogenase gene was highly expressed in stem. Alignment of the amino acid sequence of Secoisolariciresinol Dehydrogenase indicated there may be some significant amino acid sequence difference among different species. Obtain the full-length cDNA sequence of Secoisolariciresinol Dehydrogenase gene from Dysosma versipellis.

  1. Selection of mRNA 5'-untranslated region sequence with high translation efficiency through ribosome display

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Mie, Masayasu; Shimizu, Shun; Takahashi, Fumio

    2008-08-15

    The 5'-untranslated region (5'-UTR) of mRNAs functions as a translation enhancer, promoting translation efficiency. Many in vitro translation systems exhibit a reduced efficiency in protein translation due to decreased translation initiation. The use of a 5'-UTR sequence with high translation efficiency greatly enhances protein production in these systems. In this study, we have developed an in vitro selection system that favors 5'-UTRs with high translation efficiency using a ribosome display technique. A 5'-UTR random library, comprised of 5'-UTRs tagged with a His-tag and Renilla luciferase (R-luc) fusion, were in vitro translated in rabbit reticulocytes. By limiting the translation period, onlymore » mRNAs with high translation efficiency were translated. During translation, mRNA, ribosome and translated R-luc with His-tag formed ternary complexes. They were collected with translated His-tag using Ni-particles. Extracted mRNA from ternary complex was amplified using RT-PCR and sequenced. Finally, 5'-UTR with high translation efficiency was obtained from random 5'-UTR library.« less

  2. Phosphorodiamidate morpholino targeting the 5' untranslated region of the ZIKV RNA inhibits virus replication.

    PubMed

    Popik, Waldemar; Khatua, Atanu; Hildreth, James E K; Lee, Benjamin; Alcendor, Donald J

    2018-06-01

    Zika virus (ZIKV) infection has been associated with microcephaly in infants. Currently there is no treatment or vaccine. Here we explore the use of a morpholino oligonucleotide targeted to the 5' untranslated region (5'-UTR) of the ZIKV RNA to prevent ZIKV replication. Morpholino DWK-1 inhibition of ZIKV replication in human glomerular podocytes was examined by qRT-PCR, reduction in ZIKV genome copy number, western blot analysis, immunofluorescence and proinflammatory cytokine gene expression. Podocytes pretreated with DWK-1 showed reduced levels of both viral mRNA and ZIKV E protein expression compared to controls. We observed suppression in proinflammatory gene expression for IFN-β (interferon β) RANTES (regulated on activation, normal T cell expressed and secreted), MIP-1α (macrophage inflammatory protein-1α), TNF-α (tumor necrosis factor-α) and IL1-α (interleukin 1-α) in ZIKV-infected podocytes pretreated with DWK-1. Morpholino DWK-1 targeting the ZIKV 5'-UTR effectively inhibits ZIKV replication and suppresses ZIKV-induced proinflammatory gene expression. Copyright © 2018 Elsevier Inc. All rights reserved.

  3. Differential regulation of oestrogen receptor β isoforms by 5′ untranslated regions in cancer

    PubMed Central

    Smith, Laura; Brannan, Rebecca A; Hanby, Andrew M; Shaaban, Abeer M; Verghese, Eldo T; Peter, Mark B; Pollock, Steven; Satheesha, Sampoorna; Szynkiewicz, Marcin; Speirs, Valerie; Hughes, Thomas A

    2010-01-01

    Abstract Oestrogen receptors (ERs) are critical regulators of the behaviour of many cancers. Despite this, the roles and regulation of one of the two known ERs – ERβ– are poorly understood. This is partly because analyses have been confused by discrepancies between ERβ expression at mRNA and proteins levels, and because ERβ is expressed as several functionally distinct isoforms. We investigated human ERβ 5′ untranslated regions (UTRs) and their influences on ERβ expression and function. We demonstrate that two alternative ERβ 5′UTRs have potent and differential influences on expression acting at the level of translation. We show that their influences are modulated by cellular context and in carcinogenesis, and demonstrate the contributions of both upstream open reading frames and RNA secondary structure. These regulatory mechanisms offer explanations for the non-concordance of ERβ mRNA and protein. Importantly, we also demonstrate that 5′UTRs allow the first reported mechanisms for differential regulation of the expression of the ERβ isoforms 1, 2 and 5, and thereby have critical influences on ERβ function. PMID:20920096

  4. Eukaryotic Elongation Factor 1A Interacts with the Upstream Pseudoknot Domain in the 3′ Untranslated Region of Tobacco Mosaic Virus RNA

    PubMed Central

    Zeenko, Vladimir V.; Ryabova, Lyubov A.; Spirin, Alexander S.; Rothnie, Helen M.; Hess, Daniel; Browning, Karen S.; Hohn, Thomas

    2002-01-01

    The genomic RNA of tobacco mosaic virus (TMV), like that of other positive-strand RNA viruses, acts as a template for both translation and replication. The highly structured 3′ untranslated region (UTR) of TMV RNAs plays an important role in both processes; it is not polyadenylated but ends with a tRNA-like structure (TLS) preceded by a conserved upstream pseudoknot domain (UPD). The TLS of tobamoviral RNAs can be specifically aminoacylated and, in this state, can interact with eukaryotic elongation factor 1A (eEF1A)/GTP with high affinity. Using a UV cross-linking assay, we detected another specific binding site for eEF1A/GTP, within the UPDs of TMV and crucifer-infecting tobamovirus (crTMV), that does not require aminoacylation. A mutational analysis revealed that UPD pseudoknot conformation and some conserved primary sequence elements are required for this interaction. Its possible role in the regulation of tobamovirus gene expression and replication is discussed. PMID:11991996

  5. The complete chloroplast DNA sequence of the green alga Oltmannsiellopsis viridis reveals a distinctive quadripartite architecture in the chloroplast genome of early diverging ulvophytes

    PubMed Central

    Pombert, Jean-François; Lemieux, Claude; Turmel, Monique

    2006-01-01

    Background The phylum Chlorophyta contains the majority of the green algae and is divided into four classes. The basal position of the Prasinophyceae has been well documented, but the divergence order of the Ulvophyceae, Trebouxiophyceae and Chlorophyceae is currently debated. The four complete chloroplast DNA (cpDNA) sequences presently available for representatives of these classes have revealed extensive variability in overall structure, gene content, intron composition and gene order. The chloroplast genome of Pseudendoclonium (Ulvophyceae), in particular, is characterized by an atypical quadripartite architecture that deviates from the ancestral type by a large inverted repeat (IR) featuring an inverted rRNA operon and a small single-copy (SSC) region containing 14 genes normally found in the large single-copy (LSC) region. To gain insights into the nature of the events that led to the reorganization of the chloroplast genome in the Ulvophyceae, we have determined the complete cpDNA sequence of Oltmannsiellopsis viridis, a representative of a distinct, early diverging lineage. Results The 151,933 bp IR-containing genome of Oltmannsiellopsis differs considerably from Pseudendoclonium and other chlorophyte cpDNAs in intron content and gene order, but shares close similarities with its ulvophyte homologue at the levels of quadripartite architecture, gene content and gene density. Oltmannsiellopsis cpDNA encodes 105 genes, contains five group I introns, and features many short dispersed repeats. As in Pseudendoclonium cpDNA, the rRNA genes in the IR are transcribed toward the single copy region featuring the genes typically found in the ancestral LSC region, and the opposite single copy region harbours genes characteristic of both the ancestral SSC and LSC regions. The 52 genes that were transferred from the ancestral LSC to SSC region include 12 of those observed in Pseudendoclonium cpDNA. Surprisingly, the overall gene organization of Oltmannsiellopsis cpDNA more closely resembles that of Chlorella (Trebouxiophyceae) cpDNA. Conclusion The chloroplast genome of the last common ancestor of Oltmannsiellopsis and Pseudendoclonium contained a minimum of 108 genes, carried only a few group I introns, and featured a distinctive quadripartite architecture. Numerous changes were experienced by the chloroplast genome in the lineages leading to Oltmannsiellopsis and Pseudendoclonium. Our comparative analyses of chlorophyte cpDNAs support the notion that the Ulvophyceae is sister to the Chlorophyceae. PMID:16472375

  6. New encoded single-indicator sequences based on physico-chemical parameters for efficient exon identification.

    PubMed

    Meher, J K; Meher, P K; Dash, G N; Raval, M K

    2012-01-01

    The first step in gene identification problem based on genomic signal processing is to convert character strings into numerical sequences. These numerical sequences are then analysed spectrally or using digital filtering techniques for the period-3 peaks, which are present in exons (coding areas) and absent in introns (non-coding areas). In this paper, we have shown that single-indicator sequences can be generated by encoding schemes based on physico-chemical properties. Two new methods are proposed for generating single-indicator sequences based on hydration energy and dipole moments. The proposed methods produce high peak at exon locations and effectively suppress false exons (intron regions having greater peak than exon regions) resulting in high discriminating factor, sensitivity and specificity.

  7. CelF of Orpinomyces PC-2 has an intron and encodes a cellulase (CelF) containing a carbohydrate-binding module.

    PubMed

    Chen, Huizhong; Li, Xin-Liang; Blum, David L; Ximenes, Eduardo A; Ljungdahl, Lars G

    2003-01-01

    A cDNA, designated celF, encoding a cellulase (CelF) was isolated from the anaerobic fungus Orpinomyces PC-2. The open reading frame contains regions coding for a signal peptide, a carbohydrate-binding module (CBM), a linker, and a catalytic domain. The catalytic domain was homologous to those of CelA and CelC of the same fungus and to that of the Neocallimastix patriciarum CELA, but CelF lacks a docking domain, characteristic for enzymes of cellulosomes. It was also homologous to the cellobiohydrolase IIs and endoglucanases of aerobic organisms. The gene has a 111-bp intron, located within the CBM-coding region. Some biochemical properties of the purified recombinant enzyme are described.

  8. A homozygous mutation in the stem II domain of RNU4ATAC causes typical Roifman syndrome.

    PubMed

    Dinur Schejter, Yael; Ovadia, Adi; Alexandrova, Roumiana; Thiruvahindrapuram, Bhooma; Pereira, Sergio L; Manson, David E; Vincent, Ajoy; Merico, Daniele; Roifman, Chaim M

    2017-01-01

    Roifman syndrome (OMIM# 616651) is a complex syndrome encompassing skeletal dysplasia, immunodeficiency, retinal dystrophy and developmental delay, and is caused by compound heterozygous mutations involving the Stem II region and one of the other domains of the RNU4ATAC gene. This small nuclear RNA gene is essential for minor intron splicing. The Canadian Centre for Primary Immunodeficiency Registry and Repository were used to derive patient information as well as tissues. Utilising RNA sequencing methodologies, we analysed samples from patients with Roifman syndrome and assessed intron retention. We demonstrate that a homozygous mutation in Stem II is sufficient to cause the full spectrum of features associated with typical Roifman syndrome. Further, we demonstrate the same pattern of aberration in minor intron retention as found in cases with compound heterozygous mutations.

  9. Is “Junk” DNA Mostly Intron DNA?

    PubMed Central

    Wong, Gane Ka-Shu; Passey, Douglas A.; Huang, Ying-zong; Yang, Zhiyong; Yu, Jun

    2000-01-01

    Among higher eukaryotes, very little of the genome codes for protein. What is in the rest of the genome, or the “junk” DNA, that, in Homo sapiens, is estimated to be almost 97% of the genome? Is it possible that much of this “junk” is intron DNA? This is not a question that can be answered just by looking at the published data, even from the finished genomes. One cannot assume that there are no genes in a sequenced region, just because no genes were annotated. We introduce another approach to this problem, based on an analysis of the cDNA-to-genomic alignments, in all of the complete or nearly-complete genomes from the multicellular organisms. Our conclusion is that, in animals but not in plants, most of the “junk” is intron DNA. PMID:11076852

  10. Characterization of a Maize Wip1 Promoter in Transgenic Plants

    PubMed Central

    Zhang, Shengxue; Lian, Yun; Liu, Yan; Wang, Xiaoqing; Liu, Yunjun; Wang, Guoying

    2013-01-01

    The Maize Wip1 gene encodes a wound-induced Bowman-Birk inhibitor (BBI) protein which is a type of serine protease inhibitor, and its expression is induced by wounding or infection, conferring resistance against pathogens and pests. In this study, the maize Wip1 promoter was isolated and its function was analyzed. Different truncated Wip1 promoters were fused upstream of the GUS reporter gene and transformed into Arabidopsis, tobacco and rice plants. We found that (1) several truncated maize Wip1 promoters led to strong GUS activities in both transgenic Arabidopsis and tobacco leaves, whereas low GUS activity was detected in transgenic rice leaves; (2) the Wip1 promoter was not wound-induced in transgenic tobacco leaves, but was induced by wounding in transgenic rice leaves; (3) the truncated Wip1 promoter had different activity in different organs of transgenic tobacco plants; (4) the transgenic plant leaves containing different truncated Wip1 promoters had low GUS transcripts, even though high GUS protein level and GUS activities were observed; (5) there was one transcription start site of Wip1 gene in maize and two transcription start sites of GUS in Wip1::GUS transgenic lines; (6) the adjacent 35S promoter which is present in the transformation vectors enhanced the activity of the truncated Wip1 promoters in transgenic tobacco leaves, but did not influence the disability of truncated Wip1231 promoter to respond to wounding signals. We speculate that an ACAAAA hexamer, several CAA trimers and several elements similar to ACAATTAC octamer in the 5′-untranslated region might contribute to the strong GUS activity in Wip1231 transgenic lines, meanwhile, compared to the 5′-untranslated region from Wip1231 transgenic lines, the additional upstream open reading frames (uORFs) in the 5′-untranslated region from Wip1737 transgenic lines might contribute to the lower level of GUS transcript and GUS activity. PMID:24322445

  11. Posttranscriptional Control of Photosynthetic mRNA Decay under Stress Conditions Requires 3′ and 5′ Untranslated Regions and Correlates with Differential Polysome Association in Rice1[W][OA

    PubMed Central

    Park, Su-Hyun; Chung, Pil Joong; Juntawong, Piyada; Bailey-Serres, Julia; Kim, Youn Shic; Jung, Harin; Bang, Seung Woon; Kim, Yeon-Ki; Do Choi, Yang; Kim, Ju-Kon

    2012-01-01

    Abiotic stress, including drought, salinity, and temperature extremes, regulates gene expression at the transcriptional and posttranscriptional levels. Expression profiling of total messenger RNAs (mRNAs) from rice (Oryza sativa) leaves grown under stress conditions revealed that the transcript levels of photosynthetic genes are reduced more rapidly than others, a phenomenon referred to as stress-induced mRNA decay (SMD). By comparing RNA polymerase II engagement with the steady-state mRNA level, we show here that SMD is a posttranscriptional event. The SMD of photosynthetic genes was further verified by measuring the half-lives of the small subunit of Rubisco (RbcS1) and Chlorophyll a/b-Binding Protein1 (Cab1) mRNAs during stress conditions in the presence of the transcription inhibitor cordycepin. To discern any correlation between SMD and the process of translation, changes in total and polysome-associated mRNA levels after stress were measured. Total and polysome-associated mRNA levels of two photosynthetic (RbcS1 and Cab1) and two stress-inducible (Dehydration Stress-Inducible Protein1 and Salt-Induced Protein) genes were found to be markedly similar. This demonstrated the importance of polysome association for transcript stability under stress conditions. Microarray experiments performed on total and polysomal mRNAs indicate that approximately half of all mRNAs that undergo SMD remain polysome associated during stress treatments. To delineate the functional determinant(s) of mRNAs responsible for SMD, the RbcS1 and Cab1 transcripts were dissected into several components. The expressions of different combinations of the mRNA components were analyzed under stress conditions, revealing that both 3′ and 5′ untranslated regions are necessary for SMD. Our results, therefore, suggest that the posttranscriptional control of photosynthetic mRNA decay under stress conditions requires both 3′ and 5′ untranslated regions and correlates with differential polysome association. PMID:22566494

  12. RNA-Mediated Thermoregulation of Iron-Acquisition Genes in Shigella dysenteriae and Pathogenic Escherichia coli

    PubMed Central

    Kouse, Andrew B.; Righetti, Francesco; Kortmann, Jens; Narberhaus, Franz; Murphy, Erin R.

    2013-01-01

    The initiation, progression and transmission of most bacterial infections is dependent upon the ability of the invading pathogen to acquire iron from each of the varied environments encountered during the course of a natural infection. In total, 95% of iron within the human body is complexed within heme, making heme a potentially rich source of host-associated nutrient iron for invading bacteria. As heme is encountered only within the host, pathogenic bacteria often regulate synthesis of heme utilization factors such that production is maximal under host-associated environmental conditions. This study examines the regulated production of ShuA, an outer-membrane receptor required for the utilization of heme as a source of nutrient iron by Shigella dysenteriae, a pathogenic bacterium that causes severe diarrheal diseases in humans. Specifically, the impact of the distinct environmental temperatures encountered during infection within a host (37°C) and transmission between hosts (25°C) on shuA expression is investigated. We show that shuA expression is subject to temperature-dependent post-transcriptional regulation resulting in increased ShuA production at 37°C. The observed thermoregulation is mediated by nucleic acid sequences within the 5′ untranslated region. In addition, we have identified similar nucleotide sequences within the 5′ untranslated region of the orthologous chuA transcript of enteropathogenic E. coli and have demonstrated that it also functions to confer temperature-dependent post-transcriptional regulation. In both function and predicted structure, the regulatory element within the shuA and chuA 5′ untranslated regions closely resembles a FourU RNA thermometer, a zipper-like RNA structure that occludes the Shine-Dalgarno sequence at low temperatures. Increased production of ShuA and ChuA in response to the host body temperature allows for maximal production of these heme acquisition factors within the environment where S. dysenteriae and pathogenic E. coli strains would encounter heme, a host-specific iron source. PMID:23704938

  13. Genomic organization of the Neurospora crassa gsn gene: possible involvement of the STRE and HSE elements in the modulation of transcription during heat shock.

    PubMed

    Freitas, F Zanolli; Bertolini, M C

    2004-12-01

    Glycogen synthase, an enzyme involved in glycogen biosynthesis, is regulated by phosphorylation and by the allosteric ligand glucose-6-phosphate (G6P). In addition, enzyme levels can be regulated by changes in gene expression. We recently cloned a cDNA for glycogen synthase ( gsn) from Neurospora crassa, and showed that gsn transcription decreased when cells were exposed to heat shock (shifted from 30 degrees C to 45 degrees C). In order to understand the mechanisms that control gsn expression, we isolated the gene, including its 5' and 3' flanking regions, from the genome of N. crassa. An ORF of approximately 2.4 kb was identified, which is interrupted by four small introns (II-V). Intron I (482 bp) is located in the 5'UTR region. Three putative Transcription Initiation Sites (TISs) were mapped, one of which lies downstream of a canonical TATA-box sequence (5'-TGTATAAA-3'). Analysis of the 5'-flanking region revealed the presence of putative transcription factor-binding sites, including Heat Shock Elements (HSEs) and STress Responsive Elements (STREs). The possible involvement of these motifs in the negative regulation of gsn transcription was investigated using Electrophoretic Mobility Shift Assays (EMSA) with nuclear extracts of N. crassa mycelium obtained before and after heat shock, and DNA fragments encompassing HSE and STRE elements from the 5'-flanking region. While elements within the promoter region are involved in transcription under heat shock, elements in the 5'UTR intron may participate in transcription during vegetative growth. The results thus suggest that N. crassa possesses trans -acting elements that interact with the 5'-flanking region to regulate gsn transcription during heat shock and vegetative growth.

  14. Surface Diversity in Mycoplasma agalactiae Is Driven by Site-Specific DNA Inversions within the vpma Multigene Locus

    PubMed Central

    Glew, Michelle D.; Marenda, Marc; Rosengarten, Renate; Citti, Christine

    2002-01-01

    The ruminant pathogen Mycoplasma agalactiae possesses a family of abundantly expressed variable surface lipoproteins called Vpmas. Phenotypic switches between Vpma members have previously been correlated with DNA rearrangements within a locus of vpma genes and are proposed to play an important role in disease pathogenesis. In this study, six vpma genes were characterized in the M. agalactiae type strain PG2. All vpma genes clustered within an 8-kb region and shared highly conserved 5′ untranslated regions, lipoprotein signal sequences, and short N-terminal sequences. Analyses of the vpma loci from consecutive clonal isolates showed that vpma DNA rearrangements were site specific and that cleavage and strand exchange occurred within a minimal region of 21 bp located within the 5′ untranslated region of all vpma genes. This process controlled expression of vpma genes by effectively linking the open reading frame (ORF) of a silent gene to a unique active promoter sequence within the locus. An ORF (xer1) immediately adjacent to one end of the vpma locus did not undergo rearrangement and had significant homology to a distinct subset of genes belonging to the λ integrase family of site-specific xer recombinases. It is proposed that xer1 codes for a site-specific recombinase that is not involved in chromosome dimer resolution but rather is responsible for the observed vpma-specific recombination in M. agalactiae. PMID:12374833

  15. The prediction of human exons by oligonucleotide composition and discriminant analysis of spliceable open reading frames

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Solovyev, V.V.; Salamov, A.A.; Lawrence, C.B.

    1994-12-31

    Discriminant analysis is applied to the problem of recognition 5`-, internal and 3`-exons in human DNA sequences. Specific recognition functions were developed for revealing exons of particular types. The method based on a splice site prediction algorithm that uses the linear Fisher discriminant to combine the information about significant triplet frequencies of various functional parts of splice site regions and preferences of oligonucleotide in protein coding and nation regions. The accuracy of our splice site recognition function is about 97%. A discriminant function for 5`-exon prediction includes hexanucleotide composition of upstream region, triplet composition around the ATG codon, ORF codingmore » potential, donor splice site potential and composition of downstream introit region. For internal exon prediction, we combine in a discriminant function the characteristics describing the 5`- intron region, donor splice site, coding region, acceptor splice site and Y-intron region for each open reading frame flanked by GT and AG base pairs. The accuracy of precise internal exon recognition on a test set of 451 exon and 246693 pseudoexon sequences is 77% with a specificity of 79% and a level of pseudoexon ORF prediction of 99.96%. The recognition quality computed at the level of individual nucleotides is 89%, for exon sequences and 98% for intron sequences. A discriminant function for 3`-exon prediction includes octanucleolide composition of upstream nation region, triplet composition around the stop codon, ORF coding potential, acceptor splice site potential and hexanucleotide composition of downstream region. We unite these three discriminant functions in exon predicting program FEX (find exons). FEX exactly predicts 70% of 1016 exons from the test of 181 complete genes with specificity 73%, and 89% exons are exactly or partially predicted. On the average, 85% of nucleotides were predicted accurately with specificity 91%.« less

  16. Polyoma virus small tumor antigen pre-mRNA splicing requires cooperation between two 3' splice sites.

    PubMed Central

    Ge, H; Noble, J; Colgan, J; Manley, J L

    1990-01-01

    We have studied splicing of the polyoma virus early region pre-mRNA in vitro. This RNA is alternatively spliced in vivo to produce mRNA encoding the large, middle-sized (MTAg), and small (StAg) tumor antigens. Our primary interest was to learn how the 48-nucleotide StAg intron is excised, because the length of this intron is significantly less than the apparent minimum established for mammalian introns. Although the products of all three splices are detected in vitro, characterization of the pathway and sequence requirements of StAg splicing suggests that splicing factors interact with the precursor RNA in an unexpected way to catalyze removal of this intron. Specifically, StAg splicing uses either of two lariat branch points, one of which is located only 4 nucleotides from the 3' splice site. Furthermore, the StAg splice absolutely requires that the alternative MTAg 3' splice site, located 14 nucleotides downstream of the StAg 3' splice site, be intact. Insertion mutations that increase or decrease the quality of the MTAg pyrimidine stretch enhance or repress StAg as well as MTAg splicing, and a single-base change in the MTAg AG splice acceptor totally blocks both splices. These results demonstrate the ability of two 3' splice sites to cooperate with each other to bring about removal of a single intron. Images PMID:2159146

  17. Molecular cloning, sequence analysis, prokaryotic expression, and function prediction of foot-specific peroxidase in Hydra magnipapillata Chinese strain.

    PubMed

    Pan, H C; Yang, H Q; Zhao, F X; Qian, X C

    2014-08-28

    The cDNA sequence of foot-specific peroxidase PPOD1 from the Chinese strain of Hydra magnipapillata was cloned by reverse transcription-polymerase chain reaction. The cDNA sequence contained a coding region with an 873-bp open reading frame, a 31-bp 5'-untranslated region, and a 36-bp 3'-untranslated region. The structure prediction results showed that PPOD1 contains 10.34% of α-helix, 38.62% of extended strand, 12.41% of β-turn, and 38.62% of random coil. The structural core was α-helix at the N terminus. The GenBank protein blast server showed that PPOD1 contains 2 fascin-like domains. In addition, high-level PPOD1 activity was only present in the ectodermal epithelial cells located on the edge of the adhesive face of the basal disc, and that these cells extended lamellipodia and filopodia when the basal disc was tightly attached to a glass slide. The fascin-like domains of Hydra PPOD1 might contribute to the bundling of the actin filament of these cells, and hence, the formation of filopodia. In conclusion, these cells might play an important role in strengthening the adsorbability of the basal disc to substrates.

  18. Identification of 15 novel partial SHOX deletions and 13 partial duplications, and a review of the literature reveals intron 3 to be a hotspot region.

    PubMed

    Benito-Sanz, Sara; Belinchon-Martínez, Alberta; Aza-Carmona, Miriam; de la Torre, Carolina; Huber, Celine; González-Casado, Isabel; Ross, Judith L; Thomas, N Simon; Zinn, Andrew R; Cormier-Daire, Valerie; Heath, Karen E

    2017-02-01

    Short stature homeobox gene (SHOX) is located in the pseudoautosomal region 1 of the sex chromosomes. It encodes a transcription factor implicated in the skeletal growth. Point mutations, deletions or duplications of SHOX or its transcriptional regulatory elements are associated with two skeletal dysplasias, Léri-Weill dyschondrosteosis (LWD) and Langer mesomelic dysplasia (LMD), as well as in a small proportion of idiopathic short stature (ISS) individuals. We have identified a total of 15 partial SHOX deletions and 13 partial SHOX duplications in LWD, LMD and ISS patients referred for routine SHOX diagnostics during a 10 year period (2004-2014). Subsequently, we characterized these alterations using MLPA (multiplex ligation-dependent probe amplification assay), fine-tiling array CGH (comparative genomic hybridation) and breakpoint PCR. Nearly half of the alterations have a distal or proximal breakpoint in intron 3. Evaluation of our data and that in the literature reveals that although partial deletions and duplications only account for a small fraction of SHOX alterations, intron 3 appears to be a breakpoint hotspot, with alterations arising by non-allelic homologous recombination, non-homologous end joining or other complex mechanisms.

  19. Arginine kinase in Toxocara canis: Exon-intron organization, functional analysis of site-directed mutants and evaluation of putative enzyme inhibitors.

    PubMed

    Wickramasinghe, Susiji; Yatawara, Lalani; Nagataki, Mitsuru; Agatsuma, Takeshi

    2016-10-01

    To determine exon/intron organization of the Toxocara canis (T. canis) AK (TCAK) and to test green and black tea and several other chemicals against the activity of recombinant TCAK in the guanidino-specific region by site-directed mutants. Amplification of genomic DNA fragments containing introns was carried out by PCRs. The open-reading frame (1200 bp) of TCAK (wild type) was cloned into the BamH1/SalI site of pMAL-c2X. The maltose-binding protein-TCAK fusion protein was expressed in Escherichia coli TB1 cells. The purity of the expressed enzyme was verified by SDS-PAGE. Mutations were introduced into the guanidino-specific region and other areas of pMAL/TCAK by PCR. Enzyme activity was measured with an NADH-linked assay at 25 °C for the forward reaction (phosphagen synthesis). Arginine kinase in T. canis has a seven-exon/six-intron gene structure. The lengths of the introns ranged from 542 bp to 2 500 bp. All introns begin with gt and end with ag. Furthermore, we measured the enzyme activity of site-directed mutants of the recombinant TCAK. The K m value of the mutant (Alanine to Serine) decreased indicating a higher affinity for substrate arginine than the wild-type. The K m value of the mutant (Serine to Glycine) increased to 0.19 mM. The K m value (0.19 mM) of the double mutant (Alanine-Serine to Serine-Glycine) was slightly greater than in the wild-type (0.12 mM). In addition, several other chemicals were tested; including plant extract Azadiracta indica (A. indica), an aminoglycoside antibiotic (aminosidine), a citrus flavonoid glycoside (rutin) and a commercially available catechin mixture against TCAK. Green and black tea (1:10 dilution) produced 15% and 25% inhibition of TCAK, respectively. The extract of A. indica produced 5% inhibition of TCAK. Moreover, green and black tea produced a non-competitive type of inhibition and A. indica produced a mixed-type of inhibition on TCAK. Arginine kinase in T. canis has a seven-exon/six-intron gene structure. However, further studies are needed to identify a specific compound within the extract causing the inhibitory effect and also to determine the molecular mechanisms behind inhibition of arginine kinase in T. canis. Copyright © 2016 Hainan Medical University. Production and hosting by Elsevier B.V. All rights reserved.

  20. Interaction of Dopamine Transporter Gene and Observed Parenting Behaviors on Attention-Deficit/Hyperactivity Disorder: A Structural Equation Modeling Approach

    ERIC Educational Resources Information Center

    Li, James J.; Lee, Steve S.

    2013-01-01

    Emerging evidence suggests that some individuals may be simultaneously more responsive to the effects from environmental adversity "and" enrichment (i.e., differential susceptibility). Given that parenting behavior and a variable number tandem repeat polymorphism in the 3'untranslated region of the dopamine transporter (DAT1) gene are…

  1. Genetic association of ubiquilin with Alzheimer's disease and related quantitative measures.

    PubMed

    Kamboh, M I; Minster, R L; Feingold, E; DeKosky, S T

    2006-03-01

    The gene coding for ubiquilin 1 (UBQLN1) is located near a linkage peak on chromosome 9q22.2 and it also impacts the function of presenilin proteins involved in early-onset Alzheimer's disease (AD). Recently, genetic variation in UBQLN1 has been shown to affect the risk of AD in two independent family-based samples. The purpose of this study was to confirm the reported association in a large case-control sample and to also examine the association of UBQLN1 SNPs with quantitative measures of AD progression, namely age-at-onset (AAO), disease duration and Mini-Mental State Examination (MMSE) score. We examined the associations of three SNPs in the UBQLN1 gene (intron 6/A>C, intron 8/T>C and intron 9/A>G) in up to 978 LOAD cases and 808 controls. All SNPs were in significant linkage disequilibrium (P<0.0001). While modest significant associations were observed in the single-site regression analysis, 3-site haplotype analysis revealed significant associations (P<0.0001 for overall haplotype analysis). One common haplotype (H4) defined by intron 6/A-intron 8/C-intron 9/G alleles was associated with AD risk and one less common haplotype (H5) defined by intron 6/C-intron 8/C-intron 9/A alleles was associated with protection. The adjusted odds ratios with potentially one and two copies of risk haplotype H4 were 1.5 (95% CI: 0.99-2.26; P=0.054) and 3.66 (95% CI: 1.43-9.39; P=0.007), respectively, and odds ratio for haplotype H5 carriers was 0.31 (95% CI: 0.10-0.95; P=0.0398). In addition to disease risk, the homozygosity of the risk haplotype was also associated with older AAO, longer disease duration and lower MMSE score. In summary, our data from a large case-control cohort indicate that genetic variation in the UBQLN1 gene has a modest effect on risk, AAO and disease duration of AD. Our haplotype data suggest the presence of additional putative functional variants either in the UBQLN1 gene or nearby genes and provide strong justification for additional work in this region on chromosome 9.

  2. Expression analysis and in silico characterization of intronic long noncoding RNAs in renal cell carcinoma: emerging functional associations

    PubMed Central

    2013-01-01

    Background Intronic and intergenic long noncoding RNAs (lncRNAs) are emerging gene expression regulators. The molecular pathogenesis of renal cell carcinoma (RCC) is still poorly understood, and in particular, limited studies are available for intronic lncRNAs expressed in RCC. Methods Microarray experiments were performed with custom-designed arrays enriched with probes for lncRNAs mapping to intronic genomic regions. Samples from 18 primary RCC tumors and 11 nontumor adjacent matched tissues were analyzed. Meta-analyses were performed with microarray expression data from three additional human tissues (normal liver, prostate tumor and kidney nontumor samples), and with large-scale public data for epigenetic regulatory marks and for evolutionarily conserved sequences. Results A signature of 29 intronic lncRNAs differentially expressed between RCC and nontumor samples was obtained (false discovery rate (FDR) <5%). A signature of 26 intronic lncRNAs significantly correlated with the RCC five-year patient survival outcome was identified (FDR <5%, p-value ≤0.01). We identified 4303 intronic antisense lncRNAs expressed in RCC, of which 22% were significantly (p <0.05) cis correlated with the expression of the mRNA in the same locus across RCC and three other human tissues. Gene Ontology (GO) analysis of those loci pointed to 'regulation of biological processes’ as the main enriched category. A module map analysis of the protein-coding genes significantly (p <0.05) trans correlated with the 20% most abundant lncRNAs, identified 51 enriched GO terms (p <0.05). We determined that 60% of the expressed lncRNAs are evolutionarily conserved. At the genomic loci containing the intronic RCC-expressed lncRNAs, a strong association (p <0.001) was found between their transcription start sites and genomic marks such as CpG islands, RNA Pol II binding and histones methylation and acetylation. Conclusion Intronic antisense lncRNAs are widely expressed in RCC tumors. Some of them are significantly altered in RCC in comparison with nontumor samples. The majority of these lncRNAs is evolutionarily conserved and possibly modulated by epigenetic modifications. Our data suggest that these RCC lncRNAs may contribute to the complex network of regulatory RNAs playing a role in renal cell malignant transformation. PMID:24238219

  3. Ferritin gene organization: differences between plants and animals suggest possible kingdom-specific selective constraints.

    PubMed

    Proudhon, D; Wei, J; Briat, J; Theil, E C

    1996-03-01

    Ferritin, a protein widespread in nature, concentrates iron approximately 10(11)-10(12)-fold above the solubility within a spherical shell of 24 subunits; it derives in plants and animals from a common ancestor (based on sequence) but displays a cytoplasmic location in animals compared to the plastid in contemporary plants. Ferritin gene regulation in plants and animals is altered by development, hormones, and excess iron; iron signals target DNA in plants but mRNA in animals. Evolution has thus conserved the two end points of ferritin gene expression, the physiological signals and the protein structure, while allowing some divergence of the genetic mechanisms. Comparison of ferritin gene organization in plants and animals, made possible by the cloning of a dicot (soybean) ferritin gene presented here and the recent cloning of two monocot (maize) ferritin genes, shows evolutionary divergence in ferritin gene organization between plants and animals but conservation among plants or among animals; divergence in the genetic mechanism for iron regulation is reflected by the absence in all three plant genes of the IRE, a highly conserved, noncoding sequence in vertebrate animal ferritin mRNA. In plant ferritin genes, the number of introns (n = 7) is higher than in animals (n = 3). Second, no intron positions are conserved when ferritin genes of plants and animals are compared, although all ferritin gene introns are in the coding region; within kingdoms, the intron positions in ferritin genes are conserved. Finally, secondary protein structure has no apparent relationship to intron/exon boundaries in plant ferritin genes, whereas in animal ferritin genes the correspondence is high. The structural differences in introns/exons among phylogenetically related ferritin coding sequences and the high conservation of the gene structure within plant or animal kingdoms of the gene structure within plant or animal kingdoms suggest that kingdom-specific functional constraints may exist to maintain a particular intron/exon pattern within ferritin genes. In the case of plants, where ferritin gene intron placement is unrelated to triplet codons or protein structure, and where ferritin is targeted to the plastid, the selection pressure on gene organization may relate to RNA function and plastid/nuclear signaling.

  4. Group I introns are widespread in archaea.

    PubMed

    Nawrocki, Eric P; Jones, Thomas A; Eddy, Sean R

    2018-05-18

    Group I catalytic introns have been found in bacterial, viral, organellar, and some eukaryotic genomes, but not in archaea. All known archaeal introns are bulge-helix-bulge (BHB) introns, with the exception of a few group II introns. It has been proposed that BHB introns arose from extinct group I intron ancestors, much like eukaryotic spliceosomal introns are thought to have descended from group II introns. However, group I introns have little sequence conservation, making them difficult to detect with standard sequence similarity searches. Taking advantage of recent improvements in a computational homology search method that accounts for both conserved sequence and RNA secondary structure, we have identified 39 group I introns in a wide range of archaeal phyla, including examples of group I introns and BHB introns in the same host gene.

  5. The origin of introns and their role in eukaryogenesis: a compromise solution to the introns-early versus introns-late debate?

    PubMed Central

    Koonin, Eugene V

    2006-01-01

    Background Ever since the discovery of 'genes in pieces' and mRNA splicing in eukaryotes, origin and evolution of spliceosomal introns have been considered within the conceptual framework of the 'introns early' versus 'introns late' debate. The 'introns early' hypothesis, which is closely linked to the so-called exon theory of gene evolution, posits that protein-coding genes were interrupted by numerous introns even at the earliest stages of life's evolution and that introns played a major role in the origin of proteins by facilitating recombination of sequences coding for small protein/peptide modules. Under this scenario, the absence of spliceosomal introns in prokaryotes is considered to be a result of "genome streamlining". The 'introns late' hypothesis counters that spliceosomal introns emerged only in eukaryotes, and moreover, have been inserted into protein-coding genes continuously throughout the evolution of eukaryotes. Beyond the formal dilemma, the more substantial side of this debate has to do with possible roles of introns in the evolution of eukaryotes. Results I argue that several lines of evidence now suggest a coherent solution to the introns-early versus introns-late debate, and the emerging picture of intron evolution integrates aspects of both views although, formally, there seems to be no support for the original version of introns-early. Firstly, there is growing evidence that spliceosomal introns evolved from group II self-splicing introns which are present, usually, in small numbers, in many bacteria, and probably, moved into the evolving eukaryotic genome from the α-proteobacterial progenitor of the mitochondria. Secondly, the concept of a primordial pool of 'virus-like' genetic elements implies that self-splicing introns are among the most ancient genetic entities. Thirdly, reconstructions of the ancestral state of eukaryotic genes suggest that the last common ancestor of extant eukaryotes had an intron-rich genome. Thus, it appears that ancestors of spliceosomal introns, indeed, have existed since the earliest stages of life's evolution, in a formal agreement with the introns-early scenario. However, there is no evidence that these ancient introns ever became widespread before the emergence of eukaryotes, hence, the central tenet of introns-early, the role of introns in early evolution of proteins, has no support. However, the demonstration that numerous introns invaded eukaryotic genes at the outset of eukaryotic evolution and that subsequent intron gain has been limited in many eukaryotic lineages implicates introns as an ancestral feature of eukaryotic genomes and refutes radical versions of introns-late. Perhaps, most importantly, I argue that the intron invasion triggered other pivotal events of eukaryogenesis, including the emergence of the spliceosome, the nucleus, the linear chromosomes, the telomerase, and the ubiquitin signaling system. This concept of eukaryogenesis, in a sense, revives some tenets of the exon hypothesis, by assigning to introns crucial roles in eukaryotic evolutionary innovation. Conclusion The scenario of the origin and evolution of introns that is best compatible with the results of comparative genomics and theoretical considerations goes as follows: self-splicing introns since the earliest stages of life's evolution – numerous spliceosomal introns invading genes of the emerging eukaryote during eukaryogenesis – subsequent lineage-specific loss and gain of introns. The intron invasion, probably, spawned by the mitochondrial endosymbiont, might have critically contributed to the emergence of the principal features of the eukaryotic cell. This scenario combines aspects of the introns-early and introns-late views. Reviewers this article was reviewed by W. Ford Doolittle, James Darnell (nominated by W. Ford Doolittle), William Martin, and Anthony Poole. PMID:16907971

  6. Complete plastid genome sequence of the chickpea (Cicer arietinum) and the phylogenetic distribution of rps12 and clpP intron losses among legumes (Leguminosae)

    PubMed Central

    Jansen, Robert K.; Wojciechowski, Martin F.; Sanniyasi, Elumalai; Lee, Seung-Bum; Daniell, Henry

    2008-01-01

    Chickpea (Cicer arietinum, Leguminosae), an important grain legume, is widely used for food and fodder throughout the world. We sequenced the complete plastid genome of chickpea, which is 125,319 bp in size, and contains only one copy of the inverted repeat (IR). The genome encodes 108 genes, including 4 rRNAs, 29 tRNAs, and 75 proteins. The genes rps16, infA, and ycf4 are absent in the chickpea plastid genome, and ndhB has an internal stop codon in the 5′exon, similar to other legumes. Two genes have lost their introns, one in the 3′exon of the transpliced gene rps12, and the one between exons 1 and 2 of clpP; this represents the first documented case of the loss of introns from both of these genes in the same plastid genome. An extensive phylogenetic survey of these intron losses was performed on 302 taxa across legumes and the related family Polygalaceae. The clpP intron has been lost exclusively in taxa from the temperate “IR-lacking clade” (IRLC), whereas the rps12 intron has been lost in most members of the IRLC (with the exception of Wisteria, Callerya, Afgekia, and certain species of Millettia, which represent the earliest diverging lineages of this clade), and in the tribe Desmodieae, which is closely related to the tribes Phaseoleae and Psoraleeae. Data provided here suggest that the loss of the rps12 intron occurred after the loss of the IR. The two new genomic changes identified in the present study provide additional support of the monophyly of the IR-loss clade, and resolution of the pattern of the earliest-branching lineages in this clade. The availability of the complete chickpea plastid genome sequence also provides valuable information on intergenic spacer regions among legumes and endogenous regulatory sequences for plastid genetic engineering. PMID:18638561

  7. Reduced DNA methylation of FKBP5 in Cushing's syndrome.

    PubMed

    Resmini, Eugenia; Santos, Alicia; Aulinas, Anna; Webb, Susan M; Vives-Gilabert, Yolanda; Cox, Olivia; Wand, Gary; Lee, Richard S

    2016-12-01

    FKBP5 encodes a co-chaperone of HSP90 protein that regulates intracellular glucocorticoid receptor sensitivity. When it is bound to the glucocorticoid receptor complex, cortisol binds with lower affinity to glucocorticoid receptor. Cushing's syndrome is associated with memory deficits, smaller hippocampal volumes, and wide range of cognitive impairments. We aimed at evaluating blood DNA methylation of FKBP5 and its relationship with memory and hippocampal volumes in Cushing's syndrome patients. Polymorphism rs1360780 in FKBP5 has also been assessed to determine whether genetic variations can also govern CpG methylation. Thirty-two Cushing's syndrome patients and 32 matched controls underwent memory tests, 3-Tesla MRI of the brain, and DNA extraction from total leukocytes. DNA samples were bisulfite treated, PCR amplified, and pyrosequenced to assess a total of 41CpG-dinucleotides in the introns 1, 2, 5, and 7 of FKBP5. Significantly lower intronic FKBP5 DNA methylation in CS patients compared to controls was observed in ten CpG-dinucleotides. DNA methylation at these CpGs correlated with left and right HV (Intron-2-Region-2-CpG-3: LHV, r = 0.73, p = 0.02; RHV, r = 0.58, p = 0.03). Cured and active CS patients showed both lower methylation of intron 2 (92.37, 91.8, and 93.34 %, respectively, p = 0.03 for both) and of intron 7 (77.08, 73.74, and 79.71 %, respectively, p = 0.02 and p < 0.01) than controls. Twenty-two subjects had the CC genotype, 34 had the TC genotype, and eight had the TT genotype. Lower average DNA methylation in intron 7 was observed in the TT subjects compared to CC (72.5vs. 79.5 %, p = 0.02) and to TC (72.5 vs. 79.0 %, p = 0.03). Our data demonstrate, for the first time, a reduction of intronic DNA methylation of FKBP5 in CS patients.

  8. Horizontal transfer and gene conversion as an important driving force in shaping the landscape of mitochondrial introns.

    PubMed

    Wu, Baojun; Hao, Weilong

    2014-04-16

    Group I introns are highly dynamic and mobile, featuring extensive presence-absence variation and widespread horizontal transfer. Group I introns can invade intron-lacking alleles via intron homing powered by their own encoded homing endonuclease gene (HEG) after horizontal transfer or via reverse splicing through an RNA intermediate. After successful invasion, the intron and HEG are subject to degeneration and sequential loss. It remains unclear whether these mechanisms can fully address the high dynamics and mobility of group I introns. Here, we found that HEGs undergo a fast gain-and-loss turnover comparable with introns in the yeast mitochondrial 21S-rRNA gene, which is unexpected, as the intron and HEG are generally believed to move together as a unit. We further observed extensively mosaic sequences in both the introns and HEGs, and evidence of gene conversion between HEG-containing and HEG-lacking introns. Our findings suggest horizontal transfer and gene conversion can accelerate HEG/intron degeneration and loss, or rescue and propagate HEG/introns, and ultimately result in high HEG/intron turnover rate. Given that up to 25% of the yeast mitochondrial genome is composed of introns and most mitochondrial introns are group I introns, horizontal transfer and gene conversion could have served as an important mechanism in introducing mitochondrial intron diversity, promoting intron mobility and consequently shaping mitochondrial genome architecture.

  9. DOE Office of Scientific and Technical Information (OSTI.GOV)

    Abraitiene, Asta; US Department of Agriculture, Agricultural Research Service, Molecular Plant Pathology Laboratory, Room 214 Building 004 BARC-West, 10300 Baltimore Avenue, Beltsville, MD 20705; Zhao Yan

    Transient expression of engineered reporter RNAs encoding an intron-containing green fluorescent protein (GFP) from a Potato virus X-based expression vector previously demonstrated the nuclear targeting capability of the 359 nucleotide Potato spindle tuber viroid (PSTVd) RNA genome. To further delimit the putative nuclear-targeting signal, PSTVd subgenomic fragments were embedded within the intron, and recombinant reporter RNAs were inoculated onto Nicotiana benthamiana plants. Appearance of green fluorescence in leaf tissue inoculated with PSTVd-fragment-containing constructs indicated shuttling of the RNA into the nucleus by fragments as short as 80 nucleotides in length. Plant-to-plant variation in the timing of intron removal and subsequentmore » GFP fluorescence was observed; however, earliest and most abundant GFP expression was obtained with constructs containing the conserved hairpin I palindrome structure and embedded upper central conserved region. Our results suggest that this conserved sequence and/or the stem-loop structure it forms is sufficient for import of PSTVd into the nucleus.« less

  10. [Identification and phylogenetic application of unique nucleotide sequence of nad7 intron2 in Rhodiola (Crassulaceae) species].

    PubMed

    Deng, Ke-Jun; Yang, Zu-Jun; Liu, Cheng; Zhao, Wei; Liu, Chang; Feng, Juan; Ren, Zheng-Long

    2007-03-01

    Genetic characterization of 9 populations of Rhodiola crenulata, R. fastigiata and R. sachalinensis (Crassulaceae) species from Sichuan and Jilin Provinces of China, was investigated using the conserved primer of nad7 intron 2. All PCR products about 800 bp long were shorter than other Crassulaceae plants, which were used as molecular markers to identify the Rhodiola species. The sequence of the products indicated that total exon of 53 bp and intron of 738 bp exhibit only 9 nucleotide variations. Blasting the nad7 sequences to GenBank and the phylogenetic analysis showed that the sequence of Rhodiola species was clusted independently, and the length was smaller than all the registered sequences of higher plants. The result suggests that the Rhiodola species had a unique sequence in this gene region, which might be related to the special growth condition.

  11. Identification of human short introns

    PubMed Central

    Abebrese, Emmanuel L.; Arnold, Zachary R.; Armstrong, Katharine; Burns, Lindsay; Day, R. Thomas; Hsu, Daniel G.; Jarrell, Katherine; Luo, Yi; Mugayo, Daphine

    2017-01-01

    Canonical pre-mRNA splicing requires snRNPs and associated splicing factors to excise conserved intronic sequences, with a minimum intron length required for efficient splicing. Non-canonical splicing–intron excision without the spliceosome–has been documented; most notably, some tRNAs and the XBP1 mRNA contain short introns that are not removed by the spliceosome. There have been some efforts to identify additional short introns, but little is known about how many short introns are processed from mRNAs. Here, we report an approach to identify RNA short introns from RNA-Seq data, discriminating against small genomic deletions. We identify hundreds of short introns conserved among multiple human cell lines. These short introns are often alternatively spliced and are found in a variety of RNAs–both mRNAs and lncRNAs. Short intron splicing efficiency is increased by secondary structure, and we detect both canonical and non-canonical short introns. In many cases, splicing of these short introns from mRNAs is predicted to alter the reading frame and change protein output. Our findings imply that standard gene prediction models which often assume a lower limit for intron size fail to predict short introns effectively. We conclude that short introns are abundant in the human transcriptome, and short intron splicing represents an added layer to mRNA regulation. PMID:28520720

  12. Intermediate introns in nuclear genes of euglenids - are they a distinct type?

    PubMed

    Milanowski, Rafał; Gumińska, Natalia; Karnkowska, Anna; Ishikawa, Takao; Zakryś, Bożena

    2016-02-29

    Nuclear genes of euglenids contain two major types of introns: conventional spliceosomal and nonconventional introns. The latter are characterized by variable non-canonical borders, RNA secondary structure that brings intron ends together, and an unknown mechanism of removal. Some researchers also distinguish intermediate introns, which combine features of both types. They form a stable RNA secondary structure and are classified into two subtypes depending on whether they contain one (intermediate/nonconventional subtype) or both (conventional/intermediate subtype) canonical spliceosomal borders. However, it has been also postulated that most introns classified as intermediate could simply be special cases of conventional or nonconventional introns. Sequences of tubB, hsp90 and gapC genes from six strains of Euglena agilis were obtained. They contain four, six, and two or three introns, respectively (the third intron in the gapC gene is unique for just one strain). Conventional introns were present at three positions: two in the tubB gene (at one position conventional/intermediate introns were also found) and one in the gapC gene. Nonconventional introns are present at ten positions: two in the tubB gene (at one position intermediate/nonconventional introns were also found), six in hsp90 (at four positions intermediate/nonconventional introns were also found), and two in the gapC gene. Sequence and RNA secondary structure analyses of nonconventional introns confirmed that their most strongly conserved elements are base pairing nucleotides at positions +4, +5 and +6/ -8, -7 and -6 (in most introns CAG/CTG nucleotides were observed). It was also confirmed that the presence of the 5' GT/C end in intermediate/nonconventional introns is not the result of kinship with conventional introns, but is due to evolutionary pressure to preserve the purine at the 5' end. However, an example of a nonconventional intron with GC-AG ends was shown, suggesting the possibility of intron type conversion between nonconventional and conventional. Furthermore, an analysis of conventional introns revealed that the ability to form a stable RNA secondary structure by some introns is probably not a result of their relationship with nonconventional introns. It was also shown that acquisition of new nonconventional introns is an ongoing process and can be observed at the level of a single species. In the recently acquired intron in the gapC gene an extended direct repeats at the intron-exon junctions are present, suggesting that double-strand break repair process could be the source of new nonconventional introns.

  13. Splicing-Related Features of Introns Serve to Propel Evolution

    PubMed Central

    Luo, Yuping; Li, Chun; Gong, Xi; Wang, Yanlu; Zhang, Kunshan; Cui, Yaru; Sun, Yi Eve; Li, Siguang

    2013-01-01

    The role of spliceosomal intronic structures played in evolution has only begun to be elucidated. Comparative genomic analyses of fungal snoRNA sequences, which are often contained within introns and/or exons, revealed that about one-third of snoRNA-associated introns in three major snoRNA gene clusters manifested polymorphisms, likely resulting from intron loss and gain events during fungi evolution. Genomic deletions can clearly be observed as one mechanism underlying intron and exon loss, as well as generation of complex introns where several introns lie in juxtaposition without intercalating exons. Strikingly, by tracking conserved snoRNAs in introns, we found that some introns had moved from one position to another by excision from donor sites and insertion into target sties elsewhere in the genome without needing transposon structures. This study revealed the origin of many newly gained introns. Moreover, our analyses suggested that intron-containing sequences were more prone to sustainable structural changes than DNA sequences without introns due to intron's ability to jump within the genome via unknown mechanisms. We propose that splicing-related structural features of introns serve as an additional motor to propel evolution. PMID:23516505

  14. Introns: The Functional Benefits of Introns in Genomes.

    PubMed

    Jo, Bong-Seok; Choi, Sun Shim

    2015-12-01

    The intron has been a big biological mystery since it was first discovered in several aspects. First, all of the completely sequenced eukaryotes harbor introns in the genomic structure, whereas no prokaryotes identified so far carry introns. Second, the amount of total introns varies in different species. Third, the length and number of introns vary in different genes, even within the same species genome. Fourth, all introns are copied into RNAs by transcription and DNAs by replication processes, but intron sequences do not participate in protein-coding sequences. The existence of introns in the genome should be a burden to some cells, because cells have to consume a great deal of energy to copy and excise them exactly at the correct positions with the help of complicated spliceosomal machineries. The existence throughout the long evolutionary history is explained, only if selective advantages of carrying introns are assumed to be given to cells to overcome the negative effect of introns. In that regard, we summarize previous research about the functional roles or benefits of introns. Additionally, several other studies strongly suggesting that introns should not be junk will be introduced.

  15. Limited MHC class I intron 2 repertoire variation in bonobos.

    PubMed

    de Groot, Natasja G; Heijmans, Corrine M C; Helsen, Philippe; Otting, Nel; Pereboom, Zjef; Stevens, Jeroen M G; Bontrop, Ronald E

    2017-10-01

    Common chimpanzees (Pan troglodytes) experienced a selective sweep, probably caused by a SIV-like virus, which targeted their MHC class I repertoire. Based on MHC class I intron 2 data analyses, this selective sweep took place about 2-3 million years ago. As a consequence, common chimpanzees have a skewed MHC class I repertoire that is enriched for allotypes that are able to recognise conserved regions of the SIV proteome. The bonobo (Pan paniscus) shared an ancestor with common chimpanzees approximately 1.5 to 2 million years ago. To investigate whether the signature of this selective sweep is also detectable in bonobos, the MHC class I gene repertoire of two bonobo panels comprising in total 29 animals was investigated by Sanger sequencing. We identified 14 Papa-A, 20 Papa-B and 11 Papa-C alleles, of which eight, five and eight alleles, respectively, have not been reported previously. Within this pool of MHC class I variation, we recovered only 2 Papa-A, 3 Papa-B and 6 Papa-C intron 2 sequences. As compared to humans, bonobos appear to have an even more diminished MHC class I intron 2 lineage repertoire than common chimpanzees. This supports the notion that the selective sweep may have predated the speciation of common chimpanzees and bonobos. The further reduction of the MHC class I intron 2 lineage repertoire observed in bonobos as compared to the common chimpanzee may be explained by a founding effect or other subsequent selective processes.

  16. Differential expression profiles of miRNAs induced by vaccination followed by Marek’s disease virus challenge at cytolytic stage in chickens resistant or susceptible to Marek’s disease

    USDA-ARS?s Scientific Manuscript database

    Mounting evidence shows microRNAs (miRNAs) directly regulate gene expression post-transcriptionally through base-pairing with regions in the 3’-untranslated sequences of target gene mRNAs, which results in dysregulation of gene expression/translation and subsequently modulates cellular processes. We...

  17. Differential Impact of the "FMR1" Gene on Visual Processing in Fragile X Syndrome

    ERIC Educational Resources Information Center

    Kogan, Cary S.; Boutet, Isabelle; Cornish, Kim; Zangenehpour, Shahin; Mullen, Kathy T.; Holden, Jeanette J. A.; Kaloustian, Vazken M. Der; Andermann, Eva; Chaudhuri, Avi

    2004-01-01

    Fragile X syndrome (FXS) is the most common form of heritable mental retardation, affecting (~ around) 1 in 4000 males. The syndrome arises from expansion of a trinucleotide repeat in the 5'-untranslated region of the fragile X mental retardation 1 ("FMR1") gene, leading to methylation of the promoter sequence and lack of the fragile X mental…

  18. Metal specificity of an iron-responsive element in Alzheimer's APP mRNA 5'untranslated region, tolerance of SH-SY5Y and H4 neural cells to desferrioxamine, clioquinol, VK-28, and a piperazine chelator.

    PubMed

    Bandyopadhyay, S; Huang, X; Cho, H; Greig, N H; Youdim, M B; Rogers, J T

    2006-01-01

    Iron closely regulates the expression of the Alzheimer's Amyloid Precursor Protein (APP) gene at the level of message translation by a pathway similar to iron control of the translation of the ferritin L- and H mRNAs by Iron-responsive Elements in their 5' untranslated regions (5'UTRs). Using transfection based assays in SH-SY5Y neuroblastoma cells we tested the relative efficiency by which iron, copper and zinc up-regulate IRE activity in the APP 5'UTR. Desferrioxamine (high affinity Fe3+ chelator), (ii) clioquinol (low affinity Fe/Cu/Zn chelator), (iii) piperazine-1 (oral Fe chelator), (iv) VK-28 (oral Fe chelator), were tested for their relative modulation of APP 5' UTR directed translation of a luciferase reporter gene. Iron chelation based therapeutic strategies for slowing the progression of Alzheimer's disease (and other neurological disorders that manifest iron imbalance) are discussed with regard to the relative neural toxic action of each chelator in SH-SY5Y cells and in H4 glioblastoma cells.

  19. Pathway optimization by re-design of untranslated regions for L-tyrosine production in Escherichia coli

    PubMed Central

    Cheol Kim, Seong; Eun Min, Byung; Gyu Hwang, Hyun; Woo Seo, Sang; Yeol Jung, Gyoo

    2015-01-01

    L-tyrosine is a commercially important compound in the food, pharmaceutical, chemical, and cosmetic industries. Although several attempts have been made to improve L-tyrosine production, translation-level expression control and carbon flux rebalancing around phosphoenolpyruvate (PEP) node still remain to be achieved for optimizing the pathway. Here, we demonstrate pathway optimization by altering gene expression levels for L-tyrosine production in Escherichia coli. To optimize the L-tyrosine biosynthetic pathway, a synthetic constitutive promoter and a synthetic 5′-untranslated region (5′-UTR) were introduced for each gene of interest to allow for control at both transcription and translation levels. Carbon flux rebalancing was achieved by controlling the expression level of PEP synthetase using UTR Designer. The L-tyrosine productivity of the engineered E. coli strain was increased through pathway optimization resulting in 3.0 g/L of L-tyrosine titer, 0.0354 g L-tyrosine/h/g DCW of productivity, and 0.102 g L-tyrosine/g glucose yield. Thus, this work demonstrates that pathway optimization by 5′-UTR redesign is an effective strategy for the development of efficient L-tyrosine-producing bacteria. PMID:26346938

  20. Scan for Motifs: a webserver for the analysis of post-transcriptional regulatory elements in the 3' untranslated regions (3' UTRs) of mRNAs.

    PubMed

    Biswas, Ambarish; Brown, Chris M

    2014-06-08

    Gene expression in vertebrate cells may be controlled post-transcriptionally through regulatory elements in mRNAs. These are usually located in the untranslated regions (UTRs) of mRNA sequences, particularly the 3'UTRs. Scan for Motifs (SFM) simplifies the process of identifying a wide range of regulatory elements on alignments of vertebrate 3'UTRs. SFM includes identification of both RNA Binding Protein (RBP) sites and targets of miRNAs. In addition to searching pre-computed alignments, the tool provides users the flexibility to search their own sequences or alignments. The regulatory elements may be filtered by expected value cutoffs and are cross-referenced back to their respective sources and literature. The output is an interactive graphical representation, highlighting potential regulatory elements and overlaps between them. The output also provides simple statistics and links to related resources for complementary analyses. The overall process is intuitive and fast. As SFM is a free web-application, the user does not need to install any software or databases. Visualisation of the binding sites of different classes of effectors that bind to 3'UTRs will facilitate the study of regulatory elements in 3' UTRs.

  1. MiR-26a downregulates retinoblastoma in colorectal cancer.

    PubMed

    López-Urrutia, Eduardo; Coronel-Hernández, Jossimar; García-Castillo, Verónica; Contreras-Romero, Carlos; Martínez-Gutierrez, Antonio; Estrada-Galicia, Diana; Terrazas, Luis Ignacio; López-Camarillo, César; Maldonado-Martínez, Hector; Jacobo-Herrera, Nadia; Pérez-Plasencia, Carlos

    2017-04-01

    MicroRNAs are non-coding short RNAs that target the 3' untranslated region of messenger RNAs (mRNAs) and lead to their degradation or to translational repression. Several microRNAs have been designated as oncomirs, owing to their regulating tumor suppressor genes. Interestingly, a few of them have been found to target multiple genes whose simultaneous suppression contributes to the development of a tumoral phenotype. Here, we have showed that miR-26a is overexpressed in colorectal cancer data obtained from TCGA Research Network and in human colon cancer pathological specimens; moreover, an orthotopic in vivo model of colon cancer showed overexpression of miR-26a, while Rb1 expression inversely correlated to miR-26a in TCGA Research Network data, pathological samples, and the in vivo model. Then, by means of luciferase assay, we demonstrated that miR-26a targets the 3' untranslated region of Rb1 mRNA directly. This is, to our knowledge, the first report of miR-26a targeting Rb1 in colon cancer. The results of this study suggested that miR-26a could serve as a progression biomarker in colorectal cancer. Further validation studies are still needed to confirm our findings.

  2. An 8-generation family with X-linked Charcot-Marie-Tooth: Confirmation Of the pathogenicity Of a 3' untranslated region mutation in GJB1 and its clinical features.

    PubMed

    Chen, Dong-Hui; Ma, Maxwell; Scavina, Mena; Blue, Elizabeth; Wolff, John; Karna, Prasanthi; Dorschner, Michael O; Raskind, Wendy H; Bird, Thomas D

    2018-05-01

    Mutations in gap junction protein beta 1 (GJB1) on the X chromosome represent one of the most common causes of hereditary neuropathy. We assessed manifestations associated with a rare 3' untranslated region mutation (UTR) of GJB1 in a large family with X-linked Charcot-Marie-Tooth disease (CMTX). Clinical, electrophysiological, and molecular genetic analyses were performed on an 8-generation family with CMTX. There were 22 affected males and 19 symptomatic females, including an 83-year-old woman followed for 40 years. Electrophysiological studies showed a primarily axonal neuropathy. The c.*15C>T mutation in the GJB1 3' UTR was identified in 4 branches of the family with a log of odds (LOD) of 4.91. This created a BstE II enzyme recognition site that enabled detection by restriction digestion. The c.*15C>T mutation in the GJB1 3' UTR segregates with CMTX1 in 8 generations. Penetrance in males and females is essentially complete. A straightforward genetic method to detect this mutation is described. Muscle Nerve 57: 859-862, 2018. © 2017 Wiley Periodicals, Inc.

  3. Sequence of a cDNA encoding pancreatic preprosomatostatin-22.

    PubMed Central

    Magazin, M; Minth, C D; Funckes, C L; Deschenes, R; Tavianini, M A; Dixon, J E

    1982-01-01

    We report the nucleotide sequence of a precursor to somatostatin that upon proteolytic processing may give rise to a hormone of 22 amino acids. The nucleotide sequence of a cDNA from the channel catfish (Ictalurus punctatus) encodes a precursor to somatostatin that is 105 amino acids (Mr, 11,500). The cDNA coding for somatostatin-22 consists of 36 nucleotides in the 5' untranslated region, 315 nucleotides that code for the precursor to somatostatin-22, 269 nucleotides at the 3' untranslated region, and a variable length of poly(A). The putative preprohormone contains a sequence of hydrophobic amino acids at the amino terminus that has the properties of a "signal" peptide. A connecting sequence of approximately 57 amino acids is followed by a single Arg-Arg sequence, which immediately precedes the hormone. Somatostatin-22 is homologous to somatostatin-14 in 7 of the 14 amino acids, including the Phe-Trp-Lys sequence. Hybridization selection of mRNA, followed by its translation in a wheat germ cell-free system, resulted in the synthesis of a single polypeptide having a molecular weight of approximately 10,000 as estimated on Na-DodSO4/polyacrylamide gels. Images PMID:6127673

  4. Deep learning of the regulatory grammar of yeast 5′ untranslated regions from 500,000 random sequences

    PubMed Central

    Groves, Benjamin; Kuchina, Anna; Rosenberg, Alexander B.; Jojic, Nebojsa; Fields, Stanley; Seelig, Georg

    2017-01-01

    Our ability to predict protein expression from DNA sequence alone remains poor, reflecting our limited understanding of cis-regulatory grammar and hampering the design of engineered genes for synthetic biology applications. Here, we generate a model that predicts the protein expression of the 5′ untranslated region (UTR) of mRNAs in the yeast Saccharomyces cerevisiae. We constructed a library of half a million 50-nucleotide-long random 5′ UTRs and assayed their activity in a massively parallel growth selection experiment. The resulting data allow us to quantify the impact on protein expression of Kozak sequence composition, upstream open reading frames (uORFs), and secondary structure. We trained a convolutional neural network (CNN) on the random library and showed that it performs well at predicting the protein expression of both a held-out set of the random 5′ UTRs as well as native S. cerevisiae 5′ UTRs. The model additionally was used to computationally evolve highly active 5′ UTRs. We confirmed experimentally that the great majority of the evolved sequences led to higher protein expression rates than the starting sequences, demonstrating the predictive power of this model. PMID:29097404

  5. Molecular cloning, expression pattern, and chemical analysis of heat shock protein 70 (HSP70) in the mudskipper Boleophthalmus pectinirostris: Evidence for its role in regulating spermatogenesis.

    PubMed

    Han, Ying-Li; Yang, Wan-Xi; Long, Ling-Li; Sheng, Zhang; Zhou, Yang; Zhao, Yong-Qiang; Wang, You-Fa; Zhu, Jun-Quan

    2016-01-10

    Heat shock protein 70 (HSP70) is molecular chaperone that is important for reproductive biological processes. In this study, a full length HSP70 from the mudskipper (Boleophthalmus pectinirostris) was characterized. It was found to contain: a 108 bp 5'-untranslated region, a 208 bp 3'-untranslated region, and a 1953 bp open reading frame, which encodes a protein of 650 amino acids with a theoretical molecular weight of 71.1 kDa and an isoelectric point of 5.17. RT-PCR analysis revealed that HSP70 was ubiquitously expressed in all major tissues with differential expression levels. This suggests that HSP70 has vital and conserved biological functions. HSP70 was localized mainly in the cytoplasm of germinal cells, indicating an important role of this protein during spermatogenesis. In response to heat stress, the testes presented abnormal morphology in connective tissues, in which HSP70 immunoreactivity was not observed. HSP70 mRNA expression in the gill, liver, and testes was significantly increased, which suggests that HSP70 plays an important role in protection against heat stress. Copyright © 2015 Elsevier B.V. All rights reserved.

  6. Recurrent Loss of Specific Introns during Angiosperm Evolution

    PubMed Central

    Wang, Hao; Devos, Katrien M.; Bennetzen, Jeffrey L.

    2014-01-01

    Numerous instances of presence/absence variations for introns have been documented in eukaryotes, and some cases of recurrent loss of the same intron have been suggested. However, there has been no comprehensive or phylogenetically deep analysis of recurrent intron loss. Of 883 cases of intron presence/absence variation that we detected in five sequenced grass genomes, 93 were confirmed as recurrent losses and the rest could be explained by single losses (652) or single gains (118). No case of recurrent intron gain was observed. Deep phylogenetic analysis often indicated that apparent intron gains were actually numerous independent losses of the same intron. Recurrent loss exhibited extreme non-randomness, in that some introns were removed independently in many lineages. The two larger genomes, maize and sorghum, were found to have a higher rate of both recurrent loss and overall loss and/or gain than foxtail millet, rice or Brachypodium. Adjacent introns and small introns were found to be preferentially lost. Intron loss genes exhibited a high frequency of germ line or early embryogenesis expression. In addition, flanking exon A+T-richness and intron TG/CG ratios were higher in retained introns. This last result suggests that epigenetic status, as evidenced by a loss of methylated CG dinucleotides, may play a role in the process of intron loss. This study provides the first comprehensive analysis of recurrent intron loss, makes a series of novel findings on the patterns of recurrent intron loss during the evolution of the grass family, and provides insight into the molecular mechanism(s) underlying intron loss. PMID:25474210

  7. Identification of Genetic Elements Associated with EPSPS Gene Amplification

    PubMed Central

    Gaines, Todd A.; Wright, Alice A.; Molin, William T.; Lorentz, Lothar; Riggins, Chance W.; Tranel, Patrick J.; Beffa, Roland; Westra, Philip; Powles, Stephen B.

    2013-01-01

    Weed populations can have high genetic plasticity and rapid responses to environmental selection pressures. For example, 100-fold amplification of the 5-enolpyruvylshikimate-3-phosphate synthase (EPSPS) gene evolved in the weed species Amaranthus palmeri to confer resistance to glyphosate, the world’s most important herbicide. However, the gene amplification mechanism is unknown. We sequenced the EPSPS gene and genomic regions flanking EPSPS loci in A. palmeri, and searched for mobile genetic elements or repetitive sequences. The EPSPS gene was 10,229 bp, containing 8 exons and 7 introns. The gene amplification likely proceeded through a DNA-mediated mechanism, as introns exist in the amplified gene copies and the entire amplified sequence is at least 30 kb in length. Our data support the presence of two EPSPS loci in susceptible (S) A. palmeri, and that only one of these was amplified in glyphosate-resistant (R) A. palmeri. The EPSPS gene amplification event likely occurred recently, as no sequence polymorphisms were found within introns of amplified EPSPS copies from R individuals. Sequences with homology to miniature inverted-repeat transposable elements (MITEs) were identified next to EPSPS gene copies only in R individuals. Additionally, a putative Activator (Ac) transposase and a repetitive sequence region were associated with amplified EPSPS genes. The mechanism controlling this DNA-mediated amplification remains unknown. Further investigation is necessary to determine if the gene amplification may have proceeded via DNA transposon-mediated replication, and/or unequal recombination between different genomic regions resulting in replication of the EPSPS gene. PMID:23762434

  8. Molecular differentiation of Russian wild ginseng using mitochondrial nad7 intron 3 region.

    PubMed

    Li, Guisheng; Cui, Yan; Wang, Hongtao; Kwon, Woo-Saeng; Yang, Deok-Chun

    2017-07-01

    Cultivated ginseng is often introduced as a substitute and adulterant of Russian wild ginseng due to its lower cost or misidentification caused by similarity in appearance with wild ginseng. The aim of this study is to develop a simple and reliable method to differentiate Russian wild ginseng from cultivated ginseng. The mitochondrial NADH dehydrogenase subunit 7 ( nad 7) intron 3 regions of Russian wild ginseng and Chinese cultivated ginseng were analyzed. Based on the multiple sequence alignment result, a specific primer for Russian wild ginseng was designed by introducing additional mismatch and allele-specific polymerase chain reaction (PCR) was performed for identification of wild ginseng. Real-time allele-specific PCR with endpoint analysis was used for validation of the developed Russian wild ginseng single nucleotide polymorphism (SNP) marker. An SNP site specific to Russian wild ginseng was exploited by multiple alignments of mitochondrial nad 7 intron 3 regions of different ginseng samples. With the SNP-based specific primer, Russian wild ginseng was successfully discriminated from Chinese and Korean cultivated ginseng samples by allele-specific PCR. The reliability and specificity of the SNP marker was validated by checking 20 individuals of Russian wild ginseng samples with real-time allele-specific PCR assay. An effective DNA method for molecular discrimination of Russian wild ginseng from Chinese and Korean cultivated ginseng was developed. The established real-time allele-specific PCR was simple and reliable, and the present method should be a crucial complement of chemical analysis for authentication of Russian wild ginseng.

  9. WES homozygosity mapping in a recessive form of Charcot-Marie-Tooth neuropathy reveals intronic GDAP1 variant leading to a premature stop codon.

    PubMed

    Masingue, Marion; Perrot, Jimmy; Carlier, Robert-Yves; Piguet-Lacroix, Guenaelle; Latour, Philippe; Stojkovic, Tanya

    2018-05-01

    Charcot-Marie-Tooth disease (CMT) refers to a group of clinically and genetically heterogeneous inherited neuropathies. Ganglioside-induced differentiation-associated protein 1 GDAP1-related CMT has been reported in an autosomal dominant or recessive form in patients presenting either axonal or demyelinating neuropathy. We report two Sri Lankan sisters born to consanguineous parents and presenting with a severe axonal sensorimotor neuropathy. The early onset of the disease, the distal and proximal weakness and atrophy leading to major disability, along with areflexia, and, most notably, vocal cord and diaphragm paralysis were highly evocative of a GDAP1-related CMT. However, sequencing of the coding regions of the gene was normal. Whole-exome sequencing (WES) was performed and revealed that the largest region of homozygosity was around GDAP1 with several variants, mostly in non-coding regions. In view of the high clinical suspicion of GDAP1 gene involvement, we examined the variants in this gene and this, along with functional studies, allowed us to identify an alternative splicing site revealing a cryptic in-frame stop codon in intron 4 responsible for a severe loss of wild-type GDAP1. This work is the first to describe a deleterious mutation in GDAP1 gene outside of coding sequences or intronic junctions and emphasizes the importance of interpreting molecular analysis, and in particular WES results, in light of the clinical and electrophysiological phenotype.

  10. Trans-activation of the Tetrahymena group I intron ribozyme via a non-native RNA-RNA interaction.

    PubMed Central

    Ikawa, Y; Shiraishi, H; Inoue, T

    1999-01-01

    The peripheral P2.1 domain of the Tetrahymena group I intron ribozyme has been shown to be non-essential for splicing. We found, however, that separately prepared P2.1 RNA efficiently accelerates the 3' splice-site-specific hydrolysis reaction of a mutant ribozyme lacking both P2.1 and its upstream region in trans. We report here the unusual properties of this trans-activation. Compensatory mutational analysis revealed that non-native long-range base-pairings between the loop region of P2.1 RNA and L5c region of the mutant ribozyme are needed for the activation in spite of the fact that P2.1 forms base-pairings with P9.1 in the Tetrahymena ribozyme. The trans -activation depends on the non-native RNA-RNA interaction together with the higher order structure of P2.1 RNA. This activation is unique among the known trans-activations that utilize native tertiary interactions or RNA chaperons. PMID:10075996

  11. Fine-mapping identifies multiple prostate cancer risk loci at 5p15, one of which associates with TERT expression

    PubMed Central

    Kote-Jarai, Zsofia; Saunders, Edward J.; Leongamornlert, Daniel A.; Tymrakiewicz, Malgorzata; Dadaev, Tokhir; Jugurnauth-Little, Sarah; Ross-Adams, Helen; Al Olama, Ali Amin; Benlloch, Sara; Halim, Silvia; Russel, Roslin; Dunning, Alison M.; Luccarini, Craig; Dennis, Joe; Neal, David E.; Hamdy, Freddie C.; Donovan, Jenny L.; Muir, Ken; Giles, Graham G.; Severi, Gianluca; Wiklund, Fredrik; Gronberg, Henrik; Haiman, Christopher A.; Schumacher, Fredrick; Henderson, Brian E.; Le Marchand, Loic; Lindstrom, Sara; Kraft, Peter; Hunter, David J.; Gapstur, Susan; Chanock, Stephen; Berndt, Sonja I.; Albanes, Demetrius; Andriole, Gerald; Schleutker, Johanna; Weischer, Maren; Canzian, Federico; Riboli, Elio; Key, Tim J.; Travis, Ruth C.; Campa, Daniele; Ingles, Sue A.; John, Esther M.; Hayes, Richard B.; Pharoah, Paul; Khaw, Kay-Tee; Stanford, Janet L.; Ostrander, Elaine A.; Signorello, Lisa B.; Thibodeau, Stephen N.; Schaid, Dan; Maier, Christiane; Vogel, Walther; Kibel, Adam S.; Cybulski, Cezary; Lubinski, Jan; Cannon-Albright, Lisa; Brenner, Hermann; Park, Jong Y.; Kaneva, Radka; Batra, Jyotsna; Spurdle, Amanda; Clements, Judith A.; Teixeira, Manuel R.; Govindasami, Koveela; Guy, Michelle; Wilkinson, Rosemary A.; Sawyer, Emma J.; Morgan, Angela; Dicks, Ed; Baynes, Caroline; Conroy, Don; Bojesen, Stig E.; Kaaks, Rudolf; Vincent, Daniel; Bacot, François; Tessier, Daniel C.; Easton, Douglas F.; Eeles, Rosalind A.

    2013-01-01

    Associations between single nucleotide polymorphisms (SNPs) at 5p15 and multiple cancer types have been reported. We have previously shown evidence for a strong association between prostate cancer (PrCa) risk and rs2242652 at 5p15, intronic in the telomerase reverse transcriptase (TERT) gene that encodes TERT. To comprehensively evaluate the association between genetic variation across this region and PrCa, we performed a fine-mapping analysis by genotyping 134 SNPs using a custom Illumina iSelect array or Sequenom MassArray iPlex, followed by imputation of 1094 SNPs in 22 301 PrCa cases and 22 320 controls in The PRACTICAL consortium. Multiple stepwise logistic regression analysis identified four signals in the promoter or intronic regions of TERT that independently associated with PrCa risk. Gene expression analysis of normal prostate tissue showed evidence that SNPs within one of these regions also associated with TERT expression, providing a potential mechanism for predisposition to disease. PMID:23535824

  12. Genetic Variation among Major Human Geographic Groups Supports a Peculiar Evolutionary Trend in PAX9

    PubMed Central

    Paixão-Côrtes, Vanessa R.; Meyer, Diogo; Pereira, Tiago V.; Mazières, Stéphane; Elion, Jacques; Krishnamoorthy, Rajagopal; Zago, Marco A.; Silva, Wilson A.; Salzano, Francisco M.; Bortolini, Maria Cátira

    2011-01-01

    A total of 172 persons from nine South Amerindian, three African and one Eskimo populations were studied in relation to the Paired box gene 9 (PAX9) exon 3 (138 base pairs) as well as its 5′and 3′flanking intronic segments (232 bp and 220 bp, respectively) and integrated with the information available for the same genetic region from individuals of different geographical origins. Nine mutations were scored in exon 3 and six in its flanking regions; four of them are new South American tribe-specific singletons. Exon3 nucleotide diversity is several orders of magnitude higher than its intronic regions. Additionally, a set of variants in the PAX9 and 101 other genes related with dentition can define at least some dental morphological differences between Sub-Saharan Africans and non-Africans, probably associated with adaptations after the modern human exodus from Africa. Exon 3 of PAX9 could be a good molecular example of how evolvability works. PMID:21298044

  13. High variability of mitochondrial gene order among fungi.

    PubMed

    Aguileta, Gabriela; de Vienne, Damien M; Ross, Oliver N; Hood, Michael E; Giraud, Tatiana; Petit, Elsa; Gabaldón, Toni

    2014-02-01

    From their origin as an early alpha proteobacterial endosymbiont to their current state as cellular organelles, large-scale genomic reorganization has taken place in the mitochondria of all main eukaryotic lineages. So far, most studies have focused on plant and animal mitochondrial (mt) genomes (mtDNA), but fungi provide new opportunities to study highly differentiated mtDNAs. Here, we analyzed 38 complete fungal mt genomes to investigate the evolution of mtDNA gene order among fungi. In particular, we looked for evidence of nonhomologous intrachromosomal recombination and investigated the dynamics of gene rearrangements. We investigated the effect that introns, intronic open reading frames (ORFs), and repeats may have on gene order. Additionally, we asked whether the distribution of transfer RNAs (tRNAs) evolves independently to that of mt protein-coding genes. We found that fungal mt genomes display remarkable variation between and within the major fungal phyla in terms of gene order, genome size, composition of intergenic regions, and presence of repeats, introns, and associated ORFs. Our results support previous evidence for the presence of mt recombination in all fungal phyla, a process conspicuously lacking in most Metazoa. Overall, the patterns of rearrangements may be explained by the combined influences of recombination (i.e., most likely nonhomologous and intrachromosomal), accumulated repeats, especially at intergenic regions, and to a lesser extent, mobile element dynamics.

  14. Origin and evolution of spliceosomal introns

    PubMed Central

    2012-01-01

    Evolution of exon-intron structure of eukaryotic genes has been a matter of long-standing, intensive debate. The introns-early concept, later rebranded ‘introns first’ held that protein-coding genes were interrupted by numerous introns even at the earliest stages of life's evolution and that introns played a major role in the origin of proteins by facilitating recombination of sequences coding for small protein/peptide modules. The introns-late concept held that introns emerged only in eukaryotes and new introns have been accumulating continuously throughout eukaryotic evolution. Analysis of orthologous genes from completely sequenced eukaryotic genomes revealed numerous shared intron positions in orthologous genes from animals and plants and even between animals, plants and protists, suggesting that many ancestral introns have persisted since the last eukaryotic common ancestor (LECA). Reconstructions of intron gain and loss using the growing collection of genomes of diverse eukaryotes and increasingly advanced probabilistic models convincingly show that the LECA and the ancestors of each eukaryotic supergroup had intron-rich genes, with intron densities comparable to those in the most intron-rich modern genomes such as those of vertebrates. The subsequent evolution in most lineages of eukaryotes involved primarily loss of introns, with only a few episodes of substantial intron gain that might have accompanied major evolutionary innovations such as the origin of metazoa. The original invasion of self-splicing Group II introns, presumably originating from the mitochondrial endosymbiont, into the genome of the emerging eukaryote might have been a key factor of eukaryogenesis that in particular triggered the origin of endomembranes and the nucleus. Conversely, splicing errors gave rise to alternative splicing, a major contribution to the biological complexity of multicellular eukaryotes. There is no indication that any prokaryote has ever possessed a spliceosome or introns in protein-coding genes, other than relatively rare mobile self-splicing introns. Thus, the introns-first scenario is not supported by any evidence but exon-intron structure of protein-coding genes appears to have evolved concomitantly with the eukaryotic cell, and introns were a major factor of evolution throughout the history of eukaryotes. This article was reviewed by I. King Jordan, Manuel Irimia (nominated by Anthony Poole), Tobias Mourier (nominated by Anthony Poole), and Fyodor Kondrashov. For the complete reports, see the Reviewers’ Reports section. PMID:22507701

  15. Conserved Nonexonic Elements: A Novel Class of Marker for Phylogenomics.

    PubMed

    Edwards, Scott V; Cloutier, Alison; Baker, Allan J

    2017-11-01

    Noncoding markers have a particular appeal as tools for phylogenomic analysis because, at least in vertebrates, they appear less subject to strong variation in GC content among lineages. Thus far, ultraconserved elements (UCEs) and introns have been the most widely used noncoding markers. Here we analyze and study the evolutionary properties of a new type of noncoding marker, conserved nonexonic elements (CNEEs), which consists of noncoding elements that are estimated to evolve slower than the neutral rate across a set of species. Although they often include UCEs, CNEEs are distinct from UCEs because they are not ultraconserved, and, most importantly, the core region alone is analyzed, rather than both the core and its flanking regions. Using a data set of 16 birds plus an alligator outgroup, and ∼3600-∼3800 loci per marker type, we found that although CNEEs were less variable than bioinformatically derived UCEs or introns and in some cases exhibited a slower approach to branch resolution as determined by phylogenomic subsampling, the quality of CNEE alignments was superior to those of the other markers, with fewer gaps and missing species. Phylogenetic resolution using coalescent approaches was comparable among the three marker types, with most nodes being fully and congruently resolved. Comparison of phylogenetic results across the three marker types indicated that one branch, the sister group to the passerine + falcon clade, was resolved differently and with moderate (>70%) bootstrap support between CNEEs and UCEs or introns. Overall, CNEEs appear to be promising as phylogenomic markers, yielding phylogenetic resolution as high as for UCEs and introns but with fewer gaps, less ambiguity in alignments and with patterns of nucleotide substitution more consistent with the assumptions of commonly used methods of phylogenetic analysis. © The Author(s) 2017. Published by Oxford University Press on behalf of the Systematic Biologists.

  16. Conserved Nonexonic Elements: A Novel Class of Marker for Phylogenomics

    PubMed Central

    Cloutier, Alison; Baker, Allan J.

    2017-01-01

    Abstract Noncoding markers have a particular appeal as tools for phylogenomic analysis because, at least in vertebrates, they appear less subject to strong variation in GC content among lineages. Thus far, ultraconserved elements (UCEs) and introns have been the most widely used noncoding markers. Here we analyze and study the evolutionary properties of a new type of noncoding marker, conserved nonexonic elements (CNEEs), which consists of noncoding elements that are estimated to evolve slower than the neutral rate across a set of species. Although they often include UCEs, CNEEs are distinct from UCEs because they are not ultraconserved, and, most importantly, the core region alone is analyzed, rather than both the core and its flanking regions. Using a data set of 16 birds plus an alligator outgroup, and ∼3600–∼3800 loci per marker type, we found that although CNEEs were less variable than bioinformatically derived UCEs or introns and in some cases exhibited a slower approach to branch resolution as determined by phylogenomic subsampling, the quality of CNEE alignments was superior to those of the other markers, with fewer gaps and missing species. Phylogenetic resolution using coalescent approaches was comparable among the three marker types, with most nodes being fully and congruently resolved. Comparison of phylogenetic results across the three marker types indicated that one branch, the sister group to the passerine + falcon clade, was resolved differently and with moderate (>70%) bootstrap support between CNEEs and UCEs or introns. Overall, CNEEs appear to be promising as phylogenomic markers, yielding phylogenetic resolution as high as for UCEs and introns but with fewer gaps, less ambiguity in alignments and with patterns of nucleotide substitution more consistent with the assumptions of commonly used methods of phylogenetic analysis. PMID:28637293

  17. Patterns and rates of intron divergence between humans and chimpanzees

    PubMed Central

    Gazave, Elodie; Marqués-Bonet, Tomàs; Fernando, Olga; Charlesworth, Brian; Navarro, Arcadi

    2007-01-01

    Background Introns, which constitute the largest fraction of eukaryotic genes and which had been considered to be neutral sequences, are increasingly acknowledged as having important functions. Several studies have investigated levels of evolutionary constraint along introns and across classes of introns of different length and location within genes. However, thus far these studies have yielded contradictory results. Results We present the first analysis of human-chimpanzee intron divergence, in which differences in the number of substitutions per intronic site (Ki) can be interpreted as the footprint of different intensities and directions of the pressures of natural selection. Our main findings are as follows: there was a strong positive correlation between intron length and divergence; there was a strong negative correlation between intron length and GC content; and divergence rates vary along introns and depending on their ordinal position within genes (for instance, first introns are more GC rich, longer and more divergent, and divergence is lower at the 3' and 5' ends of all types of introns). Conclusion We show that the higher divergence of first introns is related to their larger size. Also, the lower divergence of short introns suggests that they may harbor a relatively greater proportion of regulatory elements than long introns. Moreover, our results are consistent with the presence of functionally relevant sequences near the 5' and 3' ends of introns. Finally, our findings suggest that other parts of introns may also be under selective constraints. PMID:17309804

  18. Evaluation of the mechanisms of intron loss and gain in the social amoebae Dictyostelium.

    PubMed

    Ma, Ming-Yue; Che, Xun-Ru; Porceddu, Andrea; Niu, Deng-Ke

    2015-12-18

    Spliceosomal introns are a common feature of eukaryotic genomes. To approach a comprehensive understanding of intron evolution on Earth, studies should look beyond repeatedly studied groups such as animals, plants, and fungi. The slime mold Dictyostelium belongs to a supergroup of eukaryotes not covered in previous studies. We found 441 precise intron losses in Dictyostelium discoideum and 202 precise intron losses in Dictyostelium purpureum. Consistent with these observations, Dictyostelium discoideum was found to have significantly more copies of reverse transcriptase genes than Dictyostelium purpureum. We also found that the lost introns are significantly further from the 5' end of genes than the conserved introns. Adjacent introns were prone to be lost simultaneously in Dictyostelium discoideum. In both Dictyostelium species, the exonic sequences flanking lost introns were found to have a significantly higher GC content than those flanking conserved introns. Together, these observations support a reverse-transcription model of intron loss in which intron losses were caused by gene conversion between genomic DNA and cDNA reverse transcribed from mature mRNA. We also identified two imprecise intron losses in Dictyostelium discoideum that may have resulted from genomic deletions. Ninety-eight putative intron gains were also observed. Consistent with previous studies of other lineages, the source sequences were found in only a small number of cases, with only two instances of intron gain identified in Dictyostelium discoideum. Although they diverged very early from animals and fungi, Dictyostelium species have similar mechanisms of intron loss.

  19. Primary structure of stanniocalcin in two basal Actinopterygii.

    PubMed

    Amemiya, Yutaka; Youson, John H

    2004-01-15

    The primary structure of stanniocalcin (STC), the principal product of the corpuscles of Stannius (CS) in ray-finned fishes, was deduced from STC cDNA clones for two species of holostean, the gar, Lepisosteus osseus and the bowfin, Amia calva. Overlapping partial cDNA clones were amplified by polymerase chain reaction (PCR) from single-strand cDNA of the CS. Excluding the poly(A) tail, the cDNAs of 1863 base pairs [bp] (gar) and 914 bp (bowfin) contained the 5' untranslated region followed by the coding region and the 3' untranslated region. Both the gar and bowfin STC cDNA encode a prehormone of 252 amino acids (aa) with a signal peptide of 32 aa and a mature protein of 220 aa. The deduced aa sequence of gar STC shows 87% identity with bowfin STC, 60-72% identity with most vertebrate STCs and 26% identity with mouse STC2. Phylogenetic analysis of the sequences support a view that the gar and bowfin form a monophyletic holostean clade. RT-PCR revealed in the gar and bowfin that, just as in mammals and rainbow trout, the expression of STC mRNA is widely spread in many tissues and organs. Since the gar and bowfin are representatives of the most ancient fishes known to possess CS, the corpuscular-derived STC molecule in fish has had a conserved evolution.

  20. An Indel Polymorphism in the MtnA 3' Untranslated Region Is Associated with Gene Expression Variation and Local Adaptation in Drosophila melanogaster

    PubMed Central

    Glaser-Schmitt, Amanda; Duchen, Pablo; Parsch, John

    2016-01-01

    Insertions and deletions (indels) are a major source of genetic variation within species and may result in functional changes to coding or regulatory sequences. In this study we report that an indel polymorphism in the 3’ untranslated region (UTR) of the metallothionein gene MtnA is associated with gene expression variation in natural populations of Drosophila melanogaster. A derived allele of MtnA with a 49-bp deletion in the 3' UTR segregates at high frequency in populations outside of sub-Saharan Africa. The frequency of the deletion increases with latitude across multiple continents and approaches 100% in northern Europe. Flies with the deletion have more than 4-fold higher MtnA expression than flies with the ancestral sequence. Using reporter gene constructs in transgenic flies, we show that the 3' UTR deletion significantly contributes to the observed expression difference. Population genetic analyses uncovered signatures of a selective sweep in the MtnA region within populations from northern Europe. We also find that the 3’ UTR deletion is associated with increased oxidative stress tolerance. These results suggest that the 3' UTR deletion has been a target of selection for its ability to confer increased levels of MtnA expression in northern European populations, likely due to a local adaptive advantage of increased oxidative stress tolerance. PMID:27120580

  1. Analysis of nonuniformity in intron phase distribution.

    PubMed Central

    Fedorov, A; Suboch, G; Bujakov, M; Fedorova, L

    1992-01-01

    The distribution of different intron groups with respect to phases has been analyzed. It has been established that group II introns and nuclear introns have a minimum frequency of phase 2 introns. Since the phase of introns is an extremely conservative measure the observed minimum reflects evolutionary processes. A sample of all known, group I introns was too small to provide a valid characteristic of their phase distribution. The findings observed for the unequal distribution of phases cannot be explained solely on the basis of the mobile properties of introns. One of the most likely explanations for this nonuniformity in the intron phase distribution is the process of exon shuffling. It is proposed that group II introns originated at the early stages of evolution and were involved in the process of exon shuffling. PMID:1598214

  2. Tissue- and case-specific retention of intron 40 in mature dystrophin mRNA.

    PubMed

    Nishida, Atsushi; Minegishi, Maki; Takeuchi, Atsuko; Niba, Emma Tabe Eko; Awano, Hiroyuki; Lee, Tomoko; Iijima, Kazumoto; Takeshima, Yasuhiro; Matsuo, Masafumi

    2015-06-01

    The dystrophin gene, which is mutated in Duchenne muscular dystrophy (DMD), comprises 79 exons that show multiple alternative splicing events. Intron retention, a type of alternative splicing, may control gene expression. We examined intron retention in dystrophin introns by reverse-transcription PCR from skeletal muscle, focusing on the nine shortest (all <1000 bp), because these are more likely to be retained. Only one, intron 40, was retained in mRNA; sequencing revealed insertion of a complete intron 40 (851 nt) between exons 40 and 41. The intron 40 retention product accounted for 1.2% of the total product but had a premature stop codon at the fifth intronic codon. Intron 40 retention was most strongly observed in the kidney (36.6%) and was not obtained from the fetal liver, lung, spleen or placenta. This indicated that intron retention is a tissue-specific event whose level varies among tissues. In two DMD patients, intron 40 retention was observed in one patient but not in the other. Examination of splicing regulatory factors revealed that intron 40 had the highest guanine-cytosine content of all examined introns in a 30-nt segment at its 3' end. Further studies are needed to clarify the biological role of intron 40-retained dystrophin mRNA.

  3. An intronic open reading frame was released from one of group II introns in the mitochondrial genome of the haptophyte Chrysochromulina sp. NIES-1333

    PubMed Central

    Nishimura, Yuki; Kamikawa, Ryoma; Hashimoto, Tetsuo; Inagaki, Yuji

    2014-01-01

    Mitochondrial (mt) genome sequences, which often bear introns, have been sampled from phylogenetically diverse eukaryotes. Thus, we can anticipate novel insights into intron evolution from previously unstudied mt genomes. We here investigated the origins and evolution of three introns in the mt genome of the haptophyte Chrysochromulina sp. NIES-1333, which was sequenced completely in this study. All the three introns were characterized as group II, on the basis of predicted secondary structure, and the conserved sequence motifs at the 5′ and 3′ termini. Our comparative studies on diverse mt genomes prompt us to propose that the Chrysochromulina mt genome laterally acquired the introns from mt genomes in distantly related eukaryotes. Many group II introns harbor intronic open reading frames for the proteins (intron-encoded proteins or IEPs), which likely facilitate the splicing of their host introns. However, we propose that a “free-standing,” IEP-like protein, which is not encoded within any introns in the Chrysochromulina mt genome, is involved in the splicing of the first cox1 intron that lacks any open reading frames. PMID:25054084

  4. Genetic Origins of Lactase Persistence and the Spread of Pastoralism in Africa

    PubMed Central

    Ranciaro, Alessia; Campbell, Michael C.; Hirbo, Jibril B.; Ko, Wen-Ya; Froment, Alain; Anagnostou, Paolo; Kotze, Maritha J.; Ibrahim, Muntaser; Nyambo, Thomas; Omar, Sabah A.; Tishkoff, Sarah A.

    2014-01-01

    In humans, the ability to digest lactose, the sugar in milk, declines after weaning because of decreasing levels of the enzyme lactase-phlorizin hydrolase, encoded by LCT. However, some individuals maintain high enzyme amounts and are able to digest lactose into adulthood (i.e., they have the lactase-persistence [LP] trait). It is thought that selection has played a major role in maintaining this genetically determined phenotypic trait in different human populations that practice pastoralism. To identify variants associated with the LP trait and to study its evolutionary history in Africa, we sequenced MCM6 introns 9 and 13 and ∼2 kb of the LCT promoter region in 819 individuals from 63 African populations and in 154 non-Africans from nine populations. We also genotyped four microsatellites in an ∼198 kb region in a subset of 252 individuals to reconstruct the origin and spread of LP-associated variants in Africa. Additionally, we examined the association between LP and genetic variability at candidate regulatory regions in 513 individuals from eastern Africa. Our analyses confirmed the association between the LP trait and three common variants in intron 13 (C-14010, G-13907, and G-13915). Furthermore, we identified two additional LP-associated SNPs in intron 13 and the promoter region (G-12962 and T-956, respectively). Using neutrality tests based on the allele frequency spectrum and long-range linkage disequilibrium, we detected strong signatures of recent positive selection in eastern African populations and the Fulani from central Africa. In addition, haplotype analysis supported an eastern African origin of the C-14010 LP-associated mutation in southern Africa. PMID:24630847

  5. Three distinct modes of intron dynamics in the evolution of eukaryotes.

    PubMed

    Carmel, Liran; Wolf, Yuri I; Rogozin, Igor B; Koonin, Eugene V

    2007-07-01

    Several contrasting scenarios have been proposed for the origin and evolution of spliceosomal introns, a hallmark of eukaryotic genes. A comprehensive probabilistic model to obtain a definitive reconstruction of intron evolution was developed and applied to 391 sets of conserved genes from 19 eukaryotic species. It is inferred that a relatively high intron density was reached early, i.e., the last common ancestor of eukaryotes contained >2.15 introns/kilobase, and the last common ancestor of multicellular life forms harbored approximately 3.4 introns/kilobase, a greater intron density than in most of the extant fungi and in some animals. The rates of intron gain and intron loss appear to have been dropping during the last approximately 1.3 billion years, with the decline in the gain rate being much steeper. Eukaryotic lineages exhibit three distinct modes of evolution of the intron-exon structure. The primary, balanced mode, apparently, operates in all lineages. In this mode, intron gain and loss are strongly and positively correlated, in contrast to previous reports on inverse correlation between these processes. The second mode involves an elevated rate of intron loss and is prevalent in several lineages, such as fungi and insects. The third mode, characterized by elevated rate of intron gain, is seen only in deep branches of the tree, indicating that bursts of intron invasion occurred at key points in eukaryotic evolution, such as the origin of animals. Intron dynamics could depend on multiple mechanisms, and in the balanced mode, gain and loss of introns might share common mechanistic features.

  6. Changes in miRNAs Signal High-Risk HPV Infections | Center for Cancer Research

    Cancer.gov

    microRNAs (miRNAs) are approximately 21 nucleotide long, non-coding RNAs that regulate the expression of certain proteins. As part of the RNA-induced silencing complex or RISC, miRNAs bind to complementary sequences in the 3’ untranslated regions of target messenger RNAs, blocking protein synthesis and sometimes leading to the destruction of the target RNA. Numerous studies

  7. First Complete Genome Sequence of Zika Virus (Flaviviridae, Flavivirus) from an Autochthonous Transmission in Brazil

    PubMed Central

    Cunha, Mariana Sequetin; Esposito, Danillo Lucas Alves; Rocco, Iray Maria; Maeda, Adriana Yurika; Vasami, Fernanda Gisele Silva; Nogueira, Juliana Silva; de Souza, Renato Pereira; Suzuki, Akemi; Addas-Carvalho, Marcelo; Barjas-Castro, Maria de Lourdes; Resende, Mariângela Ribeiro; Stucchi, Raquel Silveira Bello; Boin, Ilka de Fátima Santana Ferreira; Katz, Gizelda; Angerami, Rodrigo Nogueira

    2016-01-01

    We report here the genome sequence of Zika virus, strain ZikaSPH2015, containing all structural and nonstructural proteins flanked by the 5′ and 3′ untranslated region. It was isolated in São Paulo state, Brazil, in 2015, from a patient who received a blood transfusion from an asymptomatic donor at the time of donation. PMID:26941134

  8. A stem–loop structure in the 59 untranslated region of bean pod mottle virus RNA2 is specifically required for RNA2 accumulation

    USDA-ARS?s Scientific Manuscript database

    Bean pod mottle virus (BPMV) is a bipartite, positive-sense (+) RNA plant virus of the family Secoviridae. Its RNA1 encodes all proteins needed for genome replication and is capable of autonomous replication. By contrast, BPMV RNA2 must utilize RNA1-encoded proteins for replication. Here, we sought ...

  9. Precursors of vertebrate peptide antibiotics dermaseptin b and adenoregulin have extensive sequence identities with precursors of opioid peptides dermorphin, dermenkephalin, and deltorphins.

    PubMed

    Amiche, M; Ducancel, F; Mor, A; Boulain, J C; Menez, A; Nicolas, P

    1994-07-08

    The dermaseptins are a family of broad spectrum antimicrobial peptides, 27-34 amino acids long, involved in the defense of the naked skin of frogs against microbial invasion. They are the first vertebrate peptides to show lethal effects against the filamentous fungi responsible for severe opportunistic infections accompanying immunodeficiency syndrome and the use of immunosuppressive agents. A cDNA library was constructed from skin poly(A+) RNA of the arboreal frog Phyllomedusa bicolor and screened with an oligonucleotide probe complementary to the COOH terminus of dermaseptin b. Several clones contained a full-length DNA copy of a 443-nucleotide mRNA that encoded a 78-residue dermaseptin b precursor protein. The deduced precursor contained a putative signal sequence at the NH2 terminus, a 20-residue spacer sequence extremely rich (60%) in glutamic and aspartic acids, and a single copy of a dermaseptin b progenitor sequence at the COOH terminus. One clone contained a complete copy of adenoregulin, a 33-residue peptide reported to enhance the binding of agonists to the A1 adenosine receptor. The mRNAs encoding adenoregulin and dermaseptin b were very similar: 70 and 75% nucleotide identities between the 5'- and 3'-untranslated regions, respectively; 91% amino acid identity between the signal peptides; 82% identity between the acidic spacer sequences; and 38% identity between adenoregulin and dermaseptin b. Because adenoregulin and dermaseptin b have similar precursor designs and antimicrobial spectra, adenoregulin should be considered as a new member of the dermaseptin family and alternatively named dermaseptin b II. Preprodermaseptin b and preproadenoregulin have considerable sequence identities to the precursors encoding the opioid heptapeptides dermorphin, dermenkephalin, and deltorphins. This similarity extended into the 5'-untranslated regions of the mRNAs. These findings suggest that the genes encoding the four preproproteins are all members of the same family despite the fact that they encode end products having very different biological activities. These genes might contain a homologous export exon comprising the 5'-untranslated region, the 22-residue signal peptide, the 20-24-residue acidic spacer, and the basic pair Lys-Arg.

  10. Genetic characterization of Common Eiders breeding in the Yukon-Kuskokwim Delta, Alaska

    USGS Publications Warehouse

    Sonsthagen, Sarah A.; Talbot, Sandra L.; McCracken, Kevin G.

    2007-01-01

    We assessed population genetic subdivision among four colonies of Common Eiders (Somateria mollissima v-nigrum) breeding in the Yukon-Kuskokwim Delta (YKD), Alaska, using microsatellite genotypes and DNA sequences with differing modes of inheritance. Significant, albeit low, levels of genetic differentiation were observed between mainland populations and Kigigak Island for nuclear intron lamin A and mitochondrial DNA (mtDNA) control region. Intercolony variation in haplotypic frequencies also was observed at mtDNA. Positive growth signatures assayed from microsatellites, nuclear introns, and mtDNA indicate recent colonization of the YKD, and may explain the low levels of structuring observed. Gene flow estimates based on microsatellites, nuclear introns, and mtDNA suggest asymmetrical gene flow between mainland colonies and Kigigak Island, with more individuals on average dispersing from mainland populations to Kigigak Island than vice versa. The directionality of gene flow observed may be explained by the colonization of the YKD from northern glacial refugia or by YKD metapopulation dynamics.

  11. HybPiper: Extracting coding sequence and introns for phylogenetics from high-throughput sequencing reads using target enrichment1

    PubMed Central

    Johnson, Matthew G.; Gardner, Elliot M.; Liu, Yang; Medina, Rafael; Goffinet, Bernard; Shaw, A. Jonathan; Zerega, Nyree J. C.; Wickett, Norman J.

    2016-01-01

    Premise of the study: Using sequence data generated via target enrichment for phylogenetics requires reassembly of high-throughput sequence reads into loci, presenting a number of bioinformatics challenges. We developed HybPiper as a user-friendly platform for assembly of gene regions, extraction of exon and intron sequences, and identification of paralogous gene copies. We test HybPiper using baits designed to target 333 phylogenetic markers and 125 genes of functional significance in Artocarpus (Moraceae). Methods and Results: HybPiper implements parallel execution of sequence assembly in three phases: read mapping, contig assembly, and target sequence extraction. The pipeline was able to recover nearly complete gene sequences for all genes in 22 species of Artocarpus. HybPiper also recovered more than 500 bp of nontargeted intron sequence in over half of the phylogenetic markers and identified paralogous gene copies in Artocarpus. Conclusions: HybPiper was designed for Linux and Mac OS X and is freely available at https://github.com/mossmatters/HybPiper. PMID:27437175

  12. Intron open reading frames as mobile elements and evolution of a group I intron.

    PubMed

    Sellem, C H; Belcour, L

    1997-05-01

    Group I introns are proposed to have become mobile following the acquisition of open reading frames (ORFs) that encode highly specific DNA endonucleases. This proposal implies that intron ORFs could behave as autonomously mobile entities. This was supported by abundant circumstantial evidence but no experiment of ORF transfer from an ORF-containing intron to its ORF-less counterpart has been described. In this paper we present such experiments, which demonstrate the efficient mobility of the mitochondrial nad1-i4-orf1 between two Podospora strains. The homing of this mobile ORF was accompanied by a bidirectional co-conversion that did not systematically involve the whole intron sequence. Orf1 acquisition would be the most recent step in the evolution of the nad1-i4 intron, which has resulted in many strains of Podospora having an intron with two ORFs (biorfic) and four splicing pathways. We show that two of the splicing events that operate in this biorfic intron, as evidenced by PCR experiments, are generated by a 5'-alternative splice site, which is most probably a remnant of the monoorfic ancestral form of the intron. We propose a sequential evolution model that is consistent with the four organizations of the corresponding nad1 locus that we found among various species of the Pyrenomycete family; these organizations consist of no intron, an intron alone, a monoorfic intron, and a biorfic intron.

  13. Methylation of an alpha-foetoprotein gene intragenic site modulates gene activity.

    PubMed Central

    Opdecamp, K; Rivière, M; Molné, M; Szpirer, J; Szpirer, C

    1992-01-01

    By comparing the methylation pattern of Mspl/Hpall sites in the 5' region of the mouse alpha-foetoprotein (AFP) gene of different cells (hepatoma cells, foetal and adult liver, fibroblasts), we found a correlation between gene expression and unmethylation of a site located in the first intron of the gene. Other sites did not show this correlation. In transfection experiments of unmethylated and methylated AFP-CAT chimeric constructions, we then showed that methylation of the intronic site negatively modulates expression of CAT activity. We also found that a DNA segment centered on this site binds nuclear proteins; however methylation did not affect protein binding. Images PMID:1371343

  14. The brown algae Pl.LSU/2 group II intron-encoded protein has functional reverse transcriptase and maturase activities.

    PubMed

    Zerbato, Madeleine; Holic, Nathalie; Moniot-Frin, Sophie; Ingrao, Dina; Galy, Anne; Perea, Javier

    2013-01-01

    Group II introns are self-splicing mobile elements found in prokaryotes and eukaryotic organelles. These introns propagate by homing into precise genomic locations, following assembly of a ribonucleoprotein complex containing the intron-encoded protein (IEP) and the spliced intron RNA. Engineered group II introns are now commonly used tools for targeted genomic modifications in prokaryotes but not in eukaryotes. We speculate that the catalytic activation of currently known group II introns is limited in eukaryotic cells. The brown algae Pylaiella littoralis Pl.LSU/2 group II intron is uniquely capable of in vitro ribozyme activity at physiological level of magnesium but this intron remains poorly characterized. We purified and characterized recombinant Pl.LSU/2 IEP. Unlike most IEPs, Pl.LSU/2 IEP displayed a reverse transcriptase activity without intronic RNA. The Pl.LSU/2 intron could be engineered to splice accurately in Saccharomyces cerevisiae and splicing efficiency was increased by the maturase activity of the IEP. However, spliced transcripts were not expressed. Furthermore, intron splicing was not detected in human cells. While further tool development is needed, these data provide the first functional characterization of the PI.LSU/2 IEP and the first evidence that the Pl.LSU/2 group II intron splicing occurs in vivo in eukaryotes in an IEP-dependent manner.

  15. DIVERSITY OF THE TYPE 1 INTRON-ITS REGION OF THE 18S rRNA GENE IN PSEUDOGYMNOASCUS SPECIES FROM THE RED HILLS OF KANSAS.

    PubMed

    Chen, Xi; Crupper, Scott S

    2016-09-01

    Gypsum caves found throughout the Red Hills of Kansas have the state's most diverse and largest population of cave-roosting bats. White-nose syndrome (WNS), a disease caused by the fungus Pseudogymnoascus destructans, which threatens all temperate bat species, has not been previously detected in the gypsum caves as this disease moves westward from the eastern United States. Cave soil was obtained from the gypsum caves, and using the polymerase chain reaction, a 624-nucleotide DNA fragment specific to the Type 1 intron-internal transcribed spacer region of the 18S rRNA gene from Pseudogymnoascus species was amplified. Subsequent cloning and DNA sequencing indicated P. destructans DNA was present, along with 26 uncharacterized Pseudogymnoascus DNA variants. However, no evidence of WNS was observed in bat populations residing in these caves.

  16. Parallel computation of genome-scale RNA secondary structure to detect structural constraints on human genome.

    PubMed

    Kawaguchi, Risa; Kiryu, Hisanori

    2016-05-06

    RNA secondary structure around splice sites is known to assist normal splicing by promoting spliceosome recognition. However, analyzing the structural properties of entire intronic regions or pre-mRNA sequences has been difficult hitherto, owing to serious experimental and computational limitations, such as low read coverage and numerical problems. Our novel software, "ParasoR", is designed to run on a computer cluster and enables the exact computation of various structural features of long RNA sequences under the constraint of maximal base-pairing distance. ParasoR divides dynamic programming (DP) matrices into smaller pieces, such that each piece can be computed by a separate computer node without losing the connectivity information between the pieces. ParasoR directly computes the ratios of DP variables to avoid the reduction of numerical precision caused by the cancellation of a large number of Boltzmann factors. The structural preferences of mRNAs computed by ParasoR shows a high concordance with those determined by high-throughput sequencing analyses. Using ParasoR, we investigated the global structural preferences of transcribed regions in the human genome. A genome-wide folding simulation indicated that transcribed regions are significantly more structural than intergenic regions after removing repeat sequences and k-mer frequency bias. In particular, we observed a highly significant preference for base pairing over entire intronic regions as compared to their antisense sequences, as well as to intergenic regions. A comparison between pre-mRNAs and mRNAs showed that coding regions become more accessible after splicing, indicating constraints for translational efficiency. Such changes are correlated with gene expression levels, as well as GC content, and are enriched among genes associated with cytoskeleton and kinase functions. We have shown that ParasoR is very useful for analyzing the structural properties of long RNA sequences such as mRNAs, pre-mRNAs, and long non-coding RNAs whose lengths can be more than a million bases in the human genome. In our analyses, transcribed regions including introns are indicated to be subject to various types of structural constraints that cannot be explained from simple sequence composition biases. ParasoR is freely available at https://github.com/carushi/ParasoR .

  17. Phylogenetics and Gene Structure Dynamics of Polygalacturonase Genes in Aspergillus and Neurospora crassa

    PubMed Central

    Hong, Jin-Sung; Ryu, Ki-Hyun; Kwon, Soon-Jae; Kim, Jin-Won; Kim, Kwang-Soo; Park, Kyong-Cheul

    2013-01-01

    Polygalacturonase (PG) gene is a typical gene family present in eukaryotes. Forty-nine PGs were mined from the genomes of Neurospora crassa and five Aspergillus species. The PGs were classified into 3 clades such as clade 1 for rhamno-PGs, clade 2 for exo-PGs and clade 3 for exo- and endo-PGs, which were further grouped into 13 sub-clades based on the polypeptide sequence similarity. In gene structure analysis, a total of 124 introns were present in 44 genes and five genes lacked introns to give an average of 2.5 introns per gene. Intron phase distribution was 64.5% for phase 0, 21.8% for phase 1, and 13.7% for phase 2, respectively. The introns varied in their sequences and their lengths ranged from 20 bp to 424 bp with an average of 65.9 bp, which is approximately half the size of introns in other fungal genes. There were 29 homologous intron blocks and 26 of those were sub-clade specific. Intron losses were counted in 18 introns in which no obvious phase preference for intron loss was observed. Eighteen introns were placed at novel positions, which is considerably higher than those of plant PGs. In an evolutionary sense both intron loss and gain must have taken place for shaping the current PGs in these fungi. Together with the small intron size, low conservation of homologous intron blocks and higher number of novel introns, PGs of fungal species seem to have recently undergone highly dynamic evolution. PMID:25288950

  18. RNA structure in splicing: An evolutionary perspective.

    PubMed

    Lin, Chien-Ling; Taggart, Allison J; Fairbrother, William G

    2016-09-01

    Pre-mRNA splicing is a key post-transcriptional regulation process in which introns are excised and exons are ligated together. A novel class of structured intron was recently discovered in fish. Simple expansions of complementary AC and GT dimers at opposite boundaries of an intron were found to form a bridging structure, thereby enforcing correct splice site pairing across the intron. In some fish introns, the RNA structures are strong enough to bypass the need of regulatory protein factors for splicing. Here, we discuss the prevalence and potential functions of highly structured introns. In humans, structured introns usually arise through the co-occurrence of C and G-rich repeats at intron boundaries. We explore the potentially instructive example of the HLA receptor genes. In HLA pre-mRNA, structured introns flank the exons that encode the highly polymorphic β sheet cleft, making the processing of the transcript robust to variants that disrupt splicing factor binding. While selective forces that have shaped HLA receptor are fairly atypical, numerous other highly polymorphic genes that encode receptors contain structured introns. Finally, we discuss how the elevated mutation rate associated with the simple repeats that often compose structured intron can make structured introns themselves rapidly evolving elements.

  19. Phylogenetic Distribution of Intron Positions in Alpha-Amylase Genes of Bilateria Suggests Numerous Gains and Losses

    PubMed Central

    Da Lage, Jean-Luc; Maczkowiak, Frédérique; Cariou, Marie-Louise

    2011-01-01

    Most eukaryotes have at least some genes interrupted by introns. While it is well accepted that introns were already present at moderate density in the last eukaryote common ancestor, the conspicuous diversity of intron density among genomes suggests a complex evolutionary history, with marked differences between phyla. The question of the rates of intron gains and loss in the course of evolution and factors influencing them remains controversial. We have investigated a single gene family, alpha-amylase, in 55 species covering a variety of animal phyla. Comparison of intron positions across phyla suggests a complex history, with a likely ancestral intronless gene undergoing frequent intron loss and gain, leading to extant intron/exon structures that are highly variable, even among species from the same phylum. Because introns are known to play no regulatory role in this gene and there is no alternative splicing, the structural differences may be interpreted more easily: intron positions, sizes, losses or gains may be more likely related to factors linked to splicing mechanisms and requirements, and to recognition of introns and exons, or to more extrinsic factors, such as life cycle and population size. We have shown that intron losses outnumbered gains in recent periods, but that “resets” of intron positions occurred at the origin of several phyla, including vertebrates. Rates of gain and loss appear to be positively correlated. No phase preference was found. We also found evidence for parallel gains and for intron sliding. Presence of introns at given positions was correlated to a strong protosplice consensus sequence AG/G, which was much weaker in the absence of intron. In contrast, recent intron insertions were not associated with a specific sequence. In animal Amy genes, population size and generation time seem to have played only minor roles in shaping gene structures. PMID:21611157

  20. Conservation and Sex-Specific Splicing of the transformer Gene in the Calliphorids Cochliomyia hominivorax, Cochliomyia macellaria and Lucilia sericata

    PubMed Central

    Li, Fang; Vensko, Steven P.; Belikoff, Esther J.; Scott, Maxwell J.

    2013-01-01

    Transformer (TRA) promotes female development in several dipteran species including the Australian sheep blowfly Lucilia cuprina, the Mediterranean fruit fly, housefly and Drosophila melanogaster. tra transcripts are sex-specifically spliced such that only the female form encodes full length functional protein. The presence of six predicted TRA/TRA2 binding sites in the sex-specific female intron of the L. cuprina gene suggested that tra splicing is auto-regulated as in medfly and housefly. With the aim of identifying conserved motifs that may play a role in tra sex-specific splicing, here we have isolated and characterized the tra gene from three additional blowfly species, L. sericata, Cochliomyia hominivorax and C. macellaria. The blowfly adult male and female transcripts differ in the choice of splice donor site in the first intron, with males using a site downstream of the site used in females. The tra genes all contain a single TRA/TRA2 site in the male exon and a cluster of four to five sites in the male intron. However, overall the sex-specific intron sequences are poorly conserved in closely related blowflies. The most conserved regions are around the exon/intron junctions, the 3′ end of the intron and near the cluster of TRA/TRA2 sites. We propose a model for sex specific regulation of tra splicing that incorporates the conserved features identified in this study. In L. sericata embryos, the male tra transcript was first detected at around the time of cellular blastoderm formation. RNAi experiments showed that tra is required for female development in L. sericata and C. macellaria. The isolation of the tra gene from the New World screwworm fly C. hominivorax, a major livestock pest, will facilitate the development of a “male-only” strain for genetic control programs. PMID:23409170

  1. Mobile Bacterial Group II Introns at the Crux of Eukaryotic Evolution

    PubMed Central

    Lambowitz, Alan M.; Belfort, Marlene

    2015-01-01

    SUMMARY This review focuses on recent developments in our understanding of group II intron function, the relationships of these introns to retrotransposons and spliceosomes, and how their common features have informed thinking about bacterial group II introns as key elements in eukaryotic evolution. Reverse transcriptase-mediated and host factor-aided intron retrohoming pathways are considered along with retrotransposition mechanisms to novel sites in bacteria, where group II introns are thought to have originated. DNA target recognition and movement by target-primed reverse transcription infer an evolutionary relationship among group II introns, non-LTR retrotransposons, such as LINE elements, and telomerase. Additionally, group II introns are almost certainly the progenitors of spliceosomal introns. Their profound similarities include splicing chemistry extending to RNA catalysis, reaction stereochemistry, and the position of two divalent metals that perform catalysis at the RNA active site. There are also sequence and structural similarities between group II introns and the spliceosome’s small nuclear RNAs (snRNAs) and between a highly conserved core spliceosomal protein Prp8 and a group II intron-like reverse transcriptase. It has been proposed that group II introns entered eukaryotes during bacterial endosymbiosis or bacterial-archaeal fusion, proliferated within the nuclear genome, necessitating evolution of the nuclear envelope, and fragmented giving rise to spliceosomal introns. Thus, these bacterial self-splicing mobile elements have fundamentally impacted the composition of extant eukaryotic genomes, including the human genome, most of which is derived from close relatives of mobile group II introns. PMID:25878921

  2. De novo insertion of an intron into the mammalian sex determining gene, SRY

    PubMed Central

    O’Neill, Rachel J. Waugh; Brennan, Francine E.; Delbridge, Margaret L.; Crozier, Ross H.; Graves, Jennifer A. Marshall

    1998-01-01

    Two theories have been proposed to explain the evolution of introns within eukaryotic genes. The introns early theory, or “exon theory of genes,” proposes that introns are ancient and that recombination within introns provided new exon structure, and thus new genes. The introns late theory, or “insertional theory of introns,” proposes that ancient genes existed as uninterrupted exons and that introns have been introduced during the course of evolution. There is still controversy as to how intron–exon structure evolved and whether the majority of introns are ancient or novel. Although there is extensive evidence in support of the introns early theory, phylogenetic comparisons of several genes indicate recent gain and loss of introns within these genes. However, no example has been shown of a protein coding gene, intronless in its ancestral form, which has acquired an intron in a derived form. The mammalian sex determining gene, SRY, is intronless in all mammals studied to date, as is the gene from which it recently evolved. However, we report here comparisons of genomic and cDNA sequences that now provide evidence of a de novo insertion of an intron into the SRY gene of dasyurid marsupials. This recently (approximately 45 million years ago) inserted sequence is not homologous with known transposable elements. Our data demonstrate that introns may be inserted as spliced units within a developmentally crucial gene without disrupting its function. PMID:9465071

  3. Negative effect of the 5'-untranslated leader sequence on Ac transposon promoter expression.

    PubMed

    Scortecci, K C; Raina, R; Fedoroff, N V; Van Sluys, M A

    1999-08-01

    Transposable elements are used in heterologous plant hosts to clone genes by insertional mutagenesis. The Activator (Ac) transposable element has been cloned from maize, and introduced into a variety of plants. However, differences in regulation and transposition frequency have been observed between different host plants. The cause of this variability is still unknown. To better understand the activity of the Ac element, we analyzed the Ac promoter region and its 5'-untranslated leader sequence (5' UTL). Transient assays in tobacco NT1 suspension cells showed that the Ac promoter is a weak promoter and its activity was localized by deletion analyses. The data presented here indicate that the core of the Ac promoter is contained within 153 bp fragment upstream to transcription start sites. An important inhibitory effect (80%) due to the presence of the 5' UTL was found on the expression of LUC reporter gene. Here we demonstrate that the presence of the 5' UTL in the constructs reduces the expression driven by either strong or weak promoters.

  4. Gene relocations within chloroplast genomes of Jasminum and Menodora (Oleaceae) are due to multiple, overlapping inversions.

    PubMed

    Lee, Hae-Lim; Jansen, Robert K; Chumley, Timothy W; Kim, Ki-Joong

    2007-05-01

    The chloroplast (cp) DNA sequence of Jasminum nudiflorum (Oleaceae-Jasmineae) is completed and compared with the large single-copy region sequences from 6 related species. The cp genomes of the tribe Jasmineae (Jasminum and Menodora) show several distinctive rearrangements, including inversions, gene duplications, insertions, inverted repeat expansions, and gene and intron losses. The ycf4-psaI region in Jasminum section Primulina was relocated as a result of 2 overlapping inversions of 21,169 and 18,414 bp. The 1st, larger inversion is shared by all members of the Jasmineae indicating that it occurred in the common ancestor of the tribe. Similar rearrangements were also identified in the cp genome of Menodora. In this case, 2 fragments including ycf4 and rps4-trnS-ycf3 genes were moved by 2 additional inversions of 14 and 59 kb that are unique to Menodora. Other rearrangements in the Oleaceae are confined to certain regions of the Jasminum and Menodora cp genomes, including the presence of highly repeated sequences and duplications of coding and noncoding sequences that are inserted into clpP and between rbcL and psaI. These insertions are correlated with the loss of 2 introns in clpP and a serial loss of segments of accD. The loss of the accD gene and clpP introns in both the monocot family Poaceae and the eudicot family Oleaceae are clearly independent evolutionary events. However, their genome organization is surprisingly similar despite the distant relationship of these 2 angiosperm families.

  5. Intron-loss evolution of hatching enzyme genes in Teleostei

    PubMed Central

    2010-01-01

    Background Hatching enzyme, belonging to the astacin metallo-protease family, digests egg envelope at embryo hatching. Orthologous genes of the enzyme are found in all vertebrate genomes. Recently, we found that exon-intron structures of the genes were conserved among tetrapods, while the genes of teleosts frequently lost their introns. Occurrence of such intron losses in teleostean hatching enzyme genes is an uncommon evolutionary event, as most eukaryotic genes are generally known to be interrupted by introns and the intron insertion sites are conserved from species to species. Here, we report on extensive studies of the exon-intron structures of teleostean hatching enzyme genes for insight into how and why introns were lost during evolution. Results We investigated the evolutionary pathway of intron-losses in hatching enzyme genes of 27 species of Teleostei. Hatching enzyme genes of basal teleosts are of only one type, which conserves the 9-exon-8-intron structure of an assumed ancestor. On the other hand, otocephalans and euteleosts possess two types of hatching enzyme genes, suggesting a gene duplication event in the common ancestor of otocephalans and euteleosts. The duplicated genes were classified into two clades, clades I and II, based on phylogenetic analysis. In otocephalans and euteleosts, clade I genes developed a phylogeny-specific structure, such as an 8-exon-7-intron, 5-exon-4-intron, 4-exon-3-intron or intron-less structure. In contrast to the clade I genes, the structures of clade II genes were relatively stable in their configuration, and were similar to that of the ancestral genes. Expression analyses revealed that hatching enzyme genes were high-expression genes, when compared to that of housekeeping genes. When expression levels were compared between clade I and II genes, clade I genes tends to be expressed more highly than clade II genes. Conclusions Hatching enzyme genes evolved to lose their introns, and the intron-loss events occurred at the specific points of teleostean phylogeny. We propose that the high-expression hatching enzyme genes frequently lost their introns during the evolution of teleosts, while the low-expression genes maintained the exon-intron structure of the ancestral gene. PMID:20796321

  6. The Brown Algae Pl.LSU/2 Group II Intron-Encoded Protein Has Functional Reverse Transcriptase and Maturase Activities

    PubMed Central

    Zerbato, Madeleine; Holic, Nathalie; Moniot-Frin, Sophie; Ingrao, Dina; Galy, Anne; Perea, Javier

    2013-01-01

    Group II introns are self-splicing mobile elements found in prokaryotes and eukaryotic organelles. These introns propagate by homing into precise genomic locations, following assembly of a ribonucleoprotein complex containing the intron-encoded protein (IEP) and the spliced intron RNA. Engineered group II introns are now commonly used tools for targeted genomic modifications in prokaryotes but not in eukaryotes. We speculate that the catalytic activation of currently known group II introns is limited in eukaryotic cells. The brown algae Pylaiella littoralis Pl.LSU/2 group II intron is uniquely capable of in vitro ribozyme activity at physiological level of magnesium but this intron remains poorly characterized. We purified and characterized recombinant Pl.LSU/2 IEP. Unlike most IEPs, Pl.LSU/2 IEP displayed a reverse transcriptase activity without intronic RNA. The Pl.LSU/2 intron could be engineered to splice accurately in Saccharomyces cerevisiae and splicing efficiency was increased by the maturase activity of the IEP. However, spliced transcripts were not expressed. Furthermore, intron splicing was not detected in human cells. While further tool development is needed, these data provide the first functional characterization of the PI.LSU/2 IEP and the first evidence that the Pl.LSU/2 group II intron splicing occurs in vivo in eukaryotes in an IEP-dependent manner. PMID:23505475

  7. Introns in Cryptococcus.

    PubMed

    Janbon, Guilhem

    2018-01-01

    In Cryptococcus neoformans, nearly all genes are interrupted by small introns. In recent years, genome annotation and genetic analysis have illuminated the major roles these introns play in the biology of this pathogenic yeast. Introns are necessary for gene expression and alternative splicing can regulate gene expression in response to environmental cues. In addition, recent studies have revealed that C. neoformans introns help to prevent transposon dissemination and protect genome integrity. These characteristics of cryptococcal introns are probably not unique to Cryptococcus, and this yeast likely can be considered as a model for intron-related studies in fungi.

  8. DNA double-strand break in vivo at the 3' extremity of exons located upstream of group II introns. Senescence and circular DNA introns in Podospora mitochondria.

    PubMed

    Sainsard-Chanet, A; Begel, O; Belcour, L

    1994-10-07

    In the filamentous fungus Podospora anserina, the unavoidable phenomenon of senescence is associated with the amplification of the first intron of the mitochondrial cox1 that accumulates as circular DNA molecules consisting of tandem repeats. This group II intron (cox1-i1 or alpha) is able to transpose and contains an open reading frame with significant amino acid similarity with reverse transcriptases. The generation of these intronic circular DNA molecules, their amplification and their involvement in the senescence process are unresolved questions. We demonstrate here that: (1) another group II intron, the fourth intron of gene cox1, cox1-i4, is also able to give precise DNA end to end junctions; (2) this intronic sequence can be found amplified during senescence, although to a lesser extent than cox1-i1; (3) the amplification of the DNA multimeric cox1-i1 molecules likely does not proceed by autonomous replication; (4) the generation of the DNA intronic circles does not require efficient intron splicing; (5) a DNA double-strand break occurs in vivo at the 3' extremity of the cox1-e1 and cox1-e4 exons preceding the group II introns that form circular DNAs. On the whole, these results show that the ability to form DNA circular molecules is a property of some group II introns and they demonstrate the occurrence of a specific DNA cleavage at or near the integration site of these group II introns. The results strongly suggest that this cleavage is involved in the formation of the group II intronic DNA circles and could also be involved in the phenomenon of group II intron homing.

  9. Diversity in mRNA expression of the serine-type carboxypeptidase ocpG in Aspergillus oryzae through intron retention.

    PubMed

    Ishida, Ken; Kuboshima, Megumi; Morita, Hiroto; Maeda, Hiroshi; Okamoto, Ayako; Takeuchi, Michio; Yamagata, Youhei

    2014-01-01

    Alternative splicing is thought to be a means for diversification of products by mRNA modification. Although some intron retentions are predicted by transcriptome analysis in Aspergillus oryzae, its physiological significance remains unknown. We found that intron retention occurred occasionally in the serine-type carboxypeptidase gene, ocpG. Analysis under various culture conditions revealed that extracellular nitrogen conditions influence splicing patterns; this suggested that there might be a correlation between splicing efficiency and the necessity of OcpG activity for obtaining a nitrogen source. Since further analysis showed that splicing occurred independently in each intron, we constructed ocpG intron-exchanging strain by interchanging the positions of intron-1 and intron-2. The splicing pattern indicated the probability that ocpG intron retention was affected by the secondary structures of intronic mRNA.

  10. Alternative polyadenylation of tumor suppressor genes in small intestinal neuroendocrine tumors.

    PubMed

    Rehfeld, Anders; Plass, Mireya; Døssing, Kristina; Knigge, Ulrich; Kjær, Andreas; Krogh, Anders; Friis-Hansen, Lennart

    2014-01-01

    The tumorigenesis of small intestinal neuroendocrine tumors (SI-NETs) is poorly understood. Recent studies have associated alternative polyadenylation (APA) with proliferation, cell transformation, and cancer. Polyadenylation is the process in which the pre-messenger RNA is cleaved at a polyA site and a polyA tail is added. Genes with two or more polyA sites can undergo APA. This produces two or more distinct mRNA isoforms with different 3' untranslated regions. Additionally, APA can also produce mRNAs containing different 3'-terminal coding regions. Therefore, APA alters both the repertoire and the expression level of proteins. Here, we used high-throughput sequencing data to map polyA sites and characterize polyadenylation genome-wide in three SI-NETs and a reference sample. In the tumors, 16 genes showed significant changes of APA pattern, which lead to either the 3' truncation of mRNA coding regions or 3' untranslated regions. Among these, 11 genes had been previously associated with cancer, with 4 genes being known tumor suppressors: DCC, PDZD2, MAGI1, and DACT2. We validated the APA in three out of three cases with quantitative real-time-PCR. Our findings suggest that changes of APA pattern in these 16 genes could be involved in the tumorigenesis of SI-NETs. Furthermore, they also point to APA as a new target for both diagnostic and treatment of SI-NETs. The identified genes with APA specific to the SI-NETs could be further tested as diagnostic markers and drug targets for disease prevention and treatment.

  11. Alternative Polyadenylation of Tumor Suppressor Genes in Small Intestinal Neuroendocrine Tumors

    PubMed Central

    Rehfeld, Anders; Plass, Mireya; Døssing, Kristina; Knigge, Ulrich; Kjær, Andreas; Krogh, Anders; Friis-Hansen, Lennart

    2014-01-01

    The tumorigenesis of small intestinal neuroendocrine tumors (SI-NETs) is poorly understood. Recent studies have associated alternative polyadenylation (APA) with proliferation, cell transformation, and cancer. Polyadenylation is the process in which the pre-messenger RNA is cleaved at a polyA site and a polyA tail is added. Genes with two or more polyA sites can undergo APA. This produces two or more distinct mRNA isoforms with different 3′ untranslated regions. Additionally, APA can also produce mRNAs containing different 3′-terminal coding regions. Therefore, APA alters both the repertoire and the expression level of proteins. Here, we used high-throughput sequencing data to map polyA sites and characterize polyadenylation genome-wide in three SI-NETs and a reference sample. In the tumors, 16 genes showed significant changes of APA pattern, which lead to either the 3′ truncation of mRNA coding regions or 3′ untranslated regions. Among these, 11 genes had been previously associated with cancer, with 4 genes being known tumor suppressors: DCC, PDZD2, MAGI1, and DACT2. We validated the APA in three out of three cases with quantitative real-time-PCR. Our findings suggest that changes of APA pattern in these 16 genes could be involved in the tumorigenesis of SI-NETs. Furthermore, they also point to APA as a new target for both diagnostic and treatment of SI-NETs. The identified genes with APA specific to the SI-NETs could be further tested as diagnostic markers and drug targets for disease prevention and treatment. PMID:24782827

  12. Characterization of regulatory elements within the coat protein (CP) coding region of Tobacco mosaic virus affecting subgenomic transcription and green fluorescent protein expression from the CP subgenomic RNA promoter.

    PubMed

    Man, Michal; Epel, Bernard L

    2004-06-01

    A replicon based on Tobacco mosaic virus that was engineered to express the open reading frame (ORF) of the green fluorescent protein (GFP) gene in place of the native coat protein (CP) gene from a minimal CP subgenomic (sg) RNA promoter was found to accumulate very low levels of GFP. Regulatory regions within the CP ORF were identified that, when presented as untranslated regions flanking the GFP ORF, enhanced or inhibited sg transcription and GFP expression. Full GFP expression from the CP sgRNA promoter required more than the first 20 nt of the CP ORF but not beyond the first 56 nt. Further analysis indicated the presence of an enhancer element between nt +25 and +55 with respect to the CP translation start site. The inclusion of this enhancer sequence upstream of the GFP ORF led to elevated sg transcription and to a 50-fold increase in GFP accumulation in comparison with a minimal CP promoter in which the entire CP ORF was displaced by the GFP ORF. Inclusion of the 3'-terminal 22 nt had a minor positive effect on GFP accumulation, but the addition of extended untranslated sequences from the 3' terminus of the CP ORF downstream of the GFP ORF was basically found to inhibit sg transcription. Secondary structure analysis programs predicted the CP sgRNA promoter to reside within two stable stem-loop structures, which are followed by an enhancer region.

  13. Genome-wide identification of aquaporin encoding genes in Brassica oleracea and their phylogenetic sequence comparison to Brassica crops and Arabidopsis

    PubMed Central

    Diehn, Till A.; Pommerrenig, Benjamin; Bernhardt, Nadine; Hartmann, Anja; Bienert, Gerd P.

    2015-01-01

    Aquaporins (AQPs) are essential channel proteins that regulate plant water homeostasis and the uptake and distribution of uncharged solutes such as metalloids, urea, ammonia, and carbon dioxide. Despite their importance as crop plants, little is known about AQP gene and protein function in cabbage (Brassica oleracea) and other Brassica species. The recent releases of the genome sequences of B. oleracea and Brassica rapa allow comparative genomic studies in these species to investigate the evolution and features of Brassica genes and proteins. In this study, we identified all AQP genes in B. oleracea by a genome-wide survey. In total, 67 genes of four plant AQP subfamilies were identified. Their full-length gene sequences and locations on chromosomes and scaffolds were manually curated. The identification of six additional full-length AQP sequences in the B. rapa genome added to the recently published AQP protein family of this species. A phylogenetic analysis of AQPs of Arabidopsis thaliana, B. oleracea, B. rapa allowed us to follow AQP evolution in closely related species and to systematically classify and (re-) name these isoforms. Thirty-three groups of AQP-orthologous genes were identified between B. oleracea and Arabidopsis and their expression was analyzed in different organs. The two selectivity filters, gene structure and coding sequences were highly conserved within each AQP subfamily while sequence variations in some introns and untranslated regions were frequent. These data suggest a similar substrate selectivity and function of Brassica AQPs compared to Arabidopsis orthologs. The comparative analyses of all AQP subfamilies in three Brassicaceae species give initial insights into AQP evolution in these taxa. Based on the genome-wide AQP identification in B. oleracea and the sequence analysis and reprocessing of Brassica AQP information, our dataset provides a sequence resource for further investigations of the physiological and molecular functions of Brassica crop AQPs. PMID:25904922

  14. Alternative RNA splicing and gastric cancer.

    PubMed

    Li, Ying; Yuan, Yuan

    2017-07-01

    Alternative splicing (AS) linked to diseases, especially to tumors. Recently, more and more studies focused on the relationship between AS and gastric cancer (GC). This review surveyed the hot topic from four aspects: First, the common types of AS in cancer, including exon skipping, intron retention, mutually exclusive exon, alternative 5 ' or 3' splice site, alternative first or last exon and alternative 3' untranslated regions. Second, basic mechanisms of AS and its relationship with cancer. RNA splicing in eukaryotes follows the GT-AG rule by both cis-elements and trans-acting factors regulatory. Through RNA splicing, different proteins with different forms and functions can be produced and may be associated with carcinogenesis. Third, AS types of GC-related genes and their splicing variants. In this paper, we listed 10 common genes with AS and illustrated its possible molecular mechanisms owing to genetic variation (mutation and /or polymorphism). Fourth, the splicing variants of GC-associated genes and gastric carcinogenesis, invasion and metastasis. Many studies have found that the different splicing variants of the same gene are differentially expressed in GC and its precancerous diseases, suggesting AS has important implications in GC development. Taking together, this review highlighted the role of AS and splicing variants in the process of GC. We hope that this is not only beneficial to advances in the study field of GC, but also can provide valuable information to other similar tumor research.Although we already know some gene splicing and splicing variants play an important role in the development of GC, but many phenomena and mechanisms are still unknown. For example, how the tumor microenvironment and signal transduction pathway effect the forming and function of AS? Unfortunately, this review did not cover the contents because the current study is limited. It is no doubt that clarifying the phenomena and mechanisms of these unknown may help to reveal the relationship of AS with complex tumor genetic variation and the occurrence and development of tumors. Copyright © 2016 Elsevier B.V. All rights reserved.

  15. Development of unigene-derived SSR markers in cowpea (Vigna unguiculata) and their transferability to other Vigna species.

    PubMed

    Gupta, S K; Gopalakrishna, T

    2010-07-01

    Unigene sequences available in public databases provide a cost-effective and valuable source for the development of molecular markers. In this study, the identification and development of unigene-based SSR markers in cowpea (Vigna unguiculata (L.) Walp.) is presented. A total of 1071 SSRs were identified in 15 740 cowpea unigene sequences downloaded from the National Center for Biotechnology Information. The most frequent SSR motifs present in the unigenes were trinucleotides (59.7%), followed by dinucleotides (34.8%), pentanucleotides (4%), and tetranucleotides (1.5%). The copy number varied from 6 to 33 for dinucleotide, 5 to 29 for trinucleotide, 5 to 7 for tetranucleotide, and 4 to 6 for pentanucleotide repeats. Primer pairs were successfully designed for 803 SSR motifs and 102 SSR markers were finally characterized and validated. Putative function was assigned to 64.7% of the unigene SSR markers based on significant homology to reported proteins. About 31.7% of the SSRs were present in coding sequences and 68.3% in untranslated regions of the genes. About 87% of the SSRs located in the coding sequences were trinucleotide repeats. Allelic variation at 32 SSR loci produced 98 alleles in 20 cowpea genotypes. The polymorphic information content for the SSR markers varied from 0.10 to 0.83 with an average of 0.53. These unigene SSR markers showed a high rate of transferability (88%) across other Vigna species, thereby expanding their utility. Alignment of unigene sequences with soybean genomic sequences revealed the presence of introns in amplified products of some of the SSR markers. This study presents the distribution of SSRs in the expressed portion of the cowpea genome and is the first report of the development of functional unigene-based SSR markers in cowpea. These SSR markers would play an important role in molecular mapping, comparative genomics, and marker-assisted selection strategies in cowpea and other Vigna species.

  16. Variability of Creatine Metabolism Genes in Children with Autism Spectrum Disorder.

    PubMed

    Cameron, Jessie M; Levandovskiy, Valeriy; Roberts, Wendy; Anagnostou, Evdokia; Scherer, Stephen; Loh, Alvin; Schulze, Andreas

    2017-07-31

    Creatine deficiency syndrome (CDS) comprises three separate enzyme deficiencies with overlapping clinical presentations: arginine:glycine amidinotransferase ( GATM gene, glycine amidinotransferase), guanidinoacetate methyltransferase ( GAMT gene), and creatine transporter deficiency ( SLC6A8 gene, solute carrier family 6 member 8). CDS presents with developmental delays/regression, intellectual disability, speech and language impairment, autistic behaviour, epileptic seizures, treatment-refractory epilepsy, and extrapyramidal movement disorders; symptoms that are also evident in children with autism. The objective of the study was to test the hypothesis that genetic variability in creatine metabolism genes is associated with autism. We sequenced GATM , GAMT and SLC6A8 genes in 166 patients with autism (coding sequence, introns and adjacent untranslated regions). A total of 29, 16 and 25 variants were identified in each gene, respectively. Four variants were novel in GATM , and 5 in SLC6A8 (not present in the 1000 Genomes, Exome Sequencing Project (ESP) or Exome Aggregation Consortium (ExAC) databases). A single variant in each gene was identified as non-synonymous, and computationally predicted to be potentially damaging. Nine variants in GATM were shown to have a lower minor allele frequency (MAF) in the autism population than in the 1000 Genomes database, specifically in the East Asian population (Fisher's exact test). Two variants also had lower MAFs in the European population. In summary, there were no apparent associations of variants in GAMT and SLC6A8 genes with autism. The data implying there could be a lower association of some specific GATM gene variants with autism is an observation that would need to be corroborated in a larger group of autism patients, and with sub-populations of Asian ethnicities. Overall, our findings suggest that the genetic variability of creatine synthesis/transport is unlikely to play a part in the pathogenesis of autism spectrum disorder (ASD) in children.

  17. Muscle wasting in myotonic dystrophies: a model of premature aging.

    PubMed

    Mateos-Aierdi, Alba Judith; Goicoechea, Maria; Aiastui, Ana; Fernández-Torrón, Roberto; Garcia-Puga, Mikel; Matheu, Ander; López de Munain, Adolfo

    2015-01-01

    Myotonic dystrophy type 1 (DM1 or Steinert's disease) and type 2 (DM2) are multisystem disorders of genetic origin. Progressive muscular weakness, atrophy and myotonia are the most prominent neuromuscular features of these diseases, while other clinical manifestations such as cardiomyopathy, insulin resistance and cataracts are also common. From a clinical perspective, most DM symptoms are interpreted as a result of an accelerated aging (cataracts, muscular weakness and atrophy, cognitive decline, metabolic dysfunction, etc.), including an increased risk of developing tumors. From this point of view, DM1 could be described as a progeroid syndrome since a notable age-dependent dysfunction of all systems occurs. The underlying molecular disorder in DM1 consists of the existence of a pathological (CTG) triplet expansion in the 3' untranslated region (UTR) of the Dystrophia Myotonica Protein Kinase (DMPK) gene, whereas (CCTG)n repeats in the first intron of the Cellular Nucleic acid Binding Protein/Zinc Finger Protein 9 (CNBP/ZNF9) gene cause DM2. The expansions are transcribed into (CUG)n and (CCUG)n-containing RNA, respectively, which form secondary structures and sequester RNA-binding proteins, such as the splicing factor muscleblind-like protein (MBNL), forming nuclear aggregates known as foci. Other splicing factors, such as CUGBP, are also disrupted, leading to a spliceopathy of a large number of downstream genes linked to the clinical features of these diseases. Skeletal muscle regeneration relies on muscle progenitor cells, known as satellite cells, which are activated after muscle damage, and which proliferate and differentiate to muscle cells, thus regenerating the damaged tissue. Satellite cell dysfunction seems to be a common feature of both age-dependent muscle degeneration (sarcopenia) and muscle wasting in DM and other muscle degenerative diseases. This review aims to describe the cellular, molecular and macrostructural processes involved in the muscular degeneration seen in DM patients, highlighting the similarities found with muscle aging.

  18. Foot-and-mouth disease virus 5’-terminal S fragment is required for replication and modulation of the innate immune response in host cells

    USDA-ARS?s Scientific Manuscript database

    The foot-and-mouth disease virus (FMDV) contains a 5’ untranslated region (5’UTR) with multiple structural domains that regulate viral genome replication, translation, and virus-host interactions. At its 5’terminus, the S fragment of over 360 bp is predicted to form a stable stem-loop that is separ...

  19. Variation in the coding and 3’ untranslated regions of the porcine prolactin receptor short form modifies protein expression and function

    USDA-ARS?s Scientific Manuscript database

    The actions of prolactin (PRL) are mediated by both long (LF) and short isoforms (SF) of the PRL receptor (PRLR). Here, we report on a genetic and functional analysis of the porcine PRLR (pPRLR) SF. Three single nucleotide polymorphisms (SNPs) within exon 11 of the pPRLR-SF give rise to four amino a...

  20. First Complete Genome Sequence of Zika Virus (Flaviviridae, Flavivirus) from an Autochthonous Transmission in Brazil.

    PubMed

    Cunha, Mariana Sequetin; Esposito, Danillo Lucas Alves; Rocco, Iray Maria; Maeda, Adriana Yurika; Vasami, Fernanda Gisele Silva; Nogueira, Juliana Silva; de Souza, Renato Pereira; Suzuki, Akemi; Addas-Carvalho, Marcelo; Barjas-Castro, Maria de Lourdes; Resende, Mariângela Ribeiro; Stucchi, Raquel Silveira Bello; Boin, Ilka de Fátima Santana Ferreira; Katz, Gizelda; Angerami, Rodrigo Nogueira; da Fonseca, Benedito Antonio Lopes

    2016-03-03

    We report here the genome sequence of Zika virus, strain ZikaSPH2015, containing all structural and nonstructural proteins flanked by the 5' and 3' untranslated region. It was isolated in São Paulo state, Brazil, in 2015, from a patient who received a blood transfusion from an asymptomatic donor at the time of donation. Copyright © 2016 Cunha et al.

  1. A mixed group II/group III twintron in the Euglena gracilis chloroplast ribosomal protein S3 gene: evidence for intron insertion during gene evolution.

    PubMed Central

    Copertino, D W; Christopher, D A; Hallick, R B

    1991-01-01

    The splicing of a 409 nucleotide intron from the Euglena gracilis chloroplast ribosomal protein S3 gene (rps3) was examined by cDNA cloning and sequencing, and northern hybridization. Based on the characterization of a partially spliced pre-mRNA, the intron was characterized as a 'mixed' twintron, composed of a 311 nucleotide group II intron internal to a 98 nucleotide group III intron. Twintron excision is via a 2-step sequential splicing pathway, with removal of the internal group II intron preceding excision of the external group III intron. Based on secondary structural analysis of the twintron, we propose that group III introns may represent highly degenerate versions of group II introns. The existence of twintrons is interpreted as evidence that group II introns were inserted during the evolution of Euglena chloroplast genes from a common ancestor with eubacteria, archaebacteria, cyanobacteria, and other chloroplasts. Images PMID:1721702

  2. The myotonic dystrophy kinase 3{prime}-untranslated region and its effect on gene expression

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Ang, C.W.Y.; Sabourin, L.A.; Narang, M.A.

    1994-09-01

    Myotonic dystrophy (DM) is an autosomal dominant neuromuscular disease involving the expansion of an unstable CTG repeat in the 3{prime}-untranslated (3{prime}-UTR) region of the DM kinase (DMK) gene. Increased levels of mRNA in congenital compared to normal tissue have been shown, suggesting elevated DMK levels may be responsible for the disease phenotype. To study the effect of the DMK 3{prime}UTR on gene expression, a reporter gene system was constructed using the constitutive CMV promoter with the chloramphenicol acetyl transferase (CAT) open reading frame and the DMK 3{prime}UTR containing from 5 repeats up to 90 repeats. Transient transfection into a rhabdomyosarcomamore » cell line shows a three-fold increase in CAT activity from constructs containing a wildtype 3{prime}UTR (5 and 20 repeats) compared to a control construct containing only a poly(A) signal. Reporter constructs with repeats in the protomutation (50 repeats) and mutation (90 repeats) range show a greater than 10-fold increase over control CAT activity. These results suggest the presence of elements in the DMK 3{prime}UTR capable of conferring increased gene expression. We are currently investigating cell-specific activity of the constructs and conducting deletion mapping to identify regulatory elements in the 3{prime}-UTR.« less

  3. Identification of a second murine interleukin-11 receptor alpha-chain gene (IL11Ra2) with a restricted pattern of expression.

    PubMed

    Robb, L; Hilton, D J; Brook-Carter, P T; Begley, C G

    1997-03-15

    The interleukin-11 receptor alpha-chain, a member of the hematopoietin receptor superfamily, forms, together with gp130, a functional high-affinity receptor complex for interleukin 11. We, and others, reported the cloning of the murine interleukin 11 receptor alpha-chain cDNA (IL11Ra) and recently described the structure of the IL11Ra locus. We also described the presence of a second IL11Ra-like locus in some mouse strains. In this study we report that the second locus, designated IL11Ra2, encodes an mRNA species. The transcript was 99% identical to the IL11Ra transcript in the coding and 3'-untranslated region, but had a different 5'-untranslated region. The complete genomic organization of the IL11Ra2 locus is presented, and the two loci are shown to be located on a 200-kb NaeI genomic fragment. Comparison of the expression pattern of the IL11Ra and IL11Ra2 genes using an RT-PCR restriction fragment length polymorphism strategy revealed that while the expression of IL11Ra was widespread, expression of IL11Ra2 was restricted to testis, lymph node, and thymus.

  4. Replacement of the yeast TRP4 3' untranslated region by a hammerhead ribozyme results in a stable and efficiently exported mRNA that lacks a poly(A) tail.

    PubMed Central

    Düvel, Katrin; Valerius, Oliver; Mangus, David A; Jacobson, Allan; Braus, Gerhard H

    2002-01-01

    The mRNA poly(A) tail serves different purposes, including the facilitation of nuclear export, mRNA stabilization, efficient translation, and, finally, specific degradation. The posttranscriptional addition of a poly(A) tail depends on sequence motifs in the 3' untranslated region (3' UTR) of the mRNA and a complex trans-acting protein machinery. In this study, we have replaced the 3' UTR of the yeast TRP4 gene with sequences encoding a hammerhead ribozyme that efficiently cleaves itself in vivo. Expression of the TRP4-ribozyme allele resulted in the accumulation of a nonpolyadenylated mRNA. Cells expressing the TRP4-ribozyme mRNA showed a reduced growth rate due to a reduction in Trp4p enzyme activity. The reduction in enzyme activity was not caused by inefficient mRNA export from the nucleus or mRNA destabilization. Rather, analyses of mRNA association with polyribosomes indicate that translation of the ribozyme-containing mRNA is impaired. This translational defect allows sufficient synthesis of Trp4p to support growth of trp4 cells, but is, nevertheless, of such magnitude as to activate the general control network of amino acid biosynthesis. PMID:12003493

  5. Evolutionary dynamics of a conserved sequence motif in the ribosomal genes of the ciliate Paramecium.

    PubMed

    Catania, Francesco; Lynch, Michael

    2010-05-04

    In protozoa, the identification of preserved motifs by comparative genomics is often impeded by difficulties to generate reliable alignments for non-coding sequences. Moreover, the evolutionary dynamics of regulatory elements in 3' untranslated regions (both in protozoa and metazoa) remains a virtually unexplored issue. By screening Paramecium tetraurelia's 3' untranslated regions for 8-mers that were previously found to be preserved in mammalian 3' UTRs, we detect and characterize a motif that is distinctly conserved in the ribosomal genes of this ciliate. The motif appears to be conserved across Paramecium aurelia species but is absent from the ribosomal genes of four additional non-Paramecium species surveyed, including another ciliate, Tetrahymena thermophila. Motif-free ribosomal genes retain fewer paralogs in the genome and appear to be lost more rapidly relative to motif-containing genes. Features associated with the discovered preserved motif are consistent with this 8-mer playing a role in post-transcriptional regulation. Our observations 1) shed light on the evolution of a putative regulatory motif across large phylogenetic distances; 2) are expected to facilitate the understanding of the modulation of ribosomal genes expression in Paramecium; and 3) reveal a largely unexplored--and presumably not restricted to Paramecium--association between the presence/absence of a DNA motif and the evolutionary fate of its host genes.

  6. Role of the Pepino mosaic virus 3'-untranslated region elements in negative-strand RNA synthesis in vitro.

    PubMed

    Osman, Toba A M; Olsthoorn, René C L; Livieratos, Ioannis C

    2014-09-22

    Pepino mosaic virus (PepMV) is a mechanically-transmitted positive-strand RNA potexvirus, with a 6410 nt long single-stranded (ss) RNA genome flanked by a 5'-methylguanosine cap and a 3' poly-A tail. Computer-assisted folding of the 64 nt long PepMV 3'-untranslated region (UTR) resulted in the prediction of three stem-loop structures (hp1, hp2, and hp3 in the 3'-5' direction). The importance of these structures and/or sequences for promotion of negative-strand RNA synthesis and binding to the RNA dependent RNA polymerase (RdRp) was tested in vitro using a specific RdRp assay. Hp1, which is highly variable among different PepMV isolates, appeared dispensable for negative-strand synthesis. Hp2, which is characterized by a large U-rich loop, tolerated base-pair changes in its stem as long as they maintained the stem integrity but was very sensitive to changes in the U-rich loop. Hp3, which harbours the conserved potexvirus ACUUAA hexamer motif, was essential for template activity. Template-RNA polymerase binding competition experiments showed that the ACUUAA sequence represents a high-affinity RdRp binding element. Copyright © 2014 Elsevier B.V. All rights reserved.

  7. Dehydration stress extends mRNA 3′ untranslated regions with noncoding RNA functions in Arabidopsis

    PubMed Central

    Sun, Hai-Xi; Li, Yan; Niu, Qi-Wen; Chua, Nam-Hai

    2017-01-01

    The 3′ untranslated regions (3′ UTRs) of mRNAs play important roles in the regulation of mRNA localization, translation, and stability. Alternative cleavage and polyadenylation (APA) generates mRNAs with different 3′ UTRs, but the involvement of this process in stress response has not yet been clarified. Here, we report that a subset of stress-related genes exhibits 3′ UTR extensions of their mRNAs during dehydration stress. These extended 3′ UTRs have characteristics of long noncoding RNAs and likely do not interact with miRNAs. Functional studies using T-DNA insertion mutants reveal that they can act as antisense transcripts to repress expression levels of sense genes from the opposite strand or can activate the transcription or lead to read-through transcription of their downstream genes. Further analysis suggests that transcripts with 3′ UTR extensions have weaker poly(A) signals than those without 3′ UTR extensions. Finally, we show that their biogenesis is partially dependent on a trans-acting factor FPA. Taken together, we report that dehydration stress could induce transcript 3′ UTR extensions and elucidate a novel function for these stress-induced 3′ UTR extensions as long noncoding RNAs in the regulation of their neighboring genes. PMID:28522613

  8. Sequencing and Characterization of Novel PII Signaling Protein Gene in Microalga Haematococcus pluvialis.

    PubMed

    Ma, Ruijuan; Li, Yan; Lu, Yinghua

    2017-10-11

    The PII signaling protein is a key protein for controlling nitrogen assimilatory reactions in most organisms, but little information is reported on PII proteins of green microalga Haematococcus pluvialis . Since H. pluvialis cells can produce a large amount of astaxanthin upon nitrogen starvation, its PII protein may represent an important factor on elevated production of Haematococcus astaxanthin. This study identified and isolated the coding gene (Hp GLB1 ) from this microalga. The full-length of Hp GLB1 was 1222 bp, including 621 bp coding sequence (CDS), 103 bp 5' untranslated region (5' UTR), and 498 bp 3' untranslated region (3' UTR). The CDS could encode a protein with 206 amino acids (HpPII). Its calculated molecular weight (Mw) was 22.4 kDa and the theoretical isoelectric point was 9.53. When H. pluvialis cells were exposed to nitrogen starvation, the Hp GLB1 expression was increased 2.46 times in 48 h, concomitant with the raise of astaxanthin content. This study also used phylogenetic analysis to prove that HpPII was homogeneous to the PII proteins of other green microalgae. The results formed a fundamental basis for the future study on HpPII, for its potential physiological function in Haematococcus astaxanthin biosysthesis.

  9. Post-Transcriptional Regulation of the Human Mu-Opioid Receptor (MOR) by Morphine-Induced RNA Binding Proteins hnRNP K and PCBP1

    PubMed Central

    Song, Kyu Young; Choi, Hack Sun; Law, Ping-Yee; Wei, Li-Na; Loh, Horace H.

    2016-01-01

    Expression of the mu-opioid receptor (MOR) protein is controlled by extensive transcriptional and post-transcriptional processing. MOR gene expression has previously been shown to be altered by a post-transcriptional mechanism involving the MOR mRNA untranslated region (UTR). Here, we demonstrate for the first time the role of heterogeneous nuclear ribonucleic acids (hnRNA)-binding protein (hnRNP) K and poly(C)-binding protein 1 (PCBP1) as post-transcriptional inducers in MOR gene regulation. In the absence of morphine, a significant level of MOR mRNA is sustained in its resting state and partitions in the translationally inactive polysomal fraction. Morphine stimulation activates the downstream targets hnRNP K and PCPB1 and induces partitioning of the MOR mRNA to the translationally active fraction. Using reporter and ligand binding assays, as well as RNA EMSA, we reveal potential RNP binding sites located in the 5′-untranslated region of human MOR mRNA. In addition, we also found that morphine-induced RNPs could regulate MOR expression. Our results establish the role of hnRNP K and PCPB1 in the translational control of morphine-induced MOR expression in human neuroblastoma (NMB) cells as well as cells stably expressing MOR (NMB1). PMID:27292014

  10. Expression of CD44 3'-untranslated region regulates endogenous microRNA functions in tumorigenesis and angiogenesis.

    PubMed

    Jeyapalan, Zina; Deng, Zhaoqun; Shatseva, Tatiana; Fang, Ling; He, Chengyan; Yang, Burton B

    2011-04-01

    The non-coding 3'-untranslated region (UTR) plays an important role in the regulation of microRNA (miRNA) functions, since it can bind and inactivate multiple miRNAs. Here, we show the 3'-UTR of CD44 is able to antagonize cytoplasmic miRNAs, and result in the increased translation of CD44 and downstream target mRNA, CDC42. A series of cell function assays in the human breast cancer cell line, MT-1, have shown that the CD44 3'-UTR inhibits proliferation, colony formation and tumor growth. Furthermore, it modulated endothelial cell activities, favored angiogenesis, induced tumor cell apoptosis and increased sensitivity to Docetaxel. These results are due to the interaction of the CD44 3'-UTR with multiple miRNAs. Computational algorithms have predicted three miRNAs, miR-216a, miR-330 and miR-608, can bind to both the CD44 and CDC42 3'-UTRs. This was confirmed with luciferase assays, western blotting and immunohistochemical staining and correlated with a series of siRNA assays. Thus, the non-coding CD44 3'-UTR serves as a competitor for miRNA binding and subsequently inactivates miRNA functions, by freeing the target mRNAs from being repressed.

  11. Expression of CD44 3′-untranslated region regulates endogenous microRNA functions in tumorigenesis and angiogenesis

    PubMed Central

    Jeyapalan, Zina; Deng, Zhaoqun; Shatseva, Tatiana; Fang, Ling; He, Chengyan; Yang, Burton B.

    2011-01-01

    The non-coding 3′-untranslated region (UTR) plays an important role in the regulation of microRNA (miRNA) functions, since it can bind and inactivate multiple miRNAs. Here, we show the 3′-UTR of CD44 is able to antagonize cytoplasmic miRNAs, and result in the increased translation of CD44 and downstream target mRNA, CDC42. A series of cell function assays in the human breast cancer cell line, MT-1, have shown that the CD44 3′-UTR inhibits proliferation, colony formation and tumor growth. Furthermore, it modulated endothelial cell activities, favored angiogenesis, induced tumor cell apoptosis and increased sensitivity to Docetaxel. These results are due to the interaction of the CD44 3′-UTR with multiple miRNAs. Computational algorithms have predicted three miRNAs, miR-216a, miR-330 and miR-608, can bind to both the CD44 and CDC42 3′-UTRs. This was confirmed with luciferase assays, western blotting and immunohistochemical staining and correlated with a series of siRNA assays. Thus, the non-coding CD44 3′-UTR serves as a competitor for miRNA binding and subsequently inactivates miRNA functions, by freeing the target mRNAs from being repressed. PMID:21149267

  12. Polymorphisms at the 3' untranslated region of SLC11A1 gene are associated with protection to Brucella infection in goats.

    PubMed

    Iacoboni, Paola A; Hasenauer, Flavia C; Caffaro, M Eugenia; Gaido, Analia; Rossetto, Cristina; Neumann, Roberto D; Salatin, Antonio; Bertoni, Emiliano; Poli, Mario A; Rossetti, Carlos A

    2014-08-15

    Goats are susceptible to brucellosis and the detection of Brucella-infected animals is carried out by serological tests. In other ruminant species, polymorphisms in microsatellites (Ms) of 3' untranslated region (3'UTR) of the solute carrier family 11 member A1 (SLC11A1) gene were associated with resistance to Brucella abortus infection. Goats present two polymorphic Ms at the 3'UTR end of SLC11A1 gene, called regions A and B. Here, we evaluated if polymorphisms in regions A and/or B are associated with Brucella infection in goats. Serum (for the detection of Brucella-specific antibodies) and hair samples (for DNA isolation and structure analysis of the SLC11A1 gene) were randomly collected from 229 adult native goats from the northwest of Argentina. Serological status was evaluated by buffer plate antigen test (BPAT) complemented by the fluorescent polarization assay (FPA), and the genotype of the 3'UTR of the SLC11A1 gene was determined by capillary electrophoresis and confirmed by sequence analysis. Polymorphisms in regions A and B of the 3'UTR SLC11A1 gene were found statistically significant associated with protection to Brucella infection. Specifically, the association study indicates statistical significance of the allele A15 and B7/B7 genotype with absence of Brucella-specific antibodies (p=0.0003 and 0.0088, respectively). These data open a promising opportunity for limiting goat brucellosis through selective breeding of animals based on genetic markers associated with natural resistance to B. melitensis infection. Copyright © 2014 Elsevier B.V. All rights reserved.

  13. Bacteriophage 5' untranslated regions for control of plastid transgene expression.

    PubMed

    Yang, Huijun; Gray, Benjamin N; Ahner, Beth A; Hanson, Maureen R

    2013-02-01

    Expression of foreign proteins from transgenes incorporated into plastid genomes requires regulatory sequences that can be recognized by the plastid transcription and translation machinery. Translation signals harbored by the 5' untranslated region (UTR) of plastid transcripts can profoundly affect the level of accumulation of proteins expressed from chimeric transgenes. Both endogenous 5' UTRs and the bacteriophage T7 gene 10 (T7g10) 5' UTR have been found to be effective in combination with particular coding regions to mediate high-level expression of foreign proteins. We investigated whether two other bacteriophage 5' UTRs could be utilized in plastid transgenes by fusing them to the aadA (aminoglycoside-3'-adenyltransferase) coding region that is commonly used as a selectable marker in plastid transformation. Transplastomic plants containing either the T7g1.3 or T4g23 5' UTRs fused to Myc-epitope-tagged aadA were successfully obtained, demonstrating the ability of these 5' UTRs to regulate gene expression in plastids. Placing the Thermobifida fusca cel6A gene under the control of the T7g1.3 or T4g23 5' UTRs, along with a tetC downstream box, resulted in poor expression of the cellulase in contrast with high-level accumulation while using the T7g10 5' UTR. However, transplastomic plants with the bacteriophage 5' UTRs controlling the aadA coding region exhibited fewer undesired recombinant species than plants containing the same marker gene regulated by the Nicotiana tabacum psbA 5' UTR. Furthermore, expression of the T7g1.3 and T4g23 5' UTR::aadA fusions downstream of the cel6A gene provided sufficient spectinomycin resistance to allow selection of homoplasmic transgenic plants and had no effect on Cel6A accumulation.

  14. microRNA-122 target sites in the hepatitis C virus RNA NS5B coding region and 3' untranslated region: function in replication and influence of RNA secondary structure.

    PubMed

    Gerresheim, Gesche K; Dünnes, Nadia; Nieder-Röhrmann, Anika; Shalamova, Lyudmila A; Fricke, Markus; Hofacker, Ivo; Höner Zu Siederdissen, Christian; Marz, Manja; Niepmann, Michael

    2017-02-01

    We have analyzed the binding of the liver-specific microRNA-122 (miR-122) to three conserved target sites of hepatitis C virus (HCV) RNA, two in the non-structural protein 5B (NS5B) coding region and one in the 3' untranslated region (3'UTR). miR-122 binding efficiency strongly depends on target site accessibility under conditions when the range of flanking sequences available for the formation of local RNA secondary structures changes. Our results indicate that the particular sequence feature that contributes most to the correlation between target site accessibility and binding strength varies between different target sites. This suggests that the dynamics of miRNA/Ago2 binding not only depends on the target site itself but also on flanking sequence context to a considerable extent, in particular in a small viral genome in which strong selection constraints act on coding sequence and overlapping cis-signals and model the accessibility of cis-signals. In full-length genomes, single and combination mutations in the miR-122 target sites reveal that site 5B.2 is positively involved in regulating overall genome replication efficiency, whereas mutation of site 5B.3 showed a weaker effect. Mutation of the 3'UTR site and double or triple mutants showed no significant overall effect on genome replication, whereas in a translation reporter RNA, the 3'UTR target site inhibits translation directed by the HCV 5'UTR. Thus, the miR-122 target sites in the 3'-region of the HCV genome are involved in a complex interplay in regulating different steps of the HCV replication cycle.

  15. Endothelin-1 expression is strongly repressed by AU-rich elements in the 3′-untranslated region of the gene

    PubMed Central

    2004-01-01

    The regulation of the synthesis of the endothelial-derived vasoconstrictor ET-1 (endothelin-1) is a complex process that occurs mainly at the mRNA level. Transcription of the gene accounts for an important part of the regulation of expression, as already described for different modulators such as the cytokine TGF-β (transforming growth factor-β). However, very little is known about mechanisms governing ET-1 expression at the post-transcriptional level. The aim of the present study was to investigate the regulation of the ET-1 expression at this level. Since the 3′-UTR (3′-untranslated region) of mRNAs commonly contains genetic determinants for the post-transcriptional control of gene expression, we focused on the potential role of the 3′-UTR of ET-1 mRNA. Experiments performed with luciferase reporter constructs containing the 3′-UTR showed that this region exerts a potent destabilizing effect. Deletional analyses allowed us to locate this activity within a region at positions 924–1127. Some (but not all) of the AREs (AU-rich elements) present in this region were found to be essential for this mRNA-destabilizing activity. We also present evidence that cytosolic proteins from endothelial cells interact specifically with these RNA elements, and that a close correlation exists between the ability of the AREs to destabilize ET-1 mRNA and the binding of proteins to these elements. Our results are compatible with the existence of a strong repressional control of ET-1 expression mediated by destabilization of the mRNA exerted through the interaction of specific cytosolic proteins with AREs present in the 3′-UTR of the gene. PMID:15595926

  16. Recent mobility of plastid encoded group II introns and twintrons in five strains of the unicellular red alga Porphyridium

    PubMed Central

    Perrineau, Marie-Mathilde; Price, Dana C.; Mohr, Georg

    2015-01-01

    Group II introns are closely linked to eukaryote evolution because nuclear spliceosomal introns and the small RNAs associated with the spliceosome are thought to trace their ancient origins to these mobile elements. Therefore, elucidating how group II introns move, and how they lose mobility can potentially shed light on fundamental aspects of eukaryote biology. To this end, we studied five strains of the unicellular red alga Porphyridium purpureum that surprisingly contain 42 group II introns in their plastid genomes. We focused on a subset of these introns that encode mobility-conferring intron-encoded proteins (IEPs) and found them to be distributed among the strains in a lineage-specific manner. The reverse transcriptase and maturase domains were present in all lineages but the DNA endonuclease domain was deleted in vertically inherited introns, demonstrating a key step in the loss of mobility. P. purpureum plastid intron RNAs had a classic group IIB secondary structure despite variability in the DIII and DVI domains. We report for the first time the presence of twintrons (introns-within-introns, derived from the same mobile element) in Rhodophyta. The P. purpureum IEPs and their mobile introns provide a valuable model for the study of mobile retroelements in eukaryotes and offer promise for biotechnological applications. PMID:26157604

  17. Genetic Manipulation of Lactococcus lactis by Using Targeted Group II Introns: Generation of Stable Insertions without Selection

    PubMed Central

    Frazier, Courtney L.; San Filippo, Joseph; Lambowitz, Alan M.; Mills, David A.

    2003-01-01

    Despite their commercial importance, there are relatively few facile methods for genomic manipulation of the lactic acid bacteria. Here, the lactococcal group II intron, Ll.ltrB, was targeted to insert efficiently into genes encoding malate decarboxylase (mleS) and tetracycline resistance (tetM) within the Lactococcus lactis genome. Integrants were readily identified and maintained in the absence of a selectable marker. Since splicing of the Ll.ltrB intron depends on the intron-encoded protein, targeted invasion with an intron lacking the intron open reading frame disrupted TetM and MleS function, and MleS activity could be partially restored by expressing the intron-encoded protein in trans. Restoration of splicing from intron variants lacking the intron-encoded protein illustrates how targeted group II introns could be used for conditional expression of any gene. Furthermore, the modified Ll.ltrB intron was used to separately deliver a phage resistance gene (abiD) and a tetracycline resistance marker (tetM) into mleS, without the need for selection to drive the integration or to maintain the integrant. Our findings demonstrate the utility of targeted group II introns as a potential food-grade mechanism for delivery of industrially important traits into the genomes of lactococci. PMID:12571038

  18. Molecular breakpoint cloning and gene expression studies of a novel translocation t(4;15)(q27;q11.2) associated with Prader-Willi syndrome

    PubMed Central

    Schüle, Birgitt; Albalwi, Mohammed; Northrop, Emma; Francis, David I; Rowell, Margaret; Slater, Howard R; Gardner, RJ McKinlay; Francke, Uta

    2005-01-01

    Background Prader-Willi syndrome (MIM #176270; PWS) is caused by lack of the paternally-derived copies, or their expression, of multiple genes in a 4 Mb region on chromosome 15q11.2. Known mechanisms include large deletions, maternal uniparental disomy or mutations involving the imprinting center. De novo balanced reciprocal translocations in 5 reported individuals had breakpoints clustering in SNRPN intron 2 or exon 20/intron 20. To further dissect the PWS phenotype and define the minimal critical region for PWS features, we have studied a 22 year old male with a milder PWS phenotype and a de novo translocation t(4;15)(q27;q11.2). Methods We used metaphase FISH to narrow the breakpoint region and molecular analyses to map the breakpoints on both chromosomes at the nucleotide level. The expression of genes on chromosome 15 on both sides of the breakpoint was determined by RT-PCR analyses. Results Pertinent clinical features include neonatal hypotonia with feeding difficulties, hypogonadism, short stature, late-onset obesity, learning difficulties, abnormal social behavior and marked tolerance to pain, as well as sticky saliva and narcolepsy. Relative macrocephaly and facial features are not typical for PWS. The translocation breakpoints were identified within SNRPN intron 17 and intron 10 of a spliced non-coding transcript in band 4q27. LINE and SINE sequences at the exchange points may have contributed to the translocation event. By RT-PCR of lymphoblasts and fibroblasts, we find that upstream SNURF/SNRPN exons and snoRNAs HBII-437 and HBII-13 are expressed, but the downstream snoRNAs PWCR1/HBII-85 and HBII-438A/B snoRNAs are not. Conclusion As part of the PWCR1/HBII-85 snoRNA cluster is highly conserved between human and mice, while no copy of HBII-438 has been found in mouse, we conclude that PWCR1/HBII-85 snoRNAs is likely to play a major role in the PWS- phenotype. PMID:15877813

  19. Molecular breakpoint cloning and gene expression studies of a novel translocation t(4;15)(q27;q11.2) associated with Prader-Willi syndrome.

    PubMed

    Schüle, Birgitt; Albalwi, Mohammed; Northrop, Emma; Francis, David I; Rowell, Margaret; Slater, Howard R; Gardner, R J McKinlay; Francke, Uta

    2005-05-06

    Prader-Willi syndrome (MIM #176270; PWS) is caused by lack of the paternally-derived copies, or their expression, of multiple genes in a 4 Mb region on chromosome 15q11.2. Known mechanisms include large deletions, maternal uniparental disomy or mutations involving the imprinting center. De novo balanced reciprocal translocations in 5 reported individuals had breakpoints clustering in SNRPN intron 2 or exon 20/intron 20. To further dissect the PWS phenotype and define the minimal critical region for PWS features, we have studied a 22 year old male with a milder PWS phenotype and a de novo translocation t(4;15)(q27;q11.2). We used metaphase FISH to narrow the breakpoint region and molecular analyses to map the breakpoints on both chromosomes at the nucleotide level. The expression of genes on chromosome 15 on both sides of the breakpoint was determined by RT-PCR analyses. Pertinent clinical features include neonatal hypotonia with feeding difficulties, hypogonadism, short stature, late-onset obesity, learning difficulties, abnormal social behavior and marked tolerance to pain, as well as sticky saliva and narcolepsy. Relative macrocephaly and facial features are not typical for PWS. The translocation breakpoints were identified within SNRPN intron 17 and intron 10 of a spliced non-coding transcript in band 4q27. LINE and SINE sequences at the exchange points may have contributed to the translocation event. By RT-PCR of lymphoblasts and fibroblasts, we find that upstream SNURF/SNRPN exons and snoRNAs HBII-437 and HBII-13 are expressed, but the downstream snoRNAs PWCR1/HBII-85 and HBII-438A/B snoRNAs are not. As part of the PWCR1/HBII-85 snoRNA cluster is highly conserved between human and mice, while no copy of HBII-438 has been found in mouse, we conclude that PWCR1/HBII-85 snoRNAs is likely to play a major role in the PWS- phenotype.

  20. Evolution of introns in the archaeal world.

    PubMed

    Tocchini-Valentini, Giuseppe D; Fruscoloni, Paolo; Tocchini-Valentini, Glauco P

    2011-03-22

    The self-splicing group I introns are removed by an autocatalytic mechanism that involves a series of transesterification reactions. They require RNA binding proteins to act as chaperones to correctly fold the RNA into an active intermediate structure in vivo. Pre-tRNA introns in Bacteria and in higher eukaryote plastids are typical examples of self-splicing group I introns. By contrast, two striking features characterize RNA splicing in the archaeal world. First, self-splicing group I introns cannot be found, to this date, in that kingdom. Second, the RNA splicing scenario in Archaea is uniform: All introns, whether in pre-tRNA or elsewhere, are removed by tRNA splicing endonucleases. We suggest that in Archaea, the protein recruited for splicing is the preexisting tRNA splicing endonuclease and that this enzyme, together with the ligase, takes over the task of intron removal in a more efficient fashion than the ribozyme. The extinction of group I introns in Archaea would then be a consequence of recruitment of the tRNA splicing endonuclease. We deal here with comparative genome analysis, focusing specifically on the integration of introns into genes coding for 23S rRNA molecules, and how this newly acquired intron has to be removed to regenerate a functional RNA molecule. We show that all known oligomeric structures of the endonuclease can recognize and cleave a ribosomal intron, even when the endonuclease derives from a strain lacking rRNA introns. The persistence of group I introns in mitochondria and chloroplasts would be explained by the inaccessibility of these introns to the endonuclease.

  1. Hypervariable and highly divergent intron-exon organizations in the chordate Oikopleura dioica.

    PubMed

    Edvardsen, Rolf B; Lerat, Emmanuelle; Maeland, Anne Dorthea; Flåt, Mette; Tewari, Rita; Jensen, Marit F; Lehrach, Hans; Reinhardt, Richard; Seo, Hee-Chan; Chourrout, Daniel

    2004-10-01

    Oikopleura dioica is a pelagic tunicate with a very small genome and a very short life cycle. In order to investigate the intron-exon organizations in Oikopleura, we have isolated and characterized ribosomal protein EF-1alpha, Hox, and alpha-tubulin genes. Their intron positions have been compared with those of the same genes from various invertebrates and vertebrates, including four species with entirely sequenced genomes. Oikopleura genes, like Caenorhabditis genes, have introns at a large number of nonconserved positions, which must originate from late insertions or intron sliding of ancient insertions. Both species exhibit hypervariable intron-exon organization within their alpha-tubulin gene family. This is due to localization of most nonconserved intron positions in single members of this gene family. The hypervariability and divergence of intron positions in Oikopleura and Caenorhabditis may be related to the predominance of short introns, the processing of which is not very dependent upon the exonic environment compared to large introns. Also, both species have an undermethylated genome, and the control of methylation-induced point mutations imposes a control on exon size, at least in vertebrate genes. That introns placed at such variable positions in Oikopleura or C. elegans may serve a specific purpose is not easy to infer from our current knowledge and hypotheses on intron functions. We propose that new introns are retained in species with very short life cycles, because illegitimate exchanges including gene conversion are repressed. We also speculate that introns placed at gene-specific positions may contribute to suppressing these exchanges and thereby favor their own persistence.

  2. A SNP uncoupling Mina expression from the TGFβ signaling pathway.

    PubMed

    Lian, Shang L; Mihi, Belgacem; Koyanagi, Madoka; Nakayama, Toshinori; Bix, Mark

    2018-03-01

    Mina is a JmjC family 2-oxoglutarate oxygenase with pleiotropic roles in cell proliferation, cancer, T cell differentiation, pulmonary inflammation, and intestinal parasite expulsion. Although Mina expression varies according to cell-type, developmental stage and activation state, its transcriptional regulation is poorly understood. Across inbred mouse strains, Mina protein level exhibits a bimodal distribution, correlating with inheritance of a biallelic haplotype block comprising 21 promoter/intron 1-region SNPs. We previously showed that heritable differences in Mina protein level are transcriptionally regulated. Accordingly, we decided to test the hypothesis that at least one of the promoter/intron 1-region SNPs perturbs a Mina cis-regulatory element (CRE). Here, we have comprehensively scanned for CREs across a Mina locus-spanning 26-kilobase genomic interval. We discovered 8 potential CREs and functionally validated 4 of these, the strongest of which (E2), residing in intron 1, contained a SNP whose BALB/c-but not C57Bl/6 allele-abolished both Smad3 binding and transforming growth factor beta (TGFβ) responsiveness. Our results demonstrate the TGFβ signaling pathway plays a critical role in regulating Mina expression and SNP rs4191790 controls heritable variation in Mina expression level, raising important questions regarding the evolution of an allele that uncouples Mina expression from the TGFβ signaling pathway. © 2017 The Authors. Immunity, Inflammation and Disease Published by John Wiley & Sons Ltd.

  3. A SNP uncoupling Mina expression from the TGFβ signaling pathway

    PubMed Central

    Lian, Shang L.; Mihi, Belgacem; Koyanagi, Madoka; Nakayama, Toshinori

    2017-01-01

    Abstract Introduction Mina is a JmjC family 2‐oxoglutarate oxygenase with pleiotropic roles in cell proliferation, cancer, T cell differentiation, pulmonary inflammation, and intestinal parasite expulsion. Although Mina expression varies according to cell‐type, developmental stage and activation state, its transcriptional regulation is poorly understood. Across inbred mouse strains, Mina protein level exhibits a bimodal distribution, correlating with inheritance of a biallelic haplotype block comprising 21 promoter/intron 1‐region SNPs. We previously showed that heritable differences in Mina protein level are transcriptionally regulated. Methods Accordingly, we decided to test the hypothesis that at least one of the promoter/intron 1‐region SNPs perturbs a Mina cis‐regulatory element (CRE). Here, we have comprehensively scanned for CREs across a Mina locus‐spanning 26‐kilobase genomic interval. Results We discovered 8 potential CREs and functionally validated 4 of these, the strongest of which (E2), residing in intron 1, contained a SNP whose BALB/c—but not C57Bl/6 allele—abolished both Smad3 binding and transforming growth factor beta (TGFβ) responsiveness. Conclusions Our results demonstrate the TGFβ signaling pathway plays a critical role in regulating Mina expression and SNP rs4191790 controls heritable variation in Mina expression level, raising important questions regarding the evolution of an allele that uncouples Mina expression from the TGFβ signaling pathway. PMID:28967702

  4. Nanoparticle-mediated rhodopsin cDNA but not intron-containing DNA delivery causes transgene silencing in a rhodopsin knockout model.

    PubMed

    Zheng, Min; Mitra, Rajendra N; Filonov, Nazar A; Han, Zongchao

    2016-03-01

    Previously, we compared the efficacy of nanoparticle (NP)-mediated intron-containing rhodopsin (sgRho) vs. intronless cDNA in ameliorating retinal disease phenotypes in a rhodopsin knockout (RKO) mouse model of retinitis pigmentosa. We showed that NP-mediated sgRho delivery achieved long-term expression and phenotypic improvement in RKO mice, but not NP housing cDNA. However, the protein level of the NP-sgRho construct was only 5-10% of wild-type at 8 mo postinjection. To have a better understanding of the reduced levels of long-term expression of the vectors, in the present study, we evaluated the epigenetic changes of subretinal delivering NP-cDNA vs. NP-sgRho in the RKO mouse eyes. Following the administration, DNA methylation and histone status of specific regions (bacteria plasmid backbone, promoter, rhodopsin gene, and scaffold/matrix attachment region) of the vectors were evaluated at various time points. We documented that epigenetic transgene silencing occurred in vector-mediated gene transfer, which were caused by the plasmid backbone and the cDNA of the transgene, but not the intron-containing transgene. No toxicity or inflammation was found in the treated eyes. Our results suggest that cDNA of the rhodopsin transgene and bacteria backbone interfered with the host defense mechanism of DNA methylation-mediated transgene silencing through heterochromatin-associated modifications. © FASEB.

  5. Mutation Spectrum of the ABCA4 Gene in a Greek Cohort with Stargardt Disease: Identification of Novel Mutations and Evidence of Three Prevalent Mutated Alleles

    PubMed Central

    Vassiliki, Kokkinou; George, Koutsodontis; Polixeni, Stamatiou; Christoforos, Giatzakis; Minas, Aslanides Ioannis; Stavrenia, Koukoula; Ioannis, Datseris

    2018-01-01

    Aim To evaluate the frequency and pattern of disease-associated mutations of ABCA4 gene among Greek patients with presumed Stargardt disease (STGD1). Materials and Methods A total of 59 patients were analyzed for ABCA4 mutations using the ABCR400 microarray and PCR-based sequencing of all coding exons and flanking intronic regions. MLPA analysis as well as sequencing of two regions in introns 30 and 36 reported earlier to harbor deep intronic disease-associated variants was used in 4 selected cases. Results An overall detection rate of at least one mutant allele was achieved in 52 of the 59 patients (88.1%). Direct sequencing improved significantly the complete characterization rate, that is, identification of two mutations compared to the microarray analysis (93.1% versus 50%). In total, 40 distinct potentially disease-causing variants of the ABCA4 gene were detected, including six previously unreported potentially pathogenic variants. Among the disease-causing variants, in this cohort, the most frequent was c.5714+5G>A representing 16.1%, while p.Gly1961Glu and p.Leu541Pro represented 15.2% and 8.5%, respectively. Conclusions By using a combination of methods, we completely molecularly diagnosed 48 of the 59 patients studied. In addition, we identified six previously unreported, potentially pathogenic ABCA4 mutations. PMID:29854428

  6. Evolutionary and biogeographical implications of degraded LAGLIDADG endonuclease functionality and group I intron occurrence in stony corals (Scleractinia) and mushroom corals (Corallimorpharia).

    PubMed

    Celis, Juan Sebastián; Edgell, David R; Stelbrink, Björn; Wibberg, Daniel; Hauffe, Torsten; Blom, Jochen; Kalinowski, Jörn; Wilke, Thomas

    2017-01-01

    Group I introns and homing endonuclease genes (HEGs) are mobile genetic elements, capable of invading target sequences in intron-less genomes. LAGLIDADG HEGs are the largest family of endonucleases, playing a key role in the mobility of group I introns in a process known as 'homing'. Group I introns and HEGs are rare in metazoans, and can be mainly found inserted in the COXI gene of some sponges and cnidarians, including stony corals (Scleractinia) and mushroom corals (Corallimorpharia). Vertical and horizontal intron transfer mechanisms have been proposed as explanations for intron occurrence in cnidarians. However, the central role of LAGLIDADG motifs in intron mobility mechanisms remains poorly understood. To resolve questions regarding the evolutionary origin and distribution of group I introns and HEGs in Scleractinia and Corallimorpharia, we examined intron/HEGs sequences within a comprehensive phylogenetic framework. Analyses of LAGLIDADG motif conservation showed a high degree of degradation in complex Scleractinia and Corallimorpharia. Moreover, the two motifs lack the respective acidic residues necessary for metal-ion binding and catalysis, potentially impairing horizontal intron mobility. In contrast, both motifs are highly conserved within robust Scleractinia, indicating a fully functional endonuclease capable of promoting horizontal intron transference. A higher rate of non-synonymous substitutions (Ka) detected in the HEGs of complex Scleractinia and Corallimorpharia suggests degradation of the HEG, whereas lower Ka rates in robust Scleractinia are consistent with a scenario of purifying selection. Molecular-clock analyses and ancestral inference of intron type indicated an earlier intron insertion in complex Scleractinia and Corallimorpharia in comparison to robust Scleractinia. These findings suggest that the lack of horizontal intron transfers in the former two groups is related to an age-dependent degradation of the endonuclease activity. Moreover, they also explain the peculiar geographical patterns of introns in stony and mushroom corals.

  7. Regions of conservation and divergence in the 3' untranslated sequences of genomic RNA from Ross River virus isolates.

    PubMed

    Faragher, S G; Dalgarno, L

    1986-07-20

    The 3' untranslated (UT) sequences of the genomic RNAs of five geographic variants of the alphavirus Ross River virus (RRV) were determined and compared with the 3' UT sequence of RRV T48, the prototype strain. Part of the 3' UT region of Getah virus, a close serological relative of RRV, was also sequenced. The RRV 3' UT region varies markedly in length between variants. Large deletions or insertions, sequence rearrangements and single nucleotide substitutions are observed. A sequence tract of 49 to 58 nucleotides, which is repeated as four blocks in the RRV T48 3' UT region, occurs only once in the 3' UT region of one RRV strain (NB5092), indicating that the existence of repeat sequence blocks is not essential for RRV replication. However, the precise sequence of the 3' proximal copy of the repeat block and its position relative to the poly(A) tail were identical in all RRV isolates examined, suggesting that it has an important role in RRV replication. Nucleotide substitutions between RRV variants are distributed non-randomly along the length of the 3' UT region. The sequence of 120 to 130 nucleotides adjacent to the poly(A) tail is strongly conserved. Getah virus RNA contains three repeat sequence blocks in the 3' UT region. These are similar in sequence to those in RRV RNA but differ in their arrangement. Homology between the RRV and Getah 3' UT sequences is greatest in the 3' proximal repeat sequence block that shows three differences in 49 nucleotides. The 3' proximal repeat in Getah RNA occurs at the same position, relative to the poly(A) tail, as in all RRV variants. The RRV and Getah virus 3' UT sequences show extensive homology in the region between the 3' proximal repeat and the poly(A) tail but, apart from the repeat blocks themselves, they show no significant homology elsewhere.

  8. Using the NCBI Genome Databases to Compare the Genes for Human & Chimpanzee Beta Hemoglobin

    ERIC Educational Resources Information Center

    Offner, Susan

    2010-01-01

    The beta hemoglobin protein is identical in humans and chimpanzees. In this tutorial, students see that even though the proteins are identical, the genes that code for them are not. There are many more differences in the introns than in the exons, which indicates that coding regions of DNA are more highly conserved than non-coding regions.

  9. Bacterial group II introns: not just splicing.

    PubMed

    Toro, Nicolás; Jiménez-Zurdo, José Ignacio; García-Rodríguez, Fernando Manuel

    2007-04-01

    Group II introns are both catalytic RNAs (ribozymes) and mobile retroelements that were discovered almost 14 years ago. It has been suggested that eukaryotic mRNA introns might have originated from the group II introns present in the alphaproteobacterial progenitor of the mitochondria. Bacterial group II introns are of considerable interest not only because of their evolutionary significance, but also because they could potentially be used as tools for genetic manipulation in biotechnology and for gene therapy. This review summarizes what is known about the splicing mechanisms and mobility of bacterial group II introns, and describes the recent development of group II intron-based gene-targetting methods. Bacterial group II intron diversity, evolutionary relationships, and behaviour in bacteria are also discussed.

  10. Structure of a group II intron in complex with its reverse transcriptase.

    PubMed

    Qu, Guosheng; Kaushal, Prem Singh; Wang, Jia; Shigematsu, Hideki; Piazza, Carol Lyn; Agrawal, Rajendra Kumar; Belfort, Marlene; Wang, Hong-Wei

    2016-06-01

    Bacterial group II introns are large catalytic RNAs related to nuclear spliceosomal introns and eukaryotic retrotransposons. They self-splice, yielding mature RNA, and integrate into DNA as retroelements. A fully active group II intron forms a ribonucleoprotein complex comprising the intron ribozyme and an intron-encoded protein that performs multiple activities including reverse transcription, in which intron RNA is copied into the DNA target. Here we report cryo-EM structures of an endogenously spliced Lactococcus lactis group IIA intron in its ribonucleoprotein complex form at 3.8-Å resolution and in its protein-depleted form at 4.5-Å resolution, revealing functional coordination of the intron RNA with the protein. Remarkably, the protein structure reveals a close relationship between the reverse transcriptase catalytic domain and telomerase, whereas the active splicing center resembles the spliceosomal Prp8 protein. These extraordinary similarities hint at intricate ancestral relationships and provide new insights into splicing and retromobility.

  11. Ancient nature of alternative splicing and functions of introns

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Zhou, Kemin; Salamov, Asaf; Kuo, Alan

    Using four genomes: Chamydomonas reinhardtii, Agaricus bisporus, Aspergillus carbonarius, and Sporotricum thermophile with EST coverage of 2.9x, 8.9x, 29.5x, and 46.3x respectively, we identified 11 alternative splicing (AS) types that were dominated by intron retention (RI; biased toward short introns) and found 15, 35, 52, and 63percent AS of multiexon genes respectively. Genes with AS were more ancient, and number of AS correlated with number of exons, expression level, and maximum intron length of the gene. Introns with tendency to be retained had either stop codons or length of 3n+1 or 3n+2 presumably triggering nonsense-mediated mRNA decay (NMD), but intronsmore » retained in major isoforms (0.2-6percent of all introns) were biased toward 3n length and stop codon free. Stopless introns were biased toward phase 0, but 3n introns favored phase 1 that introduced more flexible and hydrophilic amino acids on both ends of introns which would be less disruptive to protein structure. We proposed a model in which minor RI intron could evolve into major RI that could facilitate intron loss through exonization.« less

  12. Remarkable interkingdom conservation of intron positions and massive, lineage-specific intron loss and gain in eukaryotic evolution.

    PubMed

    Rogozin, Igor B; Wolf, Yuri I; Sorokin, Alexander V; Mirkin, Boris G; Koonin, Eugene V

    2003-09-02

    Sequencing of eukaryotic genomes allows one to address major evolutionary problems, such as the evolution of gene structure. We compared the intron positions in 684 orthologous gene sets from 8 complete genomes of animals, plants, fungi, and protists and constructed parsimonious scenarios of evolution of the exon-intron structure for the respective genes. Approximately one-third of the introns in the malaria parasite Plasmodium falciparum are shared with at least one crown group eukaryote; this number indicates that these introns have been conserved through >1.5 billion years of evolution that separate Plasmodium from the crown group. Paradoxically, humans share many more introns with the plant Arabidopsis thaliana than with the fly or nematode. The inferred evolutionary scenario holds that the common ancestor of Plasmodium and the crown group and, especially, the common ancestor of animals, plants, and fungi had numerous introns. Most of these ancestral introns, which are retained in the genomes of vertebrates and plants, have been lost in fungi, nematodes, arthropods, and probably Plasmodium. In addition, numerous introns have been inserted into vertebrate and plant genes, whereas, in other lineages, intron gain was much less prominent.

  13. A Detailed History of Intron-rich Eukaryotic Ancestors Inferred from a Global Survey of 100 Complete Genomes

    PubMed Central

    Csuros, Miklos; Rogozin, Igor B.; Koonin, Eugene V.

    2011-01-01

    Protein-coding genes in eukaryotes are interrupted by introns, but intron densities widely differ between eukaryotic lineages. Vertebrates, some invertebrates and green plants have intron-rich genes, with 6–7 introns per kilobase of coding sequence, whereas most of the other eukaryotes have intron-poor genes. We reconstructed the history of intron gain and loss using a probabilistic Markov model (Markov Chain Monte Carlo, MCMC) on 245 orthologous genes from 99 genomes representing the three of the five supergroups of eukaryotes for which multiple genome sequences are available. Intron-rich ancestors are confidently reconstructed for each major group, with 53 to 74% of the human intron density inferred with 95% confidence for the Last Eukaryotic Common Ancestor (LECA). The results of the MCMC reconstruction are compared with the reconstructions obtained using Maximum Likelihood (ML) and Dollo parsimony methods. An excellent agreement between the MCMC and ML inferences is demonstrated whereas Dollo parsimony introduces a noticeable bias in the estimations, typically yielding lower ancestral intron densities than MCMC and ML. Evolution of eukaryotic genes was dominated by intron loss, with substantial gain only at the bases of several major branches including plants and animals. The highest intron density, 120 to 130% of the human value, is inferred for the last common ancestor of animals. The reconstruction shows that the entire line of descent from LECA to mammals was intron-rich, a state conducive to the evolution of alternative splicing. PMID:21935348

  14. Putative cross-kingdom horizontal gene transfer in sponge (Porifera) mitochondria.

    PubMed

    Rot, Chagai; Goldfarb, Itay; Ilan, Micha; Huchon, Dorothée

    2006-09-14

    The mitochondrial genome of Metazoa is usually a compact molecule without introns. Exceptions to this rule have been reported only in corals and sea anemones (Cnidaria), in which group I introns have been discovered in the cox1 and nad5 genes. Here we show several lines of evidence demonstrating that introns can also be found in the mitochondria of sponges (Porifera). A 2,349 bp fragment of the mitochondrial cox1 gene was sequenced from the sponge Tetilla sp. (Spirophorida). This fragment suggests the presence of a 1143 bp intron. Similar to all the cnidarian mitochondrial introns, the putative intron has group I intron characteristics. The intron is present in the cox1 gene and encodes a putative homing endonuclease. In order to establish the distribution of this intron in sponges, the cox1 gene was sequenced from several representatives of the demosponge diversity. The intron was found only in the sponge order Spirophorida. A phylogenetic analysis of the COI protein sequence and of the intron open reading frame suggests that the intron may have been transmitted horizontally from a fungus donor. Little is known about sponge-associated fungi, although in the last few years the latter have been frequently isolated from sponges. We suggest that the horizontal gene transfer of a mitochondrial intron was facilitated by a symbiotic relationship between fungus and sponge. Ecological relationships are known to have implications at the genomic level. Here, an ecological relationship between sponge and fungus is suggested based on the genomic analysis.

  15. Spliceosomal Intron Insertions in Genome Compacted Ray-Finned Fishes as Evident from Phylogeny of MC Receptors, Also Supported by a Few Other GPCRs

    PubMed Central

    Sinha, Rahul; Goyal, Pankaj; Grapputo, Alessandro

    2011-01-01

    Background Insertions of spliceosomal introns are very rare events during evolution of vertebrates and the mechanisms governing creation of novel intron(s) remain obscure. Largely, gene structures of melanocortin (MC) receptors are characterized by intron-less architecture. However, recently a few exceptions have been reported in some fishes. This warrants a systematic survey of MC receptors for understanding intron insertion events during vertebrate evolution. Methodology/Principal Findings We have compiled an extended list of MC receptors from different vertebrate genomes with variations in fishes. Notably, the closely linked MC2Rs and MC5Rs from a group of ray-finned fishes have three and one intron insertion(s), respectively, with conserved positions and intron phase. In both genes, one novel insertion was in the highly conserved DRY motif at the end of helix TM3. Further, the proto-splice site MAG↑R is maintained at intron insertion sites in these two genes. However, the orthologs of these receptors from zebrafish and tetrapods are intron-less, suggesting these introns are simultaneously created in selected fishes. Surprisingly, these novel introns are traceable only in four fish genomes. We found that these fish genomes are severely compacted after the separation from zebrafish. Furthermore, we also report novel intron insertions in P2Y receptors and in CHRM3. Finally, we report ultrasmall introns in MC2R genes from selected fishes. Conclusions/Significance The current repository of MC receptors illustrates that fishes have no MC3R ortholog. MC2R, MC5R, P2Y receptors and CHRM3 have novel intron insertions only in ray-finned fishes that underwent genome compaction. These receptors share one intron at an identical position suggestive of being inserted contemporaneously. In addition to repetitive elements, genome compaction is now believed to be a new hallmark that promotes intron insertions, as it requires rapid DNA breakage and subsequent repair processes to gain back normal functionality. PMID:21850219

  16. Optimization of a novel biophysical model using large scale in vivo antisense hybridization data displays improved prediction capabilities of structurally accessible RNA regions

    PubMed Central

    Vazquez-Anderson, Jorge; Mihailovic, Mia K.; Baldridge, Kevin C.; Reyes, Kristofer G.; Haning, Katie; Cho, Seung Hee; Amador, Paul; Powell, Warren B.

    2017-01-01

    Abstract Current approaches to design efficient antisense RNAs (asRNAs) rely primarily on a thermodynamic understanding of RNA–RNA interactions. However, these approaches depend on structure predictions and have limited accuracy, arguably due to overlooking important cellular environment factors. In this work, we develop a biophysical model to describe asRNA–RNA hybridization that incorporates in vivo factors using large-scale experimental hybridization data for three model RNAs: a group I intron, CsrB and a tRNA. A unique element of our model is the estimation of the availability of the target region to interact with a given asRNA using a differential entropic consideration of suboptimal structures. We showcase the utility of this model by evaluating its prediction capabilities in four additional RNAs: a group II intron, Spinach II, 2-MS2 binding domain and glgC 5΄ UTR. Additionally, we demonstrate the applicability of this approach to other bacterial species by predicting sRNA–mRNA binding regions in two newly discovered, though uncharacterized, regulatory RNAs. PMID:28334800

  17. A CT-rich haplotype in intron 4 of SNCA confers risk for Lewy body pathology in Alzheimer’s disease and affects SNCA expression

    PubMed Central

    Lutz, Michael W.; Saul, Robert; Linnertz, Colton; Glenn, Omolara-Chinue; Roses, Allen D.; Chiba-Falek, Ornit

    2015-01-01

    INTRODUCTION We recently showed that tagging-SNPs across the SNCA locus were significantly associated with increased risk for LB pathology in AD cases. However, the actual genetic variant(s) that underlie the observed associations remain elusive. METHODS We used a bioinformatics algorithm to catalogue Structural-Variants in a region of SNCA-intron4, followed by phased-sequencing. We performed a genetic-association analysis in autopsy series of LBV/AD cases compared with AD-only controls. We investigated the biological functions by expression analysis using temporal-cortex samples. RESULTS We identified four distinct haplotypes within a highly-polymorphic-low-complexity CT-rich region. We showed that a specific haplotype conferred risk to develop LBV/AD. We demonstrated that the CT-rich site acts as an enhancer element, where the risk haplotype was significantly associated with elevated levels of SNCA-mRNA. DISCUSSION We have discovered a novel haplotype in a CT-rich region in SNCA that contributes to LB pathology in AD patients, possibly via cis-regulation of the gene expression. PMID:26079410

  18. Identification and analysis of host proteins that interact with the 3'-untranslated region of tick-borne encephalitis virus genomic RNA.

    PubMed

    Muto, Memi; Kamitani, Wataru; Sakai, Mizuki; Hirano, Minato; Kobayashi, Shintaro; Kariwa, Hiroaki; Yoshii, Kentaro

    2018-04-02

    Tick-borne encephalitis virus (TBEV) causes severe neurological disease, but the pathogenetic mechanism is unclear. The conformational structure of the 3'-untranslated region (UTR) of TBEV is associated with its virulence. We tried to identify host proteins interacting with the 3'-UTR of TBEV. Cellular proteins of HEK293T cells were co-precipitated with biotinylated RNAs of the 3'-UTR of low- and high-virulence TBEV strains and subjected to mass spectrometry analysis. Fifteen host proteins were found to bind to the 3'-UTR of TBEV, four of which-cold shock domain containing-E1 (CSDE1), spermatid perinuclear RNA binding protein (STRBP), fragile X mental retardation protein (FMRP), and interleukin enhancer binding factor 3 (ILF3)-bound specifically to that of the low-virulence strain. An RNA immunoprecipitation and pull-down assay confirmed the interactions of the complete 3'-UTRs of TBEV genomic RNA with CSDE1, FMRP, and ILF3. Partial deletion of the stem loop (SL) 3 to SL 5 structure of the variable region of the 3'-UTR did not affect interactions with the host proteins, but the interactions were markedly suppressed by deletion of the complete SL 3, 4, and 5 structures, as in the high-virulence TBEV strain. Further analysis of the roles of host proteins in the neurologic pathogenicity of TBEV is warranted. Copyright © 2018 Elsevier B.V. All rights reserved.

  19. The RNA helicase RHAU (DHX36) suppresses expression of the transcription factor PITX1.

    PubMed

    Booy, Evan P; Howard, Ryan; Marushchak, Oksana; Ariyo, Emmanuel O; Meier, Markus; Novakowski, Stefanie K; Deo, Soumya R; Dzananovic, Edis; Stetefeld, Jörg; McKenna, Sean A

    2014-03-01

    RNA Helicase associated with AU-rich element (RHAU) (DHX36) is a DEAH (Aspartic acid, Glumatic Acid, Alanine, Histidine)-box RNA helicase that can bind and unwind G4-quadruplexes in DNA and RNA. To detect novel RNA targets of RHAU, we performed an RNA co-immunoprecipitation screen and identified the PITX1 messenger RNA (mRNA) as specifically and highly enriched. PITX1 is a homeobox transcription factor with roles in both development and cancer. Primary sequence analysis identified three probable quadruplexes within the 3'-untranslated region of the PITX1 mRNA. Each of these sequences, when isolated, forms stable quadruplex structures that interact with RHAU. We provide evidence that these quadruplexes exist in the endogenous mRNA; however, we discovered that RHAU is tethered to the mRNA via an alternative non-quadruplex-forming region. RHAU knockdown by small interfering RNA results in significant increases in PITX1 protein levels with only marginal changes in mRNA, suggesting a role for RHAU in translational regulation. Involvement of components of the microRNA machinery is supported by similar and non-additive increases in PITX1 protein expression on Dicer and combined RHAU/Dicer knockdown. We also demonstrate a requirement of argonaute-2, a key RNA-induced silencing complex component, to mediate RHAU-dependent changes in PITX1 protein levels. These results demonstrate a novel role for RHAU in microRNA-mediated translational regulation at a quadruplex-containing 3'-untranslated region.

  20. Ultra-Deep Sequencing Analysis of the Hepatitis A Virus 5'-Untranslated Region among Cases of the Same Outbreak from a Single Source

    PubMed Central

    Wu, Shuang; Nakamoto, Shingo; Kanda, Tatsuo; Jiang, Xia; Nakamura, Masato; Miyamura, Tatsuo; Shirasawa, Hiroshi; Sugiura, Nobuyuki; Takahashi-Nakaguchi, Azusa; Gonoi, Tohru; Yokosuka, Osamu

    2014-01-01

    Hepatitis A virus (HAV) is a causative agent of acute viral hepatitis for which an effective vaccine has been developed. Here we describe ultra-deep pyrosequences (UDPSs) of HAV 5'-untranslated region (5'UTR) among cases of the same outbreak, which arose from a single source, associated with a revolving sushi bar. We determined the reference sequence from HAV-derived clone from an attendant by the Sanger method. Sixteen UDPSs from this outbreak and one from another sporadic case were compared with this reference. Nucleotide errors yielded a UDPS error rate of < 1%. This study confirmed that nucleotide substitutions of this region are transition mutations in outbreak cases, that insertion was observed only in non-severe cases, and that these nucleotide substitutions were different from those of the sporadic case. Analysis of UDPSs detected low-prevalence HAV variations in 5'UTR, but no specific mutations associated with severity in these outbreak cases. To our surprise, HAV strains in this outbreak conserved HAV IRES sequence even if we performed analysis of UDPSs. UDPS analysis of HAV 5'UTR gave us no association between the disease severity of hepatitis A and HAV 5'UTR substitutions. It might be more interesting to perform ultra-deep sequencing of full length HAV genome in order to reveal possible unknown genomic determinants associated with disease severity. Further studies will be needed. PMID:24396287

  1. Multiple recent horizontal transfers of the cox1 intron in Solanaceae and extended co-conversion of flanking exons

    PubMed Central

    2011-01-01

    Background The most frequent case of horizontal transfer in plants involves a group I intron in the mitochondrial gene cox1, which has been acquired via some 80 separate plant-to-plant transfer events among 833 diverse angiosperms examined. This homing intron encodes an endonuclease thought to promote the intron's promiscuous behavior. A promising experimental approach to study endonuclease activity and intron transmission involves somatic cell hybridization, which in plants leads to mitochondrial fusion and genome recombination. However, the cox1 intron has not yet been found in the ideal group for plant somatic genetics - the Solanaceae. We therefore undertook an extensive survey of this family to find members with the intron and to learn more about the evolutionary history of this exceptionally mobile genetic element. Results Although 409 of the 426 species of Solanaceae examined lack the cox1 intron, it is uniformly present in three phylogenetically disjunct clades. Despite strong overall incongruence of cox1 intron phylogeny with angiosperm phylogeny, two of these clades possess nearly identical intron sequences and are monophyletic in intron phylogeny. These two clades, and possibly the third also, contain a co-conversion tract (CCT) downstream of the intron that is extended relative to all previously recognized CCTs in angiosperm cox1. Re-examination of all published cox1 genes uncovered additional cases of extended co-conversion and identified a rare case of putative intron loss, accompanied by full retention of the CCT. Conclusions We infer that the cox1 intron was separately and recently acquired by at least three different lineages of Solanaceae. The striking identity of the intron and CCT from two of these lineages suggests that one of these three intron captures may have occurred by a within-family transfer event. This is consistent with previous evidence that horizontal transfer in plants is biased towards phylogenetically local events. The discovery of extended co-conversion suggests that other cox1 conversions may be longer than realized but obscured by the exceptional conservation of plant mitochondrial sequences. Our findings provide further support for the rampant-transfer model of cox1 intron evolution and recommend the Solanaceae as a model system for the experimental analysis of cox1 intron transfer in plants. PMID:21943226

  2. CYCLIN-DEPENDENT KINASE G1 Is Associated with the Spliceosome to Regulate CALLOSE SYNTHASE5 Splicing and Pollen Wall Formation in Arabidopsis[C][W][OA

    PubMed Central

    Huang, Xue-Yong; Niu, Jin; Sun, Ming-Xi; Zhu, Jun; Gao, Ju-Fang; Yang, Jun; Zhou, Que; Yang, Zhong-Nan

    2013-01-01

    Arabidopsis thaliana CYCLIN-DEPEDENT KINASE G1 (CDKG1) belongs to the family of cyclin-dependent protein kinases that were originally characterized as cell cycle regulators in eukaryotes. Here, we report that CDKG1 regulates pre-mRNA splicing of CALLOSE SYNTHASE5 (CalS5) and, therefore, pollen wall formation. The knockout mutant cdkg1 exhibits reduced male fertility with impaired callose synthesis and abnormal pollen wall formation. The sixth intron in CalS5 pre-mRNA, a rare type of intron with a GC 5′ splice site, is abnormally spliced in cdkg1. RNA immunoprecipitation analysis suggests that CDKG1 is associated with this intron. CDKG1 contains N-terminal Ser/Arg (RS) motifs and interacts with splicing factor Arginine/Serine-Rich Zinc Knuckle-Containing Protein33 (RSZ33) through its RS region to regulate proper splicing. CDKG1 and RS-containing Zinc Finger Protein22 (SRZ22), a splicing factor interacting with RSZ33 and U1 small nuclear ribonucleoprotein particle (snRNP) component U1-70k, colocalize in nuclear speckles and reside in the same complex. We propose that CDKG1 is recruited to U1 snRNP through RSZ33 to facilitate the splicing of the sixth intron of CalS5. PMID:23404887

  3. Partial androgen insensitivity syndrome caused by a deep intronic mutation creating an alternative splice acceptor site of the AR gene.

    PubMed

    Ono, Hiroyuki; Saitsu, Hirotomo; Horikawa, Reiko; Nakashima, Shinichi; Ohkubo, Yumiko; Yanagi, Kumiko; Nakabayashi, Kazuhiko; Fukami, Maki; Fujisawa, Yasuko; Ogata, Tsutomu

    2018-02-02

    Although partial androgen insensitivity syndrome (PAIS) is caused by attenuated responsiveness to androgens, androgen receptor gene (AR) mutations on the coding regions and their splice sites have been identified only in <25% of patients with a diagnosis of PAIS. We performed extensive molecular studies including whole exome sequencing in a Japanese family with PAIS, identifying a deep intronic variant beyond the branch site at intron 6 of AR (NM_000044.4:c.2450-42 G > A). This variant created the splice acceptor motif that was accompanied by pyrimidine-rich sequence and two candidate branch sites. Consistent with this, reverse transcriptase (RT)-PCR experiments for cycloheximide-treated lymphoblastoid cell lines revealed a relatively large amount of aberrant mRNA produced by the newly created splice acceptor site and a relatively small amount of wildtype mRNA produced by the normal splice acceptor site. Furthermore, most of the aberrant mRNA was shown to undergo nonsense mediated decay (NMD) and, if a small amount of aberrant mRNA may have escaped NMD, such mRNA was predicted to generate a truncated AR protein missing some functional domains. These findings imply that the deep intronic mutation creating an alternative splice acceptor site resulted in the production of a relatively small amount of wildtype AR mRNA, leading to PAIS.

  4. RNA Splicing in a New Rhabdovirus from Culex Mosquitoes▿†

    PubMed Central

    Kuwata, Ryusei; Isawa, Haruhiko; Hoshino, Keita; Tsuda, Yoshio; Yanase, Tohru; Sasaki, Toshinori; Kobayashi, Mutsuo; Sawabe, Kyoko

    2011-01-01

    Among members of the order Mononegavirales, RNA splicing events have been found only in the family Bornaviridae. Here, we report that a new rhabdovirus isolated from the mosquito Culex tritaeniorhynchus replicates in the nuclei of infected cells and requires RNA splicing for viral mRNA maturation. The virus, designated Culex tritaeniorhynchus rhabdovirus (CTRV), shares a similar genome organization with other rhabdoviruses, except for the presence of a putative intron in the coding region for the L protein. Molecular phylogenetic studies indicated that CTRV belongs to the family Rhabdoviridae, but it is yet to be assigned a genus. Electron microscopic analysis revealed that the CTRV virion is extremely elongated, unlike virions of rhabdoviruses, which are generally bullet shaped. Northern hybridization confirmed that a large transcript (approximately 6,500 nucleotides [nt]) from the CTRV L gene was present in the infected cells. Strand-specific reverse transcription-PCR (RT-PCR) analyses identified the intron-exon boundaries and the 76-nt intron sequence, which contains the typical motif for eukaryotic spliceosomal intron-splice donor/acceptor sites (GU-AG), a predicted branch point, and a polypyrimidine tract. In situ hybridization exhibited that viral RNAs are primarily localized in the nucleus of infected cells, indicating that CTRV replicates in the nucleus and is allowed to utilize the host's nuclear splicing machinery. This is the first report of RNA splicing among the members of the family Rhabdoviridae. PMID:21507977

  5. RNA splicing in a new rhabdovirus from Culex mosquitoes.

    PubMed

    Kuwata, Ryusei; Isawa, Haruhiko; Hoshino, Keita; Tsuda, Yoshio; Yanase, Tohru; Sasaki, Toshinori; Kobayashi, Mutsuo; Sawabe, Kyoko

    2011-07-01

    Among members of the order Mononegavirales, RNA splicing events have been found only in the family Bornaviridae. Here, we report that a new rhabdovirus isolated from the mosquito Culex tritaeniorhynchus replicates in the nuclei of infected cells and requires RNA splicing for viral mRNA maturation. The virus, designated Culex tritaeniorhynchus rhabdovirus (CTRV), shares a similar genome organization with other rhabdoviruses, except for the presence of a putative intron in the coding region for the L protein. Molecular phylogenetic studies indicated that CTRV belongs to the family Rhabdoviridae, but it is yet to be assigned a genus. Electron microscopic analysis revealed that the CTRV virion is extremely elongated, unlike virions of rhabdoviruses, which are generally bullet shaped. Northern hybridization confirmed that a large transcript (approximately 6,500 nucleotides [nt]) from the CTRV L gene was present in the infected cells. Strand-specific reverse transcription-PCR (RT-PCR) analyses identified the intron-exon boundaries and the 76-nt intron sequence, which contains the typical motif for eukaryotic spliceosomal intron-splice donor/acceptor sites (GU-AG), a predicted branch point, and a polypyrimidine tract. In situ hybridization exhibited that viral RNAs are primarily localized in the nucleus of infected cells, indicating that CTRV replicates in the nucleus and is allowed to utilize the host's nuclear splicing machinery. This is the first report of RNA splicing among the members of the family Rhabdoviridae.

  6. A pipeline of programs for collecting and analyzing group II intron retroelement sequences from GenBank

    PubMed Central

    2013-01-01

    Background Accurate and complete identification of mobile elements is a challenging task in the current era of sequencing, given their large numbers and frequent truncations. Group II intron retroelements, which consist of a ribozyme and an intron-encoded protein (IEP), are usually identified in bacterial genomes through their IEP; however, the RNA component that defines the intron boundaries is often difficult to identify because of a lack of strong sequence conservation corresponding to the RNA structure. Compounding the problem of boundary definition is the fact that a majority of group II intron copies in bacteria are truncated. Results Here we present a pipeline of 11 programs that collect and analyze group II intron sequences from GenBank. The pipeline begins with a BLAST search of GenBank using a set of representative group II IEPs as queries. Subsequent steps download the corresponding genomic sequences and flanks, filter out non-group II introns, assign introns to phylogenetic subclasses, filter out incomplete and/or non-functional introns, and assign IEP sequences and RNA boundaries to the full-length introns. In the final step, the redundancy in the data set is reduced by grouping introns into sets of ≥95% identity, with one example sequence chosen to be the representative. Conclusions These programs should be useful for comprehensive identification of group II introns in sequence databases as data continue to rapidly accumulate. PMID:24359548

  7. Localized Retroprocessing as a Model of Intron Loss in the Plant Mitochondrial Genome

    PubMed Central

    Cuenca, Argelia; Ross, T. Gregory; Graham, Sean W.; Barrett, Craig F.; Davis, Jerrold I.; Seberg, Ole; Petersen, Gitte

    2016-01-01

    Loss of introns in plant mitochondrial genes is commonly explained by retroprocessing. Under this model, an mRNA is reverse transcribed and integrated back into the genome, simultaneously affecting the contents of introns and edited sites. To evaluate the extent to which retroprocessing explains intron loss, we analyzed patterns of intron content and predicted RNA editing for whole mitochondrial genomes of 30 species in the monocot order Alismatales. In this group, we found an unusually high degree of variation in the intron content, even expanding the hitherto known variation among angiosperms. Some species have lost some two-third of the cis-spliced introns. We found a strong correlation between intron content and editing frequency, and detected 27 events in which intron loss is consistent with the presence of nucleotides in an edited state, supporting retroprocessing. However, we also detected seven cases of intron loss not readily being explained by retroprocession. Our analyses are also not consistent with the entire length of a fully processed cDNA copy being integrated into the genome, but instead indicate that retroprocessing usually occurs for only part of the gene. In some cases, several rounds of retroprocessing may explain intron loss in genes completely devoid of introns. A number of taxa retroprocessing seem to be very common and a possibly ongoing process. It affects the entire mitochondrial genome. PMID:27435795

  8. Novel p53 tumour suppressor mutations in cases of spindle cell sarcoma, pleomorphic sarcoma and fibrosarcoma in cats.

    PubMed

    Mayr, B; Reifinger, M; Alton, K; Schaffner, G

    1998-06-01

    Twenty feline neoplasms were sequenced in the region from exons 5 to 8 for the presence of tumour suppressor gene p53 mutations. In a spindle cell sarcoma of the bladder, a missense mutation (codon 164 AAG-->GAG, lysine-->glutamic acid) in exon 5 was detected. In a pleomorphic sarcoma, a 23 bp deletion involving the splicing junction between intron 5 and exon 6 was observed. In a fibrosarcoma, a 6 bp deletion of p53 covering 2 bp of exon 7 and 4 bp of intron 7, including the splicing junction, was found. The study demonstrates three new p53 mutations in different types of sarcomas in cats.

  9. Euglena gracilis chloroplast DNA: analysis of a 1.6 kb intron of the psb C gene containing an open reading frame of 458 codons.

    PubMed

    Montandon, P E; Vasserot, A; Stutz, E

    1986-01-01

    We retrieved a 1.6 kbp intron separating two exons of the psb C gene which codes for the 44 kDa reaction center protein of photosystem II. This intron is 3 to 4 times the size of all previously sequenced Euglena gracilis chloroplast introns. It contains an open reading frame of 458 codons potentially coding for a basic protein of 54 kDa of yet unknown function. The intron boundaries follow consensus sequences established for chloroplast introns related to class II and nuclear pre-mRNA introns. Its 3'-terminal segment has structural features similar to class II mitochondrial introns with an invariant base A as possible branch point for lariat formation.

  10. Putative cross-kingdom horizontal gene transfer in sponge (Porifera) mitochondria

    PubMed Central

    Rot, Chagai; Goldfarb, Itay; Ilan, Micha; Huchon, Dorothée

    2006-01-01

    Background The mitochondrial genome of Metazoa is usually a compact molecule without introns. Exceptions to this rule have been reported only in corals and sea anemones (Cnidaria), in which group I introns have been discovered in the cox1 and nad5 genes. Here we show several lines of evidence demonstrating that introns can also be found in the mitochondria of sponges (Porifera). Results A 2,349 bp fragment of the mitochondrial cox1 gene was sequenced from the sponge Tetilla sp. (Spirophorida). This fragment suggests the presence of a 1143 bp intron. Similar to all the cnidarian mitochondrial introns, the putative intron has group I intron characteristics. The intron is present in the cox1 gene and encodes a putative homing endonuclease. In order to establish the distribution of this intron in sponges, the cox1 gene was sequenced from several representatives of the demosponge diversity. The intron was found only in the sponge order Spirophorida. A phylogenetic analysis of the COI protein sequence and of the intron open reading frame suggests that the intron may have been transmitted horizontally from a fungus donor. Conclusion Little is known about sponge-associated fungi, although in the last few years the latter have been frequently isolated from sponges. We suggest that the horizontal gene transfer of a mitochondrial intron was facilitated by a symbiotic relationship between fungus and sponge. Ecological relationships are known to have implications at the genomic level. Here, an ecological relationship between sponge and fungus is suggested based on the genomic analysis. PMID:16972986

  11. Exon definition as a potential negative force against intron losses in evolution.

    PubMed

    Niu, Deng-Ke

    2008-11-13

    Previous studies have indicated that the wide variation in intron density (the number of introns per gene) among different eukaryotes largely reflects varying degrees of intron loss during evolution. The most popular model, which suggests that organisms lose introns through a mechanism in which reverse-transcribed cDNA recombines with the genomic DNA, concerns only one mutational force. Using exons as the units of splicing-site recognition, exon definition constrains the length of exons. An intron-loss event results in fusion of flanking exons and thus a larger exon. The large size of the newborn exon may cause splicing errors, i.e., exon skipping, if the splicing of pre-mRNAs is initiated by exon definition. By contrast, if the splicing of pre-mRNAs is initiated by intron definition, intron loss does not matter. Exon definition may thus be a selective force against intron loss. An organism with a high frequency of exon definition is expected to experience a low rate of intron loss throughout evolution and have a high density of spliceosomal introns. The majority of spliceosomal introns in vertebrates may be maintained during evolution not because of potential functions, but because of their splicing mechanism (i.e., exon definition). Further research is required to determine whether exon definition is a negative force in maintaining the high intron density of vertebrates. This article was reviewed by Dr. Scott W. Roy (nominated by Dr. John Logsdon), Dr.Eugene V. Koonin, and Dr. Igor B. Rogozin (nominated by Dr. Mikhail Gelfand). For the full reviews,please go to the Reviewers' comments section.

  12. The complete chloroplast DNA sequences of the charophycean green algae Staurastrum and Zygnema reveal that the chloroplast genome underwent extensive changes during the evolution of the Zygnematales

    PubMed Central

    Turmel, Monique; Otis, Christian; Lemieux, Claude

    2005-01-01

    Background The Streptophyta comprise all land plants and six monophyletic groups of charophycean green algae. Phylogenetic analyses of four genes from three cellular compartments support the following branching order for these algal lineages: Mesostigmatales, Chlorokybales, Klebsormidiales, Zygnematales, Coleochaetales and Charales, with the last lineage being sister to land plants. Comparative analyses of the Mesostigma viride (Mesostigmatales) and land plant chloroplast genome sequences revealed that this genome experienced many gene losses, intron insertions and gene rearrangements during the evolution of charophyceans. On the other hand, the chloroplast genome of Chaetosphaeridium globosum (Coleochaetales) is highly similar to its land plant counterparts in terms of gene content, intron composition and gene order, indicating that most of the features characteristic of land plant chloroplast DNA (cpDNA) were acquired from charophycean green algae. To gain further insight into when the highly conservative pattern displayed by land plant cpDNAs originated in the Streptophyta, we have determined the cpDNA sequences of the distantly related zygnematalean algae Staurastrum punctulatum and Zygnema circumcarinatum. Results The 157,089 bp Staurastrum and 165,372 bp Zygnema cpDNAs encode 121 and 125 genes, respectively. Although both cpDNAs lack an rRNA-encoding inverted repeat (IR), they are substantially larger than Chaetosphaeridium and land plant cpDNAs. This increased size is explained by the expansion of intergenic spacers and introns. The Staurastrum and Zygnema genomes differ extensively from one another and from their streptophyte counterparts at the level of gene order, with the Staurastrum genome more closely resembling its land plant counterparts than does Zygnema cpDNA. Many intergenic regions in Zygnema cpDNA harbor tandem repeats. The introns in both Staurastrum (8 introns) and Zygnema (13 introns) cpDNAs represent subsets of those found in land plant cpDNAs. They represent 16 distinct insertion sites, only five of which are shared by the two zygnematalean genomes. Three of these insertions sites have not been identified in Chaetosphaeridium cpDNA. Conclusion The chloroplast genome experienced substantial changes in overall structure, gene order, and intron content during the evolution of the Zygnematales. Most of the features considered earlier as typical of land plant cpDNAs probably originated before the emergence of the Zygnematales and Coleochaetales. PMID:16236178

  13. Comparative Analysis of the Base Compositions of the Pre-mRNA 3′ Cleaved-Off Region and the mRNA 3′ Untranslated Region Relative to the Genomic Base Composition in Animals and Plants

    PubMed Central

    Li, Xiu-Qing

    2014-01-01

    The precursor messenger RNA (pre-mRNA) three-prime cleaved-off region (3′COR) and the mRNA three-prime untranslated region (3′UTR) play critical roles in regulating gene expression. The differences in base composition between these regions and the corresponding genomes are still largely uncharacterized in animals and plants. In this study, the base compositions of non-redundant 3′CORs and 3′UTRs were compared with the corresponding whole genomes of eleven animals, four dicotyledonous plants, and three monocotyledonous (cereal) plants. Among the four bases (A, C, G, and U for adenine, cytosine, guanine, and uracil, respectively), U (which corresponds to T, for thymine, in DNA) was the most frequent, A the second most frequent, G the third most frequent, and C the least frequent in most of the species in both the 3′COR and 3′UTR regions. In comparison with the whole genomes, in both regions the U content was usually the most overrepresented (particularly in the monocotyledonous plants), and the C content was the most underrepresented. The order obtained for the species groups, when ranked from high to low according to the U contents in the 3′COR and 3′UTR was as follows: dicotyledonous plants, monocotyledonous plants, non-mammal animals, and mammals. In contrast, the genomic T content was highest in dicotyledonous plants, lowest in monocotyledonous plants, and intermediate in animals. These results suggest the following: 1) there is a mechanism operating in both animals and plants which is biased toward U and against C in the 3′COR and 3′UTR; 2) the 3′UTR and 3′COR, as functional units, minimized the difference between dicotyledonous and monocotyledonous plants, while the dicotyledonous and monocotyledonous genomes evolved into two extreme groups in terms of base composition. PMID:24941005

  14. Sequences within the 5' untranslated region regulate the levels of a kinetoplast DNA topoisomerase mRNA during the cell cycle.

    PubMed Central

    Pasion, S G; Hines, J C; Ou, X; Mahmood, R; Ray, D S

    1996-01-01

    Gene expression in trypanosomatids appears to be regulated largely at the posttranscriptional level and involves maturation of mRNA precursors by trans splicing of a 39-nucleotide miniexon sequence to the 5' end of the mRNA and cleavage and polyadenylation at the 3' end of the mRNA. To initiate the identification of sequences involved in the periodic expression of DNA replication genes in trypanosomatids, we have mapped splice acceptor sites in the 5' flanking region of the TOP2 gene, which encodes the kinetoplast DNA topoisomerase, and have carried out deletion analysis of this region on a plasmid-encoded TOP2 gene. Block deletions within the 5' untranslated region (UTR) identified two regions (-608 to -388 and -387 to -186) responsible for periodic accumulation of the mRNA. Deletion of one or the other of these sequences had no effect on periodic expression of the mRNA, while deletion of both regions resulted in constitutive expression of the mRNA throughout the cell cycle. Subcloning of these sequences into the 5' UTR of a construct lacking both regions of the TOP2 5' UTR has shown that an octamer consensus sequence present in the 5' UTR of the TOP2, RPA1, and DHFR-TS mRNAs is required for normal cycling of the TOP2 mRNA. Mutation of the consensus octamer sequence in the TOP2 5' UTR in a plasmid construct containing only a single consensus octamer and that shows normal cycling of the plasmid-encoded TOP2 mRNA resulted in substantial reduction of the cycling of the mRNA level. These results imply a negative regulation of TOP2 mRNA during the cell cycle by a mechanism involving redundant elements containing one or more copies of a conserved octamer sequence within the 5' UTR of TOP2 mRNA. PMID:8943327

  15. Sequences within the 5' untranslated region regulate the levels of a kinetoplast DNA topoisomerase mRNA during the cell cycle.

    PubMed

    Pasion, S G; Hines, J C; Ou, X; Mahmood, R; Ray, D S

    1996-12-01

    Gene expression in trypanosomatids appears to be regulated largely at the posttranscriptional level and involves maturation of mRNA precursors by trans splicing of a 39-nucleotide miniexon sequence to the 5' end of the mRNA and cleavage and polyadenylation at the 3' end of the mRNA. To initiate the identification of sequences involved in the periodic expression of DNA replication genes in trypanosomatids, we have mapped splice acceptor sites in the 5' flanking region of the TOP2 gene, which encodes the kinetoplast DNA topoisomerase, and have carried out deletion analysis of this region on a plasmid-encoded TOP2 gene. Block deletions within the 5' untranslated region (UTR) identified two regions (-608 to -388 and -387 to -186) responsible for periodic accumulation of the mRNA. Deletion of one or the other of these sequences had no effect on periodic expression of the mRNA, while deletion of both regions resulted in constitutive expression of the mRNA throughout the cell cycle. Subcloning of these sequences into the 5' UTR of a construct lacking both regions of the TOP2 5' UTR has shown that an octamer consensus sequence present in the 5' UTR of the TOP2, RPA1, and DHFR-TS mRNAs is required for normal cycling of the TOP2 mRNA. Mutation of the consensus octamer sequence in the TOP2 5' UTR in a plasmid construct containing only a single consensus octamer and that shows normal cycling of the plasmid-encoded TOP2 mRNA resulted in substantial reduction of the cycling of the mRNA level. These results imply a negative regulation of TOP2 mRNA during the cell cycle by a mechanism involving redundant elements containing one or more copies of a conserved octamer sequence within the 5' UTR of TOP2 mRNA.

  16. Group II intron inhibits conjugative relaxase expression in bacteria by mRNA targeting

    PubMed Central

    Piazza, Carol Lyn; Smith, Dorie

    2018-01-01

    Group II introns are mobile ribozymes that are rare in bacterial genomes, often cohabiting with various mobile elements, and seldom interrupting housekeeping genes. What accounts for this distribution has not been well understood. Here, we demonstrate that Ll.LtrB, the group II intron residing in a relaxase gene on a conjugative plasmid from Lactococcus lactis, inhibits its host gene expression and restrains the naturally cohabiting mobile element from conjugative horizontal transfer. We show that reduction in gene expression is mainly at the mRNA level, and results from the interaction between exon-binding sequences (EBSs) in the intron and intron-binding sequences (IBSs) in the mRNA. The spliced intron targets the relaxase mRNA and reopens ligated exons, causing major mRNA loss. Taken together, this study provides an explanation for the distribution and paucity of group II introns in bacteria, and suggests a potential force for those introns to evolve into spliceosomal introns. PMID:29905149

  17. Group II intron inhibits conjugative relaxase expression in bacteria by mRNA targeting.

    PubMed

    Qu, Guosheng; Piazza, Carol Lyn; Smith, Dorie; Belfort, Marlene

    2018-06-15

    Group II introns are mobile ribozymes that are rare in bacterial genomes, often cohabiting with various mobile elements, and seldom interrupting housekeeping genes. What accounts for this distribution has not been well understood. Here, we demonstrate that Ll.LtrB, the group II intron residing in a relaxase gene on a conjugative plasmid from Lactococcus lactis , inhibits its host gene expression and restrains the naturally cohabiting mobile element from conjugative horizontal transfer. We show that reduction in gene expression is mainly at the mRNA level, and results from the interaction between exon-binding sequences (EBSs) in the intron and intron-binding sequences (IBSs) in the mRNA. The spliced intron targets the relaxase mRNA and reopens ligated exons, causing major mRNA loss. Taken together, this study provides an explanation for the distribution and paucity of group II introns in bacteria, and suggests a potential force for those introns to evolve into spliceosomal introns. © 2018, Qu et al.

  18. Evolution of the tRNALeu (UAA) Intron and Congruence of Genetic Markers in Lichen-Symbiotic Nostoc

    PubMed Central

    Kaasalainen, Ulla; Olsson, Sanna; Rikkinen, Jouko

    2015-01-01

    The group I intron interrupting the tRNALeu UAA gene (trnL) is present in most cyanobacterial genomes as well as in the plastids of many eukaryotic algae and all green plants. In lichen symbiotic Nostoc, the P6b stem-loop of trnL intron always involves one of two different repeat motifs, either Class I or Class II, both with unresolved evolutionary histories. Here we attempt to resolve the complex evolution of the two different trnL P6b region types. Our analysis indicates that the Class II repeat motif most likely appeared first and that independent and unidirectional shifts to the Class I motif have since taken place repeatedly. In addition, we compare our results with those obtained with other genetic markers and find strong evidence of recombination in the 16S rRNA gene, a marker widely used in phylogenetic studies on Bacteria. The congruence of the different genetic markers is successfully evaluated with the recently published software Saguaro, which has not previously been utilized in comparable studies. PMID:26098760

  19. Evolution of the tRNALeu (UAA) Intron and Congruence of Genetic Markers in Lichen-Symbiotic Nostoc.

    PubMed

    Kaasalainen, Ulla; Olsson, Sanna; Rikkinen, Jouko

    2015-01-01

    The group I intron interrupting the tRNALeu UAA gene (trnL) is present in most cyanobacterial genomes as well as in the plastids of many eukaryotic algae and all green plants. In lichen symbiotic Nostoc, the P6b stem-loop of trnL intron always involves one of two different repeat motifs, either Class I or Class II, both with unresolved evolutionary histories. Here we attempt to resolve the complex evolution of the two different trnL P6b region types. Our analysis indicates that the Class II repeat motif most likely appeared first and that independent and unidirectional shifts to the Class I motif have since taken place repeatedly. In addition, we compare our results with those obtained with other genetic markers and find strong evidence of recombination in the 16S rRNA gene, a marker widely used in phylogenetic studies on Bacteria. The congruence of the different genetic markers is successfully evaluated with the recently published software Saguaro, which has not previously been utilized in comparable studies.

  20. Chromosomal localization and partial genomic structure of the human peroxisome proliferator activated receptor-gamma (hPPAR gamma) gene.

    PubMed

    Beamer, B A; Negri, C; Yen, C J; Gavrilova, O; Rumberger, J M; Durcan, M J; Yarnall, D P; Hawkins, A L; Griffin, C A; Burns, D K; Roth, J; Reitman, M; Shuldiner, A R

    1997-04-28

    We determined the chromosomal localization and partial genomic structure of the coding region of the human PPAR gamma gene (hPPAR gamma), a nuclear receptor important for adipocyte differentiation and function. Sequence analysis and long PCR of human genomic DNA with primers that span putative introns revealed that intron positions and sizes of hPPAR gamma are similar to those previously determined for the mouse PPAR gamma gene[13]. Fluorescent in situ hybridization localized hPPAR gamma to chromosome 3, band 3p25. Radiation hybrid mapping with two independent primer pairs was consistent with hPPAR gamma being within 1.5 Mb of marker D3S1263 on 3p25-p24.2. These sequences of the intron/exon junctions of the 6 coding exons shared by hPPAR gamma 1 and hPPAR gamma 2 will facilitate screening for possible mutations. Furthermore, D3S1263 is a suitable polymorphic marker for linkage analysis to evaluate PPAR gamma's potential contribution to genetic susceptibility to obesity, lipoatrophy, insulin resistance, and diabetes.

  1. Structure and expression of the human thymocyte antigens CD1a, CD1b, and CD1c

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Martin, L.H.; Calabi, F.; Lefebvre, F.A.

    1987-12-01

    The CD1 human antigens are a family of at least three components, CD1a, CD1b, and CD1c, that are characteristic of the cortical stage of thymocyte maturation. CD1a was originally named HTA1 or T6 and thought to be the human equivalent of mouse Tla. The genes coding for all three have not been identified by transfection into mouse cells. The transfectants express the surface antigens that can then be recognized by the corresponding cluster of monoclonal antibodies used to define the three members of CD1. The full sequence of the genomic DNA is described for all three. The intron-exon structure ofmore » CD1a is deduced by comparison with a near-full-length cDNA clone. Similar structures are proposed for the other two, largely based on sequence homology. An unusually long 5'-untranslated exon (280 bases long) is highly conserved between the three genes, suggesting an important but unknown function. CD1c has a duplicated form of this exon that is thought to be spliced out. The major homology between the three antigens is in the ..beta../sub 2/-microglobulin-binding-domain. The general relatedness to major histocompatibility complex class I and class II molecules is significant but low, with no section of higher homology to mouse Tla.« less

  2. Characterization of shark complement factor I gene(s): genomic analysis of a novel shark-specific sequence.

    PubMed

    Shin, Dong-Ho; Webb, Barbara M; Nakao, Miki; Smith, Sylvia L

    2009-07-01

    Complement factor I is a crucial regulator of mammalian complement activity. Very little is known of complement regulators in non-mammalian species. We isolated and sequenced four highly similar complement factor I cDNAs from the liver of the nurse shark (Ginglymostoma cirratum), designated as GcIf-1, GcIf-2, GcIf-3 and GcIf-4 (previously referred to as nsFI-a, -b, -c and -d) which encode 689, 673, 673 and 657 amino acid residues, respectively. They share 95% (

  3. Characterization of shark complement factor I gene(s): genomic analysis of a novel shark-specific sequence

    PubMed Central

    Shin, Dong-Ho; Webb, Barbara M.; Nakao, Miki; Smith, Sylvia L.

    2009-01-01

    Complement factor I is a crucial regulator of mammalian complement activity. Very little is known of complement regulators in non-mammalian species. We isolated and sequenced four highly similar complement factor I cDNAs from the liver of the nurse shark (Ginglymostoma cirratum), designated as GcIf-1, GcIf-2, GcIf-3 and GcIf-4 (previously referred to as nsFI-a, -b, -c and –d) which encode 689, 673, 673 and 657 amino acid residues, respectively. They share 95% (≤) amino acid identities with each other, 35.4 ~ 39.6% and 62.8 ~ 65.9% with factor I of mammals and banded houndshark (Triakis scyllium), respectively. The modular structure of the GcIf is similar to that of mammals with one notable exception, the presence of a novel shark-specific sequence between the leader peptide (LP) and the factor I membrane attack complex (FIMAC) domain. The cDNA sequences differ only in the size and composition of the shark-specific region (SSR). Sequence analysis of each SSR has identified within the region two novel short sequences (SS1 and SS2) and three repeat sequences (RS1, 2 and 3). Genomic analysis has revealed the existence of three introns between the leader peptide and the FIMAC domain, tentatively designated intron 1, intron 2, and intron 3 which span 4067, 2293 and 2082 bp, respectively. Southern blot analysis suggests the presence of a single gene copy for each cDNA type. Phylogenetic analysis suggests that complement factor I of cartilaginous fish diverged prior to the emergence of mammals. All four GcIf cDNA species are expressed in four different tissues and the liver is the main tissue in which expression level of all four is high. This suggests that the expression of GcIf isotypes is tissue-dependent. PMID:19423168

  4. Characterization of a marsupial sperm protamine gene and its transcripts from the North American opossum (Didelphis marsupialis).

    PubMed

    Winkfein, R J; Nishikawa, S; Connor, W; Dixon, G H

    1993-07-01

    A synthetic oligonucleotide primer, designed from marsupial protamine protein-sequence data [Balhorn, R., Corzett, M., Matrimas, J. A., Cummins, J. & Faden, B. (1989) Analysis of protamines isolated from two marsupials, the ring-tailed wallaby and gray short-tailed opossum, J. Cell. Biol. 107] was used to amplify, via the polymerase chain reaction, protamine sequences from a North American opossum (Didelphis marsupialis) cDNA. Using the amplified sequences as probes, several protamine cDNA clones were isolated. The protein sequence, predicted from the cDNA sequences, consisted of 57 amino acids, contained a large number of arginine residues and exhibited the sequence ARYR at its amino terminus, which is conserved in avian and most eutherian mammal protamines. Like the true protamines of trout and chicken, the opossum protamine lacked cysteine residues, distinguishing it from placental mammalian protamine 1 (P1 or stable) protamines. Examination of the protamine gene, isolated by polymerase-chain-reaction amplification of genomic DNA, revealed the presence of an intron dividing the protamine-coding region, a common characteristic of all mammalian P1 genes. In addition, extensive sequence identity in the 5' and 3' flanking regions between mouse and opossum sequences classify the marsupial protamine as being closely related to placental mammal P1. Protamine transcripts, in both birds and mammals, are present in two size classes, differing by the length of their poly(A) tails (either short or long). Examination of opossum protamine transcripts by Northern hybridization revealed four distinct mRNA species in the total RNA fraction, two of which were enriched in the poly(A)-rich fraction. Northern-blot analysis, using an intron-specific probe, revealed the presence of intron sequences in two of the four protamine transcripts. If expressed, the corresponding protein from intron-containing transcripts would differ from spliced transcripts by length (49 versus 57 amino acids) and would contain a cysteine residue.

  5. Structural and Functional Characterization of Ribosomal Protein Gene Introns in Sponges

    PubMed Central

    Perina, Drago; Korolija, Marina; Mikoč, Andreja; Roller, Maša; Pleše, Bruna; Imešek, Mirna; Morrow, Christine; Batel, Renato; Ćetković, Helena

    2012-01-01

    Ribosomal protein genes (RPGs) are a powerful tool for studying intron evolution. They exist in all three domains of life and are much conserved. Accumulating genomic data suggest that RPG introns in many organisms abound with non-protein-coding-RNAs (ncRNAs). These ancient ncRNAs are small nucleolar RNAs (snoRNAs) essential for ribosome assembly. They are also mobile genetic elements and therefore probably important in diversification and enrichment of transcriptomes through various mechanisms such as intron/exon gain/loss. snoRNAs in basal metazoans are poorly characterized. We examined 449 RPG introns, in total, from four demosponges: Amphimedon queenslandica, Suberites domuncula, Suberites ficus and Suberites pagurorum and showed that RPG introns from A. queenslandica share position conservancy and some structural similarity with “higher” metazoans. Moreover, our study indicates that mobile element insertions play an important role in the evolution of their size. In four sponges 51 snoRNAs were identified. The analysis showed discrepancies between the snoRNA pools of orthologous RPG introns between S. domuncula and A. queenslandica. Furthermore, these two sponges show as much conservancy of RPG intron positions between each other as between themselves and human. Sponges from the Suberites genus show consistency in RPG intron position conservation. However, significant differences in some of the orthologous RPG introns of closely related sponges were observed. This indicates that RPG introns are dynamic even on these shorter evolutionary time scales. PMID:22880015

  6. Structural and functional characterization of ribosomal protein gene introns in sponges.

    PubMed

    Perina, Drago; Korolija, Marina; Mikoč, Andreja; Roller, Maša; Pleše, Bruna; Imešek, Mirna; Morrow, Christine; Batel, Renato; Ćetković, Helena

    2012-01-01

    Ribosomal protein genes (RPGs) are a powerful tool for studying intron evolution. They exist in all three domains of life and are much conserved. Accumulating genomic data suggest that RPG introns in many organisms abound with non-protein-coding-RNAs (ncRNAs). These ancient ncRNAs are small nucleolar RNAs (snoRNAs) essential for ribosome assembly. They are also mobile genetic elements and therefore probably important in diversification and enrichment of transcriptomes through various mechanisms such as intron/exon gain/loss. snoRNAs in basal metazoans are poorly characterized. We examined 449 RPG introns, in total, from four demosponges: Amphimedon queenslandica, Suberites domuncula, Suberites ficus and Suberites pagurorum and showed that RPG introns from A. queenslandica share position conservancy and some structural similarity with "higher" metazoans. Moreover, our study indicates that mobile element insertions play an important role in the evolution of their size. In four sponges 51 snoRNAs were identified. The analysis showed discrepancies between the snoRNA pools of orthologous RPG introns between S. domuncula and A. queenslandica. Furthermore, these two sponges show as much conservancy of RPG intron positions between each other as between themselves and human. Sponges from the Suberites genus show consistency in RPG intron position conservation. However, significant differences in some of the orthologous RPG introns of closely related sponges were observed. This indicates that RPG introns are dynamic even on these shorter evolutionary time scales.

  7. Exon–intron organization of genes in the slime mold Physarum polycephalum

    PubMed Central

    Trzcinska-Danielewicz, Joanna; Fronk, Jan

    2000-01-01

    The slime mold Physarum polycephalum is a morphologically simple organism with a large and complex genome. The exon–intron organization of its genes exhibits features typical for protists and fungi as well as those characteristic for the evolutionarily more advanced species. This indicates that both the taxonomic position as well as the size of the genome shape the exon–intron organization of an organism. The average gene has 3.7 introns which are on average 138 bp, with a rather narrow size distribution. Introns are enriched in AT base pairs by 13% relative to exons. The consensus sequences at exon–intron boundaries resemble those found for other species, with minor differences between short and long introns. A unique feature of P.polycephalum introns is the strong preference for pyrimidines in the coding strand throughout their length, without a particular enrichment at the 3′-ends. PMID:10982858

  8. SURVEY AND SUMMARY: exon-intron organization of genes in the slime mold Physarum polycephalum.

    PubMed

    Trzcinska-Danielewicz, J; Fronk, J

    2000-09-15

    The slime mold Physarum polycephalum is a morphologically simple organism with a large and complex genome. The exon-intron organization of its genes exhibits features typical for protists and fungi as well as those characteristic for the evolutionarily more advanced species. This indicates that both the taxonomic position as well as the size of the genome shape the exon-intron organization of an organism. The average gene has 3.7 introns which are on average 138 bp, with a rather narrow size distribution. Introns are enriched in AT base pairs by 13% relative to exons. The consensus sequences at exon-intron boundaries resemble those found for other species, with minor differences between short and long introns. A unique feature of P.polycephalum introns is the strong preference for pyrimidines in the coding strand throughout their length, without a particular enrichment at the 3'-ends.

  9. Evolution of Mhc-DRB introns: implications for the origin of primates.

    PubMed

    Kupfermann, H; Satta, Y; Takahata, N; Tichy, H; Klein, J

    1999-06-01

    Introns are generally believed to evolve too rapidly and too erratically to be of much use in phylogenetic reconstructions. Few phylogenetically informative intron sequences are available, however, to ascertain the validity of this supposition. In the present study the supposition was tested on the example of the mammalian class II major histocompatibility complex (Mhc) genes of the DRB family. Since the Mhc genes evolve under balancing selection and are believed to recombine or rearrange frequently, the evolution of their introns could be expected to be particularly rapid and subject to scrambling. Sequences of intron 4 and 5 DRB genes were obtained from polymerase chain reaction-amplified fragments of genomic DNA from representatives of six eutherian orders-Primates, Scandentia, Chiroptera, Dermoptera, Lagomorpha, and Insectivora. Although short stretches of the introns have indeed proved to be unalignable, the bulk of the intron sequences from all six orders, spanning >85 million years (my) of evolution, could be aligned and used in a study of the tempo and mode of intron evolution. The analysis has revealed the Mhc introns to evolve at a rate similar to that of other genes and of synonymous sites of non-Mhc genes. No evidence of homogenization or large-scale scrambling of the intron sequences could be found. The Mhc introns apparently evolve largely by point mutations and insertions/deletions. The phylogenetic signals contained in the intron sequences could be used to identify Scandentia as the sister group of Primates, to support the existence of the Archonta superorder, and to confirm the monophyly of the Chiroptera.

  10. Molecular gene organisation and secondary structure of the mitochondrial large subunit ribosomal RNA from the cultivated Basidiomycota Agrocybe aegerita: a 13 kb gene possessing six unusual nucleotide extensions and eight introns.

    PubMed

    Gonzalez, P; Barroso, G; Labarère, J

    1999-04-01

    The complete gene sequence and secondary structure of the mitochondrial LSU rRNA from the cultivated Basidiomycota Agrocybe aegerita was derived by chromosome walking. The A.aegerita LSU rRNA gene (13 526 nt) represents, to date, the longest described, due to the highest number of introns (eight) and the occurrence of six long nucleotidic extensions. Seven introns belong to group I, while the intronic sequence i5 constitutes the first typical group II intron reported in a fungal mitochondrial LSU rDNA. As with most fungal LSU rDNA introns reported to date, four introns (i5-i8) are distributed in domain V associated with the peptidyl-transferase activity. One intron (i1) is located in domain I, and three (i2-i4) in domain II. The introns i2-i8 possess homologies with other fungal, algal or protozoan introns located at the same position in LSU rDNAs. One of them (i6) is located at the same insertion site as most Ascomycota or algae LSU introns, suggesting a possible inheritance from a common ancestor. On the contrary, intron i1 is located at a so-far unreported insertion site. Among the six unusual nucleotide extensions, five are located in domain I and one in domain V. This is the first report of a mitochondrial LSU rRNA gene sequence and secondary structure for the whole Basidiomycota division.

  11. Cross-talk between PRMT1-mediated methylation and ubiquitylation on RBM15 controls RNA splicing.

    PubMed

    Zhang, Li; Tran, Ngoc-Tung; Su, Hairui; Wang, Rui; Lu, Yuheng; Tang, Haiping; Aoyagi, Sayura; Guo, Ailan; Khodadadi-Jamayran, Alireza; Zhou, Dewang; Qian, Kun; Hricik, Todd; Côté, Jocelyn; Han, Xiaosi; Zhou, Wenping; Laha, Suparna; Abdel-Wahab, Omar; Levine, Ross L; Raffel, Glen; Liu, Yanyan; Chen, Dongquan; Li, Haitao; Townes, Tim; Wang, Hengbin; Deng, Haiteng; Zheng, Y George; Leslie, Christina; Luo, Minkui; Zhao, Xinyang

    2015-11-17

    RBM15, an RNA binding protein, determines cell-fate specification of many tissues including blood. We demonstrate that RBM15 is methylated by protein arginine methyltransferase 1 (PRMT1) at residue R578, leading to its degradation via ubiquitylation by an E3 ligase (CNOT4). Overexpression of PRMT1 in acute megakaryocytic leukemia cell lines blocks megakaryocyte terminal differentiation by downregulation of RBM15 protein level. Restoring RBM15 protein level rescues megakaryocyte terminal differentiation blocked by PRMT1 overexpression. At the molecular level, RBM15 binds to pre-messenger RNA intronic regions of genes important for megakaryopoiesis such as GATA1, RUNX1, TAL1 and c-MPL. Furthermore, preferential binding of RBM15 to specific intronic regions recruits the splicing factor SF3B1 to the same sites for alternative splicing. Therefore, PRMT1 regulates alternative RNA splicing via reducing RBM15 protein concentration. Targeting PRMT1 may be a curative therapy to restore megakaryocyte differentiation for acute megakaryocytic leukemia.

  12. Association of ESR1 gene tagging SNPs with breast cancer risk

    PubMed Central

    Dunning, Alison M.; Healey, Catherine S.; Baynes, Caroline; Maia, Ana-Teresa; Scollen, Serena; Vega, Ana; Rodríguez, Raquel; Barbosa-Morais, Nuno L.; Ponder, Bruce A.J.; Low, Yen-Ling; Bingham, Sheila; Haiman, Christopher A.; Le Marchand, Loic; Broeks, Annegien; Schmidt, Marjanka K.; Hopper, John; Southey, Melissa; Beckmann, Matthias W.; Fasching, Peter A.; Peto, Julian; Johnson, Nichola; Bojesen, Stig E.; Nordestgaard, Børge; Milne, Roger L.; Benitez, Javier; Hamann, Ute; Ko, Yon; Schmutzler, Rita K.; Burwinkel, Barbara; Schürmann, Peter; Dörk, Thilo; Heikkinen, Tuomas; Nevanlinna, Heli; Lindblom, Annika; Margolin, Sara; Mannermaa, Arto; Kosma, Veli-Matti; Chen, Xiaoqing; Spurdle, Amanda; Change-Claude, Jenny; Flesch-Janys, Dieter; Couch, Fergus J.; Olson, Janet E.; Severi, Gianluca; Baglietto, Laura; Børresen-Dale, Anne-Lise; Kristensen, Vessela; Hunter, David J.; Hankinson, Susan E.; Devilee, Peter; Vreeswijk, Maaike; Lissowska, Jolanta; Brinton, Louise; Liu, Jianjun; Hall, Per; Kang, Daehee; Yoo, Keun-Young; Shen, Chen-Yang; Yu, Jyh-Cherng; Anton-Culver, Hoda; Ziogoas, Argyrios; Sigurdson, Alice; Struewing, Jeff; Easton, Douglas F.; Garcia-Closas, Montserrat; Humphreys, Manjeet K.; Morrison, Jonathan; Pharoah, Paul D.P.; Pooley, Karen A.; Chenevix-Trench, Georgia

    2009-01-01

    We have conducted a three-stage, comprehensive single nucleotide polymorphism (SNP)-tagging association study of ESR1 gene variants (SNPs) in more than 55 000 breast cancer cases and controls from studies within the Breast Cancer Association Consortium (BCAC). No large risks or highly significant associations were revealed. SNP rs3020314, tagging a region of ESR1 intron 4, is associated with an increase in breast cancer susceptibility with a dominant mode of action in European populations. Carriers of the c-allele have an odds ratio (OR) of 1.05 [95% Confidence Intervals (CI) 1.02–1.09] relative to t-allele homozygotes, P = 0.004. There is significant heterogeneity between studies, P = 0.002. The increased risk appears largely confined to oestrogen receptor-positive tumour risk. The region tagged by SNP rs3020314 contains sequence that is more highly conserved across mammalian species than the rest of intron 4, and it may subtly alter the ratio of two mRNA splice forms. PMID:19126777

  13. Relationship of the Interaction Between Two Quantitative Trait Loci with γ-Globin Expression in β-Thalassemia Intermedia Patients.

    PubMed

    NickAria, Shiva; Haghpanah, Sezaneh; Ramzi, Mani; Karimi, Mehran

    2018-05-10

    Globin switching is a significant factor on blood hemoglobin (Hb) level but its molecular mechanisms have not yet been identified, however, several quantitative trait loci (QTL) and polymorphisms involved regions on chromosomes 2p, 6q, 8q and X account for variation in the γ-globin expression level. We studied the effect of interaction between a region on intron six of the TOX gene, chromosome 8q (chr8q) and XmnI locus on the γ-globin promoter, chr11p on γ-globin expression in 150 β-thalassemia intermedia (β-TI) patients, evaluated by statistical interaction analysis. Our results showed a significant interaction between one QTL on intron six of the TOX gene (rs9693712) and XmnI locus that effect γ-globin expression. Interchromosomal interaction mediates through transcriptional machanisms to preserve true genome architectural features, chromosomes localization and DNA bending. This interaction can be a part of the unknown molecular mechanism of globin switching and regulation of gene expression.

  14. Forks in the tracks: Group II introns, spliceosomes, telomeres and beyond.

    PubMed

    Agrawal, Rajendra Kumar; Wang, Hong-Wei; Belfort, Marlene

    2016-12-01

    Group II introns are large catalytic RNAs that form a ribonucleoprotein (RNP) complex by binding to an intron-encoded protein (IEP). The IEP, which facilitates both RNA splicing and intron mobility, has multiple activities including reverse transcriptase. Recent structures of a group II intron RNP complex and of IEPs from diverse bacteria fuel arguments that group II introns are ancestrally related to eukaryotic spliceosomes as well as to telomerase and viruses. Furthermore, recent structural studies of various functional states of the spliceosome allow us to draw parallels between the group II intron RNP and the spliceosome. Here we present an overview of these studies, with an emphasis on the structure of the IEPs in their isolated and RNA-bound states and on their evolutionary relatedness. In addition, we address the conundrum of the free, albeit truncated IEPs forming dimers, whereas the IEP bound to the intron ribozyme is a monomer in the mature RNP. Future studies needed to resolve some of the outstanding issues related to group II intron RNP function and dynamics are also discussed.

  15. Identification, characterization and functional analysis of regulatory region of nanos gene from half-smooth tongue sole (Cynoglossus semilaevis).

    PubMed

    Huang, Jinqiang; Li, Yongjuan; Shao, Changwei; Wang, Na; Chen, Songlin

    2017-06-20

    The nanos gene encodes an RNA-binding zinc finger protein, which is required in the development and maintenance of germ cells. However, there is very limited information about nanos in flatfish, which impedes its application in fish breeding. In this study, we report the molecular cloning, characterization and functional analysis of the 3'-untranslated region of the nanos gene (Csnanos) from half-smooth tongue sole (Cynoglossus semilaevis), which is an economically important flatfish in China. The 1233-bp cDNA sequence, 1709-bp genomic sequence and flanking sequences (2.8-kb 5'- and 1.6-kb 3'-flanking regions) of Csnanos were cloned and characterized. Sequence analysis revealed that CsNanos shares low homology with Nanos in other species, but the zinc finger domain of CsNanos is highly similar. Phylogenetic analysis indicated that CsNanos belongs to the Nanos2 subfamily. Csnanos expression was widely detected in various tissues, but the expression level was higher in testis and ovary. During early development and sex differentiation, Csnanos expression exhibited a clear sexually dimorphic pattern, suggesting its different roles in the migration and differentiation of primordial germ cells (PGCs). Higher expression levels of Csnanos mRNA in normal females and males than in neomales indicated that the nanos gene may play key roles in maintaining the differentiation of gonad. Moreover, medaka PGCs were successfully labeled by the microinjection of synthesized mRNA consisting of green fluorescence protein and the 3'-untranslated region of Csnanos. These findings provide new insights into nanos gene expression and function, and lay the foundation for further study of PGC development and applications in tongue sole breeding. Copyright © 2017 Elsevier B.V. All rights reserved.

  16. Various mutations compensate for a deleterious lacZα insert in the replication enhancer of M13 bacteriophage

    PubMed Central

    Zygiel, Emily M.; Noren, Karen A.; Adamkiewicz, Marta A.; Aprile, Richard J.; Bowditch, Heather K.; Carroll, Christine L.; Cerezo, Maria Abigail S.; Dagher, Adelle M.; Hebert, Courtney R.; Hebert, Lauren E.; Mahame, Gloria M.; Milne, Stephanie C.; Silvestri, Kelly M.; Sutherland, Sara E.; Sylvia, Alexandria M.; Taveira, Caitlyn N.; VanValkenburgh, David J.; Noren, Christopher J.

    2017-01-01

    M13 and other members of the Ff class of filamentous bacteriophages have been extensively employed in myriad applications. The Ph.D. series of phage-displayed peptide libraries were constructed from the M13-based vector M13KE. As a direct descendent of M13mp19, M13KE contains the lacZα insert in the intergenic region between genes IV and II, where it interrupts the replication enhancer of the (+) strand origin. Phage carrying this 816-nucleotide insert are viable, but propagate in E. coli at a reduced rate compared to wild-type M13 phage, presumably due to a replication defect caused by the insert. We have previously reported thirteen compensatory mutations in the 5’-untranslated region of gene II, which encodes the replication initiator protein gIIp. Here we report several additional mutations in M13KE that restore a wild-type propagation rate. Several clones from constrained-loop variable peptide libraries were found to have ejected the majority of lacZα gene in order to reconstruct the replication enhancer, albeit with a small scar. In addition, new point mutations in the gene II 5’-untranslated region or the gene IV coding sequence have been spontaneously observed or synthetically engineered. Through phage propagation assays, we demonstrate that all these genetic modifications compensate for the replication defect in M13KE and restore the wild-type propagation rate. We discuss the mechanisms by which the insertion and ejection of the lacZα gene, as well as the mutations in the regulatory region of gene II, influence the efficiency of replication initiation at the (+) strand origin. We also examine the presence and relevance of fast-propagating mutants in phage-displayed peptide libraries. PMID:28445507

  17. Various mutations compensate for a deleterious lacZα insert in the replication enhancer of M13 bacteriophage.

    PubMed

    Zygiel, Emily M; Noren, Karen A; Adamkiewicz, Marta A; Aprile, Richard J; Bowditch, Heather K; Carroll, Christine L; Cerezo, Maria Abigail S; Dagher, Adelle M; Hebert, Courtney R; Hebert, Lauren E; Mahame, Gloria M; Milne, Stephanie C; Silvestri, Kelly M; Sutherland, Sara E; Sylvia, Alexandria M; Taveira, Caitlyn N; VanValkenburgh, David J; Noren, Christopher J; Hall, Marilena Fitzsimons

    2017-01-01

    M13 and other members of the Ff class of filamentous bacteriophages have been extensively employed in myriad applications. The Ph.D. series of phage-displayed peptide libraries were constructed from the M13-based vector M13KE. As a direct descendent of M13mp19, M13KE contains the lacZα insert in the intergenic region between genes IV and II, where it interrupts the replication enhancer of the (+) strand origin. Phage carrying this 816-nucleotide insert are viable, but propagate in E. coli at a reduced rate compared to wild-type M13 phage, presumably due to a replication defect caused by the insert. We have previously reported thirteen compensatory mutations in the 5'-untranslated region of gene II, which encodes the replication initiator protein gIIp. Here we report several additional mutations in M13KE that restore a wild-type propagation rate. Several clones from constrained-loop variable peptide libraries were found to have ejected the majority of lacZα gene in order to reconstruct the replication enhancer, albeit with a small scar. In addition, new point mutations in the gene II 5'-untranslated region or the gene IV coding sequence have been spontaneously observed or synthetically engineered. Through phage propagation assays, we demonstrate that all these genetic modifications compensate for the replication defect in M13KE and restore the wild-type propagation rate. We discuss the mechanisms by which the insertion and ejection of the lacZα gene, as well as the mutations in the regulatory region of gene II, influence the efficiency of replication initiation at the (+) strand origin. We also examine the presence and relevance of fast-propagating mutants in phage-displayed peptide libraries.

  18. Elements in the murine c-mos messenger RNA 5'-untranslated region repress translation of downstream coding sequences.

    PubMed

    Steel, L F; Telly, D L; Leonard, J; Rice, B A; Monks, B; Sawicki, J A

    1996-10-01

    Murine c-mos transcripts isolated from testes have 5'-untranslated regions (5'UTRs) of approximately 300 nucleotides with a series of four overlapping open reading frames (ORFs) upstream of the AUG codon that initiates the Mos ORF. Ovarian c-mos transcripts have shorter 5'UTRs (70-80 nucleotides) and contain only 1-2 of the upstream ORFs (uORFs). To test whether these 5'UTRs affect translational efficiency, we have constructed plasmids for the expression of chimeric transcripts with a mos-derived 5'UTR fused to the Escherichia coli beta-galactosidase coding region. Translational efficiency has been evaluated by measuring beta-galactosidase activity NIH3T3 cells transiently transfected with these plasmids and with plasmids where various mutations have been introduced into the 5'UTR. We show that the 5'UTR characteristic of testis-specific c-mos mRNA strongly represses translation relative to the translation of transcripts that contain a 5'UTR derived from beta-globin mRNA, and this is mainly due to the four uORFs. Each of the four upstream AUG triplets can be recognized as a start site for translation, and no single uAUG dominates the repressive effect. The uORFs repress translation by a mechanism that is not affected by the amino acid sequence in the COOH-terminal region of the uORF-encoded peptides. The very short uORF (AUGUGA) present in ovary-specific transcripts does not repress translation. Staining of testis sections from transgenic mice carrying chimeric beta-galactosidase transgene constructs, which contain a mos 5'UTR with or without the uATGs, suggests that the uORFs can dramatically change the pattern of expression in spermatogenic cells.

  19. RNA Sequencing of the Exercise Transcriptome in Equine Athletes

    PubMed Central

    Verini-Supplizi, Andrea; Barcaccia, Gianni; Albiero, Alessandro; D'Angelo, Michela; Campagna, Davide; Valle, Giorgio; Felicetti, Michela; Silvestrelli, Maurizio; Cappelli, Katia

    2013-01-01

    The horse is an optimal model organism for studying the genomic response to exercise-induced stress, due to its natural aptitude for athletic performance and the relative homogeneity of its genetic and environmental backgrounds. Here, we applied RNA-sequencing analysis through the use of SOLiD technology in an experimental framework centered on exercise-induced stress during endurance races in equine athletes. We monitored the transcriptional landscape by comparing gene expression levels between animals at rest and after competition. Overall, we observed a shift from coding to non-coding regions, suggesting that the stress response involves the differential expression of not annotated regions. Notably, we observed significant post-race increases of reads that correspond to repeats, especially the intergenic and intronic L1 and L2 transposable elements. We also observed increased expression of the antisense strands compared to the sense strands in intronic and regulatory regions (1 kb up- and downstream) of the genes, suggesting that antisense transcription could be one of the main mechanisms for transposon regulation in the horse under stress conditions. We identified a large number of transcripts corresponding to intergenic and intronic regions putatively associated with new transcriptional elements. Gene expression and pathway analysis allowed us to identify several biological processes and molecular functions that may be involved with exercise-induced stress. Ontology clustering reflected mechanisms that are already known to be stress activated (e.g., chemokine-type cytokines, Toll-like receptors, and kinases), as well as “nucleic acid binding” and “signal transduction activity” functions. There was also a general and transient decrease in the global rates of protein synthesis, which would be expected after strenuous global stress. In sum, our network analysis points toward the involvement of specific gene clusters in equine exercise-induced stress, including those involved in inflammation, cell signaling, and immune interactions. PMID:24391776

  20. [Polymorphism in the Serotonin Transporter Gene (SLC6A4) and Emotional Bipolar Disorder in Two Regional Mental Health Centers from the Eje Cafetero (Colombia)].

    PubMed

    Ramos, Lucero Rengifo; Arias, Duverney Gaviria; Salazar, Liliana Salazar; Vélez, Juan Pablo; Pardo, Stella Lozano

    2012-03-01

    The indel polymorphisms in the promoting region and the 2(nd) intron polymorphisms in the serotonin transporter gene (SLC6A4) have been associated to bipolar disorder 1 (BD1) in several population studies. The objective was to analyze the genotypic and allelic frequencies in both gene regions in a study of cases and controls with individuals from Risaralda and Quindío (Colombia) so as to establish possible associations to BD1, and compare results with previous and similar studies. 133 patients and 120 controls were studied. L and S indel polymorphisms in the promoting region were analyzed by PCR, together with VNTR STin2.10 and STin 2.12 VNTRs polymorphisms in the 2(nd) intron of the SL-C6A4 gene Genotypic and allelic frequencies for the S and L polymorphisms were similar both in cases and controls. However, the LL genotype was significantly increased both in BD1 population (OR=1.89; CI95%=1.1-3.68), and when discriminated by gender. This particular genotype in general population is OR=2.22; IC95%=1.04-5.66 for women, and OR=1.62; IC 95%=0.71-4.39 for men. No significant genotypic and allelic differences were found for VNTR STin2.10 and STin 2.12. polymorphisms. No association was found between polymorphisms of 5-HTTLPR polymorphisms and the 2(nd) intron of the serotonin transporting gene in general patients with BD1, nor when compared by gender. Our results are similar to those reported for Caucasian populations and differ from those of Asian and Brazilian populations. Copyright © 2012 Asociación Colombiana de Psiquiatría. Publicado por Elsevier España. All rights reserved.

Top