Hypervariable and highly divergent intron-exon organizations in the chordate Oikopleura dioica.
Edvardsen, Rolf B; Lerat, Emmanuelle; Maeland, Anne Dorthea; Flåt, Mette; Tewari, Rita; Jensen, Marit F; Lehrach, Hans; Reinhardt, Richard; Seo, Hee-Chan; Chourrout, Daniel
2004-10-01
Oikopleura dioica is a pelagic tunicate with a very small genome and a very short life cycle. In order to investigate the intron-exon organizations in Oikopleura, we have isolated and characterized ribosomal protein EF-1alpha, Hox, and alpha-tubulin genes. Their intron positions have been compared with those of the same genes from various invertebrates and vertebrates, including four species with entirely sequenced genomes. Oikopleura genes, like Caenorhabditis genes, have introns at a large number of nonconserved positions, which must originate from late insertions or intron sliding of ancient insertions. Both species exhibit hypervariable intron-exon organization within their alpha-tubulin gene family. This is due to localization of most nonconserved intron positions in single members of this gene family. The hypervariability and divergence of intron positions in Oikopleura and Caenorhabditis may be related to the predominance of short introns, the processing of which is not very dependent upon the exonic environment compared to large introns. Also, both species have an undermethylated genome, and the control of methylation-induced point mutations imposes a control on exon size, at least in vertebrate genes. That introns placed at such variable positions in Oikopleura or C. elegans may serve a specific purpose is not easy to infer from our current knowledge and hypotheses on intron functions. We propose that new introns are retained in species with very short life cycles, because illegitimate exchanges including gene conversion are repressed. We also speculate that introns placed at gene-specific positions may contribute to suppressing these exchanges and thereby favor their own persistence.
Schwaiger, F W; Weyers, E; Epplen, C; Brün, J; Ruff, G; Crawford, A; Epplen, J T
1993-09-01
Twenty-one different caprine and 13 ovine MHC-DRB exon 2 sequences were determined including part of the adjacent introns containing simple repetitive (gt)n(ga)m elements. The positions for highly polymorphic DRB amino acids vary slightly among ungulates and other mammals. From man and mouse to ungulates the basic (gt)n(ga)m structure is fixed in evolution for 7 x 10(7) years whereas ample variations exist in the tandem (gt)n and (ga)m dinucleotides and especially their "degenerated" derivatives. Phylogenetic trees for the alpha-helices and beta-pleated sheets of the ungulate DRB sequences suggest different evolutionary histories. In hoofed animals as well as in humans DRB beta-sheet encoding sequences and adjacent intronic repeats can be assembled into virtually identical groups suggesting coevolution of noncoding as well as coding DNA. In contrast alpha-helices and C-terminal parts of the first DRB domain evolve distinctly. In the absence of a defined mechanism causing specific, site-directed mutations, double-recombination or gene-conversion-like events would readily explain this fact. The role of the intronic simple (gt)n(ga)m repeat is discussed with respect to these genetic exchange mechanisms during evolution.
Olsson, Sanna; Kaasalainen, Ulla; Rikkinen, Jouko
2012-02-01
In this study we reconstruct the structural evolution of the hyper-variable P6b region of the group I trnLeu intron in a monophyletic group of lichen-symbiotic Nostoc strains and establish it as a useful marker in the phylogenetic analysis of these organisms. The studied cyanobacteria occur as photosynthetic and/or nitrogen-fixing symbionts in lichen species of the diverse Nephroma guild. Phylogenetic analyses and secondary structure reconstructions are used to improve the understanding of the replication mechanisms in the P6b stem-loop and to explain the observed distribution patterns of indels. The variants of the P6b region in the Nostoc clade studied consist of different combinations of five sequence modules. The distribution of indels together with the ancestral character reconstruction performed enables the interpretation of the evolution of each sequence module. Our results indicate that the indel events are usually associated with single nucleotide changes in the P6b region and have occurred several times independently. In spite of their homoplasy, they provide phylogenetic information for closely related taxa. Thus we recognize that features of the P6b region can be used as molecular markers for species identification and phylogenetic studies involving symbiotic Nostoc cyanobacteria.
Kim, Hyoung Tae; Kim, Ki-Joong
2014-01-01
Comparative analyses of complete chloroplast (cp) DNA sequences within a species may provide clues to understand the population dynamics and colonization histories of plant species. Equisetum arvense (Equisetaceae) is a widely distributed fern species in northeastern Asia, Europe, and North America. The complete cp DNA sequences from Asian and American E. arvense individuals were compared in this study. The Asian E. arvense cp genome was 583 bp shorter than that of the American E. arvense. In total, 159 indels were observed between two individuals, most of which were concentrated on the hypervariable trnY-trnE intergenic spacer (IGS) in the large single-copy (LSC) region of the cp genome. This IGS region held a series of 19 bp repeating units. The numbers of the 19 bp repeat unit were responsible for 78% of the total length difference between the two cp genomes. Furthermore, only other closely related species of Equisetum also show the hypervariable nature of the trnY-trnE IGS. By contrast, only a single indel was observed in the gene coding regions: the ycf1 gene showed 24 bp differences between the two continental individuals due to a single tandem-repeat indel. A total of 165 single-nucleotide polymorphisms (SNPs) were recorded between the two cp genomes. Of these, 52 SNPs (31.5%) were distributed in coding regions, 13 SNPs (7.9%) were in introns, and 100 SNPs (60.6%) were in intergenic spacers (IGS). The overall difference between the Asian and American E. arvense cp genomes was 0.12%. Despite the relatively high genetic diversity between Asian and American E. arvense, the two populations are recognized as a single species based on their high morphological similarity. This indicated that the two regional populations have been in morphological stasis. PMID:25157804
Genetic examination of the putative skull of Jan Kochanowski reveals its female sex
Kupiec, Tomasz; Branicki, Wojciech
2011-01-01
We report the results of genetic examination of the putative skull of Jan Kochanowski (1530-1584), a great Polish renaissance poet. The skull was retrieved in 1791 by historian Tadeusz Czacki from the Kochanowski family tomb and became the property of the Czartoryskis Museum in Krakow. An anthropological study in 1926 questioned its male origin, which raised doubts about its authenticity. Our report presents genetic evidence that resolves this dispute. From the sole tooth we obtained a sufficient amount of DNA to perform the analysis of nuclear markers. The analysis of the sex-informative part of intron 1 in amelogenin, genotyped using AmpFiSTR® NGM PCR Amplification Kit and Powerplex® ESI17 Kit human identification systems, revealed the female origin of the tooth. The female origin was further confirmed by the analysis of a portion of amelogenin intron 2, a microsatellite marker located on the X chromosome, as well as by a lack of signal from Y chromosomal microsatellite markers and the sex-determining region Y marker. Data obtained for two hypervariable regions, HVI and HVII, in mitochondrial DNA showed that mtDNA haplotype was relatively frequent among contemporary Europeans. The analysis of a set of single nucleotide polymorphisms relevant for prediction of the iris color indicated an 87% probability that the woman had hazel or brown eye color. PMID:21674838
Genetic examination of the putative skull of Jan Kochanowski reveals its female sex.
Kupiec, Tomasz; Branicki, Wojciech
2011-06-01
We report the results of genetic examination of the putative skull of Jan Kochanowski (1530-1584), a great Polish renaissance poet. The skull was retrieved in 1791 by historian Tadeusz Czacki from the Kochanowski family tomb and became the property of the Czartoryskis Museum in Krakow. An anthropological study in 1926 questioned its male origin, which raised doubts about its authenticity. Our report presents genetic evidence that resolves this dispute. From the sole tooth we obtained a sufficient amount of DNA to perform the analysis of nuclear markers. The analysis of the sex-informative part of intron 1 in amelogenin, genotyped using AmpFiSTR® NGM PCR Amplification Kit and Powerplex® ESI17 Kit human identification systems, revealed the female origin of the tooth. The female origin was further confirmed by the analysis of a portion of amelogenin intron 2, a microsatellite marker located on the X chromosome, as well as by a lack of signal from Y chromosomal microsatellite markers and the sex-determining region Y marker. Data obtained for two hypervariable regions, HVI and HVII, in mitochondrial DNA showed that mtDNA haplotype was relatively frequent among contemporary Europeans. The analysis of a set of single nucleotide polymorphisms relevant for prediction of the iris color indicated an 87% probability that the woman had hazel or brown eye color.
Frequencies of VNTR and RFLP polymorphisms associated with factor VIII gene in Singapore
DOE Office of Scientific and Technical Information (OSTI.GOV)
Fong, I.; Lai, P.S.; Ouah, T.C.
1994-09-01
The allelic frequency of any polymorphism within a population determines its usefulness for genetic counselling. This is important in populations of non-Caucasian origin as RFLPs may significantly differ among ethnic groups. We report a study of five intragenic polymorphisms in factor VIII gene carried out in Singapore. The three PCR-based RFLP markers studied were Intron 18/Bcl I, Intron 19/Hind III and Intron 22/Xba I. In an analysis of 148 unrelated normal X chromosomes, the allele frequencies were found to be A1 = 0.18, A2 = 0.82 (Bcl I RFLP), A1 = 0.80, A2 = 0.20 (Hind III RFLP) and A1more » = 0.58, and A2 = 0.42 (Xba I RFLP). The heterozygosity rates of 74 females analyzed separately were 31%, 32% and 84.2%, respectively. Linkage disequilibrium was also observed to some degree between Bcl I and Hind III polymorphism in our population. We have also analyzed a sequence polymorphism in Intron 7 using hybridization with radioactive-labelled {sup 32}P allele-specific oligonucleotide probes. This polymorphism was not very polymorphic in our population with only 2% of 117 individuals analyzed being informative. However, the use of a hypervariable dinucleotide repeat sequence (VNTR) in Intron 13 showed that 25 of our of 27 (93%) females were heterozygous. Allele frequencies ranged from 1 to 55 %. We conclude that a viable strategy for molecular analysis of Hemophilia A families in our population should include the use of Intron 18/Bcl I and Intron 22/Xba I RFLP markers and the Intron 13 VNTR marker.« less
Fu, Jianmin; Liu, Huimin; Hu, Jingjing; Liang, Yuqin; Liang, Jinjun; Wuyun, Tana; Tan, Xiaofeng
2016-01-01
Diospyros is the largest genus in Ebenaceae, comprising more than 500 species with remarkable economic value, especially Diospyros kaki Thunb., which has traditionally been an important food resource in China, Korea, and Japan. Complete chloroplast (cp) genomes from D. kaki, D. lotus L., D. oleifera Cheng., D. glaucifolia Metc., and Diospyros 'Jinzaoshi' were sequenced using Illumina sequencing technology. This is the first cp genome reported in Ebenaceae. The cp genome sequences of Diospyros ranged from 157,300 to 157,784 bp in length, presenting a typical quadripartite structure with two inverted repeats each separated by one large and one small single-copy region. For each cp genome, 134 genes were annotated, including 80 protein-coding, 31 tRNA, and 4 rRNA unique genes. In all, 179 repeats and 283 single sequence repeats were identified. Four hypervariable regions, namely, intergenic region of trnQ_rps16, trnV_ndhC, and psbD_trnT, and intron of ndhA, were identified in the Diospyros genomes. Phylogenetic analyses based on the whole cp genome, protein-coding, and intergenic and intron sequences indicated that D. oleifera is closely related to D. kaki and could be used as a model plant for future research on D. kaki; to our knowledge, this is proposed for the first time. Further, these analyses together with two large deletions (301 and 140 bp) in the cp genome of D. 'Jinzaoshi', support its placement as a new species in Diospyros. Both maximum parsimony and likelihood analyses for 19 taxa indicated the basal position of Ericales in asterids and suggested that Ebenaceae is monophyletic in Ericales.
Hu, Jingjing; Liang, Yuqin; Liang, Jinjun; Wuyun, Tana; Tan, Xiaofeng
2016-01-01
Diospyros is the largest genus in Ebenaceae, comprising more than 500 species with remarkable economic value, especially Diospyros kaki Thunb., which has traditionally been an important food resource in China, Korea, and Japan. Complete chloroplast (cp) genomes from D. kaki, D. lotus L., D. oleifera Cheng., D. glaucifolia Metc., and Diospyros ‘Jinzaoshi’ were sequenced using Illumina sequencing technology. This is the first cp genome reported in Ebenaceae. The cp genome sequences of Diospyros ranged from 157,300 to 157,784 bp in length, presenting a typical quadripartite structure with two inverted repeats each separated by one large and one small single-copy region. For each cp genome, 134 genes were annotated, including 80 protein-coding, 31 tRNA, and 4 rRNA unique genes. In all, 179 repeats and 283 single sequence repeats were identified. Four hypervariable regions, namely, intergenic region of trnQ_rps16, trnV_ndhC, and psbD_trnT, and intron of ndhA, were identified in the Diospyros genomes. Phylogenetic analyses based on the whole cp genome, protein-coding, and intergenic and intron sequences indicated that D. oleifera is closely related to D. kaki and could be used as a model plant for future research on D. kaki; to our knowledge, this is proposed for the first time. Further, these analyses together with two large deletions (301 and 140 bp) in the cp genome of D. ‘Jinzaoshi’, support its placement as a new species in Diospyros. Both maximum parsimony and likelihood analyses for 19 taxa indicated the basal position of Ericales in asterids and suggested that Ebenaceae is monophyletic in Ericales. PMID:27442423
Williams, R R; Hassan-Walker, A F; Lavender, F L; Morgan, M; Faik, P; Ragoussis, J
2001-05-16
Minisatellites are tandemly repeated DNA sequences found throughout the genomes of all eukaryotes. They are regions often prone to instability and hence hypervariability; thus repeat unit sequence is generally not conserved beyond closely related species. We have studied the minisatellite located in intron 9 of the human glucose phosphate isomerase (GPI) gene (also known as neuroleukin, autocrine motility factor, maturation and differentiation factor) and have found, by Zoo blotting coupled with PCR amplification and DNA sequencing, that similar repeat units are present in seven other species of mammal. There is also evidence for the presence of the minisatellite in chicken. The repeat unit does not appear to be present at any other locus in these genomes. Minisatellite DNA has been reported to be involved in recombination activity, control of gene expression of nearby gene(s) (both transcriptional and translational), whilst others form protein coding regions. The high level of conservation exhibited by the GPI minisatellite, coupled with the unique location, strongly suggests a functional role. Our results from transient and stable transfections using luciferase reporter constructs have shown that the GPI minisatellite region can act to increase transcription from the SV40 promoter, CMV promoter and the human GPI promoter.
PRRSV strain VR-2332 Nsp2 deletion mutants attenuate clinical symptoms in swine
USDA-ARS?s Scientific Manuscript database
PRRSV nonstructural protein 2 (nsp2) contains a N-terminal cysteine proteinase (PL2) domain, a middle hypervariable region and C-terminal putative transmembrane domain. Prior studies had shown that as much as 403 amino acids could be removed from the hypervariable region without losing virus viabil...
Javan, Gulnaz T; Finley, Sheree J; Smith, Tasia; Miller, Joselyn; Wilkinson, Jeremy E
2017-01-01
Human thanatomicrobiome studies have established that an abundant number of putrefactive bacteria within internal organs of decaying bodies are obligate anaerobes, Clostridium spp. These microorganisms have been implicated as etiological agents in potentially life-threatening infections; notwithstanding, the scale and trajectory of these microbes after death have not been elucidated. We performed phylogenetic surveys of thanatomicrobiome signatures of cadavers' internal organs to compare the microbial diversity between the 16S rRNA gene V4 hypervariable region and V3-4 conjoined regions from livers and spleens of 45 cadavers undergoing forensic microbiological studies. Phylogenetic analyses of 16S rRNA gene sequences revealed that the V4 region had a significantly higher mean Chao1 richness within the total microbiome data. Permutational multivariate analysis of variance statistical tests, based on unweighted UniFrac distances, demonstrated that taxa compositions were significantly different between V4 and V3-4 hypervariable regions ( p < 0.001). Of note, we present the first study, using the largest cohort of criminal cases to date, that two hypervariable regions show discriminatory power for human postmortem microbial diversity. In conclusion, here we propose the impact of hypervariable region selection for the 16S rRNA gene in differentiating thanatomicrobiomic profiles to provide empirical data to explain a unique concept, the Postmortem Clostridium Effect.
Molecular characterization of the canine mitochondrial DNA control region for forensic applications.
Eichmann, Cordula; Parson, Walther
2007-09-01
The canine mitochondrial DNA (mtDNA) control region of 133 dogs living in the area around Innsbruck, Austria was sequenced. A total of 40 polymorphic sites were observed in the first hypervariable segment and 15 in the second, which resulted in the differentiation of 40 distinct haplotypes. We observed five nucleotide positions that were highly polymorphic within different haplogroups, and they represent good candidates for mtDNA screening. We found five point heteroplasmic positions; all located in HVS-I and a polythymine region in HVS-II, the latter often being associated with length heteroplasmy. In contrast to human mtDNA, the canine control region contains a hypervariable 10 nucleotide repeat region, which is located between the two hypervariable regions. In our population sample, we observed eight different repeat types, which we characterized by direct sequencing and fragment length analysis. The discrimination power of the canine mtDNA control region was 0.93, not taking the polymorphic repeat region into consideration.
Mulder, Kevin P.; Cortazar-Chinarro, Maria; Harris, D. James; Crottini, Angelica; Grant, Evan H. Campbell; Fleischer, Robert C.; Savage, Anna E.
2017-01-01
The Major Histocompatibility Complex (MHC) is a genomic region encoding immune loci that are important and frequently used markers in studies of adaptive genetic variation and disease resistance. Given the primary role of infectious diseases in contributing to global amphibian declines, we characterized the hypervariable exon 2 and flanking introns of the MHC Class IIβ chain for 17 species of frogs in the Ranidae, a speciose and cosmopolitan family facing widespread pathogen infections and declines. We find high levels of genetic variation concentrated in the Peptide Binding Region (PBR) of the exon. Ten codons are under positive selection, nine of which are located in the mammal-defined PBR. We hypothesize that the tenth codon (residue 21) is an amphibian-specific PBR site that may be important in disease resistance. Trans-species and trans-generic polymorphisms are evident from exon-based genealogies, and co-phylogenetic analyses between intron, exon and mitochondrial based reconstructions reveal incongruent topologies, likely due to different locus histories. We developed two sets of barcoded adapters that reliably amplify a single and likely functional locus in all screened species using both 454 and Illumina based sequencing methods. These primers provide a resource for multiplexing and directly sequencing hundreds of samples in a single sequencing run, avoiding the labour and chimeric sequences associated with cloning, and enabling MHC population genetic analyses. Although the primers are currently limited to the 17 species we tested, these sequences and protocols provide a useful genetic resource and can serve as a starting point for future disease, adaptation and conservation studies across a range of anuran taxa.
Mulder, Kevin P; Cortazar-Chinarro, Maria; Harris, D James; Crottini, Angelica; Campbell Grant, Evan H; Fleischer, Robert C; Savage, Anna E
2017-11-01
The Major Histocompatibility Complex (MHC) is a genomic region encoding immune loci that are important and frequently used markers in studies of adaptive genetic variation and disease resistance. Given the primary role of infectious diseases in contributing to global amphibian declines, we characterized the hypervariable exon 2 and flanking introns of the MHC Class IIβ chain for 17 species of frogs in the Ranidae, a speciose and cosmopolitan family facing widespread pathogen infections and declines. We find high levels of genetic variation concentrated in the Peptide Binding Region (PBR) of the exon. Ten codons are under positive selection, nine of which are located in the mammal-defined PBR. We hypothesize that the tenth codon (residue 21) is an amphibian-specific PBR site that may be important in disease resistance. Trans-species and trans-generic polymorphisms are evident from exon-based genealogies, and co-phylogenetic analyses between intron, exon and mitochondrial based reconstructions reveal incongruent topologies, likely due to different locus histories. We developed two sets of barcoded adapters that reliably amplify a single and likely functional locus in all screened species using both 454 and Illumina based sequencing methods. These primers provide a resource for multiplexing and directly sequencing hundreds of samples in a single sequencing run, avoiding the labour and chimeric sequences associated with cloning, and enabling MHC population genetic analyses. Although the primers are currently limited to the 17 species we tested, these sequences and protocols provide a useful genetic resource and can serve as a starting point for future disease, adaptation and conservation studies across a range of anuran taxa. Copyright © 2017 Elsevier Ltd. All rights reserved.
Carreno, R A; Barta, J R
1998-11-01
The small subunit ribosomal RNA (SSU rRNA) genes of hippoboscid (Ornithoica vicina Walker) and tabanid (Chrysops niger Macquart) Diptera were sequenced to determine their phylogenetic position within the order and to determine whether or not extensive hypervariable regions in this gene are widespread in the Diptera. A parsimony analysis of an alignment containing 8 dipteran sequences produced a single most parsimonious tree that placed O. vicina as sister group to Drosophila melanogaster Meigen. The tabanid Chrysops niger was sister group to the asilomorphan taxa, and the sister group to the Brachycera was a Tipula sp. although this relationship was not supported by bootstrap analysis. The hippoboscid and tabanid sequences contain extensive hypervariable regions in the V2, V4, V6, and V7 regions as do other Diptera. When these regions of the alignment were excluded from the phylogenetic analysis, a single most parsimonious tree was found. This tree had an identical overall topology to the tree obtained from the total data set. The hypervariable regions in parts of the dipteran SSU rRNA genes were more extensive in the nematocerous dipteran sequences used in this study than in the other dipteran representatives; these hypervariable regions may be of more utility in inferring relationship among species and subspecies than at the suprageneric level.
NASA Astrophysics Data System (ADS)
Zhao, Xiaoqing; Li, Hong; Bao, Tonglaga; Ying, Zhiqiang
2012-09-01
Many experiment evidences showed that sequence structures of introns and intron loss/gain can influence gene expression, but current mechanisms did not refer to the functions of post-spliced introns directly. We propose that postspliced introns play their functions in gene expression by interacting with their mRNA sequences and the interaction is characterized by the matched segments between introns and their CDS. In this study, we investigated the interaction characters with length series by improved Smith-Waterman local alignment software for the ribosomal protein genes in C. elegans and D. melanogaster. Our results showed that RF values of five intron groups are significantly high in the central non-conserved region and very low in 5'-end and 3'-end splicing region. It is interesting that the number of the optimal matched regions gradually increases with intron length. Distributions of the optimal matched regions are different for five intron groups. Our study revealed that there are more interaction regions between longer introns and their CDS than shorter, and it provides a positive pattern for regulating the gene expression.
Barb, Jennifer J; Oler, Andrew J; Kim, Hyung-Suk; Chalmers, Natalia; Wallen, Gwenyth R; Cashion, Ann; Munson, Peter J; Ames, Nancy J
2016-01-01
There is much speculation on which hypervariable region provides the highest bacterial specificity in 16S rRNA sequencing. The optimum solution to prevent bias and to obtain a comprehensive view of complex bacterial communities would be to sequence the entire 16S rRNA gene; however, this is not possible with second generation standard library design and short-read next-generation sequencing technology. This paper examines a new process using seven hypervariable or V regions of the 16S rRNA (six amplicons: V2, V3, V4, V6-7, V8, and V9) processed simultaneously on the Ion Torrent Personal Genome Machine (Life Technologies, Grand Island, NY). Four mock samples were amplified using the 16S Ion Metagenomics Kit™ (Life Technologies) and their sequencing data is subjected to a novel analytical pipeline. Results are presented at family and genus level. The Kullback-Leibler divergence (DKL), a measure of the departure of the computed from the nominal bacterial distribution in the mock samples, was used to infer which region performed best at the family and genus levels. Three different hypervariable regions, V2, V4, and V6-7, produced the lowest divergence compared to the known mock sample. The V9 region gave the highest (worst) average DKL while the V4 gave the lowest (best) average DKL. In addition to having a high DKL, the V9 region in both the forward and reverse directions performed the worst finding only 17% and 53% of the known family level and 12% and 47% of the genus level bacteria, while results from the forward and reverse V4 region identified all 17 family level bacteria. The results of our analysis have shown that our sequencing methods using 6 hypervariable regions of the 16S rRNA and subsequent analysis is valid. This method also allowed for the assessment of how well each of the variable regions might perform simultaneously. Our findings will provide the basis for future work intended to assess microbial abundance at different time points throughout a clinical protocol.
Nomiyama, H; Kuhara, S; Kukita, T; Otsuka, T; Sakaki, Y
1981-01-01
The 26S ribosomal RNA gene of Physarum polycephalum is interrupted by two introns, and we have previously determined the sequence of one of them (intron 1) (Nomiyama et al. Proc.Natl.Acad.Sci.USA 78, 1376-1380, 1981). In this study we sequenced the second intron (intron 2) of about 0.5 kb length and its flanking regions, and found that one nucleotide at each junction is identical in intron 1 and intron 2, though the junction regions share no other sequence homology. Comparison of the flanking exon sequences to E. coli 23S rRNA sequences shows that conserved sequences are interspersed with tracts having little homology. In particular, the region encompassing the intron 2 interruption site is highly conserved. The E. coli ribosomal protein L1 binding region is also conserved. Images PMID:6171776
Yu, Zhongtang; García-González, Rubén; Schanbacher, Floyd L.; Morrison, Mark
2008-01-01
Different hypervariable (V) regions of the archaeal 16S rRNA gene (rrs) were compared systematically to establish a preferred V region(s) for use in Archaea-specific PCR-denaturing gradient gel electrophoresis (DGGE). The PCR products of the V3 region produced the most informative DGGE profiles and permitted identification of common methanogens from rumen samples from sheep. This study also showed that different methanogens might be detected when different V regions are targeted by PCR-DGGE. Dietary fat appeared to transiently stimulate Methanosphaera stadtmanae but inhibit Methanobrevibacter sp. strain AbM4 in rumen samples. PMID:18083874
Remarkable sequence conservation of the last intron in the PKD1 gene.
Rodova, Marianna; Islam, M Rafiq; Peterson, Kenneth R; Calvet, James P
2003-10-01
The last intron of the PKD1 gene (intron 45) was found to have exceptionally high sequence conservation across four mammalian species: human, mouse, rat, and dog. This conservation did not extend to the comparable intron in pufferfish. Pairwise comparisons for intron 45 showed 91% identity (human vs. dog) to 100% identity (mouse vs. rat) for an average for all four species of 94% identity. In contrast, introns 43 and 44 of the PKD1 gene had average pairwise identities of 57% and 54%, and exons 43, 44, and 45 and the coding region of exon 46 had average pairwise identities of 80%, 84%, 82%, and 80%. Intron 45 is 90 to 95 bp in length, with the major region of sequence divergence being in a central 4-bp to 9-bp variable region. RNA secondary structure analysis of intron 45 predicts a branching stem-loop structure in which the central variable region lies in one loop and the putative branch point sequence lies in another loop, suggesting that the intron adopts a specific stem-loop structure that may be important for its removal. Although intron 45 appears to conform to the class of small, G-triplet-containing introns that are spliced by a mechanism utilizing intron definition, its high sequence conservation may be a reflection of constraints imposed by a unique mechanism that coordinates splicing of this last PKD1 intron with polyadenylation.
Diniz, Fabio M; Maclean, Norman; Ogawa, Masayoshi; Cintra, Israel H A; Bentzen, Paul
2005-01-01
Atlantic spiny lobsters support major fisheries in northeastern Brazilian waters and in the Caribbean Sea. To avoid reduction in diversity and elimination of distinct stocks, understanding their population dynamics, including structuring of populations and genetic diversity, is critical. We here explore the potential of using the hypervariable domain in the control region of the mitochondrial DNA as a genetic marker to characterize population subdivision in spiny lobsters, using Panulirus argus as the species model. The primers designed on the neighboring conserved genes have amplified the entire control region (approx. 780 bases) of P. argus and other closely related species. Average nucleotide and haplotype diversity within P. argus were found to be high, and population structuring was hypothesized. The data suggest a division of P. argus into genetically different phylogeographic groups. The hypervariable domain seems to be useful for determining genetic differentiation of geographically distinct stocks of P. argus and other Atlantic spiny lobsters.
Individual specific DNA fingerprints from a hypervariable region probe: alpha-globin 3'HVR.
Fowler, S J; Gill, P; Werrett, D J; Higgs, D R
1988-06-01
A probe detecting a hypervariable region (HVR) 3' to the alpha globin locus on chromosome 16 has been used to produce DNA fingerprints. Segregation analysis has revealed multiple, randomly dispersed DNA fragments inherited in a Mendelian fashion with minimal allelism and linkage. The fingerprints are highly polymorphic (probability of chance association between random individuals much less than 10(-14]. The probe is, therefore, a powerful discriminating tool: it is envisaged that this probe will have forensic applications, including paternity cases, and will be informative in linkage analysis.
Akkuratov, Evgeny E; Walters, Lorraine; Saha-Mandal, Arnab; Khandekar, Sushant; Crawford, Erin; Zirbel, Craig L; Leisner, Scott; Prakash, Ashwin; Fedorova, Larisa; Fedorov, Alexei
2014-09-10
Orthologous introns have identical positions relative to the coding sequence in orthologous genes of different species. By analyzing the complete genomes of five plants we generated a database of 40,512 orthologous intron groups of dicotyledonous plants, 28,519 orthologous intron groups of angiosperms, and 15,726 of land plants (moss and angiosperms). Multiple sequence alignments of each orthologous intron group were obtained using the Mafft algorithm. The number of conserved regions in plant introns appeared to be hundreds of times fewer than that in mammals or vertebrates. Approximately three quarters of conserved intronic regions among angiosperms and dicots, in particular, correspond to alternatively-spliced exonic sequences. We registered only a handful of conserved intronic ncRNAs of flowering plants. However, the most evolutionarily conserved intronic region, which is ubiquitous for all plants examined in this study, including moss, possessed multiple structural features of tRNAs, which caused us to classify it as a putative tRNA-like ncRNA. Intronic sequences encoding tRNA-like structures are not unique to plants. Bioinformatics examination of the presence of tRNA inside introns revealed an unusually long-term association of four glycine tRNAs inside the Vac14 gene of fish, amniotes, and mammals. Copyright © 2014 Elsevier B.V. All rights reserved.
Sharma, Swarkar; Saha, Anjana; Rai, Ekta; Bhat, Audesh; Bamezai, Ramesh
2005-01-01
We have analysed the hypervariable regions (HVR I and II) of human mitochondrial DNA (mtDNA) in individuals from Uttar Pradesh (UP), Bihar (BI) and Punjab (PUNJ), belonging to the Indo-European linguistic group, and from South India (SI), that have their linguistic roots in Dravidian language. Our analysis revealed the presence of known and novel mutations in both hypervariable regions in the studied population groups. Median joining network analyses based on mtDNA showed extensive overlap in mtDNA lineages despite the extensive cultural and linguistic diversity. MDS plot analysis based on Fst distances suggested increased maternal genetic proximity for the studied population groups compared with other world populations. Mismatch distribution curves, respective neighbour joining trees and other statistical analyses showed that there were significant expansions. The study revealed an ancient common ancestry for the studied population groups, most probably through common founder female lineage(s), and also indicated that human migrations occurred (maybe across and within the Indian subcontinent) even after the initial phase of female migration to India.
Phylogenetic analysis of Tibetan mastiffs based on mitochondrial hypervariable region I.
Ren, Zhanjun; Chen, Huiling; Yang, Xuejiao; Zhang, Chengdong
2017-03-01
Recently, the number of Tibetan mastiffs, which is a precious germplasm resource and cultural heritage, is decreasing sharply. Therefore, the genetic diversity of Tibetan mastiffs needs to be studied to clarify its phylogenetics relationships and lay the foundation for resource protection, rational development and utilization of Tibetan mastiffs. We sequenced hypervariable region I of mitochondrial DNA (mtDNA) of 110 individuals from Tibet region and Gansu province. A total of 12 polymorphic sites were identified which defined eight haplotypes of which H4 and H8 were unique to Tibetan population with H8 being identified first. The haplotype diversity (Hd: 0.808), nucleotide diversity (Pi: 0.603%), the average number of nucleotide difference (K: 3.917) of Tibetan mastiffs from Gansu were higher than those from Tibet region (Hd: 0.794; Pi: 0.589%; K: 3.831), which revealed higher genetic diversity in Gansu. In terms of total population, the genetic variation was low. The median-joining network and phylogenetic tree based on the mtDNA hypervariable region I showed that Tibetan mastiffs originated from grey wolves, as the other domestic dogs and had different history of maternal origin. The mismatch distribution analysis and neutrality tests indicated that Tibetan mastiffs were in genetic equilibrium or in a population decline.
Yuasa, Isao; Jin, Feng; Harihara, Shinji; Matsusue, Aya; Fujihara, Junko; Takeshita, Haruo; Akane, Atsushi; Umetsu, Kazuo; Saitou, Naruya; Chattopadhyay, Prasanta K
2013-09-01
Previous studies of four populations revealed that a hypervariable short tandem repeat (iSTR) in intron 7 of the human complement factor I (CFI) gene on chromosome 4q was unique, with 17 possible East Asian-specific group H alleles observed at relatively high frequencies. To develop a deeper anthropological and forensic understanding of iSTR, 1161 additional individuals from 11 Asian populations were investigated. Group H alleles of iSTR and c.1217A allele of a SNP in exon 11 of the CFI gene were associated with each other and were almost entirely confined to East Asian populations. Han Chinese in Changsha, southern China, showed the highest frequency for East Asian-specific group H alleles (0.201) among 15 populations. Group H alleles were observed to decrease gradually from south to north in 11 East Asian populations. This expansion of group H alleles provides evidence that southern China and Southeast Asia are a hotspot of Asian diversity and a genetic reservoir of Asians after they entered East Asia. The expected heterozygosity values of iSTR ranged from 0.927 in Thais to 0.874 in Oroqens, higher than those of an STR in the fibrinogen alpha chain (FGA) gene on chromosome 4q. Thus, iSTR is a useful marker for anthropological and forensic genetics. Copyright © 2013 Elsevier Ireland Ltd. All rights reserved.
NASA Technical Reports Server (NTRS)
He, X. M.; Ruker, F.; Casale, E.; Carter, D. C.
1992-01-01
The three-dimensional structure of a human monoclonal antibody (Fab), which binds specifically to a major epitope of the transmembrane protein gp41 of the human immunodeficiency virus type 1, has been determined by crystallographic methods to a resolution of 2.7 A. It has been previously determined that this antibody recognizes the epitope SGKLICTTAVPWNAS, belongs to the subclass IgG1 (kappa), and exhibits antibody-dependent cellular cytotoxicity. The quaternary structure of the Fab is in an extended conformation with an elbow bend angle between the constant and variable domains of 175 degrees. Structurally, four of the hypervariable loops can be classified according to previously recognized canonical structures. The third hypervariable loops of the heavy (H3) and light chain (L3) are structurally distinct. Hypervariable loop H3, residues 102H-109H, is unusually extended from the surface. The complementarity-determining region forms a hydrophobic binding pocket that is created primarily from hypervariable loops L3, H3, and H2.
NASA Technical Reports Server (NTRS)
He, Xiao M.; Rueker, Florian; Casale, Elena; Carter, Daniel C.
1992-01-01
The three-dimensional structure of a human monoclonal antibody (Fab), which binds specifically to a major epitope of the transmembrane protein gp41 of the human immunodeficiency virus type 1, has been determined by crystallographic methods to a resolution of 2.7 A. It has been previously determined that this antibody recognizes the epitope SGKLICTTAVPWNAS, belongs to the subclass IgG1 (kappa), and exhibits antibody-dependent cellular cytotoxicity. The quaternary structure of the Fab is in an extended conformation with an elbow bend angle between the constant and variable domains of 175 deg. Structurally, four of the hypervariable loops can be classified according to previously recognized canonical structures. The third hypervariable loops of the heavy (H3) and light chain (L3) are structurally distinct. Hypervariable loop H3, residues 102H-109H, is unusually extended from the surface. The complementarity-determining region forms a hydrophobic binding pocket that is created primarily from hypervariable loops L3, H3, and H2.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Buffalo, Cosmo Z.; Bahn-Suh, Adrian J.; Hirakis, Sophia P.
No vaccine exists against group A Streptococcus (GAS), a leading cause of worldwide morbidity and mortality. A severe hurdle is the hypervariability of its major antigen, the M protein, with >200 different M types known. Neutralizing antibodies typically recognize M protein hypervariable regions (HVRs) and confer narrow protection. In stark contrast, human C4b-binding protein (C4BP), which is recruited to the GAS surface to block phagocytic killing, interacts with a remarkably large number of M protein HVRs (apparently ~90%). Such broad recognition is rare, and we discovered a unique mechanism for this through the structure determination of four sequence-diverse M proteinsmore » in complexes with C4BP. The structures revealed a uniform and tolerant ‘reading head’ in C4BP, which detected conserved sequence patterns hidden within hypervariability. Our results open up possibilities for rational therapies that target the M–C4BP interaction, and also inform a path towards vaccine design.« less
Verma, Kapil; Sharma, Sapna; Sharma, Arun; Dalal, Jyoti; Bhardwaj, Tapeshwar
2018-06-01
Genetic variations among humans occur both within and among populations and range from single nucleotide changes to multiple-nucleotide variants. These multiple-nucleotide variants are useful for studying the relationships among individuals or various population groups. The study of human genetic variations can help scientists understand how different population groups are biologically related to one another. Sequence analysis of hypervariable regions of human mitochondrial DNA (mtDNA) has been successfully used for the genetic characterization of different population groups for forensic purposes. It is well established that different ethnic or population groups differ significantly in their mtDNA distributions. In the last decade, very little research has been conducted on mtDNA variations in the Indian population, although such data would be useful for elucidating the history of human population expansion across the world. Moreover, forensic studies on mtDNA variations in the Indian subcontinent are also scarce, particularly in the northern part of India. In this report, variations in the hypervariable regions of mtDNA were analyzed in the Yadav population of Haryana. Different molecular diversity indices were computed. Further, the obtained haplotypes were classified into different haplogroups and the phylogenetic relationship between different haplogroups was inferred.
Bell, Courtnee R; Wilkinson, Jeremy E; Robertson, Boakai K; Javan, Gulnaz T
2018-05-10
Recent studies have revealed distinct thanatomicrobiome (microbiome of death) signatures in human body sites after death. Thanatomicrobiome studies suggest that microbial succession after death may have the potential to reveal important postmortem biomarkers for the identification of time of death. We surveyed the postmortem microbiomes of cardiac tissues from ten corpses with varying times of death (6-58 h) using amplicon-based sequencing of the 16S rRNA gene' V1-2 and V4 hypervariable regions. The results demonstrated that amplicons had statistically significant (p <0.05) sex-dependent changes. Clostridium sp., Pseudomonas sp., Pantoea sp., and Streptococcus sp. had the highest enrichment for both V1-2 and V4 regions. Interestingly, the results also show that V4 amplicons had higher abundance of Clostridium sp. and Pseudomonas sp. in female hearts compared to males. Additionally, Streptococcus sp. was solely found in male heart samples. The distinction between sexes was further supported by Principle Coordinate Analysis, which revealed microbes in female hearts formed a distinctive cluster separate from male cadavers for both hypervariable regions. This study provides data that demonstrates that two hypervariable regions show discriminatory power for sex differences in postmortem heart samples. This article is protected by copyright. All rights reserved. This article is protected by copyright. All rights reserved.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Kalchman, M.; Lin, B.; Nasir, J.
1994-09-01
The mouse homologue of the Huntington disease gene (Hdh) has recently been cloned and mapped to a region of synteny with the human, on mouse chromosome 5. The two genes share a high degree of both coding (90% amino acid) and nucleotide (86.2%) identity. We have subsequently performed a detailed comparison of the genomic organization of the 5{prime} region of the two genes encompassing the promoter region and first five exons of both the human and mouse genes. The comparative sequence analysis of the promoter region between HD and Hdh reveals two highly conserved regions. One region (-56 to -118)more » (+1 is the ATG start codon), shared 84% nucleotide identity and another region (-130 to -206) had 81% nucleotide identity. Nine putative Sp1 sites appear in the human promoter region contrasted with only 3 in a similar region in the mouse. Furthermore, 17 and 20 base pair direct repeats present in the HD 5{prime} region are absent in the similar Hdh region. Although both the mouse and human intron/exon boundaries conform to the GT/AG rule, the intron sizes between HD and Hdh are markedly different. The first four introns in Hdh are 15, 7, 5 and 0.5 kb compared to sizes of 10, 15, 7 and 0.5 kb, respectively. Comparison between the mouse and human intronic sequences immediately adjacent to the first five exons (excluding exon 1) reveals only about 46 to 50% identity within the first 60 bp of intronic sequence. Furthermore, we have identified novel polymorphic di-, tri- and tetra-nucleotide repeats in Hdh introns of various mouse strains that are not present in the human. For example, polymorphic CT repeats are present in introns 2 and 4 of Hdh and a novel mouse 56 AAG trinucleotide repeat (interrupted by an AAGG) is also located within intron 2. This information concerning the promoter and genomic organization of both HD and Hdh is critical for designing appropriate gene targetting vectors for studying the normal function of the HD and Hdh genes in model systems.« less
Schuster, Astrid; Lopez, Jose V; Becking, Leontine E; Kelly, Michelle; Pomponi, Shirley A; Wörheide, Gert; Erpenbeck, Dirk; Cárdenas, Paco
2017-03-20
Mitochondrial introns intermit coding regions of genes and feature characteristic secondary structures and splicing mechanisms. In metazoans, mitochondrial introns have only been detected in sponges, cnidarians, placozoans and one annelid species. Within demosponges, group I and group II introns are present in six families. Based on different insertion sites within the cox1 gene and secondary structures, four types of group I and two types of group II introns are known, which can harbor up to three encoding homing endonuclease genes (HEG) of the LAGLIDADG family (group I) and/or reverse transcriptase (group II). However, only little is known about sponge intron mobility, transmission, and origin due to the lack of a comprehensive dataset. We analyzed the largest dataset on sponge mitochondrial group I introns to date: 95 specimens, from 11 different sponge genera which provided novel insights into the evolution of group I introns. For the first time group I introns were detected in four genera of the sponge family Scleritodermidae (Scleritoderma, Microscleroderma, Aciculites, Setidium). We demonstrated that group I introns in sponges aggregate in the most conserved regions of cox1. We showed that co-occurrence of two introns in cox1 is unique among metazoans, but not uncommon in sponges. However, this combination always associates an active intron with a degenerating one. Earlier hypotheses of HGT were confirmed and for the first time VGT and secondary losses of introns conclusively demonstrated. This study validates the subclass Spirophorina (Tetractinellida) as an intron hotspot in sponges. Our analyses confirm that most sponge group I introns probably originated from fungi. DNA barcoding is discussed and the application of alternative primers suggested.
Cenik, Can; Chua, Hon Nian; Singh, Guramrit; Akef, Abdalla; Snyder, Michael P; Palazzo, Alexander F; Moore, Melissa J; Roth, Frederick P
2017-03-01
Introns are found in 5' untranslated regions (5'UTRs) for 35% of all human transcripts. These 5'UTR introns are not randomly distributed: Genes that encode secreted, membrane-bound and mitochondrial proteins are less likely to have them. Curiously, transcripts lacking 5'UTR introns tend to harbor specific RNA sequence elements in their early coding regions. To model and understand the connection between coding-region sequence and 5'UTR intron status, we developed a classifier that can predict 5'UTR intron status with >80% accuracy using only sequence features in the early coding region. Thus, the classifier identifies transcripts with 5 ' proximal- i ntron- m inus-like-coding regions ("5IM" transcripts). Unexpectedly, we found that the early coding sequence features defining 5IM transcripts are widespread, appearing in 21% of all human RefSeq transcripts. The 5IM class of transcripts is enriched for non-AUG start codons, more extensive secondary structure both preceding the start codon and near the 5' cap, greater dependence on eIF4E for translation, and association with ER-proximal ribosomes. 5IM transcripts are bound by the exon junction complex (EJC) at noncanonical 5' proximal positions. Finally, N 1 -methyladenosines are specifically enriched in the early coding regions of 5IM transcripts. Taken together, our analyses point to the existence of a distinct 5IM class comprising ∼20% of human transcripts. This class is defined by depletion of 5' proximal introns, presence of specific RNA sequence features associated with low translation efficiency, N 1 -methyladenosines in the early coding region, and enrichment for noncanonical binding by the EJC. © 2017 Cenik et al.; Published by Cold Spring Harbor Laboratory Press for the RNA Society.
Bhattacharya, D; Surek, B; Rüsing, M; Damberger, S; Melkonian, M
1994-01-01
Group I introns are found in organellar genomes, in the genomes of eubacteria and phages, and in nuclear-encoded rRNAs. The origin and distribution of nuclear-encoded rRNA group I introns are not understood. To elucidate their evolutionary relationships, we analyzed diverse nuclear-encoded small-subunit rRNA group I introns including nine sequences from the green-algal order Zygnematales (Charophyceae). Phylogenetic analyses of group I introns and rRNA coding regions suggest that lateral transfers have occurred in the evolutionary history of group I introns and that, after transfer, some of these elements may form stable components of the host-cell nuclear genomes. The Zygnematales introns, which share a common insertion site (position 1506 relative to the Escherichia coli small-subunit rRNA), form one subfamily of group I introns that has, after its origin, been inherited through common ancestry. Since the first Zygnematales appear in the middle Devonian within the fossil record, the "1506" group I intron presumably has been a stable component of the Zygnematales small-subunit rRNA coding region for 350-400 million years. PMID:7937917
Intriguing Balancing Selection on the Intron 5 Region of LMBR1 in Human Population
He, Fang; Wu, Dong-Dong; Kong, Qing-Peng; Zhang, Ya-Ping
2008-01-01
Background The intron 5 of gene LMBR1 is the cis-acting regulatory module for the sonic hedgehog (SHH) gene. Mutation in this non-coding region is associated with preaxial polydactyly, and may play crucial roles in the evolution of limb and skeletal system. Methodology/Principal Findings We sequenced a region of the LMBR1 gene intron 5 in East Asian human population, and found a significant deviation of Tajima's D statistics from neutrality taking human population growth into account. Data from HapMap also demonstrated extended linkage disequilibrium in the region in East Asian and European population, and significantly low degree of genetic differentiation among human populations. Conclusion/Significance We proposed that the intron 5 of LMBR1 was presumably subject to balancing selection during the evolution of modern human. PMID:18698406
Extensive intron gain in the ancestor of placental mammals
2011-01-01
Background Genome-wide studies of intron dynamics in mammalian orthologous genes have found convincing evidence for loss of introns but very little for intron turnover. Similarly, large-scale analysis of intron dynamics in a few vertebrate genomes has identified only intron losses and no gains, indicating that intron gain is an extremely rare event in vertebrate evolution. These studies suggest that the intron-rich genomes of vertebrates do not allow intron gain. The aim of this study was to search for evidence of de novo intron gain in domesticated genes from an analysis of their exon/intron structures. Results A phylogenomic approach has been used to analyse all domesticated genes in mammals and chordates that originated from the coding parts of transposable elements. Gain of introns in domesticated genes has been reconstructed on well established mammalian, vertebrate and chordate phylogenies, and examined as to where and when the gain events occurred. The locations, sizes and amounts of de novo introns gained in the domesticated genes during the evolution of mammals and chordates has been analyzed. A significant amount of intron gain was found only in domesticated genes of placental mammals, where more than 70 cases were identified. De novo gained introns show clear positional bias, since they are distributed mainly in 5' UTR and coding regions, while 3' UTR introns are very rare. In the coding regions of some domesticated genes up to 8 de novo gained introns have been found. Intron densities in Eutheria-specific domesticated genes and in older domesticated genes that originated early in vertebrates are lower than those for normal mammalian and vertebrate genes. Surprisingly, the majority of intron gains have occurred in the ancestor of placentals. Conclusions This study provides the first evidence for numerous intron gains in the ancestor of placental mammals and demonstrates that adequate taxon sampling is crucial for reconstructing intron evolution. The findings of this comprehensive study slightly challenge the current view on the evolutionary stasis in intron dynamics during the last 100 - 200 My. Domesticated genes could constitute an excellent system on which to analyse the mechanisms of intron gain in placental mammals. Reviewers: this article was reviewed by Dan Graur, Eugene V. Koonin and Jürgen Brosius. PMID:22112745
Mans, Ben J; Pienaar, Ronel; Latif, Abdalla A; Potgieter, Fred T
2011-05-01
Sequence variation within the 18S SSU rRNA V4 hyper-variable region can affect the accuracy of real-time hybridization probe-based diagnostics for the detection of Theileria spp. infections. This is relevant for assays that use non-specific primers, such as the real-time hybridization assay for T. parva (Sibeko et al. 2008). To assess the effect of sequence variation on this test, the Theileria 18S gene from 62 buffalo and 49 cattle samples was cloned and ∼1000 clones sequenced. Twenty-six genotypes were detected which included known and novel genotypes for the T. buffeli, T. mutans, T. taurotragi and T. velifera clades. A novel genotype related to T. sp. (sable) was also detected in 1 bovine sample. Theileria genotypic diversity was higher in buffalo compared to cattle. Polymorphism within the T. parva hyper-variable region was confirmed by aberrant real-time melting peaks and supported by sequencing of the S5 ribosomal gene. Analysis of the S5 gene suggests that this gene can be a marker for species differentiation. T. parva, T. sp. (buffalo) and T. sp. (bougasvlei) remain the only genotypes amplified by the primer set of the hybridization assay. Therefore, the 18S sequence diversity observed does not seem to affect the current real-time hybridization assay for T. parva.
High-Resolution Melting (HRM) of Hypervariable Mitochondrial DNA Regions for Forensic Science.
Dos Santos Rocha, Alípio; de Amorim, Isis Salviano Soares; Simão, Tatiana de Almeida; da Fonseca, Adenilson de Souza; Garrido, Rodrigo Grazinoli; Mencalha, Andre Luiz
2018-03-01
Forensic strategies commonly are proceeding by analysis of short tandem repeats (STRs); however, new additional strategies have been proposed for forensic science. Thus, this article standardized the high-resolution melting (HRM) of DNA for forensic analyzes. For HRM, mitochondrial DNA (mtDNA) from eight individuals were extracted from mucosa swabs by DNAzol reagent, samples were amplified by PCR and submitted to HRM analysis to identify differences in hypervariable (HV) regions I and II. To confirm HRM, all PCR products were DNA sequencing. The data suggest that is possible discriminate DNA from different samples by HRM curves. Also, uncommon dual-dissociation was identified in a single PCR product, increasing HRM analyzes by evaluation of melting peaks. Thus, HRM is accurate and useful to screening small differences in HVI and HVII regions from mtDNA and increase the efficiency of laboratory routines based on forensic genetics. © 2017 American Academy of Forensic Sciences.
Comparative Analysis of Vertebrate Dystrophin Loci Indicate Intron Gigantism as a Common Feature
Pozzoli, Uberto; Elgar, Greg; Cagliani, Rachele; Riva, Laura; Comi, Giacomo P.; Bresolin, Nereo; Bardoni, Alessandra; Sironi, Manuela
2003-01-01
The human DMD gene is the largest known to date, spanning > 2000 kb on the X chromosome. The gene size is mainly accounted for by huge intronic regions. We sequenced 190 kb of Fugu rubripes (pufferfish) genomic DNA corresponding to the complete dystrophin gene (FrDMD) and provide the first report of gene structure and sequence comparison among dystrophin genomic sequences from different vertebrate organisms. Almost all intron positions and phases are conserved between FrDMD and its mammalian counterparts, and the predicted protein product of the Fugu gene displays 55% identity and 71% similarity to human dystrophin. In analogy to the human gene, FrDMD presents several-fold longer than average intronic regions. Analysis of intron sequences of the human and murine genes revealed that they are extremely conserved in size and that a similar fraction of total intron length is represented by repetitive elements; moreover, our data indicate that intron expansion through repeat accumulation in the two orthologs is the result of independent insertional events. The hypothesis that intron length might be functionally relevant to the DMD gene regulation is proposed and substantiated by the finding that dystrophin intron gigantism is common to the three vertebrate genes. [Supplemental material is available online at www.genome.org.] PMID:12727896
Genome skimming identifies polymorphism in tern populations and species
2012-01-01
Background Terns (Charadriiformes: Sterninae) are a lineage of cosmopolitan shorebirds with a disputed evolutionary history that comprises several species of conservation concern. As a non-model system in genetics, previous study has left most of the nuclear genome unexplored, and population-level studies are limited to only 15% of the world's species of terns and noddies. Screening of polymorphic nuclear sequence markers is needed to enhance genetic resolution because of supposed low mitochondrial mutation rate, documentation of nuclear insertion of hypervariable mitochondrial regions, and limited success of microsatellite enrichment in terns. Here, we investigated the phylogenetic and population genetic utility for terns and relatives of a variety of nuclear markers previously developed for other birds and spanning the nuclear genome. Markers displaying a variety of mutation rates from both the nuclear and mitochondrial genome were tested and prioritized according to optimal cross-species amplification and extent of genetic polymorphism between (1) the main tern clades and (2) individual Royal Terns (Thalasseus maxima) breeding on the US East Coast. Results Results from this genome skimming effort yielded four new nuclear sequence-based markers for tern phylogenetics and 11 intra-specific polymorphic markers. Further, comparison between the two genomes indicated a phylogenetic conflict at the base of terns, involving the inclusion (mitochondrial) or exclusion (nuclear) of the Angel Tern (Gygis alba). Although limited mitochondrial variation was confirmed, both nuclear markers and a short tandem repeat in the mitochondrial control region indicated the presence of considerable genetic variation in Royal Terns at a regional scale. Conclusions These data document the value of intronic markers to the study of terns and allies. We expect that these and additional markers attained through next-generation sequencing methods will accurately map the genetic origin and species history of this group of birds. PMID:22333071
Analyzing Population Genetics Using the Mitochondrial Control Region and Bioinformatics
ERIC Educational Resources Information Center
Sato, Takumi; Phillips, Bonnie; Latourelle, Sandra M.; Elwess, Nancy L.
2010-01-01
The 14-base pair hypervariable region in mitochondrial DNA (mtDNA) of Asian populations, specifically Japanese and Chinese students at Plattsburgh State University, was examined. Previous research on this 14-base pair region showed it to be susceptible to mutations and as a result indicated direct correlation with specific ethnic populations.…
Allix-Béguec, Caroline; Wahl, Céline; Hanekom, Madeleine; Nikolayevskyy, Vladyslav; Drobniewski, Francis; Maeda, Shinji; Campos-Herrero, Isolina; Mokrousov, Igor; Niemann, Stefan; Kontsevaya, Irina; Rastogi, Nalin; Samper, Sofia; Sng, Li-Hwei; Warren, Robin M.
2014-01-01
Mycobacterium tuberculosis Beijing strains represent targets of special importance for molecular surveillance of tuberculosis (TB), especially because they are associated with spread of multidrug resistance in some world regions. Standard 24-locus mycobacterial interspersed repetitive-unit–variable-number tandem-repeat (MIRU-VNTR) typing lacks resolution power for accurately discriminating closely related clones that often compose Beijing strain populations. Therefore, we evaluated a set of 7 additional, hypervariable MIRU-VNTR loci for better resolution and tracing of such strains, using a collection of 535 Beijing isolates from six world regions where these strains are known to be prevalent. The typeability and interlaboratory reproducibility of these hypervariable loci were lower than those of the 24 standard loci. Three loci (2163a, 3155, and 3336) were excluded because of their redundant variability and/or more frequent noninterpretable results compared to the 4 other markers. The use of the remaining 4-locus set (1982, 3232, 3820, and 4120) increased the number of types by 52% (from 223 to 340) and reduced the clustering rate from 58.3 to 36.6%, when combined with the use of the standard 24-locus set. Known major clonal complexes/24-locus-based clusters were all subdivided, although the degree of subdivision varied depending on the complex. Only five single-locus variations were detected among the hypervariable loci of an additional panel of 92 isolates, representing 15 years of clonal spread of a single Beijing strain in a geographically restricted setting. On this calibrated basis, we propose this 4-locus set as a consensus for subtyping Beijing clonal complexes and clusters, after standard typing. PMID:24172154
Allix-Béguec, Caroline; Wahl, Céline; Hanekom, Madeleine; Nikolayevskyy, Vladyslav; Drobniewski, Francis; Maeda, Shinji; Campos-Herrero, Isolina; Mokrousov, Igor; Niemann, Stefan; Kontsevaya, Irina; Rastogi, Nalin; Samper, Sofia; Sng, Li-Hwei; Warren, Robin M; Supply, Philip
2014-01-01
Mycobacterium tuberculosis Beijing strains represent targets of special importance for molecular surveillance of tuberculosis (TB), especially because they are associated with spread of multidrug resistance in some world regions. Standard 24-locus mycobacterial interspersed repetitive-unit-variable-number tandem-repeat (MIRU-VNTR) typing lacks resolution power for accurately discriminating closely related clones that often compose Beijing strain populations. Therefore, we evaluated a set of 7 additional, hypervariable MIRU-VNTR loci for better resolution and tracing of such strains, using a collection of 535 Beijing isolates from six world regions where these strains are known to be prevalent. The typeability and interlaboratory reproducibility of these hypervariable loci were lower than those of the 24 standard loci. Three loci (2163a, 3155, and 3336) were excluded because of their redundant variability and/or more frequent noninterpretable results compared to the 4 other markers. The use of the remaining 4-locus set (1982, 3232, 3820, and 4120) increased the number of types by 52% (from 223 to 340) and reduced the clustering rate from 58.3 to 36.6%, when combined with the use of the standard 24-locus set. Known major clonal complexes/24-locus-based clusters were all subdivided, although the degree of subdivision varied depending on the complex. Only five single-locus variations were detected among the hypervariable loci of an additional panel of 92 isolates, representing 15 years of clonal spread of a single Beijing strain in a geographically restricted setting. On this calibrated basis, we propose this 4-locus set as a consensus for subtyping Beijing clonal complexes and clusters, after standard typing.
Zeng, Xian-Chun; Nie, Yao; Luo, Xuesong; Wu, Shifen; Shi, Wanxia; Zhang, Lei; Liu, Yichen; Cao, Hanjun; Yang, Ye; Zhou, Jianping
2013-03-01
The full-length cDNA sequences of two novel cysteine-rich peptides (referred to as HsVx1 and MmKTx1) were obtained from scorpions. The two peptides represent a novel class of cysteine-rich peptides with a unique cysteine pattern. The genomic sequence of HsVx1 is composed of three exons interrupted by two introns that are localized in the mature peptide encoding region and inserted in phase 1 and phase 2, respectively. Such a genomic organization markedly differs from those of other peptides from scorpions described previously. Genome-wide search for the orthologs of HsVx1 identified 59 novel cysteine-rich peptides from arthropods. These peptides share a consistent cysteine pattern with HsVx1. Genomic comparison revealed extensive intron length differences and intronic number and position polymorphisms among the genes of these peptides. Further analysis identified 30 cases of intron sliding, 1 case of intron gain and 22 cases of intron loss occurred with the genes of the HsVx1 and HsVx1-like peptides. It is interesting to see that three HsVx1-like peptides XP_001658928, XP_001658929 and XP_001658930 were derived from a single gene (XP gene): the former two were generated from alternative splicing; the third one was encoded by a DNA region in the reverse complementary strand of the third intron of the XP gene. These findings strongly suggest that the genes of these cysteine-rich peptides were evolved by intron sliding, intron gain/loss, gene recombination and alternative splicing events in response to selective forces without changing their cysteine pattern. The evolution of these genes is dominated by intron sliding and intron loss. Copyright © 2012 Elsevier Inc. All rights reserved.
Neuhaus, H; Link, G
1987-01-01
The trnK gene endocing the tRNALys(UUU) has been located on mustard (Sinapis alba) chloroplast DNA, 263 bp upstream of the psbA gene on the same strand. The nucleotide sequence of the trnK gene and its flanking regions as well as the putative transcription start and termination sites are shown. The 5' end of the transcript lies 121 bp upstream of the 5' tRNA coding region and is preceded by procaryotic-type "-10" and "-35" sequence elements, while the 3' end maps 2.77 kb downstream to a DNA region with possible stemloop secondary structure. The anticodon loop of the tRNALys is interrupted by a 2,574 bp intron containing a long open reading frame, which codes for 524 amino acids. Based on conserved stem and loop structures, this intron has characteristic features of a class II intron. A region near the carboxyl terminus of the derived polypeptide appears structurally related to maturases.
The skin microbiome in psoriatic arthritis: methodology development and pilot data.
Castelino, Madhura; Eyre, Stephen; Moat, John; Fox, Graeme; Martin, Paul; Ijaz, Umer; Quince, Christopher; Ho, Pauline; Upton, Mathew; Barton, Anne
2015-02-26
Skin microbiota are likely to be important in the development of conditions such as psoriatic arthritis. Profiling the bacterial community in the psosriatic plaques will contribute to our understanding of the role of the skin microbiome in these conditions. The aim of this work was to determine the optimum study design for work on the skin microbiome with use of the MiSeq platform. The objectives were to compare data generated from two platforms for two primer pairs in a low density mock bacterial community. DNA was obtained from two low density mock communities of 11 diverse bacterial strains (with and without human DNA supplementation) and from swabs taken from the skin of four healthy volunteers. The DNA was amplified with primer pairs covering hypervariable regions of the 16S rRNA gene: primers 63F and 519R (V1-V3), and 347F and 803R (V3-V4). The resultant libraries were indexed for the MiSeq and Roche454 platforms and sequenced. Both datasets were de-noised, cleaned of chimeras, and analysed by use of QIIME software (version 1.8.0). No significant difference in the diversity indices at the phylum and the genus level between the platforms was seen. Comparison of the diversity indices for the mock community data for the two primer pairs demonstrated that the V3-V4 hypervariable region had significantly better capture of bacterial diversity than did the V1-V3 region. Amplification with the same primer pairs showed strong concordance within each platform (98·9-99·8%), with negligible effect of spiked human DNA contamination. Comparison at the family level classification between samples processed on the MiSeq and Roche454 platforms using the V3-V4 hypervariable region also showed a high level of concordance (87%), although less so for the V1-V3 primers (10%). The pilot data from healthy volunteers were similar. Results obtained from the V3-V4 16S rRNA hypervariable region, sequencing on the MiSeq and Roche454 platforms, were concordant between replicates, and between each other. These findings suggest that the MiSeq platform, and these primers, is a comparable method for determining skin microbiota to the widely used Roche454 methodology. NIHR Manchester Musculoskeletal Biomedical Research Unit. Copyright © 2015 Elsevier Ltd. All rights reserved.
Characterization of the molecular basis of group II intron RNA recognition by CRS1-CRM domains.
Keren, Ido; Klipcan, Liron; Bezawork-Geleta, Ayenachew; Kolton, Max; Shaya, Felix; Ostersetzer-Biran, Oren
2008-08-22
CRM (chloroplast RNA splicing and ribosome maturation) is a recently recognized RNA-binding domain of ancient origin that has been retained in eukaryotic genomes only within the plant lineage. Whereas in bacteria CRM domains exist as single domain proteins involved in ribosome maturation, in plants they are found in a family of proteins that contain between one and four repeats. Several members of this family with multiple CRM domains have been shown to be required for the splicing of specific plastidic group II introns. Detailed biochemical analysis of one of these factors in maize, CRS1, demonstrated its high affinity and specific binding to the single group II intron whose splicing it facilitates, the plastid-encoded atpF intron RNA. Through its association with two intronic regions, CRS1 guides the folding of atpF intron RNA into its predicted "catalytically active" form. To understand how multiple CRM domains cooperate to achieve high affinity sequence-specific binding to RNA, we analyzed the RNA binding affinity and specificity associated with each individual CRM domain in CRS1; whereas CRM3 bound tightly to the RNA, CRM1 associated specifically with a unique region found within atpF intron domain I. CRM2, which demonstrated only low binding affinity, also seems to form specific interactions with regions localized to domains I, III, and IV. We further show that CRM domains share structural similarities and RNA binding characteristics with the well known RNA recognition motif domain.
Toro, N; Martínez-Rodríguez, L; Martínez-Abarca, F
2014-10-01
Group II introns are self-splicing catalytic RNAs that act as mobile retroelements. In bacteria, they are thought to be tolerated to some extent because they self-splice and home preferentially to sites outside of functional genes, generally within intergenic regions or in other mobile genetic elements, by mechanisms including the divergence of DNA target specificity to prevent target site saturation. RmInt1 is a mobile group II intron that is widespread in natural populations of Sinorhizobium meliloti and was first described in the GR4 strain. Like other bacterial group II introns, RmInt1 tends to evolve toward an inactive form by fragmentation, with loss of the 3' terminus. We identified genomic evidence of a fragmented intron closely related to RmInt1 buried in the genome of the extant S. meliloti/S. medicae species. By studying this intron, we obtained evidence for the occurrence of intron insertion before the divergence of ancient rhizobial species. This fragmented group II intron has thus existed for a long time and has provided sequence variation, on which selection can act, contributing to diverse genetic rearrangements, and to generate pan-genome divergence after strain differentiation. The data presented here suggest that fragmented group II introns within intergenic regions closed to functionally important neighboring genes may have been microevolutionary forces driving adaptive evolution of these rhizobial species.
Toro, N; Martínez-Rodríguez, L; Martínez-Abarca, F
2014-01-01
Group II introns are self-splicing catalytic RNAs that act as mobile retroelements. In bacteria, they are thought to be tolerated to some extent because they self-splice and home preferentially to sites outside of functional genes, generally within intergenic regions or in other mobile genetic elements, by mechanisms including the divergence of DNA target specificity to prevent target site saturation. RmInt1 is a mobile group II intron that is widespread in natural populations of Sinorhizobium meliloti and was first described in the GR4 strain. Like other bacterial group II introns, RmInt1 tends to evolve toward an inactive form by fragmentation, with loss of the 3′ terminus. We identified genomic evidence of a fragmented intron closely related to RmInt1 buried in the genome of the extant S. meliloti/S. medicae species. By studying this intron, we obtained evidence for the occurrence of intron insertion before the divergence of ancient rhizobial species. This fragmented group II intron has thus existed for a long time and has provided sequence variation, on which selection can act, contributing to diverse genetic rearrangements, and to generate pan-genome divergence after strain differentiation. The data presented here suggest that fragmented group II introns within intergenic regions closed to functionally important neighboring genes may have been microevolutionary forces driving adaptive evolution of these rhizobial species. PMID:24736785
BIALLELIC POLYMORPHISM IN THE INTRON REGION OF B-TUBULIN GENE OF CRYPTOSPORIDIUM PARASITES
Nucleotide sequencing of polymerase chain reaction-amplified intron region of the Cryptosporidium parvum B-tubulin gene in 26 human and 15 animal isolates revealed distinct genetic polymorphism between the human and bovine genotypes. The separation of 2 genotypes of C. parvum is...
Swain, Timothy D
2018-01-01
The recent rapid proliferation of novel taxon identification in the Zoanthidea has been accompanied by a parallel propagation of gene trees as a tool of species discovery, but not a corresponding increase in our understanding of phylogeny. This disparity is caused by the trade-off between the capabilities of automated DNA sequence alignment and data content of genes applied to phylogenetic inference in this group. Conserved genes or segments are easily aligned across the order, but produce poorly resolved trees; hypervariable genes or segments contain the evolutionary signal necessary for resolution and robust support, but sequence alignment is daunting. Staggered alignments are a form of phylogeny-informed sequence alignment composed of a mosaic of local and universal regions that allow phylogenetic inference to be applied to all nucleotides from both hypervariable and conserved gene segments. Comparisons between species tree phylogenies inferred from all data (staggered alignment) and hypervariable-excluded data (standard alignment) demonstrate improved confidence and greater topological agreement with other sources of data for the complete-data tree. This novel phylogeny is the most comprehensive to date (in terms of taxa and data) and can serve as an expandable tool for evolutionary hypothesis testing in the Zoanthidea. Spanish language abstract available in Text S1. Translation by L. O. Swain, DePaul University, Chicago, Illinois, 60604, USA. Copyright © 2017 Elsevier Inc. All rights reserved.
Tooley, Paul W; Bandyopadhyay, Ranajit; Carras, Marie M; Pazoutová, Sylvie
2006-04-01
Isolates of Claviceps causing ergot on sorghum in India were analysed by AFLP analysis, and by analysis of DNA sequences of the EF-1alpha gene intron 4 and beta-tubulin gene intron 3 region. Of 89 isolates assayed from six states in India, four were determined to be C. sorghi, and the rest C. africana. A relatively low level of genetic diversity was observed within the Indian C. africana population. No evidence of genetic exchange between C. africana and C. sorghi was observed in either AFLP or DNA sequence analysis. Phylogenetic analysis was conducted using DNA sequences from 14 different Claviceps species. A multigene phylogeny based on the EF-1alpha gene intron 4, the beta-tubulin gene intron 3 region, and rDNA showed that C. sorghi grouped most closely with C. gigantea and C. africana. Although the Claviceps species we analysed were closely related, they colonize hosts that are taxonomically very distinct suggesting that there is no direct coevolution of Claviceps with its hosts.
Intron self-complementarity enforces exon inclusion in a yeast pre-mRNA
Howe, Kenneth James; Ares, Manuel
1997-01-01
Skipping of internal exons during removal of introns from pre-mRNA must be avoided for proper expression of most eukaryotic genes. Despite significant understanding of the mechanics of intron removal, mechanisms that ensure inclusion of internal exons in multi-intron pre-mRNAs remain mysterious. Using a natural two-intron yeast gene, we have identified distinct RNA–RNA complementarities within each intron that prevent exon skipping and ensure inclusion of internal exons. We show that these complementarities are positioned to act as intron identity elements, bringing together only the appropriate 5′ splice sites and branchpoints. Destroying either intron self-complementarity allows exon skipping to occur, and restoring the complementarity using compensatory mutations rescues exon inclusion, indicating that the elements act through formation of RNA secondary structure. Introducing new pairing potential between regions near the 5′ splice site of intron 1 and the branchpoint of intron 2 dramatically enhances exon skipping. Similar elements identified in single intron yeast genes contribute to splicing efficiency. Our results illustrate how intron secondary structure serves to coordinate splice site pairing and enforce exon inclusion. We suggest that similar elements in vertebrate genes could assist in the splicing of very large introns and in the evolution of alternative splicing. PMID:9356473
Insertion of a self-splicing intron into the mtDNA of atriploblastic animal
DOE Office of Scientific and Technical Information (OSTI.GOV)
Valles, Y.; Halanych, K.; Boore, J.L.
2006-04-14
Nephtys longosetosa is a carnivorous polychaete worm that lives in the intertidal and subtidal zones with worldwide distribution (pleijel&rouse2001). Its mitochondrial genome has the characteristics typical of most metazoans: 37 genes; circular molecule; almost no intergenic sequence; and no significant gene rearrangements when compared to other annelid mtDNAs (booremoritz19981995). Ubiquitous features as small intergenic regions and lack of introns suggested that metazoan mtDNAs are under strong selective pressures to reduce their genome size allowing for faster replication requirements (booremoritz19981995Lynch2005). Yet, in 1996 two type I introns were found in the mtDNA of the basal metazoan Metridium senile (FigureX). Breaking amore » long-standing rule (absence of introns in metazoan mtDNA), this finding was later supported by the further presence of group I introns in other cnidarians. Interestingly, only the class Anthozoa within cnidarians seems to harbor such introns. Although several hundreds of triploblastic metazoan mtDNAs have been sequenced, this study is the first evidence of mitochondrial introns in triploblastic metazoans. The cox1 gene of N. longosetosa has an intron of almost 2 kbs in length. This finding represents as well the first instance of a group II intron (anthozoans harbor group I introns) in all metazoan lineages. Opposite trends are observed within plants, fungi and protist mtDNAs, where introns (both group I and II) and other non-coding sequences are widespread. Plant, fungal and protist mtDNA structure and organization differ enormously from that of metazoan mtDNA. Both, plant and fungal mtDNA are dynamic molecules that undergo high rates of recombination, contain long intergenic spacer regions and harbor both group I and group II introns. However, as metazoans they have a conserved gene content. Protists, on the other hand have a striking variation of gene content and introns that account for the genome size variation. In contrast to this mtDNA structure and organization diversity, current genome level studies point to a monophyletic origin of the mitochondria (REFS), raising questions such as: what are the pressures at work shaping the evolution of the mitochondrial genome at 'higher' levels? What drives the absence of introns and other non-coding spacers in metazoan mtDNA? What characteristics must have an intron to be maintained in an environment where 'extra chromosomes' are usually selected against?« less
Ecker, Simone; Chen, Lu; Pancaldi, Vera; Bagger, Frederik O; Fernández, José María; Carrillo de Santa Pau, Enrique; Juan, David; Mann, Alice L; Watt, Stephen; Casale, Francesco Paolo; Sidiropoulos, Nikos; Rapin, Nicolas; Merkel, Angelika; Stunnenberg, Hendrik G; Stegle, Oliver; Frontini, Mattia; Downes, Kate; Pastinen, Tomi; Kuijpers, Taco W; Rico, Daniel; Valencia, Alfonso; Beck, Stephan; Soranzo, Nicole; Paul, Dirk S
2017-01-26
A healthy immune system requires immune cells that adapt rapidly to environmental challenges. This phenotypic plasticity can be mediated by transcriptional and epigenetic variability. We apply a novel analytical approach to measure and compare transcriptional and epigenetic variability genome-wide across CD14 + CD16 - monocytes, CD66b + CD16 + neutrophils, and CD4 + CD45RA + naïve T cells from the same 125 healthy individuals. We discover substantially increased variability in neutrophils compared to monocytes and T cells. In neutrophils, genes with hypervariable expression are found to be implicated in key immune pathways and are associated with cellular properties and environmental exposure. We also observe increased sex-specific gene expression differences in neutrophils. Neutrophil-specific DNA methylation hypervariable sites are enriched at dynamic chromatin regions and active enhancers. Our data highlight the importance of transcriptional and epigenetic variability for the key role of neutrophils as the first responders to inflammatory stimuli. We provide a resource to enable further functional studies into the plasticity of immune cells, which can be accessed from: http://blueprint-dev.bioinfo.cnio.es/WP10/hypervariability .
Tian, Ying; Wang, Genjie; Hu, Qingzhu; Xiao, Xichun; Chen, Shuxia
2018-04-01
The AML1/ETO onco-fusion protein is crucial for the genesis of t(8;21) acute myeloid leukemia (AML) and is well documented as a transcriptional repressor through dominant-negative effect. However, little is known about the transactivation mechanism of AML1/ETO. Through large cohort of patient's expression level data analysis and a series of experimental validation, we report here that AML1/ETO transactivates c-KIT expression through directly binding to and mediating the long-range interaction between the promoter and intronic enhancer regions of c-KIT. Gene expression analyses verify that c-KIT expression is significantly high in t(8;21) AML. Further ChIP-seq analysis and motif scanning identify two regulatory regions located in the promoter and intronic enhancer region of c-KIT, respectively. Both regions are enriched by co-factors of AML1/ETO, such as AML1, CEBPe, c-Jun, and c-Fos. Further luciferase reporter assays show that AML1/ETO trans-activates c-KIT promoter activity through directly recognizing the AML1 motif and the co-existence of co-factors. The induction of c-KIT promoter activity is reinforced with the existence of intronic enhancer region. Furthermore, ChIP-3C-qPCR assays verify that AML1/ETO mediates the formation of DNA-looping between the c-KIT promoter and intronic enhancer region through the long-range interaction. Collectively, our data uncover a novel transcriptional activity mechanism of AML1/ETO and enrich our knowledge of the onco-fusion protein mediated transcription regulation. © 2017 Wiley Periodicals, Inc.
Isolation and identification of gene-specific microRNAs.
Lin, Shi-Lung; Chang, Donald C; Ying, Shao-Yao
2006-01-01
Prediction of microRNA (miRNA) candidates using computer programming has identified hundreds and hundreds of genomic hairpin sequences, of which, the functions remain to be determined. Because direct transfection of hairpin-like miRNA precursors (pre)-miRNAs in mammalian cells is not always sufficient to trigger effective RNA-induced gene-silencing complex (RISC) assembly, a key step for RNA interference (RNAi)-related gene silencing, we developed an intronic miRNA-expressing system to overcome this problem, and successfully increased the efficiency and effectiveness of miRNA-associated RNAi induction in vitro and in vivo. By insertion of a hairpin-like pre-miRNA structure into the intron region of a gene, this intronic miRNA biogenesis system has been found to depend on a coupled interaction of nascent precursor messenger RNA transcription and intron excision within a specific nuclear region proximal to genomic perichromatin fibrils. The intronic miRNA was transcribed by RNA type II polymerases, coexpressed with a primary gene transcript, and excised out of its encoding gene transcript by intracellular RNA splicing and processing mechanisms. Currently, some ribonuclease III endonucleases have been found to be involved in the processing of spliced introns and probably facilitating the intronic miRNA maturation. Using this miRNA-expressing system, we have shown for the first time that the intron-derived miRNAs were able to induce strong RNAi effects in not only human and mouse cells but also zebrafish, chicken embryos, and adult mice. Based on the strand complementarity between the designed miRNA and its target gene sequence, we have also developed a miRNA isolation protocol to purify and identify the mature miRNAs generated by the intronic miRNA-expressing system. Several intronic miRNA identities and structures are currently confirmed to be active in vitro and in vivo. According to this proof- of-principle method, we now have the knowledge to design pre-miRNA inserts that are more efficient and effective for the intronic miRNA-expressing system.
Isolation and identification of gene-specific microRNAs.
Lin, Shi-Lung; Chang, Donald C; Ying, Shao-Yao
2013-01-01
Computer programming has identified hundreds of genomic hairpin sequences, many with functions remain to be determined. Because direct transfection of hairpin-like miRNA precursors (pre)-miRNAs in mammalian cells is not always sufficient to trigger effective RNA-induced gene silencing complex (RISC) assembly, a key step for RNA interference (RNAi)-related gene silencing, we developed an intronic miRNA-expressing system to overcome this problem by inserting a hairpin-like pre-miRNA structure into the intron region of a gene and successfully increased the efficiency and effectiveness of miRNA-associated RNAi induction in vitro and in vivo. This intronic miRNA biogenesis has been found to depend on a coupled interaction of nascent precursor messenger RNA transcription and intron excision within a specific nuclear region proximal to genomic perichromatin fibrils. The intronic miRNA was transcribed by RNA type II polymerases, coexpressed with a primary gene transcript, and excised out of its encoding gene transcript by intracellular RNA splicing and processing mechanisms. Currently, some ribonuclease III endonucleases have been found to be involved in the processing of spliced introns and probably facilitating the intronic miRNA maturation. Using this miRNA generation system, we have shown for the first time that the intron-derived miRNAs were able to induce strong RNAi effects in not only human and mouse cells but also zebrafishes, chicken embryos, and adult mice. We have also developed an miRNA isolation protocol, based on the complementarity between the designed miRNA and its target gene sequence, to purify and identify the mature miRNAs generated by the intronic miRNA-expressing system. Several intronic miRNA identities and structures are currently confirmed to be active in vitro and in vivo. According to this proven-of-principle method, we now have full knowledge to design pre-miRNA inserts that are more efficient and effective for the intronic miRNA-expressing systems.
Beta-globin LCR and intron elements cooperate and direct spatial reorganization for gene therapy.
Buzina, Alla; Lo, Mandy Y M; Moffett, Angela; Hotta, Akitsu; Fussner, Eden; Bharadwaj, Rikki R; Pasceri, Peter; Garcia-Martinez, J Victor; Bazett-Jones, David P; Ellis, James
2008-04-11
The Locus Control Region (LCR) requires intronic elements within beta-globin transgenes to direct high level expression at all ectopic integration sites. However, these essential intronic elements cannot be transmitted through retrovirus vectors and their deletion may compromise the therapeutic potential for gene therapy. Here, we systematically regenerate functional beta-globin intron 2 elements that rescue LCR activity directed by 5'HS3. Evaluation in transgenic mice demonstrates that an Oct-1 binding site and an enhancer in the intron cooperate to increase expression levels from LCR globin transgenes. Replacement of the intronic AT-rich region with the Igmu 3'MAR rescues LCR activity in single copy transgenic mice. Importantly, a combination of the Oct-1 site, Igmu 3'MAR and intronic enhancer in the BGT158 cassette directs more consistent levels of expression in transgenic mice. By introducing intron-modified transgenes into the same genomic integration site in erythroid cells, we show that BGT158 has the greatest transcriptional induction. 3D DNA FISH establishes that induction stimulates this small 5'HS3 containing transgene and the endogenous locus to spatially reorganize towards more central locations in erythroid nuclei. Electron Spectroscopic Imaging (ESI) of chromatin fibers demonstrates that ultrastructural heterochromatin is primarily perinuclear and does not reorganize. Finally, we transmit intron-modified globin transgenes through insulated self-inactivating (SIN) lentivirus vectors into erythroid cells. We show efficient transfer and robust mRNA and protein expression by the BGT158 vector, and virus titer improvements mediated by the modified intron 2 in the presence of an LCR cassette composed of 5'HS2-4. Our results have important implications for the mechanism of LCR activity at ectopic integration sites. The modified transgenes are the first to transfer intronic elements that potentiate LCR activity and are designed to facilitate correction of hemoglobinopathies using single copy vectors.
Splicing of a group II intron involved in the conjugative transfer of pRS01 in lactococci.
Mills, D A; McKay, L L; Dunny, G M
1996-06-01
Analysis of a region involved in the conjugative transfer of the lactococcal conjugative element pRS01 has revealed a bacteria] group II intron. Splicing of this lactococcal intron (designated Ll.ltrB) in vivo resulted in the ligation of two exon messages (ltrBE1 and ltrBE2) which encoded a putative conjugative relaxase essential for the transfer of pRS01. Like many group II introns, the Ll.ltrB intron possessed an open reading frame (ltrA) with homology to reverse transcriptases. Remarkably, sequence analysis of ltrA suggested a greater similarity to open reading frames encoded by eukaryotic mitochondrial group II introns than to those identified to date from other bacteria. Several insertional mutations within ltrA resulted in plasmids exhibiting a conjugative transfer-deficient phenotype. These results provide the first direct evidence for splicing of a prokaryotic group II intron in vivo and suggest that conjugative transfer is a mechanism for group II intron dissemination in bacteria.
The complete plastid genome sequence of Eustrephus latifolius (Asparagaceae: Lomandroideae).
Kim, Hyoung Tae; Kim, Jung Sung; Kim, Joo-Hwan
2016-01-01
The complete chloroplast (cp) genome sequence of Eustrephus latifolius was firstly determined in subfamily Lomandriodeae of family Asparagaceae. It was 159,736 bp and contained a large single copy region (82,403 bp) and a small single copy region (13,607 bp) which were separated by two inverted repeat regions (31,863 bp). In total, 132 genes were identified and they were consisted of 83 coding genes, 8 rRNA genes, 38 tRNA genes, 3 pseudogenes. rpl23 and clpP were pseudogenes due to sequence deletions. Among 23 genes containing introns, rps12 and ycf3 contained two introns and the rest had just one intron. The intact ycf68 was identified within an intron of trnI-GAU. The amino acid sequence was almost identical with Phoenix dactylifera in Aracales. Ycf1 of E. latifolius was completely located in IR. It was similar to cp genome structure of Lemna minor, Spirodela polyrhiza, Wolffiella lingulata, Wolffia australiana in Alismatales.
Oguri, Emiko; Yamaguchi, Tomio; Tsubota, Hiromi; Deguchi, Hironori; Murakami, Noriaki
2013-01-01
Leucobryum boninense is endemic to the Bonin Islands, Japan, and its related species are widely distributed in Asia and the Pacific. We aimed to clarify the phylogenetic relationships among Leucobryum species and infer the origin of L. boninense. We also describe the utility of the chloroplast trnK intron including matK for resolving the phylogenetic relationships among Leucobryum species, as phylogenetic analyses using trnK intron and/or matK have not been performed well in bryophytes to date. Fifty samples containing 15 species of Leucobryum from Asia and the Pacific were examined for six chloroplast DNA regions including rbcL, rps4, partial 5′ trnK intron, matK, partial 3′ trnK intron, and trnL-F intergenic spacer plus one nuclear DNA region including ITS. A molecular phylogenetic tree showed that L. boninense made a clade with L. scabrum from Japan, Taiwan and, Hong Kong; L. javense which is widely distributed in East and Southeast Asia, and L. pachyphyllum and L. seemannii restricted to the Hawaii Islands, as well as with L. scaberulum from the Ryukyus, Japan, Taiwan, and southeastern China. Leucobryum boninense from various islands of the Bonin Islands made a monophylic group that was closely related to L. scabrum and L. javense from Japan. Therefore, L. boninense may have evolved from L. scabrum from Japan, Taiwan, or Hong Kong, or L. javense from Japan. We also described the utility of trnK intron including matK. A percentage of the parsimony-informative characters in trnK intron sequence data (5.8%) was significantly higher than that from other chloroplast regions, rbcL (2.4%) and rps4 (3.2%) sequence data. Nucleotide sequence data of the trnK intron including matK are more informative than other chloroplast DNA regions for identifying the phylogenetic relationships among Leucobryum species. PMID:23610621
Oguri, Emiko; Yamaguchi, Tomio; Tsubota, Hiromi; Deguchi, Hironori; Murakami, Noriaki
2013-04-01
Leucobryum boninense is endemic to the Bonin Islands, Japan, and its related species are widely distributed in Asia and the Pacific. We aimed to clarify the phylogenetic relationships among Leucobryum species and infer the origin of L. boninense. We also describe the utility of the chloroplast trnK intron including matK for resolving the phylogenetic relationships among Leucobryum species, as phylogenetic analyses using trnK intron and/or matK have not been performed well in bryophytes to date. Fifty samples containing 15 species of Leucobryum from Asia and the Pacific were examined for six chloroplast DNA regions including rbcL, rps4, partial 5' trnK intron, matK, partial 3' trnK intron, and trnL-F intergenic spacer plus one nuclear DNA region including ITS. A molecular phylogenetic tree showed that L. boninense made a clade with L. scabrum from Japan, Taiwan and, Hong Kong; L. javense which is widely distributed in East and Southeast Asia, and L. pachyphyllum and L. seemannii restricted to the Hawaii Islands, as well as with L. scaberulum from the Ryukyus, Japan, Taiwan, and southeastern China. Leucobryum boninense from various islands of the Bonin Islands made a monophylic group that was closely related to L. scabrum and L. javense from Japan. Therefore, L. boninense may have evolved from L. scabrum from Japan, Taiwan, or Hong Kong, or L. javense from Japan. We also described the utility of trnK intron including matK. A percentage of the parsimony-informative characters in trnK intron sequence data (5.8%) was significantly higher than that from other chloroplast regions, rbcL (2.4%) and rps4 (3.2%) sequence data. Nucleotide sequence data of the trnK intron including matK are more informative than other chloroplast DNA regions for identifying the phylogenetic relationships among Leucobryum species.
Loreni, F; Ruberti, I; Bozzoni, I; Pierandrei-Amaldi, P; Amaldi, F
1985-01-01
Ribosomal protein L1 is encoded by two genes in Xenopus laevis. The comparison of two cDNA sequences shows that the two L1 gene copies (L1a and L1b) have diverged in many silent sites and very few substitution sites; moreover a small duplication occurred at the very end of the coding region of the L1b gene which thus codes for a product five amino acids longer than that coded by L1a. Quantitatively the divergence between the two L1 genes confirms that a whole genome duplication took place in Xenopus laevis approximately 30 million years ago. A genomic fragment containing one of the two L1 gene copies (L1a), with its nine introns and flanking regions, has been completely sequenced. The 5' end of this gene has been mapped within a 20-pyridimine stretch as already found for other vertebrate ribosomal protein genes. Four of the nine introns have a 60-nucleotide sequence with 80% homology; within this region some boxes, one of which is 16 nucleotides long, are 100% homologous among the four introns. This feature of L1a gene introns is interesting since we have previously shown that the activity of this gene is regulated at a post-transcriptional level and it involves the block of the normal splicing of some intron sequences. Images Fig. 3. Fig. 5. PMID:3841512
Martínez-Rodríguez, Laura; García-Rodríguez, Fernando M; Molina-Sánchez, María Dolores; Toro, Nicolás; Martínez-Abarca, Francisco
2014-01-01
Group II introns are self-splicing RNAs and site-specific mobile retroelements found in bacterial and organellar genomes. The group II intron RmInt1 is present at high copy number in Sinorhizobium meliloti species, and has a multifunctional intron-encoded protein (IEP) with reverse transcriptase/maturase activities, but lacking the DNA-binding and endonuclease domains. We characterized two RmInt1-related group II introns RmInt2 from S. meliloti strain GR4 and Sr.md.I1 from S. medicae strain WSM419 in terms of splicing and mobility activities. We used both wild-type and engineered intron-donor constructs based on ribozyme ΔORF-coding sequence derivatives, and we determined the DNA target requirements for RmInt2, the element most distantly related to RmInt1. The excision and mobility patterns of intron-donor constructs expressing different combinations of IEP and intron RNA provided experimental evidence for the co-operation of IEPs and intron RNAs from related elements in intron splicing and, in some cases, in intron homing. We were also able to identify the DNA target regions recognized by these IEPs lacking the DNA endonuclease domain. Our results provide new insight into the versatility of related group II introns and the possible co-operation between these elements to facilitate the colonization of bacterial genomes. PMID:25482895
Martínez-Rodríguez, Laura; García-Rodríguez, Fernando M; Molina-Sánchez, María Dolores; Toro, Nicolás; Martínez-Abarca, Francisco
2014-01-01
Group II introns are self-splicing RNAs and site-specific mobile retroelements found in bacterial and organellar genomes. The group II intron RmInt1 is present at high copy number in Sinorhizobium meliloti species, and has a multifunctional intron-encoded protein (IEP) with reverse transcriptase/maturase activities, but lacking the DNA-binding and endonuclease domains. We characterized two RmInt1-related group II introns RmInt2 from S. meliloti strain GR4 and Sr.md.I1 from S. medicae strain WSM419 in terms of splicing and mobility activities. We used both wild-type and engineered intron-donor constructs based on ribozyme ΔORF-coding sequence derivatives, and we determined the DNA target requirements for RmInt2, the element most distantly related to RmInt1. The excision and mobility patterns of intron-donor constructs expressing different combinations of IEP and intron RNA provided experimental evidence for the co-operation of IEPs and intron RNAs from related elements in intron splicing and, in some cases, in intron homing. We were also able to identify the DNA target regions recognized by these IEPs lacking the DNA endonuclease domain. Our results provide new insight into the versatility of related group II introns and the possible co-operation between these elements to facilitate the colonization of bacterial genomes.
Mechanisms Used for Genomic Proliferation by Thermophilic Group II Introns
Mohr, Georg; Ghanem, Eman; Lambowitz, Alan M.
2010-01-01
Mobile group II introns, which are found in bacterial and organellar genomes, are site-specific retroelments hypothesized to be evolutionary ancestors of spliceosomal introns and retrotransposons in higher organisms. Most bacteria, however, contain no more than one or a few group II introns, making it unclear how introns could have proliferated to higher copy numbers in eukaryotic genomes. An exception is the thermophilic cyanobacterium Thermosynechococcus elongatus, which contains 28 closely related copies of a group II intron, constituting ∼1.3% of the genome. Here, by using a combination of bioinformatics and mobility assays at different temperatures, we identified mechanisms that contribute to the proliferation of T. elongatus group II introns. These mechanisms include divergence of DNA target specificity to avoid target site saturation; adaptation of some intron-encoded reverse transcriptases to splice and mobilize multiple degenerate introns that do not encode reverse transcriptases, leading to a common splicing apparatus; and preferential insertion within other mobile introns or insertion elements, which provide new unoccupied sites in expanding non-essential DNA regions. Additionally, unlike mesophilic group II introns, the thermophilic T. elongatus introns rely on elevated temperatures to help promote DNA strand separation, enabling access to a larger number of DNA target sites by base pairing of the intron RNA, with minimal constraint from the reverse transcriptase. Our results provide insight into group II intron proliferation mechanisms and show that higher temperatures, which are thought to have prevailed on Earth during the emergence of eukaryotes, favor intron proliferation by increasing the accessibility of DNA target sites. We also identify actively mobile thermophilic introns, which may be useful for structural studies, gene targeting in thermophiles, and as a source of thermostable reverse transcriptases. PMID:20543989
Yi, Dong-Keun; Lee, Hae-Lim; Sun, Byung-Yun; Chung, Mi Yoon; Kim, Ki-Joong
2012-05-01
This study reports the complete chloroplast (cp) DNA sequence of Eleutherococcus senticosus (GenBank: JN 637765), an endangered endemic species. The genome is 156,768 bp in length, and contains a pair of inverted repeat (IR) regions of 25,930 bp each, a large single copy (LSC) region of 86,755 bp and a small single copy (SSC) region of 18,153 bp. The structural organization, gene and intron contents, gene order, AT content, codon usage, and transcription units of the E. senticosus chloroplast genome are similar to that of typical land plant cp DNA. We aligned and analyzed the sequences of 86 coding genes, 19 introns and 113 intergenic spacers (IGS) in three different taxonomic hierarchies; Eleutherococcus vs. Panax, Eleutherococcus vs. Daucus, and Eleutherococcus vs. Nicotiana. The distribution of indels, the number of polymorphic sites and nucleotide diversity indicate that positional constraint is more important than functional constraint for the evolution of cp genome sequences in Asterids. For example, the intron sequences in the LSC region exhibited base substitution rates 5-11-times higher than that of the IR regions, while the intron sequences in the SSC region evolved 7-14-times faster than those in the IR region. Furthermore, the Ka/Ks ratio of the gene coding sequences supports a stronger evolutionary constraint in the IR region than in the LSC or SSC regions. Therefore, our data suggest that selective sweeps by base collection mechanisms more frequently eliminate polymorphisms in the IR region than in other regions. Chloroplast genome regions that have high levels of base substitutions also show higher incidences of indels. Thirty-five simple sequence repeat (SSR) loci were identified in the Eleutherococcus chloroplast genome. Of these, 27 are homopolymers, while six are di-polymers and two are tri-polymers. In addition to the SSR loci, we also identified 18 medium size repeat units ranging from 22 to 79 bp, 11 of which are distributed in the IGS or intron regions. These medium size repeats may contribute to developing a cp genome-specific gene introduction vector because the region may use for specific recombination sites.
Haut, Donald D.; Pintel, D. J.
1998-01-01
Alternative splicing of pre-mRNAs plays a critical role in maximizing the coding capacity of the small parvovirus genome. The small-intron region of minute virus of mice (MVM) pre-mRNAs undergoes an unusual pattern of overlapping alternative splicing—using two donors (D1 and D2) and two acceptors (A1 and A2) within a region of 120 nucleotides—that determines the steady-state ratios of the various viral mRNAs. In this report, we show that the determinants that govern excision of the small intron are complex and are also required for efficient definition of the upstream exon. For the MVM small intron in its natural context, the two donors appear to compete for the splicing machinery: the position of D1 favors its usage, while the primary sequence of D2 must be more like the consensus sequence than is D1 to be used efficiently. We have genetically defined the branch points that are used for generation of the major and minor spliced forms and show that recognition of components of the small-intron acceptors is likely to be the dominant determinant in alternative small-intron excision. We have also identified a G-rich intronic enhancer sequence within the small intron that is essential for splicing of the minor form (D2 to A2) but not the major form (D1 to A1) of MVM mRNAs and is required for efficient definition of the upstream NS2-specific exon. In its natural context, the small intron appears to be excised by a mechanism consistent with intron definition. When the MVM small intron is expanded, various parameters of its excision are altered, indicating that critical cis-acting signals are context dependent. Relative use of the donors and acceptors is altered, and the upstream NS2-specific exon is no longer efficiently defined. The fact that definition of the upstream NS2-specific exon can be achieved by the MVM small intron in its natural context, but not when it is expanded, suggests that the multiple determinants that govern definition and excision of the small intron are required, in concert, for upstream exon definition. Our data are consistent with a model in which alternative splicing of the MVM P4-generated pre-mRNAs is governed by a hybrid of intron- and exon-defining mechanisms. PMID:9499034
Isolation and Identification of Gene-Specific MicroRNAs.
Lin, Shi-Lung; Chang, Donald C; Ying, Shao-Yao
2018-01-01
Computer programming has identified hundreds of genomic hairpin sequences, many with functions yet to be determined. Because transfection of hairpin-like microRNA precursors (pre-miRNAs) into mammalian cells is not always sufficient to trigger RNA-induced gene silencing complex (RISC) assembly, a key step for inducing RNA interference (RNAi)-related gene silencing, we have developed an intronic miRNA expression system to overcome this problem by inserting a hairpin-like pre-miRNA structure into the intron region of a gene, and hence successfully increase the efficiency and effectiveness of miRNA-associated RNAi induction in vitro and in vivo. This intronic miRNA biogenesis mechanism has been found to depend on a coupled interaction of nascent messenger RNA transcription and intron excision within a specific nuclear region proximal to genomic perichromatin fibrils. The intronic miRNA so obtained is transcribed by type-II RNA polymerases, coexpressed within a primary gene transcript, and then excised out of the gene transcript by intracellular RNA splicing and processing machineries. After that, ribonuclease III (RNaseIII) endonucleases further process the spliced introns into mature miRNAs. Using this intronic miRNA expression system, we have shown for the first time that the intron-derived miRNAs are able to elicit strong RNAi effects in not only human and mouse cells in vitro but also in zebrafishes, chicken embryos, and adult mice in vivo. We have also developed a miRNA isolation protocol, based on the complementarity between the designed miRNA and its targeted gene sequence, to purify and identify the mature miRNAs generated. As a result, several intronic miRNA identities and structures have been confirmed. According to this proof-of-principle methodology, we now have full knowledge to design various intronic pre-miRNA inserts that are more efficient and effective for inducing specific gene silencing effects in vitro and in vivo.
Single nucleotide polymorphisms in the Mycobacterium bovis genome resolve phylogenetic relationships
USDA-ARS?s Scientific Manuscript database
Mycobacterium bovis isolates carry restricted allelic variation yet exhibit a range of disease phenotypes and host preferences. Conventional genotyping methods target small hyper-variable regions of their genome and provide anonymous biallelic information insufficient to develop phylogeny. To resolv...
Branchpoint selection in the splicing of U12-dependent introns in vitro.
McConnell, Timothy S; Cho, Soo-Jin; Frilander, Mikko J; Steitz, Joan A
2002-05-01
In metazoans, splicing of introns from pre-mRNAs can occur by two pathways: the major U2-dependent or the minor U12-dependent pathways. Whereas the U2-dependent pathway has been well characterized, much about the U12-dependent pathway remains to be discovered. Most of the information regarding U12-type introns has come from in vitro studies of a very few known introns of this class. To expand our understanding of U12-type splicing, especially to test the hypothesis that the simple base-pairing mechanism between the intron and U12 snRNA defines the branchpoint of U12-dependent introns, additional in vitro splicing substrates were created from three putative U12-type introns: the third intron of the Xenopus RPL1 a gene (XRP), the sixth intron of the Xenopus TFIIS.oA gene (XTF), and the first intron of the human Sm E gene (SME). In vitro splicing in HeLa nuclear extract confirmed U12-dependent splicing of each of these introns. Surprisingly, branchpoint mapping of the XRP splicing intermediate shows use of the upstream rather than the downstream of two consecutive adenosines within the branchpoint sequence (BPS), contrary to the prediction based on alignment with the sixth intron of human P120, a U12-dependent intron whose branch site was previously determined. Also, in the SME intron, the position of the branchpoint A residue within the region base paired with U12 differs from that in P120 and XTF. Analysis of these three additional introns therefore rules out simple models for branchpoint selection by the U12-type spliceosome.
Branchpoint selection in the splicing of U12-dependent introns in vitro.
McConnell, Timothy S; Cho, Soo-Jin; Frilander, Mikko J; Steitz, Joan A
2002-01-01
In metazoans, splicing of introns from pre-mRNAs can occur by two pathways: the major U2-dependent or the minor U12-dependent pathways. Whereas the U2-dependent pathway has been well characterized, much about the U12-dependent pathway remains to be discovered. Most of the information regarding U12-type introns has come from in vitro studies of a very few known introns of this class. To expand our understanding of U12-type splicing, especially to test the hypothesis that the simple base-pairing mechanism between the intron and U12 snRNA defines the branchpoint of U12-dependent introns, additional in vitro splicing substrates were created from three putative U12-type introns: the third intron of the Xenopus RPL1 a gene (XRP), the sixth intron of the Xenopus TFIIS.oA gene (XTF), and the first intron of the human Sm E gene (SME). In vitro splicing in HeLa nuclear extract confirmed U12-dependent splicing of each of these introns. Surprisingly, branchpoint mapping of the XRP splicing intermediate shows use of the upstream rather than the downstream of two consecutive adenosines within the branchpoint sequence (BPS), contrary to the prediction based on alignment with the sixth intron of human P120, a U12-dependent intron whose branch site was previously determined. Also, in the SME intron, the position of the branchpoint A residue within the region base paired with U12 differs from that in P120 and XTF. Analysis of these three additional introns therefore rules out simple models for branchpoint selection by the U12-type spliceosome. PMID:12022225
Voorbij, Annemarie M W Y; van Steenbeek, Frank G; Vos-Loohuis, Manon; Martens, Ellen E C P; Hanson-Nilsson, Jeanette M; van Oost, Bernard A; Kooistra, Hans S; Leegwater, Peter A
2011-01-01
Dwarfism in German shepherd dogs is due to combined pituitary hormone deficiency of unknown genetic cause. We localized the recessively inherited defect by a genome wide approach to a region on chromosome 9 with a lod score of 9.8. The region contains LHX3, which codes for a transcription factor essential for pituitary development. Dwarfs have a deletion of one of six 7 bp repeats in intron 5 of LHX3, reducing the intron size to 68 bp. One dwarf was compound heterozygous for the deletion and an insertion of an asparagine residue in the DNA-binding homeodomain of LHX3, suggesting involvement of the gene in the disorder. An exon trapping assay indicated that the shortened intron is not spliced efficiently, probably because it is too small. We applied bisulfite conversion of cytosine to uracil in RNA followed by RT-PCR to analyze the splicing products. The aberrantly spliced RNA molecules resulted from either skipping of exon 5 or retention of intron 5. The same splicing defects were observed in cDNA derived from the pituitary of dwarfs. A survey of similarly mutated introns suggests that there is a minimal distance requirement between the splice donor and branch site of 50 nucleotides. In conclusion, a contraction of a DNA repeat in intron 5 of canine LHX3 leads to deficient splicing and is associated with pituitary dwarfism.
Voorbij, Annemarie M. W. Y.; van Steenbeek, Frank G.; Vos-Loohuis, Manon; Martens, Ellen E. C. P.; Hanson-Nilsson, Jeanette M.; van Oost, Bernard A.; Kooistra, Hans S.; Leegwater, Peter A.
2011-01-01
Dwarfism in German shepherd dogs is due to combined pituitary hormone deficiency of unknown genetic cause. We localized the recessively inherited defect by a genome wide approach to a region on chromosome 9 with a lod score of 9.8. The region contains LHX3, which codes for a transcription factor essential for pituitary development. Dwarfs have a deletion of one of six 7 bp repeats in intron 5 of LHX3, reducing the intron size to 68 bp. One dwarf was compound heterozygous for the deletion and an insertion of an asparagine residue in the DNA-binding homeodomain of LHX3, suggesting involvement of the gene in the disorder. An exon trapping assay indicated that the shortened intron is not spliced efficiently, probably because it is too small. We applied bisulfite conversion of cytosine to uracil in RNA followed by RT-PCR to analyze the splicing products. The aberrantly spliced RNA molecules resulted from either skipping of exon 5 or retention of intron 5. The same splicing defects were observed in cDNA derived from the pituitary of dwarfs. A survey of similarly mutated introns suggests that there is a minimal distance requirement between the splice donor and branch site of 50 nucleotides. In conclusion, a contraction of a DNA repeat in intron 5 of canine LHX3 leads to deficient splicing and is associated with pituitary dwarfism. PMID:22132174
Pombert, Jean-François; Otis, Christian; Turmel, Monique; Lemieux, Claude
2013-01-01
Organelle genes are often interrupted by group I and or group II introns. Splicing of these mobile genetic occurs at the RNA level via serial transesterification steps catalyzed by the introns'own tertiary structures and, sometimes, with the help of external factors. These catalytic ribozymes can be found in cis or trans configuration, and although trans-arrayed group II introns have been known for decades, trans-spliced group I introns have been reported only recently. In the course of sequencing the complete mitochondrial genome of the prasinophyte picoplanktonic green alga Prasinoderma coloniale CCMP 1220 (Prasinococcales, clade VI), we uncovered two additional cases of trans-spliced group I introns. Here, we describe these introns and compare the 54,546 bp-long mitochondrial genome of Prasinoderma with those of four other prasinophytes (clades II, III and V). This comparison underscores the highly variable mitochondrial genome architecture in these ancient chlorophyte lineages. Both Prasinoderma trans-spliced introns reside within the large subunit rRNA gene (rnl) at positions where cis-spliced relatives, often containing homing endonuclease genes, have been found in other organelles. In contrast, all previously reported trans-spliced group I introns occur in different mitochondrial genes (rns or coxI). Each Prasinoderma intron is fragmented into two pieces, forming at the RNA level a secondary structure that resembles those of its cis-spliced counterparts. As observed for other trans-spliced group I introns, the breakpoint of the first intron maps to the variable loop L8, whereas that of the second is uniquely located downstream of P9.1. The breakpoint In each Prasinoderma intron corresponds to the same region where the open reading frame (ORF) occurs when present in cis-spliced orthologs. This correlation between the intron breakpoint and the ORF location in cis-spliced orthologs also holds for other trans-spliced introns; we discuss the possible implications of this interesting observation for trans-splicing of group I introns. PMID:24386369
Oliveira, Patrícia; Sanges, Remo; Huntsman, David; Stupka, Elia; Oliveira, Carla
2012-01-01
Cadherins are cell–cell adhesion proteins essential for the maintenance of tissue architecture and integrity, and their impairment is often associated with human cancer. Knowledge regarding regulatory mechanisms associated with cadherin misexpression in cancer is scarce. Specific features of the intronic-structure and intronic-based regulatory mechanisms in the cadherin superfamily are unidentified. This study aims at systematically characterizing the intronic portion of cadherin superfamily members and the identification of intronic regions constituting putative targets/triggers of regulation, using a bioinformatic approach and biological data mining. Our study demonstrates that the cadherin superfamily genes harbour specific characteristics in comparison to all non-cadherin genes, both from the genomic and transcriptional standpoints. Cadherin superfamily genes display higher average total intron number and significantly longer introns than other genes and across the entire vertebrate lineage. Moreover, in the human genome, we observed an uncommon high frequency of MIR (mammalian-wide interspersed repeats) and MaLR (mammalian-wide interspersed repeats, a subtype of LTR) regulatory-associated repetitive elements at 5′-located introns, concomitantly with increased de novo intronic transcription. Using this approach, we identified cadherin intronic-specific sites that may constitute novel targets/triggers of cadherin superfamily expression regulation. These findings pinpoint the need to identify mechanisms affecting particularly MIR and MaLR elements located in introns 2 and 3 of human cadherin genes, possibly important in the expression modulation of this superfamily in homeostasis and cancer. PMID:22317972
Cenik, Can; Chua, Hon Nian; Zhang, Hui; Tarnawsky, Stefan P.; Akef, Abdalla; Derti, Adnan; Tasan, Murat; Moore, Melissa J.; Palazzo, Alexander F.; Roth, Frederick P.
2011-01-01
In higher eukaryotes, messenger RNAs (mRNAs) are exported from the nucleus to the cytoplasm via factors deposited near the 5′ end of the transcript during splicing. The signal sequence coding region (SSCR) can support an alternative mRNA export (ALREX) pathway that does not require splicing. However, most SSCR–containing genes also have introns, so the interplay between these export mechanisms remains unclear. Here we support a model in which the furthest upstream element in a given transcript, be it an intron or an ALREX–promoting SSCR, dictates the mRNA export pathway used. We also experimentally demonstrate that nuclear-encoded mitochondrial genes can use the ALREX pathway. Thus, ALREX can also be supported by nucleotide signals within mitochondrial-targeting sequence coding regions (MSCRs). Finally, we identified and experimentally verified novel motifs associated with the ALREX pathway that are shared by both SSCRs and MSCRs. Our results show strong correlation between 5′ untranslated region (5′UTR) intron presence/absence and sequence features at the beginning of the coding region. They also suggest that genes encoding secretory and mitochondrial proteins share a common regulatory mechanism at the level of mRNA export. PMID:21533221
Evolution of ribonuclease in relation to polypeptide folding mechanisms.
NASA Technical Reports Server (NTRS)
Barnard, E. A.; Cohen, M. S.; Gold, M. H.; Kim, J.-K.
1972-01-01
Comparisons of the N-terminal region of pancreatic RNAase in seven species are presented, taking into account cow, bison, deer, rat, pig, kangaroo, and turtle. The available limited evidence on hypervariable regions indicates that there is still an evolutionary constraint on them. It is proposed that there is a selection pressure acting on all regions of a protein sequence in evolution. Mutations that tend to obstruct the folding process can lead to various intensities of selection pressure.
Towards barcode markers in Fungi: an intron map of Ascomycota mitochondria.
Santamaria, Monica; Vicario, Saverio; Pappadà, Graziano; Scioscia, Gaetano; Scazzocchio, Claudio; Saccone, Cecilia
2009-06-16
A standardized and cost-effective molecular identification system is now an urgent need for Fungi owing to their wide involvement in human life quality. In particular the potential use of mitochondrial DNA species markers has been taken in account. Unfortunately, a serious difficulty in the PCR and bioinformatic surveys is due to the presence of mobile introns in almost all the fungal mitochondrial genes. The aim of this work is to verify the incidence of this phenomenon in Ascomycota, testing, at the same time, a new bioinformatic tool for extracting and managing sequence databases annotations, in order to identify the mitochondrial gene regions where introns are missing so as to propose them as species markers. The general trend towards a large occurrence of introns in the mitochondrial genome of Fungi has been confirmed in Ascomycota by an extensive bioinformatic analysis, performed on all the entries concerning 11 mitochondrial protein coding genes and 2 mitochondrial rRNA (ribosomal RNA) specifying genes, belonging to this phylum, available in public nucleotide sequence databases. A new query approach has been developed to retrieve effectively introns information included in these entries. After comparing the new query-based approach with a blast-based procedure, with the aim of designing a faithful Ascomycota mitochondrial intron map, the first method appeared clearly the most accurate. Within this map, despite the large pervasiveness of introns, it is possible to distinguish specific regions comprised in several genes, including the full NADH dehydrogenase subunit 6 (ND6) gene, which could be considered as barcode candidates for Ascomycota due to their paucity of introns and to their length, above 400 bp, comparable to the lower end size of the length range of barcodes successfully used in animals. The development of the new query system described here would answer the pressing requirement to improve drastically the bioinformatics support to the DNA Barcode Initiative. The large scale investigation of Ascomycota mitochondrial introns performed through this tool, allowing to exclude the introns-rich sequences from the barcode candidates exploration, could be the first step towards a mitochondrial barcoding strategy for these organisms, similar to the standard approach employed in metazoans.
Acosta, Jose Luis; Hernández-Mondragón, Alma Cristal; Correa-Acosta, Laura Carolina; Cazañas-Padilla, Sandra Nathaly; Chávez-Florencio, Berenice; Ramírez-Vega, Elvia Yamilet; Monge-Cázares, Tulia; Aguilar-Salinas, Carlos A; Tusié-Luna, Teresa; Del Bosque-Plata, Laura
2016-05-26
Genetic variations of the TCF7L2 gene are associated with the development of Type 2 diabetes (T2D). The associated mutations have demonstrated an adaptive role in some human populations, but no studies have determined the impact of evolutionary forces on genetic diversity in indigenous populations from Mexico. Here, we sequenced and analyzed the variation of the TCF7L2 gene in three Amerindian populations and compared the results with whole-exon-sequencing of Mestizo populations from Sigma and the 1000 Genomes Project to assess the roles of selection and recombination in diversity. The diversity in the indigenous populations was biased to intronic regions. Most of the variation was low frequency. Only mutations rs77961654 and rs61724286 were located on exon 15. We did not observe variation in intronic region 4-6 in any of the three indigenous populations. In addition, we identified peaks of selective sweeps in the mestizo samples from the Sigma Project within this region. By replicating the analysis of association with T2D between case-controls from the Sigma Project, we determined that T2D was most highly associated with the rs7903146 risk allele and to a lesser extent with the other six variants. All associated markers were located in intronic region 4-6, and their r(2) values of linkage disequilibrium were significantly higher in the Mexican population than in Africans from the 1000 Genomes Project. We observed reticulations in both the haplotypes network analysis from seven marker associates and the neighborNet tree based on 6061 markers in the TCF7L2 gene identified from all samples of the 1000 Genomes Project. Finally, we identified two recombination hotspots in the upstream region and 3' end of the TCF7L2 gene. The lack of diversity in intronic region 4-6 in Indigenous populations could be an effect of selective sweeps generated by the selection of neighboring rare variants at T2D-associated mutations. The survivors' variants make the intronic region 4-6 the area of the greatest population differentiation within the TCF7L2 gene. The abundance of selective peak sweeps in the downstream region of the TCF7L2 gene suggests that the TCF7L2 gene is part of a region that is in constant recombination between populations.
X-linked hypophosphatemia attributable to pseudoexons of the PHEX gene.
Christie, P T; Harding, B; Nesbit, M A; Whyte, M P; Thakker, R V
2001-08-01
X-linked hypophosphatemia is commonly caused by mutations of the coding region of PHEX (phosphate-regulating gene with homologies to endopeptidases on the X chromosome). However, such PHEX mutations are not detected in approximately one third of X-linked hypophosphatemia patients who may harbor defects in the noncoding or intronic regions. We have therefore investigated 11 unrelated X-linked hypophosphatemia patients in whom coding region mutations had been excluded, for intronic mutations that may lead to mRNA splicing abnormalities, by the use of lymphoblastoid RNA and RT-PCRs. One X-linked hypophosphatemia patient was found to have 3 abnormally large transcripts, resulting from 51-bp, 100-bp, and 170-bp insertions, all of which would lead to missense peptides and premature termination codons. The origin of these transcripts was a mutation (g to t) at position +1268 of intron 7, which resulted in the occurrence of a high quality novel donor splice site (ggaagg to gtaagg). Splicing between this novel donor splice site and 3 preexisting, but normally silent, acceptor splice sites within intron 7 resulted in the occurrences of the 3 pseudoexons. This represents the first report of PHEX pseudoexons and reveals further the diversity of genetic abnormalities causing X-linked hypophosphatemia.
Two distinct promoters drive transcription of the human D1A dopamine receptor gene.
Lee, S H; Minowa, M T; Mouradian, M M
1996-10-11
The human D1A dopamine receptor gene has a GC-rich, TATA-less promoter located upstream of a small, noncoding exon 1, which is separated from the coding exon 2 by a 116-base pair (bp)-long intron. Serial 3'-deletions of the 5'-noncoding region of this gene, including the intron and 5'-end of exon 2, resulted in 80 and 40% decrease in transcriptional activity of the upstream promoter in two D1A-expressing neuroblastoma cell lines, SK-N-MC and NS20Y, respectively. To investigate the function of this region, the intron and 245 bp at the 5'-end of exon 2 were investigated. Transient expression analyses using various chloramphenicol acetyltransferase constructs showed that the transcriptional activity of the intron is higher than that of the upstream promoter by 12-fold in SK-N-MC cells and by 5.5-fold in NS20Y cells in an orientation-dependent manner, indicating that the D1A intron is a strong promoter. Primer extension and ribonuclease protection assays revealed that transcription driven by the intron promoter is initiated at the junction of intron and exon 2 and at a cluster of nucleotides located 50 bp downstream from this junction. The same transcription start sites are utilized by the chloramphenicol acetyltransferase constructs employed in transfections as well as by the D1A gene expressed within the human caudate. The relative abundance of D1A transcripts originating from the upstream promoter compared with those transcribed from the intron promoter is 1.5-2.9 times in SK-N-MC cells and 2 times in the human caudate. Transcript stability studies in SK-N-MC cells revealed that longer D1A mRNA molecules containing exon 1 are degraded 1.8 times faster than shorter transcripts lacking exon 1. Although gel mobility shift assay could not detect DNA-protein interaction at the D1A intron, competitive co-transfection using the intron as competitor confirmed the presence of trans-acting factors at the intron. These data taken together indicate that the human D1A gene has two functional TATA-less promoters, both in D1A expressing cultured neuroblastoma cells and in the human striatum.
Lu, Jiamiao; Williams, James A.; Luke, Jeremy; Zhang, Feijie; Chu, Kirk; Kay, Mark A.
2017-01-01
We previously developed a mini-intronic plasmid (MIP) expression system in which the essential bacterial elements for plasmid replication and selection are placed within an engineered intron contained within a universal 5′ UTR noncoding exon. Like minicircle DNA plasmids (devoid of bacterial backbone sequences), MIP plasmids overcome transcriptional silencing of the transgene. However, in addition MIP plasmids increase transgene expression by 2 and often >10 times higher than minicircle vectors in vivo and in vitro. Based on these findings, we examined the effects of the MIP intronic sequences in a recombinant adeno-associated virus (AAV) vector system. Recombinant AAV vectors containing an intron with a bacterial replication origin and bacterial selectable marker increased transgene expression by 40 to 100 times in vivo when compared with conventional AAV vectors. Therefore, inclusion of this noncoding exon/intron sequence upstream of the coding region can substantially enhance AAV-mediated gene expression in vivo. PMID:27903072
Bodilis, Josselin; Nsigue-Meilo, Sandrine; Besaury, Ludovic; Quillet, Laurent
2012-01-01
Even though the 16S rRNA gene is the most commonly used taxonomic marker in microbial ecology, its poor resolution is still not fully understood at the intra-genus level. In this work, the number of rRNA gene operons, intra-genomic heterogeneities and lateral transfers were investigated at a fine-scale resolution, throughout the Pseudomonas genus. In addition to nineteen sequenced Pseudomonas strains, we determined the 16S rRNA copy number in four other Pseudomonas strains by Southern hybridization and Pulsed-Field Gel Electrophoresis, and studied the intra-genomic heterogeneities by Denaturing Gradient Gel Electrophoresis and sequencing. Although the variable copy number (from four to seven) seems to be correlated with the evolutionary distance, some close strains in the P. fluorescens lineage showed a different number of 16S rRNA genes, whereas all the strains in the P. aeruginosa lineage displayed the same number of genes (four copies). Further study of the intra-genomic heterogeneities revealed that most of the Pseudomonas strains (15 out of 19 strains) had at least two different 16S rRNA alleles. A great difference (5 or 19 nucleotides, essentially grouped near the V1 hypervariable region) was observed only in two sequenced strains. In one of our strains studied (MFY30 strain), we found a difference of 12 nucleotides (grouped in the V3 hypervariable region) between copies of the 16S rRNA gene. Finally, occurrence of partial lateral transfers of the 16S rRNA gene was further investigated in 1803 full-length sequences of Pseudomonas available in the databases. Remarkably, we found that the two most variable regions (the V1 and V3 hypervariable regions) had probably been laterally transferred from another evolutionary distant Pseudomonas strain for at least 48.3 and 41.6% of the 16S rRNA sequences, respectively. In conclusion, we strongly recommend removing these regions of the 16S rRNA gene during the intra-genus diversity studies. PMID:22545126
Pre-Mrna Introns as a Model for Cryptographic Algorithm:. Theory and Experiments
NASA Astrophysics Data System (ADS)
Regoli, Massimo
2010-01-01
The RNA-Crypto System (shortly RCS) is a symmetric key algorithm to cipher data. The idea for this new algorithm starts from the observation of nature. In particular from the observation of RNA behavior and some of its properties. In particular the RNA sequences have some sections called Introns. Introns, derived from the term "intragenic regions", are non-coding sections of precursor mRNA (pre-mRNA) or other RNAs, that are removed (spliced out of the RNA) before the mature RNA is formed. Once the introns have been spliced out of a pre-mRNA, the resulting mRNA sequence is ready to be translated into a protein. The corresponding parts of a gene are known as introns as well. The nature and the role of Introns in the pre-mRNA is not clear and it is under ponderous researches by Biologists but, in our case, we will use the presence of Introns in the RNA-Crypto System output as a strong method to add chaotic non coding information and an unnecessary behaviour in the access to the secret key to code the messages. In the RNA-Crypto System algorithm the introns are sections of the ciphered message with non-coding information as well as in the precursor mRNA.
a Simple Symmetric Algorithm Using a Likeness with Introns Behavior in RNA Sequences
NASA Astrophysics Data System (ADS)
Regoli, Massimo
2009-02-01
The RNA-Crypto System (shortly RCS) is a symmetric key algorithm to cipher data. The idea for this new algorithm starts from the observation of nature. In particular from the observation of RNA behavior and some of its properties. The RNA sequences has some sections called Introns. Introns, derived from the term "intragenic regions", are non-coding sections of precursor mRNA (pre-mRNA) or other RNAs, that are removed (spliced out of the RNA) before the mature RNA is formed. Once the introns have been spliced out of a pre-mRNA, the resulting mRNA sequence is ready to be translated into a protein. The corresponding parts of a gene are known as introns as well. The nature and the role of Introns in the pre-mRNA is not clear and it is under ponderous researches by Biologists but, in our case, we will use the presence of Introns in the RNA-Crypto System output as a strong method to add chaotic non coding information and an unnecessary behaviour in the access to the secret key to code the messages. In the RNA-Crypto System algoritnm the introns are sections of the ciphered message with non-coding information as well as in the precursor mRNA.
Automated antibody structure prediction using Accelrys tools: Results and best practices
Fasnacht, Marc; Butenhof, Ken; Goupil-Lamy, Anne; Hernandez-Guzman, Francisco; Huang, Hongwei; Yan, Lisa
2014-01-01
We describe the methodology and results from our participation in the second Antibody Modeling Assessment experiment. During the experiment we predicted the structure of eleven unpublished antibody Fv fragments. Our prediction methods centered on template-based modeling; potential templates were selected from an antibody database based on their sequence similarity to the target in the framework regions. Depending on the quality of the templates, we constructed models of the antibody framework regions either using a single, chimeric or multiple template approach. The hypervariable loop regions in the initial models were rebuilt by grafting the corresponding regions from suitable templates onto the model. For the H3 loop region, we further refined models using ab initio methods. The final models were subjected to constrained energy minimization to resolve severe local structural problems. The analysis of the models submitted show that Accelrys tools allow for the construction of quite accurate models for the framework and the canonical CDR regions, with RMSDs to the X-ray structure on average below 1 Å for most of these regions. The results show that accurate prediction of the H3 hypervariable loops remains a challenge. Furthermore, model quality assessment of the submitted models show that the models are of quite high quality, with local geometry assessment scores similar to that of the target X-ray structures. Proteins 2014; 82:1583–1598. © 2014 The Authors. Proteins published by Wiley Periodicals, Inc. PMID:24833271
Hua, Yimin; Vickers, Timothy A.; Okunola, Hazeem L.; Bennett, C. Frank; Krainer, Adrian R.
2008-01-01
survival of motor neuron 2, centromeric (SMN2) is a gene that modifies the severity of spinal muscular atrophy (SMA), a motor-neuron disease that is the leading genetic cause of infant mortality. Increasing inclusion of SMN2 exon 7, which is predominantly skipped, holds promise to treat or possibly cure SMA; one practical strategy is the disruption of splicing silencers that impair exon 7 recognition. By using an antisense oligonucleotide (ASO)-tiling method, we systematically screened the proximal intronic regions flanking exon 7 and identified two intronic splicing silencers (ISSs): one in intron 6 and a recently described one in intron 7. We analyzed the intron 7 ISS by mutagenesis, coupled with splicing assays, RNA-affinity chromatography, and protein overexpression, and found two tandem hnRNP A1/A2 motifs within the ISS that are responsible for its inhibitory character. Mutations in these two motifs, or ASOs that block them, promote very efficient exon 7 inclusion. We screened 31 ASOs in this region and selected two optimal ones to test in human SMN2 transgenic mice. Both ASOs strongly increased hSMN2 exon 7 inclusion in the liver and kidney of the transgenic animals. Our results show that the high-resolution ASO-tiling approach can identify cis-elements that modulate splicing positively or negatively. Most importantly, our results highlight the therapeutic potential of some of these ASOs in the context of SMA. PMID:18371932
Genomic structure of two ras family genes in the slime mold Physarum polycephalum.
Trzcińska-Danielewicz, Joanna; Kozlowski, Piotr; Gierdal, Katarzyna; Wiejak, Jolanta; Jagielski, Adam; Toczko, Kazimierz; Fronk, Jan
2002-08-01
Genomic structure of two Physarum polycephalum ras family genes, Ppras2 and Pprap1, has been determined, including the upstream region of the latter. The genes are interrupted by three and four introns, respectively. The first intron of Ppras2 has the same location within the coding sequence as the first intron in another ras homolog from this organism, Ppras1 [Trzcińska-Danielewicz, J., Kozlowski, P., and Toczko, K. (1996). "Cloning and genomic sequence of the Physarum polycephalum Ppras1 gene, a homologue of the ras protooncogene", Gene 169, pp. 143-144]. All introns, ranging from 53 to ca. 460 base pairs, have the canonical 5' and 3' ends, are greatly enriched in pyrimidines in the coding strand and have frequent pyrimidines-only tracts. These latter features seem to be responsible for the difficulties in cloning and sequencing of parts of these genes. Short sequences shared with P. polycephalum transposon-like repeats are common in the introns, indicating a possible role of transposition in intron evolution. In all three ras family genes phase zero introns are located mostly between sequences coding for regular protein secondary structure elements.
Forensic Analysis of Canine DNA Samples in the Undergraduate Biochemistry Laboratory
ERIC Educational Resources Information Center
Carson, Tobin M.; Bradley, Sharonda Q.; Fekete, Brenda L.; Millard, Julie T.; LaRiviere, Frederick J.
2009-01-01
Recent advances in canine genomics have allowed the development of highly distinguishing methods of analysis for both nuclear and mitochondrial DNA. We describe a laboratory exercise suitable for an undergraduate biochemistry course in which the polymerase chain reaction is used to amplify hypervariable regions of DNA from dog hair and saliva…
Parallel Loss of Plastid Introns and Their Maturase in the Genus Cuscuta
McNeal, Joel R.; Kuehl, Jennifer V.; Boore, Jeffrey L.; Leebens-Mack, Jim; dePamphilis, Claude W.
2009-01-01
Plastid genome content and arrangement are highly conserved across most land plants and their closest relatives, streptophyte algae, with nearly all plastid introns having invaded the genome in their common ancestor at least 450 million years ago. One such intron, within the transfer RNA trnK-UUU, contains a large open reading frame that encodes a presumed intron maturase, matK. This gene is missing from the plastid genomes of two species in the parasitic plant genus Cuscuta but is found in all other published land plant and streptophyte algal plastid genomes, including that of the nonphotosynthetic angiosperm Epifagus virginiana and two other species of Cuscuta. By examining matK and plastid intron distribution in Cuscuta, we add support to the hypothesis that its normal role is in splicing seven of the eight group IIA introns in the genome. We also analyze matK nucleotide sequences from Cuscuta species and relatives that retain matK to test whether changes in selective pressure in the maturase are associated with intron deletion. Stepwise loss of most group IIA introns from the plastid genome results in substantial change in selective pressure within the hypothetical RNA-binding domain of matK in both Cuscuta and Epifagus, either through evolution from a generalist to a specialist intron splicer or due to loss of a particular intron responsible for most of the constraint on the binding region. The possibility of intron-specific specialization in the X-domain is implicated by evidence of positive selection on the lineage leading to C. nitida in association with the loss of six of seven introns putatively spliced by matK. Moreover, transfer RNA gene deletion facilitated by parasitism combined with an unusually high rate of intron loss from remaining functional plastid genes created a unique circumstance on the lineage leading to Cuscuta subgenus Grammica that allowed elimination of matK in the most species-rich lineage of Cuscuta. PMID:19543388
Parallel loss of plastid introns and their maturase in the genus Cuscuta.
McNeal, Joel R; Kuehl, Jennifer V; Boore, Jeffrey L; Leebens-Mack, Jim; dePamphilis, Claude W
2009-06-19
Plastid genome content and arrangement are highly conserved across most land plants and their closest relatives, streptophyte algae, with nearly all plastid introns having invaded the genome in their common ancestor at least 450 million years ago. One such intron, within the transfer RNA trnK-UUU, contains a large open reading frame that encodes a presumed intron maturase, matK. This gene is missing from the plastid genomes of two species in the parasitic plant genus Cuscuta but is found in all other published land plant and streptophyte algal plastid genomes, including that of the nonphotosynthetic angiosperm Epifagus virginiana and two other species of Cuscuta. By examining matK and plastid intron distribution in Cuscuta, we add support to the hypothesis that its normal role is in splicing seven of the eight group IIA introns in the genome. We also analyze matK nucleotide sequences from Cuscuta species and relatives that retain matK to test whether changes in selective pressure in the maturase are associated with intron deletion. Stepwise loss of most group IIA introns from the plastid genome results in substantial change in selective pressure within the hypothetical RNA-binding domain of matK in both Cuscuta and Epifagus, either through evolution from a generalist to a specialist intron splicer or due to loss of a particular intron responsible for most of the constraint on the binding region. The possibility of intron-specific specialization in the X-domain is implicated by evidence of positive selection on the lineage leading to C. nitida in association with the loss of six of seven introns putatively spliced by matK. Moreover, transfer RNA gene deletion facilitated by parasitism combined with an unusually high rate of intron loss from remaining functional plastid genes created a unique circumstance on the lineage leading to Cuscuta subgenus Grammica that allowed elimination of matK in the most species-rich lineage of Cuscuta.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Fennelly, J.; Laval, S.; Wright, E.
1996-04-01
We have identified a genomic locus (DXYH1) that is polymorphic and hypervariable within the CBA/H colony. Using a panel of C57BL/6 x Mus spretus backcross offspring, it was mapped to the distal end of the X chromosome. Pseudoautosomal inheritance was demonstrated through three generations of CBA/H x CBA/H and CBA/H x C57BL/6 crosses and confirmed through linkage to the Sxr locus in X/Y Sxr x 3H1 crosses. Meiotic recombination frequencies place DXYH1 {approximately}28% into the pseudoautosomal region from the boundary. The de novo generation of CBA/H variant DXYH1 restriction fragment length polymorphisms during spermatogenesis is suggestive of the germline instabilitymore » associated with hypermutable human minisatellites. The absence of DXY1-related sequences in Mus spretus provides DNA sequence evidence to support the observed failure of X-Y pairing during meiosis and consequent hybrid infertility in C57BL/6 x Mus spretus male F1 offspring. 19 refs., 4 figs.« less
VNTR alleles associated with the {alpha}-globin locus are haplotype and population related
DOE Office of Scientific and Technical Information (OSTI.GOV)
Martinson, J.J.; Clegg, J.B.; Boyce, A.J.
1994-09-01
The human {alpha}-globin complex contains several polymorphic restriction-enzyme sites (i.e., RFLPs) linked to form haplotypes and is flanked by two hypervariable VNTR loci, the 5{prime} hypervariable region (HVR) and the more highly polymorphic 3{prime}HVR. Using a combination of RFLP analysis and PCR, the authors have characterized the 5{prime}HVR and 3{prime}HVR alleles associated with the {alpha}-globin haplotypes of 133 chromosomes, and they here show that specific {alpha}-globin haplotypes are each associated with discrete subsets of the alleles observed at these two VNTR loci. This statistically highly significant association is observed over a region spanning {approximately} 100 kb. With the exception ofmore » closely related haplotypes, different haplotypes do not share identically sized 3{prime}HVR alleles. Earlier studies have shown that {alpha}-globin haplotype distributions differ between populations; the current findings also reveal extensive population substructure in the repertoire of {alpha}-globin VNTRs. If similar features are characteristic of other VNTR loci, this will have important implications for forensic and anthropological studies. 42 refs., 5 figs., 5 tabs.« less
Carter, James R; Keith, James H; Fraser, Tresa S; Dawson, James L; Kucharski, Cheryl A; Horne, Kate M; Higgs, Stephen; Fraser, Malcolm J
2014-06-13
Approximately 100 million confirmed infections and 20,000 deaths are caused by Dengue virus (DENV) outbreaks annually. Global warming and rapid dispersal have resulted in DENV epidemics in formally non-endemic regions. Currently no consistently effective preventive measures for DENV exist, prompting development of transgenic and paratransgenic vector control approaches. Production of transgenic mosquitoes refractory for virus infection and/or transmission is contingent upon defining antiviral genes that have low probability for allowing escape mutations, and are equally effective against multiple serotypes. Previously we demonstrated the effectiveness of an anti-viral group I intron targeting U143 of the DENV genome in mediating trans-splicing and expression of a marker gene with the capsid coding domain. In this report we examine the effectiveness of coupling expression of ΔN Bax to trans-splicing U143 intron activity as a means of suppressing DENV infection of mosquito cells. Targeting the conserved DENV circularization sequence (CS) by U143 intron trans-splicing activity appends a 3' exon RNA encoding ΔN Bax to the capsid coding region of the genomic RNA, resulting in a chimeric protein that induces premature cell death upon infection. TCID50-IFA analyses demonstrate an enhancement of DENV suppression for all DENV serotypes tested over the identical group I intron coupled with the non-apoptotic inducing firefly luciferase as the 3' exon. These cumulative results confirm the increased effectiveness of this αDENV-U143-ΔN Bax group I intron as a sequence specific antiviral that should be useful for suppression of DENV in transgenic mosquitoes. Annexin V staining, caspase 3 assays, and DNA ladder observations confirm DCA-ΔN Bax fusion protein expression induces apoptotic cell death. This report confirms the relative effectiveness of an anti-DENV group I intron coupled to an apoptosis-inducing ΔN Bax 3' exon that trans-splices conserved sequences of the 5' CS region of all DENV serotypes and induces apoptotic cell death upon infection. Our results confirm coupling the targeted ribozyme capabilities of the group I intron with the generation of an apoptosis-inducing transcript increases the effectiveness of infection suppression, improving the prospects of this unique approach as a means of inducing transgenic refractoriness in mosquitoes for all serotypes of this important disease.
Mitochondrial Group II Introns, Cytochrome c Oxidase, and Senescence in Podospora anserina†
Begel, Odile; Boulay, Jocelyne; Albert, Beatrice; Dufour, Eric; Sainsard-Chanet, Annie
1999-01-01
Podospora anserina is a filamentous fungus with a limited life span. It expresses a degenerative syndrome called senescence, which is always associated with the accumulation of circular molecules (senDNAs) containing specific regions of the mitochondrial chromosome. A mobile group II intron (α) has been thought to play a prominent role in this syndrome. Intron α is the first intron of the cytochrome c oxidase subunit I gene (COX1). Mitochondrial mutants that escape the senescence process are missing this intron, as well as the first exon of the COX1 gene. We describe here the first mutant of P. anserina that has the α sequence precisely deleted and whose cytochrome c oxidase activity is identical to that of wild-type cells. The integration site of the intron is slightly modified, and this change prevents efficient homing of intron α. We show here that this mutant displays a senescence syndrome similar to that of the wild type and that its life span is increased about twofold. The introduction of a related group II intron into the mitochondrial genome of the mutant does not restore the wild-type life span. These data clearly demonstrate that intron α is not the specific senescence factor but rather an accelerator or amplifier of the senescence process. They emphasize the role that intron α plays in the instability of the mitochondrial chromosome and the link between this instability and longevity. Our results strongly support the idea that in Podospora, “immortality” can be acquired not by the absence of intron α but rather by the lack of active cytochrome c oxidase. PMID:10330149
2010-01-01
Comparison of 18S rDNA gene sequences is a very promising method for identification and classification of living organisms. Molecular identification and discrimination of different Dunaliella species were carried out based on the size of 18S rDNA gene and, number and position of introns in the gene. Three types of 18S rDNA structure have already been reported: the gene with a size of ~1770 bp lacking any intron, with a size of ~2170 bp consisting one intron near 5' terminus, and with a size of ~2570 bp harbouring two introns near 5' and 3' termini. Hereby, we report a new 18S rDNA gene arrangement in terms of intron localization and nucleotide sequence in a Dunaliella isolated from Iranian salt lakes (ABRIINW-M1/2). PCR amplification with genus-specific primers resulted in production of a ~2170 bp DNA band, which is similar to that of D. salina 18S rDNA gene containing only one intron near 5' terminus. Whilst, sequence composition of the gene revealed the lack of any intron near 5' terminus in our isolate. Furthermore, another alteration was observed due to the presence of a 440 bp DNA fragment near 3' terminus. Accordingly, 18S rDNA gene of the isolate is clearly different from those of D. salina and any other Dunaliella species reported so far. Moreover, analysis of ITS region sequence showed the diversity of this region compared to the previously reported species. 18S rDNA and ITS sequences of our isolate were submitted with accesion numbers of EU678868 and EU927373 in NCBI database, respectively. The optimum growth rate of this isolate occured at the salinity level of 1 M NaCl. The maximum carotenoid content under stress condition of intense light (400 μmol photon m-2 s-1), high salinity (4 M NaCl) and deficiency of nitrate and phosphate nutritions reached to 240 ng/cell after 15 days. PMID:20377865
Jayashree, B; Jagadeesh, V T; Hoisington, D
2008-05-01
The availability of complete, annotated genomic sequence information in model organisms is a rich resource that can be extended to understudied orphan crops through comparative genomic approaches. We report here a software tool (cisprimertool) for the identification of conserved intron scanning regions using expressed sequence tag alignments to a completely sequenced model crop genome. The method used is based on earlier studies reporting the assessment of conserved intron scanning primers (called CISP) within relatively conserved exons located near exon-intron boundaries from onion, banana, sorghum and pearl millet alignments with rice. The tool is freely available to academic users at http://www.icrisat.org/gt-bt/CISPTool.htm. © 2007 ICRISAT.
Mollusk genes encoding lysine tRNA (UUU) contain introns.
Matsuo, M; Abe, Y; Saruta, Y; Okada, N
1995-11-20
New intron-containing genes encoding tRNAs were discovered when genomic DNA isolated from various animal species was amplified by the polymerase chain reaction (PCR) with primers based on sequences of rabbit tRNA(Lys). From sequencing analysis of the products of PCR, we found that introns are present in several genes encoding tRNA(Lys) in mollusks, such as Loligo bleekeri (squid) and Octopus vulgaris (octopus). These introns were specific to genes encoding tRNA(Lys)(CUU) and were not present in genes encoding tRNA(Lys)(CUU). In addition, the sequences of the introns were different from one another. To confirm the results of our initial experiments, we isolated and sequenced genes encoding tRNA(Lys)(CUU) and tRNA(Lys)(UUU). The gene for tRNA(Lys)(UUU) from squid contained an intron, whose sequence was the same as that identified by PCR, and the gene formed a cluster with a corresponding pseudogene. Several DNA regions of 2.1 kb containing this cluster appeared to be tandemly arrayed in the squid genome. By contrast, the gene encoding tRNA(Lys)(CUU) did not contain an intron, as shown also by PCR. The tRNA(Lys)(UUU) that corresponded to the analyzed gene was isolated and characterized. The present study provides the first example of an intron-containing gene encoding a tRNA in mollusks and suggests the universality of introns in such genes in higher eukaryotes.
McAllister, Jane; Casino, Carmela; Davidson, Fiona; Power, Joan; Lawlor, Emer; Yap, Peng Lee; Simmonds, Peter; Smith, Donald B.
1998-01-01
The long-term evolution of the hepatitis C virus hypervariable region (HVR) and flanking regions of the E1 and E2 envelope proteins have been studied in a cohort of women infected from a common source of anti-D immunoglobulin. Whereas virus sequences in the infectious source were relatively homogeneous, distinct HVR variants were observed in each anti-D recipient, indicating that this region can evolve in multiple directions from the same point. Where HVR variants with dissimilar sequences were present in a single individual, the frequency of synonymous substitution in the flanking regions suggested that the lineages diverged more than a decade previously. Even where a single major HVR variant was present in an infected individual, this lineage was usually several years old. Multiple lineages can therefore coexist during long periods of chronic infection without replacement. The characteristics of amino acid substitution in the HVR were not consistent with the random accumulation of mutations and imply that amino acid replacement in the HVR was strongly constrained. Another variable region of E2 centered on codon 60 shows similar constraints, while HVR2 was relatively unconstrained. Several of these features are difficult to explain if a neutralizing immune response against the HVR is the only selective force operating on E2. The impact of PCR artifacts such as nucleotide misincorporation and the shuffling of dissimilar templates is discussed. PMID:9573256
Rath, Matthias; Jenssen, Sönke E; Schwefel, Konrad; Spiegler, Stefanie; Kleimeier, Dana; Sperling, Christian; Kaderali, Lars; Felbor, Ute
2017-09-01
Cerebral cavernous malformations (CCM) are vascular lesions of the central nervous system that can cause headaches, seizures and hemorrhagic stroke. Disease-associated mutations have been identified in three genes: CCM1/KRIT1, CCM2 and CCM3/PDCD10. The precise proportion of deep-intronic variants in these genes and their clinical relevance is yet unknown. Here, a long-range PCR (LR-PCR) approach for target enrichment of the entire genomic regions of the three genes was combined with next generation sequencing (NGS) to screen for coding and non-coding variants. NGS detected all six CCM1/KRIT1, two CCM2 and four CCM3/PDCD10 mutations that had previously been identified by Sanger sequencing. Two of the pathogenic variants presented here are novel. Additionally, 20 stringently selected CCM index cases that had remained mutation-negative after conventional sequencing and exclusion of copy number variations were screened for deep-intronic mutations. The combination of bioinformatics filtering and transcript analyses did not reveal any deep-intronic splice mutations in these cases. Our results demonstrate that target enrichment by LR-PCR combined with NGS can be used for a comprehensive analysis of the entire genomic regions of the CCM genes in a research context. However, its clinical utility is limited as deep-intronic splice mutations in CCM1/KRIT1, CCM2 and CCM3/PDCD10 seem to be rather rare. Copyright © 2017 Elsevier Masson SAS. All rights reserved.
Barnea-Yizhar, Ofer; Ram, Sigal; Kovalev, Ekaterina; Azriel, Aviva; Rand, Ulfert; Nakayama, Manabu; Hauser, Hansjörg; Gepstein, Lior; Levi, Ben-Zion
2016-01-01
Interferon Regulatory Factor-8 (IRF-8) serves as a key factor in the hierarchical differentiation towards monocyte/dendritic cell lineages. While much insight has been accumulated into the mechanisms essential for its hematopoietic specific expression, the mode of restricting IRF-8 expression in non-hematopoietic cells is still unknown. Here we show that the repression of IRF-8 expression in restrictive cells is mediated by its 3rd intron. Removal of this intron alleviates the repression of Bacterial Artificial Chromosome (BAC) IRF-8 reporter gene in these cells. Fine deletion analysis points to conserved regions within this intron mediating its restricted expression. Further, the intron alone selectively initiates gene silencing only in expression-restrictive cells. Characterization of this intron’s properties points to its role as an initiator of sustainable gene silencing inducing chromatin condensation with suppressive histone modifications. This intronic element cannot silence episomal transgene expression underlining a strict chromatin-dependent silencing mechanism. We validated this chromatin-state specificity of IRF-8 intron upon in-vitro differentiation of induced pluripotent stem cells (iPSCs) into cardiomyocytes. Taken together, the IRF-8 3rd intron is sufficient and necessary to initiate gene silencing in non-hematopoietic cells, highlighting its role as a nucleation core for repressed chromatin during differentiation. PMID:27257682
2011-01-01
Background One member of the W family of human endogenous retroviruses (HERV) appears to have been functionally adopted by the human host. Nevertheless, a highly diversified and regulated transcription from a range of HERV-W elements has been observed in human tissues and cells. Aberrant expression of members of this family has also been associated with human disease such as multiple sclerosis (MS) and schizophrenia. It is not known whether this broad expression of HERV-W elements represents transcriptional leakage or specific transcription initiated from the retroviral promoter in the long terminal repeat (LTR) region. Therefore, potential influences of genomic context, structure and orientation on the expression levels of individual HERV-W elements in normal human tissues were systematically investigated. Results Whereas intronic HERV-W elements with a pseudogene structure exhibited a strong anti-sense orientation bias, intronic elements with a proviral structure and solo LTRs did not. Although a highly variable expression across tissues and elements was observed, systematic effects of context, structure and orientation were also observed. Elements located in intronic regions appeared to be expressed at higher levels than elements located in intergenic regions. Intronic elements with proviral structures were expressed at higher levels than those elements bearing hallmarks of processed pseudogenes or solo LTRs. Relative to their corresponding genes, intronic elements integrated on the sense strand appeared to be transcribed at higher levels than those integrated on the anti-sense strand. Moreover, the expression of proviral elements appeared to be independent from that of their corresponding genes. Conclusions Intronic HERV-W provirus integrations on the sense strand appear to have elicited a weaker negative selection than pseudogene integrations of transcripts from such elements. Our current findings suggest that the previously observed diversified and tissue-specific expression of elements in the HERV-W family is the result of both directed transcription (involving both the LTR and internal sequence) and leaky transcription of HERV-W elements in normal human tissues. PMID:21226900
Feltus, F A; Singh, H P; Lohithaswa, H C; Schulze, S R; Silva, T D; Paterson, A H
2006-04-01
Completed genome sequences provide templates for the design of genome analysis tools in orphan species lacking sequence information. To demonstrate this principle, we designed 384 PCR primer pairs to conserved exonic regions flanking introns, using Sorghum/Pennisetum expressed sequence tag alignments to the Oryza genome. Conserved-intron scanning primers (CISPs) amplified single-copy loci at 37% to 80% success rates in taxa that sample much of the approximately 50-million years of Poaceae divergence. While the conserved nature of exons fostered cross-taxon amplification, the lesser evolutionary constraints on introns enhanced single-nucleotide polymorphism detection. For example, in eight rice (Oryza sativa) genotypes, polymorphism averaged 12.1 per kb in introns but only 3.6 per kb in exons. Curiously, among 124 CISPs evaluated across Oryza, Sorghum, Pennisetum, Cynodon, Eragrostis, Zea, Triticum, and Hordeum, 23 (18.5%) seemed to be subject to rigid intron size constraints that were independent of per-nucleotide DNA sequence variation. Furthermore, we identified 487 conserved-noncoding sequence motifs in 129 CISP loci. A large CISP set (6,062 primer pairs, amplifying introns from 1,676 genes) designed using an automated pipeline showed generally higher abundance in recombinogenic than in nonrecombinogenic regions of the rice genome, thus providing relatively even distribution along genetic maps. CISPs are an effective means to explore poorly characterized genomes for both DNA polymorphism and noncoding sequence conservation on a genome-wide or candidate gene basis, and also provide anchor points for comparative genomics across a diverse range of species.
Feltus, F.A.; Singh, H.P.; Lohithaswa, H.C.; Schulze, S.R.; Silva, T.D.; Paterson, A.H.
2006-01-01
Completed genome sequences provide templates for the design of genome analysis tools in orphan species lacking sequence information. To demonstrate this principle, we designed 384 PCR primer pairs to conserved exonic regions flanking introns, using Sorghum/Pennisetum expressed sequence tag alignments to the Oryza genome. Conserved-intron scanning primers (CISPs) amplified single-copy loci at 37% to 80% success rates in taxa that sample much of the approximately 50-million years of Poaceae divergence. While the conserved nature of exons fostered cross-taxon amplification, the lesser evolutionary constraints on introns enhanced single-nucleotide polymorphism detection. For example, in eight rice (Oryza sativa) genotypes, polymorphism averaged 12.1 per kb in introns but only 3.6 per kb in exons. Curiously, among 124 CISPs evaluated across Oryza, Sorghum, Pennisetum, Cynodon, Eragrostis, Zea, Triticum, and Hordeum, 23 (18.5%) seemed to be subject to rigid intron size constraints that were independent of per-nucleotide DNA sequence variation. Furthermore, we identified 487 conserved-noncoding sequence motifs in 129 CISP loci. A large CISP set (6,062 primer pairs, amplifying introns from 1,676 genes) designed using an automated pipeline showed generally higher abundance in recombinogenic than in nonrecombinogenic regions of the rice genome, thus providing relatively even distribution along genetic maps. CISPs are an effective means to explore poorly characterized genomes for both DNA polymorphism and noncoding sequence conservation on a genome-wide or candidate gene basis, and also provide anchor points for comparative genomics across a diverse range of species. PMID:16607031
Gallagher, Carla J; Keene, Keith L; Mychaleckyj, Josyf C; Langefeld, Carl D; Hirschhorn, Joel N; Henderson, Brian E; Gordon, Candace J; Freedman, Barry I; Rich, Stephen S; Bowden, Donald W; Sale, Michèle M
2007-03-01
The estrogen receptor-alpha gene (ESR1) was selected as a positional candidate under a type 2 diabetes linkage peak at 6q24-27. A total of 42 ESR1 single nucleotide polymorphisms (SNPs) were genotyped in 380 African-American type 2 diabetic case subjects with end-stage renal disease (ESRD) and 276 African-American control subjects. A total of 22 ancestry informative markers were also genotyped, and the program Admixmap was used to adjust allelic and haplotypic association tests for individual estimates of admixture. The most significant association with type 2 diabetes-ESRD was with rs1033182 in intron 2 (P = 0.013, admixture-adjusted P(a) = 0.021). Genotyping 17 SNPs across a region of ESR1 intron 1-intron 2 in an expanded population of 851 case and 635 control subjects supported association with rs1033182 (P = 0.004, P(a) = 0.027) and with an independent six-SNP haplotype of high linkage disequilibrium spanning 6.4 kb (P < 0.0001, P(a) < 0.0001). The same 17 ESR1 SNPs were genotyped in 300 European-American type 2 diabetes-ESRD case subjects and 310 European-American control subjects. Two intron 2 SNPs, rs2431260 (P = 0.015) and rs1709183 (P = 0.019), and a four-SNP haplotype containing these SNPs (P = 0.033) were associated with type 2 diabetes and/or ESRD. Results suggest that intron 1 and intron 2 of the ESR1 gene may contain functionally important regions related to type 2 diabetes or ESRD risk.
USDA-ARS?s Scientific Manuscript database
Helminths, including GI nematodes, colonize > 1/3 of the world’s population and have evolved with humans and their microbiome. Parasites inherently regulate the host immune response to ensure their survival through mechanisms that dampen host inflammation. These unique properties of nematodes have b...
Skin Microbiome Surveys Are Strongly Influenced by Experimental Design.
Meisel, Jacquelyn S; Hannigan, Geoffrey D; Tyldsley, Amanda S; SanMiguel, Adam J; Hodkinson, Brendan P; Zheng, Qi; Grice, Elizabeth A
2016-05-01
Culture-independent studies to characterize skin microbiota are increasingly common, due in part to affordable and accessible sequencing and analysis platforms. Compared to culture-based techniques, DNA sequencing of the bacterial 16S ribosomal RNA (rRNA) gene or whole metagenome shotgun (WMS) sequencing provides more precise microbial community characterizations. Most widely used protocols were developed to characterize microbiota of other habitats (i.e., gastrointestinal) and have not been systematically compared for their utility in skin microbiome surveys. Here we establish a resource for the cutaneous research community to guide experimental design in characterizing skin microbiota. We compare two widely sequenced regions of the 16S rRNA gene to WMS sequencing for recapitulating skin microbiome community composition, diversity, and genetic functional enrichment. We show that WMS sequencing most accurately recapitulates microbial communities, but sequencing of hypervariable regions 1-3 of the 16S rRNA gene provides highly similar results. Sequencing of hypervariable region 4 poorly captures skin commensal microbiota, especially Propionibacterium. WMS sequencing, which is resource and cost intensive, provides evidence of a community's functional potential; however, metagenome predictions based on 16S rRNA sequence tags closely approximate WMS genetic functional profiles. This study highlights the importance of experimental design for downstream results in skin microbiome surveys. Copyright © 2016 The Authors. Published by Elsevier Inc. All rights reserved.
Skin microbiome surveys are strongly influenced by experimental design
Meisel, Jacquelyn S.; Hannigan, Geoffrey D.; Tyldsley, Amanda S.; SanMiguel, Adam J.; Hodkinson, Brendan P.; Zheng, Qi; Grice, Elizabeth A.
2016-01-01
Culture-independent studies to characterize skin microbiota are increasingly common, due in part to affordable and accessible sequencing and analysis platforms. Compared to culture-based techniques, DNA sequencing of the bacterial 16S ribosomal RNA (rRNA) gene or whole metagenome shotgun (WMS) sequencing provide more precise microbial community characterizations. Most widely used protocols were developed to characterize microbiota of other habitats (i.e. gastrointestinal), and have not been systematically compared for their utility in skin microbiome surveys. Here we establish a resource for the cutaneous research community to guide experimental design in characterizing skin microbiota. We compare two widely sequenced regions of the 16S rRNA gene to WMS sequencing for recapitulating skin microbiome community composition, diversity, and genetic functional enrichment. We show that WMS sequencing most accurately recapitulates microbial communities, but sequencing of hypervariable regions 1-3 of the 16S rRNA gene provides highly similar results. Sequencing of hypervariable region 4 poorly captures skin commensal microbiota, especially Propionibacterium. WMS sequencing, which is resource- and cost-intensive, provides evidence of a community’s functional potential; however, metagenome predictions based on 16S rRNA sequence tags closely approximate WMS genetic functional profiles. This work highlights the importance of experimental design for downstream results in skin microbiome surveys. PMID:26829039
Low-copy nuclear primers and ycf1 primers in Cactaceae.
Franck, Alan R; Cochrane, Bruce J; Garey, James R
2012-10-01
To increase the number of variable regions available for phylogenetic study in the Cactaceae, primers were developed for a portion of the plastid ycf1 gene and intron-spanning regions of two low-copy nuclear genes (isi1, nhx1). • Primers were tested on several families within Caryophyllales, focusing on the Cactaceae. Gel electrophoresis indicated positive amplification in most samples. Sequences of these three regions (isi1, nhx1, ycf1) from Harrisia exhibited variation similar to or greater than two plastid regions (atpB-rbcL intergenic spacer and rpl16 intron). • The isi, nhx, and ycf1 primers amplify phylogenetically useful information applicable to the Cactaceae and other families in the Caryophyllales.
Bio—Cryptography: A Possible Coding Role for RNA Redundancy
NASA Astrophysics Data System (ADS)
Regoli, M.
2009-03-01
The RNA-Crypto System (shortly RCS) is a symmetric key algorithm to cipher data. The idea for this new algorithm starts from the observation of nature. In particular from the observation of RNA behavior and some of its properties. The RNA sequences have some sections called Introns. Introns, derived from the term "intragenic regions," are non-coding sections of precursor mRNA (pre-mRNA) or other RNAs, that are removed (spliced out of the RNA) before the mature RNA is formed. Once the introns have been spliced out of a pre-mRNA, the resulting mRNA sequence is ready to be translated into a protein. The corresponding parts of a gene are known as introns as well. The nature and the role of Introns in the pre-mRNA is not clear and it is under ponderous researches by biologists but, in our case, we will use the presence of Introns in the RNA-Crypto System output as a strong method to add chaotic non coding information and an unnecessary behavior in the access to the secret key to code the messages. In the RNA-Crypto System algorithm the introns are sections of the ciphered message with non-coding information as well as in the precursor mRNA.
Wang, Hongtao; Li, Guisheng; Kwon, Woo-Saeng; Yang, Deok-Chun
2016-06-04
Panax ginseng is one of the most valuable medicinal plants in the Orient. The low level of genetic variation has limited the application of molecular markers for cultivar authentication and marker-assisted selection in cultivated ginseng. To exploit DNA polymorphism within ginseng cultivars, ginseng expressed sequence tags (ESTs) were searched against the potential intron polymorphism (PIP) database to predict the positions of introns. Intron-flanking primers were then designed in conserved exon regions and used to amplify across the more variable introns. Sequencing results showed that single nucleotide polymorphisms (SNPs), as well as indels, were detected in four EST-derived introns, and SNP markers specific to "Gopoong" and "K-1" were first reported in this study. Based on cultivar-specific SNP sites, allele-specific polymerase chain reaction (PCR) was conducted and proved to be effective for the authentication of ginseng cultivars. Additionally, the combination of a simple NaOH-Tris DNA isolation method and real-time allele-specific PCR assay enabled the high throughput selection of cultivars from ginseng fields. The established real-time allele-specific PCR assay should be applied to molecular authentication and marker assisted selection of P. ginseng cultivars, and the EST intron-targeting strategy will provide a potential approach for marker development in species without whole genomic DNA sequence information.
Bottomless barrel-sponge species in the Indo-Pacific?
Setiawan, Edwin; Voogd, Nicole J De; Wörheide, Gert; Erpenbeck, Dirk
2016-07-06
The use of nuclear markers, in addition to traditional mitochondrial markers, helps to clarify hidden patterns of genetic structure in natural populations (Palumbi & Baker, 1994). This is particularly evident among demosponges that possess slow mitochondrial evolutionary rates compared to Bilateria, where nuclear intron markers can aid in the understanding of shallow level phylogenetic relationships (Shearer et al., 2002). Ideally, these nuclear markers (i) are evolutionary well-conserved across different lineages, (ii) produce amplicons holding a number of sites with sufficient variability to answer the relevant phylogenetic question, (iii) derive from single copy genes (see review in Zhang & Hewitt, 2003). A popular method to amplify intron markers uses EPIC (Exon-Primed, Intron-Crossing) primers that anneal to the more conserved flanking exon regions and subsequently bridge the intron during amplification (Palumbi & Baker, 1994).
Paulsrud, Per; Lindblad, Peter
1998-01-01
We examined the genetic diversity of Nostoc symbionts in some lichens by using the tRNALeu (UAA) intron as a genetic marker. The nucleotide sequence was analyzed in the context of the secondary structure of the transcribed intron. Cyanobacterial tRNALeu (UAA) introns were specifically amplified from freshly collected lichen samples without previous DNA extraction. The lichen species used in the present study were Nephroma arcticum, Peltigera aphthosa, P. membranacea, and P. canina. Introns with different sizes around 300 bp were consistently obtained. Multiple clones from single PCRs were screened by using their single-stranded conformational polymorphism pattern, and the nucleotide sequence was determined. No evidence for sample heterogenity was found. This implies that the symbiont in situ is not a diverse community of cyanobionts but, rather, one Nostoc strain. Furthermore, each lichen thallus contained only one intron type, indicating that each thallus is colonized only once or that there is a high degree of specificity. The same cyanobacterial intron sequence was also found in samples of one lichen species from different localities. In a phylogenetic analysis, the cyanobacterial lichen sequences grouped together with the sequences from two free-living Nostoc strains. The size differences in the intron were due to insertions and deletions in highly variable regions. The sequence data were used in discussions concerning specificity and biology of the lichen symbiosis. It is concluded that the tRNALeu (UAA) intron can be of great value when examining cyanobacterial diversity. PMID:9435083
Deep intronic GPR143 mutation in a Japanese family with ocular albinism
Naruto, Takuya; Okamoto, Nobuhiko; Masuda, Kiyoshi; Endo, Takao; Hatsukawa, Yoshikazu; Kohmoto, Tomohiro; Imoto, Issei
2015-01-01
Deep intronic mutations are often ignored as possible causes of human disease. Using whole-exome sequencing, we analysed genomic DNAs of a Japanese family with two male siblings affected by ocular albinism and congenital nystagmus. Although mutations or copy number alterations of coding regions were not identified in candidate genes, the novel intronic mutation c.659-131 T > G within GPR143 intron 5 was identified as hemizygous in affected siblings and as heterozygous in the unaffected mother. This mutation was predicted to create a cryptic splice donor site within intron 5 and activate a cryptic acceptor site at 41nt upstream, causing the insertion into the coding sequence of an out-of-frame 41-bp pseudoexon with a premature stop codon in the aberrant transcript, which was confirmed by minigene experiments. This result expands the mutational spectrum of GPR143 and suggests the utility of next-generation sequencing integrated with in silico and experimental analyses for improving the molecular diagnosis of this disease. PMID:26061757
Deep intronic GPR143 mutation in a Japanese family with ocular albinism.
Naruto, Takuya; Okamoto, Nobuhiko; Masuda, Kiyoshi; Endo, Takao; Hatsukawa, Yoshikazu; Kohmoto, Tomohiro; Imoto, Issei
2015-06-10
Deep intronic mutations are often ignored as possible causes of human disease. Using whole-exome sequencing, we analysed genomic DNAs of a Japanese family with two male siblings affected by ocular albinism and congenital nystagmus. Although mutations or copy number alterations of coding regions were not identified in candidate genes, the novel intronic mutation c.659-131 T > G within GPR143 intron 5 was identified as hemizygous in affected siblings and as heterozygous in the unaffected mother. This mutation was predicted to create a cryptic splice donor site within intron 5 and activate a cryptic acceptor site at 41nt upstream, causing the insertion into the coding sequence of an out-of-frame 41-bp pseudoexon with a premature stop codon in the aberrant transcript, which was confirmed by minigene experiments. This result expands the mutational spectrum of GPR143 and suggests the utility of next-generation sequencing integrated with in silico and experimental analyses for improving the molecular diagnosis of this disease.
Nadjar-Boger, Elisabeth; Funkenstein, Bruria
2011-02-01
Myostatin (MSTN) is a member of the transforming growth factor-ß superfamily that functions as a negative regulator of skeletal muscle development and growth in mammals. Fish express at least two genes for MSTN: MSTN-1 and MSTN-2. To date, MSTN-2 promoters have been cloned only from salmonids and zebrafish. Here we described the cloning and sequence analysis of MSTN-2 gene and its 5' flanking region in the marine fish Sparus aurata (saMSTN-2). We demonstrate the existence of three alleles of the promoter and three alleles of the first intron. Sequence comparison of the promoter region in the three alleles revealed that although the sequences of the first 1050 bp upstream of the translation start site are almost identical in the three alleles, a substantial sequence divergence is seen further upstream. Careful sequence analysis of the region upstream of the first 1050 bp in the three alleles identified several elements that appear to be repeated in some or all sequences, at different positions. This suggests that the promoter region of saMSTN-2 has been subjected to various chromosomal rearrangements during the course of evolution, reflecting either insertion or deletion events. Screening of several genomic DNA collections indicated differences in allele frequency, with allele 'b' being the most abundant, followed by allele 'c', whereas allele 'a' is relatively rare. Sequence analysis of saMSTN-2 gene also revealed polymorphism in the first intron, identifying three alleles. The length difference in alleles '1R' and '2R' of the first intron is due to the presence of one or two copies of a repeated block of approximately 150 bp, located at the 5' end of the first intron. The third allele, '4R', has an additional insertion of 323 bp located 116 bp upstream of the 3' end of the first intron. Analysis of several DNA collections showed that the '2R' allele is the most common, followed by the '4R' allele, whereas the '1R' allele is relatively rare. Progeny analysis of a full-sib family showed a Mendelian mode of inheritance of the two genetic loci. No clear association was found between the two genetic markers and growth rate. These results show for the first time a substantial degree of polymorphism in both the promoter and first intron of MSTN-2 gene in a perciform fish species which points to chromosomal rearrangements that took place during evolution.
Keene, Keith L.; Mychaleckyj, Josyf C.; Smith, Shelly G.; Leak, Tennille S.; Perlegas, Peter S.; Langefeld, Carl D.; Herrington, David M.; Freedman, Barry I.; Rich, Stephen S.; Bowden, Donald W.; Sale, Michèle M.
2009-01-01
We previously investigated the estrogen receptor α gene (ESR1) as a positional candidate for type 2 diabetes (T2DM), and found evidence for association between the intron 1-intron 2 region of this gene and type 2 diabetes and/or nephropathy in an African American (AA) population. Our objective was to comprehensively evaluate variants across the entire ESR1 gene for association in AA with T2DM and End Stage Renal Disease (T2DM-ESRD). One hundred fifty SNPs in ESR1, spanning 476 kb, were genotyped in 577 AA individuals with T2DM-ESRD and 596 AA controls. Genotypic association tests for dominant, additive, and recessive models, and haplotypic association, were calculated using a χ2 statistic and corresponding P value. Thirty-one SNPs showed nominal evidence for association (P< 0.05) with T2DM-ESRD in one or more genotypic model. After correcting for multiple tests, promoter SNP rs11964281 (nominal P=0.000291, adjusted P=0.0289), and intron 4 SNPs rs1569788 (nominal P=0.000754, adjusted P=0.0278) and rs9340969 (nominal P=0.00109, adjusted P=0.0467) remained significant at experimentwise error rate (EER) P<0.05 for the dominant class of tests. Twenty-three of the thirty-one associated SNPs cluster within the intron 4-intron 6 region. Gender stratification revealed nominal evidence for association with 35 SNPs in females (352 cases; 306 controls) and seven SNPs in males (225 cases; 290 controls). We have identified a novel region of the ESR1 gene that may contain important functional polymorphisms in relation to susceptibility to T2DM and/or diabetic nephropathy. PMID:18305958
Keene, Keith L; Mychaleckyj, Josyf C; Smith, Shelly G; Leak, Tennille S; Perlegas, Peter S; Langefeld, Carl D; Herrington, David M; Freedman, Barry I; Rich, Stephen S; Bowden, Donald W; Sale, Michèle M
2008-05-01
We previously investigated the estrogen receptor alpha gene (ESR1) as a positional candidate for type 2 diabetes (T2DM), and found evidence for association between the intron 1-intron 2 region of this gene and T2DM and/or nephropathy in an African American (AA) population. Our objective was to comprehensively evaluate variants across the entire ESR1 gene for association in AA with T2DM and end stage renal disease (T2DM-ESRD). One hundred fifty SNPs in ESR1, spanning 476 kb, were genotyped in 577 AA individuals with T2DM-ESRD and 596 AA controls. Genotypic association tests for dominant, additive, and recessive models, and haplotypic association, were calculated using a chi(2) statistic and corresponding P value. Thirty-one SNPs showed nominal evidence for association (P < 0.05) with T2DM-ESRD in one or more genotypic model. After correcting for multiple tests, promoter SNP rs11964281 (nominal P = 0.000291, adjusted P = 0.0289), and intron 4 SNPs rs1569788 (nominal P = 0.000754, adjusted P = 0.0278) and rs9340969 (nominal P = 0.00109, adjusted P = 0.0467) remained significant at experimentwise error rate (EER) P = 0.05 for the dominant class of tests. Twenty-three of the thirty-one associated SNPs cluster within the intron 4-intron 6 regions. Gender stratification revealed nominal evidence for association with 35 SNPs in females (352 cases; 306 controls) and seven SNPs in males (225 cases; 290 controls). We have identified a novel region of the ESR1 gene that may contain important functional polymorphisms in relation to susceptibility to T2DM and/or diabetic nephropathy.
Etang, Josiane; Vicente, Jose L; Nwane, Philippe; Chouaibou, Mouhamadou; Morlais, Isabelle; Do Rosario, Virgilio E; Simard, Frederic; Awono-Ambene, Parfait; Toto, Jean Claude; Pinto, Joao
2009-07-01
Sequence variation at the intron-1 of the voltage-gated sodium channel gene in Anopheles gambiae M- and S-forms from Cameroon was assessed to explore the number of mutational events originating knockdown resistance (kdr) alleles. Mosquitoes were sampled between December 2005 and June 2006 from three geographical areas: (i) Magba in the western region; (ii) Loum, Tiko, Douala, Kribi, and Campo along the Atlantic coast; and (iii) Bertoua, in the eastern continental plateau. Both 1014S and 1014F kdr alleles were found in the S-form with overall frequencies of 14% and 42% respectively. Only the 1014F allele was found in the M-form at lower frequency (11%). Analysis of a 455 bp region of intron-1 upstream the kdr locus revealed four independent mutation events originating kdr alleles, here named MS1 -1014F, S1-1014S and S2-1014S kdr-intron-1 haplotypes in S-form and MS3-1014F kdr-intron-1 haplotype in the M-form. Furthermore, there was evidence for mutual introgression of kdr 1014F allele between the two molecular forms, MS1 and MS3 being widely shared by them. Although no M/S hybrid was observed in analysed samples, this wide distribution of haplotypes MS1 and MS3 suggests inter-form hybridizing at significant level and emphasizes the rapid diffusion of the kdr alleles in Africa. The mosaic of genetic events found in Cameroon is representative of the situation in the West-Central African region and highlights the importance of evaluating the spatial and temporal evolution of kdr alleles for a better management of insecticide resistance.
Beauregard, Arthur; Chalamcharla, Venkata R; Piazza, Carol Lyn; Belfort, Marlene; Coros, Colin J
2006-11-01
Group II introns are mobile genetic elements that invade their cognate intron-minus alleles via an RNA intermediate, in a process known as retrohoming. They can also retrotranspose to ectopic sites at low frequency. In Escherichia coli, retrotransposition of the lactococcal group II intron, Ll.LtrB, occurs preferentially within the Ori and Ter macrodomains of the E. coli chromosome. These macrodomains migrate towards the poles of the cell, where the intron-encoded protein, LtrA, localizes. Here we investigate whether alteration of nucleoid condensation, chromosome partitioning and replication affect retrotransposition frequencies, as well as bipolar localization of the Ll.LtrB intron integration and LtrA distribution in E. coli. We thus examined these properties in the absence of the nucleoid-associated proteins H-NS, StpA and MukB, in variants of partitioning functions including the centromere-like sequence migS and the actin homologue MreB, as well as in the replication mutants DeltaoriC, seqA, tus and topoIV (ts). Although there were some dramatic fluctuations in retrotransposition levels in these hosts, bipolar localization of integration events was maintained. LtrA was consistently found in nucleoid-free regions, with its localization to the cellular poles being largely preserved in these hosts. Together, these results suggest that bipolar localization of group II intron retrotransposition results from the residence of the intron-encoded protein at the poles of the cell.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Kumar, Abhishek, E-mail: akumar@bot.uni-kiel.de; Bhandari, Anita; Sarde, Sandeep J.
Highlights: • C1 inhibitors of fishes have two Ig domains fused in the N-terminal end. • Spliceosomal introns gain in two Ig domains of selected ray-finned fishes. • C1 inhibitors gene is maintained from 450 MY on the same locus. • C1 inhibitors gene is missing in frog and lampreys. • C1 inhibitors of tetrapod and fishes differ in the RCL region. - Abstract: C1 inhibitor (C1IN) is a multi-facet serine protease inhibitor in the plasma cascades, inhibiting several proteases, notably, regulates both complement and contact system activation. Despite huge advancements in the understanding of C1IN based on biochemical propertiesmore » and its roles in the plasma cascades, the phylogenetic history of C1IN remains uncharacterized. To date, there is no comprehensive study illustrating the phylogenetic history of C1IN. Herein, we explored phylogenetic history of C1IN gene in vertebrates. Fishes have C1IN with two immunoglobulin like domains attached in the N-terminal region. The RCL regions of CIIN from fishes and tetrapod genomes have variations at the positions P2 and P1′. Gene structures of C1IN gene from selected ray-finned fishes varied in the Ig domain region with creation of novel intron splitting exon Im2 into Im2a and Im2b. This intron is limited to ray-finned fishes with genome size reduced below 1 Gb. Hence, we suggest that genome compaction and associated double-strand break repairs are behind this intron gain. This study reveals the evolutionary history of C1IN and confirmed that this gene remains the same locus for ∼450 MY in 52 vertebrates analysed, but it is not found in frogs and lampreys.« less
DOE Office of Scientific and Technical Information (OSTI.GOV)
Proia, R.L.
1988-03-01
Lysosomal {beta}-hexosaminidase is composed of two structurally similar chains, {alpha} and {beta}, that are the products of different genes. Mutations in either gene causing {beta}-hexosaminidase deficiency result in the lysosomal storage disease GM2-gangliosidosis. To enable the investigation of the molecular lesions in this disorder and to study the evolutionary relationship between the {alpha} and {beta} chains, the {beta}-chain gene was isolated, and its organization was characterized. The {beta}-chain coding region is divided into 14 exons distributed over {approx}40 kilobases of DNA. Comparison with the {alpha}-chain gene revealed that 12 of the 13 introns interrupt the coding regions at homologous positions.more » This extensive sharing of intron placement demonstrates that the {alpha} and {beta} chains evolved by way of the duplication of a common ancestor.« less
Receptor-like genes in the major resistance locus of lettuce are subject to divergent selection.
Meyers, B C; Shen, K A; Rohani, P; Gaut, B S; Michelmore, R W
1998-01-01
Disease resistance genes in plants are often found in complex multigene families. The largest known cluster of disease resistance specificities in lettuce contains the RGC2 family of genes. We compared the sequences of nine full-length genomic copies of RGC2 representing the diversity in the cluster to determine the structure of genes within this family and to examine the evolution of its members. The transcribed regions range from at least 7.0 to 13.1 kb, and the cDNAs contain deduced open reading frames of approximately 5. 5 kb. The predicted RGC2 proteins contain a nucleotide binding site and irregular leucine-rich repeats (LRRs) that are characteristic of resistance genes cloned from other species. Unique features of the RGC2 gene products include a bipartite LRR region with >40 repeats. At least eight members of this family are transcribed. The level of sequence diversity between family members varied in different regions of the gene. The ratio of nonsynonymous (Ka) to synonymous (Ks) nucleotide substitutions was lowest in the region encoding the nucleotide binding site, which is the presumed effector domain of the protein. The LRR-encoding region showed an alternating pattern of conservation and hypervariability. This alternating pattern of variation was also found in all comparisons within families of resistance genes cloned from other species. The Ka /Ks ratios indicate that diversifying selection has resulted in increased variation at these codons. The patterns of variation support the predicted structure of LRR regions with solvent-exposed hypervariable residues that are potentially involved in binding pathogen-derived ligands. PMID:9811792
Historically low mitochondrial DNA diversity in koalas (Phascolarctos cinereus)
2012-01-01
Background The koala (Phascolarctos cinereus) is an arboreal marsupial that was historically widespread across eastern Australia until the end of the 19th century when it suffered a steep population decline. Hunting for the fur trade, habitat conversion, and disease contributed to a precipitous reduction in koala population size during the late 1800s and early 1900s. To examine the effects of these reductions in population size on koala genetic diversity, we sequenced part of the hypervariable region of mitochondrial DNA (mtDNA) in koala museum specimens collected in the 19th and 20th centuries, hypothesizing that the historical samples would exhibit greater genetic diversity. Results The mtDNA haplotypes present in historical museum samples were identical to haplotypes found in modern koala populations, and no novel haplotypes were detected. Rarefaction analyses suggested that the mtDNA genetic diversity present in the museum samples was similar to that of modern koalas. Conclusions Low mtDNA diversity may have been present in koala populations prior to recent population declines. When considering management strategies, low genetic diversity of the mtDNA hypervariable region may not indicate recent inbreeding or founder events but may reflect an older historical pattern for koalas. PMID:23095716
Wang, Hongtao; Li, Guisheng; Kwon, Woo-Saeng; Yang, Deok-Chun
2016-01-01
Panax ginseng is one of the most valuable medicinal plants in the Orient. The low level of genetic variation has limited the application of molecular markers for cultivar authentication and marker-assisted selection in cultivated ginseng. To exploit DNA polymorphism within ginseng cultivars, ginseng expressed sequence tags (ESTs) were searched against the potential intron polymorphism (PIP) database to predict the positions of introns. Intron-flanking primers were then designed in conserved exon regions and used to amplify across the more variable introns. Sequencing results showed that single nucleotide polymorphisms (SNPs), as well as indels, were detected in four EST-derived introns, and SNP markers specific to “Gopoong” and “K-1” were first reported in this study. Based on cultivar-specific SNP sites, allele-specific polymerase chain reaction (PCR) was conducted and proved to be effective for the authentication of ginseng cultivars. Additionally, the combination of a simple NaOH-Tris DNA isolation method and real-time allele-specific PCR assay enabled the high throughput selection of cultivars from ginseng fields. The established real-time allele-specific PCR assay should be applied to molecular authentication and marker assisted selection of P. ginseng cultivars, and the EST intron-targeting strategy will provide a potential approach for marker development in species without whole genomic DNA sequence information. PMID:27271615
Evaluation of non-coding variation in GLUT1 deficiency.
Liu, Yu-Chi; Lee, Jia Wei Audrey; Bellows, Susannah T; Damiano, John A; Mullen, Saul A; Berkovic, Samuel F; Bahlo, Melanie; Scheffer, Ingrid E; Hildebrand, Michael S
2016-12-01
Loss-of-function mutations in SLC2A1, encoding glucose transporter-1 (GLUT-1), lead to dysfunction of glucose transport across the blood-brain barrier. Ten percent of cases with hypoglycorrhachia (fasting cerebrospinal fluid [CSF] glucose <2.2mmol/L) do not have mutations. We hypothesized that GLUT1 deficiency could be due to non-coding SLC2A1 variants. We performed whole exome sequencing of one proband with a GLUT1 phenotype and hypoglycorrhachia negative for SLC2A1 sequencing and copy number variants. We studied a further 55 patients with different epilepsies and low CSF glucose who did not have exonic mutations or copy number variants. We sequenced non-coding promoter and intronic regions. We performed mRNA studies for the recurrent intronic variant. The proband had a de novo splice site mutation five base pairs from the intron-exon boundary. Three of 55 patients had deep intronic SLC2A1 variants, including a recurrent variant in two. The recurrent variant produced less SLC2A1 mRNA transcript. Fasting CSF glucose levels show an age-dependent correlation, which makes the definition of hypoglycorrhachia challenging. Low CSF glucose levels may be associated with pathogenic SLC2A1 mutations including deep intronic SLC2A1 variants. Extending genetic screening to non-coding regions will enable diagnosis of more patients with GLUT1 deficiency, allowing implementation of the ketogenic diet to improve outcomes. © 2016 Mac Keith Press.
Kumari, Priyanka; Singh, Subodh Kumar; Raman, Rajiva
2018-06-05
Genome-wide linkage analysis and whole genome sequencing in a Van der Woude syndrome (VWS) family revealed that the SNP, rs539075, within intron 2 of the cadherin 2 gene (CDH2) co-segregated with the disease phenotype. A study with nonsyndromic cleft lip with or without cleft palate (NSCL ± P) cases (N = 292) and controls (N = 287) established association of this SNP with NSCL ± P as a risk factor. RT-PCR based expression analysis of the SNP-harbouring region of intron 2 of CDH2 in the clefted lip and/or palate tissues of 16 patients revealed that the mutant allele expressed in all those individuals having it (hetero-/homozygous), whereas the wild type allele expressed in <50% of the samples in which it was present. The intronic transcript was also present in the prospective lip and palate region of 13.5 dpc mouse embryo, detected by RNA in situ hybridization and RT-PCR. These results including the in silico, characterization of the ~200 nt-intronic transcript showed that conformationally it fits best with noncoding small RNA, possibly a precursor of miRNA. Its function in the orofacial organogenesis remains to be elucidated which will enable us to define the role of this mutant ncRNA in the clefting of lip and palate. Copyright © 2018 Elsevier B.V. All rights reserved.
Zeng, Fan-chun; Gao, Cheng-wen; Gao, Li-zhi
2016-01-01
The complete chloroplast genome sequence of American bird pepper (Capsicum annuum var. glabriusculum) is reported and characterized in this study. The genome size is 156,612 bp, containing a pair of inverted repeats (IRs) of 25,776 bp separated by a large single-copy region of 87,213 bp and a small single-copy region of 17,851 bp. The chloroplast genome harbors 130 known genes, including 89 protein-coding genes, 8 ribosomal RNA genes, and 37 tRNA genes. A total of 18 of these genes are duplicated in the inverted repeat regions, 16 genes contain 1 intron, and 2 genes and one ycf have 2 introns.
Mitochondrial control-region sequence variation in aboriginal Australians.
van Holst Pellekaan, S; Frommer, M; Sved, J; Boettcher, B
1998-01-01
The mitochondrial D-loop hypervariable segment 1 (mt HVS1) between nucleotides 15997 and 16377 has been examined in aboriginal Australian people from the Darling River region of New South Wales (riverine) and from Yuendumu in central Australia (desert). Forty-seven unique HVS1 types were identified, varying at 49 nucleotide positions. Pairwise analysis by calculation of BEPPI (between population proportion index) reveals statistically significant structure in the populations, although some identical HVS1 types are seen in the two contrasting regions. mt HVS1 types may reflect more-ancient distributions than do linguistic diversity and other culturally distinguishing attributes. Comparison with sequences from five published global studies reveals that these Australians demonstrate greatest divergence from some Africans, least from Papua New Guinea highlanders, and only slightly more from some Pacific groups (Indonesian, Asian, Samoan, and coastal Papua New Guinea), although the HVS1 types vary at different nucleotide sites. Construction of a median network, displaying three main groups, suggests that several hypervariable nucleotide sites within the HVS1 are likely to have undergone mutation independently, making phylogenetic comparison with global samples by conventional methods difficult. Specific nucleotide-site variants are major separators in median networks constructed from Australian HVS1 types alone and for one global selection. The distribution of these, requiring extended study, suggests that they may be signatures of different groups of prehistoric colonizers into Australia, for which the time of colonization remains elusive. PMID:9463317
Geiss, K T; Abbas, G M; Makaroff, C A
1994-04-01
The mitochondrial gene coding for subunit 4 of the NADH dehydrogenase complex I (nad4) has been isolated and characterized from lettuce, Lactuca sativa. Analysis of nad4 genes in a number of plants by Southern hybridization had previously suggested that the intron content varied between species. Characterization of the lettuce gene confirms this observation. Lettuce nad4 contains two exons and one group IIA intron, whereas previously sequenced nad4 genes from turnip and wheat contain three group IIA introns. Northern analysis identified a transcript of 1600 nucleotides, which represents the mature nad4 mRNA and a primary transcript of 3200 nucleotides. Sequence analysis of lettuce and turnip nad4 cDNAs was used to confirm the intron/exon border sequences and to examine RNA editing patterns. Editing is observed at the 5' and 3' ends of the lettuce transcript, but is absent from sequences that correspond to exons two, three and the 5' end of exon four in turnip and wheat. In contrast, turnip transcripts are highly edited in this region, suggesting that homologous recombination of an edited and spliced cDNA intermediate was involved in the loss of introns two and three from an ancestral lettuce nad4 gene.
Phylogenetic Diversity of Koala Retrovirus within a Wild Koala Population.
Chappell, K J; Brealey, J C; Amarilla, A A; Watterson, D; Hulse, L; Palmieri, C; Johnston, S D; Holmes, E C; Meers, J; Young, P R
2017-02-01
Koala populations are in serious decline across many areas of mainland Australia, with infectious disease a contributing factor. Koala retrovirus (KoRV) is a gammaretrovirus present in most wild koala populations and captive colonies. Five subtypes of KoRV (A to E) have been identified based on amino acid sequence divergence in a hypervariable region of the receptor binding domain of the envelope protein. However, analysis of viral genetic diversity has been conducted primarily on KoRV in captive koalas housed in zoos in Japan, the United States, and Germany. Wild koalas within Australia have not been comparably assessed. Here we report a detailed analysis of KoRV genetic diversity in samples collected from 18 wild koalas from southeast Queensland. By employing deep sequencing we identified 108 novel KoRV envelope sequences and determined their phylogenetic diversity. Genetic diversity in KoRV was abundant and fell into three major groups; two comprised the previously identified subtypes A and B, while the third contained the remaining hypervariable region subtypes (C, D, and E) as well as four hypervariable region subtypes that we newly define here (F, G, H, and I). In addition to the ubiquitous presence of KoRV-A, which may represent an exclusively endogenous variant, subtypes B, D, and F were found to be at high prevalence, while subtypes G, H, and I were present in a smaller number of animals. Koala retrovirus (KoRV) is thought to be a significant contributor to koala disease and population decline across mainland Australia. This study is the first to determine KoRV subtype prevalence among a wild koala population, and it significantly expands the total number of KoRV sequences available, providing a more precise picture of genetic diversity. This understanding of KoRV subtype prevalence and genetic diversity will be important for conservation efforts attempting to limit the spread of KoRV. Furthermore, KoRV is one of the only retroviruses shown to exist in both endogenous (transmitted vertically to offspring in the germ line DNA) and exogenous (horizontally transmitted between infected individuals) forms, a division of fundamental evolutionary importance. Copyright © 2017 American Society for Microbiology.
Yoshihara, Keisuke; Le, Minh Nhat; Nagasawa, Koo; Tsukagoshi, Hiroyuki; Nguyen, Hien Anh; Toizumi, Michiko; Moriuchi, Hiroyuki; Hashizume, Masahiro; Ariyoshi, Koya; Dang, Duc Anh; Kimura, Hirokazu; Yoshida, Lay-Myint
2016-11-01
We performed molecular evolutionary analyses of the G gene C-terminal 3rd hypervariable region of RSV-A genotypes NA1 and ON1 strains from the paediatric acute respiratory infection patients in central Vietnam during the 2010-2012 study period. Time-scaled phylogenetic analyses were performed using Bayesian Markov Chain Monte Carlo (MCMC) method, and pairwise distances (p-distances) were calculated. Bayesian Skyline Plot (BSP) was constructed to analyze the time-trend relative genetic diversity of central Vietnam RSV-A strains. We also estimated the N-glycosylation sites within G gene hypervariable region. Amino acid substitutions under positive and negative selection pressure were examined using Conservative Single Likelihood Ancestor Counting (SLAC), Fixed Effects Likelihood (FEL), Internal Fixed Effects Likelihood (IFEL) and Mixed Effects Model for Episodic Diversifying Selection (MEME) models. The majority of central Vietnam ON1 strains detected in 2012 were classified into lineage 1 with few positively selected substitutions. As for the Vietnamese NA1 strains, four lineages were circulating during the study period with a few positive selection sites. Shifting patterns of the predominantly circulating NA1 lineage were observed in each year during the investigation period. Median p-distance of central Vietnam NA1 strains was wider (p-distance=0.028) than that of ON1 (p-distance=0.012). The molecular evolutionary rate of central Vietnam ON1 strains was estimated to be 2.55×10 -2 (substitutions/site/year) and was faster than NA1 (7.12×10 -3 (substitutions/site/year)). Interestingly, the evolutionary rates of both genotypes ON1 and NA1 strains from central Vietnam were faster than the global strains respectively. Furthermore, the shifts of N-glycosylation pattern within the G gene 3rd hypervariable region of Vietnamese NA1 strains were observed in each year. BSP analysis indicated the rapid growth of RSV-A effective population size in early 2012. These results suggested that the molecular evolution of RSV-A G gene detected in central Vietnam was fast with unique evolutionary dynamics. Copyright © 2016 Elsevier B.V. All rights reserved.
In vitro mapping of Myotonic Dystrophy (DM) gene promoter
DOE Office of Scientific and Technical Information (OSTI.GOV)
Storbeck, C.J.; Sabourin, L.; Baird, S.
1994-09-01
The Myotonic Dystrophy Kinase (DMK) gene has been cloned and shared homology to serine/threonine protein kinases. Overexpression of this gene in stably transfected mouse myoblasts has been shown to inhibit fusion into myotubes while myoblasts stably transfected with an antisense construct show increased fusion potential. These experiments, along with data showing that the DM gene is highly expressed in muscle have highlighted the possibility of DMK being involved in myogenesis. The promoter region of the DM gene lacks a consensus TATA box and CAAT box, but harbours numerous transcription binding sites. Clones containing extended 5{prime} upstream sequences (UPS) of DMKmore » only weakly drive the reporter gene chloramphenicol acetyl transferase (CAT) when transfected into C2C12 mouse myoblasts. However, four E-boxes are present in the first intron of the DM gene and transient assays show increased expression of the CAT gene when the first intron is present downstream of these 5{prime} UPS in an orientation dependent manner. Comparison between mouse and human sequence reveals that the regions in the first intron where the E-boxes are located are highly conserved. The mapping of the promoter and the importance of the first intron in the control of DMK expression will be presented.« less
Korotkova, Nadja; Borsch, Thomas; Quandt, Dietmar; Taylor, Nigel P; Müller, Kai F; Barthlott, Wilhelm
2011-09-01
The Cactaceae are a major New World plant family and popular in horticulture. Still, taxonomic units and species limits have been difficult to define, and molecular phylogenetic studies so far have yielded largely unresolved trees, so relationships within Cactaceae remain insufficiently understood. This study focuses on the predominantly epiphytic tribe Rhipsalideae and evaluates the utility of a spectrum of plastid genomic regions. • We present a phylogenetic study including 52 of the 53 Rhipsalideae species and all the infraspecific taxa. Seven regions (trnK intron, matK, rbcL, rps3-rpl16, rpl16 intron, psbA-trnH, trnQ-rps16), ca. 5600 nucleotides (nt) were sequenced per sample. The regions used were evaluated for their phylogenetic performance and performance in DNA-based species recognition based on operational taxonomic units (OTUs) defined beforehand. • The Rhipsalideae are monophyletic and contain five clades that correspond to the genera Rhipsalis, Lepismium, Schlumbergera, Hatiora, and Rhipsalidopsis. The species-level tree was well resolved and supported; the rpl16 and trnK introns yielded the best phylogenetic signal. Although the psbA-trnH and trnQ-rps16 spacers were the most successful individual regions for OTU identification, their success rate did not significantly exceed 70%. The highest OTU identification rate of 97% was found using the combination of psbA-trnH, rps3-rpl16, trnK intron, and trnQ-rps16 as a minimum possible marker length (ca. 1660 nt). • The phylogenetic performance of a marker is not determined by the level of sequence variability, and species discrimination power does not necessarily correlate with phylogenetic utility.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Conrad, R.; Thomas, J.; Spieth, J.
In nematodes, the RNA products of some genes are trans-spliced to a 22-nucleotide spliced leader (SL), while the RNA products of other genes are not. In Caenorhabditis elegans, there are two SLs, Sl1 and SL2, donated by two distinct small nuclear ribonucleoprotein particles in a process functionally quite similar to nuclear intron removal. The authors demonstrate here that it is possible to convert a non-trans-spliced gene into a trans-spliced gene by placement of an intron missing only the 5[prime] splice site into the 5[prime] untranslated region. Stable transgenic strains were isolated expressing a gene in which 69 nucleotides of amore » vit-5 intron, including the 3[prime] splice site, were inserted into the 5[prime] untranslated region of a vit-2/vit-6 fusion gene. The RNA product of this gene was examined by primer extension and PCR amplification. Although the vit-2/vit-6 transgene product is not normally trans-spliced, the majority of transcripts from this altered gene were trans-spliced to SL1. They termed the region of a trans-spliced mRNA precursor between the 5[prime] end and the first 3[prime] splice site an 'outrun'. The results suggest that if a transcript begins with intronlike sequence followed by a 3[prime] splice site, this alone may constitute an outrun and be sufficient to demarcate a transcript as a trans-splice acceptor. These findings leave open the possibility that specific sequences are required to increase the efficiency of trans-splicing.« less
Chalker, Victoria J; Waller, Andrew; Webb, Katy; Spearing, Emma; Crosse, Patricia; Brownlie, Joe; Erles, Kerstin
2012-06-01
The genetic diversity and antibiotic resistance profiles of 38 Streptococcus equi subsp. zooepidemicus isolates were determined from a kennelled canine population during two outbreaks of hemorrhagic pneumonia (1999 to 2002 and 2007 to 2010). Analysis of the szp gene hypervariable region and the 16S-23S rRNA intergenic spacer region and multilocus sequence typing (MLST) indicated a predominant tetO-positive, doxycycline-resistant ST-10 strain during 1999 to 2002 and a predominant tetM-positive doxycycline-resistant ST-62 strain during 2007 to 2010.
Chalker, Victoria J.; Waller, Andrew; Webb, Katy; Spearing, Emma; Crosse, Patricia; Brownlie, Joe
2012-01-01
The genetic diversity and antibiotic resistance profiles of 38 Streptococcus equi subsp. zooepidemicus isolates were determined from a kennelled canine population during two outbreaks of hemorrhagic pneumonia (1999 to 2002 and 2007 to 2010). Analysis of the szp gene hypervariable region and the 16S-23S rRNA intergenic spacer region and multilocus sequence typing (MLST) indicated a predominant tetO-positive, doxycycline-resistant ST-10 strain during 1999 to 2002 and a predominant tetM-positive doxycycline-resistant ST-62 strain during 2007 to 2010. PMID:22495558
The whole chloroplast genome of wild rice (Oryza australiensis).
Wu, Zhiqiang; Ge, Song
2016-01-01
The whole chloroplast genome of wild rice (Oryza australiensis) is characterized in this study. The genome size is 135,224 bp, exhibiting a typical circular structure including a pair of 25,776 bp inverted repeats (IRa,b) separated by a large single-copy region (LSC) of 82,212 bp and a small single-copy region (SSC) of 12,470 bp. The overall GC content of the genome is 38.95%. 110 unique genes were annotated, including 76 protein-coding genes, 4 ribosomal RNA genes, and 30t RNA genes. Among these, 18 are duplicated in the inverted repeat regions, 13 genes contain one intron, and 2 genes (rps12 and ycf3) have two introns.
Choi, Hong-Kyu; Kim, Dongjin; Uhm, Taesik; Limpens, Eric; Lim, Hyunju; Mun, Jeong-Hwan; Kalo, Peter; Penmetsa, R Varma; Seres, Andrea; Kulikova, Olga; Roe, Bruce A; Bisseling, Ton; Kiss, Gyorgy B; Cook, Douglas R
2004-01-01
A core genetic map of the legume Medicago truncatula has been established by analyzing the segregation of 288 sequence-characterized genetic markers in an F(2) population composed of 93 individuals. These molecular markers correspond to 141 ESTs, 80 BAC end sequence tags, and 67 resistance gene analogs, covering 513 cM. In the case of EST-based markers we used an intron-targeted marker strategy with primers designed to anneal in conserved exon regions and to amplify across intron regions. Polymorphisms were significantly more frequent in intron vs. exon regions, thus providing an efficient mechanism to map transcribed genes. Genetic and cytogenetic analysis produced eight well-resolved linkage groups, which have been previously correlated with eight chromosomes by means of FISH with mapped BAC clones. We anticipated that mapping of conserved coding regions would have utility for comparative mapping among legumes; thus 60 of the EST-based primer pairs were designed to amplify orthologous sequences across a range of legume species. As an initial test of this strategy, we used primers designed against M. truncatula exon sequences to rapidly map genes in M. sativa. The resulting comparative map, which includes 68 bridging markers, indicates that the two Medicago genomes are highly similar and establishes the basis for a Medicago composite map. PMID:15082563
Development of Plant Gene Vectors for Tissue-Specific Expression Using GFP as a Reporter Gene
NASA Technical Reports Server (NTRS)
Jackson, Jacquelyn; Egnin, Marceline; Xue, Qi-Han; Prakash, C. S.
1997-01-01
Reporter genes are widely employed in plant molecular biology research to analyze gene expression and to identify promoters. Gus (UidA) is currently the most popular reporter gene but its detection requires a destructive assay. The use of jellyfish green fluorescent protein (GFP) gene from Aequorea Victoria holds promise for noninvasive detection of in vivo gene expression. To study how various plant promoters are expressed in sweet potato (Ipomoea batatas), we are transcriptionally fusing the intron-modified (mGFP) or synthetic (modified for codon-usage) GFP coding regions to these promoters: double cauliflower mosaic virus 35S (CaMV 35S) with AMV translational enhancer, ubiquitin7-intron-ubiquitin coding region (ubi7-intron-UQ) and sporaminA. A few of these vectors have been constructed and introduced into E. coli DH5a and Agrobacterium tumefaciens EHA105. Transient expression studies are underway using protoplast-electroporation and particle bombardment of leaf tissues.
Hardy, C M; Clark-Walker, G D
1991-07-01
The cytochrome oxidase subunit 1 gene (COX1) in K. lactis K8 mtDNA spans 8,826 bp and contains five exons (termed E1-E5) totalling 1,602 bp that show 88% nucleotide base matching and 91% amino acid homology to the equivalent gene in S. cerevisiae. The four introns (termed K1 cox1.1-1.4) contain open reading frames encoding proteins of 786, 333, 319 and 395 amino acids respectively that potentially encode maturase enzymes. The first intron belongs to group II whereas the remaining three are group I type B. Introns K1 cox1.1, 1.3, and 1.4 are found at identical locations to introns Sc cox1.2, 1.5 a, and 1.5 b respectively from S. cerevisiae. Horizontal transfer of an intron between recent progenitors of K. lactis and S. cerevisiae is suggested by the observation that K1 cox1.1 and Sc cox1.2 show 96% base matching. Sequence comparisons between K1 cox1.3/Sc cox1.5 a and K1 cox1.4/Sc cox1.5 b suggest that these introns are likely to have been present in the ancestral COX1 gene of these yeasts. Intron K1 cox1.2 is not found in S. cerevisiae and appears at an unique location in K. lactis. A feature of the DNA sequences of the group I introns K1 cox1.2, 1.3, and 1.4 is the presence of 11 GC-rich clusters inserted into both coding and noncoding regions. Immediately downstream of the COX1 gene is the ATPase subunit 8 gene (A8) that shows 82.6% base matching to its counterpart in S. cerevisiae mtDNA.
Bennett, Matthew S.; Triemer, Richard E.; Preisfeld, Angelika
2017-01-01
Background Over the last few years multiple studies have been published showing a great diversity in size of chloroplast genomes (cpGenomes), and in the arrangement of gene clusters, in the Euglenales. However, while these genomes provided important insights into the evolution of cpGenomes across the Euglenales and within their genera, only two genomes were analyzed in regard to genomic variability between and within Euglenales and Eutreptiales. To better understand the dynamics of chloroplast genome evolution in early evolving Eutreptiales, this study focused on the cpGenome of Eutreptiella pomquetensis, and the spread and peculiarities of introns. Methods The Etl. pomquetensis cpGenome was sequenced, annotated and afterwards examined in structure, size, gene order and intron content. These features were compared with other euglenoid cpGenomes as well as those of prasinophyte green algae, including Pyramimonas parkeae. Results and Discussion With about 130,561 bp the chloroplast genome of Etl. pomquetensis, a basal taxon in the phototrophic euglenoids, was considerably larger than the two other Eutreptiales cpGenomes sequenced so far. Although the detected quadripartite structure resembled most green algae and plant chloroplast genomes, the gene content of the single copy regions in Etl. pomquetensis was completely different from those observed in green algae and plants. The gene composition of Etl. pomquetensis was extensively changed and turned out to be almost identical to other Eutreptiales and Euglenales, and not to P. parkeae. Furthermore, the cpGenome of Etl. pomquetensis was unexpectedly permeated by a high number of introns, which led to a substantially larger genome. The 51 identified introns of Etl. pomquetensis showed two major unique features: (i) more than half of the introns displayed a high level of pairwise identities; (ii) no group III introns could be identified in the protein coding genes. These findings support the hypothesis that group III introns are degenerated group II introns and evolved later. PMID:28852596
The complete chloroplast genome sequence of the medicinal plant Andrographis paniculata.
Ding, Ping; Shao, Yanhua; Li, Qian; Gao, Junli; Zhang, Runjing; Lai, Xiaoping; Wang, Deqin; Zhang, Huiye
2016-07-01
The complete chloroplast genome of Andrographis paniculata, an important medicinal plant with great economic value, has been studied in this article. The genome size is 150,249 bp in length, with 38.3% GC content. A pair of inverted repeats (IRs, 25,300 bp) are separated by a large single copy region (LSC, 82,459 bp) and a small single-copy region (SSC, 17,190 bp). The chloroplast genome contains 114 unique genes, 80 protein-coding genes, 30 tRNA genes and 4 rRNA genes. In these genes, 15 genes contained 1 intron and 3 genes comprised of 2 introns.
The complete chloroplast genome sequence of Dianthus superbus var. longicalycinus.
Gurusamy, Raman; Lee, Do-Hyung; Park, SeonJoo
2016-05-01
The complete chloroplast genome (cpDNA) sequence of Dianthus superbus var. longicalycinus is an economically important traditional Chinese medicine was reported and characterized. The cpDNA of Dianthus superbus var. longicalycinus is 149,539 bp, with 36.3% GC content. A pair of inverted repeats (IRs) of 24,803 bp is separated by a large single-copy region (LSC, 82,805 bp) and a small single-copy region (SSC, 17,128 bp). It encodes 85 protein-coding genes, 36 tRNA genes and 8 rRNA genes. Of 129 individual genes, 13 genes encoded one intron and three genes have two introns.
NASA Technical Reports Server (NTRS)
Breault, D. T.; Lichtler, A. C.; Rowe, D. W.
1997-01-01
Collagen reporter gene constructs have be used to identify cell-specific sequences needed for transcriptional activation. The elements required for endogenous levels of COL1A1 expression, however, have not been elucidated. The human COL1A1 minigene is expressed at high levels and likely harbors sequence elements required for endogenous levels of activity. Using stably transfected osteoblastic Py1a cells, we studied a series of constructs (pOBColCAT) designed to characterize further the elements required for high level of expression. pOBColCAT, which contains the COL1A1 first intron, was expressed at 50-100-fold higher levels than ColCAT 3.6, which lacks the first intron. This difference is best explained by improved mRNA processing rather than a transcriptional effect. Furthermore, variation in activity observed with the intron deletion constructs is best explained by altered mRNA splicing. Two major regions of the human COL1A1 minigene, the 3'-flanking sequences and the minigene body, were introduced into pOBColCAT to assess both transcriptional enhancing activity and the effect on mRNA stability. Analysis of the minigene body, which includes the first five exons and introns fused with the terminal six introns and exons, revealed an orientation-independent 5-fold increase in CAT activity. In contrast the 3'-flanking sequences gave rise to a modest 61% increase in CAT activity. Neither region increased the mRNA half-life of the parent construct, suggesting that CAT-specific mRNA instability elements may serve as dominant negative regulators of stability. This study suggests that other sites within the body of the COL1A1 minigene are important for high expression, e.g. during periods of rapid extracellular matrix production.
Characterization and Expression of the Lucina pectinata Oxygen and Sulfide Binding Hemoglobin Genes
López-Garriga, Juan; Cadilla, Carmen L.
2016-01-01
The clam Lucina pectinata lives in sulfide-rich muds and houses intracellular symbiotic bacteria that need to be supplied with hydrogen sulfide and oxygen. This clam possesses three hemoglobins: hemoglobin I (HbI), a sulfide-reactive protein, and hemoglobin II (HbII) and III (HbIII), which are oxygen-reactive. We characterized the complete gene sequence and promoter regions for the oxygen reactive hemoglobins and the partial structure and promoters of the HbI gene from Lucina pectinata. We show that HbI has two mRNA variants, where the 5’end had either a sequence of 96 bp (long variant) or 37 bp (short variant). The gene structure of the oxygen reactive Hbs is defined by having 4-exons/3-introns with conservation of intron location at B12.2 and G7.0 and the presence of pre-coding introns, while the partial gene structure of HbI has the same intron conservation but appears to have a 5-exon/ 4-intron structure. A search for putative transcription factor binding sites (TFBSs) was done with the promoters for HbII, HbIII, HbI short and HbI long. The HbII, HbIII and HbI long promoters showed similar predicted TFBSs. We also characterized MITE-like elements in the HbI and HbII gene promoters and intronic regions that are similar to sequences found in other mollusk genomes. The gene expression levels of the clam Hbs, from sulfide-rich and sulfide-poor environments showed a significant decrease of expression in the symbiont-containing tissue for those clams in a sulfide-poor environment, suggesting that the sulfide concentration may be involved in the regulation of these proteins. Gene expression evaluation of the two HbI mRNA variants indicated that the longer variant is expressed at higher levels than the shorter variant in both environments. PMID:26824233
2010-01-01
Background Molecular characterization of collagen-VI related myopathies currently relies on standard sequencing, which yields a detection rate approximating 75-79% in Ullrich congenital muscular dystrophy (UCMD) and 60-65% in Bethlem myopathy (BM) patients as PCR-based techniques tend to miss gross genomic rearrangements as well as copy number variations (CNVs) in both the coding sequence and intronic regions. Methods We have designed a custom oligonucleotide CGH array in order to investigate the presence of CNVs in the coding and non-coding regions of COL6A1, A2, A3, A5 and A6 genes and a group of genes functionally related to collagen VI. A cohort of 12 patients with UCMD/BM negative at sequencing analysis and 2 subjects carrying a single COL6 mutation whose clinical phenotype was not explicable by inheritance were selected and the occurrence of allelic and genetic heterogeneity explored. Results A deletion within intron 1A of the COL6A2 gene, occurring in compound heterozygosity with a small deletion in exon 28, previously detected by routine sequencing, was identified in a BM patient. RNA studies showed monoallelic transcription of the COL6A2 gene, thus elucidating the functional effect of the intronic deletion. No pathogenic mutations were identified in the remaining analyzed patients, either within COL6A genes, or in genes functionally related to collagen VI. Conclusions Our custom CGH array may represent a useful complementary diagnostic tool, especially in recessive forms of the disease, when only one mutant allele is detected by standard sequencing. The intronic deletion we identified represents the first example of a pure intronic mutation in COL6A genes. PMID:20302629
DOE Office of Scientific and Technical Information (OSTI.GOV)
Lin, Biaoyang; Nasir, J.; Kalchman, M.A.
1995-02-10
We have previously cloned and characterized the murine homologue of the Huntington disease (HD) gene and shown that it maps to mouse chromosome 5 within a region of conserved synteny with human chromosome 4p16.3. Here we present a detailed comparison of the sequence of the putative promoter and the organization of the 5{prime} genomic region of the murine (Hdh) and human HD genes encompassing the first five exons. We show that in this region these two genes share identical exon boundaries, but have different-size introns. Two dinucleotide (CT) and one trinucleotide intronic polymorphism in Hdh and an intronic CA polymorphismmore » in the HD gene were identified. Comparison of 940-bp sequence 5{prime} to the putative translation start site reveals a highly conserved region (78.8% nucleotide identity) between Hdh and the HD gene from nucleotide -56 to -206 (of Hdh). Neither Hdh nor the HD gene have typical TATA or CCAAT elements, but both show one putative AP2 binding site and numerous potential Sp1 binding sites. The high sequence identity between Hdh and the HD gene for approximately 200 bp 5{prime} to the putative translation start site indicates that these sequences may play a role in regulating expression of the Huntington disease gene. 30 refs., 4 figs., 2 tabs.« less
Thermal Unfolding Simulations of Bacterial Flagellin: Insight into its Refolding Before Assembly
Chng, Choon-Peng; Kitao, Akio
2008-01-01
Flagellin is the subunit of the bacterial filament, the micrometer-long propeller of a bacterial flagellum. The protein is believed to undergo unfolding for transport through the channel of the filament and to refold in a chamber at the end of the channel before being assembled into the growing filament. We report a thermal unfolding simulation study of S. typhimurium flagellin in aqueous solution as an attempt to gain atomic-level insight into the refolding process. Each molecule comprises two filament-core domains {D0, D1} and two hypervariable-region domains {D2, D3}. D2 can be separated into subdomains D2a and D2b. We observed a similar unfolding order of the domains as reported in experimental thermal denaturation. D2a and D3 exhibited high thermal stability and contained persistent three-stranded β-sheets in the denatured state which could serve as folding cores to guide refolding. A recent mutagenesis study on flagellin stability seems to suggest the importance of the folding cores. Using crude size estimates, our data suggests that the chamber might be large enough for either denatured hypervariable-region domains or filament-core domains, but not whole flagellin; this implicates a two-staged refolding process. PMID:18263660
Saxena, S; Saxena, V K; Tomar, S; Sapcota, D; Gonmei, G
2016-06-01
A comparative analysis of caecum and crop microbiota of chick, grower and adult stages of Indian indigenous chickens was conducted to investigate the role of the microbiota of the gastrointestinal tract, which play an important role in host performance, health and immunity. High-throughput Illumina sequencing was performed for V3, V4 and V4-V6 hypervariable regions of the 16S rRNA gene. M5RNA and M5NR databases under MG-RAST were used for metagenomic datasets annotation. In the crop, Firmicutes (~78%) and Proteobacteria (~16%) were the predominant phyla whereas in the caecum, Firmicutes (~50%), Bacteroidetes (~29%) and Actinobacteria (~10%) were predominant. The Shannon-Wiener diversity index suggested that sample richness and diversity increased as the chicken aged. For the first time, the presence of Lactobacillus species such as L. frumenti, L. antri, L. mucosae in the chicken crop along with Kineococcus radiotolerans, Desulfohalobium retbaense and L. jensenii in the caecum are reported. Many of these bacterial species have been found to be involved in immune response modulation and disease prevention in pigs and humans. The gut microbiome of the indigenous chicken was enriched with microbes having probiotic potential which might be essential for their adaptability.
Sacco, James; Ruplin, Andrew; Skonieczny, Paul; Ohman, Michael
2017-01-01
In humans, reduced activity of the enzyme monoamine oxidase type A (MAOA) due to genetic polymorphisms within the MAOA gene leads to increased brain neurotransmitter levels associated with aggression. In order to study MAOA genetic diversity in dogs, we designed a preliminary study whose objectives were to identify novel alleles in functionally important regions of the canine MAOA gene, and to investigate whether the frequencies of these polymorphisms varied between five broad breed groups (ancient, herding, mastiff, modern European, and mountain). Fifty dogs representing these five breed groups were sequenced. A total of eleven polymorphisms were found. Seven were single nucleotide polymorphisms (SNPs; two exonic, two intronic and three in the promoter), while four were repeat intronic variations. The most polymorphic loci were repeat regions in introns 1, 2 (7 alleles) and 10 (3 alleles), while the exonic and the promoter regions were highly conserved. Comparison of the allele frequencies of certain microsatellite polymorphisms among the breed groups indicated a decreasing or increasing trend in the number of repeats at different microsatellite loci, as well as the highest genetic diversity for the ancient breeds and the lowest for the most recent mountain breeds, perhaps attributable to canine domestication and recent breed formation. While a specific promoter SNP (-212A > G) is rare in the dog, it is the major allele in wolves. Replacement of this ancestral allele in domestic dogs may lead to the deletion of heat shock factor binding sites on the MAOA promoter. Dogs exhibit significant variation in certain intronic regions of the MAOA gene, while the coding and promoter regions are well-conserved. Distinct genetic differences were observed between breed groups. Further studies are now required to establish whether such polymorphisms are associated in any way with MAOA level and canine behaviour including aggression.
Campos, W N; Massaro, J D; Martinelli, A L C; Halliwell, J A; Marsh, S G E; Mendes-Junior, C T; Donadi, E A
2017-10-01
The HFE molecule controls iron uptake from gut, and defects in the molecule have been associated with iron overload, particularly in hereditary hemochromatosis. The HFE gene including both coding and boundary intronic regions were sequenced in 304 Brazilian individuals, encompassing healthy individuals and patients exhibiting hereditary or acquired iron overload. Six sites of variation were detected: (1) H63D C>G in exon 2, (2) IVS2 (+4) T>C in intron 2, (3) a C>G transversion in intron 3, (4) C282Y G>A in exon 4, (5) IVS4 (-44) T>C in intron 4, and (6) a new guanine deletion (G>del) in intron 5, which were used for haplotype inference. Nine HFE alleles were detected and six of these were officially named on the basis of the HLA Nomenclature, defined by the World Health Organization (WHO) Nomenclature Committee for Factors of the HLA System, and published via the IPD-IMGT/HLA website. Four alleles, HFE*001, *002, *003, and *004 exhibited variation within their exon sequences. © 2017 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.
Suh, E R; Waring, R B
1990-01-01
It has been proposed that recognition of the 3' splice site in many group I introns involves base pairing between the start of the 3' exon and a region of the intron known as the internal guide sequence (R. W. Davies, R. B. Waring, J. Ray, T. A. Brown, and C. Scazzocchio, Nature [London] 300:719-724, 1982). We have examined this hypothesis, using the self-splicing rRNA intron from Tetrahymena thermophila. Mutations in the 3' exon that weaken this proposed pairing increased use of a downstream cryptic 3' splice site. Compensatory mutations in the guide sequence that restore this pairing resulted in even stronger selection of the normal 3' splice site. These changes in 3' splice site usage were more pronounced in the background of a mutation (414A) which resulted in an adenine instead of a guanine being the last base of the intron. These results show that the proposed pairing (P10) plays an important role in ensuring that cryptic 3' splice sites are selected against. Surprisingly, the 414A mutation alone did not result in activation of the cryptic 3' splice site. Images PMID:2342465
Doran, Mark; du Plessis, Daniel G; Ghadiali, Eric J; Mann, David M A; Pickering-Brown, Stuart; Larner, Andrew J
2007-10-01
Frontotemporal dementia with parkinsonism linked to chromosome 17 (FTDP-17) owing to the tau intron 10 + 16 mutation usually occurs with a prototypical frontotemporal dementia phenotype with prominent disinhibition and affective disturbances. To report a new FTDP-17 pedigree with the tau intron 10 + 16 mutation demonstrating a clinical phenotype suggestive of Alzheimer disease. Case reports. Regional neuroscience centers in northwest England. Patients We examined 4 members of a kindred in which 8 individuals were affected in 3 generations. All 4 patients reported memory difficulty. Marked anomia was also present, but behavioral disturbances were conspicuously absent in the early stages of disease. All patients had an initial clinical diagnosis of Alzheimer disease. No mutations were found in the presenilin or amyloid precursor protein genes. Pathologic examination of the proband showed features typical of FTDP-17, and tau gene analysis showed the intron 10 + 16 mutation. This pedigree illustrates the phenotypic variability of tau intron 10 + 16 mutations. In pedigrees with a clinical diagnosis of Alzheimer disease but without presenilin or amyloid precursor protein gene mutations, tau gene mutations may be found.
Efficiency of introns from various origins in fish cells.
Bétancourt, O H; Attal, J; Théron, M C; Puissant, C; Houdebine, L M
1993-06-01
Several vectors containing (1) regulatory regions from Rous sarcoma virus (RSV), human cytomegalovirus (CMV), and herpes simplex thymidine kinase (TK); (2) introns from early or late SV40 genes and from trout growth hormone gene (tGH); (3) chloramphenicol acetyltransferase gene (CAT); and (4) transcription terminators from SV40 were transfected into carp EPC cells, salmon CHSE cells, tilapia TO2 cells, quail QT6 cells, and hamster CHO cells. CAT activity was measured in extracts from several cell lines 3 days after transfection and in the fish EPC stable clones. The CMV and RSV promoters were the most potent in all cell types. The intron from late SV40 genes (VP1 intron) worked properly in QT6 and CHO cells but not in EPC and very weakly in TO2 cells. The tGH intron was efficient in all cell types but preferentially in fish cells. The small t intron from SV40 was processed in all cell types. The small t and, to a lesser extent, the tGH introns amplified expression of cat gene in stable clones, in comparison to the transiently transfected cells. These results indicate that elements from mammalian genes may not be properly recognized by the fish cellular machinery and in an unpredictable manner. This finding suggests that vectors prepared to express foreign genes in transfected cultured fish cells and transgenic fish should preferably contain DNA sequences from fish genes or, alternatively, those sequences from mammalian genes that have been previously proved to be compatible with the fish cellular machinery.
Bunawan, Hamidun; Yen, Choong Chee; Yaakop, Salmah; Noor, Normah Mohd
2017-01-26
The chloroplastic trnL intron and the nuclear internal transcribed spacer (ITS) region were sequenced for 11 Nepenthes species recorded in Peninsular Malaysia to examine their phylogenetic relationship and to evaluate the usage of trnL intron and ITS sequences for phylogenetic reconstruction of this genus. Phylogeny reconstruction was carried out using neighbor-joining, maximum parsimony and Bayesian analyses. All the trees revealed two major clusters, a lowland group consisting of N. ampullaria, N. mirabilis, N. gracilis and N. rafflesiana, and another containing both intermediately distributed species (N. albomarginata and N. benstonei) and four highland species (N. sanguinea, N. macfarlanei, N. ramispina and N. alba). The trnL intron and ITS sequences proved to provide phylogenetic informative characters for deriving a phylogeny of Nepenthes species in Peninsular Malaysia. To our knowledge, this is the first molecular phylogenetic study of Nepenthes species occurring along an altitudinal gradient in Peninsular Malaysia.
Crystal structure of group II intron domain 1 reveals a template for RNA assembly
Zhao, Chen; Rajashankar, Kanagalaghatta R.; Marcia, Marco; ...
2015-10-26
Although the importance of large noncoding RNAs is increasingly appreciated, our understanding of their structures and architectural dynamics remains limited. In particular, we know little about RNA folding intermediates and how they facilitate the productive assembly of RNA tertiary structures. In this paper, we report the crystal structure of an obligate intermediate that is required during the earliest stages of group II intron folding. Composed of domain 1 from the Oceanobacillus iheyensis group II intron (266 nucleotides), this intermediate retains native-like features but adopts a compact conformation in which the active site cleft is closed. Transition between this closed andmore » the open (native) conformation is achieved through discrete rotations of hinge motifs in two regions of the molecule. Finally, the open state is then stabilized by sequential docking of downstream intron domains, suggesting a 'first come, first folded' strategy that may represent a generalizable pathway for assembly of large RNA and ribonucleoprotein structures.« less
2011-01-01
Introduction The Toll-like receptor 7 (TLR7) gene, encoded on human chromosome Xp22.3, is crucial for type I interferon production. A recent multicenter study in East Asian populations, comprising Chinese, Korean and Japanese participants, identified an association of a TLR7 single-nucleotide polymorphism (SNP) located in the 3' untranslated region (3' UTR), rs3853839, with systemic lupus erythematosus (SLE), especially in males, although some difference was observed among the tested populations. To test whether additional polymorphisms contribute to SLE in Japanese, we systematically analyzed the association of TLR7 with SLE in a Japanese female population. Methods A case-control association study was conducted on eight tag SNPs in the TLR7 region, including rs3853839, in 344 Japanese females with SLE and 274 healthy female controls. Results In addition to rs3853839, two SNPs in intron 2, rs179019 and rs179010, which were in moderate linkage disequilibrium with each other (r2 = 0.53), showed an association with SLE (rs179019: P = 0.016, odds ratio (OR) 2.02, 95% confidence interval (95% CI) 1.15 to 3.54; rs179010: P = 0.018, OR 1.75, 95% CI 1.10 to 2.80 (both under the recessive model)). Conditional logistic regression analysis revealed that the association of the intronic SNPs and the 3' UTR SNP remained significant after we adjusted them for each other. When only the patients and controls carrying the risk genotypes at the 3' UTR SNPpositionwere analyzed, the risk of SLE was significantly increased when the individuals also carried the risk genotypes at both of the intronic SNPs (P = 0.0043, OR 2.45, 95% CI 1.31 to 4.60). Furthermore, the haplotype containing the intronic risk alleles in addition to the 3' UTR risk allele was associated with SLE under the recessive model (P = 0.016, OR 2.37, 95% CI 1.17 to 4.80), but other haplotypes were not associated with SLE. Conclusions The TLR7 intronic SNPs rs179019 and rs179010 are associated with SLE independently of the 3' UTR SNP rs3853839 in Japanese women. Our findings support a role of TLR7 in predisposition for SLE in Asian populations. PMID:21396113
Positive selection of digestive Cys proteases in herbivorous Coleoptera.
Vorster, Juan; Rasoolizadeh, Asieh; Goulet, Marie-Claire; Cloutier, Conrad; Sainsbury, Frank; Michaud, Dominique
2015-10-01
Positive selection is thought to contribute to the functional diversification of insect-inducible protease inhibitors in plants in response to selective pressures exerted by the digestive proteases of their herbivorous enemies. Here we assessed whether a reciprocal evolutionary process takes place on the insect side, and whether ingestion of a positively selected plant inhibitor may translate into a measurable rebalancing of midgut proteases in vivo. Midgut Cys proteases of herbivorous Coleoptera, including the major pest Colorado potato beetle (Leptinotarsa decemlineata), were first compared using a codon-based evolutionary model to look for the occurrence of hypervariable, positively selected amino acid sites among the tested sequences. Hypervariable sites were found, distributed within -or close to- amino acid regions interacting with Cys-type inhibitors of the plant cystatin protein family. A close examination of L. decemlineata sequences indicated a link between their assignment to protease functional families and amino acid identity at positively selected sites. A function-diversifying role for positive selection was further suggested empirically by in vitro protease assays and a shotgun proteomic analysis of L. decemlineata Cys proteases showing a differential rebalancing of protease functional family complements in larvae fed single variants of a model cystatin mutated at positively selected amino acid sites. These data confirm overall the occurrence of hypervariable, positively selected amino acid sites in herbivorous Coleoptera digestive Cys proteases. They also support the idea of an adaptive role for positive selection, useful to generate functionally diverse proteases in insect herbivores ingesting functionally diverse, rapidly evolving dietary cystatins. Copyright © 2015 Elsevier Ltd. All rights reserved.
The complete chloroplast genome of Sinopodophyllum hexandrum Ying (Berberidaceae).
Meng, Lihua; Liu, Ruijuan; Chen, Jianbing; Ding, Chenxu
2017-05-01
The complete nucleotide sequence of the Sinopodophyllum hexandrum Ying chloroplast genome (cpDNA) was determined based on next-generation sequencing technologies in this study. The genome was 157 203 bp in length, containing a pair of inverted repeat (IRa and IRb) regions of 25 960 bp, which were separated by a large single-copy (LSC) region of 87 065 bp and a small single-copy (SSC) region of 18 218 bp, respectively. The cpDNA contained 148 genes, including 96 protein-coding genes, 8 ribosomal RNA genes, and 44 tRNA genes. In these genes, eight harbored a single intron, and two (ycf3 and clpP) contained a couple of introns. The cpDNA AT content of S. hexandrum cpDNA is 61.5%.
The complete chloroplast genome sequence of Euonymus japonicus (Celastraceae).
Choi, Kyoung Su; Park, SeonJoo
2016-09-01
The complete chloroplast (cp) genome sequence of the Euonymus japonicus, the first sequenced of the genus Euonymus, was reported in this study. The total length was 157 637 bp, containing a pair of 26 678 bp inverted repeat region (IR), which were separated by small single copy (SSC) region and large single copy (LSC) region of 18 340 bp and 85 941 bp, respectively. This genome contains 107 unique genes, including 74 coding genes, four rRNA genes, and 29 tRNA genes. Seventeen genes contain intron of E. japonicus, of which three genes (clpP, ycf3, and rps12) include two introns. The maximum likelihood (ML) phylogenetic analysis revealed that E. japonicus was closely related to Manihot and Populus.
Population genetic structure and conservation of marbled murrelets (Brachyramphus marmoratus)
Friesen, Vicki L.; Birt, T.P.; Piatt, John F.; Golightly, R.T.; Newman, S.H.; Hebert, P.N.; Congdon, B.C.; Gissing, G.
2005-01-01
Marbled murrelets (Brachyramphus marmoratus) are coastal seabirds that nest from California to the Aleutian Islands. They are declining and considered threatened in several regions. We compared variation in the mitochondrial control region, four nuclear introns and three microsatellite loci among 194 murrelets from throughout their range except Washington and Oregon. Significant population genetic structure was found: nine private control region haplotypes and three private intron alleles occurred at high frequency in the Aleutians and California; global estimates of FST or ??ST and most pairwise estimates involving the Aleutians and/or California were significant; and marked isolation-by-distance was found. Given the available samples, murrelets appear to comprise five genetic management units: (1) western Aleutian Islands, (2) central Aleutian Islands, (3) mainland Alaska and British Columbia, (4) northern California, and (5) central California.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Paule Roth, M.; Malfroy, L.; Offer, C.
1995-07-20
Human myelin oligodendrocyte glycoprotein (MOG), a myelin component of the central nervous system, is a candidate target antigen for autoimmune-mediated demyelination. We have isolated and sequenced part of a cosmid clone that contains the entire human MOG gene. The primary nuclear transcript, extending from the putative start of transcription to the site of poly(A) addition, is 15,561 nucleotides in length. The human MOG gene contains 8 exons, separated by 7 introns; canonical intron/exon boundary sites are observed at each junction. The introns vary in size from 242 to 6484 bp and contain numerous repetitive DNA elements, including 14 Alu sequencesmore » within 3 introns. Another Alu element is located in the 3{prime}-untranslated region of the gene. Alu sequences were classified with respect to subfamily assignment. Seven hundred sixty-three nucleotides 5{prime} of the transcription start and 1214 nucleotides 3{prime} of the poly(A) addition sites were also sequenced. The 5{prime}-flanking region revealed the presence of several consensus sequences that could be relevant in the transcription of the MOG gene, in particular binding sites in common with other myelin gene promoters. Two polymorphic intragenic dinucleotide (CA){sub n} and tetranucleotide (TAAA){sub n} repeats were identified and may provide genetic marker tools for association and linkage studies. 50 refs., 3 figs., 3 tabs.« less
Structure and polymorphism of the mouse prion protein gene.
Westaway, D; Cooper, C; Turner, S; Da Costa, M; Carlson, G A; Prusiner, S B
1994-01-01
Missense mutations in the prion protein (PrP) gene, overexpression of the cellular isoform of PrP (PrPC), and infection with prions containing the scrapie isoform of PrP (PrPSc) all cause neurodegenerative disease. To understand better the physiology and expression of PrPC, we retrieved mouse PrP gene (Prn-p) yeast artificial chromosome (YAC), cosmid, phage, and cDNA clones. Physical mapping positions Prn-p approximately 300 kb from ecotropic virus integration site number 4 (Evi-4), compatible with failure to detect recombination between Prn-p and Evi-4 in genetic crosses. The Prn-pa allele encompasses three exons, with exons 1 and 2 encoding the mRNA 5' untranslated region. Exon 2 has no equivalent in the Syrian hamster and human PrP genes. The Prn-pb gene shares this intron/exon structure but harbors an approximately 6-kb deletion within intron 2. While the Prn-pb open reading frame encodes two amino acid substitutions linked to prolonged scrapie incubation periods, a deletion of intron 2 sequences also characterizes inbred strains such as RIII/S and MOLF/Ei with shorter incubation periods, making a relationship between intron 2 size and scrapie pathogenesis unlikely. The promoter regions of a and b Prn-p alleles include consensus Sp1 and AP-1 sites, as well as other conserved motifs which may represent binding sites for as yet unidentified transcription factors. Images PMID:7912827
A Study of the 5S Ribosomal RNAs of the Vibrionaceae
1984-01-01
codon (UAA, UAG, or UGA) TBE Tris-borate-EDTA buffer ug microgram, i.e., 10-’ gram 6 ul microliter. iJe., 10- 6 liter UPG unweighted pair-group UPGMA ...Psy~ww~w .......................... .. 4.------------------ 0 IC 5b. The UPGMA , or UPS average linkage, dendrogram resulting from the...cluster, and the V. damsela - Q. anguillarus doublet are identical to that predicted by UPGMA analysis. C. CONSERVED AND HYPERVARIABLE REGIONS As
Reduced mutation rate in exons due to differential mismatch repair
Mularoni, Loris; Muiños, Ferran; Gonzalez-Perez, Abel; López-Bigas, Núria
2017-01-01
While recent studies have revealed higher than anticipated heterogeneity of mutation rate across genomic regions, mutations in exons and introns are assumed to be generated at the same rate. Here we find fewer somatic mutations in exons than expected based on their sequence content, and demonstrate that this is not due to purifying selection. Moreover, we show that it is caused by higher mismatch repair activity in exonic than in intronic regions. Our findings have important implications for our understanding of mutational and DNA repair processes, our knowledge of the evolution of eukaryotic genes, and practical ramifications for the study of the evolution of both tumors and species. PMID:29106418
Filichkin, Sergei A.; Hamilton, Michael; Dharmawardhana, Palitha D.; Singh, Sunil K.; Sullivan, Christopher; Ben-Hur, Asa; Reddy, Anireddy S. N.; Jaiswal, Pankaj
2018-01-01
Abiotic stresses affect plant physiology, development, growth, and alter pre-mRNA splicing. Western poplar is a model woody tree and a potential bioenergy feedstock. To investigate the extent of stress-regulated alternative splicing (AS), we conducted an in-depth survey of leaf, root, and stem xylem transcriptomes under drought, salt, or temperature stress. Analysis of approximately one billion of genome-aligned RNA-Seq reads from tissue- or stress-specific libraries revealed over fifteen millions of novel splice junctions. Transcript models supported by both RNA-Seq and single molecule isoform sequencing (Iso-Seq) data revealed a broad array of novel stress- and/or tissue-specific isoforms. Analysis of Iso-Seq data also resulted in the discovery of 15,087 novel transcribed regions of which 164 show AS. Our findings demonstrate that abiotic stresses profoundly perturb transcript isoform profiles and trigger widespread intron retention (IR) events. Stress treatments often increased or decreased retention of specific introns – a phenomenon described here as differential intron retention (DIR). Many differentially retained introns were regulated in a stress- and/or tissue-specific manner. A subset of transcripts harboring super stress-responsive DIR events showed persisting fluctuations in the degree of IR across all treatments and tissue types. To investigate coordinated dynamics of intron-containing transcripts in the study we quantified absolute copy number of isoforms of two conserved transcription factors (TFs) using Droplet Digital PCR. This case study suggests that stress treatments can be associated with coordinated switches in relative ratios between fully spliced and intron-retaining isoforms and may play a role in adjusting transcriptome to abiotic stresses. PMID:29483921
Screening of Variations in CD22 Gene in Children with B-Precursor Acute Lymphoblastic Leukemia.
Aslar Oner, Deniz; Akin, Dilara Fatma; Sipahi, Kadir; Mumcuoglu, Mine; Ezer, Ustun; Kürekci, A Emin; Akar, Nejat
2016-09-01
CD22 is expressed on the surface of B-cell lineage cells from the early progenitor stage of pro-B cell until terminal differentiation to mature B cells. It plays a role in signal transduction and as a regulator of B-cell receptor signaling in B-cell development. We aimed to screen exons 9-14 of the CD22 gene, which is a mutational hot spot region in B-precursor acute lymphoblastic leukemia (pre-B ALL) patients, to find possible genetic variants that could play role in the pathogenesis of pre-B ALL in Turkish children. This study included 109 Turkish children with pre-B ALL who were diagnosed at Losante Hospital for Children with Leukemia. Genomic DNA was extracted from both peripheral blood and bone marrow leukocytes. Gene amplification was performed with PCR, and all samples were screened for the variants by single strand conformation polymorphism. Samples showing band shifts were sequenced on an automated sequencer. In our patient group a total of 9 variants were identified in the CD22 gene by sequencing: a novel variant in intron 10 (T2199G); a missense variant in exon 12; 5 intronic variants between exon 12 and intron 13; a novel intronic variant (C2424T); and a synonymous in exon 13. Thirteen of 109 children (11.9%) carried the T2199G novel intronic variant located in intron 10, and 17 of 109 children (15.6%) carried the C2424T novel intronic variant. Novel variants in the CD22 gene in children with pre-B ALL in Turkey that are not present, in the Human Gene Mutation Database or NCBI SNP database, were found.
Liu, Feng; Melton, James T; Bi, Yuping
2017-10-01
To further understand the trends in the evolution of mitochondrial genomes (mitogenomes or mtDNAs) in the Ulvophyceae, the mitogenomes of two separate thalli of Ulva pertusa were sequenced. Two U. pertusa mitogenomes (Up1 and Up2) were 69,333 bp and 64,602 bp in length. These mitogenomes shared two ribosomal RNAs (rRNAs), 28 transfer RNAs (tRNAs), 29 protein-coding genes, and 12 open reading frames. The 4.7 kb difference in size was attributed to variation in intron content and tandem repeat regions. A total of six introns were present in the smaller U. pertusa mtDNA (Up2), while the larger mtDNA (Up1) had eight. The larger mtDNA had two additional group II introns in two genes (cox1 and cox2) and tandem duplication mutations in noncoding regions. Our results showed the first case of intraspecific variation in chlorophytan mitogenomes and provided further genomic data for the undersampled Ulvophyceae. © 2017 Phycological Society of America.
Strong Signature of Natural Selection within an FHIT Intron Implicated in Prostate Cancer Risk
Ding, Yan; Larson, Garrett; Rivas, Guillermo; Lundberg, Cathryn; Geller, Louis; Ouyang, Ching; Weitzel, Jeffrey; Archambeau, John; Slater, Jerry; Daly, Mary B.; Benson, Al B.; Kirkwood, John M.; O'Dwyer, Peter J.; Sutphen, Rebecca; Stewart, James A.; Johnson, David; Nordborg, Magnus; Krontiris, Theodore G.
2008-01-01
Previously, a candidate gene linkage approach on brother pairs affected with prostate cancer identified a locus of prostate cancer susceptibility at D3S1234 within the fragile histidine triad gene (FHIT), a tumor suppressor that induces apoptosis. Subsequent association tests on 16 SNPs spanning approximately 381 kb surrounding D3S1234 in Americans of European descent revealed significant evidence of association for a single SNP within intron 5 of FHIT. In the current study, re-sequencing and genotyping within a 28.5 kb region surrounding this SNP further delineated the association with prostate cancer risk to a 15 kb region. Multiple SNPs in sequences under evolutionary constraint within intron 5 of FHIT defined several related haplotypes with an increased risk of prostate cancer in European-Americans. Strong associations were detected for a risk haplotype defined by SNPs 138543, 142413, and 152494 in all cases (Pearson's χ2 = 12.34, df 1, P = 0.00045) and for the homozygous risk haplotype defined by SNPs 144716, 142413, and 148444 in cases that shared 2 alleles identical by descent with their affected brothers (Pearson's χ2 = 11.50, df 1, P = 0.00070). In addition to highly conserved sequences encompassing SNPs 148444 and 152413, population studies revealed strong signatures of natural selection for a 1 kb window covering the SNP 144716 in two human populations, the European American (π = 0.0072, Tajima's D = 3.31, 14 SNPs) and the Japanese (π = 0.0049, Fay & Wu's H = 8.05, 14 SNPs), as well as in chimpanzees (Fay & Wu's H = 8.62, 12 SNPs). These results strongly support the involvement of the FHIT intronic region in an increased risk of prostate cancer. PMID:18953408
Scieglinska, D; Widłak, W; Konopka, W; Poutanen, M; Rahman, N; Huhtaniemi, I; Krawczyk, Z
2001-01-01
The rat Hst70 gene and its mouse counterpart Hsp70.2 belong to the family of Hsp70 heat shock genes and are specifically expressed in male germ cells. Previous studies regarding the structure of the 5' region of the transcription unit of these genes as well as localization of the 'cis' elements conferring their testis-specific expression gave contradictory results [Widlak, Markkula, Krawczyk, Kananen and Huhtaniemi (1995) Biochim. Biophys. Acta 1264, 191-200; Dix, Rosario-Herrle, Gotoh, Mori, Goulding, Barret and Eddy (1996) Dev. Biol. 174, 310-321]. In the present paper we solve these controversies and show that the 5' untranslated region (UTR) of the Hst70 gene contains an intron which is localized similar to that of the mouse Hsp70.2 gene. Reverse transcriptase-mediated PCR, Northern blotting and RNase protection analysis revealed that the transcription initiation of both genes starts at two main distant sites, and one of them is localized within the intron. As a result two populations of Hst70 gene transcripts with similar sizes but different 5' UTR structures can be detected in total testicular RNA. Functional analysis of the Hst70 gene promoter in transgenic mice and transient transfection assays proved that the DNA fragment of approx. 360 bp localized upstream of the ATG transcription start codon is the minimal promoter required for testis-specific expression of the HST70/chloramphenicol acetyltransferase transgene. These experiments also suggest that the expression of the gene may depend on 'cis' regulatory elements localized within exon 1 and the intron sequences. PMID:11563976
Characterization of novel RS1 exonic deletions in juvenile X-linked retinoschisis
D’Souza, Leera; Cukras, Catherine; Antolik, Christian; Craig, Candice; He, Hong; Li, Shibo; Hejtmancik, James F.; Sieving, Paul A.; Wang, Xinjing
2013-01-01
Purpose X-linked juvenile retinoschisis (XLRS) is a vitreoretinal dystrophy characterized by schisis (splitting) of the inner layers of the neuroretina. Mutations within the retinoschisis (RS1) gene are responsible for this disease. The mutation spectrum consists of amino acid substitutions, splice site variations, small indels, and larger genomic deletions. Clinically, genomic deletions are rarely reported. Here, we characterize two novel full exonic deletions: one encompassing exon 1 and the other spanning exons 4–5 of the RS1 gene. We also report the clinical findings in these patients with XLRS with two different exonic deletions. Methods Unrelated XLRS men and boys and their mothers (if available) were enrolled for molecular genetics evaluation. The patients also underwent ophthalmologic examination and in some cases electroretinogram (ERG) recording. All the exons and the flanking intronic regions of the RS1 gene were analyzed with direct sequencing. Two patients with exonic deletions were further evaluated with array comparative genomic hybridization to define the scope of the genomic aberrations. After the deleted genomic region was identified, primer walking followed by direct sequencing was used to determine the exact breakpoints. Results Two novel exonic deletions of the RS1 gene were identified: one including exon 1 and the other spanning exons 4 and 5. The exon 1 deletion extends from the 5′ region of the RS1 gene (including the promoter) through intron 1 (c.(−35)-1723_c.51+2664del4472). The exon 4–5 deletion spans introns 3 to intron 5 (c.185–1020_c.522+1844del5764). Conclusions Here we report two novel exonic deletions within the RS1 gene locus. We have also described the clinical presentations and hypothesized the genomic mechanisms underlying these schisis phenotypes. PMID:24227916
Characterization of novel RS1 exonic deletions in juvenile X-linked retinoschisis.
D'Souza, Leera; Cukras, Catherine; Antolik, Christian; Craig, Candice; Lee, Ji-Yun; He, Hong; Li, Shibo; Smaoui, Nizar; Hejtmancik, James F; Sieving, Paul A; Wang, Xinjing
2013-01-01
X-linked juvenile retinoschisis (XLRS) is a vitreoretinal dystrophy characterized by schisis (splitting) of the inner layers of the neuroretina. Mutations within the retinoschisis (RS1) gene are responsible for this disease. The mutation spectrum consists of amino acid substitutions, splice site variations, small indels, and larger genomic deletions. Clinically, genomic deletions are rarely reported. Here, we characterize two novel full exonic deletions: one encompassing exon 1 and the other spanning exons 4-5 of the RS1 gene. We also report the clinical findings in these patients with XLRS with two different exonic deletions. Unrelated XLRS men and boys and their mothers (if available) were enrolled for molecular genetics evaluation. The patients also underwent ophthalmologic examination and in some cases electroretinogram (ERG) recording. All the exons and the flanking intronic regions of the RS1 gene were analyzed with direct sequencing. Two patients with exonic deletions were further evaluated with array comparative genomic hybridization to define the scope of the genomic aberrations. After the deleted genomic region was identified, primer walking followed by direct sequencing was used to determine the exact breakpoints. Two novel exonic deletions of the RS1 gene were identified: one including exon 1 and the other spanning exons 4 and 5. The exon 1 deletion extends from the 5' region of the RS1 gene (including the promoter) through intron 1 (c.(-35)-1723_c.51+2664del4472). The exon 4-5 deletion spans introns 3 to intron 5 (c.185-1020_c.522+1844del5764). Here we report two novel exonic deletions within the RS1 gene locus. We have also described the clinical presentations and hypothesized the genomic mechanisms underlying these schisis phenotypes.
Design of retrovirus vectors for transfer and expression of the human. beta. -globin gene
DOE Office of Scientific and Technical Information (OSTI.GOV)
Miller, A.D.; Bender, M.A.; Harris, E.A.S.
1988-11-01
Regulated expression of the human ..beta..-globin gene has been demonstrated in cultured murine erythroleukemia cells and in mice after retrovirus-mediated gene transfer. However, the low titer of recombinant viruses described to date results in relatively inefficient gene transfer, which limits their usefulness for animal studies and for potential gene therapy in humans for diseases involving defective ..beta..-globin genes. The authors found regions that interfered with virus production within intron 2 of the ..beta..-globin gene and on both sides of the gene. The flanking regions could be removed, but intron 2 was required for ..beta..-globin expression. Inclusion of ..beta..-globin introns necessitatesmore » an antisense orientation of the gene within the retrovirus vector. However, they found no effect of the antisense ..beta..-globin transcription on virus production. A region downstream of the ..beta..-globin gene that stimulates expression of the gene in transgenic mice was included in the viruses without detrimental effects on virus titer. Virus titers of over 10/sup 6/ CFU/ml were obtained with the final vector design, which retained the ability to direct regulated expression of human ..beta..-globin in murine erythroleukemia cells. The vector also allowed transfer and expression of the human ..beta..-globin gene in hematopoietic cells (CFU-S cells) in mice.« less
Gu, Cuihua; Tembrock, Luke R.; Johnson, Nels G.; Simmons, Mark P.; Wu, Zhiqiang
2016-01-01
Lagerstroemia (crape myrtle) is an important plant genus used in ornamental horticulture in temperate regions worldwide. As such, numerous hybrids have been developed. However, DNA sequence resources and genome information for Lagerstroemia are limited, hindering evolutionary inferences regarding interspecific relationships. We report the complete plastid genome of Lagerstroemia fauriei. To our knowledge, this is the first reported whole plastid genome within Lythraceae. This genome is 152,440 bp in length with 38% GC content and consists of two single-copy regions separated by a pair of 25,793 bp inverted repeats. The large single copy and the small single copy regions span 83,921 bp and 16,933 bp, respectively. The genome contains 129 genes, including 17 located in each inverted repeat. Phylogenetic analysis of genera sampled from Geraniaceae, Myrtaceae, and Onagraceae corroborated the sister relationship between Lythraceae and Onagraceae. The plastid genomes of L. fauriei and several other Lythraceae species lack the rpl2 intron, which indicating an early loss of this intron within the Lythraceae lineage. The plastid genome of L. fauriei provides a much needed genetic resource for further phylogenetic research in Lagerstroemia and Lythraceae. Highly variable markers were identified for application in phylogenetic, barcoding and conservation genetic applications. PMID:26950701
Bonen, Linda; Boer, Poppo H.; Gray, Michael W.
1984-01-01
We have determined the sequence of the wheat mitochondrial gene for cytochrome oxidase subunit II (COII) and find that its derived protein sequence differs from that of maize at only three amino acid positions. Unexpectedly, all three replacements are non-conservative ones. The wheat COII gene has a highly-conserved intron at the same position as in maize, but the wheat intron is 1.5 times longer because of an insert relative to its maize counterpart. Hybridization analysis of mitochondrial DNA from rye, pea, broad bean and cucumber indicates strong sequence conservation of COII coding sequences among all these higher plants. However, only rye and maize mitochondrial DNA show homology with wheat COII intron sequences and rye alone with intron-insert sequences. We find that a sequence identical to the region of the 5' exon corresponding to the transmembrane domain of the COII protein is present at a second genomic location in wheat mitochondria. These variations in COII gene structure and size, as well as the presence of repeated COII sequences, illustrate at the DNA sequence level, factors which contribute to higher plant mitochondrial DNA diversity and complexity. ImagesFig. 3.Fig. 4.Fig. 5. PMID:16453565
NASA Technical Reports Server (NTRS)
Kretsinger, R. H.; Nakayama, S.
1993-01-01
In the previous three reports in this series we demonstrated that the EF-hand family of proteins evolved by a complex pattern of gene duplication, transposition, and splicing. The dendrograms based on exon sequences are nearly identical to those based on protein sequences for troponin C, the essential light chain myosin, the regulatory light chain, and calpain. This validates both the computational methods and the dendrograms for these subfamilies. The proposal of congruence for calmodulin, troponin C, essential light chain, and regulatory light chain was confirmed. There are, however, significant differences in the calmodulin dendrograms computed from DNA and from protein sequences. In this study we find that introns are distributed throughout the EF-hand domain and the interdomain regions. Further, dendrograms based on intron type and distribution bear little resemblance to those based on protein or on DNA sequences. We conclude that introns are inserted, and probably deleted, with relatively high frequency. Further, in the EF-hand family exons do not correspond to structural domains and exon shuffling played little if any role in the evolution of this widely distributed homolog family. Calmodulin has had a turbulent evolution. Its dendrograms based on protein sequence, exon sequence, 3'-tail sequence, intron sequences, and intron positions all show significant differences.
Phase distribution of spliceosomal introns: implications for intron origin
Nguyen, Hung D; Yoshihama, Maki; Kenmochi, Naoya
2006-01-01
Background The origin of spliceosomal introns is the central subject of the introns-early versus introns-late debate. The distribution of intron phases is non-uniform, with an excess of phase-0 introns. Introns-early explains this by speculating that a fraction of present-day introns were present between minigenes in the progenote and therefore must lie in phase-0. In contrast, introns-late predicts that the nonuniformity of intron phase distribution reflects the nonrandomness of intron insertions. Results In this paper, we tested the two theories using analyses of intron phase distribution. We inferred the evolution of intron phase distribution from a dataset of 684 gene orthologs from seven eukaryotes using a maximum likelihood method. We also tested whether the observed intron phase distributions from 10 eukaryotes can be explained by intron insertions on a genome-wide scale. In contrast to the prediction of introns-early, the inferred evolution of intron phase distribution showed that the proportion of phase-0 introns increased over evolution. Consistent with introns-late, the observed intron phase distributions matched those predicted by an intron insertion model quite well. Conclusion Our results strongly support the introns-late hypothesis of the origin of spliceosomal introns. PMID:16959043
Creekmore, Amy L; Ziegler, Yvonne S; Bonéy, Jamie L; Nardulli, Ann M
2007-03-15
We have used a chromatin immunoprecipitation (ChIP)-based cloning strategy to isolate and identify genes associated with estrogen receptor alpha (ERalpha) in MCF-7 human breast cancer cells. One of the gene regions isolated was a 288bp fragment from the ninth intron of the breast cancer 1 associated ring domain (BARD1) gene. We demonstrated that ERalpha associated with this region of the endogenous BARD 1 gene in MCF-7 cells, that ERalpha bound to three of five ERE half sites located in the 288bp BARD1 region, and that this 288bp BARD1 region conferred estrogen responsiveness to a heterologous promoter. Importantly, treatment of MCF-7 cells with estrogen increased BARD1 mRNA and protein levels. These findings demonstrate that ChIP cloning strategies can be utilized to successfully isolate regulatory regions that are far removed from the transcription start site and assist in identifying cis elements involved in conferring estrogen responsiveness.
Kung, M H; Lee, Y J; Hsu, J T; Huang, M C; Ju, Y T
2015-06-01
Goat β-casein (CSN2) promoter has been extensively used to derive expression of recombinant therapeutic protein in transgenic goats; however, little direct evidence exists for signaling molecules and the cis-elements of goat CSN2 promoter in response to lactogenic hormone stimulation in goat mammary epithelial cells. Here, we use an immortalized caprine mammary epithelial cell line (CMC) to search for evidence of the above. Serial 5'-flanking regions deleted of promoter and intron 1 in goat CSN2 (-4,047 to +2,054) driven by firefly luciferase reporter gene were constructed and applied to measure promoter activity in CMC. The intron 1 region (+393 to +501) significantly decreased basal activity of the promoter. This finding contradicts other studies of the role of intron 1. The signal transducer and activator of transcription (STAT)5a played a significant role in activating promoter activity by prolactin stimulation. Hydrocortisone enhanced and prolonged the activity of STAT5a and promoter in CMC, but was independent of the glucocorticoid receptor response element. The minimum length of the CSN2 promoter segment in response to lactogenic stimulation was confirmed by 5' serial deletions. A cis-element located from -300 to -90 in proximal goat CSN2 promoter that is absent in bovine and human CSN2 promoter was newly identified. We demonstrated the presence of a STAT5a binding site (-102 to -82) and preservation of the guanosine nucleotide at position -90 based on responses to the presence of lactogenic hormone using internal deletions and point mutations of the predicted STAT5a binding site, and chromatin immunoprecipitation assay. Together, these findings demonstrate that the proximal -300 bp of goat CSN2 promoter containing the STAT5a binding site (-102 to -82) is the response element for lactogenic hormone stimulation. Additionally, intron 1 may be required for tissue or developmental stage-specific expression in mammary gland. The role of the far-distal regions of goat CSN2 promoter in high-level lactogenic hormone induction and specific expression require further examination. Copyright © 2015 American Dairy Science Association. Published by Elsevier Inc. All rights reserved.
Gene-Based Single Nucleotide Polymorphism Markers for Genetic and Association Mapping in Common Bean
2012-01-01
Background In common bean, expressed sequence tags (ESTs) are an underestimated source of gene-based markers such as insertion-deletions (Indels) or single-nucleotide polymorphisms (SNPs). However, due to the nature of these conserved sequences, detection of markers is difficult and portrays low levels of polymorphism. Therefore, development of intron-spanning EST-SNP markers can be a valuable resource for genetic experiments such as genetic mapping and association studies. Results In this study, a total of 313 new gene-based markers were developed at target genes. Intronic variation was deeply explored in order to capture more polymorphism. Introns were putatively identified after comparing the common bean ESTs with the soybean genome, and the primers were designed over intron-flanking regions. The intronic regions were evaluated for parental polymorphisms using the single strand conformational polymorphism (SSCP) technique and Sequenom MassARRAY system. A total of 53 new marker loci were placed on an integrated molecular map in the DOR364 × G19833 recombinant inbred line (RIL) population. The new linkage map was used to build a consensus map, merging the linkage maps of the BAT93 × JALO EEP558 and DOR364 × BAT477 populations. A total of 1,060 markers were mapped, with a total map length of 2,041 cM across 11 linkage groups. As a second application of the generated resource, a diversity panel with 93 genotypes was evaluated with 173 SNP markers using the MassARRAY-platform and KASPar technology. These results were coupled with previous SSR evaluations and drought tolerance assays carried out on the same individuals. This agglomerative dataset was examined, in order to discover marker-trait associations, using general linear model (GLM) and mixed linear model (MLM). Some significant associations with yield components were identified, and were consistent with previous findings. Conclusions In short, this study illustrates the power of intron-based markers for linkage and association mapping in common bean. The utility of these markers is discussed in relation with the usefulness of microsatellites, the molecular markers by excellence in this crop. PMID:22734675
Dolz, Roser; Valle, Rosa; Perera, Carmen L.; Bertran, Kateri; Frías, Maria T.; Majó, Natàlia; Ganges, Llilianne; Pérez, Lester J.
2013-01-01
Background Infectious bursal disease is a highly contagious and acute viral disease caused by the infectious bursal disease virus (IBDV); it affects all major poultry producing areas of the world. The current study was designed to rigorously measure the global phylogeographic dynamics of IBDV strains to gain insight into viral population expansion as well as the emergence, spread and pattern of the geographical structure of very virulent IBDV (vvIBDV) strains. Methodology/Principal Findings Sequences of the hyper-variable region of the VP2 (HVR-VP2) gene from IBDV strains isolated from diverse geographic locations were obtained from the GenBank database; Cuban sequences were obtained in the current work. All sequences were analysed by Bayesian phylogeographic analysis, implemented in the Bayesian Evolutionary Analysis Sampling Trees (BEAST), Bayesian Tip-association Significance testing (BaTS) and Spatial Phylogenetic Reconstruction of Evolutionary Dynamics (SPREAD) software packages. Selection pressure on the HVR-VP2 was also assessed. The phylogeographic association-trait analysis showed that viruses sampled from individual countries tend to cluster together, suggesting a geographic pattern for IBDV strains. Spatial analysis from this study revealed that strains carrying sequences that were linked to increased virulence of IBDV appeared in Iran in 1981 and spread to Western Europe (Belgium) in 1987, Africa (Egypt) around 1990, East Asia (China and Japan) in 1993, the Caribbean Region (Cuba) by 1995 and South America (Brazil) around 2000. Selection pressure analysis showed that several codons in the HVR-VP2 region were under purifying selection. Conclusions/Significance To our knowledge, this work is the first study applying the Bayesian phylogeographic reconstruction approach to analyse the emergence and spread of vvIBDV strains worldwide. PMID:23805195
Alfonso-Morales, Abdulahi; Martínez-Pérez, Orlando; Dolz, Roser; Valle, Rosa; Perera, Carmen L; Bertran, Kateri; Frías, Maria T; Majó, Natàlia; Ganges, Llilianne; Pérez, Lester J
2013-01-01
Infectious bursal disease is a highly contagious and acute viral disease caused by the infectious bursal disease virus (IBDV); it affects all major poultry producing areas of the world. The current study was designed to rigorously measure the global phylogeographic dynamics of IBDV strains to gain insight into viral population expansion as well as the emergence, spread and pattern of the geographical structure of very virulent IBDV (vvIBDV) strains. Sequences of the hyper-variable region of the VP2 (HVR-VP2) gene from IBDV strains isolated from diverse geographic locations were obtained from the GenBank database; Cuban sequences were obtained in the current work. All sequences were analysed by Bayesian phylogeographic analysis, implemented in the Bayesian Evolutionary Analysis Sampling Trees (BEAST), Bayesian Tip-association Significance testing (BaTS) and Spatial Phylogenetic Reconstruction of Evolutionary Dynamics (SPREAD) software packages. Selection pressure on the HVR-VP2 was also assessed. The phylogeographic association-trait analysis showed that viruses sampled from individual countries tend to cluster together, suggesting a geographic pattern for IBDV strains. Spatial analysis from this study revealed that strains carrying sequences that were linked to increased virulence of IBDV appeared in Iran in 1981 and spread to Western Europe (Belgium) in 1987, Africa (Egypt) around 1990, East Asia (China and Japan) in 1993, the Caribbean Region (Cuba) by 1995 and South America (Brazil) around 2000. Selection pressure analysis showed that several codons in the HVR-VP2 region were under purifying selection. To our knowledge, this work is the first study applying the Bayesian phylogeographic reconstruction approach to analyse the emergence and spread of vvIBDV strains worldwide.
Lazzarato, F; Franceschinis, G; Botta, M; Cordero, F; Calogero, R A
2004-11-01
RRE allows the extraction of non-coding regions surrounding a coding sequence [i.e. gene upstream region, 5'-untranslated region (5'-UTR), introns, 3'-UTR, downstream region] from annotated genomic datasets available at NCBI. RRE parser and web-based interface are accessible at http://www.bioinformatica.unito.it/bioinformatics/rre/rre.html
Associations of polymorphisms in the Pit-1 gene with growth and carcass traits in Angus beef cattle.
Zhao, Q; Davis, M E; Hines, H C
2004-08-01
The Pit-1 gene was studied as a candidate for genetic markers of growth and carcass traits. Angus beef cattle that were divergently selected for high- or low-blood serum IGF-I concentration were used in this study. The single-strand conformation polymorphism method was used to identify polymorphism in the Pit-1 gene including regions from intron 2 to exon 6. Two polymorphisms, Pit1I3H (HinfI) and Pit1I3NL (NlaIII), were detected in intron 3 of the Pit-1 gene. One polymorphism, Pit1I4N (BstNI), was found in intron 4, and a single nucleotide polymorphism, Pit1I5, was found in intron 5. The previously reported polymorphism in exon 6, Pit1E6H (HinfI), was also studied in 416 Angus beef cattle. Associations of the polymorphisms with growth traits, carcass traits, and IGF-I concentration were analyzed using a general linear model procedure. No significant associations were observed between these polymorphisms and growth and carcass traits.
Felício, V; Ramalho, A S; Igreja, S; Amaral, M D
2017-03-01
Even with advent of next generation sequencing complete sequencing of large disease-associated genes and intronic regions is economically not feasible. This is the case of cystic fibrosis transmembrane conductance regulator (CFTR), the gene responsible for cystic fibrosis (CF). Yet, to confirm a CF diagnosis, proof of CFTR dysfunction needs to be obtained, namely by the identification of two disease-causing mutations. Moreover, with the advent of mutation-based therapies, genotyping is an essential tool for CF disease management. There is, however, still an unmet need to genotype CF patients by fast, comprehensive and cost-effective approaches, especially in populations with high genetic heterogeneity (and low p.F508del incidence), where CF is now emerging with new diagnosis dilemmas (Brazil, Asia, etc). Herein, we report an innovative mRNA-based approach to identify CFTR mutations in the complete coding and intronic regions. We applied this protocol to genotype individuals with a suspicion of CF and only one or no CFTR mutations identified by routine methods. It successfully detected multiple intronic mutations unlikely to be detected by CFTR exon sequencing. We conclude that this is a rapid, robust and inexpensive method to detect any CFTR coding/intronic mutation (including rare ones) that can be easily used either as primary approach or after routine DNA analysis. © 2016 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.
Pombert, Jean-François; Lemieux, Claude; Turmel, Monique
2006-01-01
Background The phylum Chlorophyta contains the majority of the green algae and is divided into four classes. The basal position of the Prasinophyceae has been well documented, but the divergence order of the Ulvophyceae, Trebouxiophyceae and Chlorophyceae is currently debated. The four complete chloroplast DNA (cpDNA) sequences presently available for representatives of these classes have revealed extensive variability in overall structure, gene content, intron composition and gene order. The chloroplast genome of Pseudendoclonium (Ulvophyceae), in particular, is characterized by an atypical quadripartite architecture that deviates from the ancestral type by a large inverted repeat (IR) featuring an inverted rRNA operon and a small single-copy (SSC) region containing 14 genes normally found in the large single-copy (LSC) region. To gain insights into the nature of the events that led to the reorganization of the chloroplast genome in the Ulvophyceae, we have determined the complete cpDNA sequence of Oltmannsiellopsis viridis, a representative of a distinct, early diverging lineage. Results The 151,933 bp IR-containing genome of Oltmannsiellopsis differs considerably from Pseudendoclonium and other chlorophyte cpDNAs in intron content and gene order, but shares close similarities with its ulvophyte homologue at the levels of quadripartite architecture, gene content and gene density. Oltmannsiellopsis cpDNA encodes 105 genes, contains five group I introns, and features many short dispersed repeats. As in Pseudendoclonium cpDNA, the rRNA genes in the IR are transcribed toward the single copy region featuring the genes typically found in the ancestral LSC region, and the opposite single copy region harbours genes characteristic of both the ancestral SSC and LSC regions. The 52 genes that were transferred from the ancestral LSC to SSC region include 12 of those observed in Pseudendoclonium cpDNA. Surprisingly, the overall gene organization of Oltmannsiellopsis cpDNA more closely resembles that of Chlorella (Trebouxiophyceae) cpDNA. Conclusion The chloroplast genome of the last common ancestor of Oltmannsiellopsis and Pseudendoclonium contained a minimum of 108 genes, carried only a few group I introns, and featured a distinctive quadripartite architecture. Numerous changes were experienced by the chloroplast genome in the lineages leading to Oltmannsiellopsis and Pseudendoclonium. Our comparative analyses of chlorophyte cpDNAs support the notion that the Ulvophyceae is sister to the Chlorophyceae. PMID:16472375
A novel multi-variant epitope ensemble vaccine against avian leukosis virus subgroup J.
Wang, Xiaoyu; Zhou, Defang; Wang, Guihua; Huang, Libo; Zheng, Qiankun; Li, Chengui; Cheng, Ziqiang
2017-12-04
The hypervariable antigenicity and immunosuppressive features of avian leukosis virus subgroup J (ALV-J) has led to great challenges to develop effective vaccines. Epitope vaccine will be a perspective trend. Previously, we identified a variant antigenic neutralizing epitope in hypervariable region 1 (hr1) of ALV-J, N-LRDFIA/E/TKWKS/GDDL/HLIRPYVNQS-C. BLAST analysis showed that the mutation of A, E, T and H in this epitope cover 79% of all ALV-J strains. Base on this data, we designed a multi-variant epitope ensemble vaccine comprising the four mutation variants linked with glycine and serine. The recombinant multi-variant epitope gene was expressed in Escherichia coli BL21. The expressed protein of the variant multi-variant epitope gene can react with positive sera and monoclonal antibodies of ALV-J, while cannot react with ALV-J negative sera. The multi-variant epitope vaccine that conjugated Freund's adjuvant complete/incomplete showed high immunogenicity that reached the titer of 1:64,000 at 42 days post immunization and maintained the immune period for at least 126 days in SPF chickens. Further, we demonstrated that the antibody induced by the variant multi-variant ensemble epitope vaccine recognized and neutralized different ALV-J strains (NX0101, TA1, WS1, BZ1224 and BZ4). Protection experiment that was evaluated by clinical symptom, viral shedding, weight gain, gross and histopathology showed 100% chickens that inoculated the multi-epitope vaccine were well protected against ALV-J challenge. The result shows a promising multi-variant epitope ensemble vaccine against hypervariable viruses in animals. Copyright © 2017 Elsevier Ltd. All rights reserved.
Meher, J K; Meher, P K; Dash, G N; Raval, M K
2012-01-01
The first step in gene identification problem based on genomic signal processing is to convert character strings into numerical sequences. These numerical sequences are then analysed spectrally or using digital filtering techniques for the period-3 peaks, which are present in exons (coding areas) and absent in introns (non-coding areas). In this paper, we have shown that single-indicator sequences can be generated by encoding schemes based on physico-chemical properties. Two new methods are proposed for generating single-indicator sequences based on hydration energy and dipole moments. The proposed methods produce high peak at exon locations and effectively suppress false exons (intron regions having greater peak than exon regions) resulting in high discriminating factor, sensitivity and specificity.
Chen, Huizhong; Li, Xin-Liang; Blum, David L; Ximenes, Eduardo A; Ljungdahl, Lars G
2003-01-01
A cDNA, designated celF, encoding a cellulase (CelF) was isolated from the anaerobic fungus Orpinomyces PC-2. The open reading frame contains regions coding for a signal peptide, a carbohydrate-binding module (CBM), a linker, and a catalytic domain. The catalytic domain was homologous to those of CelA and CelC of the same fungus and to that of the Neocallimastix patriciarum CELA, but CelF lacks a docking domain, characteristic for enzymes of cellulosomes. It was also homologous to the cellobiohydrolase IIs and endoglucanases of aerobic organisms. The gene has a 111-bp intron, located within the CBM-coding region. Some biochemical properties of the purified recombinant enzyme are described.
Genetic variations in the MCT1 (SLC16A1) gene in the Chinese population of Singapore.
Lean, Choo Bee; Lee, Edmund Jon Deoon
2009-01-01
MCT1(SLC16A1) is the first member of the monocarboxylate transporter (MCT) and its family is involved in the transportation of metabolically important monocarboxylates such as lactate, pyruvate, acetate and ketone bodies. This study identifies genetic variations in SLC16A1 in the ethnic Chinese group of the Singaporean population (n=95). The promoter, coding region and exon-intron junctions of the SLC16A1 gene encoding the MCT1 transporter were screened for genetic variation in the study population by DNA sequencing. Seven genetic variations of SLC16A1, including 4 novel ones, were found: 2 in the promoter region, 2 in the coding exons (both nonsynonymous variations), 2 in the 3' untranslated region (3'UTR) and 1 in the intron. Of the two mutations detected in the promoter region, the -363-855T>C is a novel mutation. The 1282G>A (Val(428)Ile) is a novel SNP and was found as heterozygotic in 4 subjects. The 1470T>A (Asp(490)Glu) was found to be a common polymorphism in this study. Lastly, IVS3-17A>C in intron 3 and 2258 (755)A>G in 3'UTR are novel mutations found to be common polymorphisms in the local Chinese population. To our knowledge, this is the first report of a comprehensive analysis on the MCT1 gene in any population.
Phylogeography of Barbary macaques (Macaca sylvanus) and the origin of the Gibraltar colony.
Modolo, Lara; Salzburger, Walter; Martin, Robert D
2005-05-17
The Barbary macaque (Macaca sylvanus) is the earliest offshoot of the genus Macaca and the only extant African representative, all other species being Asiatic. Once distributed throughout North Africa, M. sylvanus is now restricted to isolated forest fragments in Algeria and Morocco. The species is threatened; the maximum total wild population size is estimated at 10,000 individuals. Relationships among surviving wild subpopulations in Algeria (96 samples) and Morocco (116 samples) were examined by using 468-bp sequences from hypervariable region I of the mitochondrial DNA control region. Twenty-four different haplotypes were identified, differing by 1-26 mutational steps (0.2-5.6%) and 1 insertion. With one exception (attributable to secondary introduction in coastal Morocco), Algerian and Moroccan haplotypes are clearly distinct. However, whereas Moroccan subpopulations show little divergence in hypervariable region I sequences and little correspondence with geographical distribution, there is a deep division between two main subpopulations in Algeria and one marked secondary division, with haplotypes generally matching geographical distribution. Accepting an origin of the genus Macaca of 5.5 million years ago, the Moroccan population and the two main Algerian subpopulations diverged approximately 1.6 million years ago. Distinction between Moroccan and Algerian haplotypes permitted analysis of the origin of the Gibraltar colony of Barbary macaques (68 samples; 30% of the population). It is generally held that the present Gibraltar population descended from a dozen individuals imported during World War II. However, the Gibraltar sample was found to include Algerian and Moroccan haplotypes separated by at least 16 mutational steps, revealing a dual origin of the founding females.
Phylogeography of Barbary macaques (Macaca sylvanus) and the origin of the Gibraltar colony
Modolo, Lara; Salzburger, Walter; Martin, Robert D.
2005-01-01
The Barbary macaque (Macaca sylvanus) is the earliest offshoot of the genus Macaca and the only extant African representative, all other species being Asiatic. Once distributed throughout North Africa, M. sylvanus is now restricted to isolated forest fragments in Algeria and Morocco. The species is threatened; the maximum total wild population size is estimated at 10,000 individuals. Relationships among surviving wild subpopulations in Algeria (96 samples) and Morocco (116 samples) were examined by using 468-bp sequences from hypervariable region I of the mitochondrial DNA control region. Twenty-four different haplotypes were identified, differing by 1-26 mutational steps (0.2-5.6%) and 1 insertion. With one exception (attributable to secondary introduction in coastal Morocco), Algerian and Moroccan haplotypes are clearly distinct. However, whereas Moroccan subpopulations show little divergence in hypervariable region I sequences and little correspondence with geographical distribution, there is a deep division between two main subpopulations in Algeria and one marked secondary division, with haplotypes generally matching geographical distribution. Accepting an origin of the genus Macaca of 5.5 million years ago, the Moroccan population and the two main Algerian subpopulations diverged ≈1.6 million years ago. Distinction between Moroccan and Algerian haplotypes permitted analysis of the origin of the Gibraltar colony of Barbary macaques (68 samples; 30% of the population). It is generally held that the present Gibraltar population descended from a dozen individuals imported during World War II. However, the Gibraltar sample was found to include Algerian and Moroccan haplotypes separated by at least 16 mutational steps, revealing a dual origin of the founding females. PMID:15870193
A homozygous mutation in the stem II domain of RNU4ATAC causes typical Roifman syndrome.
Dinur Schejter, Yael; Ovadia, Adi; Alexandrova, Roumiana; Thiruvahindrapuram, Bhooma; Pereira, Sergio L; Manson, David E; Vincent, Ajoy; Merico, Daniele; Roifman, Chaim M
2017-01-01
Roifman syndrome (OMIM# 616651) is a complex syndrome encompassing skeletal dysplasia, immunodeficiency, retinal dystrophy and developmental delay, and is caused by compound heterozygous mutations involving the Stem II region and one of the other domains of the RNU4ATAC gene. This small nuclear RNA gene is essential for minor intron splicing. The Canadian Centre for Primary Immunodeficiency Registry and Repository were used to derive patient information as well as tissues. Utilising RNA sequencing methodologies, we analysed samples from patients with Roifman syndrome and assessed intron retention. We demonstrate that a homozygous mutation in Stem II is sufficient to cause the full spectrum of features associated with typical Roifman syndrome. Further, we demonstrate the same pattern of aberration in minor intron retention as found in cases with compound heterozygous mutations.
Is “Junk” DNA Mostly Intron DNA?
Wong, Gane Ka-Shu; Passey, Douglas A.; Huang, Ying-zong; Yang, Zhiyong; Yu, Jun
2000-01-01
Among higher eukaryotes, very little of the genome codes for protein. What is in the rest of the genome, or the “junk” DNA, that, in Homo sapiens, is estimated to be almost 97% of the genome? Is it possible that much of this “junk” is intron DNA? This is not a question that can be answered just by looking at the published data, even from the finished genomes. One cannot assume that there are no genes in a sequenced region, just because no genes were annotated. We introduce another approach to this problem, based on an analysis of the cDNA-to-genomic alignments, in all of the complete or nearly-complete genomes from the multicellular organisms. Our conclusion is that, in animals but not in plants, most of the “junk” is intron DNA. PMID:11076852
Freitas, F Zanolli; Bertolini, M C
2004-12-01
Glycogen synthase, an enzyme involved in glycogen biosynthesis, is regulated by phosphorylation and by the allosteric ligand glucose-6-phosphate (G6P). In addition, enzyme levels can be regulated by changes in gene expression. We recently cloned a cDNA for glycogen synthase ( gsn) from Neurospora crassa, and showed that gsn transcription decreased when cells were exposed to heat shock (shifted from 30 degrees C to 45 degrees C). In order to understand the mechanisms that control gsn expression, we isolated the gene, including its 5' and 3' flanking regions, from the genome of N. crassa. An ORF of approximately 2.4 kb was identified, which is interrupted by four small introns (II-V). Intron I (482 bp) is located in the 5'UTR region. Three putative Transcription Initiation Sites (TISs) were mapped, one of which lies downstream of a canonical TATA-box sequence (5'-TGTATAAA-3'). Analysis of the 5'-flanking region revealed the presence of putative transcription factor-binding sites, including Heat Shock Elements (HSEs) and STress Responsive Elements (STREs). The possible involvement of these motifs in the negative regulation of gsn transcription was investigated using Electrophoretic Mobility Shift Assays (EMSA) with nuclear extracts of N. crassa mycelium obtained before and after heat shock, and DNA fragments encompassing HSE and STRE elements from the 5'-flanking region. While elements within the promoter region are involved in transcription under heat shock, elements in the 5'UTR intron may participate in transcription during vegetative growth. The results thus suggest that N. crassa possesses trans -acting elements that interact with the 5'-flanking region to regulate gsn transcription during heat shock and vegetative growth.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Solovyev, V.V.; Salamov, A.A.; Lawrence, C.B.
1994-12-31
Discriminant analysis is applied to the problem of recognition 5`-, internal and 3`-exons in human DNA sequences. Specific recognition functions were developed for revealing exons of particular types. The method based on a splice site prediction algorithm that uses the linear Fisher discriminant to combine the information about significant triplet frequencies of various functional parts of splice site regions and preferences of oligonucleotide in protein coding and nation regions. The accuracy of our splice site recognition function is about 97%. A discriminant function for 5`-exon prediction includes hexanucleotide composition of upstream region, triplet composition around the ATG codon, ORF codingmore » potential, donor splice site potential and composition of downstream introit region. For internal exon prediction, we combine in a discriminant function the characteristics describing the 5`- intron region, donor splice site, coding region, acceptor splice site and Y-intron region for each open reading frame flanked by GT and AG base pairs. The accuracy of precise internal exon recognition on a test set of 451 exon and 246693 pseudoexon sequences is 77% with a specificity of 79% and a level of pseudoexon ORF prediction of 99.96%. The recognition quality computed at the level of individual nucleotides is 89%, for exon sequences and 98% for intron sequences. A discriminant function for 3`-exon prediction includes octanucleolide composition of upstream nation region, triplet composition around the stop codon, ORF coding potential, acceptor splice site potential and hexanucleotide composition of downstream region. We unite these three discriminant functions in exon predicting program FEX (find exons). FEX exactly predicts 70% of 1016 exons from the test of 181 complete genes with specificity 73%, and 89% exons are exactly or partially predicted. On the average, 85% of nucleotides were predicted accurately with specificity 91%.« less
Ge, H; Noble, J; Colgan, J; Manley, J L
1990-01-01
We have studied splicing of the polyoma virus early region pre-mRNA in vitro. This RNA is alternatively spliced in vivo to produce mRNA encoding the large, middle-sized (MTAg), and small (StAg) tumor antigens. Our primary interest was to learn how the 48-nucleotide StAg intron is excised, because the length of this intron is significantly less than the apparent minimum established for mammalian introns. Although the products of all three splices are detected in vitro, characterization of the pathway and sequence requirements of StAg splicing suggests that splicing factors interact with the precursor RNA in an unexpected way to catalyze removal of this intron. Specifically, StAg splicing uses either of two lariat branch points, one of which is located only 4 nucleotides from the 3' splice site. Furthermore, the StAg splice absolutely requires that the alternative MTAg 3' splice site, located 14 nucleotides downstream of the StAg 3' splice site, be intact. Insertion mutations that increase or decrease the quality of the MTAg pyrimidine stretch enhance or repress StAg as well as MTAg splicing, and a single-base change in the MTAg AG splice acceptor totally blocks both splices. These results demonstrate the ability of two 3' splice sites to cooperate with each other to bring about removal of a single intron. Images PMID:2159146
Genetic variations of VDR/NR1I1 encoding vitamin D receptor in a Japanese population.
Ukaji, Maho; Saito, Yoshiro; Fukushima-Uesaka, Hiromi; Maekawa, Keiko; Katori, Noriko; Kaniwa, Nahoko; Yoshida, Teruhiko; Nokihara, Hiroshi; Sekine, Ikuo; Kunitoh, Hideo; Ohe, Yuichiro; Yamamoto, Noboru; Tamura, Tomohide; Saijo, Nagahiro; Sawada, Jun-ichi
2007-12-01
The vitamin D receptor (VDR) is a transcriptional factor responsive to 1alpha,25-dihydroxyvitamin D(3) and lithocholic acid, and induces expression of drug metabolizing enzymes CYP3A4, CYP2B6 and CYP2C9. In this study, the promoter regions, 14 exons (including 6 exon 1's) and their flanking introns of VDR were comprehensively screened for genetic variations in 107 Japanese subjects. Sixty-one genetic variations including 25 novel ones were found: 9 in the 5'-flanking region, 2 in the 5'-untranslated region (UTR), 7 in the coding exons (5 synonymous and 2 nonsynonymous variations), 12 in the 3'-UTR, 19 in the introns between the exon 1's, and 12 in introns 2 to 8. Of these, one novel nonsynonymous variation, 154A>G (Met52Val), was detected with an allele frequency of 0.005. The single nucleotide polymorphisms (SNPs) that increase VDR expression or activity, -29649G>A, 2T>C and 1592((*)308)C>A tagging linked variations in the 3'-UTR, were detected at 0.430, 0.636, and 0.318 allele frequencies, respectively. Another SNP, -26930A>G, with reduced VDR transcription was found at a 0.028 frequency. These findings would be useful for association studies on VDR variations in Japanese.
Benito-Sanz, Sara; Belinchon-Martínez, Alberta; Aza-Carmona, Miriam; de la Torre, Carolina; Huber, Celine; González-Casado, Isabel; Ross, Judith L; Thomas, N Simon; Zinn, Andrew R; Cormier-Daire, Valerie; Heath, Karen E
2017-02-01
Short stature homeobox gene (SHOX) is located in the pseudoautosomal region 1 of the sex chromosomes. It encodes a transcription factor implicated in the skeletal growth. Point mutations, deletions or duplications of SHOX or its transcriptional regulatory elements are associated with two skeletal dysplasias, Léri-Weill dyschondrosteosis (LWD) and Langer mesomelic dysplasia (LMD), as well as in a small proportion of idiopathic short stature (ISS) individuals. We have identified a total of 15 partial SHOX deletions and 13 partial SHOX duplications in LWD, LMD and ISS patients referred for routine SHOX diagnostics during a 10 year period (2004-2014). Subsequently, we characterized these alterations using MLPA (multiplex ligation-dependent probe amplification assay), fine-tiling array CGH (comparative genomic hybridation) and breakpoint PCR. Nearly half of the alterations have a distal or proximal breakpoint in intron 3. Evaluation of our data and that in the literature reveals that although partial deletions and duplications only account for a small fraction of SHOX alterations, intron 3 appears to be a breakpoint hotspot, with alterations arising by non-allelic homologous recombination, non-homologous end joining or other complex mechanisms.
MAPT as a predisposing gene for sporadic amyotrophic lateral sclerosis in the Chinese Han population
Fang, Pu; Xu, Wenyuan; Wu, Chengsi; Zhu, Min; Li, Xiaobing; Hong, Daojun
2013-01-01
A previous study of European Caucasian patients with sporadic amyotrophic lateral sclerosis demonstrated that a polymorphism in the microtubule-associated protein Tau (MAPT) gene was significantly associated with sporadic amyotrophic lateral sclerosis pathogenesis. Here, we tested this association in 107 sporadic amyotrophic lateral sclerosis patients and 100 healthy controls from the Chinese Han population. We screened the mutation-susceptible regions of MAPT – the 3' and 5' untranslated regions as well as introns 9, 10, 11, and 12 – by direct sequencing, and identified 33 genetic variations. Two of these, 105788 A > G in intron 9 and 123972 T > A in intron 11, were not present in the control group. The age of onset in patients with the 105788 A > G and/or the 123972 T > A variant was younger than that in patients without either genetic variation. Moreover, the pa-tients with a genetic variation were more prone to bulbar palsy and breathing difficulties than those with the wild-type genotype. This led to a shorter survival period in patients with a MAPT genetic variant. Our study suggests that the MAPT gene is a potential risk gene for sporadic amyotrophic lateral sclerosis in the Chinese Han population. PMID:25206632
Klahre, U; Hemmings-Mieszczak, M; Filipowicz, W
1995-06-01
We have previously characterized nuclear cDNA clones encoding two RNA binding proteins, CP-RBP30 and CP-RBP-31, which are targeted to chloroplasts in Nicotiana plumbaginifolia. In this report we describe the analysis of the 3'-untranslated regions (3'-UTRs) in 22 CP-RBP30 and 8 CP-RBP31 clones which reveals that mRNAs encoding both proteins have a very complex polyadenylation pattern. Fourteen distinct poly(A) sites were identified among CP-RBP30 clones and four sites among the CP-RBP31 clones. The authenticity of the sites was confirmed by RNase A/T1 mapping of N. plumbaginifolia RNA. CP-RBP30 provides an extreme example of the heterogeneity known to be a feature of mRNA polyadenylation in higher plants. Using PCR we have demonstrated that CP-RBP genes in N. plumbaginifolia and N. sylvestris, in addition to the previously described introns interrupting the coding region, contain an intron located in the 3' non-coding part of the gene. In the case of the CP-RBP31, we have identified one polyadenylation event occurring in this intron.
Wickramasinghe, Susiji; Yatawara, Lalani; Nagataki, Mitsuru; Agatsuma, Takeshi
2016-10-01
To determine exon/intron organization of the Toxocara canis (T. canis) AK (TCAK) and to test green and black tea and several other chemicals against the activity of recombinant TCAK in the guanidino-specific region by site-directed mutants. Amplification of genomic DNA fragments containing introns was carried out by PCRs. The open-reading frame (1200 bp) of TCAK (wild type) was cloned into the BamH1/SalI site of pMAL-c2X. The maltose-binding protein-TCAK fusion protein was expressed in Escherichia coli TB1 cells. The purity of the expressed enzyme was verified by SDS-PAGE. Mutations were introduced into the guanidino-specific region and other areas of pMAL/TCAK by PCR. Enzyme activity was measured with an NADH-linked assay at 25 °C for the forward reaction (phosphagen synthesis). Arginine kinase in T. canis has a seven-exon/six-intron gene structure. The lengths of the introns ranged from 542 bp to 2 500 bp. All introns begin with gt and end with ag. Furthermore, we measured the enzyme activity of site-directed mutants of the recombinant TCAK. The K m value of the mutant (Alanine to Serine) decreased indicating a higher affinity for substrate arginine than the wild-type. The K m value of the mutant (Serine to Glycine) increased to 0.19 mM. The K m value (0.19 mM) of the double mutant (Alanine-Serine to Serine-Glycine) was slightly greater than in the wild-type (0.12 mM). In addition, several other chemicals were tested; including plant extract Azadiracta indica (A. indica), an aminoglycoside antibiotic (aminosidine), a citrus flavonoid glycoside (rutin) and a commercially available catechin mixture against TCAK. Green and black tea (1:10 dilution) produced 15% and 25% inhibition of TCAK, respectively. The extract of A. indica produced 5% inhibition of TCAK. Moreover, green and black tea produced a non-competitive type of inhibition and A. indica produced a mixed-type of inhibition on TCAK. Arginine kinase in T. canis has a seven-exon/six-intron gene structure. However, further studies are needed to identify a specific compound within the extract causing the inhibitory effect and also to determine the molecular mechanisms behind inhibition of arginine kinase in T. canis. Copyright © 2016 Hainan Medical University. Production and hosting by Elsevier B.V. All rights reserved.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Jacobson, D.P.; Schmeling, P.; Sommer, S.S.
Alternating purine and pyrimidine repeats (RY(i)) are an abundant source of polymorphism. The subset with long tandem repeats of GT or AC (GT(i)) have been studied extensively, but cryptic RY(i) (i.e., no single tandem repeat predominates) have received little attention. The factor IX gene has a polymorphic cryptic RY(i) of 142-216 bp. Previously, there were four known polymorphic alleles, of the form AB, A[sub 2]B, A[sub 2]B[sub 2], and A[sub 3]B[sub 2], where A = (GT)(AC)[sub 3](AT)[sub 3](GT)(AT)[sub 4] and B = A with an additional 3' AT dinucleotide. To further characterize this locus, the authors examined more than 1,700more » additional human chromosomes and determined the sequences of the homologous sites in orangutans and chimpanzees. The novel alleles found in humans expand the repertoire of A/B alleles to A[sub 0-4]B[sub 1] and A[sub 1-3]B[sub 2]. The A[sub n]B[sub 2] series are abundant in Caucasians but are absent in blacks and Asians. Conversely, the A[sub 0]B[sub 1] allele is common in blacks but is not found in more than 1,700 Caucasian chromosomes. The data are compatible with a model in which recombination is more frequent than polymerase slippage at this locus. In orangutans, the RY(i) is present, but the sequence is markedly different. An A/B-type of pattern was discerned in which B differs from A by an additional six (AT) dinucleotides at the 3' end. In chimpanzees, the size of the RY(i) locus was greatly expanded, and the sequence showed a novel pattern of hypervariability in which there are many tandem repeats of the form (GT)[sub n](AC)[sub 0](AT)[sub p](GT)[sub q](AT)[sub s], where n, o, p, q, and s are different integers. The sequences of the factor IX intron 1 cryptic RY(i) in three primates provide perspective on the range of possible patterns of polymorphism. Analysis of the patterns suggests how the RY(i) can be conserved during evolution, while the precise sequence varies. 25 refs., 5 figs., 3 tabs.« less
Genetic association of ubiquilin with Alzheimer's disease and related quantitative measures.
Kamboh, M I; Minster, R L; Feingold, E; DeKosky, S T
2006-03-01
The gene coding for ubiquilin 1 (UBQLN1) is located near a linkage peak on chromosome 9q22.2 and it also impacts the function of presenilin proteins involved in early-onset Alzheimer's disease (AD). Recently, genetic variation in UBQLN1 has been shown to affect the risk of AD in two independent family-based samples. The purpose of this study was to confirm the reported association in a large case-control sample and to also examine the association of UBQLN1 SNPs with quantitative measures of AD progression, namely age-at-onset (AAO), disease duration and Mini-Mental State Examination (MMSE) score. We examined the associations of three SNPs in the UBQLN1 gene (intron 6/A>C, intron 8/T>C and intron 9/A>G) in up to 978 LOAD cases and 808 controls. All SNPs were in significant linkage disequilibrium (P<0.0001). While modest significant associations were observed in the single-site regression analysis, 3-site haplotype analysis revealed significant associations (P<0.0001 for overall haplotype analysis). One common haplotype (H4) defined by intron 6/A-intron 8/C-intron 9/G alleles was associated with AD risk and one less common haplotype (H5) defined by intron 6/C-intron 8/C-intron 9/A alleles was associated with protection. The adjusted odds ratios with potentially one and two copies of risk haplotype H4 were 1.5 (95% CI: 0.99-2.26; P=0.054) and 3.66 (95% CI: 1.43-9.39; P=0.007), respectively, and odds ratio for haplotype H5 carriers was 0.31 (95% CI: 0.10-0.95; P=0.0398). In addition to disease risk, the homozygosity of the risk haplotype was also associated with older AAO, longer disease duration and lower MMSE score. In summary, our data from a large case-control cohort indicate that genetic variation in the UBQLN1 gene has a modest effect on risk, AAO and disease duration of AD. Our haplotype data suggest the presence of additional putative functional variants either in the UBQLN1 gene or nearby genes and provide strong justification for additional work in this region on chromosome 9.
2013-01-01
Background Intronic and intergenic long noncoding RNAs (lncRNAs) are emerging gene expression regulators. The molecular pathogenesis of renal cell carcinoma (RCC) is still poorly understood, and in particular, limited studies are available for intronic lncRNAs expressed in RCC. Methods Microarray experiments were performed with custom-designed arrays enriched with probes for lncRNAs mapping to intronic genomic regions. Samples from 18 primary RCC tumors and 11 nontumor adjacent matched tissues were analyzed. Meta-analyses were performed with microarray expression data from three additional human tissues (normal liver, prostate tumor and kidney nontumor samples), and with large-scale public data for epigenetic regulatory marks and for evolutionarily conserved sequences. Results A signature of 29 intronic lncRNAs differentially expressed between RCC and nontumor samples was obtained (false discovery rate (FDR) <5%). A signature of 26 intronic lncRNAs significantly correlated with the RCC five-year patient survival outcome was identified (FDR <5%, p-value ≤0.01). We identified 4303 intronic antisense lncRNAs expressed in RCC, of which 22% were significantly (p <0.05) cis correlated with the expression of the mRNA in the same locus across RCC and three other human tissues. Gene Ontology (GO) analysis of those loci pointed to 'regulation of biological processes’ as the main enriched category. A module map analysis of the protein-coding genes significantly (p <0.05) trans correlated with the 20% most abundant lncRNAs, identified 51 enriched GO terms (p <0.05). We determined that 60% of the expressed lncRNAs are evolutionarily conserved. At the genomic loci containing the intronic RCC-expressed lncRNAs, a strong association (p <0.001) was found between their transcription start sites and genomic marks such as CpG islands, RNA Pol II binding and histones methylation and acetylation. Conclusion Intronic antisense lncRNAs are widely expressed in RCC tumors. Some of them are significantly altered in RCC in comparison with nontumor samples. The majority of these lncRNAs is evolutionarily conserved and possibly modulated by epigenetic modifications. Our data suggest that these RCC lncRNAs may contribute to the complex network of regulatory RNAs playing a role in renal cell malignant transformation. PMID:24238219
Proudhon, D; Wei, J; Briat, J; Theil, E C
1996-03-01
Ferritin, a protein widespread in nature, concentrates iron approximately 10(11)-10(12)-fold above the solubility within a spherical shell of 24 subunits; it derives in plants and animals from a common ancestor (based on sequence) but displays a cytoplasmic location in animals compared to the plastid in contemporary plants. Ferritin gene regulation in plants and animals is altered by development, hormones, and excess iron; iron signals target DNA in plants but mRNA in animals. Evolution has thus conserved the two end points of ferritin gene expression, the physiological signals and the protein structure, while allowing some divergence of the genetic mechanisms. Comparison of ferritin gene organization in plants and animals, made possible by the cloning of a dicot (soybean) ferritin gene presented here and the recent cloning of two monocot (maize) ferritin genes, shows evolutionary divergence in ferritin gene organization between plants and animals but conservation among plants or among animals; divergence in the genetic mechanism for iron regulation is reflected by the absence in all three plant genes of the IRE, a highly conserved, noncoding sequence in vertebrate animal ferritin mRNA. In plant ferritin genes, the number of introns (n = 7) is higher than in animals (n = 3). Second, no intron positions are conserved when ferritin genes of plants and animals are compared, although all ferritin gene introns are in the coding region; within kingdoms, the intron positions in ferritin genes are conserved. Finally, secondary protein structure has no apparent relationship to intron/exon boundaries in plant ferritin genes, whereas in animal ferritin genes the correspondence is high. The structural differences in introns/exons among phylogenetically related ferritin coding sequences and the high conservation of the gene structure within plant or animal kingdoms of the gene structure within plant or animal kingdoms suggest that kingdom-specific functional constraints may exist to maintain a particular intron/exon pattern within ferritin genes. In the case of plants, where ferritin gene intron placement is unrelated to triplet codons or protein structure, and where ferritin is targeted to the plastid, the selection pressure on gene organization may relate to RNA function and plastid/nuclear signaling.
Group I introns are widespread in archaea.
Nawrocki, Eric P; Jones, Thomas A; Eddy, Sean R
2018-05-18
Group I catalytic introns have been found in bacterial, viral, organellar, and some eukaryotic genomes, but not in archaea. All known archaeal introns are bulge-helix-bulge (BHB) introns, with the exception of a few group II introns. It has been proposed that BHB introns arose from extinct group I intron ancestors, much like eukaryotic spliceosomal introns are thought to have descended from group II introns. However, group I introns have little sequence conservation, making them difficult to detect with standard sequence similarity searches. Taking advantage of recent improvements in a computational homology search method that accounts for both conserved sequence and RNA secondary structure, we have identified 39 group I introns in a wide range of archaeal phyla, including examples of group I introns and BHB introns in the same host gene.
Koonin, Eugene V
2006-01-01
Background Ever since the discovery of 'genes in pieces' and mRNA splicing in eukaryotes, origin and evolution of spliceosomal introns have been considered within the conceptual framework of the 'introns early' versus 'introns late' debate. The 'introns early' hypothesis, which is closely linked to the so-called exon theory of gene evolution, posits that protein-coding genes were interrupted by numerous introns even at the earliest stages of life's evolution and that introns played a major role in the origin of proteins by facilitating recombination of sequences coding for small protein/peptide modules. Under this scenario, the absence of spliceosomal introns in prokaryotes is considered to be a result of "genome streamlining". The 'introns late' hypothesis counters that spliceosomal introns emerged only in eukaryotes, and moreover, have been inserted into protein-coding genes continuously throughout the evolution of eukaryotes. Beyond the formal dilemma, the more substantial side of this debate has to do with possible roles of introns in the evolution of eukaryotes. Results I argue that several lines of evidence now suggest a coherent solution to the introns-early versus introns-late debate, and the emerging picture of intron evolution integrates aspects of both views although, formally, there seems to be no support for the original version of introns-early. Firstly, there is growing evidence that spliceosomal introns evolved from group II self-splicing introns which are present, usually, in small numbers, in many bacteria, and probably, moved into the evolving eukaryotic genome from the α-proteobacterial progenitor of the mitochondria. Secondly, the concept of a primordial pool of 'virus-like' genetic elements implies that self-splicing introns are among the most ancient genetic entities. Thirdly, reconstructions of the ancestral state of eukaryotic genes suggest that the last common ancestor of extant eukaryotes had an intron-rich genome. Thus, it appears that ancestors of spliceosomal introns, indeed, have existed since the earliest stages of life's evolution, in a formal agreement with the introns-early scenario. However, there is no evidence that these ancient introns ever became widespread before the emergence of eukaryotes, hence, the central tenet of introns-early, the role of introns in early evolution of proteins, has no support. However, the demonstration that numerous introns invaded eukaryotic genes at the outset of eukaryotic evolution and that subsequent intron gain has been limited in many eukaryotic lineages implicates introns as an ancestral feature of eukaryotic genomes and refutes radical versions of introns-late. Perhaps, most importantly, I argue that the intron invasion triggered other pivotal events of eukaryogenesis, including the emergence of the spliceosome, the nucleus, the linear chromosomes, the telomerase, and the ubiquitin signaling system. This concept of eukaryogenesis, in a sense, revives some tenets of the exon hypothesis, by assigning to introns crucial roles in eukaryotic evolutionary innovation. Conclusion The scenario of the origin and evolution of introns that is best compatible with the results of comparative genomics and theoretical considerations goes as follows: self-splicing introns since the earliest stages of life's evolution – numerous spliceosomal introns invading genes of the emerging eukaryote during eukaryogenesis – subsequent lineage-specific loss and gain of introns. The intron invasion, probably, spawned by the mitochondrial endosymbiont, might have critically contributed to the emergence of the principal features of the eukaryotic cell. This scenario combines aspects of the introns-early and introns-late views. Reviewers this article was reviewed by W. Ford Doolittle, James Darnell (nominated by W. Ford Doolittle), William Martin, and Anthony Poole. PMID:16907971
Jansen, Robert K.; Wojciechowski, Martin F.; Sanniyasi, Elumalai; Lee, Seung-Bum; Daniell, Henry
2008-01-01
Chickpea (Cicer arietinum, Leguminosae), an important grain legume, is widely used for food and fodder throughout the world. We sequenced the complete plastid genome of chickpea, which is 125,319 bp in size, and contains only one copy of the inverted repeat (IR). The genome encodes 108 genes, including 4 rRNAs, 29 tRNAs, and 75 proteins. The genes rps16, infA, and ycf4 are absent in the chickpea plastid genome, and ndhB has an internal stop codon in the 5′exon, similar to other legumes. Two genes have lost their introns, one in the 3′exon of the transpliced gene rps12, and the one between exons 1 and 2 of clpP; this represents the first documented case of the loss of introns from both of these genes in the same plastid genome. An extensive phylogenetic survey of these intron losses was performed on 302 taxa across legumes and the related family Polygalaceae. The clpP intron has been lost exclusively in taxa from the temperate “IR-lacking clade” (IRLC), whereas the rps12 intron has been lost in most members of the IRLC (with the exception of Wisteria, Callerya, Afgekia, and certain species of Millettia, which represent the earliest diverging lineages of this clade), and in the tribe Desmodieae, which is closely related to the tribes Phaseoleae and Psoraleeae. Data provided here suggest that the loss of the rps12 intron occurred after the loss of the IR. The two new genomic changes identified in the present study provide additional support of the monophyly of the IR-loss clade, and resolution of the pattern of the earliest-branching lineages in this clade. The availability of the complete chickpea plastid genome sequence also provides valuable information on intergenic spacer regions among legumes and endogenous regulatory sequences for plastid genetic engineering. PMID:18638561
Reduced DNA methylation of FKBP5 in Cushing's syndrome.
Resmini, Eugenia; Santos, Alicia; Aulinas, Anna; Webb, Susan M; Vives-Gilabert, Yolanda; Cox, Olivia; Wand, Gary; Lee, Richard S
2016-12-01
FKBP5 encodes a co-chaperone of HSP90 protein that regulates intracellular glucocorticoid receptor sensitivity. When it is bound to the glucocorticoid receptor complex, cortisol binds with lower affinity to glucocorticoid receptor. Cushing's syndrome is associated with memory deficits, smaller hippocampal volumes, and wide range of cognitive impairments. We aimed at evaluating blood DNA methylation of FKBP5 and its relationship with memory and hippocampal volumes in Cushing's syndrome patients. Polymorphism rs1360780 in FKBP5 has also been assessed to determine whether genetic variations can also govern CpG methylation. Thirty-two Cushing's syndrome patients and 32 matched controls underwent memory tests, 3-Tesla MRI of the brain, and DNA extraction from total leukocytes. DNA samples were bisulfite treated, PCR amplified, and pyrosequenced to assess a total of 41CpG-dinucleotides in the introns 1, 2, 5, and 7 of FKBP5. Significantly lower intronic FKBP5 DNA methylation in CS patients compared to controls was observed in ten CpG-dinucleotides. DNA methylation at these CpGs correlated with left and right HV (Intron-2-Region-2-CpG-3: LHV, r = 0.73, p = 0.02; RHV, r = 0.58, p = 0.03). Cured and active CS patients showed both lower methylation of intron 2 (92.37, 91.8, and 93.34 %, respectively, p = 0.03 for both) and of intron 7 (77.08, 73.74, and 79.71 %, respectively, p = 0.02 and p < 0.01) than controls. Twenty-two subjects had the CC genotype, 34 had the TC genotype, and eight had the TT genotype. Lower average DNA methylation in intron 7 was observed in the TT subjects compared to CC (72.5vs. 79.5 %, p = 0.02) and to TC (72.5 vs. 79.0 %, p = 0.03). Our data demonstrate, for the first time, a reduction of intronic DNA methylation of FKBP5 in CS patients.
Wu, Baojun; Hao, Weilong
2014-04-16
Group I introns are highly dynamic and mobile, featuring extensive presence-absence variation and widespread horizontal transfer. Group I introns can invade intron-lacking alleles via intron homing powered by their own encoded homing endonuclease gene (HEG) after horizontal transfer or via reverse splicing through an RNA intermediate. After successful invasion, the intron and HEG are subject to degeneration and sequential loss. It remains unclear whether these mechanisms can fully address the high dynamics and mobility of group I introns. Here, we found that HEGs undergo a fast gain-and-loss turnover comparable with introns in the yeast mitochondrial 21S-rRNA gene, which is unexpected, as the intron and HEG are generally believed to move together as a unit. We further observed extensively mosaic sequences in both the introns and HEGs, and evidence of gene conversion between HEG-containing and HEG-lacking introns. Our findings suggest horizontal transfer and gene conversion can accelerate HEG/intron degeneration and loss, or rescue and propagate HEG/introns, and ultimately result in high HEG/intron turnover rate. Given that up to 25% of the yeast mitochondrial genome is composed of introns and most mitochondrial introns are group I introns, horizontal transfer and gene conversion could have served as an important mechanism in introducing mitochondrial intron diversity, promoting intron mobility and consequently shaping mitochondrial genome architecture.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Tay, J.S.H.; Liu, Y.; Low, P.S.
A length polymorphism at the 5{prime} untranslated region of exon 1 and an RFLP (Dde I) in intron 5 (nt 160) of the ATIII gene were amplified by polymerase chain reaction with primers of published sequences. DNA fragments were size-fractionated by agarose gel electrophoresis (3% NuSieve and 1% Seakem GTG) and photographed over a UV transilluminator. A strong linkage disequilibrium was observed between these two polymorphisms of the ATIII gene in the Chinese ({chi}{sup 2} = 63.7; {triangle} 0.42, P < 0.001). The estimated frequencies of the three haplotypes were found to be 0.37 for SD+, 0.40 for LD+ andmore » 0.23 for LD-.« less
Evaluation of the phospholamban gene in purebred large-breed dogs with dilated cardiomyopathy.
Stabej, Polona; Leegwater, Peter A; Stokhof, Arnold A; Domanjko-Petric, Aleksandra; van Oost, Bernard A
2005-03-01
To evaluate the role of the phospholamban gene in purebred large-breed dogs with dilated cardiomyopathy (DCM). 6 dogs with DCM, including 2 Doberman Pinschers, 2 Newfoundlands, and 2 Great Danes. All dogs had clinical signs of congestive heart failure, and a diagnosis of DCM was made on the basis of echocardiographic findings. Blood samples were collected from each dog, and genomic DNA was isolated by a salt extraction method. Specific oligonucleotides were designed to amplify the promoter, exon 1, the 5'-part of exon 2 including the complete coding region, and part of intron 1 of the canine phospholamban gene via polymerase chain reaction procedures. These regions were screened for mutations in DNA obtained from the 6 dogs with DCM. No mutations were identified in the promoter, 5' untranslated region, part of intron 1, part of the 3' untranslated region, and the complete coding region of the phospholamban gene in dogs with DCM. Results indicate that mutations in the phospholamban gene are not a frequent cause of DCM in Doberman Pinschers, Newfoundlands, and Great Danes.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Abraitiene, Asta; US Department of Agriculture, Agricultural Research Service, Molecular Plant Pathology Laboratory, Room 214 Building 004 BARC-West, 10300 Baltimore Avenue, Beltsville, MD 20705; Zhao Yan
Transient expression of engineered reporter RNAs encoding an intron-containing green fluorescent protein (GFP) from a Potato virus X-based expression vector previously demonstrated the nuclear targeting capability of the 359 nucleotide Potato spindle tuber viroid (PSTVd) RNA genome. To further delimit the putative nuclear-targeting signal, PSTVd subgenomic fragments were embedded within the intron, and recombinant reporter RNAs were inoculated onto Nicotiana benthamiana plants. Appearance of green fluorescence in leaf tissue inoculated with PSTVd-fragment-containing constructs indicated shuttling of the RNA into the nucleus by fragments as short as 80 nucleotides in length. Plant-to-plant variation in the timing of intron removal and subsequentmore » GFP fluorescence was observed; however, earliest and most abundant GFP expression was obtained with constructs containing the conserved hairpin I palindrome structure and embedded upper central conserved region. Our results suggest that this conserved sequence and/or the stem-loop structure it forms is sufficient for import of PSTVd into the nucleus.« less
Deng, Ke-Jun; Yang, Zu-Jun; Liu, Cheng; Zhao, Wei; Liu, Chang; Feng, Juan; Ren, Zheng-Long
2007-03-01
Genetic characterization of 9 populations of Rhodiola crenulata, R. fastigiata and R. sachalinensis (Crassulaceae) species from Sichuan and Jilin Provinces of China, was investigated using the conserved primer of nad7 intron 2. All PCR products about 800 bp long were shorter than other Crassulaceae plants, which were used as molecular markers to identify the Rhodiola species. The sequence of the products indicated that total exon of 53 bp and intron of 738 bp exhibit only 9 nucleotide variations. Blasting the nad7 sequences to GenBank and the phylogenetic analysis showed that the sequence of Rhodiola species was clusted independently, and the length was smaller than all the registered sequences of higher plants. The result suggests that the Rhiodola species had a unique sequence in this gene region, which might be related to the special growth condition.
Structural characterization of the FKHR gene and its rearrangement in alveolar rhabdomyosarcoma.
Davis, R J; Bennicelli, J L; Macina, R A; Nycum, L M; Biegel, J A; Barr, F G
1995-12-01
The FKHR gene, which contains a forkhead DNA-binding motif, is fused to either PAX3 or PAX7 by the t(2;13) or t(1;13) translocation in alveolar rhabdomyosarcoma,respectively. These tumors express chimeric transcripts encoding the N-terminal portion of either PAX protein fused to the C-terminal portion of FKHR. To understand the structural basis and functional consequences of these translocations, we characterized the wild-type FKHR gene and its rearrangement in alveolar rhabdomyosarcomas. By isolating and analyzing phage, cosmid and YAC clones, we determined that FKHR consists of three exons spanning 140 kb and that several highly similar loci are present in other genomic regions. Exon 1 encodes the N-terminus of the forkhead domain and is embedded within demethylated CpG island. RNA analyses reveal FKHR transcripts initiate from a TATA-less promoter within this island. Exon 2 encodes the C-terminus of the forkhead domain and a transcription activation domain, whereas exon 3 encodes a large 3' untranslated region. The intron 1-exon 2 boundary precisely matches the FHKR fusion point in the chimeric transcripts found in alveolar rhabdomyosarcomas. Using pulsed-field and fluorescence in situ hybridization analyses, we demonstrate that the 130kb FKHR intron 1 is rearranged in t(2;13)-containing alveolar rhabdomyosarcomas. Our findings indicate that FKHR intron 1 provides a large target for DNA rearrangemnt. Rearrangement of this intron with PAX3 produces two important functional consequences: in-frame fusion of N-terminal PAX3 sequences to the FKHR transcriptional activation domain and disruption of the FKHR DNA binding domain.
Svanella-Dumas, Laurence; Candresse, Thierry; Hullé, Maurice; Marais, Armelle
2013-01-01
A systematic search for viral infection was performed in the isolated Kerguelen Islands, using a range of polyvalent genus-specific PCR assays. Barley yellow dwarf virus (BYDV) was detected in both introduced and native grasses such as Poa cookii. The geographical distribution of BYDV and its prevalence in P. cookii were analyzed using samples collected from various sites of the archipelago. We estimate the average prevalence of BYDV to be 24.9% in P. cookii, with significant variability between sites. BYDV genetic diversity was assessed using sequence information from two genomic regions: the P3 open reading frame (ORF) (encoding the coat protein) and the hypervariable P6 ORF region. The phylogenetic analysis in the P3 region showed that BYDV sequences segregate into three major lineages, the most frequent of which (Ker-I cluster) showed close homology with BYDV-PAV-I isolates and had very low intra-lineage diversity (0.6%). A similarly low diversity was also recorded in the hypervariable P6 region, suggesting that Ker-I isolates derive from the recent introduction of BYDV-PAV-I. Divergence time estimation suggests that BYDV-PAV-I was likely introduced in the Kerguelen environment at the same time frame as its aphid vector, Rhopalosiphum padi, whose distribution shows good overlap with that of BYDV-Ker-I. The two other lineages show more than 22% amino acid divergence in the P3 region with other known species in the BYDV species complex, indicating that they represent distinct BYDV species. Using species-specific amplification primers, the distribution of these novel species was analyzed. The high prevalence of BYDV on native Poaceae and the presence of the vector R. padi, raises the question of its impact on the vulnerable plant communities of this remote ecosystem. PMID:23825645
Identification of human short introns
Abebrese, Emmanuel L.; Arnold, Zachary R.; Armstrong, Katharine; Burns, Lindsay; Day, R. Thomas; Hsu, Daniel G.; Jarrell, Katherine; Luo, Yi; Mugayo, Daphine
2017-01-01
Canonical pre-mRNA splicing requires snRNPs and associated splicing factors to excise conserved intronic sequences, with a minimum intron length required for efficient splicing. Non-canonical splicing–intron excision without the spliceosome–has been documented; most notably, some tRNAs and the XBP1 mRNA contain short introns that are not removed by the spliceosome. There have been some efforts to identify additional short introns, but little is known about how many short introns are processed from mRNAs. Here, we report an approach to identify RNA short introns from RNA-Seq data, discriminating against small genomic deletions. We identify hundreds of short introns conserved among multiple human cell lines. These short introns are often alternatively spliced and are found in a variety of RNAs–both mRNAs and lncRNAs. Short intron splicing efficiency is increased by secondary structure, and we detect both canonical and non-canonical short introns. In many cases, splicing of these short introns from mRNAs is predicted to alter the reading frame and change protein output. Our findings imply that standard gene prediction models which often assume a lower limit for intron size fail to predict short introns effectively. We conclude that short introns are abundant in the human transcriptome, and short intron splicing represents an added layer to mRNA regulation. PMID:28520720
Zhao, Yan; Yi, Zhenzhen; Gentekaki, Eleni; Zhan, Aibin; Al-Farraj, Saleh A; Song, Weibo
2016-01-01
Ciliates comprise a highly diverse protozoan lineage inhabiting all biotopes and playing crucial roles in regulating microbial food webs. Nevertheless, subtle morphological differences and tiny sizes hinder proper species identification for many ciliates. Here, we use the species-rich taxon Frontonia and employ both nuclear and mitochondrial loci. We attempt to assess the level of genetic diversity and evaluate the potential of each marker in delineating species of Frontonia. Morphological features and ecological characteristics are also integrated into genetic results, in an attempt to resolve conflicts of species identification based on morphological and molecular methods. Our studies reveal: (1) the mitochondrial cox1 gene, nuclear ITS1 and ITS2 as well as the hypervariable D2 region of LSU rDNA are promising candidates for species delineation; (2) the cox1 gene provides the best resolution for analyses below the species level; (3) the V2 and V4 hypervariable regions of SSU rDNA, and D1 of LSU rDNA as well as the 5.8S rDNA gene do not show distinct barcoding gap due to overlap between intra- and inter-specific genetic divergences; (4) morphological character-based analysis shows promise for delimitation of Frontonia species; and (5) all gene markers and character-based analyses demonstrate that the genus Frontonia consists of three groups and monophyly of the genus Frontonia is questionable. Copyright © 2015 Elsevier Inc. All rights reserved.
Hibino, Akinobu; Taniguchi, Kiyosu; Zaraket, Hassan; Shobugawa, Yugo; Matsui, Tamano; Suzuki, Hiroshi
2018-01-01
We investigated the genetic diversity, the circulation patterns, and risk for hospital admission of human respiratory syncytial virus (HRSV) strains in Japan between 2012 through 2015. During the study period, 744 HRSV-positive cases were identified by rapid diagnostic test. Of these, 572 samples were positive by real-time PCR; 400 (69.9%) were HRSV-A, and 172 (30.1%) were HRSV-B. HRSV-A and -B alternated as the dominant strain in the subsequent seasons. Phylogenetic tree analysis of the second hyper-variable region of the G protein classified the HRSV-A specimens into NA1 (n = 242) and ON1 (n = 114) genotypes and the HRSV-B specimens into BA9 (n = 60), and BA10 (n = 27). The ON1 genotype, containing a 72-nucleotide duplication in the G protein’s second hyper-variable region, was first detected in the 2012–2013 season but it predominated and replaced the older NA1 HRSV-A in the 2014–2015 season, which also coincided with a record number of HRSV cases reported to the National Infectious Disease Surveillance in Japan. The risk of hospitalization was 6.9 times higher for the ON1 genotype compared to NA1. In conclusion, our data showed that the emergence and predominance of the relatively new ON1 genotype in Japan was associated with a record high number of cases and increased risk for hospitalization. PMID:29377949
Lupini, Caterina; Giovanardi, Davide; Pesente, Patrizia; Bonci, Michela; Felice, Viviana; Rossi, Giulia; Morandini, Emilio; Cecchinato, Mattia; Catelli, Elena
2016-08-01
A distinctive infectious bursal disease (IBD) virus genotype (ITA) was detected in IBD-live vaccinated broilers in Italy without clinical signs of IBD. It was isolated in specific-pathogen-free eggs and molecularly characterized in the hypervariable region of the virus protein (VP) 2. Phylogenetic analysis showed that ITA strains clustered separately from other homologous reference sequences of IBDVs, either classical or very virulent, retrieved from GenBank or previously reported in Italy, and from vaccine strains. The new genotype shows peculiar molecular characteristics in key positions of the VP2 hypervariable region, which affect charged or potentially glycosylated amino acids virtually associated with important changes in virus properties. Characterization of 41 IBDV strains detected in Italy between 2013 and 2014 showed that ITA is emergent in densely populated poultry areas of Italy, being 68% of the IBDV detections made during routine diagnostic activity over a two-year period, in spite of the immunity induced by large-scale vaccination. Four very virulent strains (DV86) and one classical strain (HPR2), together with eight vaccine strains, were also detected. The currently available epidemiological and clinical data do not allow the degree of pathogenicity of the ITA genotype to be defined. Only in vivo experimental pathogenicity studies conducted in secure isolation conditions, through the evaluation of clinical signs and macro/microscopic lesions, will clarify conclusively the virulence of the new Italian genotype.
Alvarenga, Janaína Sousa Campos; Ligeiro, Carla Maia; Gontijo, Célia Maria Ferreira; Cortes, Sofia; Campino, Lenea; Vago, Annamaria Ravara; Melo, Maria Norma
2012-01-01
Background Visceral Leishmaniasis (VL) caused by species from the Leishmania donovani complex is the most severe form of the disease, lethal if untreated. VL caused by Leishmania infantum is a zoonosis with an increasing number of human cases and millions of dogs infected in the Old and the New World. In this study, L. infantum (syn. L.chagasi) strains were isolated from human and canine VL cases. The strains were obtained from endemic areas from Brazil and Portugal and their genetic polymorphism was ascertained using the LSSP-PCR (Low-Stringency Single Specific Primer PCR) technique for analyzing the kinetoplastid DNA (kDNA) minicircles hypervariable region. Principal Findings KDNA genetic signatures obtained by minicircle LSSP-PCR analysis of forty L. infantum strains allowed the grouping of strains in several clades. Furthermore, LSSP-PCR profiles of L. infantum subpopulations were closely related to the host origin (human or canine). To our knowledge this is the first study which used this technique to compare genetic polymorphisms among strains of L. infantum originated from both the Old and the New World. Conclusions LSSP-PCR profiles obtained by analysis of L. infantum kDNA hypervariable region of parasites isolated from human cases and infected dogs from Brazil and Portugal exhibited a genetic correlation among isolates originated from the same reservoir, human or canine. However, no association has been detected among the kDNA signatures and the geographical origin of L. infantum strains. PMID:22912862
Muhle, Rebecca A.; Adjalley, Sophie; Falkard, Brie; Nkrumah, Louis J.; Muhle, Michael E.; Fidock, David A.
2009-01-01
Questions surround the mechanism of mutually exclusive expression by which Plasmodium falciparum mediates activation and silencing of var genes. These encode PfEMP1 proteins, which function as cytoadherent and immunomodulatory molecules at the surface of parasitized erythrocytes. Current evidence suggests that promoter silencing by var introns might play a key role in var gene regulation. To evaluate the impact of cis-acting regulatory regions on var silencing, we generated P. falciparum lines in which luciferase was placed under the control of an UpsA var promoter. By utilizing the Bxb1 integrase system, these reporter cassettes were targeted to a genomic region that was not in apposition to var sub-telomeric domains. This eliminated possible effects from surrounding telomeric elements and removed the variability inherent in episomal systems. Studies with highly synchronized parasites revealed that the UpsA element possessed minimal activity in comparison with a heterologous (hrp3) promoter. This may well result from the integrated UpsA promoter being largely silenced by the neighboring cg6 promoter. Our analyses also revealed that the DownsA 3’ untranslated region further decreased the luciferase activity from both cassettes, whereas the var A intron repressed the UpsA promoter specifically. By applying multivariate analysis over the entire cell cycle, we confirmed the significance of these cis-elements and found the parasite stage to be the major factor regulating UpsA promoter activity. Additionally, we observed that the UpsA promoter was capable of nucleating reversible silencing that spread to a downstream promoter. We believe these studies are the first to analyze promoter activity of Group A var genes which have been implicated in severe malaria, and support the model that var introns can further suppress var expression. These data also suggest an important suppressive role for the DownsA terminator. Our findings imply the existence of multiple levels of var gene regulation in addition to intrinsic promoter-dependent silencing. PMID:19463825
Intermediate introns in nuclear genes of euglenids - are they a distinct type?
Milanowski, Rafał; Gumińska, Natalia; Karnkowska, Anna; Ishikawa, Takao; Zakryś, Bożena
2016-02-29
Nuclear genes of euglenids contain two major types of introns: conventional spliceosomal and nonconventional introns. The latter are characterized by variable non-canonical borders, RNA secondary structure that brings intron ends together, and an unknown mechanism of removal. Some researchers also distinguish intermediate introns, which combine features of both types. They form a stable RNA secondary structure and are classified into two subtypes depending on whether they contain one (intermediate/nonconventional subtype) or both (conventional/intermediate subtype) canonical spliceosomal borders. However, it has been also postulated that most introns classified as intermediate could simply be special cases of conventional or nonconventional introns. Sequences of tubB, hsp90 and gapC genes from six strains of Euglena agilis were obtained. They contain four, six, and two or three introns, respectively (the third intron in the gapC gene is unique for just one strain). Conventional introns were present at three positions: two in the tubB gene (at one position conventional/intermediate introns were also found) and one in the gapC gene. Nonconventional introns are present at ten positions: two in the tubB gene (at one position intermediate/nonconventional introns were also found), six in hsp90 (at four positions intermediate/nonconventional introns were also found), and two in the gapC gene. Sequence and RNA secondary structure analyses of nonconventional introns confirmed that their most strongly conserved elements are base pairing nucleotides at positions +4, +5 and +6/ -8, -7 and -6 (in most introns CAG/CTG nucleotides were observed). It was also confirmed that the presence of the 5' GT/C end in intermediate/nonconventional introns is not the result of kinship with conventional introns, but is due to evolutionary pressure to preserve the purine at the 5' end. However, an example of a nonconventional intron with GC-AG ends was shown, suggesting the possibility of intron type conversion between nonconventional and conventional. Furthermore, an analysis of conventional introns revealed that the ability to form a stable RNA secondary structure by some introns is probably not a result of their relationship with nonconventional introns. It was also shown that acquisition of new nonconventional introns is an ongoing process and can be observed at the level of a single species. In the recently acquired intron in the gapC gene an extended direct repeats at the intron-exon junctions are present, suggesting that double-strand break repair process could be the source of new nonconventional introns.
Splicing-Related Features of Introns Serve to Propel Evolution
Luo, Yuping; Li, Chun; Gong, Xi; Wang, Yanlu; Zhang, Kunshan; Cui, Yaru; Sun, Yi Eve; Li, Siguang
2013-01-01
The role of spliceosomal intronic structures played in evolution has only begun to be elucidated. Comparative genomic analyses of fungal snoRNA sequences, which are often contained within introns and/or exons, revealed that about one-third of snoRNA-associated introns in three major snoRNA gene clusters manifested polymorphisms, likely resulting from intron loss and gain events during fungi evolution. Genomic deletions can clearly be observed as one mechanism underlying intron and exon loss, as well as generation of complex introns where several introns lie in juxtaposition without intercalating exons. Strikingly, by tracking conserved snoRNAs in introns, we found that some introns had moved from one position to another by excision from donor sites and insertion into target sties elsewhere in the genome without needing transposon structures. This study revealed the origin of many newly gained introns. Moreover, our analyses suggested that intron-containing sequences were more prone to sustainable structural changes than DNA sequences without introns due to intron's ability to jump within the genome via unknown mechanisms. We propose that splicing-related structural features of introns serve as an additional motor to propel evolution. PMID:23516505
Introns: The Functional Benefits of Introns in Genomes.
Jo, Bong-Seok; Choi, Sun Shim
2015-12-01
The intron has been a big biological mystery since it was first discovered in several aspects. First, all of the completely sequenced eukaryotes harbor introns in the genomic structure, whereas no prokaryotes identified so far carry introns. Second, the amount of total introns varies in different species. Third, the length and number of introns vary in different genes, even within the same species genome. Fourth, all introns are copied into RNAs by transcription and DNAs by replication processes, but intron sequences do not participate in protein-coding sequences. The existence of introns in the genome should be a burden to some cells, because cells have to consume a great deal of energy to copy and excise them exactly at the correct positions with the help of complicated spliceosomal machineries. The existence throughout the long evolutionary history is explained, only if selective advantages of carrying introns are assumed to be given to cells to overcome the negative effect of introns. In that regard, we summarize previous research about the functional roles or benefits of introns. Additionally, several other studies strongly suggesting that introns should not be junk will be introduced.
Limited MHC class I intron 2 repertoire variation in bonobos.
de Groot, Natasja G; Heijmans, Corrine M C; Helsen, Philippe; Otting, Nel; Pereboom, Zjef; Stevens, Jeroen M G; Bontrop, Ronald E
2017-10-01
Common chimpanzees (Pan troglodytes) experienced a selective sweep, probably caused by a SIV-like virus, which targeted their MHC class I repertoire. Based on MHC class I intron 2 data analyses, this selective sweep took place about 2-3 million years ago. As a consequence, common chimpanzees have a skewed MHC class I repertoire that is enriched for allotypes that are able to recognise conserved regions of the SIV proteome. The bonobo (Pan paniscus) shared an ancestor with common chimpanzees approximately 1.5 to 2 million years ago. To investigate whether the signature of this selective sweep is also detectable in bonobos, the MHC class I gene repertoire of two bonobo panels comprising in total 29 animals was investigated by Sanger sequencing. We identified 14 Papa-A, 20 Papa-B and 11 Papa-C alleles, of which eight, five and eight alleles, respectively, have not been reported previously. Within this pool of MHC class I variation, we recovered only 2 Papa-A, 3 Papa-B and 6 Papa-C intron 2 sequences. As compared to humans, bonobos appear to have an even more diminished MHC class I intron 2 lineage repertoire than common chimpanzees. This supports the notion that the selective sweep may have predated the speciation of common chimpanzees and bonobos. The further reduction of the MHC class I intron 2 lineage repertoire observed in bonobos as compared to the common chimpanzee may be explained by a founding effect or other subsequent selective processes.
Sharma, K; Hair-Bejo, M; Omar, A R; Aini, I
2005-01-01
Two Infectious bursal disease virus (IBDV) isolates, NP1SSH and NP2K were obtained from a severe infectious bursal disease (IBD) outbreak in Nepal in 2002. The hypervariable (HV) region of VP2 gene (1326 bp) of the isolates was generated by RT-PCR and sequenced. The obtained nucleotide sequences were compared with those of twenty other IBDV isolates/strains. Phylogenetic analysis based on this comparison revealed that NP1SSH and NP2K clustered with very virulent (vv) IBDV strains of serotype 1. In contrast, classical, Australian classical and attenuated strains of serotype 1 and avirulent IBDV strains of serotype 2 formed a different cluster. The deduced amino acid sequences of the two isolates showed a 98.3% identity with each other and 97.1% and 98.3% identities, respectively with very virulent IBDV (vvIBDV) isolates/strains. Three amino acids substitutions at positions 300 (E-->A), 308 (I-->F) and 334 (A-->P) within the HV region were common for both the isolates. The amino acids substitutions at positions 27 (S-->T), 28 (I-->T), 31 (D-->A), 36 (H-->Y), 135 (E-->G), 223 (G-->S), 225 (V-->I), 351 (L-->I), 352 (V-->E) and 399 (I-->S) for NP1SSH and at position 438 (I-->S) for NP2K were unique and differed from other IBDV isolates/strains. NP1SSH and NP2K showed highest similarity (97.8%) with the BD399 strain from Bangladesh as compared with other vvIBDV isolates/strains. We conclude that the NP1SSH and NP2K isolates of IBDV from Nepal represent vvIBDV of serotype 1.
Gustafsson, Mattias C U; Lannergård, Jonas; Nilsson, O Rickard; Kristensen, Bodil M; Olsen, John E; Harris, Claire L; Ufret-Vincenty, Rafael L; Stålhammar-Carlemalm, Margaretha; Lindahl, Gunnar
2013-01-01
Many pathogens express a surface protein that binds the human complement regulator factor H (FH), as first described for Streptococcus pyogenes and the antiphagocytic M6 protein. It is commonly assumed that FH recruited to an M protein enhances virulence by protecting the bacteria against complement deposition and phagocytosis, but the role of FH-binding in S. pyogenes pathogenesis has remained unclear and controversial. Here, we studied seven purified M proteins for ability to bind FH and found that FH binds to the M5, M6 and M18 proteins but not the M1, M3, M4 and M22 proteins. Extensive immunochemical analysis indicated that FH binds solely to the hypervariable region (HVR) of an M protein, suggesting that selection has favored the ability of certain HVRs to bind FH. These FH-binding HVRs could be studied as isolated polypeptides that retain ability to bind FH, implying that an FH-binding HVR represents a distinct ligand-binding domain. The isolated HVRs specifically interacted with FH among all human serum proteins, interacted with the same region in FH and showed species specificity, but exhibited little or no antigenic cross-reactivity. Although these findings suggested that FH recruited to an M protein promotes virulence, studies in transgenic mice did not demonstrate a role for bound FH during acute infection. Moreover, phagocytosis tests indicated that ability to bind FH is neither sufficient nor necessary for S. pyogenes to resist killing in whole human blood. While these data shed new light on the HVR of M proteins, they suggest that FH-binding may affect S. pyogenes virulence by mechanisms not assessed in currently used model systems.
Kristensen, Bodil M.; Olsen, John E.; Harris, Claire L.; Ufret-Vincenty, Rafael L.; Stålhammar-Carlemalm, Margaretha; Lindahl, Gunnar
2013-01-01
Many pathogens express a surface protein that binds the human complement regulator factor H (FH), as first described for Streptococcus pyogenes and the antiphagocytic M6 protein. It is commonly assumed that FH recruited to an M protein enhances virulence by protecting the bacteria against complement deposition and phagocytosis, but the role of FH-binding in S. pyogenes pathogenesis has remained unclear and controversial. Here, we studied seven purified M proteins for ability to bind FH and found that FH binds to the M5, M6 and M18 proteins but not the M1, M3, M4 and M22 proteins. Extensive immunochemical analysis indicated that FH binds solely to the hypervariable region (HVR) of an M protein, suggesting that selection has favored the ability of certain HVRs to bind FH. These FH-binding HVRs could be studied as isolated polypeptides that retain ability to bind FH, implying that an FH-binding HVR represents a distinct ligand-binding domain. The isolated HVRs specifically interacted with FH among all human serum proteins, interacted with the same region in FH and showed species specificity, but exhibited little or no antigenic cross-reactivity. Although these findings suggested that FH recruited to an M protein promotes virulence, studies in transgenic mice did not demonstrate a role for bound FH during acute infection. Moreover, phagocytosis tests indicated that ability to bind FH is neither sufficient nor necessary for S. pyogenes to resist killing in whole human blood. While these data shed new light on the HVR of M proteins, they suggest that FH-binding may affect S. pyogenes virulence by mechanisms not assessed in currently used model systems. PMID:23637608
Lu, Shaoyong; Banerjee, Avik; Jang, Hyunbum; Zhang, Jian; Gaponenko, Vadim; Nussinov, Ruth
2015-01-01
K-Ras4B, a frequently mutated oncogene in cancer, plays an essential role in cell growth, differentiation, and survival. Its C-terminal membrane-associated hypervariable region (HVR) is required for full biological activity. In the active GTP-bound state, the HVR interacts with acidic plasma membrane (PM) headgroups, whereas the farnesyl anchors in the membrane; in the inactive GDP-bound state, the HVR may interact with both the PM and the catalytic domain at the effector binding region, obstructing signaling and nucleotide exchange. Here, using molecular dynamics simulations and NMR, we aim to figure out the effects of nucleotides (GTP and GDP) and frequent (G12C, G12D, G12V, G13D, and Q61H) and infrequent (E37K and R164Q) oncogenic mutations on full-length K-Ras4B. The mutations are away from or directly at the HVR switch I/effector binding site. Our results suggest that full-length wild-type GDP-bound K-Ras4B (K-Ras4BWT-GDP) is in an intrinsically autoinhibited state via tight HVR-catalytic domain interactions. The looser association in K-Ras4BWT-GTP may release the HVR. Some of the oncogenic mutations weaken the HVR-catalytic domain association in the K-Ras4B-GDP/-GTP bound states, which may facilitate the HVR disassociation in a nucleotide-independent manner, thereby up-regulating oncogenic Ras signaling. Thus, our results suggest that mutations can exert their effects in more than one way, abolishing GTP hydrolysis and facilitating effector binding. PMID:26453300
Harvala, Heli; Wiman, Åsa; Wallensten, Anders; Zakikhany, Katherina; Englund, Hélène; Brytting, Maria
2016-02-15
It is increasingly difficult to differentiate measles viruses (MeVs) relating to certain outbreaks on the basis of the nucleoprotein (N) gene sequence only, as the diversity of circulating MeV strains has decreased. We studied genomic regions that could provide better molecular discrimination between epidemiologically linked and unlinked MeV variants identified in Sweden during 2013-2014. The hemagglutinin (H) gene and hypervariable region between the fusion and matrix genes (MF-HVR) from 53 MeV-positive samples were amplified and sequenced. Data on phylogenetic clustering of MeVs on the basis of N, H, and MF-HVR sequences were compared to epidemiological data. MeVs were genotyped: 27 were B3, and 26 were D8. One genotype B3 cluster based on the N gene sequence contained epidemiologically unrelated viruses from 4 outbreaks, whereas analysis of H and MF-HVR sequences separated them into phylogenetic clusters consistent with the epidemiological data. Similarly, the single cluster of viruses with a genotype D8 N gene could be divided into the 5 outbreak groups on the basis of the phylogeny of MF-HVR sequences. A detailed picture of MeV circulation with more-defined links between outbreaks was obtained by sequencing the H gene and MF-HVR. Further identification and better genetic characterization of MeVs internationally is essential in identifying sources and routes of MeV spread within and beyond Europe in the elimination end game. © The Author 2015. Published by Oxford University Press for the Infectious Diseases Society of America. All rights reserved. For permissions, e-mail journals.permissions@oup.com.
Recurrent Loss of Specific Introns during Angiosperm Evolution
Wang, Hao; Devos, Katrien M.; Bennetzen, Jeffrey L.
2014-01-01
Numerous instances of presence/absence variations for introns have been documented in eukaryotes, and some cases of recurrent loss of the same intron have been suggested. However, there has been no comprehensive or phylogenetically deep analysis of recurrent intron loss. Of 883 cases of intron presence/absence variation that we detected in five sequenced grass genomes, 93 were confirmed as recurrent losses and the rest could be explained by single losses (652) or single gains (118). No case of recurrent intron gain was observed. Deep phylogenetic analysis often indicated that apparent intron gains were actually numerous independent losses of the same intron. Recurrent loss exhibited extreme non-randomness, in that some introns were removed independently in many lineages. The two larger genomes, maize and sorghum, were found to have a higher rate of both recurrent loss and overall loss and/or gain than foxtail millet, rice or Brachypodium. Adjacent introns and small introns were found to be preferentially lost. Intron loss genes exhibited a high frequency of germ line or early embryogenesis expression. In addition, flanking exon A+T-richness and intron TG/CG ratios were higher in retained introns. This last result suggests that epigenetic status, as evidenced by a loss of methylated CG dinucleotides, may play a role in the process of intron loss. This study provides the first comprehensive analysis of recurrent intron loss, makes a series of novel findings on the patterns of recurrent intron loss during the evolution of the grass family, and provides insight into the molecular mechanism(s) underlying intron loss. PMID:25474210
Identification of Genetic Elements Associated with EPSPS Gene Amplification
Gaines, Todd A.; Wright, Alice A.; Molin, William T.; Lorentz, Lothar; Riggins, Chance W.; Tranel, Patrick J.; Beffa, Roland; Westra, Philip; Powles, Stephen B.
2013-01-01
Weed populations can have high genetic plasticity and rapid responses to environmental selection pressures. For example, 100-fold amplification of the 5-enolpyruvylshikimate-3-phosphate synthase (EPSPS) gene evolved in the weed species Amaranthus palmeri to confer resistance to glyphosate, the world’s most important herbicide. However, the gene amplification mechanism is unknown. We sequenced the EPSPS gene and genomic regions flanking EPSPS loci in A. palmeri, and searched for mobile genetic elements or repetitive sequences. The EPSPS gene was 10,229 bp, containing 8 exons and 7 introns. The gene amplification likely proceeded through a DNA-mediated mechanism, as introns exist in the amplified gene copies and the entire amplified sequence is at least 30 kb in length. Our data support the presence of two EPSPS loci in susceptible (S) A. palmeri, and that only one of these was amplified in glyphosate-resistant (R) A. palmeri. The EPSPS gene amplification event likely occurred recently, as no sequence polymorphisms were found within introns of amplified EPSPS copies from R individuals. Sequences with homology to miniature inverted-repeat transposable elements (MITEs) were identified next to EPSPS gene copies only in R individuals. Additionally, a putative Activator (Ac) transposase and a repetitive sequence region were associated with amplified EPSPS genes. The mechanism controlling this DNA-mediated amplification remains unknown. Further investigation is necessary to determine if the gene amplification may have proceeded via DNA transposon-mediated replication, and/or unequal recombination between different genomic regions resulting in replication of the EPSPS gene. PMID:23762434
Molecular differentiation of Russian wild ginseng using mitochondrial nad7 intron 3 region.
Li, Guisheng; Cui, Yan; Wang, Hongtao; Kwon, Woo-Saeng; Yang, Deok-Chun
2017-07-01
Cultivated ginseng is often introduced as a substitute and adulterant of Russian wild ginseng due to its lower cost or misidentification caused by similarity in appearance with wild ginseng. The aim of this study is to develop a simple and reliable method to differentiate Russian wild ginseng from cultivated ginseng. The mitochondrial NADH dehydrogenase subunit 7 ( nad 7) intron 3 regions of Russian wild ginseng and Chinese cultivated ginseng were analyzed. Based on the multiple sequence alignment result, a specific primer for Russian wild ginseng was designed by introducing additional mismatch and allele-specific polymerase chain reaction (PCR) was performed for identification of wild ginseng. Real-time allele-specific PCR with endpoint analysis was used for validation of the developed Russian wild ginseng single nucleotide polymorphism (SNP) marker. An SNP site specific to Russian wild ginseng was exploited by multiple alignments of mitochondrial nad 7 intron 3 regions of different ginseng samples. With the SNP-based specific primer, Russian wild ginseng was successfully discriminated from Chinese and Korean cultivated ginseng samples by allele-specific PCR. The reliability and specificity of the SNP marker was validated by checking 20 individuals of Russian wild ginseng samples with real-time allele-specific PCR assay. An effective DNA method for molecular discrimination of Russian wild ginseng from Chinese and Korean cultivated ginseng was developed. The established real-time allele-specific PCR was simple and reliable, and the present method should be a crucial complement of chemical analysis for authentication of Russian wild ginseng.
Masingue, Marion; Perrot, Jimmy; Carlier, Robert-Yves; Piguet-Lacroix, Guenaelle; Latour, Philippe; Stojkovic, Tanya
2018-05-01
Charcot-Marie-Tooth disease (CMT) refers to a group of clinically and genetically heterogeneous inherited neuropathies. Ganglioside-induced differentiation-associated protein 1 GDAP1-related CMT has been reported in an autosomal dominant or recessive form in patients presenting either axonal or demyelinating neuropathy. We report two Sri Lankan sisters born to consanguineous parents and presenting with a severe axonal sensorimotor neuropathy. The early onset of the disease, the distal and proximal weakness and atrophy leading to major disability, along with areflexia, and, most notably, vocal cord and diaphragm paralysis were highly evocative of a GDAP1-related CMT. However, sequencing of the coding regions of the gene was normal. Whole-exome sequencing (WES) was performed and revealed that the largest region of homozygosity was around GDAP1 with several variants, mostly in non-coding regions. In view of the high clinical suspicion of GDAP1 gene involvement, we examined the variants in this gene and this, along with functional studies, allowed us to identify an alternative splicing site revealing a cryptic in-frame stop codon in intron 4 responsible for a severe loss of wild-type GDAP1. This work is the first to describe a deleterious mutation in GDAP1 gene outside of coding sequences or intronic junctions and emphasizes the importance of interpreting molecular analysis, and in particular WES results, in light of the clinical and electrophysiological phenotype.
Trans-activation of the Tetrahymena group I intron ribozyme via a non-native RNA-RNA interaction.
Ikawa, Y; Shiraishi, H; Inoue, T
1999-01-01
The peripheral P2.1 domain of the Tetrahymena group I intron ribozyme has been shown to be non-essential for splicing. We found, however, that separately prepared P2.1 RNA efficiently accelerates the 3' splice-site-specific hydrolysis reaction of a mutant ribozyme lacking both P2.1 and its upstream region in trans. We report here the unusual properties of this trans-activation. Compensatory mutational analysis revealed that non-native long-range base-pairings between the loop region of P2.1 RNA and L5c region of the mutant ribozyme are needed for the activation in spite of the fact that P2.1 forms base-pairings with P9.1 in the Tetrahymena ribozyme. The trans -activation depends on the non-native RNA-RNA interaction together with the higher order structure of P2.1 RNA. This activation is unique among the known trans-activations that utilize native tertiary interactions or RNA chaperons. PMID:10075996
Kote-Jarai, Zsofia; Saunders, Edward J.; Leongamornlert, Daniel A.; Tymrakiewicz, Malgorzata; Dadaev, Tokhir; Jugurnauth-Little, Sarah; Ross-Adams, Helen; Al Olama, Ali Amin; Benlloch, Sara; Halim, Silvia; Russel, Roslin; Dunning, Alison M.; Luccarini, Craig; Dennis, Joe; Neal, David E.; Hamdy, Freddie C.; Donovan, Jenny L.; Muir, Ken; Giles, Graham G.; Severi, Gianluca; Wiklund, Fredrik; Gronberg, Henrik; Haiman, Christopher A.; Schumacher, Fredrick; Henderson, Brian E.; Le Marchand, Loic; Lindstrom, Sara; Kraft, Peter; Hunter, David J.; Gapstur, Susan; Chanock, Stephen; Berndt, Sonja I.; Albanes, Demetrius; Andriole, Gerald; Schleutker, Johanna; Weischer, Maren; Canzian, Federico; Riboli, Elio; Key, Tim J.; Travis, Ruth C.; Campa, Daniele; Ingles, Sue A.; John, Esther M.; Hayes, Richard B.; Pharoah, Paul; Khaw, Kay-Tee; Stanford, Janet L.; Ostrander, Elaine A.; Signorello, Lisa B.; Thibodeau, Stephen N.; Schaid, Dan; Maier, Christiane; Vogel, Walther; Kibel, Adam S.; Cybulski, Cezary; Lubinski, Jan; Cannon-Albright, Lisa; Brenner, Hermann; Park, Jong Y.; Kaneva, Radka; Batra, Jyotsna; Spurdle, Amanda; Clements, Judith A.; Teixeira, Manuel R.; Govindasami, Koveela; Guy, Michelle; Wilkinson, Rosemary A.; Sawyer, Emma J.; Morgan, Angela; Dicks, Ed; Baynes, Caroline; Conroy, Don; Bojesen, Stig E.; Kaaks, Rudolf; Vincent, Daniel; Bacot, François; Tessier, Daniel C.; Easton, Douglas F.; Eeles, Rosalind A.
2013-01-01
Associations between single nucleotide polymorphisms (SNPs) at 5p15 and multiple cancer types have been reported. We have previously shown evidence for a strong association between prostate cancer (PrCa) risk and rs2242652 at 5p15, intronic in the telomerase reverse transcriptase (TERT) gene that encodes TERT. To comprehensively evaluate the association between genetic variation across this region and PrCa, we performed a fine-mapping analysis by genotyping 134 SNPs using a custom Illumina iSelect array or Sequenom MassArray iPlex, followed by imputation of 1094 SNPs in 22 301 PrCa cases and 22 320 controls in The PRACTICAL consortium. Multiple stepwise logistic regression analysis identified four signals in the promoter or intronic regions of TERT that independently associated with PrCa risk. Gene expression analysis of normal prostate tissue showed evidence that SNPs within one of these regions also associated with TERT expression, providing a potential mechanism for predisposition to disease. PMID:23535824
Genetic Variation among Major Human Geographic Groups Supports a Peculiar Evolutionary Trend in PAX9
Paixão-Côrtes, Vanessa R.; Meyer, Diogo; Pereira, Tiago V.; Mazières, Stéphane; Elion, Jacques; Krishnamoorthy, Rajagopal; Zago, Marco A.; Silva, Wilson A.; Salzano, Francisco M.; Bortolini, Maria Cátira
2011-01-01
A total of 172 persons from nine South Amerindian, three African and one Eskimo populations were studied in relation to the Paired box gene 9 (PAX9) exon 3 (138 base pairs) as well as its 5′and 3′flanking intronic segments (232 bp and 220 bp, respectively) and integrated with the information available for the same genetic region from individuals of different geographical origins. Nine mutations were scored in exon 3 and six in its flanking regions; four of them are new South American tribe-specific singletons. Exon3 nucleotide diversity is several orders of magnitude higher than its intronic regions. Additionally, a set of variants in the PAX9 and 101 other genes related with dentition can define at least some dental morphological differences between Sub-Saharan Africans and non-Africans, probably associated with adaptations after the modern human exodus from Africa. Exon 3 of PAX9 could be a good molecular example of how evolvability works. PMID:21298044
High variability of mitochondrial gene order among fungi.
Aguileta, Gabriela; de Vienne, Damien M; Ross, Oliver N; Hood, Michael E; Giraud, Tatiana; Petit, Elsa; Gabaldón, Toni
2014-02-01
From their origin as an early alpha proteobacterial endosymbiont to their current state as cellular organelles, large-scale genomic reorganization has taken place in the mitochondria of all main eukaryotic lineages. So far, most studies have focused on plant and animal mitochondrial (mt) genomes (mtDNA), but fungi provide new opportunities to study highly differentiated mtDNAs. Here, we analyzed 38 complete fungal mt genomes to investigate the evolution of mtDNA gene order among fungi. In particular, we looked for evidence of nonhomologous intrachromosomal recombination and investigated the dynamics of gene rearrangements. We investigated the effect that introns, intronic open reading frames (ORFs), and repeats may have on gene order. Additionally, we asked whether the distribution of transfer RNAs (tRNAs) evolves independently to that of mt protein-coding genes. We found that fungal mt genomes display remarkable variation between and within the major fungal phyla in terms of gene order, genome size, composition of intergenic regions, and presence of repeats, introns, and associated ORFs. Our results support previous evidence for the presence of mt recombination in all fungal phyla, a process conspicuously lacking in most Metazoa. Overall, the patterns of rearrangements may be explained by the combined influences of recombination (i.e., most likely nonhomologous and intrachromosomal), accumulated repeats, especially at intergenic regions, and to a lesser extent, mobile element dynamics.
Origin and evolution of spliceosomal introns
2012-01-01
Evolution of exon-intron structure of eukaryotic genes has been a matter of long-standing, intensive debate. The introns-early concept, later rebranded ‘introns first’ held that protein-coding genes were interrupted by numerous introns even at the earliest stages of life's evolution and that introns played a major role in the origin of proteins by facilitating recombination of sequences coding for small protein/peptide modules. The introns-late concept held that introns emerged only in eukaryotes and new introns have been accumulating continuously throughout eukaryotic evolution. Analysis of orthologous genes from completely sequenced eukaryotic genomes revealed numerous shared intron positions in orthologous genes from animals and plants and even between animals, plants and protists, suggesting that many ancestral introns have persisted since the last eukaryotic common ancestor (LECA). Reconstructions of intron gain and loss using the growing collection of genomes of diverse eukaryotes and increasingly advanced probabilistic models convincingly show that the LECA and the ancestors of each eukaryotic supergroup had intron-rich genes, with intron densities comparable to those in the most intron-rich modern genomes such as those of vertebrates. The subsequent evolution in most lineages of eukaryotes involved primarily loss of introns, with only a few episodes of substantial intron gain that might have accompanied major evolutionary innovations such as the origin of metazoa. The original invasion of self-splicing Group II introns, presumably originating from the mitochondrial endosymbiont, into the genome of the emerging eukaryote might have been a key factor of eukaryogenesis that in particular triggered the origin of endomembranes and the nucleus. Conversely, splicing errors gave rise to alternative splicing, a major contribution to the biological complexity of multicellular eukaryotes. There is no indication that any prokaryote has ever possessed a spliceosome or introns in protein-coding genes, other than relatively rare mobile self-splicing introns. Thus, the introns-first scenario is not supported by any evidence but exon-intron structure of protein-coding genes appears to have evolved concomitantly with the eukaryotic cell, and introns were a major factor of evolution throughout the history of eukaryotes. This article was reviewed by I. King Jordan, Manuel Irimia (nominated by Anthony Poole), Tobias Mourier (nominated by Anthony Poole), and Fyodor Kondrashov. For the complete reports, see the Reviewers’ Reports section. PMID:22507701
Conserved Nonexonic Elements: A Novel Class of Marker for Phylogenomics.
Edwards, Scott V; Cloutier, Alison; Baker, Allan J
2017-11-01
Noncoding markers have a particular appeal as tools for phylogenomic analysis because, at least in vertebrates, they appear less subject to strong variation in GC content among lineages. Thus far, ultraconserved elements (UCEs) and introns have been the most widely used noncoding markers. Here we analyze and study the evolutionary properties of a new type of noncoding marker, conserved nonexonic elements (CNEEs), which consists of noncoding elements that are estimated to evolve slower than the neutral rate across a set of species. Although they often include UCEs, CNEEs are distinct from UCEs because they are not ultraconserved, and, most importantly, the core region alone is analyzed, rather than both the core and its flanking regions. Using a data set of 16 birds plus an alligator outgroup, and ∼3600-∼3800 loci per marker type, we found that although CNEEs were less variable than bioinformatically derived UCEs or introns and in some cases exhibited a slower approach to branch resolution as determined by phylogenomic subsampling, the quality of CNEE alignments was superior to those of the other markers, with fewer gaps and missing species. Phylogenetic resolution using coalescent approaches was comparable among the three marker types, with most nodes being fully and congruently resolved. Comparison of phylogenetic results across the three marker types indicated that one branch, the sister group to the passerine + falcon clade, was resolved differently and with moderate (>70%) bootstrap support between CNEEs and UCEs or introns. Overall, CNEEs appear to be promising as phylogenomic markers, yielding phylogenetic resolution as high as for UCEs and introns but with fewer gaps, less ambiguity in alignments and with patterns of nucleotide substitution more consistent with the assumptions of commonly used methods of phylogenetic analysis. © The Author(s) 2017. Published by Oxford University Press on behalf of the Systematic Biologists.
Conserved Nonexonic Elements: A Novel Class of Marker for Phylogenomics
Cloutier, Alison; Baker, Allan J.
2017-01-01
Abstract Noncoding markers have a particular appeal as tools for phylogenomic analysis because, at least in vertebrates, they appear less subject to strong variation in GC content among lineages. Thus far, ultraconserved elements (UCEs) and introns have been the most widely used noncoding markers. Here we analyze and study the evolutionary properties of a new type of noncoding marker, conserved nonexonic elements (CNEEs), which consists of noncoding elements that are estimated to evolve slower than the neutral rate across a set of species. Although they often include UCEs, CNEEs are distinct from UCEs because they are not ultraconserved, and, most importantly, the core region alone is analyzed, rather than both the core and its flanking regions. Using a data set of 16 birds plus an alligator outgroup, and ∼3600–∼3800 loci per marker type, we found that although CNEEs were less variable than bioinformatically derived UCEs or introns and in some cases exhibited a slower approach to branch resolution as determined by phylogenomic subsampling, the quality of CNEE alignments was superior to those of the other markers, with fewer gaps and missing species. Phylogenetic resolution using coalescent approaches was comparable among the three marker types, with most nodes being fully and congruently resolved. Comparison of phylogenetic results across the three marker types indicated that one branch, the sister group to the passerine + falcon clade, was resolved differently and with moderate (>70%) bootstrap support between CNEEs and UCEs or introns. Overall, CNEEs appear to be promising as phylogenomic markers, yielding phylogenetic resolution as high as for UCEs and introns but with fewer gaps, less ambiguity in alignments and with patterns of nucleotide substitution more consistent with the assumptions of commonly used methods of phylogenetic analysis. PMID:28637293
Turmel, Monique; Otis, Christian; Lemieux, Claude
2016-09-19
To probe organelle genome evolution in the Ulvales/Ulotrichales clade, the newly sequenced chloroplast and mitochondrial genomes of Gloeotilopsis planctonica and Gloeotilopsis sarcinoidea (Ulotrichales) were compared with those of Pseudendoclonium akinetum (Ulotrichales) and of the few other green algae previously sampled in the Ulvophyceae. At 105,236 bp, the G planctonica mitochondrial DNA (mtDNA) is the largest mitochondrial genome reported so far among chlorophytes, whereas the 221,431-bp G planctonica and 262,888-bp G sarcinoidea chloroplast DNAs (cpDNAs) are the largest chloroplast genomes analyzed among the Ulvophyceae. Gains of non-coding sequences largely account for the expansion of these genomes. Both Gloeotilopsis cpDNAs lack the inverted repeat (IR) typically found in green plants, indicating that two independent IR losses occurred in the Ulvales/Ulotrichales. Our comparison of the Pseudendoclonium and Gloeotilopsis cpDNAs offered clues regarding the mechanism of IR loss in the Ulotrichales, suggesting that internal sequences from the rDNA operon were differentially lost from the two original IR copies during this process. Our analyses also unveiled a number of genetic novelties. Short mtDNA fragments were discovered in two distinct regions of the G sarcinoidea cpDNA, providing the first evidence for intracellular inter-organelle gene migration in green algae. We identified for the first time in green algal organelles, group II introns with LAGLIDADG ORFs as well as group II introns inserted into untranslated gene regions. We discovered many group II introns occupying sites not previously documented for the chloroplast genome and demonstrated that a number of them arose by intragenomic proliferation, most likely through retrohoming. © The Author 2016. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Turmel, Monique; Otis, Christian; Lemieux, Claude
2016-01-01
Abstract To probe organelle genome evolution in the Ulvales/Ulotrichales clade, the newly sequenced chloroplast and mitochondrial genomes of Gloeotilopsis planctonica and Gloeotilopsis sarcinoidea (Ulotrichales) were compared with those of Pseudendoclonium akinetum (Ulotrichales) and of the few other green algae previously sampled in the Ulvophyceae. At 105,236 bp, the G. planctonica mitochondrial DNA (mtDNA) is the largest mitochondrial genome reported so far among chlorophytes, whereas the 221,431-bp G. planctonica and 262,888-bp G. sarcinoidea chloroplast DNAs (cpDNAs) are the largest chloroplast genomes analyzed among the Ulvophyceae. Gains of non-coding sequences largely account for the expansion of these genomes. Both Gloeotilopsis cpDNAs lack the inverted repeat (IR) typically found in green plants, indicating that two independent IR losses occurred in the Ulvales/Ulotrichales. Our comparison of the Pseudendoclonium and Gloeotilopsis cpDNAs offered clues regarding the mechanism of IR loss in the Ulotrichales, suggesting that internal sequences from the rDNA operon were differentially lost from the two original IR copies during this process. Our analyses also unveiled a number of genetic novelties. Short mtDNA fragments were discovered in two distinct regions of the G. sarcinoidea cpDNA, providing the first evidence for intracellular inter-organelle gene migration in green algae. We identified for the first time in green algal organelles, group II introns with LAGLIDADG ORFs as well as group II introns inserted into untranslated gene regions. We discovered many group II introns occupying sites not previously documented for the chloroplast genome and demonstrated that a number of them arose by intragenomic proliferation, most likely through retrohoming. PMID:27503298
Patterns and rates of intron divergence between humans and chimpanzees
Gazave, Elodie; Marqués-Bonet, Tomàs; Fernando, Olga; Charlesworth, Brian; Navarro, Arcadi
2007-01-01
Background Introns, which constitute the largest fraction of eukaryotic genes and which had been considered to be neutral sequences, are increasingly acknowledged as having important functions. Several studies have investigated levels of evolutionary constraint along introns and across classes of introns of different length and location within genes. However, thus far these studies have yielded contradictory results. Results We present the first analysis of human-chimpanzee intron divergence, in which differences in the number of substitutions per intronic site (Ki) can be interpreted as the footprint of different intensities and directions of the pressures of natural selection. Our main findings are as follows: there was a strong positive correlation between intron length and divergence; there was a strong negative correlation between intron length and GC content; and divergence rates vary along introns and depending on their ordinal position within genes (for instance, first introns are more GC rich, longer and more divergent, and divergence is lower at the 3' and 5' ends of all types of introns). Conclusion We show that the higher divergence of first introns is related to their larger size. Also, the lower divergence of short introns suggests that they may harbor a relatively greater proportion of regulatory elements than long introns. Moreover, our results are consistent with the presence of functionally relevant sequences near the 5' and 3' ends of introns. Finally, our findings suggest that other parts of introns may also be under selective constraints. PMID:17309804
Evaluation of the mechanisms of intron loss and gain in the social amoebae Dictyostelium.
Ma, Ming-Yue; Che, Xun-Ru; Porceddu, Andrea; Niu, Deng-Ke
2015-12-18
Spliceosomal introns are a common feature of eukaryotic genomes. To approach a comprehensive understanding of intron evolution on Earth, studies should look beyond repeatedly studied groups such as animals, plants, and fungi. The slime mold Dictyostelium belongs to a supergroup of eukaryotes not covered in previous studies. We found 441 precise intron losses in Dictyostelium discoideum and 202 precise intron losses in Dictyostelium purpureum. Consistent with these observations, Dictyostelium discoideum was found to have significantly more copies of reverse transcriptase genes than Dictyostelium purpureum. We also found that the lost introns are significantly further from the 5' end of genes than the conserved introns. Adjacent introns were prone to be lost simultaneously in Dictyostelium discoideum. In both Dictyostelium species, the exonic sequences flanking lost introns were found to have a significantly higher GC content than those flanking conserved introns. Together, these observations support a reverse-transcription model of intron loss in which intron losses were caused by gene conversion between genomic DNA and cDNA reverse transcribed from mature mRNA. We also identified two imprecise intron losses in Dictyostelium discoideum that may have resulted from genomic deletions. Ninety-eight putative intron gains were also observed. Consistent with previous studies of other lineages, the source sequences were found in only a small number of cases, with only two instances of intron gain identified in Dictyostelium discoideum. Although they diverged very early from animals and fungi, Dictyostelium species have similar mechanisms of intron loss.
Use of synthetic oligonucleotide DNA probes for the identification of Bacteroides gingivalis.
Moncla, B J; Braham, P; Dix, K; Watanabe, S; Schwartz, D
1990-01-01
Six different oligonucleotide probes complementary to the hypervariable regions of 16S rRNA of Bacteroides gingivalis were tested for specificity and sensitivity against 77 field strains of B. gingivalis and 105 strains of 12 other Bacteroides species. The data demonstrated that these probes were very specific (range, 0.85 to 1.00) and sensitive (1.00). Some limited cross-reactions with other Bacteroides species were observed. Four of these probes should be useful for rapid detection and identification of B. gingivalis. Images PMID:1690217
Phylogenetic analysis of Sicilian goats reveals a new mtDNA lineage.
Sardina, M T; Ballester, M; Marmi, J; Finocchiaro, R; van Kaam, J B C H M; Portolano, B; Folch, J M
2006-08-01
The mitochondrial hypervariable region 1 (HVR1) sequence of 67 goats belonging to the Girgentana, Maltese and Derivata di Siria breeds was partially sequenced in order to present the first phylogenetic characterization of Sicilian goat breeds. These sequences were compared with published sequences of Indian and Pakistani domestic goats and wild goats. Mitochondrial lineage A was observed in most of the Sicilian goats. However, three Girgentana haplotypes were highly divergent from the Capra hircus clade, indicating that a new mtDNA lineage in domestic goats was found.
Analysis of nonuniformity in intron phase distribution.
Fedorov, A; Suboch, G; Bujakov, M; Fedorova, L
1992-01-01
The distribution of different intron groups with respect to phases has been analyzed. It has been established that group II introns and nuclear introns have a minimum frequency of phase 2 introns. Since the phase of introns is an extremely conservative measure the observed minimum reflects evolutionary processes. A sample of all known, group I introns was too small to provide a valid characteristic of their phase distribution. The findings observed for the unequal distribution of phases cannot be explained solely on the basis of the mobile properties of introns. One of the most likely explanations for this nonuniformity in the intron phase distribution is the process of exon shuffling. It is proposed that group II introns originated at the early stages of evolution and were involved in the process of exon shuffling. PMID:1598214
Tissue- and case-specific retention of intron 40 in mature dystrophin mRNA.
Nishida, Atsushi; Minegishi, Maki; Takeuchi, Atsuko; Niba, Emma Tabe Eko; Awano, Hiroyuki; Lee, Tomoko; Iijima, Kazumoto; Takeshima, Yasuhiro; Matsuo, Masafumi
2015-06-01
The dystrophin gene, which is mutated in Duchenne muscular dystrophy (DMD), comprises 79 exons that show multiple alternative splicing events. Intron retention, a type of alternative splicing, may control gene expression. We examined intron retention in dystrophin introns by reverse-transcription PCR from skeletal muscle, focusing on the nine shortest (all <1000 bp), because these are more likely to be retained. Only one, intron 40, was retained in mRNA; sequencing revealed insertion of a complete intron 40 (851 nt) between exons 40 and 41. The intron 40 retention product accounted for 1.2% of the total product but had a premature stop codon at the fifth intronic codon. Intron 40 retention was most strongly observed in the kidney (36.6%) and was not obtained from the fetal liver, lung, spleen or placenta. This indicated that intron retention is a tissue-specific event whose level varies among tissues. In two DMD patients, intron 40 retention was observed in one patient but not in the other. Examination of splicing regulatory factors revealed that intron 40 had the highest guanine-cytosine content of all examined introns in a 30-nt segment at its 3' end. Further studies are needed to clarify the biological role of intron 40-retained dystrophin mRNA.
Nishimura, Yuki; Kamikawa, Ryoma; Hashimoto, Tetsuo; Inagaki, Yuji
2014-01-01
Mitochondrial (mt) genome sequences, which often bear introns, have been sampled from phylogenetically diverse eukaryotes. Thus, we can anticipate novel insights into intron evolution from previously unstudied mt genomes. We here investigated the origins and evolution of three introns in the mt genome of the haptophyte Chrysochromulina sp. NIES-1333, which was sequenced completely in this study. All the three introns were characterized as group II, on the basis of predicted secondary structure, and the conserved sequence motifs at the 5′ and 3′ termini. Our comparative studies on diverse mt genomes prompt us to propose that the Chrysochromulina mt genome laterally acquired the introns from mt genomes in distantly related eukaryotes. Many group II introns harbor intronic open reading frames for the proteins (intron-encoded proteins or IEPs), which likely facilitate the splicing of their host introns. However, we propose that a “free-standing,” IEP-like protein, which is not encoded within any introns in the Chrysochromulina mt genome, is involved in the splicing of the first cox1 intron that lacks any open reading frames. PMID:25054084
Genetic Origins of Lactase Persistence and the Spread of Pastoralism in Africa
Ranciaro, Alessia; Campbell, Michael C.; Hirbo, Jibril B.; Ko, Wen-Ya; Froment, Alain; Anagnostou, Paolo; Kotze, Maritha J.; Ibrahim, Muntaser; Nyambo, Thomas; Omar, Sabah A.; Tishkoff, Sarah A.
2014-01-01
In humans, the ability to digest lactose, the sugar in milk, declines after weaning because of decreasing levels of the enzyme lactase-phlorizin hydrolase, encoded by LCT. However, some individuals maintain high enzyme amounts and are able to digest lactose into adulthood (i.e., they have the lactase-persistence [LP] trait). It is thought that selection has played a major role in maintaining this genetically determined phenotypic trait in different human populations that practice pastoralism. To identify variants associated with the LP trait and to study its evolutionary history in Africa, we sequenced MCM6 introns 9 and 13 and ∼2 kb of the LCT promoter region in 819 individuals from 63 African populations and in 154 non-Africans from nine populations. We also genotyped four microsatellites in an ∼198 kb region in a subset of 252 individuals to reconstruct the origin and spread of LP-associated variants in Africa. Additionally, we examined the association between LP and genetic variability at candidate regulatory regions in 513 individuals from eastern Africa. Our analyses confirmed the association between the LP trait and three common variants in intron 13 (C-14010, G-13907, and G-13915). Furthermore, we identified two additional LP-associated SNPs in intron 13 and the promoter region (G-12962 and T-956, respectively). Using neutrality tests based on the allele frequency spectrum and long-range linkage disequilibrium, we detected strong signatures of recent positive selection in eastern African populations and the Fulani from central Africa. In addition, haplotype analysis supported an eastern African origin of the C-14010 LP-associated mutation in southern Africa. PMID:24630847
Three distinct modes of intron dynamics in the evolution of eukaryotes.
Carmel, Liran; Wolf, Yuri I; Rogozin, Igor B; Koonin, Eugene V
2007-07-01
Several contrasting scenarios have been proposed for the origin and evolution of spliceosomal introns, a hallmark of eukaryotic genes. A comprehensive probabilistic model to obtain a definitive reconstruction of intron evolution was developed and applied to 391 sets of conserved genes from 19 eukaryotic species. It is inferred that a relatively high intron density was reached early, i.e., the last common ancestor of eukaryotes contained >2.15 introns/kilobase, and the last common ancestor of multicellular life forms harbored approximately 3.4 introns/kilobase, a greater intron density than in most of the extant fungi and in some animals. The rates of intron gain and intron loss appear to have been dropping during the last approximately 1.3 billion years, with the decline in the gain rate being much steeper. Eukaryotic lineages exhibit three distinct modes of evolution of the intron-exon structure. The primary, balanced mode, apparently, operates in all lineages. In this mode, intron gain and loss are strongly and positively correlated, in contrast to previous reports on inverse correlation between these processes. The second mode involves an elevated rate of intron loss and is prevalent in several lineages, such as fungi and insects. The third mode, characterized by elevated rate of intron gain, is seen only in deep branches of the tree, indicating that bursts of intron invasion occurred at key points in eukaryotic evolution, such as the origin of animals. Intron dynamics could depend on multiple mechanisms, and in the balanced mode, gain and loss of introns might share common mechanistic features.
Humphreys, Isla; Fleming, Vicki; Fabris, Paolo; Parker, Joe; Schulenberg, Bodo; Brown, Anthony; Demetriou, Charis; Gaudieri, Silvana; Pfafferott, Katja; Lucas, Michaela; Collier, Jane; Huang, Kuan-Hsiang Gary; Pybus, Oliver G.; Klenerman, Paul; Barnes, Eleanor
2009-01-01
Hepatitis C virus subtype 3a is a highly prevalent and globally distributed strain that is often associated with infection via injection drug use. This subtype exhibits particular phenotypic characteristics. In spite of this, detailed genetic analysis of this subtype has rarely been performed. We performed full-length viral sequence analysis in 18 patients with chronic HCV subtype 3a infection and assessed genomic viral variability in comparison to other HCV subtypes. Two novel regions of intragenotypic hypervariability within the envelope protein E2, of HCV genotype 3a, were identified. We named these regions HVR495 and HVR575. They consisted of flanking conserved hydrophobic amino acids and central variable residues. A 5-amino-acid insertion found only in genotype 3a and a putative glycosylation site is contained within HVR575. Evolutionary analysis of E2 showed that positively selected sites within genotype 3a infection were largely restricted to HVR1, HVR495, and HVR575. Further analysis of clonal viral populations within single hosts showed that viral variation within HVR495 and HVR575 were subject to intrahost positive selecting forces. Longitudinal analysis of four patients with acute HCV subtype 3a infection sampled at multiple time points showed that positively selected mutations within HVR495 and HVR575 arose early during primary infection. HVR495 and HVR575 were not present in HCV subtypes 1a, 1b, 2a, or 6a. Some variability that was not subject to positive selection was present in subtype 4a HVR575. Further defining the functional significance of these regions may have important implications for genotype 3a E2 virus-receptor interactions and for vaccine studies that aim to induce cross-reactive anti-E2 antibodies. PMID:19740991
Genetic characterization of Common Eiders breeding in the Yukon-Kuskokwim Delta, Alaska
Sonsthagen, Sarah A.; Talbot, Sandra L.; McCracken, Kevin G.
2007-01-01
We assessed population genetic subdivision among four colonies of Common Eiders (Somateria mollissima v-nigrum) breeding in the Yukon-Kuskokwim Delta (YKD), Alaska, using microsatellite genotypes and DNA sequences with differing modes of inheritance. Significant, albeit low, levels of genetic differentiation were observed between mainland populations and Kigigak Island for nuclear intron lamin A and mitochondrial DNA (mtDNA) control region. Intercolony variation in haplotypic frequencies also was observed at mtDNA. Positive growth signatures assayed from microsatellites, nuclear introns, and mtDNA indicate recent colonization of the YKD, and may explain the low levels of structuring observed. Gene flow estimates based on microsatellites, nuclear introns, and mtDNA suggest asymmetrical gene flow between mainland colonies and Kigigak Island, with more individuals on average dispersing from mainland populations to Kigigak Island than vice versa. The directionality of gene flow observed may be explained by the colonization of the YKD from northern glacial refugia or by YKD metapopulation dynamics.
Johnson, Matthew G.; Gardner, Elliot M.; Liu, Yang; Medina, Rafael; Goffinet, Bernard; Shaw, A. Jonathan; Zerega, Nyree J. C.; Wickett, Norman J.
2016-01-01
Premise of the study: Using sequence data generated via target enrichment for phylogenetics requires reassembly of high-throughput sequence reads into loci, presenting a number of bioinformatics challenges. We developed HybPiper as a user-friendly platform for assembly of gene regions, extraction of exon and intron sequences, and identification of paralogous gene copies. We test HybPiper using baits designed to target 333 phylogenetic markers and 125 genes of functional significance in Artocarpus (Moraceae). Methods and Results: HybPiper implements parallel execution of sequence assembly in three phases: read mapping, contig assembly, and target sequence extraction. The pipeline was able to recover nearly complete gene sequences for all genes in 22 species of Artocarpus. HybPiper also recovered more than 500 bp of nontargeted intron sequence in over half of the phylogenetic markers and identified paralogous gene copies in Artocarpus. Conclusions: HybPiper was designed for Linux and Mac OS X and is freely available at https://github.com/mossmatters/HybPiper. PMID:27437175
Intron open reading frames as mobile elements and evolution of a group I intron.
Sellem, C H; Belcour, L
1997-05-01
Group I introns are proposed to have become mobile following the acquisition of open reading frames (ORFs) that encode highly specific DNA endonucleases. This proposal implies that intron ORFs could behave as autonomously mobile entities. This was supported by abundant circumstantial evidence but no experiment of ORF transfer from an ORF-containing intron to its ORF-less counterpart has been described. In this paper we present such experiments, which demonstrate the efficient mobility of the mitochondrial nad1-i4-orf1 between two Podospora strains. The homing of this mobile ORF was accompanied by a bidirectional co-conversion that did not systematically involve the whole intron sequence. Orf1 acquisition would be the most recent step in the evolution of the nad1-i4 intron, which has resulted in many strains of Podospora having an intron with two ORFs (biorfic) and four splicing pathways. We show that two of the splicing events that operate in this biorfic intron, as evidenced by PCR experiments, are generated by a 5'-alternative splice site, which is most probably a remnant of the monoorfic ancestral form of the intron. We propose a sequential evolution model that is consistent with the four organizations of the corresponding nad1 locus that we found among various species of the Pyrenomycete family; these organizations consist of no intron, an intron alone, a monoorfic intron, and a biorfic intron.
Methylation of an alpha-foetoprotein gene intragenic site modulates gene activity.
Opdecamp, K; Rivière, M; Molné, M; Szpirer, J; Szpirer, C
1992-01-01
By comparing the methylation pattern of Mspl/Hpall sites in the 5' region of the mouse alpha-foetoprotein (AFP) gene of different cells (hepatoma cells, foetal and adult liver, fibroblasts), we found a correlation between gene expression and unmethylation of a site located in the first intron of the gene. Other sites did not show this correlation. In transfection experiments of unmethylated and methylated AFP-CAT chimeric constructions, we then showed that methylation of the intronic site negatively modulates expression of CAT activity. We also found that a DNA segment centered on this site binds nuclear proteins; however methylation did not affect protein binding. Images PMID:1371343
Zerbato, Madeleine; Holic, Nathalie; Moniot-Frin, Sophie; Ingrao, Dina; Galy, Anne; Perea, Javier
2013-01-01
Group II introns are self-splicing mobile elements found in prokaryotes and eukaryotic organelles. These introns propagate by homing into precise genomic locations, following assembly of a ribonucleoprotein complex containing the intron-encoded protein (IEP) and the spliced intron RNA. Engineered group II introns are now commonly used tools for targeted genomic modifications in prokaryotes but not in eukaryotes. We speculate that the catalytic activation of currently known group II introns is limited in eukaryotic cells. The brown algae Pylaiella littoralis Pl.LSU/2 group II intron is uniquely capable of in vitro ribozyme activity at physiological level of magnesium but this intron remains poorly characterized. We purified and characterized recombinant Pl.LSU/2 IEP. Unlike most IEPs, Pl.LSU/2 IEP displayed a reverse transcriptase activity without intronic RNA. The Pl.LSU/2 intron could be engineered to splice accurately in Saccharomyces cerevisiae and splicing efficiency was increased by the maturase activity of the IEP. However, spliced transcripts were not expressed. Furthermore, intron splicing was not detected in human cells. While further tool development is needed, these data provide the first functional characterization of the PI.LSU/2 IEP and the first evidence that the Pl.LSU/2 group II intron splicing occurs in vivo in eukaryotes in an IEP-dependent manner.
Chen, Xi; Crupper, Scott S
2016-09-01
Gypsum caves found throughout the Red Hills of Kansas have the state's most diverse and largest population of cave-roosting bats. White-nose syndrome (WNS), a disease caused by the fungus Pseudogymnoascus destructans, which threatens all temperate bat species, has not been previously detected in the gypsum caves as this disease moves westward from the eastern United States. Cave soil was obtained from the gypsum caves, and using the polymerase chain reaction, a 624-nucleotide DNA fragment specific to the Type 1 intron-internal transcribed spacer region of the 18S rRNA gene from Pseudogymnoascus species was amplified. Subsequent cloning and DNA sequencing indicated P. destructans DNA was present, along with 26 uncharacterized Pseudogymnoascus DNA variants. However, no evidence of WNS was observed in bat populations residing in these caves.
Kawaguchi, Risa; Kiryu, Hisanori
2016-05-06
RNA secondary structure around splice sites is known to assist normal splicing by promoting spliceosome recognition. However, analyzing the structural properties of entire intronic regions or pre-mRNA sequences has been difficult hitherto, owing to serious experimental and computational limitations, such as low read coverage and numerical problems. Our novel software, "ParasoR", is designed to run on a computer cluster and enables the exact computation of various structural features of long RNA sequences under the constraint of maximal base-pairing distance. ParasoR divides dynamic programming (DP) matrices into smaller pieces, such that each piece can be computed by a separate computer node without losing the connectivity information between the pieces. ParasoR directly computes the ratios of DP variables to avoid the reduction of numerical precision caused by the cancellation of a large number of Boltzmann factors. The structural preferences of mRNAs computed by ParasoR shows a high concordance with those determined by high-throughput sequencing analyses. Using ParasoR, we investigated the global structural preferences of transcribed regions in the human genome. A genome-wide folding simulation indicated that transcribed regions are significantly more structural than intergenic regions after removing repeat sequences and k-mer frequency bias. In particular, we observed a highly significant preference for base pairing over entire intronic regions as compared to their antisense sequences, as well as to intergenic regions. A comparison between pre-mRNAs and mRNAs showed that coding regions become more accessible after splicing, indicating constraints for translational efficiency. Such changes are correlated with gene expression levels, as well as GC content, and are enriched among genes associated with cytoskeleton and kinase functions. We have shown that ParasoR is very useful for analyzing the structural properties of long RNA sequences such as mRNAs, pre-mRNAs, and long non-coding RNAs whose lengths can be more than a million bases in the human genome. In our analyses, transcribed regions including introns are indicated to be subject to various types of structural constraints that cannot be explained from simple sequence composition biases. ParasoR is freely available at https://github.com/carushi/ParasoR .
Hong, Jin-Sung; Ryu, Ki-Hyun; Kwon, Soon-Jae; Kim, Jin-Won; Kim, Kwang-Soo; Park, Kyong-Cheul
2013-01-01
Polygalacturonase (PG) gene is a typical gene family present in eukaryotes. Forty-nine PGs were mined from the genomes of Neurospora crassa and five Aspergillus species. The PGs were classified into 3 clades such as clade 1 for rhamno-PGs, clade 2 for exo-PGs and clade 3 for exo- and endo-PGs, which were further grouped into 13 sub-clades based on the polypeptide sequence similarity. In gene structure analysis, a total of 124 introns were present in 44 genes and five genes lacked introns to give an average of 2.5 introns per gene. Intron phase distribution was 64.5% for phase 0, 21.8% for phase 1, and 13.7% for phase 2, respectively. The introns varied in their sequences and their lengths ranged from 20 bp to 424 bp with an average of 65.9 bp, which is approximately half the size of introns in other fungal genes. There were 29 homologous intron blocks and 26 of those were sub-clade specific. Intron losses were counted in 18 introns in which no obvious phase preference for intron loss was observed. Eighteen introns were placed at novel positions, which is considerably higher than those of plant PGs. In an evolutionary sense both intron loss and gain must have taken place for shaping the current PGs in these fungi. Together with the small intron size, low conservation of homologous intron blocks and higher number of novel introns, PGs of fungal species seem to have recently undergone highly dynamic evolution. PMID:25288950
RNA structure in splicing: An evolutionary perspective.
Lin, Chien-Ling; Taggart, Allison J; Fairbrother, William G
2016-09-01
Pre-mRNA splicing is a key post-transcriptional regulation process in which introns are excised and exons are ligated together. A novel class of structured intron was recently discovered in fish. Simple expansions of complementary AC and GT dimers at opposite boundaries of an intron were found to form a bridging structure, thereby enforcing correct splice site pairing across the intron. In some fish introns, the RNA structures are strong enough to bypass the need of regulatory protein factors for splicing. Here, we discuss the prevalence and potential functions of highly structured introns. In humans, structured introns usually arise through the co-occurrence of C and G-rich repeats at intron boundaries. We explore the potentially instructive example of the HLA receptor genes. In HLA pre-mRNA, structured introns flank the exons that encode the highly polymorphic β sheet cleft, making the processing of the transcript robust to variants that disrupt splicing factor binding. While selective forces that have shaped HLA receptor are fairly atypical, numerous other highly polymorphic genes that encode receptors contain structured introns. Finally, we discuss how the elevated mutation rate associated with the simple repeats that often compose structured intron can make structured introns themselves rapidly evolving elements.
Aoki, Kimiko; Tanaka, Hiroyuki; Kawahara, Takashi
2018-07-01
The standard method for personal identification and verification of urine samples in doping control is short tandem repeat (STR) analysis using nuclear DNA (nDNA). The DNA concentration of urine is very low and decreases under most conditions used for sample storage; therefore, the amount of DNA from cryopreserved urine samples may be insufficient for STR analysis. We aimed to establish a multiplexed assay for urine mitochondrial DNA typing containing only trace amounts of DNA, particularly for Japanese populations. A multiplexed suspension-array assay using oligo-tagged microspheres (Luminex MagPlex-TAG) was developed to measure C-stretch length in hypervariable region 1 (HV1) and 2 (HV2), five single nucleotide polymorphisms (SNPs), and one polymorphic indel. Based on these SNPs and the indel, the Japanese population can be classified into five major haplogroups (D4, B, M7a, A, D5). The assay was applied to DNA samples from urine cryopreserved for 1 - 1.5 years (n = 63) and fresh blood (n = 150). The assay with blood DNA enabled Japanese subjects to be categorized into 62 types, exhibiting a discriminatory power of 0.960. The detection limit for cryopreserved urine was 0.005 ng of nDNA. Profiling of blood and urine pairs revealed that 5 of 63 pairs showed different C-stretch patterns in HV1 or HV2. The assay described here yields valuable information in terms of the verification of urine sample sources employing only trace amounts of recovered DNA. However, blood cannot be used as a reference sample.
Da Lage, Jean-Luc; Maczkowiak, Frédérique; Cariou, Marie-Louise
2011-01-01
Most eukaryotes have at least some genes interrupted by introns. While it is well accepted that introns were already present at moderate density in the last eukaryote common ancestor, the conspicuous diversity of intron density among genomes suggests a complex evolutionary history, with marked differences between phyla. The question of the rates of intron gains and loss in the course of evolution and factors influencing them remains controversial. We have investigated a single gene family, alpha-amylase, in 55 species covering a variety of animal phyla. Comparison of intron positions across phyla suggests a complex history, with a likely ancestral intronless gene undergoing frequent intron loss and gain, leading to extant intron/exon structures that are highly variable, even among species from the same phylum. Because introns are known to play no regulatory role in this gene and there is no alternative splicing, the structural differences may be interpreted more easily: intron positions, sizes, losses or gains may be more likely related to factors linked to splicing mechanisms and requirements, and to recognition of introns and exons, or to more extrinsic factors, such as life cycle and population size. We have shown that intron losses outnumbered gains in recent periods, but that “resets” of intron positions occurred at the origin of several phyla, including vertebrates. Rates of gain and loss appear to be positively correlated. No phase preference was found. We also found evidence for parallel gains and for intron sliding. Presence of introns at given positions was correlated to a strong protosplice consensus sequence AG/G, which was much weaker in the absence of intron. In contrast, recent intron insertions were not associated with a specific sequence. In animal Amy genes, population size and generation time seem to have played only minor roles in shaping gene structures. PMID:21611157
Li, Fang; Vensko, Steven P.; Belikoff, Esther J.; Scott, Maxwell J.
2013-01-01
Transformer (TRA) promotes female development in several dipteran species including the Australian sheep blowfly Lucilia cuprina, the Mediterranean fruit fly, housefly and Drosophila melanogaster. tra transcripts are sex-specifically spliced such that only the female form encodes full length functional protein. The presence of six predicted TRA/TRA2 binding sites in the sex-specific female intron of the L. cuprina gene suggested that tra splicing is auto-regulated as in medfly and housefly. With the aim of identifying conserved motifs that may play a role in tra sex-specific splicing, here we have isolated and characterized the tra gene from three additional blowfly species, L. sericata, Cochliomyia hominivorax and C. macellaria. The blowfly adult male and female transcripts differ in the choice of splice donor site in the first intron, with males using a site downstream of the site used in females. The tra genes all contain a single TRA/TRA2 site in the male exon and a cluster of four to five sites in the male intron. However, overall the sex-specific intron sequences are poorly conserved in closely related blowflies. The most conserved regions are around the exon/intron junctions, the 3′ end of the intron and near the cluster of TRA/TRA2 sites. We propose a model for sex specific regulation of tra splicing that incorporates the conserved features identified in this study. In L. sericata embryos, the male tra transcript was first detected at around the time of cellular blastoderm formation. RNAi experiments showed that tra is required for female development in L. sericata and C. macellaria. The isolation of the tra gene from the New World screwworm fly C. hominivorax, a major livestock pest, will facilitate the development of a “male-only” strain for genetic control programs. PMID:23409170
Mobile Bacterial Group II Introns at the Crux of Eukaryotic Evolution
Lambowitz, Alan M.; Belfort, Marlene
2015-01-01
SUMMARY This review focuses on recent developments in our understanding of group II intron function, the relationships of these introns to retrotransposons and spliceosomes, and how their common features have informed thinking about bacterial group II introns as key elements in eukaryotic evolution. Reverse transcriptase-mediated and host factor-aided intron retrohoming pathways are considered along with retrotransposition mechanisms to novel sites in bacteria, where group II introns are thought to have originated. DNA target recognition and movement by target-primed reverse transcription infer an evolutionary relationship among group II introns, non-LTR retrotransposons, such as LINE elements, and telomerase. Additionally, group II introns are almost certainly the progenitors of spliceosomal introns. Their profound similarities include splicing chemistry extending to RNA catalysis, reaction stereochemistry, and the position of two divalent metals that perform catalysis at the RNA active site. There are also sequence and structural similarities between group II introns and the spliceosome’s small nuclear RNAs (snRNAs) and between a highly conserved core spliceosomal protein Prp8 and a group II intron-like reverse transcriptase. It has been proposed that group II introns entered eukaryotes during bacterial endosymbiosis or bacterial-archaeal fusion, proliferated within the nuclear genome, necessitating evolution of the nuclear envelope, and fragmented giving rise to spliceosomal introns. Thus, these bacterial self-splicing mobile elements have fundamentally impacted the composition of extant eukaryotic genomes, including the human genome, most of which is derived from close relatives of mobile group II introns. PMID:25878921
De novo insertion of an intron into the mammalian sex determining gene, SRY
O’Neill, Rachel J. Waugh; Brennan, Francine E.; Delbridge, Margaret L.; Crozier, Ross H.; Graves, Jennifer A. Marshall
1998-01-01
Two theories have been proposed to explain the evolution of introns within eukaryotic genes. The introns early theory, or “exon theory of genes,” proposes that introns are ancient and that recombination within introns provided new exon structure, and thus new genes. The introns late theory, or “insertional theory of introns,” proposes that ancient genes existed as uninterrupted exons and that introns have been introduced during the course of evolution. There is still controversy as to how intron–exon structure evolved and whether the majority of introns are ancient or novel. Although there is extensive evidence in support of the introns early theory, phylogenetic comparisons of several genes indicate recent gain and loss of introns within these genes. However, no example has been shown of a protein coding gene, intronless in its ancestral form, which has acquired an intron in a derived form. The mammalian sex determining gene, SRY, is intronless in all mammals studied to date, as is the gene from which it recently evolved. However, we report here comparisons of genomic and cDNA sequences that now provide evidence of a de novo insertion of an intron into the SRY gene of dasyurid marsupials. This recently (approximately 45 million years ago) inserted sequence is not homologous with known transposable elements. Our data demonstrate that introns may be inserted as spliced units within a developmentally crucial gene without disrupting its function. PMID:9465071
Lee, Hae-Lim; Jansen, Robert K; Chumley, Timothy W; Kim, Ki-Joong
2007-05-01
The chloroplast (cp) DNA sequence of Jasminum nudiflorum (Oleaceae-Jasmineae) is completed and compared with the large single-copy region sequences from 6 related species. The cp genomes of the tribe Jasmineae (Jasminum and Menodora) show several distinctive rearrangements, including inversions, gene duplications, insertions, inverted repeat expansions, and gene and intron losses. The ycf4-psaI region in Jasminum section Primulina was relocated as a result of 2 overlapping inversions of 21,169 and 18,414 bp. The 1st, larger inversion is shared by all members of the Jasmineae indicating that it occurred in the common ancestor of the tribe. Similar rearrangements were also identified in the cp genome of Menodora. In this case, 2 fragments including ycf4 and rps4-trnS-ycf3 genes were moved by 2 additional inversions of 14 and 59 kb that are unique to Menodora. Other rearrangements in the Oleaceae are confined to certain regions of the Jasminum and Menodora cp genomes, including the presence of highly repeated sequences and duplications of coding and noncoding sequences that are inserted into clpP and between rbcL and psaI. These insertions are correlated with the loss of 2 introns in clpP and a serial loss of segments of accD. The loss of the accD gene and clpP introns in both the monocot family Poaceae and the eudicot family Oleaceae are clearly independent evolutionary events. However, their genome organization is surprisingly similar despite the distant relationship of these 2 angiosperm families.
Intron-loss evolution of hatching enzyme genes in Teleostei
2010-01-01
Background Hatching enzyme, belonging to the astacin metallo-protease family, digests egg envelope at embryo hatching. Orthologous genes of the enzyme are found in all vertebrate genomes. Recently, we found that exon-intron structures of the genes were conserved among tetrapods, while the genes of teleosts frequently lost their introns. Occurrence of such intron losses in teleostean hatching enzyme genes is an uncommon evolutionary event, as most eukaryotic genes are generally known to be interrupted by introns and the intron insertion sites are conserved from species to species. Here, we report on extensive studies of the exon-intron structures of teleostean hatching enzyme genes for insight into how and why introns were lost during evolution. Results We investigated the evolutionary pathway of intron-losses in hatching enzyme genes of 27 species of Teleostei. Hatching enzyme genes of basal teleosts are of only one type, which conserves the 9-exon-8-intron structure of an assumed ancestor. On the other hand, otocephalans and euteleosts possess two types of hatching enzyme genes, suggesting a gene duplication event in the common ancestor of otocephalans and euteleosts. The duplicated genes were classified into two clades, clades I and II, based on phylogenetic analysis. In otocephalans and euteleosts, clade I genes developed a phylogeny-specific structure, such as an 8-exon-7-intron, 5-exon-4-intron, 4-exon-3-intron or intron-less structure. In contrast to the clade I genes, the structures of clade II genes were relatively stable in their configuration, and were similar to that of the ancestral genes. Expression analyses revealed that hatching enzyme genes were high-expression genes, when compared to that of housekeeping genes. When expression levels were compared between clade I and II genes, clade I genes tends to be expressed more highly than clade II genes. Conclusions Hatching enzyme genes evolved to lose their introns, and the intron-loss events occurred at the specific points of teleostean phylogeny. We propose that the high-expression hatching enzyme genes frequently lost their introns during the evolution of teleosts, while the low-expression genes maintained the exon-intron structure of the ancestral gene. PMID:20796321
Humanized Antibodies for Antiviral Therapy
NASA Astrophysics Data System (ADS)
Co, Man Sung; Deschamps, Marguerite; Whitley, Richard J.; Queen, Cary
1991-04-01
Antibody therapy holds great promise for the treatment of cancer, autoimmune disorders, and viral infections. Murine monoclonal antibodies are relatively easy to produce but are severely restricted for therapeutic use by their immunogenicity in humans. Production of human monoclonal antibodies has been problematic. Humanized antibodies can be generated by introducing the six hypervariable regions from the heavy and light chains of a murine antibody into a human framework sequence and combining it with human constant regions. We humanized, with the aid of computer modeling, two murine monoclonal antibodies against herpes simplex virus gB and gD glycoproteins. The binding, virus neutralization, and cell protection results all indicate that both humanized antibodies have retained the binding activities and the biological properties of the murine monoclonal antibodies.
Zerbato, Madeleine; Holic, Nathalie; Moniot-Frin, Sophie; Ingrao, Dina; Galy, Anne; Perea, Javier
2013-01-01
Group II introns are self-splicing mobile elements found in prokaryotes and eukaryotic organelles. These introns propagate by homing into precise genomic locations, following assembly of a ribonucleoprotein complex containing the intron-encoded protein (IEP) and the spliced intron RNA. Engineered group II introns are now commonly used tools for targeted genomic modifications in prokaryotes but not in eukaryotes. We speculate that the catalytic activation of currently known group II introns is limited in eukaryotic cells. The brown algae Pylaiella littoralis Pl.LSU/2 group II intron is uniquely capable of in vitro ribozyme activity at physiological level of magnesium but this intron remains poorly characterized. We purified and characterized recombinant Pl.LSU/2 IEP. Unlike most IEPs, Pl.LSU/2 IEP displayed a reverse transcriptase activity without intronic RNA. The Pl.LSU/2 intron could be engineered to splice accurately in Saccharomyces cerevisiae and splicing efficiency was increased by the maturase activity of the IEP. However, spliced transcripts were not expressed. Furthermore, intron splicing was not detected in human cells. While further tool development is needed, these data provide the first functional characterization of the PI.LSU/2 IEP and the first evidence that the Pl.LSU/2 group II intron splicing occurs in vivo in eukaryotes in an IEP-dependent manner. PMID:23505475
Janbon, Guilhem
2018-01-01
In Cryptococcus neoformans, nearly all genes are interrupted by small introns. In recent years, genome annotation and genetic analysis have illuminated the major roles these introns play in the biology of this pathogenic yeast. Introns are necessary for gene expression and alternative splicing can regulate gene expression in response to environmental cues. In addition, recent studies have revealed that C. neoformans introns help to prevent transposon dissemination and protect genome integrity. These characteristics of cryptococcal introns are probably not unique to Cryptococcus, and this yeast likely can be considered as a model for intron-related studies in fungi.
Sainsard-Chanet, A; Begel, O; Belcour, L
1994-10-07
In the filamentous fungus Podospora anserina, the unavoidable phenomenon of senescence is associated with the amplification of the first intron of the mitochondrial cox1 that accumulates as circular DNA molecules consisting of tandem repeats. This group II intron (cox1-i1 or alpha) is able to transpose and contains an open reading frame with significant amino acid similarity with reverse transcriptases. The generation of these intronic circular DNA molecules, their amplification and their involvement in the senescence process are unresolved questions. We demonstrate here that: (1) another group II intron, the fourth intron of gene cox1, cox1-i4, is also able to give precise DNA end to end junctions; (2) this intronic sequence can be found amplified during senescence, although to a lesser extent than cox1-i1; (3) the amplification of the DNA multimeric cox1-i1 molecules likely does not proceed by autonomous replication; (4) the generation of the DNA intronic circles does not require efficient intron splicing; (5) a DNA double-strand break occurs in vivo at the 3' extremity of the cox1-e1 and cox1-e4 exons preceding the group II introns that form circular DNAs. On the whole, these results show that the ability to form DNA circular molecules is a property of some group II introns and they demonstrate the occurrence of a specific DNA cleavage at or near the integration site of these group II introns. The results strongly suggest that this cleavage is involved in the formation of the group II intronic DNA circles and could also be involved in the phenomenon of group II intron homing.
Germline EMSY sequence alterations in hereditary breast cancer and ovarian cancer families.
Määttä, Kirsi M; Nurminen, Riikka; Kankuri-Tammilehto, Minna; Kallioniemi, Anne; Laasanen, Satu-Leena; Schleutker, Johanna
2017-07-24
BRCA1 and BRCA2 mutations explain approximately one-fifth of the inherited susceptibility in high-risk Finnish hereditary breast and ovarian cancer (HBOC) families. EMSY is located in the breast cancer-associated chromosomal region 11q13. The EMSY gene encodes a BRCA2-interacting protein that has been implicated in DNA damage repair and genomic instability. We analysed the role of germline EMSY variation in breast/ovarian cancer predisposition. The present study describes the first EMSY screening in patients with high familial risk for this disease. Index individuals from 71 high-risk, BRCA1/2-negative HBOC families were screened for germline EMSY sequence alterations in protein coding regions and exon-intron boundaries using Sanger sequencing and TaqMan assays. The identified variants were further screened in 36 Finnish HBOC patients and 904 controls. Moreover, one novel intronic deletion was screened in a cohort of 404 breast cancer patients unselected for family history. Haplotype block structure and the association of haplotypes with breast/ovarian cancer were analysed using Haploview. The functionality of the identified variants was predicted using Haploreg, RegulomeDB, Human Splicing Finder, and Pathogenic-or-Not-Pipeline 2. Altogether, 12 germline EMSY variants were observed. Two alterations were located in the coding region, five alterations were intronic, and five alterations were located in the 3'untranslated region (UTR). Variant frequencies did not significantly differ between cases and controls. The novel variant, c.2709 + 122delT, was detected in 1 out of 107 (0.9%) breast cancer patients, and the carrier showed a bilateral form of the disease. The deletion was absent in 897 controls (OR = 25.28; P = 0.1) and in 404 breast cancer patients unselected for family history. No haplotype was identified to increase the risk of breast/ovarian cancer. Functional analyses suggested that variants, particularly in the 3'UTR, were located within regulatory elements. The novel deletion was predicted to affect splicing regulatory elements. These results suggest that the identified EMSY variants are likely neutral at the population level. However, these variants may contribute to breast/ovarian cancer risk in single families. Additional analyses are warranted for rare novel intronic deletions and the 3'UTR variants predicted to have functional roles.
Ishida, Ken; Kuboshima, Megumi; Morita, Hiroto; Maeda, Hiroshi; Okamoto, Ayako; Takeuchi, Michio; Yamagata, Youhei
2014-01-01
Alternative splicing is thought to be a means for diversification of products by mRNA modification. Although some intron retentions are predicted by transcriptome analysis in Aspergillus oryzae, its physiological significance remains unknown. We found that intron retention occurred occasionally in the serine-type carboxypeptidase gene, ocpG. Analysis under various culture conditions revealed that extracellular nitrogen conditions influence splicing patterns; this suggested that there might be a correlation between splicing efficiency and the necessity of OcpG activity for obtaining a nitrogen source. Since further analysis showed that splicing occurred independently in each intron, we constructed ocpG intron-exchanging strain by interchanging the positions of intron-1 and intron-2. The splicing pattern indicated the probability that ocpG intron retention was affected by the secondary structures of intronic mRNA.
Analysis and recognition of 5′ UTR intron splice sites in human pre-mRNA
Eden, E.; Brunak, S.
2004-01-01
Prediction of splice sites in non-coding regions of genes is one of the most challenging aspects of gene structure recognition. We perform a rigorous analysis of such splice sites embedded in human 5′ untranslated regions (UTRs), and investigate correlations between this class of splice sites and other features found in the adjacent exons and introns. By restricting the training of neural network algorithms to ‘pure’ UTRs (not extending partially into protein coding regions), we for the first time investigate the predictive power of the splicing signal proper, in contrast to conventional splice site prediction, which typically relies on the change in sequence at the transition from protein coding to non-coding. By doing so, the algorithms were able to pick up subtler splicing signals that were otherwise masked by ‘coding’ noise, thus enhancing significantly the prediction of 5′ UTR splice sites. For example, the non-coding splice site predicting networks pick up compositional and positional bias in the 3′ ends of non-coding exons and 5′ non-coding intron ends, where cytosine and guanine are over-represented. This compositional bias at the true UTR donor sites is also visible in the synaptic weights of the neural networks trained to identify UTR donor sites. Conventional splice site prediction methods perform poorly in UTRs because the reading frame pattern is absent. The NetUTR method presented here performs 2–3-fold better compared with NetGene2 and GenScan in 5′ UTRs. We also tested the 5′ UTR trained method on protein coding regions, and discovered, surprisingly, that it works quite well (although it cannot compete with NetGene2). This indicates that the local splicing pattern in UTRs and coding regions is largely the same. The NetUTR method is made publicly available at www.cbs.dtu.dk/services/NetUTR. PMID:14960723
DOE Office of Scientific and Technical Information (OSTI.GOV)
Eipers, P.G.
1992-01-01
The gene for the human p58[sup clk[minus]1] protein kinase, a cell division control-related gene, has been mapped by somatic cell hybrid analyses, in situ localization with the chromosomal gene, and nested polymerase chain reaction amplification of microdissected chromosomes. These studies indicate that the expressed p58[sup clk[minus]1] chromosomal gene maps to 1p36, while a highly related p58[sup clk[minus]1] sequence of unknown nature maps to chromosome 15. Assignment of a p34[sup cdc2]-related gene to 1p36 region, including neuroblastoma, ductal carcinoma of the breast, malignant melanoma, Merkel cell carcinoma and endocrine neoplasia among others. Aberrant expression of this protein kinase negatively regulates normalmore » cellular growth. The p58[sup clk[minus]1] protein contains a central domain of 299 amino acids that is 46% identical to human p34[sup cdc2], the master mitotic protein kinase. This dissertation details the complete structure of the p58[sup clk[minus]1] chromosomal gene, including its putative promoter region, transcriptional start sites, exonic sequences, and intron/exon boundary sequences. The gene is 10 kb in size and contains 12 exons and 11 introns. Interestingly, the rather large 2.0 kb 3[prime] untranslated region is interrupted by an intron that separates a region containing numerous AUUUA destabilization motifs from the coding region. Furthermore, the expression of this gene in normal human tissues, as well as several human tumor cell samples and lines, is examined. The origin of multiple human transcripts from the same chromosomal gene, and the possible differential stability of these various transcripts, is discussed with regard to the transcriptional and post-transcriptional regulation of this gene. This is the first report of the chromosomal gene structure of a member of the p34[sup cdc2] supergene family.« less
Copertino, D W; Christopher, D A; Hallick, R B
1991-01-01
The splicing of a 409 nucleotide intron from the Euglena gracilis chloroplast ribosomal protein S3 gene (rps3) was examined by cDNA cloning and sequencing, and northern hybridization. Based on the characterization of a partially spliced pre-mRNA, the intron was characterized as a 'mixed' twintron, composed of a 311 nucleotide group II intron internal to a 98 nucleotide group III intron. Twintron excision is via a 2-step sequential splicing pathway, with removal of the internal group II intron preceding excision of the external group III intron. Based on secondary structural analysis of the twintron, we propose that group III introns may represent highly degenerate versions of group II introns. The existence of twintrons is interpreted as evidence that group II introns were inserted during the evolution of Euglena chloroplast genes from a common ancestor with eubacteria, archaebacteria, cyanobacteria, and other chloroplasts. Images PMID:1721702
Perrineau, Marie-Mathilde; Price, Dana C.; Mohr, Georg
2015-01-01
Group II introns are closely linked to eukaryote evolution because nuclear spliceosomal introns and the small RNAs associated with the spliceosome are thought to trace their ancient origins to these mobile elements. Therefore, elucidating how group II introns move, and how they lose mobility can potentially shed light on fundamental aspects of eukaryote biology. To this end, we studied five strains of the unicellular red alga Porphyridium purpureum that surprisingly contain 42 group II introns in their plastid genomes. We focused on a subset of these introns that encode mobility-conferring intron-encoded proteins (IEPs) and found them to be distributed among the strains in a lineage-specific manner. The reverse transcriptase and maturase domains were present in all lineages but the DNA endonuclease domain was deleted in vertically inherited introns, demonstrating a key step in the loss of mobility. P. purpureum plastid intron RNAs had a classic group IIB secondary structure despite variability in the DIII and DVI domains. We report for the first time the presence of twintrons (introns-within-introns, derived from the same mobile element) in Rhodophyta. The P. purpureum IEPs and their mobile introns provide a valuable model for the study of mobile retroelements in eukaryotes and offer promise for biotechnological applications. PMID:26157604
Frazier, Courtney L.; San Filippo, Joseph; Lambowitz, Alan M.; Mills, David A.
2003-01-01
Despite their commercial importance, there are relatively few facile methods for genomic manipulation of the lactic acid bacteria. Here, the lactococcal group II intron, Ll.ltrB, was targeted to insert efficiently into genes encoding malate decarboxylase (mleS) and tetracycline resistance (tetM) within the Lactococcus lactis genome. Integrants were readily identified and maintained in the absence of a selectable marker. Since splicing of the Ll.ltrB intron depends on the intron-encoded protein, targeted invasion with an intron lacking the intron open reading frame disrupted TetM and MleS function, and MleS activity could be partially restored by expressing the intron-encoded protein in trans. Restoration of splicing from intron variants lacking the intron-encoded protein illustrates how targeted group II introns could be used for conditional expression of any gene. Furthermore, the modified Ll.ltrB intron was used to separately deliver a phage resistance gene (abiD) and a tetracycline resistance marker (tetM) into mleS, without the need for selection to drive the integration or to maintain the integrant. Our findings demonstrate the utility of targeted group II introns as a potential food-grade mechanism for delivery of industrially important traits into the genomes of lactococci. PMID:12571038
Schüle, Birgitt; Albalwi, Mohammed; Northrop, Emma; Francis, David I; Rowell, Margaret; Slater, Howard R; Gardner, RJ McKinlay; Francke, Uta
2005-01-01
Background Prader-Willi syndrome (MIM #176270; PWS) is caused by lack of the paternally-derived copies, or their expression, of multiple genes in a 4 Mb region on chromosome 15q11.2. Known mechanisms include large deletions, maternal uniparental disomy or mutations involving the imprinting center. De novo balanced reciprocal translocations in 5 reported individuals had breakpoints clustering in SNRPN intron 2 or exon 20/intron 20. To further dissect the PWS phenotype and define the minimal critical region for PWS features, we have studied a 22 year old male with a milder PWS phenotype and a de novo translocation t(4;15)(q27;q11.2). Methods We used metaphase FISH to narrow the breakpoint region and molecular analyses to map the breakpoints on both chromosomes at the nucleotide level. The expression of genes on chromosome 15 on both sides of the breakpoint was determined by RT-PCR analyses. Results Pertinent clinical features include neonatal hypotonia with feeding difficulties, hypogonadism, short stature, late-onset obesity, learning difficulties, abnormal social behavior and marked tolerance to pain, as well as sticky saliva and narcolepsy. Relative macrocephaly and facial features are not typical for PWS. The translocation breakpoints were identified within SNRPN intron 17 and intron 10 of a spliced non-coding transcript in band 4q27. LINE and SINE sequences at the exchange points may have contributed to the translocation event. By RT-PCR of lymphoblasts and fibroblasts, we find that upstream SNURF/SNRPN exons and snoRNAs HBII-437 and HBII-13 are expressed, but the downstream snoRNAs PWCR1/HBII-85 and HBII-438A/B snoRNAs are not. Conclusion As part of the PWCR1/HBII-85 snoRNA cluster is highly conserved between human and mice, while no copy of HBII-438 has been found in mouse, we conclude that PWCR1/HBII-85 snoRNAs is likely to play a major role in the PWS- phenotype. PMID:15877813
Schüle, Birgitt; Albalwi, Mohammed; Northrop, Emma; Francis, David I; Rowell, Margaret; Slater, Howard R; Gardner, R J McKinlay; Francke, Uta
2005-05-06
Prader-Willi syndrome (MIM #176270; PWS) is caused by lack of the paternally-derived copies, or their expression, of multiple genes in a 4 Mb region on chromosome 15q11.2. Known mechanisms include large deletions, maternal uniparental disomy or mutations involving the imprinting center. De novo balanced reciprocal translocations in 5 reported individuals had breakpoints clustering in SNRPN intron 2 or exon 20/intron 20. To further dissect the PWS phenotype and define the minimal critical region for PWS features, we have studied a 22 year old male with a milder PWS phenotype and a de novo translocation t(4;15)(q27;q11.2). We used metaphase FISH to narrow the breakpoint region and molecular analyses to map the breakpoints on both chromosomes at the nucleotide level. The expression of genes on chromosome 15 on both sides of the breakpoint was determined by RT-PCR analyses. Pertinent clinical features include neonatal hypotonia with feeding difficulties, hypogonadism, short stature, late-onset obesity, learning difficulties, abnormal social behavior and marked tolerance to pain, as well as sticky saliva and narcolepsy. Relative macrocephaly and facial features are not typical for PWS. The translocation breakpoints were identified within SNRPN intron 17 and intron 10 of a spliced non-coding transcript in band 4q27. LINE and SINE sequences at the exchange points may have contributed to the translocation event. By RT-PCR of lymphoblasts and fibroblasts, we find that upstream SNURF/SNRPN exons and snoRNAs HBII-437 and HBII-13 are expressed, but the downstream snoRNAs PWCR1/HBII-85 and HBII-438A/B snoRNAs are not. As part of the PWCR1/HBII-85 snoRNA cluster is highly conserved between human and mice, while no copy of HBII-438 has been found in mouse, we conclude that PWCR1/HBII-85 snoRNAs is likely to play a major role in the PWS- phenotype.
Evolution of introns in the archaeal world.
Tocchini-Valentini, Giuseppe D; Fruscoloni, Paolo; Tocchini-Valentini, Glauco P
2011-03-22
The self-splicing group I introns are removed by an autocatalytic mechanism that involves a series of transesterification reactions. They require RNA binding proteins to act as chaperones to correctly fold the RNA into an active intermediate structure in vivo. Pre-tRNA introns in Bacteria and in higher eukaryote plastids are typical examples of self-splicing group I introns. By contrast, two striking features characterize RNA splicing in the archaeal world. First, self-splicing group I introns cannot be found, to this date, in that kingdom. Second, the RNA splicing scenario in Archaea is uniform: All introns, whether in pre-tRNA or elsewhere, are removed by tRNA splicing endonucleases. We suggest that in Archaea, the protein recruited for splicing is the preexisting tRNA splicing endonuclease and that this enzyme, together with the ligase, takes over the task of intron removal in a more efficient fashion than the ribozyme. The extinction of group I introns in Archaea would then be a consequence of recruitment of the tRNA splicing endonuclease. We deal here with comparative genome analysis, focusing specifically on the integration of introns into genes coding for 23S rRNA molecules, and how this newly acquired intron has to be removed to regenerate a functional RNA molecule. We show that all known oligomeric structures of the endonuclease can recognize and cleave a ribosomal intron, even when the endonuclease derives from a strain lacking rRNA introns. The persistence of group I introns in mitochondria and chloroplasts would be explained by the inaccessibility of these introns to the endonuclease.
A SNP uncoupling Mina expression from the TGFβ signaling pathway.
Lian, Shang L; Mihi, Belgacem; Koyanagi, Madoka; Nakayama, Toshinori; Bix, Mark
2018-03-01
Mina is a JmjC family 2-oxoglutarate oxygenase with pleiotropic roles in cell proliferation, cancer, T cell differentiation, pulmonary inflammation, and intestinal parasite expulsion. Although Mina expression varies according to cell-type, developmental stage and activation state, its transcriptional regulation is poorly understood. Across inbred mouse strains, Mina protein level exhibits a bimodal distribution, correlating with inheritance of a biallelic haplotype block comprising 21 promoter/intron 1-region SNPs. We previously showed that heritable differences in Mina protein level are transcriptionally regulated. Accordingly, we decided to test the hypothesis that at least one of the promoter/intron 1-region SNPs perturbs a Mina cis-regulatory element (CRE). Here, we have comprehensively scanned for CREs across a Mina locus-spanning 26-kilobase genomic interval. We discovered 8 potential CREs and functionally validated 4 of these, the strongest of which (E2), residing in intron 1, contained a SNP whose BALB/c-but not C57Bl/6 allele-abolished both Smad3 binding and transforming growth factor beta (TGFβ) responsiveness. Our results demonstrate the TGFβ signaling pathway plays a critical role in regulating Mina expression and SNP rs4191790 controls heritable variation in Mina expression level, raising important questions regarding the evolution of an allele that uncouples Mina expression from the TGFβ signaling pathway. © 2017 The Authors. Immunity, Inflammation and Disease Published by John Wiley & Sons Ltd.
A SNP uncoupling Mina expression from the TGFβ signaling pathway
Lian, Shang L.; Mihi, Belgacem; Koyanagi, Madoka; Nakayama, Toshinori
2017-01-01
Abstract Introduction Mina is a JmjC family 2‐oxoglutarate oxygenase with pleiotropic roles in cell proliferation, cancer, T cell differentiation, pulmonary inflammation, and intestinal parasite expulsion. Although Mina expression varies according to cell‐type, developmental stage and activation state, its transcriptional regulation is poorly understood. Across inbred mouse strains, Mina protein level exhibits a bimodal distribution, correlating with inheritance of a biallelic haplotype block comprising 21 promoter/intron 1‐region SNPs. We previously showed that heritable differences in Mina protein level are transcriptionally regulated. Methods Accordingly, we decided to test the hypothesis that at least one of the promoter/intron 1‐region SNPs perturbs a Mina cis‐regulatory element (CRE). Here, we have comprehensively scanned for CREs across a Mina locus‐spanning 26‐kilobase genomic interval. Results We discovered 8 potential CREs and functionally validated 4 of these, the strongest of which (E2), residing in intron 1, contained a SNP whose BALB/c—but not C57Bl/6 allele—abolished both Smad3 binding and transforming growth factor beta (TGFβ) responsiveness. Conclusions Our results demonstrate the TGFβ signaling pathway plays a critical role in regulating Mina expression and SNP rs4191790 controls heritable variation in Mina expression level, raising important questions regarding the evolution of an allele that uncouples Mina expression from the TGFβ signaling pathway. PMID:28967702
Zheng, Min; Mitra, Rajendra N; Filonov, Nazar A; Han, Zongchao
2016-03-01
Previously, we compared the efficacy of nanoparticle (NP)-mediated intron-containing rhodopsin (sgRho) vs. intronless cDNA in ameliorating retinal disease phenotypes in a rhodopsin knockout (RKO) mouse model of retinitis pigmentosa. We showed that NP-mediated sgRho delivery achieved long-term expression and phenotypic improvement in RKO mice, but not NP housing cDNA. However, the protein level of the NP-sgRho construct was only 5-10% of wild-type at 8 mo postinjection. To have a better understanding of the reduced levels of long-term expression of the vectors, in the present study, we evaluated the epigenetic changes of subretinal delivering NP-cDNA vs. NP-sgRho in the RKO mouse eyes. Following the administration, DNA methylation and histone status of specific regions (bacteria plasmid backbone, promoter, rhodopsin gene, and scaffold/matrix attachment region) of the vectors were evaluated at various time points. We documented that epigenetic transgene silencing occurred in vector-mediated gene transfer, which were caused by the plasmid backbone and the cDNA of the transgene, but not the intron-containing transgene. No toxicity or inflammation was found in the treated eyes. Our results suggest that cDNA of the rhodopsin transgene and bacteria backbone interfered with the host defense mechanism of DNA methylation-mediated transgene silencing through heterochromatin-associated modifications. © FASEB.
Vassiliki, Kokkinou; George, Koutsodontis; Polixeni, Stamatiou; Christoforos, Giatzakis; Minas, Aslanides Ioannis; Stavrenia, Koukoula; Ioannis, Datseris
2018-01-01
Aim To evaluate the frequency and pattern of disease-associated mutations of ABCA4 gene among Greek patients with presumed Stargardt disease (STGD1). Materials and Methods A total of 59 patients were analyzed for ABCA4 mutations using the ABCR400 microarray and PCR-based sequencing of all coding exons and flanking intronic regions. MLPA analysis as well as sequencing of two regions in introns 30 and 36 reported earlier to harbor deep intronic disease-associated variants was used in 4 selected cases. Results An overall detection rate of at least one mutant allele was achieved in 52 of the 59 patients (88.1%). Direct sequencing improved significantly the complete characterization rate, that is, identification of two mutations compared to the microarray analysis (93.1% versus 50%). In total, 40 distinct potentially disease-causing variants of the ABCA4 gene were detected, including six previously unreported potentially pathogenic variants. Among the disease-causing variants, in this cohort, the most frequent was c.5714+5G>A representing 16.1%, while p.Gly1961Glu and p.Leu541Pro represented 15.2% and 8.5%, respectively. Conclusions By using a combination of methods, we completely molecularly diagnosed 48 of the 59 patients studied. In addition, we identified six previously unreported, potentially pathogenic ABCA4 mutations. PMID:29854428
Celis, Juan Sebastián; Edgell, David R; Stelbrink, Björn; Wibberg, Daniel; Hauffe, Torsten; Blom, Jochen; Kalinowski, Jörn; Wilke, Thomas
2017-01-01
Group I introns and homing endonuclease genes (HEGs) are mobile genetic elements, capable of invading target sequences in intron-less genomes. LAGLIDADG HEGs are the largest family of endonucleases, playing a key role in the mobility of group I introns in a process known as 'homing'. Group I introns and HEGs are rare in metazoans, and can be mainly found inserted in the COXI gene of some sponges and cnidarians, including stony corals (Scleractinia) and mushroom corals (Corallimorpharia). Vertical and horizontal intron transfer mechanisms have been proposed as explanations for intron occurrence in cnidarians. However, the central role of LAGLIDADG motifs in intron mobility mechanisms remains poorly understood. To resolve questions regarding the evolutionary origin and distribution of group I introns and HEGs in Scleractinia and Corallimorpharia, we examined intron/HEGs sequences within a comprehensive phylogenetic framework. Analyses of LAGLIDADG motif conservation showed a high degree of degradation in complex Scleractinia and Corallimorpharia. Moreover, the two motifs lack the respective acidic residues necessary for metal-ion binding and catalysis, potentially impairing horizontal intron mobility. In contrast, both motifs are highly conserved within robust Scleractinia, indicating a fully functional endonuclease capable of promoting horizontal intron transference. A higher rate of non-synonymous substitutions (Ka) detected in the HEGs of complex Scleractinia and Corallimorpharia suggests degradation of the HEG, whereas lower Ka rates in robust Scleractinia are consistent with a scenario of purifying selection. Molecular-clock analyses and ancestral inference of intron type indicated an earlier intron insertion in complex Scleractinia and Corallimorpharia in comparison to robust Scleractinia. These findings suggest that the lack of horizontal intron transfers in the former two groups is related to an age-dependent degradation of the endonuclease activity. Moreover, they also explain the peculiar geographical patterns of introns in stony and mushroom corals.
Using the NCBI Genome Databases to Compare the Genes for Human & Chimpanzee Beta Hemoglobin
ERIC Educational Resources Information Center
Offner, Susan
2010-01-01
The beta hemoglobin protein is identical in humans and chimpanzees. In this tutorial, students see that even though the proteins are identical, the genes that code for them are not. There are many more differences in the introns than in the exons, which indicates that coding regions of DNA are more highly conserved than non-coding regions.
Bacterial group II introns: not just splicing.
Toro, Nicolás; Jiménez-Zurdo, José Ignacio; García-Rodríguez, Fernando Manuel
2007-04-01
Group II introns are both catalytic RNAs (ribozymes) and mobile retroelements that were discovered almost 14 years ago. It has been suggested that eukaryotic mRNA introns might have originated from the group II introns present in the alphaproteobacterial progenitor of the mitochondria. Bacterial group II introns are of considerable interest not only because of their evolutionary significance, but also because they could potentially be used as tools for genetic manipulation in biotechnology and for gene therapy. This review summarizes what is known about the splicing mechanisms and mobility of bacterial group II introns, and describes the recent development of group II intron-based gene-targetting methods. Bacterial group II intron diversity, evolutionary relationships, and behaviour in bacteria are also discussed.
Structure of a group II intron in complex with its reverse transcriptase.
Qu, Guosheng; Kaushal, Prem Singh; Wang, Jia; Shigematsu, Hideki; Piazza, Carol Lyn; Agrawal, Rajendra Kumar; Belfort, Marlene; Wang, Hong-Wei
2016-06-01
Bacterial group II introns are large catalytic RNAs related to nuclear spliceosomal introns and eukaryotic retrotransposons. They self-splice, yielding mature RNA, and integrate into DNA as retroelements. A fully active group II intron forms a ribonucleoprotein complex comprising the intron ribozyme and an intron-encoded protein that performs multiple activities including reverse transcription, in which intron RNA is copied into the DNA target. Here we report cryo-EM structures of an endogenously spliced Lactococcus lactis group IIA intron in its ribonucleoprotein complex form at 3.8-Å resolution and in its protein-depleted form at 4.5-Å resolution, revealing functional coordination of the intron RNA with the protein. Remarkably, the protein structure reveals a close relationship between the reverse transcriptase catalytic domain and telomerase, whereas the active splicing center resembles the spliceosomal Prp8 protein. These extraordinary similarities hint at intricate ancestral relationships and provide new insights into splicing and retromobility.
Ancient nature of alternative splicing and functions of introns
DOE Office of Scientific and Technical Information (OSTI.GOV)
Zhou, Kemin; Salamov, Asaf; Kuo, Alan
Using four genomes: Chamydomonas reinhardtii, Agaricus bisporus, Aspergillus carbonarius, and Sporotricum thermophile with EST coverage of 2.9x, 8.9x, 29.5x, and 46.3x respectively, we identified 11 alternative splicing (AS) types that were dominated by intron retention (RI; biased toward short introns) and found 15, 35, 52, and 63percent AS of multiexon genes respectively. Genes with AS were more ancient, and number of AS correlated with number of exons, expression level, and maximum intron length of the gene. Introns with tendency to be retained had either stop codons or length of 3n+1 or 3n+2 presumably triggering nonsense-mediated mRNA decay (NMD), but intronsmore » retained in major isoforms (0.2-6percent of all introns) were biased toward 3n length and stop codon free. Stopless introns were biased toward phase 0, but 3n introns favored phase 1 that introduced more flexible and hydrophilic amino acids on both ends of introns which would be less disruptive to protein structure. We proposed a model in which minor RI intron could evolve into major RI that could facilitate intron loss through exonization.« less
Rogozin, Igor B; Wolf, Yuri I; Sorokin, Alexander V; Mirkin, Boris G; Koonin, Eugene V
2003-09-02
Sequencing of eukaryotic genomes allows one to address major evolutionary problems, such as the evolution of gene structure. We compared the intron positions in 684 orthologous gene sets from 8 complete genomes of animals, plants, fungi, and protists and constructed parsimonious scenarios of evolution of the exon-intron structure for the respective genes. Approximately one-third of the introns in the malaria parasite Plasmodium falciparum are shared with at least one crown group eukaryote; this number indicates that these introns have been conserved through >1.5 billion years of evolution that separate Plasmodium from the crown group. Paradoxically, humans share many more introns with the plant Arabidopsis thaliana than with the fly or nematode. The inferred evolutionary scenario holds that the common ancestor of Plasmodium and the crown group and, especially, the common ancestor of animals, plants, and fungi had numerous introns. Most of these ancestral introns, which are retained in the genomes of vertebrates and plants, have been lost in fungi, nematodes, arthropods, and probably Plasmodium. In addition, numerous introns have been inserted into vertebrate and plant genes, whereas, in other lineages, intron gain was much less prominent.
Csuros, Miklos; Rogozin, Igor B.; Koonin, Eugene V.
2011-01-01
Protein-coding genes in eukaryotes are interrupted by introns, but intron densities widely differ between eukaryotic lineages. Vertebrates, some invertebrates and green plants have intron-rich genes, with 6–7 introns per kilobase of coding sequence, whereas most of the other eukaryotes have intron-poor genes. We reconstructed the history of intron gain and loss using a probabilistic Markov model (Markov Chain Monte Carlo, MCMC) on 245 orthologous genes from 99 genomes representing the three of the five supergroups of eukaryotes for which multiple genome sequences are available. Intron-rich ancestors are confidently reconstructed for each major group, with 53 to 74% of the human intron density inferred with 95% confidence for the Last Eukaryotic Common Ancestor (LECA). The results of the MCMC reconstruction are compared with the reconstructions obtained using Maximum Likelihood (ML) and Dollo parsimony methods. An excellent agreement between the MCMC and ML inferences is demonstrated whereas Dollo parsimony introduces a noticeable bias in the estimations, typically yielding lower ancestral intron densities than MCMC and ML. Evolution of eukaryotic genes was dominated by intron loss, with substantial gain only at the bases of several major branches including plants and animals. The highest intron density, 120 to 130% of the human value, is inferred for the last common ancestor of animals. The reconstruction shows that the entire line of descent from LECA to mammals was intron-rich, a state conducive to the evolution of alternative splicing. PMID:21935348
Putative cross-kingdom horizontal gene transfer in sponge (Porifera) mitochondria.
Rot, Chagai; Goldfarb, Itay; Ilan, Micha; Huchon, Dorothée
2006-09-14
The mitochondrial genome of Metazoa is usually a compact molecule without introns. Exceptions to this rule have been reported only in corals and sea anemones (Cnidaria), in which group I introns have been discovered in the cox1 and nad5 genes. Here we show several lines of evidence demonstrating that introns can also be found in the mitochondria of sponges (Porifera). A 2,349 bp fragment of the mitochondrial cox1 gene was sequenced from the sponge Tetilla sp. (Spirophorida). This fragment suggests the presence of a 1143 bp intron. Similar to all the cnidarian mitochondrial introns, the putative intron has group I intron characteristics. The intron is present in the cox1 gene and encodes a putative homing endonuclease. In order to establish the distribution of this intron in sponges, the cox1 gene was sequenced from several representatives of the demosponge diversity. The intron was found only in the sponge order Spirophorida. A phylogenetic analysis of the COI protein sequence and of the intron open reading frame suggests that the intron may have been transmitted horizontally from a fungus donor. Little is known about sponge-associated fungi, although in the last few years the latter have been frequently isolated from sponges. We suggest that the horizontal gene transfer of a mitochondrial intron was facilitated by a symbiotic relationship between fungus and sponge. Ecological relationships are known to have implications at the genomic level. Here, an ecological relationship between sponge and fungus is suggested based on the genomic analysis.
Sinha, Rahul; Goyal, Pankaj; Grapputo, Alessandro
2011-01-01
Background Insertions of spliceosomal introns are very rare events during evolution of vertebrates and the mechanisms governing creation of novel intron(s) remain obscure. Largely, gene structures of melanocortin (MC) receptors are characterized by intron-less architecture. However, recently a few exceptions have been reported in some fishes. This warrants a systematic survey of MC receptors for understanding intron insertion events during vertebrate evolution. Methodology/Principal Findings We have compiled an extended list of MC receptors from different vertebrate genomes with variations in fishes. Notably, the closely linked MC2Rs and MC5Rs from a group of ray-finned fishes have three and one intron insertion(s), respectively, with conserved positions and intron phase. In both genes, one novel insertion was in the highly conserved DRY motif at the end of helix TM3. Further, the proto-splice site MAG↑R is maintained at intron insertion sites in these two genes. However, the orthologs of these receptors from zebrafish and tetrapods are intron-less, suggesting these introns are simultaneously created in selected fishes. Surprisingly, these novel introns are traceable only in four fish genomes. We found that these fish genomes are severely compacted after the separation from zebrafish. Furthermore, we also report novel intron insertions in P2Y receptors and in CHRM3. Finally, we report ultrasmall introns in MC2R genes from selected fishes. Conclusions/Significance The current repository of MC receptors illustrates that fishes have no MC3R ortholog. MC2R, MC5R, P2Y receptors and CHRM3 have novel intron insertions only in ray-finned fishes that underwent genome compaction. These receptors share one intron at an identical position suggestive of being inserted contemporaneously. In addition to repetitive elements, genome compaction is now believed to be a new hallmark that promotes intron insertions, as it requires rapid DNA breakage and subsequent repair processes to gain back normal functionality. PMID:21850219
Vazquez-Anderson, Jorge; Mihailovic, Mia K.; Baldridge, Kevin C.; Reyes, Kristofer G.; Haning, Katie; Cho, Seung Hee; Amador, Paul; Powell, Warren B.
2017-01-01
Abstract Current approaches to design efficient antisense RNAs (asRNAs) rely primarily on a thermodynamic understanding of RNA–RNA interactions. However, these approaches depend on structure predictions and have limited accuracy, arguably due to overlooking important cellular environment factors. In this work, we develop a biophysical model to describe asRNA–RNA hybridization that incorporates in vivo factors using large-scale experimental hybridization data for three model RNAs: a group I intron, CsrB and a tRNA. A unique element of our model is the estimation of the availability of the target region to interact with a given asRNA using a differential entropic consideration of suboptimal structures. We showcase the utility of this model by evaluating its prediction capabilities in four additional RNAs: a group II intron, Spinach II, 2-MS2 binding domain and glgC 5΄ UTR. Additionally, we demonstrate the applicability of this approach to other bacterial species by predicting sRNA–mRNA binding regions in two newly discovered, though uncharacterized, regulatory RNAs. PMID:28334800
Lutz, Michael W.; Saul, Robert; Linnertz, Colton; Glenn, Omolara-Chinue; Roses, Allen D.; Chiba-Falek, Ornit
2015-01-01
INTRODUCTION We recently showed that tagging-SNPs across the SNCA locus were significantly associated with increased risk for LB pathology in AD cases. However, the actual genetic variant(s) that underlie the observed associations remain elusive. METHODS We used a bioinformatics algorithm to catalogue Structural-Variants in a region of SNCA-intron4, followed by phased-sequencing. We performed a genetic-association analysis in autopsy series of LBV/AD cases compared with AD-only controls. We investigated the biological functions by expression analysis using temporal-cortex samples. RESULTS We identified four distinct haplotypes within a highly-polymorphic-low-complexity CT-rich region. We showed that a specific haplotype conferred risk to develop LBV/AD. We demonstrated that the CT-rich site acts as an enhancer element, where the risk haplotype was significantly associated with elevated levels of SNCA-mRNA. DISCUSSION We have discovered a novel haplotype in a CT-rich region in SNCA that contributes to LB pathology in AD patients, possibly via cis-regulation of the gene expression. PMID:26079410
Youssef, Noha; Sheik, Cody S.; Krumholz, Lee R.; Najar, Fares Z.; Roe, Bruce A.; Elshahed, Mostafa S.
2009-01-01
Pyrosequencing-based 16S rRNA gene surveys are increasingly utilized to study highly diverse bacterial communities, with special emphasis on utilizing the large number of sequences obtained (tens to hundreds of thousands) for species richness estimation. However, it is not yet clear how the number of operational taxonomic units (OTUs) and, hence, species richness estimates determined using shorter fragments at different taxonomic cutoffs correlates with the number of OTUs assigned using longer, nearly complete 16S rRNA gene fragments. We constructed a 16S rRNA clone library from an undisturbed tallgrass prairie soil (1,132 clones) and used it to compare species richness estimates obtained using eight pyrosequencing candidate fragments (99 to 361 bp in length) and the nearly full-length fragment. Fragments encompassing the V1 and V2 (V1+V2) region and the V6 region (generated using primer pairs 8F-338R and 967F-1046R) overestimated species richness; fragments encompassing the V3, V7, and V7+V8 hypervariable regions (generated using primer pairs 338F-530R, 1046F-1220R, and 1046F-1392R) underestimated species richness; and fragments encompassing the V4, V5+V6, and V6+V7 regions (generated using primer pairs 530F-805R, 805F-1046R, and 967F-1220R) provided estimates comparable to those obtained with the nearly full-length fragment. These patterns were observed regardless of the alignment method utilized or the parameter used to gauge comparative levels of species richness (number of OTUs observed, slope of scatter plots of pairwise distance values for short and nearly complete fragments, and nonparametric and parametric species richness estimates). Similar results were obtained when analyzing three other datasets derived from soil, adult Zebrafish gut, and basaltic formations in the East Pacific Rise. Regression analysis indicated that these observed discrepancies in species richness estimates within various regions could readily be explained by the proportions of hypervariable, variable, and conserved base pairs within an examined fragment. PMID:19561178
2011-01-01
Background The most frequent case of horizontal transfer in plants involves a group I intron in the mitochondrial gene cox1, which has been acquired via some 80 separate plant-to-plant transfer events among 833 diverse angiosperms examined. This homing intron encodes an endonuclease thought to promote the intron's promiscuous behavior. A promising experimental approach to study endonuclease activity and intron transmission involves somatic cell hybridization, which in plants leads to mitochondrial fusion and genome recombination. However, the cox1 intron has not yet been found in the ideal group for plant somatic genetics - the Solanaceae. We therefore undertook an extensive survey of this family to find members with the intron and to learn more about the evolutionary history of this exceptionally mobile genetic element. Results Although 409 of the 426 species of Solanaceae examined lack the cox1 intron, it is uniformly present in three phylogenetically disjunct clades. Despite strong overall incongruence of cox1 intron phylogeny with angiosperm phylogeny, two of these clades possess nearly identical intron sequences and are monophyletic in intron phylogeny. These two clades, and possibly the third also, contain a co-conversion tract (CCT) downstream of the intron that is extended relative to all previously recognized CCTs in angiosperm cox1. Re-examination of all published cox1 genes uncovered additional cases of extended co-conversion and identified a rare case of putative intron loss, accompanied by full retention of the CCT. Conclusions We infer that the cox1 intron was separately and recently acquired by at least three different lineages of Solanaceae. The striking identity of the intron and CCT from two of these lineages suggests that one of these three intron captures may have occurred by a within-family transfer event. This is consistent with previous evidence that horizontal transfer in plants is biased towards phylogenetically local events. The discovery of extended co-conversion suggests that other cox1 conversions may be longer than realized but obscured by the exceptional conservation of plant mitochondrial sequences. Our findings provide further support for the rampant-transfer model of cox1 intron evolution and recommend the Solanaceae as a model system for the experimental analysis of cox1 intron transfer in plants. PMID:21943226
NASA Astrophysics Data System (ADS)
Millard, Julie T.; Pilon, André M.
2003-04-01
A recent forensic approach for identification of unknown biological samples is mitochondrial DNA (mtDNA) sequencing. We describe a laboratory exercise suitable for an undergraduate biochemistry course in which the polymerase chain reaction is used to amplify a 440 base pair hypervariable region of human mtDNA from a variety of "crime scene" samples (e.g., teeth, hair, nails, cigarettes, envelope flaps, toothbrushes, and chewing gum). Amplification is verified via agarose gel electrophoresis and then samples are subjected to cycle sequencing. Sequence alignments are made via the program CLUSTAL W, allowing students to compare samples and solve the "crime."
Lv, Xu-Cong; Jiang, Ya-Jun; Liu, Jie; Guo, Wei-Ling; Liu, Zhi-Bin; Zhang, Wen; Rao, Ping-Fan; Ni, Li
2017-08-16
Denaturing gradient gel electrophoresis (DGGE) has become a widely used tool to examine microbial community structure. However, when DGGE is applied to evaluate the fungal community of traditional fermentation starters, the choice of hypervariable ribosomal RNA gene regions is still controversial. In the current study, several previously published fungal PCR primer sets were compared and evaluated using PCR-DGGE, with the purpose of screening a suitable primer set to study the fungal community of traditional fermentation starters for Hong Qu glutinous rice wine. Firstly, different primer sets were used to amplify different hypervariable regions from pure fungal cultures. Except NS1/FR1+ and ITS1fGC/ITS4, other primer sets (NL1+/LS2R, NL3A/NL4GC, FF390/FR1+, NS1/GCFung, NS3+/YM951r and ITS1fGC/ITS2r) amplified the target DNA sequences successfully. Secondly, the selected primer sets were further evaluated based on their resolution to distinguish different fungal cultures through DGGE fingerprints. Three primer sets (NL1+/LS2R, NS1/GCFung and ITS1fGC/ITS2r) were finally selected for investigating the fungal community structure of different traditional fermentation starters for Hong Qu glutinous rice wine. The internal transcribed spacer (ITS) region amplified by ITS1fGC/ITS2r, which is more hypervariable than the 18S rRNA gene and 26S rRNA gene, provides an excellent tool to separate amplification products of different fungal species. Results indicated that PCR-DGGE profile using ITS1fGC/ITS2r showed more abundant fungal species than that using NL1+/LS2R and NS1/GCFung. Therefore, ITS1fGC/ITS2r is the most suitable primer set for PCR-DGGE analysis of fungal community structure in traditional fermentation starters for Hong Qu glutinous rice wine. DGGE profiles based on ITS1fGC/ITS2r revealed the presence of twenty-four fungal species in traditional fermentation starter. A significant difference of fungal community can be observed directly from DGGE fingerprints and principal component analysis. The statistical analysis results based on the band intensities of fungal DGGE profile showed that Saccharomyces cerevisiae, Saccharomycopsis fibuligera, Rhizopus oryzae, Monascus purpureus and Aspergillus niger were the dominant fungal species. In conclusion, the comparison of several primer sets for fungal PCR-DGGE would be useful to enrich our knowledge of the fungal community structures associated with traditional fermentation starters, which may facilitate the development of better starter cultures for manufacturing Chinese Hong Qu glutinous rice wine. Copyright © 2017 Elsevier B.V. All rights reserved.
Huang, Xue-Yong; Niu, Jin; Sun, Ming-Xi; Zhu, Jun; Gao, Ju-Fang; Yang, Jun; Zhou, Que; Yang, Zhong-Nan
2013-01-01
Arabidopsis thaliana CYCLIN-DEPEDENT KINASE G1 (CDKG1) belongs to the family of cyclin-dependent protein kinases that were originally characterized as cell cycle regulators in eukaryotes. Here, we report that CDKG1 regulates pre-mRNA splicing of CALLOSE SYNTHASE5 (CalS5) and, therefore, pollen wall formation. The knockout mutant cdkg1 exhibits reduced male fertility with impaired callose synthesis and abnormal pollen wall formation. The sixth intron in CalS5 pre-mRNA, a rare type of intron with a GC 5′ splice site, is abnormally spliced in cdkg1. RNA immunoprecipitation analysis suggests that CDKG1 is associated with this intron. CDKG1 contains N-terminal Ser/Arg (RS) motifs and interacts with splicing factor Arginine/Serine-Rich Zinc Knuckle-Containing Protein33 (RSZ33) through its RS region to regulate proper splicing. CDKG1 and RS-containing Zinc Finger Protein22 (SRZ22), a splicing factor interacting with RSZ33 and U1 small nuclear ribonucleoprotein particle (snRNP) component U1-70k, colocalize in nuclear speckles and reside in the same complex. We propose that CDKG1 is recruited to U1 snRNP through RSZ33 to facilitate the splicing of the sixth intron of CalS5. PMID:23404887
Ono, Hiroyuki; Saitsu, Hirotomo; Horikawa, Reiko; Nakashima, Shinichi; Ohkubo, Yumiko; Yanagi, Kumiko; Nakabayashi, Kazuhiko; Fukami, Maki; Fujisawa, Yasuko; Ogata, Tsutomu
2018-02-02
Although partial androgen insensitivity syndrome (PAIS) is caused by attenuated responsiveness to androgens, androgen receptor gene (AR) mutations on the coding regions and their splice sites have been identified only in <25% of patients with a diagnosis of PAIS. We performed extensive molecular studies including whole exome sequencing in a Japanese family with PAIS, identifying a deep intronic variant beyond the branch site at intron 6 of AR (NM_000044.4:c.2450-42 G > A). This variant created the splice acceptor motif that was accompanied by pyrimidine-rich sequence and two candidate branch sites. Consistent with this, reverse transcriptase (RT)-PCR experiments for cycloheximide-treated lymphoblastoid cell lines revealed a relatively large amount of aberrant mRNA produced by the newly created splice acceptor site and a relatively small amount of wildtype mRNA produced by the normal splice acceptor site. Furthermore, most of the aberrant mRNA was shown to undergo nonsense mediated decay (NMD) and, if a small amount of aberrant mRNA may have escaped NMD, such mRNA was predicted to generate a truncated AR protein missing some functional domains. These findings imply that the deep intronic mutation creating an alternative splice acceptor site resulted in the production of a relatively small amount of wildtype AR mRNA, leading to PAIS.
RNA Splicing in a New Rhabdovirus from Culex Mosquitoes▿†
Kuwata, Ryusei; Isawa, Haruhiko; Hoshino, Keita; Tsuda, Yoshio; Yanase, Tohru; Sasaki, Toshinori; Kobayashi, Mutsuo; Sawabe, Kyoko
2011-01-01
Among members of the order Mononegavirales, RNA splicing events have been found only in the family Bornaviridae. Here, we report that a new rhabdovirus isolated from the mosquito Culex tritaeniorhynchus replicates in the nuclei of infected cells and requires RNA splicing for viral mRNA maturation. The virus, designated Culex tritaeniorhynchus rhabdovirus (CTRV), shares a similar genome organization with other rhabdoviruses, except for the presence of a putative intron in the coding region for the L protein. Molecular phylogenetic studies indicated that CTRV belongs to the family Rhabdoviridae, but it is yet to be assigned a genus. Electron microscopic analysis revealed that the CTRV virion is extremely elongated, unlike virions of rhabdoviruses, which are generally bullet shaped. Northern hybridization confirmed that a large transcript (approximately 6,500 nucleotides [nt]) from the CTRV L gene was present in the infected cells. Strand-specific reverse transcription-PCR (RT-PCR) analyses identified the intron-exon boundaries and the 76-nt intron sequence, which contains the typical motif for eukaryotic spliceosomal intron-splice donor/acceptor sites (GU-AG), a predicted branch point, and a polypyrimidine tract. In situ hybridization exhibited that viral RNAs are primarily localized in the nucleus of infected cells, indicating that CTRV replicates in the nucleus and is allowed to utilize the host's nuclear splicing machinery. This is the first report of RNA splicing among the members of the family Rhabdoviridae. PMID:21507977
RNA splicing in a new rhabdovirus from Culex mosquitoes.
Kuwata, Ryusei; Isawa, Haruhiko; Hoshino, Keita; Tsuda, Yoshio; Yanase, Tohru; Sasaki, Toshinori; Kobayashi, Mutsuo; Sawabe, Kyoko
2011-07-01
Among members of the order Mononegavirales, RNA splicing events have been found only in the family Bornaviridae. Here, we report that a new rhabdovirus isolated from the mosquito Culex tritaeniorhynchus replicates in the nuclei of infected cells and requires RNA splicing for viral mRNA maturation. The virus, designated Culex tritaeniorhynchus rhabdovirus (CTRV), shares a similar genome organization with other rhabdoviruses, except for the presence of a putative intron in the coding region for the L protein. Molecular phylogenetic studies indicated that CTRV belongs to the family Rhabdoviridae, but it is yet to be assigned a genus. Electron microscopic analysis revealed that the CTRV virion is extremely elongated, unlike virions of rhabdoviruses, which are generally bullet shaped. Northern hybridization confirmed that a large transcript (approximately 6,500 nucleotides [nt]) from the CTRV L gene was present in the infected cells. Strand-specific reverse transcription-PCR (RT-PCR) analyses identified the intron-exon boundaries and the 76-nt intron sequence, which contains the typical motif for eukaryotic spliceosomal intron-splice donor/acceptor sites (GU-AG), a predicted branch point, and a polypyrimidine tract. In situ hybridization exhibited that viral RNAs are primarily localized in the nucleus of infected cells, indicating that CTRV replicates in the nucleus and is allowed to utilize the host's nuclear splicing machinery. This is the first report of RNA splicing among the members of the family Rhabdoviridae.
2013-01-01
Background Accurate and complete identification of mobile elements is a challenging task in the current era of sequencing, given their large numbers and frequent truncations. Group II intron retroelements, which consist of a ribozyme and an intron-encoded protein (IEP), are usually identified in bacterial genomes through their IEP; however, the RNA component that defines the intron boundaries is often difficult to identify because of a lack of strong sequence conservation corresponding to the RNA structure. Compounding the problem of boundary definition is the fact that a majority of group II intron copies in bacteria are truncated. Results Here we present a pipeline of 11 programs that collect and analyze group II intron sequences from GenBank. The pipeline begins with a BLAST search of GenBank using a set of representative group II IEPs as queries. Subsequent steps download the corresponding genomic sequences and flanks, filter out non-group II introns, assign introns to phylogenetic subclasses, filter out incomplete and/or non-functional introns, and assign IEP sequences and RNA boundaries to the full-length introns. In the final step, the redundancy in the data set is reduced by grouping introns into sets of ≥95% identity, with one example sequence chosen to be the representative. Conclusions These programs should be useful for comprehensive identification of group II introns in sequence databases as data continue to rapidly accumulate. PMID:24359548
Localized Retroprocessing as a Model of Intron Loss in the Plant Mitochondrial Genome
Cuenca, Argelia; Ross, T. Gregory; Graham, Sean W.; Barrett, Craig F.; Davis, Jerrold I.; Seberg, Ole; Petersen, Gitte
2016-01-01
Loss of introns in plant mitochondrial genes is commonly explained by retroprocessing. Under this model, an mRNA is reverse transcribed and integrated back into the genome, simultaneously affecting the contents of introns and edited sites. To evaluate the extent to which retroprocessing explains intron loss, we analyzed patterns of intron content and predicted RNA editing for whole mitochondrial genomes of 30 species in the monocot order Alismatales. In this group, we found an unusually high degree of variation in the intron content, even expanding the hitherto known variation among angiosperms. Some species have lost some two-third of the cis-spliced introns. We found a strong correlation between intron content and editing frequency, and detected 27 events in which intron loss is consistent with the presence of nucleotides in an edited state, supporting retroprocessing. However, we also detected seven cases of intron loss not readily being explained by retroprocession. Our analyses are also not consistent with the entire length of a fully processed cDNA copy being integrated into the genome, but instead indicate that retroprocessing usually occurs for only part of the gene. In some cases, several rounds of retroprocessing may explain intron loss in genes completely devoid of introns. A number of taxa retroprocessing seem to be very common and a possibly ongoing process. It affects the entire mitochondrial genome. PMID:27435795
Mayr, B; Reifinger, M; Alton, K; Schaffner, G
1998-06-01
Twenty feline neoplasms were sequenced in the region from exons 5 to 8 for the presence of tumour suppressor gene p53 mutations. In a spindle cell sarcoma of the bladder, a missense mutation (codon 164 AAG-->GAG, lysine-->glutamic acid) in exon 5 was detected. In a pleomorphic sarcoma, a 23 bp deletion involving the splicing junction between intron 5 and exon 6 was observed. In a fibrosarcoma, a 6 bp deletion of p53 covering 2 bp of exon 7 and 4 bp of intron 7, including the splicing junction, was found. The study demonstrates three new p53 mutations in different types of sarcomas in cats.
Montandon, P E; Vasserot, A; Stutz, E
1986-01-01
We retrieved a 1.6 kbp intron separating two exons of the psb C gene which codes for the 44 kDa reaction center protein of photosystem II. This intron is 3 to 4 times the size of all previously sequenced Euglena gracilis chloroplast introns. It contains an open reading frame of 458 codons potentially coding for a basic protein of 54 kDa of yet unknown function. The intron boundaries follow consensus sequences established for chloroplast introns related to class II and nuclear pre-mRNA introns. Its 3'-terminal segment has structural features similar to class II mitochondrial introns with an invariant base A as possible branch point for lariat formation.
Putative cross-kingdom horizontal gene transfer in sponge (Porifera) mitochondria
Rot, Chagai; Goldfarb, Itay; Ilan, Micha; Huchon, Dorothée
2006-01-01
Background The mitochondrial genome of Metazoa is usually a compact molecule without introns. Exceptions to this rule have been reported only in corals and sea anemones (Cnidaria), in which group I introns have been discovered in the cox1 and nad5 genes. Here we show several lines of evidence demonstrating that introns can also be found in the mitochondria of sponges (Porifera). Results A 2,349 bp fragment of the mitochondrial cox1 gene was sequenced from the sponge Tetilla sp. (Spirophorida). This fragment suggests the presence of a 1143 bp intron. Similar to all the cnidarian mitochondrial introns, the putative intron has group I intron characteristics. The intron is present in the cox1 gene and encodes a putative homing endonuclease. In order to establish the distribution of this intron in sponges, the cox1 gene was sequenced from several representatives of the demosponge diversity. The intron was found only in the sponge order Spirophorida. A phylogenetic analysis of the COI protein sequence and of the intron open reading frame suggests that the intron may have been transmitted horizontally from a fungus donor. Conclusion Little is known about sponge-associated fungi, although in the last few years the latter have been frequently isolated from sponges. We suggest that the horizontal gene transfer of a mitochondrial intron was facilitated by a symbiotic relationship between fungus and sponge. Ecological relationships are known to have implications at the genomic level. Here, an ecological relationship between sponge and fungus is suggested based on the genomic analysis. PMID:16972986
Exon definition as a potential negative force against intron losses in evolution.
Niu, Deng-Ke
2008-11-13
Previous studies have indicated that the wide variation in intron density (the number of introns per gene) among different eukaryotes largely reflects varying degrees of intron loss during evolution. The most popular model, which suggests that organisms lose introns through a mechanism in which reverse-transcribed cDNA recombines with the genomic DNA, concerns only one mutational force. Using exons as the units of splicing-site recognition, exon definition constrains the length of exons. An intron-loss event results in fusion of flanking exons and thus a larger exon. The large size of the newborn exon may cause splicing errors, i.e., exon skipping, if the splicing of pre-mRNAs is initiated by exon definition. By contrast, if the splicing of pre-mRNAs is initiated by intron definition, intron loss does not matter. Exon definition may thus be a selective force against intron loss. An organism with a high frequency of exon definition is expected to experience a low rate of intron loss throughout evolution and have a high density of spliceosomal introns. The majority of spliceosomal introns in vertebrates may be maintained during evolution not because of potential functions, but because of their splicing mechanism (i.e., exon definition). Further research is required to determine whether exon definition is a negative force in maintaining the high intron density of vertebrates. This article was reviewed by Dr. Scott W. Roy (nominated by Dr. John Logsdon), Dr.Eugene V. Koonin, and Dr. Igor B. Rogozin (nominated by Dr. Mikhail Gelfand). For the full reviews,please go to the Reviewers' comments section.
Turmel, Monique; Otis, Christian; Lemieux, Claude
2005-01-01
Background The Streptophyta comprise all land plants and six monophyletic groups of charophycean green algae. Phylogenetic analyses of four genes from three cellular compartments support the following branching order for these algal lineages: Mesostigmatales, Chlorokybales, Klebsormidiales, Zygnematales, Coleochaetales and Charales, with the last lineage being sister to land plants. Comparative analyses of the Mesostigma viride (Mesostigmatales) and land plant chloroplast genome sequences revealed that this genome experienced many gene losses, intron insertions and gene rearrangements during the evolution of charophyceans. On the other hand, the chloroplast genome of Chaetosphaeridium globosum (Coleochaetales) is highly similar to its land plant counterparts in terms of gene content, intron composition and gene order, indicating that most of the features characteristic of land plant chloroplast DNA (cpDNA) were acquired from charophycean green algae. To gain further insight into when the highly conservative pattern displayed by land plant cpDNAs originated in the Streptophyta, we have determined the cpDNA sequences of the distantly related zygnematalean algae Staurastrum punctulatum and Zygnema circumcarinatum. Results The 157,089 bp Staurastrum and 165,372 bp Zygnema cpDNAs encode 121 and 125 genes, respectively. Although both cpDNAs lack an rRNA-encoding inverted repeat (IR), they are substantially larger than Chaetosphaeridium and land plant cpDNAs. This increased size is explained by the expansion of intergenic spacers and introns. The Staurastrum and Zygnema genomes differ extensively from one another and from their streptophyte counterparts at the level of gene order, with the Staurastrum genome more closely resembling its land plant counterparts than does Zygnema cpDNA. Many intergenic regions in Zygnema cpDNA harbor tandem repeats. The introns in both Staurastrum (8 introns) and Zygnema (13 introns) cpDNAs represent subsets of those found in land plant cpDNAs. They represent 16 distinct insertion sites, only five of which are shared by the two zygnematalean genomes. Three of these insertions sites have not been identified in Chaetosphaeridium cpDNA. Conclusion The chloroplast genome experienced substantial changes in overall structure, gene order, and intron content during the evolution of the Zygnematales. Most of the features considered earlier as typical of land plant cpDNAs probably originated before the emergence of the Zygnematales and Coleochaetales. PMID:16236178
Group II intron inhibits conjugative relaxase expression in bacteria by mRNA targeting
Piazza, Carol Lyn; Smith, Dorie
2018-01-01
Group II introns are mobile ribozymes that are rare in bacterial genomes, often cohabiting with various mobile elements, and seldom interrupting housekeeping genes. What accounts for this distribution has not been well understood. Here, we demonstrate that Ll.LtrB, the group II intron residing in a relaxase gene on a conjugative plasmid from Lactococcus lactis, inhibits its host gene expression and restrains the naturally cohabiting mobile element from conjugative horizontal transfer. We show that reduction in gene expression is mainly at the mRNA level, and results from the interaction between exon-binding sequences (EBSs) in the intron and intron-binding sequences (IBSs) in the mRNA. The spliced intron targets the relaxase mRNA and reopens ligated exons, causing major mRNA loss. Taken together, this study provides an explanation for the distribution and paucity of group II introns in bacteria, and suggests a potential force for those introns to evolve into spliceosomal introns. PMID:29905149
Group II intron inhibits conjugative relaxase expression in bacteria by mRNA targeting.
Qu, Guosheng; Piazza, Carol Lyn; Smith, Dorie; Belfort, Marlene
2018-06-15
Group II introns are mobile ribozymes that are rare in bacterial genomes, often cohabiting with various mobile elements, and seldom interrupting housekeeping genes. What accounts for this distribution has not been well understood. Here, we demonstrate that Ll.LtrB, the group II intron residing in a relaxase gene on a conjugative plasmid from Lactococcus lactis , inhibits its host gene expression and restrains the naturally cohabiting mobile element from conjugative horizontal transfer. We show that reduction in gene expression is mainly at the mRNA level, and results from the interaction between exon-binding sequences (EBSs) in the intron and intron-binding sequences (IBSs) in the mRNA. The spliced intron targets the relaxase mRNA and reopens ligated exons, causing major mRNA loss. Taken together, this study provides an explanation for the distribution and paucity of group II introns in bacteria, and suggests a potential force for those introns to evolve into spliceosomal introns. © 2018, Qu et al.
Evolution of the tRNALeu (UAA) Intron and Congruence of Genetic Markers in Lichen-Symbiotic Nostoc
Kaasalainen, Ulla; Olsson, Sanna; Rikkinen, Jouko
2015-01-01
The group I intron interrupting the tRNALeu UAA gene (trnL) is present in most cyanobacterial genomes as well as in the plastids of many eukaryotic algae and all green plants. In lichen symbiotic Nostoc, the P6b stem-loop of trnL intron always involves one of two different repeat motifs, either Class I or Class II, both with unresolved evolutionary histories. Here we attempt to resolve the complex evolution of the two different trnL P6b region types. Our analysis indicates that the Class II repeat motif most likely appeared first and that independent and unidirectional shifts to the Class I motif have since taken place repeatedly. In addition, we compare our results with those obtained with other genetic markers and find strong evidence of recombination in the 16S rRNA gene, a marker widely used in phylogenetic studies on Bacteria. The congruence of the different genetic markers is successfully evaluated with the recently published software Saguaro, which has not previously been utilized in comparable studies. PMID:26098760
Evolution of the tRNALeu (UAA) Intron and Congruence of Genetic Markers in Lichen-Symbiotic Nostoc.
Kaasalainen, Ulla; Olsson, Sanna; Rikkinen, Jouko
2015-01-01
The group I intron interrupting the tRNALeu UAA gene (trnL) is present in most cyanobacterial genomes as well as in the plastids of many eukaryotic algae and all green plants. In lichen symbiotic Nostoc, the P6b stem-loop of trnL intron always involves one of two different repeat motifs, either Class I or Class II, both with unresolved evolutionary histories. Here we attempt to resolve the complex evolution of the two different trnL P6b region types. Our analysis indicates that the Class II repeat motif most likely appeared first and that independent and unidirectional shifts to the Class I motif have since taken place repeatedly. In addition, we compare our results with those obtained with other genetic markers and find strong evidence of recombination in the 16S rRNA gene, a marker widely used in phylogenetic studies on Bacteria. The congruence of the different genetic markers is successfully evaluated with the recently published software Saguaro, which has not previously been utilized in comparable studies.
Beamer, B A; Negri, C; Yen, C J; Gavrilova, O; Rumberger, J M; Durcan, M J; Yarnall, D P; Hawkins, A L; Griffin, C A; Burns, D K; Roth, J; Reitman, M; Shuldiner, A R
1997-04-28
We determined the chromosomal localization and partial genomic structure of the coding region of the human PPAR gamma gene (hPPAR gamma), a nuclear receptor important for adipocyte differentiation and function. Sequence analysis and long PCR of human genomic DNA with primers that span putative introns revealed that intron positions and sizes of hPPAR gamma are similar to those previously determined for the mouse PPAR gamma gene[13]. Fluorescent in situ hybridization localized hPPAR gamma to chromosome 3, band 3p25. Radiation hybrid mapping with two independent primer pairs was consistent with hPPAR gamma being within 1.5 Mb of marker D3S1263 on 3p25-p24.2. These sequences of the intron/exon junctions of the 6 coding exons shared by hPPAR gamma 1 and hPPAR gamma 2 will facilitate screening for possible mutations. Furthermore, D3S1263 is a suitable polymorphic marker for linkage analysis to evaluate PPAR gamma's potential contribution to genetic susceptibility to obesity, lipoatrophy, insulin resistance, and diabetes.
Shin, Dong-Ho; Webb, Barbara M; Nakao, Miki; Smith, Sylvia L
2009-07-01
Complement factor I is a crucial regulator of mammalian complement activity. Very little is known of complement regulators in non-mammalian species. We isolated and sequenced four highly similar complement factor I cDNAs from the liver of the nurse shark (Ginglymostoma cirratum), designated as GcIf-1, GcIf-2, GcIf-3 and GcIf-4 (previously referred to as nsFI-a, -b, -c and -d) which encode 689, 673, 673 and 657 amino acid residues, respectively. They share 95% (
Structural features of diverse Pin-II proteinase inhibitor genes from Capsicum annuum.
Mahajan, Neha S; Dewangan, Veena; Lomate, Purushottam R; Joshi, Rakesh S; Mishra, Manasi; Gupta, Vidya S; Giri, Ashok P
2015-02-01
The proteinase inhibitor (PI) genes from Capsicum annuum were characterized with respect to their UTR, introns and promoter elements. The occurrence of PIs with circularly permuted domain organization was evident. Several potato inhibitor II (Pin-II) type proteinase inhibitor (PI) genes have been analyzed from Capsicum annuum (L.) with respect to their differential expression during plant defense response. However, complete gene characterization of any of these C. annuum PIs (CanPIs) has not been carried out so far. Complete gene architectures of a previously identified CanPI-7 (Beads-on-string, Type A) and a member of newly isolated Bracelet type B, CanPI-69 are reported in this study. The 5' UTR (untranslated region), 3'UTR, and intronic sequences of both the CanPI genes were obtained. The genomic sequence of CanPI-7 exhibited, exon 1 (49 base pair, bp) and exon 2 (740 bp) interrupted by a 294-bp long type I intron. We noted the occurrence of three multi-domain PIs (CanPI-69, 70, 71) with circularly permuted domain organization. CanPI-69 was found to possess exon 1 (49 bp), exon 2 (551 bp) and a 584-bp long type I intron. The upstream sequence analysis of CanPI-7 and CanPI-69 predicted various transcription factor-binding sites including TATA and CAAT boxes, hormone-responsive elements (ABRELATERD1, DOFCOREZM, ERELEE4), and a defense-responsive element (WRKY71OS). Binding of transcription factors such as zinc finger motif MADS-box and MYB to the promoter regions was confirmed using electrophoretic mobility shift assay followed by mass spectrometric identification. The 3' UTR analysis for 25 CanPI genes revealed unique/distinct 3' UTR sequence for each gene. Structures of three domain CanPIs of type A and B were predicted and further analyzed for their attributes. This investigation of CanPI gene architecture will enable the better understanding of the genetic elements present in CanPIs.
Shin, Dong-Ho; Webb, Barbara M.; Nakao, Miki; Smith, Sylvia L.
2009-01-01
Complement factor I is a crucial regulator of mammalian complement activity. Very little is known of complement regulators in non-mammalian species. We isolated and sequenced four highly similar complement factor I cDNAs from the liver of the nurse shark (Ginglymostoma cirratum), designated as GcIf-1, GcIf-2, GcIf-3 and GcIf-4 (previously referred to as nsFI-a, -b, -c and –d) which encode 689, 673, 673 and 657 amino acid residues, respectively. They share 95% (≤) amino acid identities with each other, 35.4 ~ 39.6% and 62.8 ~ 65.9% with factor I of mammals and banded houndshark (Triakis scyllium), respectively. The modular structure of the GcIf is similar to that of mammals with one notable exception, the presence of a novel shark-specific sequence between the leader peptide (LP) and the factor I membrane attack complex (FIMAC) domain. The cDNA sequences differ only in the size and composition of the shark-specific region (SSR). Sequence analysis of each SSR has identified within the region two novel short sequences (SS1 and SS2) and three repeat sequences (RS1, 2 and 3). Genomic analysis has revealed the existence of three introns between the leader peptide and the FIMAC domain, tentatively designated intron 1, intron 2, and intron 3 which span 4067, 2293 and 2082 bp, respectively. Southern blot analysis suggests the presence of a single gene copy for each cDNA type. Phylogenetic analysis suggests that complement factor I of cartilaginous fish diverged prior to the emergence of mammals. All four GcIf cDNA species are expressed in four different tissues and the liver is the main tissue in which expression level of all four is high. This suggests that the expression of GcIf isotypes is tissue-dependent. PMID:19423168
Winkfein, R J; Nishikawa, S; Connor, W; Dixon, G H
1993-07-01
A synthetic oligonucleotide primer, designed from marsupial protamine protein-sequence data [Balhorn, R., Corzett, M., Matrimas, J. A., Cummins, J. & Faden, B. (1989) Analysis of protamines isolated from two marsupials, the ring-tailed wallaby and gray short-tailed opossum, J. Cell. Biol. 107] was used to amplify, via the polymerase chain reaction, protamine sequences from a North American opossum (Didelphis marsupialis) cDNA. Using the amplified sequences as probes, several protamine cDNA clones were isolated. The protein sequence, predicted from the cDNA sequences, consisted of 57 amino acids, contained a large number of arginine residues and exhibited the sequence ARYR at its amino terminus, which is conserved in avian and most eutherian mammal protamines. Like the true protamines of trout and chicken, the opossum protamine lacked cysteine residues, distinguishing it from placental mammalian protamine 1 (P1 or stable) protamines. Examination of the protamine gene, isolated by polymerase-chain-reaction amplification of genomic DNA, revealed the presence of an intron dividing the protamine-coding region, a common characteristic of all mammalian P1 genes. In addition, extensive sequence identity in the 5' and 3' flanking regions between mouse and opossum sequences classify the marsupial protamine as being closely related to placental mammal P1. Protamine transcripts, in both birds and mammals, are present in two size classes, differing by the length of their poly(A) tails (either short or long). Examination of opossum protamine transcripts by Northern hybridization revealed four distinct mRNA species in the total RNA fraction, two of which were enriched in the poly(A)-rich fraction. Northern-blot analysis, using an intron-specific probe, revealed the presence of intron sequences in two of the four protamine transcripts. If expressed, the corresponding protein from intron-containing transcripts would differ from spliced transcripts by length (49 versus 57 amino acids) and would contain a cysteine residue.
Berger, C; Berger, B; Parson, W
2012-01-01
In recent years, evidence from domestic dogs has increasingly been analyzed by forensic DNA testing. Especially, canine hairs have proved most suitable and practical due to the high rate of hair transfer occurring between dogs and humans. Starting with the description of a contamination-free sample handling procedure, we give a detailed workflow for sequencing hypervariable segments (HVS) of the mtDNA control region from canine evidence. After the hair material is lysed and the DNA extracted by Phenol/Chloroform, the amplification and sequencing strategy comprises the HVS I and II of the canine control region and is optimized for DNA of medium-to-low quality and quantity. The sequencing procedure is based on the Sanger Big-dye deoxy-terminator method and the separation of the sequencing reaction products is performed on a conventional multicolor fluorescence detection capillary electrophoresis platform. Finally, software-aided base calling and sequence interpretation are addressed exemplarily.
Myrach, Till; Zhu, Anting; Witte, Claus-Peter
2017-09-01
Urease is a ubiquitous nickel metalloenzyme. In plants, its activation requires three urease accessory proteins (UAPs), UreD, UreF, and UreG. In bacteria, the UAPs interact with urease and facilitate activation, which involves the channeling of two nickel ions into the active site. So far this process has not been investigated in eukaryotes. Using affinity pulldowns of Strep-tagged UAPs from Arabidopsis and rice transiently expressed in planta , we demonstrate that a urease-UreD-UreF-UreG complex exists in plants and show its stepwise assembly. UreG is crucial for nickel delivery because UreG-dependent urease activation in vitro was observed only with UreG obtained from nickel-sufficient plants. This activation competence could not be generated in vitro by incubation of UreG with nickel, bicarbonate, and GTP. Compared with their bacterial orthologs, plant UreGs possess an N-terminal extension containing a His- and Asp/Glu-rich hypervariable region followed by a highly conserved sequence comprising two potential H X H metal-binding sites. Complementing the ureG-1 mutant of Arabidopsis with N-terminal deletion variants of UreG demonstrated that the hypervariable region has a minor impact on activation efficiency, whereas the conserved region up to the first H X H motif is highly beneficial and up to the second H X H motif strictly required for activation. We also show that urease reaches its full activity several days after nickel becomes available in the leaves, indicating that urease activation is limited by nickel accessibility in vivo Our data uncover the crucial role of UreG for nickel delivery during eukaryotic urease activation, inciting further investigations of the details of this process. © 2017 by The American Society for Biochemistry and Molecular Biology, Inc.
Characterization of a prototype strain of hepatitis E virus.
Tsarev, S A; Emerson, S U; Reyes, G R; Tsareva, T S; Legters, L J; Malik, I A; Iqbal, M; Purcell, R H
1992-01-15
A strain of hepatitis E virus (SAR-55) implicated in an epidemic of enterically transmitted non-A, non-B hepatitis, now called hepatitis E, was characterized extensively. Six cynomolgus monkeys (Macaca fascicularis) were infected with a strain of hepatitis E virus from Pakistan. Reverse transcription-polymerase chain reaction was used to determine the pattern of virus shedding in feces, bile, and serum relative to hepatitis and induction of specific antibodies. Virtually the entire genome of SAR-55 (7195 nucleotides) was sequenced. Comparison of the sequence of SAR-55 with that of a Burmese strain revealed a high level of homology except for one region encoding 100 amino acids of a putative nonstructural polyprotein. Identification of this region as hypervariable was obtained by partial sequencing of a third isolate of hepatitis E virus from Kirgizia.
Structural and Functional Characterization of Ribosomal Protein Gene Introns in Sponges
Perina, Drago; Korolija, Marina; Mikoč, Andreja; Roller, Maša; Pleše, Bruna; Imešek, Mirna; Morrow, Christine; Batel, Renato; Ćetković, Helena
2012-01-01
Ribosomal protein genes (RPGs) are a powerful tool for studying intron evolution. They exist in all three domains of life and are much conserved. Accumulating genomic data suggest that RPG introns in many organisms abound with non-protein-coding-RNAs (ncRNAs). These ancient ncRNAs are small nucleolar RNAs (snoRNAs) essential for ribosome assembly. They are also mobile genetic elements and therefore probably important in diversification and enrichment of transcriptomes through various mechanisms such as intron/exon gain/loss. snoRNAs in basal metazoans are poorly characterized. We examined 449 RPG introns, in total, from four demosponges: Amphimedon queenslandica, Suberites domuncula, Suberites ficus and Suberites pagurorum and showed that RPG introns from A. queenslandica share position conservancy and some structural similarity with “higher” metazoans. Moreover, our study indicates that mobile element insertions play an important role in the evolution of their size. In four sponges 51 snoRNAs were identified. The analysis showed discrepancies between the snoRNA pools of orthologous RPG introns between S. domuncula and A. queenslandica. Furthermore, these two sponges show as much conservancy of RPG intron positions between each other as between themselves and human. Sponges from the Suberites genus show consistency in RPG intron position conservation. However, significant differences in some of the orthologous RPG introns of closely related sponges were observed. This indicates that RPG introns are dynamic even on these shorter evolutionary time scales. PMID:22880015
Structural and functional characterization of ribosomal protein gene introns in sponges.
Perina, Drago; Korolija, Marina; Mikoč, Andreja; Roller, Maša; Pleše, Bruna; Imešek, Mirna; Morrow, Christine; Batel, Renato; Ćetković, Helena
2012-01-01
Ribosomal protein genes (RPGs) are a powerful tool for studying intron evolution. They exist in all three domains of life and are much conserved. Accumulating genomic data suggest that RPG introns in many organisms abound with non-protein-coding-RNAs (ncRNAs). These ancient ncRNAs are small nucleolar RNAs (snoRNAs) essential for ribosome assembly. They are also mobile genetic elements and therefore probably important in diversification and enrichment of transcriptomes through various mechanisms such as intron/exon gain/loss. snoRNAs in basal metazoans are poorly characterized. We examined 449 RPG introns, in total, from four demosponges: Amphimedon queenslandica, Suberites domuncula, Suberites ficus and Suberites pagurorum and showed that RPG introns from A. queenslandica share position conservancy and some structural similarity with "higher" metazoans. Moreover, our study indicates that mobile element insertions play an important role in the evolution of their size. In four sponges 51 snoRNAs were identified. The analysis showed discrepancies between the snoRNA pools of orthologous RPG introns between S. domuncula and A. queenslandica. Furthermore, these two sponges show as much conservancy of RPG intron positions between each other as between themselves and human. Sponges from the Suberites genus show consistency in RPG intron position conservation. However, significant differences in some of the orthologous RPG introns of closely related sponges were observed. This indicates that RPG introns are dynamic even on these shorter evolutionary time scales.
Exon–intron organization of genes in the slime mold Physarum polycephalum
Trzcinska-Danielewicz, Joanna; Fronk, Jan
2000-01-01
The slime mold Physarum polycephalum is a morphologically simple organism with a large and complex genome. The exon–intron organization of its genes exhibits features typical for protists and fungi as well as those characteristic for the evolutionarily more advanced species. This indicates that both the taxonomic position as well as the size of the genome shape the exon–intron organization of an organism. The average gene has 3.7 introns which are on average 138 bp, with a rather narrow size distribution. Introns are enriched in AT base pairs by 13% relative to exons. The consensus sequences at exon–intron boundaries resemble those found for other species, with minor differences between short and long introns. A unique feature of P.polycephalum introns is the strong preference for pyrimidines in the coding strand throughout their length, without a particular enrichment at the 3′-ends. PMID:10982858
SURVEY AND SUMMARY: exon-intron organization of genes in the slime mold Physarum polycephalum.
Trzcinska-Danielewicz, J; Fronk, J
2000-09-15
The slime mold Physarum polycephalum is a morphologically simple organism with a large and complex genome. The exon-intron organization of its genes exhibits features typical for protists and fungi as well as those characteristic for the evolutionarily more advanced species. This indicates that both the taxonomic position as well as the size of the genome shape the exon-intron organization of an organism. The average gene has 3.7 introns which are on average 138 bp, with a rather narrow size distribution. Introns are enriched in AT base pairs by 13% relative to exons. The consensus sequences at exon-intron boundaries resemble those found for other species, with minor differences between short and long introns. A unique feature of P.polycephalum introns is the strong preference for pyrimidines in the coding strand throughout their length, without a particular enrichment at the 3'-ends.
Watanabe, K; Yoshioka, K; Ito, H; Ishigami, M; Takagi, K; Utsunomiya, S; Kobayashi, M; Kishimoto, H; Yano, M; Kakumu, S
1999-11-10
Hypervariable region 1 (HVR1) proteins of hepatitis C virus (HCV) have been reported to react broadly with sera of patients with HCV infection. However, the variability of the broad reactivity of individual HVR1 proteins has not been elucidated. We assessed the reactivity of 25 different HVR1 proteins (genotype 1b) with sera of 81 patients with HCV infection (genotype 1b) by Western blot. HVR1 proteins reacted with 2-60 sera. The number of sera reactive with each HVR1 protein significantly correlated with the number of amino acid residues identical to the consensus sequence defined by Puntoriero et al. (G. Puntoriero, A. Lahm, S. Zucchelli, B. B. Ercole, R. Tafi, M. Penzzanera, M. U. Mondelli, R. Cortese, A. Tramontano, G. Galfre', and A. Nicosia. 1998. EMBO J. 17, 3521-3533. ) (r = 0.561, P < 0.005). The most widely reactive HVR1 protein, 12-22, had a sequence similar to the consensus sequence. The peptide with C-terminal 13-amino-acids sequence of HVR1 protein 12-22 (NH2-CSFTSLFTPGPSQK) was injected into rabbits as an immunogen. The rabbit immune sera reacted with 9 of 25 HVR1 proteins of genotype 1b including HVR1 protein 12-22 and with 3 of 12 proteins of genotype 2a. These results indicate that the HVR1 protein broadly reactive with patients' sera has a sequence similar to the consensus sequence, can induce broadly reactive sera, and could be one of the candidate immunogens in a prophylactic vaccine against HCV. Copyright 1999 Academic Press.
Wang, Li-li; Ma, Bin; Qian, Dun; Pang, Jun; Yao, Ya-li
2013-12-01
To assess the correlation between polymorphisms in the coagulation factor VII (F VII)gene hypervariable region 4 (HVR4)site and risk related to coronary heart disease (CHD)in different ethnic populations, especially the Asian populations. Publications up to April 2013, from CBM, CNKI, Wanfang Database,VIP, PubMed, Cochrane Library and Embase were searched to collect data from case-control studies related to F VII gene HVR4 site and CHD in populations from different ethnicities. Quality of studies was evaluated, available data extracted and both RevMan 5.1 and Stata 11.0 softwares were used for Meta-analysis. Fifteen case-control studies were included, involving 3167 cases with CHD group and 3168 cases in the control group. on this Meta-analysis showed that:a)polymorphism of the F VII gene HVR4 site H7/H6+H5 and CHD, b)H7H7/H6H6 + H7H6 and CHD were both slightly correlated between people with different ethnic backgrounds. However, the H6 allele versus H7+H5 allele and CHD showed different results-a high correlation seen in different ethnic groups. H5 allele versus H6+H7 allele and CHD did not appear significant difference(OR = 1.20, 95%CI:0.76-1.90, P = 0.43). Both F VII gene HVR4 polymorphisms H7 allele and the H7H7 genotype might have served as protective factors for CHD in different ethnic groups, H6 allele might serve as a risk factor for CHD, but H5 allele was likely not to be associated with CHD in different ethnic groups.
Bacterial communities in soil samples from the Mingyong Glacier of southwestern China.
Li, Haoyu; Taj, Muhammad Kamran; Ji, Xiuling; Zhang, Qi; Lin, Liangbing; Zhou, Zhimei; Wei, Yunlin
2017-05-01
The present study was an effort to determine the bacterial diversity of soils in Mingyong Glacier located at the Meili Snow Mountains of southwestern China. Mingyong Glacier has different climatic zones within a very narrow area, and bacterial community diversity in this low temperature area remains largely unknown. In this study, soil samples were collected from four different climatic zones: M11A (dry warm valley), M14 (forest), M15 (grass land), and M16 (glacier zones). Phylogenetic analysis based on 16S rRNA gene V6 hypervariable region showed high bacterial abundance in the glacier. The number of Operational Taxonomic Units ranged from 2.24×10 3 to 5.56×10 3 in soil samples. Statistical analysis of 16S rRNA gene clone libraries results showed that bacterial diversity in zones M11A,M14 and M16 are higher than in zone M15. The bacterial community structures are clearly distinguishable, and phylogenetic analysis showed that the predominant phyla were Proteobacteria, Deinococcus-Thermus, Firmicutes, Actinobacteria, and Nitrospirae in Mingyong Glacier. Seventy-nine different orders from four zones have been isolated. Bacterial diversity and distribution of bacterial communities related to the anthropogenic perturbations in zone (M15) were confirmed by diversity index analysis, and the diversity index of other three zones was satisfactory through this analysis software. The results suggest that bacterial diversity and distribution analyses using bacterial 16S rRNA gene V6 hypervariable region were successful, and bacterial communities in this area not only had the same bacterial phyla compared to other glaciers but also had their own rare species.
2014-01-01
Background Hypervariable region 1 (HVR1) contained within envelope protein 2 (E2) gene is the most variable part of HCV genome and its translation product is a major target for the host immune response. Variability within HVR1 may facilitate evasion of the immune response and could affect treatment outcome. The aim of the study was to analyze the impact of HVR1 heterogeneity employing sensitive ultra-deep sequencing, on the outcome of PEG-IFN-α (pegylated interferon α) and ribavirin treatment. Methods HVR1 sequences were amplified from pretreatment serum samples of 25 patients infected with genotype 1b HCV (12 responders and 13 non-responders) and were subjected to pyrosequencing (GS Junior, 454/Roche). Reads were corrected for sequencing error using ShoRAH software, while population reconstruction was done using three different minimal variant frequency cut-offs of 1%, 2% and 5%. Statistical analysis was done using Mann–Whitney and Fisher’s exact tests. Results Complexity, Shannon entropy, nucleotide diversity per site, genetic distance and the number of genetic substitutions were not significantly different between responders and non-responders, when analyzing viral populations at any of the three frequencies (≥1%, ≥2% and ≥5%). When clonal sample was used to determine pyrosequencing error, 4% of reads were found to be incorrect and the most abundant variant was present at a frequency of 1.48%. Use of ShoRAH reduced the sequencing error to 1%, with the most abundant erroneous variant present at frequency of 0.5%. Conclusions While deep sequencing revealed complex genetic heterogeneity of HVR1 in chronic hepatitis C patients, there was no correlation between treatment outcome and any of the analyzed quasispecies parameters. PMID:25016390
Pyrosequencing as a tool for the identification of common isolates of Mycobacterium sp.
Tuohy, Marion J; Hall, Gerri S; Sholtis, Mary; Procop, Gary W
2005-04-01
Pyrosequencing technology, sequencing by addition, was evaluated for categorization of mycobacterial isolates. One hundred and eighty-nine isolates, including 18 ATCC and Trudeau Mycobacterial Culture Collection (TMC) strains, were studied. There were 38 Mycobacterium tuberculosis complex, 27 M. kansasii, 27 MAI complex, 21 M. marinum, 14 M. gordonae, 20 M. chelonae-abscessus group, 10 M. fortuitum, 5 M. xenopi, 3 M. celatum, 2 M. terrae complex, 20 M. mucogenicum, and 2 M. scrofulaceum. Nucleic acid extracts were prepared from solid media or MGIT broth. Traditional PCR was performed with one of the primers biotinylated; the assay targeted a portion of the 16S rRNA gene that contains a hypervariable region, which has been previously shown to be useful for the identification of mycobacteria. The PSQ Sample Preparation Kit was used, and the biotinylated PCR product was processed to a single-stranded DNA template. The sequencing primer was hybridized to the DNA template in a PSQ96 plate. Incorporation of the complementary nucleotides resulted in light generation peaks, forming a pyrogram, which was evaluated by the instrument software. Thirty basepairs were used for isolate categorization. Manual interpretation of the sequences was performed if the quality of the 30-bp sequence was in doubt or if more than 4 bp homopolymers were recognized. Sequences with more than 5 bp of bad quality were deemed unacceptable. When blasted against GenBank, 179 of 189 sequences (94.7%) assigned isolates to the correct molecular genus or group. Ten M. gordonae isolates had more than 5 bp of bad quality sequence and were not accepted. Pyrosequencing of this hypervariable region afforded rapid and acceptable characterization of common, routinely isolated clinical Mycobacterium sp. Algorithms are recommended for further differentiation with an additional sequencing primer or additional biochemicals.
Saleem, Muhammad; Prince, Stephen M.; Rigby, Stephen E. J.; Imran, Muhammad; Patel, Hema; Chan, Hannah; Sanders, Holly; Maiden, Martin C. J.; Feavers, Ian M.; Derrick, Jeremy P.
2013-01-01
FrpB is an outer membrane transporter from Neisseria meningitidis, the causative agent of meningococcal meningitis. It is a member of the TonB-dependent transporter (TBDT) family and is responsible for iron uptake into the periplasm. FrpB is subject to a high degree of antigenic variation, principally through a region of hypervariable sequence exposed at the cell surface. From the crystal structures of two FrpB antigenic variants, we identify a bound ferric ion within the structure which induces structural changes on binding which are consistent with it being the transported substrate. Binding experiments, followed by elemental analysis, verified that FrpB binds Fe3+ with high affinity. EPR spectra of the bound Fe3+ ion confirmed that its chemical environment was consistent with that observed in the crystal structure. Fe3+ binding was reduced or abolished on mutation of the Fe3+-chelating residues. FrpB orthologs were identified in other Gram-negative bacteria which showed absolute conservation of the coordinating residues, suggesting the existence of a specific TBDT sub-family dedicated to the transport of Fe3+. The region of antigenic hypervariability lies in a separate, external sub-domain, whose structure is conserved in both the F3-3 and F5-1 variants, despite their sequence divergence. We conclude that the antigenic sub-domain has arisen separately as a result of immune selection pressure to distract the immune response from the primary transport function. This would enable FrpB to function as a transporter independently of antibody binding, by using the antigenic sub-domain as a ‘molecular decoy’ to distract immune surveillance. PMID:23457610
Saleem, Muhammad; Prince, Stephen M; Rigby, Stephen E J; Imran, Muhammad; Patel, Hema; Chan, Hannah; Sanders, Holly; Maiden, Martin C J; Feavers, Ian M; Derrick, Jeremy P
2013-01-01
FrpB is an outer membrane transporter from Neisseria meningitidis, the causative agent of meningococcal meningitis. It is a member of the TonB-dependent transporter (TBDT) family and is responsible for iron uptake into the periplasm. FrpB is subject to a high degree of antigenic variation, principally through a region of hypervariable sequence exposed at the cell surface. From the crystal structures of two FrpB antigenic variants, we identify a bound ferric ion within the structure which induces structural changes on binding which are consistent with it being the transported substrate. Binding experiments, followed by elemental analysis, verified that FrpB binds Fe(3+) with high affinity. EPR spectra of the bound Fe(3+) ion confirmed that its chemical environment was consistent with that observed in the crystal structure. Fe(3+) binding was reduced or abolished on mutation of the Fe(3+)-chelating residues. FrpB orthologs were identified in other Gram-negative bacteria which showed absolute conservation of the coordinating residues, suggesting the existence of a specific TBDT sub-family dedicated to the transport of Fe(3+). The region of antigenic hypervariability lies in a separate, external sub-domain, whose structure is conserved in both the F3-3 and F5-1 variants, despite their sequence divergence. We conclude that the antigenic sub-domain has arisen separately as a result of immune selection pressure to distract the immune response from the primary transport function. This would enable FrpB to function as a transporter independently of antibody binding, by using the antigenic sub-domain as a 'molecular decoy' to distract immune surveillance.
Evolution of Mhc-DRB introns: implications for the origin of primates.
Kupfermann, H; Satta, Y; Takahata, N; Tichy, H; Klein, J
1999-06-01
Introns are generally believed to evolve too rapidly and too erratically to be of much use in phylogenetic reconstructions. Few phylogenetically informative intron sequences are available, however, to ascertain the validity of this supposition. In the present study the supposition was tested on the example of the mammalian class II major histocompatibility complex (Mhc) genes of the DRB family. Since the Mhc genes evolve under balancing selection and are believed to recombine or rearrange frequently, the evolution of their introns could be expected to be particularly rapid and subject to scrambling. Sequences of intron 4 and 5 DRB genes were obtained from polymerase chain reaction-amplified fragments of genomic DNA from representatives of six eutherian orders-Primates, Scandentia, Chiroptera, Dermoptera, Lagomorpha, and Insectivora. Although short stretches of the introns have indeed proved to be unalignable, the bulk of the intron sequences from all six orders, spanning >85 million years (my) of evolution, could be aligned and used in a study of the tempo and mode of intron evolution. The analysis has revealed the Mhc introns to evolve at a rate similar to that of other genes and of synonymous sites of non-Mhc genes. No evidence of homogenization or large-scale scrambling of the intron sequences could be found. The Mhc introns apparently evolve largely by point mutations and insertions/deletions. The phylogenetic signals contained in the intron sequences could be used to identify Scandentia as the sister group of Primates, to support the existence of the Archonta superorder, and to confirm the monophyly of the Chiroptera.
Gonzalez, P; Barroso, G; Labarère, J
1999-04-01
The complete gene sequence and secondary structure of the mitochondrial LSU rRNA from the cultivated Basidiomycota Agrocybe aegerita was derived by chromosome walking. The A.aegerita LSU rRNA gene (13 526 nt) represents, to date, the longest described, due to the highest number of introns (eight) and the occurrence of six long nucleotidic extensions. Seven introns belong to group I, while the intronic sequence i5 constitutes the first typical group II intron reported in a fungal mitochondrial LSU rDNA. As with most fungal LSU rDNA introns reported to date, four introns (i5-i8) are distributed in domain V associated with the peptidyl-transferase activity. One intron (i1) is located in domain I, and three (i2-i4) in domain II. The introns i2-i8 possess homologies with other fungal, algal or protozoan introns located at the same position in LSU rDNAs. One of them (i6) is located at the same insertion site as most Ascomycota or algae LSU introns, suggesting a possible inheritance from a common ancestor. On the contrary, intron i1 is located at a so-far unreported insertion site. Among the six unusual nucleotide extensions, five are located in domain I and one in domain V. This is the first report of a mitochondrial LSU rRNA gene sequence and secondary structure for the whole Basidiomycota division.
Cross-talk between PRMT1-mediated methylation and ubiquitylation on RBM15 controls RNA splicing.
Zhang, Li; Tran, Ngoc-Tung; Su, Hairui; Wang, Rui; Lu, Yuheng; Tang, Haiping; Aoyagi, Sayura; Guo, Ailan; Khodadadi-Jamayran, Alireza; Zhou, Dewang; Qian, Kun; Hricik, Todd; Côté, Jocelyn; Han, Xiaosi; Zhou, Wenping; Laha, Suparna; Abdel-Wahab, Omar; Levine, Ross L; Raffel, Glen; Liu, Yanyan; Chen, Dongquan; Li, Haitao; Townes, Tim; Wang, Hengbin; Deng, Haiteng; Zheng, Y George; Leslie, Christina; Luo, Minkui; Zhao, Xinyang
2015-11-17
RBM15, an RNA binding protein, determines cell-fate specification of many tissues including blood. We demonstrate that RBM15 is methylated by protein arginine methyltransferase 1 (PRMT1) at residue R578, leading to its degradation via ubiquitylation by an E3 ligase (CNOT4). Overexpression of PRMT1 in acute megakaryocytic leukemia cell lines blocks megakaryocyte terminal differentiation by downregulation of RBM15 protein level. Restoring RBM15 protein level rescues megakaryocyte terminal differentiation blocked by PRMT1 overexpression. At the molecular level, RBM15 binds to pre-messenger RNA intronic regions of genes important for megakaryopoiesis such as GATA1, RUNX1, TAL1 and c-MPL. Furthermore, preferential binding of RBM15 to specific intronic regions recruits the splicing factor SF3B1 to the same sites for alternative splicing. Therefore, PRMT1 regulates alternative RNA splicing via reducing RBM15 protein concentration. Targeting PRMT1 may be a curative therapy to restore megakaryocyte differentiation for acute megakaryocytic leukemia.
Association of ESR1 gene tagging SNPs with breast cancer risk
Dunning, Alison M.; Healey, Catherine S.; Baynes, Caroline; Maia, Ana-Teresa; Scollen, Serena; Vega, Ana; Rodríguez, Raquel; Barbosa-Morais, Nuno L.; Ponder, Bruce A.J.; Low, Yen-Ling; Bingham, Sheila; Haiman, Christopher A.; Le Marchand, Loic; Broeks, Annegien; Schmidt, Marjanka K.; Hopper, John; Southey, Melissa; Beckmann, Matthias W.; Fasching, Peter A.; Peto, Julian; Johnson, Nichola; Bojesen, Stig E.; Nordestgaard, Børge; Milne, Roger L.; Benitez, Javier; Hamann, Ute; Ko, Yon; Schmutzler, Rita K.; Burwinkel, Barbara; Schürmann, Peter; Dörk, Thilo; Heikkinen, Tuomas; Nevanlinna, Heli; Lindblom, Annika; Margolin, Sara; Mannermaa, Arto; Kosma, Veli-Matti; Chen, Xiaoqing; Spurdle, Amanda; Change-Claude, Jenny; Flesch-Janys, Dieter; Couch, Fergus J.; Olson, Janet E.; Severi, Gianluca; Baglietto, Laura; Børresen-Dale, Anne-Lise; Kristensen, Vessela; Hunter, David J.; Hankinson, Susan E.; Devilee, Peter; Vreeswijk, Maaike; Lissowska, Jolanta; Brinton, Louise; Liu, Jianjun; Hall, Per; Kang, Daehee; Yoo, Keun-Young; Shen, Chen-Yang; Yu, Jyh-Cherng; Anton-Culver, Hoda; Ziogoas, Argyrios; Sigurdson, Alice; Struewing, Jeff; Easton, Douglas F.; Garcia-Closas, Montserrat; Humphreys, Manjeet K.; Morrison, Jonathan; Pharoah, Paul D.P.; Pooley, Karen A.; Chenevix-Trench, Georgia
2009-01-01
We have conducted a three-stage, comprehensive single nucleotide polymorphism (SNP)-tagging association study of ESR1 gene variants (SNPs) in more than 55 000 breast cancer cases and controls from studies within the Breast Cancer Association Consortium (BCAC). No large risks or highly significant associations were revealed. SNP rs3020314, tagging a region of ESR1 intron 4, is associated with an increase in breast cancer susceptibility with a dominant mode of action in European populations. Carriers of the c-allele have an odds ratio (OR) of 1.05 [95% Confidence Intervals (CI) 1.02–1.09] relative to t-allele homozygotes, P = 0.004. There is significant heterogeneity between studies, P = 0.002. The increased risk appears largely confined to oestrogen receptor-positive tumour risk. The region tagged by SNP rs3020314 contains sequence that is more highly conserved across mammalian species than the rest of intron 4, and it may subtly alter the ratio of two mRNA splice forms. PMID:19126777
NickAria, Shiva; Haghpanah, Sezaneh; Ramzi, Mani; Karimi, Mehran
2018-05-10
Globin switching is a significant factor on blood hemoglobin (Hb) level but its molecular mechanisms have not yet been identified, however, several quantitative trait loci (QTL) and polymorphisms involved regions on chromosomes 2p, 6q, 8q and X account for variation in the γ-globin expression level. We studied the effect of interaction between a region on intron six of the TOX gene, chromosome 8q (chr8q) and XmnI locus on the γ-globin promoter, chr11p on γ-globin expression in 150 β-thalassemia intermedia (β-TI) patients, evaluated by statistical interaction analysis. Our results showed a significant interaction between one QTL on intron six of the TOX gene (rs9693712) and XmnI locus that effect γ-globin expression. Interchromosomal interaction mediates through transcriptional machanisms to preserve true genome architectural features, chromosomes localization and DNA bending. This interaction can be a part of the unknown molecular mechanism of globin switching and regulation of gene expression.
Forks in the tracks: Group II introns, spliceosomes, telomeres and beyond.
Agrawal, Rajendra Kumar; Wang, Hong-Wei; Belfort, Marlene
2016-12-01
Group II introns are large catalytic RNAs that form a ribonucleoprotein (RNP) complex by binding to an intron-encoded protein (IEP). The IEP, which facilitates both RNA splicing and intron mobility, has multiple activities including reverse transcriptase. Recent structures of a group II intron RNP complex and of IEPs from diverse bacteria fuel arguments that group II introns are ancestrally related to eukaryotic spliceosomes as well as to telomerase and viruses. Furthermore, recent structural studies of various functional states of the spliceosome allow us to draw parallels between the group II intron RNP and the spliceosome. Here we present an overview of these studies, with an emphasis on the structure of the IEPs in their isolated and RNA-bound states and on their evolutionary relatedness. In addition, we address the conundrum of the free, albeit truncated IEPs forming dimers, whereas the IEP bound to the intron ribozyme is a monomer in the mature RNP. Future studies needed to resolve some of the outstanding issues related to group II intron RNP function and dynamics are also discussed.
RNA Sequencing of the Exercise Transcriptome in Equine Athletes
Verini-Supplizi, Andrea; Barcaccia, Gianni; Albiero, Alessandro; D'Angelo, Michela; Campagna, Davide; Valle, Giorgio; Felicetti, Michela; Silvestrelli, Maurizio; Cappelli, Katia
2013-01-01
The horse is an optimal model organism for studying the genomic response to exercise-induced stress, due to its natural aptitude for athletic performance and the relative homogeneity of its genetic and environmental backgrounds. Here, we applied RNA-sequencing analysis through the use of SOLiD technology in an experimental framework centered on exercise-induced stress during endurance races in equine athletes. We monitored the transcriptional landscape by comparing gene expression levels between animals at rest and after competition. Overall, we observed a shift from coding to non-coding regions, suggesting that the stress response involves the differential expression of not annotated regions. Notably, we observed significant post-race increases of reads that correspond to repeats, especially the intergenic and intronic L1 and L2 transposable elements. We also observed increased expression of the antisense strands compared to the sense strands in intronic and regulatory regions (1 kb up- and downstream) of the genes, suggesting that antisense transcription could be one of the main mechanisms for transposon regulation in the horse under stress conditions. We identified a large number of transcripts corresponding to intergenic and intronic regions putatively associated with new transcriptional elements. Gene expression and pathway analysis allowed us to identify several biological processes and molecular functions that may be involved with exercise-induced stress. Ontology clustering reflected mechanisms that are already known to be stress activated (e.g., chemokine-type cytokines, Toll-like receptors, and kinases), as well as “nucleic acid binding” and “signal transduction activity” functions. There was also a general and transient decrease in the global rates of protein synthesis, which would be expected after strenuous global stress. In sum, our network analysis points toward the involvement of specific gene clusters in equine exercise-induced stress, including those involved in inflammation, cell signaling, and immune interactions. PMID:24391776
Ramos, Lucero Rengifo; Arias, Duverney Gaviria; Salazar, Liliana Salazar; Vélez, Juan Pablo; Pardo, Stella Lozano
2012-03-01
The indel polymorphisms in the promoting region and the 2(nd) intron polymorphisms in the serotonin transporter gene (SLC6A4) have been associated to bipolar disorder 1 (BD1) in several population studies. The objective was to analyze the genotypic and allelic frequencies in both gene regions in a study of cases and controls with individuals from Risaralda and Quindío (Colombia) so as to establish possible associations to BD1, and compare results with previous and similar studies. 133 patients and 120 controls were studied. L and S indel polymorphisms in the promoting region were analyzed by PCR, together with VNTR STin2.10 and STin 2.12 VNTRs polymorphisms in the 2(nd) intron of the SL-C6A4 gene Genotypic and allelic frequencies for the S and L polymorphisms were similar both in cases and controls. However, the LL genotype was significantly increased both in BD1 population (OR=1.89; CI95%=1.1-3.68), and when discriminated by gender. This particular genotype in general population is OR=2.22; IC95%=1.04-5.66 for women, and OR=1.62; IC 95%=0.71-4.39 for men. No significant genotypic and allelic differences were found for VNTR STin2.10 and STin 2.12. polymorphisms. No association was found between polymorphisms of 5-HTTLPR polymorphisms and the 2(nd) intron of the serotonin transporting gene in general patients with BD1, nor when compared by gender. Our results are similar to those reported for Caucasian populations and differ from those of Asian and Brazilian populations. Copyright © 2012 Asociación Colombiana de Psiquiatría. Publicado por Elsevier España. All rights reserved.
Gene structure and evolution of transthyretin in the order Chiroptera.
Khwanmunee, Jiraporn; Leelawatwattana, Ladda; Prapunpoj, Porntip
2016-02-01
Bats are mammals in the order Chiroptera. Although many extensive morphologic and molecular genetics analyses have been attempted, phylogenetic relationships of bats has not been completely resolved. The paraphyly of microbats is of particular controversy that needs to be confirmed. In this study, we attempted to use the nucleotide sequence of transthyretin (TTR) intron 1 to resolve the relationship among bats. To explore its utility, the complete sequences of TTR gene and intron 1 region of bats in Vespertilionidae: genus Eptesicus (Eptesicus fuscus) and genus Myotis (Myotis brandtii, Myotis davidii, and Myotis lucifugus), and Pteropodidae (Pteropus alecto and Pteropus vampyrus) were extracted from the retrieved sequences, whereas those of Rhinoluphus affinis and Scotophilus kuhlii were amplified and sequenced. The derived overall amino sequences of bat TTRs were found to be very similar to those in other eutherians but differed from those in other classes of vertebrates. However, missing of amino acids from N-terminal or C-terminal region was observed. The phylogenetic analysis of amino acid sequences suggested bat and other eutherian TTRs lineal descent from a single most recent common ancestor which differed from those of non-placental mammals and the other classes of vertebrates. The splicing of bat TTR precursor mRNAs was similar to those of other eutherian but different from those of marsupial, bird, reptile and amphibian. Based on TTR intron 1 sequence, the inferred evolutionary relationship within Chiroptera revealed more closely relatedness of R. affinis to megabats than to microbats. Accordingly, the paraphyly of microbats was suggested.
Regulation of the plasma cell transcription factor Blimp-1 gene by Bach2 and Bcl6.
Ochiai, Kyoko; Muto, Akihiko; Tanaka, Hiromu; Takahashi, Shinichiro; Igarashi, Kazuhiko
2008-03-01
B lymphocyte-induced maturation protein 1 (Blimp-1) is a key regulator for plasma cell differentiation. Prior to the terminal differentiation into plasma cells, Blimp-1 expression is suppressed in B cells by transcription repressors BTB and CNC homology 2 (Bach2) and B cell lymphoma 6 (Bcl6). Bach2 binds to the Maf recognition element (MARE) of the promoter upstream region of the Blimp-1 gene (Prdm1) by forming a heterodimer with MafK. Bach2 and Bcl6 were found to interact with each other in B cells. While both Bach2 and Bcl6 possess the BTB domain which mediates protein-protein interactions, they interacted in a BTB-independent manner. Bcl6 is known to repress Prdm1 through a Bcl6 recognition element 1 in the intron 5, in which a putative, evolutionarily conserved MARE was identified. Both repressed the expression of a reporter gene containing the intron 5 region depending on the presence of the respective binding sites in 18-81 pre-B cells. Co-expression of Bach2 and Bcl6 resulted in further repression of the reporter plasmid. Chromatin immunoprecipitation assays showed MafK to bind to the intron MARE in various B cell lines, thus suggesting that it binds as a heterodimer with Bach2. Therefore, the interaction between Bach2 and Bcl6 might be crucial for the proper repression of Prdm1 in B cells.
Yu, Yi; Panhuysen, Carolien; Kranzler, Henry R; Hesselbrock, Victor; Rounsaville, Bruce; Weiss, Roger; Brady, Kathleen; Farrer, Lindsay A; Gelernter, Joel
2006-07-15
We report here a study considering association of alleles and haplotypes at the DOPA decarboxylase (DDC) locus with the DSM-IV diagnosis of nicotine dependence (ND) or a quantitative measure for ND using the Fagerstrom Test for Nicotine Dependence (FTND). We genotyped 18 single nucleotide polymorphisms (SNPs) spanning a region of approximately 210 kb that includes DDC and the genes immediately flanking DDC in 1,590 individuals from 621 families of African-American (AA) or European-American (EA) ancestry. Evidence of association (family-based tests) was observed with several SNPs for both traits (0.0002
NASA Astrophysics Data System (ADS)
Hamid, Nur Athirah Abd; Ismail, Ismanizan
2013-11-01
Polygonum minus, locally named as Kesum is an aromatic herb which is high in secondary metabolite content. Alcohol dehydrogenase is an important enzyme that catalyzes the reversible oxidation of alcohol and aldehyde with the presence of NAD(P)(H) as co-factor. The main focus of this research is to identify the gene of ADH. The total RNA was extracted from leaves of P. minus which was treated with 150 μM Jasmonic acid. Full-length cDNA sequence of ADH was isolated via rapid amplification cDNA end (RACE). Subsequently, in silico analysis was conducted on the full-length cDNA sequence and PCR was done on genomic DNA to determine the exon and intron organization. Two sequences of ADH, designated as PmADH1 and PmADH2 were successfully isolated. Both sequences have ORF of 801 bp which encode 266 aa residues. Nucleotide sequence comparison of PmADH1 and PmADH2 indicated that both sequences are highly similar at the ORF region but divergent in the 3' untranslated regions (UTR). The amino acid is differ at the 107 residue; PmADH1 contains Gly (G) residue while PmADH2 contains Cys (C) residue. The intron-exon organization pattern of both sequences are also same, with 3 introns and 4 exons. Based on in silico analysis, both sequences contain "classical" short chain alcohol dehydrogenases/reductases ((c) SDRs) conserved domain. The results suggest that both sequences are the members of short chain alcohol dehydrogenase family.
Chaw, Shu-Miaw; Walters, Terrence W; Chang, Chien-Chang; Hu, Shu-Hsuan; Chen, Shin-Hsiao
2005-10-01
Phylogenetic relationships among the three families and 12 living genera of cycads were reconstructed by distance and parsimony criteria using three markers: the chloroplast matK gene, the chloroplast trnK intron and the nuclear ITS/5.8S rDNA sequence. All datasets indicate that Cycadaceae (including only the genus Cycas) is remotely related to other cycads, in which Dioon was resolved as the basal-most clade, followed by Bowenia and a clade containing the remaining nine genera. Encephalartos and Lepidozamia are closer to each other than to Macrozamia. The African genus Stangeria is embedded within the New World subfamily Zamiodeae. Therefore, Bowenia is an unlikely sister to Stangeria, contrary to the view that they form the Stangeriaceae. The generic status of Dyerocycas and Chigua is unsupportable as they are paraphyletic with Cycas and the Zamia, respectively. Nonsense mutations in the matK gene and indels in the other two datasets lend evidence to reinforce the above conclusions. According to the phylogenies, the past geography of the genera of cycads and the evolution of character states are hypothesized and discussed. Within the suborder Zamiieae, Stangeria, and the tribe Zamieae evolved significantly faster than other genera. The matK gene and ITS/5.8S region contain more useful information than the trnK intron in addressing phylogeny. Redelimitations of Zamiaceae, Stangeriaceae, subfamily Encephalartoideae and subtribe Macrozamiineae are necessary.
Molecular Evolution of the Non-Coding Eosinophil Granule Ontogeny Transcript
Rose, Dominic; Stadler, Peter F.
2011-01-01
Eukaryotic genomes are pervasively transcribed. A large fraction of the transcriptional output consists of long, mRNA-like, non-protein-coding transcripts (mlncRNAs). The evolutionary history of mlncRNAs is still largely uncharted territory. In this contribution, we explore in detail the evolutionary traces of the eosinophil granule ontogeny transcript (EGOT), an experimentally confirmed representative of an abundant class of totally intronic non-coding transcripts (TINs). EGOT is located antisense to an intron of the ITPR1 gene. We computationally identify putative EGOT orthologs in the genomes of 32 different amniotes, including orthologs from primates, rodents, ungulates, carnivores, afrotherians, and xenarthrans, as well as putative candidates from basal amniotes, such as opossum or platypus. We investigate the EGOT gene phylogeny, analyze patterns of sequence conservation, and the evolutionary conservation of the EGOT gene structure. We show that EGO-B, the spliced isoform, may be present throughout the placental mammals, but most likely dates back even further. We demonstrate here for the first time that the whole EGOT locus is highly structured, containing several evolutionary conserved, and thermodynamic stable secondary structures. Our analyses allow us to postulate novel functional roles of a hitherto poorly understood region at the intron of EGO-B which is highly conserved at the sequence level. The region contains a novel ITPR1 exon and also conserved RNA secondary structures together with a conserved TATA-like element, which putatively acts as a promoter of an independent regulatory element. PMID:22303364
Wolff, G; Burger, G; Lang, B F; Kück, U
1993-01-01
The mitochondrial DNA from the colourless alga Prototheca wickerhamii contains two mosaic genes as was revealed from complete sequencing of the circular extranuclear genome. The genes for the large subunit of the ribosomal RNA (LSUrRNA) as well as for subunit I of the cytochrome oxidase (coxI) carry two and three intronic sequences respectively. On the basis of their canonical nucleotide sequences they can be classified as group I introns. Phylogenetic comparisons of the coxI protein sequences allow us to conclude that the P.wickerhamii mtDNA is much closer related to higher plant mtDNAs than to those of the chlorophyte alga C.reinhardtii. The comparison of the intron sequences revealed several unusual features: (1) The P.wickerhamii introns are structurally related to mitochondrial introns from various ascomycetous fungi. (2) Phylogenetic analyses indicate a close relationship between fungal and algal intronic sequences. (3) The P. wickerhamii introns are located at positions within the structural genes which can be considered as preferred intron insertion sites in homologous mitochondrial genes from fungi or liverwort. In all cases, the sequences adjacent to the insertion sites are very well conserved over large evolutionary distances. Our finding of highly similar introns in fungi and algae is consistent with the idea that introns have already been present in the bacterial ancestors of present day mitochondria and evolved concomitantly with the organelles. PMID:7680126
Regional spread of HIV-1 M subtype B in middle-aged patients by random env-C2V4 region sequencing
Stürmer, Martin; Zimmermann, Katrin; Fritzsche, Carlos; Reisinger, Emil; Doelken, Gottfried; Berger, Annemarie; Doerr, Hans W.; Eberle, Josef
2010-01-01
A transmission cluster of HIV-1 M:B was identified in 11 patients with a median age of 52 (range 26–65) in North-East Germany by C2V4 region sequencing of the env gene of HIV-1, who—except of one—were not aware of any risky behaviour. The 10 male and 1 female patients deteriorated immunologically, according to their information made available, within 4 years after a putative HIV acquisition. Nucleic acid sequence analysis showed a R5 virus in all patients and in 7 of 11 a crown motif of the V3 loop, GPGSALFTT, which is found rarely. Analysis of formation of this cluster showed that there is still a huge discrepancy between awareness and behaviour regarding HIV transmission in middle-aged patients, and that a local outbreak can be detected by nucleic acid analysis of the hypervariable env region. PMID:20217125
Eichmann, Cordula; Parson, Walther
2008-09-01
The traditional protocol for forensic mitochondrial DNA (mtDNA) analyses involves the amplification and sequencing of the two hypervariable segments HVS-I and HVS-II of the mtDNA control region. The primers usually span fragment sizes of 300-400 bp each region, which may result in weak or failed amplification in highly degraded samples. Here we introduce an improved and more stable approach using shortened amplicons in the fragment range between 144 and 237 bp. Ten such amplicons were required to produce overlapping fragments that cover the entire human mtDNA control region. These were co-amplified in two multiplex polymerase chain reactions and sequenced with the individual amplification primers. The primers were carefully selected to minimize binding on homoplasic and haplogroup-specific sites that would otherwise result in loss of amplification due to mis-priming. The multiplexes have successfully been applied to ancient and forensic samples such as bones and teeth that showed a high degree of degradation.
Intergenic disease-associated regions are abundant in novel transcripts.
Bartonicek, N; Clark, M B; Quek, X C; Torpy, J R; Pritchard, A L; Maag, J L V; Gloss, B S; Crawford, J; Taft, R J; Hayward, N K; Montgomery, G W; Mattick, J S; Mercer, T R; Dinger, M E
2017-12-28
Genotyping of large populations through genome-wide association studies (GWAS) has successfully identified many genomic variants associated with traits or disease risk. Unexpectedly, a large proportion of GWAS single nucleotide polymorphisms (SNPs) and associated haplotype blocks are in intronic and intergenic regions, hindering their functional evaluation. While some of these risk-susceptibility regions encompass cis-regulatory sites, their transcriptional potential has never been systematically explored. To detect rare tissue-specific expression, we employed the transcript-enrichment method CaptureSeq on 21 human tissues to identify 1775 multi-exonic transcripts from 561 intronic and intergenic haploblocks associated with 392 traits and diseases, covering 73.9 Mb (2.2%) of the human genome. We show that a large proportion (85%) of disease-associated haploblocks express novel multi-exonic non-coding transcripts that are tissue-specific and enriched for GWAS SNPs as well as epigenetic markers of active transcription and enhancer activity. Similarly, we captured transcriptomes from 13 melanomas, targeting nine melanoma-associated haploblocks, and characterized 31 novel melanoma-specific transcripts that include fusion proteins, novel exons and non-coding RNAs, one-third of which showed allelically imbalanced expression. This resource of previously unreported transcripts in disease-associated regions ( http://gwas-captureseq.dingerlab.org ) should provide an important starting point for the translational community in search of novel biomarkers, disease mechanisms, and drug targets.
Shpakovskiĭ, G V; Lebedenko, E N
1998-01-01
Plasmid pYUK3 bearing the fet5+ gene of Schizosaccharomyces pombe was isolated from a genomic library of the fission yeast, and a detailed physical map of the whole genomic insert (ca. 9.6 Kbp) was constructed. The primary structure of the fet5+ gene and its flanking regions is established. The gene contains a single 45-bp intron in its distal part. A typical TATA-box (TATAAG) was found in the 5'-noncoding region ca. 50 bp upstream of the putative start of transcription, and the 3'-noncoding region contains AT-rich palindromes, which are probably involved in termination of the fet5+ transcription. A previously unidentified gene of Sz. pombe encoding a protein with some similarity to one of the transcriptional activators from the TBP (TATA-binding protein) group of SPT factors of transcription was found in the vicinity of the fet5+ gene. Taking into account that cDNA of the fet5(+)-gene was isolated as a suppressor of the genetic-defect of nuclear RNA polymerases I-III (Bioorg. Khim., 1997, vol. 23, No 3, pp. 234-237), this vicinity may be the first evidence of possible clustering, in the genome of the fission yeast, of genes participating in transcription regulation.
Comparative architecture of silks, fibrous proteins and their encoding genes in insects and spiders.
Craig, Catherine L; Riekel, Christian
2002-12-01
The known silk fibroins and fibrous glues are thought to be encoded by members of the same gene family. All silk fibroins sequenced to date contain regions of long-range order (crystalline regions) and/or short-range order (non-crystalline regions). All of the sequenced fibroin silks (Flag or silk from flagelliform gland in spiders; Fhc or heavy chain fibroin silks produced by Lepidoptera larvae) are made up of hierarchically organized, repetitive arrays of amino acids. Fhc fibroin genes are characterized by a similar molecular genetic architecture of two exons and one intron, but the organization and size of these units differs. The Flag, Ser (sericin gene) and BR (Balbiani ring genes; both fibrous proteins) genes are made up of multiple exons and introns. Sequences coding for crystalline and non-crystalline protein domains are integrated in the repetitive regions of Fhc and MA exons, but not in the protein glues Ser1 and BR-1. Genetic 'hot-spots' promote recombination errors in Fhc, MA, and Flag. Codon bias, structural constraint, point mutations, and shortened coding arrays may be alternative means of stabilizing precursor mRNA transcripts. Differential regulation of gene expression and selective splicing of the mRNA transcript may allow rapid adaptation of silk functional properties to different physical environments.
Vaughn, J C; Mason, M T; Sper-Whitis, G L; Kuhlman, P; Palmer, J D
1995-11-01
We present phylogenetic evidence that a group I intron in an angiosperm mitochondrial gene arose recently by horizontal transfer from a fungal donor species. A 1,716-bp fragment of the mitochondrial coxI gene from the angiosperm Peperomia polybotrya was amplified via the polymerase chain reaction and sequenced. Comparison to other coxI genes revealed a 966-bp group I intron, which, based on homology with the related yeast coxI intron aI4, potentially encodes a 279-amino-acid site-specific DNA endonuclease. This intron, which is believed to function as a ribozyme during its own splicing, is not present in any of 19 coxI genes examined from other diverse vascular plant species. Phylogenetic analysis of intron origin was carried out using three different tree-generating algorithms, and on a variety of nucleotide and amino acid data sets from the intron and its flanking exon sequences. These analyses show that the Peperomia coxI gene intron and exon sequences are of fundamentally different evolutionary origin. The Peperomia intron is more closely related to several fungal mitochondrial introns, two of which are located at identical positions in coxI, than to identically located coxI introns from the land plant Marchantia and the green alga Prototheca. Conversely, the exon sequence of this gene is, as expected, most closely related to other angiosperm coxI genes. These results, together with evidence suggestive of co-conversion of exonic markers immediately flanking the intron insertion site, lead us to conclude that the Peperomia coxI intron probably arose by horizontal transfer from a fungal donor, using the double-strand-break repair pathway. The donor species may have been one of the symbiotic mycorrhizal fungi that live in close obligate association with most plants.
The evolution of Dscam genes across the arthropods.
Armitage, Sophie A O; Freiburg, Rebecca Y; Kurtz, Joachim; Bravo, Ignacio G
2012-04-13
One way of creating phenotypic diversity is through alternative splicing of precursor mRNAs. A gene that has evolved a hypervariable form is Down syndrome cell adhesion molecule (Dscam-hv), which in Drosophila melanogaster can produce thousands of isoforms via mutually exclusive alternative splicing. The extracellular region of this protein is encoded by three variable exon clusters, each containing multiple exon variants. The protein is vital for neuronal wiring where the extreme variability at the somatic level is required for axonal guidance, and it plays a role in immunity where the variability has been hypothesised to relate to recognition of different antigens. Dscam-hv has been found across the Pancrustacea. Additionally, three paralogous non-hypervariable Dscam-like genes have also been described for D. melanogaster. Here we took a bioinformatics approach, building profile Hidden Markov Models to search across species for putative orthologs to the Dscam genes and for hypervariable alternatively spliced exons, and inferring the phylogenetic relationships among them. Our aims were to examine whether Dscam orthologs exist outside the Bilateria, whether the origin of Dscam-hv could lie outside the Pancrustacea, when the Dscam-like orthologs arose, how many alternatively spliced exons of each exon cluster were present in the most common recent ancestor, and how these clusters evolved. Our results suggest that the origin of Dscam genes may lie after the split between the Cnidaria and the Bilateria and supports the hypothesis that Dscam-hv originated in the common ancestor of the Pancrustacea. Our phylogeny of Dscam gene family members shows six well-supported clades: five containing Dscam-like genes and one containing all the Dscam-hv genes, a seventh clade contains arachnid putative Dscam genes. Furthermore, the exon clusters appear to have experienced different evolutionary histories. Dscam genes have undergone independent duplication events in the insects and in an arachnid genome, which adds to the more well-known tandem duplications that have taken place within Dscam-hv genes. Therefore, two forms of gene expansion seem to be active within this gene family. The evolutionary history of this dynamic gene family will be further unfolded as genomes of species from more disparate groups become available.
The evolution of Dscam genes across the arthropods
2012-01-01
Background One way of creating phenotypic diversity is through alternative splicing of precursor mRNAs. A gene that has evolved a hypervariable form is Down syndrome cell adhesion molecule (Dscam-hv), which in Drosophila melanogaster can produce thousands of isoforms via mutually exclusive alternative splicing. The extracellular region of this protein is encoded by three variable exon clusters, each containing multiple exon variants. The protein is vital for neuronal wiring where the extreme variability at the somatic level is required for axonal guidance, and it plays a role in immunity where the variability has been hypothesised to relate to recognition of different antigens. Dscam-hv has been found across the Pancrustacea. Additionally, three paralogous non-hypervariable Dscam-like genes have also been described for D. melanogaster. Here we took a bioinformatics approach, building profile Hidden Markov Models to search across species for putative orthologs to the Dscam genes and for hypervariable alternatively spliced exons, and inferring the phylogenetic relationships among them. Our aims were to examine whether Dscam orthologs exist outside the Bilateria, whether the origin of Dscam-hv could lie outside the Pancrustacea, when the Dscam-like orthologs arose, how many alternatively spliced exons of each exon cluster were present in the most common recent ancestor, and how these clusters evolved. Results Our results suggest that the origin of Dscam genes may lie after the split between the Cnidaria and the Bilateria and supports the hypothesis that Dscam-hv originated in the common ancestor of the Pancrustacea. Our phylogeny of Dscam gene family members shows six well-supported clades: five containing Dscam-like genes and one containing all the Dscam-hv genes, a seventh clade contains arachnid putative Dscam genes. Furthermore, the exon clusters appear to have experienced different evolutionary histories. Conclusions Dscam genes have undergone independent duplication events in the insects and in an arachnid genome, which adds to the more well-known tandem duplications that have taken place within Dscam-hv genes. Therefore, two forms of gene expansion seem to be active within this gene family. The evolutionary history of this dynamic gene family will be further unfolded as genomes of species from more disparate groups become available. PMID:22500922
[Detection of factor VIII intron 1 inversion in severe haemophilia A].
Liang, Yan; Yan, Zhen-yu; Yan, Mei; Hua, Bao-lai; Xiao, Bai; Zhao, Yong-qiang; Liu, Jing-zhong
2009-06-01
Screening the intron 1 inversion of factor VIII (FVIII) in the population of severe haemophilia A(HA) in China and performing carrier detection and prenatal diagnosis. Using LD-PCR to detect intron 22 inversions and multiple-PCR within two tubes to intron 1 inversions in severe HA patients. Carrier detection and prenatal diagnosis were performed in affected families. Linkage analysis and DNA sequencing were used to verify these tests. One hundred and eighteen patients were seven diagnosed as intron 22 inversions and 7 were intron 1 inversions out of 247 severe HA patients. The prevalence of the intron 1 inversion in Chinese severe haemophilia A patients was 2.8% (7/247). Six women from family A and 2 from family B were diagnosed as carriers. One fetus from family A was affected fetus. Intron 1 inversion could be detected directly by multiple-PCR within two tubes. This method made the strategy more perfective in carrier and prenatal diagnosis of haemophilia A.
Introns Protect Eukaryotic Genomes from Transcription-Associated Genetic Instability.
Bonnet, Amandine; Grosso, Ana R; Elkaoutari, Abdessamad; Coleno, Emeline; Presle, Adrien; Sridhara, Sreerama C; Janbon, Guilhem; Géli, Vincent; de Almeida, Sérgio F; Palancade, Benoit
2017-08-17
Transcription is a source of genetic instability that can notably result from the formation of genotoxic DNA:RNA hybrids, or R-loops, between the nascent mRNA and its template. Here we report an unexpected function for introns in counteracting R-loop accumulation in eukaryotic genomes. Deletion of endogenous introns increases R-loop formation, while insertion of an intron into an intronless gene suppresses R-loop accumulation and its deleterious impact on transcription and recombination in yeast. Recruitment of the spliceosome onto the mRNA, but not splicing per se, is shown to be critical to attenuate R-loop formation and transcription-associated genetic instability. Genome-wide analyses in a number of distant species differing in their intron content, including human, further revealed that intron-containing genes and the intron-richest genomes are best protected against R-loop accumulation and subsequent genetic instability. Our results thereby provide a possible rationale for the conservation of introns throughout the eukaryotic lineage. Copyright © 2017 Elsevier Inc. All rights reserved.
Transposition of an intron in yeast mitochondria requires a protein encoded by that intron.
Macreadie, I G; Scott, R M; Zinn, A R; Butow, R A
1985-06-01
The optional 1143 bp intron in the yeast mitochondrial 21S rRNA gene (omega +) is nearly quantitatively inserted in genetic crosses into 21S rRNA alleles that lack it (omega -). The intron contains an open reading frame that can encode a protein of 235 amino acids, but no function has been ascribed to this sequence. We previously found an in vivo double-strand break in omega - DNA at or close to the intron insertion site only in zygotes of omega + X omega - crosses that appears with the same kinetics as intron insertion. We now show that mutations in the intron open reading frame that would alter the translation product simultaneously inhibit nonreciprocal omega recombination and the in vivo double-strand break in omega - DNA. These results provide evidence that the open reading frame encodes a protein required for intron transposition and support the role of the double-strand break in the process.
Imprecise intron losses are less frequent than precise intron losses but are not rare in plants.
Ma, Ming-Yue; Zhu, Tao; Li, Xue-Nan; Lan, Xin-Ran; Liu, Heng-Yuan; Yang, Yu-Fei; Niu, Deng-Ke
2015-05-27
In this study, we identified 19 intron losses, including 11 precise intron losses (PILs), six imprecise intron losses (IILs), one de-exonization, and one exon deletion in tomato and potato, and 17 IILs in Arabidopsis thaliana. Comparative analysis of related genomes confirmed that all of the IILs have been fixed during evolution. Consistent with previous studies, our results indicate that PILs are a major type of intron loss. However, at least in plants, IILs are unlikely to be as rare as previously reported. This article was reviewed by Jun Yu and Zhang Zhang. For complete reviews, see the Reviewers' Reports section.
Organellar maturases: A window into the evolution of the spliceosome.
Schmitz-Linneweber, Christian; Lampe, Marie-Kristin; Sultan, Laure D; Ostersetzer-Biran, Oren
2015-09-01
During the evolution of eukaryotic genomes, many genes have been interrupted by intervening sequences (introns) that must be removed post-transcriptionally from RNA precursors to form mRNAs ready for translation. The origin of nuclear introns is still under debate, but one hypothesis is that the spliceosome and the intron-exon structure of genes have evolved from bacterial-type group II introns that invaded the eukaryotic genomes. The group II introns were most likely introduced into the eukaryotic genome from an α-proteobacterial predecessor of mitochondria early during the endosymbiosis event. These self-splicing and mobile introns spread through the eukaryotic genome and later degenerated. Pieces of introns became part of the general splicing machinery we know today as the spliceosome. In addition, group II introns likely brought intron maturases with them to the nucleus. Maturases are found in most bacterial introns, where they act as highly specific splicing factors for group II introns. In the spliceosome, the core protein Prp8 shows homology to group II intron-encoded maturases. While maturases are entirely intron specific, their descendant of the spliceosomal machinery, the Prp8 protein, is an extremely versatile splicing factor with multiple interacting proteins and RNAs. How could such a general player in spliceosomal splicing evolve from the monospecific bacterial maturases? Analysis of the organellar splicing machinery in plants may give clues on the evolution of nuclear splicing. Plants encode various proteins which are closely related to bacterial maturases. The organellar genomes contain one maturase each, named MatK in chloroplasts and MatR in mitochondria. In addition, several maturase genes have been found in the nucleus as well, which are acting on mitochondrial pre-RNAs. All plant maturases show sequence deviation from their progenitor bacterial maturases, and interestingly are all acting on multiple organellar group II intron targets. Moreover, they seem to function in the splicing of group II introns together with a number of additional nuclear-encoded splicing factors, possibly acting as an organellar proto-spliceosome. Together, this makes them interesting models for the early evolution of nuclear spliceosomal splicing. In this review, we summarize recent advances in our understanding of the role of plant maturases and their accessory factors in plants. This article is part of a Special Issue entitled: Chloroplast Biogenesis. Copyright © 2015 Elsevier B.V. All rights reserved.
Analysis for complete genomic sequence of HLA-B and HLA-C alleles in the Chinese Han population.
Zhu, F; He, Y; Zhang, W; He, J; He, J; Xu, X; Lv, H; Yan, L
2011-08-01
In the present study, we have determined the complete genomic sequence and analysed the intron polymorphism of partial HLA-B and HLA-C alleles in the Chinese Han population. Over 3.0 kb DNA fragments of HLA-B and HLA-C loci were amplified by polymerase chain reaction from partial 5' untranslated region to 3' noncoding region respectively, and then the amplified products were sequenced. Full-length nucleotide sequences of 14 HLA-B alleles and 10 HLA-C alleles were obtained and have been submitted to GenBank and IMGT/HLA database. Two novel alleles of HLA-B*52:01:01:02 and HLA-B*59:01:01:02 were identified, and the complete genomic sequence of HLA-B*52:01:01:01 was firstly reported. Totally 157 and 167 polymorphism positions were found in the full-length genomic sequence of HLA-B and HLA-C loci respectively. Our results suggested that many single nucleotide polymorphisms existed in the exon and intron regions, and the data can provide useful information for understanding the evolution of HLA-B and HLA-C alleles. © 2011 Blackwell Publishing Ltd.
ENDERSBY, N. M.; HOFFMANN, A. A.; WHITE, V. L.; LOWENSTEIN, S.; RITCHIE, S.; JOHNSON, P. H.; RAPLEY, L. P.; RYAN, P. A.; NAM, V. S.; YEN, N. T.; KITTIYAPONG, P.; WEEKS, A. R.
2009-01-01
The distribution of Aedes aegypti (L.) in Australia is currently restricted to northern Queensland, but it has been more extensive in the past. In this study, we evaluate the genetic structure of Ae. aegypti populations in Australia and Vietnam and consider genetic differentiation between mosquitoes from these areas and those from a population in Thailand. Six microsatellites and two exon primed intron crossing markers were used to assess isolation by distance across all populations and also within the Australian sample. Investigations of founder effects, amount of molecular variation between and within regions and comparison of FST values among Australian and Vietnamese populations were made to assess the scale of movement of Ae. aegypti. Genetic control methods are under development for mosquito vector populations including the dengue vector Ae. aegypti. The success of these control methods will depend on the population structure of the target species including population size and rates of movement among populations. Releases of modified mosquitoes could target local populations that show a high degree of isolation from surrounding populations, potentially allowing new variants to become established in one region with eventual dispersal to other regions. PMID:19769038
Vazquez-Anderson, Jorge; Mihailovic, Mia K; Baldridge, Kevin C; Reyes, Kristofer G; Haning, Katie; Cho, Seung Hee; Amador, Paul; Powell, Warren B; Contreras, Lydia M
2017-05-19
Current approaches to design efficient antisense RNAs (asRNAs) rely primarily on a thermodynamic understanding of RNA-RNA interactions. However, these approaches depend on structure predictions and have limited accuracy, arguably due to overlooking important cellular environment factors. In this work, we develop a biophysical model to describe asRNA-RNA hybridization that incorporates in vivo factors using large-scale experimental hybridization data for three model RNAs: a group I intron, CsrB and a tRNA. A unique element of our model is the estimation of the availability of the target region to interact with a given asRNA using a differential entropic consideration of suboptimal structures. We showcase the utility of this model by evaluating its prediction capabilities in four additional RNAs: a group II intron, Spinach II, 2-MS2 binding domain and glgC 5΄ UTR. Additionally, we demonstrate the applicability of this approach to other bacterial species by predicting sRNA-mRNA binding regions in two newly discovered, though uncharacterized, regulatory RNAs. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Xie, Enzhong; Zhu, Lingyu; Zhao, Lingyun
1996-08-01
The complete 4775-nt cDNA encoding the human serotonin 5-HT{sub 2C} receptor (5-HT{sub 2C}R), a G-protein-coupled receptor, has been isolated. It contains a 1377-nt coding region flanked by a 728-nt 5{prime}-untranslated region and a 2670-nt 3{prime}-untranslated region. By using the cloned 5-HT{sub 2C}R cDNA probe, the complete human gene for this receptor has been isolated and shown to contain six exons and five introns spanning at least 230 kb of DNA. The coding region of the human 5-HT{sub 2C}R gene is interrupted by three introns, and the positions of the intron/exon junctions are conserved between the human and the rodent genes.more » In addition, an alternatively spliced 5-HT{sub 2C}R RNA that contains a 95-nt deletion in the region coding for the second intracellular loop and the fourth transmembrane domain of the receptor has been identified. This deletion leads to a frameshift and premature termination so that the short isoform RNA encodes a putative protein of 248 amino acids. The ratio for the short isoform over the 5-HT{sub 2C}R RNA was found to be higher in choroid plexus tumor than in normal brain tissue, suggesting the possibility of differential regulation of the 5-HT{sub 2C}R gene in different neural tissues or during tumorigenesis. Transcription of the human 5-HT{sub 2C}R gene was found to be initiated at multiple sites. No classical TATA-box sequence was found at the appropriate location, and the 5{prime}-flanking sequence contains many potential transcription factor-binding sites. A 7.3-kb 5{prime}-flanking 5-HT{sub 2C}R DNA directed the efficient expression of a luciferase reported gene in SK-N-SH and IMR32 neuroblastoma cells, indicating that is contains a functional promoter. 69 refs., 8 figs., 1 tab.« less
Polymorphism in the intron 20 of porcine O-linked N-acetylglucosamine transferase
USDA-ARS?s Scientific Manuscript database
Objective: O-linked N-acetylglucosamine (O-GlcNAc) transferase (OGT) catalyzes the addition of O-GlcNAc and GlcNAcylation has extensive crosstalk with phosphorylation to regulate signaling and transcription. Pig OGT is located near the region of chromosome X that affects follicle stimulating hormone...
Schäfer, B; Merlos-Lange, A M; Anderl, C; Welser, F; Zimmer, M; Wolf, K
1991-01-01
In this paper we report the inability of four group I introns in the gene encoding subunit I of cytochrome c oxidase (cox1) and the group II intron in the apocytochrome b gene (cob) to splice autocatalytically. Furthermore we present the characterization of the first cox1 intron in the mutator strain anar-14 and the construction and characterization of strains with intronless mitochondrial genomes. We provide evidence that removal of introns at the DNA level (termed DNA splicing) is dependent on an active RNA maturase. Finally we demonstrate that the absence of introns does not abolish homologous mitochondrial recombination.
Colonization of heterochromatic genes by transposable elements in Drosophila.
Dimitri, Patrizio; Junakovic, Nikolaj; Arcà, Bruno
2003-04-01
As a further step toward understanding transposable element-host genome interactions, we investigated the molecular anatomy of introns from five heterochromatic and 22 euchromatic protein-coding genes of Drosophila melanogaster. A total of 79 kb of intronic sequences from heterochromatic genes and 355 kb of intronic sequences from euchromatic genes have been used in Blast searches against Drosophila transposable elements (TEs). The results show that TE-homologous sequences belonging to 19 different families represent about 50% of intronic DNA from heterochromatic genes. In contrast, only 0.1% of the euchromatic intron DNA exhibits homology to known TEs. Intraspecific and interspecific size polymorphisms of introns were found, which are likely to be associated with changes in TE-related sequences. Together, the enrichment in TEs and the apparent dynamic state of heterochromatic introns suggest that TEs contribute significantly to the evolution of genes located in heterochromatin.
An intron within the 16S ribosomal RNA gene of the archaeon Pyrobaculum aerophilum
NASA Technical Reports Server (NTRS)
Burggraf, S.; Larsen, N.; Woese, C. R.; Stetter, K. O.
1993-01-01
The 16S rRNA genes of Pyrobaculum aerophilum and Pyrobaculum islandicum were amplified by the polymerase chain reaction, and the resulting products were sequenced directly. The two organisms are closely related by this measure (over 98% similar). However, they differ in that the (lone) 16S rRNA gene of Pyrobaculum aerophilum contains a 713-bp intron not seen in the corresponding gene of Pyrobaculum islandicum. To our knowledge, this is the only intron so far reported in the small subunit rRNA gene of a prokaryote. Upon excision the intron is circularized. A secondary structure model of the intron-containing rRNA suggests a splicing mechanism of the same type as that invoked for the tRNA introns of the Archaea and Eucarya and 23S rRNAs of the Archaea. The intron contains an open reading frame whose protein translation shows no certain homology with any known protein sequence.
The group II intron maturase: a reverse transcriptase and splicing factor go hand in hand.
Zhao, Chen; Pyle, Anna Marie
2017-12-01
The splicing of group II introns in vivo requires the assistance of a multifunctional intron encoded protein (IEP, or maturase). Each IEP is also a reverse-transcriptase enzyme that enables group II introns to behave as mobile genetic elements. During splicing or retro-transposition, each group II intron forms a tight, specific complex with its own encoded IEP, resulting in a highly reactive holoenzyme. This review focuses on the structural basis for IEP function, as revealed by recent crystal structures of an IEP reverse transcriptase domain and cryo-EM structures of an IEP-intron complex. These structures explain how the same IEP scaffold is utilized for intron recognition, splicing and reverse transcription, while providing a physical basis for understanding the evolutionary transformation of the IEP into the eukaryotic splicing factor Prp8. Copyright © 2017 Elsevier Ltd. All rights reserved.
Characterization of a prototype strain of hepatitis E virus.
Tsarev, S A; Emerson, S U; Reyes, G R; Tsareva, T S; Legters, L J; Malik, I A; Iqbal, M; Purcell, R H
1992-01-01
A strain of hepatitis E virus (SAR-55) implicated in an epidemic of enterically transmitted non-A, non-B hepatitis, now called hepatitis E, was characterized extensively. Six cynomolgus monkeys (Macaca fascicularis) were infected with a strain of hepatitis E virus from Pakistan. Reverse transcription-polymerase chain reaction was used to determine the pattern of virus shedding in feces, bile, and serum relative to hepatitis and induction of specific antibodies. Virtually the entire genome of SAR-55 (7195 nucleotides) was sequenced. Comparison of the sequence of SAR-55 with that of a Burmese strain revealed a high level of homology except for one region encoding 100 amino acids of a putative nonstructural polyprotein. Identification of this region as hypervariable was obtained by partial sequencing of a third isolate of hepatitis E virus from Kirgizia. Images PMID:1731327
Kiesler, Kevin M; Coble, Michael D; Hall, Thomas A; Vallone, Peter M
2014-01-01
A set of 711 samples from four U.S. population groups was analyzed using a novel mass spectrometry based method for mitochondrial DNA (mtDNA) base composition profiling. Comparison of the mass spectrometry results with Sanger sequencing derived data yielded a concordance rate of 99.97%. Length heteroplasmy was identified in 46% of samples and point heteroplasmy was observed in 6.6% of samples in the combined mass spectral and Sanger data set. Using discrimination capacity as a metric, Sanger sequencing of the full control region had the highest discriminatory power, followed by the mass spectrometry base composition method, which was more discriminating than Sanger sequencing of just the hypervariable regions. This trend is in agreement with the number of nucleotides covered by each of the three assays. Published by Elsevier Ireland Ltd.
Genetic analysis and ethnic affinities from two Scytho-Siberian skeletons.
Ricaut, François-Xavier; Keyser-Tracqui, Christine; Cammaert, Laurence; Crubézy, Eric; Ludes, Bertrand
2004-04-01
We extracted DNA from two skeletons belonging to the Sytho-Siberian population, which were excavated from the Sebÿstei site (dating back 2,500 years) in the Altai Republic (Central Asia). Ancient DNA was analyzed by autosomal short tandem repeats (STRs) and by the sequencing of the hypervariable region 1 (HV1) of the mitochondrial DNA (mtDNA) control region. The results showed that these two skeletons were not close relatives. Moreover, their haplogroups were characteristic of Asian populations. Comparison with the haplogroup of 3,523 Asian and American individuals linked one skeleton with a putative ancestral paleo-Asiatic population and the other with Chinese populations. It appears that the genetic study of ancient populations of Central Asia brings important elements to the understanding of human population movements in Asia. Copyright 2003 Wiley-Liss, Inc.
Asakura, Yukari; Barkan, Alice
2007-12-01
The CRM domain is a recently recognized RNA binding domain found in three group II intron splicing factors in chloroplasts, in a bacterial protein that associates with ribosome precursors, and in a family of uncharacterized proteins in plants. To elucidate the functional repertoire of proteins with CRM domains, we studied CFM2 (for CRM Family Member 2), which harbors four CRM domains. RNA coimmunoprecipitation assays showed that CFM2 in maize (Zea mays) chloroplasts is associated with the group I intron in pre-trnL-UAA and group II introns in the ndhA and ycf3 pre-mRNAs. T-DNA insertions in the Arabidopsis thaliana ortholog condition a defective-seed phenotype (strong allele) or chlorophyll-deficient seedlings with impaired splicing of the trnL group I intron and the ndhA, ycf3-int1, and clpP-int2 group II introns (weak alleles). CFM2 and two previously described CRM proteins are bound simultaneously to the ndhA and ycf3-int1 introns and act in a nonredundant fashion to promote their splicing. With these findings, CRM domain proteins are implicated in the activities of three classes of catalytic RNA: group I introns, group II introns, and 23S rRNA.
Nisa-Martínez, Rafael; Laporte, Philippe; Jiménez-Zurdo, José Ignacio; Frugier, Florian; Crespi, Martin; Toro, Nicolás
2013-01-01
Some bacterial group II introns are widely used for genetic engineering in bacteria, because they can be reprogrammed to insert into the desired DNA target sites. There is considerable interest in developing this group II intron gene targeting technology for use in eukaryotes, but nuclear genomes present several obstacles to the use of this approach. The nuclear genomes of eukaryotes do not contain group II introns, but these introns are thought to have been the progenitors of nuclear spliceosomal introns. We investigated the expression and subcellular localization of the bacterial RmInt1 group II intron-encoded protein (IEP) in Arabidopsis thaliana protoplasts. Following the expression of translational fusions of the wild-type protein and several mutant variants with EGFP, the full-length IEP was found exclusively in the nucleolus, whereas the maturase domain alone targeted EGFP to nuclear speckles. The distribution of the bacterial RmInt1 IEP in plant cell protoplasts suggests that the compartmentalization of eukaryotic cells into nucleus and cytoplasm does not prevent group II introns from invading the host genome. Furthermore, the trafficking of the IEP between the nucleolus and the speckles upon maturase inactivation is consistent with the hypothesis that the spliceosomal machinery evolved from group II introns.
Nisa-Martínez, Rafael; Laporte, Philippe; Jiménez-Zurdo, José Ignacio; Frugier, Florian; Crespi, Martin; Toro, Nicolás
2013-01-01
Some bacterial group II introns are widely used for genetic engineering in bacteria, because they can be reprogrammed to insert into the desired DNA target sites. There is considerable interest in developing this group II intron gene targeting technology for use in eukaryotes, but nuclear genomes present several obstacles to the use of this approach. The nuclear genomes of eukaryotes do not contain group II introns, but these introns are thought to have been the progenitors of nuclear spliceosomal introns. We investigated the expression and subcellular localization of the bacterial RmInt1 group II intron-encoded protein (IEP) in Arabidopsis thaliana protoplasts. Following the expression of translational fusions of the wild-type protein and several mutant variants with EGFP, the full-length IEP was found exclusively in the nucleolus, whereas the maturase domain alone targeted EGFP to nuclear speckles. The distribution of the bacterial RmInt1 IEP in plant cell protoplasts suggests that the compartmentalization of eukaryotic cells into nucleus and cytoplasm does not prevent group II introns from invading the host genome. Furthermore, the trafficking of the IEP between the nucleolus and the speckles upon maturase inactivation is consistent with the hypothesis that the spliceosomal machinery evolved from group II introns. PMID:24391881
Spencer, Thomas J.; Biederman, Joseph; Faraone, Stephen V.; Madras, Bertha K.; Bonab, Ali A.; Dougherty, Darin D.; Batchelder, Holly; Clarke, Allison; Fischman, Alan J.
2013-01-01
Background The main aim of this study was to examine the relationship between dopamine transporter (DAT) binding in the striatum in individuals with and without attention-deficit/hyperactivity disorder (ADHD), attending to the 3′-untranslated region of the gene (3′-UTR) and intron8 variable number of tandem repeats (VNTR) polymorphisms of the DAT (SLC6A3) gene. Methods Subjects consisted of 68 psychotropic (including stimulant)-naïve and smoking-naïve volunteers between 18 and 55 years of age (ADHD n = 34; control subjects n = 34). Striatal DAT binding was measured with positron emission tomography with 11C altropane. Genotyping of the two DAT (SLC6A3) 3′-UTR and intron8 VNTRs used standard protocols. Results The gene frequencies of each of the gene polymorphisms assessed did not differ between the ADHD and control groups. The ADHD status (t = 2.99; p < .004) and 3′-UTR of SLC6A3 9 repeat carrier status (t = 2.74; p < .008) were independently and additively associated with increased DAT binding in the caudate. The ADHD status was associated with increased striatal (caudate) DAT binding regardless of 3′-UTR genotype, and 3′-UTR genotype was associated with increased striatal (caudate) DAT binding regardless of ADHD status. In contrast, there were no significant associations between polymorphisms of DAT intron8 or the 3′-UTR-intron8 haplotype with DAT binding. Conclusions The 3′-UTR but not intron8 VNTR genotypes were associated with increased DAT binding in both ADHD patients and healthy control subjects. Both ADHD status and the 3′-UTR polymorphism status had an additive effect on DAT binding. Our findings suggest that an ADHD risk polymorphism (3′-UTR) of SLC6A3 has functional consequences on central nervous system DAT binding in humans. PMID:23273726
Mechanism for DNA transposons to generate introns on genomic scales
Huff, Jason T.; Zilberman, Daniel; Roy, Scott W.
2017-01-01
Discovered four decades ago, the existence of introns was one of the most unexpected findings in molecular biology1. Introns are sequences interrupting genes that must be removed as part of mRNA production. Genome sequencing projects have documented that most eukaryotic genes contain at least one and frequently many introns2,3. Comparison of these genomes reveals a history of long evolutionary periods with little intron gain punctuated by episodes of rapid, extensive gain2,3. However, no detailed mechanism for such episodic intron generation has been empirically supported on a sufficient scale, despite several proposals4–8. Here we show how short non-autonomous DNA transposons independently generated hundreds to thousands of introns in the prasinophyte Micromonas pusilla and the pelagophyte Aureococcus anophagefferens. Each transposon carries one splice site. The other splice site is co-opted from gene sequence duplicated upon transposon insertion, allowing perfect splicing out of RNA. The distributions of sequences that can be co-opted are biased with respect to codons, and phasing of transposon-generated introns is similarly biased. These transposons insert between preexisting nucleosomes, so that multiple nearby insertions generate nucleosome-sized intervening segments. Thus, transposon insertion and sequence co-option may explain the intron phase biases2 and prevalence of nucleosome-sized exons9 observed in eukaryotes. Overall, the two independent examples of proliferating elements illustrate a general DNA transposon mechanism plausibly accounting for episodes of rapid, extensive intron gain during eukaryotic evolution2,3. PMID:27760113
Yang, Zefeng; Gu, Shiliang; Wang, Xuefeng; Li, Wenjuan; Tang, Zaixiang; Xu, Chenwu
2008-09-01
CPP-like genes are members of a small family which features the existence of two similar Cys-rich domains termed CXC domains in their protein products and are distributed widely in plants and animals but do not exist in yeast. The members of this family in plants play an important role in development of reproductive tissue and control of cell division. To gain insights into how CPP-like genes evolved in plants, we conducted a comparative phylogenetic and molecular evolutionary analysis of the CPP-like gene family in Arabidopsis and rice. The results of phylogeny revealed that both gene loss and species-specific expansion contributed to the evolution of this family in Arabidopsis and rice. Both intron gain and intron loss were observed through intron/exon structure analysis for duplicated genes. Our results also suggested that positive selection was a major force during the evolution of CPP-like genes in plants, and most amino acid residues under positive selection were disproportionately located in the region outside the CXC domains. Further analysis revealed that two CXC domains and sequences connecting them might have coevolved during the long evolutionary period.
A novel intronic mutation in the DDP1 gene in a family with X-linked dystonia-deafness syndrome.
Ezquerra, Mario; Campdelacreu, Jaume; Muñoz, Esteban; Tolosa, Eduardo; Martí, María J
2005-02-01
X-linked dystonia-deafness syndrome (Mohr-Tranebjaerg syndrome) is a rare neurodegenerative disease characterized by hearing loss and dystonia. So far, 7 mutations in the coding region of the DDP1 gene have been described. They consist of frameshift, nonsense, missense mutations or deletions. To investigate the presence of mutations in the DDP1 gene in a family with dystonia-deafness syndrome. Seven members belonging to 2 generations of a family with 2 affected subjects underwent genetic analysis. Mutational screening in the DDP1 gene was made through DNA direct sequencing. We found an intronic mutation in the DDP1 gene. It consists of an A-to-C substitution in the position -23 in reference to the first nucleotide of exon 2 (IVS1-23A>C). The mutation was present in 2 affected men and their respective unaffected mothers, whereas it was absent in the healthy men from this family and in 90 healthy controls. Intronic mutations in the DDP1 gene can also cause X-linked dystonia-deafness syndrome. In our case, the effect of the mutation could be due to a splicing alteration.
Gandhi, Manish J; Pendergrass, Thomas W; Cummings, Carrie C; Ihara, Kenji; Blau, C Anthony; Drachman, Jonathan G
2005-10-01
An 11-year-old girl, presenting with fatigue and bruising, was found to be profoundly pancytopenic. Bone marrow exam and clinical evaluation were consistent with aplastic anemia. Family members were studied as potential stem cell donors, revealing that both younger siblings displayed significant thrombocytopenia, whereas both parents had normal blood counts. We evaluated this pedigree to understand the unusually late presentation of congenital amegakaryocytic thrombocytopenia (CAMT). The coding region and the intron/exon junctions of MPL were sequenced from each family member. Vectors representing each of the mutations were constructed and tested for the ability to support growth of Baf3/Mpl(mutant) cells. All three siblings had elevated thrombopoietin levels. Analysis of genomic DNA demonstrated that each parent had mutations/polymorphisms in a single MPL allele and that each child was a compound heterozygote, having inherited both abnormal alleles. The maternal allele encoded a mutation of the donor splice-junction at the exon-3/intron-3 boundary. A mini-gene construct encoding normal vs mutant versions of the intron-3 donor-site demonstrated that physiologic splicing was significantly reduced in the mutant construct. Mutations that incompletely eliminate Mpl expression/function may result in delayed diagnosis of CAMT and confusion with aplastic anemia.
NASA Astrophysics Data System (ADS)
Li, Shengjie; Bai, Junjie; Wang, Lin
2008-08-01
Myostatin or GDF-8, a member of the transforming growth factor-β (TGF-β) superfamily, has been demonstrated to be a negative regulator of skeletal muscle mass in mammals. In the present study, we obtained a 5.64 kb sequence of myostatin encoding gene and its promoter from largemouth bass ( Micropterus salmoides). The myostatin encoding gene consisted of three exons (488 bp, 371 bp and 1779 bp, respectively) and two introns (390 bp and 855 bp, respectively). The intron-exon boundaries were conservative in comparison with those of mammalian myostatin encoding genes, whereas the size of introns was smaller than that of mammals. Sequence analysis of 1.569 kb of the largemouth bass myostatin gene promoter region revealed that it contained two TATA boxes, one CAAT box and nine putative E-boxes. Putative muscle growth response elements for myocyte enhancer factor 2 (MEF2), serum response factor (SRF), activator protein 1 (AP1), etc., and muscle-specific Mt binding site (MTBF) were also detected. Some of the transcription factor binding sites were conserved among five teleost species. This information will be useful for studying the transcriptional regulation of myostatin in fish.
Bergman, C M; Kreitman, M
2001-08-01
Comparative genomic approaches to gene and cis-regulatory prediction are based on the principle that differential DNA sequence conservation reflects variation in functional constraint. Using this principle, we analyze noncoding sequence conservation in Drosophila for 40 loci with known or suspected cis-regulatory function encompassing >100 kb of DNA. We estimate the fraction of noncoding DNA conserved in both intergenic and intronic regions and describe the length distribution of ungapped conserved noncoding blocks. On average, 22%-26% of noncoding sequences surveyed are conserved in Drosophila, with median block length approximately 19 bp. We show that point substitution in conserved noncoding blocks exhibits transition bias as well as lineage effects in base composition, and occurs more than an order of magnitude more frequently than insertion/deletion (indel) substitution. Overall, patterns of noncoding DNA structure and evolution differ remarkably little between intergenic and intronic conserved blocks, suggesting that the effects of transcription per se contribute minimally to the constraints operating on these sequences. The results of this study have implications for the development of alignment and prediction algorithms specific to noncoding DNA, as well as for models of cis-regulatory DNA sequence evolution.
Xing, Wen-Rui; Hou, Bei-Wei; Guan, Jing-Jiao; Luo, Jing; Ding, Xiao-Yu
2013-04-01
The LEAFY (LFY) homologous gene of Dendrobium moniliforme (L.) Sw. was cloned by new primers which were designed based on the conservative region of known sequences of orchid LEAFY gene. Partial LFY homologous gene was cloned by common PCR, then we got the complete LFY homologous gene Den LFY by Tail-PCR. The complete sequence of DenLFY gene was 3 575 bp which contained three exons and two introns. Using BLAST method, comparison analysis among the exon of LFY homologous gene indicted that the DenLFY gene had high identity with orchids LFY homologous, including the related fragment of PhalLFY (84%) in Phalaenopsis hybrid cultivar, LFY homologous gene in Oncidium (90%) and in other orchid (over 80%). Using MP analysis, Dendrobium is found to be the sister to Oncidium and Phalaenopsis. Homologous analysis demonstrated that the C-terminal amino acids were highly conserved. When the exons and introns were separately considered, exons and the sequence of amino acid were good markers for the function research of DenLFY gene. The second intron can be used in authentication research of Dendrobium based on the length polymorphism between Dendrobium moniliforme and Dendrobium officinale.
Mohr, Sabine; Ghanem, Eman; Smith, Whitney; Sheeter, Dennis; Qin, Yidan; King, Olga; Polioudakis, Damon; Iyer, Vishwanath R; Hunicke-Smith, Scott; Swamy, Sajani; Kuersten, Scott; Lambowitz, Alan M
2013-07-01
Mobile group II introns encode reverse transcriptases (RTs) that function in intron mobility ("retrohoming") by a process that requires reverse transcription of a highly structured, 2-2.5-kb intron RNA with high processivity and fidelity. Although the latter properties are potentially useful for applications in cDNA synthesis and next-generation RNA sequencing (RNA-seq), group II intron RTs have been difficult to purify free of the intron RNA, and their utility as research tools has not been investigated systematically. Here, we developed general methods for the high-level expression and purification of group II intron-encoded RTs as fusion proteins with a rigidly linked, noncleavable solubility tag, and we applied them to group II intron RTs from bacterial thermophiles. We thus obtained thermostable group II intron RT fusion proteins that have higher processivity, fidelity, and thermostability than retroviral RTs, synthesize cDNAs at temperatures up to 81°C, and have significant advantages for qRT-PCR, capillary electrophoresis for RNA-structure mapping, and next-generation RNA sequencing. Further, we find that group II intron RTs differ from the retroviral enzymes in template switching with minimal base-pairing to the 3' ends of new RNA templates, making it possible to efficiently and seamlessly link adaptors containing PCR-primer binding sites to cDNA ends without an RNA ligase step. This novel template-switching activity enables facile and less biased cloning of nonpolyadenylated RNAs, such as miRNAs or protein-bound RNA fragments. Our findings demonstrate novel biochemical activities and inherent advantages of group II intron RTs for research, biotechnological, and diagnostic methods, with potentially wide applications.
Gonzalez, P; Barroso, G; Labarère, J
1998-10-05
The Basidiomycota Agrocybe aegerita (Aa) mitochondrial cox1 gene (6790 nucleotides), encoding a protein of 527aa (58377Da), is split by four large subgroup IB introns possessing site-specific endonucleases assumed to be involved in intron mobility. When compared to other fungal COX1 proteins, the Aa protein is closely related to the COX1 one of the Basidiomycota Schizophyllum commune (Sc). This clade reveals a relationship with the studied Ascomycota ones, with the exception of Schizosaccharomyces pombe (Sp) which ranges in an out-group position compared with both higher fungi divisions. When comparison is extended to other kingdoms, fungal COX1 sequences are found to be more related to algae and plant ones (more than 57.5% aa similarity) than to animal sequences (53.6% aa similarity), contrasting with the previously established close relationship between fungi and animals, based on comparisons of nuclear genes. The four Aa cox1 introns are homologous to Ascomycota or algae cox1 introns sharing the same location within the exonic sequences. The percentages of identity of the intronic nucleotide sequences suggest a possible acquisition by lateral transfers of ancestral copies or of their derived sequences. These identities extend over the whole intronic sequences, arguing in favor of a transfer of the complete intron rather than a transfer limited to the encoded ORF. The intron i4 shares 74% of identity, at the nucleotidic level, with the Podospora anserina (Pa) intron i14, and up to 90.5% of aa similarity between the encoded proteins, i.e. the highest values reported to date between introns of two phylogenetically distant species. This low divergence argues for a recent lateral transfer between the two species. On the contrary, the low sequence identities (below 36%) observed between Aa i1 and the homologous Sp i1 or Prototheca wickeramii (Pw) i1 suggest a long evolution time after the separation of these sequences. The introns i2 and i3 possessed intermediate percentages of identity with their homologous Ascomycota introns. This is the first report of the complete nucleotide sequence and molecular organization of a mitochondrial cox1 gene of any member of the Basidiomycota division.
Reenacting the birth of an intron
DOE Office of Scientific and Technical Information (OSTI.GOV)
Hellsten, Uffe; Aspden, Julie L.; Rio, Donald C.
2011-07-01
An intron is an extended genomic feature whose function requires multiple constrained positions - donor and acceptor splice sites, a branch point, a polypyrimidine tract and suitable splicing enhancers - that may be distributed over hundreds or thousands of nucleotides. New introns are therefore unlikely to emerge by incremental accumulation of functional sub-elements. Here we demonstrate that a functional intron can be created de novo in a single step by a segmental genomic duplication. This experiment recapitulates in vivo the birth of an intron that arose in the ancestral jawed vertebrate lineage nearly half a billion years ago.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Nagao, Yoshiro; Sly, W.S.; Batanian, J.R.
1995-08-10
Carbonic anhydrase V (CA V) is expressed in mitochondrial matrix in liver and several other tissues. It is of interest for its putative roles in providing bicarbonate to carbamoyl phosphate synthetase for ureagenesis and to pyruvate carboxylase for gluconeogenesis and its possible importance in explaining certain inherited metabolic disorders with hyperammonemia and hypoglycemia. Following the recent characterization of the cDNA for human CA V, we report the isolation of the human gene from two {lambda} genomic libraries and its characterization. The CA V gene (CA5) is approximately 50 kb long and contains 7 exons and 6 introns. The exon-intron boundariesmore » are found in positions identical to those determined for the previously described CA II, CA III, and CA VII genes. Like the CA VII gene, CA5 does not contain typical TATA and CAAT promoter elements in the 5{prime} flanking region but does contain a TTTAA sequence 147 nucleotides upstream of the initiation codon. CA5 also contains a 12-bp GT-rich segment beginning 13 bp downstream of the polyadenylation signal in the 3{prime} untranslated region of exon 7. FISH analysis allowed CA5 to be assigned to chromosome 16q24.3. An unprocessed pseudogene containing sequence homologous to exons 3-7 and introns 3-6 was also isolated and was assigned by FISH analysis to chromosome 16p11.2-p12. 22 refs., 4 figs., 1 tab.« less
Widespread antisense transcription of Populus genome under drought.
Yuan, Yinan; Chen, Su
2018-06-06
Antisense transcription is widespread in many genomes and plays important regulatory roles in gene expression. The objective of our study was to investigate the extent and functional relevance of antisense transcription in forest trees. We employed Populus, a model tree species, to probe the antisense transcriptional response of tree genome under drought, through stranded RNA-seq analysis. We detected nearly 48% of annotated Populus gene loci with antisense transcripts and 44% of them with co-transcription from both DNA strands. Global distribution of reads pattern across annotated gene regions uncovered that antisense transcription was enriched in untranslated regions while sense reads were predominantly mapped in coding exons. We further detected 1185 drought-responsive sense and antisense gene loci and identified a strong positive correlation between the expression of antisense and sense transcripts. Additionally, we assessed the antisense expression in introns and found a strong correlation between intronic expression and exonic expression, confirming antisense transcription of introns contributes to transcriptional activity of Populus genome under drought. Finally, we functionally characterized drought-responsive sense-antisense transcript pairs through gene ontology analysis and discovered that functional groups including transcription factors and histones were concordantly regulated at both sense and antisense transcriptional level. Overall, our study demonstrated the extensive occurrence of antisense transcripts of Populus genes under drought and provided insights into genome structure, regulation pattern and functional significance of drought-responsive antisense genes in forest trees. Datasets generated in this study serve as a foundation for future genetic analysis to improve our understanding of gene regulation by antisense transcription.
Rumpho, Mary E.; Pochareddy, Sirisha; Worful, Jared M.; Summer, Elizabeth J.; Bhattacharya, Debashish; Pelletreau, Karen N.; Tyler, Mary S.; Lee, Jungho; Manhart, James R.; Soule, Kara M.
2009-01-01
Phosphoribulokinase (PRK), a nuclear-encoded plastid-localized enzyme unique to the photosynthetic carbon reduction (Calvin) cycle, was cloned and characterized from the stramenopile alga Vaucheria litorea. This alga is the source of plastids for the mollusc (sea slug) Elysia chlorotica which enable the animal to survive for months solely by photoautotrophic CO2 fixation. The 1633-bp V. litorea prk gene was cloned and the coding region, found to be interrupted by four introns, encodes a 405-amino acid protein. This protein contains the typical bipartite target sequence expected of nuclear-encoded proteins that are directed to complex (i.e. four membrane-bound) algal plastids. De novo synthesis of PRK and enzyme activity were detected in E. chlorotica in spite of having been starved of V. litorea for several months. Unlike the algal enzyme, PRK in the sea slug did not exhibit redox regulation. Two copies of partial PRK-encoding genes were isolated from both sea slug and aposymbiotic sea slug egg DNA using PCR. Each copy contains the nucleotide region spanning exon 1 and part of exon 2 of V. litorea prk, including the bipartite targeting peptide. However, the larger prk fragment also includes intron 1. The exon and intron sequences of prk in E. chlorotica and V. litorea are nearly identical. These data suggest that PRK is differentially regulated in V. litorea and E. chlorotica and at least a portion of the V. litorea nuclear PRK gene is present in sea slugs that have been starved for several months. PMID:19995736
Pattison, Jillian M.; Posternak, Valeriya; Cole, Michael D.
2016-01-01
It is well established that environmental toxins, such as exposure to arsenic, are risk factors in the development of urinary bladder cancer, yet recent genome-wide association studies (GWAS) provide compelling evidence that there is a strong genetic component associated with disease predisposition. A single nucleotide polymorphism (SNP), rs8102137, was identified on chromosome 19q12, residing 6 kb upstream of the important cell cycle regulator and proto-oncogene, Cyclin E1 (CCNE1). However, the functional role of this variant in bladder cancer predisposition has been unclear since it lies within a non-coding region of the genome. Here, it is demonstrated that bladder cancer cells heterozygous for this SNP exhibit biased allelic expression of CCNE1 with 1.5-fold more transcription occurring from the risk allele. Furthermore, using chromatin immunoprecipitation assays, a novel enhancer element was identified within the first intron of CCNE1 that binds Kruppel-like Factor 5 (KLF5), a known transcriptional activator in bladder cancer. Moreover, the data reveal that the presence of rs200996365, a SNP in high linkage disequilibrium with rs8102137 residing in the center of a KLF5 motif, alters KLF5 binding to this genomic region. Through luciferase assays and CRISPR-Cas9 genome editing, a novel polymorphic intronic regulatory element controlling CCNE1 transcription is characterized. These studies uncover how a cancer-associated polymorphism mechanistically contributes to an increased predisposition for bladder cancer development. Implications A polymorphic KLF5 binding site near the CCNE1 gene explains genetic risk identified through genome wide association studies. PMID:27514407
Arabidopsis Chloroplast Mini-Ribonuclease III Participates in rRNA Maturation and Intron Recycling
Hotto, Amber M.; Castandet, Benoît; Gilet, Laetitia; Higdon, Andrea; Condon, Ciarán; Stern, David B.
2015-01-01
RNase III proteins recognize double-stranded RNA structures and catalyze endoribonucleolytic cleavages that often regulate gene expression. Here, we characterize the functions of RNC3 and RNC4, two Arabidopsis thaliana chloroplast Mini-RNase III-like enzymes sharing 75% amino acid sequence identity. Whereas rnc3 and rnc4 null mutants have no visible phenotype, rnc3/rnc4 (rnc3/4) double mutants are slightly smaller and chlorotic compared with the wild type. In Bacillus subtilis, the RNase Mini-III is integral to 23S rRNA maturation. In Arabidopsis, we observed imprecise maturation of 23S rRNA in the rnc3/4 double mutant, suggesting that exoribonucleases generated staggered ends in the absence of specific Mini-III-catalyzed cleavages. A similar phenotype was found at the 3′ end of the 16S rRNA, and the primary 4.5S rRNA transcript contained 3′ extensions, suggesting that Mini-III catalyzes several processing events of the polycistronic rRNA precursor. The rnc3/4 mutant showed overaccumulation of a noncoding RNA complementary to the 4.5S-5S rRNA intergenic region, and its presence correlated with that of the extended 4.5S rRNA precursor. Finally, we found rnc3/4-specific intron degradation intermediates that are probable substrates for Mini-III and show that B. subtilis Mini-III is also involved in intron regulation. Overall, this study extends our knowledge of the key role of Mini-III in intron and noncoding RNA regulation and provides important insight into plastid rRNA maturation. PMID:25724636
An antisense RNA in a lytic cyanophage links psbA to a gene encoding a homing endonuclease.
Millard, Andrew D; Gierga, Gregor; Clokie, Martha R J; Evans, David J; Hess, Wolfgang R; Scanlan, David J
2010-09-01
Cyanophage genomes frequently possess the psbA gene, encoding the D1 polypeptide of photosystem II. This protein is believed to maintain host photosynthetic capacity during infection and enhance phage fitness under high-light conditions. Although the first documented cyanophage-encoded psbA gene contained a group I intron, this feature has not been widely reported since, despite a plethora of new sequences becoming available. In this study, we show that in cyanophage S-PM2, this intron is spliced during the entire infection cycle. Furthermore, we report the widespread occurrence of psbA introns in marine metagenomic libraries, and with psbA often adjacent to a homing endonuclease (HE). Bioinformatic analysis of the intergenic region between psbA and the adjacent HE gene F-CphI in S-PM2 showed the presence of an antisense RNA (asRNA) connecting these two separate genetic elements. The asRNA is co-regulated with psbA and F-CphI, suggesting its involvement with their expression. Analysis of scaffolds from global ocean survey datasets shows this asRNA to be commonly associated with the 3' end of cyanophage psbA genes, implying that this potential mechanism of regulating marine 'viral' photosynthesis is evolutionarily conserved. Although antisense transcription is commonly found in eukaryotic and increasingly also in prokaryotic organisms, there has been no indication for asRNAs in lytic phages so far. We propose that this asRNA also provides a means of preventing the formation of mobile group I introns within cyanophage psbA genes.
Changes in exon–intron structure during vertebrate evolution affect the splicing pattern of exons
Gelfman, Sahar; Burstein, David; Penn, Osnat; Savchenko, Anna; Amit, Maayan; Schwartz, Schraga; Pupko, Tal; Ast, Gil
2012-01-01
Exon–intron architecture is one of the major features directing the splicing machinery to the short exons that are located within long flanking introns. However, the evolutionary dynamics of exon–intron architecture and its impact on splicing is largely unknown. Using a comparative genomic approach, we analyzed 17 vertebrate genomes and reconstructed the ancestral motifs of both 3′ and 5′ splice sites, as also the ancestral length of exons and introns. Our analyses suggest that vertebrate introns increased in length from the shortest ancestral introns to the longest primate introns. An evolutionary analysis of splice sites revealed that weak splice sites act as a restrictive force keeping introns short. In contrast, strong splice sites allow recognition of exons flanked by long introns. Reconstruction of the ancestral state suggests these phenomena were not prevalent in the vertebrate ancestor, but appeared during vertebrate evolution. By calculating evolutionary rate shifts in exons, we identified cis-acting regulatory sequences that became fixed during the transition from early vertebrates to mammals. Experimental validations performed on a selection of these hexamers confirmed their regulatory function. We additionally revealed many features of exons that can discriminate alternative from constitutive exons. These features were integrated into a machine-learning approach to predict whether an exon is alternative. Our algorithm obtains very high predictive power (AUC of 0.91), and using these predictions we have identified and successfully validated novel alternatively spliced exons. Overall, we provide novel insights regarding the evolutionary constraints acting upon exons and their recognition by the splicing machinery. PMID:21974994
Two CRM protein subfamilies cooperate in the splicing of group IIB introns in chloroplasts.
Asakura, Yukari; Bayraktar, Omer Ali; Barkan, Alice
2008-11-01
Chloroplast genomes in angiosperms encode approximately 20 group II introns, approximately half of which are classified as subgroup IIB. The splicing of all but one of the subgroup IIB introns requires a heterodimer containing the peptidyl-tRNA hydrolase homolog CRS2 and one of two closely related proteins, CAF1 or CAF2, that harbor a recently recognized RNA binding domain called the CRM domain. Two CRS2/CAF-dependent introns require, in addition, a CRM domain protein called CFM2 that is only distantly related to CAF1 and CAF2. Here, we show that CFM3, a close relative of CFM2, associates in vivo with those CRS2/CAF-dependent introns that are not CFM2 ligands. Mutant phenotypes in rice and Arabidopsis support a role for CFM3 in the splicing of most of the introns with which it associates. These results show that either CAF1 or CAF2 and either CFM2 or CFM3 simultaneously bind most chloroplast subgroup IIB introns in vivo, and that the CAF and CFM subunits play nonredundant roles in splicing. These results suggest that the expansion of the CRM protein family in plants resulted in two subfamilies that play different roles in group II intron splicing, with further diversification within a subfamily to accommodate multiple intron ligands.
The splicing of tiny introns of Paramecium is controlled by MAGO.
Contreras, Julia; Begley, Victoria; Marsella, Laura; Villalobo, Eduardo
2018-07-15
The exon junction complex (EJC) is a key element of the splicing machinery. The EJC core is composed of eIF4A3, MAGO, Y14 and MLN51. Few accessory proteins, such as CWC22 or UPF3, bind transiently to the EJC. The EJC has been implicated in the control of the splicing of long introns. To ascertain whether the EJC controls the splicing of short introns, we used Paramecium tetraurelia as a model organism, since it has thousands of very tiny introns. To elucidate whether EJC affects intron splicing in P. tetraurelia, we searched for EJC protein-coding genes, and silenced those genes coding for eIF4A3, MAGO and CWC22. We found that P. tetraurelia likely assembles an active EJC with only three of the core proteins, since MLN51 is lacking. Silencing of eIF4A3 or CWC22 genes, but not that of MAGO, caused lethality. Silencing of the MAGO gene caused either an increase, decrease, or no change in intron retention levels of some intron-containing mRNAs used as reporters. We suggest that a fine-tuning expression of EJC genes is required for steady intron removal in P. tetraurelia. Taking into consideration our results and those published by others, we conclude that the EJC controls splicing independently of the intron size. Copyright © 2018 Elsevier B.V. All rights reserved.
Gomes, Felipe E E S; Arantes, Thales D; Fernandes, José A L; Ferreira, Leonardo C; Romero, Héctor; Bosco, Sandra M G; Oliveira, Maria T B; Del Negro, Gilda M B; Theodoro, Raquel C
2018-01-01
Cryptococcosis, one of the most important systemic mycosis in the world, is caused by different genotypes of Cryptococcus neoformans and Cryptococcus gattii , which differ in their ecology, epidemiology, and antifungal susceptibility. Therefore, the search for new molecular markers for genotyping, pathogenicity and drug susceptibility is necessary. Group I introns fulfill the requisites for such task because (i) they are polymorphic sequences; (ii) their self-splicing is inhibited by some drugs; and (iii) their correct splicing under parasitic conditions is indispensable for pathogen survival. Here, we investigated the presence of group I introns in the mitochondrial LSU rRNA gene in 77 Cryptococcus isolates and its possible relation to drug susceptibility. Sequencing revealed two new introns in the LSU rRNA gene. All the introns showed high sequence similarity to other mitochondrial introns from distinct fungi, supporting the hypothesis of an ancient non-allelic invasion. Intron presence was statistically associated with those genotypes reported to be less pathogenic ( p < 0.001). Further virulence assays are needed to confirm this finding. In addition, in vitro antifungal tests indicated that the presence of LSU rRNA introns may influence the minimum inhibitory concentration (MIC) of amphotericin B and 5-fluorocytosine. These findings point to group I introns in the mitochondrial genome of Cryptococcus as potential molecular markers for antifungal resistance, as well as therapeutic targets.
Sultan, Laure D.; Grewe, Felix; Rolle, Katarzyna; Abudraham, Sivan; Shevtsov, Sofia; Klipcan, Liron; Barciszewski, Jan; Dietrich, André
2016-01-01
Group II introns are large catalytic RNAs that are ancestrally related to nuclear spliceosomal introns. Sequences corresponding to group II RNAs are found in many prokaryotes and are particularly prevalent within plants organellar genomes. Proteins encoded within the introns themselves (maturases) facilitate the splicing of their own host pre-RNAs. Mitochondrial introns in plants have diverged considerably in sequence and have lost their maturases. In angiosperms, only a single maturase has been retained in the mitochondrial DNA: the matR gene found within NADH dehydrogenase 1 (nad1) intron 4. Its conservation across land plants and RNA editing events, which restore conserved amino acids, indicates that matR encodes a functional protein. However, the biological role of MatR remains unclear. Here, we performed an in vivo investigation of the roles of MatR in Brassicaceae. Directed knockdown of matR expression via synthetically designed ribozymes altered the processing of various introns, including nad1 i4. Pull-down experiments further indicated that MatR is associated with nad1 i4 and several other intron-containing pre-mRNAs. MatR may thus represent an intermediate link in the gradual evolutionary transition from the intron-specific maturases in bacteria into their versatile spliceosomal descendants in the nucleus. The similarity between maturases and the core spliceosomal Prp8 protein further supports this intriguing theory. PMID:27760804
Shcherban, Andrey Borisovich; Schichkina, Aleksandra Aleksandrovna; Salina, Elena Artemovna
2016-11-16
Triticum araraticum and Triticum timopheevii are tetraploid species of the Timopheevi group. The former includes both winter and spring forms with a predominance of winter forms, whereas T. timopheevii is considered a spring species. In order to clarify the origin of the spring growth habit in T. timopheevii, allelic variability of the VRN-1 gene was investigated in a set of accessions of both tetraploid species, together with the diploid species Ae. speltoides, presumed donor of the G genome to these tetraploids. The promoter region of the VRN-A1 locus in all studied tetraploid accessions of both T. araraticum and T. timopheevii represents the previously described allele VRN-A1f with a 50 bp deletion near the start codon. Three additional alleles were identified namely, VRN-A1f-del, VRN-A1f-ins and VRN-A1f-del/ins, which contained large mutations in the first (1 st ) intron of VRN-A1. The first allele, carrying a deletion of 2.7 kb in a central part of intron 1, occurred in a few accessions of T. araraticum and no accessions of T. timopheevii. The VRN-A1f-ins allele, containing the insertion of a 0.4 kb MITE element about 0.4 kb upstream from the start of intron 1, and allele VRN-A1f-del/ins having this insertion coupled with a deletion of 2.7 kb are characteristic only for T. timopheevii. Allelic variation at the VRN-G1 locus includes the previously described allele VRN-G1a (with the insertion of a 0.2 kb MITE in the promoter) found in a few accessions of both tetraploid species. We showed that alleles VRN-A1f-del and VRN-G1a have no association with the spring growth habit, while in all accessions of T. timopheevii this habit was associated with the dominant VRN-A1f-ins and VRN-A1f-del/ins alleles. None of the Ae. speltoides accessions included in this study had changes in the promoter or 1 st intron regions of VRN-1 which might confer a spring growth habit. The VRN-1 promoter sequences analyzed herein and downloaded from databases have been used to construct a phylogram to assess the time of divergence of Ae. speltoides in relation to other wheat species. Among accessions of T. araraticum, the preferentially winter predecessor of T. timopheevii, two large mutations were found in both VRN-A1 and VRN-G1 loci (VRN-A1f-del and VRN-G1a) that were found to have no effect on vernalization requirements. Spring tetraploid T. timopheevii had one VRN-1 allele in common for two species (VRN-G1a), and two that were specific (VRN-A1f-ins, VRN-A1f-del/ins). The latter alleles include mutations in the 1 st intron of VRN-A1 and also share a 0.4 kb MITE insertion near the start of intron 1. We suggested that this insertion resulted in a spring growth habit in a progenitor of T. timopheevii which has probably been selected during subsequent domestication. The phylogram constructed on the basis of the VRN-1 promoter sequences confirmed the early divergence (~3.5 MYA) of the ancestor(s) of the B/G genomes from Ae. speltoides.
Chee, Gab-Joo; Takami, Hideto
2011-01-01
Group II introns inserted into genes often undergo splicing at unexpected sites, and participate in the transcription of host genes. We identified five copies of a group II intron, designated Oi.Int, in the genome of an extremely halotolerant and alkaliphilic bacillus, Oceanobacillus iheyensis. The Oi.Int4 differs from the Oi.Int3 at four bases. The ligated exons of the Oi.Int4 could not be detected by RT-PCR assays in vivo or in vitro although group II introns can generally self-splice in vitro without the involvement of an intron-encoded open reading frame (ORF). In the Oi.Int4 mutants with base substitutions within the ORF, ligated exons were detected by in vitro self-splicing. It was clear that the ligation of exons during splicing is affected by the sequence of the intron-encoded ORF since the splice sites corresponded to the joining sites of the intron. In addition, the mutant introns showed unexpected multiple products with alternative 5' splice sites. These findings imply that alternative 5' splicing which causes a functional change of ligated exons presumably has influenced past adaptations of O. iheyensis to various environmental changes.
Nisa-Martínez, Rafael; Jiménez-Zurdo, José I.; Martínez-Abarca, Francisco; Muñoz-Adelantado, Estefanía; Toro, Nicolás
2007-01-01
RmInt1 is a self-splicing and mobile group II intron initially identified in the bacterium Sinorhizobium meliloti, which encodes a reverse transcriptase–maturase (Intron Encoded Protein, IEP) lacking the C-terminal DNA binding (D) and DNA endonuclease domains (En). RmInt1 invades cognate intronless homing sites (ISRm2011-2) by a mechanism known as retrohoming. This work describes how the RmInt1 intron spreads in the S.meliloti genome upon acquisition by conjugation. This process was revealed by using the wild-type intron RmInt1 and engineered intron-donor constructs based on ribozyme coding sequence (ΔORF)-derivatives with higher homing efficiency than the wild-type intron. The data demonstrate that RmInt1 propagates into the S.meliloti genome primarily by retrohoming with a strand bias related to replication of the chromosome and symbiotic megaplasmids. Moreover, we show that when expressed in trans from a separate plasmid, the IEP is able to mobilize genomic ΔORF ribozymes that afterward displayed wild-type levels of retrohoming. Our results contribute to get further understanding of how group II introns spread into bacterial genomes in nature. PMID:17158161
Nisa-Martínez, Rafael; Jiménez-Zurdo, José I; Martínez-Abarca, Francisco; Muñoz-Adelantado, Estefanía; Toro, Nicolás
2007-01-01
RmInt1 is a self-splicing and mobile group II intron initially identified in the bacterium Sinorhizobium meliloti, which encodes a reverse transcriptase-maturase (Intron Encoded Protein, IEP) lacking the C-terminal DNA binding (D) and DNA endonuclease domains (En). RmInt1 invades cognate intronless homing sites (ISRm2011-2) by a mechanism known as retrohoming. This work describes how the RmInt1 intron spreads in the S.meliloti genome upon acquisition by conjugation. This process was revealed by using the wild-type intron RmInt1 and engineered intron-donor constructs based on ribozyme coding sequence (DeltaORF)-derivatives with higher homing efficiency than the wild-type intron. The data demonstrate that RmInt1 propagates into the S.meliloti genome primarily by retrohoming with a strand bias related to replication of the chromosome and symbiotic megaplasmids. Moreover, we show that when expressed in trans from a separate plasmid, the IEP is able to mobilize genomic DeltaORF ribozymes that afterward displayed wild-type levels of retrohoming. Our results contribute to get further understanding of how group II introns spread into bacterial genomes in nature.
Adams, Gerard C; Surve-Iyer, Rupa S; Iezzoni, Amy F
2002-01-01
Leucostoma species that are the causal agents of Cytospora canker of stone and pome fruit trees were studied in detail. DNA sequence of the internal transcribed spacer regions and the 5.8S of the nuclear ribosomal DNA operon (ITS rDNA) supplied sufficient characters to assess the phylogenetic relationships among species of Leucostoma, Valsa, Valsella, and related anamorphs in Cytospora. Parsimony analysis of the aligned sequence divided Cytospora isolates from fruit trees into clades that generally agreed with the morphological species concepts, and with some of the phenetic groupings (PG 1-6) identified previously by isozyme analysis and cultural characteristics. Phylogenetic analysis inferred that isolates of L. persoonii formed two well-resolved clades distinct from isolates of L. cinctum. Phylogenetic analysis of the ITS rDNA, isozyme analysis, and cultural characteristics supported the inference that L. persoonii groups PG 2 and PG 3 were populations of a new species apparently more genetically different from L. persoonii PG 1 than from isolates representative of L. massariana, L. niveum, L. translucens, and Valsella melastoma. The new species, L. parapersoonii, was described. A diverse collection of isolates of L. cinctum, L. persoonii, and L. parapersoonii were examined for genetic variation using restriction fragment length polymorphism (RFLP) analysis of the ITS rDNA and the five prime end of the large subunit of the rDNA (LSU rDNA). HinfI and HpaII endonucleases were each useful in dividing the Leucostoma isolates into RFLP profiles corresponding to the isozyme phenetic groups, PG 1-6. RFLP analysis was more effective than isozyme analysis in uncovering variation among isolates of L. persoonii PG 1, but less effective within L. cinctum populations. Isolates representative of seven of the L. persoonii formae speciales proposed by G. Défago in 1935 were found to be genetically diverse isolates of PG 1. Two large insertions, 415 and 309 nucleotides long, in the small subunit (SSU) of the nuclear rDNA of L. cinctum were identified as Group 1 introns; intron 1 at position 943 and intron 2 at position 1199. The two introns were found to be consistently present in isolates of L. cinctum PG 4 and PG 5 and absent from L. cinctum PG 6 isolates, despite the similarity of the ITS sequence and teleomorph morphology. Intron 1 was of subgroup 1C1 whereas intron 2 was of an unknown subgroup. RFLP patterns and presence/absence of introns were useful characters for expediting the identification of cultures of Leucostoma isolated from stone and pome fruit cankers. RFLP patterns from 13 endonucleases provided an effective method for selecting an array of diverse PG 1 isolates useful in screening plant germplasm for disease-resistance.
The kinetics of pre-mRNA splicing in the Drosophila genome and the influence of gene architecture.
Pai, Athma A; Henriques, Telmo; McCue, Kayla; Burkholder, Adam; Adelman, Karen; Burge, Christopher B
2017-12-27
Production of most eukaryotic mRNAs requires splicing of introns from pre-mRNA. The splicing reaction requires definition of splice sites, which are initially recognized in either intron-spanning ('intron definition') or exon-spanning ('exon definition') pairs. To understand how exon and intron length and splice site recognition mode impact splicing, we measured splicing rates genome-wide in Drosophila , using metabolic labeling/RNA sequencing and new mathematical models to estimate rates. We found that the modal intron length range of 60-70 nt represents a local maximum of splicing rates, but that much longer exon-defined introns are spliced even faster and more accurately. We observed unexpectedly low variation in splicing rates across introns in the same gene, suggesting the presence of gene-level influences, and we identified multiple gene level variables associated with splicing rate. Together our data suggest that developmental and stress response genes may have preferentially evolved exon definition in order to enhance the rate or accuracy of splicing.
Cianciulli, Antonia; Calvello, Rosa; Panaro, Maria A
2015-04-01
In the homologous genes studied, the exons and introns alternated in the same order in mouse and human. We studied, in both species: corresponding short segments of introns, whole corresponding introns and complete homologous genes. We considered the total number of nucleotides and the number and orientation of the SINE inserts. Comparisons of mouse and human data series showed that at the level of individual relatively short segments of intronic sequences the stochastic variability prevails in the local structuring, but at higher levels of organization a deterministic component emerges, conserved in mouse and human during the divergent evolution, despite the ample re-editing of the intronic sequences and the fact that processes such as SINE spread had taken place in an independent way in the two species. Intron conservation is negatively correlated with the SINE occupancy, suggesting that virus inserts interfere with the conservation of the sequences inherited from the common ancestor. Copyright © 2015 Elsevier Ltd. All rights reserved.
USDA-ARS?s Scientific Manuscript database
Our objective was to resequence insulin receptor substrate 2 (IRS2) to identify variants associated with obesity- and diabetes-related traits in Hispanic children. Exonic and intronic segments, 5' and 3' flanking regions of IRS2 (approx. 14.5 kb), were bidirectionally sequenced for single nucleotide...
Choi, Kyoung Su; Kwak, Myounghai; Lee, Byoungyoon; Park, SeonJoo
2018-01-01
The chloroplast genome of Tetragonia tetragonioides (Aizoaceae; Caryophyllales) was sequenced to provide information for studies on phylogeny and evolution within Caryophyllales. The chloroplast genome of Tetragonia tetragonioides is 149,506 bp in length and includes a pair of inverted repeats (IRs) of 24,769 bp that separate a large single copy (LSC) region of 82,780 bp and a small single copy (SSC) region of 17,188 bp. Comparative analysis of the chloroplast genome showed that Caryphyllales species have lost many genes. In particular, the rpl2 intron and infA gene were not found in T. tetragonioides, and core Caryophyllales lack the rpl2 intron. Phylogenetic analyses were conducted using 55 genes in 16 complete chloroplast genomes. Caryophyllales was found to divide into two clades; core Caryophyllales and noncore Caryophyllales. The genus Tetragonia is closely related to Mesembryanthemum. Comparisons of the synonymous (Ks), nonsynonymous (Ka), and Ka/Ks substitution rates revealed that nonsynonymous substitution rates were lower than synonymous substitution rates and that Ka/Ks rates were less than 1. The findings of the present study suggest that most genes are a purified selection.
Kusumi, J; Tsumura, Y; Yoshimaru, H; Tachida, H
2000-10-01
Nucleotide sequences from four chloroplast genes, the matK, chlL, intergenic spacer (IGS) region between trnL and trnF, and an intron of trnL, were determined from all species of Taxodiaceae and five species of Cupressaceae sensu stricto (s.s.). Phylogenetic trees were constructed using the maximum parsimony and the neighbor-joining methods with Cunninghamia as an outgroup. These analyses provided greater resolution of relationships among genera and higher bootstrap supports for clades compared to previous analyses. Results indicate that Taiwania diverged first, and then Athrotaxis diverged from the remaining genera. Metasequoia, Sequoia, and Sequoiadendron form a clade. Taxodium and Glyptostrobus form a clade, which is the sister to Cryptomeria. Cupressaceae s.s. are derived from within Taxodiaceae, being the most closely related to the Cryptomeria/Taxodium/Glyptostrobus clade. These relationships are consistent with previous morphological groupings and the analyses of molecular data. In addition, we found acceleration of evolutionary rates in Cupressaceae s.s. Possible causes for the acceleration are discussed.
Dong, Chongmei; Vincent, Kate; Sharp, Peter
2009-12-04
TILLING (Targeting Induced Local Lesions IN Genomes) is a powerful tool for reverse genetics, combining traditional chemical mutagenesis with high-throughput PCR-based mutation detection to discover induced mutations that alter protein function. The most popular mutation detection method for TILLING is a mismatch cleavage assay using the endonuclease CelI. For this method, locus-specific PCR is essential. Most wheat genes are present as three similar sequences with high homology in exons and low homology in introns. Locus-specific primers can usually be designed in introns. However, it is sometimes difficult to design locus-specific PCR primers in a conserved region with high homology among the three homoeologous genes, or in a gene lacking introns, or if information on introns is not available. Here we describe a mutation detection method which combines High Resolution Melting (HRM) analysis of mixed PCR amplicons containing three homoeologous gene fragments and sequence analysis using Mutation Surveyor software, aimed at simultaneous detection of mutations in three homoeologous genes. We demonstrate that High Resolution Melting (HRM) analysis can be used in mutation scans in mixed PCR amplicons containing three homoeologous gene fragments. Combining HRM scanning with sequence analysis using Mutation Surveyor is sensitive enough to detect a single nucleotide mutation in the heterozygous state in a mixed PCR amplicon containing three homoeoloci. The method was tested and validated in an EMS (ethylmethane sulfonate)-treated wheat TILLING population, screening mutations in the carboxyl terminal domain of the Starch Synthase II (SSII) gene. Selected identified mutations of interest can be further analysed by cloning to confirm the mutation and determine the genomic origin of the mutation. Polyploidy is common in plants. Conserved regions of a gene often represent functional domains and have high sequence similarity between homoeologous loci. The method described here is a useful alternative to locus-specific based methods for screening mutations in conserved functional domains of homoeologous genes. This method can also be used for SNP (single nucleotide polymorphism) marker development and eco-TILLING in polyploid species.
Melangath, Geetha; Sen, Titash; Kumar, Rakesh; Bawa, Pushpinder; Srinivasan, Subha; Vijayraghavan, Usha
2017-01-01
Budding yeast spliceosomal factors ScSlu7 and ScPrp18 interact and mediate intron 3'ss choice during second step pre-mRNA splicing. The fission yeast genome with abundant multi-intronic transcripts, degenerate splice signals and SR proteins is an apt unicellular fungal model to deduce roles for core spliceosomal factors in alternative splice-site choice, intron retention and to study the cellular implications of regulated splicing. From our custom microarray data we deduce a stringent reproducible subset of S. pombe alternative events. We examined the role of factors SpSlu7 or SpPrp18 for these splice events and investigated the relationship to growth phase and stress. Wild-type log and stationary phase cells showed ats1+ exon 3 skipped and intron 3 retained transcripts. Interestingly the non-consensus 5'ss in ats1+ intron 3 caused SpSlu7 and SpPrp18 dependent intron retention. We validated the use of an alternative 5'ss in dtd1+ intron 1 and of an upstream alternative 3'ss in DUF3074 intron 1. The dtd1+ intron 1 non-canonical 5'ss yielded an alternative mRNA whose levels increased in stationary phase. Utilization of dtd1+ intron 1 sub-optimal 5' ss required functional SpPrp18 and SpSlu7 while compromise in SpSlu7 function alone hampered the selection of the DUF3074 intron 1 non canonical 3'ss. We analysed the relative abundance of these splice isoforms during mild thermal, oxidative and heavy metal stress and found stress-specific splice patterns for ats1+ and DUF3074 intron 1 some of which were SpSlu7 and SpPrp18 dependent. By studying ats1+ splice isoforms during compromised transcription elongation rates in wild-type, spslu7-2 and spprp18-5 mutant cells we found dynamic and intron context-specific effects in splice-site choice. Our work thus shows the combinatorial effects of splice site strength, core splicing factor functions and transcription elongation kinetics to dictate alternative splice patterns which in turn serve as an additional recourse of gene regulation in fission yeast.
Kumar, Rakesh; Bawa, Pushpinder; Srinivasan, Subha
2017-01-01
Budding yeast spliceosomal factors ScSlu7 and ScPrp18 interact and mediate intron 3’ss choice during second step pre-mRNA splicing. The fission yeast genome with abundant multi-intronic transcripts, degenerate splice signals and SR proteins is an apt unicellular fungal model to deduce roles for core spliceosomal factors in alternative splice-site choice, intron retention and to study the cellular implications of regulated splicing. From our custom microarray data we deduce a stringent reproducible subset of S. pombe alternative events. We examined the role of factors SpSlu7 or SpPrp18 for these splice events and investigated the relationship to growth phase and stress. Wild-type log and stationary phase cells showed ats1+ exon 3 skipped and intron 3 retained transcripts. Interestingly the non-consensus 5’ss in ats1+ intron 3 caused SpSlu7 and SpPrp18 dependent intron retention. We validated the use of an alternative 5’ss in dtd1+ intron 1 and of an upstream alternative 3’ss in DUF3074 intron 1. The dtd1+ intron 1 non-canonical 5’ss yielded an alternative mRNA whose levels increased in stationary phase. Utilization of dtd1+ intron 1 sub-optimal 5’ ss required functional SpPrp18 and SpSlu7 while compromise in SpSlu7 function alone hampered the selection of the DUF3074 intron 1 non canonical 3’ss. We analysed the relative abundance of these splice isoforms during mild thermal, oxidative and heavy metal stress and found stress-specific splice patterns for ats1+ and DUF3074 intron 1 some of which were SpSlu7 and SpPrp18 dependent. By studying ats1+ splice isoforms during compromised transcription elongation rates in wild-type, spslu7-2 and spprp18-5 mutant cells we found dynamic and intron context-specific effects in splice-site choice. Our work thus shows the combinatorial effects of splice site strength, core splicing factor functions and transcription elongation kinetics to dictate alternative splice patterns which in turn serve as an additional recourse of gene regulation in fission yeast. PMID:29236736
Lamech, Lilian T.; Mallam, Anna L.; Lambowitz, Alan M.
2014-01-01
The Neurospora crassa mitochondrial tyrosyl-tRNA synthetase (mtTyrRS; CYT-18 protein) evolved a new function as a group I intron splicing factor by acquiring the ability to bind group I intron RNAs and stabilize their catalytically active RNA structure. Previous studies showed: (i) CYT-18 binds group I introns by using both its N-terminal catalytic domain and flexibly attached C-terminal anticodon-binding domain (CTD); and (ii) the catalytic domain binds group I introns specifically via multiple structural adaptations that occurred during or after the divergence of Peziomycotina and Saccharomycotina. However, the function of the CTD and how it contributed to the evolution of splicing activity have been unclear. Here, small angle X-ray scattering analysis of CYT-18 shows that both CTDs of the homodimeric protein extend outward from the catalytic domain, but move inward to bind opposite ends of a group I intron RNA. Biochemical assays show that the isolated CTD of CYT-18 binds RNAs non-specifically, possibly contributing to its interaction with the structurally different ends of the intron RNA. Finally, we find that the yeast mtTyrRS, which diverged from Pezizomycotina fungal mtTyrRSs prior to the evolution of splicing activity, binds group I intron and other RNAs non-specifically via its CTD, but lacks further adaptations needed for group I intron splicing. Our results suggest a scenario of constructive neutral (i.e., pre-adaptive) evolution in which an initial non-specific interaction between the CTD of an ancestral fungal mtTyrRS and a self-splicing group I intron was “fixed” by an intron RNA mutation that resulted in protein-dependent splicing. Once fixed, this interaction could be elaborated by further adaptive mutations in both the catalytic domain and CTD that enabled specific binding of group I introns. Our results highlight a role for non-specific RNA binding in the evolution of RNA-binding proteins. PMID:25536042
Lamech, Lilian T; Mallam, Anna L; Lambowitz, Alan M
2014-12-01
The Neurospora crassa mitochondrial tyrosyl-tRNA synthetase (mtTyrRS; CYT-18 protein) evolved a new function as a group I intron splicing factor by acquiring the ability to bind group I intron RNAs and stabilize their catalytically active RNA structure. Previous studies showed: (i) CYT-18 binds group I introns by using both its N-terminal catalytic domain and flexibly attached C-terminal anticodon-binding domain (CTD); and (ii) the catalytic domain binds group I introns specifically via multiple structural adaptations that occurred during or after the divergence of Peziomycotina and Saccharomycotina. However, the function of the CTD and how it contributed to the evolution of splicing activity have been unclear. Here, small angle X-ray scattering analysis of CYT-18 shows that both CTDs of the homodimeric protein extend outward from the catalytic domain, but move inward to bind opposite ends of a group I intron RNA. Biochemical assays show that the isolated CTD of CYT-18 binds RNAs non-specifically, possibly contributing to its interaction with the structurally different ends of the intron RNA. Finally, we find that the yeast mtTyrRS, which diverged from Pezizomycotina fungal mtTyrRSs prior to the evolution of splicing activity, binds group I intron and other RNAs non-specifically via its CTD, but lacks further adaptations needed for group I intron splicing. Our results suggest a scenario of constructive neutral (i.e., pre-adaptive) evolution in which an initial non-specific interaction between the CTD of an ancestral fungal mtTyrRS and a self-splicing group I intron was "fixed" by an intron RNA mutation that resulted in protein-dependent splicing. Once fixed, this interaction could be elaborated by further adaptive mutations in both the catalytic domain and CTD that enabled specific binding of group I introns. Our results highlight a role for non-specific RNA binding in the evolution of RNA-binding proteins.
Torrado, Mario; Iglesias, Raquel; Nespereira, Beatriz; Centeno, Alberto; López, Eduardo; Mikhailov, Alexander T
2009-07-01
The cardiac ankyrin repeat domain 1 protein (ANKRD1, also known as CARP) has been extensively characterized with regard to its proposed functions as a cardio-enriched transcriptional co-factor and stress-inducible myofibrillar protein. The present results show the occurrence of alternative splicing by intron retention events in the pig and human ankrd1 gene. In pig heart, ankrd1 is expressed as four alternatively spliced transcripts, three of which have non-excised introns: ankrd1-contained introns 6, 7 and 8 (i.e., ankrd1-i6,7,8), ankrd1-contained introns 7 and 8 (i.e., ankrd1-i7,8), and ankrd1 retained only intron 8 (i.e., ankrd1-i8). In the human heart, two orthologues of porcine intron-retaining ankrd1 variants (i.e., ankrd1-i8 and ankrd1-i7,8) are detected. We demonstrate that these newly-identified intron-retaining ankrd1 transcripts are functionally intact, efficiently translated into protein in vitro and exported to the cytoplasm in cardiomyocytes in vivo. In the piglet heart, both the intronless and intron-retaining ankrd1 mRNAs are co-expressed in a chamber-dependent manner being more abundant in the left as compared to the right myocardium. Our data further indicate co-upregulation of the ankrd1 spliced variants in myocardium in the porcine model of diastolic heart failure. Most significantly, we demonstrate that in vivo forced expression of recombinant intronless ankrd1 markedly increases the levels of intron-retaining ankrd1 variants (but not of the endogenous main transcript) in piglet myocardium, suggesting that ANKRD1 may positively regulate the expression of its own intron-containing RNAs in response to cardiac stress. Overall, our findings demonstrate that in cardiomyocytes ANKRD1 can exist in multiple isoforms which may contribute to the functional diversity of this factor in heart development and disease.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Ploos van Amstel, H.; Reitsma, P.H.; van der Logt, C.P.
The human protein S locus on chromosome 3 consists of two protein S genes, PS{alpha} and PS{beta}. Here the authors report the cloning and characterization of both genes. Fifteen exons of the PS{alpha} gene were identified that together code for protein S mRNA as derived from the reported protein S cDNAs. Analysis by primer extension of liver protein S mRNA, however, reveals the presence of two mRNA forms that differ in the length of their 5{prime}-noncoding region. Both transcripts contain a 5{prime}-noncoding region longer than found in the protein S cDNAs. The two products may arise from alternative splicing ofmore » an additional intron in this region or from the usage of two start sites for transcription. The intron-exon organization of the PS{alpha} gene fully supports the hypothesis that the protein S gene is the product of an evolutional assembling process in which gene modules coding for structural/functional protein units also found in other coagulation proteins have been put upstream of the ancestral gene of a steroid hormone binding protein. The PS{beta} gene is identified as a pseudogene. It contains a large variety of detrimental aberrations, viz., the absence of exon I, a splice site mutation, three stop codons, and a frame shift mutation. Overall the two genes PS{alpha} and PS{beta} show between their exonic sequences 96.5% homology. Southern analysis of primate DNA showed that the duplication of the ancestral protein S gene has occurred after the branching of the orangutan from the African apes. A nonsense mutation that is present in the pseudogene of man also could be identified in one of the two protein S genes of both chimpanzee and gorilla. This implicates that silencing of one of the two protein S genes must have taken place before the divergence of the three African apes.« less
Dutta, Debargh; Gunasekera, Devi; Ragni, Margaret V; Pratt, Kathleen P
2016-12-27
The most frequent mutations resulting in hemophilia A are an intron 22 or intron 1 gene inversion, which together cause ∼50% of severe hemophilia A cases. We report a simple and accurate RNA-based assay to detect these mutations in patients and heterozygous carriers. The assays do not require specialized equipment or expensive reagents; therefore, they may provide useful and economic protocols that could be standardized for central laboratory testing. RNA is purified from a blood sample, and reverse transcription nested polymerase chain reaction (RT-NPCR) reactions amplify DNA fragments with the F8 sequence spanning the exon 22 to 23 splice site (intron 22 inversion test) or the exon 1 to 2 splice site (intron 1 inversion test). These sequences will be amplified only from F8 RNA without an intron 22 or intron 1 inversion mutation, respectively. Additional RT-NPCR reactions are then carried out to amplify the inverted sequences extending from F8 exon 19 to the first in-frame stop codon within intron 22 or a chimeric transcript containing F8 exon 1 and the VBP1 gene. These latter 2 products are produced only by individuals with an intron 22 or intron 1 inversion mutation, respectively. The intron 22 inversion mutations may be further classified (eg, as type 1 or type 2, reflecting the specific homologous recombination sites) by the standard DNA-based "inverse-shifting" PCR assay if desired. Efficient Bcl I and T4 DNA ligase enzymes that cleave and ligate DNA in minutes were used, which is a substantial improvement over previous protocols that required overnight incubations. These protocols can accurately detect F8 inversion mutations via same-day testing of patient samples.
Sultan, Laure D; Mileshina, Daria; Grewe, Felix; Rolle, Katarzyna; Abudraham, Sivan; Głodowicz, Paweł; Niazi, Adnan Khan; Keren, Ido; Shevtsov, Sofia; Klipcan, Liron; Barciszewski, Jan; Mower, Jeffrey P; Dietrich, André; Ostersetzer-Biran, Oren
2016-11-01
Group II introns are large catalytic RNAs that are ancestrally related to nuclear spliceosomal introns. Sequences corresponding to group II RNAs are found in many prokaryotes and are particularly prevalent within plants organellar genomes. Proteins encoded within the introns themselves (maturases) facilitate the splicing of their own host pre-RNAs. Mitochondrial introns in plants have diverged considerably in sequence and have lost their maturases. In angiosperms, only a single maturase has been retained in the mitochondrial DNA: the matR gene found within NADH dehydrogenase 1 (nad1) intron 4. Its conservation across land plants and RNA editing events, which restore conserved amino acids, indicates that matR encodes a functional protein. However, the biological role of MatR remains unclear. Here, we performed an in vivo investigation of the roles of MatR in Brassicaceae. Directed knockdown of matR expression via synthetically designed ribozymes altered the processing of various introns, including nad1 i4. Pull-down experiments further indicated that MatR is associated with nad1 i4 and several other intron-containing pre-mRNAs. MatR may thus represent an intermediate link in the gradual evolutionary transition from the intron-specific maturases in bacteria into their versatile spliceosomal descendants in the nucleus. The similarity between maturases and the core spliceosomal Prp8 protein further supports this intriguing theory. © 2016 American Society of Plant Biologists. All rights reserved.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Kerr, J.M.; Fisher, L.W.; Termine, J.D.
The authors have isolated and partially sequenced the human bone sialoprotein gene (IBSP). IBSP has been sublocalized by in situ hybridization to chromosome 4q38-q31 and is composed of six small exons (51 to 159 bp) and 1 large exon ([approximately]2.6 kb). The intron/exon junctions defined by sequence analysis are of class O, retaining an intact coding triplet. Sequence analysis of the 5[prime] upstream region revealed a TATAA (nucleotides -30 to-25 from the transcriptional start point) and a CCAAT (nucleotides -56 to-52) box, both in the reverse orientation. Intron 1 contains interesting structural elements composed of polypyrimidine repeats followed by amore » poly(AC)[sub n] tract. Both types of structural elements have been detected in promoter regions of other genes and have been implicated in transcriptional regulation. Several differences between the previously published cDNA sequence and the authors' sequence have been identified, most of which are contained within the untranslated exon 1. Three base revisions in the coding region include a G to T (Gly to Val, amino acid 195), T to C (Val to Ala, amino acid 268), and T to A (Glu to Asp, amino acid 270). In conclusion, the genomic organization and potential regulatory elements of human IBSP have been elucidated. 42 refs., 4 figs., 1 tab.« less
Diehl, William E.; Johnson, Welkin E.; Hunter, Eric
2013-01-01
All genes in the TRIM6/TRIM34/TRIM5/TRIM22 locus are type I interferon inducible, with TRIM5 and TRIM22 possessing antiviral properties. Evolutionary studies involving the TRIM6/34/5/22 locus have predominantly focused on the coding sequence of the genes, finding that TRIM5 and TRIM22 have undergone high rates of both non-synonymous nucleotide replacements and in-frame insertions and deletions. We sought to understand if divergent evolutionary pressures on TRIM6/34/5/22 coding regions have selected for modifications in the non-coding regions of these genes and explore whether such non-coding changes may influence the biological function of these genes. The transcribed genomic regions, including the introns, of TRIM6, TRIM34, TRIM5, and TRIM22 from ten Haplorhini primates and one prosimian species were analyzed for transposable element content. In Haplorhini species, TRIM5 displayed an exaggerated interspecies variability, predominantly resulting from changes in the composition of transposable elements in the large first and fourth introns. Multiple lineage-specific endogenous retroviral long terminal repeats (LTRs) were identified in the first intron of TRIM5 and TRIM22. In the prosimian genome, we identified a duplication of TRIM5 with a concomitant loss of TRIM22. The transposable element content of the prosimian TRIM5 genes appears to largely represent the shared Haplorhini/prosimian ancestral state for this gene. Furthermore, we demonstrated that one such differentially fixed LTR provides for species-specific transcriptional regulation of TRIM22 in response to p53 activation. Our results identify a previously unrecognized source of species-specific variation in the antiviral TRIM genes, which can lead to alterations in their transcriptional regulation. These observations suggest that there has existed long-term pressure for exaptation of retroviral LTRs in the non-coding regions of these genes. This likely resulted from serial viral challenges and provided a mechanism for rapid alteration of transcriptional regulation. To our knowledge, this represents the first report of persistent evolutionary pressure for the capture of retroviral LTR insertions. PMID:23516500
Processing of Archaebacterial Intron-Containing tRNA Gene Transcripts.
1987-07-31
1{ 1. Project Goals: A. To determine the mechanism of tRNA intron processing in the halophilic archaebacteria. B. Characterize and compare the...enzyme(s) responsible for the removal of 5’-flanking sequences from halophilic and sulfur-dependent tRNA gene transcripts. C. Examine the structure and...distribution of tRNA introns in the halophilic archaebacteria. 2. Accomplishments: A. Intron processing mechanism We have succeeded in our primary
Bornstein, P; McKay, J; Liska, D J; Apone, S; Devarayalu, S
1988-01-01
The first intron of the human collagen alpha 1(I) gene contains several positively and negatively acting elements. We have studied the transcription of collagen-human growth hormone fusion genes, containing deletions and rearrangements of collagen intronic sequences, by transient transfection of chick tendon fibroblasts and NIH 3T3 cells. In chick tendon fibroblasts, but not in 3T3 cells, inversion of intronic sequences containing a previously studied 274-base-pair segment, A274, resulted in markedly reduced human growth hormone mRNA levels as determined by an RNase protection assay. This inhibitory effect was largely alleviated when deletions were introduced in the collagen promoter of plasmids containing negatively oriented intronic sequences. Evidence for interaction of the promoter with the intronic segment, A274, was obtained by gel mobility shift assays. We suggest that promoter-intron interactions, mediated by DNA-binding proteins, regulate collagen gene transcription. Inversion of intronic segments containing critical interactive elements might then lead to an altered geometry and reduced activity of a transcriptional complex in those cells with sufficiently high levels of appropriate transcription factors. We further suggest that the deleted promoter segment plays a key role in directing DNA interactions involved in transcriptional control. Images PMID:3211130
Functionality of In vitro Reconstituted Group II Intron RmInt1-Derived Ribonucleoprotein Particles.
Molina-Sánchez, Maria D; García-Rodríguez, Fernando M; Toro, Nicolás
2016-01-01
The functional unit of mobile group II introns is a ribonucleoprotein particle (RNP) consisting of the intron-encoded protein (IEP) and the excised intron RNA. The IEP has reverse transcriptase activity but also promotes RNA splicing, and the RNA-protein complex triggers site-specific DNA insertion by reverse splicing, in a process called retrohoming. In vitro reconstituted ribonucleoprotein complexes from the Lactococcus lactis group II intron Ll.LtrB, which produce a double strand break, have recently been studied as a means of developing group II intron-based gene targeting methods for higher organisms. The Sinorhizobium meliloti group II intron RmInt1 is an efficient mobile retroelement, the dispersal of which appears to be linked to transient single-stranded DNA during replication. The RmInt1IEP lacks the endonuclease domain (En) and cannot cut the bottom strand to generate the 3' end to initiate reverse transcription. We used an Escherichia coli expression system to produce soluble and active RmInt1 IEP and reconstituted RNPs with purified components in vitro . The RNPs generated were functional and reverse-spliced into a single-stranded DNA target. This work constitutes the starting point for the use of group II introns lacking DNA endonuclease domain-derived RNPs for highly specific gene targeting methods.
Functionality of In vitro Reconstituted Group II Intron RmInt1-Derived Ribonucleoprotein Particles
Molina-Sánchez, Maria D.; García-Rodríguez, Fernando M.; Toro, Nicolás
2016-01-01
The functional unit of mobile group II introns is a ribonucleoprotein particle (RNP) consisting of the intron-encoded protein (IEP) and the excised intron RNA. The IEP has reverse transcriptase activity but also promotes RNA splicing, and the RNA-protein complex triggers site-specific DNA insertion by reverse splicing, in a process called retrohoming. In vitro reconstituted ribonucleoprotein complexes from the Lactococcus lactis group II intron Ll.LtrB, which produce a double strand break, have recently been studied as a means of developing group II intron-based gene targeting methods for higher organisms. The Sinorhizobium meliloti group II intron RmInt1 is an efficient mobile retroelement, the dispersal of which appears to be linked to transient single-stranded DNA during replication. The RmInt1IEP lacks the endonuclease domain (En) and cannot cut the bottom strand to generate the 3′ end to initiate reverse transcription. We used an Escherichia coli expression system to produce soluble and active RmInt1 IEP and reconstituted RNPs with purified components in vitro. The RNPs generated were functional and reverse-spliced into a single-stranded DNA target. This work constitutes the starting point for the use of group II introns lacking DNA endonuclease domain-derived RNPs for highly specific gene targeting methods. PMID:27730127
Toyoda, N; Kleinhaus, N; Larsen, P R
1996-06-01
We analyzed the exon-intron structure of the human type 1 deiodinase gene (dio1) and compared it with that of a patient with suspected congenital type 1 deiodinase (D1) deficiency. The hdio1 gene is identical in exon-intron arrangement to the mouse gene, with coding sequences and a selenocysteine insertion sequence (SECIS) element contained in four exons. There were no mutations in the sequences of exons 1-4 of the patient's genomic DNA. Functional studies by transient expression techniques showed no difference in basal promoter activity or T3 responsiveness between the patient's and the normal dio1 gene. A structural abnormality in the dio1 gene is not a likely explanation for this patient's D1-deficient phenotype.
The in vivo use of alternate 3'-splice sites in group I introns.
Sellem, C H; Belcour, L
1994-04-11
Alternative splicing of group I introns has been postulated as a possible mechanism that would ensure the translation of proteins encoded into intronic open reading frames, discontinuous with the upstream exon and lacking an initiation signal. Alternate splice sites were previously depicted according to secondary structures of several group I introns. We present here strong evidence that, in the case of Podospora anserina nad 1-i4 and cox1-i7 mitochondrial introns, alternative splicing events do occur in vivo. Indeed, by PCR experiments we have detected molecules whose sequence is precisely that expected if the predicted alternate 3'-splice sites were used.
Land, language, and loci: mtDNA in Native Americans and the genetic history of Peru.
Lewis, Cecil M; Tito, Raúl Y; Lizárraga, Beatriz; Stone, Anne C
2005-07-01
Despite a long history of complex societies and despite extensive present-day linguistic and ethnic diversity, relatively few populations in Peru have been sampled for population genetic investigations. In order to address questions about the relationships between South American populations and about the extent of correlation between genetic distance, language, and geography in the region, mitochondrial DNA (mtDNA) hypervariable region I sequences and mtDNA haplogroup markers were examined in 33 individuals from the state of Ancash, Peru. These sequences were compared to those from 19 American Indian populations using diversity estimates, AMOVA tests, mismatch distributions, a multidimensional scaling plot, and regressions. The results show correlations between genetics, linguistics, and geographical affinities, with stronger correlations between genetics and language. Additionally, the results suggest a pattern of differential gene flow and drift in western vs. eastern South America, supporting previous mtDNA and Y chromosome investigations. (c) 2004 Wiley-Liss, Inc
Li, Wenli; Terenius, Olle; Hirai, Makoto; Nilsson, Anders S; Faye, Ingrid
2005-01-01
The Chinese oak silk moth Antheraea pernyi is an important silk producer. To understand microbial resistance of this moth, we cloned Hemolin, encoding a multifunctional immune protein belonging to the immunoglobulin superfamily, and examined the expression in gonads and fat body. The ApHemolin amino acid sequence was compared to other Hemolin sequences in order to predict functional sites. Several sites were conserved; among them a phosphate binding site, which according to 3D structure modelling does not appear in neuroglian, the phylogenetically closest related protein. In addition, two conserved KDG sequences in the C-C' loop of immunoglobulin domains 1 and 3, give rise to gamma-turns, which is a common motif in the C'-C'' loop of the hypervariable region L2 in vertebrate immunoglobulins. The comparisons also show variable regions of specific interest for future studies of hemolin and its interaction with microbial entities.
Xu, Feng-Ling; Ding, Mei; Yao, Jun; Shi, Zhang-Sen; Wu, Xue; Zhang, Jing-Jing; Pang, Hao; Xing, Jia-Xin; Xuan, Jin-Feng; Wang, Bao-Jie
2017-01-01
To determine whether mitochondrial DNA (mtDNA) variations are associated with schizophrenia, 313 patients with schizophrenia and 326 unaffected participants of the northern Chinese Han population were included in a prospective study. Single-nucleotide polymorphisms (SNPs) including C5178A, A10398G, G13708A, and C13928G were analyzed by polymerase chain reaction-restriction fragment length polymorphism (PCR-RFLP). Hypervariable regions I and II (HVSI and HVSII) were analyzed by sequencing. The results showed that the 4 SNPs and 11 haplotypes, composed of the 4 SNPs, did not differ significantly between patient and control groups. No significant association between haplogroups and the risk of schizophrenia was ascertained after Bonferroni correction. Drawing a conclusion, there was no evidence of an association between mtDNA (the 4 SNPs and the control region) and schizophrenia in the northern Chinese Han population.
DNA analyses of the remains of the Prince Branciforte Barresi family.
Rickards, O; Martínez-Labarga, C; Favaro, M; Frezza, D; Mallegni, F
2001-01-01
The five skeletons found buried in the church of Militello di Catania, Sicily, were tentatively identified by morphological analysis and historical reports as the remains of Prince Branciforte Barresi, two of his children, his brother and another juvenile member of the family (sixteenth and seventeenth centuries). In order to attempt to clarify the degree of relationships of the five skeletons, sex testing and mitochondrial DNA (mtDNA) sequence analysis of the hypervariable segments I and II (HV1 and HV2) of control region were performed. Moreover, the 9 bp-deletion marker of region V (COII/tRNAlys) was examined. Molecular genetic analyses were consistent with historical expectations, although they did not directly demonstrate that these are in fact the remains of the Prince and his relatives, due to the impossibility of obtaining DNA from living maternal relatives of the Prince.
The kinetics of pre-mRNA splicing in the Drosophila genome and the influence of gene architecture
Pai, Athma A; Henriques, Telmo; McCue, Kayla; Burkholder, Adam; Adelman, Karen
2017-01-01
Production of most eukaryotic mRNAs requires splicing of introns from pre-mRNA. The splicing reaction requires definition of splice sites, which are initially recognized in either intron-spanning (‘intron definition’) or exon-spanning (‘exon definition’) pairs. To understand how exon and intron length and splice site recognition mode impact splicing, we measured splicing rates genome-wide in Drosophila, using metabolic labeling/RNA sequencing and new mathematical models to estimate rates. We found that the modal intron length range of 60–70 nt represents a local maximum of splicing rates, but that much longer exon-defined introns are spliced even faster and more accurately. We observed unexpectedly low variation in splicing rates across introns in the same gene, suggesting the presence of gene-level influences, and we identified multiple gene level variables associated with splicing rate. Together our data suggest that developmental and stress response genes may have preferentially evolved exon definition in order to enhance the rate or accuracy of splicing. PMID:29280736
The kinetics of pre-mRNA splicing in the Drosophila genome and the influence of gene architecture
Pai, Athma A.; Henriques, Telmo; McCue, Kayla; ...
2017-12-27
Production of most eukaryotic mRNAs requires splicing of introns from pre-mRNA. The splicing reaction requires definition of splice sites, which are initially recognized in either intron-spanning (‘intron definition’) or exon-spanning (‘exon definition’) pairs. To understand how exon and intron length and splice site recognition mode impact splicing, we measured splicing rates genome-wide in Drosophila, using metabolic labeling/RNA sequencing and new mathematical models to estimate rates. We found that the modal intron length range of 60–70 nt represents a local maximum of splicing rates, but that much longer exon-defined introns are spliced even faster and more accurately. We observed unexpectedly lowmore » variation in splicing rates across introns in the same gene, suggesting the presence of gene-level influences, and we identified multiple gene level variables associated with splicing rate. Together our data suggest that developmental and stress response genes may have preferentially evolved exon definition in order to enhance the rate or accuracy of splicing.« less
The kinetics of pre-mRNA splicing in the Drosophila genome and the influence of gene architecture
DOE Office of Scientific and Technical Information (OSTI.GOV)
Pai, Athma A.; Henriques, Telmo; McCue, Kayla
Production of most eukaryotic mRNAs requires splicing of introns from pre-mRNA. The splicing reaction requires definition of splice sites, which are initially recognized in either intron-spanning (‘intron definition’) or exon-spanning (‘exon definition’) pairs. To understand how exon and intron length and splice site recognition mode impact splicing, we measured splicing rates genome-wide in Drosophila, using metabolic labeling/RNA sequencing and new mathematical models to estimate rates. We found that the modal intron length range of 60–70 nt represents a local maximum of splicing rates, but that much longer exon-defined introns are spliced even faster and more accurately. We observed unexpectedly lowmore » variation in splicing rates across introns in the same gene, suggesting the presence of gene-level influences, and we identified multiple gene level variables associated with splicing rate. Together our data suggest that developmental and stress response genes may have preferentially evolved exon definition in order to enhance the rate or accuracy of splicing.« less
Protistan Biogeography: A Snapshot Across a Major Shipping Corridor Spanning Two Oceans.
Pagenkopp Lohan, Katrina M; Fleischer, Robert C; Torchin, Mark E; Ruiz, Gregory M
2017-04-01
Deciphering patterns of protistan taxa is a crucial step for understanding anthropogenic and environmental impacts on biogeography. We characterized and compared protistan communities from environmental samples collected along a major shipping corridor, the Panama Canal, and the Bocas del Toro archipelago. We used metabarcoding with high throughput sequencing (HTS) with the V4 hypervariable region of the ribosomal gene complex (rDNA). We detected many protistan taxa, including a variety of parasitic and toxic taxa. There were 1,296 OTUs shared across all three regions, with an additional 342-1,526 OTUs occurring across two or more regions, suggesting some mixing within the Caribbean and across the Isthmus. In general, this mixing did not impact community similarity, which was primarily distinct across regions. When OTUs identified as gregarines were analyzed separately, most samples grouped by region and communities were distinct across the Canal. Shipping traffic through the Panama Canal could move some taxa across regions; however, different environmental conditions in the two oceans may limit their establishment. Overall our results suggest that contemporary protistan biogeographic patterns are likely caused by a complex combination of factors, including anthropogenic dispersal and environmental tolerance. Published by Elsevier GmbH.
Sans, Monica; Figueiro, Gonzalo; Hidalgo, Pedro C
2012-06-01
Uruguayan population has been considered as of European descent, as its Native populations victims of genocide apparently disappeared in the 19th century. Contradicting this national belief, genetic studies have shown a substantial Native contribution. However, the continuity between prehistoric, historic, and present populations remains unproved. With the aim of adding elements to prove a possible population continuity, we studied a mitochondrial lineage, part of haplogroup C1, analyzing the complete genome of a modern Uruguayan individual and the hypervariable region I (HVRI) in prehistoric, historic, and contemporary individuals. Several individuals carried the mutations that characterize this lineage: two from an archaeological mound located in the east of the country, the Charrúa Indian chief Vaimaca Perú and five individuals from the present population. The lineage was initially characterized by its HVRI sequence, having the four typical C1 mutations and adding 16051G and 16288C; other mutations were also found: 16140C was found in all but the oldest individual, dated 1,610 years BP, while 16209C, 16422C, and 16519C were found only in some individuals. Hypervariable region II showed the typical C1 mutations and 194T. The coding region, analyzed in modern individuals, was characterized by 12378T, while other mutations found were not common to all of them. In summary, we have found and described a new lineage that shows continuity from prehistoric mound builders to the present population, through a representative of the extinct Charrúa Indians. The lineage appeared at least 1,600 years ago and is carried by approximately 0.7% of the modern Uruguayan population. The continuity of the lineage supports alternative perspectives about Uruguayan national identity and the meaning of the genocide, best labeled as ethnocide because of its consequences. It also contributes to the discussion about who the prehistoric mound builders were, and to the origin, at least in the maternal line, of a Charrúa Indian. From a more general perspective, we can conclude that the characteristics, evolution, and expansion of founder haplogroup C in America have not yet been elucidated.
Zhang, Libin; Leibowitz, Michael J; Zhang, Yi
2009-02-18
Self-splicing of group I intron from the 26S rRNA of Candida albicans is essential for maturation of the host RNA. Here, we demonstrated that the co-transcriptional splicing of the intron in vitro was blocked by antisense oligonucleotides (AONs) targeting the P3-P7 core of the intron. The core-targeted AON effectively and specifically inhibited the intron splicing from its host RNA in living C. albicans. Furthermore, flow cytometry experiments showed that the growth inhibition was caused by a fungicidal effect. For the first time, we showed that an AON targeting the ribozyme core folding specifically inhibits the endogenous ribozyme splicing in living cells and specifically kills the intron-containing fungal strains, which sheds light on the development of antifungal drugs in the future.
Bradley, Ian M; Pinto, Ameet J; Guest, Jeremy S
2016-10-01
The use of high-throughput sequencing technologies with the 16S rRNA gene for characterization of bacterial and archaeal communities has become routine. However, the adoption of sequencing methods for eukaryotes has been slow, despite their significance to natural and engineered systems. There are large variations among the target genes used for amplicon sequencing, and for the 18S rRNA gene, there is no consensus on which hypervariable region provides the most suitable representation of diversity. Additionally, it is unclear how much PCR/sequencing bias affects the depiction of community structure using current primers. The present study amplified the V4 and V8-V9 regions from seven microalgal mock communities as well as eukaryotic communities from freshwater, coastal, and wastewater samples to examine the effect of PCR/sequencing bias on community structure and membership. We found that degeneracies on the 3' end of the current V4-specific primers impact read length and mean relative abundance. Furthermore, the PCR/sequencing error is markedly higher for GC-rich members than for communities with balanced GC content. Importantly, the V4 region failed to reliably capture 2 of the 12 mock community members, and the V8-V9 hypervariable region more accurately represents mean relative abundance and alpha and beta diversity. Overall, the V4 and V8-V9 regions show similar community representations over freshwater, coastal, and wastewater environments, but specific samples show markedly different communities. These results indicate that multiple primer sets may be advantageous for gaining a more complete understanding of community structure and highlight the importance of including mock communities composed of species of interest. The quantification of error associated with community representation by amplicon sequencing is a critical challenge that is often ignored. When target genes are amplified using currently available primers, differential amplification efficiencies result in inaccurate estimates of community structure. The extent to which amplification bias affects community representation and the accuracy with which different gene targets represent community structure are not known. As a result, there is no consensus on which region provides the most suitable representation of diversity for eukaryotes. This study determined the accuracy with which commonly used 18S rRNA gene primer sets represent community structure and identified particular biases related to PCR amplification and Illumina MiSeq sequencing in order to more accurately study eukaryotic microbial communities. Copyright © 2016, American Society for Microbiology. All Rights Reserved.
An accurate and efficient experimental approach for characterization of the complex oral microbiota.
Zheng, Wei; Tsompana, Maria; Ruscitto, Angela; Sharma, Ashu; Genco, Robert; Sun, Yijun; Buck, Michael J
2015-10-05
Currently, taxonomic interrogation of microbiota is based on amplification of 16S rRNA gene sequences in clinical and scientific settings. Accurate evaluation of the microbiota depends heavily on the primers used, and genus/species resolution bias can arise with amplification of non-representative genomic regions. The latest Illumina MiSeq sequencing chemistry has extended the read length to 300 bp, enabling deep profiling of large number of samples in a single paired-end reaction at a fraction of the cost. An increasingly large number of researchers have adopted this technology for various microbiome studies targeting the 16S rRNA V3-V4 hypervariable region. To expand the applicability of this powerful platform for further descriptive and functional microbiome studies, we standardized and tested an efficient, reliable, and straightforward workflow for the amplification, library construction, and sequencing of the 16S V1-V3 hypervariable region using the new 2 × 300 MiSeq platform. Our analysis involved 11 subgingival plaque samples from diabetic and non-diabetic human subjects suffering from periodontitis. The efficiency and reliability of our experimental protocol was compared to 16S V3-V4 sequencing data from the same samples. Comparisons were based on measures of observed taxonomic richness and species evenness, along with Procrustes analyses using beta(β)-diversity distance metrics. As an experimental control, we also analyzed a total of eight technical replicates for the V1-V3 and V3-V4 regions from a synthetic community with known bacterial species operon counts. We show that our experimental protocol accurately measures true bacterial community composition. Procrustes analyses based on unweighted UniFrac β-diversity metrics depicted significant correlation between oral bacterial composition for the V1-V3 and V3-V4 regions. However, measures of phylotype richness were higher for the V1-V3 region, suggesting that V1-V3 offers a deeper assessment of population diversity and community ecology for the complex oral microbiota. This study provides researchers with valuable experimental evidence for the selection of appropriate 16S amplicons for future human oral microbiome studies. We expect that the tested 16S V1-V3 framework will be widely applicable to other types of microbiota, allowing robust, time-efficient, and inexpensive examination of thousands of samples for population, phylogenetic, and functional crossectional and longitutidal studies.
Potvin, Eric; Beuret, Laurent; Cadrin-Girard, Jean-François; Carter, Marcelle; Roy, Sophie; Tremblay, Michel; Charron, Jean
2010-11-01
The precise expression of the N-myc proto-oncogene is essential for normal mammalian development, whereas altered N-myc gene regulation is known to be a determinant factor in tumor formation. Using transgenic mouse embryos, we show that N-myc sequences from kb -8.7 to kb +7.2 are sufficient to reproduce the N-myc embryonic expression profile in developing branchial arches and limb buds. These sequences encompass several regulatory elements dispersed throughout the N-myc locus, including an upstream limb bud enhancer, a downstream somite enhancer, a branchial arch enhancer in the second intron, and a negative regulatory element in the first intron. N-myc expression in the limb buds is under the dominant control of the limb bud enhancer. The expression in the branchial arches necessitates the interplay of three regulatory domains. The branchial arch enhancer cooperates with the somite enhancer region to prevent an inhibitory activity contained in the first intron. The characterization of the branchial arch enhancer has revealed a specific role of the transcription factor GATA3 in the regulation of N-myc expression. Together, these data demonstrate that correct N-myc developmental expression is achieved via cooperation of multiple positive and negative regulatory elements.
Schmidt, S; Pericak-Vance, M A; Sawcer, S; Barcellos, L F; Hart, J; Sims, J; Prokop, A M; van der Walt, J; DeLoa, C; Lincoln, R R; Oksenberg, J R; Compston, A; Hauser, S L; Haines, J L; Gregory, S G
2006-07-01
Discrepant findings have been reported regarding an association of the apolipoprotein E (APOE) gene with the clinical course of multiple sclerosis (MS). To resolve these discrepancies, we examined common sequence variation in six candidate genes residing in a 380-kb genomic region surrounding and including the APOE locus for an association with MS severity. We genotyped at least three polymorphisms in each of six candidate genes in 1,540 Caucasian MS families (729 single-case and multiple-case families from the United States, 811 single-case families from the UK). By applying the quantitative transmission/disequilibrium test to a recently proposed MS severity score, the only statistically significant (P=0.003) association with MS severity was found for an intronic variant in the Herpes Virus Entry Mediator-B Gene PVRL2. Additional genotyping extended the association to a 16.6 kb block spanning intron 1 to intron 2 of the gene. Sequencing of PVRL2 failed to identify variants with an obvious functional role. In conclusion, the analysis of a very large data set suggests that genetic polymorphisms in PVRL2 may influence MS severity and supports the possibility that viral factors may contribute to the clinical course of MS, consistent with previous reports.
Damak, Naourez; Abdeljalil, Salma; Taeib, Noomen Hadj; Gargouri, Ali
2015-08-01
The rhg gene encoding a rhamnogalacturonase was isolated from the novel strain A1 of Aspergillus niger. It consists of an ORF of 1.505 kb encoding a putative protein of 446 amino acids with a predicted molecular mass of 47 kDa, belonging to the family 28 of glycosyl hydrolases. The nature and position of amino acids comprising the active site as well as the three-dimensional structure were well conserved between the A. niger CTM10548 and fungal rhamnogalacturonases. The coding region of the rhg gene is interrupted by three short introns of 56 (introns 1 and 3) and 52 (intron 2) bp in length. The comparison of the peptide sequence with A. niger rhg sequences revealed that the A1 rhg should be an endo-rhamnogalacturonases, more homologous to rhg A than rhg B A. niger known enzymes. The comparison of rhg nucleotide sequence from A. niger A1 with rhg A from A. niger shows several base changes. Most of these changes (59 %) are located at the third base of codons suggesting maintaining the same enzyme function. We used the rhamnogalacturonase A from Aspergillus aculeatus as a template to build a structural model of rhg A1 that adopted a right-handed parallel β-helix.
Berthold, Peter; Schmitt, Rüdiger; Mages, Wolfgang
2002-12-01
We have developed a positively selectable marker for the green alga Chlamydomonas reinhardtii using the Streptomyces hygroscopicus aminoglycoside phosphotransferase gene (aph7"). Its expression is controlled by C. reinhardtii regulatory elements, namely, the beta2-tubulin gene promoter in combination with the first intron and the 3' untranslated region of the small subunit of ribulose bisphosphate carboxylase, rbcS2. C. reinhardtii cell-wall deficient and wild-type strains were transformed at rates up to 5 x 10(-5) with two constructs, pHyg3 and pHyg4 (intron-less). Transformants selected on plates with 10 microg/ml hygromycin B exhibited diverse levels of resistance of up to 200 microg/ml that were stably maintained for at least seven months; they contained two to five copies of the construct integrated in their genomes. Transcription of the chimeric aph7" gene, correct splicing of the rbcS2 intron, and polyadenylation of the transcripts have been verified by sequencing of RT-PCR products. Average co-transformation rates using pHyg3 and a second selectable plasmid were about 11%. This advocates the hygromycin-resistance plasmid, pHyg3, as a new versatile tool for the transformation of a broad range of C. reinhardtii strains without the sustained need for using auxotrophic mutants as recipients.
Chen, Fei; Zhou, Yu; Qi, Yingchuan B.; Khivansara, Vishal; Li, Hairi; Chun, Sang Young; Kim, John K.; Fu, Xiang-Dong; Jin, Yishi
2015-01-01
Alternative polyadenylation (APA) is widespread in neuronal development and activity-mediated neural plasticity. However, the underlying molecular mechanisms are largely unknown. We used systematic genetic studies and genome-wide surveys of the transcriptional landscape to identify a context-dependent regulatory pathway controlling APA in the Caenorhabditis elegans nervous system. Loss of function in ssup-72, a Ser5 phosphatase for the RNA polymerase II (Pol II) C-terminal domain (CTD), dampens transcription termination at a strong intronic polyadenylation site (PAS) in unc-44/ankyrin yet promotes termination at the weak intronic PAS of the MAP kinase dlk-1. A nuclear protein, SYDN-1, which regulates neuronal development, antagonizes the function of SSUP-72 and several nuclear polyadenylation factors. This regulatory pathway allows the production of a neuron-specific isoform of unc-44 and an inhibitory isoform of dlk-1. Dysregulation of the unc-44 and dlk-1 mRNA isoforms in sydn-1 mutants impairs neuronal development. Deleting the intronic PAS of unc-44 results in increased pre-mRNA processing of neuronal ankyrin and suppresses sydn-1 mutants. These results reveal a mechanism by which regulation of CTD phosphorylation controls coding region APA in the nervous system. PMID:26588990
Bäumlein, H; Wobus, U; Pustell, J; Kafatos, F C
1986-01-01
The field bean, Vicia faba L. var. minor, possesses two sub-families of 11 S legumin genes named A and B. We isolated from a genomic library a B-type gene (LeB4) and determined its primary DNA sequence. Gene LeB4 codes for a 484 amino acid residue prepropolypeptide, encompassing a signal peptide of 22 amino acid residues, an acidic, very hydrophilic alpha-chain of 281 residues and a basic, somewhat hydrophobic beta-chain of 181 residues. The latter two coding regions are immediately contiguous, but each is interrupted by a short intron. Type A legumin genes from soybean and pea are known to have introns in the same two positions, in addition to an extra intron (within the alpha-coding sequence). Sequence comparisons of legumin genes from these three plants revealed a highly conserved sequence element of at least 28 bp, centered at approximately 100 bp upstream of each cap site. The element is absent from the equivalent position of all non-legumin and other plant and fungal genes examined. We tentatively name this element "legumin box" and suggest that it may have a function in the regulation of legumin gene expression. PMID:3960730
2014-01-01
Background In higher plants, inorganic nitrogen is assimilated via the glutamate synthase cycle or GS-GOGAT pathway. GOGAT enzyme occurs in two distinct forms that use NADH (NADH-GOGAT) or Fd (Fd-GOGAT) as electron carriers. The goal of the present study was to characterize wheat Fd-GOGAT genes and to assess the linkage with grain protein content (GPC), an important quantitative trait controlled by multiple genes. Results We report the complete genomic sequences of the three homoeologous A, B and D Fd-GOGAT genes from hexaploid wheat (Triticum aestivum) and their localization and characterization. The gene is comprised of 33 exons and 32 introns for all the three homoeologues genes. The three genes show the same exon/intron number and size, with the only exception of a series of indels in intronic regions. The partial sequence of the Fd-GOGAT gene located on A genome was determined in two durum wheat (Triticum turgidum ssp. durum) cvs Ciccio and Svevo, characterized by different grain protein content. Genomic differences allowed the gene mapping in the centromeric region of chromosome 2A. QTL analysis was conducted in the Svevo×Ciccio RIL mapping population, previously evaluated in 5 different environments. The study co-localized the Fd-GOGAT-A gene with the marker GWM-339, identifying a significant major QTL for GPC. Conclusions The wheat Fd-GOGAT genes are highly conserved; both among the three homoeologous hexaploid wheat genes and in comparison with other plants. In durum wheat, an association was shown between the Fd-GOGAT allele of cv Svevo with increasing GPC - potentially useful in breeding programs. PMID:25099972
Krisinger, J; Jeung, E B; Simmen, R C; Leung, P C
1995-01-01
The expression of Calbindin-D9k (CaBP-9k) in the pig uterus and placenta was measured by Northern blot analysis and reverse transcription polymerase chain reaction (PCR), respectively. Progesterone (P4) administration to ovariectomized pigs decreased CaBP-9k mRNA levels. Expression of endometrial CaBP-9k mRNA was high on pregnancy Days 10-12 and below the detection limit on Days 15 and 18. On Day 60, expression could be detected at low levels. In myometrium and placenta, CaBP-9k mRNA expression was not detectable by Northern analysis using total RNA. Reverse-transcribed RNA from both tissues demonstrated the presence of CaBP-9k transcripts by means of PCR. The partial CaBP-9k gene was amplified by PCR and cloned to determine the sequence of intron A. In contrast to the rat CaBP-9k gene, the pig gene does not contain a functional estrogen response element (ERE) within this region. A similar ERE-like sequence located at the identical location was examined by gel retardation analysis and failed to bind the estradiol receptor. A similar disruption of this ERE-like sequence has been described in the human CaBP-9k gene, which is not expressed at any level in placenta, myometrium, or endometrium. It is concluded that the pig CaBP-9k gene is regulated in these reproductive tissues in a manner distinct from that in rat and human tissues. The regulation is probably due to a regulatory region outside of intron A, which in the rat gene contains the key cis element for uterine expression of the CaBP-9k gene.
van der Klift, Heleen M; Jansen, Anne M L; van der Steenstraten, Niki; Bik, Elsa C; Tops, Carli M J; Devilee, Peter; Wijnen, Juul T
2015-01-01
A subset of DNA variants causes genetic disease through aberrant splicing. Experimental splicing assays, either RT-PCR analyses of patient RNA or functional splicing reporter minigene assays, are required to evaluate the molecular nature of the splice defect. Here, we present minigene assays performed for 17 variants in the consensus splice site regions, 14 exonic variants outside these regions, and two deep intronic variants, all in the DNA mismatch-repair (MMR) genes MLH1, MSH2, MSH6, and PMS2, associated with Lynch syndrome. We also included two deep intronic variants in APC and PKD2. For one variant (MLH1 c.122A>G), our minigene assay and patient RNA analysis could not confirm the previously reported aberrant splicing. The aim of our study was to further investigate the concordance between minigene splicing assays and patient RNA analyses. For 30 variants results from patient RNA analyses were available, either performed by our laboratory or presented in literature. Some variants were deliberately included in this study because they resulted in multiple aberrant transcripts in patient RNA analysis, or caused a splice effect other than the prevalent exon skip. While both methods were completely concordant in the assessment of splice effects, four variants exhibited major differences in aberrant splice patterns. Based on the present and earlier studies, together showing an almost 100% concordance of minigene assays with patient RNA analyses, we discuss the weight given to minigene splicing assays in the current criteria proposed by InSiGHT for clinical classification of MMR variants. PMID:26247049
Fernandez-Valverde, Selene L; Calcino, Andrew D; Degnan, Bernard M
2015-05-15
The demosponge Amphimedon queenslandica is amongst the few early-branching metazoans with an assembled and annotated draft genome, making it an important species in the study of the origin and early evolution of animals. Current gene models in this species are largely based on in silico predictions and low coverage expressed sequence tag (EST) evidence. Amphimedon queenslandica protein-coding gene models are improved using deep RNA-Seq data from four developmental stages and CEL-Seq data from 82 developmental samples. Over 86% of previously predicted genes are retained in the new gene models, although 24% have additional exons; there is also a marked increase in the total number of annotated 3' and 5' untranslated regions (UTRs). Importantly, these new developmental transcriptome data reveal numerous previously unannotated protein-coding genes in the Amphimedon genome, increasing the total gene number by 25%, from 30,060 to 40,122. In general, Amphimedon genes have introns that are markedly smaller than those in other animals and most of the alternatively spliced genes in Amphimedon undergo intron-retention; exon-skipping is the least common mode of alternative splicing. Finally, in addition to canonical polyadenylation signal sequences, Amphimedon genes are enriched in a number of unique AT-rich motifs in their 3' UTRs. The inclusion of developmental transcriptome data has substantially improved the structure and composition of protein-coding gene models in Amphimedon queenslandica, providing a more accurate and comprehensive set of genes for functional and comparative studies. These improvements reveal the Amphimedon genome is comprised of a remarkably high number of tightly packed genes. These genes have small introns and there is pervasive intron retention amongst alternatively spliced transcripts. These aspects of the sponge genome are more similar unicellular opisthokont genomes than to other animal genomes.
Ruvolo, Vivian; Wang, Eryu; Boyle, Sarah; Swaminathan, Sankar
1998-01-01
The Epstein–Barr virus (EBV) nuclear protein BS-MLF1 (SM) is expressed early after entry of EBV into the lytic cycle. SM transactivates reporter gene constructs driven by a wide variety of promoters, but the mechanism of SM action is poorly understood. In this study, we demonstrate that the SM protein inhibits expression of intron-containing genes and activates expression of intron-less genes. We demonstrate that SM has the predicted inhibitory effect on expression of a spliced EBV gene but activates an unspliced early EBV gene. SM inhibited gene expression at the post-transcriptional level by preventing the accumulation of nuclear and cytoplasmic RNA transcripts. Conversely, SM led to increased accumulation of nuclear mRNA from intron-less genes without affecting the rate of transcription, indicating that SM enhances nuclear RNA stability. The ratio of cytoplasmic to nuclear polyadenylated mRNA was increased in the presence of SM, suggesting that SM also enhances nucleo-cytoplasmic mRNA transport. The degree of transactivation by SM was dependent on the sequence of the 3′-untranslated region of the target mRNA. Finally, we demonstrate that the amino-terminal portion of SM fused to glutathione-S-transferase binds radioactively labeled RNA in vitro, indicating that SM is a single-stranded RNA binding protein. Importantly, the latent and immediate-early genes of EBV contain introns whereas many early and late genes do not. Thus, SM may down-regulate synthesis of host cell proteins and latent EBV proteins while simultaneously enhancing expression of specific lytic EBV genes by binding to mRNA and modulating its stability and transport. PMID:9671768
Ruvolo, V; Wang, E; Boyle, S; Swaminathan, S
1998-07-21
The Epstein-Barr virus (EBV) nuclear protein BS-MLF1 (SM) is expressed early after entry of EBV into the lytic cycle. SM transactivates reporter gene constructs driven by a wide variety of promoters, but the mechanism of SM action is poorly understood. In this study, we demonstrate that the SM protein inhibits expression of intron-containing genes and activates expression of intron-less genes. We demonstrate that SM has the predicted inhibitory effect on expression of a spliced EBV gene but activates an unspliced early EBV gene. SM inhibited gene expression at the post-transcriptional level by preventing the accumulation of nuclear and cytoplasmic RNA transcripts. Conversely, SM led to increased accumulation of nuclear mRNA from intron-less genes without affecting the rate of transcription, indicating that SM enhances nuclear RNA stability. The ratio of cytoplasmic to nuclear polyadenylated mRNA was increased in the presence of SM, suggesting that SM also enhances nucleo-cytoplasmic mRNA transport. The degree of transactivation by SM was dependent on the sequence of the 3'-untranslated region of the target mRNA. Finally, we demonstrate that the amino-terminal portion of SM fused to glutathione-S-transferase binds radioactively labeled RNA in vitro, indicating that SM is a single-stranded RNA binding protein. Importantly, the latent and immediate-early genes of EBV contain introns whereas many early and late genes do not. Thus, SM may down-regulate synthesis of host cell proteins and latent EBV proteins while simultaneously enhancing expression of specific lytic EBV genes by binding to mRNA and modulating its stability and transport.
Abbasi, Amir A; Minhas, Rashid; Schmidt, Ansgar; Koch, Sabine; Grzeschik, Karl-Heinz
2013-10-01
The zinc finger transcription factor Gli3 is an important mediator of Sonic hedgehog (Shh) signaling. During early embryonic development Gli3 participates in patterning and growth of the central nervous system, face, skeleton, limb, tooth and gut. Precise regulation of the temporal and spatial expression of Gli3 is crucial for the proper specification of these structures in mammals and other vertebrates. Previously we reported a set of human intronic cis-regulators controlling almost the entire known repertoire of endogenous Gli3 expression in mouse neural tube and limbs. However, the genetic underpinning of GLI3 expression in other embryonic domains such as craniofacial structures and internal organs remain elusive. Here we demonstrate in a transgenic mice assay the potential of a subset of human/fish conserved non-coding sequences (CNEs) residing within GLI3 intronic intervals to induce reporter gene expression at known regions of endogenous Gli3 transcription in embryonic domains other than central nervous system (CNS) and limbs. Highly specific reporter expression was observed in craniofacial structures, eye, gut, and genitourinary system. Moreover, the comparison of expression patterns directed by these intronic cis-acting regulatory elements in mouse and zebrafish embryos suggests that in accordance with sequence conservation, the target site specificity of a subset of these elements remains preserved among these two lineages. Taken together with our recent investigations, it is proposed here that during vertebrate evolution the Gli3 expression control acquired multiple, independently acting, intronic enhancers for spatiotemporal patterning of CNS, limbs, craniofacial structures and internal organs. © 2013 The Authors Development, Growth & Differentiation © 2013 Japanese Society of Developmental Biologists.
Three Group-I introns in 18S rDNA of Endosymbiotic Algae of Paramecium bursaria from Japan
NASA Astrophysics Data System (ADS)
Hoshina, Ryo; Kamako, Shin-ichiro; Imamura, Nobutaka
2004-08-01
In the nuclear encoded small subunit ribosomal DNA (18S rDNA) of symbiotic alga of Paramecium bursaria (F36 collected in Japan) possesses three intron-like insertions (Hoshina et al., unpubl. data, 2003). The present study confirmed these exact lengths and insertion sites by reverse transcription-PCR. Two of them were inserted at Escherichia coli 16S rRNA genic position 943 and 1512 that are frequent intron insertion positions, but another insertion position (nearly 1370) was the first finding. Their secondary structures suggested they belong to Group-I intron; one belongs to subgroup IE, others belong to subgroup IC1. Similarity search indicated these introns are ancestral ones.
Selfish DNA: homing endonucleases find a home.
Edgell, David R
2009-02-10
Self-splicing group I introns come in two flavours - those with a homing endonuclease to promote mobility of the intron, and those without an endonuclease. How homing endonucleases and self-splicing introns associate to form a composite selfish genetic element is a question of long-standing interest. Recent work has revealed that a shared characteristic of both introns and endonucleases, the targeting of conserved sequences, may provide the impetus for the evolution of composite mobile genetic elements.
Emblem, Åse; Karlsen, Bård Ove; Evertsen, Jussi; Johansen, Steinar D
2011-11-01
Group I introns are genetic insertion elements that invade host genomes in a wide range of organisms. In metazoans, however, group I introns are extremely rare, so far only identified within mitogenomes of hexacorals and some sponges. We sequenced the complete mitogenome of the cold-water scleractinian coral Lophelia pertusa, the dominating deep sea reef-building coral species in the North Atlantic Ocean. The mitogenome (16,150 bp) has the same gene content but organized in a unique gene order compared to that of other known scleractinian corals. A complex group I intron (6460 bp) inserted in the ND5 gene (position 717) was found to host seven essential mitochondrial protein genes and one ribosomal RNA gene. Phylogenetic analysis supports a vertical inheritance pattern of the ND5-717 intron among hexacoral mitogenomes with no examples of intron loss. Structural assessments of the Lophelia intron revealed an unusual organization that lacks the universally conserved ωG at the 3' end, as well as a highly compact RNA core structure with overlapping ribozyme and protein coding capacities. Based on phylogenetic and structural analyses we reconstructed the evolutionary history of ND5-717, from its ancestral protist origin, through intron loss in some early metazoan lineages, and into a compulsory feature with functional implications in hexacorals. Copyright © 2011 Elsevier Inc. All rights reserved.
Zhou, Lijuan; Powell, Charles A.; Hoffman, Michele T.; Li, Wenbin; Fan, Guocheng; Liu, Bo; Lin, Hong; Duan, Yongping
2011-01-01
“Candidatus Liberibacter asiaticus” is a psyllid-transmitted, phloem-limited alphaproteobacterium and the most prevalent species of “Ca. Liberibacter” associated with a devastating worldwide citrus disease known as huanglongbing (HLB). Two related and hypervariable genes (hyvI and hyvII) were identified in the prophage regions of the Psy62 “Ca. Liberibacter asiaticus” genome. Sequence analyses of the hyvI and hyvII genes in 35 “Ca. Liberibacter asiaticus” DNA isolates collected globally revealed that the hyvI gene contains up to 12 nearly identical tandem repeats (NITRs, 132 bp) and 4 partial repeats, while hyvII contains up to 2 NITRs and 4 partial repeats and shares homology with hyvI. Frequent deletions or insertions of these repeats within the hyvI and hyvII genes were observed, none of which disrupted the open reading frames. Sequence conservation within the individual repeats but an extensive variation in repeat numbers, rearrangement, and the sequences flanking the repeat region indicate the diversity and plasticity of “Ca. Liberibacter asiaticus” bacterial populations in the world. These differences were found not only in samples of distinct geographical origins but also in samples from a single origin and even from a single “Ca. Liberibacter asiaticus”-infected sample. This is the first evidence of different “Ca. Liberibacter asiaticus” populations coexisting in a single HLB-affected sample. The Florida “Ca. Liberibacter asiaticus” isolates contain both hyvI and hyvII, while all other global “Ca. Liberibacter asiaticus” isolates contain either one or the other. Interclade assignments of the putative HyvI and HyvII proteins from Florida isolates with other global isolates in phylogenetic trees imply multiple “Ca. Liberibacter asiaticus” populations in the world and a multisource introduction of the “Ca. Liberibacter asiaticus” bacterium into Florida. PMID:21784907
Gentil, Coline A; Gammill, Hilary S; Luu, Christine T; Mayes, Maureen D; Furst, Dan E; Nelson, J Lee
2017-03-07
Specific HLA class II alleles are associated with systemic sclerosis (SSc) risk, clinical characteristics, and autoantibodies. HLA nomenclature initially developed with antibodies as typing reagents defining DRB1 allele groups. However, alleles from different DRB1 allele groups encode the same third hypervariable region (3rd HVR) sequence, the primary T-cell recognition site, and 3rd HVR charge differences can affect interactions with T cells. We considered 3rd HVR sequences (amino acids 67-74) irrespective of the allele group and analyzed parental inheritance considered according to the 3rd HVR charge, comparing SSc patients with controls. In total, 306 families (121 SSc and 185 controls) were HLA genotyped and parental HLA-haplotype origin was determined. Analysis was conducted according to DRβ1 3rd HVR sequence, charge, and parental inheritance. The distribution of 3rd HVR sequences differed in SSc patients versus controls (p = 0.007), primarily due to an increase of specific DRB1*11 alleles, in accord with previous observations. The 3rd HVR sequences were next analyzed according to charge and parental inheritance. Paternal transmission of DRB1 alleles encoding a +2 charge 3rd HVR was significantly reduced in SSc patients compared with maternal transmission (p = 0.0003, corrected for analysis of four charge categories p = 0.001). To a lesser extent, paternal transmission was increased when charge was 0 (p = 0.021, corrected for multiple comparisons p = 0.084). In contrast, paternal versus maternal inheritance was similar in controls. SSc patients differed from controls when DRB1 alleles were categorized according to 3rd HVR sequences. Skewed parental inheritance was observed in SSc patients but not in controls when the DRβ1 3rd HVR was considered according to charge. These observations suggest that epigenetic modulation of HLA merits investigation in SSc.
Mans, Ben J; Pienaar, Ronel; Ratabane, John; Pule, Boitumelo; Latif, Abdalla A
2016-07-01
Molecular classification and systematics of the Theileria is based on the analysis of the 18S rRNA gene. Reverse line blot or conventional sequencing approaches have disadvantages in the study of 18S rRNA diversity and a next-generation 454 sequencing approach was investigated. The 18S rRNA gene was amplified using RLB primers coupled to 96 unique sequence identifiers (MIDs). Theileria positive samples from African buffalo (672) and cattle (480) from southern Africa were combined in batches of 96 and sequenced using the GS Junior 454 sequencer to produce 825711 informative sequences. Sequences were extracted based on MIDs and analysed to identify Theileria genotypes. Genotypes observed in buffalo and cattle were confirmed in the current study, while no new genotypes were discovered. Genotypes showed specific geographic distributions, most probably linked with vector distributions. Host specificity of buffalo and cattle specific genotypes were confirmed and prevalence data as well as relative parasitemia trends indicate preference for different hosts. Mixed infections are common with African buffalo carrying more genotypes compared to cattle. Associative or exclusion co-infection profiles were observed between genotypes that may have implications for speciation and systematics: specifically that more Theileria species may exist in cattle and buffalo than currently recognized. Analysis of primers used for Theileria parva diagnostics indicate that no new genotypes will be amplified by the current primer sets confirming their specificity. T. parva SNP variants that occur in the 18S rRNA hypervariable region were confirmed. A next generation sequencing approach is useful in obtaining comprehensive knowledge regarding 18S rRNA diversity and prevalence for the Theileria, allowing for the assessment of systematics and diagnostic assays based on the 18S gene. Copyright © 2016 Elsevier GmbH. All rights reserved.
Post-transcriptional regulation mediated by specific neurofilament introns in vivo.
Wang, Chen; Szaro, Ben G
2016-04-01
Neurons regulate genes post-transcriptionally to coordinate the supply of cytoskeletal proteins, such as the medium neurofilament (NEFM), with demand for structural materials in response to extracellular cues encountered by developing axons. By using a method for evaluating functionality of cis-regulatory gene elements in vivo through plasmid injection into Xenopus embryos, we discovered that splicing of a specific nefm intron was required for robust transgene expression, regardless of promoter or cell type. Transgenes utilizing the nefm 3'-UTR but substituting other nefm introns expressed little or no protein owing to defects in handling of the messenger (m)RNA as opposed to transcription or splicing. Post-transcriptional events at multiple steps, but mainly during nucleocytoplasmic export, contributed to these varied levels of protein expression. An intron of the β-globin gene was also able to promote expression in a manner identical to that of the nefm intron, implying a more general preference for certain introns in controlling nefm expression. These results expand our knowledge of intron-mediated gene expression to encompass neurofilaments, indicating an additional layer of complexity in the control of a cytoskeletal gene needed for developing and maintaining healthy axons. © 2016. Published by The Company of Biologists Ltd.
Horiuchi, Keiko; Perez-Cerezales, Serafín; Papasaikas, Panagiotis; Ramos-Ibeas, Priscila; López-Cardona, Angela Patricia; Laguna-Barraza, Ricardo; Fonseca Balvís, Noelia; Pericuesta, Eva; Fernández-González, Raul; Planells, Benjamín; Viera, Alberto; Suja, Jose Angel; Ross, Pablo Juan; Alén, Francisco; Orio, Laura; Rodriguez de Fonseca, Fernando; Pintado, Belén; Valcárcel, Juan; Gutiérrez-Adán, Alfonso
2018-04-03
The U2AF35-like ZRSR1 has been implicated in the recognition of 3' splice site during spliceosome assembly, but ZRSR1 knockout mice do not show abnormal phenotypes. To analyze ZRSR1 function and its precise role in RNA splicing, we generated ZRSR1 mutant mice containing truncating mutations within its RNA-recognition motif. Homozygous mutant mice exhibited severe defects in erythrocytes, muscle stretch, and spermatogenesis, along with germ cell sloughing and apoptosis, ultimately leading to azoospermia and male sterility. Testis RNA sequencing (RNA-seq) analyses revealed increased intron retention of both U2- and U12-type introns, including U12-type intron events in genes with key functions in spermatogenesis and spermatid development. Affected U2 introns were commonly found flanking U12 introns, suggesting functional cross-talk between the two spliceosomes. The splicing and tissue defects observed in mutant mice attributed to ZRSR1 loss of function suggest a physiological role for this factor in U12 intron splicing. Copyright © 2018 The Author(s). Published by Elsevier Inc. All rights reserved.
De Wachter, R; Neefs, J M; Goris, A; Van de Peer, Y
1992-01-01
The nucleotide sequence of the gene coding for small ribosomal subunit RNA in the basidiomycete Ustilago maydis was determined. It revealed the presence of a group I intron with a length of 411 nucleotides. This is the third occurrence of such an intron discovered in a small subunit rRNA gene encoded by a eukaryotic nuclear genome. The other two occurrences are in Pneumocystis carinii, a fungus of uncertain taxonomic status, and Ankistrodesmus stipitatus, a green alga. The nucleotides of the conserved core structure of 101 group I intron sequences present in different genes and genome types were aligned and their evolutionary relatedness was examined. This revealed a cluster including all group I introns hitherto found in eukaryotic nuclear genes coding for small and large subunit rRNAs. A secondary structure model was designed for the area of the Ustilago maydis small ribosomal subunit RNA precursor where the intron is situated. It shows that the internal guide sequence pairing with the intron boundaries fits between two helices of the small subunit rRNA, and that minimal rearrangement of base pairs suffices to achieve the definitive secondary structure of the 18S rRNA upon splicing. PMID:1561081
Cordero, David; Peña, Juan B; Saavedra, Carlos
2014-02-01
Studies on the phylogeography of species inhabiting the Mediterranean and the nearby coasts of the NE Atlantic Ocean (MEDAT) have found subdivision and/or phylogeographic structure in one or more of the Atlantic, western Mediterranean and eastern Mediterranean basins. This structure has been explained as the result of past population fragmentation caused by Pleistocene sea level changes and current patterns of marine circulation. However, the increasing use of nuclear markers has revealed that these two factors alone are not enough to explain the phylogeographic patterns, and an additional role has been suggested for endogenous barriers to gene flow or natural selection. In this article we examined the role of these factors in Ruditapes decussatus, a commercial clam species native to MEDAT. A genetic analysis of 11 populations was carried out by examining 6 introns with a PCR-RFLP technique. We found subdivision in three regions: Atlantic (ATL), western Mediterranean plus Tunisia (WMED), and Aegean and Adriatic seas (AEGAD). Two introns (Ech and Tbp) showed alleles that were restricted to AEGAD. Sequencing a subsample of individuals for these introns indicated that AEGAD-specific alleles were separate clades, thus revealing a phylogeographic brake at the WMED-AEGAD boundary. Sequencing of the mitochondrial COI locus confirmed this phylogeographic break. Dating of the AEGAD mitochondrial haplotypes and nuclear alleles with a Bayesian MCMC method revealed that they shared common ancestors in the Pleistocene. These results can be explained in the framework of Pleistocene sea level drops and patterns of gene flow in MEDAT. An additional observation was a lack of differentiation at COI between the ATL and WMED, in sharp contrast with 4 introns that showed clear genetic subdivision. Neutrality tests did not support the hypothesis of a selective sweep acting on mtDNA to explain the contrasting levels of differentiation between mitochondrial and nuclear markers across the ATL-WMED transition, and we argue that the difference between markers is best explained by the existence of an endogenous genetic barrier, rather than by a physical barrier to larval migration alone. Copyright © 2013 Elsevier Inc. All rights reserved.
Kaer, Kristel; Branovets, Jelena; Hallikma, Anni; Nigumann, Pilvi; Speek, Mart
2011-01-01
Background Transcriptional interference has been recently recognized as an unexpectedly complex and mostly negative regulation of genes. Despite a relatively few studies that emerged in recent years, it has been demonstrated that a readthrough transcription derived from one gene can influence the transcription of another overlapping or nested gene. However, the molecular effects resulting from this interaction are largely unknown. Methodology/Principal Findings Using in silico chromosome walking, we searched for prematurely terminated transcripts bearing signatures of intron retention or exonization of intronic sequence at their 3′ ends upstream to human L1 retrotransposons, protein-coding and noncoding nested genes. We demonstrate that transcriptional interference induced by intronic L1s (or other repeated DNAs) and nested genes could be characterized by intron retention, forced exonization and cryptic polyadenylation. These molecular effects were revealed from the analysis of endogenous transcripts derived from different cell lines and tissues and confirmed by the expression of three minigenes in cell culture. While intron retention and exonization were comparably observed in introns upstream to L1s, forced exonization was preferentially detected in nested genes. Transcriptional interference induced by L1 or nested genes was dependent on the presence or absence of cryptic splice sites, affected the inclusion or exclusion of the upstream exon and the use of cryptic polyadenylation signals. Conclusions/Significance Our results suggest that transcriptional interference induced by intronic L1s and nested genes could influence the transcription of the large number of genes in normal as well as in tumor tissues. Therefore, this type of interference could have a major impact on the regulation of the host gene expression. PMID:22022525
Berends Sexton, T; Jones, J T; Mullet, J E
1990-05-01
A 6.25 kbp barley plastid DNA region located between psbA and psbD-psbC were sequenced and RNAs produced from this DNA were analyzed. TrnK(UUU), rps16 and trnQ(UUG) were located upstream of psbA. These genes were transcribed from the same DNA strand as psbA and multiple RNAs hybridized to them. TrnK and rsp16 contained introns; a 504 amino acid open reading frame (ORF504) was located within the trnK intron. Between trnQ and psbD-psbC was a 2.24 kbp region encoding psbK, psbI and trnS(GCU). PsbK and psbI are encoded on the same DNA strand as psbD-psbC whereas trnS(GCU) is transcribed from the opposite strand. Two large RNAs accumulate in barley etioplasts which contain psbK, psbI, anti-sense trnS(GCU) and psbD-psbC sequences. Other RNAs encode psbK and psbI only, or psbK only. The divergent trnS(GCU) located upstream of psbD-psbC and a second divergent trnS(UGA) located downstream of psbD-psbC were both expressed. Furthermore, RNA complementary to psbK and psbI mRNA was detected, suggesting that transcription from divergent overlapping transcription units may modulate expression from this DNA region.
Schausi, Diane; Tiffoche, Christophe; Thieulant, Marie-Lise
2003-07-01
We have characterized the intronic promoter of the rat estrogen receptor (ER) alpha gene, responsible for the lactotrope-specific truncated ER product (TERP)-1 isoform expression. Transcriptional regulation was investigated by transient transfections using 5'-deletion constructs. TERP promoter constructs were highly active in MMQ cells, a pure lactotrope cell line, whereas a low basal activity was detected in alphaT3-1 gonadotrope cells or in COS-7 monkey kidney cells. Serial deletion analysis revealed that 1) a minimal -693-bp region encompassing the TATA box is sufficient to allow lactotrope-specific expression; 2) the promoter contains strong positive cis-acting elements both in the distal and proximal regions, and 3) the region spanning the -1698/-1194 region includes repressor elements. Transient transfection studies, EMSAs, and gel shifts demonstrated that estrogen activates the TERP promoter via an estrogen-responsive element (ERE1) located within the proximal region. Mutation of ERE1 site completely abolishes the estradiol-dependent transcription, indicating that ERE1 site is sufficient to confer estrogen responsiveness to TERP promoter. In addition, ERalpha action was synergized by transfection of the pituitary-specific factor Pit-1. EMSAs showed that a single Pit-1 DNA binding element in the vicinity of the TATA box is sufficient to confer response by the TERP promoter. In conclusion, we demonstrated, for the first time, that TERP promoter regulation involves ERE and Pit-1 cis-elements and corresponding trans-acting factors, which could play a role in the physiological changes that occur in TERP-1 transcription in lactotrope cells.
Ryu, Dongchan; Ryu, Jihye; Lee, Chaeyoung
2016-05-01
A genome-wide association study (GWAS) was conducted to examine genetic associations of common autosomal nucleotide variants with sex in a Korean population with 4183 males and 4659 females. Nine genetic association signals were identified in four intragenic and five intergenic regions (P<5 × 10(-8)). Further analysis with an independent data set confirmed two intragenic association signals in the genes encoding protein phosphatase 1, regulatory subunit 12B (PPP1R12B, intron 12, rs1819043) and dynein, axonemal, heavy chain 11 (DNAH11, intron 61, rs10255013), which are directly involved in the reproductive system. This study revealed autosomal genetic variants associated with sex ratio by GWAS for the first time. This implies that genetic variants in proximity to the association signals may influence sex-specific selection and contribute to sex ratio variation. Further studies are required to reveal the mechanisms underlying sex-specific selection.
Circular RNAs and hereditary bone diseases.
Zhai, Naixiang; Lu, Yanqin; Wang, Yanzhou; Ren, Xiuzhi; Han, Jinxiang
2018-02-01
Circular RNA (circRNA) is a non-linear form of RNA derived from exonic, intronic, and exon-intron gene regions. circRNAs are characterized by covalent closed loops, highly stable nuclease resistance, and specific expression in species and developmental stages. CircRNA molecules have been identified as playing roles in the regulation of cell transcription, transcriptional expression after translation, interactions with microRNAs, and protein coding. A high stability and tissue- and disease-specific expression allow circRNAs to serve as potential biomarkers both for diseases and prognosis. CircRNAs function in bone remodeling by directly participating in bone-related signaling pathways and by forming the circRNA-miRNA-mRNA axis. Studies have seldom reported on the low incidence of circRNAs in genetic bone disorders. The current study reviews the characteristics of circRNAs and recent research on their role in rare hereditary bone diseases.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Ferrari, S.; Finelli, P.; Rocchi, M.
The human genome contains a large number of sequences related to the cDNA for High Mobility Group 1 protein (HMG1), which so far has hampered the cloning and mapping of the active HMG1 gene. We show that the human HMG1 gene contains introns, while the HMG1-related sequences do not and most likely are retrotransposed pseudogenes. We identified eight YACs from the ICI and CEPH libraries that contain the human HMG1 gene. The HMG1 gene is similar in structure to the previously characterized murine homologue and maps to human chromosome 13 and q12, as determined by in situ hybridization. The mousemore » Hmg1 gene maps to the telomeric region of murine Chromosome 5, which is syntenic to the human 13q12 band. 18 refs., 3 figs.« less
Invariant U2 snRNA nucleotides form a stem loop to recognize the intron early in splicing
Perriman, Rhonda; Ares, Manuel
2010-01-01
U2 snRNA-intron branchpoint pairing is a critical step in pre-mRNA recognition by the splicing apparatus, but the mechanism by which these two RNAs engage each other is unknown. Here we identify a new U2 snRNA structure, the branchpoint interaction stem-loop (BSL), that presents the U2 nucleotides that will contact the intron. We provide evidence that the BSL forms prior to interaction with the intron, and is disrupted by the DExD/H protein Prp5p during engagement of the snRNA with the intron. In vitro splicing complex assembly in a BSL-destabilized mutant extract suggests that the BSL is required at a previously unrecognized step between commitment complex and prespliceosome formation. The extreme evolutionary conservation of the BSL suggests it represents an ancient structural solution to the problem of intron branchpoint recognition by dynamic RNA elements that must serve multiple functions at other times during splicing. PMID:20471947
Major clades of Agaricales: a multilocus phylogenetic overview.
P. Brandon Matheny; Judd M. Curtis; Valerie Hofstetter; M. Catherine Aime; Jean-Marc Moncalvo; Zai-Wei Ge; Zhu-Liang Yang; Joseph F. Ammirati; Timothy J. Baroni; Neale L. Bougher; Karen W. Lodge Hughes; Richard W. Kerrigan; Michelle T. Seidl; Aanen; Matthew Duur K. DeNitis; Graciela M. Daniele; Dennis E. Desjardin; Bradley R. Kropp; Lorelei L. Norvell; Andrew Parker; Else C. Vellinga; Rytas Vilgalys; David S. Hibbett
2006-01-01
An overview of the phylogeny of the Agaricales is presented based on a multilocus analysis of a six-gene region supermatrix. Bayesian analyses of 5611 nucleotide characters of rpb1, rpb1-intron 2, rpb2 and 18S, 25S, and 5.8S ribosomal RNA genes recovered six major clades, which are recognized informally and labeled the Agaricoid, Tricholomatoid, Marasmioid, Pluteoid,...
Gene organization and alternative splicing of human prohormone convertase PC8.
Goodge, K A; Thomas, R J; Martin, T J; Gillespie, M T
1998-01-01
The mammalian Ca2+-dependent serine protease prohormone convertase PC8 is expressed ubiquitously, being transcribed as 3.5, 4.3 and 6.0 kb mRNA isoforms in various tissues. To determine the origin of these various mRNA isoforms we report the characterization of the human PC8 gene, which has been previously localized to chromosome 11q23-24. Consisting of 16 exons, the human PC8 gene spans approx. 27 kb. A comparison of the position of intron-exon junctions of the human PC8 gene with the gene structures of previously reported prohormone convertase genes demonstrated a divergence of the human PC8 from the highly conserved nature of the gene organization of this enzyme family. The nucleotide sequence of the 5'-flanking region of the human PC8 is reported and possesses putative promoter elements characteristic of a GC-rich promoter. Further supporting the potential role of a GC-rich promoter element, multiple transcriptional initiation sites within a 200 bp region were demonstrated. We propose that the various mRNA isoforms of PC8 result from the inclusion of intronic sequences within transcripts. PMID:9820811
DOE Office of Scientific and Technical Information (OSTI.GOV)
Cuppens, H.; Marynen, P.; Cassiman, J.J.
1993-12-01
The authors have previously shown that about 85% of the mutations in 194 Belgian cystic fibrosis alleles could be detected by a reverse dot-blot assay. In the present study, 50 Belgian chromosomes were analyzed for mutations in the cystic fibrosis transmembrane conductance regulator gene by means of direct solid phase automatic sequencing of PCR products of individual exons. Twenty-six disease mutations and 14 polymorphisms were found. Twelve of these mutations and 3 polymorphisms were not described before. With the exception of one mutant allele carrying two mutations, these mutations were the only mutations found in the complete coding region andmore » their exon/intron boundaries. The total sensitivity of mutant CF alleles that could be identified was 98.5%. Given the heterogeneity of these mutations, most of them very rare, CFTR mutation screening still remains rather complex in the population, and population screening, whether desirable or not, does not appear to be technically feasible with the methods currently available. 24 refs., 1 fig., 2 tabs.« less
Dynamics of DNA methylomes underlie oyster development
Sourdaine, Pascal; Guo, Ximing; Favrel, Pascal
2017-01-01
DNA methylation is a critical epigenetic regulator of development in mammals and social insects, but its significance in development outside these groups is not understood. Here we investigated the genome-wide dynamics of DNA methylation in a mollusc model, the oyster Crassostrea gigas, from the egg to the completion of organogenesis. Large-scale methylation maps reveal that the oyster genome displays a succession of methylated and non methylated regions, which persist throughout development. Differentially methylated regions (DMRs) are strongly regulated during cleavage and metamorphosis. The distribution and levels of methylated DNA within genomic features (exons, introns, promoters, repeats and transposons) show different developmental lansdscapes marked by a strong increase in the methylation of exons against introns after metamorphosis. Kinetics of methylation in gene-bodies correlate to their transcription regulation and to distinct functional gene clusters, and DMRs at cleavage and metamorphosis bear the genes functionally related to these steps, respectively. This study shows that DNA methylome dynamics underlie development through transcription regulation in the oyster, a lophotrochozoan species. To our knowledge, this is the first demonstration of such epigenetic regulation outside vertebrates and ecdysozoan models, bringing new insights into the evolution and the epigenetic regulation of developmental processes. PMID:28594821
Dynamics of DNA methylomes underlie oyster development.
Riviere, Guillaume; He, Yan; Tecchio, Samuele; Crowell, Elizabeth; Gras, Michaël; Sourdaine, Pascal; Guo, Ximing; Favrel, Pascal
2017-06-01
DNA methylation is a critical epigenetic regulator of development in mammals and social insects, but its significance in development outside these groups is not understood. Here we investigated the genome-wide dynamics of DNA methylation in a mollusc model, the oyster Crassostrea gigas, from the egg to the completion of organogenesis. Large-scale methylation maps reveal that the oyster genome displays a succession of methylated and non methylated regions, which persist throughout development. Differentially methylated regions (DMRs) are strongly regulated during cleavage and metamorphosis. The distribution and levels of methylated DNA within genomic features (exons, introns, promoters, repeats and transposons) show different developmental lansdscapes marked by a strong increase in the methylation of exons against introns after metamorphosis. Kinetics of methylation in gene-bodies correlate to their transcription regulation and to distinct functional gene clusters, and DMRs at cleavage and metamorphosis bear the genes functionally related to these steps, respectively. This study shows that DNA methylome dynamics underlie development through transcription regulation in the oyster, a lophotrochozoan species. To our knowledge, this is the first demonstration of such epigenetic regulation outside vertebrates and ecdysozoan models, bringing new insights into the evolution and the epigenetic regulation of developmental processes.
Cruts, M; Backhovens, H; Van Gassen, G; Theuns, J; Wang, S Y; Wehnert, A; van Duijn, C M; Karlsson, T; Hofman, A; Adolfsson, R
1995-10-13
Linkage analysis studies have indicated that the chromosome band 14q24.3 harbours a major gene for familial early-onset Alzheimer's disease (AD). Recently we localized the chromosome 14 AD gene (AD3) in the 6.4 cM interval between the markers D14S289 and D14S61. We mapped the gene encoding dihydrolipoyl succinyltransferase (DLST), the E2k component of human alpha-ketoglutarate dehydrogenase complex (KGDHC), in the AD3 candidate region using yeast artificial chromosomes (YACs). The DLST gene is a candidate for the AD3 gene since deficiencies in KGDHC activity have been observed in brain tissue and fibroblasts of AD patients. The 15 exons and the promoter region of the DLST gene were analysed for mutations in chromosome 14 linked AD cases and in two series of unrelated early-onset AD cases (onset age < 55 years). Sequence variations in intronic sequences (introns 3, 5 and 10) or silent mutations in exonic sequences (exons 8 and 14) were identified. However, no AD related mutations were observed, suggesting that the DLST gene is not the chromosome 14 AD3 gene.
Jaramillo-Correa, J P; Bousquet, J; Beaulieu, J; Isabel, N; Perron, M; Bouillé, M
2003-05-01
Primers previously developed to amplify specific non-coding regions of the mitochondrial genome in Angiosperms, and new primers for additional non-coding mtDNA regions, were tested for their ability to direct DNA amplification in 12 conifer taxa and to detect sequence-tagged-site (STS) polymorphisms within and among eight species in Picea. Out of 12 primer pairs, nine were successful at amplifying mtDNA in most of the taxa surveyed. In conifers, indels and substitutions were observed for several loci, allowing them to distinguish between families, genera and, in some cases, between species within genera. In Picea, interspecific polymorphism was detected for four loci, while intraspecific variation was observed for three of the mtDNA regions studied. One of these (SSU rRNA V1 region) exhibited indel polymorphisms, and the two others ( nad1 intron b/c and nad5 intron1) revealed restriction differences after digestion with Sau3AI (PCR-RFLP). A fourth locus, the nad4L- orf25 intergenic region, showed a multibanding pattern for most of the spruce species, suggesting a possible gene duplication. Maternal inheritance, expected for mtDNA in conifers, was observed for all polymorphic markers except the intergenic region nad4L- orf25. Pooling of the variation observed with the remaining three markers resulted in two to six different mtDNA haplotypes within the different species of Picea. Evidence for intra-genomic recombination was observed in at least two taxa. Thus, these mitotypes are likely to be more informative than single-locus haplotypes. They should be particularly useful for the study of biogeography and the dynamics of hybrid zones.
Distinctive archaebacterial species associated with anaerobic rumen protozoan Entodinium caudatum.
Tóthová, T; Piknová, M; Kisidayová, S; Javorský, P; Pristas, P
2008-01-01
The diversity of archaebacteria associated with anaerobic rumen protozoan Entodinium caudatum in long term in vitro culture was investigated by denaturing gradient gel electrophoresis (DGGE) analysis of hypervariable V3 region of archaebacterial 16S rRNA gene. PCR was accomplished directly from DNA extracted from a single protozoal cell and from total community genomic DNA and the obtained fingerprints were compared. The analysis indicated the presence of a solitary intensive band present in Entodinium caudatum single cell DNA, which had no counterparts in the profile from total DNA. The identity of archaebacterium represented by this band was determined by sequence analysis which showed that the sequence fell to the cluster of ciliate symbiotic methanogens identified recently by 16S gene library approach.
PIECE 2.0: an update for the plant gene structure comparison and evolution database
USDA-ARS?s Scientific Manuscript database
PIECE (Plant Intron Exon Comparision and Evolution) is a web-accessible database that houses intron and exon information of plant genes. PIECE serves as a resource for biologists interested in comparing intron-exon organization and provides valuable insights into the evolution of gene structure in ...
Functional comparison of three transformer gene introns regulating conditional female lethality
USDA-ARS?s Scientific Manuscript database
The trasformer gene plays a critical role in the sex determination pathways of many insects. We cloned two transformer gene introns from Anastrepha suspensa, the Caribbean fruit fly. These introns have sequences that putatively have a role in sex-specific splicing patterns that affect sex determinat...
Choosing and Using Introns in Molecular Phylogenetics
Creer, Simon
2007-01-01
Introns are now commonly used in molecular phylogenetics in an attempt to recover gene trees that are concordant with species trees, but there are a range of genomic, logistical and analytical considerations that are infrequently discussed in empirical studies that utilize intron data. This review outlines expedient approaches for locus selection, overcoming paralogy problems, recombination detection methods and the identification and incorporation of LVHs in molecular systematics. A range of parsimony and Bayesian analytical approaches are also described in order to highlight the methods that can currently be employed to align sequences and treat indels in subsequent analyses. By covering the main points associated with the generation and analysis of intron data, this review aims to provide a comprehensive introduction to using introns (or any non-coding nuclear data partition) in contemporary phylogenetics. PMID:19461984
Ślipiko, Monika; Buczkowska-Chmielewska, Katarzyna; Bączkiewicz, Alina; Szczecińska, Monika; Sawicki, Jakub
2017-01-01
Liverwort mitogenomes are considered to be evolutionarily stable. A comparative analysis of four Calypogeia species revealed differences compared to previously sequenced liverwort mitogenomes. Such differences involve unexpected structural changes in the two genes, cox1 and atp1, which have lost three and two introns, respectively. The group I introns in the cox1 gene are proposed to have been lost by two-step localized retroprocessing, whereas one-step retroprocessing could be responsible for the disappearance of the group II introns in the atp1 gene. These cases represent the first identified losses of introns in mitogenomes of leafy liverworts (Jungermanniopsida) contrasting the stability of mitochondrial gene order with certain changes in the gene content and intron set in liverworts. PMID:29257096
RNA chaperone StpA loosens interactions of the tertiary structure in the td group I intron in vivo
Waldsich, Christina; Grossberger, Rupert; Schroeder, Renée
2002-01-01
Efficient splicing of the td group I intron in vivo is dependent on the ribosome. In the absence of translation, the pre-mRNA is trapped in nonnative-splicing-incompetent conformations. Alternatively, folding of the pre-mRNA can be promoted by the RNA chaperone StpA or by the group I intron-specific splicing factor Cyt-18. To understand the mechanism of action of RNA chaperones, we probed the impact of StpA on the structure of the td intron in vivo. Our data suggest that StpA loosens tertiary interactions. The most prominent structural change was the opening of the base triples, which are involved in the correct orientation of the two major intron core domains. In line with the destabilizing activity of StpA, splicing of mutant introns with a reduced structural stability is sensitive to StpA. In contrast, Cyt-18 strengthens tertiary contacts, thereby rescuing splicing of structurally compromised td mutants in vivo. Our data provide direct evidence for protein-induced conformational changes within catalytic RNA in vivo. Whereas StpA resolves tertiary contacts enabling the RNA to refold, Cyt-18 contributes to the overall compactness of the td intron in vivo. PMID:12208852
Lunghi, Matteo; Spano, Furio; Magini, Alessandro; Emiliani, Carla; Carruthers, Vern B; Di Cristina, Manlio
2016-02-01
Apicomplexan parasites including Toxoplasma gondii and Plasmodium species have complex life cycles that include multiple hosts and differentiation through several morphologically distinct stages requiring marked changes in gene expression. This review highlights emerging evidence implicating regulation of mRNA splicing as a mechanism to prime these parasites for rapid gene expression upon differentiation. We summarize the most important insights in alternative splicing including its role in regulating gene expression by decreasing mRNA abundance via 'Regulated Unproductive Splicing and Translation'. As a related but less well-understood mechanism, we discuss also our recent work suggesting a role for intron retention for precluding translation of stage specific isoforms of T. gondii glycolytic enzymes. We additionally provide new evidence that intron retention might be a widespread mechanism during parasite differentiation. Supporting this notion, recent genome-wide analysis of Toxoplasma and Plasmodium suggests intron retention is more pervasive than heretofore thought. These findings parallel recent emergence of intron retention being more prevalent in mammals than previously believed, thereby adding to the established roles in plants, fungi and unicellular eukaryotes. Deeper mechanistic studies of intron retention will provide important insight into its role in regulating gene expression in apicomplexan parasites and more general in eukaryotic organisms.
Dool, Serena E; Puechmaille, Sébastien J; Dietz, Christian; Juste, Javier; Ibáñez, Carlos; Hulva, Pavel; Roué, Stéphane G; Petit, Eric J; Jones, Gareth; Russo, Danilo; Toffoli, Roberto; Viglino, Andrea; Martinoli, Adriano; Rossiter, Stephen J; Teeling, Emma C
2013-08-01
The demographic history of Rhinolophus hipposideros (lesser horseshoe bat) was reconstructed across its European, North African and Middle-Eastern distribution prior to, during and following the most recent glaciations by generating and analysing a multimarker data set. This data set consisted of an X-linked nuclear intron (Bgn; 543 bp), mitochondrial DNA (cytb-tRNA-control region; 1630 bp) and eight variable microsatellite loci for up to 373 individuals from 86 localities. Using this data set of diverse markers, it was possible to determine the species' demography at three temporal stages. Nuclear intron data revealed early colonization into Europe from the east, which pre-dates the Quaternary glaciations. The mtDNA data supported multiple glacial refugia across the Mediterranean, the largest of which were found in the Ibero-Maghreb region and an eastern location (Anatolia/Middle East)-that were used by R. hipposideros during the most recent glacial cycles. Finally, microsatellites provided the most recent information on these species' movements since the Last Glacial Maximum and suggested that lineages that had diverged into glacial refugia, such as in the Ibero-Maghreb region, have remained isolated. These findings should be used to inform future conservation management strategies for R. hipposideros and show the power of using a multimarker data set for phylogeographic studies. © 2013 John Wiley & Sons Ltd.
2011-01-01
Background The genus Pyrus belongs to the tribe Pyreae (the former subfamily Maloideae) of the family Rosaceae, and includes one of the most important commercial fruit crops, pear. The phylogeny of Pyrus has not been definitively reconstructed. In our previous efforts, the internal transcribed spacer region (ITS) revealed a poorly resolved phylogeny due to non-concerted evolution of nrDNA arrays. Therefore, introns of low copy nuclear genes (LCNG) are explored here for improved resolution. However, paralogs and lineage sorting are still two challenges for applying LCNGs in phylogenetic studies, and at least two independent nuclear loci should be compared. In this work the second intron of LEAFY and the alcohol dehydrogenase gene (Adh) were selected to investigate their molecular evolution and phylogenetic utility. Results DNA sequence analyses revealed a complex ortholog and paralog structure of Adh genes in Pyrus and Malus, the pears and apples. Comparisons between sequences from RT-PCR and genomic PCR indicate that some Adh homologs are putatively nonfunctional. A partial region of Adh1 was sequenced for 18 Pyrus species and three subparalogs representing Adh1-1 were identified. These led to poorly resolved phylogenies due to low sequence divergence and the inclusion of putative recombinants. For the second intron of LEAFY, multiple inparalogs were discovered for both LFY1int2 and LFY2int2. LFY1int2 is inadequate for phylogenetic analysis due to lineage sorting of two inparalogs. LFY2int2-N, however, showed a relatively high sequence divergence and led to the best-resolved phylogeny. This study documents the coexistence of outparalogs and inparalogs, and lineage sorting of these paralogs and orthologous copies. It reveals putative recombinants that can lead to incorrect phylogenetic inferences, and presents an improved phylogenetic resolution of Pyrus using LFY2int2-N. Conclusions Our study represents the first phylogenetic analyses based on LCNGs in Pyrus. Ancient and recent duplications lead to a complex structure of Adh outparalogs and inparalogs in Pyrus and Malus, resulting in neofunctionalization, nonfunctionalization and possible subfunctionalization. Among all investigated orthologs, LFY2int2-N is the best nuclear marker for phylogenetic reconstruction of Pyrus due to suitable sequence divergence and the absence of lineage sorting. PMID:21917170
Enyeart, Peter J; Mohr, Georg; Ellington, Andrew D; Lambowitz, Alan M
2014-01-13
Mobile group II introns are bacterial retrotransposons that combine the activities of an autocatalytic intron RNA (a ribozyme) and an intron-encoded reverse transcriptase to insert site-specifically into DNA. They recognize DNA target sites largely by base pairing of sequences within the intron RNA and achieve high DNA target specificity by using the ribozyme active site to couple correct base pairing to RNA-catalyzed intron integration. Algorithms have been developed to program the DNA target site specificity of several mobile group II introns, allowing them to be made into 'targetrons.' Targetrons function for gene targeting in a wide variety of bacteria and typically integrate at efficiencies high enough to be screened easily by colony PCR, without the need for selectable markers. Targetrons have found wide application in microbiological research, enabling gene targeting and genetic engineering of bacteria that had been intractable to other methods. Recently, a thermostable targetron has been developed for use in bacterial thermophiles, and new methods have been developed for using targetrons to position recombinase recognition sites, enabling large-scale genome-editing operations, such as deletions, inversions, insertions, and 'cut-and-pastes' (that is, translocation of large DNA segments), in a wide range of bacteria at high efficiency. Using targetrons in eukaryotes presents challenges due to the difficulties of nuclear localization and sub-optimal magnesium concentrations, although supplementation with magnesium can increase integration efficiency, and directed evolution is being employed to overcome these barriers. Finally, spurred by new methods for expressing group II intron reverse transcriptases that yield large amounts of highly active protein, thermostable group II intron reverse transcriptases from bacterial thermophiles are being used as research tools for a variety of applications, including qRT-PCR and next-generation RNA sequencing (RNA-seq). The high processivity and fidelity of group II intron reverse transcriptases along with their novel template-switching activity, which can directly link RNA-seq adaptor sequences to cDNAs during reverse transcription, open new approaches for RNA-seq and the identification and profiling of non-coding RNAs, with potentially wide applications in research and biotechnology.
Kim, Young-Kyu; Park, Chong-wook; Kim, Ki-Joong
2009-03-31
The chloroplast DNA sequences of Megaleranthis saniculifolia, an endemic and monotypic endangered plant species, were completed in this study (GenBank FJ597983). The genome is 159,924 bp in length. It harbors a pair of IR regions consisting of 26,608 bp each. The lengths of the LSC and SSC regions are 88,326 bp and 18,382 bp, respectively. The structural organizations, gene and intron contents, gene orders, AT contents, codon usages, and transcription units of the Megaleranthis chloroplast genome are similar to those of typical land plant cp DNAs. However, the detailed features of Megaleranthis chloroplast genomes are substantially different from that of Ranunculus, which belongs to the same family, the Ranunculaceae. First, the Megaleranthis cp DNA was 4,797 bp longer than that of Ranunculus due to an expanded IR region into the SSC region and duplicated sequence elements in several spacer regions of the Megaleranthis cp genome. Second, the chloroplast genomes of Megaleranthis and Ranunculus evidence 5.6% sequence divergence in the coding regions, 8.9% sequence divergence in the intron regions, and 18.7% sequence divergence in the intergenic spacer regions, respectively. In both the coding and noncoding regions, average nucleotide substitution rates differed markedly, depending on the genome position. Our data strongly implicate the positional effects of the evolutionary modes of chloroplast genes. The genes evidencing higher levels of base substitutions also have higher incidences of indel mutations and low Ka/Ks ratios. A total of 54 simple sequence repeat loci were identified from the Megaleranthis cp genome. The existence of rich cp SSR loci in the Megaleranthis cp genome provides a rare opportunity to study the population genetic structures of this endangered species. Our phylogenetic trees based on the two independent markers, the nuclear ITS and chloroplast matK sequences, strongly support the inclusion of the Megaleranthis to the Trollius. Therefore, our molecular trees support Ohwi's original treatment of Megaleranthis saniculiforia to Trollius chosenensis Ohwi.
Compositional correlations in the chicken genome.
Musto, H; Romero, H; Zavala, A; Bernardi, G
1999-09-01
This paper analyses the compositional correlations that hold in the chicken genome. Significant linear correlations were found among the regions studied-coding sequences (and their first, second, and third codon positions), flanking regions (5' and 3'), and introns-as is the case in the human genome. We found that these compositional correlations are not limited to global GC levels but even extend to individual bases. Furthermore, an analysis of 1037 coding sequences has confirmed a correlation among GC(3), GC(2), and GC(1). The implications of these results are discussed.
The complete chloroplast genome sequence of Dendrobium officinale.
Yang, Pei; Zhou, Hong; Qian, Jun; Xu, Haibin; Shao, Qingsong; Li, Yonghua; Yao, Hui
2016-01-01
The complete chloroplast sequence of Dendrobium officinale, an endangered and economically important traditional Chinese medicine, was reported and characterized. The genome size is 152,018 bp, with 37.5% GC content. A pair of inverted repeats (IRs) of 26,284 bp are separated by a large single-copy region (LSC, 84,944 bp) and a small single-copy region (SSC, 14,506 bp). The complete cp DNA contains 83 protein-coding genes, 39 tRNA genes and 8 rRNA genes. Fourteen genes contained one or two introns.
Characterizing the strand-specific distribution of non-CpG methylation in human pluripotent cells.
Guo, Weilong; Chung, Wen-Yu; Qian, Minping; Pellegrini, Matteo; Zhang, Michael Q
2014-03-01
DNA methylation is an important defense and regulatory mechanism. In mammals, most DNA methylation occurs at CpG sites, and asymmetric non-CpG methylation has only been detected at appreciable levels in a few cell types. We are the first to systematically study the strand-specific distribution of non-CpG methylation. With the divide-and-compare strategy, we show that CHG and CHH methylation are not intrinsically different in human embryonic stem cells (ESCs) and induced pluripotent stem cells (iPSCs). We also find that non-CpG methylation is skewed between the two strands in introns, especially at intron boundaries and in highly expressed genes. Controlling for the proximal sequences of non-CpG sites, we show that the skew of non-CpG methylation in introns is mainly guided by sequence skew. By studying subgroups of transposable elements, we also found that non-CpG methylation is distributed in a strand-specific manner in both short interspersed nuclear elements (SINE) and long interspersed nuclear elements (LINE), but not in long terminal repeats (LTR). Finally, we show that on the antisense strand of Alus, a non-CpG site just downstream of the A-box is highly methylated. Together, the divide-and-compare strategy leads us to identify regions with strand-specific distributions of non-CpG methylation in humans.
Complement receptor 1 variants confer protection from severe malaria in Odisha, India.
Panda, Aditya K; Panda, Madhumita; Tripathy, Rina; Pattanaik, Sarit S; Ravindran, Balachandran; Das, Bidyut K
2012-01-01
In Plasmodium falciparum infection, complement receptor-1 (CR1) on erythrocyte's surface and ABO blood group play important roles in formation of rosettes which are presumed to be contributory in the pathogenesis of severe malaria. Although several studies have attempted to determine the association of CR1 polymorphisms with severe malaria, observations remain inconsistent. Therefore, a case control study and meta-analysis was performed to address this issue. Common CR1 polymorphisms (intron 27 and exon 22) and blood group were typed in 353 cases of severe malaria (SM) [97 cerebral malaria (CM), 129 multi-organ dysfunction (MOD), 127 non-cerebral severe malaria (NCSM)], 141 un-complicated malaria and 100 healthy controls from an endemic region of Odisha, India. Relevant publications for meta-analysis were searched from the database. The homozygous polymorphisms of CR1 intron 27 and exon 22 (TT and GG) and alleles (T and G) that are associated with low expression of CR1 on red blood cells, conferred significant protection against CM, MOD and malaria deaths. Combined analysis showed significant association of blood group B/intron 27-AA/exon 22-AA with susceptibility to SM (CM and MOD). Meta-analysis revealed that the CR1 exon 22 low expression polymorphism is significantly associated with protection against severe malaria. The results of the present study demonstrate that common CR1 variants significantly protect against severe malaria in an endemic area.
Lambert, Marie-Pierre; Terrone, Sophie; Giraud, Guillaume; Benoit-Pilven, Clara; Cluet, David; Combaret, Valérie; Mortreux, Franck; Auboeuf, Didier; Bourgeois, Cyril F
2018-06-21
The Repressor Element 1-silencing transcription factor (REST) represses a number of neuronal genes in non-neuronal cells or in undifferentiated neural progenitors. Here, we report that the DEAD box RNA helicase DDX17 controls important REST-related processes that are critical during the early phases of neuronal differentiation. First, DDX17 associates with REST, promotes its binding to the promoter of a subset of REST-targeted genes and co-regulates REST transcriptional repression activity. During neuronal differentiation, we observed a downregulation of DDX17 along with that of the REST complex that contributes to the activation of neuronal genes. Second, DDX17 and its paralog DDX5 regulate the expression of several proneural microRNAs that are known to target the REST complex during neurogenesis, including miR-26a/b that are also direct regulators of DDX17 expression. In this context, we propose a new mechanism by which RNA helicases can control the biogenesis of intronic miRNAs. We show that the processing of the miR-26a2 precursor is dependent on RNA helicases, owing to an intronic regulatory region that negatively impacts on both miRNA processing and splicing of its host intron. Our work places DDX17 in the heart of a pathway involving REST and miRNAs that allows neuronal gene repression.