Analysis of 16S-23S rRNA intergenic spacer regions of Vibrio cholerae and Vibrio mimicus.
Chun, J; Huq, A; Colwell, R R
1999-05-01
Vibrio cholerae identification based on molecular sequence data has been hampered by a lack of sequence variation from the closely related Vibrio mimicus. The two species share many genes coding for proteins, such as ctxAB, and show almost identical 16S DNA coding for rRNA (rDNA) sequences. Primers targeting conserved sequences flanking the 3' end of the 16S and the 5' end of the 23S rDNAs were used to amplify the 16S-23S rRNA intergenic spacer regions of V. cholerae and V. mimicus. Two major (ca. 580 and 500 bp) and one minor (ca. 750 bp) amplicons were consistently generated for both species, and their sequences were determined. The largest fragment contains three tRNA genes (tDNAs) coding for tRNAGlu, tRNALys, and tRNAVal, which has not previously been found in bacteria examined to date. The 580-bp amplicon contained tDNAIle and tDNAAla, whereas the 500-bp fragment had single tDNA coding either tRNAGlu or tRNAAla. Little variation, i.e., 0 to 0.4%, was found among V. cholerae O1 classical, O1 El Tor, and O139 epidemic strains. Slightly more variation was found against the non-O1/non-O139 serotypes (ca. 1% difference) and V. mimicus (2 to 3% difference). A pair of oligonucleotide primers were designed, based on the region differentiating all of V. cholerae strains from V. mimicus. The PCR system developed was subsequently evaluated by using representatives of V. cholerae from environmental and clinical sources, and of other taxa, including V. mimicus. This study provides the first molecular tool for identifying the species V. cholerae.
Mayán, Maria D
2013-01-01
Three RNA polymerases coexist in the ribosomal DNA of Saccharomyces cerevisiae. RNAP-I transcribes the 35S rRNA, RNAP-III transcribes the 5S rRNA and RNAP-II is found in both intergenic non-coding regions. Previously, we demonstrated that RNAP-II molecules bound to the intergenic non-coding regions (IGS) of the ribosomal locus are mainly found in a stalled conformation, and the stalled polymerase mediates chromatin interactions, which isolate RNAP-I from the RNAP-III transcriptional domain. Besides, RNAP-II transcribes both IGS regions at low levels, using different cryptic promoters. This report demonstrates that RNAP-II also transcribes two sequences located in the 5'- and 3'-ends of the 35S rRNA gene that overlap with the sequences of the 35S rRNA precursor transcribed by RNAP-I. The sequence located at the promoter region of RNAP-I, called the p-RNA transcript, binds to the transcription termination-related protein, Reb1p, while the T-RNA sequence, located in the termination sites of RNAP-I gene, contains the stem-loop recognized by Rtn1p, which is necessary for proper termination of RNAP-I. Because of their location, these small RNAs may play a key role in the initiation and termination of RNAP-I transcription. To correctly synthesize proteins, eukaryotic cells may retain a mechanism that connects the three main polymerases. This report suggests that cryptic transcription by RNAP-II may be required for normal transcription by RNAP-I in the ribosomal locus of S. cerevisiae. Copyright © 2012 John Wiley & Sons, Ltd.
RNA processing in Neurospora crassa mitochondria: use of transfer RNA sequences as signals.
Breitenberger, C A; Browning, K S; Alzner-DeWeerd, B; RajBhandary, U L
1985-01-01
We have used RNA gel transfer hybridization, S1 nuclease mapping and primer extension to analyze transcripts derived from several genes in Neurospora crassa mitochondria. The transcripts studied include those for cytochrome oxidase subunit III, 17S rRNA and an unidentified open reading frame. In all three cases, initial transcripts are long, include tRNA sequences, and are subsequently processed to generate the mature RNAs. We find that endpoints of the most abundant transcripts generally coincide with those of tRNA sequences. We therefore conclude that tRNA sequences in long transcripts act as primary signals for RNA processing in N. crassa mitochondria. The situation is somewhat analogous to that observed in mammalian mitochondrial systems. The difference, however, is that in mammalian mitochondria, noncoding spacers between tRNA, rRNA and protein genes are very short and in many cases non-existent, allowing no room for intergenic RNA processing signals whereas, in N. crassa mtDNA, intergenic non-coding sequences are usually several hundred nucleotides long and contain highly conserved GC-rich palindromic sequences. Since these GC-rich palindromic sequences are retained in the processed mature RNAs, we conclude that they do not serve as signals for RNA processing. Images Fig. 2. Fig. 3. Fig. 4. Fig. 5. Fig. 6. Fig. 7. PMID:2990893
Sharwood, Robert E.; Hotto, Amber M.; Bollenbach, Thomas J.; Stern, David B.
2011-01-01
Post-transcriptional regulation in the chloroplast is exerted by nucleus-encoded ribonucleases and RNA-binding proteins. One of these ribonucleases is RNR1, a 3′-to-5′ exoribonuclease of the RNase II family. We have previously shown that Arabidopsis rnr1-null mutants exhibit specific abnormalities in the expression of the rRNA operon, including the accumulation of precursor 23S, 16S, and 4.5S species and a concomitant decrease in the mature species. 5S rRNA transcripts, however, accumulate to a very low level in both precursor and mature forms, suggesting that they are unstable in the rnr1 background. Here we demonstrate that rnr1 plants overaccumulate an antisense RNA, AS5, that is complementary to the 5S rRNA, its intergenic spacer, and the downstream trnR gene, which encodes tRNAArg, raising the possibility that AS5 destabilizes 5S rRNA or its precursor and/or blocks rRNA maturation. To investigate this, we used an in vitro system that supports 5S rRNA and trnR processing. We show that AS5 inhibits 5S rRNA maturation from a 5S-trnR precursor, and shorter versions of AS5 demonstrate that inhibition requires intergenic sequences. To test whether the sense and antisense RNAs form double-stranded regions in vitro, treatment with the single-strand-specific mung bean nuclease was used. These results suggest that 5S–AS5 duplexes interfere with a sense-strand secondary structure near the endonucleolytic cleavage site downstream from the 5S rRNA coding region. We hypothesize that these duplexes are degraded by a dsRNA-specific ribonuclease in vivo, contributing to the 5S rRNA deficiency observed in rnr1. PMID:21148395
Li, Weijun; Wang, Zongqing; Che, Yanli
2017-11-12
In this study, the complete mitochondrial genome of Cryptocercus meridianus was sequenced. The circular mitochondrial genome is 15,322 bp in size and contains 13 protein-coding genes, two ribosomal RNA genes (12S rRNA and 16S rRNA), 22 transfer RNA genes, and one D-loop region. We compare the mitogenome of C. meridianus with that of C. relictus and C. kyebangensis . The base composition of the whole genome was 45.20%, 9.74%, 16.06%, and 29.00% for A, G, C, and T, respectively; it shows a high AT content (74.2%), similar to the mitogenomes of C. relictus and C. kyebangensis . The protein-coding genes are initiated with typical mitochondrial start codons except for cox1 with TTG. The gene order of the C. meridianus mitogenome differs from the typical insect pattern for the translocation of tRNA-Ser AGN , while the mitogenomes of the other two Cryptocercus species, C. relictus and C. kyebangensis , are consistent with the typical insect pattern. There are two very long non-coding intergenic regions lying on both sides of the rearranged gene tRNA-Ser AGN . The phylogenetic relationships were constructed based on the nucleotide sequence of 13 protein-coding genes and two ribosomal RNA genes. The mitogenome of C. meridianus is the first representative of the order Blattodea that demonstrates rearrangement, and it will contribute to the further study of the phylogeny and evolution of the genus Cryptocercus and related taxa.
Lessard, Laurent; Liu, Michelle; Marzese, Diego M.; Wang, Hongwei; Chong, Kelly; Kawas, Neal; Donovan, Nicholas C; Kiyohara, Eiji; Hsu, Sandy; Nelson, Nellie; Izraely, Sivan; Sagi-Assif, Orit; Witz, Isaac P; Ma, Xiao-Jun; Luo, Yuling; Hoon, Dave SB
2015-01-01
In recent years, considerable advances have been made in the characterization of protein-coding alterations involved in the pathogenesis of melanoma. However, despite their growing implication in cancer, little is known about the role of long non-coding RNAs in melanoma progression. We hypothesized that copy number alterations of intergenic non-protein coding domains could help identify long intergenic non-coding RNAs (lincRNAs) associated with metastatic cutaneous melanoma. Among several candidates, our approach uncovered the chromosome 6p22.3 CASC15 lincRNA locus as a frequently gained genomic segment in metastatic melanoma tumors and cell lines. The locus was actively transcribed in metastatic melanoma cells, and up-regulation of CASC15 expression was associated with metastatic progression to brain metastasis in a mouse xenograft model. In clinical specimens, CASC15 levels increased during melanoma progression and were independent predictors of disease recurrence in a cohort of 141 patients with AJCC stage III lymph node metastasis. Moreover, siRNA knockdown experiments revealed that CASC15 regulates melanoma cell phenotype switching between proliferative and invasive states. Accordingly, CASC15 levels correlated with known gene signatures corresponding to melanoma proliferative and invasive phenotypes. These findings support a key role for CASC15 in metastatic melanoma. PMID:26016895
Shiao, Yih-Horng; Lupascu, Sorin T; Gu, Yuhan D; Kasprzak, Wojciech; Hwang, Christopher J; Fields, Janet R; Leighty, Robert M; Quiñones, Octavio; Shapiro, Bruce A; Alvord, W Gregory; Anderson, Lucy M
2009-10-19
Ribosomal RNA (rRNA) is a central regulator of cell growth and may control cancer development. A cis noncoding rRNA (nc-rRNA) upstream from the 45S rRNA transcription start site has recently been implicated in control of rRNA transcription in mouse fibroblasts. We investigated whether a similar nc-rRNA might be expressed in human cancer epithelial cells, and related to any genomic characteristics. Using quantitative rRNA measurement, we demonstrated that a nc-rRNA is transcribed in human lung epithelial and lung cancer cells, starting from approximately -1000 nucleotides upstream of the rRNA transcription start site (+1) and extending at least to +203. This nc-rRNA was significantly more abundant in the majority of lung cancer cell lines, relative to a nontransformed lung epithelial cell line. Its abundance correlated negatively with total 45S rRNA in 12 of 13 cell lines (P = 0.014). During sequence analysis from -388 to +306, we observed diverse, frequent intercopy single nucleotide polymorphisms (SNPs) in rRNA, with a frequency greater than predicted by chance at 12 sites. A SNP at +139 (U/C) in the 5' leader sequence varied among the cell lines and correlated negatively with level of the nc-rRNA (P = 0.014). Modelling of the secondary structure of the rRNA 5'-leader sequence indicated a small increase in structural stability due to the +139 U/C SNP and a minor shift in local configuration occurrences. The results demonstrate occurrence of a sense nc-rRNA in human lung epithelial and cancer cells, and imply a role in regulation of the rRNA gene, which may be affected by a +139 SNP in the 5' leader sequence of the primary rRNA transcript.
Sequence Analysis of Mitochondrial Genome of Toxascaris leonina from a South China Tiger.
Li, Kangxin; Yang, Fang; Abdullahi, A Y; Song, Meiran; Shi, Xianli; Wang, Minwei; Fu, Yeqi; Pan, Weida; Shan, Fang; Chen, Wu; Li, Guoqing
2016-12-01
Toxascaris leonina is a common parasitic nematode of wild mammals and has significant impacts on the protection of rare wild animals. To analyze population genetic characteristics of T. leonina from South China tiger, its mitochondrial (mt) genome was sequenced. Its complete circular mt genome was 14,277 bp in length, including 12 protein-coding genes, 22 tRNA genes, 2 rRNA genes, and 2 non-coding regions. The nucleotide composition was biased toward A and T. The most common start codon and stop codon were TTG and TAG, and 4 genes ended with an incomplete stop codon. There were 13 intergenic regions ranging 1 to 10 bp in size. Phylogenetically, T. leonina from a South China tiger was close to canine T. leonina . This study reports for the first time a complete mt genome sequence of T. leonina from the South China tiger, and provides a scientific basis for studying the genetic diversity of nematodes between different hosts.
Hodgetts, Jennifer; Boonham, Neil; Mumford, Rick; Harrison, Nigel; Dickinson, Matthew
2008-08-01
Phytoplasma phylogenetics has focused primarily on sequences of the non-coding 16S rRNA gene and the 16S-23S rRNA intergenic spacer region (16-23S ISR), and primers that enable amplification of these regions from all phytoplasmas by PCR are well established. In this study, primers based on the secA gene have been developed into a semi-nested PCR assay that results in a sequence of the expected size (about 480 bp) from all 34 phytoplasmas examined, including strains representative of 12 16Sr groups. Phylogenetic analysis of secA gene sequences showed similar clustering of phytoplasmas when compared with clusters resolved by similar sequence analyses of a 16-23S ISR-23S rRNA gene contig or of the 16S rRNA gene alone. The main differences between trees were in the branch lengths, which were elongated in the 16-23S ISR-23S rRNA gene tree when compared with the 16S rRNA gene tree and elongated still further in the secA gene tree, despite this being a shorter sequence. The improved resolution in the secA gene-derived phylogenetic tree resulted in the 16SrII group splitting into two distinct clusters, while phytoplasmas associated with coconut lethal yellowing-type diseases split into three distinct groups, thereby supporting past proposals that they represent different candidate species within 'Candidatus Phytoplasma'. The ability to differentiate 16Sr groups and subgroups by virtual RFLP analysis of secA gene sequences suggests that this gene may provide an informative alternative molecular marker for pathogen identification and diagnosis of phytoplasma diseases.
Cryptic tRNAs in chaetognath mitochondrial genomes.
Barthélémy, Roxane-Marie; Seligmann, Hervé
2016-06-01
The chaetognaths constitute a small and enigmatic phylum of little marine invertebrates. Both nuclear and mitochondrial genomes have numerous originalities, some phylum-specific. Until recently, their mitogenomes seemed containing only one tRNA gene (trnMet), but a recent study found in two chaetognath mitogenomes two and four tRNA genes. Moreover, apparently two conspecific mitogenomes have different tRNA gene numbers (one and two). Reanalyses by tRNAscan-SE and ARWEN softwares of the five available complete chaetognath mitogenomes suggest numerous additional tRNA genes from different types. Their total number never reaches the 22 found in most other invertebrates using that genetic code. Predicted error compensation between codon-anticodon mismatch and tRNA misacylation suggests translational activity by tRNAs predicted solely according to secondary structure for tRNAs predicted by tRNAscan-SE, not ARWEN. Numbers of predicted stop-suppressor (antitermination) tRNAs coevolve with predicted overlapping, frameshifted protein coding genes including stop codons. Sequence alignments in secondary structure prediction with non-chaetognath tRNAs suggest that the most likely functional tRNAs are in intergenic regions, as regular mt-tRNAs. Due to usually short intergenic regions, generally tRNA sequences partially overlap with flanking genes. Some tRNA pairs seem templated by sense-antisense strands. Moreover, 16S rRNA genes, but not 12S rRNAs, appear as tRNA nurseries, as previously suggested for multifunctional ribosomal-like protogenomes. Copyright © 2016 Elsevier Ltd. All rights reserved.
Linking maternal and somatic 5S rRNA types with different sequence-specific non-LTR retrotransposons
Pagano, Johanna F.B.; Ensink, Wim A.; van Olst, Marina; van Leeuwen, Selina; Nehrdich, Ulrike; Zhu, Kongju; Spaink, Herman P.; Girard, Geneviève; Rauwerda, Han; Jonker, Martijs J.; Dekker, Rob J.
2017-01-01
5S rRNA is a ribosomal core component, transcribed from many gene copies organized in genomic repeats. Some eukaryotic species have two 5S rRNA types defined by their predominant expression in oogenesis or adult tissue. Our next-generation sequencing study on zebrafish egg, embryo, and adult tissue identified maternal-type 5S rRNA that is exclusively accumulated during oogenesis, replaced throughout the embryogenesis by a somatic-type, and thus virtually absent in adult somatic tissue. The maternal-type 5S rDNA contains several thousands of gene copies on chromosome 4 in tandem repeats with small intergenic regions, whereas the somatic-type is present in only 12 gene copies on chromosome 18 with large intergenic regions. The nine-nucleotide variation between the two 5S rRNA types likely affects TFIII binding and riboprotein L5 binding, probably leading to storage of maternal-type rRNA. Remarkably, these sequence differences are located exactly at the sequence-specific target site for genome integration by the 5S rRNA-specific Mutsu retrotransposon family. Thus, we could define maternal- and somatic-type MutsuDr subfamilies. Furthermore, we identified four additional maternal-type and two new somatic-type MutsuDr subfamilies, each with their own target sequence. This target-site specificity, frequently intact maternal-type retrotransposon elements, plus specific presence of Mutsu retrotransposon RNA and piRNA in egg and adult tissue, suggest an involvement of retrotransposons in achieving the differential copy number of the two types of 5S rDNA loci. PMID:28003516
Chen, Zhi-Teng; Du, Yu-Zhou
2015-03-01
The complete mitochondrial genome of the stonefly, Sweltsa longistyla Wu (Plecoptera: Chloroperlidae), was sequenced in this study. The mitogenome of S. longistyla is 16,151bp and contains 37 genes including 13 protein-coding genes (PCGs), 22 tRNA genes, two rRNA genes, and a large non-coding region. S. longistyla, Pteronarcys princeps Banks, Kamimuria wangi Du and Cryptoperla stilifera Sivec belong to the Plecoptera, and the gene order and orientation of their mitogenomes were similar. The overall AT content for the four stoneflies was below 72%, and the AT content of tRNA genes was above 69%. The four genomes were compact and contained only 65-127bp of non-coding intergenic DNAs. Overlapping nucleotides existed in all four genomes and ranged from 24 (P. princeps) to 178bp (K. wangi). There was a 7-bp motif ('ATGATAA') of overlapping DNA and an 8-bp motif (AAGCCTTA) conserved in three stonefly species (P. princeps, K. wangi and C. stilifera). The control regions of four stoneflies contained a stem-loop structure. Four conserved sequence blocks (CSBs) were present in the A+T-rich regions of all four stoneflies. Copyright © 2014 Elsevier B.V. All rights reserved.
Jaramillo-Correa, J P; Bousquet, J; Beaulieu, J; Isabel, N; Perron, M; Bouillé, M
2003-05-01
Primers previously developed to amplify specific non-coding regions of the mitochondrial genome in Angiosperms, and new primers for additional non-coding mtDNA regions, were tested for their ability to direct DNA amplification in 12 conifer taxa and to detect sequence-tagged-site (STS) polymorphisms within and among eight species in Picea. Out of 12 primer pairs, nine were successful at amplifying mtDNA in most of the taxa surveyed. In conifers, indels and substitutions were observed for several loci, allowing them to distinguish between families, genera and, in some cases, between species within genera. In Picea, interspecific polymorphism was detected for four loci, while intraspecific variation was observed for three of the mtDNA regions studied. One of these (SSU rRNA V1 region) exhibited indel polymorphisms, and the two others ( nad1 intron b/c and nad5 intron1) revealed restriction differences after digestion with Sau3AI (PCR-RFLP). A fourth locus, the nad4L- orf25 intergenic region, showed a multibanding pattern for most of the spruce species, suggesting a possible gene duplication. Maternal inheritance, expected for mtDNA in conifers, was observed for all polymorphic markers except the intergenic region nad4L- orf25. Pooling of the variation observed with the remaining three markers resulted in two to six different mtDNA haplotypes within the different species of Picea. Evidence for intra-genomic recombination was observed in at least two taxa. Thus, these mitotypes are likely to be more informative than single-locus haplotypes. They should be particularly useful for the study of biogeography and the dynamics of hybrid zones.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Helfenbein, Kevin G.; Fourcade, H. Matthew; Vanjani, Rohit G.
2004-05-01
We report the first complete mitochondrial (mt) DNA sequence from a member of the phylum Chaetognatha (arrow worms). The Paraspadella gotoi mtDNA is highly unusual, missing 23 of the genes commonly found in animal mtDNAs, including atp6, which has otherwise been found universally to be present. Its 14 genes are unusually arranged into two groups, one on each strand. One group is punctuated by numerous non-coding intergenic nucleotides, while the other group is tightly packed, having no non-coding nucleotides, leading to speculation that there are two transcription units with differing modes of expression. The phylogenetic position of the Chaetognatha withinmore » the Metazoa has long been uncertain, with conflicting or equivocal results from various morphological analyses and rRNA sequence comparisons. Comparisons here of amino acid sequences from mitochondrially encoded proteins gives a single most parsimonious tree that supports a position of Chaetognatha as sister to the protostomes studied here. From this, one can more clearly interpret the patterns of evolution of various developmental features, especially regarding the embryological fate of the blastopore.« less
Locati, Mauro D; Pagano, Johanna F B; Ensink, Wim A; van Olst, Marina; van Leeuwen, Selina; Nehrdich, Ulrike; Zhu, Kongju; Spaink, Herman P; Girard, Geneviève; Rauwerda, Han; Jonker, Martijs J; Dekker, Rob J; Breit, Timo M
2017-04-01
5S rRNA is a ribosomal core component, transcribed from many gene copies organized in genomic repeats. Some eukaryotic species have two 5S rRNA types defined by their predominant expression in oogenesis or adult tissue. Our next-generation sequencing study on zebrafish egg, embryo, and adult tissue identified maternal-type 5S rRNA that is exclusively accumulated during oogenesis, replaced throughout the embryogenesis by a somatic-type, and thus virtually absent in adult somatic tissue. The maternal-type 5S rDNA contains several thousands of gene copies on chromosome 4 in tandem repeats with small intergenic regions, whereas the somatic-type is present in only 12 gene copies on chromosome 18 with large intergenic regions. The nine-nucleotide variation between the two 5S rRNA types likely affects TFIII binding and riboprotein L5 binding, probably leading to storage of maternal-type rRNA. Remarkably, these sequence differences are located exactly at the sequence-specific target site for genome integration by the 5S rRNA-specific Mutsu retrotransposon family. Thus, we could define maternal- and somatic-type MutsuDr subfamilies. Furthermore, we identified four additional maternal-type and two new somatic-type MutsuDr subfamilies, each with their own target sequence. This target-site specificity, frequently intact maternal-type retrotransposon elements, plus specific presence of Mutsu retrotransposon RNA and piRNA in egg and adult tissue, suggest an involvement of retrotransposons in achieving the differential copy number of the two types of 5S rDNA loci. © 2017 Locati et al.; Published by Cold Spring Harbor Laboratory Press for the RNA Society.
Wang, Deguo; Liu, Yanhong
2015-05-26
Streptococcus dysgalactiae, Streptococcus uberis and Streptococcus agalactiae are the three main pathogens causing bovine mastitis, with great losses to the dairy industry. Rapid and specific loop-mediated isothermal amplification methods (LAMP) for identification and differentiation of these three pathogens are not available. With the 16S rRNA gene and 16S-23S rRNA intergenic spacers as targets, four sets of LAMP primers were designed for identification and differentiation of S. dysgalactiae, S. uberis and S. agalactiae. The detection limit of all four LAMP primer sets were 0.1 pg DNA template per reaction, the LAMP method with 16S rRNA gene and 16S-23S rRNA intergenic spacers as the targets can differentiate the three pathogens, which is potentially useful in epidemiological studies.
McKee, B. D.; Habera, L.; Vrana, J. A.
1992-01-01
In Drosophila melanogaster males, X-Y meiotic chromosome pairing is mediated by the nucleolus organizers (NOs) which are located in the X heterochromatin (Xh) and near the Y centromere. Deficiencies for Xh disrupt X-Y meiotic pairing and cause high frequencies of X-Y nondisjunction. Insertion of cloned rRNA genes on an Xh(-) chromosome partially restores normal X-Y pairing and disjunction. To map the sequences within an inserted, X-linked rRNA gene responsible for stimulating X-Y pairing, partial deletions were generated by P element-mediated destabilization of the insert. Complete deletions of the rRNA transcription unit did not interfere with the ability to stimulate X-Y pairing as long as most of the intergenic spacer (IGS) remained. Within groups of deletions that lacked the entire transcription unit and differed only in length of residual IGS material, pairing ability was proportional to the dose of 240-bp intergenic spacer repeats. Deletions of the complete rRNA transcription unit or of the 28S sequences alone blocked nucleolus formation, as determined by binding of an antinucleolar antibody, yet did not interfere with pairing ability, suggesting that X-Y pairing may not be mechanistically related to nucleolus formation. A model for achiasmatic pairing in Drosophila males based upon the combined action of topoisomerase I and a strand transferase is proposed. PMID:1330825
Zhao, A; Guo, A; Liu, Z; Pape, L
1997-01-01
The coding sequences for a Schizosaccharomyces pombe sequence-specific DNA binding protein, Reb1p, have been cloned. The predicted S. pombe Reb1p is 24-29% identical to mouse TTF-1 (transcription termination factor-1) and Saccharomyces cerevisiae REB1 protein, both of which direct termination of RNA polymerase I catalyzed transcripts. The S.pombe Reb1 cDNA encodes a predicted polypeptide of 504 amino acids with a predicted molecular weight of 58.4 kDa. The S. pombe Reb1p is unusual in that the bipartite DNA binding motif identified originally in S.cerevisiae and Klyveromyces lactis REB1 proteins is uninterrupted and thus S.pombe Reb1p may contain the smallest natural REB1 homologous DNA binding domain. Its genomic coding sequences were shown to be interrupted by two introns. A recombinant histidine-tagged Reb1 protein bearing the rDNA binding domain has two homologous, sequence-specific binding sites in the S. pomber DNA intergenic spacer, located between 289 and 480 nt downstream of the end of the approximately 25S rRNA coding sequences. Each binding site is 13-14 bp downstream of two of the three proposed in vivo termination sites. The core of this 17 bp site, AGGTAAGGGTAATGCAC, is specifically protected by Reb1p in footprinting analysis. PMID:9016645
Yang, Huirong; Zhang, Jia-En; Luo, Hao; Luo, Mingzhu; Guo, Jing; Deng, Zhixin; Zhao, Benliang
2016-05-01
We present the complete mitochondrial genome of Cipangopaludina cathayensis in this study. The mitochondrial genome is 17,157 bp in length, containing 13 protein-coding genes, 2 rRNA genes, 22 tRNA genes. All of them are encoded on the heavy strand except 7 tRNA genes on the light strand. Overall nucleotide compositions of the light strand are 44.51% of A, 26.74% of T, 20.48% of C and 8.28% of G. All the protein-coding genes start with ATG initiation codon except ATP6 with ATA and ND4 with TTG, and 2 types of termination codons are TAA (ATP6, ND2, COX1, COX2, ATP8, ND1, ND6, Cytb, COX3, ND4) and TAG (ND4L, ND5, ND3). There are 29 intergenic spacers and 5 gene overlaps. The tandem repeat sequences are observed in COX2, tRNA(Asp), ATP6, tRNA(Cys), S-rRNA, ND1, Cytb, ND4 and COX3 genes. Gene arrangement and distribution are different from the typical vertebrates. The absence of D-loop is consistent with the Gastropoda, but at least one lengthy non-coding region is essential regulatory element for the initiation of transcription and replication.
USDA-ARS?s Scientific Manuscript database
The phylogenetic utility of sequence variation from five chloroplast DNA intergenic spacer (IGS) regions: trnT-trnF, psbA-trnH, atpB-rbcL, trnV-16S rRNA, and trnS-trnfM was examined in the genus Juglans. A total of seventeen taxa representing the four sections within Juglans and an outgroup taxon, ...
Histone and ribosomal RNA repetitive gene clusters of the boll weevil are linked in a tandem array.
Roehrdanz, R; Heilmann, L; Senechal, P; Sears, S; Evenson, P
2010-08-01
Histones are the major protein component of chromatin structure. The histone family is made up of a quintet of proteins, four core histones (H2A, H2B, H3 & H4) and the linker histones (H1). Spacers are found between the coding regions. Among insects this quintet of genes is usually clustered and the clusters are tandemly repeated. Ribosomal DNA contains a cluster of the rRNA sequences 18S, 5.8S and 28S. The rRNA genes are separated by the spacers ITS1, ITS2 and IGS. This cluster is also tandemly repeated. We found that the ribosomal RNA repeat unit of at least two species of Anthonomine weevils, Anthonomus grandis and Anthonomus texanus (Coleoptera: Curculionidae), is interspersed with a block containing the histone gene quintet. The histone genes are situated between the rRNA 18S and 28S genes in what is known as the intergenic spacer region (IGS). The complete reiterated Anthonomus grandis histone-ribosomal sequence is 16,248 bp.
Jheng, Cheng-Fong; Chen, Tien-Chih; Lin, Jhong-Yi; Chen, Ting-Chieh; Wu, Wen-Luan; Chang, Ching-Chun
2012-07-01
The chloroplast genome of Phalaenopsis equestris was determined and compared to those of Phalaenopsis aphrodite and Oncidium Gower Ramsey in Orchidaceae. The chloroplast genome of P. equestris is 148,959 bp, and a pair of inverted repeats (25,846 bp) separates the genome into large single-copy (85,967 bp) and small single-copy (11,300 bp) regions. The genome encodes 109 genes, including 4 rRNA, 30 tRNA and 75 protein-coding genes, but loses four ndh genes (ndhA, E, F and H) and seven other ndh genes are pseudogenes. The rate of inter-species variation between the two moth orchids was 0.74% (1107 sites) for single nucleotide substitution and 0.24% for insertions (161 sites; 1388 bp) and deletions (189 sites; 1393 bp). The IR regions have a lower rate of nucleotide substitution (3.5-5.8-fold) and indels (4.3-7.1-fold) than single-copy regions. The intergenic spacers are the most divergent, and based on the length variation of the three intergenic spacers, 11 native Phalaenopsis orchids could be successfully distinguished. The coding genes, IR junction and RNA editing sites are relatively more conserved between the two moth orchids than between those of Phalaenopsis and Oncidium spp. Copyright © 2012 Elsevier Ireland Ltd. All rights reserved.
Cicala, Francesco; Moore, James D; Cáceres-Martínez, Jorge; Del Río-Portilla, Miguel A; Hernández-Rodríguez, Mónica; Vásquez-Yeomans, Rebeca; Rocha-Olivares, Axayácatl
2018-05-01
Withering syndrome (WS) is a chronic wasting disease affecting abalone species attributed to the pathogen Candidatus Xenohaliotis californiensis (CXc). Wild populations of blue (Haliotis fulgens) and yellow (H. corrugata) abalone have experienced unusual mortality rates since 2009 off the peninsula of Baja California and WS has been hypothesized as a possible cause. Currently, little information is available about the genetic diversity of CXc and particularly the possible existence of strains differing in pathogenicity. In a recent phylogenetic analysis, we characterized five coding genes from this rickettsial pathogen. Here, we analyze those genes and two additional intergenic non-coding regions following multi-locus sequence typing (MLST) and multi-spacer typing (MST) approaches to assess the genetic variability of CXc and its relationship with blue, yellow and red (H. rufescens) abalone. Moreover, we used 16S rRNA pyrosequencing reads from gut microbiomes of blue and yellow abalone to complete the genetic characterization of this prokaryote. The presence of CXc was investigated in more than 150 abalone of the three species; furthermore, a total of 385 DNA sequences and 7117 16S rRNA reads from Candidatus Xenohaliotis californiensis were used to evaluate its population genetic structure. Our findings suggest the absence of polymorphism in the DNA sequences of analyzed loci and the presence of a single lineage of CXc infecting abalone from California (USA) and Baja California (Mexico). We posit that the absence of genetic variably in this marine rickettsia may be the result of evolutionary and ecological processes. Copyright © 2018 Elsevier Inc. All rights reserved.
SUMIYAMA, KENTA; MIYAKE, TSUTOMU; GRIMWOOD, JANE; STUART, ANDREW; DICKSON, MARK; SCHMUTZ, JEREMY; RUDDLE, FRANK H.; MYERS, RICHARD M.; AMEMIYA, CHRIS T.
2013-01-01
The mammalian Dlx3 and Dlx4 genes are configured as a bigene cluster, and their respective expression patterns are controlled temporally and spatially by cis-elements that largely reside within the intergenic region of the cluster. Previous work revealed that there are conspicuously conserved elements within the intergenic region of the Dlx3–4 bigene clusters of mouse and human. In this paper we have extended these analyses to include 12 additional mammalian taxa (including a marsupial and a monotreme) in order to better define the nature and molecular evolutionary trends of the coding and non-coding functional elements among morphologically divergent mammals. Dlx3–4 regions were fully sequenced from 12 divergent taxa of interest. We identified three theria-specific amino acid replacements in homeodomain of Dlx4 gene that functions in placenta. Sequence analyses of constrained nucleotide sites in the intergenic non-coding region showed that many of the intergenic conserved elements are highly conserved and have evolved slowly within the mammals. In contrast, a branchial arch/craniofacial enhancer I37-2 exhibited accelerated evolution at the branch between the monotreme and therian common ancestor despite being highly conserved among therian species. Functional analysis of I37-2 in transgenic mice has shown that the equivalent region of the platypus fails to drive transcriptional activity in branchial arches. These observations, taken together with our molecular evolutionary data, suggest that theria-specific episodic changes in the I37-2 element may have contributed to craniofacial innovation at the base of the mammalian lineage. PMID:22951979
Diagnosis of clinical samples spotted on FTA cards using PCR-based methods.
Jamjoom, Manal; Sultan, Amal H
2009-04-01
The broad clinical presentation of Leishmaniasis makes the diagnosis of current and past cases of this disease rather difficult. Differential diagnosis is important because diseases caused by other aetiologies and a clinical spectrum similar to that of leishmaniasis (e.g. leprosy, skin cancers and tuberculosis for CL; malaria and schistosomiasis for VL) are often present in endemic areas of endemicity. Presently, a variety of methods have been developed and tested to aid the identification and diagnosis of Leishmania. The advent of the PCR technology has opened new channels for the diagnosis of leishmaniasis in a variety of clinical materials. PCR is a simple, rapid procedure that has been adapted for diagnosis of leishmaniasis. A range of tools is currently available for the diagnosis and identification of leishmaniasis and Leishmania species, respectively. However, none of these diagnostic tools are examined and tested using samples spotted on FTA cards. Three different PCR-based approaches were examined including: kDNA minicircle, Leishmania 18S rRNA gene and PCR-RFLP of Intergenic region of ribosomal protein. PCR primers were designed that sit within the coding sequences of genes (relatively well conserved) but which amplify across the intervening intergenic sequence (relatively variable). These were used in PCR-RFLP on reference isolates of 10 of the most important Leishmania species: L. donovani, L. infantum, L. major & L. tropica. Digestion of PCR products with restriction enzymes produced species-specific restriction patterns allowed discrimination of reference isolates. The kDNA minicircle primers are highly sensitive in diagnosis of both bone marrow and skin smears from FTA cards. Leishmania 18S rRNA gene conserved region is sensitive in identification of bone marrow smear but less sensitive in diagnosing skin smears. The intergenic nested PCR-RFLP using P5 & P6 as well as P1 & P2 newly designed primers showed high level of reproducibility and sensitivity. Though, it was less sensitive than kDNA minicircle primers, but easily discriminated between Leishmania species.
NASA Astrophysics Data System (ADS)
Karakatsanis, L. P.; Pavlos, G. P.; Iliopoulos, A. C.; Pavlos, E. G.; Clark, P. M.; Duke, J. L.; Monos, D. S.
2018-09-01
This study combines two independent domains of science, the high throughput DNA sequencing capabilities of Genomics and complexity theory from Physics, to assess the information encoded by the different genomic segments of exonic, intronic and intergenic regions of the Major Histocompatibility Complex (MHC) and identify possible interactive relationships. The dynamic and non-extensive statistical characteristics of two well characterized MHC sequences from the homozygous cell lines, PGF and COX, in addition to two other genomic regions of comparable size, used as controls, have been studied using the reconstructed phase space theorem and the non-extensive statistical theory of Tsallis. The results reveal similar non-linear dynamical behavior as far as complexity and self-organization features. In particular, the low-dimensional deterministic nonlinear chaotic and non-extensive statistical character of the DNA sequences was verified with strong multifractal characteristics and long-range correlations. The nonlinear indices repeatedly verified that MHC sequences, whether exonic, intronic or intergenic include varying levels of information and reveal an interaction of the genes with intergenic regions, whereby the lower the number of genes in a region, the less the complexity and information content of the intergenic region. Finally we showed the significance of the intergenic region in the production of the DNA dynamics. The findings reveal interesting content information in all three genomic elements and interactive relationships of the genes with the intergenic regions. The results most likely are relevant to the whole genome and not only to the MHC. These findings are consistent with the ENCODE project, which has now established that the non-coding regions of the genome remain to be of relevance, as they are functionally important and play a significant role in the regulation of expression of genes and coordination of the many biological processes of the cell.
Robinett, C C; O'Connor, A; Dunaway, M
1997-01-01
We have identified a novel activity for the region of the intergenic spacer of the Xenopus laevis rRNA genes that contains the 35- and 100-bp repeats. We devised a new assay for this region by constructing DNA plasmids containing a tandem repeat of rRNA reporter genes that were separated by the 35- and 100-bp repeat region and a rRNA gene enhancer. When the 35- and 100-bp repeat region is present in its normal position and orientation at the 3' end of the rRNA reporter genes, the enhancer activates the adjacent downstream promoter but not the upstream rRNA promoter on the same plasmid. Because this element can restrict the range of an enhancer's activity in the context of tandem genes, we have named it the repeat organizer (RO). The ability to restrict enhancer action is a feature of insulator elements, but unlike previously described insulator elements the RO does not block enhancer action in a simple enhancer-blocking assay. Instead, the activity of the RO requires that it be in its normal position and orientation with respect to the other sequence elements of the rRNA genes. The enhancer-binding transcription factor xUBF also binds to the repetitive sequences of the RO in vitro, but these sequences do not activate transcription in vivo. We propose that the RO is a specialized insulator element that organizes the tandem array of rRNA genes into single-gene expression units by promoting activation of a promoter by its proximal enhancers. PMID:9111359
Laura K Muller; Jeffrey M. Lorch; Daniel L. Lindner; Michael O' Connor; Andrea Gargas; David S. Blehert
2013-01-01
The fungus Geomyces destructans is the causative agent of white-nose syndrome (WNS), a disease that has killed millions of North American hibernating bats. We describe a real-time TaqMan PCR test that detects DNA from G. destructans by targeting a portion of the multicopy intergenic spacer region of the rRNA gene complex. The...
Panangala, V S; van Santen, V L; Shoemaker, C A; Klesius, P H
2005-01-01
To analyse interspecies and intraspecies differences based on the 16S-23S rRNA intergenic spacer region (ISR) sequences of the fish pathogens Edwardsiella ictaluri and Edwardsiella tarda. The 16S-23S rRNA spacer regions of 19 Edw. ictaluri and four Edw. tarda isolates from four geographical regions were amplified by PCR with primers complementary to conserved sequences within the flanking 16S-23S rRNA coding sequences. Two products were generated from all isolates, without interspecies or intraspecific size polymorphisms. Sequence analysis of the amplified fragments revealed a smaller ISR of 350 bp, which contained a gene for tRNA(Glu), and a larger ISR of 441 bp, which contained genes for tRNA(Ile) and tRNA(Ala). The sequences of the smaller ISR of different Edw. ictaluri isolates were essentially identical to each other. Partial sequences of larger ISR from several Edw. ictaluri isolates also revealed no differences from the one complete Edw. ictaluri large ISR sequence obtained. The sequences of the smaller ISR of Edw. tarda were 97% identical to the Edw. ictaluri smaller ISR and the larger ISR were 96-98% identical to the Edw. ictaluri larger ISR sequence. The Edw. tarda isolates displayed limited ISR sequence heterogeneity, with > or =97% sequence identity among isolates for both small and large ISR. There is a high degree of size and sequence similarity of 16S-23S ISR both among isolates within Edw. ictaluri and Edw. tarda species and between the two species. Our results confirm a close genetic relationship between Edw. ictaluri and Edw. tarda and the relative homogeneity of Edw. ictaluri isolates compared with Edw. tarda isolates. Because no differences were found in ISR sequences among Edw. ictaluri isolates, sequence analysis of the ISR will not be useful to distinguish isolates of Edw. ictaluri. However, we identified restriction sites that differ between ISR sequences of Edw. ictaluri and Edw. tarda, which will be useful in distinguishing the two species.
Intergenic disease-associated regions are abundant in novel transcripts.
Bartonicek, N; Clark, M B; Quek, X C; Torpy, J R; Pritchard, A L; Maag, J L V; Gloss, B S; Crawford, J; Taft, R J; Hayward, N K; Montgomery, G W; Mattick, J S; Mercer, T R; Dinger, M E
2017-12-28
Genotyping of large populations through genome-wide association studies (GWAS) has successfully identified many genomic variants associated with traits or disease risk. Unexpectedly, a large proportion of GWAS single nucleotide polymorphisms (SNPs) and associated haplotype blocks are in intronic and intergenic regions, hindering their functional evaluation. While some of these risk-susceptibility regions encompass cis-regulatory sites, their transcriptional potential has never been systematically explored. To detect rare tissue-specific expression, we employed the transcript-enrichment method CaptureSeq on 21 human tissues to identify 1775 multi-exonic transcripts from 561 intronic and intergenic haploblocks associated with 392 traits and diseases, covering 73.9 Mb (2.2%) of the human genome. We show that a large proportion (85%) of disease-associated haploblocks express novel multi-exonic non-coding transcripts that are tissue-specific and enriched for GWAS SNPs as well as epigenetic markers of active transcription and enhancer activity. Similarly, we captured transcriptomes from 13 melanomas, targeting nine melanoma-associated haploblocks, and characterized 31 novel melanoma-specific transcripts that include fusion proteins, novel exons and non-coding RNAs, one-third of which showed allelically imbalanced expression. This resource of previously unreported transcripts in disease-associated regions ( http://gwas-captureseq.dingerlab.org ) should provide an important starting point for the translational community in search of novel biomarkers, disease mechanisms, and drug targets.
Romero, J; García-Varela, M; Laclette, J P; Espejo, R T
2002-11-01
To explore the bacterial microbiota in Chilean oyster (Tiostrea chilensis), a molecular approach that permits detection of different bacteria, independently of their capacity to grow in culture media, was used. Bacterial diversity was assessed by analysis of both the 16S rDNA and the 16S-23S intergenic region, obtained by PCR amplifications of DNA extracted from depurated oysters. RFLP of the PCR amplified 16S rDNA showed a prevailing pattern in most of the individuals analyzed, indicating that a few bacterial species were relatively abundant and common in oysters. Cloning and sequencing of the 16S rDNA with the prevailing RFLP pattern indicated that this rRNA was most closely related to Arcobacter spp. However, analysis by the size of the amplified 16S-23S rRNA intergenic regions revealed not Arcobacter spp. but Staphylococcus spp. related bacteria as a major and common component in oyster. These different results may be caused by the absence of target for one of the primers employed for amplification of the intergenic region. Neither of the two bacteria species found in large abundance was recovered after culturing under aerobic, anaerobic, or microaerophilic conditions. This result, however, is expected because the number of bacteria recovered after cultivation was less than 0.01% of the total. All together, these observations suggest that Arcobacter-related strains are probably abundant and common in the Chilean oyster bacterial microbiota.
Sequences in the intergenic spacer influence RNA Pol I transcription from the human rRNA promoter
DOE Office of Scientific and Technical Information (OSTI.GOV)
Li, W.M.; Sylvester, J.E.
1994-09-01
In most eucaryotic species, ribosomal genes are tandemly repeated about 100-5000 times per haploid genome. The 43 Kb human rDNA repeat consists of a 13 Kb coding region for the 18S, 5.8S, 28S ribosomal RNAs (rRNAs) and transcribed spacers separated by a 30 Kb intergenic spacer. For species such as frog, mouse and rat, sequences in the intergenic spacer other than the gene promoter have been shown to modulate transcription of the ribosomal gene. These sequences are spacer promoters, enhancers and the terminator for spacer transcription. We are addressing whether the human ribosomal gene promoter is similarly influenced. In-vitro transcriptionmore » run-off assays have revealed that the 4.5 kb region (CBE), directly upstream of the gene promoter, has cis-stimulation and trans-competition properties. This suggests that the CBE fragment contains an enhancer(s) for ribosomal gene transcription. Further experiments have shown that a fragment ({approximately}1.6 kb) within the CBE fragment also has trans-competition function. Deletion subclones of this region are being tested to delineate the exact sequences responsible for these modulating activities. Previous sequence analysis and functional studies have revealed that CBE contains regions of DNA capable of adopting alternative structures such as bent DNA, Z-DNA, and triple-stranded DNA. Whether these structures are required for modulating transcription remains to be determined as does the specific DNA-protein interaction involved.« less
Knief, Claudia; Frances, Lisa; Cantet, Franck; Vorholt, Julia A.
2008-01-01
Bacteria of the genus Methylobacterium are widespread in the environment, but their ecological role in ecosystems, such as the plant phyllosphere, is not very well understood. To gain better insight into the distribution of different Methylobacterium species in diverse ecosystems, a rapid and specific cultivation-independent method for detection of these organisms and analysis of their community structure is needed. Therefore, 16S rRNA gene-targeted primers specific for this genus were designed and evaluated. These primers were used in PCR in combination with a reverse primer that binds to the tRNAAla gene, which is located upstream of the 23S rRNA gene in the 16S-23S intergenic spacer (IGS). PCR products that were of different lengths were obtained due to the length heterogeneity of the IGS of different Methylobacterium species. This length variation allowed generation of fingerprints of Methylobacterium communities in environmental samples by automated ribosomal intergenic spacer analysis. The Methylobacterium communities on leaves of different plant species in a natural field were compared using this method. The new method allows rapid comparisons of Methylobacterium communities and is thus a useful tool to study Methylobacterium communities in different ecosystems. PMID:18263752
Lv, Yuanda; Liang, Zhikai; Ge, Min; Qi, Weicong; Zhang, Tifu; Lin, Feng; Peng, Zhaohua; Zhao, Han
2016-05-11
Nitrogen (N) is an essential and often limiting nutrient to plant growth and development. Previous studies have shown that the mRNA expressions of numerous genes are regulated by nitrogen supplies; however, little is known about the expressed non-coding elements, for example long non-coding RNAs (lncRNAs) that control the response of maize (Zea mays L.) to nitrogen. LncRNAs are a class of non-coding RNAs larger than 200 bp, which have emerged as key regulators in gene expression. In this study, we surveyed the intergenic/intronic lncRNAs in maize B73 leaves at the V7 stage under conditions of N-deficiency and N-sufficiency using ribosomal RNA depletion and ultra-deep total RNA sequencing approaches. By integration with mRNA expression profiles and physiological evaluations, 7245 lncRNAs and 637 nitrogen-responsive lncRNAs were identified that exhibited unique expression patterns. Co-expression network analysis showed that the nitrogen-responsive lncRNAs were enriched mainly in one of the three co-expressed modules. The genes in the enriched module are mainly involved in NADH dehydrogenase activity, oxidative phosphorylation and the nitrogen compounds metabolic process. We identified a large number of lncRNAs in maize and illustrated their potential regulatory roles in response to N stress. The results lay the foundation for further in-depth understanding of the molecular mechanisms of lncRNAs' role in response to nitrogen stresses.
Lin, C S; Sun, Y L; Liu, C Y; Yang, P C; Chang, L C; Cheng, I C; Mao, S J; Huang, M C
1999-08-05
The complete nucleotide sequence of the pig (Sus scrofa) mitochondrial genome, containing 16613bp, is presented in this report. The genome is not a specific length because of the presence of the variable numbers of tandem repeats, 5'-CGTGCGTACA in the displacement loop (D-loop). Genes responsible for 12S and 16S rRNAs, 22 tRNAs, and 13 protein-coding regions are found. The genome carries very few intergenic nucleotides with several instances of overlap between protein-coding or tRNA genes, except in the D-loop region. For evaluating the possible evolutionary relationships between Artiodactyla and Cetacea, the nucleotide substitutions and amino acid sequences of 13 protein-coding genes were aligned by pairwise comparisons of the pig, cow, and fin whale. By comparing these sequences, we suggest that there is a closer relationship between the pig and cow than that between either of these species and fin whale. In addition, the accumulation of transversions and gaps in pig 12S and 16S rRNA genes was compared with that in other eutherian species, including cow, fin whale, human, horse, and harbor seal. The results also reveal a close phylogenetic relationship between pig and cow, as compared to fin whale and others. Thus, according to the sequence differences of mitochondrial rRNA genes in eutherian species, the evolutionary separation of pig and cow occurred about 53-60 million years ago.
Genome-wide survey by ChIP-seq reveals YY1 regulation of lincRNAs in skeletal myogenesis
Lu, Leina; Sun, Kun; Chen, Xiaona; Zhao, Yu; Wang, Lijun; Zhou, Liang; Sun, Hao; Wang, Huating
2013-01-01
Skeletal muscle differentiation is orchestrated by a network of transcription factors, epigenetic regulators, and non-coding RNAs. The transcription factor Yin Yang 1 (YY1) silences multiple target genes in myoblasts (MBs) by recruiting Ezh2 (Enhancer of Zeste Homologue2). To elucidate genome-wide YY1 binding in MBs, we performed chromatin immunoprecipitation (ChIP)-seq and found 1820 specific binding sites in MBs with a large portion residing in intergenic regions. Detailed analysis demonstrated that YY1 acts as an activator for many loci in addition to its known repressor function. No significant co-occupancy was found between YY1 and Ezh2, suggesting an additional Ezh2-independent function for YY1 in MBs. Further analysis of intergenic binding sites showed that YY1 potentially regulates dozens of large intergenic non-coding RNAs (lincRNAs), whose function in myogenesis is underexplored. We characterized a novel muscle-associated lincRNA (Yam-1) that is positively regulated by YY1. Yam-1 is downregulated upon differentiation and acts as an inhibitor of myogenesis. We demonstrated that Yam-1 functions through in cis regulation of miR-715, which in turn targets Wnt7b. Our findings not only provide the first genome-wide picture of YY1 association in muscle cells, but also uncover the functional role of lincRNA Yam-1. PMID:23942234
Cremonesi, P; Zottola, T; Locatelli, C; Pollera, C; Castiglioni, B; Scaccabarozzi, L; Moroni, P
2013-01-01
Staphylococcus aureus is an important human and animal pathogen, and is regarded as an important cause of intramammary infection (IMI) in ruminants. Staphylococcus aureus genetic variability and virulence factors have been well studied in veterinary medicine, especially in cows as support for control and management of IMI. The aim of the present study was to genotype 71 Staph. aureus isolates from the bulk tank and foremilk of water buffaloes (n=40) and from udder tissue (n=7) and foremilk (n=24) from small ruminants. The method used was previously applied to bovine Staph. aureus and is based on the amplification of the 16S-23S rRNA intergenic spacer region. The technique applied was able to identify different Staph. aureus genotypes isolated from dairy species other than the bovine species, and cluster the genotypes according to species and herds. Virulence gene distribution was consistent with genotype differentiation. The isolates were also characterized through determination of the presence of 19 virulence-associated genes by specific PCR. Enterotoxins A, C, D, G, I, J, and L were associated with Staph. aureus isolates from buffaloes, whereas enterotoxins C and L were linked to small ruminants. Genes coding for methicillin resistance, Panton-Valentine leukocidin, exfoliative toxins A and B, and enterotoxins B, E, and H were undetected. These findings indicate that RNA template-specific PCR is a valid technique for typing Staph. aureus from buffaloes and small ruminants and is a useful tool for understanding udder infection epidemiology. Copyright © 2013 American Dairy Science Association. Published by Elsevier Inc. All rights reserved.
Liu, Feng; Pang, Shaojun; Luo, Minbo
2016-01-01
Sargassum fusiforme (Harvey) Setchell (=Hizikia fusiformis (Harvey) Okamura) is one of the most important economic seaweeds for mariculture in China. In this study, we present the complete mitochondrial genome of S. fusiforme. The genome is 34,696 bp in length with circular organization, encoding the standard set of three ribosomal RNA genes (rRNA), 25 transfer RNA genes (tRNA), 35 protein-coding genes, and two conserved open reading frames (ORFs). Its total AT content is 62.47%, lower than other brown algae except Pylaiella littoralis. The mitogenome carries 1571 bp of intergenic region constituting 4.53% of the genome, and 13 pairs of overlapping genes with the overlap size from 1 to 90 bp. The phylogenetic analyses based on 35 protein-coding genes reveal that S. fusiforme has a closer evolutionary relationship with Sargassum muticum than Sargassum horneri, indicating Hizikia are not distinct evolutionary entity and should be reduced to synonymy with Sargassum.
Complete mitochondrial genome of the Yellow-spotted skate Okamejei hollandi (Rajiformes: Rajidae).
Li, Weidong; Chen, Xiao; Liu, Wenai; Sun, Renjie; Zhou, Haolang
2016-07-01
The complete mitochondrial genome of the Yellow-spotted skate Okamejei hollandi was determined in this study. It is 16,974 bp in length and contains 13 protein-coding genes, two rRNA genes, 22 tRNA genes, and one putative control region. The overall base composition is 30.5% A, 27.8% C, 14.0% G, and 27.8% T. There are 28 bp short intergenic spaces located in 12 gene junctions and 31 bp overlaps located in nine gene junctions in the whole mitogenome. Two start codons (ATG and GTG) and two stop codons (TAG and TAA/T) were used in the protein-coding genes. The lengths of 22 tRNA genes range from 68 (tRNA-Ser2) to 75 (tRNA-Leu1) bp. The origin of L-strand replication (OL) sequence (37 bp) was identified between the tRNA-Asn and tRNA-Cys genes. The control region is 1311 bp in length with high A + T and poor G content.
Song, Y; Kato, N; Liu, C; Matsumiya, Y; Kato, H; Watanabe, K
2000-06-15
Rapid and reliable two-step multiplex polymerase chain reaction (PCR) assays were established to identify human intestinal lactobacilli; a multiplex PCR was used for grouping of lactobacilli with a mixture of group-specific primers followed by four multiplex PCR assays with four sorts of species-specific primer mixtures for identification at the species level. Primers used were designed from nucleotide sequences of the 16S-23S rRNA intergenic spacer region and its flanking 23S rRNA gene of members of the genus Lactobacillus which are commonly isolated from human stool specimens: Lactobacillus acidophilus, Lactobacillus crispatus, Lactobacillus delbrueckii (ssp. bulgaricus and ssp. lactis), Lactobacillus fermentum, Lactobacillus gasseri, Lactobacillus jensenii, Lactobacillus paracasei (ssp. paracasei and ssp. tolerans), Lactobacillus plantarum, Lactobacillus reuteri, Lactobacillus rhamnosus and Lactobacillus salivarius (ssp. salicinius and ssp. salivarius). The established two-step multiplex PCR assays were applied to the identification of 84 Lactobacillus strains isolated from human stool specimens and the PCR results were consistent with the results from the DNA-DNA hybridization assay. These results suggest that the multiplex PCR system established in this study is a simple, rapid and reliable method for the identification of common Lactobacillus isolates from human stool samples.
The conservation and signatures of lincRNAs in Marek’s disease of chicken
USDA-ARS?s Scientific Manuscript database
Long intergenic non-coding RNAs (lincRNAs) associated with a number of cancers and other diseases have been identified in mammals, but they are still formidable to be comprehensively identified and characterized. Marek’s disease (MD) is a T cell lymphoma of chickens induced by Marek’s disease virus ...
The conservation and signatures of lincRNAs in Marek’s disease of chicken
USDA-ARS?s Scientific Manuscript database
Long intergenic non-coding RNAs (lincRNAs) associated with a number of cancers and other diseases have been identified in mammals, but they are still formidable to be comprehensively identified and characterized in chicken. Marek’s disease (MD) is a T cell lymphoma of chickens induced by Marek’s dis...
Single-molecule sequencing of the desiccation-tolerant grass Oropetium thomaeum.
VanBuren, Robert; Bryant, Doug; Edger, Patrick P; Tang, Haibao; Burgess, Diane; Challabathula, Dinakar; Spittle, Kristi; Hall, Richard; Gu, Jenny; Lyons, Eric; Freeling, Michael; Bartels, Dorothea; Ten Hallers, Boudewijn; Hastie, Alex; Michael, Todd P; Mockler, Todd C
2015-11-26
Plant genomes, and eukaryotic genomes in general, are typically repetitive, polyploid and heterozygous, which complicates genome assembly. The short read lengths of early Sanger and current next-generation sequencing platforms hinder assembly through complex repeat regions, and many draft and reference genomes are fragmented, lacking skewed GC and repetitive intergenic sequences, which are gaining importance due to projects like the Encyclopedia of DNA Elements (ENCODE). Here we report the whole-genome sequencing and assembly of the desiccation-tolerant grass Oropetium thomaeum. Using only single-molecule real-time sequencing, which generates long (>16 kilobases) reads with random errors, we assembled 99% (244 megabases) of the Oropetium genome into 625 contigs with an N50 length of 2.4 megabases. Oropetium is an example of a 'near-complete' draft genome which includes gapless coverage over gene space as well as intergenic sequences such as centromeres, telomeres, transposable elements and rRNA clusters that are typically unassembled in draft genomes. Oropetium has 28,466 protein-coding genes and 43% repeat sequences, yet with 30% more compact euchromatic regions it is the smallest known grass genome. The Oropetium genome demonstrates the utility of single-molecule real-time sequencing for assembling high-quality plant and other eukaryotic genomes, and serves as a valuable resource for the plant comparative genomics community.
Zou, Cheng; Li, Jingxuan; Luo, Wenzhe; Li, Long; Hu, An; Fu, Yuhua; Hou, Ye; Li, Changchun
2017-08-18
Long intergenic non-coding RNAs (lincRNAs) play essential roles in numerous biological processes and are widely studied. The skeletal muscle is an important tissue that plays an essential role in individual movement ability. However, lincRNAs in pig skeletal muscles are largely undiscovered and their biological functions remain elusive. In this study, we assembled transcriptomes using RNA-seq data published in previous studies of our laboratory group and identified 323 lincRNAs in porcine leg muscle. We found that these lincRNAs have shorter transcript length, fewer exons and lower expression level than protein-coding genes. Gene ontology and pathway analyses indicated that many potential target genes (PTGs) of lincRNAs were involved in skeletal-muscle-related processes, such as muscle contraction and muscle system process. Combined our previous studies, we found a potential regulatory mechanism in which the promoter methylation of lincRNAs can negatively regulate lincRNA expression and then positively regulate PTG expression, which can finally result in abnormal phenotypes of cloned piglets through a certain unknown pathway. This work detailed a number of lincRNAs and their target genes involved in skeletal muscle growth and development and can facilitate future studies on their roles in skeletal muscle growth and development.
[Structural organization of 5S ribosomal DNA of Rosa rugosa].
Tynkevych, Iu O; Volkov, R A
2014-01-01
In order to clarify molecular organization of the genomic region encoding 5S rRNA in diploid species Rosa rugosa several 5S rDNA repeated units were cloned and sequenced. Analysis of the obtained sequences revealed that only one length variant of 5S rDNA repeated units, which contains intact promoter elements in the intergenic spacer region (IGS) and appears to be transcriptionally active is present in the genome. Additionally, a limited number of 5S rDNA pseudogenes lacking a portion of coding sequence and the complete IGS was detected. A high level of sequence similarity (from 93.7 to 97.5%) between the IGS of major 5S rDNA variants of East Asian R. rugosa and North American R. nitida was found indicating comparatively recent divergence of these species.
Garcia, S; Kovařík, A
2013-01-01
In higher eukaryotes, the 5S rRNA genes occur in tandem units and are arranged either separately (S-type arrangement) or linked to other repeated genes, in most cases to rDNA locus encoding 18S–5.8S–26S genes (L-type arrangement). Here we used Southern blot hybridisation, PCR and sequencing approaches to analyse genomic organisation of rRNA genes in all large gymnosperm groups, including Coniferales, Ginkgoales, Gnetales and Cycadales. The data are provided for 27 species (21 genera). The 5S units linked to the 35S rDNA units occur in some but not all Gnetales, Coniferales and in Ginkgo (∼30% of the species analysed), while the remaining exhibit separate organisation. The linked 5S rRNA genes may occur as single-copy insertions or as short tandems embedded in the 26S–18S rDNA intergenic spacer (IGS). The 5S transcript may be encoded by the same (Ginkgo, Ephedra) or opposite (Podocarpus) DNA strand as the 18S–5.8S–26S genes. In addition, pseudogenised 5S copies were also found in some IGS types. Both L- and S-type units have been largely homogenised across the genomes. Phylogenetic relationships based on the comparison of 5S coding sequences suggest that the 5S genes independently inserted IGS at least three times in the course of gymnosperm evolution. Frequent transpositions and rearrangements of basic units indicate relatively relaxed selection pressures imposed on genomic organisation of 5S genes in plants. PMID:23512008
Garcia, S; Kovařík, A
2013-07-01
In higher eukaryotes, the 5S rRNA genes occur in tandem units and are arranged either separately (S-type arrangement) or linked to other repeated genes, in most cases to rDNA locus encoding 18S-5.8S-26S genes (L-type arrangement). Here we used Southern blot hybridisation, PCR and sequencing approaches to analyse genomic organisation of rRNA genes in all large gymnosperm groups, including Coniferales, Ginkgoales, Gnetales and Cycadales. The data are provided for 27 species (21 genera). The 5S units linked to the 35S rDNA units occur in some but not all Gnetales, Coniferales and in Ginkgo (∼30% of the species analysed), while the remaining exhibit separate organisation. The linked 5S rRNA genes may occur as single-copy insertions or as short tandems embedded in the 26S-18S rDNA intergenic spacer (IGS). The 5S transcript may be encoded by the same (Ginkgo, Ephedra) or opposite (Podocarpus) DNA strand as the 18S-5.8S-26S genes. In addition, pseudogenised 5S copies were also found in some IGS types. Both L- and S-type units have been largely homogenised across the genomes. Phylogenetic relationships based on the comparison of 5S coding sequences suggest that the 5S genes independently inserted IGS at least three times in the course of gymnosperm evolution. Frequent transpositions and rearrangements of basic units indicate relatively relaxed selection pressures imposed on genomic organisation of 5S genes in plants.
Walter, J.; Tannock, G. W.; Tilsala-Timisjarvi, A.; Rodtong, S.; Loach, D. M.; Munro, K.; Alatossava, T.
2000-01-01
Denaturing gradient gel electrophoresis (DGGE) of DNA fragments obtained by PCR amplification of the V2-V3 region of the 16S rRNA gene was used to detect the presence of Lactobacillus species in the stomach contents of mice. Lactobacillus isolates cultured from human and porcine gastrointestinal samples were identified to the species level by using a combination of DGGE and species-specific PCR primers that targeted 16S-23S rRNA intergenic spacer region or 16S rRNA gene sequences. The identifications obtained by this approach were confirmed by sequencing the V2-V3 region of the 16S rRNA gene and by a BLAST search of the GenBank database. PMID:10618239
Pan-cancer transcriptomic analysis associates long non-coding RNAs with key mutational driver events
Ashouri, Arghavan; Sayin, Volkan I.; Van den Eynden, Jimmy; Singh, Simranjit X.; Papagiannakopoulos, Thales; Larsson, Erik
2016-01-01
Thousands of long non-coding RNAs (lncRNAs) lie interspersed with coding genes across the genome, and a small subset has been implicated as downstream effectors in oncogenic pathways. Here we make use of transcriptome and exome sequencing data from thousands of tumours across 19 cancer types, to identify lncRNAs that are induced or repressed in relation to somatic mutations in key oncogenic driver genes. Our screen confirms known coding and non-coding effectors and also associates many new lncRNAs to relevant pathways. The associations are often highly reproducible across cancer types, and while many lncRNAs are co-expressed with their protein-coding hosts or neighbours, some are intergenic and independent. We highlight lncRNAs with possible functions downstream of the tumour suppressor TP53 and the master antioxidant transcription factor NFE2L2. Our study provides a comprehensive overview of lncRNA transcriptional alterations in relation to key driver mutational events in human cancers. PMID:28959951
Walworth, Nathan G.; Pfreundt, Ulrike; Nelson, William C.; ...
2015-04-07
Understanding the evolution of the free-living, cyanobacterial, diazotroph Trichodesmium is of great importance due to its critical role in oceanic biogeochemistry and primary production. Unlike the other >150 available genomes of free-living cyanobacteria, only 63.8% of the Trichodesmium erythraeum (strain IMS101) genome is predicted to encode protein, which is 20-25% less than the average for other cyanobacteria and non-pathogenic, free-living bacteria. We use distinctive isolates and metagenomic data to show that low coding density observed in IMS101 is a common feature of the Trichodesmium genus both in culture and in situ. Transcriptome analysis indicates that 86% of the non-coding spacemore » is expressed, although the function of these transcripts is unclear. The density of noncoding, possible regulatory elements predicted in Trichodesmium, when normalized per intergenic kilobase, was comparable and two fold higher than that found in the gene dense genomes of the sympatric cyanobacterial genera Synechococcus and Prochlorococcus, respectively. Conserved Trichodesmium ncRNA secondary structures were predicted between most culture and metagenomic sequences lending support to the structural conservation. Conservation of these intergenic regions in spatiotemporally separated Trichodesmium populations suggests possible genus-wide selection for their maintenance. These large intergenic spacers may have developed during intervals of strong genetic drift caused by periodic blooms of a subset of genotypes, which may have reduced effective population size. Our data suggest that transposition of selfish DNA, low effective population size, and high fidelity replication allowed the unusual ‘inflation’ of noncoding sequence observed in Trichodesmium despite its oligotrophic lifestyle.« less
2010-01-01
Background Natural accessions of Arabidopsis thaliana are characterized by a high level of phenotypic variation that can be used to investigate the extent and mode of selection on the primary metabolic traits. A collection of 54 A. thaliana natural accession-derived lines were subjected to deep genotyping through Single Feature Polymorphism (SFP) detection via genomic DNA hybridization to Arabidopsis Tiling 1.0 Arrays for the detection of selective sweeps, and identification of associations between sweep regions and growth-related metabolic traits. Results A total of 1,072,557 high-quality SFPs were detected and indications for 3,943 deletions and 1,007 duplications were obtained. A significantly lower than expected SFP frequency was observed in protein-, rRNA-, and tRNA-coding regions and in non-repetitive intergenic regions, while pseudogenes, transposons, and non-coding RNA genes are enriched with SFPs. Gene families involved in plant defence or in signalling were identified as highly polymorphic, while several other families including transcription factors are depleted of SFPs. 198 significant associations between metabolic genes and 9 metabolic and growth-related phenotypic traits were detected with annotation hinting at the nature of the relationship. Five significant selective sweep regions were also detected of which one associated significantly with a metabolic trait. Conclusions We generated a high density polymorphism map for 54 A. thaliana accessions that highlights the variability of resistance genes across geographic ranges and used it to identify selective sweeps and associations between metabolic genes and metabolic phenotypes. Several associations show a clear biological relationship, while many remain requiring further investigation. PMID:20302660
Detection of non-coding RNA in bacteria and archaea using the DETR'PROK Galaxy pipeline.
Toffano-Nioche, Claire; Luo, Yufei; Kuchly, Claire; Wallon, Claire; Steinbach, Delphine; Zytnicki, Matthias; Jacq, Annick; Gautheret, Daniel
2013-09-01
RNA-seq experiments are now routinely used for the large scale sequencing of transcripts. In bacteria or archaea, such deep sequencing experiments typically produce 10-50 million fragments that cover most of the genome, including intergenic regions. In this context, the precise delineation of the non-coding elements is challenging. Non-coding elements include untranslated regions (UTRs) of mRNAs, independent small RNA genes (sRNAs) and transcripts produced from the antisense strand of genes (asRNA). Here we present a computational pipeline (DETR'PROK: detection of ncRNAs in prokaryotes) based on the Galaxy framework that takes as input a mapping of deep sequencing reads and performs successive steps of clustering, comparison with existing annotation and identification of transcribed non-coding fragments classified into putative 5' UTRs, sRNAs and asRNAs. We provide a step-by-step description of the protocol using real-life example data sets from Vibrio splendidus and Escherichia coli. Copyright © 2013 The Authors. Published by Elsevier Inc. All rights reserved.
Etebari, Kayvan; Furlong, Michael J.; Asgari, Sassan
2015-01-01
Long non-coding RNAs (lncRNAs) play important roles in genomic imprinting, cancer, differentiation and regulation of gene expression. Here, we identified 3844 long intergenic ncRNAs (lincRNA) in Plutella xylostella, which is a notorious pest of cruciferous plants that has developed field resistance to all classes of insecticides, including Bacillus thuringiensis (Bt) endotoxins. Further, we found that some of those lincRNAs may potentially serve as precursors for the production of small ncRNAs. We found 280 and 350 lincRNAs that are differentially expressed in Chlorpyrifos and Fipronil resistant larvae. A survey on P. xylostella midgut transcriptome data from Bt-resistant populations revealed 59 altered lincRNA in two resistant strains compared with the susceptible population. We validated the transcript levels of a number of putative lincRNAs in deltamethrin-resistant larvae that were exposed to deltamethrin, which indicated that this group of lincRNAs might be involved in the response to xenobiotics in this insect. To functionally characterize DBM lincRNAs, gene ontology (GO) enrichment of their associated protein-coding genes was extracted and showed over representation of protein, DNA and RNA binding GO terms. The data presented here will facilitate future studies to unravel the function of lincRNAs in insecticide resistance or the response to xenobiotics of eukaryotic cells. PMID:26411386
Mizrahi-Aviv, Ela; Mills, David; Benzioni, Aliza; Bar-Zvi, Dudy
2005-03-01
Chloroplast metabolism is rapidly affected by salt stress. Photosynthesis is one of the first processes known to be affected by salinity. Here, we report that salinity inhibits chloroplast post-transcriptional RNA processing. A differentially expressed 680-bp cDNA, containing the 3' sequence of 16S rRNA, transcribed intergenic spacer, exon 1 and intron of tRNA(Ile), was isolated by differential display reverse transcriptase PCR from salt-grown jojoba (Simmondsia chinesis) shoot cultures. Northern blot analysis indicated that although most rRNA appears to be fully processed, partially processed chloroplast 16S rRNA accumulates in salt-grown cultures. Thus, salinity appears to decrease the processing of the rrn transcript. The possible effect of this decreased processing on physiological processes is, as yet, unknown.
Fu, Jianmin; Liu, Huimin; Hu, Jingjing; Liang, Yuqin; Liang, Jinjun; Wuyun, Tana; Tan, Xiaofeng
2016-01-01
Diospyros is the largest genus in Ebenaceae, comprising more than 500 species with remarkable economic value, especially Diospyros kaki Thunb., which has traditionally been an important food resource in China, Korea, and Japan. Complete chloroplast (cp) genomes from D. kaki, D. lotus L., D. oleifera Cheng., D. glaucifolia Metc., and Diospyros 'Jinzaoshi' were sequenced using Illumina sequencing technology. This is the first cp genome reported in Ebenaceae. The cp genome sequences of Diospyros ranged from 157,300 to 157,784 bp in length, presenting a typical quadripartite structure with two inverted repeats each separated by one large and one small single-copy region. For each cp genome, 134 genes were annotated, including 80 protein-coding, 31 tRNA, and 4 rRNA unique genes. In all, 179 repeats and 283 single sequence repeats were identified. Four hypervariable regions, namely, intergenic region of trnQ_rps16, trnV_ndhC, and psbD_trnT, and intron of ndhA, were identified in the Diospyros genomes. Phylogenetic analyses based on the whole cp genome, protein-coding, and intergenic and intron sequences indicated that D. oleifera is closely related to D. kaki and could be used as a model plant for future research on D. kaki; to our knowledge, this is proposed for the first time. Further, these analyses together with two large deletions (301 and 140 bp) in the cp genome of D. 'Jinzaoshi', support its placement as a new species in Diospyros. Both maximum parsimony and likelihood analyses for 19 taxa indicated the basal position of Ericales in asterids and suggested that Ebenaceae is monophyletic in Ericales.
Hu, Jingjing; Liang, Yuqin; Liang, Jinjun; Wuyun, Tana; Tan, Xiaofeng
2016-01-01
Diospyros is the largest genus in Ebenaceae, comprising more than 500 species with remarkable economic value, especially Diospyros kaki Thunb., which has traditionally been an important food resource in China, Korea, and Japan. Complete chloroplast (cp) genomes from D. kaki, D. lotus L., D. oleifera Cheng., D. glaucifolia Metc., and Diospyros ‘Jinzaoshi’ were sequenced using Illumina sequencing technology. This is the first cp genome reported in Ebenaceae. The cp genome sequences of Diospyros ranged from 157,300 to 157,784 bp in length, presenting a typical quadripartite structure with two inverted repeats each separated by one large and one small single-copy region. For each cp genome, 134 genes were annotated, including 80 protein-coding, 31 tRNA, and 4 rRNA unique genes. In all, 179 repeats and 283 single sequence repeats were identified. Four hypervariable regions, namely, intergenic region of trnQ_rps16, trnV_ndhC, and psbD_trnT, and intron of ndhA, were identified in the Diospyros genomes. Phylogenetic analyses based on the whole cp genome, protein-coding, and intergenic and intron sequences indicated that D. oleifera is closely related to D. kaki and could be used as a model plant for future research on D. kaki; to our knowledge, this is proposed for the first time. Further, these analyses together with two large deletions (301 and 140 bp) in the cp genome of D. ‘Jinzaoshi’, support its placement as a new species in Diospyros. Both maximum parsimony and likelihood analyses for 19 taxa indicated the basal position of Ericales in asterids and suggested that Ebenaceae is monophyletic in Ericales. PMID:27442423
Ettoumi, Besma; Chouchane, Habib; Guesmi, Amel; Mahjoubi, Mouna; Brusetti, Lorenzo; Neifar, Mohamed; Borin, Sara; Daffonchio, Daniele; Cherif, Ameur
2016-01-01
In the present study, the ecological distribution of marine Actinobacteria isolated from seamount and non-seamount stations in the Tyrrhenian Sea was investigated. A collection of 110 isolates was analyzed by Automated Ribosomal Intergenic Spacer Analysis (ARISA) and 16S rRNA gene sequencing of representatives for each ARISA haplotype (n=49). Phylogenetic analysis of 16S rRNA sequences showed a wide diversity of marine isolates and clustered the strains into 11 different genera, Janibacter, Rhodococcus, Arthrobacter, Kocuria, Dietzia, Curtobacterium, Micrococcus, Citricoccus, Brevibacterium, Brachybacterium and Nocardioides. Interestingly, Janibacter limosus was the most encountered species particularly in seamounts stations, suggesting that it represents an endemic species of this particular ecosystem. The application of BOX-PCR fingerprinting on J. limosus sub-collection (n=22), allowed their separation into seven distinct BOX-genotypes suggesting a high intraspecific microdiversity among the collection. Furthermore, by screening the biotechnological potential of selected actinobacterial strains, J. limosus was shown to exhibit the most important biosurfactant activity. Our overall data indicates that Janibacter is a major and active component of seamounts in the Tyrrhenian Sea adapted to low nutrient ecological niche. Copyright © 2016 Elsevier GmbH. All rights reserved.
Origin, evolution, and biogeography of Juglans: a phylogenetic perspective
USDA-ARS?s Scientific Manuscript database
Phylogenetic analyses of extant Juglans (Juglandaceae) using five cpDNA intergenic spacer (IGS) sequences (trnT-trnF, psbA-trnH, atpB-rbcL, trnV-16S rRNA, and trnS-trnfM) were performed to elucidate the origin, diversification, historical biogeography, and evolutionary relationships within the genus...
The cyanobiont in an Azolla fern is neither Anabaena nor Nostoc.
Baker, Judith A; Entsch, Barrie; McKay, David B
2003-12-05
The cyanobacterial symbionts in the fern Azolla have generally been ascribed to either the Anabaena or Nostoc genera. By using comparisons of the sequences of the phycocyanin intergenic spacer and a fragment of the 16S rRNA, we found that the cyanobiont from an Azolla belongs to neither of these genera.
Single-molecule sequencing of the desiccation-tolerant grass Oropetium thomaeum
DOE Office of Scientific and Technical Information (OSTI.GOV)
VanBuren, Robert; Bryant, Doug; Edger, Patrick P.
Plant genomes, and eukaryotic genomes in general, are typically repetitive, polyploid and heterozygous, which complicates genome assembly1. The short read lengths of early Sanger and current next-generation sequencing platforms hinder assembly through complex repeat regions, and many draft and reference genomes are fragmented, lacking skewed GC and repetitive intergenic sequences, which are gaining importance due to projects like the Encyclopedia of DNA Elements (ENCODE). Here we report the whole-genome sequencing and assembly of the desiccation-tolerant grass Oropetium thomaeum. Using only single-molecule real-time sequencing, which generates long (>16 kilobases) reads with random errors, we assembled 99% (244 megabases) of the Oropetiummore » genome into 625 contigs with an N50 length of 2.4 megabases. Oropetium is an example of a ‘near-complete’ draft genome which includes gapless coverage over gene space as well as intergenic sequences such as centromeres, telomeres, transposable elements and rRNA clusters that are typically unassembled in draft genomes. Oropetium has 28,466 protein-coding genes and 43% repeat sequences, yet with 30% more compact euchromatic regions it is the smallest known grass genome. As a result, the Oropetium genome demonstrates the utility of single-molecule real-time sequencing for assembling high-quality plant and other eukaryotic genomes, and serves as a valuable resource for the plant comparative genomics community.« less
Single-molecule sequencing of the desiccation-tolerant grass Oropetium thomaeum
VanBuren, Robert; Bryant, Doug; Edger, Patrick P.; ...
2015-11-11
Plant genomes, and eukaryotic genomes in general, are typically repetitive, polyploid and heterozygous, which complicates genome assembly1. The short read lengths of early Sanger and current next-generation sequencing platforms hinder assembly through complex repeat regions, and many draft and reference genomes are fragmented, lacking skewed GC and repetitive intergenic sequences, which are gaining importance due to projects like the Encyclopedia of DNA Elements (ENCODE). Here we report the whole-genome sequencing and assembly of the desiccation-tolerant grass Oropetium thomaeum. Using only single-molecule real-time sequencing, which generates long (>16 kilobases) reads with random errors, we assembled 99% (244 megabases) of the Oropetiummore » genome into 625 contigs with an N50 length of 2.4 megabases. Oropetium is an example of a ‘near-complete’ draft genome which includes gapless coverage over gene space as well as intergenic sequences such as centromeres, telomeres, transposable elements and rRNA clusters that are typically unassembled in draft genomes. Oropetium has 28,466 protein-coding genes and 43% repeat sequences, yet with 30% more compact euchromatic regions it is the smallest known grass genome. As a result, the Oropetium genome demonstrates the utility of single-molecule real-time sequencing for assembling high-quality plant and other eukaryotic genomes, and serves as a valuable resource for the plant comparative genomics community.« less
Wang, Jinyan; Yang, Yuwen; Jin, Lamei; Ling, Xitie; Liu, Tingli; Chen, Tianzi; Ji, Yinghua; Yu, Wengui; Zhang, Baolong
2018-06-04
Long Noncoding-RNAs (LncRNAs) are known to be involved in some biological processes, but their roles in plant-virus interactions remain largely unexplored. While circular RNAs (circRNAs) have been studied in animals, there has yet to be extensive research on them in a plant system, especially in tomato-tomato yellow leaf curl virus (TYLCV) interaction. In this study, RNA transcripts from the susceptible tomato line JS-CT-9210 either infected with TYLCV or untreated, were sequenced in a pair-end strand-specific manner using ribo-zero rRNA removal library method. A total of 2056 lncRNAs including 1767 long intergenic non-coding RNA (lincRNAs) and 289 long non-coding natural antisense transcripts (lncNATs) were obtained. The expression patterns in lncRNAs were similar in susceptible tomato plants between control check (CK) and TYLCV infected samples. Our analysis suggested that lncRNAs likely played a role in a variety of functions, including plant hormone signaling, protein processing in the endoplasmic reticulum, RNA transport, ribosome function, photosynthesis, glulathione metabolism, and plant-pathogen interactions. Using virus-induced gene silencing (VIGS) analysis, we found that reduced expression of the lncRNA S-slylnc0957 resulted in enhanced resistance to TYLCV in susceptible tomato plants. Moreover, we identified 184 circRNAs candidates using the CircRNA Identifier (CIRI) software, of which 32 circRNAs were specifically expressed in untreated samples and 83 circRNAs in TYLCV samples. Approximately 62% of these circRNAs were derived from exons. We validated the circRNAs by both PCR and Sanger sequencing using divergent primers, and found that most of circRNAs were derived from the exons of protein coding genes. The silencing of these circRNAs parent genes resulted in decreased TYLCV virus accumulation. In this study, we identified novel lncRNAs and circRNAs using bioinformatic approaches and showed that these RNAs function as negative regulators of TYLCV infection. Moreover, the expression patterns of lncRNAs in susceptible tomato plants were different from that of resistant tomato plants, while exonic circRNAs expression positively associated with their respective protein coding genes. This work provides a foundation for elaborating the novel roles of lncRNAs and circRNAs in susceptible tomatoes following TYLCV infection.
Characteristics and significance of intergenic polyadenylated RNA transcription in Arabidopsis.
Moghe, Gaurav D; Lehti-Shiu, Melissa D; Seddon, Alex E; Yin, Shan; Chen, Yani; Juntawong, Piyada; Brandizzi, Federica; Bailey-Serres, Julia; Shiu, Shin-Han
2013-01-01
The Arabidopsis (Arabidopsis thaliana) genome is the most well-annotated plant genome. However, transcriptome sequencing in Arabidopsis continues to suggest the presence of polyadenylated (polyA) transcripts originating from presumed intergenic regions. It is not clear whether these transcripts represent novel noncoding or protein-coding genes. To understand the nature of intergenic polyA transcription, we first assessed its abundance using multiple messenger RNA sequencing data sets. We found 6,545 intergenic transcribed fragments (ITFs) occupying 3.6% of Arabidopsis intergenic space. In contrast to transcribed fragments that map to protein-coding and RNA genes, most ITFs are significantly shorter, are expressed at significantly lower levels, and tend to be more data set specific. A surprisingly large number of ITFs (32.1%) may be protein coding based on evidence of translation. However, our results indicate that these "translated" ITFs tend to be close to and are likely associated with known genes. To investigate if ITFs are under selection and are functional, we assessed ITF conservation through cross-species as well as within-species comparisons. Our analysis reveals that 237 ITFs, including 49 with translation evidence, are under strong selective constraint and relatively distant from annotated features. These ITFs are likely parts of novel genes. However, the selective pressure imposed on most ITFs is similar to that of randomly selected, untranscribed intergenic sequences. Our findings indicate that despite the prevalence of ITFs, apart from the possibility of genomic contamination, many may be background or noisy transcripts derived from "junk" DNA, whose production may be inherent to the process of transcription and which, on rare occasions, may act as catalysts for the creation of novel genes.
Jayakodi, Murukarthick; Jung, Je Won; Park, Doori; Ahn, Young-Joon; Lee, Sang-Choon; Shin, Sang-Yoon; Shin, Chanseok; Yang, Tae-Jin; Kwon, Hyung Wook
2015-09-04
Long non-coding RNAs (lncRNAs) are a class of RNAs that do not encode proteins. Recently, lncRNAs have gained special attention for their roles in various biological process and diseases. In an attempt to identify long intergenic non-coding RNAs (lincRNAs) and their possible involvement in honey bee development and diseases, we analyzed RNA-seq datasets generated from Asian honey bee (Apis cerana) and western honey bee (Apis mellifera). We identified 2470 lincRNAs with an average length of 1011 bp from A. cerana and 1514 lincRNAs with an average length of 790 bp in A. mellifera. Comparative analysis revealed that 5 % of the total lincRNAs derived from both species are unique in each species. Our comparative digital gene expression analysis revealed a high degree of tissue-specific expression among the seven major tissues of honey bee, different from mRNA expression patterns. A total of 863 (57 %) and 464 (18 %) lincRNAs showed tissue-dependent expression in A. mellifera and A. cerana, respectively, most preferentially in ovary and fat body tissues. Importantly, we identified 11 lincRNAs that are specifically regulated upon viral infection in honey bees, and 10 of them appear to play roles during infection with various viruses. This study provides the first comprehensive set of lincRNAs for honey bees and opens the door to discover lincRNAs associated with biological and hormone signaling pathways as well as various diseases of honey bee.
Benthic bacterial diversity in submerged sinkhole ecosystems.
Nold, Stephen C; Pangborn, Joseph B; Zajack, Heidi A; Kendall, Scott T; Rediske, Richard R; Biddanda, Bopaiah A
2010-01-01
Physicochemical characterization, automated ribosomal intergenic spacer analysis (ARISA) community profiling, and 16S rRNA gene sequencing approaches were used to study bacterial communities inhabiting submerged Lake Huron sinkholes inundated with hypoxic, sulfate-rich groundwater. Photosynthetic cyanobacterial mats on the sediment surface were dominated by Phormidium autumnale, while deeper, organically rich sediments contained diverse and active bacterial communities.
NASA Astrophysics Data System (ADS)
Mackiewicz, P.; Gierlik, A.; Kowalczuk, M.; Szczepanik, D.; Dudek, M. R.; Cebrat, S.
1999-12-01
We have analysed protein coding and intergenic sequences in the Borrelia burgdorferi (the Lyme disease bacterium) genome using different kinds of DNA walks. Genes occupying the leading strand of DNA have significantly different nucleotide composition from genes occupying the lagging strand. Nucleotide compositional bias of the two DNA strands reflects the aminoacid composition of proteins. 96% of genes coding for ribosomal proteins lie on the leading DNA strand, which suggests that the positions of these as well as other genes are non-random. In the B. burgdorferi genome, the asymmetry in intergenic DNA sequences is lower than the asymmetry in the third positions in codons. All these characters of the B. burgdorferi genome suggest that both replication-associated mutational pressure and recombination mechanisms have established the specific structure of the genome and now any recombination leading to inversion of a gene in respect to the direction of replication is forbidden. This property of the genome allows us to assume that it is in a steady state, which enables us to fix some parameters for simulations of DNA evolution.
Mitochondrial genome evolution in the Saccharomyces sensu stricto complex.
Ruan, Jiangxing; Cheng, Jian; Zhang, Tongcun; Jiang, Huifeng
2017-01-01
Exploring the evolutionary patterns of mitochondrial genomes is important for our understanding of the Saccharomyces sensu stricto (SSS) group, which is a model system for genomic evolution and ecological analysis. In this study, we first obtained the complete mitochondrial sequences of two important species, Saccharomyces mikatae and Saccharomyces kudriavzevii. We then compared the mitochondrial genomes in the SSS group with those of close relatives, and found that the non-coding regions evolved rapidly, including dramatic expansion of intergenic regions, fast evolution of introns and almost 20-fold higher rearrangement rates than those of the nuclear genomes. However, the coding regions, and especially the protein-coding genes, are more conserved than those in the nuclear genomes of the SSS group. The different evolutionary patterns of coding and non-coding regions in the mitochondrial and nuclear genomes may be related to the origin of the aerobic fermentation lifestyle in this group. Our analysis thus provides novel insights into the evolution of mitochondrial genomes.
Kapranov, Philipp; St Laurent, Georges; Raz, Tal; Ozsolak, Fatih; Reynolds, C Patrick; Sorensen, Poul H B; Reaman, Gregory; Milos, Patrice; Arceci, Robert J; Thompson, John F; Triche, Timothy J
2010-12-21
Discovery that the transcriptional output of the human genome is far more complex than predicted by the current set of protein-coding annotations and that most RNAs produced do not appear to encode proteins has transformed our understanding of genome complexity and suggests new paradigms of genome regulation. However, the fraction of all cellular RNA whose function we do not understand and the fraction of the genome that is utilized to produce that RNA remain controversial. This is not simply a bookkeeping issue because the degree to which this un-annotated transcription is present has important implications with respect to its biologic function and to the general architecture of genome regulation. For example, efforts to elucidate how non-coding RNAs (ncRNAs) regulate genome function will be compromised if that class of RNAs is dismissed as simply 'transcriptional noise'. We show that the relative mass of RNA whose function and/or structure we do not understand (the so called 'dark matter' RNAs), as a proportion of all non-ribosomal, non-mitochondrial human RNA (mt-RNA), can be greater than that of protein-encoding transcripts. This observation is obscured in studies that focus only on polyA-selected RNA, a method that enriches for protein coding RNAs and at the same time discards the vast majority of RNA prior to analysis. We further show the presence of a large number of very long, abundantly-transcribed regions (100's of kb) in intergenic space and further show that expression of these regions is associated with neoplastic transformation. These overlap some regions found previously in normal human embryonic tissues and raises an interesting hypothesis as to the function of these ncRNAs in both early development and neoplastic transformation. We conclude that 'dark matter' RNA can constitute the majority of non-ribosomal, non-mitochondrial-RNA and a significant fraction arises from numerous very long, intergenic transcribed regions that could be involved in neoplastic transformation.
Benthic Bacterial Diversity in Submerged Sinkhole Ecosystems▿ †
Nold, Stephen C.; Pangborn, Joseph B.; Zajack, Heidi A.; Kendall, Scott T.; Rediske, Richard R.; Biddanda, Bopaiah A.
2010-01-01
Physicochemical characterization, automated ribosomal intergenic spacer analysis (ARISA) community profiling, and 16S rRNA gene sequencing approaches were used to study bacterial communities inhabiting submerged Lake Huron sinkholes inundated with hypoxic, sulfate-rich groundwater. Photosynthetic cyanobacterial mats on the sediment surface were dominated by Phormidium autumnale, while deeper, organically rich sediments contained diverse and active bacterial communities. PMID:19880643
Inácio, Vera; Rocheta, Margarida; Morais-Cecílio, Leonor
2014-01-01
The 35S ribosomal DNA (rDNA) units, repeated in tandem at one or more chromosomal loci, are separated by an intergenic spacer (IGS) containing functional elements involved in the regulation of transcription of downstream rRNA genes. In the present work, we have compared the IGS molecular organizations in two divergent species of Fagaceae, Fagus sylvatica and Quercus suber, aiming to comprehend the evolution of the IGS sequences within the family. Self- and cross-hybridization FISH was done on representative species of the Fagaceae. The IGS length variability and the methylation level of 18 and 25S rRNA genes were assessed in representatives of three genera of this family: Fagus, Quercus and Castanea. The intergenic spacers in Beech and Cork Oak showed similar overall organizations comprising putative functional elements needed for rRNA gene activity and containing a non-transcribed spacer (NTS), a promoter region, and a 5′-external transcribed spacer. In the NTS: the sub-repeats structure in Beech is more organized than in Cork Oak, sharing some short motifs which results in the lowest sequence similarity of the entire IGS; the AT-rich region differed in both spacers by a GC-rich block inserted in Cork Oak. The 5′-ETS is the region with the higher similarity, having nonetheless different lengths. FISH with the NTS-5′-ETS revealed fainter signals in cross-hybridization in agreement with the divergence between genera. The diversity of IGS lengths revealed variants from ∼2 kb in Fagus, and Quercus up to 5.3 kb in Castanea, and a lack of correlation between the number of variants and the number of rDNA loci in several species. Methylation of 25S Bam HI site was confirmed in all species and detected for the first time in the 18S of Q. suber and Q. faginea. These results provide important clues for the evolutionary trends of the rDNA 25S-18S IGS in the Fagaceae family. PMID:24893289
DOE Office of Scientific and Technical Information (OSTI.GOV)
Wu, Yongyan; Key Laboratory of Animal Biotechnology, Ministry of Agriculture, Northwest A and F University, Yangling 712100, Shaanxi; Ai, Zhiying
2013-10-15
Embryonic stem cells (ESCs) can proliferate indefinitely in vitro and differentiate into cells of all three germ layers. These unique properties make them exceptionally valuable for drug discovery and regenerative medicine. However, the practical application of ESCs is limited because it is difficult to derive and culture ESCs. It has been demonstrated that CHIR99021 (CHIR) promotes self-renewal and enhances the derivation efficiency of mouse (m)ESCs. However, the downstream targets of CHIR are not fully understood. In this study, we identified CHIR-regulated genes in mESCs using microarray analysis. Our microarray data demonstrated that CHIR not only influenced the Wnt/β-catenin pathway bymore » stabilizing β-catenin, but also modulated several other pluripotency-related signaling pathways such as TGF-β, Notch and MAPK signaling pathways. More detailed analysis demonstrated that CHIR inhibited Nodal signaling, while activating bone morphogenetic protein signaling in mESCs. In addition, we found that pluripotency-maintaining transcription factors were up-regulated by CHIR, while several developmental-related genes were down-regulated. Furthermore, we found that CHIR altered the expression of epigenetic regulatory genes and long intergenic non-coding RNAs. Quantitative real-time PCR results were consistent with microarray data, suggesting that CHIR alters the expression pattern of protein-encoding genes (especially transcription factors), epigenetic regulatory genes and non-coding RNAs to establish a relatively stable pluripotency-maintaining network. - Highlights: • Combined use of CHIR with LIF promotes self-renewal of J1 mESCs. • CHIR-regulated genes are involved in multiple pathways. • CHIR inhibits Nodal signaling and promotes Bmp4 expression to activate BMP signaling. • Expression of epigenetic regulatory genes and lincRNAs is altered by CHIR.« less
Dalmay, Tamas
2018-01-01
RNA interference (RNAi) is a complex and highly conserved regulatory mechanism mediated via small RNAs (sRNAs). Recent technical advances in high throughput sequencing have enabled an increasingly detailed analysis of sRNA abundances and profiles in specific body parts and tissues. This enables investigations of the localized roles of microRNAs (miRNAs) and small interfering RNAs (siRNAs). However, variation in the proportions of non-coding RNAs in the samples being compared can hinder these analyses. Specific tissues may vary significantly in the proportions of fragments of longer non-coding RNAs (such as ribosomal RNA or transfer RNA) present, potentially reflecting tissue-specific differences in biological functions. For example, in Drosophila, some tissues contain a highly abundant 30nt rRNA fragment (the 2S rRNA) as well as abundant 5’ and 3’ terminal rRNA fragments. These can pose difficulties for the construction of sRNA libraries as they can swamp the sequencing space and obscure sRNA abundances. Here we addressed this problem and present a modified “rRNA blocking” protocol for the construction of high-definition (HD) adapter sRNA libraries, in D. melanogaster reproductive tissues. The results showed that 2S rRNAs targeted by blocking oligos were reduced from >80% to < 0.01% total reads. In addition, the use of multiple rRNA blocking oligos to bind the most abundant rRNA fragments allowed us to reveal the underlying sRNA populations at increased resolution. Side-by-side comparisons of sequencing libraries of blocked and non-blocked samples revealed that rRNA blocking did not change the miRNA populations present, but instead enhanced their abundances. We suggest that this rRNA blocking procedure offers the potential to improve the in-depth analysis of differentially expressed sRNAs within and across different tissues. PMID:29474379
Baurens, Franc-Christophe; Bocs, Stéphanie; Rouard, Mathieu; Matsumoto, Takashi; Miller, Robert N G; Rodier-Goud, Marguerite; MBéguié-A-MBéguié, Didier; Yahiaoui, Nabila
2010-07-16
Comparative sequence analysis of complex loci such as resistance gene analog clusters allows estimating the degree of sequence conservation and mechanisms of divergence at the intraspecies level. In banana (Musa sp.), two diploid wild species Musa acuminata (A genome) and Musa balbisiana (B genome) contribute to the polyploid genome of many cultivars. The M. balbisiana species is associated with vigour and tolerance to pests and disease and little is known on the genome structure and haplotype diversity within this species. Here, we compare two genomic sequences of 253 and 223 kb corresponding to two haplotypes of the RGA08 resistance gene analog locus in M. balbisiana "Pisang Klutuk Wulung" (PKW). Sequence comparison revealed two regions of contrasting features. The first is a highly colinear gene-rich region where the two haplotypes diverge only by single nucleotide polymorphisms and two repetitive element insertions. The second corresponds to a large cluster of RGA08 genes, with 13 and 18 predicted RGA genes and pseudogenes spread over 131 and 152 kb respectively on each haplotype. The RGA08 cluster is enriched in repetitive element insertions, in duplicated non-coding intergenic sequences including low complexity regions and shows structural variations between haplotypes. Although some allelic relationships are retained, a large diversity of RGA08 genes occurs in this single M. balbisiana genotype, with several RGA08 paralogs specific to each haplotype. The RGA08 gene family has evolved by mechanisms of unequal recombination, intragenic sequence exchange and diversifying selection. An unequal recombination event taking place between duplicated non-coding intergenic sequences resulted in a different RGA08 gene content between haplotypes pointing out the role of such duplicated regions in the evolution of RGA clusters. Based on the synonymous substitution rate in coding sequences, we estimated a 1 million year divergence time for these M. balbisiana haplotypes. A large RGA08 gene cluster identified in wild banana corresponds to a highly variable genomic region between haplotypes surrounded by conserved flanking regions. High level of sequence identity (70 to 99%) of the genic and intergenic regions suggests a recent and rapid evolution of this cluster in M. balbisiana.
Kwon, Hyuk-Sang; Yang, Eun-Hee; Yeon, Seung-Woo; Kang, Byoung-Hwa; Kim, Tae-Yong
2004-10-15
This study aimed to develop a novel multiplex polymerase chain reaction (PCR) primer set for the identification of seven probiotic Lactobacillus species such as Lactobacillus acidophilus, Lactobacillus delbrueckii, Lactobacillus casei, Lactobacillus gasseri, Lactobacillus plantarum, Lactobacillus reuteri and Lactobacillus rhamnosus. The primer set, comprising of seven specific and two conserved primers, was derived from the integrated sequences of 16S and 23S rRNA genes and their rRNA intergenic spacer region of each species. It was able to identify the seven target species with 93.6% accuracy, which exceeds that of the general biochemical methods. The phylogenetic analyses, using 16S rDNA sequences of the probiotic isolates, also provided further support that the results from the multiplex PCR assay were trustworthy. Taken together, we suggest that the multiplex primer set is an efficient tool for simple, rapid and reliable identification of seven Lactobacillus species.
2018-01-01
FAM230C, a long intergenic non-coding RNA (lincRNA) gene in human chromosome 13 (chr13) is a member of lincRNA genes termed family with sequence similarity 230. An analysis using bioinformatics search tools and alignment programs was undertaken to determine properties of FAM230C and its related genes. Results reveal that the DNA translocation element, the Translocation Breakpoint Type A (TBTA) sequence, which consists of satellite DNA, Alu elements, and AT-rich sequences is embedded in the FAM230C gene. Eight lincRNA genes related to FAM230C also carry the TBTA sequences. These genes were formed from a large segment of the 3’ half of the FAM230C sequence duplicated in chr22, and are specifically in regions of low copy repeats (LCR22)s, in or close to the 22q.11.2 region. 22q11.2 is a chromosomal segment that undergoes a high rate of DNA translocation and is prone to genetic deletions. FAM230C-related genes present in other chromosomes do not carry the TBTA motif and were formed from the 5’ half region of the FAM230C sequence. These findings identify a high specificity in lincRNA gene formation by gene sequence duplication in different chromosomes. PMID:29668722
PlantRNA_Sniffer: A SVM-Based Workflow to Predict Long Intergenic Non-Coding RNAs in Plants.
Vieira, Lucas Maciel; Grativol, Clicia; Thiebaut, Flavia; Carvalho, Thais G; Hardoim, Pablo R; Hemerly, Adriana; Lifschitz, Sergio; Ferreira, Paulo Cavalcanti Gomes; Walter, Maria Emilia M T
2017-03-04
Non-coding RNAs (ncRNAs) constitute an important set of transcripts produced in the cells of organisms. Among them, there is a large amount of a particular class of long ncRNAs that are difficult to predict, the so-called long intergenic ncRNAs (lincRNAs), which might play essential roles in gene regulation and other cellular processes. Despite the importance of these lincRNAs, there is still a lack of biological knowledge and, currently, the few computational methods considered are so specific that they cannot be successfully applied to other species different from those that they have been originally designed to. Prediction of lncRNAs have been performed with machine learning techniques. Particularly, for lincRNA prediction, supervised learning methods have been explored in recent literature. As far as we know, there are no methods nor workflows specially designed to predict lincRNAs in plants. In this context, this work proposes a workflow to predict lincRNAs on plants, considering a workflow that includes known bioinformatics tools together with machine learning techniques, here a support vector machine (SVM). We discuss two case studies that allowed to identify novel lincRNAs, in sugarcane ( Saccharum spp.) and in maize ( Zea mays ). From the results, we also could identify differentially-expressed lincRNAs in sugarcane and maize plants submitted to pathogenic and beneficial microorganisms.
PlantRNA_Sniffer: A SVM-Based Workflow to Predict Long Intergenic Non-Coding RNAs in Plants
Vieira, Lucas Maciel; Grativol, Clicia; Thiebaut, Flavia; Carvalho, Thais G.; Hardoim, Pablo R.; Hemerly, Adriana; Lifschitz, Sergio; Ferreira, Paulo Cavalcanti Gomes; Walter, Maria Emilia M. T.
2017-01-01
Non-coding RNAs (ncRNAs) constitute an important set of transcripts produced in the cells of organisms. Among them, there is a large amount of a particular class of long ncRNAs that are difficult to predict, the so-called long intergenic ncRNAs (lincRNAs), which might play essential roles in gene regulation and other cellular processes. Despite the importance of these lincRNAs, there is still a lack of biological knowledge and, currently, the few computational methods considered are so specific that they cannot be successfully applied to other species different from those that they have been originally designed to. Prediction of lncRNAs have been performed with machine learning techniques. Particularly, for lincRNA prediction, supervised learning methods have been explored in recent literature. As far as we know, there are no methods nor workflows specially designed to predict lincRNAs in plants. In this context, this work proposes a workflow to predict lincRNAs on plants, considering a workflow that includes known bioinformatics tools together with machine learning techniques, here a support vector machine (SVM). We discuss two case studies that allowed to identify novel lincRNAs, in sugarcane (Saccharum spp.) and in maize (Zea mays). From the results, we also could identify differentially-expressed lincRNAs in sugarcane and maize plants submitted to pathogenic and beneficial microorganisms. PMID:29657283
The complete mitochondrial genome of Chrysopa pallens (Insecta, Neuroptera, Chrysopidae).
He, Kun; Chen, Zhe; Yu, Dan-Na; Zhang, Jia-Yong
2012-10-01
The complete mitochondrial genome of Chrysopa pallens (Neuroptera, Chrysopidae) was sequenced. It consists of 13 protein-coding genes, 22 transfer RNA genes, 2 ribosomal RNA (rRNA) genes, and a control region (AT-rich region). The total length of C. pallens mitogenome is 16,723 bp with 79.5% AT content, and the length of control region is 1905 bp with 89.1% AT content. The non-coding regions of C. pallens include control region between 12S rRNA and trnI genes, and a 75-bp space region between trnI and trnQ genes.
Hia, Fabian; Chionh, Yok Hian; Pang, Yan Ling Joy; DeMott, Michael S; McBee, Megan E; Dedon, Peter C
2015-03-11
A major challenge in the study of mycobacterial RNA biology is the lack of a comprehensive RNA isolation method that overcomes the unusual cell wall to faithfully yield the full spectrum of non-coding RNA (ncRNA) species. Here, we describe a simple and robust procedure optimized for the isolation of total ncRNA, including 5S, 16S and 23S ribosomal RNA (rRNA) and tRNA, from mycobacteria, using Mycobacterium bovis BCG to illustrate the method. Based on a combination of mechanical disruption and liquid and solid-phase technologies, the method produces all major species of ncRNA in high yield and with high integrity, enabling direct chemical and sequence analysis of the ncRNA species. The reproducibility of the method with BCG was evident in bioanalyzer electrophoretic analysis of isolated RNA, which revealed quantitatively significant differences in the ncRNA profiles of exponentially growing and non-replicating hypoxic bacilli. The method also overcame an historical inconsistency in 5S rRNA isolation, with direct sequencing revealing a novel post-transcriptional processing of 5S rRNA to its functional form and with chemical analysis revealing seven post-transcriptional ribonucleoside modifications in the 5S rRNA. This optimized RNA isolation procedure thus provides a means to more rigorously explore the biology of ncRNA species in mycobacteria. © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.
Chen, Z. Jeffrey; Pikaard, Craig S.
1997-01-01
Nucleolar dominance is an epigenetic phenomenon that describes the formation of nucleoli around rRNA genes inherited from only one parent in the progeny of an interspecific hybrid. Despite numerous cytogenetic studies, little is known about nucleolar dominance at the level of rRNA gene expression in plants. We used S1 nuclease protection and primer extension assays to define nucleolar dominance at a molecular level in the plant genus Brassica. rRNA transcription start sites were mapped in three diploids and in three allotetraploids (amphidiploids) and one allohexaploid species derived from these diploid progenitors. rRNA transcripts of only one progenitor were detected in vegetative tissues of each polyploid. Dominance was independent of maternal effect, ploidy, or rRNA gene dosage. Natural and newly synthesized amphidiploids yielded the same results, arguing against substantial evolutionary effects. The hypothesis that nucleolar dominance in plants is correlated with physical characteristics of rRNA gene intergenic spacers is not supported in Brassica. Furthermore, in Brassica napus, rRNA genes silenced in vegetative tissues were found to be expressed in all floral organs, including sepals and petals, arguing against the hypothesis that passage through meiosis is needed to reactivate suppressed genes. Instead, the transition of inflorescence to floral meristem appears to be a developmental stage when silenced genes can be derepressed. PMID:9096413
Kakou, Bidénam; Angers, Bernard; Glémet, Hélène
2016-03-01
The intergenic spacer (IGS) is located between ribosomal RNA (rRNA) gene copies. Within the IGS, regulatory elements for rRNA gene transcription are found, as well as a varying number of other repetitive elements that are at the root of IGS length heterogeneity. This heterogeneity has been shown to have a functional significance through its effect on growth rate. Here, we present the structural organization of yellow perch (Perca flavescens) IGS based on its entire sequence, as well as the IGS length variation within a natural population. Yellow perch IGS structure has four discrete regions containing tandem repeat elements. For three of these regions, no specific length class was detected as allele size was seemingly normally distributed. However, for one repeat region, PCR amplification uncovered the presence of two distinctive IGS variants representing a length difference of 1116 bp. This repeat region was also devoid of any CpG sites despite a high GC content. Balanced selection may be holding the alleles in the population and would account for the high diversity of length variants observed for adjacent regions. Our study is an important precursor for further work aiming to assess the role of IGS length variation in influencing growth rate in fish.
Reiche, Kristin; Kasack, Katharina; Schreiber, Stephan; Lüders, Torben; Due, Eldri U.; Naume, Bjørn; Riis, Margit; Kristensen, Vessela N.; Horn, Friedemann; Børresen-Dale, Anne-Lise; Hackermüller, Jörg; Baumbusch, Lars O.
2014-01-01
Breast cancer, the second leading cause of cancer death in women, is a highly heterogeneous disease, characterized by distinct genomic and transcriptomic profiles. Transcriptome analyses prevalently assessed protein-coding genes; however, the majority of the mammalian genome is expressed in numerous non-coding transcripts. Emerging evidence supports that many of these non-coding RNAs are specifically expressed during development, tumorigenesis, and metastasis. The focus of this study was to investigate the expression features and molecular characteristics of long non-coding RNAs (lncRNAs) in breast cancer. We investigated 26 breast tumor and 5 normal tissue samples utilizing a custom expression microarray enclosing probes for mRNAs as well as novel and previously identified lncRNAs. We identified more than 19,000 unique regions significantly differentially expressed between normal versus breast tumor tissue, half of these regions were non-coding without any evidence for functional open reading frames or sequence similarity to known proteins. The identified non-coding regions were primarily located in introns (53%) or in the intergenic space (33%), frequently orientated in antisense-direction of protein-coding genes (14%), and commonly distributed at promoter-, transcription factor binding-, or enhancer-sites. Analyzing the most diverse mRNA breast cancer subtypes Basal-like versus Luminal A and B resulted in 3,025 significantly differentially expressed unique loci, including 682 (23%) for non-coding transcripts. A notable number of differentially expressed protein-coding genes displayed non-synonymous expression changes compared to their nearest differentially expressed lncRNA, including an antisense lncRNA strongly anticorrelated to the mRNA coding for histone deacetylase 3 (HDAC3), which was investigated in more detail. Previously identified chromatin-associated lncRNAs (CARs) were predominantly downregulated in breast tumor samples, including CARs located in the protein-coding genes for CALD1, FTX, and HNRNPH1. In conclusion, a number of differentially expressed lncRNAs have been identified with relation to cancer-related protein-coding genes. PMID:25264628
Reiche, Kristin; Kasack, Katharina; Schreiber, Stephan; Lüders, Torben; Due, Eldri U; Naume, Bjørn; Riis, Margit; Kristensen, Vessela N; Horn, Friedemann; Børresen-Dale, Anne-Lise; Hackermüller, Jörg; Baumbusch, Lars O
2014-01-01
Breast cancer, the second leading cause of cancer death in women, is a highly heterogeneous disease, characterized by distinct genomic and transcriptomic profiles. Transcriptome analyses prevalently assessed protein-coding genes; however, the majority of the mammalian genome is expressed in numerous non-coding transcripts. Emerging evidence supports that many of these non-coding RNAs are specifically expressed during development, tumorigenesis, and metastasis. The focus of this study was to investigate the expression features and molecular characteristics of long non-coding RNAs (lncRNAs) in breast cancer. We investigated 26 breast tumor and 5 normal tissue samples utilizing a custom expression microarray enclosing probes for mRNAs as well as novel and previously identified lncRNAs. We identified more than 19,000 unique regions significantly differentially expressed between normal versus breast tumor tissue, half of these regions were non-coding without any evidence for functional open reading frames or sequence similarity to known proteins. The identified non-coding regions were primarily located in introns (53%) or in the intergenic space (33%), frequently orientated in antisense-direction of protein-coding genes (14%), and commonly distributed at promoter-, transcription factor binding-, or enhancer-sites. Analyzing the most diverse mRNA breast cancer subtypes Basal-like versus Luminal A and B resulted in 3,025 significantly differentially expressed unique loci, including 682 (23%) for non-coding transcripts. A notable number of differentially expressed protein-coding genes displayed non-synonymous expression changes compared to their nearest differentially expressed lncRNA, including an antisense lncRNA strongly anticorrelated to the mRNA coding for histone deacetylase 3 (HDAC3), which was investigated in more detail. Previously identified chromatin-associated lncRNAs (CARs) were predominantly downregulated in breast tumor samples, including CARs located in the protein-coding genes for CALD1, FTX, and HNRNPH1. In conclusion, a number of differentially expressed lncRNAs have been identified with relation to cancer-related protein-coding genes.
Using a Euclid distance discriminant method to find protein coding genes in the yeast genome.
Zhang, Chun-Ting; Wang, Ju; Zhang, Ren
2002-02-01
The Euclid distance discriminant method is used to find protein coding genes in the yeast genome, based on the single nucleotide frequencies at three codon positions in the ORFs. The method is extremely simple and may be extended to find genes in prokaryotic genomes or eukaryotic genomes with less introns. Six-fold cross-validation tests have demonstrated that the accuracy of the algorithm is better than 93%. Based on this, it is found that the total number of protein coding genes in the yeast genome is less than or equal to 5579 only, about 3.8-7.0% less than 5800-6000, which is currently widely accepted. The base compositions at three codon positions are analyzed in details using a graphic method. The result shows that the preference codons adopted by yeast genes are of the RGW type, where R, G and W indicate the bases of purine, non-G and A/T, whereas the 'codons' in the intergenic sequences are of the form NNN, where N denotes any base. This fact constitutes the basis of the algorithm to distinguish between coding and non-coding ORFs in the yeast genome. The names of putative non-coding ORFs are listed here in detail.
Ruppitsch, W; Stöger, A; Indra, A; Grif, K; Schabereiter-Gurtner, C; Hirschl, A; Allerberger, F
2007-03-01
In a bioterrorism event a rapid tool is needed to identify relevant dangerous bacteria. The aim of the study was to assess the usefulness of partial 16S rRNA gene sequence analysis and the suitability of diverse databases for identifying dangerous bacterial pathogens. For rapid identification purposes a 500-bp fragment of the 16S rRNA gene of 28 isolates comprising Bacillus anthracis, Brucella melitensis, Burkholderia mallei, Burkholderia pseudomallei, Francisella tularensis, Yersinia pestis, and eight genus-related and unrelated control strains was amplified and sequenced. The obtained sequence data were submitted to three public and two commercial sequence databases for species identification. The most frequent reason for incorrect identification was the lack of the respective 16S rRNA gene sequences in the database. Sequence analysis of a 500-bp 16S rDNA fragment allows the rapid identification of dangerous bacterial species. However, for discrimination of closely related species sequencing of the entire 16S rRNA gene, additional sequencing of the 23S rRNA gene or sequencing of the 16S-23S rRNA intergenic spacer is essential. This work provides comprehensive information on the suitability of partial 16S rDNA analysis and diverse databases for rapid and accurate identification of dangerous bacterial pathogens.
Microbial Analysis of Bite Marks by Sequence Comparison of Streptococcal DNA
Kennedy, Darnell M.; Stanton, Jo-Ann L.; García, José A.; Mason, Chris; Rand, Christy J.; Kieser, Jules A.; Tompkins, Geoffrey R.
2012-01-01
Bite mark injuries often feature in violent crimes. Conventional morphometric methods for the forensic analysis of bite marks involve elements of subjective interpretation that threaten the credibility of this field. Human DNA recovered from bite marks has the highest evidentiary value, however recovery can be compromised by salivary components. This study assessed the feasibility of matching bacterial DNA sequences amplified from experimental bite marks to those obtained from the teeth responsible, with the aim of evaluating the capability of three genomic regions of streptococcal DNA to discriminate between participant samples. Bite mark and teeth swabs were collected from 16 participants. Bacterial DNA was extracted to provide the template for PCR primers specific for streptococcal 16S ribosomal RNA (16S rRNA) gene, 16S–23S intergenic spacer (ITS) and RNA polymerase beta subunit (rpoB). High throughput sequencing (GS FLX 454), followed by stringent quality filtering, generated reads from bite marks for comparison to those generated from teeth samples. For all three regions, the greatest overlaps of identical reads were between bite mark samples and the corresponding teeth samples. The average proportions of reads identical between bite mark and corresponding teeth samples were 0.31, 0.41 and 0.31, and for non-corresponding samples were 0.11, 0.20 and 0.016, for 16S rRNA, ITS and rpoB, respectively. The probabilities of correctly distinguishing matching and non-matching teeth samples were 0.92 for ITS, 0.99 for 16S rRNA and 1.0 for rpoB. These findings strongly support the tenet that bacterial DNA amplified from bite marks and teeth can provide corroborating information in the identification of assailants. PMID:23284761
Reclassification of Borrelia spp. isolated in South Korea using Multilocus Sequence Typing.
Park, Kyung-Hee; Choi, Yeon-Joo; Kim, Jeoungyeon; Park, Hye-Jin; Song, Dayoung; Jang, Won-Jong
2018-05-31
Using Borrelia isolated from South Korea, we evaluated by MLST and three intergenic genes (16S rRNA, ospA, and 5S-23S IGS) typing to analyze the relationship between host and vector and molecular background. Using the MLST analysis, we identified B. afzelii, B. yangtzensis, B. garinii, and B. bavariensis. This study was first report of the identification of B. yangtzensis using the MLST in South Korea.
Milky hemolymph syndrome (MHS) in spiny lobsters, penaeid shrimp and crabs.
Nunan, Linda M; Poulos, Bonnie T; Navarro, Solangel; Redman, Rita M; Lightner, Donald V
2010-09-02
Black tiger shrimp Penaeus monodon, European shore crab Carcinus maenas and spiny lobster Panulirus spp. can be affected by milky hemolymph syndrome (MHS). Four rickettsia-like bacteria (RLB) isolates of MHS originating from 5 geographical areas have been identified to date. The histopathology of the disease was characterized and a multiplex PCR assay was developed for detection of the 4 bacterial isolates. The 16S rRNA gene and 16-23S rRNA intergenic spacer region (ISR) were used to examine the phylogeny of the MHS isolates. Although the pathology of this disease appears similar in the various different hosts, sequencing and examination of the phylogenetic relationships reveal 4 distinct RLB involved in the infection process.
Schroeder, H; Hoeltken, A M; Fladung, M
2012-03-01
Within the genus Populus several species belonging to different sections are cross-compatible. Hence, high numbers of interspecies hybrids occur naturally and, additionally, have been artificially produced in huge breeding programmes during the last 100 years. Therefore, determination of a single poplar species, used for the production of 'multi-species hybrids' is often difficult, and represents a great challenge for the use of molecular markers in species identification. Within this study, over 20 chloroplast regions, both intergenic spacers and coding regions, have been tested for their ability to differentiate different poplar species using 23 already published barcoding primer combinations and 17 newly designed primer combinations. About half of the published barcoding primers yielded amplification products, whereas the new primers designed on the basis of the total sequenced cpDNA genome of Populus trichocarpa Torr. & Gray yielded much higher amplification success. Intergenic spacers were found to be more variable than coding regions within the genus Populus. The highest discrimination power of Populus species was found in the combination of two intergenic spacers (trnG-psbK, psbK-psbl) and the coding region rpoC. In barcoding projects, the coding regions matK and rbcL are often recommended, but within the genus Populus they only show moderate variability and are not efficient in species discrimination. © 2011 German Botanical Society and The Royal Botanical Society of the Netherlands.
Ferchichi, M; Valcheva, R; Prévost, H; Onno, B; Dousset, X
2008-06-01
Species-specific primers targeting the 16S-23S ribosomal DNA (rDNA) intergenic spacer region (ISR) were designed to rapidly discriminate between Lactobacillus mindensis, Lactobacillus panis, Lactobacillus paralimentarius, Lactobacillus pontis and Lactobacillus frumenti species recently isolated from French sourdough. The 16S-23S ISRs were amplified using primers 16S/p2 and 23S/p7, which anneal to positions 1388-1406 of the 16S rRNA gene and to positions 207-189 of the 23S rRNA gene respectively, Escherichia coli numbering (GenBank accession number V00331). Clone libraries of the resulting amplicons were constructed using a pCR2.1 TA cloning kit and sequenced. Species-specific primers were designed based on the sequences obtained and were used to amplify the 16S-23S ISR in the Lactobacillus species considered. For all of them, two PCR amplicons, designated as small ISR (S-ISR) and large ISR (L-ISR), were obtained. The L-ISR is composed of the corresponding S-ISR, interrupted by a sequence containing tRNA(Ile) and tRNA(Ala) genes. Based on these sequences, species-specific primers were designed and proved to identify accurately the species considered among 30 reference Lactobacillus species tested. Designed species-specific primers enable a rapid and accurate identification of L. mindensis, L. paralimentarius, L. panis, L. pontis and L. frumenti species among other lactobacilli. The proposed method provides a powerful and convenient means of rapidly identifying some sourdough lactobacilli, which could be of help in large starter culture surveys.
Korde, Asawari; Rosselot, Jessica M.; Donze, David
2014-01-01
The major function of eukaryotic RNA polymerase III is to transcribe transfer RNA, 5S ribosomal RNA, and other small non-protein-coding RNA molecules. Assembly of the RNA polymerase III complex on chromosomal DNA requires the sequential binding of transcription factor complexes TFIIIC and TFIIIB. Recent evidence has suggested that in addition to producing RNA transcripts, chromatin-assembled RNA polymerase III complexes may mediate additional nuclear functions that include chromatin boundary, nucleosome phasing, and general genome organization activities. This study provides evidence of another such “extratranscriptional” activity of assembled RNA polymerase III complexes, which is the ability to block progression of intergenic RNA polymerase II transcription. We demonstrate that the RNA polymerase III complex bound to the tRNA gene upstream of the Saccharomyces cerevisiae ATG31 gene protects the ATG31 promoter against readthrough transcriptional interference from the upstream noncoding intergenic SUT467 transcription unit. This protection is predominately mediated by binding of the TFIIIB complex. When TFIIIB binding to this tRNA gene is weakened, an extended SUT467–ATG31 readthrough transcript is produced, resulting in compromised ATG31 translation. Since the ATG31 gene product is required for autophagy, strains expressing the readthrough transcript exhibit defective autophagy induction and reduced fitness under autophagy-inducing nitrogen starvation conditions. Given the recent discovery of widespread pervasive transcription in all forms of life, protection of neighboring genes from intergenic transcriptional interference may be a key extratranscriptional function of assembled RNA polymerase III complexes and possibly other DNA binding proteins. PMID:24336746
Wen, Dong-Yue; Lin, Peng; Pang, Yu-Yan; Chen, Gang; He, Yun; Dang, Yi-Wu; Yang, Hong
2018-05-05
BACKGROUND Long non-coding RNAs (lncRNAs) have a role in physiological and pathological processes, including cancer. The aim of this study was to investigate the expression of the long intergenic non-protein coding RNA 665 (LINC00665) gene and the cell cycle in hepatocellular carcinoma (HCC) using database analysis including The Cancer Genome Atlas (TCGA), the Gene Expression Omnibus (GEO), and quantitative real-time polymerase chain reaction (qPCR). MATERIAL AND METHODS Expression levels of LINC00665 were compared between human tissue samples of HCC and adjacent normal liver, clinicopathological correlations were made using TCGA and the GEO, and qPCR was performed to validate the findings. Other public databases were searched for other genes associated with LINC00665 expression, including The Atlas of Noncoding RNAs in Cancer (TANRIC), the Multi Experiment Matrix (MEM), Gene Ontology (GO), Kyoto Encyclopedia of Genes and Genomes (KEGG) and protein-protein interaction (PPI) networks. RESULTS Overexpression of LINC00665 in patients with HCC was significantly associated with gender, tumor grade, stage, and tumor cell type. Overexpression of LINC00665 in patients with HCC was significantly associated with overall survival (OS) (HR=1.47795%; CI: 1.046-2.086). Bioinformatics analysis identified 469 related genes and further analysis supported a hypothesis that LINC00665 regulates pathways in the cell cycle to facilitate the development and progression of HCC through ten identified core genes: CDK1, BUB1B, BUB1, PLK1, CCNB2, CCNB1, CDC20, ESPL1, MAD2L1, and CCNA2. CONCLUSIONS Overexpression of the lncRNA, LINC00665 may be involved in the regulation of cell cycle pathways in HCC through ten identified hub genes.
Functional annotation of the vlinc class of non-coding RNAs using systems biology approach
Laurent, Georges St.; Vyatkin, Yuri; Antonets, Denis; Ri, Maxim; Qi, Yao; Saik, Olga; Shtokalo, Dmitry; de Hoon, Michiel J.L.; Kawaji, Hideya; Itoh, Masayoshi; Lassmann, Timo; Arner, Erik; Forrest, Alistair R.R.; Nicolas, Estelle; McCaffrey, Timothy A.; Carninci, Piero; Hayashizaki, Yoshihide; Wahlestedt, Claes; Kapranov, Philipp
2016-01-01
Functionality of the non-coding transcripts encoded by the human genome is the coveted goal of the modern genomics research. While commonly relied on the classical methods of forward genetics, integration of different genomics datasets in a global Systems Biology fashion presents a more productive avenue of achieving this very complex aim. Here we report application of a Systems Biology-based approach to dissect functionality of a newly identified vast class of very long intergenic non-coding (vlinc) RNAs. Using highly quantitative FANTOM5 CAGE dataset, we show that these RNAs could be grouped into 1542 novel human genes based on analysis of insulators that we show here indeed function as genomic barrier elements. We show that vlincRNAs genes likely function in cis to activate nearby genes. This effect while most pronounced in closely spaced vlincRNA–gene pairs can be detected over relatively large genomic distances. Furthermore, we identified 101 vlincRNA genes likely involved in early embryogenesis based on patterns of their expression and regulation. We also found another 109 such genes potentially involved in cellular functions also happening at early stages of development such as proliferation, migration and apoptosis. Overall, we show that Systems Biology-based methods have great promise for functional annotation of non-coding RNAs. PMID:27001520
2010-01-01
Background Comparative sequence analysis of complex loci such as resistance gene analog clusters allows estimating the degree of sequence conservation and mechanisms of divergence at the intraspecies level. In banana (Musa sp.), two diploid wild species Musa acuminata (A genome) and Musa balbisiana (B genome) contribute to the polyploid genome of many cultivars. The M. balbisiana species is associated with vigour and tolerance to pests and disease and little is known on the genome structure and haplotype diversity within this species. Here, we compare two genomic sequences of 253 and 223 kb corresponding to two haplotypes of the RGA08 resistance gene analog locus in M. balbisiana "Pisang Klutuk Wulung" (PKW). Results Sequence comparison revealed two regions of contrasting features. The first is a highly colinear gene-rich region where the two haplotypes diverge only by single nucleotide polymorphisms and two repetitive element insertions. The second corresponds to a large cluster of RGA08 genes, with 13 and 18 predicted RGA genes and pseudogenes spread over 131 and 152 kb respectively on each haplotype. The RGA08 cluster is enriched in repetitive element insertions, in duplicated non-coding intergenic sequences including low complexity regions and shows structural variations between haplotypes. Although some allelic relationships are retained, a large diversity of RGA08 genes occurs in this single M. balbisiana genotype, with several RGA08 paralogs specific to each haplotype. The RGA08 gene family has evolved by mechanisms of unequal recombination, intragenic sequence exchange and diversifying selection. An unequal recombination event taking place between duplicated non-coding intergenic sequences resulted in a different RGA08 gene content between haplotypes pointing out the role of such duplicated regions in the evolution of RGA clusters. Based on the synonymous substitution rate in coding sequences, we estimated a 1 million year divergence time for these M. balbisiana haplotypes. Conclusions A large RGA08 gene cluster identified in wild banana corresponds to a highly variable genomic region between haplotypes surrounded by conserved flanking regions. High level of sequence identity (70 to 99%) of the genic and intergenic regions suggests a recent and rapid evolution of this cluster in M. balbisiana. PMID:20637079
Arabidopsis Chloroplast Mini-Ribonuclease III Participates in rRNA Maturation and Intron Recycling
Hotto, Amber M.; Castandet, Benoît; Gilet, Laetitia; Higdon, Andrea; Condon, Ciarán; Stern, David B.
2015-01-01
RNase III proteins recognize double-stranded RNA structures and catalyze endoribonucleolytic cleavages that often regulate gene expression. Here, we characterize the functions of RNC3 and RNC4, two Arabidopsis thaliana chloroplast Mini-RNase III-like enzymes sharing 75% amino acid sequence identity. Whereas rnc3 and rnc4 null mutants have no visible phenotype, rnc3/rnc4 (rnc3/4) double mutants are slightly smaller and chlorotic compared with the wild type. In Bacillus subtilis, the RNase Mini-III is integral to 23S rRNA maturation. In Arabidopsis, we observed imprecise maturation of 23S rRNA in the rnc3/4 double mutant, suggesting that exoribonucleases generated staggered ends in the absence of specific Mini-III-catalyzed cleavages. A similar phenotype was found at the 3′ end of the 16S rRNA, and the primary 4.5S rRNA transcript contained 3′ extensions, suggesting that Mini-III catalyzes several processing events of the polycistronic rRNA precursor. The rnc3/4 mutant showed overaccumulation of a noncoding RNA complementary to the 4.5S-5S rRNA intergenic region, and its presence correlated with that of the extended 4.5S rRNA precursor. Finally, we found rnc3/4-specific intron degradation intermediates that are probable substrates for Mini-III and show that B. subtilis Mini-III is also involved in intron regulation. Overall, this study extends our knowledge of the key role of Mini-III in intron and noncoding RNA regulation and provides important insight into plastid rRNA maturation. PMID:25724636
Schnare, Murray N.; Collings, James C.; Spencer, David F.; Gray, Michael W.
2000-01-01
In Crithidia fasciculata, the ribosomal RNA (rRNA) gene repeats range in size from ∼11 to 12 kb. This length heterogeneity is localized to a region of the intergenic spacer (IGS) that contains tandemly repeated copies of a 19mer sequence. The IGS also contains four copies of an ∼55 nt repeat that has an internal inverted repeat and is also present in the IGS of Leishmania species. We have mapped the C.fasciculata transcription initiation site as well as two other reverse transcriptase stop sites that may be analogous to the A0 and A′ pre-rRNA processing sites within the 5′ external transcribed spacer (ETS) of other eukaryotes. Features that could influence processing at these sites include two stretches of conserved primary sequence and three secondary structure elements present in the 5′ ETS. We also characterized the C.fasciculata U3 snoRNA, which has the potential for base-pairing with pre-rRNA sequences. Finally, we demonstrate that biosynthesis of large subunit rRNA in both C.fasciculata and Trypanosoma brucei involves 3′-terminal addition of three A residues that are not present in the corresponding DNA sequences. PMID:10982863
Competing endogenous RNA network crosstalk reveals novel molecular markers in colorectal cancer.
Samir, Nehal; Matboli, Marwa; El-Tayeb, Hanaa; El-Tawdi, Ahmed; Hassan, Mohmed K; Waly, Amr; El-Akkad, Hesham A E; Ramadan, Mohamed G; Al-Belkini, Tarek N; El-Khamisy, Sherif; El-Asmar, Farid
2018-05-08
The competing endogenous RNA networks play a pivotal role in cancer diagnosis and progression. Novel properstrategies for early detection of colorectal cancer (CRC) are strongly needed. We investigated a novel CRC-specific RNA-based integrated competing endogenous network composed of lethal3 malignant brain tumor like1 (L3MBTL1) gene, long non-coding intergenic RNA- (lncRNA RP11-909B2.1) and homo sapiens microRNA-595 (hsa-miRNA-595) using in silico data analysis. RT-qPCR-based validation of the network was achieved in serum of 70 patients with CRC, 40 patients with benign colorectal neoplasm, and 20 healthy controls. Moreover, in cancer tissues of 20 of the 70 CRC cases were involved in the study. The expression of RNA-based biomarker network in both CRC and adjacent non-tumor tissues and their correlation with the serum levels of this network members was investigated. Lastly, the expression levels of the chosen ceRNA was verified in CRC cell line. Our results revealed that the three RNAs-based biomarker network (long non-coding intergenic RNA-[lncRNA RP11-909B2.1], Homo sapiens microRNA-595 [hsa-miRNA-595], and L3MBTL1 mRNA), had high sensitivity and specificity for discriminating CRC from healthy controls and also from benign colorectal neoplasm. The data suggest that among these three RNAs, serum lncRNA RP11-909B2.1 could be a promising independent prognostic factors in CRC. The circulatory RNA based biomarker panel can act as potential biomarker for CRC diagnosis and prognosis. © 2018 Wiley Periodicals, Inc.
Piggy: a rapid, large-scale pan-genome analysis tool for intergenic regions in bacteria.
Thorpe, Harry A; Bayliss, Sion C; Sheppard, Samuel K; Feil, Edward J
2018-04-01
The concept of the "pan-genome," which refers to the total complement of genes within a given sample or species, is well established in bacterial genomics. Rapid and scalable pipelines are available for managing and interpreting pan-genomes from large batches of annotated assemblies. However, despite overwhelming evidence that variation in intergenic regions in bacteria can directly influence phenotypes, most current approaches for analyzing pan-genomes focus exclusively on protein-coding sequences. To address this we present Piggy, a novel pipeline that emulates Roary except that it is based only on intergenic regions. A key utility provided by Piggy is the detection of highly divergent ("switched") intergenic regions (IGRs) upstream of genes. We demonstrate the use of Piggy on large datasets of clinically important lineages of Staphylococcus aureus and Escherichia coli. For S. aureus, we show that highly divergent (switched) IGRs are associated with differences in gene expression and we establish a multilocus reference database of IGR alleles (igMLST; implemented in BIGSdb).
Homolka, David; Ivanek, Robert; Forejt, Jiri; Jansa, Petr
2011-02-14
Tight regulation of testicular gene expression is a prerequisite for male reproductive success, while differentiation of gene activity in spermatogenesis is important during speciation. Thus, comparison of testicular transcriptomes between closely related species can reveal unique regulatory patterns and shed light on evolutionary constraints separating the species. Here, we compared testicular transcriptomes of two closely related mouse species, Mus musculus and Mus spretus, which diverged more than one million years ago. We analyzed testicular expression using tiling arrays overlapping Chromosomes 2, X, Y and mitochondrial genome. An excess of differentially regulated non-coding RNAs was found on Chromosome 2 including the intronic antisense RNAs, intergenic RNAs and premature forms of Piwi-interacting RNAs (piRNAs). Moreover, striking difference was found in the expression of X-linked G6pdx gene, the parental gene of the autosomal retrogene G6pd2. The prevalence of non-coding RNAs among differentially expressed transcripts indicates their role in species-specific regulation of spermatogenesis. The postmeiotic expression of G6pdx in Mus spretus points towards the continuous evolution of X-chromosome silencing and provides an example of expression change accompanying the out-of-the X-chromosomal retroposition.
Functional annotation of the vlinc class of non-coding RNAs using systems biology approach.
St Laurent, Georges; Vyatkin, Yuri; Antonets, Denis; Ri, Maxim; Qi, Yao; Saik, Olga; Shtokalo, Dmitry; de Hoon, Michiel J L; Kawaji, Hideya; Itoh, Masayoshi; Lassmann, Timo; Arner, Erik; Forrest, Alistair R R; Nicolas, Estelle; McCaffrey, Timothy A; Carninci, Piero; Hayashizaki, Yoshihide; Wahlestedt, Claes; Kapranov, Philipp
2016-04-20
Functionality of the non-coding transcripts encoded by the human genome is the coveted goal of the modern genomics research. While commonly relied on the classical methods of forward genetics, integration of different genomics datasets in a global Systems Biology fashion presents a more productive avenue of achieving this very complex aim. Here we report application of a Systems Biology-based approach to dissect functionality of a newly identified vast class of very long intergenic non-coding (vlinc) RNAs. Using highly quantitative FANTOM5 CAGE dataset, we show that these RNAs could be grouped into 1542 novel human genes based on analysis of insulators that we show here indeed function as genomic barrier elements. We show that vlinc RNAs genes likely function in cisto activate nearby genes. This effect while most pronounced in closely spaced vlinc RNA-gene pairs can be detected over relatively large genomic distances. Furthermore, we identified 101 vlinc RNA genes likely involved in early embryogenesis based on patterns of their expression and regulation. We also found another 109 such genes potentially involved in cellular functions also happening at early stages of development such as proliferation, migration and apoptosis. Overall, we show that Systems Biology-based methods have great promise for functional annotation of non-coding RNAs. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.
HOTAIR: An Oncogenic Long Non-Coding RNA in Human Cancer.
Tang, Qing; Hann, Swei Sunny
2018-05-24
Long non-coding RNAs (LncRNAs) represent a novel class of noncoding RNAs that are longer than 200 nucleotides without protein-coding potential and function as novel master regulators in various human diseases, including cancer. Accumulating evidence shows that lncRNAs are dysregulated and implicated in various aspects of cellular homeostasis, such as proliferation, apoptosis, mobility, invasion, metastasis, chromatin remodeling, gene transcription, and post-transcriptional processing. However, the mechanisms by which lncRNAs regulate various biological functions in human diseases have yet to be determined. HOX antisense intergenic RNA (HOTAIR) is a recently discovered lncRNA and plays a critical role in various areas of cancer, such as proliferation, survival, migration, drug resistance, and genomic stability. In this review, we briefly introduce the concept, identification, and biological functions of HOTAIR. We then describe the involvement of HOTAIR that has been associated with tumorigenesis, growth, invasion, cancer stem cell differentiation, metastasis, and drug resistance in cancer. We also discuss emerging insights into the role of HOTAIR as potential biomarkers and therapeutic targets for novel treatment paradigms in cancer. © 2018 The Author(s). Published by S. Karger AG, Basel.
Kouvelis, Vassili N; Ghikas, Dimitri V; Typas, Milton A
2004-10-01
The mitochondrial genome (mtDNA) of the entomopathogenic fungus Lecanicillium muscarium (synonym Verticillium lecanii) with a total size of 24,499-bp has been analyzed. So far, it is the smallest known mitochondrial genome among Pezizomycotina, with an extremely compact gene organization and only one group-I intron in its large ribosomal RNA (rnl) gene. It contains the 14 typical genes coding for proteins related to oxidative phosphorylation, the two rRNA genes, one intronic ORF coding for a possible ribosomal protein (rps), and a set of 25 tRNA genes which recognize codons for all amino acids, except alanine and cysteine. All genes are transcribed from the same DNA strand. Gene order comparison with all available complete fungal mtDNAs-representatives of all four Phyla are included-revealed some characteristic common features like uninterrupted gene pairs, overlapping genes, and extremely variable intergenic regions, that can all be exploited for the study of fungal mitochondrial genomes. Moreover, a minimum common mtDNA gene order could be detected, in two units, for all known Sordariomycetes namely nad1-nad4-atp8-atp6 and rns-cox3-rnl, which can be extended in Hypocreales, to nad4L-nad5-cob-cox1-nad1-nad4-atp8-atp6 and rns-cox3-rnl nad2-nad3, respectively. Phylogenetic analysis of all fungal mtDNA essential protein-coding genes as one unit, clearly demonstrated the superiority of small genome (mtDNA) over single gene comparisons.
2010-01-01
Background The identification of non-coding transcripts in human, mouse, and Escherichia coli has revealed their widespread occurrence and functional importance in both eukaryotic and prokaryotic life. In prokaryotes, studies have shown that non-coding transcripts participate in a broad range of cellular functions like gene regulation, stress and virulence. However, very little is known about non-coding transcripts in Streptococcus pneumoniae (pneumococcus), an obligate human respiratory pathogen responsible for significant worldwide morbidity and mortality. Tiling microarrays enable genome wide mRNA profiling as well as identification of novel transcripts at a high-resolution. Results Here, we describe a high-resolution transcription map of the S. pneumoniae clinical isolate TIGR4 using genomic tiling arrays. Our results indicate that approximately 66% of the genome is expressed under our experimental conditions. We identified a total of 50 non-coding small RNAs (sRNAs) from the intergenic regions, of which 36 had no predicted function. Half of the identified sRNA sequences were found to be unique to S. pneumoniae genome. We identified eight overrepresented sequence motifs among sRNA sequences that correspond to sRNAs in different functional categories. Tiling arrays also identified approximately 202 operon structures in the genome. Conclusions In summary, the pneumococcal operon structures and novel sRNAs identified in this study enhance our understanding of the complexity and extent of the pneumococcal 'expressed' genome. Furthermore, the results of this study open up new avenues of research for understanding the complex RNA regulatory network governing S. pneumoniae physiology and virulence. PMID:20525227
Veldman, G M; Klootwijk, J; van Heerikhuizen, H; Planta, R J
1981-01-01
We have determined the nucleotide sequence of part of a cloned yeast ribosomal RNA operon extending from the 5.8S RNA gene downstream into the 5' -terminal region of the 26S RNA gene. We mapped the pertinent processing sites, viz. the 5' end of 26S rRNA and the 3'ends of 5.8S rRNA and its immediate precursor, 7S RNA. At the 3' end of 7S RNA we find the sequence UCGUUU which is very similar to the type I consensus sequence UCAUUA/U present at the 3' ends of 17S, 5.8S and 26S rRNA as well as 18S precursor rRNA in yeast. At the 5' end of the 26S RNA gene we find a sequence of thirteen nucleotides which is homologous to the type II sequence present at the 5' termini of both the 17S and the 5.8S RNA gene. These findings further support the suggestion put forward earlier (G.M. Veldman et al. (1980) Nucl. Acids Res. 8, 2907-2920) that both consensus sequences are involved in the recognition of precursor rRNA by the processing nuclease(s). We discuss a model for the processing of yeast rRNA in which a processing enzyme sequentially recognizes several combinations of a type I and a type II consensus sequence. We also describe the existence of a significant base complementarity between sequences in the 5' -terminal region of 26S rRNA and the 3' -terminal region of 5.8S rRNA. We suggest that base pairing between these sequences contributes to the binding between 5.8S and 26S rRNA. Images PMID:7312619
Cho, Otomi; Sugita, Takashi
2016-12-01
As DNA sequences of the intergenic spacer (IGS) region in the rRNA gene show remarkable intraspecies diversity compared with the small subunit, large subunit, and internal transcribed spacer region, the IGS region has been used as an epidemiological tool in studies on Malassezia globosa and M. restricta, which are responsible for the exacerbation of atopic dermatitis (AD) and seborrheic dermatitis (SD). However, the IGS regions of M. sympodialis and M. dermatis obtained from the skin of patients with AD and SD, as well as healthy subjects, lacked sequence diversity. Of the 105 M. sympodialis strains and the 40 M. dermatis strains, the sequences of 103 (98.1 %) and 39 (97.5 %), respectively, were identical. Thus, given the lack of intraspecies diversity in the IGS regions of M. sympodialis and M. dermatis, studies of the diversity of these species should be performed using appropriate genes and not the IGS.
Muller, Laura K.; Lorch, Jeffrey M.; Lindner, Daniel L.; O'Connor, Michael; Gargas, Andrea; Blehert, David S.
2013-01-01
The fungus Geomyces destructans is the causative agent of white-nose syndrome (WNS), a disease that has killed millions of North American hibernating bats. We describe a real-time TaqMan PCR test that detects DNA from G. destructans by targeting a portion of the multicopy intergenic spacer region of the rRNA gene complex. The test is highly sensitive, consistently detecting as little as 3.3 fg of genomic DNA from G. destructans. The real-time PCR test specifically amplified genomic DNA from G. destructans but did not amplify target sequence from 54 closely related fungal isolates (including 43 Geomyces spp. isolates) associated with bats. The test was further qualified by analyzing DNA extracted from 91 bat wing skin samples, and PCR results matched histopathology findings. These data indicate the real-time TaqMan PCR method described herein is a sensitive, specific, and rapid test to detect DNA from G. destructans and provides a valuable tool for WNS diagnostics and research.
Intrinsic and extrinsic approaches for detecting genes in a bacterial genome.
Borodovsky, M; Rudd, K E; Koonin, E V
1994-01-01
The unannotated regions of the Escherichia coli genome DNA sequence from the EcoSeq6 database, totaling 1,278 'intergenic' sequences of the combined length of 359,279 basepairs, were analyzed using computer-assisted methods with the aim of identifying putative unknown genes. The proposed strategy for finding new genes includes two key elements: i) prediction of expressed open reading frames (ORFs) using the GeneMark method based on Markov chain models for coding and non-coding regions of Escherichia coli DNA, and ii) search for protein sequence similarities using programs based on the BLAST algorithm and programs for motif identification. A total of 354 putative expressed ORFs were predicted by GeneMark. Using the BLASTX and TBLASTN programs, it was shown that 208 ORFs located in the unannotated regions of the E. coli chromosome are significantly similar to other protein sequences. Identification of 182 ORFs as probable genes was supported by GeneMark and BLAST, comprising 51.4% of the GeneMark 'hits' and 87.5% of the BLAST 'hits'. 73 putative new genes, comprising 20.6% of the GeneMark predictions, belong to ancient conserved protein families that include both eubacterial and eukaryotic members. This value is close to the overall proportion of highly conserved sequences among eubacterial proteins, indicating that the majority of the putative expressed ORFs that are predicted by GeneMark, but have no significant BLAST hits, nevertheless are likely to be real genes. The majority of the putative genes identified by BLAST search have been described since the release of the EcoSeq6 database, but about 70 genes have not been detected so far. Among these new identifications are genes encoding proteins with a variety of predicted functions including dehydrogenases, kinases, several other metabolic enzymes, ATPases, rRNA methyltransferases, membrane proteins, and different types of regulatory proteins. Images PMID:7984428
Chalker, Victoria J; Waller, Andrew; Webb, Katy; Spearing, Emma; Crosse, Patricia; Brownlie, Joe; Erles, Kerstin
2012-06-01
The genetic diversity and antibiotic resistance profiles of 38 Streptococcus equi subsp. zooepidemicus isolates were determined from a kennelled canine population during two outbreaks of hemorrhagic pneumonia (1999 to 2002 and 2007 to 2010). Analysis of the szp gene hypervariable region and the 16S-23S rRNA intergenic spacer region and multilocus sequence typing (MLST) indicated a predominant tetO-positive, doxycycline-resistant ST-10 strain during 1999 to 2002 and a predominant tetM-positive doxycycline-resistant ST-62 strain during 2007 to 2010.
Chalker, Victoria J.; Waller, Andrew; Webb, Katy; Spearing, Emma; Crosse, Patricia; Brownlie, Joe
2012-01-01
The genetic diversity and antibiotic resistance profiles of 38 Streptococcus equi subsp. zooepidemicus isolates were determined from a kennelled canine population during two outbreaks of hemorrhagic pneumonia (1999 to 2002 and 2007 to 2010). Analysis of the szp gene hypervariable region and the 16S-23S rRNA intergenic spacer region and multilocus sequence typing (MLST) indicated a predominant tetO-positive, doxycycline-resistant ST-10 strain during 1999 to 2002 and a predominant tetM-positive doxycycline-resistant ST-62 strain during 2007 to 2010. PMID:22495558
The oestrogen receptor alpha-regulated lncRNA NEAT1 is a critical modulator of prostate cancer.
Chakravarty, Dimple; Sboner, Andrea; Nair, Sujit S; Giannopoulou, Eugenia; Li, Ruohan; Hennig, Sven; Mosquera, Juan Miguel; Pauwels, Jonathan; Park, Kyung; Kossai, Myriam; MacDonald, Theresa Y; Fontugne, Jacqueline; Erho, Nicholas; Vergara, Ismael A; Ghadessi, Mercedeh; Davicioni, Elai; Jenkins, Robert B; Palanisamy, Nallasivam; Chen, Zhengming; Nakagawa, Shinichi; Hirose, Tetsuro; Bander, Neil H; Beltran, Himisha; Fox, Archa H; Elemento, Olivier; Rubin, Mark A
2014-11-21
The androgen receptor (AR) plays a central role in establishing an oncogenic cascade that drives prostate cancer progression. Some prostate cancers escape androgen dependence and are often associated with an aggressive phenotype. The oestrogen receptor alpha (ERα) is expressed in prostate cancers, independent of AR status. However, the role of ERα remains elusive. Using a combination of chromatin immunoprecipitation (ChIP) and RNA-sequencing data, we identified an ERα-specific non-coding transcriptome signature. Among putatively ERα-regulated intergenic long non-coding RNAs (lncRNAs), we identified nuclear enriched abundant transcript 1 (NEAT1) as the most significantly overexpressed lncRNA in prostate cancer. Analysis of two large clinical cohorts also revealed that NEAT1 expression is associated with prostate cancer progression. Prostate cancer cells expressing high levels of NEAT1 were recalcitrant to androgen or AR antagonists. Finally, we provide evidence that NEAT1 drives oncogenic growth by altering the epigenetic landscape of target gene promoters to favour transcription.
The oestrogen receptor alpha-regulated lncRNA NEAT1 is a critical modulator of prostate cancer
Chakravarty, Dimple; Sboner, Andrea; Nair, Sujit S.; Giannopoulou, Eugenia; Li, Ruohan; Hennig, Sven; Mosquera, Juan Miguel; Pauwels, Jonathan; Park, Kyung; Kossai, Myriam; MacDonald, Theresa Y.; Fontugne, Jacqueline; Erho, Nicholas; Vergara, Ismael A.; Ghadessi, Mercedeh; Davicioni, Elai; Jenkins, Robert B.; Palanisamy, Nallasivam; Chen, Zhengming; Nakagawa, Shinichi; Hirose, Tetsuro; Bander, Neil H.; Beltran, Himisha; Fox, Archa H.; Elemento, Olivier; Rubin, Mark A.
2014-01-01
The androgen receptor (AR) plays a central role in establishing an oncogenic cascade that drives prostate cancer progression. Some prostate cancers escape androgen dependence and are often associated with an aggressive phenotype. The oestrogen receptor alpha (ERα) is expressed in prostate cancers, independent of AR status. However, the role of ERα remains elusive. Using a combination of chromatin immunoprecipitation (ChIP) and RNA-sequencing data, we identified an ERα-specific non-coding transcriptome signature. Among putatively ERα-regulated intergenic long non-coding RNAs (lncRNAs), we identified nuclear enriched abundant transcript 1 (NEAT1) as the most significantly overexpressed lncRNA in prostate cancer. Analysis of two large clinical cohorts also revealed that NEAT1 expression is associated with prostate cancer progression. Prostate cancer cells expressing high levels of NEAT1 were recalcitrant to androgen or AR antagonists. Finally, we provide evidence that NEAT1 drives oncogenic growth by altering the epigenetic landscape of target gene promoters to favour transcription. PMID:25415230
Bråte, Jon; Adamski, Marcin; Neumann, Ralf S; Shalchian-Tabrizi, Kamran; Adamska, Maja
2015-12-22
Long non-coding RNAs (lncRNAs) play important regulatory roles during animal development, and it has been hypothesized that an RNA-based gene regulation was important for the evolution of developmental complexity in animals. However, most studies of lncRNA gene regulation have been performed using model animal species, and very little is known about this type of gene regulation in non-bilaterians. We have therefore analysed RNA-Seq data derived from a comprehensive set of embryogenesis stages in the calcareous sponge Sycon ciliatum and identified hundreds of developmentally expressed intergenic lncRNAs (lincRNAs) in this species. In situ hybridization of selected lincRNAs revealed dynamic spatial and temporal expression during embryonic development. More than 600 lincRNAs constitute integral parts of differentially expressed gene modules, which also contain known developmental regulatory genes, e.g. transcription factors and signalling molecules. This study provides insights into the non-coding gene repertoire of one of the earliest evolved animal lineages, and suggests that RNA-based gene regulation was probably present in the last common ancestor of animals. © 2015 The Authors.
Cellular dissection of psoriasis for transcriptome analyses and the post-GWAS era
2014-01-01
Background Genome-scale studies of psoriasis have been used to identify genes of potential relevance to disease mechanisms. For many identified genes, however, the cell type mediating disease activity is uncertain, which has limited our ability to design gene functional studies based on genomic findings. Methods We identified differentially expressed genes (DEGs) with altered expression in psoriasis lesions (n = 216 patients), as well as candidate genes near susceptibility loci from psoriasis GWAS studies. These gene sets were characterized based upon their expression across 10 cell types present in psoriasis lesions. Susceptibility-associated variation at intergenic (non-coding) loci was evaluated to identify sites of allele-specific transcription factor binding. Results Half of DEGs showed highest expression in skin cells, although the dominant cell type differed between psoriasis-increased DEGs (keratinocytes, 35%) and psoriasis-decreased DEGs (fibroblasts, 33%). In contrast, psoriasis GWAS candidates tended to have highest expression in immune cells (71%), with a significant fraction showing maximal expression in neutrophils (24%, P < 0.001). By identifying candidate cell types for genes near susceptibility loci, we could identify and prioritize SNPs at which susceptibility variants are predicted to influence transcription factor binding. This led to the identification of potentially causal (non-coding) SNPs for which susceptibility variants influence binding of AP-1, NF-κB, IRF1, STAT3 and STAT4. Conclusions These findings underscore the role of innate immunity in psoriasis and highlight neutrophils as a cell type linked with pathogenetic mechanisms. Assignment of candidate cell types to genes emerging from GWAS studies provides a first step towards functional analysis, and we have proposed an approach for generating hypotheses to explain GWAS hits at intergenic loci. PMID:24885462
Yeates, Christine; Saunders, Aaron M; Crocetti, Gregory R; Blackall, Linda L
2003-05-01
The 23S rRNA-targeted probes GAM42a and BET42a provided equivocal results with the uncultured gammaproteobacterium 'Candidatus Competibacter phosphatis' where some cells bound GAM42a and other cells bound BET42a in fluorescence in situ hybridization (FISH) experiments. Probes GAM42a and BET42a span positions 1027-1043 in the 23S rRNA and differ from each other by one nucleotide at position 1033. Clone libraries were prepared from PCR products spanning the 16S rRNA genes, intergenic spacer region and 23S rRNA genes from two mixed cultures enriched in 'Candidatus C. phosphatis'. With individual clone inserts, the 16S rDNA portion was used to confirm the source organism as 'Candidatus C. phosphatis' and the 23S rDNA portion was used to determine the sequence of the GAM42a/BET42a probe target region. Of the 19 clones sequenced, 8 had the GAM42a probe target (T at position 1033) and 11 had G at position 1033, the only mismatch with GAM42a. However, none of the clones had the BET42a probe target (A at 1033). Non-canonical base-pairing between the 23S rRNA of 'Candidatus C. phosphatis' with G at position 1033 and GAM42a (G-A) or BET42a (G-T) is likely to explain the probing anomalies. A probe (GAM42_C1033) was optimized for use in FISH, targeting cells with G at position 1033, and was found to highlight not only some 'Candidatus C. phosphatis' cells, but also other bacteria. This demonstrates that there are bacteria in addition to 'Candidatus C. phosphatis' with the GAM42_C1033 probe target and not the BET42a or GAM42a probe target.
Junk DNA and the long non-coding RNA twist in cancer genetics
Ling, Hui; Vincent, Kimberly; Pichler, Martin; Fodde, Riccardo; Berindan-Neagoe, Ioana; Slack, Frank J.; Calin, George A
2015-01-01
The central dogma of molecular biology states that the flow of genetic information moves from DNA to RNA to protein. However, in the last decade this dogma has been challenged by new findings on non-coding RNAs (ncRNAs) such as microRNAs (miRNAs). More recently, long non-coding RNAs (lncRNAs) have attracted much attention due to their large number and biological significance. Many lncRNAs have been identified as mapping to regulatory elements including gene promoters and enhancers, ultraconserved regions, and intergenic regions of protein-coding genes. Yet, the biological function and molecular mechanisms of lncRNA in human diseases in general and cancer in particular remain largely unknown. Data from the literature suggest that lncRNA, often via interaction with proteins, functions in specific genomic loci or use their own transcription loci for regulatory activity. In this review, we summarize recent findings supporting the importance of DNA loci in lncRNA function, and the underlying molecular mechanisms via cis or trans regulation, and discuss their implications in cancer. In addition, we use the 8q24 genomic locus, a region containing interactive SNPs, DNA regulatory elements and lncRNAs, as an example to illustrate how single nucleotide polymorphism (SNP) located within lncRNAs may be functionally associated with the individual’s susceptibility to cancer. PMID:25619839
Bacteremia due to Moraxella atlantae in a cancer patient.
De Baere, Thierry; Muylaert, An; Everaert, Els; Wauters, Georges; Claeys, Geert; Verschraegen, Gerda; Vaneechoutte, Mario
2002-07-01
A gram-negative alkaline phosphatase- and pyrrolidone peptidase-positive rod-shaped bacterium (CCUG 45702) was isolated from two aerobic blood cultures from a female cancer patient. No identification could be reached using phenotypic techniques. Amplification of the tRNA intergenic spacers revealed fragments with lengths of 116, 133, and 270 bp, but no such pattern was present in our reference library. Sequencing of the 16S rRNA gene revealed its identity as Moraxella atlantae, a species isolated only rarely and published only once as causing infection. In retrospect, the phenotypic characteristics fit the identification as M. atlantae (formerly known as CDC group M-3). Comparative 16S rRNA sequence analysis indicates that M. atlantae, M. lincolnii, and M. osloensis might constitute three separate genera within the MORAXELLACEAE: After treatment with amoxicillin-clavulanic acid for 2 days, fever subsided and the patient was dismissed.
Bacteremia Due to Moraxella atlantae in a Cancer Patient
De Baere, Thierry; Muylaert, An; Everaert, Els; Wauters, Georges; Claeys, Geert; Verschraegen, Gerda; Vaneechoutte, Mario
2002-01-01
A gram-negative alkaline phosphatase- and pyrrolidone peptidase-positive rod-shaped bacterium (CCUG 45702) was isolated from two aerobic blood cultures from a female cancer patient. No identification could be reached using phenotypic techniques. Amplification of the tRNA intergenic spacers revealed fragments with lengths of 116, 133, and 270 bp, but no such pattern was present in our reference library. Sequencing of the 16S rRNA gene revealed its identity as Moraxella atlantae, a species isolated only rarely and published only once as causing infection. In retrospect, the phenotypic characteristics fit the identification as M. atlantae (formerly known as CDC group M-3). Comparative 16S rRNA sequence analysis indicates that M. atlantae, M. lincolnii, and M. osloensis might constitute three separate genera within the Moraxellaceae. After treatment with amoxicillin-clavulanic acid for 2 days, fever subsided and the patient was dismissed. PMID:12089312
Ricaño-Ponce, Isis; Zhernakova, Daria V; Deelen, Patrick; Luo, Oscar; Li, Xingwang; Isaacs, Aaron; Karjalainen, Juha; Di Tommaso, Jennifer; Borek, Zuzanna Agnieszka; Zorro, Maria M; Gutierrez-Achury, Javier; Uitterlinden, Andre G; Hofman, Albert; van Meurs, Joyce; Netea, Mihai G; Jonkers, Iris H; Withoff, Sebo; van Duijn, Cornelia M; Li, Yang; Ruan, Yijun; Franke, Lude; Wijmenga, Cisca; Kumar, Vinod
2016-04-01
Genome-wide association and fine-mapping studies in 14 autoimmune diseases (AID) have implicated more than 250 loci in one or more of these diseases. As more than 90% of AID-associated SNPs are intergenic or intronic, pinpointing the causal genes is challenging. We performed a systematic analysis to link 460 SNPs that are associated with 14 AID to causal genes using transcriptomic data from 629 blood samples. We were able to link 71 (39%) of the AID-SNPs to two or more nearby genes, providing evidence that for part of the AID loci multiple causal genes exist. While 54 of the AID loci are shared by one or more AID, 17% of them do not share candidate causal genes. In addition to finding novel genes such as ULK3, we also implicate novel disease mechanisms and pathways like autophagy in celiac disease pathogenesis. Furthermore, 42 of the AID SNPs specifically affected the expression of 53 non-coding RNA genes. To further understand how the non-coding genome contributes to AID, the SNPs were linked to functional regulatory elements, which suggest a model where AID genes are regulated by network of chromatin looping/non-coding RNAs interactions. The looping model also explains how a causal candidate gene is not necessarily the gene closest to the AID SNP, which was the case in nearly 50% of cases. Copyright © 2016 The Authors. Published by Elsevier Ltd.. All rights reserved.
Homolka, David; Ivanek, Robert; Forejt, Jiri; Jansa, Petr
2011-01-01
Background Tight regulation of testicular gene expression is a prerequisite for male reproductive success, while differentiation of gene activity in spermatogenesis is important during speciation. Thus, comparison of testicular transcriptomes between closely related species can reveal unique regulatory patterns and shed light on evolutionary constraints separating the species. Methodology/Principal Findings Here, we compared testicular transcriptomes of two closely related mouse species, Mus musculus and Mus spretus, which diverged more than one million years ago. We analyzed testicular expression using tiling arrays overlapping Chromosomes 2, X, Y and mitochondrial genome. An excess of differentially regulated non-coding RNAs was found on Chromosome 2 including the intronic antisense RNAs, intergenic RNAs and premature forms of Piwi-interacting RNAs (piRNAs). Moreover, striking difference was found in the expression of X-linked G6pdx gene, the parental gene of the autosomal retrogene G6pd2. Conclusions/Significance The prevalence of non-coding RNAs among differentially expressed transcripts indicates their role in species-specific regulation of spermatogenesis. The postmeiotic expression of G6pdx in Mus spretus points towards the continuous evolution of X-chromosome silencing and provides an example of expression change accompanying the out-of-the X-chromosomal retroposition. PMID:21347268
del Val, Coral; Rivas, Elena; Torres-Quesada, Omar; Toro, Nicolás; Jiménez-Zurdo, José I
2007-01-01
Bacterial small non-coding RNAs (sRNAs) are being recognized as novel widespread regulators of gene expression in response to environmental signals. Here, we present the first search for sRNA-encoding genes in the nitrogen-fixing endosymbiont Sinorhizobium meliloti, performed by a genome-wide computational analysis of its intergenic regions. Comparative sequence data from eight related α-proteobacteria were obtained, and the interspecies pairwise alignments were scored with the programs eQRNA and RNAz as complementary predictive tools to identify conserved and stable secondary structures corresponding to putative non-coding RNAs. Northern experiments confirmed that eight of the predicted loci, selected among the original 32 candidates as most probable sRNA genes, expressed small transcripts. This result supports the combined use of eQRNA and RNAz as a robust strategy to identify novel sRNAs in bacteria. Furthermore, seven of the transcripts accumulated differentially in free-living and symbiotic conditions. Experimental mapping of the 5′-ends of the detected transcripts revealed that their encoding genes are organized in autonomous transcription units with recognizable promoter and, in most cases, termination signatures. These findings suggest novel regulatory functions for sRNAs related to the interactions of α-proteobacteria with their eukaryotic hosts. PMID:17971083
Stranded Whole Transcriptome RNA-Seq for All RNA Types
Yan, Pearlly X.; Fang, Fang; Buechlein, Aaron; Ford, James B.; Tang, Haixu; Huang, Tim H.; Burow, Matthew E.; Liu, Yunlong; Rusch, Douglas B.
2015-01-01
Stranded whole transcriptome RNA-Seq described in this unit captures quantitative expression data for all types of RNA including, but not limited to miRNA (microRNA), piRNA (Piwi-interacting RNA), snoRNA (small nucleolar RNA), lincRNA (large non-coding intergenic RNA), SRP RNA (signal recognition particle RNA), tRNA (transfer RNA), mtRNA (mitochondrial RNA) and mRNA (messenger RNA). The size and nature of these types of RNA are irrelevant to the approach described here. Barcoded libraries for multiplexing on the Illumina platform are generated with this approach but it can be applied to other platforms with a few modifications. PMID:25599667
Insertion of a self-splicing intron into the mtDNA of atriploblastic animal
DOE Office of Scientific and Technical Information (OSTI.GOV)
Valles, Y.; Halanych, K.; Boore, J.L.
2006-04-14
Nephtys longosetosa is a carnivorous polychaete worm that lives in the intertidal and subtidal zones with worldwide distribution (pleijel&rouse2001). Its mitochondrial genome has the characteristics typical of most metazoans: 37 genes; circular molecule; almost no intergenic sequence; and no significant gene rearrangements when compared to other annelid mtDNAs (booremoritz19981995). Ubiquitous features as small intergenic regions and lack of introns suggested that metazoan mtDNAs are under strong selective pressures to reduce their genome size allowing for faster replication requirements (booremoritz19981995Lynch2005). Yet, in 1996 two type I introns were found in the mtDNA of the basal metazoan Metridium senile (FigureX). Breaking amore » long-standing rule (absence of introns in metazoan mtDNA), this finding was later supported by the further presence of group I introns in other cnidarians. Interestingly, only the class Anthozoa within cnidarians seems to harbor such introns. Although several hundreds of triploblastic metazoan mtDNAs have been sequenced, this study is the first evidence of mitochondrial introns in triploblastic metazoans. The cox1 gene of N. longosetosa has an intron of almost 2 kbs in length. This finding represents as well the first instance of a group II intron (anthozoans harbor group I introns) in all metazoan lineages. Opposite trends are observed within plants, fungi and protist mtDNAs, where introns (both group I and II) and other non-coding sequences are widespread. Plant, fungal and protist mtDNA structure and organization differ enormously from that of metazoan mtDNA. Both, plant and fungal mtDNA are dynamic molecules that undergo high rates of recombination, contain long intergenic spacer regions and harbor both group I and group II introns. However, as metazoans they have a conserved gene content. Protists, on the other hand have a striking variation of gene content and introns that account for the genome size variation. In contrast to this mtDNA structure and organization diversity, current genome level studies point to a monophyletic origin of the mitochondria (REFS), raising questions such as: what are the pressures at work shaping the evolution of the mitochondrial genome at 'higher' levels? What drives the absence of introns and other non-coding spacers in metazoan mtDNA? What characteristics must have an intron to be maintained in an environment where 'extra chromosomes' are usually selected against?« less
Petrova, Olga E.; Garcia-Alcalde, Fernando; Zampaloni, Claudia; Sauer, Karin
2017-01-01
Global transcriptomic analysis via RNA-seq is often hampered by the high abundance of ribosomal (r)RNA in bacterial cells. To remove rRNA and enrich coding sequences, subtractive hybridization procedures have become the approach of choice prior to RNA-seq, with their efficiency varying in a manner dependent on sample type and composition. Yet, despite an increasing number of RNA-seq studies, comparative evaluation of bacterial rRNA depletion methods has remained limited. Moreover, no such study has utilized RNA derived from bacterial biofilms, which have potentially higher rRNA:mRNA ratios and higher rRNA carryover during RNA-seq analysis. Presently, we evaluated the efficiency of three subtractive hybridization-based kits in depleting rRNA from samples derived from biofilm, as well as planktonic cells of the opportunistic human pathogen Pseudomonas aeruginosa. Our results indicated different rRNA removal efficiency for the three procedures, with the Ribo-Zero kit yielding the highest degree of rRNA depletion, which translated into enhanced enrichment of non-rRNA transcripts and increased depth of RNA-seq coverage. The results indicated that, in addition to improving RNA-seq sensitivity, efficient rRNA removal enhanced detection of low abundance transcripts via qPCR. Finally, we demonstrate that the Ribo-Zero kit also exhibited the highest efficiency when P. aeruginosa/Staphylococcus aureus co-culture RNA samples were tested. PMID:28117413
Comparison of simple sequence repeats in 19 Archaea.
Trivedi, S
2006-12-05
All organisms that have been studied until now have been found to have differential distribution of simple sequence repeats (SSRs), with more SSRs in intergenic than in coding sequences. SSR distribution was investigated in Archaea genomes where complete chromosome sequences of 19 Archaea were analyzed with the program SPUTNIK to find di- to penta-nucleotide repeats. The number of repeats was determined for the complete chromosome sequences and for the coding and non-coding sequences. Different from what has been found for other groups of organisms, there is an abundance of SSRs in coding regions of the genome of some Archaea. Dinucleotide repeats were rare and CG repeats were found in only two Archaea. In general, trinucleotide repeats are the most abundant SSR motifs; however, pentanucleotide repeats are abundant in some Archaea. Some of the tetranucleotide and pentanucleotide repeat motifs are organism specific. In general, repeats are short and CG-rich repeats are present in Archaea having a CG-rich genome. Among the 19 Archaea, SSR density was not correlated with genome size or with optimum growth temperature. Pentanucleotide density had an inverse correlation with the CG content of the genome.
Rapid Differentiation and In Situ Detection of 16 Sourdough Lactobacillus Species by Multiplex PCR
Settanni, Luca; van Sinderen, Douwe; Rossi, Jone; Corsetti, Aldo
2005-01-01
A two-step multiplex PCR-based method was designed for the rapid detection of 16 species of lactobacilli known to be commonly present in sourdough. The first step of multiplex PCR was developed with a mixture of group-specific primers, while the second step included three multiplex PCR assays with a mixture of species-specific primers. Primers were derived from sequences that specify the 16S rRNA, the 16S-23S rRNA intergenic spacer region, and part of the 23S rRNA gene. The primer pairs designed were shown to exclusively amplify the targeted rrn operon fragment of the corresponding species. Due to the reliability of simultaneously identifying Lactobacillus plantarum, Lactobacillus pentosus, and Lactobacillus paraplantarum, a previously described multiplex PCR method employing recA gene-derived primers was included in the multiplex PCR system. The combination of a newly developed, quick bacterial DNA extraction method from sourdough and this multiplex PCR assay allows the rapid in situ detection of several sourdough-associated lactobacilli, including the recently described species Lactobacillus rossii, and thus represents a very useful alternative to culture-based methodologies. PMID:15933001
Diversity and evolution of the emerging Pandoraviridae family.
Legendre, Matthieu; Fabre, Elisabeth; Poirot, Olivier; Jeudy, Sandra; Lartigue, Audrey; Alempic, Jean-Marie; Beucher, Laure; Philippe, Nadège; Bertaux, Lionel; Christo-Foroux, Eugène; Labadie, Karine; Couté, Yohann; Abergel, Chantal; Claverie, Jean-Michel
2018-06-11
With DNA genomes reaching 2.5 Mb packed in particles of bacterium-like shape and dimension, the first two Acanthamoeba-infecting pandoraviruses remained up to now the most complex viruses since their discovery in 2013. Our isolation of three new strains from distant locations and environments is now used to perform the first comparative genomics analysis of the emerging worldwide-distributed Pandoraviridae family. Thorough annotation of the genomes combining transcriptomic, proteomic, and bioinformatic analyses reveals many non-coding transcripts and significantly reduces the former set of predicted protein-coding genes. Here we show that the pandoraviruses exhibit an open pan-genome, the enormous size of which is not adequately explained by gene duplications or horizontal transfers. As most of the strain-specific genes have no extant homolog and exhibit statistical features comparable to intergenic regions, we suggest that de novo gene creation could contribute to the evolution of the giant pandoravirus genomes.
Hah, Nasun; Danko, Charles G.; Core, Leighton; Waterfall, Joshua J.; Siepel, Adam; Lis, John T.; Kraus, W. Lee
2011-01-01
Summary We report the immediate effects of estrogen signaling on the transcriptome of breast cancer cells using Global Run-On and sequencing (GRO-seq). The data were analyzed using a new bioinformatic approach that allowed us to identify transcripts directly from the GRO-seq data. We found that estrogen signaling directly regulates a strikingly large fraction of the transcriptome in a rapid, robust, and unexpectedly transient manner. In addition to protein coding genes, estrogen regulates the distribution and activity of all three RNA polymerases, and virtually every class of non-coding RNA that has been described to date. We also identified a large number of previously undetected estrogen-regulated intergenic transcripts, many of which are found proximal to estrogen receptor binding sites. Collectively, our results provide the most comprehensive measurement of the primary and immediate estrogen effects to date and a resource for understanding rapid signal-dependent transcription in other systems. PMID:21549415
Explaining the disease phenotype of intergenic SNP through predicted long range regulation
Chen, Jingqi; Tian, Weidong
2016-01-01
Thousands of disease-associated SNPs (daSNPs) are located in intergenic regions (IGR), making it difficult to understand their association with disease phenotypes. Recent analysis found that non-coding daSNPs were frequently located in or approximate to regulatory elements, inspiring us to try to explain the disease phenotypes of IGR daSNPs through nearby regulatory sequences. Hence, after locating the nearest distal regulatory element (DRE) to a given IGR daSNP, we applied a computational method named INTREPID to predict the target genes regulated by the DRE, and then investigated their functional relevance to the IGR daSNP's disease phenotypes. 36.8% of all IGR daSNP-disease phenotype associations investigated were possibly explainable through the predicted target genes, which were enriched with, were functionally relevant to, or consisted of the corresponding disease genes. This proportion could be further increased to 60.5% if the LD SNPs of daSNPs were also considered. Furthermore, the predicted SNP-target gene pairs were enriched with known eQTL/mQTL SNP-gene relationships. Overall, it's likely that IGR daSNPs may contribute to disease phenotypes by interfering with the regulatory function of their nearby DREs and causing abnormal expression of disease genes. PMID:27280978
Lünse, Christina E.; Corbino, Keith A.; Ames, Tyler D.; Nelson, James W.; Roth, Adam; Perkins, Kevin R.; Sherlock, Madeline E.
2017-01-01
Abstract The discovery of structured non-coding RNAs (ncRNAs) in bacteria can reveal new facets of biology and biochemistry. Comparative genomics analyses executed by powerful computer algorithms have successfully been used to uncover many novel bacterial ncRNA classes in recent years. However, this general search strategy favors the discovery of more common ncRNA classes, whereas progressively rarer classes are correspondingly more difficult to identify. In the current study, we confront this problem by devising several methods to select subsets of intergenic regions that can concentrate these rare RNA classes, thereby increasing the probability that comparative sequence analysis approaches will reveal their existence. By implementing these methods, we discovered 224 novel ncRNA classes, which include ROOL RNA, an RNA class averaging 581 nt and present in multiple phyla, several highly conserved and widespread ncRNA classes with properties that suggest sophisticated biochemical functions and a multitude of putative cis-regulatory RNA classes involved in a variety of biological processes. We expect that further research on these newly found RNA classes will reveal additional aspects of novel biology, and allow for greater insights into the biochemistry performed by ncRNAs. PMID:28977401
Leloire, Audrey; Dhennin, Véronique; Coumoul, Xavier; Yengo, Loïc; Froguel, Philippe
2017-01-01
Bisphenol A (BPA) exposure has been suspected to be associated with deleterious effects on health including obesity and metabolically-linked diseases. Although bisphenols F (BPF) and S (BPS) are BPA structural analogs commonly used in many marketed products as a replacement for BPA, only sparse toxicological data are available yet. Our objective was to comprehensively characterize bisphenols gene targets in a human primary adipocyte model, in order to determine whether they may induce cellular dysfunction, using chronic exposure at two concentrations: a “low-dose” similar to the dose usually encountered in human biological fluids and a higher dose. Therefore, BPA, BPF and BPS have been added at 10 nM or 10 μM during the differentiation of human primary adipocytes from subcutaneous fat of three non-diabetic Caucasian female patients. Gene expression (mRNA/lncRNA) arrays and microRNA arrays, have been used to assess coding and non-coding RNA changes. We detected significantly deregulated mRNA/lncRNA and miRNA at low and high doses. Enrichment in “cancer” and “organismal injury and abnormalities” related pathways was found in response to the three products. Some long intergenic non-coding RNAs and small nucleolar RNAs were differentially expressed suggesting that bisphenols may also activate multiple cellular processes and epigenetic modifications. The analysis of upstream regulators of deregulated genes highlighted hormones or hormone-like chemicals suggesting that BPS and BPF can be suspected to interfere, just like BPA, with hormonal regulation and have to be considered as endocrine disruptors. All these results suggest that as BPA, its substitutes BPS and BPF should be used with the same restrictions. PMID:28628672
An integrated, structure- and energy-based view of the genetic code.
Grosjean, Henri; Westhof, Eric
2016-09-30
The principles of mRNA decoding are conserved among all extant life forms. We present an integrative view of all the interaction networks between mRNA, tRNA and rRNA: the intrinsic stability of codon-anticodon duplex, the conformation of the anticodon hairpin, the presence of modified nucleotides, the occurrence of non-Watson-Crick pairs in the codon-anticodon helix and the interactions with bases of rRNA at the A-site decoding site. We derive a more information-rich, alternative representation of the genetic code, that is circular with an unsymmetrical distribution of codons leading to a clear segregation between GC-rich 4-codon boxes and AU-rich 2:2-codon and 3:1-codon boxes. All tRNA sequence variations can be visualized, within an internal structural and energy framework, for each organism, and each anticodon of the sense codons. The multiplicity and complexity of nucleotide modifications at positions 34 and 37 of the anticodon loop segregate meaningfully, and correlate well with the necessity to stabilize AU-rich codon-anticodon pairs and to avoid miscoding in split codon boxes. The evolution and expansion of the genetic code is viewed as being originally based on GC content with progressive introduction of A/U together with tRNA modifications. The representation we present should help the engineering of the genetic code to include non-natural amino acids. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.
Sugai, Akihiro; Sato, Hiroki; Yoneda, Misako; Kai, Chieko
2017-08-01
The regulation of transcription during Nipah virus (NiV) replication is poorly understood. Using a bicistronic minigenome system, we investigated the involvement of non-coding regions (NCRs) in the transcriptional re-initiation efficiency of NiV RNA polymerase. Reporter assays revealed that attenuation of NiV gene expression was not constant at each gene junction, and that the attenuating property was controlled by the 3' NCR. However, this regulation was independent of the gene-end, gene-start and intergenic regions. Northern blot analysis indicated that regulation of viral gene expression by the phosphoprotein (P) and large protein (L) 3' NCRs occurred at the transcription level. We identified uridine-rich tracts within the L 3' NCR that are similar to gene-end signals. These gene-end-like sequences were recognized as weak transcription termination signals by the viral RNA polymerase, thereby reducing downstream gene transcription. Thus, we suggest that NiV has a unique mechanism of transcriptional regulation. Copyright © 2017 Elsevier Inc. All rights reserved.
Zhang, Zeng-Wang; Chen, Jia-Jun; Xia, Shi-Hui; Zhao, Hua; Yang, Jun-Bo; Zhang, Hao; He, Bin; Jiao, Jun; Zhan, Bo-Tao; Sun, Cheng-Cao
2018-04-15
Growing evidence shows that long non-coding RNAs (lncRNAs) have been wildly verified to modulate multiple tumorigenesis, especially lung adenocarcinoma. In present study, we aim to investigate the role of lncRNA LINC00319 in the lung adenocarcinoma carcinogenesis. We observed that increased expression of LINC00319 in lung adenocarcinoma tissues and cells in comparison to their corresponding controls. Moreover, the aberrant overexpression of LINC00319 indicated the poor prognosis of lung adenocarcinoma patients. Silence of LINC00319 was able to repress lung adenocarcinoma cell growth in vitro. Rescue assay was performed to further confirm that LINC00319 contributed to lung adenocarcinoma progression by regulating miR-450b-5p/EZH2 signal pathway. Taken together, our study discovered the oncogenic role of LINC00319 in clinical specimens and cellular experiments, showing the potential LINC00319/miR-450b-5p/EZH2 pathway. This results and findings provide a novel insight for lung adenocarcinoma tumorigenesis. Copyright © 2018. Published by Elsevier B.V.
Variation of 45S rDNA intergenic spacers in Arabidopsis thaliana.
Havlová, Kateřina; Dvořáčková, Martina; Peiro, Ramon; Abia, David; Mozgová, Iva; Vansáčová, Lenka; Gutierrez, Crisanto; Fajkus, Jiří
2016-11-01
Approximately seven hundred 45S rRNA genes (rDNA) in the Arabidopsis thaliana genome are organised in two 4 Mbp-long arrays of tandem repeats arranged in head-to-tail fashion separated by an intergenic spacer (IGS). These arrays make up 5 % of the A. thaliana genome. IGS are rapidly evolving sequences and frequent rearrangements inside the rDNA loci have generated considerable interspecific and even intra-individual variability which allows to distinguish among otherwise highly conserved rRNA genes. The IGS has not been comprehensively described despite its potential importance in regulation of rDNA transcription and replication. Here we describe the detailed sequence variation in the complete IGS of A. thaliana WT plants and provide the reference/consensus IGS sequence, as well as genomic DNA analysis. We further investigate mutants dysfunctional in chromatin assembly factor-1 (CAF-1) (fas1 and fas2 mutants), which are known to have a reduced number of rDNA copies, and plant lines with restored CAF-1 function (segregated from a fas1xfas2 genetic background) showing major rDNA rearrangements. The systematic rDNA loss in CAF-1 mutants leads to the decreased variability of the IGS and to the occurrence of distinct IGS variants. We present for the first time a comprehensive and representative set of complete IGS sequences, obtained by conventional cloning and by Pacific Biosciences sequencing. Our data expands the knowledge of the A. thaliana IGS sequence arrangement and variability, which has not been available in full and in detail until now. This is also the first study combining IGS sequencing data with RFLP analysis of genomic DNA.
Pringle, Märit; Bergsten, Christer; Fernström, Lise-Lotte; Höök, Helena; Johansson, Karl-Erik
2008-10-20
Digital dermatitis in cattle is an emerging infectious disease. Ulcerative lesions are typically located on the plantar skin between the heel bulbs and adjacent to the coronet. Spirochetes of the genus Treponema are found in high numbers in the lesions and are likely to be involved in the pathogenesis. The aim of this study was to obtain pure cultures of spirochetes from cattle with digital dermatitis and to describe them further. Tissue samples and swabs from active digital dermatitis lesions were used for culturing. Pure isolates were subjected to, molecular typing through 16S rRNA gene sequencing, pulsed-field gel electrophoresis (PFGE), random amplified polymorphic DNA (RAPD) and an intergenic spacer PCR developed for Treponema spp. as well as API-ZYM and antimicrobial susceptibility tests. The antimicrobial agents used were tiamulin, valnemulin, tylosin, aivlosin, lincomycin and doxycycline. Seven spirochete isolates from five herds were obtained. Both 16S rRNA gene sequences, which were identical except for three polymorphic nucleotide positions, and the intergenic spacer PCR indicated that all isolates were of one yet unnamed species, most closely related to Treponema phagedenis. The enzymatic profile and antimicrobial susceptibility pattern were also similar for all isolates. However it was possible to separate the isolates through their PFGE and RAPD banding pattern. This is the first report on isolation of a Treponema sp. from cattle with digital dermatitis in Scandinavia. The phylotype isolated has previously been cultured from samples from cattle in the USA and the UK and is closely related to T. phagedenis. While very similar, the isolates in this study were possible to differentiate through PFGE and RAPD indicating that these methods are suitable for subtyping of this phylotype. No antimicrobial resistance could be detected among the tested isolates.
Yang, Huirong; Zhang, Jia-En; Guo, Jing; Deng, Zhixin; Luo, Hao; Luo, Mingzhu; Zhao, Benliang
2016-05-01
We present the complete mitochondrial genome of the Achatina fulica in this study. The results show that the mitochondrial genome is 15,057 bp in length, which is comprised of 13 protein-coding genes, 2 rRNA genes, 21 tRNA genes. The nucleotide compositions of the light strand are 35.47% of A, 27.97% of T 19.46% of C, and 17.10% of G. Except the ND3, 7 tRNA, ATP6, ATP8, COX3 and 12S-rRNA on the light strand, the rest are encoded on the heavy strand. Five types of inferred initiation codons are ATA (ND1, ND5), GTG (ND6), ATG (COX3, COX2), ATT (ND4) and TTG (COX1, ND2, ND3, ND4L, ATP6, ATP8, Cytb), and 3 types of inferred termination codons are T (COX3, ND2), TAA (ND1, ND4L, ND5, ND6, ATP6), and TAG (ND3, ND4, COX1, COX2, Cytb, ATP8). There are 24 intergenic spacers and 6 gene overlaps. The tandem repeat sequence (total 52 bp) of (AATAATT)n is observed in 16S-rRNA. Gene arrangement and distribution are inconsistent with the typical vertebrates.
Mycoplasmas hyorhinis in different regions of cuba. diagnosis
Lobo, Evelyn; Poveda, Carlos; Gupta, Rakesh; Suarez, Alejandro; Hernández, Yenney; Ramírez, Ana; Poveda, José B.
2011-01-01
M. hyorhinis is considered one of the etiological agents of arthritis in sucking pigs, but recently as seen, some strains can produce pneumonia that could not be distinguished from the mycoplasmosis caused by M. hyopneumoniae. The study was conducted to research the presence of Mycoplasma hyorhinis (M. hyorhinis ) in different regions of the country from exudates of pig lungs with typical EP lesions. Exudates from 280 pig lungs with typical EP lesions were studied using molecular techniques such as PCR, real time PCR and amplification of the 16S-23S rRNA. It was detected that the 66% of the samples studied resulted positive to M. hyorhinis, and the presence of this species was detected in all the provinces. Amplification and studies on the intergenic region 16S-23S of M. hyorhinis rRNA demonstrated the existing variability among strains of a same species. This study is the first report on M. hyorhinis detection in Cuba. PMID:24031686
Yang, Q; Radebaugh, C A; Kubaska, W; Geiss, G K; Paule, M R
1995-11-11
The intergenic spacer (IGS) of Acanthamoeba castellanii rRNA genes contains repeated elements which are weak enhancers for transcription by RNA polymerase I. A protein, EBF, was identified and partially purified which binds to the enhancers and to several other sequences within the IGS, but not to other DNA fragments, including the rRNA core promoter. No consensus binding sequence could be discerned in these fragments and bound factor is in rapid equilibrium with unbound. EBF has functional characteristics similar to vertebrate upstream binding factors (UBF). Not only does it bind to the enhancer and other IGS elements, but it also stimulates binding of TIF-IB, the fundamental transcription initiation factor, to the core promoter and stimulates transcription from the promoter. Attempts to identify polypeptides with epitopes similar to rat or Xenopus laevis UBF suggest that structurally the protein from A.castellanii is not closely related to vertebrate UBF.
Yang, Q; Radebaugh, C A; Kubaska, W; Geiss, G K; Paule, M R
1995-01-01
The intergenic spacer (IGS) of Acanthamoeba castellanii rRNA genes contains repeated elements which are weak enhancers for transcription by RNA polymerase I. A protein, EBF, was identified and partially purified which binds to the enhancers and to several other sequences within the IGS, but not to other DNA fragments, including the rRNA core promoter. No consensus binding sequence could be discerned in these fragments and bound factor is in rapid equilibrium with unbound. EBF has functional characteristics similar to vertebrate upstream binding factors (UBF). Not only does it bind to the enhancer and other IGS elements, but it also stimulates binding of TIF-IB, the fundamental transcription initiation factor, to the core promoter and stimulates transcription from the promoter. Attempts to identify polypeptides with epitopes similar to rat or Xenopus laevis UBF suggest that structurally the protein from A.castellanii is not closely related to vertebrate UBF. Images PMID:7501455
Bustamante, Carlos; Ovenden, Jennifer R
2016-01-01
The silver gemfish Rexea solandri is an important economic resource but Vulnerable to overfishing in Australian waters. The complete mitochondrial genome sequence is described from 1.6 million reads obtained via next generation sequencing. The total length of the mitogenome is 16,350 bp comprising 2 rRNA, 13 protein-coding genes, 22 tRNA and 2 non-coding regions. The mitogenome sequence was validated against sequences of PCR fragments and BLAST queries of Genbank. Gene order was equivalent to that found in marine fishes.
Gillespie, J J; Johnston, J S; Cannone, J J; Gutell, R R
2006-01-01
As an accompanying manuscript to the release of the honey bee genome, we report the entire sequence of the nuclear (18S, 5.8S, 28S and 5S) and mitochondrial (12S and 16S) ribosomal RNA (rRNA)-encoding gene sequences (rDNA) and related internally and externally transcribed spacer regions of Apis mellifera (Insecta: Hymenoptera: Apocrita). Additionally, we predict secondary structures for the mature rRNA molecules based on comparative sequence analyses with other arthropod taxa and reference to recently published crystal structures of the ribosome. In general, the structures of honey bee rRNAs are in agreement with previously predicted rRNA models from other arthropods in core regions of the rRNA, with little additional expansion in non-conserved regions. Our multiple sequence alignments are made available on several public databases and provide a preliminary establishment of a global structural model of all rRNAs from the insects. Additionally, we provide conserved stretches of sequences flanking the rDNA cistrons that comprise the externally transcribed spacer regions (ETS) and part of the intergenic spacer region (IGS), including several repetitive motifs. Finally, we report the occurrence of retrotransposition in the nuclear large subunit rDNA, as R2 elements are present in the usual insertion points found in other arthropods. Interestingly, functional R1 elements usually present in the genomes of insects were not detected in the honey bee rRNA genes. The reverse transcriptase products of the R2 elements are deduced from their putative open reading frames and structurally aligned with those from another hymenopteran insect, the jewel wasp Nasonia (Pteromalidae). Stretches of conserved amino acids shared between Apis and Nasonia are illustrated and serve as potential sites for primer design, as target amplicons within these R2 elements may serve as novel phylogenetic markers for Hymenoptera. Given the impending completion of the sequencing of the Nasonia genome, we expect our report eventually to shed light on the evolution of the hymenopteran genome within higher insects, particularly regarding the relative maintenance of conserved rDNA genes, related variable spacer regions and retrotransposable elements. PMID:17069639
Foox, Jonathan; Brugler, Mercer; Siddall, Mark Edward; Rodríguez, Estefanía
2016-07-01
Six complete and three partial actiniarian mitochondrial genomes were amplified in two semi-circles using long-range PCR and pyrosequenced in a single run on a 454 GS Junior, doubling the number of complete mitogenomes available within the order. Typical metazoan mtDNA features included circularity, 13 protein-coding genes, 2 ribosomal RNA genes, and length ranging from 17,498 to 19,727 bp. Several typical anthozoan mitochondrial genome features were also observed including the presence of only two transfer RNA genes, elevated A + T richness ranging from 54.9 to 62.4%, large intergenic regions, and group 1 introns interrupting NADH dehydrogenase subunit 5 and cytochrome c oxidase subunit I, the latter of which possesses a homing endonuclease gene. Within the sea anemone Alicia sansibarensis, we report the first mitochondrial gene order rearrangement within the Actiniaria, as well as putative novel non-canonical protein-coding genes. Phylogenetic analyses of all 13 protein-coding and 2 ribosomal genes largely corroborated current hypotheses of sea anemone interrelatedness, with a few lower-level differences.
Identification of Small RNAs in Desulfovibrio vulgaris Hildenborough
DOE Office of Scientific and Technical Information (OSTI.GOV)
Burns, Andrew; Joachimiak, Marcin; Deutschbauer, Adam
2010-05-17
Desulfovibrio vulgaris is an anaerobic sulfate-reducing bacterium capable of facilitating the removal of toxic metals such as uranium from contaminated sites via reduction. As such, it is essential to understand the intricate regulatory cascades involved in how D. vulgaris and its relatives respond to stressors in such sites. One approach is the identification and analysis of small non-coding RNAs (sRNAs); molecules ranging in size from 20-200 nucleotides that predominantly affect gene regulation by binding to complementary mRNA in an anti-sense fashion and therefore provide an immediate regulatory response. To identify sRNAs in D. vulgaris, a bacterium that does not possessmore » an annotated hfq gene, RNA was pooled from stationary and exponential phases, nitrate exposure, and biofilm conditions. The subsequent RNA was size fractionated, modified, and converted to cDNA for high throughput transcriptomic deep sequencing. A computational approach to identify sRNAs via the alignment of seven separate Desulfovibrio genomes was also performed. From the deep sequencing analysis, 2,296 reads between 20 and 250 nt were identified with expression above genome background. Analysis of those reads limited the number of candidates to ~;;87 intergenic, while ~;;140 appeared to be antisense to annotated open reading frames (ORFs). Further BLAST analysis of the intergenic candidates and other Desulfovibrio genomes indicated that eight candidates were likely portions of ORFs not previously annotated in the D. vulgaris genome. Comparison of the intergenic and antisense data sets to the bioinformatical predicted candidates, resulted in ~;;54 common candidates. Current approaches using Northern analysis and qRT-PCR are being used toverify expression of the candidates and to further develop the role these sRNAs play in D. vulgaris regulation.« less
Comparison of Ultra-Conserved Elements in Drosophilids and Vertebrates
Makunin, Igor V.; Shloma, Viktor V.; Stephen, Stuart J.; Pheasant, Michael; Belyakin, Stepan N.
2013-01-01
Metazoan genomes contain many ultra-conserved elements (UCEs), long sequences identical between distant species. In this study we identified UCEs in drosophilid and vertebrate species with a similar level of phylogenetic divergence measured at protein-coding regions, and demonstrated that both the length and number of UCEs are larger in vertebrates. The proportion of non-exonic UCEs declines in distant drosophilids whilst an opposite trend was observed in vertebrates. We generated a set of 2,126 Sophophora UCEs by merging elements identified in several drosophila species and compared these to the eutherian UCEs identified in placental mammals. In contrast to vertebrates, the Sophophora UCEs are depleted around transcription start sites. Analysis of 52,954 P-element, piggyBac and Minos insertions in the D. melanogaster genome revealed depletion of the P-element and piggyBac insertions in and around the Sophophora UCEs. We examined eleven fly strains with transposon insertions into the intergenic UCEs and identified associated phenotypes in five strains. Four insertions behave as recessive lethals, and in one case we observed a suppression of the marker gene within the transgene, presumably by silenced chromatin around the integration site. To confirm the lethality is caused by integration of transposons we performed a phenotype rescue experiment for two stocks and demonstrated that the excision of the transposons from the intergenic UCEs restores viability. Sequencing of DNA after the transposon excision in one fly strain with the restored viability revealed a 47 bp insertion at the original transposon integration site suggesting that the nature of the mutation is important for the appearance of the phenotype. Our results suggest that the UCEs in flies and vertebrates have both common and distinct features, and demonstrate that a significant proportion of intergenic drosophila UCEs are sensitive to disruption. PMID:24349264
Kopf, Matthias; Klähn, Stephan; Scholz, Ingeborg; Hess, Wolfgang R; Voß, Björn
2015-04-22
In all studied organisms, a substantial portion of the transcriptome consists of non-coding RNAs that frequently execute regulatory functions. Here, we have compared the primary transcriptomes of the cyanobacteria Synechocystis sp. PCC 6714 and PCC 6803 under 10 different conditions. These strains share 2854 protein-coding genes and a 16S rRNA identity of 99.4%, indicating their close relatedness. Conserved major transcriptional start sites (TSSs) give rise to non-coding transcripts within the sigB gene, from the 5'UTRs of cmpA and isiA, and 168 loci in antisense orientation. Distinct differences include single nucleotide polymorphisms rendering promoters inactive in one of the strains, e.g., for cmpR and for the asRNA PsbA2R. Based on the genome-wide mapped location, regulation and classification of TSSs, non-coding transcripts were identified as the most dynamic component of the transcriptome. We identified a class of mRNAs that originate by read-through from an sRNA that accumulates as a discrete and abundant transcript while also serving as the 5'UTR. Such an sRNA/mRNA structure, which we name 'actuaton', represents another way for bacteria to remodel their transcriptional network. Our findings support the hypothesis that variations in the non-coding transcriptome constitute a major evolutionary element of inter-strain divergence and capability for physiological adaptation.
Structural analysis of two length variants of the rDNA intergenic spacer from Eruca sativa.
Lakshmikumaran, M; Negi, M S
1994-03-01
Restriction enzyme analysis of the rRNA genes of Eruca sativa indicated the presence of many length variants within a single plant and also between different cultivars which is unusual for most crucifers studied so far. Two length variants of the rDNA intergenic spacer (IGS) from a single individual E. sativa (cv. Itsa) plant were cloned and characterized. The complete nucleotide sequences of both the variants (3 kb and 4 kb) were determined. The intergenic spacer contains three families of tandemly repeated DNA sequences denoted as A, B and C. However, the long (4 kb) variant shows the presence of an additional repeat, denoted as D, which is a duplication of a 224 bp sequence just upstream of the putative transcription initiation site. Repeat units belonging to the three different families (A, B and C) were in the size range of 22 to 30 bp. Such short repeat elements are present in the IGS of most of the crucifers analysed so far. Sequence analysis of the variants (3 kb and 4 kb) revealed that the length heterogeneity of the spacer is located at three different regions and is due to the varying copy numbers of repeat units belonging to families A and B. Length variation of the spacer is also due to the presence of a large duplication (D repeats) in the 4 kb variant which is absent in the 3 kb variant. The putative transcription initiation site was identified by comparisons with the rDNA sequences from other plant species.
Adenovirus and mycoplasma infection in an ornate box turtle (Terrapene ornata ornata) in Hungary.
Farkas, Szilvia L; Gál, János
2009-07-02
A female, adult ornate box turtle (Terrapene ornata ornata) with fatty liver was submitted for virologic examination in Hungary. Signs of an adenovirus infection including degeneration of the liver cells, enlarged nuclei and intranuclear inclusion bodies were detected by light microscopic examination. The presence of an adenovirus was later confirmed by obtaining partial sequence data from the adenoviral DNA-dependent DNA-polymerase. Phylogenetic analyses revealed that this novel chelonian adenovirus was distinct from previously described reptilian adenoviruses, not belonging to any of the recognized genera of the family Adenoviridae. As a part of the routine diagnostic procedure for chelonians the detection of herpes-, rana- and iridoviruses together with Mycoplasma spp. was attempted. Amplicons were generated by a general mycoplasma polymerase chain reaction (PCR) targeting the 16S/23S ribosomal RNA (rRNA) intergenic spacer region, as well as, a specific Mycoplasma agassizii PCR targeting the 16S rRNA gene. Based on the analyses of partial sequences of the 16S rRNA gene, the Mycoplasma sp. of the ornate box turtle seemed to be identical with the recently described eastern box turtle (Terrapene carolina carolina) Mycoplasma sp. This is the first report of a novel chelonian adenovirus and a mycoplasma infection in an ornate box turtle (T. ornata ornata) in Europe.
Raele, D A; Galante, D; Pugliese, N; La Salandra, G; Lomuto, M; Cafiero, M Assunta
2018-05-01
The poultry red mite (PRM), Dermanyssus gallinae, is a nonburrowing haematophagous nest-dwelling ectoparasite of birds; occasionally it bites humans, inducing dermatitis. The possibility that this parasite may also be involved in transmission of pathogens is an additional concern. We investigated the presence of zoonotic agents in PRMs from bird nests and pets, and related them to urban outbreaks of dermatitis. A total of 98 PRMs from 12 outbreaks of PRM dermatitis that occurred in Italian cities from 2001 to 2017 were molecularly investigated for detection of Coxiella spp. (16S rRNA), Chlamydophila spp. (16S rRNA), Rickettsia spp. (17 kDa protein - encoding gene), Borrelia burgdorferi sensu lato ( groEL gene) and Bartonella spp. (16S-23S rRNA intergenic spacer). Of the 12 tested mite pools, one was positive for Coxiella burnetii (100% identity) and two for B. burgdorferi sensu lato (99% with Borrelia afzelii ). For the first time, the presence of B. burgdorferi sensu lato and C. burnetii is reported in PRMs from urban areas. Birds, mainly pigeons, can harbour both pathogens. Therefore, birds and their nest-dwelling PRMs may play a role in the epidemiology of these infections.
Abdollahi-Arpanahi, Rostam; Morota, Gota; Valente, Bruno D; Kranis, Andreas; Rosa, Guilherme J M; Gianola, Daniel
2016-02-03
Genome-wide association studies in humans have found enrichment of trait-associated single nucleotide polymorphisms (SNPs) in coding regions of the genome and depletion of these in intergenic regions. However, a recent release of the ENCyclopedia of DNA elements showed that ~80 % of the human genome has a biochemical function. Similar studies on the chicken genome are lacking, thus assessing the relative contribution of its genic and non-genic regions to variation is relevant for biological studies and genetic improvement of chicken populations. A dataset including 1351 birds that were genotyped with the 600K Affymetrix platform was used. We partitioned SNPs according to genome annotation data into six classes to characterize the relative contribution of genic and non-genic regions to genetic variation as well as their predictive power using all available quality-filtered SNPs. Target traits were body weight, ultrasound measurement of breast muscle and hen house egg production in broiler chickens. Six genomic regions were considered: intergenic regions, introns, missense, synonymous, 5' and 3' untranslated regions, and regions that are located 5 kb upstream and downstream of coding genes. Genomic relationship matrices were constructed for each genomic region and fitted in the models, separately or simultaneously. Kernel-based ridge regression was used to estimate variance components and assess predictive ability. Contribution of each class of genomic regions to dominance variance was also considered. Variance component estimates indicated that all genomic regions contributed to marked additive genetic variation and that the class of synonymous regions tended to have the greatest contribution. The marked dominance genetic variation explained by each class of genomic regions was similar and negligible (~0.05). In terms of prediction mean-square error, the whole-genome approach showed the best predictive ability. All genic and non-genic regions contributed to phenotypic variation for the three traits studied. Overall, the contribution of additive genetic variance to the total genetic variance was much greater than that of dominance variance. Our results show that all genomic regions are important for the prediction of the targeted traits, and the whole-genome approach was reaffirmed as the best tool for genome-enabled prediction of quantitative traits.
Blaiotta, Giuseppe; Pepe, Olimpia; Mauriello, Gianluigi; Villani, Francesco; Andolfi, Rosamaria; Moschetti, Giancarlo
2002-12-01
The intergenic spacer region (ISR) between the 16S and 23S rRNA genes was tested as a tool for differentiating lactococci commonly isolated in a dairy environment. 17 reference strains, representing 11 different species belonging to the genera Lactococcus, Streptococcus, Lactobacillus, Enterococcus and Leuconostoc, and 127 wild streptococcal strains isolated during the whole fermentation process of "Fior di Latte" cheese were analyzed. After 16S-23S rDNA ISR amplification by PCR, species or genus-specific patterns were obtained for most of the reference strains tested. Moreover, results obtained after nucleotide analysis show that the 16S-23S rDNA ISR sequences vary greatly, in size and sequence, among Lactococcus garvieae, Lactococcus raffinolactis, Lactococcus lactis as well as other streptococci from dairy environments. Because of the high degree of inter-specific polymorphism observed, 16S-23S rDNA ISR can be considered a good potential target for selecting species-specific molecular assays, such as PCR primer or probes, for a rapid and extremely reliable differentiation of dairy lactococcal isolates.
Nielsen, O F; Carin, M; Westergaard, O
1984-01-01
In isolated nucleoli from Tetrahymena thermophila, low concentrations of the intercalating agent proflavine inhibit both transcription termination and splicing of the rRNA precursor. Proflavine also exerts an in vivo effect on the process of transcription termination under conditions, where the growth rate is only slightly reduced. Thus, approximately 40% of the rRNA precursor molecules, accumulated in nucleoli during 60 min of treatment with the drug, are longer than the normal 35S rRNA precursor. R-Loop mapping of these longer precursor molecules isolated after 30 and 60 min of incubation demonstrates that the RNA polymerases have a 50 fold lower elongation rate in the spacer region than in the coding region. Proflavine in the given concentration is found to have no significant effect on the splicing of properly terminated precursor molecules. In contrast, none of the longer non-terminated molecules are found to be spliced. These results indicate that proflavine primarily affects the process of transcription termination and that the splicing event is inhibited due to the improper termination of the precursor molecule. Images PMID:6694912
Boeri, Eduardo J.; Wanke, María M.; Madariaga, María J.; Teijeiro, María L.; Elena, Sebastian A.; Trangoni, Marcos D.
2018-01-01
Aim: This study aimed to compare the sensitivity (S), specificity (Sp), and positive likelihood ratios (LR+) of four polymerase chain reaction (PCR) assays for the detection of Brucella spp. in dog’s clinical samples. Materials and Methods: A total of 595 samples of whole blood, urine, and genital fluids were evaluated between October 2014 and November 2016. To compare PCR assays, the gold standard was defined using a combination of different serological and microbiological test. Bacterial isolation from urine and blood cultures was carried out. Serological methods such as rapid slide agglutination test, indirect enzyme-linked immunosorbent assay, agar gel immunodiffusion test, and buffered plate antigen test were performed. Four genes were evaluated: (i) The gene coding for the BCSP31 protein, (ii) the ribosomal gene coding for the 16S-23S intergenic spacer region, (iii) the gene coding for porins omp2a/omp2b, and (iv) the gene coding for the insertion sequence IS711. Results: The results obtained were as follows: (1) For the primers that amplify the gene coding for the BCSP31 protein: S: 45.64% (confidence interval [CI] 39.81-51.46), Sp: 95.62% (CI 93.13-98.12), and LR+: 10.43 (CI 6.04-18); (2) for the primers that amplify the ribosomal gene of the 16S-23S rDNA intergenic spacer region: S: 69.80% (CI 64.42-75.18), Sp: 95.62 % (CI 93.13-98.12), and LR+: 11.52 (CI 7.31-18.13); (3) for the primers that amplify the omp2a and omp2b genes: S: 39.26% (CI 33.55-44.97), Sp: 97.31% (CI 95.30-99.32), and LR+ 14.58 (CI 7.25-29.29); and (4) for the primers that amplify the insertion sequence IS711: S: 22.82% (CI 17.89 - 27.75), Sp: 99.66% (CI 98.84-100), and LR+ 67.77 (CI 9.47-484.89). Conclusion: We concluded that the gene coding for the 16S-23S rDNA intergenic spacer region was the one that best detected Brucella spp. in canine clinical samples. PMID:29657404
Explaining the disease phenotype of intergenic SNP through predicted long range regulation.
Chen, Jingqi; Tian, Weidong
2016-10-14
Thousands of disease-associated SNPs (daSNPs) are located in intergenic regions (IGR), making it difficult to understand their association with disease phenotypes. Recent analysis found that non-coding daSNPs were frequently located in or approximate to regulatory elements, inspiring us to try to explain the disease phenotypes of IGR daSNPs through nearby regulatory sequences. Hence, after locating the nearest distal regulatory element (DRE) to a given IGR daSNP, we applied a computational method named INTREPID to predict the target genes regulated by the DRE, and then investigated their functional relevance to the IGR daSNP's disease phenotypes. 36.8% of all IGR daSNP-disease phenotype associations investigated were possibly explainable through the predicted target genes, which were enriched with, were functionally relevant to, or consisted of the corresponding disease genes. This proportion could be further increased to 60.5% if the LD SNPs of daSNPs were also considered. Furthermore, the predicted SNP-target gene pairs were enriched with known eQTL/mQTL SNP-gene relationships. Overall, it's likely that IGR daSNPs may contribute to disease phenotypes by interfering with the regulatory function of their nearby DREs and causing abnormal expression of disease genes. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.
Gurfield, Nikos; Grewal, Saran; Cua, Lynnie S; Torres, Pedro J; Kelley, Scott T
2017-01-01
The Pacific coast tick, Dermacentor occidentalis Marx, is found throughout California and can harbor agents that cause human diseases such as anaplasmosis, ehrlichiosis, tularemia, Rocky Mountain spotted fever and rickettsiosis 364D. Previous studies have demonstrated that nonpathogenic endosymbiotic bacteria can interfere with Rickettsia co-infections in other tick species. We hypothesized that within D. occidentalis ticks, interference may exist between different nonpathogenic endosymbiotic or nonendosymbiotic bacteria and Spotted Fever group Rickettsia (SFGR). Using PCR amplification and sequencing of the romp A gene and intergenic region we identified a cohort of SFGR-infected and non-infected D. occidentalis ticks collected from San Diego County. We then amplified a partial segment of the 16S rRNA gene and used next-generation sequencing to elucidate the microbiomes and levels of co-infection in the ticks. The SFGR R. philipii str. 364D and R. rhipicephali were detected in 2.3% and 8.2% of the ticks, respectively, via romp A sequencing. Interestingly, next generation sequencing revealed an inverse relationship between the number of Francisella- like endosymbiont (FLE) 16S rRNA sequences and Rickettsia 16S rRNA sequences within individual ticks that is consistent with partial interference between FLE and SFGR infecting ticks. After excluding the Rickettsia and FLE endosymbionts from the analysis, there was a small but significant difference in microbial community diversity and a pattern of geographic isolation by distance between collection locales. In addition, male ticks had a greater diversity of bacteria than female ticks and ticks that weren't infected with SFGR had similar microbiomes to canine skin microbiomes. Although experimental studies are required for confirmation, our findings are consistent with the hypothesis that FLEs and, to a lesser extent, other bacteria, interfere with the ability of D. occidentalis to be infected with certain SFGR. The results also raise interesting possibilities about the effects of putative vertebrate hosts on the tick microbiome.
Circular non-coding RNA ANRIL modulates ribosomal RNA maturation and atherosclerosis in humans
Holdt, Lesca M.; Stahringer, Anika; Sass, Kristina; Pichler, Garwin; Kulak, Nils A.; Wilfert, Wolfgang; Kohlmaier, Alexander; Herbst, Andreas; Northoff, Bernd H.; Nicolaou, Alexandros; Gäbel, Gabor; Beutner, Frank; Scholz, Markus; Thiery, Joachim; Musunuru, Kiran; Krohn, Knut; Mann, Matthias; Teupser, Daniel
2016-01-01
Circular RNAs (circRNAs) are broadly expressed in eukaryotic cells, but their molecular mechanism in human disease remains obscure. Here we show that circular antisense non-coding RNA in the INK4 locus (circANRIL), which is transcribed at a locus of atherosclerotic cardiovascular disease on chromosome 9p21, confers atheroprotection by controlling ribosomal RNA (rRNA) maturation and modulating pathways of atherogenesis. CircANRIL binds to pescadillo homologue 1 (PES1), an essential 60S-preribosomal assembly factor, thereby impairing exonuclease-mediated pre-rRNA processing and ribosome biogenesis in vascular smooth muscle cells and macrophages. As a consequence, circANRIL induces nucleolar stress and p53 activation, resulting in the induction of apoptosis and inhibition of proliferation, which are key cell functions in atherosclerosis. Collectively, these findings identify circANRIL as a prototype of a circRNA regulating ribosome biogenesis and conferring atheroprotection, thereby showing that circularization of long non-coding RNAs may alter RNA function and protect from human disease. PMID:27539542
Wang, Jiajia; Li, Hu; Dai, Renhuai
2017-12-01
Here, we describe the first complete mitochondrial genome (mitogenome) sequence of the leafhopper Taharana fasciana (Coelidiinae). The mitogenome sequence contains 15,161 bp with an A + T content of 77.9%. It includes 13 protein-coding genes, two ribosomal RNA genes, 22 transfer RNA genes, and one non-coding (A + T-rich) region; in addition, a repeat region is also present (GenBank accession no. KY886913). These genes/regions are in the same order as in the inferred insect ancestral mitogenome. All protein-coding genes have ATN as the start codon, and TAA or single T as the stop codons, except the gene ND3, which ends with TAG. Furthermore, we predicted the secondary structures of the rRNAs in T. fasciana. Six domains (domain III is absent in arthropods) and 41 helices were predicted for 16S rRNA, and 12S rRNA comprised three structural domains and 24 helices. Phylogenetic tree analysis confirmed that T. fasciana and other members of the Cicadellidae are clustered into a clade, and it identified the relationships among the subfamilies Deltocephalinae, Coelidiinae, Idiocerinae, Cicadellinae, and Typhlocybinae.
Novel variants of the 5S rRNA genes in Eruca sativa.
Singh, K; Bhatia, S; Lakshmikumaran, M
1994-02-01
The 5S ribosomal RNA (rRNA) genes of Eruca sativa were cloned and characterized. They are organized into clusters of tandemly repeated units. Each repeat unit consists of a 119-bp coding region followed by a noncoding spacer region that separates it from the coding region of the next repeat unit. Our study reports novel gene variants of the 5S rRNA genes in plants. Two families of the 5S rDNA, the 0.5-kb size family and the 1-kb size family, coexist in the E. sativa genome. The 0.5-kb size family consists of the 5S rRNA genes (S4) that have coding regions similar to those of other reported plant 5S rDNA sequences, whereas the 1-kb size family consists of the 5S rRNA gene variants (S1) that exist as 1-kb BamHI tandem repeats. S1 is made up of two variant units (V1 and V2) of 5S rDNA where the BamHI site between the two units is mutated. Sequence heterogeneity among S4, V1, and V2 units exists throughout the sequence and is not limited to the noncoding spacer region only. The coding regions of V1 and V2 show approximately 20% dissimilarity to the coding regions of S4 and other reported plant 5S rDNA sequences. Such a large variation in the coding regions of the 5S rDNA units within the same plant species has been observed for the first time. Restriction site variation is observed between the two size classes of 5S rDNA in E. sativa.(ABSTRACT TRUNCATED AT 250 WORDS)
Coupled transcription and processing of mouse ribosomal RNA in a cell-free system.
Mishima, Y; Mitsuma, T; Ogata, K
1985-01-01
An in vitro processing system of mouse rRNA was achieved using an RNA polymerase I-specific transcription system, (S100) and recombinant plasmids consisting of mouse rRNA gene (rDNA) segments containing the transcription initiation and 5'-terminal region of 18S (or 41S) rRNA. Pulse-chase experiments showed that a specific processing occurred with transcripts of the plasmid DNAs when the direction of transcription was the correct orientation relative to the 18S rRNA coding sequence, but not with transcripts of the DNA templates in which this coding sequence was in the opposite orientation. From the S1 nuclease protection analyses, we concluded that there are several steps of endonucleolytic cleavage including one 105 nucleotides upstream from the 5' end of 18S rRNA. Intermediates cleaved at this site were identified in in vivo processing of rRNA. This result indicates that endonucleolytic cleavage takes place 105 nucleotides upstream from the 5' terminus of 18S rRNA prior to the formation of mature 18S rRNA. Trimming or cleavage of the 105 nucleotides may be involved in the formation of the 5' terminus of mature 18S rRNA. Images Fig. 2. Fig. 3. Fig. 4. Fig. 5. Fig. 6. PMID:3004977
Roles of Non-Coding RNA in Sugarcane-Microbe Interaction.
Thiebaut, Flávia; Rojas, Cristian A; Grativol, Clícia; Calixto, Edmundo P da R; Motta, Mariana R; Ballesteros, Helkin G F; Peixoto, Barbara; de Lima, Berenice N S; Vieira, Lucas M; Walter, Maria Emilia; de Armas, Elvismary M; Entenza, Júlio O P; Lifschitz, Sergio; Farinelli, Laurent; Hemerly, Adriana S; Ferreira, Paulo C G
2017-12-20
Studies have highlighted the importance of non-coding RNA regulation in plant-microbe interaction. However, the roles of sugarcane microRNAs (miRNAs) in the regulation of disease responses have not been investigated. Firstly, we screened the sRNA transcriptome of sugarcane infected with Acidovorax avenae . Conserved and novel miRNAs were identified. Additionally, small interfering RNAs (siRNAs) were aligned to differentially expressed sequences from the sugarcane transcriptome. Interestingly, many siRNAs aligned to a transcript encoding a copper-transporter gene whose expression was induced in the presence of A. avenae , while the siRNAs were repressed in the presence of A. avenae . Moreover, a long intergenic non-coding RNA was identified as a potential target or decoy of miR408. To extend the bioinformatics analysis, we carried out independent inoculations and the expression patterns of six miRNAs were validated by quantitative reverse transcription-PCR (qRT-PCR). Among these miRNAs, miR408-a copper-microRNA-was downregulated. The cleavage of a putative miR408 target, a laccase, was confirmed by a modified 5'RACE (rapid amplification of cDNA ends) assay. MiR408 was also downregulated in samples infected with other pathogens, but it was upregulated in the presence of a beneficial diazotrophic bacteria. Our results suggest that regulation by miR408 is important in sugarcane sensing whether microorganisms are either pathogenic or beneficial, triggering specific miRNA-mediated regulatory mechanisms accordingly.
Roles of Non-Coding RNA in Sugarcane-Microbe Interaction
Grativol, Clícia; Motta, Mariana R.; Ballesteros, Helkin G. F.; Peixoto, Barbara; Vieira, Lucas M.; Walter, Maria Emilia; de Armas, Elvismary M.; Entenza, Júlio O. P.; Lifschitz, Sergio; Farinelli, Laurent; Hemerly, Adriana S.
2017-01-01
Studies have highlighted the importance of non-coding RNA regulation in plant-microbe interaction. However, the roles of sugarcane microRNAs (miRNAs) in the regulation of disease responses have not been investigated. Firstly, we screened the sRNA transcriptome of sugarcane infected with Acidovorax avenae. Conserved and novel miRNAs were identified. Additionally, small interfering RNAs (siRNAs) were aligned to differentially expressed sequences from the sugarcane transcriptome. Interestingly, many siRNAs aligned to a transcript encoding a copper-transporter gene whose expression was induced in the presence of A. avenae, while the siRNAs were repressed in the presence of A. avenae. Moreover, a long intergenic non-coding RNA was identified as a potential target or decoy of miR408. To extend the bioinformatics analysis, we carried out independent inoculations and the expression patterns of six miRNAs were validated by quantitative reverse transcription-PCR (qRT-PCR). Among these miRNAs, miR408—a copper-microRNA—was downregulated. The cleavage of a putative miR408 target, a laccase, was confirmed by a modified 5′RACE (rapid amplification of cDNA ends) assay. MiR408 was also downregulated in samples infected with other pathogens, but it was upregulated in the presence of a beneficial diazotrophic bacteria. Our results suggest that regulation by miR408 is important in sugarcane sensing whether microorganisms are either pathogenic or beneficial, triggering specific miRNA-mediated regulatory mechanisms accordingly. PMID:29657296
Landscape of somatic mutations in 560 breast cancer whole-genome sequences
Nik-Zainal, Serena; Davies, Helen; Staaf, Johan; ...
2016-05-02
Here, we analysed whole-genome sequences of 560 breast cancers to advance understanding of the driver mutations conferring clonal advantage and the mutational processes generating somatic mutations. We found that 93 protein-coding cancer genes carried probable driver mutations. Some non-coding regions exhibited high mutation frequencies, but most have distinctive structural features probably causing elevated mutation rates and do not contain driver mutations. Mutational signature analysis was extended to genome rearrangements and revealed twelve base substitution and six rearrangement signatures. Three rearrangement signatures, characterized by tandem duplications or deletions, appear associated with defective homologous-recombination-based DNA repair: one with deficient BRCA1 function, anothermore » with deficient BRCA1 or BRCA2 function, the cause of the third is unknown. This analysis of all classes of somatic mutation across exons, introns and intergenic regions highlights the repertoire of cancer genes and mutational processes operating, and progresses towards a comprehensive account of the somatic genetic basis of breast cancer.« less
Landscape of somatic mutations in 560 breast cancer whole-genome sequences
DOE Office of Scientific and Technical Information (OSTI.GOV)
Nik-Zainal, Serena; Davies, Helen; Staaf, Johan
Here, we analysed whole-genome sequences of 560 breast cancers to advance understanding of the driver mutations conferring clonal advantage and the mutational processes generating somatic mutations. We found that 93 protein-coding cancer genes carried probable driver mutations. Some non-coding regions exhibited high mutation frequencies, but most have distinctive structural features probably causing elevated mutation rates and do not contain driver mutations. Mutational signature analysis was extended to genome rearrangements and revealed twelve base substitution and six rearrangement signatures. Three rearrangement signatures, characterized by tandem duplications or deletions, appear associated with defective homologous-recombination-based DNA repair: one with deficient BRCA1 function, anothermore » with deficient BRCA1 or BRCA2 function, the cause of the third is unknown. This analysis of all classes of somatic mutation across exons, introns and intergenic regions highlights the repertoire of cancer genes and mutational processes operating, and progresses towards a comprehensive account of the somatic genetic basis of breast cancer.« less
Landscape of somatic mutations in 560 breast cancer whole genome sequences
Nik-Zainal, Serena; Davies, Helen; Staaf, Johan; Ramakrishna, Manasa; Glodzik, Dominik; Zou, Xueqing; Martincorena, Inigo; Alexandrov, Ludmil B.; Martin, Sancha; Wedge, David C.; Van Loo, Peter; Ju, Young Seok; Smid, Marcel; Brinkman, Arie B; Morganella, Sandro; Aure, Miriam R.; Lingjærde, Ole Christian; Langerød, Anita; Ringnér, Markus; Ahn, Sung-Min; Boyault, Sandrine; Brock, Jane E.; Broeks, Annegien; Butler, Adam; Desmedt, Christine; Dirix, Luc; Dronov, Serge; Fatima, Aquila; Foekens, John A.; Gerstung, Moritz; Hooijer, Gerrit KJ; Jang, Se Jin; Jones, David R.; Kim, Hyung-Yong; King, Tari A.; Krishnamurthy, Savitri; Lee, Hee Jin; Lee, Jeong-Yeon; Li, Yilong; McLaren, Stuart; Menzies, Andrew; Mustonen, Ville; O’Meara, Sarah; Pauporté, Iris; Pivot, Xavier; Purdie, Colin A.; Raine, Keiran; Ramakrishnan, Kamna; Rodríguez-González, F. Germán; Romieu, Gilles; Sieuwerts, Anieta M.; Simpson, Peter T; Shepherd, Rebecca; Stebbings, Lucy; Stefansson, Olafur A; Teague, Jon; Tommasi, Stefania; Treilleux, Isabelle; Van den Eynden, Gert G.; Vermeulen, Peter; Vincent-Salomon, Anne; Yates, Lucy; Caldas, Carlos; van’t Veer, Laura; Tutt, Andrew; Knappskog, Stian; Tan, Benita Kiat Tee; Jonkers, Jos; Borg, Åke; Ueno, Naoto T; Sotiriou, Christos; Viari, Alain; Futreal, P. Andrew; Campbell, Peter J; Span, Paul N.; Van Laere, Steven; Lakhani, Sunil R; Eyfjord, Jorunn E.; Thompson, Alastair M.; Birney, Ewan; Stunnenberg, Hendrik G; van de Vijver, Marc J; Martens, John W.M.; Børresen-Dale, Anne-Lise; Richardson, Andrea L.; Kong, Gu; Thomas, Gilles; Stratton, Michael R.
2016-01-01
We analysed whole genome sequences of 560 breast cancers to advance understanding of the driver mutations conferring clonal advantage and the mutational processes generating somatic mutations. 93 protein-coding cancer genes carried likely driver mutations. Some non-coding regions exhibited high mutation frequencies but most have distinctive structural features probably causing elevated mutation rates and do not harbour driver mutations. Mutational signature analysis was extended to genome rearrangements and revealed 12 base substitution and six rearrangement signatures. Three rearrangement signatures, characterised by tandem duplications or deletions, appear associated with defective homologous recombination based DNA repair: one with deficient BRCA1 function; another with deficient BRCA1 or BRCA2 function; the cause of the third is unknown. This analysis of all classes of somatic mutation across exons, introns and intergenic regions highlights the repertoire of cancer genes and mutational processes operative, and progresses towards a comprehensive account of the somatic genetic basis of breast cancer. PMID:27135926
Beauparlant, Marc A; Drouin, Guy
2014-02-01
Analyses of the 5S rRNA genes found in the spliced-leader (SL) gene repeat units of numerous trypanosome species suggest that such linkages were not inherited from a common ancestor, but were the result of independent 5S rRNA gene insertions. In trypanosomes, 5S rRNA genes are found either in the tandemly repeated units coding for SL genes or in independent tandemly repeated units. Given that trypanosome species where 5S rRNA genes are within the tandemly repeated units coding for SL genes are phylogenetically related, one might hypothesize that this arrangement is the result of an ancestral insertion of 5S rRNA genes into the tandemly repeated SL gene family of trypanosomes. Here, we use the types of 5S rRNA genes found associated with SL genes, the flanking regions of the inserted 5S rRNA genes and the position of these insertions to show that most of the 5S rRNA genes found within SL gene repeat units of trypanosome species were not acquired from a common ancestor but are the results of independent insertions. These multiple 5S rRNA genes insertion events in trypanosomes are likely the result of frequent founder events in different hosts and/or geographical locations in species having short generation times.
The Evolution of Dark Matter in the Mitogenome of Seed Beetles
Sayadi, Ahmed; Immonen, Elina; Tellgren-Roth, Christian
2017-01-01
Abstract Animal mitogenomes are generally thought of as being economic and optimized for rapid replication and transcription. We use long-read sequencing technology to assemble the remarkable mitogenomes of four species of seed beetles. These are the largest circular mitogenomes ever assembled in insects, ranging from 24,496 to 26,613 bp in total length, and are exceptional in that some 40% consists of non-coding DNA. The size expansion is due to two very long intergenic spacers (LIGSs), rich in tandem repeats. The two LIGSs are present in all species but vary greatly in length (114–10,408 bp), show very low sequence similarity, divergent tandem repeat motifs, a very high AT content and concerted length evolution. The LIGSs have been retained for at least some 45 my but must have undergone repeated reductions and expansions, despite strong purifying selection on protein coding mtDNA genes. The LIGSs are located in two intergenic sites where a few recent studies of insects have also reported shorter LIGSs (>200 bp). These sites may represent spaces that tolerate neutral repeat array expansions or, alternatively, the LIGSs may function to allow a more economic translational machinery. Mitochondrial respiration in adult seed beetles is based almost exclusively on fatty acids, which reduces the need for building complex I of the oxidative phosphorylation pathway (NADH dehydrogenase). One possibility is thus that the LIGSs may allow depressed transcription of NAD genes. RNA sequencing showed that LIGSs are partly transcribed and transcriptional profiling suggested that all seven mtDNA NAD genes indeed show low levels of transcription and co-regulation of transcription across sexes and tissues. PMID:29048527
Gong, Chenguang; Li, Zhizhong; Ramanujan, Krishnan; Clay, Ieuan; Zhang, Yunyu; Lemire-Brachat, Sophie; Glass, David J
2015-07-27
Increasing evidence suggests that long non-coding RNAs (LncRNAs) represent a new class of regulators of stem cells. However, the roles of LncRNAs in stem cell maintenance and myogenesis remain largely unexamined. For this study, hundreds of intergenic LncRNAs were identified that are expressed in myoblasts and regulated during differentiation. One of these LncRNAs, termed LncMyoD, is encoded next to the Myod gene and is directly activated by MyoD during myoblast differentiation. Knockdown of LncMyoD strongly inhibits terminal muscle differentiation, largely due to a failure to exit the cell cycle. LncMyoD directly binds to IGF2-mRNA-binding protein 2 (IMP2) and negatively regulates IMP2-mediated translation of proliferation genes such as N-Ras and c-Myc. While the RNA sequence of LncMyoD is not well conserved between human and mouse, its locus, gene structure, and function are preserved. The MyoD-LncMyoD-IMP2 pathway elucidates a mechanism as to how MyoD blocks proliferation to create a permissive state for differentiation. Copyright © 2015 Elsevier Inc. All rights reserved.
Kopf, Matthias; Klähn, Stephan; Scholz, Ingeborg; Hess, Wolfgang R.; Voß, Björn
2015-01-01
In all studied organisms, a substantial portion of the transcriptome consists of non-coding RNAs that frequently execute regulatory functions. Here, we have compared the primary transcriptomes of the cyanobacteria Synechocystis sp. PCC 6714 and PCC 6803 under 10 different conditions. These strains share 2854 protein-coding genes and a 16S rRNA identity of 99.4%, indicating their close relatedness. Conserved major transcriptional start sites (TSSs) give rise to non-coding transcripts within the sigB gene, from the 5′UTRs of cmpA and isiA, and 168 loci in antisense orientation. Distinct differences include single nucleotide polymorphisms rendering promoters inactive in one of the strains, e.g., for cmpR and for the asRNA PsbA2R. Based on the genome-wide mapped location, regulation and classification of TSSs, non-coding transcripts were identified as the most dynamic component of the transcriptome. We identified a class of mRNAs that originate by read-through from an sRNA that accumulates as a discrete and abundant transcript while also serving as the 5′UTR. Such an sRNA/mRNA structure, which we name ‘actuaton’, represents another way for bacteria to remodel their transcriptional network. Our findings support the hypothesis that variations in the non-coding transcriptome constitute a major evolutionary element of inter-strain divergence and capability for physiological adaptation. PMID:25902393
Perspectives of Long Non-Coding RNAs in Cancer Diagnostics
Reis, Eduardo M.; Verjovski-Almeida, Sergio
2012-01-01
Long non-coding RNAs (lncRNAs) transcribed from intergenic and intronic regions of the human genome constitute a broad class of cellular transcripts that are under intensive investigation. While only a handful of lncRNAs have been characterized, their involvement in fundamental cellular processes that control gene expression highlights a central role in cell homeostasis. Not surprisingly, aberrant expression of regulatory lncRNAs has been increasingly documented in different types of cancer, where they can mediate both oncogenic or tumor suppressor effects. Interaction with chromatin remodeling complexes that promote silencing of specific genes or modulation of splicing factor proteins seem to be two general modes of lncRNA regulation, but it is conceivable that additional mechanisms of action are yet to be unveiled. LncRNAs show greater tissue specificity compared to protein-coding mRNAs making them attractive in the search of novel diagnostics/prognostics cancer biomarkers in body fluid samples. In fact, lncRNA prostate cancer antigen 3 can be detected in urine samples and has been shown to improve diagnosis of prostate cancer. We suggest that an unbiased screening of the presence of RNAs in easily accessible body fluids such as serum and urine might reveal novel circulating lncRNAs as potential biomarkers in many types of cancer. Annotation and functional characterization of the lncRNA complement of the cancer transcriptome will conceivably provide new venues for early diagnosis and treatment of the disease. PMID:22408643
Dynamic gene expression response to altered gravity in human T cells.
Thiel, Cora S; Hauschild, Swantje; Huge, Andreas; Tauber, Svantje; Lauber, Beatrice A; Polzer, Jennifer; Paulsen, Katrin; Lier, Hartwin; Engelmann, Frank; Schmitz, Burkhard; Schütte, Andreas; Layer, Liliana E; Ullrich, Oliver
2017-07-12
We investigated the dynamics of immediate and initial gene expression response to different gravitational environments in human Jurkat T lymphocytic cells and compared expression profiles to identify potential gravity-regulated genes and adaptation processes. We used the Affymetrix GeneChip® Human Transcriptome Array 2.0 containing 44,699 protein coding genes and 22,829 non-protein coding genes and performed the experiments during a parabolic flight and a suborbital ballistic rocket mission to cross-validate gravity-regulated gene expression through independent research platforms and different sets of control experiments to exclude other factors than alteration of gravity. We found that gene expression in human T cells rapidly responded to altered gravity in the time frame of 20 s and 5 min. The initial response to microgravity involved mostly regulatory RNAs. We identified three gravity-regulated genes which could be cross-validated in both completely independent experiment missions: ATP6V1A/D, a vacuolar H + -ATPase (V-ATPase) responsible for acidification during bone resorption, IGHD3-3/IGHD3-10, diversity genes of the immunoglobulin heavy-chain locus participating in V(D)J recombination, and LINC00837, a long intergenic non-protein coding RNA. Due to the extensive and rapid alteration of gene expression associated with regulatory RNAs, we conclude that human cells are equipped with a robust and efficient adaptation potential when challenged with altered gravitational environments.
Gaitán-Espitia, Juan Diego; Nespolo, Roberto F.; Opazo, Juan C.
2013-01-01
The complete sequences of three mitochondrial genomes from the land snail Cornu aspersum were determined. The mitogenome has a length of 14050 bp, and it encodes 13 protein-coding genes, 22 transfer RNA genes and two ribosomal RNA genes. It also includes nine small intergene spacers, and a large AT-rich intergenic spacer. The intra-specific divergence analysis revealed that COX1 has the lower genetic differentiation, while the most divergent genes were NADH1, NADH3 and NADH4. With the exception of Euhadra herklotsi, the structural comparisons showed the same gene order within the family Helicidae, and nearly identical gene organization to that found in order Pulmonata. Phylogenetic reconstruction recovered Basommatophora as polyphyletic group, whereas Eupulmonata and Pulmonata as paraphyletic groups. Bayesian and Maximum Likelihood analyses showed that C. aspersum is a close relative of Cepaea nemoralis, and with the other Helicidae species form a sister group of Albinaria caerulea, supporting the monophyly of the Stylommatophora clade. PMID:23826260
Phytoplasma-specific PCR primers based on sequences of the 16S-23S rRNA spacer region.
Smart, C D; Schneider, B; Blomquist, C L; Guerra, L J; Harrison, N A; Ahrens, U; Lorenz, K H; Seemüller, E; Kirkpatrick, B C
1996-01-01
In order to develop a diagnostic tool to identify phytoplasmas and classify them according to their phylogenetic group, we took advantage of the sequence diversity of the 16S-23S intergenic spacer regions (SRs) of phytoplasmas. Ten PCR primers were developed from the SR sequences and were shown to amplify in a group-specific fashion. For some groups of phytoplasmas, such as elm yellows, ash yellows, and pear decline, the SR primer was paired with a specific primer from within the 16S rRNA gene. Each of these primer pairs was specific for a specific phytoplasma group, and they did not produce PCR products of the correct size from any other phytoplasma group. One primer was designed to anneal within the conserved tRNA(Ile) and, when paired with a universal primer, amplified all phytoplasmas tested. None of the primers produced PCR amplification products of the correct size from healthy plant DNA. These primers can serve as effective tools for identifying particular phytoplasmas in field samples. PMID:8702291
Rapid Detection of the Chlamydiaceae and Other Families in the Order Chlamydiales: Three PCR Tests
Everett, Karin D. E.; Hornung, Linda J.; Andersen, Arthur A.
1999-01-01
Few identification methods will rapidly or specifically detect all bacteria in the order Chlamydiales, family Chlamydiaceae. In this study, three PCR tests based on sequence data from over 48 chlamydial strains were developed for identification of these bacteria. Two tests exclusively recognized the Chlamydiaceae: a multiplex test targeting the ompA gene and the rRNA intergenic spacer and a TaqMan test targeting the 23S ribosomal DNA. The multiplex test was able to detect as few as 200 inclusion-forming units (IFU), while the TaqMan test could detect 2 IFU. The amplicons produced in these tests ranged from 132 to 320 bp in length. The third test, targeting the 23S rRNA gene, produced a 600-bp amplicon from strains belonging to several families in the order Chlamydiales. Direct sequence analysis of this amplicon has facilitated the identification of new chlamydial strains. These three tests permit ready identification of chlamydiae for diagnostic and epidemiologic study. The specificity of these tests indicates that they might also be used to identify chlamydiae without culture or isolation. PMID:9986815
Thresher: an improved algorithm for peak height thresholding of microbial community profiles.
Starke, Verena; Steele, Andrew
2014-11-15
This article presents Thresher, an improved technique for finding peak height thresholds for automated rRNA intergenic spacer analysis (ARISA) profiles. We argue that thresholds must be sample dependent, taking community richness into account. In most previous fragment analyses, a common threshold is applied to all samples simultaneously, ignoring richness variations among samples and thereby compromising cross-sample comparison. Our technique solves this problem, and at the same time provides a robust method for outlier rejection, selecting for removal any replicate pairs that are not valid replicates. Thresholds are calculated individually for each replicate in a pair, and separately for each sample. The thresholds are selected to be the ones that minimize the dissimilarity between the replicates after thresholding. If a choice of threshold results in the two replicates in a pair failing a quantitative test of similarity, either that threshold or that sample must be rejected. We compare thresholded ARISA results with sequencing results, and demonstrate that the Thresher algorithm outperforms conventional thresholding techniques. The software is implemented in R, and the code is available at http://verenastarke.wordpress.com or by contacting the author. vstarke@ciw.edu or http://verenastarke.wordpress.com Supplementary data are available at Bioinformatics online. © The Author 2014. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.
Liu, Qiu-Ning; Chai, Xin-Yue; Bian, Dan-Dan; Zhou, Chun-Lin; Tang, Bo-Ping
2016-01-01
The mitochondrial (mt) genome can provide important information for the understanding of phylogenetic relationships. The complete mt genome of Plodia interpunctella (Lepidoptera: Pyralidae) has been sequenced. The circular genome is 15 287 bp in size, encoding 13 protein-coding genes (PCGs), 2 rRNA genes, 22 tRNA genes, and a control region. The AT skew of this mt genome is slightly negative, and the nucleotide composition is biased toward A+T nucleotides (80.15%). All PCGs start with the typical ATN (ATA, ATC, ATG, and ATT) codons, except for the cox1 gene which may start with the CGA codon. Four of the 13 PCGs harbor the incomplete termination codon T or TA. All the tRNA genes are folded into the typical clover-leaf structure of mitochondrial tRNA, except for trnS1 (AGN) in which the DHU arm fails to form a stable stem-loop structure. The overlapping sequences are 35 bp in total and are found in seven different locations. A total of 240 bp of intergenic spacers are scattered in 16 regions. The control region of the mt genome is 327 bp in length and consisted of several features common to the sequenced lepidopteran insects. Phylogenetic analysis based on 13 PCGs using the Maximum Likelihood method shows that the placement of P. interpunctella was within the Pyralidae.
Mapping of ribosomal 23S ribosomal RNA modifications in Clostridium sporogenes.
Kirpekar, Finn; Hansen, Lykke H; Mundus, Julie; Tryggedsson, Stine; Teixeira Dos Santos, Patrícia; Ntokou, Eleni; Vester, Birte
2018-06-27
All organisms contain RNA modifications in their ribosomal RNA (rRNA), but the importance, positions and exact function of these are still not fully elucidated. Various functions such as stabilising structures, controlling ribosome assembly and facilitating interactions have been suggested and in some cases substantiated. Bacterial rRNA contains much fewer modifications than eukaryotic rRNA. The rRNA modification patterns in bacteria differ from each other, but too few organisms have been mapped to draw general conclusions. This study maps 23S ribosomal RNA modifications in Clostridium sporogenes that can be characterised as a non-toxin producing Clostridium botulinum. Clostridia are able to sporulate and thereby survive harsh conditions, and are in general considered to be resilient to antibiotics. Selected regions of the 23S rRNA were investigated by mass spectrometry and by primer extension analysis to pinpoint modified sites and the nature of the modifications. Apparently, C. sporogenes 23S rRNA contains few modifications compared to other investigated bacteria. No modifications were identified in domain II and III of 23S rRNA. Three modifications were identified in domain IV, all of which have also been found in other organisms. Two unusual modifications were identified in domain V, methylated dihydrouridine at position U2449 and dihydrouridine at position U2500 (Escherichia coli numbering), in addition to four previously known modified positions. The enzymes responsible for the modifications were searched for in the C. sporogenes genome using BLAST with characterised enzymes as query. The search identified genes potentially coding for RNA modifying enzymes responsible for most of the found modifications.
Gawor, Jan; Grzesiak, Jakub; Sasin-Kurowska, Joanna; Borsuk, Piotr; Gromadka, Robert; Górniak, Dorota; Świątecki, Aleksander; Aleksandrzak-Piekarczyk, Tamara; Zdanowski, Marek K
2016-07-01
Polaromonas is one of the most abundant genera found on glacier surfaces, yet its ecology remains poorly described. Investigations made to date point towards a uniform distribution of Polaromonas phylotypes across the globe. We compared 43 Polaromonas isolates obtained from surfaces of Arctic and Antarctic glaciers to address this issue. 16S rRNA gene sequences, intergenic transcribed spacers (ITS) and metabolic fingerprinting showed great differences between hemispheres but also between neighboring glaciers. Phylogenetic distance between Arctic and Antarctic isolates indicated separate species. The Arctic group clustered similarly, when constructing dendrograms based on 16S rRNA gene and ITS sequences, as well as metabolic traits. The Antarctic strains, although almost identical considering 16S rRNA genes, diverged into 2 groups based on the ITS sequences and metabolic traits, suggesting recent niche separation. Certain phenotypic traits pointed towards cell adaptation to specific conditions on a particular glacier, like varying pH levels. Collected data suggest, that seeding of glacial surfaces with Polaromonas cells transported by various means, is of greater efficiency on local than global scales. Selection mechanisms present of glacial surfaces reduce the deposited Polaromonas diversity, causing subsequent adaptation to prevailing environmental conditions. Furthermore, interactions with other supraglacial microbiota, like algae cells may drive postselectional niche separation and microevolution within the Polaromonas genus.
2013-01-01
Background Lactobacillus jensenii, L. iners, L. crispatus and L. gasseri are the most frequently occurring lactobacilli in the vagina. However, the native species vary widely according to the studied population. The present study was performed to genetically determine the identity of Lactobacillus strains present in the vaginal discharge of healthy and bacterial vaginosis (BV) intermediate Mexican women. Methods In a prospective study, 31 strains preliminarily identified as Lactobacillus species were isolated from 21 samples collected from 105 non-pregnant Mexican women. The samples were classified into groups according to the Nugent score criteria proposed for detection of BV: normal (N), intermediate (I) and bacterial vaginosis (BV). We examined the isolates using culture-based methods as well as molecular analysis of the V1–V3 regions of the 16S rRNA gene. Enterobacterial repetitive intergenic consensus (ERIC) sequence analysis was performed to reject clones. Results Clinical isolates (25/31) were classified into four groups based on sequencing and analysis of the 16S rRNA gene: L. acidophilus (14/25), L. reuteri (6/25), L. casei (4/25) and L. buchneri (1/25). The remaining six isolates were presumptively identified as Enterococcus species. Within the L. acidophilus group, L. gasseri was the most frequently isolated species, followed by L. jensenii and L. crispatus. L. fermentum, L. rhamnosus and L. brevis were also isolated, and were placed in the L. reuteri, L. casei and L. buchneri groups, respectively. ERIC profile analysis showed intraspecific variability amongst the L. gasseri and L. fermentum species. Conclusions These findings agree with previous studies showing that L. crispatus, L. gasseri and L. jensenii are consistently present in the healthy vaginal ecosystem. Additional species or phylotypes were detected in the vaginal microbiota of the non-pregnant Mexican (Hispanic-mestizo) population, and thus, these results further our understanding of vaginal lactobacilli colonisation and richness in this particular population. PMID:23617246
Pringle, Märit; Bergsten, Christer; Fernström, Lise-Lotte; Höök, Helena; Johansson, Karl-Erik
2008-01-01
Background Digital dermatitis in cattle is an emerging infectious disease. Ulcerative lesions are typically located on the plantar skin between the heel bulbs and adjacent to the coronet. Spirochetes of the genus Treponema are found in high numbers in the lesions and are likely to be involved in the pathogenesis. The aim of this study was to obtain pure cultures of spirochetes from cattle with digital dermatitis and to describe them further. Methods Tissue samples and swabs from active digital dermatitis lesions were used for culturing. Pure isolates were subjected to, molecular typing through 16S rRNA gene sequencing, pulsed-field gel electrophoresis (PFGE), random amplified polymorphic DNA (RAPD) and an intergenic spacer PCR developed for Treponema spp. as well as API-ZYM and antimicrobial susceptibility tests. The antimicrobial agents used were tiamulin, valnemulin, tylosin, aivlosin, lincomycin and doxycycline. Results Seven spirochete isolates from five herds were obtained. Both 16S rRNA gene sequences, which were identical except for three polymorphic nucleotide positions, and the intergenic spacer PCR indicated that all isolates were of one yet unnamed species, most closely related to Treponema phagedenis. The enzymatic profile and antimicrobial susceptibility pattern were also similar for all isolates. However it was possible to separate the isolates through their PFGE and RAPD banding pattern. Conclusion This is the first report on isolation of a Treponema sp. from cattle with digital dermatitis in Scandinavia. The phylotype isolated has previously been cultured from samples from cattle in the USA and the UK and is closely related to T. phagedenis. While very similar, the isolates in this study were possible to differentiate through PFGE and RAPD indicating that these methods are suitable for subtyping of this phylotype. No antimicrobial resistance could be detected among the tested isolates. PMID:18937826
Nedelcu, Aurora M.; Lee, Robert W.; Lemieux, Claude; Gray, Michael W.; Burger, Gertraud
2000-01-01
Two distinct mitochondrial genome types have been described among the green algal lineages investigated to date: a reduced–derived, Chlamydomonas-like type and an ancestral, Prototheca-like type. To determine if this unexpected dichotomy is real or is due to insufficient or biased sampling and to define trends in the evolution of the green algal mitochondrial genome, we sequenced and analyzed the mitochondrial DNA (mtDNA) of Scenedesmus obliquus. This genome is 42,919 bp in size and encodes 42 conserved genes (i.e., large and small subunit rRNA genes, 27 tRNA and 13 respiratory protein-coding genes), four additional free-standing open reading frames with no known homologs, and an intronic reading frame with endonuclease/maturase similarity. No 5S rRNA or ribosomal protein-coding genes have been identified in Scenedesmus mtDNA. The standard protein-coding genes feature a deviant genetic code characterized by the use of UAG (normally a stop codon) to specify leucine, and the unprecedented use of UCA (normally a serine codon) as a signal for termination of translation. The mitochondrial genome of Scenedesmus combines features of both green algal mitochondrial genome types: the presence of a more complex set of protein-coding and tRNA genes is shared with the ancestral type, whereas the lack of 5S rRNA and ribosomal protein-coding genes as well as the presence of fragmented and scrambled rRNA genes are shared with the reduced–derived type of mitochondrial genome organization. Furthermore, the gene content and the fragmentation pattern of the rRNA genes suggest that this genome represents an intermediate stage in the evolutionary process of mitochondrial genome streamlining in green algae. [The sequence data described in this paper have been submitted to the GenBank data library under accession no. AF204057.] PMID:10854413
Bélanger-Lépine, Frédérique; Leung, Christelle; Glémet, Hélène; Angers, Bernard
2018-01-01
The ribosomal intergenic spacer (IGS), responsible for the rate of transcription of rRNA genes, is associated with the growth and fecundity of individuals. A previous study of IGS length variants in a yellow perch (Perca flavescens) population revealed the presence of two predominant alleles differing by 1 kb due to variation in the number of repeat units. This study aims to assess whether length variation of IGS is the result of selection in natural populations. Length variation of IGS and 11 neutral microsatellite loci were assessed in geographically distant yellow perch populations. Most populations displayed the very same IGS alleles; they did not differ in frequencies among populations and the F ST was not significantly different from zero. In contrast, diversity at microsatellite loci was high and differed among populations (F ST = 0.18). Selection test based on F ST identified IGS as a significant outlier from neutral expectations for population differentiation. Heterozygote excess was also detected in one specific cohort, suggesting temporal variation in the selection regime. While the exact mechanism remains to be specified, together the results of this study support the contention that balancing selection is acting to maintain two distinct IGS alleles in natural fish populations.
Willkomm, Dagmar K.; Minnerup, Jens; Hüttenhofer, Alexander; Hartmann, Roland K.
2005-01-01
By an experimental RNomics approach, we have generated a cDNA library from small RNAs expressed from the genome of the hyperthermophilic bacterium Aquifex aeolicus. The library included RNAs that were antisense to mRNAs and tRNAs as well as RNAs encoded in intergenic regions. Substantial steady-state levels in A.aeolicus cells were confirmed for several of the cloned RNAs by northern blot analysis. The most abundant intergenic RNA of the library was identified as the 6S RNA homolog of A.aeolicus. Although shorter in size (150 nt) than its γ-proteobacterial homologs (∼185 nt), it is predicted to have the most stable structure among known 6S RNAs. As in the γ-proteobacteria, the A.aeolicus 6S RNA gene (ssrS) is located immediately upstream of the ygfA gene encoding a widely conserved 5-formyltetrahydrofolate cyclo-ligase. We identifed novel 6S RNA candidates within the γ-proteobacteria but were unable to identify reasonable 6S RNA candidates in other bacterial branches, utilizing mfold analyses of the region immediately upstream of ygfA combined with 6S RNA blastn searches. By RACE experiments, we mapped the major transcription initiation site of A.aeolicus 6S RNA primary transcripts, located within the pheT gene preceding ygfA, as well as three processing sites. PMID:15814812
2012-01-01
Background Tandemly arranged nuclear ribosomal DNA (rDNA), encoding 18S, 5.8S and 26S ribosomal RNA (rRNA), exhibit concerted evolution, a pattern thought to result from the homogenisation of rDNA arrays. However rDNA homogeneity at the single nucleotide polymorphism (SNP) level has not been detailed in organisms with more than a few hundred copies of the rDNA unit. Here we study rDNA complexity in species with arrays consisting of thousands of units. Methods We examined homogeneity of genic (18S) and non-coding internally transcribed spacer (ITS1) regions of rDNA using Roche 454 and/or Illumina platforms in four angiosperm species, Nicotiana sylvestris, N. tomentosiformis, N. otophora and N. kawakamii. We compared the data with Southern blot hybridisation revealing the structure of intergenic spacer (IGS) sequences and with the number and distribution of rDNA loci. Results and Conclusions In all four species the intragenomic homogeneity of the 18S gene was high; a single ribotype makes up over 90% of the genes. However greater variation was observed in the ITS1 region, particularly in species with two or more rDNA loci, where >55% of rDNA units were a single ribotype, with the second most abundant variant accounted for >18% of units. IGS heterogeneity was high in all species. The increased number of ribotypes in ITS1 compared with 18S sequences may reflect rounds of incomplete homogenisation with strong selection for functional genic regions and relaxed selection on ITS1 variants. The relationship between the number of ITS1 ribotypes and the number of rDNA loci leads us to propose that rDNA evolution and complexity is influenced by locus number and/or amplification of orphaned rDNA units at new chromosomal locations. PMID:23259460
Huanca-Mamani, Wilson; Arias-Carrasco, Raúl; Cárdenas-Ninasivincha, Steffany; Rojas-Herrera, Marcelo; Sepúlveda-Hermosilla, Gonzalo; Caris-Maldonado, José Carlos; Bastías, Elizabeth; Maracaja-Coutinho, Vinicius
2018-03-20
Long non-coding RNAs (lncRNAs) have been defined as transcripts longer than 200 nucleotides, which lack significant protein coding potential and possess critical roles in diverse cellular processes. Long non-coding RNAs have recently been functionally characterized in plant stress-response mechanisms. In the present study, we perform a comprehensive identification of lncRNAs in response to combined stress induced by salinity and excess of boron in the Lluteño maize, a tolerant maize landrace from Atacama Desert, Chile. We use deep RNA sequencing to identify a set of 48,345 different lncRNAs, of which 28,012 (58.1%) are conserved with other maize (B73, Mo17 or Palomero), with the remaining 41.9% belonging to potentially Lluteño exclusive lncRNA transcripts. According to B73 maize reference genome sequence, most Lluteño lncRNAs correspond to intergenic transcripts. Interestingly, Lluteño lncRNAs presents an unusual overall higher expression compared to protein coding genes under exposure to stressed conditions. In total, we identified 1710 putatively responsive to the combined stressed conditions of salt and boron exposure. We also identified a set of 848 stress responsive potential trans natural antisense transcripts ( trans -NAT) lncRNAs, which seems to be regulating genes associated with regulation of transcription, response to stress, response to abiotic stimulus and participating of the nicotianamine metabolic process. Reverse transcription-quantitative PCR (RT-qPCR) experiments were performed in a subset of lncRNAs, validating their existence and expression patterns. Our results suggest that a diverse set of maize lncRNAs from leaves and roots is responsive to combined salt and boron stress, being the first effort to identify lncRNAs from a maize landrace adapted to extreme conditions such as the Atacama Desert. The information generated is a starting point to understand the genomic adaptabilities suffered by this maize to surpass this extremely stressed environment.
Huanca-Mamani, Wilson; Arias-Carrasco, Raúl; Cárdenas-Ninasivincha, Steffany; Rojas-Herrera, Marcelo; Sepúlveda-Hermosilla, Gonzalo; Caris-Maldonado, José Carlos; Bastías, Elizabeth; Maracaja-Coutinho, Vinicius
2018-01-01
Long non-coding RNAs (lncRNAs) have been defined as transcripts longer than 200 nucleotides, which lack significant protein coding potential and possess critical roles in diverse cellular processes. Long non-coding RNAs have recently been functionally characterized in plant stress–response mechanisms. In the present study, we perform a comprehensive identification of lncRNAs in response to combined stress induced by salinity and excess of boron in the Lluteño maize, a tolerant maize landrace from Atacama Desert, Chile. We use deep RNA sequencing to identify a set of 48,345 different lncRNAs, of which 28,012 (58.1%) are conserved with other maize (B73, Mo17 or Palomero), with the remaining 41.9% belonging to potentially Lluteño exclusive lncRNA transcripts. According to B73 maize reference genome sequence, most Lluteño lncRNAs correspond to intergenic transcripts. Interestingly, Lluteño lncRNAs presents an unusual overall higher expression compared to protein coding genes under exposure to stressed conditions. In total, we identified 1710 putatively responsive to the combined stressed conditions of salt and boron exposure. We also identified a set of 848 stress responsive potential trans natural antisense transcripts (trans-NAT) lncRNAs, which seems to be regulating genes associated with regulation of transcription, response to stress, response to abiotic stimulus and participating of the nicotianamine metabolic process. Reverse transcription-quantitative PCR (RT-qPCR) experiments were performed in a subset of lncRNAs, validating their existence and expression patterns. Our results suggest that a diverse set of maize lncRNAs from leaves and roots is responsive to combined salt and boron stress, being the first effort to identify lncRNAs from a maize landrace adapted to extreme conditions such as the Atacama Desert. The information generated is a starting point to understand the genomic adaptabilities suffered by this maize to surpass this extremely stressed environment. PMID:29558449
Bhattacharya, D; Surek, B; Rüsing, M; Damberger, S; Melkonian, M
1994-01-01
Group I introns are found in organellar genomes, in the genomes of eubacteria and phages, and in nuclear-encoded rRNAs. The origin and distribution of nuclear-encoded rRNA group I introns are not understood. To elucidate their evolutionary relationships, we analyzed diverse nuclear-encoded small-subunit rRNA group I introns including nine sequences from the green-algal order Zygnematales (Charophyceae). Phylogenetic analyses of group I introns and rRNA coding regions suggest that lateral transfers have occurred in the evolutionary history of group I introns and that, after transfer, some of these elements may form stable components of the host-cell nuclear genomes. The Zygnematales introns, which share a common insertion site (position 1506 relative to the Escherichia coli small-subunit rRNA), form one subfamily of group I introns that has, after its origin, been inherited through common ancestry. Since the first Zygnematales appear in the middle Devonian within the fossil record, the "1506" group I intron presumably has been a stable component of the Zygnematales small-subunit rRNA coding region for 350-400 million years. PMID:7937917
Complete mitochondrial genome of Cynopterus sphinx (Pteropodidae: Cynopterus).
Li, Linmiao; Li, Min; Wu, Zhengjun; Chen, Jinping
2015-01-01
We have characterized the complete mitochondrial genome of Cynopterus sphinx (Pteropodidae: Cynopterus) and described its organization in this study. The total length of C. sphinx complete mitochondrial genome was 16,895 bp with the base composition of 32.54% A, 14.05% G, 25.82% T and 27.59% C. The complete mitochondrial genome included 13 protein-coding genes, 22 tRNA genes, 2 rRNA genes (12S rRNA and 16S rRNA) and 1 control region (D-loop). The control region was 1435 bp long with the sequence CATACG repeat 64 times. Three protein-coding genes (ND1, COI and ND4) were ended with incomplete stop codon TA or T.
Kim, Sangkyu; Welsh, David A; Myers, Leann; Cherry, Katie E; Wyckoff, Jennifer; Jazwinski, S Michal
2015-02-28
We have completed a genome-wide linkage scan for healthy aging using data collected from a family study, followed by fine-mapping by association in a separate population, the first such attempt reported. The family cohort consisted of parents of age 90 or above and their children ranging in age from 50 to 80. As a quantitative measure of healthy aging, we used a frailty index, called FI34, based on 34 health and function variables. The linkage scan found a single significant linkage peak on chromosome 12. Using an independent cohort of unrelated nonagenarians, we carried out a fine-scale association mapping of the region suggestive of linkage and identified three sites associated with healthy aging. These healthy-aging sites (HASs) are located in intergenic regions at 12q13-14. HAS-1 has been previously associated with multiple diseases, and an enhancer was recently mapped and experimentally validated within the site. HAS-2 is a previously uncharacterized site possessing genomic features suggestive of enhancer activity. HAS-3 contains features associated with Polycomb repression. The HASs also contain variants associated with exceptional longevity, based on a separate analysis. Our results provide insight into functional genomic networks involving non-coding regulatory elements that are involved in healthy aging and longevity.
Kim, Sangkyu; Welsh, David A.; Myers, Leann; Cherry, Katie E.; Wyckoff, Jennifer; Jazwinski, S. Michal
2015-01-01
We have completed a genome-wide linkage scan for healthy aging using data collected from a family study, followed by fine-mapping by association in a separate population, the first such attempt reported. The family cohort consisted of parents of age 90 or above and their children ranging in age from 50 to 80. As a quantitative measure of healthy aging, we used a frailty index, called FI34, based on 34 health and function variables. The linkage scan found a single significant linkage peak on chromosome 12. Using an independent cohort of unrelated nonagenarians, we carried out a fine-scale association mapping of the region suggestive of linkage and identified three sites associated with healthy aging. These healthy-aging sites (HASs) are located in intergenic regions at 12q13–14. HAS-1 has been previously associated with multiple diseases, and an enhancer was recently mapped and experimentally validated within the site. HAS-2 is a previously uncharacterized site possessing genomic features suggestive of enhancer activity. HAS-3 contains features associated with Polycomb repression. The HASs also contain variants associated with exceptional longevity, based on a separate analysis. Our results provide insight into functional genomic networks involving non-coding regulatory elements that are involved in healthy aging and longevity. PMID:25682868
Mallatt, Jon; Craig, Catherine Waggoner; Yoder, Matthew J
2010-04-01
This study (1) uses nearly complete rRNA-gene sequences from across Metazoa (197 taxa) to reconstruct animal phylogeny; (2) presents a highly annotated, manual alignment of these sequences with special reference to rRNA features including paired sites (http://purl.oclc.org/NET/rRNA/Metazoan_alignment) and (3) tests, after eliminating as few disruptive, rogue sequences as possible, if a likelihood framework can recover the main metazoan clades. We found that systematic elimination of approximately 6% of the sequences, including the divergent or unstably placed sequences of cephalopods, arrowworm, symphylan and pauropod myriapods, and of myzostomid and nemertodermatid worms, led to a tree that supported Ecdysozoa, Lophotrochozoa, Protostomia, and Bilateria. Deuterostomia, however, was never recovered, because the rRNA of urochordates goes (nonsignificantly) near the base of the Bilateria. Counterintuitively, when we modeled the evolution of the paired sites, phylogenetic resolution was not increased over traditional tree-building models that assume all sites in rRNA evolve independently. The rRNA genes of non-bilaterians contain a higher % AT than do those of most bilaterians. The rRNA genes of Acoela and Myzostomida were found to be secondarily shortened, AT-enriched, and highly modified, throwing some doubt on the location of these worms at the base of Bilateria in the rRNA tree--especially myzostomids, which other evidence suggests are annelids instead. Other findings are marsupial-with-placental mammals, arrowworms in Ecdysozoa (well supported here but contradicted by morphology), and Placozoa as sister to Cnidaria. Finally, despite the difficulties, the rRNA-gene trees are in strong concordance with trees derived from multiple protein-coding genes in supporting the new animal phylogeny. (c) 2009 Elsevier Inc. All rights reserved.
Genome-scale transcriptional activation by an engineered CRISPR-Cas9 complex.
Konermann, Silvana; Brigham, Mark D; Trevino, Alexandro E; Joung, Julia; Abudayyeh, Omar O; Barcena, Clea; Hsu, Patrick D; Habib, Naomi; Gootenberg, Jonathan S; Nishimasu, Hiroshi; Nureki, Osamu; Zhang, Feng
2015-01-29
Systematic interrogation of gene function requires the ability to perturb gene expression in a robust and generalizable manner. Here we describe structure-guided engineering of a CRISPR-Cas9 complex to mediate efficient transcriptional activation at endogenous genomic loci. We used these engineered Cas9 activation complexes to investigate single-guide RNA (sgRNA) targeting rules for effective transcriptional activation, to demonstrate multiplexed activation of ten genes simultaneously, and to upregulate long intergenic non-coding RNA (lincRNA) transcripts. We also synthesized a library consisting of 70,290 guides targeting all human RefSeq coding isoforms to screen for genes that, upon activation, confer resistance to a BRAF inhibitor. The top hits included genes previously shown to be able to confer resistance, and novel candidates were validated using individual sgRNA and complementary DNA overexpression. A gene expression signature based on the top screening hits correlated with markers of BRAF inhibitor resistance in cell lines and patient-derived samples. These results collectively demonstrate the potential of Cas9-based activators as a powerful genetic perturbation technology.
Ge, Wei; Wang, Shan-He; Sun, Bing; Zhang, Yue-Lang; Shen, Wei; Khatib, Hasan; Wang, Xin
2018-06-12
The role of melatonin in promoting the yield of Cashmere goat wool has been demonstrated for decades though there remains a lack of knowledge regarding melatonin mediated hair follicle growth. Recent studies have demonstrated that long non-coding RNAs (lncRNAs) are widely transcribed in the genome and play ubiquitous roles in regulating biological processes. However, the role of lncRNAs in regulating melatonin mediated hair follicle growth remains unclear. In this study, we established an in vitro Cashmere goat secondary hair follicle culture system, and demonstrated that 500 ng/L melatonin exposure promoted hair follicle fiber growth. Based on long intergenic RNA sequencing, we demonstrated that melatonin promoted hair follicle elongation via regulating genes involved in focal adhesion and extracellular matrix receptor pathways and further cis predicting of lncRNAs targeted genes indicated that melatonin mediated lncRNAs mainly targeted vascular smooth muscle contraction and signaling pathways regulating the pluripotency of stem cells. We proposed that melatonin exposure not only perturbed key signals secreted from hair follicle stem cells to regulate hair follicle development, but also mediated lncRNAs mainly targeted to pathways involved in the microvascular system and extracellular matrix, which constitute the highly orchestrated microenvironment for hair follicle stem cell. Taken together, our findings here provide a profound view of lncRNAs in regulating Cashmere goat hair follicle circadian rhythms and broaden our knowledge on melatonin mediated hair follicle morphological changes.
Chillón, Isabel; Pyle, Anna M.
2016-01-01
LincRNA-p21 is a long intergenic non-coding RNA (lincRNA) involved in the p53-mediated stress response. We sequenced the human lincRNA-p21 (hLincRNA-p21) and found that it has a single exon that includes inverted repeat Alu elements (IRAlus). Sense and antisense Alu elements fold independently of one another into a secondary structure that is conserved in lincRNA-p21 among primates. Moreover, the structures formed by IRAlus are involved in the localization of hLincRNA-p21 in the nucleus, where hLincRNA-p21 colocalizes with paraspeckles. Our results underscore the importance of IRAlus structures for the function of hLincRNA-p21 during the stress response. PMID:27378782
Complete mitochondrial genome of the Tyto longimembris (Strigiformes: Tytonidae).
Xu, Peng; Li, Yankuo; Miao, Lujun; Xie, Guangyong; Huang, Yan
2016-07-01
The complete mitochondrial genome of Tyto longimembris has been determined in this study. It is 18,466 bp in length and consists of 13 protein-coding genes, 22 transfer RNA (tRNA) genes, 2 ribosomal RNA (rRNA) genes and a non-coding control region (D-loop). The overall base composition of the heavy strand of the T. longimembris mitochondrial genome is A: 30.1%, T: 23.5%, C: 31.8% and G: 14.6%. The structure of control region should be characterized by a region containing tandem repeats as two definitely separated clusters of tandem repeats were found. This study provided an important data set for phylogenetic and taxonomic analyses of Tyto species.
Garcia, Sònia; Panero, José L; Siroky, Jiri; Kovarik, Ales
2010-08-16
In flowering plants and animals the most common ribosomal RNA genes (rDNA) organisation is that in which 35S (encoding 18S-5.8S-26S rRNA) and 5S genes are physically separated occupying different chromosomal loci. However, recent observations established that both genes have been unified to a single 35S-5S unit in the genus Artemisia (Asteraceae), a genomic arrangement typical of primitive eukaryotes such as yeast, among others. Here we aim to reveal the origin, distribution and mechanisms leading to the linked organisation of rDNA in the Asteraceae by analysing unit structure (PCR, Southern blot, sequencing), gene copy number (quantitative PCR) and chromosomal position (FISH) of 5S and 35S rRNA genes in approximately 200 species representing the family diversity and other closely related groups. Dominant linked rDNA genotype was found within three large groups in subfamily Asteroideae: tribe Anthemideae (93% of the studied cases), tribe Gnaphalieae (100%) and in the "Heliantheae alliance" (23%). The remaining five tribes of the Asteroideae displayed canonical non linked arrangement of rDNA, as did the other groups in the Asteraceae. Nevertheless, low copy linked genes were identified among several species that amplified unlinked units. The conserved position of functional 5S insertions downstream from the 26S gene suggests a unique, perhaps retrotransposon-mediated integration event at the base of subfamily Asteroideae. Further evolution likely involved divergence of 26S-5S intergenic spacers, amplification and homogenisation of units across the chromosomes and concomitant elimination of unlinked arrays. However, the opposite trend, from linked towards unlinked arrangement was also surmised in few species indicating possible reversibility of these processes. Our results indicate that nearly 25% of Asteraceae species may have evolved unusual linked arrangement of rRNA genes. Thus, in plants, fundamental changes in intrinsic structure of rDNA units, their copy number and chromosomal organisation may occur within relatively short evolutionary time. We hypothesize that the 5S gene integration within the 35S unit might have repeatedly occurred during plant evolution, and probably once in Asteraceae.
2010-01-01
Background In flowering plants and animals the most common ribosomal RNA genes (rDNA) organisation is that in which 35S (encoding 18S-5.8S-26S rRNA) and 5S genes are physically separated occupying different chromosomal loci. However, recent observations established that both genes have been unified to a single 35S-5S unit in the genus Artemisia (Asteraceae), a genomic arrangement typical of primitive eukaryotes such as yeast, among others. Here we aim to reveal the origin, distribution and mechanisms leading to the linked organisation of rDNA in the Asteraceae by analysing unit structure (PCR, Southern blot, sequencing), gene copy number (quantitative PCR) and chromosomal position (FISH) of 5S and 35S rRNA genes in ~200 species representing the family diversity and other closely related groups. Results Dominant linked rDNA genotype was found within three large groups in subfamily Asteroideae: tribe Anthemideae (93% of the studied cases), tribe Gnaphalieae (100%) and in the "Heliantheae alliance" (23%). The remaining five tribes of the Asteroideae displayed canonical non linked arrangement of rDNA, as did the other groups in the Asteraceae. Nevertheless, low copy linked genes were identified among several species that amplified unlinked units. The conserved position of functional 5S insertions downstream from the 26S gene suggests a unique, perhaps retrotransposon-mediated integration event at the base of subfamily Asteroideae. Further evolution likely involved divergence of 26S-5S intergenic spacers, amplification and homogenisation of units across the chromosomes and concomitant elimination of unlinked arrays. However, the opposite trend, from linked towards unlinked arrangement was also surmised in few species indicating possible reversibility of these processes. Conclusions Our results indicate that nearly 25% of Asteraceae species may have evolved unusual linked arrangement of rRNA genes. Thus, in plants, fundamental changes in intrinsic structure of rDNA units, their copy number and chromosomal organisation may occur within relatively short evolutionary time. We hypothesize that the 5S gene integration within the 35S unit might have repeatedly occurred during plant evolution, and probably once in Asteraceae. PMID:20712858
Diversity of acetic acid bacteria present in healthy grapes from the Canary Islands.
Valera, Maria José; Laich, Federico; González, Sara S; Torija, Maria Jesús; Mateo, Estibaliz; Mas, Albert
2011-11-15
The identification of acetic acid bacteria (AAB) from sound grapes from the Canary Islands is reported in the present study. No direct recovery of bacteria was possible in the most commonly used medium, so microvinifications were performed on grapes from Tenerife, La Palma and Lanzarote islands. Up to 396 AAB were isolated from those microvinifications and identified by 16S rRNA gene sequencing and phylogenetic analysis. With this method, Acetobacter pasteurianus, Acetobacter tropicalis, Gluconobacter japonicus and Gluconacetobacter saccharivorans were identified. However, no discrimination between the closely related species Acetobacter malorum and Acetobacter cerevisiae was possible. As previously described, 16S-23S rRNA gene internal transcribed spacer (ITS) region phylogenetic analysis was required to classify isolates as one of those species. These two species were the most frequently occurring, accounting for more than 60% of the isolates. For typing the AAB isolates, both the Enterobacterial Repetitive Intergenic Consensus (ERIC)-PCR and (GTG)5-PCR techniques gave similar resolution. A total of 60 profiles were identified. Thirteen of these profiles were found in more than one vineyard, and only one profile was found on two different islands (Tenerife and La Palma). Copyright © 2011 Elsevier B.V. All rights reserved.
Porphyromonas loveana sp. nov., isolated from the oral cavity of Australian marsupials.
Bird, Philip S; Trott, Darren J; Mikkelsen, Deirdre; Milinovich, Gabriel J; Hillman, Kristine M; Burrell, Paul C; Blackall, Linda L
2016-10-01
An obligatory anaerobic, Gram-stain-negative coccobacillus with black-pigmented colonies was isolated from the oral cavity of selected Australian marsupial species. Phenotypic and molecular criteria showed that this bacterium was a distinct species within the genus Porphyromonas, and was closely related to Porphyromonas gingivalis and Porphyromonas gulae. This putative novel species and P. gulae could be differentiated from P. gingivalis by catalase activity. Further characterization by multi-locus enzyme electrophoresis of glutamate dehydrogenase and malate dehydrogenase enzyme mobility and matrix-assisted laser desorption ionization time-of-flight MS showed that this putative novel species could be differentiated phenotypically from P. gingivalis and P. gulae. Definitive identification by 16S rRNA gene sequencing showed that this bacterium belonged to a unique monophyletic lineage, phylogenetically distinct from P. gingivalis (94.9 % similarity) and P. gulae (95.5 %). This also was supported by 16S-23S rRNA intergenic spacer region and glutamate dehydrogenase gene sequencing. A new species epithet, Porphyromonas loveana sp. nov., is proposed for this bacterium, with DSM 28520T (=NCTC 13658T=UQD444T=MRK101T), isolated from a musky rat kangaroo, as the type strain.
Conserved Curvature of RNA Polymerase I Core Promoter Beyond rRNA Genes: The Case of the Tritryps
Smircich, Pablo; Duhagon, María Ana; Garat, Beatriz
2015-01-01
In trypanosomatids, the RNA polymerase I (RNAPI)-dependent promoters controlling the ribosomal RNA (rRNA) genes have been well identified. Although the RNAPI transcription machinery recognizes the DNA conformation instead of the DNA sequence of promoters, no conformational study has been reported for these promoters. Here we present the in silico analysis of the intrinsic DNA curvature of the rRNA gene core promoters in Trypanosoma brucei, Trypanosoma cruzi, and Leishmania major. We found that, in spite of the absence of sequence conservation, these promoters hold conformational properties similar to other eukaryotic rRNA promoters. Our results also indicated that the intrinsic DNA curvature pattern is conserved within the Leishmania genus and also among strains of T. cruzi and T. brucei. Furthermore, we analyzed the impact of point mutations on the intrinsic curvature and their impact on the promoter activity. Furthermore, we found that the core promoters of protein-coding genes transcribed by RNAPI in T. brucei show the same conserved conformational characteristics. Overall, our results indicate that DNA intrinsic curvature of the rRNA gene core promoters is conserved in these ancient eukaryotes and such conserved curvature might be a requirement of RNAPI machinery for transcription of not only rRNA genes but also protein-coding genes. PMID:26718450
Organisation of the plant genome in chromosomes.
Heslop-Harrison, J S Pat; Schwarzacher, Trude
2011-04-01
The plant genome is organized into chromosomes that provide the structure for the genetic linkage groups and allow faithful replication, transcription and transmission of the hereditary information. Genome sizes in plants are remarkably diverse, with a 2350-fold range from 63 to 149,000 Mb, divided into n=2 to n= approximately 600 chromosomes. Despite this huge range, structural features of chromosomes like centromeres, telomeres and chromatin packaging are well-conserved. The smallest genomes consist of mostly coding and regulatory DNA sequences present in low copy, along with highly repeated rDNA (rRNA genes and intergenic spacers), centromeric and telomeric repetitive DNA and some transposable elements. The larger genomes have similar numbers of genes, with abundant tandemly repeated sequence motifs, and transposable elements alone represent more than half the DNA present. Chromosomes evolve by fission, fusion, duplication and insertion events, allowing evolution of chromosome size and chromosome number. A combination of sequence analysis, genetic mapping and molecular cytogenetic methods with comparative analysis, all only becoming widely available in the 21st century, is elucidating the exact nature of the chromosome evolution events at all timescales, from the base of the plant kingdom, to intraspecific or hybridization events associated with recent plant breeding. As well as being of fundamental interest, understanding and exploiting evolutionary mechanisms in plant genomes is likely to be a key to crop development for food production. © 2011 The Authors. The Plant Journal © 2011 Blackwell Publishing Ltd.
Evolution of the unspliced transcriptome.
Engelhardt, Jan; Stadler, Peter F
2015-08-20
Despite their abundance, unspliced EST data have received little attention as a source of information on non-coding RNAs. Very little is know, therefore, about the genomic distribution of unspliced non-coding transcripts and their relationship with the much better studied regularly spliced products. In particular, their evolution has remained virtually unstudied. We systematically study the evidence on unspliced transcripts available in EST annotation tracks for human and mouse, comprising 104,980 and 66,109 unspliced EST clusters, respectively. Roughly one third of these are located totally inside introns of known genes (TINs) and another third overlaps exonic regions (PINs). Eleven percent are "intergenic", far away from any annotated gene. Direct evidence for the independent transcription of many PINs and TINs is obtained from CAGE tag and chromatin data. We predict more than 2000 3'UTR-associated RNA candidates for each human and mouse. Fifteen to twenty percent of the unspliced EST cluster are conserved between human and mouse. With the exception of TINs, the sequences of unspliced EST clusters evolve significantly slower than genomic background. Furthermore, like spliced lincRNAs, they show highly tissue-specific expression patterns. Unspliced long non-coding RNAs are an important, rapidly evolving, component of mammalian transcriptomes. Their analysis is complicated by their preferential association with complex transcribed loci that usually also harbor a plethora of spliced transcripts. Unspliced EST data, although typically disregarded in transcriptome analysis, can be used to gain insights into this rarely investigated transcriptome component. The frequently postulated connection between lack of splicing and nuclear retention and the surprising overlap of chromatin-associated transcripts suggests that this class of transcripts might be involved in chromatin organization and possibly other mechanisms of epigenetic control.
Ruiz Esparza-Garrido, Ruth; Rodríguez-Corona, Juan Manuel; López-Aguilar, Javier Enrique; Rodríguez-Florido, Marco Antonio; Velázquez-Wong, Ana Claudia; Viedma-Rodríguez, Rubí; Salamanca-Gómez, Fabio; Velázquez-Flores, Miguel Ángel
2017-10-01
Expression changes for long non-coding RNAs (lncRNAs) have been identified in adult glioblastoma multiforme (GBM) and in a mixture of adult and pediatric astrocytoma. Since adult and pediatric astrocytomas are molecularly different, the mixture of both could mask specific features in each. We determined the global expression patterns of lncRNAs and messenger RNA (mRNAs) in pediatric astrocytoma of different histological grades. Transcript expression changes were determined with an HTA 2.0 array. lncRNA interactions with microRNAs and mRNAs were predicted by using an algorithm and the LncTar tool, respectively. Interactomes were constructed with the HIPPIE database and visualized with the Cytoscape platform. The array showed expression changes in 156 and 207 lncRNAs in tumors (versus the control) and in pediatric GBM (versus low-grade astrocytoma), respectively. Predictions identified lncRNAs that have putative microRNA binding sites, which might suggest that they function as sponges in these tumors. Also, lncRNAs were shown to interact with many mRNAs, such as Pleckstrin homology-like domain, family A, member 1 (PHLDA1) and sulfatase 2 (SULF2). For example, qPCR found long intergenic non-coding RNA regulator of reprogramming (linc-RoR) expression levels upregulated in pediatric GBM when they were compared with control tissues or with low-grade tumors. Meanwhile, PHLDA1 and ELAV-like RNA binding protein 1 (ELAV1) showed expression changes in tumors relative to the control. Our data showed many lncRNAs with expression changes in pediatric astrocytoma, which might be involved in the regulation of different signaling pathways.
Atkinson, Sophie; Marguerat, Samuel; Bitton, Danny; Bachand, Francois; Rodriguez-Lopez, Maria; Rallis, Charalampos; Lemay, Jean-Francois; Cotobal, Cristina; Malecki, Michal; Smialowski, Pawel; Mata, Juan; Korber, Philipp; Bahler, Jurg
2018-06-18
Long non-coding RNAs (lncRNAs), which are longer than 200 nucleotides but often unstable, contribute a substantial and diverse portion to pervasive non-coding transcriptomes. Most lncRNAs are poorly annotated and understood, although several play important roles in gene regulation and diseases. Here we systematically uncover and analyse lncRNAs in Schizosaccharomyces pombe. Based on RNA-seq data from twelve RNA-processing mutants and nine physiological conditions, we identify 5775 novel lncRNAs, nearly 4-times the previously annotated lncRNAs. The expression of most lncRNAs becomes strongly induced under the genetic and physiological perturbations, most notably during late meiosis. Most lncRNAs are cryptic and suppressed by three RNA-processing pathways: the nuclear exosome, cytoplasmic exonuclease, and RNAi. Double-mutant analyses reveal substantial coordination and redundancy among these pathways. We classify lncRNAs by their dominant pathway into cryptic unstable transcripts (CUTs), Xrn1-sensitive unstable transcripts (XUTs), and Dicer-sensitive unstable transcripts (DUTs). XUTs and DUTs are enriched for antisense lncRNAs, while CUTs are often bidirectional and actively translated. The cytoplasmic exonuclease, along with RNAi, dampens the expression of thousands of lncRNAs and mRNAs that become induced during meiosis. Antisense lncRNA expression mostly negatively correlates with sense mRNA expression in the physiological, but not the genetic conditions. Intergenic and bidirectional lncRNAs emerge from nucleosome-depleted regions, upstream of positioned nucleosomes. Our results highlight both similarities and differences to lncRNA regulation in budding yeast. This broad survey of the lncRNA repertoire and characteristics in S. pombe, and the interwoven regulatory pathways that target lncRNAs, provides a rich framework for their further functional analyses. Published by Cold Spring Harbor Laboratory Press for the RNA Society.
Nelson, Leigh A; Cameron, Stephen L; Yeates, David K
2011-10-01
The monogeneric family Fergusoninidae consists of gall-forming flies that, together with Fergusobia (Tylenchida: Neotylenchidae) nematodes, form the only known mutualistic association between insects and nematodes. In this study, the entire 16,000 bp mitochondrial genome of Fergusonina taylori Nelson and Yeates was sequenced. The circular genome contains one encoding region including 27 genes and one non-coding A+T-rich region. The arrangement of the protein-coding, ribosomal RNA (rRNA) and transfer RNA (tRNA) genes was the same as that found in the ancestral insect. Nucleotide composition is highly A+T biased. All of the protein initiation codons are ATN, except for nad1 which begins with TTT. All 22 tRNA anticodons of F. taylori match those observed in Drosophila yakuba, and all form the typical cloverleaf structure except for tRNA-Ser((AGN)) which lacks a dihydrouridine (DHU) arm. Secondary structural features of the rRNA genes of Fergusonina are similar to those proposed for other insects, with minor modifications. The mitochondrial genome of Fergusonina presented here may prove valuable for resolving the sister group to the Fergusoninidae, and expands the available mtDNA data sources for acalyptrates overall.
[Novel bidirectional promoter from human genome].
Orekhova, A S; Sverdlova, P S; Spirin, P V; Leonova, O G; Popenko, V I; Prasolov, V S; Rubtsov, P M
2011-01-01
In human and other mammalian genomes a number of closely linked gene pairs transcribed in opposite directions are found. According to bioinformatic analysis up to 10% of human genes are arranged in this way. In present work the fragment of human genome was cloned that separates genes localized at 2p13.1 and oriented "head-to-head", coding for hypothetical proteins with unknown functions--CCDC (Coiled Coil Domain Containing) 142 and TTC (TetraTricopeptide repeat Containing) 31. Intergenic CCDC142-TTC31 region overlaps with CpG-island and contains a number of potential binding sites for transcription factors. This fragment functions as bidirectional promoter in the system ofluciferase reporter gene expression upon transfection of human embryonic kidney (HEK293) cells. The vectors containing genes of two fluorescent proteins--green (EGFP) and red (DsRed2) in opposite orientations separated by the fragment of CCDC142-TTC31 intergenic region were constructed. In HEK293 cells transfected with these vectors simultaneous expression of two fluorescent proteins is observed. Truncated versions of intergenic region were obtained and their promoter activity measured. Minimal promoter fragment contains elements Inr, BRE, DPE characteristic for TATA-less promoters. Thus, from the human genome the novel bidirectional promoter was cloned that can be used for simultaneous constitutive expression of two genes in human cells.
Kazakoff, Stephen H.; Imelfort, Michael; Edwards, David; Koehorst, Jasper; Biswas, Bandana; Batley, Jacqueline; Scott, Paul T.; Gresshoff, Peter M.
2012-01-01
Pongamia pinnata (syn. Millettia pinnata) is a novel, fast-growing arboreal legume that bears prolific quantities of oil-rich seeds suitable for the production of biodiesel and aviation biofuel. Here, we have used Illumina® ‘Second Generation DNA Sequencing (2GS)’ and a new short-read de novo assembler, SaSSY, to assemble and annotate the Pongamia chloroplast (152,968 bp; cpDNA) and mitochondrial (425,718 bp; mtDNA) genomes. We also show that SaSSY can be used to accurately assemble 2GS data, by re-assembling the Lotus japonicus cpDNA and in the process assemble its mtDNA (380,861 bp). The Pongamia cpDNA contains 77 unique protein-coding genes and is almost 60% gene-dense. It contains a 50 kb inversion common to other legumes, as well as a novel 6.5 kb inversion that is responsible for the non-disruptive, re-orientation of five protein-coding genes. Additionally, two copies of an inverted repeat firmly place the species outside the subclade of the Fabaceae lacking the inverted repeat. The Pongamia and L. japonicus mtDNA contain just 33 and 31 unique protein-coding genes, respectively, and like other angiosperm mtDNA, have expanded intergenic and multiple repeat regions. Through comparative analysis with Vigna radiata we measured the average synonymous and non-synonymous divergence of all three legume mitochondrial (1.59% and 2.40%, respectively) and chloroplast (8.37% and 8.99%, respectively) protein-coding genes. Finally, we explored the relatedness of Pongamia within the Fabaceae and showed the utility of the organellar genome sequences by mapping transcriptomic data to identify up- and down-regulated stress-responsive gene candidates and confirm in silico predicted RNA editing sites. PMID:23272141
Kazakoff, Stephen H; Imelfort, Michael; Edwards, David; Koehorst, Jasper; Biswas, Bandana; Batley, Jacqueline; Scott, Paul T; Gresshoff, Peter M
2012-01-01
Pongamia pinnata (syn. Millettia pinnata) is a novel, fast-growing arboreal legume that bears prolific quantities of oil-rich seeds suitable for the production of biodiesel and aviation biofuel. Here, we have used Illumina® 'Second Generation DNA Sequencing (2GS)' and a new short-read de novo assembler, SaSSY, to assemble and annotate the Pongamia chloroplast (152,968 bp; cpDNA) and mitochondrial (425,718 bp; mtDNA) genomes. We also show that SaSSY can be used to accurately assemble 2GS data, by re-assembling the Lotus japonicus cpDNA and in the process assemble its mtDNA (380,861 bp). The Pongamia cpDNA contains 77 unique protein-coding genes and is almost 60% gene-dense. It contains a 50 kb inversion common to other legumes, as well as a novel 6.5 kb inversion that is responsible for the non-disruptive, re-orientation of five protein-coding genes. Additionally, two copies of an inverted repeat firmly place the species outside the subclade of the Fabaceae lacking the inverted repeat. The Pongamia and L. japonicus mtDNA contain just 33 and 31 unique protein-coding genes, respectively, and like other angiosperm mtDNA, have expanded intergenic and multiple repeat regions. Through comparative analysis with Vigna radiata we measured the average synonymous and non-synonymous divergence of all three legume mitochondrial (1.59% and 2.40%, respectively) and chloroplast (8.37% and 8.99%, respectively) protein-coding genes. Finally, we explored the relatedness of Pongamia within the Fabaceae and showed the utility of the organellar genome sequences by mapping transcriptomic data to identify up- and down-regulated stress-responsive gene candidates and confirm in silico predicted RNA editing sites.
Yao, Yao; Wang, Rui; Lu, Jun Kun; Wang, En Tao; Chen, Wen Xin
2014-01-01
The nodulation of Erythrophleum fordii has been recorded recently, but its microsymbionts have never been studied. To investigate the diversity and biogeography of rhizobia associated with this leguminous evergreen tree, root nodules were collected from the southern subtropical region of China. A total of 166 bacterial isolates were obtained from the nodules and characterized. In a PCR-based restriction fragment length polymorphism (RFLP) analysis of ribosomal intergenic sequences, the isolates were classified into 22 types within the genus Bradyrhizobium. Sequence analysis of 16S rRNA, ribosomal intergenic spacer (IGS), and the housekeeping genes recA and glnII classified the isolates into four groups: the Bradyrhizobium elkanii and Bradyrhizobium pachyrhizi groups, comprising the dominant symbionts, Bradyrhizobium yuanmingense, and an unclassified group comprising the minor symbionts. The nodC and nifH phylogenetic trees defined five or six lineages among the isolates, which was largely consistent with the definition of genomic species. The phylogenetic results and evolutionary analysis demonstrated that mutation and vertical transmission of genes were the principal processes for the divergent evolution of Bradyrhizobium species associated with E. fordii, while lateral transfer and recombination of housekeeping and symbiotic genes were rare. The distribution of the dominant rhizobial populations was affected by soil pH and effective phosphorus. This is the first report to characterize E. fordii rhizobia. PMID:25085491
RNA Sequencing of the Exercise Transcriptome in Equine Athletes
Verini-Supplizi, Andrea; Barcaccia, Gianni; Albiero, Alessandro; D'Angelo, Michela; Campagna, Davide; Valle, Giorgio; Felicetti, Michela; Silvestrelli, Maurizio; Cappelli, Katia
2013-01-01
The horse is an optimal model organism for studying the genomic response to exercise-induced stress, due to its natural aptitude for athletic performance and the relative homogeneity of its genetic and environmental backgrounds. Here, we applied RNA-sequencing analysis through the use of SOLiD technology in an experimental framework centered on exercise-induced stress during endurance races in equine athletes. We monitored the transcriptional landscape by comparing gene expression levels between animals at rest and after competition. Overall, we observed a shift from coding to non-coding regions, suggesting that the stress response involves the differential expression of not annotated regions. Notably, we observed significant post-race increases of reads that correspond to repeats, especially the intergenic and intronic L1 and L2 transposable elements. We also observed increased expression of the antisense strands compared to the sense strands in intronic and regulatory regions (1 kb up- and downstream) of the genes, suggesting that antisense transcription could be one of the main mechanisms for transposon regulation in the horse under stress conditions. We identified a large number of transcripts corresponding to intergenic and intronic regions putatively associated with new transcriptional elements. Gene expression and pathway analysis allowed us to identify several biological processes and molecular functions that may be involved with exercise-induced stress. Ontology clustering reflected mechanisms that are already known to be stress activated (e.g., chemokine-type cytokines, Toll-like receptors, and kinases), as well as “nucleic acid binding” and “signal transduction activity” functions. There was also a general and transient decrease in the global rates of protein synthesis, which would be expected after strenuous global stress. In sum, our network analysis points toward the involvement of specific gene clusters in equine exercise-induced stress, including those involved in inflammation, cell signaling, and immune interactions. PMID:24391776
Zhang, Hong-Li; Ye, Fei
2017-01-01
Praying mantises are a diverse group of predatory insects. Although some Mantodea mitogenomes have been reported, a comprehensive comparative and evolutionary genomic study is lacking for this group. In the present study, four new mitogenomes were sequenced, annotated, and compared to the previously published mitogenomes of other Mantodea species. Most Mantodea mitogenomes share a typical set of mitochondrial genes and a putative control region (CR). Additionally, and most intriguingly, another large non-coding region (LNC) was detected between trnM and ND2 in all six Paramantini mitogenomes examined. The main section in this common region of Paramantini may have initially originated from the corresponding control region for each species, whereas sequence differences between the LNCs and CRs and phylogenetic analyses indicate that LNC and CR are largely independently evolving. Namely, the LNC (the duplicated CR) may have subsequently degenerated during evolution. Furthermore, evidence suggests that special intergenic gaps have been introduced in some species through gene rearrangement and duplication. These gaps are actually the original abutting sequences of migrated or duplicated genes. Some gaps (G5 and G6) are homologous to the 5' and 3' surrounding regions of the duplicated gene in the original gene order, and another specific gap (G7) has tandem repeats. We analysed the phylogenetic relationships of fifteen Mantodea species using 37 concatenated mitochondrial genes and detected several synapomorphies unique to species in some clades. PMID:28367101
FunGene: the functional gene pipeline and repository.
Fish, Jordan A; Chai, Benli; Wang, Qiong; Sun, Yanni; Brown, C Titus; Tiedje, James M; Cole, James R
2013-01-01
Ribosomal RNA genes have become the standard molecular markers for microbial community analysis for good reasons, including universal occurrence in cellular organisms, availability of large databases, and ease of rRNA gene region amplification and analysis. As markers, however, rRNA genes have some significant limitations. The rRNA genes are often present in multiple copies, unlike most protein-coding genes. The slow rate of change in rRNA genes means that multiple species sometimes share identical 16S rRNA gene sequences, while many more species share identical sequences in the short 16S rRNA regions commonly analyzed. In addition, the genes involved in many important processes are not distributed in a phylogenetically coherent manner, potentially due to gene loss or horizontal gene transfer. While rRNA genes remain the most commonly used markers, key genes in ecologically important pathways, e.g., those involved in carbon and nitrogen cycling, can provide important insights into community composition and function not obtainable through rRNA analysis. However, working with ecofunctional gene data requires some tools beyond those required for rRNA analysis. To address this, our Functional Gene Pipeline and Repository (FunGene; http://fungene.cme.msu.edu/) offers databases of many common ecofunctional genes and proteins, as well as integrated tools that allow researchers to browse these collections and choose subsets for further analysis, build phylogenetic trees, test primers and probes for coverage, and download aligned sequences. Additional FunGene tools are specialized to process coding gene amplicon data. For example, FrameBot produces frameshift-corrected protein and DNA sequences from raw reads while finding the most closely related protein reference sequence. These tools can help provide better insight into microbial communities by directly studying key genes involved in important ecological processes.
A Molecular Portrait of De Novo Genes in Yeasts.
Vakirlis, Nikolaos; Hebert, Alex S; Opulente, Dana A; Achaz, Guillaume; Hittinger, Chris Todd; Fischer, Gilles; Coon, Joshua J; Lafontaine, Ingrid
2018-03-01
New genes, with novel protein functions, can evolve "from scratch" out of intergenic sequences. These de novo genes can integrate the cell's genetic network and drive important phenotypic innovations. Therefore, identifying de novo genes and understanding how the transition from noncoding to coding occurs are key problems in evolutionary biology. However, identifying de novo genes is a difficult task, hampered by the presence of remote homologs, fast evolving sequences and erroneously annotated protein coding genes. To overcome these limitations, we developed a procedure that handles the usual pitfalls in de novo gene identification and predicted the emergence of 703 de novo gene candidates in 15 yeast species from 2 genera whose phylogeny spans at least 100 million years of evolution. We validated 85 candidates by proteomic data, providing new translation evidence for 25 of them through mass spectrometry experiments. We also unambiguously identified the mutations that enabled the transition from noncoding to coding for 30 Saccharomyces de novo genes. We established that de novo gene origination is a widespread phenomenon in yeasts, only a few being ultimately maintained by selection. We also found that de novo genes preferentially emerge next to divergent promoters in GC-rich intergenic regions where the probability of finding a fortuitous and transcribed ORF is the highest. Finally, we found a more than 3-fold enrichment of de novo genes at recombination hot spots, which are GC-rich and nucleosome-free regions, suggesting that meiotic recombination contributes to de novo gene emergence in yeasts.
Hotto, Amber M; Huston, Zoe E; Stern, David B
2010-09-29
The roles of non-coding RNAs in regulating gene expression have been extensively studied in both prokaryotes and eukaryotes, however few reports exist as to their roles in organellar gene regulation. Evidence for accumulation of natural antisense RNAs (asRNAs) in chloroplasts comes from the expressed sequence tag database and cDNA libraries, while functional data have been largely obtained from artificial asRNAs. In this study, we used Nicotiana tabacum to investigate the effect on sense strand transcripts of overexpressing a natural chloroplast asRNA, AS5, which is complementary to the region which encodes the 5S rRNA and tRNAArg. AS5-overexpressing (AS5ox) plants obtained by chloroplast transformation exhibited slower growth and slightly pale green leaves. Analysis of AS5 transcripts revealed four distinct species in wild-type (WT) and AS5ox plants, and additional AS5ox-specific products. Of the corresponding sense strand transcripts, tRNAArg overaccumulated several-fold in transgenic plants whereas 5S rRNA was unaffected. However, run-on transcription showed that the 5S-trnR region was transcribed four-fold more in the AS5ox plants compared to WT, indicating that overexpression of AS5 was associated with decreased stability of 5S rRNA. In addition, polysome analysis of the transformants showed less 5S rRNA and rbcL mRNA associated with ribosomes. Our results suggest that AS5 can modulate 5S rRNA levels, giving it the potential to affect Chloroplast translation and plant growth. More globally, overexpression of asRNAs via chloroplast transformation may be a useful strategy for defining their functions.
Salisbury, Joseph P; Sîrbulescu, Ruxandra F; Moran, Benjamin M; Auclair, Jared R; Zupanc, Günther K H; Agar, Jeffrey N
2015-03-11
The brown ghost knifefish (Apteronotus leptorhynchus) is a weakly electric teleost fish of particular interest as a versatile model system for a variety of research areas in neuroscience and biology. The comprehensive information available on the neurophysiology and neuroanatomy of this organism has enabled significant advances in such areas as the study of the neural basis of behavior, the development of adult-born neurons in the central nervous system and their involvement in the regeneration of nervous tissue, as well as brain aging and senescence. Despite substantial scientific interest in this species, no genomic resources are currently available. Here, we report the de novo assembly and annotation of the A. leptorhynchus transcriptome. After evaluating several trimming and transcript reconstruction strategies, de novo assembly using Trinity uncovered 42,459 unique contigs containing at least a partial protein-coding sequence based on alignment to a reference set of known Actinopterygii sequences. As many as 11,847 of these contigs contained full or near-full length protein sequences, providing broad coverage of the proteome. A variety of non-coding RNA sequences were also identified and annotated, including conserved long intergenic non-coding RNA and other long non-coding RNA observed previously to be expressed in adult zebrafish (Danio rerio) brain, as well as a variety of miRNA, snRNA, and snoRNA. Shotgun proteomics confirmed translation of open reading frames from over 2,000 transcripts, including alternative splice variants. Assignment of tandem mass spectra was greatly improved by use of the assembly compared to databases of sequences from closely related organisms. The assembly and raw reads have been deposited at DDBJ/EMBL/GenBank under the accession number GBKR00000000. Tandem mass spectrometry data is available via ProteomeXchange with identifier PXD001285. Presented here is the first release of an annotated de novo transcriptome assembly from Apteronotus leptorhynchus, providing a broad overview of RNA expressed in central nervous system tissue. The assembly, which includes substantial coverage of a wide variety of both protein coding and non-coding transcripts, will allow the development of better tools to understand the mechanisms underlying unique characteristics of the knifefish model system, such as their tremendous regenerative capacity and negligible brain senescence.
A long and abundant non-coding RNA in Lactobacillus salivarius.
Cousin, Fabien J; Lynch, Denise B; Chuat, Victoria; Bourin, Maxence J B; Casey, Pat G; Dalmasso, Marion; Harris, Hugh M B; McCann, Angela; O'Toole, Paul W
2017-09-01
Lactobacillus salivarius , found in the intestinal microbiota of humans and animals, is studied as an example of the sub-dominant intestinal commensals that may impart benefits upon their host. Strains typically harbour at least one megaplasmid that encodes functions contributing to contingency metabolism and environmental adaptation. RNA sequencing (RNA-seq)transcriptomic analysis of L. salivarius strain UCC118 identified the presence of a novel unusually abundant long non-coding RNA (lncRNA) encoded by the megaplasmid, and which represented more than 75 % of the total RNA-seq reads after depletion of rRNA species. The expression level of this 520 nt lncRNA in L. salivarius UCC118 exceeded that of the 16S rRNA, it accumulated during growth, was very stable over time and was also expressed during intestinal transit in a mouse. This lncRNA sequence is specific to the L. salivarius species; however, among 45 L . salivarius genomes analysed, not all (only 34) harboured the sequence for the lncRNA. This lncRNA was produced in 27 tested L. salivarius strains, but at strain-specific expression levels. High-level lncRNA expression correlated with high megaplasmid copy number. Transcriptome analysis of a deletion mutant lacking this lncRNA identified altered expression levels of genes in a number of pathways, but a definitive function of this new lncRNA was not identified. This lncRNA presents distinctive and unique properties, and suggests potential basic and applied scientific developments of this phenomenon.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Siddaramappa, Shivakumara; Delano, Susana; Green, Lance D.
2012-01-01
Dehalogenimonas lykanthroporepellens is the type species of the genus Dehalogenimonas, which belongs to a deeply branching lineage within the phylum Chloroflexi. This strictly anaerobic, mesophilic, non spore forming, Gram negative staining bacterium was first isolated from chlorinated solvent contaminated groundwater at a Superfund site located near Baton Rouge, Louisiana, USA. D. lykanthroporepellens was of interest for genome sequencing for two reasons: (a) its unusual ability to couple growth with reductive dechlorination of environmentally important polychlorinated aliphatic alkanes and (b) its phylogenetic position distant from previously sequenced bacteria. The 1,686,510 bp circular chromosome of strain BL-DC-9{sup T} contains 1,720 predicted proteinmore » coding genes, 47 tRNA genes, a single large subunit rRNA (23S-5S) locus, and a single, orphan, small unit rRNA (16S) locus.« less
Regulation of nucleolus assembly by non-coding RNA polymerase II transcripts
Caudron-Herger, Maïwen; Pankert, Teresa; Rippe, Karsten
2016-01-01
ABSTRACT The nucleolus is a nuclear subcompartment for tightly regulated rRNA production and ribosome subunit biogenesis. It also acts as a cellular stress sensor and can release enriched factors in response to cellular stimuli. Accordingly, the content and structure of the nucleolus change dynamically, which is particularly evident during cell cycle progression: the nucleolus completely disassembles during mitosis and reassembles in interphase. Although the mechanisms that drive nucleolar (re)organization have been the subject of a number of studies, they are only partly understood. Recently, we identified Alu element-containing RNA polymerase II transcripts (aluRNAs) as important for nucleolar structure and rRNA synthesis. Integrating these findings with studies on the liquid droplet-like nature of the nucleolus leads us to propose a model on how RNA polymerase II transcripts could regulate the assembly of the nucleolus in response to external stimuli and during cell cycle progression. PMID:27416361
Regulation of nucleolus assembly by non-coding RNA polymerase II transcripts.
Caudron-Herger, Maïwen; Pankert, Teresa; Rippe, Karsten
2016-05-03
The nucleolus is a nuclear subcompartment for tightly regulated rRNA production and ribosome subunit biogenesis. It also acts as a cellular stress sensor and can release enriched factors in response to cellular stimuli. Accordingly, the content and structure of the nucleolus change dynamically, which is particularly evident during cell cycle progression: the nucleolus completely disassembles during mitosis and reassembles in interphase. Although the mechanisms that drive nucleolar (re)organization have been the subject of a number of studies, they are only partly understood. Recently, we identified Alu element-containing RNA polymerase II transcripts (aluRNAs) as important for nucleolar structure and rRNA synthesis. Integrating these findings with studies on the liquid droplet-like nature of the nucleolus leads us to propose a model on how RNA polymerase II transcripts could regulate the assembly of the nucleolus in response to external stimuli and during cell cycle progression.
The complete mitochondrial genome of the bagarius yarrelli from honghe river
NASA Astrophysics Data System (ADS)
Du, M.; Zhou, C. J.; Niu, B. Z.; Liu, Y. H.; Li, N.; Ai, J. L.; Xu, G. L.
2016-08-01
The total length of mitochondrial DNA sequence of the Bagarius yarrelli from the Honghe river of China is determined in this paper. The total length of the circular molecule is 16524 base pair which denoted a similar gene order to that of the other bony fishes, which include a non-coding control region, a replicated origin, two ribosome RNA (rRNA) genes, 22 transfer RNA (tRNA) genes as well as 13 protein-coding genes. Its whole base constitution is 31.4% for A, 26.9% for C, 15.7% for G and 26.0% for T, with an A+T bias of 57.4%. Those mitochondrial data would contribute to further study molecular evolution and population genetics of this species.
The complete mitochondrial genome sequence of the Datong yak (Bos grunniens).
Wu, Xiaoyun; Chu, Min; Liang, Chunnian; Ding, Xuezhi; Guo, Xian; Bao, Pengjia; Yan, Ping
2016-01-01
Datong yak is a famous artificially cultivated breed in China. In the present work, we report the complete mitochondrial genome sequence of Datong yak for the first time. The total length of the mitogenome is 16,323 bp long, containing 13 protein-coding genes, 22 tRNA genes, two rRNA genes and one non-coding region (D-loop region). The gene order of Datong yak mitogenome is identical to that observed in most other vertebrates. The overall base composition is 33.71% A, 25.8.0% C, 13.21% G and 27.27% T, with an A + T content of 60.98%. The complete mitogenome sequence information of Datong yak can provide useful data for further studies on molecular breeding and taxonomic status.
Characterization of the complete mitochondrial genome sequence of Gannan yak (Bos grunniens).
Wu, Xiaoyun; Ding, Xuezhi; Chu, Min; Guo, Xian; Bao, Pengjia; Liang, Chunnian; Yan, Ping
2016-01-01
Gannan yak is the native breed of Gansu province in China. In this work, the complete mitochondrial genome sequence of Gannan yak was determined for the first time. The total length of the mitogenome is 16,322 bp long, with the base composition of 33.74% A, 25.84% T, 13.18% C, and 27.24% G. It contained 13 protein-coding genes, 22 tRNA genes, two rRNA genes and one non-coding region (D-loop region). The gene order of Gannan yak mitogenome is identical to that observed in most other vertebrates. The complete mitogenome sequence information of Gannan yak can provide useful data for further studies on protection of genetic resources and phylogenetic relationships within Bos grunniens.
Complete mitochondrial genome of Chuanzhong black goat in southwest of China (Capra hircus).
Huang, Yong-Fu; Chen, Li-Peng; Zhao, Yong-Ju; Zhang, Hao; Na, Ri-Su; Zhao, Zhong-Quan; Zhang, Jia-Hua; Jiang, Cao-De; Ma, Yue-Hui; Sun, Ya-Wang; E, Guang-Xin
2016-09-01
The Chuanzhong black goat (Capra hircus) is a breed native to southwest of China. Its complete mitochondrial genome is 16,641 nt in length, consisting of 13 protein-coding genes, 22 transfer RNA (tRNA) genes, two ribosomal RNA (rRNA) genes, and a non-coding control region. As in other mammals, most mitochondrial genes are encoded on the heavy strand, except for ND6 and eight tRNA genes, which are encoded on the light strand. Its overall base composition is A: 33.5%, T: 27.3%, C: 26.1%, and G: 13.1%. The complete mitogenome of the Chinese indigenous breed of goat could provide a basic data for further phylogenetics analysis.
Zhang, Yanan; Song, Tao; Pan, Tao; Sun, Xiaonan; Sun, Zhonglou; Qian, Lifu; Zhang, Baowei
2016-07-01
The complete sequence of the mitochondrial genome was determined for Asio flammeus, which is distributed widely in geography. The length of the complete mitochondrial genome was 18,966 bp, containing 2 rRNA genes, 22 tRNA genes, 13 protein-coding genes (PCGs), and 1 non-coding region (D-loop). All the genes were distributed on the H-strand, except for the ND6 subunit gene and eight tRNA genes which were encoded on the L-strand. The D-loop of A. flammeus contained many tandem repeats of varying lengths and repeat numbers. The molecular-based phylogeny showed that our species acted as the sister group to A. capensis and the supported Asio was the monophyletic group.
Non-contiguous finished genome sequence and description of Collinsella massiliensis sp. nov.
Padmanabhan, Roshan; Dubourg, Gregory; Nguyen, Thi-Thien; Couderc, Carine; Rossi-Tamisier, Morgane; Caputo, Aurelia; Raoult, Didier; Fournier, Pierre-Edouard
2014-01-01
Collinsella massiliensis strain GD3T is the type strain of Collinsella massiliensis sp. nov., a new species within the genus Collinsella. This strain, whose genome is described here, was isolated from the fecal flora of a 53-year-old French Caucasoid woman who had been admitted to intensive care unit for Guillain-Barré syndrome. Collinsella massiliensis is a Gram-positive, obligate anaerobic, non motile and non sporulating bacillus. Here, we describe the features of this organism, together with the complete genome sequence and annotation. The genome is 2,319,586 bp long (1 chromosome, no plasmid), exhibits a G+C content of 65.8% and contains 2,003 protein-coding and 54 RNA genes, including 1 rRNA operon. PMID:25197489
Non-contiguous finished genome sequence and description of Collinsella massiliensis sp. nov.
Padmanabhan, Roshan; Dubourg, Gregory; Nguyen, Thi-Thien; Couderc, Carine; Rossi-Tamisier, Morgane; Caputo, Aurelia; Raoult, Didier; Fournier, Pierre-Edouard
2014-06-15
Collinsella massiliensis strain GD3(T) is the type strain of Collinsella massiliensis sp. nov., a new species within the genus Collinsella. This strain, whose genome is described here, was isolated from the fecal flora of a 53-year-old French Caucasoid woman who had been admitted to intensive care unit for Guillain-Barré syndrome. Collinsella massiliensis is a Gram-positive, obligate anaerobic, non motile and non sporulating bacillus. Here, we describe the features of this organism, together with the complete genome sequence and annotation. The genome is 2,319,586 bp long (1 chromosome, no plasmid), exhibits a G+C content of 65.8% and contains 2,003 protein-coding and 54 RNA genes, including 1 rRNA operon.
RNomics in Drosophila melanogaster: identification of 66 candidates for novel non-messenger RNAs
Yuan, Guozhong; Klämbt, Christian; Bachellerie, Jean-Pierre; Brosius, Jürgen; Hüttenhofer, Alexander
2003-01-01
By generating a specialised cDNA library from four different developmental stages of Drosophila melanogaster, we have identified 66 candidates for small non-messenger RNAs (snmRNAs) and have confirmed their expression by northern blot analysis. Thirteen of them were expressed at certain stages of D.melanogaster development, only. Thirty-five species belong to the class of small nucleolar RNAs (snoRNAs), divided into 15 members from the C/D subclass and 20 members from the H/ACA subclass, which mostly guide 2′-O-methylation and pseudouridylation, respectively, of rRNA and snRNAs. These also include two outstanding C/D snoRNAs, U3 and U14, both functioning as pre-rRNA chaperones. Surprisingly, the sequence of the Drosophila U14 snoRNA reflects a major change of function of this snoRNA in Diptera relative to yeast and vertebrates. Among the 22 snmRNAs lacking known sequence and structure motifs, five were located in intergenic regions, two in introns, five in untranslated regions of mRNAs, eight were derived from open reading frames, and two were transcribed opposite to an intron. Interestingly, detection of two RNA species from this group implies that certain snmRNA species are processed from alternatively spliced pre-mRNAs. Surprisingly, a few snmRNA sequences could not be found on the published D.melanogaster genome, which might suggest that more snmRNA genes (as well as mRNAs) are hidden in unsequenced regions of the genome. PMID:12736298
Machado, Ana; Bordalo, Adriano A
2014-08-01
Potable water is a resource out of reach for millions worldwide, and the available water may be chemically and microbiologically compromised. This is particularly acute in Africa, where water-networks may be non-existent or restricted to a small fraction of the urban population, as in the case of Guinea-Bissau, West Africa. This study was carried out seasonally in Bolama (11°N), where unprotected hand-dug wells with acidic water are the sole source of water for the population. We inspected the free-living bacterial community dynamics by automated rRNA intergenic spacer analyses, quantitative polymerase chain reaction and cloning approaches. The results revealed a clear seasonal shift in bacterial assemblage composition and microbial abundance within the same sampling site. Temperature, pH and turbidity, together with the infiltration and percolation of surface water, which takes place in the wet season, seemed to be the driving factors in the shaping and selection of the bacterial community and deterioration of water quality. Analysis of 16S rDNA sequences revealed several potential pathogenic bacteria and uncultured bacteria associated with water and sediments, corroborating the importance of a culture-independent approach in drinking water monitoring. Copyright © 2014. Published by Elsevier B.V.
Kasalický, Vojtěch; Jezbera, Jan; Hahn, Martin W.; Šimek, Karel
2013-01-01
Bacteria of the genus Limnohabitans, more precisely the R-BT lineage, have a prominent role in freshwater bacterioplankton communities due to their high rates of substrate uptake and growth, growth on algal-derived substrates and high mortality rates from bacterivory. Moreover, due to their generally larger mean cell volume, compared to typical bacterioplankton cells, they contribute over-proportionally to total bacterioplankton biomass. Here we present genetic, morphological and ecophysiological properties of 35 bacterial strains affiliated with the Limnohabitans genus newly isolated from 11 non-acidic European freshwater habitats. The low genetic diversity indicated by the previous studies using the ribosomal SSU gene highly contrasted with the surprisingly rich morphologies and different patterns in substrate utilization of isolated strains. Therefore, the intergenic spacer between 16S and 23S rRNA genes was successfully tested as a fine-scale marker to delineate individual lineages and even genotypes. For further studies, we propose the division of the Limnohabitans genus into five lineages (provisionally named as LimA, LimB, LimC, LimD and LimE) and also additional sublineages within the most diversified lineage LimC. Such a delineation is supported by the morphology of isolated strains which predetermine large differences in their ecology. PMID:23505469
Molecular characterization of Banana streak virus isolate from Musa Acuminata in China.
Zhuang, Jun; Wang, Jian-Hua; Zhang, Xin; Liu, Zhi-Xin
2011-12-01
Banana streak virus (BSV), a member of genus Badnavirus, is a causal agent of banana streak disease throughout the world. The genetic diversity of BSVs from different regions of banana plantations has previously been investigated, but there are relatively few reports of the genetic characteristic of episomal (non-integrated) BSV genomes isolated from China. Here, the complete genome, a total of 7722bp (GenBank accession number DQ092436), of an isolate of Banana streak virus (BSV) on cultivar Cavendish (BSAcYNV) in Yunnan, China was determined. The genome organises in the typical manner of badnaviruses. The intergenic region of genomic DNA contains a large stem-loop, which may contribute to the ribosome shift into the following open reading frames (ORFs). The coding region of BSAcYNV consists of three overlapping ORFs, ORF1 with a non-AUG start codon and ORF2 encoding two small proteins are individually involved in viral movement and ORF3 encodes a polyprotein. Besides the complete genome, a defective genome lacking the whole RNA leader region and a majority of ORF1 and which encompasses 6525bp was also isolated and sequenced from this BSV DNA reservoir in infected banana plants. Sequence analyses showed that BSAcYNV has closest similarity in terms of genome organization and the coding assignments with an BSV isolate from Vietnam (BSAcVNV). The corresponding coding regions shared identities of 88% and -95% at nucleotide and amino acid levels, respectively. Phylogenetic analysis also indicated BSAcYNV shared the closest geographical evolutionary relationship to BSAcVNV among sequenced banana streak badnaviruses.
The complete mitochondrial genome of Hydra vulgaris (Hydroida: Hydridae).
Pan, Hong-Chun; Fang, Hong-Yan; Li, Shi-Wei; Liu, Jun-Hong; Wang, Ying; Wang, An-Tai
2014-12-01
The complete mitochondrial genome of Hydra vulgaris (Hydroida: Hydridae) is composed of two linear DNA molecules. The mitochondrial DNA (mtDNA) molecule 1 is 8010 bp long and contains six protein-coding genes, large subunit rRNA, methionine and tryptophan tRNAs, two pseudogenes consisting respectively of a partial copy of COI, and terminal sequences at two ends of the linear mtDNA, while the mtDNA molecule 2 is 7576 bp long and contains seven protein-coding genes, small subunit rRNA, methionine tRNA, a pseudogene consisting of a partial copy of COI and terminal sequences at two ends of the linear mtDNA. COI gene begins with GTG as start codon, whereas other 12 protein-coding genes start with a typical ATG initiation codon. In addition, all protein-coding genes are terminated with TAA as stop codon.
Yong, Hoi-Sen; Song, Sze-Looi; Lim, Phaik-Eem; Eamsobhana, Praphathip; Suana, I. Wayan
2016-01-01
Bactrocera latifrons is a serious pest of solanaceous fruits and Bactrocera umbrosa is a pest of Artocarpus fruits, while Bactrocera melastomatos infests the fruit of Melastomataceae. They are members of the subgenus Bactrocera. We report here the complete mitochondrial genome of these fruit flies determined by next-generation sequencing and their phylogeny with other taxa of the subgenus Bactrocera. The whole mitogenomes of these three species possessed 37 genes namely, 13 protein-coding genes (PCGs), 2 rRNA and 22 tRNA genes. The mitogenome of B. latifrons (15,977 bp) was longer than those of B. melastomatos (15,954 bp) and B. umbrosa (15,898 bp). This difference can be attributed to the size of the intergenic spacers (283 bp in B. latifrons, 261 bp in B. melastomatos, and 211 bp in B. umbrosa). Most of the PCGs in the three species have an identical start codon, except for atp8 (adenosine triphosphate synthase protein 8), which had an ATG instead of GTG in B. umbrosa, whilst the nad3 (NADH dehydrogenase subunit 3) and nad6 (NADH dehydrogenase subunit 6) genes were characterized by an ATC instead of ATT in B. melastomatos. The three species had identical stop codon for the respective PCGs. In B. latifrons and B. melastomatos, the TΨC (thymidine-pseudouridine-cytidine)-loop was absent in trnF (phenylalanine) and DHU (dihydrouracil)-loop was absent in trnS1 (serine S1). In B. umbrosa, trnN (asparagine), trnC (cysteine) and trnF lacked the TψC-loop, while trnS1 lacked the DHU-stem. Molecular phylogeny based on 13 PCGs was in general concordant with 15 mitochondrial genes (13 PCGs and 2 rRNA genes), with B. latifrons and B. umbrosa forming a sister group basal to the other species of the subgenus Bactrocera which was monophyletic. The whole mitogenomes will serve as a useful dataset for studying the genetics, systematics and phylogenetic relationships of the many species of Bactrocera genus in particular, and tephritid fruit flies in general. PMID:26840430
Oxidative stress damages rRNA inside the ribosome and differentially affects the catalytic center
Willi, Jessica; Küpfer, Pascal; Evéquoz, Damien; Fernandez, Guillermo; Polacek, Norbert
2018-01-01
Abstract Intracellular levels of reactive oxygen species (ROS) increase as a consequence of oxidative stress and represent a major source of damage to biomolecules. Due to its high cellular abundance RNA is more frequently the target for oxidative damage than DNA. Nevertheless the functional consequences of damage on stable RNA are poorly understood. Using a genome-wide approach, based on 8-oxo-guanosine immunoprecipitation, we present evidence that the most abundant non-coding RNA in a cell, the ribosomal RNA (rRNA), is target for oxidative nucleobase damage by ROS. Subjecting ribosomes to oxidative stress, we demonstrate that oxidized 23S rRNA inhibits the ribosome during protein biosynthesis. Placing single oxidized nucleobases at specific position within the ribosome's catalytic center by atomic mutagenesis resulted in markedly different functional outcomes. While some active site nucleobases tolerated oxidative damage well, oxidation at others had detrimental effects on protein synthesis by inhibiting different sub-steps of the ribosomal elongation cycle. Our data provide molecular insight into the biological consequences of RNA oxidation in one of the most central cellular enzymes and reveal mechanistic insight on the role of individual active site nucleobases during translation. PMID:29309687
Zhao, Zhongliang; Dammert, Marcel A; Hoppe, Sven; Bierhoff, Holger; Grummt, Ingrid
2016-09-30
Attenuation of ribosome biogenesis in suboptimal growth environments is crucial for cellular homeostasis and genetic integrity. Here, we show that shutdown of rRNA synthesis in response to elevated temperature is brought about by mechanisms that target both the RNA polymerase I (Pol I) transcription machinery and the epigenetic signature of the rDNA promoter. Upon heat shock, the basal transcription factor TIF-IA is inactivated by inhibition of CK2-dependent phosphorylations at Ser170/172. Attenuation of pre-rRNA synthesis in response to heat stress is accompanied by upregulation of PAPAS, a long non-coding RNA (lncRNA) that is transcribed in antisense orientation to pre-rRNA. PAPAS interacts with CHD4, the adenosine triphosphatase subunit of NuRD, leading to deacetylation of histones and movement of the promoter-bound nucleosome into a position that is refractory to transcription initiation. The results exemplify how stress-induced inactivation of TIF-IA and lncRNA-dependent changes of chromatin structure ensure repression of rRNA synthesis in response to thermo-stress. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.
Bayesian variable selection for post-analytic interrogation of susceptibility loci.
Chen, Siying; Nunez, Sara; Reilly, Muredach P; Foulkes, Andrea S
2017-06-01
Understanding the complex interplay among protein coding genes and regulatory elements requires rigorous interrogation with analytic tools designed for discerning the relative contributions of overlapping genomic regions. To this aim, we offer a novel application of Bayesian variable selection (BVS) for classifying genomic class level associations using existing large meta-analysis summary level resources. This approach is applied using the expectation maximization variable selection (EMVS) algorithm to typed and imputed SNPs across 502 protein coding genes (PCGs) and 220 long intergenic non-coding RNAs (lncRNAs) that overlap 45 known loci for coronary artery disease (CAD) using publicly available Global Lipids Gentics Consortium (GLGC) (Teslovich et al., 2010; Willer et al., 2013) meta-analysis summary statistics for low-density lipoprotein cholesterol (LDL-C). The analysis reveals 33 PCGs and three lncRNAs across 11 loci with >50% posterior probabilities for inclusion in an additive model of association. The findings are consistent with previous reports, while providing some new insight into the architecture of LDL-cholesterol to be investigated further. As genomic taxonomies continue to evolve, additional classes such as enhancer elements and splicing regions, can easily be layered into the proposed analysis framework. Moreover, application of this approach to alternative publicly available meta-analysis resources, or more generally as a post-analytic strategy to further interrogate regions that are identified through single point analysis, is straightforward. All coding examples are implemented in R version 3.2.1 and provided as supplemental material. © 2016, The International Biometric Society.
MXD1 localizes in the nucleolus, binds UBF and impairs rRNA synthesis.
Lafita-Navarro, Maria Del Carmen; Blanco, Rosa; Mata-Garrido, Jorge; Liaño-Pons, Judit; Tapia, Olga; García-Gutiérrez, Lucía; García-Alegría, Eva; Berciano, María T; Lafarga, Miguel; León, Javier
2016-10-25
MXD1 is a protein that interacts with MAX, to form a repressive transcription factor. MXD1-MAX binds E-boxes. MXD1-MAX antagonizes the transcriptional activity of the MYC oncoprotein in most models. It has been reported that MYC overexpression leads to augmented RNA synthesis and ribosome biogenesis, which is a relevant activity in MYC-mediated tumorigenesis. Here we describe that MXD1, but not MYC or MNT, localizes to the nucleolus in a wide array of cell lines derived from different tissues (carcinoma, leukemia) as well as in embryonic stem cells. MXD1 also localizes in the nucleolus of primary tissue cells as neurons and Sertoli cells. The nucleolar localization of MXD1 was confirmed by co-localization with UBF. Co-immunoprecipitation experiments showed that MXD1 interacted with UBF and proximity ligase assays revealed that this interaction takes place in the nucleolus. Furthermore, chromatin immunoprecipitation assays showed that MXD1 was bound in the transcribed rDNA chromatin, where it co-localizes with UBF, but also in the ribosomal intergenic regions. The MXD1 involvement in rRNA synthesis was also suggested by the nucleolar segregation upon rRNA synthesis inhibition by actinomycin D. Silencing of MXD1 with siRNAs resulted in increased synthesis of pre-rRNA while enforced MXD1 expression reduces it. The results suggest a new role for MXD1, which is the control of ribosome biogenesis. This new MXD1 function would be important to curb MYC activity in tumor cells.
MXD1 localizes in the nucleolus, binds UBF and impairs rRNA synthesis
Lafita-Navarro, Maria del Carmen; Blanco, Rosa; Mata-Garrido, Jorge; Liaño-Pons, Judit; Tapia, Olga; García-Gutiérrez, Lucía; García-Alegría, Eva; Berciano, María T.; Lafarga, Miguel; León, Javier
2016-01-01
MXD1 is a protein that interacts with MAX, to form a repressive transcription factor. MXD1-MAX binds E-boxes. MXD1-MAX antagonizes the transcriptional activity of the MYC oncoprotein in most models. It has been reported that MYC overexpression leads to augmented RNA synthesis and ribosome biogenesis, which is a relevant activity in MYC-mediated tumorigenesis. Here we describe that MXD1, but not MYC or MNT, localizes to the nucleolus in a wide array of cell lines derived from different tissues (carcinoma, leukemia) as well as in embryonic stem cells. MXD1 also localizes in the nucleolus of primary tissue cells as neurons and Sertoli cells. The nucleolar localization of MXD1 was confirmed by co-localization with UBF. Co-immunoprecipitation experiments showed that MXD1 interacted with UBF and proximity ligase assays revealed that this interaction takes place in the nucleolus. Furthermore, chromatin immunoprecipitation assays showed that MXD1 was bound in the transcribed rDNA chromatin, where it co-localizes with UBF, but also in the ribosomal intergenic regions. The MXD1 involvement in rRNA synthesis was also suggested by the nucleolar segregation upon rRNA synthesis inhibition by actinomycin D. Silencing of MXD1 with siRNAs resulted in increased synthesis of pre-rRNA while enforced MXD1 expression reduces it. The results suggest a new role for MXD1, which is the control of ribosome biogenesis. This new MXD1 function would be important to curb MYC activity in tumor cells. PMID:27588501
Hobbs, A A; Rosen, J M
1982-01-01
The complete sequences of rat alpha- and gamma-casein mRNAs have been determined. The 1402-nucleotide alpha- and 864-nucleotide gamma-casein mRNAs both encode 15 amino acid signal peptides and mature proteins of 269 and 164 residues, respectively. Considerable homology between the 5' non-coding regions, and the regions encoding the signal peptides and the phosphorylation sites, in these mRNAs as compared to several other rodent casein mRNAs, was observed. Significant homology was also detected between rat alpha- and bovine alpha s1-casein. Comparison of the rodent and bovine sequences suggests that the caseins evolved at about the time of the appearance of the primitive mammals. This may have occurred by intragenic duplication of a nucleotide sequence encoding a primitive phosphorylation site, -(Ser)n-Glu-Glu-, and intergenic duplication resulting in the small casein multigene family. A unique feature of the rat alpha-casein sequence is an insertion in the coding region containing 10 repeated elements of 18 nucleotides each. This insertion appears to have occurred 7-12 million years ago, just prior to the divergence of rat and mouse. Images PMID:6298707
Non-contiguous finished genome sequence and description of Oceanobacillus massiliensis sp. nov.
Roux, Véronique; Million, Matthieu; Robert, Catherine; Magne, Alix; Raoult, Didier
2013-01-01
Oceanobacillus massiliensis strain N’DiopT sp. nov. is the type strain of O. massiliensis sp. nov., a new species within the genus Oceanobacillus. This strain, whose genome is described here, was isolated from the fecal flora of a healthy patient. O. massiliensis is an aerobic rod. Here we describe the features of this organism, together with the complete genome sequence and annotation. The 3,532,675 bp long genome contains 3,519 protein-coding genes and 72 RNA genes, including between 6 and 8 rRNA operons. PMID:24976893
Kim, Hyoung Tae; Kim, Ki-Joong
2014-01-01
Comparative analyses of complete chloroplast (cp) DNA sequences within a species may provide clues to understand the population dynamics and colonization histories of plant species. Equisetum arvense (Equisetaceae) is a widely distributed fern species in northeastern Asia, Europe, and North America. The complete cp DNA sequences from Asian and American E. arvense individuals were compared in this study. The Asian E. arvense cp genome was 583 bp shorter than that of the American E. arvense. In total, 159 indels were observed between two individuals, most of which were concentrated on the hypervariable trnY-trnE intergenic spacer (IGS) in the large single-copy (LSC) region of the cp genome. This IGS region held a series of 19 bp repeating units. The numbers of the 19 bp repeat unit were responsible for 78% of the total length difference between the two cp genomes. Furthermore, only other closely related species of Equisetum also show the hypervariable nature of the trnY-trnE IGS. By contrast, only a single indel was observed in the gene coding regions: the ycf1 gene showed 24 bp differences between the two continental individuals due to a single tandem-repeat indel. A total of 165 single-nucleotide polymorphisms (SNPs) were recorded between the two cp genomes. Of these, 52 SNPs (31.5%) were distributed in coding regions, 13 SNPs (7.9%) were in introns, and 100 SNPs (60.6%) were in intergenic spacers (IGS). The overall difference between the Asian and American E. arvense cp genomes was 0.12%. Despite the relatively high genetic diversity between Asian and American E. arvense, the two populations are recognized as a single species based on their high morphological similarity. This indicated that the two regional populations have been in morphological stasis. PMID:25157804
Raju, Hemalatha B.; Tsinoremas, Nicholas F.; Capobianco, Enrico
2016-01-01
Regeneration of injured nerves is likely occurring in the peripheral nervous system, but not in the central nervous system. Although protein-coding gene expression has been assessed during nerve regeneration, little is currently known about the role of non-coding RNAs (ncRNAs). This leaves open questions about the potential effects of ncRNAs at transcriptome level. Due to the limited availability of human neuropathic pain (NP) data, we have identified the most comprehensive time-course gene expression profile referred to sciatic nerve (SN) injury and studied in a rat model using two neuronal tissues, namely dorsal root ganglion (DRG) and SN. We have developed a methodology to identify differentially expressed bioentities starting from microarray probes and repurposing them to annotate ncRNAs, while analyzing the expression profiles of protein-coding genes. The approach is designed to reuse microarray data and perform first profiling and then meta-analysis through three main steps. First, we used contextual analysis to identify what we considered putative or potential protein-coding targets for selected ncRNAs. Relevance was therefore assigned to differential expression of neighbor protein-coding genes, with neighborhood defined by a fixed genomic distance from long or antisense ncRNA loci, and of parental genes associated with pseudogenes. Second, connectivity among putative targets was used to build networks, in turn useful to conduct inference at interactomic scale. Last, network paths were annotated to assess relevance to NP. We found significant differential expression in long-intergenic ncRNAs (32 lincRNAs in SN and 8 in DRG), antisense RNA (31 asRNA in SN and 12 in DRG), and pseudogenes (456 in SN and 56 in DRG). In particular, contextual analysis centered on pseudogenes revealed some targets with known association to neurodegeneration and/or neurogenesis processes. While modules of the olfactory receptors were clearly identified in protein–protein interaction networks, other connectivity paths were identified between proteins already investigated in studies on disorders, such as Parkinson, Down syndrome, Huntington disease, and Alzheimer. Our findings suggest the importance of reusing gene expression data by meta-analysis approaches. PMID:27803687
Raju, Hemalatha B; Tsinoremas, Nicholas F; Capobianco, Enrico
2016-01-01
Regeneration of injured nerves is likely occurring in the peripheral nervous system, but not in the central nervous system. Although protein-coding gene expression has been assessed during nerve regeneration, little is currently known about the role of non-coding RNAs (ncRNAs). This leaves open questions about the potential effects of ncRNAs at transcriptome level. Due to the limited availability of human neuropathic pain (NP) data, we have identified the most comprehensive time-course gene expression profile referred to sciatic nerve (SN) injury and studied in a rat model using two neuronal tissues, namely dorsal root ganglion (DRG) and SN. We have developed a methodology to identify differentially expressed bioentities starting from microarray probes and repurposing them to annotate ncRNAs, while analyzing the expression profiles of protein-coding genes. The approach is designed to reuse microarray data and perform first profiling and then meta-analysis through three main steps. First, we used contextual analysis to identify what we considered putative or potential protein-coding targets for selected ncRNAs. Relevance was therefore assigned to differential expression of neighbor protein-coding genes, with neighborhood defined by a fixed genomic distance from long or antisense ncRNA loci, and of parental genes associated with pseudogenes. Second, connectivity among putative targets was used to build networks, in turn useful to conduct inference at interactomic scale. Last, network paths were annotated to assess relevance to NP. We found significant differential expression in long-intergenic ncRNAs (32 lincRNAs in SN and 8 in DRG), antisense RNA (31 asRNA in SN and 12 in DRG), and pseudogenes (456 in SN and 56 in DRG). In particular, contextual analysis centered on pseudogenes revealed some targets with known association to neurodegeneration and/or neurogenesis processes. While modules of the olfactory receptors were clearly identified in protein-protein interaction networks, other connectivity paths were identified between proteins already investigated in studies on disorders, such as Parkinson, Down syndrome, Huntington disease, and Alzheimer. Our findings suggest the importance of reusing gene expression data by meta-analysis approaches.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Zhengqiu, C.; Penaflor, C.; Kuehl, J.V.
2006-06-01
The magnoliids represent the largest basal angiosperm clade with four orders, 19 families and 8,500 species. Although several recent angiosperm molecular phylogenies have supported the monophyly of magnoliids and suggested relationships among the orders, the limited number of genes examined resulted in only weak support, and these issues remain controversial. Furthermore, considerable incongruence has resulted in phylogenies supporting three different sets of relationships among magnoliids and the two large angiosperm clades, monocots and eudicots. This is one of the most important remaining issues concerning relationships among basal angiosperms. We sequenced the chloroplast genomes of three magnoliids, Drimys (Canellales), Liriodendron (Magnoliales),more » and Piper (Piperales), and used these data in combination with 32 other completed angiosperm chloroplast genomes to assess phylogenetic relationships among magnoliids. The Drimys and Piper chloroplast genomes are nearly identical in size at 160,606 and 160,624 bp, respectively. The genomes include a pair of inverted repeats of 26,649 bp (Drimys) and 27,039 (Piper), separated by a small single copy region of 18,621 (Drimys) and 18,878 (Piper) and a large single copy region of 88,685 bp (Drimys) and 87,666 bp (Piper). The gene order of both taxa is nearly identical to many other unrearranged angiosperm chloroplast genomes, including Calycanthus, the other published magnoliid genome. Comparisons of angiosperm chloroplast genomes indicate that GC content is not uniformly distributed across the genome. Overall GC content ranges from 34-39%, and coding regions have a substantially higher GC content than non-coding regions (both intergenic spacers and introns). Among protein-coding genes, GC content varies by codon position with 1st codon > 2nd codon > 3rd codon, and it varies by functional group with photosynthetic genes having the highest percentage and NADH genes the lowest. Across the genome, GC content is highest in the inverted repeat due to the presence of rRNA genes and lowest in the small single copy region where most NADH genes are located. Phylogenetic analyses using maximum parsimony and maximum likelihood methods were performed on DNA sequences of 61 protein-coding genes. Trees from both analyses provided strong support for the monophyly of magnoliids and two strongly supported groups were identified, the Canellales/Piperales and the Laurales/Magnoliales. The phylogenies also provided moderate to strong support for the basal position of Amborella, and a sister relationship of magnoliids to a clade that includes monocots and eudicots. The complete sequences of three magnoliid chloroplast genomes provide new data from the largest basal angiosperm clade. Evolutionary comparisons of these new genome sequences, combined with other published angiosperm genome, confirm that GC content is unevenly distributed across the genome by location, codon position, and functional group. Furthermore, phylogenetic analyses provide the strongest support so far for the hypothesis that the magnoliids are sister to a large clade that includes both monocots and eudicots.« less
Molecular diversity of the rumen microbiome of Norwegian reindeer on natural summer pasture.
Sundset, Monica A; Edwards, Joan E; Cheng, Yan Fen; Senosiain, Roberto S; Fraile, Maria N; Northwood, Korinne S; Praesteng, Kirsti E; Glad, Trine; Mathiesen, Svein D; Wright, André-Denis G
2009-02-01
The molecular diversity of the rumen microbiome was investigated in five semi-domesticated adult female Norwegian reindeer (Rangifer tarandus tarandus) grazing on natural summer pastures on the coast of northern Norway (71.00 degrees N, 25.30 degrees E). Mean population densities (numbers per gram wet weight) of methanogenic archaea, rumen bacteria and ciliate protozoa, estimated using quantitative real-time polymerase chain reaction (PCR), were 3.17x10(9), 5.17x10(11) and 4.02x10(7), respectively. Molecular diversity of rumen methanogens was revealed using a 16S rRNA gene library (54 clones) constructed using pooled PCR products from the whole rumen contents of the five individual reindeer. Based upon a similarity criterion of <97%, a total of 19 distinct operational taxonomic units (OTUs) were identified, nine of which are potential new species. The 16S rRNA sequences generated from the reindeer rumen exhibited a high degree of sequence similarity to methanogens affiliated with the families Methanobacteriaceae (14 OTUs) and Methanosarcinaceae (one OTU). Four of the OTUs detected belonged to a group of uncultivated archaea previously found in domestic ruminants and thought to be dominant in the rumen together with Methanobrevibacter spp. Denaturing gradient gel electrophoresis profiling of the rumen bacterial 16S rRNA gene and the protozoal 18S rRNA gene indicated a high degree of animal variation, although some bands were common to all individuals. Automated ribosomal intergenic spacer analysis (ARISA) profiling of the ruminal Neocallimastigales population indicated that the reindeer are likely to contain more than one type of anaerobic fungus. The ARISA profile from one animal was distinct from the other four. This is the first molecular investigation of the ruminal methanogenic archaea in reindeer, revealing higher numbers than expected based on methane emission data available. Also, many of the reindeer archaeal 16S rRNA gene sequences were similar to those reported in domesticated ruminants in Australia, Canada, China, New Zealand and Venezuela, supporting previous findings that there seems to be no host type or geographical effect on the methanogenic archaea community structure in ruminants.
Characterization of Treponema spp. isolates from pigs with ear necrosis and shoulder ulcers.
Svartström, Olov; Karlsson, Frida; Fellström, Claes; Pringle, Märit
2013-10-25
Ear necrosis and shoulder ulcers in pigs are animal welfare problems and ethical issues that can cause economic losses for producers. Spirochetes have been observed microscopically in scrapings from pig ulcers since the early 1900s, but have until recently not been cultured and therefore not characterized. In this study, 12 Treponema spp. isolates were acquired from porcine ear necrosis, shoulder ulcers and gingiva. DNA analysis of the 16S rRNA-tRNA(Ile) intergenic spacer region (ISR2) or the 16S rRNA gene revealed relatedness to oral treponemes found in dogs and humans. All isolates except one aligned into two clusters, Treponema pedis and Treponema sp. OMZ 840-like. The 16S rRNA gene of the remaining isolate shared 99% nucleotide identity with Treponema parvum. Genetic fingerprinting of the isolates was performed through random amplification of polymorphic DNA (RAPD). In addition, the isolates were characterized by biochemical tests, including api(®)ZYM, tryptophanase and hippuricase activity, and by testing the antimicrobial susceptibility to tiamulin, valnemulin, tylosin, tylvalosin, lincomycin and doxycycline using broth dilution. All isolates except two showed unique RAPD fingerprints, whereas metabolic activity tests could not differentiate between the isolates. The MICs of all antimicrobial agents tested were low. Copyright © 2013 Elsevier B.V. All rights reserved.
Schmidt-Chanasit, Jonas; Bialonski, Alexandra; Heinemann, Patrick; Ulrich, Rainer G; Günther, Stephan; Rabenau, Holger F; Doerr, Hans Wilhelm
2010-07-01
Recently two different herpes simplex virus type 2 (HSV-2) clades (A and B) were described on DNA sequence data of the glycoprotein E (gE), G (gG) and I (gI) genes. To type the circulating HSV-2 wild-type strains in Germany by a novel approach and to monitor potential changes in the molecular epidemiology between 1997 and 2008. A total of 64 clinical HSV-2 isolates were analyzed by a novel approach using the DNA sequences of the complete open reading frames of glycoprotein B (gB) and gG. Recombination analysis of the gB and gG gene sequences was performed to reveal intragenic recombinants. Based on the phylogenetic analysis of the gB coding DNA sequence 8 of 64 (12%) isolates were classified as clade A strains and 56 of 64 (88%) isolates were classified as clade B strains. Analysis of the gG coding DNA sequence classified 4 (6%) isolates as clade A strains and 60 (94%) isolates as clade B strains. In comparison, the 8 isolates classified as clade A strains using the gB sequence data were classified as clade B strains when using the gG coding DNA sequence, suggesting intergenic recombination events. Intragenic recombination events were not detected. The first molecular survey of clinical HSV-2 isolates from Germany demonstrated the circulation of clade A and B strains and of intergenic recombinants over a period of 12 years. Copyright (c) 2010 Elsevier B.V. All rights reserved.
Rekadwad, Bhagwan N; Khobragade, Chandrahasya N
2016-06-01
Microbiologists are routinely engaged isolation, identification and comparison of isolated bacteria for their novelty. 16S rRNA sequences of Bacillus pumilus were retrieved from NCBI repository and generated QR codes for sequences (FASTA format and full Gene Bank information). 16SrRNA were used to generate quick response (QR) codes of Bacillus pumilus isolated from Lonar Crator Lake (19° 58' N; 76° 31' E), India. Bacillus pumilus 16S rRNA gene sequences were used to generate CGR, FCGR and PCA. These can be used for visual comparison and evaluation respectively. The hyperlinked QR codes, CGR, FCGR and PCA of all the isolates are made available to the users on a portal https://sites.google.com/site/bhagwanrekadwad/. This generated digital data helps to evaluate and compare any Bacillus pumilus strain, minimizes laboratory efforts and avoid misinterpretation of the species.
Ou, Jing; Liu, Jin-Bo; Yao, Fu-Jiao; Wang, Xin-Guo; Wei, Zhao-Ming
2016-01-01
Flour beetles of the genus Tribolium are all pests of stored products and cause severe economic losses every year. The American black flour beetle Tribolium audax is one of the important pest species of flour beetle, and it is also an important quarantine insect. Here we sequenced and characterized the complete mitochondrial genome of T. audax, which was intercepted by Huangpu Custom in maize from America. The complete circular mitochondrial genome (mitogenome) of T. audax was 15,924 bp in length, containing 37 typical coding genes and one non-coding AT-rich region. The mitogenome of T. audax exhibits a gene arrangement and content identical to the most common type in insects. All protein coding genes (PCGs) are start with a typical ATN initiation codon, except for the cox1, which use AAC as its start codon instead of ATN. Eleven genes use standard complete termination codon (nine TAA, two TAG), whereas the nad4 and nad5 genes end with single T. Except for trnS1 (AGN), all tRNA genes display typical secondary cloverleaf structures as those of other insects. The sizes of the large and small ribosomal RNA genes are 1288 and 780 bp, respectively. The AT content of the AT-rich region is 81.36%. The 5 bp conserved motif TACTA was found in the intergenic region between trnS2 (UCN) and nad1.
An, Shi-Qi; Febrer, Melanie; McCarthy, Yvonne; Tang, Dong-Jie; Clissold, Leah; Kaithakottil, Gemy; Swarbreck, David; Tang, Ji-Liang; Rogers, Jane; Dow, J Maxwell; Ryan, Robert P
2013-01-01
The bacterium Xanthomonas campestris is an economically important pathogen of many crop species and a model for the study of bacterial phytopathogenesis. In X. campestris, a regulatory system mediated by the signal molecule DSF controls virulence to plants. The synthesis and recognition of the DSF signal depends upon different Rpf proteins. DSF signal generation requires RpfF whereas signal perception and transduction depends upon a system comprising the sensor RpfC and regulator RpfG. Here we have addressed the action and role of Rpf/DSF signalling in phytopathogenesis by high-resolution transcriptional analysis coupled to functional genomics. We detected transcripts for many genes that were unidentified by previous computational analysis of the genome sequence. Novel transcribed regions included intergenic transcripts predicted as coding or non-coding as well as those that were antisense to coding sequences. In total, mutation of rpfF, rpfG and rpfC led to alteration in transcript levels (more than fourfold) of approximately 480 genes. The regulatory influence of RpfF and RpfC demonstrated considerable overlap. Contrary to expectation, the regulatory influence of RpfC and RpfG had limited overlap, indicating complexities of the Rpf signalling system. Importantly, functional analysis revealed over 160 new virulence factors within the group of Rpf-regulated genes. PMID:23617851
Cross-site comparison of ribosomal depletion kits for Illumina RNAseq library construction.
Herbert, Zachary T; Kershner, Jamie P; Butty, Vincent L; Thimmapuram, Jyothi; Choudhari, Sulbha; Alekseyev, Yuriy O; Fan, Jun; Podnar, Jessica W; Wilcox, Edward; Gipson, Jenny; Gillaspy, Allison; Jepsen, Kristen; BonDurant, Sandra Splinter; Morris, Krystalynne; Berkeley, Maura; LeClerc, Ashley; Simpson, Stephen D; Sommerville, Gary; Grimmett, Leslie; Adams, Marie; Levine, Stuart S
2018-03-15
Ribosomal RNA (rRNA) comprises at least 90% of total RNA extracted from mammalian tissue or cell line samples. Informative transcriptional profiling using massively parallel sequencing technologies requires either enrichment of mature poly-adenylated transcripts or targeted depletion of the rRNA fraction. The latter method is of particular interest because it is compatible with degraded samples such as those extracted from FFPE and also captures transcripts that are not poly-adenylated such as some non-coding RNAs. Here we provide a cross-site study that evaluates the performance of ribosomal RNA removal kits from Illumina, Takara/Clontech, Kapa Biosystems, Lexogen, New England Biolabs and Qiagen on intact and degraded RNA samples. We find that all of the kits are capable of performing significant ribosomal depletion, though there are differences in their ease of use. All kits were able to remove ribosomal RNA to below 20% with intact RNA and identify ~ 14,000 protein coding genes from the Universal Human Reference RNA sample at >1FPKM. Analysis of differentially detected genes between kits suggests that transcript length may be a key factor in library production efficiency. These results provide a roadmap for labs on the strengths of each of these methods and how best to utilize them.
Zhai, Bintao; Niu, Qingli; Yang, Jifei; Liu, Zhijie; Liu, Junlong; Yin, Hong; Zeng, Qiaoying
2017-02-01
Lyme disease caused by Borrelia burgdorferi sensu lato (s.l.) is a common disease of domestic animals and wildlife worldwide. Sika deer is first-grade state-protected wildlife animals in China and have economic consequences for humans. It is reported that sika deer may serve as an important reservoir host for several species of B. burgdorferi s.l. and may transmit these species to humans and animals. However, little is known about the presence of Borrelia pathogens in sika deer in China. In this study, the existence and prevalence of Borrelia sp. in sika deer from four regions of Jilin Province in China was assessed. Seventy-one blood samples of sika deer were collected and tested by nested-PCRs based on 16S ribosomal RNA (16S rRNA), outer surface protein A (OspA), flagenllin (fla), and 5S-23S rRNA intergenic spacer (5S-23S rRNA) genes of B. burgdorferi s.l. Six (8.45%) samples were positive for Borrelia sp. based on sequences of 4 genes. The positive samples were detected 18 for 16S rRNA, 10 for OspA, 16 for fla and 6 for 5S-23S, with the positive rates 25.35% (95% CI=3.8-35.6), 14.08% (95% CI=3.0-21.6), 22.54% (95% CI=4.3-36.9) and 8.45% (95% CI=1.7-22.9), respectively. Sequence analysis of the positive PCR products revealed that the partial 4 genes sequences in this study were all most similar to the sequences of B. garinii and B. burgdorferi sensu stricto (s.s.), no other Borrelia genospecies were found. This is the first report of Borrelia pathogens in sika deer in China. The findings in this study indicated that sika deer as potential natural host and may spread Lyme disease pathogen to animals, ticks, and even humans. Copyright © 2016. Published by Elsevier B.V.
Wangkheimayum, Jayalaxmi; Paul, Deepjyoti; Dhar, Debadatta; Nepram, Rajlakshmi; Chetri, Shiela; Bhowmik, Deepshikha; Chakravarty, Atanu
2017-01-01
ABSTRACT The methylation of a ribosomal target leads to a high level of resistance to all clinically relevant aminoglycoside antibiotics, so early detection of these resistance determinants will help to reduce the incidence of treatment failures as well as lessen the dissemination rate. Here, we characterized different 16S rRNA methyltransferases responsible for aminoglycoside resistance and their epidemiological background in clinical isolates of Enterobacteriaceae in a tertiary referral hospital in India. All aminoglycoside-resistant isolates were screened for different 16S rRNA methyltransferases by PCR assay, and incompatibility typing of the conjugable plasmid harboring resistance genes was performed by PCR-based replicon typing. An assay for the stability and elimination of these resistance plasmids was performed. The coexistence of extended-spectrum β-lactamases and metallo-β-lactamases was also detected, and the heterogeneity of these isolates was determined by enterobacterial repetitive intergenic consensus PCR. The PCR assay revealed the presence of armA, rmtA, rmtB, rmtC, and rmtD in single and multiple combinations, and these were carried by a diverse group of Inc plasmids. Plasmids harboring these resistance determinants were highly stable and maintained until the 55th serial passage, but SDS treatment could easily eliminate the plasmids harboring the resistance determinants. The coexistence of blaTEM, blaPER, blaGES, and blaSHV, as well as blaVIM and blaNDM, within these isolates was also detected. Strains with different clonal patterns of aminoglycoside resistance were found to spread in this hospital setting. We observed that the 16S rRNA methyltransferase genes were encoded within different Inc plasmid types, suggesting diverse origins and sources of acquisition. Therefore, the present study is of epidemiological importance and can have a role in infection control policy in hospital settings. PMID:28320725
Litter Breakdown and Microbial Succession on Two Submerged Leaf Species in a Small Forested Stream
Newman, Molli M.; Liles, Mark R.; Feminella, Jack W.
2015-01-01
Microbial succession during leaf breakdown was investigated in a small forested stream in west-central Georgia, USA, using multiple culture-independent techniques. Red maple (Acer rubrum) and water oak (Quercus nigra) leaf litter were incubated in situ for 128 days, and litter breakdown was quantified by ash-free dry mass (AFDM) method and microbial assemblage composition using phospholipid fatty acid analysis (PLFA), ribosomal intergenic spacer analysis (RISA), denaturing gradient gel electrophoresis (DGGE), and bar-coded next-generation sequencing of 16S rRNA gene amplicons. Leaf breakdown was faster for red maple than water oak. PLFA revealed a significant time effect on microbial lipid profiles for both leaf species. Microbial assemblages on maple contained a higher relative abundance of bacterial lipids than oak, and oak microbial assemblages contained higher relative abundance of fungal lipids than maple. RISA showed that incubation time was more important in structuring bacterial assemblages than leaf physicochemistry. DGGE profiles revealed high variability in bacterial assemblages over time, and sequencing of DGGE-resolved amplicons indicated several taxa present on degrading litter. Next-generation sequencing revealed temporal shifts in dominant taxa within the phylum Proteobacteria, whereas γ-Proteobacteria dominated pre-immersion and α- and β-Proteobacteria dominated after 1 month of instream incubation; the latter groups contain taxa that are predicted to be capable of using organic material to fuel further breakdown. Our results suggest that incubation time is more important than leaf species physicochemistry in influencing leaf litter microbial assemblage composition, and indicate the need for investigation into seasonal and temporal dynamics of leaf litter microbial assemblage succession. PMID:26098687
Biophysical Constraints Arising from Compositional Context in Synthetic Gene Networks.
Yeung, Enoch; Dy, Aaron J; Martin, Kyle B; Ng, Andrew H; Del Vecchio, Domitilla; Beck, James L; Collins, James J; Murray, Richard M
2017-07-26
Synthetic gene expression is highly sensitive to intragenic compositional context (promoter structure, spacing regions between promoter and coding sequences, and ribosome binding sites). However, much less is known about the effects of intergenic compositional context (spatial arrangement and orientation of entire genes on DNA) on expression levels in synthetic gene networks. We compare expression of induced genes arranged in convergent, divergent, or tandem orientations. Induction of convergent genes yielded up to 400% higher expression, greater ultrasensitivity, and dynamic range than divergent- or tandem-oriented genes. Orientation affects gene expression whether one or both genes are induced. We postulate that transcriptional interference in divergent and tandem genes, mediated by supercoiling, can explain differences in expression and validate this hypothesis through modeling and in vitro supercoiling relaxation experiments. Treatment with gyrase abrogated intergenic context effects, bringing expression levels within 30% of each other. We rebuilt the toggle switch with convergent genes, taking advantage of supercoiling effects to improve threshold detection and switch stability. Copyright © 2017 Elsevier Inc. All rights reserved.
Problem-Based Test: An "In Vitro" Experiment to Analyze the Genetic Code
ERIC Educational Resources Information Center
Szeberenyi, Jozsef
2010-01-01
Terms to be familiar with before you start to solve the test: genetic code, translation, synthetic polynucleotide, leucine, serine, filter precipitation, radioactivity measurement, template, mRNA, tRNA, rRNA, aminoacyl-tRNA synthesis, ribosomes, degeneration of the code, wobble, initiation, and elongation of protein synthesis, initiation codon.…
Rekadwad, Bhagwan N; Khobragade, Chandrahasya N
2016-03-01
16S rRNA sequences of morphologically and biochemically identified 21 thermophilic bacteria isolated from Unkeshwar hot springs (19°85'N and 78°25'E), Dist. Nanded (India) has been deposited in NCBI repository. The 16S rRNA gene sequences were used to generate QR codes for sequences (FASTA format and full Gene Bank information). Diversity among the isolates is compared with known isolates and evaluated using CGR, FCGR and PCA i.e. visual comparison and evaluation respectively. Considerable biodiversity was observed among the identified bacteria isolated from Unkeshwar hot springs. The hyperlinked QR codes, CGR, FCGR and PCA of all the isolates are made available to the users on a portal https://sites.google.com/site/bhagwanrekadwad/.
Khan, Mohammad S.; Sadat, Syed U.; Jan, Asad; Munir, Iqbal
2017-01-01
Transgenic Brassica napus harboring the synthetic chitinase (NiC) gene exhibits broad-spectrum antifungal resistance. As the rhizosphere microorganisms play an important role in element cycling and nutrient transformation, therefore, biosafety assessment of NiC containing transgenic plants on soil ecosystem is a regulatory requirement. The current study is designed to evaluate the impact of NiC gene on the rhizosphere enzyme activities and microbial community structure. The transgenic lines with the synthetic chitinase gene (NiC) showed resistance to Alternaria brassicicola, a common disease causing fungal pathogen. The rhizosphere enzyme analysis showed no significant difference in the activities of fivesoil enzymes: alkalyine phosphomonoestarase, arylsulphatase, β-glucosidase, urease and sucrase between the transgenic and non-transgenic lines of B. napus varieties, Durr-e-NIFA (DN) and Abasyne-95 (AB-95). However, varietal differences were observed based on the analysis of molecular variance. Some individual enzymes were significantly different in the transgenic lines from those of non-transgenic but the results were not reproducible in the second trail and thus were considered as environmental effect. Genotypic diversity of soil microbes through 16S–23S rRNA intergenic spacer region amplification was conducted to evaluate the potential impact of the transgene. No significant diversity (4% for bacteria and 12% for fungal) between soil microbes of NiC B. napus and the non-transgenic lines was found. However, significant varietal differences were observed between DN and AB-95 with 79% for bacterial and 54% for fungal diversity. We conclude that the NiC B. napus lines may not affect the microbial enzyme activities and community structure of the rhizosphere soil. Varietal differences might be responsible for minor changes in the tested parameters. PMID:28791039
In and out of the rRNA genes: characterization of Pokey elements in the sequenced Daphnia genome
2013-01-01
Background Only a few transposable elements are known to exhibit site-specific insertion patterns, including the well-studied R-element retrotransposons that insert into specific sites within the multigene rDNA. The only known rDNA-specific DNA transposon, Pokey (superfamily: piggyBac) is found in the freshwater microcrustacean, Daphnia pulex. Here, we present a genome-wide analysis of Pokey based on the recently completed whole genome sequencing project for D. pulex. Results Phylogenetic analysis of Pokey elements recovered from the genome sequence revealed the presence of four lineages corresponding to two divergent autonomous families and two related lineages of non-autonomous miniature inverted repeat transposable elements (MITEs). The MITEs are also found at the same 28S rRNA gene insertion site as the Pokey elements, and appear to have arisen as deletion derivatives of autonomous elements. Several copies of the full-length Pokey elements may be capable of producing an active transposase. Surprisingly, both families of Pokey possess a series of 200 bp repeats upstream of the transposase that is derived from the rDNA intergenic spacer (IGS). The IGS sequences within the Pokey elements appear to be evolving in concert with the rDNA units. Finally, analysis of the insertion sites of Pokey elements outside of rDNA showed a target preference for sites similar to the specific sequence that is targeted within rDNA. Conclusions Based on the target site preference of Pokey elements and the concerted evolution of a segment of the element with the rDNA unit, we propose an evolutionary path by which the ancestors of Pokey elements have invaded the rDNA niche. We discuss how specificity for the rDNA unit may have evolved and how this specificity has played a role in the long-term survival of these elements in the subgenus Daphnia. PMID:24059783
Nyaku, Seloame T; Sripathi, Venkateswara R; Kantety, Ramesh V; Gu, Yong Q; Lawrence, Kathy; Sharma, Govind C
2013-01-01
The 18S rRNA gene is fundamental to cellular and organismal protein synthesis and because of its stable persistence through generations it is also used in phylogenetic analysis among taxa. Sequence variation in this gene within a single species is rare, but it has been observed in few metazoan organisms. More frequently it has mostly been reported in the non-transcribed spacer region. Here, we have identified two sequence variants within the near full coding region of 18S rRNA gene from a single reniform nematode (RN) Rotylenchulus reniformis labeled as reniform nematode variant 1 (RN_VAR1) and variant 2 (RN_VAR2). All sequences from three of the four isolates had both RN variants in their sequences; however, isolate 13B had only RN variant 2 sequence. Specific variable base sites (96 or 5.5%) were found within the 18S rRNA gene that can clearly distinguish the two 18S rDNA variants of RN, in 11 (25.0%) and 33 (75.0%) of the 44 RN clones, for RN_VAR1 and RN_VAR2, respectively. Neighbor-joining trees show that the RN_VAR1 is very similar to the previously existing R. reniformis sequence in GenBank, while the RN_VAR2 sequence is more divergent. This is the first report of the identification of two major variants of the 18S rRNA gene in the same single RN, and documents the specific base variation between the two variants, and hypothesizes on simultaneous co-existence of these two variants for this gene.
Nyaku, Seloame T.; Sripathi, Venkateswara R.; Kantety, Ramesh V.; Gu, Yong Q.; Lawrence, Kathy; Sharma, Govind C.
2013-01-01
The 18S rRNA gene is fundamental to cellular and organismal protein synthesis and because of its stable persistence through generations it is also used in phylogenetic analysis among taxa. Sequence variation in this gene within a single species is rare, but it has been observed in few metazoan organisms. More frequently it has mostly been reported in the non-transcribed spacer region. Here, we have identified two sequence variants within the near full coding region of 18S rRNA gene from a single reniform nematode (RN) Rotylenchulus reniformis labeled as reniform nematode variant 1 (RN_VAR1) and variant 2 (RN_VAR2). All sequences from three of the four isolates had both RN variants in their sequences; however, isolate 13B had only RN variant 2 sequence. Specific variable base sites (96 or 5.5%) were found within the 18S rRNA gene that can clearly distinguish the two 18S rDNA variants of RN, in 11 (25.0%) and 33 (75.0%) of the 44 RN clones, for RN_VAR1 and RN_VAR2, respectively. Neighbor-joining trees show that the RN_VAR1 is very similar to the previously existing R. reniformis sequence in GenBank, while the RN_VAR2 sequence is more divergent. This is the first report of the identification of two major variants of the 18S rRNA gene in the same single RN, and documents the specific base variation between the two variants, and hypothesizes on simultaneous co-existence of these two variants for this gene. PMID:23593343
De Wachter, R; Neefs, J M; Goris, A; Van de Peer, Y
1992-01-01
The nucleotide sequence of the gene coding for small ribosomal subunit RNA in the basidiomycete Ustilago maydis was determined. It revealed the presence of a group I intron with a length of 411 nucleotides. This is the third occurrence of such an intron discovered in a small subunit rRNA gene encoded by a eukaryotic nuclear genome. The other two occurrences are in Pneumocystis carinii, a fungus of uncertain taxonomic status, and Ankistrodesmus stipitatus, a green alga. The nucleotides of the conserved core structure of 101 group I intron sequences present in different genes and genome types were aligned and their evolutionary relatedness was examined. This revealed a cluster including all group I introns hitherto found in eukaryotic nuclear genes coding for small and large subunit rRNAs. A secondary structure model was designed for the area of the Ustilago maydis small ribosomal subunit RNA precursor where the intron is situated. It shows that the internal guide sequence pairing with the intron boundaries fits between two helices of the small subunit rRNA, and that minimal rearrangement of base pairs suffices to achieve the definitive secondary structure of the 18S rRNA upon splicing. PMID:1561081
McFrederick, Quinn S; Vuong, Hoang Q; Rothman, Jason A
2018-06-01
Gram-stain-positive, rod-shaped, non-spore forming bacteria have been isolated from flowers and the guts of adult wild bees in the families Megachilidae and Halictidae. Phylogenetic analysis of the 16S rRNA gene indicated that these bacteria belong to the genus Lactobacillus, and are most closely related to the honey-bee associated bacteria Lactobacillus kunkeei (97.0 % sequence similarity) and Lactobacillus apinorum (97.0 % sequence similarity). Phylogenetic analyses of 16S rRNA genes and six single-copy protein coding genes, in situ and in silico DNA-DNA hybridization, and fatty-acid profiling differentiates the newly isolated bacteria as three novel Lactobacillus species: Lactobacillus micheneri sp. nov. with the type strain Hlig3 T (=DSM 104126 T ,=NRRL B-65473 T ), Lactobacillus timberlakei with the type strain HV_12 T (=DSM 104128 T ,=NRRL B-65472 T ), and Lactobacillus quenuiae sp. nov. with the type strain HV_6 T (=DSM 104127 T ,=NRRL B-65474 T ).
Fuentes, Eduardo N; Zuloaga, Rodrigo; Nardocci, Gino; Fernandez de la Reguera, Catalina; Simonet, Nicolas; Fumeron, Robinson; Valdes, Juan Antonio; Molina, Alfredo; Alvarez, Marco
2014-01-01
Ribosomal biogenesis controls cellular growth in living organisms, with the rate-limiting step of this process being the transcription of ribosomal DNA (rDNA). Considering that epigenetic mechanisms allow an organism to respond to environmental changes, the expression in muscle of several molecules that regulate epigenetic rRNA synthesis, as well as rDNA transcription, were evaluated during the seasonal acclimatization of the carp. First, the nucleotide sequences encoding the components forming the NoRC (ttf-I, tip5) and eNoSC (sirt1, nml, suv39h1), two chromatin remodeling complexes that silence rRNA synthesis, as well as the sequence of ubf1, a key regulator of rDNA transcription, were obtained. Subsequently the transcriptional regulation of the aforementioned molecules, and other key molecules involved in rRNA synthesis (mh2a1, mh2a2, h2a.z, h2a.z.7, nuc, p80), was assessed. The carp sequences for TTF-I, TIP5, SIRT1, NML, SUV39H1, and UBF1 showed a high conservation of domains and key amino acids in comparison with other fish and higher vertebrates. The mRNA contents in muscle for ttf-I, tip5, sirt1, nml, suv39h1, mh2a1, mh2a.z, and nuc were up-regulated during winter in comparison with summer, whereas the mRNA levels of mh2a2, ubf1, and p80 were down-regulated. Also, the contents of molecules involved in processing the rRNA (snoRNAs) and pRNA, a stabilizer of NoRC complex, were analyzed, finding that these non-coding RNAs were not affected by seasonal acclimatization. These results suggest that variations in the expression of rRNA and the molecules that epigenetically regulate its synthesis are contributing to the muscle plasticity induced by seasonal acclimatization in carp. Copyright © 2014 Elsevier Inc. All rights reserved.
Free-living and captive turtles and tortoises as carriers of new Chlamydia spp.
Niemczuk, Krzysztof; Zaręba, Kinga; Zając, Magdalena; Laroucau, Karine; Szymańska-Czerwińska, Monika
2017-01-01
A variety of Chlamydia species belonging to the Chlamydiaceae family have been reported in reptilian hosts but scarce data about their occurrence in turtles and tortoises are available. In this study, research was conducted to acquire information on invasive alien species (IAS) of turtles and indigenous turtles and tortoises, living both free and in captivity, as possible reservoirs of Chlamydiaceae. Analysis of specimens (pharyngeal and cloacal swabs and tissues) from 204 turtles and tortoises revealed an overall Chlamydiaceae prevalence of 18.3% and 28.6% among free-living and captive animals respectively, with variable levels of shedding. Further testing conducted with a species-specific real-time PCR and microarray test was unsuccessful. Subsequently sequencing was applied to genotype the Chlamydiaceae-positive samples. Almost the full lengths of the 16S rRNA and ompA genes as well as the 16S-23S intergenic spacer (IGS) and 23S rRNA domain I were obtained for 14, 20 and 8 specimens respectively. Phylogenetic analysis of 16S rRNA amplicons revealed two distinct branches. Group 1 (10 specimens), specific to freshwater turtles and reported here for the first time, was most closely related to Chlamydia (C.) pneumoniae strains and the newly described Candidatus C. sanzinia. Group 2 (four specimens), detected in Testudo spp. samples, showed highest homology to C. pecorum strains but formed a separate sub-branch. Finally, molecular analysis conducted on positive samples together with their geographical distribution in places distant from each other strongly suggest that Group 1 specimens correspond to a new species in the Chlamydiaceae family. In-depth studies of Chlamydia spp. from turtles and tortoises are needed to further characterise these atypical strains and address arising questions about their pathogenicity and zoonotic potential. PMID:28950002
Free-living and captive turtles and tortoises as carriers of new Chlamydia spp.
Mitura, Agata; Niemczuk, Krzysztof; Zaręba, Kinga; Zając, Magdalena; Laroucau, Karine; Szymańska-Czerwińska, Monika
2017-01-01
A variety of Chlamydia species belonging to the Chlamydiaceae family have been reported in reptilian hosts but scarce data about their occurrence in turtles and tortoises are available. In this study, research was conducted to acquire information on invasive alien species (IAS) of turtles and indigenous turtles and tortoises, living both free and in captivity, as possible reservoirs of Chlamydiaceae. Analysis of specimens (pharyngeal and cloacal swabs and tissues) from 204 turtles and tortoises revealed an overall Chlamydiaceae prevalence of 18.3% and 28.6% among free-living and captive animals respectively, with variable levels of shedding. Further testing conducted with a species-specific real-time PCR and microarray test was unsuccessful. Subsequently sequencing was applied to genotype the Chlamydiaceae-positive samples. Almost the full lengths of the 16S rRNA and ompA genes as well as the 16S-23S intergenic spacer (IGS) and 23S rRNA domain I were obtained for 14, 20 and 8 specimens respectively. Phylogenetic analysis of 16S rRNA amplicons revealed two distinct branches. Group 1 (10 specimens), specific to freshwater turtles and reported here for the first time, was most closely related to Chlamydia (C.) pneumoniae strains and the newly described Candidatus C. sanzinia. Group 2 (four specimens), detected in Testudo spp. samples, showed highest homology to C. pecorum strains but formed a separate sub-branch. Finally, molecular analysis conducted on positive samples together with their geographical distribution in places distant from each other strongly suggest that Group 1 specimens correspond to a new species in the Chlamydiaceae family. In-depth studies of Chlamydia spp. from turtles and tortoises are needed to further characterise these atypical strains and address arising questions about their pathogenicity and zoonotic potential.
Liu, Wendy Y Y; Ridgway, Hayley J; James, Trevor K; James, Euan K; Chen, Wen-Ming; Sprent, Janet I; Young, J Peter W; Andrews, Mitchell
2014-10-01
The South African invasive legume Dipogon lignosus (Phaseoleae) produces nodules with both determinate and indeterminate characteristics in New Zealand (NZ) soils. Ten bacterial isolates produced functional nodules on D. lignosus. The 16S ribosomal RNA (rRNA) gene sequences identified one isolate as Bradyrhizobium sp., one isolate as Rhizobium sp. and eight isolates as Burkholderia sp. The Bradyrhizobium sp. and Rhizobium sp. 16S rRNA sequences were identical to those of strains previously isolated from crop plants and may have originated from inocula used on crops. Both 16S rRNA and DNA recombinase A (recA) gene sequences placed the eight Burkholderia isolates separate from previously described Burkholderia rhizobial species. However, the isolates showed a very close relationship to Burkholderia rhizobial strains isolated from South African plants with respect to their nitrogenase iron protein (nifH), N-acyltransferase nodulation protein A (nodA) and N-acetylglucosaminyl transferase nodulation protein C (nodC) gene sequences. Gene sequences and enterobacterial repetitive intergenic consensus (ERIC) PCR and repetitive element palindromic PCR (rep-PCR) banding patterns indicated that the eight Burkholderia isolates separated into five clones of one strain and three of another. One strain was tested and shown to produce functional nodules on a range of South African plants previously reported to be nodulated by Burkholderia tuberum STM678(T) which was isolated from the Cape Region. Thus, evidence is strong that the Burkholderia strains isolated here originated in South Africa and were somehow transported with the plants from their native habitat to NZ. It is possible that the strains are of a new species capable of nodulating legumes.
Amexis, Georgios; Rubin, Steven; Chatterjee, Nando; Carbone, Kathryn; Chumakov, Kostantin
2003-06-01
A single clinical isolate of mumps virus designated 88-1961 was obtained from a patient hospitalized with a clinical history of upper respiratory tract infection, parotitis, severe headache, fever and lymphadenopathy. We have sequenced the full-length genome of 88-1961 and compared it against all available full-length sequences of mumps virus. Based upon its nucleotide sequence of the SH gene 88-1961 was identified as a genotype H mumps strain. The overall extent of nucleotide and amino acid differences between each individual gene and protein of 88-1961 and the full-length mumps samples showed that the missense to silent ratios were unevenly distributed. Upon evaluation of the consensus sequence of 88-1961, four positions were found to be clearly heterogeneous at the nucleotide level (NP 315C/T, NP 318C/T, F 271A/C, and HN 855C/T). Sequence analysis revealed that the amino acid sequences for the NP, M, and the L protein were the most conserved, whereas the SH protein exhibited the highest variability among the compared mumps genotypes A, B, and G. No identifying molecular patterns in the non-coding (intergenic) or coding regions of 88-1961 were found when we compared it against relatively virulent (Urabe AM9 B, Glouc1/UK96, 87-1004 and 87-1005) and non-virulent mumps strains (Jeryl Lynn and all Urabe Am9 A substrains). Copyright 2003 Wiley-Liss, Inc.
A sensible technique to detect mollicutes impurities in human cells cultured in GMP condition.
Ugolotti, Elisabetta; Vanni, Irene
2014-01-01
In therapeutic trials the use of manipulated cell cultures for clinical applications is often required. Mollicutes microorganism contamination of tissue cultures is a major problem because it can determine various and severe alterations in cellular function. Thus methods able to detect and trace cell cultures with Mollicutes contamination are needed in the monitoring of cells grown under good manufacturing practice conditions, and cell lines in continuous culture must be tested at regular intervals. We here describe a multiplex quantitative polymerase chain reaction assay able to detect contaminant Mollicutes species in a single-tube reaction through analysis of 16S-23S rRNA intergenic spacer regions and Tuf and P1 cytoadhesin genes. The method shows a sensitivity, specificity, and robustness comparable with the culture and the indicator cell culture as required by the European Pharmacopoeia guidelines and was validated following International Conference on Harmonization guidelines and Food and Drug Administration requirements.
Sakaridis, I; Soultos, N; Dovas, C I; Papavergou, E; Ambrosiadis, I; Koidis, P
2012-02-01
This study was conducted to isolate psychrotrophic lactic acid bacteria (LAB) from chicken carcasses with inhibitory activity against strains of Salmonella spp. and Listeria monocytogenes. A total of 100 broiler samples were examined for the presence of LAB. Ninety-two LAB isolates that showed antimicrobial effects against Salmonella spp. and L. monocytogenes were further analysed to examine their LAB (Gram-positive, catalase negative, oxidase negative) and psychrotrophic characteristics (ability to grow at 7 °C). Fifty isolates were further selected and identified initially using standard biochemical tests in miniature (Micro-kits API CH 50) and then by sequencing of the 16s-23s rRNA gene boundary region (Intergenic Spacer Region). By molecular identification, these isolates were classified into 5 different LAB species: Lactobacillus salivarius, Lactobacillus reuteri, Lactobacillus johnsonii, Pediococcus acidilactici, and Lactobacillus paralimentarius. None of the isolates produced tyramine or histamine. Copyright © 2011 Elsevier Ltd. All rights reserved.
Non-contiguous finished genome sequence and description of Alistipes timonensis sp. nov.
Lagier, Jean-Christophe; Armougom, Fabrice; Mishra, Ajay Kumar; Nguyen, Thi-Tien; Raoult, Didier; Fournier, Pierre-Edouard
2012-01-01
Alistipes timonensis strain JC136T sp. nov. is the type strain of A. timonensis sp. nov., a new species within the genus Alistipes. This strain, whose genome is described here, was isolated from the fecal flora of a healthy patient. A. timonensis is an obligate anaerobic rod. Here we describe the features of this organism, together with the complete genome sequence and annotation. The 3,497,779 bp long genome (one chromosome but no plasmid) contains 2,742 protein-coding and 50 RNA genes, including three rRNA genes. PMID:23408657
Rekadwad, Bhagwan N.; Khobragade, Chandrahasya N.
2015-01-01
16S rRNA sequences of morphologically and biochemically identified 21 thermophilic bacteria isolated from Unkeshwar hot springs (19°85′N and 78°25′E), Dist. Nanded (India) has been deposited in NCBI repository. The 16S rRNA gene sequences were used to generate QR codes for sequences (FASTA format and full Gene Bank information). Diversity among the isolates is compared with known isolates and evaluated using CGR, FCGR and PCA i.e. visual comparison and evaluation respectively. Considerable biodiversity was observed among the identified bacteria isolated from Unkeshwar hot springs. The hyperlinked QR codes, CGR, FCGR and PCA of all the isolates are made available to the users on a portal https://sites.google.com/site/bhagwanrekadwad/. PMID:26793757
Wu, Qiong; Xiang, Shihao; Ma, Jiali; Hui, Pingping; Wang, Ting; Meng, Wenying; Shi, Min; Wang, Yugang
2018-06-01
Long non-coding RNA (lncRNA) is responsible for a diverse range of cellular functions, such as transcriptional and translational regulation and variance in gene expression. The lncRNA CASC15 (cancer susceptibility candidate 15) is a long intergenic non-coding RNA (lincRNA) locus in chromosome 6p22.3. Previous research shows that lncRNA CASC15 is implicated in the biological behaviors of several cancers such as neuroblastoma and melanoma. Here, we aimed to explore in detail how CASC15 contributes to the growth of gastric cancer (GC). As predicted, the expression of CASC15 was enriched in GC tissues and cell lines as compared with healthy tissues and cells using qRT-PCR. The Kaplan-Meier method was used to demonstrate that high expression of CASC15 is linked to a poor prognosis for patients suffering from GC. Additionally, functional experiments proved that the down- or up-regulation of CASC15 inhibited or facilitated cell proliferation via the induction of cell cycle arrest and apoptosis, and also suppressed or accelerated cell migration and invasion by affecting the progression of the epithelial-to-mesenchymal transition (EMT). In vivo experiments showed that the knockdown of CASC15 lessened the tumor volume and weight and influenced the EMT process. This was confirmed by western blot assays and immunohistochemistry, indicating impaired metastatic ability in nude mice. CASC15 involvement in the tumorigenesis of GC occurs when CASC15 interacts with EZH2 and WDR5 to modulate CDKN1A in nucleus. Additionally, the knockdown of CASC15 triggered the silencing of ZEB1 in cytoplasm, which was shown to be associated with the competitive binding of CASC15 to miR-33a-5p. © 2018 The Authors. Published by FEBS Press and John Wiley & Sons Ltd.
Comprehensive Analysis of Genome Rearrangements in Eight Human Malignant Tumor Tissues
Wang, Chong
2016-01-01
Carcinogenesis is a complex multifactorial, multistage process, but the precise mechanisms are not well understood. In this study, we performed a genome-wide analysis of the copy number variation (CNV), breakpoint region (BPR) and fragile sites in 2,737 tumor samples from eight tumor entities and in 432 normal samples. CNV detection and BPR identification revealed that BPRs tended to accumulate in specific genomic regions in tumor samples whereas being dispersed genome-wide in the normal samples. Hotspots were observed, at which segments with similar alteration in copy number were overlapped along with BPRs adjacently clustered. Evaluation of BPR occurrence frequency showed that at least one was detected in about and more than 15% of samples for each tumor entity while BPRs were maximal in 12% of the normal samples. 127 of 2,716 tumor-relevant BPRs (termed ‘common BPRs’) exhibited also a noticeable occurrence frequency in the normal samples. Colocalization assessment identified 20,077 CNV-affecting genes and 169 of these being known tumor-related genes. The most noteworthy genes are KIAA0513 important for immunologic, synaptic and apoptotic signal pathways, intergenic non-coding RNA RP11-115C21.2 possibly acting as oncogene or tumor suppressor by changing the structure of chromatin, and ADAM32 likely importance in cancer cell proliferation and progression by ectodomain-shedding of diverse growth factors, and the well-known tumor suppressor gene p53. The BPR distributions indicate that CNV mutations are likely non-random in tumor genomes. The marked recurrence of BPRs at specific regions supports common progression mechanisms in tumors. The presence of hotspots together with common BPRs, despite its small group size, imply a relation between fragile sites and cancer-gene alteration. Our data further suggest that both protein-coding and non-coding genes possessing a range of biological functions might play a causative or functional role in tumor biology. This research enhances our understanding of the mechanisms for tumorigenesis and progression. PMID:27391163
Kawaguchi, Risa; Kiryu, Hisanori
2016-05-06
RNA secondary structure around splice sites is known to assist normal splicing by promoting spliceosome recognition. However, analyzing the structural properties of entire intronic regions or pre-mRNA sequences has been difficult hitherto, owing to serious experimental and computational limitations, such as low read coverage and numerical problems. Our novel software, "ParasoR", is designed to run on a computer cluster and enables the exact computation of various structural features of long RNA sequences under the constraint of maximal base-pairing distance. ParasoR divides dynamic programming (DP) matrices into smaller pieces, such that each piece can be computed by a separate computer node without losing the connectivity information between the pieces. ParasoR directly computes the ratios of DP variables to avoid the reduction of numerical precision caused by the cancellation of a large number of Boltzmann factors. The structural preferences of mRNAs computed by ParasoR shows a high concordance with those determined by high-throughput sequencing analyses. Using ParasoR, we investigated the global structural preferences of transcribed regions in the human genome. A genome-wide folding simulation indicated that transcribed regions are significantly more structural than intergenic regions after removing repeat sequences and k-mer frequency bias. In particular, we observed a highly significant preference for base pairing over entire intronic regions as compared to their antisense sequences, as well as to intergenic regions. A comparison between pre-mRNAs and mRNAs showed that coding regions become more accessible after splicing, indicating constraints for translational efficiency. Such changes are correlated with gene expression levels, as well as GC content, and are enriched among genes associated with cytoskeleton and kinase functions. We have shown that ParasoR is very useful for analyzing the structural properties of long RNA sequences such as mRNAs, pre-mRNAs, and long non-coding RNAs whose lengths can be more than a million bases in the human genome. In our analyses, transcribed regions including introns are indicated to be subject to various types of structural constraints that cannot be explained from simple sequence composition biases. ParasoR is freely available at https://github.com/carushi/ParasoR .
Dash, Hirak R; Das, Surajit
2016-04-01
Both point and non-point sources increase the pollution status of mercury and increase the population of mercury-resistant marine bacteria (MRMB). They can be targeted as the indicator organism to access marine mercury pollution, besides utilization in bioremediation. Thus, sediment and water samples were collected for 2 years (2010-2012) along Odisha coast of Bay of Bengal, India. Mercury content of the study sites varied from 0.47 to 0.99 ppb irrespective of the seasons of sampling. A strong positive correlation was observed between mercury content and MRMB population (P < 0.05) suggesting the utilization of these bacteria to assess the level of mercury pollution in the marine environment. Seventy-eight percent of the MRMB isolates were under the phylum Firmicutes, and 36 and 31% of them could resist mercury by mer operon-mediated volatilization and mercury biosorption, respectively. In addition, most of the isolates could resist a number of antibiotics and toxic metals. All the MRMB isolates possess the potential of growth and survival at cardinal pH (4-8), temperature (25-37 °C), and salinity (5-35 psu). Enterobacteria repetitive intergenic consensus (ERIC) and repetitive element palindromic PCR (REP-PCR) produced fingerprints corroborating the results of 16S rRNA gene sequencing. Fourier transform infrared (FTIR) spectral analysis also revealed strain-level speciation and phylogenetic relationships.
Epigenetic regulation of TTF-I-mediated promoter–terminator interactions of rRNA genes
Németh, Attila; Guibert, Sylvain; Tiwari, Vijay Kumar; Ohlsson, Rolf; Längst, Gernot
2008-01-01
Ribosomal RNA synthesis is the eukaryotic cell's main transcriptional activity, but little is known about the chromatin domain organization and epigenetics of actively transcribed rRNA genes. Here, we show epigenetic and spatial organization of mouse rRNA genes at the molecular level. TTF-I-binding sites subdivide the rRNA transcription unit into functional chromatin domains and sharply delimit transcription factor occupancy. H2A.Z-containing nucleosomes occupy the spacer promoter next to a newly characterized TTF-I-binding site. The spacer and the promoter proximal TTF-I-binding sites demarcate the enhancer. DNA from both the enhancer and the coding region is hypomethylated in actively transcribed repeats. 3C analysis revealed an interaction between promoter and terminator regions, which brings the beginning and end of active rRNA genes into close contact. Reporter assays show that TTF-I mediates this interaction, thereby linking topology and epigenetic regulation of the rRNA genes. PMID:18354495
Rekadwad, Bhagwan N.; Khobragade, Chandrahasya N.
2016-01-01
Microbiologists are routinely engaged isolation, identification and comparison of isolated bacteria for their novelty. 16S rRNA sequences of Bacillus pumilus were retrieved from NCBI repository and generated QR codes for sequences (FASTA format and full Gene Bank information). 16SrRNA were used to generate quick response (QR) codes of Bacillus pumilus isolated from Lonar Crator Lake (19° 58′ N; 76° 31′ E), India. Bacillus pumilus 16S rRNA gene sequences were used to generate CGR, FCGR and PCA. These can be used for visual comparison and evaluation respectively. The hyperlinked QR codes, CGR, FCGR and PCA of all the isolates are made available to the users on a portal https://sites.google.com/site/bhagwanrekadwad/. This generated digital data helps to evaluate and compare any Bacillus pumilus strain, minimizes laboratory efforts and avoid misinterpretation of the species. PMID:27141529
Yoon, Kwang Bae; Kim, Ji Young; Park, Yung Chul
2016-05-01
We describe the characteristics of complete mitogenome of C. brachyotis in this article. The complete mitogenome of C. brachyotis is 16,701 bp long with a total base composition of 32.4% A, 25.7% T, 27.7% C and 14.2% G. The mitogenome consists of 13 protein-coding genes (11,408 bp), (KM659865) two rRNA (12S rRNA and 16S rRNA) genes (2,539 bp), 22 tRNA genes (1518 bp) and one control region (1239 bp).
Fan, SiGang; Hu, ChaoQun; Wen, Jing; Zhang, LvPing
2011-05-01
The complete mitochondrial DNA sequence contains useful information for phylogenetic analyses of metazoa. In this study, the complete mitochondrial DNA sequence of sea cucumber Stichopus horrens (Holothuroidea: Stichopodidae: Stichopus) is presented. The complete sequence was determined using normal and long PCRs. The mitochondrial genome of Stichopus horrens is a circular molecule 16257 bps long, composed of 13 protein-coding genes, two ribosomal RNA genes and 22 transfer RNA genes. Most of these genes are coded on the heavy strand except for one protein-coding gene (nad6) and five tRNA genes (tRNA ( Ser(UCN) ), tRNA ( Gln ), tRNA ( Ala ), tRNA ( Val ), tRNA ( Asp )) which are coded on the light strand. The composition of the heavy strand is 30.8% A, 23.7% C, 16.2% G, and 29.3% T bases (AT skew=0.025; GC skew=-0.188). A non-coding region of 675 bp was identified as a putative control region because of its location and AT richness. The intergenic spacers range from 1 to 50 bp in size, totaling 227 bp. A total of 25 overlapping nucleotides, ranging from 1 to 10 bp in size, exist among 11 genes. All 13 protein-coding genes are initiated with an ATG. The TAA codon is used as the stop codon in all the protein coding genes except nad3 and nad4 that use TAG as their termination codon. The most frequently used amino acids are Leu (16.29%), Ser (10.34%) and Phe (8.37%). All of the tRNA genes have the potential to fold into typical cloverleaf secondary structures. We also compared the order of the genes in the mitochondrial DNA from the five holothurians that are now available and found a novel gene arrangement in the mitochondrial DNA of Stichopus horrens.
Ye, Yafei; Yang, Shengnan; Han, Yanping; Sun, Jingjing; Xv, Lijuan; Wu, Lina; Wang, Yongfeng; Ming, Liang
2018-06-21
Long intergenic non-coding RNA Linc00472 has been considered as a tumor suppressor in some cancers. However, the function and mechanism of Linc00472 in colorectal cancer has not been well elucidated. In this study, we found that Linc00472 was down-regulated in colorectal cancer tissues and cells. Elevated Linc00472 expression suppressed proliferation and induced apoptosis in colorectal cancer cells. Moreover, Linc00472 acted as a competing endogenous RNA (ceRNA) of miR-196a to release programmed cell death 4 (PDCD4). Furthermore, miR-196a overexpression or PDCD4 knockdown reversed Linc00472-mediated proliferation inhibition and apoptosis induction in colorectal cancer cells. Ectopic Linc00472 expression hindered tumor growth in vivo . Our study demonstrated that Linc00472 suppressed proliferation and induced apoptosis through up-regulating PDCD4 by decoying miR-196a, which may be an effective therapeutic target for colorectal cancer.
Cheewachaiwit, S; Warin, N; Phuangrat, B; Rukpratanporn, S; Gajanandana, O; Balatero, C H; Chatchawankanphanich, O
2017-07-01
Overall, 244 samples of cucurbit crops with yellowing symptoms and selected weed species, from 15 provinces in Thailand, were screened by RT-PCR using primers Polero-CP-F and Polero-CP-R. A total of 160 samples (~66%) were infected by poleroviruses. Analysis of a 1.4 kb region covering the 3' RNA-dependent RNA polymerase (RdRp) gene, the intergenic non-coding region (iNCR), and the coat protein (CP), showed that four poleroviruses, namely, cucurbit aphid-borne yellows virus (CABYV), luffa aphid-borne yellows virus (LABYV), melon aphid-borne yellows virus (MABYV) and suakwa aphid-borne yellows virus (SABYV) were associated with the yellowing symptoms in cucurbit crops. Further analyses indicated presence of putative recombinant viruses referred to as CABYV-R and SABYV-R. CABYV-R was derived from the recombination between MABYV and the common strain of CABYV (CABYV-C). SABYV-R was derived from the recombination of MABYV and SABYV.
How close is close: 16S rRNA sequence identity may not be sufficient to guarantee species identity
NASA Technical Reports Server (NTRS)
Fox, G. E.; Wisotzkey, J. D.; Jurtshuk, P. Jr
1992-01-01
16S rRNA (genes coding for rRNA) sequence comparisons were conducted with the following three psychrophilic strains: Bacillus globisporus W25T (T = type strain) and Bacillus psychrophilus W16AT, and W5. These strains exhibited more than 99.5% sequence identity and within experimental uncertainty could be regarded as identical. Their close taxonomic relationship was further documented by phenotypic similarities. In contrast, previously published DNA-DNA hybridization results have convincingly established that these strains do not belong to the same species if current standards are used. These results emphasize the important point that effective identity of 16S rRNA sequences is not necessarily a sufficient criterion to guarantee species identity. Thus, although 16S rRNA sequences can be used routinely to distinguish and establish relationships between genera and well-resolved species, very recently diverged species may not be recognizable.
Evidence of birth-and-death evolution of 5S rRNA gene in Channa species (Teleostei, Perciformes).
Barman, Anindya Sundar; Singh, Mamta; Singh, Rajeev Kumar; Lal, Kuldeep Kumar
2016-12-01
In higher eukaryotes, minor rDNA family codes for 5S rRNA that is arranged in tandem arrays and comprises of a highly conserved 120 bp long coding sequence with a variable non-transcribed spacer (NTS). Initially the 5S rDNA repeats are considered to be evolved by the process of concerted evolution. But some recent reports, including teleost fishes suggested that evolution of 5S rDNA repeat does not fit into the concerted evolution model and evolution of 5S rDNA family may be explained by a birth-and-death evolution model. In order to study the mode of evolution of 5S rDNA repeats in Perciformes fish species, nucleotide sequence and molecular organization of five species of genus Channa were analyzed in the present study. Molecular analyses revealed several variants of 5S rDNA repeats (four types of NTS) and networks created by a neighbor net algorithm for each type of sequences (I, II, III and IV) did not show a clear clustering in species specific manner. The stable secondary structure is predicted and upstream and downstream conserved regulatory elements were characterized. Sequence analyses also shown the presence of two putative pseudogenes in Channa marulius. Present study supported that 5S rDNA repeats in genus Channa were evolved under the process of birth-and-death.
Gould, Virginia C; Okazaki, Aki; Howe, Robin A; Avison, Matthew B
2004-08-01
To determine the level of variation in the smeDEF efflux pump and smeT transcriptional regulator genes among three defined 16S rRNA sequence subgroups of clinical Stenotrophomonas maltophilia isolates. smeDEF sequencing used a PCR genome walking approach. Determination of the sequence surrounding smeDEF used a flanking primer PCR method and specific primers anchored in smeD or smeF together with random primers. smeDEF is chromosomal and located in the same position in the chromosome in all three subgroups of isolates. Flanking smeD is a gene, smeT, encoding a putative transcriptional repressor for smeDEF. Variation at these loci among the isolates is considerably lower (up to 10%) than at intrinsic beta-lactamase loci (up to 30%) in the same isolates, implying greater functional constraint. The smeD-smeT intergenic region contains a highly conserved section, which maps with previously predicted promoter/operator regions, and a hypervariable untranslated region, which can be used to subgroup clinical isolates. These data provide further evidence that it is possible to group clinical isolates of the inherently variable species, S. maltophilia, based on genotypic properties. Isolate D457, in which most work concerning smeDEF expression has been performed, does not fall into S. maltophilia subgroup A, which is the most typical.
Nodeomics: Pathogen Detection in Vertebrate Lymph Nodes Using Meta-Transcriptomics
Wittekindt, Nicola E.; Padhi, Abinash; Schuster, Stephan C.; Qi, Ji; Zhao, Fangqing; Tomsho, Lynn P.; Kasson, Lindsay R.; Packard, Michael; Cross, Paul C.; Poss, Mary
2010-01-01
The ongoing emergence of human infections originating from wildlife highlights the need for better knowledge of the microbial community in wildlife species where traditional diagnostic approaches are limited. Here we evaluate the microbial biota in healthy mule deer (Odocoileus hemionus) by analyses of lymph node meta-transcriptomes. cDNA libraries from five individuals and two pools of samples were prepared from retropharyngeal lymph node RNA enriched for polyadenylated RNA and sequenced using Roche-454 Life Sciences technology. Protein-coding and 16S ribosomal RNA (rRNA) sequences were taxonomically profiled using protein and rRNA specific databases. Representatives of all bacterial phyla were detected in the seven libraries based on protein-coding transcripts indicating that viable microbiota were present in lymph nodes. Residents of skin and rumen, and those ubiquitous in mule deer habitat dominated classifiable bacterial species. Based on detection of both rRNA and protein-coding transcripts, we identified two new proteobacterial species; a Helicobacter closely related to Helicobacter cetorum in the Helicobacter pylori/Helicobacter acinonychis complex and an Acinetobacter related to Acinetobacter schindleri. Among viruses, a novel gamma retrovirus and other members of the Poxviridae and Retroviridae were identified. We additionally evaluated bacterial diversity by amplicon sequencing the hypervariable V6 region of 16S rRNA and demonstrate that overall taxonomic diversity is higher with the meta-transcriptomic approach. These data provide the most complete picture to date of the microbial diversity within a wildlife host. Our research advances the use of meta-transcriptomics to study microbiota in wildlife tissues, which will facilitate detection of novel organisms with pathogenic potential to human and animals.
Symonová, Radka; Ocalewicz, Konrad; Kirtiklis, Lech; Delmastro, Giovanni Battista; Pelikánová, Šárka; Garcia, Sonia; Kovařík, Aleš
2017-05-18
Pikes represent an important genus (Esox) harbouring a pre-duplication karyotype (2n = 2x = 50) of economically important salmonid pseudopolyploids. Here, we have characterized the 5S ribosomal RNA genes (rDNA) in Esox lucius and its closely related E. cisalpinus using cytogenetic, molecular and genomic approaches. Intragenomic homogeneity and copy number estimation was carried out using Illumina reads. The higher-order structure of rDNA arrays was investigated by the analysis of long PacBio reads. Position of loci on chromosomes was determined by FISH. DNA methylation was analysed by methylation-sensitive restriction enzymes. The 5S rDNA loci occupy exclusively (peri)centromeric regions on 30-38 acrocentric chromosomes in both E. lucius and E. cisalpinus. The large number of loci is accompanied by extreme amplification of genes (>20,000 copies), which is to the best of our knowledge one of the highest copy number of rRNA genes in animals ever reported. Conserved secondary structures of predicted 5S rRNAs indicate that most of the amplified genes are potentially functional. Only few SNPs were found in genic regions indicating their high homogeneity while intergenic spacers were more heterogeneous and several families were identified. Analysis of 10-30 kb-long molecules sequenced by the PacBio technology (containing about 40% of total 5S rDNA) revealed that the vast majority (96%) of genes are organised in large several kilobase-long blocks. Dispersed genes or short tandems were less common (4%). The adjacent 5S blocks were directly linked, separated by intervening DNA and even inverted. The 5S units differing in the intergenic spacers formed both homogeneous and heterogeneous (mixed) blocks indicating variable degree of homogenisation between the loci. Both E. lucius and E. cisalpinus 5S rDNA was heavily methylated at CG dinucleotides. Extreme amplification of 5S rRNA genes in the Esox genome occurred in the absence of significant pseudogenisation suggesting its recent origin and/or intensive homogenisation processes. The dense methylation of units indicates that powerful epigenetic mechanisms have evolved in this group of fish to silence amplified genes. We discuss how the higher-order repeat structures impact on homogenisation of 5S rDNA in the genome.
Population Genomics of Paramecium Species.
Johri, Parul; Krenek, Sascha; Marinov, Georgi K; Doak, Thomas G; Berendonk, Thomas U; Lynch, Michael
2017-05-01
Population-genomic analyses are essential to understanding factors shaping genomic variation and lineage-specific sequence constraints. The dearth of such analyses for unicellular eukaryotes prompted us to assess genomic variation in Paramecium, one of the most well-studied ciliate genera. The Paramecium aurelia complex consists of ∼15 morphologically indistinguishable species that diverged subsequent to two rounds of whole-genome duplications (WGDs, as long as 320 MYA) and possess extremely streamlined genomes. We examine patterns of both nuclear and mitochondrial polymorphism, by sequencing whole genomes of 10-13 worldwide isolates of each of three species belonging to the P. aurelia complex: P. tetraurelia, P. biaurelia, P. sexaurelia, as well as two outgroup species that do not share the WGDs: P. caudatum and P. multimicronucleatum. An apparent absence of global geographic population structure suggests continuous or recent dispersal of Paramecium over long distances. Intergenic regions are highly constrained relative to coding sequences, especially in P. caudatum and P. multimicronucleatum that have shorter intergenic distances. Sequence diversity and divergence are reduced up to ∼100-150 bp both upstream and downstream of genes, suggesting strong constraints imposed by the presence of densely packed regulatory modules. In addition, comparison of sequence variation at non-synonymous and synonymous sites suggests similar recent selective pressures on paralogs within and orthologs across the deeply diverging species. This study presents the first genome-wide population-genomic analysis in ciliates and provides a valuable resource for future studies in evolutionary and functional genetics in Paramecium. © The Author 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Labbe, Jessy L; Murat, Claude; Morin, Emmanuelle
It is becoming clear that simple sequence repeats (SSRs) play a significant role in fungal genome organization, and they are a large source of genetic markers for population genetics and meiotic maps. We identified SSRs in the Laccaria bicolor genome by in silico survey and analyzed their distribution in the different genomic regions. We also compared the abundance and distribution of SSRs in L. bicolor with those of the following fungal genomes: Phanerochaete chrysosporium, Coprinopsis cinerea, Ustilago maydis, Cryptococcus neoformans, Aspergillus nidulans, Magnaporthe grisea, Neurospora crassa and Saccharomyces cerevisiae. Using the MISA computer program, we detected 277,062 SSRs in themore » L. bicolor genome representing 8% of the assembled genomic sequence. Among the analyzed basidiomycetes, L. bicolor exhibited the highest SSR density although no correlation between relative abundance and the genome sizes was observed. In most genomes the short motifs (mono- to trinucleotides) were more abundant than the longer repeated SSRs. Generally, in each organism, the occurrence, relative abundance, and relative density of SSRs decreased as the repeat unit increased. Furthermore, each organism had its own common and longest SSRs. In the L. bicolor genome, most of the SSRs were located in intergenic regions (73.3%) and the highest SSR density was observed in transposable elements (TEs; 6,706 SSRs/Mb). However, 81% of the protein-coding genes contained SSRs in their exons, suggesting that SSR polymorphism may alter gene phenotypes. Within a L. bicolor offspring, sequence polymorphism of 78 SSRs was mainly detected in non-TE intergenic regions. Unlike previously developed microsatellite markers, these new ones are spread throughout the genome; these markers could have immediate applications in population genetics.« less
Mind the gap; seven reasons to close fragmented genome assemblies.
Thomma, Bart P H J; Seidl, Michael F; Shi-Kunne, Xiaoqian; Cook, David E; Bolton, Melvin D; van Kan, Jan A L; Faino, Luigi
2016-05-01
Like other domains of life, research into the biology of filamentous microbes has greatly benefited from the advent of whole-genome sequencing. Next-generation sequencing (NGS) technologies have revolutionized sequencing, making genomic sciences accessible to many academic laboratories including those that study non-model organisms. Thus, hundreds of fungal genomes have been sequenced and are publically available today, although these initiatives have typically yielded considerably fragmented genome assemblies that often lack large contiguous genomic regions. Many important genomic features are contained in intergenic DNA that is often missing in current genome assemblies, and recent studies underscore the significance of non-coding regions and repetitive elements for the life style, adaptability and evolution of many organisms. The study of particular types of genetic elements, such as telomeres, centromeres, repetitive elements, effectors, and clusters of co-regulated genes, but also of phenomena such as structural rearrangements, genome compartmentalization and epigenetics, greatly benefits from having a contiguous and high-quality, preferably even complete and gapless, genome assembly. Here we discuss a number of important reasons to produce gapless, finished, genome assemblies to help answer important biological questions. Copyright © 2015 Elsevier Inc. All rights reserved.
The Fragmented Mitochondrial Ribosomal RNAs of Plasmodium falciparum
Feagin, Jean E.; Harrell, Maria Isabel; Lee, Jung C.; Coe, Kevin J.; Sands, Bryan H.; Cannone, Jamie J.; Tami, Germaine; Schnare, Murray N.; Gutell, Robin R.
2012-01-01
Background The mitochondrial genome in the human malaria parasite Plasmodium falciparum is most unusual. Over half the genome is composed of the genes for three classic mitochondrial proteins: cytochrome oxidase subunits I and III and apocytochrome b. The remainder encodes numerous small RNAs, ranging in size from 23 to 190 nt. Previous analysis revealed that some of these transcripts have significant sequence identity with highly conserved regions of large and small subunit rRNAs, and can form the expected secondary structures. However, these rRNA fragments are not encoded in linear order; instead, they are intermixed with one another and the protein coding genes, and are coded on both strands of the genome. This unorthodox arrangement hindered the identification of transcripts corresponding to other regions of rRNA that are highly conserved and/or are known to participate directly in protein synthesis. Principal Findings The identification of 14 additional small mitochondrial transcripts from P. falcipaurm and the assignment of 27 small RNAs (12 SSU RNAs totaling 804 nt, 15 LSU RNAs totaling 1233 nt) to specific regions of rRNA are supported by multiple lines of evidence. The regions now represented are highly similar to those of the small but contiguous mitochondrial rRNAs of Caenorhabditis elegans. The P. falciparum rRNA fragments cluster on the interfaces of the two ribosomal subunits in the three-dimensional structure of the ribosome. Significance All of the rRNA fragments are now presumed to have been identified with experimental methods, and nearly all of these have been mapped onto the SSU and LSU rRNAs. Conversely, all regions of the rRNAs that are known to be directly associated with protein synthesis have been identified in the P. falciparum mitochondrial genome and RNA transcripts. The fragmentation of the rRNA in the P. falciparum mitochondrion is the most extreme example of any rRNA fragmentation discovered. PMID:22761677
DOE Office of Scientific and Technical Information (OSTI.GOV)
Bao, Guanhui; University of Chinese Academy of Sciences, Beijing; Dong, Hongjun
Highlights: • Genomes of a butanol tolerant strain and its parent strain were deciphered. • Comparative genomic and proteomic was applied to understand butanol tolerance. • None differentially expressed proteins have mutations in its corresponding genes. • Mutations in ribosome might be responsible for the global difference of proteomics. - Abstract: Clostridium acetobutylicum strain Rh8 is a butanol-tolerant mutant which can tolerate up to 19 g/L butanol, 46% higher than that of its parent strain DSM 1731. We previously performed comparative cytoplasm- and membrane-proteomic analyses to understand the mechanism underlying the improved butanol tolerance of strain Rh8. In this work,more » we further extended this comparison to the genomic level. Compared with the genome of the parent strain DSM 1731, two insertion sites, four deletion sites, and 67 single nucleotide variations (SNVs) are distributed throughout the genome of strain Rh8. Among the 67 SNVs, 16 SNVs are located in the predicted promoters and intergenic regions; while 29 SNVs are located in the coding sequence, affecting a total of 21 proteins involved in transport, cell structure, DNA replication, and protein translation. The remaining 22 SNVs are located in the ribosomal genes, affecting a total of 12 rRNA genes in different operons. Analysis of previous comparative proteomic data indicated that none of the differentially expressed proteins have mutations in its corresponding genes. Rchange Algorithms analysis indicated that the mutations occurred in the ribosomal genes might change the ribosome RNA thermodynamic characteristics, thus affect the translation strength of these proteins. Take together, the improved butanol tolerance of C. acetobutylicum strain Rh8 might be acquired through regulating the translational process to achieve different expression strength of genes involved in butanol tolerance.« less
Complete mitogenome sequencing and phylogenetic analysis of PaLi yak (Bos grunniens).
Bao, Pengjia; Guo, Xian; Pei, Jie; Liang, Chunnian; Ding, Xuezhi; Min, Chu; Wang, Hongbo; Wu, Xiaoyun; Yan, Ping
2016-11-01
PaLi yak is a very important local breed in China; as a year-round grazing animal, it plays a very important role for the economic and native herdsmen. The PaLi yak complete mitochondrial DNA is sequenced in this study, the total length is 16,324 bp, containing 13 protein-coding genes, 22 tRNA genes, 2 rRNA genes and a non-coding control region (D-loop region). The order and composition are similar to most of the other vertebrates. The base contents are: 33.72% A, 25.80% C, 13.21% G and 27.27% T; A + T (60.99%) was higher than G + C (39.01%). The phylogenetic relationships were analyzed using the complete mitogenome sequence, results showed that the genetic relationship between yak and cattle is distinct. These information provides useful data for further study on protection of genetic resources and the taxonomy of Bovinae.
Characterization of the complete mitochondrial genome sequence of wild yak (Bos mutus).
Chunnian, Liang; Wu, Xiaoyun; Ding, Xuezhi; Wang, Hongbo; Guo, Xian; Chu, Min; Bao, Pengjia; Yan, Ping
2016-11-01
Wild yak is a special breed in China and it is regarded as an important genetic resource for sustainably developing the animal husbandry in Tibetan area and enriching region's biodiversity. The complete mitochondrial genome of wild yak (16,322 bp in length) displayed 37 typical animal mitochondrial genes and A + T-rich (61.01%), with an overall G + C content of only 38.99%. It contained a non-coding control region (D-loop), 13 protein-coding genes, two rRNA genes, and 22 tRNA genes. Most of the genes have ATG initiation codons, whereas ND2, ND3, and ND5 genes start with ATA and were encoded on H-strand. The gene order of wild yak mitogenome is identical to that observed in most other vertebrates. The complete mitochondrial genome sequence of wild yak reported here could provide valuable information for developing genetic markers and phylogenetic analysis in yak.
Moyo, Lindani; Ramesh, Shunmugiah V; Kappagantu, Madhu; Mitter, Neena; Sathuvalli, Vidyasagar; Pappu, Hanu R
2017-07-17
Potato virus Y (PVY) is one of the most economically important pathogen of potato that is present as biologically distinct strains. The virus-derived small interfering RNAs (vsiRNAs) from potato cv. Russet Burbank individually infected with PVY-N, PVY-NTN and PVY-O strains were recently characterized. Plant defense RNA-silencing mechanisms deployed against viruses produce vsiRNAs to degrade homologous viral transcripts. Based on sequence complementarity, the vsiRNAs can potentially degrade host RNA transcripts raising the prospect of vsiRNAs as pathogenicity determinants in virus-host interactions. This study investigated the global effects of PVY vsiRNAs on the host potato transcriptome. The strain-specific vsiRNAs of PVY, expressed in high copy number, were analyzed in silico for their proclivity to target potato coding and non-coding RNAs using psRobot and psRNATarget algorithms. Functional annotation of target coding transcripts was carried out to predict physiological effects of the vsiRNAs on the potato cv. Russet Burbank. The downregulation of selected target coding transcripts was further validated using qRT-PCR. The vsiRNAs derived from biologically distinct strains of PVY displayed diversity in terms of absolute number, copy number and hotspots for siRNAs on their respective genomes. The vsiRNAs populations were derived with a high frequency from 6 K1, P1 and Hc-Pro for PVY-N, P1, Hc-Pro and P3 for PVY-NTN, and P1, 3' UTR and NIa for PVY-O genomic regions. The number of vsiRNAs that displayed interaction with potato coding transcripts and number of putative coding target transcripts were comparable between PVY-N and PVY-O, and were relatively higher for PVY-NTN. The most abundant target non-coding RNA transcripts for the strain specific PVY-derived vsiRNAs were found to be MIR821, 28S rRNA,18S rRNA, snoR71, tRNA-Met and U5. Functional annotation and qRT-PCR validation suggested that the vsiRNAs target genes involved in plant hormone signaling, genetic information processing, plant-pathogen interactions, plant defense and stress response processes in potato. The findings suggested that the PVY-derived vsiRNAs could act as a pathogenicity determinant and as a counter-defense strategy to host RNA silencing in PVY-potato interactions. The broad range of host genes targeted by PVY vsiRNAs in infected potato suggests a diverse role for vsiRNAs that includes suppression of host stress responses and developmental processes. The interactome scenario is the first report on the interaction between one of the most important Potyvirus genome-derived siRNAs and the potato transcripts.
Global Shifts in Genome and Proteome Composition Are Very Tightly Coupled
Brbić, Maria; Warnecke, Tobias; Kriško, Anita; Supek, Fran
2015-01-01
The amino acid composition (AAC) of proteomes differs greatly between microorganisms and is associated with the environmental niche they inhabit, suggesting that these changes may be adaptive. Similarly, the oligonucleotide composition of genomes varies and may confer advantages at the DNA/RNA level. These influences overlap in protein-coding sequences, making it difficult to gauge their relative contributions. We disentangle these effects by systematically evaluating the correspondence between intergenic nucleotide composition, where protein-level selection is absent, the AAC, and ecological parameters of 909 prokaryotes. We find that G + C content, the most frequently used measure of genomic composition, cannot capture diversity in AAC and across ecological contexts. However, di-/trinucleotide composition in intergenic DNA predicts amino acid frequencies of proteomes to the point where very little cross-species variability remains unexplained (91% of variance accounted for). Qualitatively similar results were obtained for 49 fungal genomes, where 80% of the variability in AAC could be explained by the composition of introns and intergenic regions. Upon factoring out oligonucleotide composition and phylogenetic inertia, the residual AAC is poorly predictive of the microbes’ ecological preferences, in stark contrast with the original AAC. Moreover, highly expressed genes do not exhibit more prominent environment-related AAC signatures than lowly expressed genes, despite contributing more to the effective proteome. Thus, evolutionary shifts in overall AAC appear to occur almost exclusively through factors shaping the global oligonucleotide content of the genome. We discuss these results in light of contravening evidence from biophysical data and further reading frame-specific analyses that suggest that adaptation takes place at the protein level. PMID:25971281
Walworth, Nathan; Pfreundt, Ulrike; Nelson, William C.; ...
2015-03-23
Understanding the evolution of the free-living, cyanobacterial, diazotroph Trichodesmium is of great importance because of its critical role in oceanic biogeochemistry and primary production. Unlike the other >150 available genomes of free-living cyanobacteria, only 63.8% of the Trichodesmium erythraeum (strain IMS101) genome is predicted to encode protein, which is 20–25% less than the average for other cyanobacteria and nonpathogenic, free-living bacteria. In this paper, we use distinctive isolates and metagenomic data to show that low coding density observed in IMS101 is a common feature of the Trichodesmium genus, both in culture and in situ. Transcriptome analysis indicates that 86% ofmore » the noncoding space is expressed, although the function of these transcripts is unclear. The density of noncoding, possible regulatory elements predicted in Trichodesmium, when normalized per intergenic kilobase, was comparable and twofold higher than that found in the gene-dense genomes of the sympatric cyanobacterial genera Synechococcus and Prochlorococcus, respectively. Conserved Trichodesmium noncoding RNA secondary structures were predicted between most culture and metagenomic sequences, lending support to the structural conservation. Conservation of these intergenic regions in spatiotemporally separated Trichodesmium populations suggests possible genus-wide selection for their maintenance. These large intergenic spacers may have developed during intervals of strong genetic drift caused by periodic blooms of a subset of genotypes, which may have reduced effective population size. Finally, our data suggest that transposition of selfish DNA, low effective population size, and high-fidelity replication allowed the unusual “inflation” of noncoding sequence observed in Trichodesmium despite its oligotrophic lifestyle.« less
Non contiguous-finished genome sequence and description of Enorma timonensis sp. nov.
Ramasamy, Dhamodaran; Dubourg, Gregory; Robert, Catherine; Caputo, Aurelia; Papazian, Laurent; Raoult, Didier; Fournier, Pierre-Edouard
2014-01-01
Enorma timonensis strain GD5T sp. nov., is the type strain of E. timonensis sp. nov., a new member of the genus Enorma within the family Coriobacteriaceae. This strain, whose genome is described here, was isolated from the fecal flora of a 53-year-old woman hospitalized for 3 months in an intensive care unit. E. timonensis is an obligate anaerobic rod. Here we describe the features of this organism, together with the complete genome sequence and annotation. The 2,365,123 bp long genome (1 chromosome but no plasmid) contains 2,060 protein-coding and 52 RNA genes, including 4 rRNA genes. PMID:25197477
Non contiguous-finished genome sequence and description of Peptoniphilus obesi sp. nov.
Mishra, Ajay Kumar; Hugon, Perrine; Lagier, Jean-Christophe; Nguyen, Thi-Thien; Robert, Catherine; Couderc, Carine; Raoult, Didier
2013-01-01
Peptoniphilus obesi strain ph1T sp. nov., is the type strain of P. obesi sp. nov., a new species within the genus Peptoniphilus. This strain, whose genome is described here, was isolated from the fecal flora of a 26-year-old woman suffering from morbid obesity. P. obesi strain ph1T is a Gram-positive, obligate anaerobic coccus. Here we describe the features of this organism, together with the complete genome sequence and annotation. The 1,774,150 bp long genome (1 chromosome but no plasmid) contains 1,689 protein-coding and 29 RNA genes, including 5 rRNA genes. PMID:24019985
DOE Office of Scientific and Technical Information (OSTI.GOV)
Helfenbein, Kevin G.; Brown, Wesley M.; Boore, Jeffrey L.
We have sequenced the complete mitochondrial DNA (mtDNA) of the articulate brachiopod Terebratalia transversa. The circular genome is 14,291 bp in size, relatively small compared to other published metazoan mtDNAs. The 37 genes commonly found in animal mtDNA are present; the size decrease is due to the truncation of several tRNA, rRNA, and protein genes, to some nucleotide overlaps, and to a paucity of non-coding nucleotides. Although the gene arrangement differs radically from those reported for other metazoans, some gene junctions are shared with two other articulate brachiopods, Laqueus rubellus and Terebratulina retusa. All genes in the T. transversa mtDNA,more » unlike those in most metazoan mtDNAs reported, are encoded by the same strand. The A+T content (59.1 percent) is low for a metazoan mtDNA, and there is a high propensity for homopolymer runs and a strong base-compositional strand bias. The coding strand is quite G+T-rich, a skew that is shared by the confamilial (laqueid) specie s L. rubellus, but opposite to that found in T. retusa, a cancellothyridid. These compositional skews are strongly reflected in the codon usage patterns and the amino acid compositions of the mitochondrial proteins, with markedly different usage observed between T. retusa and the two laqueids. This observation, plus the similarity of the laqueid non-coding regions to the reverse complement of the non-coding region of the cancellothyridid, suggest that an inversion that resulted in a reversal in the direction of first-strand replication has occurred in one of the two lineages. In addition to the presence of one non-coding region in T. transversa that is comparable to those in the other brachiopod mtDNAs, there are two others with the potential to form secondary structures; one or both of these may be involved in the process of transcript cleavage.« less
Triatominae-Trypanosoma cruzi/T. rangeli: Vector-parasite interactions.
Vallejo, G A; Guhl, F; Schaub, G A
2009-01-01
Of the currently known 140 species in the family Reduviidae, subfamily Triatominae, those which are most important as vectors of the aetiologic agent of Chagas disease, Trypanosoma cruzi, belong to the tribes Triatomini and Rhodniini. The latter not only transmit T. cruzi but also Trypanosoma rangeli, which is considered apathogenic for the mammalian host but can be pathogenic for the vectors. Using different molecular methods, two main lineages of T. cruzi have been classified, T. cruzi I and T. cruzi II. Within T. cruzi II, five subdivisions are recognized, T. cruzi IIa-IIe, according to the variability of the ribosomal subunits 24Salpha rRNA and 18S rRNA. In T. rangeli, differences in the organization of the kinetoplast DNA separate two forms denoted T. rangeli KP1+ and KP1-, although differences in the intergenic mini-exon gene and of the small subunit rRNA (SSU rRNA) suggest four subpopulations denoted T. rangeli A, B, C and D. The interactions of these subpopulations of the trypanosomes with different species and populations of Triatominae determine the epidemiology of the human-infecting trypanosomes in Latin America. Often, specific subpopulations of the trypanosomes are transmitted by specific vectors in a particular geographic area. Studies centered on trypanosome-triatomine interaction may allow identification of co-evolutionary processes, which, in turn, could consolidate hypotheses of the evolution and the distribution of T. cruzi/T. rangeli-vectors in America, and they may help to identify the mechanisms that either facilitate or impede the transmission of the parasites in different vector species. Such mechanisms seem to involve intestinal bacteria, especially the symbionts which are needed by the triatomines to complete nymphal development and to produce eggs. Development of the symbionts is regulated by the vector. T. cruzi and T. rangeli interfere with this system and induce the production of antibacterial substances. Whereas T. cruzi is only subpathogenic for the insect host, T. rangeli strongly affects species of the genus Rhodnius and this pathogenicity seems based on a reduction of the number of symbionts.
Complete mitochondrial genome of a wild Siberian tiger.
Sun, Yujiao; Lu, Taofeng; Sun, Zhaohui; Guan, Weijun; Liu, Zhensheng; Teng, Liwei; Wang, Shuo; Ma, Yuehui
2015-01-01
In this study, the complete mitochondrial genome of Siberian tiger (Panthera tigris altaica) was sequenced, using muscle tissue obtained from a male wild tiger. The total length of the mitochondrial genome is 16,996 bp. The genome structure of this tiger is in accordance with other Siberian tigers and it contains 12S rRNA gene, 16S rRNA gene, 22 tRNA genes, 13 protein-coding genes, and 1 control region.
VlincRNAs controlled by retroviral elements are a hallmark of pluripotency and cancer.
St Laurent, Georges; Shtokalo, Dmitry; Dong, Biao; Tackett, Michael R; Fan, Xiaoxuan; Lazorthes, Sandra; Nicolas, Estelle; Sang, Nianli; Triche, Timothy J; McCaffrey, Timothy A; Xiao, Weidong; Kapranov, Philipp
2013-07-22
The function of the non-coding portion of the human genome remains one of the most important questions of our time. Its vast complexity is exemplified by the recent identification of an unusual and notable component of the transcriptome - very long intergenic non-coding RNAs, termed vlincRNAs. Here we identify 2,147 vlincRNAs covering 10 percent of our genome. We show they are present not only in cancerous cells, but also in primary cells and normal human tissues, and are controlled by canonical promoters. Furthermore, vlincRNA promoters frequently originate from within endogenous retroviral sequences. Strikingly, the number of vlincRNAs expressed from endogenous retroviral promoters strongly correlates with pluripotency or the degree of malignant transformation. These results suggest a previously unknown connection between the pluripotent state and cancer via retroviral repeat-driven expression of vlincRNAs. Finally, we show that vlincRNAs can be syntenically conserved in humans and mouse and their depletion using RNAi can cause apoptosis in cancerous cells. These intriguing observations suggest that vlincRNAs could create a framework that combines many existing short ESTs and lincRNAs into a landscape of very long transcripts functioning in the regulation of gene expression in the nucleus. Certain types of vlincRNAs participate at specific stages of normal development and, based on analysis of a limited set of cancerous and primary cell lines, they appear to be co-opted by cancer-associated transcriptional programs. This provides additional understanding of transcriptome regulation during the malignant state, and could lead to additional targets and options for its reversal.
Rennick, Linda J; Duprex, W Paul; Rima, Bert K
2007-10-01
Transcription from morbillivirus genomes commences at a single promoter in the 3' non-coding terminus, with the six genes being transcribed sequentially. The 3' and 5' untranslated regions (UTRs) of the genes (mRNA sense), together with the intergenic trinucleotide spacer, comprise the non-coding sequences (NCS) of the virus and contain the conserved gene end and gene start signals, respectively. Bicistronic minigenomes containing transcription units (TUs) encoding autofluorescent reporter proteins separated by measles virus (MV) NCS were used to give a direct estimation of gene expression in single, living cells by assessing the relative amounts of each fluorescent protein in each cell. Initially, five minigenomes containing each of the MV NCS were generated. Assays were developed to determine the amount of each fluorescent protein in cells at both cell population and single-cell levels. This revealed significant variations in gene expression between cells expressing the same NCS-containing minigenome. The minigenome containing the M/F NCS produced significantly lower amounts of fluorescent protein from the second TU (TU2), compared with the other minigenomes. A minigenome with a truncated F 5' UTR had increased expression from TU2. This UTR is 524 nt longer than the other MV 5' UTRs. Insertions into the 5' UTR of the enhanced green fluorescent protein gene in the minigenome containing the N/P NCS showed that specific sequences, rather than just the additional length of F 5' UTR, govern this decreased expression from TU2.
Navarro-Ródenas, Alfonso; Carra, Andrea; Morte, Asunción
2018-01-01
Despite of the integrity of their RNA, some desert truffles present a non-canonical profile of rRNA where 3.3 kb is absent, 1.8 kb is clear and a band of 1.6 kb is observed. A similar rRNA profile was identified in organisms belonging to different life kingdoms, with the exception of the Kingdom Fungi, as a result of a split LSU rRNA called hidden gap . rRNA profiles of desert truffles were analyzed to verify the presence of the non-canonical profile. The RNA of desert truffles and yeast were blotted and hybridized with probes complementary to LSU extremes. RACE of LSU rRNA was carried out to determine the LSU rRNA breakage point. LSU rRNA of desert truffles presents a post-transcriptional cleavage of five nucleotides that generates a hidden gap located in domain D7. LSU splits into two molecules of 1.6 and 1.8 kb. Similar to other organisms, a UAAU tract, downstream of the breakage point, was identified. Phylogenetic comparison suggests that during fungi evolution mutations were introduced in the hypervariable D7 domain, resulting in a sequence that is specifically post-transcriptionally cleaved in some desert truffles.
Anosova, Irina; Melnik, Svitlana; Tripsianes, Konstantinos; Kateb, Fatiha; Grummt, Ingrid; Sattler, Michael
2015-05-26
The chromatin remodeling complex NoRC, comprising the subunits SNF2h and TIP5/BAZ2A, mediates heterochromatin formation at major clusters of repetitive elements, including rRNA genes, centromeres and telomeres. Association with chromatin requires the interaction of the TAM (TIP5/ARBP/MBD) domain of TIP5 with noncoding RNA, which targets NoRC to specific genomic loci. Here, we show that the NMR structure of the TAM domain of TIP5 resembles the fold of the MBD domain, found in methyl-CpG binding proteins. However, the TAM domain exhibits an extended MBD fold with unique C-terminal extensions that constitute a novel surface for RNA binding. Mutation of critical amino acids within this surface abolishes RNA binding in vitro and in vivo. Our results explain the distinct binding specificities of TAM and MBD domains to RNA and methylated DNA, respectively, and reveal structural features for the interaction of NoRC with non-coding RNA. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.
Fujisawa, Takatomo; Narikawa, Rei; Okamoto, Shinobu; Ehira, Shigeki; Yoshimura, Hidehisa; Suzuki, Iwane; Masuda, Tatsuru; Mochimaru, Mari; Takaichi, Shinichi; Awai, Koichiro; Sekine, Mitsuo; Horikawa, Hiroshi; Yashiro, Isao; Omata, Seiha; Takarada, Hiromi; Katano, Yoko; Kosugi, Hiroki; Tanikawa, Satoshi; Ohmori, Kazuko; Sato, Naoki; Ikeuchi, Masahiko; Fujita, Nobuyuki; Ohmori, Masayuki
2010-01-01
A filamentous non-N2-fixing cyanobacterium, Arthrospira (Spirulina) platensis, is an important organism for industrial applications and as a food supply. Almost the complete genome of A. platensis NIES-39 was determined in this study. The genome structure of A. platensis is estimated to be a single, circular chromosome of 6.8 Mb, based on optical mapping. Annotation of this 6.7 Mb sequence yielded 6630 protein-coding genes as well as two sets of rRNA genes and 40 tRNA genes. Of the protein-coding genes, 78% are similar to those of other organisms; the remaining 22% are currently unknown. A total 612 kb of the genome comprise group II introns, insertion sequences and some repetitive elements. Group I introns are located in a protein-coding region. Abundant restriction-modification systems were determined. Unique features in the gene composition were noted, particularly in a large number of genes for adenylate cyclase and haemolysin-like Ca2+-binding proteins and in chemotaxis proteins. Filament-specific genes were highlighted by comparative genomic analysis. PMID:20203057
Omeire, Destiny; Abdin, Shaunte; Brooks, Daniel M; Miranda, Hector C
2015-04-01
The Germain's Peacock-Pheasant Polyplectron germaini (Aves, Galliformes, Phasianidae) is classified as Near Threatened on the IUCN Red List. The complete mitochondrial genome of P. germaini is 16,699 bp, consisting of 13 protein-coding genes, 2 rRNA, 22 tRNA genes and 1 control region. All of the 13 protein-coding genes have ATG as start codon. Eight of the 13 protein-coding genes have TAA as stop codon.
Nabavi, Reza; Conneely, Brendan; McCarthy, Elaine; Good, Barbara; Shayan, Parviz; DE Waal, Theo
2014-09-01
Accurate identification of sheep nematodes is a critical point in epidemiological studies and monitoring of drug resistance in flocks. However, due to a close morphological similarity between the eggs and larval stages of many of these nematodes, such identification is not a trivial task. There are a number of studies showing that molecular targets in ribosomal DNA (Internal transcribed spacer 1, 2 and Intergenic spacer) are suitable for accurate identification of sheep bursate nematodes. The objective of present study was to compare the ITS1, ITS2 and IGS regions of Iranian common bursate nematodes in order to choose best target for specific identification methods. The first and second internal transcribed spacers (ITS1and ITS2) and intergenic spacer (IGS) of the ribosomal DNA (rDNA) of 5 common Iranian bursate nematodes of sheep were sequenced. The sequences of some non-Iranian isolates were used for comparison in order to evaluate the variation in sequence homology between geographically different nematode populations. Comparison of the ITS1 and ITS2 sequences of Iranian nematodes showed greatest similarity among Teladorsagia circumcincta and Marshallagia marshalli of 94% and 88%, respectively. While Trichostrongylus colubriformis and M. marshalli showed the highest homology (99%) in the IGS sequences. Comparison of the spacer sequences of Iranian with non-Iranian isolates showed significantly higher variation in Haemonchus contortus compared to the other species. Both the ITS1 and ITS2 sequences are convenient targets to have species-specific identification of Iranian bursate nematodes. On the other hand the IGS region may be a less suitable molecular target.
Joseph, S; Schmidt, L M; Danquah, W B; Timper, P; Mekete, T
2017-02-01
To generate single spore lines of a population of bacterial parasite of root-knot nematode (RKN), Pasteuria penetrans, isolated from Florida and examine genotypic variation and virulence characteristics exist within the population. Six single spore lines (SSP), 16SSP, 17SSP, 18SSP, 25SSP, 26SSP and 30SSP were generated. Genetic variability was evaluated by comparing single-nucleotide polymorphisms (SNPs) in six protein-coding genes and the 16S rRNA gene. An average of one SNP was observed for every 69 bp in the 16S rRNA, whereas no SNPs were observed in the protein-coding sequences. Hierarchical cluster analysis of 16S rRNA sequences placed the clones into three distinct clades. Bio-efficacy analysis revealed significant heterogeneity in the level virulence and host specificity between the individual clones. The SNP markers developed to the 5' hypervariable region of the 16S rRNA gene may be useful in biotype differentiation within a population of P. penetrans. This study demonstrates an efficient method for generating single spore lines of P. penetrans and gives a deep insight into genetic heterogeneity and varying level of virulence exists within a population parasitizing a specific Meloidogyne sp. host. The results also suggest that the application of generalist spore lines in nematode management may achieve broad RKN control. © 2016 The Society for Applied Microbiology.
Mineralogical Control on Microbial Diversity in a Weathered Granite?
NASA Astrophysics Data System (ADS)
Gleeson, D.; Clipson, N.; McDermott, F.
2003-12-01
Mineral transformation reactions and the behaviour of metals in rock and soils are affected not only by physicochemical parameters but also by biological factors, particularly by microbial activity. Microbes inhabit a wide range of niches in surface and subsurface environments, with mineral-microbe interactions being generally poorly understood. The focus of this study is to elucidate the role of microbial activity in the weathering of common silicate minerals in granitic rocks. A site in the Wicklow Mountains (Ireland) has been identified that consists of an outcrop surface of Caledonian (ca. 400 million years old) pegmatitic granite from which large intact crystals of variably weathered muscovite, plagioclase, K-feldspar and quartz were sampled, together with whole-rock granite. Culture-based microbial approaches have been widely used to profile microbial communities, particularly from copiotrophic environments, but it is now well established that for oligotrophic environments such as those that would be expected on weathering faces, perhaps less than 1% of microbial diversity can be profiled by cultural means. A number of culture-independent molecular based approaches have been developed to profile microbial diversity and community structure. These rely on successfully isolating environmental DNA from a given environment, followed by the use of the polymerase chain reaction (PCR) to amplify the typically small quantities of extracted DNA. Amplified DNA can then be analysed using cloning based approaches as well as community fingerprinting systems such as denaturing gradient gel electrophoresis (DGGE), terminal restriction fragment length polymorphism (TRFLP) and ribosomal intergenic spacer analysis (RISA). Community DNA was extracted and the intergenic spacer region (ITS) between small (16S) and large (23S) bacterial subunit rRNA genes was amplified. RISA fragments were then electrophoresed on a non-denaturing polyacrylamide gel. Banding patterns suggest that the bacterial population in whole rock, which contained approximately 30 separated bands (indicative of the number of bacterial ribotypes), is greater than muscovite (20), K-feldspar (15), and plagioclase feldspar (12) with quartz exhibiting the lowest number (6). These bands were excised from the gel for sequencing, allowing identification of the major populations. An automated approach was also used to assess similarity of bacterial communities present on each sample type, and this allowed for a statistical evaluation of bacterial diversity. Petrographic studies were carried out to assess mineral alteration effects. Scanning electron microscopy (SEM) was used to visualise in-situ bacterial cells.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Gopalan, Vinod; Smith, Robert A.; Lam, Alfred K.-Y., E-mail: a.lam@griffith.edu.au
miR-498 is a non-coding RNA located intergenically in 19q13.41. Due to its predicted targeting of several genes involved in control of cellular growth, we examined the expression of miR-498 in colon cancer cell lines and a large cohort of patients with colorectal adenocarcinoma. Two colon cancer cancer cell lines (SW480 and SW48) and one normal colonic epithelial cell line (FHC) were recruited. The expression of miR-498 was tested in these cell lines by using quantitative real-time polymerase chain reaction (qRT-PCR). Tissues from 80 patients with surgical resection of colorectum (60 adenocarcinomas and 20 non-neoplastic tissues) were tested for miR-498 expressionmore » by qRT-PCR. In addition, an exogenous miR-498 (mimic) was used to detect the miRNA's effects on cell proliferation and cell cycle events in SW480 using MTT calorimetric assay and flow cytometry respectively. The colon cancer cell lines showed reduced expression of miR-498 compared to a normal colonic epithelial cell line. Mimic driven over expression of miR-498 in the SW480 cell line resulted in reduced cell proliferation and increased proportions of G2-M phase cells. In tissues, miR-498 expression was too low to be detected in all colorectal adenocarcinoma compared to non-neoplastic tissues. This suggests that the down regulation of miR-498 in colorectal cancer tissues and the direct suppressive cellular effect noted in cancer cell lines implies that miR-498 has some direct or indirect role in the pathogenesis of colorectal adenocarcinomas. - Highlights: • miR-498 is a non-coding RNA located in 19q13.41. • Colon cancer cell lines showed reduced expression of miR-498. • Mimic driven over expression of miR-498 in colon cancer cells resulted in lower cell proliferation. • miR-498 expression was down regulated in all colorectal adenocarcinoma tissues.« less
Conversion at large intergenic regions of mitochondrial DNA in Saccharomyces cerevisiae.
Skelly, P J; Clark-Walker, G D
1990-04-01
Saccharomyces cerevisiae mitochondrial DNA deletion mutants have been used to examine whether base-biased intergenic regions of the genome influence mitochondrial biogenesis. One strain (delta 5.0) lacks a 5-kilobase (kb) segment extending from the proline tRNA gene to the small rRNA gene that includes ori1, while a second strain (delta 3.7) is missing a 3.7-kb region between the genes for ATPase subunit 6 and glutamic acid tRNA that encompasses ori7 plus ori2. Growth of these strains on both fermentable and nonfermentable substrates does not differ from growth of the wild-type strain, indicating that the deletable regions of the genome do not play a direct role in the expression of mitochondrial genes. Examination of whether the 5- or 3.7-kb regions influence mitochondrial DNA transmission was undertaken by crossing strains and examining mitochondrial genotypes in zygotic colonies. In a cross between strain delta 5.0, harboring three active ori elements (ori2, ori3, and ori5), and strain delta 3.7, containing only two active ori elements (ori3 and ori5), there is a preferential recovery of the genome containing two active ori elements (37% of progeny) over that containing three active elements (20%). This unexpected result, suggesting that active ori elements do not influence transmission of respiratory-competent genomes, is interpreted to reflect a preferential conversion of the delta 5.0 genome to the wild type (41% of progeny). Supporting evidence for conversion over biased transmission is shown by preferential recovery of a nonparental genome in the progeny of a heterozygous cross in which both parental molecules can be identified by size polymorphisms.
Conversion at large intergenic regions of mitochondrial DNA in Saccharomyces cerevisiae.
Skelly, P J; Clark-Walker, G D
1990-01-01
Saccharomyces cerevisiae mitochondrial DNA deletion mutants have been used to examine whether base-biased intergenic regions of the genome influence mitochondrial biogenesis. One strain (delta 5.0) lacks a 5-kilobase (kb) segment extending from the proline tRNA gene to the small rRNA gene that includes ori1, while a second strain (delta 3.7) is missing a 3.7-kb region between the genes for ATPase subunit 6 and glutamic acid tRNA that encompasses ori7 plus ori2. Growth of these strains on both fermentable and nonfermentable substrates does not differ from growth of the wild-type strain, indicating that the deletable regions of the genome do not play a direct role in the expression of mitochondrial genes. Examination of whether the 5- or 3.7-kb regions influence mitochondrial DNA transmission was undertaken by crossing strains and examining mitochondrial genotypes in zygotic colonies. In a cross between strain delta 5.0, harboring three active ori elements (ori2, ori3, and ori5), and strain delta 3.7, containing only two active ori elements (ori3 and ori5), there is a preferential recovery of the genome containing two active ori elements (37% of progeny) over that containing three active elements (20%). This unexpected result, suggesting that active ori elements do not influence transmission of respiratory-competent genomes, is interpreted to reflect a preferential conversion of the delta 5.0 genome to the wild type (41% of progeny). Supporting evidence for conversion over biased transmission is shown by preferential recovery of a nonparental genome in the progeny of a heterozygous cross in which both parental molecules can be identified by size polymorphisms. Images PMID:2181277
Dourado, Ana Catarina; Alves, Paula I L; Tenreiro, Tania; Ferreira, Eugénio M; Tenreiro, Rogério; Fareleira, Paula; Crespo, M Teresa Barreto
2009-12-01
A collection of nodule isolates from Medicago polymorpha obtained from southern and central Portugal was evaluated by M13-PCR fingerprinting and hierarchical cluster analysis. Several genomic clusters were obtained which, by 16S rRNA gene sequencing of selected representatives, were shown to be associated with particular taxonomic groups of rhizobia and other soil bacteria. The method provided a clear separation between rhizobia and co-isolated non-symbiotic soil contaminants. Ten M13-PCR groups were assigned to Sinorhizobium (Ensifer) medicae and included all isolates responsible for the formation of nitrogen-fixing nodules upon re-inoculation of M. polymorpha test-plants. In addition, enterobacterial repetitive intergenic consensus (ERIC)-PCR fingerprinting indicated a high genomic heterogeneity within the major M13- PCR clusters of S. medicae isolates. Based on nucleotide sequence data of an M13-PCR amplicon of ca. 1500 bp, observed only in S. medicae isolates and spanning locus Smed_3707 to Smed_3709 from the pSMED01 plasmid sequence of S. medicae WSM419 genome's sequence, a pair of PCR primers was designed and used for direct PCR amplification of a 1399-bp sequence within this fragment. Additional in silico and in vitro experiments, as well as phylogenetic analysis, confirmed the specificity of this primer combination and therefore the reliability of this approach in the prompt identification of S. medicae isolates and their distinction from other soil bacteria.
Genotypic and phenotypic diversity of Alicyclobacillus acidocaldarius isolates.
Félix-Valenzuela, L; Guardiola-Avila, I; Burgara-Estrella, A; Ibarra-Zavala, M; Mata-Haro, V
2015-10-01
The fruit juice industry recognizes Alicyclobacillus as a major quality control target micro-organism. In this study, we analysed 19 bacterial isolates to identify Alicyclobacillus species by polymerase chain reaction (PCR) and sequencing analyses. Phenotypic and genomic diversity among isolates were investigated by API 50CHB system and ERIC-PCR (enterobacterial repetitive intergenic consensus-PCR) respectively. All bacterial isolates were identified as Alicyclobacillus acidocaldarius, and almost all showed identical DNA sequences according to their 16S rRNA (rDNA) gene partial sequences. Only few carbohydrates were fermented by A. acidocaldarius isolates, and there was little variability in the biochemical profile. Genotypic fingerprinting of the A. acidocaldarius isolates showed high diversity, and clusters by ERIC-PCR were distinct to those obtained from the 16S rRNA gene phylogenetic tree. There was no correlation between phenotypic and genotypic variability in the A. acidocaldarius isolates analysed in this study. Detection of Alicyclobacillus strains is imperative in fruit concentrates and juices due to the production of guaiacol. Identification of the genera originates rejection of the product by processing industry. However, not all the Alicyclobacillus species are deteriorative and hence the importance to differentiate among them. In this study, partial 16S ribosomal RNA sequence alignment allowed the differentiation of species. In addition, ERIC-PCR was introduced for the genotypic characterization of Alicyclobacillus, as an alternative for differentiation among isolates from the same species. © 2015 The Society for Applied Microbiology.
Yen, Hung-Kai; Lin, Tsair-Fuh; Tseng, I-Cheng
2012-02-01
Two molecular methods, denaturing gradient gel electrophoresis (DGGE) and quantitative real-time polymerase chain reaction (qPCR) with the Universal ProbeLibrary (UPL) probe, were developed and used for the characterization and quantification of several microcystin producers in Moo-Tan Reservoir (MTR), Taiwan and its associated water treatment plant (Shih-Men Water Treatment Plant, SMWTP). Internal transcribed spacer (ITS) sequence, a highly diversified region between the 16S rRNA and 23S rRNA genes, was used to further identify the isolated strains from MTR and also used in DGGE for the detection of the specific DNA fragments and biomarkers for 11 strains observed in MTR. These ITS-DGGE biomarkers were successfully applied to monitor the community changes of potential toxigenic Microcystis sp. over a period of five years. Two highly specific primers were combined with UPL probes to measure microcystins synthesis gene (mcyB) and phycocyanin intergenic spacer region (cpcB) concentrations in water samples. The copy concentrations of UPL-mcyB and UPL-cpcB correlated well with MC-RR concentrations/water temperature and Microcystis sp. cell numbers in the water samples, respectively. For SMWTP, toxin concentrations were low, but the DGGE bands clearly demonstrated the presence of potential microcystin producers in both water treatment plants and finished water samples. It was demonstrated that toxigenic Microcystis sp. may penetrate through the treatment processes and pose a potential risk to human health in the drinking water systems.
Biogeography of sulfur-oxidizing Acidithiobacillus populations in extremely acidic cave biofilms
Jones, Daniel S; Schaperdoth, Irene; Macalady, Jennifer L
2016-01-01
Extremely acidic (pH 0–1.5) Acidithiobacillus-dominated biofilms known as snottites are found in sulfide-rich caves around the world. Given the extreme geochemistry and subsurface location of the biofilms, we hypothesized that snottite Acidithiobacillus populations would be genetically isolated. We therefore investigated biogeographic relationships among snottite Acidithiobacillus spp. separated by geographic distances ranging from meters to 1000s of kilometers. We determined genetic relationships among the populations using techniques with three levels of resolution: (i) 16S rRNA gene sequencing, (ii) 16S–23S intergenic transcribed spacer (ITS) region sequencing and (iii) multi-locus sequencing typing (MLST). We also used metagenomics to compare functional gene characteristics of select populations. Based on 16S rRNA genes, snottites in Italy and Mexico are dominated by different sulfur-oxidizing Acidithiobacillus spp. Based on ITS sequences, Acidithiobacillus thiooxidans strains from different cave systems in Italy are genetically distinct. Based on MLST of isolates from Italy, genetic distance is positively correlated with geographic distance both among and within caves. However, metagenomics revealed that At. thiooxidans populations from different cave systems in Italy have different sulfur oxidation pathways and potentially other significant differences in metabolic capabilities. In light of those genomic differences, we argue that the observed correlation between genetic and geographic distance among snottite Acidithiobacillus populations is partially explained by an evolutionary model in which separate cave systems were stochastically colonized by different ancestral surface populations, which then continued to diverge and adapt in situ. PMID:27187796
Stancheva, I; Lucchini, R; Koller, T; Sogo, J M
1997-01-01
By using formaldehyde cross-linking of histones to DNA and gel retardation assays we show that formaldehyde fixation, similar to previously established psoralen photocross-linking, discriminates between nucleosome- packed (inactive) and nucleosome-free (active) fractions of ribosomal RNA genes. By both cross-linking techniques we were able to purify fragments from agarose gels, corresponding to coding, enhancer and promoter sequences of rRNA genes, which were further investigated with respect to DNA methylation. This approach allows us to analyse independently and in detail methylation patterns of active and inactive rRNA gene copies by the combination of Hpa II and Msp I restriction enzymes. We found CpG methylation mainly present in enhancer and promoter regions of inactive rRNA gene copies. The methylation of one single Hpa II site, located in the promoter region, showed particularly strong correlation with the transcriptional activity. PMID:9108154
You, Qi; Yan, Hengyu; Liu, Yue; Yi, Xin; Zhang, Kang; Xu, Wenying; Su, Zhen
2017-05-01
The 22-nucleotide non-coding microRNAs (miRNAs) are mostly transcribed by RNA polymerase II and are similar to protein-coding genes. Unlike the clear process from stem-loop precursors to mature miRNAs, the primary transcriptional regulation of miRNA, especially in plants, still needs to be further clarified, including the original transcription start site, functional cis-elements and primary transcript structures. Due to several well-characterized transcription signals in the promoter region, we proposed a systemic approach integrating multidimensional "omics" (including genomics, transcriptomics, and epigenomics) data to improve the genome-wide identification of primary miRNA transcripts. Here, we used the model plant Arabidopsis thaliana to improve the ability to identify candidate promoter locations in intergenic miRNAs and to determine rules for identifying primary transcription start sites of miRNAs by integrating high-throughput omics data, such as the DNase I hypersensitive sites, chromatin immunoprecipitation-sequencing of polymerase II and H3K4me3, as well as high throughput transcriptomic data. As a result, 93% of refined primary transcripts could be confirmed by the primer pairs from a previous study. Cis-element and secondary structure analyses also supported the feasibility of our results. This work will contribute to the primary transcriptional regulatory analysis of miRNAs, and the conserved regulatory pattern may be a suitable miRNA characteristic in other plant species.
First detection of Rickettsia conorii ssp. caspia in Rhipicephalus sanguineus in Zambia.
Chitimia-Dobler, Lidia; Dobler, Gerhard; Schaper, Sabine; Küpper, Thomas; Kattner, Simone; Wölfel, Silke
2017-11-01
Ticks are important vectors for Rickettsia spp. of the spotted fever group all around the world. Rickettsia conorii is the etiological agent of boutonneuse fever in the Mediterranean region and Africa. Tick identification was based on morphological features and further characterized using the 16S rRNA gene. The ticks were individually tested using pan-Rickettsia real-time-PCR for screening, and 23S-5S intergenic spacer region, 16S rDNA, gltA, sca4, ompB, and ompA genes were used to analyze the Rickettsia positive samples. Rickettsia conorii ssp. caspia was detected in tick collected in Zambia for the first time, thus demonstrating the possibility of the occurrence of human disease, namely Astrakhan fever, due to this Rickettsia ssp. in this region of Africa. The prevalence of R. conorii ssp. caspia was 0.06% (one positive tick out of 1465 tested ticks) and 0.07% (one positive tick out of 1254 tested Rh. sanguineus).
Bach, H-J; Jessen, I; Schloter, M; Munch, J C
2003-01-01
Real-time TaqMan-PCR assays were developed for detection, differentiation and absolute quantification of the pathogenic subspecies of Clavibacter michiganensis (Cm) in one single PCR run. The designed primer pair, targeting intergenic sequences of the rRNA operon (ITS) common in all subspecies, was suitable for the amplification of the expected 223-nt DNA fragments of all subspecies. Closely related bacteria were completely discriminated, except of Rathayibacter iranicus, from which weak PCR product bands appeared on agarose gel after 35 PCR cycles. Sufficient specificity of PCR detection was reached by introduction of the additional subspecies specific probes used in TaqMan-PCR. Only Cm species were detected and there was clear differentiation among the subspecies C. michiganensis sepedonicus (Cms), C. michiganensis michiganensis (Cmm), C. michiganensis nebraskensis (Cmn), C. michiganensis insidiosus (Cmi) and C. michiganensis tessellarius (Cmt). The TaqMan assays were optimized to enable a simultaneous quantification of each subspecies. Validity is shown by comparison with cell counts.
Zhan, Siyuan; Dong, Yao; Zhao, Wei; Guo, Jiazhong; Zhong, Tao; Wang, Linjie; Li, Li; Zhang, Hongping
2016-08-22
Long non-coding RNAs (lncRNAs) have been studied extensively over the past few years. Large numbers of lncRNAs have been identified in mouse, rat, and human, and some of them have been shown to play important roles in muscle development and myogenesis. However, there are few reports on the characterization of lncRNAs covering all the development stages of skeletal muscle in livestock. RNA libraries constructed from developing longissimus dorsi muscle of fetal (45, 60, and 105 days of gestation) and postnatal (3 days after birth) goat (Capra hircus) were sequenced. A total of 1,034,049,894 clean reads were generated. Among them, 3981 lncRNA transcripts corresponding to 2739 lncRNA genes were identified, including 3515 intergenic lncRNAs and 466 anti-sense lncRNAs. Notably, in pairwise comparisons between the libraries of skeletal muscle at the different development stages, a total of 577 transcripts were differentially expressed (P < 0.05) which were validated by qPCR using randomly selected six lncRNA genes. The identified goat lncRNAs shared some characteristics, such as fewer exons and shorter length, with the lncRNAs in other mammals. We also found 1153 lncRNAs genes were neighbored 1455 protein-coding genes (<10 kb upstream and downstream) and functionally enriched in transcriptional regulation and development-related processes, indicating they may be in cis-regulatory relationships. Additionally, Pearson's correlation coefficients of co-expression levels suggested 1737 lncRNAs and 19,422 mRNAs were possibly in trans-regulatory relationships (r > 0.95 or r < -0.95). These co-expressed mRNAs were enriched in development-related biological processes such as muscle system processes, regulation of cell growth, muscle cell development, regulation of transcription, and embryonic morphogenesis. This study provides a catalog of goat muscle-related lncRNAs, and will contribute to a fuller understanding of the molecular mechanism underpinning muscle development in mammals.
Computational RNomics of Drosophilids
Rose, Dominic; Hackermüller, Jörg; Washietl, Stefan; Reiche, Kristin; Hertel, Jana; Findeiß, Sven; Stadler, Peter F; Prohaska, Sonja J
2007-01-01
Background Recent experimental and computational studies have provided overwhelming evidence for a plethora of diverse transcripts that are unrelated to protein-coding genes. One subclass consists of those RNAs that require distinctive secondary structure motifs to exert their biological function and hence exhibit distinctive patterns of sequence conservation characteristic for positive selection on RNA secondary structure. The deep-sequencing of 12 drosophilid species coordinated by the NHGRI provides an ideal data set of comparative computational approaches to determine those genomic loci that code for evolutionarily conserved RNA motifs. This class of loci includes the majority of the known small ncRNAs as well as structured RNA motifs in mRNAs. We report here on a genome-wide survey using RNAz. Results We obtain 16 000 high quality predictions among which we recover the majority of the known ncRNAs. Taking a pessimistically estimated false discovery rate of 40% into account, this implies that at least some ten thousand loci in the Drosophila genome show the hallmarks of stabilizing selection action of RNA structure, and hence are most likely functional at the RNA level. A subset of RNAz predictions overlapping with TRF1 and BRF binding sites [Isogai et al., EMBO J. 26: 79–89 (2007)], which are plausible candidates of Pol III transcripts, have been studied in more detail. Among these sequences we identify several "clusters" of ncRNA candidates with striking structural similarities. Conclusion The statistical evaluation of the RNAz predictions in comparison with a similar analysis of vertebrate genomes [Washietl et al., Nat. Biotech. 23: 1383–1390 (2005)] shows that qualitatively similar fractions of structured RNAs are found in introns, UTRs, and intergenic regions. The intergenic RNA structures, however, are concentrated much more closely around known protein-coding loci, suggesting that flies have significantly smaller complement of independent structured ncRNAs compared to mammals. PMID:17996037
The complete mitochondrial genome sequence of the maned wolf (Chrysocyon brachyurus).
Zhao, Chao; Yang, Xiufeng; Zhang, Honghai; Zhang, Jin; Chen, Lei; Sha, Weilai; Liu, Guangshuai
2016-01-01
In this study, the complete mitochondrial genome of the maned wolf (Chrysocyon brachyurus), the unique species in Chrysocyon, was sequenced and reported for the first time using blood samples obtained from a female individual in Shanghai Zoo, China. Sequence analysis showed that the genome structure was in accordance with other Canidae species and it contained 12 S rRNA gene, 16 S rRNA gene, 22 tRNA genes, 13 protein-coding genes and 1 control region.
Li, Dandan; Wang, Yanhong; Zhang, Kun; Jiao, Zhujin; Zhu, Xiaopeng; Skogerboe, Geir; Guo, Xiangqian; Chinnusamy, Viswanathan; Bi, Lijun; Huang, Yongping; Dong, Shuanglin; Chen, Runsheng; Kan, Yunchao
2011-01-01
Accumulating evidences show that small non-protein coding RNAs (ncRNAs) play important roles in development, stress response and other cellular processes. The silkworm is an important model for studies on insect genetics and control of lepidopterous pests. Here, we have performed the first systematic identification and analysis of intermediate size ncRNAs (50–500 nt) in the silkworm. We identified 189 novel ncRNAs, including 141 snoRNAs, six snRNAs, three tRNAs, one SRP and 38 unclassified ncRNAs. Forty ncRNAs showed significantly altered expression during silkworm development or across specific stage transitions. Genomic comparisons revealed that 123 of these ncRNAs are potentially silkworm-specific. Analysis of the genomic organization of the ncRNA loci showed that 32.62% of the novel snoRNA loci are intergenic, and that all the intronic snoRNAs follow the pattern of one-snoRNA-per-intron. Target site analysis predicted a total of 95 2′-O-methylation and pseudouridylation modification sites of rRNAs, snRNAs and tRNAs. Together, these findings provide new clues for future functional study of ncRNA during insect development and evolution. PMID:21227919
2013-01-01
Background Bat trypanosomes have been implicated in the evolutionary history of the T. cruzi clade, which comprises species from a wide geographic and host range in South America, Africa and Europe, including bat-restricted species and the generalist agents of human American trypanosomosis T. cruzi and T. rangeli. Methods Trypanosomes from bats (Rhinolophus landeri and Hipposideros caffer) captured in Mozambique, southeast Africa, were isolated by hemoculture. Barcoding was carried out through the V7V8 region of Small Subunit (SSU) rRNA and Fluorescent Fragment Length barcoding (FFLB). Phylogenetic inferences were based on SSU rRNA, glyceraldehyde phosphate dehydrogenase (gGAPDH) and Spliced Leader (SL) genes. Morphological characterization included light, scanning and transmission electron microscopy. Results New trypanosomes from bats clustered together forming a clade basal to a larger assemblage called the T. cruzi clade. Barcoding, phylogenetic analyses and genetic distances based on SSU rRNA and gGAPDH supported these trypanosomes as a new species, which we named Trypanosoma livingstonei n. sp. The large and highly polymorphic SL gene repeats of this species showed a copy of the 5S ribosomal RNA into the intergenic region. Unique morphological (large and broad blood trypomastigotes compatible to species of the subgenus Megatrypanum and cultures showing highly pleomorphic epimastigotes and long and slender trypomastigotes) and ultrastructural (cytostome and reservosomes) features and growth behaviour (when co-cultivated with HeLa cells at 37°C differentiated into trypomastigotes resembling the blood forms and do not invaded the cells) complemented the description of this species. Conclusion Phylogenetic inferences supported the hypothesis that Trypanosoma livingstonei n. sp. diverged from a common ancestral bat trypanosome that evolved exclusively in Chiroptera or switched at independent opportunities to mammals of several orders forming the clade T. cruzi, hence, providing further support for the bat seeding hypothesis to explain the origin of T. cruzi and T. rangeli. PMID:23915781
Wang, He; Xiao, Meng; Kong, Fanrong; Chen, Sharon; Dou, Hong-Tao; Sorrell, Tania; Li, Ruo-Yu; Xu, Ying-Chun
2011-01-01
Eleven reference and 25 clinical isolates of Fusarium were subject to multilocus DNA sequence analysis to determine the species and haplotypes of the fusarial isolates from Beijing and Shandong, China. Seven loci were analyzed: the translation elongation factor 1 alpha gene (EF-1α); the nuclear rRNA internal transcribed spacer (ITS), large subunit (LSU), and intergenic spacer (IGS) regions; the second largest subunit of the RNA polymerase gene (RPB2); the calmodulin gene (CAM); and the mitochondrial small subunit (mtSSU) rRNA gene. We also evaluated an IGS-targeted PCR/reverse line blot (RLB) assay for species/haplotype identification of Fusarium. Twenty Fusarium species and seven species complexes were identified. Of 25 clinical isolates (10 species), the Gibberella (Fusarium) fujikuroi species complex was the commonest (40%) and was followed by the Fusarium solani species complex (FSSC) (36%) and the F. incarnatum-F. equiseti species complex (12%). Six FSSC isolates were identified to the species level as FSSC-3+4, and three as FSSC-5. Twenty-nine IGS, 27 EF-1α, 26 RPB2, 24 CAM, 18 ITS, 19 LSU, and 18 mtSSU haplotypes were identified; 29 were unique, and haplotypes for 24 clinical strains were novel. By parsimony informative character analysis, the IGS locus was the most phylogenetically informative, and the rRNA gene regions were the least. Results by RLB were concordant with multilocus sequence analysis for all isolates. Amphotericin B was the most active drug against all species. Voriconazole MICs were high (>8 μg/ml) for 15 (42%) isolates, including FSSC. Analysis of larger numbers of isolates is required to determine the clinical utility of the seven-locus sequence analysis and RLB assay in species classification of fusaria. PMID:21389150
A deep learning method for lincRNA detection using auto-encoder algorithm.
Yu, Ning; Yu, Zeng; Pan, Yi
2017-12-06
RNA sequencing technique (RNA-seq) enables scientists to develop novel data-driven methods for discovering more unidentified lincRNAs. Meantime, knowledge-based technologies are experiencing a potential revolution ignited by the new deep learning methods. By scanning the newly found data set from RNA-seq, scientists have found that: (1) the expression of lincRNAs appears to be regulated, that is, the relevance exists along the DNA sequences; (2) lincRNAs contain some conversed patterns/motifs tethered together by non-conserved regions. The two evidences give the reasoning for adopting knowledge-based deep learning methods in lincRNA detection. Similar to coding region transcription, non-coding regions are split at transcriptional sites. However, regulatory RNAs rather than message RNAs are generated. That is, the transcribed RNAs participate the biological process as regulatory units instead of generating proteins. Identifying these transcriptional regions from non-coding regions is the first step towards lincRNA recognition. The auto-encoder method achieves 100% and 92.4% prediction accuracy on transcription sites over the putative data sets. The experimental results also show the excellent performance of predictive deep neural network on the lincRNA data sets compared with support vector machine and traditional neural network. In addition, it is validated through the newly discovered lincRNA data set and one unreported transcription site is found by feeding the whole annotated sequences through the deep learning machine, which indicates that deep learning method has the extensive ability for lincRNA prediction. The transcriptional sequences of lincRNAs are collected from the annotated human DNA genome data. Subsequently, a two-layer deep neural network is developed for the lincRNA detection, which adopts the auto-encoder algorithm and utilizes different encoding schemes to obtain the best performance over intergenic DNA sequence data. Driven by those newly annotated lincRNA data, deep learning methods based on auto-encoder algorithm can exert their capability in knowledge learning in order to capture the useful features and the information correlation along DNA genome sequences for lincRNA detection. As our knowledge, this is the first application to adopt the deep learning techniques for identifying lincRNA transcription sequences.
Battistelli, C; Cicchini, C; Santangelo, L; Tramontano, A; Grassi, L; Gonzalez, F J; de Nonno, V; Grassi, G; Amicone, L; Tripodi, M
2017-01-01
The transcription factor Snail is a master regulator of cellular identity and epithelial-to-mesenchymal transition (EMT) directly repressing a broad repertoire of epithelial genes. How chromatin modifiers instrumental to its activity are recruited to Snail-specific binding sites is unclear. Here we report that the long non-coding RNA (lncRNA) HOTAIR (for HOX Transcript Antisense Intergenic RNA) mediates a physical interaction between Snail and enhancer of zeste homolog 2 (EZH2), an enzymatic subunit of the polycomb-repressive complex 2 and the main writer of chromatin-repressive marks. The Snail-repressive activity, here monitored on genes with a pivotal function in epithelial and hepatic morphogenesis, differentiation and cell-type identity, depends on the formation of a tripartite Snail/HOTAIR/EZH2 complex. These results demonstrate an lncRNA-mediated mechanism by which a transcriptional factor conveys a general chromatin modifier to specific genes, thereby allowing the execution of hepatocyte transdifferentiation; moreover, they highlight HOTAIR as a crucial player in the Snail-mediated EMT. PMID:27452518
Yamaguchi, Kosuke; Hada, Masashi; Fukuda, Yuko; Inoue, Erina; Makino, Yoshinori; Katou, Yuki; Shirahige, Katsuhiko; Okada, Yuki
2018-06-26
The question of whether retained histones in the sperm genome localize to gene-coding regions or gene deserts has been debated for years. Previous contradictory observations are likely caused by the non-uniform sensitivity of sperm chromatin to micrococcal nuclease (MNase) digestion. Sperm chromatin has a highly condensed but heterogeneous structure and is composed of 90%∼99% protamines and 1%∼10% histones. In this study, we utilized nucleoplasmin (NPM) to improve the solubility of sperm chromatin by removing protamines in vitro. NPM treatment efficiently solubilized histones while maintaining quality and quantity. Chromatin immunoprecipitation sequencing (ChIP-seq) analyses using NPM-treated sperm demonstrated the predominant localization of H4 to distal intergenic regions, whereas modified histones exhibited a modification-dependent preferential enrichment in specific genomic elements, such as H3K4me3 at CpG-rich promoters and H3K9me3 in satellite repeats, respectively, implying the existence of machinery protecting modified histones from eviction. Copyright © 2018 The Author(s). Published by Elsevier Inc. All rights reserved.
Das, Partha Pratim; Hendrix, David A.; Apostolou, Effie; Buchner, Alice H.; Canver, Matthew C.; Beyaz, Semir; Ljuboja, Damir; Kuintzle, Rachael; Kim, Woojin; Karnik, Rahul; Shao, Zhen; Xie, Huafeng; Xu, Jian; De Los Angeles, Alejandro; Zhang, Yingying; Choe, Junho; Jun, Don Leong Jia; Shen, Xiaohua; Gregory, Richard I.; Daley, George Q.; Meissner, Alexander; Kellis, Manolis; Hochedlinger, Konrad; Kim, Jonghwan; Orkin, Stuart H.
2017-01-01
SUMMARY Polycomb Repressive Complex 2 (PRC2) function and DNA methylation (DNAme) are typically correlated with the gene repression. Here, we show that PRC2 is required to maintain expression of maternal microRNAs (miRNAs) and long non-coding RNAs (lncRNAs) from the Gtl2-Rian-Mirg locus, which is essential for full pluripotency of iPSCs. In the absence of PRC2 the entire locus becomes transcriptionally repressed due to gain of DNA methylation at the intergenic differentially methylated regions (IG-DMR). Furthermore, we demonstrate that the IG-DMR serves as an enhancer of the maternal Gtl2-Rian-Mirg locus. Mechanistic study reveals that PRC2 interacts physically with Dnmt3 methyltransferases and prevents their recruitment and subsequent DNAme at the IG-DMR, thereby allowing for proper expression of the maternal Gtl2-Rian-Mirg locus. Our observations provide a novel mechanism by which PRC2 counteracts the action of Dnmt3 methyltransferases at an imprinted locus required for full pluripotency. PMID:26299972
Evidence for regulation of columnar habit in apple by a putative 2OG-Fe(II) oxygenase.
Wolters, Pieter J; Schouten, Henk J; Velasco, Riccardo; Si-Ammour, Azeddine; Baldi, Paolo
2013-12-01
Understanding the genetic mechanisms controlling columnar-type growth in the apple mutant 'Wijcik' will provide insights on how tree architecture and growth are regulated in fruit trees. In apple, columnar-type growth is controlled by a single major gene at the Columnar (Co) locus. By comparing the genomic sequence of the Co region of 'Wijcik' with its wild-type 'McIntosh', a novel non-coding DNA element of 1956 bp specific to Pyreae was found to be inserted in an intergenic region of 'Wijcik'. Expression analysis of selected genes located in the vicinity of the insertion revealed the upregulation of the MdCo31 gene encoding a putative 2OG-Fe(II) oxygenase in axillary buds of 'Wijcik'. Constitutive expression of MdCo31 in Arabidopsis thaliana resulted in compact plants with shortened floral internodes, a phenotype reminiscent of the one observed in columnar apple trees. We conclude that MdCo31 is a strong candidate gene for the control of columnar growth in 'Wijcik'. No claim to original European Union works. New Phytologist © 2013 New Phytologist Trust.
Draft Genome Sequence of the Deinococcus-Thermus Bacterium Meiothermus ruber Strain A
Thiel, Vera; Tomsho, Lynn P.; Burhans, Richard; ...
2015-03-26
The draft genome sequence of the Deinococcus-Thermus group bacterium Meiothermus ruber strain A, isolated from a cyanobacterial enrichment culture obtained from Octopus Spring (Yellowstone National Park, WY), comprises 2,968,099 bp in 170 contigs. It is predicted to contain 2,895 protein-coding genes, 44 tRNA-coding genes, and 2 rRNA operons.
Draft Genome Sequence of Staphylococcus cohnii subsp. urealyticus Isolated from a Healthy Dog
Wigmore, Sarah M.; Wareham, David W.
2017-01-01
ABSTRACT Staphylococcus cohnii subsp. urealyticus strain SW120 was isolated from the ear swab of a healthy dog. The isolate is resistant to methicillin and fusidic acid. The SW120 draft genome is 2,805,064 bp and contains 2,667 coding sequences, including 58 tRNAs and nine complete rRNA coding regions. PMID:28209829
Salvato, Paola; Simonato, Mauro; Battisti, Andrea; Negrisolo, Enrico
2008-01-01
Background Knowledge of animal mitochondrial genomes is very important to understand their molecular evolution as well as for phylogenetic and population genetic studies. The Lepidoptera encompasses more than 160,000 described species and is one of the largest insect orders. To date only nine lepidopteran mitochondrial DNAs have been fully and two others partly sequenced. Furthermore the taxon sampling is very scant. Thus advance of lepidopteran mitogenomics deeply requires new genomes derived from a broad taxon sampling. In present work we describe the mitochondrial genome of the moth Ochrogaster lunifer. Results The mitochondrial genome of O. lunifer is a circular molecule 15593 bp long. It includes the entire set of 37 genes usually present in animal mitochondrial genomes. It contains also 7 intergenic spacers. The gene order of the newly sequenced genome is that typical for Lepidoptera and differs from the insect ancestral type for the placement of trnM. The 77.84% A+T content of its α strand is the lowest among known lepidopteran genomes. The mitochondrial genome of O. lunifer exhibits one of the most marked C-skew among available insect Pterygota genomes. The protein-coding genes have typical mitochondrial start codons except for cox1 that present an unusual CGA. The O. lunifer genome exhibits the less biased synonymous codon usage among lepidopterans. Comparative genomics analysis study identified atp6, cox1, cox2 as cox3, cob, nad1, nad2, nad4, and nad5 as potential markers for population genetics/phylogenetics studies. A peculiar feature of O. lunifer mitochondrial genome it that the intergenic spacers are mostly made by repetitive sequences. Conclusion The mitochondrial genome of O. lunifer is the first representative of superfamily Noctuoidea that account for about 40% of all described Lepidoptera. New genome shares many features with other known lepidopteran genomes. It differs however for its low A+T content and marked C-skew. Compared to other lepidopteran genomes it is less biased in synonymous codon usage. Comparative evolutionary analysis of lepidopteran mitochondrial genomes allowed the identification of previously neglected coding genes as potential phylogenetic markers. Presence of repetitive elements in intergenic spacers of O. lunifer genome supports the role of DNA slippage as possible mechanism to produce spacers during replication. PMID:18627592
Kraakman, L S; Mager, W H; Maurer, K T; Nieuwint, R T; Planta, R J
1989-01-01
Transcription of the majority of the ribosomal protein (rp) genes in yeast is activated through common cis-acting elements, designated RPG-boxes. These elements have been shown to act as specific binding sites for the protein factor TUF/RAP1/GRF1 in vitro. Two such elements occur in the intergenic region separating the divergently transcribed genes encoding L46 and S24. To investigate whether the two RPG-boxes mediate transcription activation of both the L46 and S24 gene, two experimental strategies were followed: cloning of the respective genes on multicopy vectors and construction of fusion genes. Cloning of the L46 + S24 gene including the intergenic region in a multicopy yeast vector indicated that both genes are transcriptionally active. Using constructs in which only the S24 or the L46 gene is present, with or without the intergenic region, we obtained evidence that the intergenic region is indispensable for transcription activation of either gene. To demarcate the element(s) responsible for this activation, fusions of the intergenic region in either orientation to the galK reporter gene were made. Northern analysis of the levels of hybrid mRNA demonstrated that the intergenic region can serve as an heterologous promoter when it is in the 'S24-orientation'. Surprisingly, however, when fused in the reverse orientation the intergenic region did hardly confer transcription activity on the fusion gene. Furthermore, a 274 bp FnuDII-FnuDII fragment from the intergenic region that contains the RPG-boxes, could replace the naturally occurring upstream activation site (UASrpg) of the L25 rp-gene only when inserted in the 'S24-orientation'. Removal of 15 bp from the FnuDII fragment appeared to be sufficient to obtain transcription activation in the 'L46 orientation' as well. Analysis of a construct in which the RPG-boxes were selectively deleted from the promoter region of the L46 gene indicated that the RPG-boxes are needed for efficient transcriptional activation of the L46 gene. We conclude that all promoter elements for the S24 gene are located within the intergenic region, where the RPG-boxes are the most likely UAS-elements. However, the intergenic region (including the RPG-boxes) is required but not sufficient to confer transcription activity on the L46 gene. Images PMID:2602141
Kraakman, L S; Mager, W H; Maurer, K T; Nieuwint, R T; Planta, R J
1989-12-11
Transcription of the majority of the ribosomal protein (rp) genes in yeast is activated through common cis-acting elements, designated RPG-boxes. These elements have been shown to act as specific binding sites for the protein factor TUF/RAP1/GRF1 in vitro. Two such elements occur in the intergenic region separating the divergently transcribed genes encoding L46 and S24. To investigate whether the two RPG-boxes mediate transcription activation of both the L46 and S24 gene, two experimental strategies were followed: cloning of the respective genes on multicopy vectors and construction of fusion genes. Cloning of the L46 + S24 gene including the intergenic region in a multicopy yeast vector indicated that both genes are transcriptionally active. Using constructs in which only the S24 or the L46 gene is present, with or without the intergenic region, we obtained evidence that the intergenic region is indispensable for transcription activation of either gene. To demarcate the element(s) responsible for this activation, fusions of the intergenic region in either orientation to the galK reporter gene were made. Northern analysis of the levels of hybrid mRNA demonstrated that the intergenic region can serve as an heterologous promoter when it is in the 'S24-orientation'. Surprisingly, however, when fused in the reverse orientation the intergenic region did hardly confer transcription activity on the fusion gene. Furthermore, a 274 bp FnuDII-FnuDII fragment from the intergenic region that contains the RPG-boxes, could replace the naturally occurring upstream activation site (UASrpg) of the L25 rp-gene only when inserted in the 'S24-orientation'. Removal of 15 bp from the FnuDII fragment appeared to be sufficient to obtain transcription activation in the 'L46 orientation' as well. Analysis of a construct in which the RPG-boxes were selectively deleted from the promoter region of the L46 gene indicated that the RPG-boxes are needed for efficient transcriptional activation of the L46 gene. We conclude that all promoter elements for the S24 gene are located within the intergenic region, where the RPG-boxes are the most likely UAS-elements. However, the intergenic region (including the RPG-boxes) is required but not sufficient to confer transcription activity on the L46 gene.
Gibson, Joshua D; Hunt, Greg J
2016-01-01
The complete mitochondrial genome from an Africanized honey bee population (AHB, derived from Apis mellifera scutellata) was assembled and analyzed. The mitogenome is 16,411 bp long and contains the same gene repertoire and gene order as the European honey bee (13 protein coding genes, 22 tRNA genes and 2 rRNA genes). ND4 appears to use an alternate start codon and the long rRNA gene is 48 bp shorter in AHB due to a deletion in a terminal AT dinucleotide repeat. The dihydrouracil arm is missing from tRNA-Ser (AGN) and tRNA-Glu is missing the TV loop. The A + T content is comparable to the European honey bee (84.7%), which increases to 95% for the 3rd position in the protein coding genes.
Zhang, Guoyun; Duan, Aiguo; Zhang, Jianguo; He, Caiyun
2017-01-05
Long non-coding RNAs (lncRNAs), which are >200nt longer transcripts, potentially play important roles in almost all biological processes in plants and mammals. However, the functions and profiles of lncRNAs in fruit is less understood. Therefore, it is urgent and necessary to identify and analyze the functions of lncRNAs in sea buckthorns. Using RNA-sequencing, we synthetically identified lncRNAs in mature fruit from the red and yellow sea buckthorn. We obtained 567,778,938 clean reads from six samples and identified 3428 lncRNAs in mature fruit, including 2498 intergenic lncRNAs, 593 anti-sense lncRNAs, and 337 intronic lncRNAs. We also identified 3819 and 2295 circular RNAs in red and yellow sea buckthorn Fruit. In the aspects of gene architecture and expression, our results showed significant differences among the three lncRNA subtypes. We also investigated the effect of lncRNAs on its cis and trans target genes. Based on target genes analysis, we obtained 61 different expression lncRNAs (DE-lncRNAs) between these two sea buckthorns, including 23 special expression lncRNAs in red fruit and 22 special expression lncRNAs in yellow fruit. Importantly, we found a few DE-lncRNAs play cis and trans roles for genes in the Carotenoid biosynthesis, ascorbate and aldarate metabolism and fatty acid metabolism pathways. Our study provides a resource for lncRNA studies in mature fruit. It probably encourages researchers to deeply study fruit-coloring. It expands our knowledge about lncRNA biology and the annotation of the sea buckthorn genome. Copyright © 2016 Elsevier B.V. All rights reserved.
Wang, Jun; Lei, Zeng-jie; Guo, Yan; Wang, Tao; Qin, Zhong-yi; Xiao, Hua-liang; Fan, Li-lin; Chen, Dong-feng; Bian, Xiu-wu; Liu, Jia; Wang, Bin
2015-11-10
Cancer stem cells (CSCs) are key cellular targets for effective cancer therapy, due to their critical roles in cancer progression and chemo/radio-resistance. Emerging evidence demonstrates that long non-coding RNAs (lncRNAs) are important players in the biology of cancers. However, it remains unknown whether lncRNAs could be exploited to target CSCs. We report that large intergenic non-coding RNA p21 (lincRNA-p21) is a potent suppressor of stem-like traits of CSCs purified from both primary colorectal cancer (CRC) tissues and cell lines. A novel lincRNA-p21-expressing adenoviral vector, which was armed with miRNA responsive element (MRE) of miR-451 (Ad-lnc-p21-MRE), was generated to eliminate CRC CSCs. Integration of miR-451 MREs into the adenovirus efficiently delivered lincRNA-p21 into CSCs that contained low levels of miR-451. Moreover, lincRNA-p21 inhibited the activity of β-catenin signaling, thereby attenuating the viability, self-renewal, and glycolysis of CSCs in vitro. By limiting dilution and serial tumor formation assay, we demonstrated that Ad-lnc-p21-MRE significantly suppressed the self-renewal potential and tumorigenicity of CSCs in nude mice. Importantly, application of miR-451 MREs appeared to protect normal liver cells from off-target expression of lincRNA-p21 in both tumor-bearing and naïve mice. Taken together, these findings suggest that lncRNAs may be promising therapeutic molecules to eradicate CSCs and MREs of tumor-suppressor miRNAs, such as miR-451, may be exploited to ensure the specificity of CSC-targeting strategies.
Liu, Guo; Zhang, Wenhao
2018-06-11
Excessive exposure to ultraviolet (UV) rays can cause damage of the skin and may induce cancer, immunosuppression, photoaging, and inflammation. The long non-coding RNA (lncRNA) HOX antisense intergenic RNA (HOTAIR) is involved in multiple human biological processes. However, its role in UVB-induced keratinocyte injury is unclear. This study was performed to investigate the effects of HOTAIR in UVB-induced apoptosis and inflammatory injury in human keratinocytes (HaCaT cells). Quantitative real-time polymerase chain reaction was performed to analyze the expression levels of HOTAIR, PKR, TNF-α, and IL-6. Cell viability was measured using trypan blue exclusion method and cell apoptosis using flow cytometry and western blot. ELISA was used to measure the concentrations of TNF-α and IL-6. Western blot was used to measure the expression of PKR, apoptosis-related proteins, and PI3K/AKT and NF-κB pathway proteins. UVB induced HaCaT cell injury by inhibiting cell viability and promoting cell apoptosis and expressions of IL-6 and TNF-α. UVB also promoted the expression of HOTAIR. HOTAIR suppression increased cell viability and decreased apoptosis and expression of inflammatory factors in UVB-treated cells. HOTAIR also promoted the expression of PKR. Overexpression of HOTAIR decreased cell viability and increased cell apoptosis and expression of inflammatory factors in UVB-treated cells by upregulating PKR. Overexpression of PKR decreased cell viability and promoted cell apoptosis in UVB-treated cells. Overexpression of PKR activated PI3K/AKT and NF-κB pathways. Our findings identified an essential role of HOTAIR in promoting UVB-induced apoptosis and inflammatory injury by up-regulating PKR in keratinocytes.
Yong, Hoi-Sen; Song, Sze-Looi; Lim, Phaik-Eem; Chan, Kok-Gan; Chow, Wan-Loo; Eamsobhana, Praphathip
2015-01-01
The whole mitochondrial genome of the pest fruit fly Bactrocera arecae was obtained from next-generation sequencing of genomic DNA. It had a total length of 15,900 bp, consisting of 13 protein-coding genes, 2 rRNA genes, 22 tRNA genes and a non-coding region (A + T-rich control region). The control region (952 bp) was flanked by rrnS and trnI genes. The start codons included 6 ATG, 3 ATT and 1 each of ATA, ATC, GTG and TCG. Eight TAA, two TAG, one incomplete TA and two incomplete T stop codons were represented in the protein-coding genes. The cloverleaf structure for trnS1 lacked the D-loop, and that of trnN and trnF lacked the TΨC-loop. Molecular phylogeny based on 13 protein-coding genes was concordant with 37 mitochondrial genes, with B. arecae having closest genetic affinity to B. tryoni. The subgenus Bactrocera of Dacini tribe and the Dacinae subfamily (Dacini and Ceratitidini tribes) were monophyletic. The whole mitogenome of B. arecae will serve as a useful dataset for studying the genetics, systematics and phylogenetic relationships of the many species of Bactrocera genus in particular, and tephritid fruit flies in general. PMID:26472633
Li, Juan; Chen, Fen; Sugiyama, Hiromu; Blair, David; Lin, Rui-Qing; Zhu, Xing-Quan
2015-07-01
In the present study, near-complete mitochondrial (mt) genome sequences for Schistosoma japonicum from different regions in the Philippines and Japan were amplified and sequenced. Comparisons among S. japonicum from the Philippines, Japan, and China revealed a geographically based length difference in mt genomes, but the mt genomic organization and gene arrangement were the same. Sequence differences among samples from the Philippines and all samples from the three endemic areas were 0.57-2.12 and 0.76-3.85 %, respectively. The most variable part of the mt genome was the non-coding region. In the coding portion of the genome, protein-coding genes varied more than rRNA genes and tRNAs. The near-complete mt genome sequences for Philippine specimens were identical in length (14,091 bp) which was 4 bp longer than those of S. japonicum samples from Japan and China. This indel provides a unique genetic marker for S. japonicum samples from the Philippines. Phylogenetic analyses based on the concatenated amino acids of 12 protein-coding genes showed that samples of S. japonicum clustered according to their geographical origins. The identified mitochondrial indel marker will be useful for tracing the source of S. japonicum infection in humans and animals in Southeast Asia.
Li, Penggao; Yang, Chun; Yue, Rong; Zhen, Yaping; Zhuo, Qin; Piao, Jianhua; Yang, Xiaoguang; Xiao, Rong
2018-01-17
This study investigated the composition and proportions of fecal microbiota in Sprague-Dawley rats after consuming two genetically modified (GM) corn lines in comparison with the isogenic corn and the AIN93G standard feed for 10 weeks using bar-coded 16S rRNA gene sequencing. As a result, GM corn did not significantly alter the overall health and alpha-diversity of fecal microbiota. Fecal microbiota structures could be separated into noncorn and corn but not non-GM and GM corn subgroups. Both non-GM and GM corn caused the increase in bacterial populations related to carbohydrates utilization, such as Lactobacillus, Barnesiella, and Bifidobacterium, and the reduction in potentially pathogenic populations, such as Tannerella and Moraxellaceae. In conclusion, similar effects on the fecal microbiota were observed after consuming a GM- and non-GM-corn-based diet for long periods. Further studies are warranted to elucidate the functional relevance of the changes in the proportions of bacterial populations in these diets.
Complete mitochondrial genome of the larch hawk moth, Sphinx morio (Lepidoptera: Sphingidae).
Kim, Min Jee; Choi, Sei-Woong; Kim, Iksoo
2013-12-01
The larch hawk moth, Sphinx morio, belongs to the lepidopteran family Sphingidae that has long been studied as a family of model insects in a diverse field. In this study, we describe the complete mitochondrial genome (mitogenome) sequences of the species in terms of general genomic features and characteristic short repetitive sequences found in the A + T-rich region. The 15,299-bp-long genome consisted of a typical set of genes (13 protein-coding genes, 2 rRNA genes, and 22 tRNA genes) and one major non-coding A + T-rich region, with the typical arrangement found in Lepidoptera. The 316-bp-long A + T-rich region located between srRNA and tRNA(Met) harbored the conserved sequence blocks that are typically found in lepidopteran insects. Additionally, the A + T-rich region of S. morio contained three characteristic repeat sequences that are rarely found in Lepidoptera: two identical 12-bp repeat, three identical 5-bp-long tandem repeat, and six nearly identical 5-6 bp long repeat sequences.
The complete mitochondrial genome of Gobiobotia filifer (Teleostei, Cypriniformes: Cyprinidae).
Li, Qiang; Liu, Ya; Zhou, Jian; Gong, Quan; Li, Hua; Lai, Jiansheng; Li, Lianman
2016-09-01
The Gobiobotia filifer is a small economic fish which distributes in the upstream of Yangtze River and its distributaries. For the environmental pollution and overfishing, its population declined drastically in recent decades, so it is essential to protect its resource. In this study, the complete mitochondrial genome sequence of G. filifer was determined with PCR technology, which contains 13 protein-coding genes, 22 tRNA genes, two rRNA genes, and a non-coding control region with the total length of 16,613 bp. The order and composition of genes were similar to most of the other teleost fish. Most of the genes were encoded on heavy strand, except for ND6 genes and eight tRNAs. Just like most other vertebrates, the bias of G and C has been found in different genes/regions. The complete mitochondrial genome sequence of G. filifer would contribute to better understand evolution of this lineage, population genetics, and will help administrative department to make rules and laws to protect this lineage.
The complete mitochondrial genome of Liobagrus marginatus (Teleostei, Siluriformes: Amblycipitidae).
Li, Qiang; Du, Jun; Liu, Ya; Zhou, Jian; Ke, Hongyu; Liu, Chao; Liu, Guangxun
2014-04-01
The Liobagrus marginatus is an economic fish which distribute in the upstream of Yangtze river and its distributary. For its taste fresh, environmental pollution and overfishing, its population declined drastically and body miniaturization in recent decades, so it is essential to protect its resource. In this study, the complete mitochondrial genome sequence of Liobagrus marginatus was sequenced, which contains 22 tRNA genes, 13 protein-coding genes, 2 rRNA genes, and a non-coding control region with the total length of 16,497 bp. The gene arrangement and composition are similar to most of other fish. Most of the genes are encoded on heavy-strand, except for eight tRNA and ND6 genes. Just like most other vertebrates, the bias of G and C has been found in statistics results of different genes/regions. The complete mitochondrial genome sequence of Liobagrus marginatus would contribute to better understand population genetics, evolution of this lineage, and will help administrative departments to make rules and laws to protect it.
Kwong, Waldan K; Moran, Nancy A
2016-03-01
Honey bees and bumble bees harbour a small, defined set of gut bacterial associates. Strains matching sequences from 16S rRNA gene surveys of bee gut microbiotas were isolated from two honey bee species from East Asia. These isolates were mesophlic, non-pigmented, catalase-positive and oxidase-negative. The major fatty acids were iso-C15 : 0, iso-C17 : 0 3-OH, C16 : 0 and C16 : 0 3-OH. The DNA G+C content was 29-31 mol%. They had ∼87 % 16S rRNA gene sequence identity to the closest relatives described. Phylogenetic reconstruction using 20 protein-coding genes showed that these bee-derived strains formed a highly supported monophyletic clade, sister to the clade containing species of the genera Chryseobacterium and Elizabethkingia within the family Flavobacteriaceae of the phylum Bacteroidetes. On the basis of phenotypic and genotypic characteristics, we propose placing these strains in a novel genus and species: Apibacter adventoris gen. nov., sp. nov. The type strain of Apibacter adventoris is wkB301T ( = NRRL B-65307T = NCIMB 14986T).
Li, Shan; Dong, Xia; Su, Zhengchang
2013-07-30
Although prokaryotic gene transcription has been studied over decades, many aspects of the process remain poorly understood. Particularly, recent studies have revealed that transcriptomes in many prokaryotes are far more complex than previously thought. Genes in an operon are often alternatively and dynamically transcribed under different conditions, and a large portion of genes and intergenic regions have antisense RNA (asRNA) and non-coding RNA (ncRNA) transcripts, respectively. Ironically, similar studies have not been conducted in the model bacterium E coli K12, thus it is unknown whether or not the bacterium possesses similar complex transcriptomes. Furthermore, although RNA-seq becomes the major method for analyzing the complexity of prokaryotic transcriptome, it is still a challenging task to accurately assemble full length transcripts using short RNA-seq reads. To fill these gaps, we have profiled the transcriptomes of E. coli K12 under different culture conditions and growth phases using a highly specific directional RNA-seq technique that can capture various types of transcripts in the bacterial cells, combined with a highly accurate and robust algorithm and tool TruHMM (http://bioinfolab.uncc.edu/TruHmm_package/) for assembling full length transcripts. We found that 46.9 ~ 63.4% of expressed operons were utilized in their putative alternative forms, 72.23 ~ 89.54% genes had putative asRNA transcripts and 51.37 ~ 72.74% intergenic regions had putative ncRNA transcripts under different culture conditions and growth phases. As has been demonstrated in many other prokaryotes, E. coli K12 also has a highly complex and dynamic transcriptomes under different culture conditions and growth phases. Such complex and dynamic transcriptomes might play important roles in the physiology of the bacterium. TruHMM is a highly accurate and robust algorithm for assembling full-length transcripts in prokaryotes using directional RNA-seq short reads.
2013-01-01
Background Although prokaryotic gene transcription has been studied over decades, many aspects of the process remain poorly understood. Particularly, recent studies have revealed that transcriptomes in many prokaryotes are far more complex than previously thought. Genes in an operon are often alternatively and dynamically transcribed under different conditions, and a large portion of genes and intergenic regions have antisense RNA (asRNA) and non-coding RNA (ncRNA) transcripts, respectively. Ironically, similar studies have not been conducted in the model bacterium E coli K12, thus it is unknown whether or not the bacterium possesses similar complex transcriptomes. Furthermore, although RNA-seq becomes the major method for analyzing the complexity of prokaryotic transcriptome, it is still a challenging task to accurately assemble full length transcripts using short RNA-seq reads. Results To fill these gaps, we have profiled the transcriptomes of E. coli K12 under different culture conditions and growth phases using a highly specific directional RNA-seq technique that can capture various types of transcripts in the bacterial cells, combined with a highly accurate and robust algorithm and tool TruHMM (http://bioinfolab.uncc.edu/TruHmm_package/) for assembling full length transcripts. We found that 46.9 ~ 63.4% of expressed operons were utilized in their putative alternative forms, 72.23 ~ 89.54% genes had putative asRNA transcripts and 51.37 ~ 72.74% intergenic regions had putative ncRNA transcripts under different culture conditions and growth phases. Conclusions As has been demonstrated in many other prokaryotes, E. coli K12 also has a highly complex and dynamic transcriptomes under different culture conditions and growth phases. Such complex and dynamic transcriptomes might play important roles in the physiology of the bacterium. TruHMM is a highly accurate and robust algorithm for assembling full-length transcripts in prokaryotes using directional RNA-seq short reads. PMID:23899370
Dong, Yan; Sun, Hongying; Guo, Hua; Pan, Da; Qian, Changyuan; Hao, Sijing; Zhou, Kaiya
2012-08-15
Myriapods are among the earliest arthropods and may have evolved to become part of the terrestrial biota more than 400 million years ago. A noticeable lack of mitochondrial genome data from Pauropoda hampers phylogenetic and evolutionary studies within the subphylum Myriapoda. We sequenced the first complete mitochondrial genome of a microscopic pauropod, Pauropus longiramus (Arthropoda: Myriapoda), and conducted comprehensive mitogenomic analyses across the Myriapoda. The pauropod mitochondrial genome is a circular molecule of 14,487 bp long and contains the entire set of thirty-seven genes. Frequent intergenic overlaps occurred between adjacent tRNAs, and between tRNA and protein-coding genes. This is the first example of a mitochondrial genome with multiple intergenic overlaps and reveals a strategy for arthropods to effectively compact the mitochondrial genome by overlapping and truncating tRNA genes with neighbor genes, instead of only truncating tRNAs. Phylogenetic analyses based on protein-coding genes provide strong evidence that the sister group of Pauropoda is Symphyla. Additionally, approximately unbiased (AU) tests strongly support the Progoneata and confirm the basal position of Chilopoda in Myriapoda. This study provides an estimation of myriapod origins around 555 Ma (95% CI: 444-704 Ma) and this date is comparable with that of the Cambrian explosion and candidate myriapod-like fossils. A new time-scale suggests that deep radiations during early myriapod diversification occurred at least three times, not once as previously proposed. A Carboniferous origin of pauropods is congruent with the idea that these taxa are derived, rather than basal, progoneatans. Copyright © 2012 Elsevier B.V. All rights reserved.
Hücker, Sarah M.; Ardern, Zachary; Goldberg, Tatyana; Schafferhans, Andrea; Bernhofer, Michael; Vestergaard, Gisle; Nelson, Chase W.; Schloter, Michael; Rost, Burkhard; Scherer, Siegfried
2017-01-01
In the past, short protein-coding genes were often disregarded by genome annotation pipelines. Transcriptome sequencing (RNAseq) signals outside of annotated genes have usually been interpreted to indicate either ncRNA or pervasive transcription. Therefore, in addition to the transcriptome, the translatome (RIBOseq) of the enteric pathogen Escherichia coli O157:H7 strain Sakai was determined at two optimal growth conditions and a severe stress condition combining low temperature and high osmotic pressure. All intergenic open reading frames potentially encoding a protein of ≥ 30 amino acids were investigated with regard to coverage by transcription and translation signals and their translatability expressed by the ribosomal coverage value. This led to discovery of 465 unique, putative novel genes not yet annotated in this E. coli strain, which are evenly distributed over both DNA strands of the genome. For 255 of the novel genes, annotated homologs in other bacteria were found, and a machine-learning algorithm, trained on small protein-coding E. coli genes, predicted that 89% of these translated open reading frames represent bona fide genes. The remaining 210 putative novel genes without annotated homologs were compared to the 255 novel genes with homologs and to 250 short annotated genes of this E. coli strain. All three groups turned out to be similar with respect to their translatability distribution, fractions of differentially regulated genes, secondary structure composition, and the distribution of evolutionary constraint, suggesting that both novel groups represent legitimate genes. However, the machine-learning algorithm only recognized a small fraction of the 210 genes without annotated homologs. It is possible that these genes represent a novel group of genes, which have unusual features dissimilar to the genes of the machine-learning algorithm training set. PMID:28902868
The complete sequence of mitochondrial genome of polled yak (Bos grunniens).
Chu, Min; Wu, Xiaoyun; Liang, Chunnian; Pei, Jie; Ding, Xuezhi; Guo, Xian; Bao, Pengjia; Yan, Ping
2016-05-01
Generally speaking, the hornless trait is also known as polled. Although the POLL locus could be assigned to a 1.36-Mb interval in the centromeric region of BTA1 (Georges et al., 1993; Drögemüller et al., 2005)), and (Liu et al., 2014) reported a 147-kb segment that included three protein-coding genes was the most likely location of the POLL mutation in domestic yaks, the underlying genetic basis for the polled trait is still unknown. In this work, the complete mitochondrial genome sequence of polled yak was determined for the first time. The total length of the mitogenome is 16,324 bp long, with the base composition of 33.72% A, 27.25% T, 25.83% C, and 13.20% G. It contained 13 protein-coding genes, 22 tRNA genes, 2 rRNA genes and 1 non-coding region (D-loop region). The gene order of polled yak mitogenome is identical to that observed in most other vertebrates. The complete mitogenome sequence information of polled yak will provide useful data for further studies on protection of genetic resources and phylogenetic relationships within Bos grunniens.
The complete chloroplast genome sequence of the medicinal plant Salvia miltiorrhiza.
Qian, Jun; Song, Jingyuan; Gao, Huanhuan; Zhu, Yingjie; Xu, Jiang; Pang, Xiaohui; Yao, Hui; Sun, Chao; Li, Xian'en; Li, Chuyuan; Liu, Juyan; Xu, Haibin; Chen, Shilin
2013-01-01
Salvia miltiorrhiza is an important medicinal plant with great economic and medicinal value. The complete chloroplast (cp) genome sequence of Salvia miltiorrhiza, the first sequenced member of the Lamiaceae family, is reported here. The genome is 151,328 bp in length and exhibits a typical quadripartite structure of the large (LSC, 82,695 bp) and small (SSC, 17,555 bp) single-copy regions, separated by a pair of inverted repeats (IRs, 25,539 bp). It contains 114 unique genes, including 80 protein-coding genes, 30 tRNAs and four rRNAs. The genome structure, gene order, GC content and codon usage are similar to the typical angiosperm cp genomes. Four forward, three inverted and seven tandem repeats were detected in the Salvia miltiorrhiza cp genome. Simple sequence repeat (SSR) analysis among the 30 asterid cp genomes revealed that most SSRs are AT-rich, which contribute to the overall AT richness of these cp genomes. Additionally, fewer SSRs are distributed in the protein-coding sequences compared to the non-coding regions, indicating an uneven distribution of SSRs within the cp genomes. Entire cp genome comparison of Salvia miltiorrhiza and three other Lamiales cp genomes showed a high degree of sequence similarity and a relatively high divergence of intergenic spacers. Sequence divergence analysis discovered the ten most divergent and ten most conserved genes as well as their length variation, which will be helpful for phylogenetic studies in asterids. Our analysis also supports that both regional and functional constraints affect gene sequence evolution. Further, phylogenetic analysis demonstrated a sister relationship between Salvia miltiorrhiza and Sesamum indicum. The complete cp genome sequence of Salvia miltiorrhiza reported in this paper will facilitate population, phylogenetic and cp genetic engineering studies of this medicinal plant.
NASA Astrophysics Data System (ADS)
Park, J.
2015-12-01
The microbial communities transported by Asian dust events have attracted much attention as bioaerosols because the transported airborne microbes may strongly influence the downwind ecosystems and potentially human health in East Asia. Bioaerosol study has received relatively little attention and their characterization and risk assessments remain poorly developed. We used high throughput 16S rRNA gene targeted pyrosequencing and real-time quantitative PCR (qPCR) to monitor airborne bacterial communities and assess their potential risk. We monitored microbial communities in bioaerosol in Seoul between 2011 and 2013 using high volume air samplers. Six samples were collected during Asian dust (AD) events and the other 34 samples were urban air collected during non-Asian dust (non-AD) events. According to the qPCR result, the gene copy numbers of 16S rRNA genes were significantly higher during the AD events (P < 0.05) and their abundances were positively correlated with PM10 concentrations and bacterial diversities. The most abundant bacterial members (genus level) in the AD samples were Bacillus, Neisseria and E.coli/Shigella. To identify pathogenic populations, multilocus sequence typing (MLST) and virulence tests were applied using culture methods. 16S rRNA gene sequences of several pathogens were detected and their relative abundances appeared to have increased with increased concentrations of PM10. About 1% of Bacillus isolates were identified as known pathogenic B. cereus, confirming their presence in Asian dust samples. The qPCR detection of bceT gene, which codes for an enterotoxin in B. cereus group, was significantly increased in the AD dust samples over the non-AD samples. The following MLST assessment and virulence test of cultivated Bacillus isolates showed that B. cereus, B. licheniformis and B. mycoides were identified as pathogenic bacteria, and these pathogenic bacteria were usually more abundant during AD events. To assess the possible associations of identified pathogens on the hospital stroke admissions of residents in Seoul, we identified sixteen bioaerosol episodes using Poisson regression and calculated relative risk. The findings are useful in building a database for bacterial pathogens in AD events.
Ogola, Edwin; Villinger, Jandouwe; Mabuka, Danspaid; Omondi, David; Orindi, Benedict; Mutunga, James; Owino, Vincent; Masiga, Daniel K
2017-09-08
Small islands serve as potential malaria reservoirs through which new infections might come to the mainland and may be important targets in malaria elimination efforts. This study investigated malaria vector species diversity, blood-meal hosts, Plasmodium infection rates, and long-lasting insecticidal net (LLIN) coverage on Mageta, Magare and Ngodhe Islands of Lake Victoria in western Kenya, a region where extensive vector control is implemented on the mainland. From trapping for six consecutive nights per month (November 2012 to March 2015) using CDC light traps, pyrethrum spray catches and backpack aspiration, 1868 Anopheles mosquitoes were collected. Based on their cytochrome oxidase I (COI) and intergenic spacer region PCR and sequencing, Anopheles gambiae s.l. (68.52%), Anopheles coustani (19.81%) and Anopheles funestus s.l. (11.67%) mosquitoes were differentiated. The mean abundance of Anopheles mosquitoes per building per trap was significantly higher (p < 0.001) in Mageta than in Magare and Ngodhe. Mageta was also the most populated island (n = 6487) with low LLIN coverage of 62.35% compared to Ngodhe (n = 484; 88.31%) and Magare (n = 250; 98.59%). Overall, 416 (22.27%) engorged Anopheles mosquitoes were analysed, of which 41 tested positive for Plasmodium falciparum infection by high-resolution melting (HRM) analysis of 18S rRNA and cytochrome b PCR products. Plasmodium falciparum infection rates were 10.00, 11.76, 0, and 18.75% among blood-fed An. gambiae s.s. (n = 320), Anopheles arabiensis (n = 51), An. funestus s.s. (n = 29), and An. coustani (n = 16), respectively. Based on HRM analysis of vertebrate cytochrome b, 16S rRNA and COI PCR products, humans (72.36%) were the prominent blood-meal hosts of malaria vectors, but 20.91% of blood-meals were from non-human vertebrate hosts. These findings demonstrate high Plasmodium infection rates among the primary malaria vectors An. gambiae s.s. and An. arabiensis, as well as in An. coustani for the first time in the region, and that non-human blood-meal sources play an important role in their ecology. Further, the higher Anopheles mosquito abundances on the only low LLIN coverage island of Mageta suggests that high LLIN coverage has been effective in reducing malaria vector populations on Magare and Ngodhe Islands.
Ben Braïek, Olfa; Ghomrassi, Hamdi; Cremonesi, Paola; Morandi, Stefano; Fleury, Yannick; Le Chevalier, Patrick; Hani, Khaled; Bel Hadj, Omrane; Ghrairi, Taoufik
2017-06-01
Screening for lactic acid bacteria (LAB) from fresh shrimp samples (Penaeus vannamei) collected from retail seafood markets in the Tunisian's coast, resulted in the isolation of an Enterococcus strain termed Q1. This strain was selected for its antagonistic activity against pathogenic bacteria such as Listeria monocytogenes, Pseudomonas aeruginosa, Lactococcus garvieae and against fungi (Aspergillus niger and Fusarium equiseti). The Q1 strain was characterised using standard morphological and biochemical tests, growth assays at different temperatures, pH and salinity. 16S rRNA, rpoA and pheS gene sequencing, as well as the 16S-23S rRNA intergenic spacer analyses, were combined to identify strain Q1 as a strain of Enterococcus lactis. The bacteriocin produced by E. lactis Q1 is thermostable, active in the pH range from 4.0 to 9.0 and has a bactericidal mode of action. The enterocin P structural gene was detected by specific PCR in strain E. lactis Q1, which is in good agreement with SDS-PAGE data of the purified bacteriocin. A lack of significant antibiotic resistance genes and virulence determinants was confirmed by specific PCRs. This work provides the first description of an enterocin P producer E. lactis strain isolated from a fresh shrimp. Based on its safety properties (absence of haemolytic activity, virulence factors and antibiotic resistance genes), this strain has the potential to be used as a natural additive or adjunct protective culture in food biopreservation and/or probiotic culture.
Toenshoff, Elena R; Penz, Thomas; Narzt, Thomas; Collingro, Astrid; Schmitz-Esser, Stephan; Pfeiffer, Stefan; Klepal, Waltraud; Wagner, Michael; Weinmaier, Thomas; Rattei, Thomas; Horn, Matthias
2012-01-01
Adelgids (Insecta: Hemiptera: Adelgidae) are known as severe pests of various conifers in North America, Canada, Europe and Asia. Here, we present the first molecular identification of bacteriocyte-associated symbionts in these plant sap-sucking insects. Three geographically distant populations of members of the Adelges nordmannianae/piceae complex, identified based on coI and ef1alpha gene sequences, were investigated. Electron and light microscopy revealed two morphologically different endosymbionts, coccoid or polymorphic, which are located in distinct bacteriocytes. Phylogenetic analyses of their 16S and 23S rRNA gene sequences assigned both symbionts to novel lineages within the Gammaproteobacteria sharing <92% 16S rRNA sequence similarity with each other and showing no close relationship with known symbionts of insects. Their identity and intracellular location were confirmed by fluorescence in situ hybridization, and the names ‘Candidatus Steffania adelgidicola' and ‘Candidatus Ecksteinia adelgidicola' are proposed for tentative classification. Both symbionts were present in all individuals of all investigated populations and in different adelgid life stages including eggs, suggesting vertical transmission from mother to offspring. An 85 kb genome fragment of ‘Candidatus S. adelgidicola' was reconstructed based on a metagenomic library created from purified symbionts. Genomic features including the frequency of pseudogenes, the average length of intergenic regions and the presence of several genes which are absent in other long-term obligate symbionts, suggested that ‘Candidatus S. adelgidicola' is an evolutionarily young bacteriocyte-associated symbiont, which has been acquired after diversification of adelgids from their aphid sister group. PMID:21833037
Lactobacillus brantae sp. nov., isolated from faeces of Canada geese (Branta canadensis).
Volokhov, Dmitriy V; Amselle, Megan; Beck, Brian J; Popham, David L; Whittaker, Paul; Wang, Hua; Kerrigan, Elizabeth; Chizhikov, Vladimir E
2012-09-01
Three strains of lactic acid bacteria (LAB) were isolated from the faeces of apparently healthy wild Canada geese (Branta canadensis) in 2010 by cultivating faecal LAB on Rogosa SL agar under aerobic conditions. These three isolates were found to share 99.9 % gene sequence similarity of their 16S rRNA, their 16S-23S intergenic transcribed spacer region (ITS), partial 23S rRNA, rpoB, rpoC, rpoA and pheS gene sequences. However, the three strains exhibited lower levels of sequence similarity of these genetic targets to all known LAB, and the phylogenetically closest species to the geese strains were Lactobacillus casei, Lactobacillus paracasei, Lactobacillus rhamnosus and Lactobacillus saniviri. In comparison to L. casei ATCC 393(T), L. paracasei ATCC 25302(T), L. rhamnosus ATCC 7469(T) and L. saniviri DSM 24301(T), the novel isolates reacted uniquely in tests for cellobiose, galactose, mannitol, citric acid, aesculin and dextrin, and gave negative results in tests for l-proline arylamidase and l-pyrrolydonyl-arylamidase, and in the Voges-Proskauer test. Biochemical tests for cellobiose, aesculin, galactose, gentiobiose, mannitol, melezitose, ribose, salicin, sucrose, trehalose, raffinose, turanose, amygdalin and arbutin could be used for differentiation between L. saniviri and the novel strains. On the basis of phenotypic and genotypic characteristics, and phylogenetic data, the three isolates represent a novel species of the genus Lactobacillus, for which the name Lactobacillus brantae sp. nov. is proposed. The type strain is SL1108(T) (= ATCC BAA-2142(T) = LMG 26001(T) = DSM 23927(T)) and two additional strains are SL1170 and SL60106.
Role of miRNAs in CD4 T cell plasticity during inflammation and tolerance
Sethi, Apoorva; Kulkarni, Neeraja; Sonar, Sandip; Lal, Girdhari
2013-01-01
Gene expression is tightly regulated in a tuneable, cell-specific and time-dependent manner. Recent advancement in epigenetics and non-coding RNA (ncRNA) revolutionized the concept of gene regulation. In order to regulate the transcription, ncRNA can promptly response to the extracellular signals as compared to transcription factors present in the cells. microRNAs (miRNAs) are ncRNA (~22 bp) encoded in the genome, and present as intergenic or oriented antisense to neighboring genes. The strategic location of miRNA in coding genes helps in the coupled regulation of its expression with host genes. miRNA together with complex machinery called RNA-induced silencing complex (RISC) interacts with target mRNA and degrade the mRNA or inhibits the translation. CD4 T cells play an important role in the generation and maintenance of inflammation and tolerance. Cytokines and chemokines present in the inflamed microenvironment controls the differentiation and function of various subsets of CD4 T cells [Th1, Th2, Th17, and regulatory CD4 T cells (Tregs)]. Recent studies suggest that miRNAs play an important role in the development and function of all subsets of CD4 T cells. In current review, we focused on how various miRNAs are regulated by cell's extrinsic and intrinsic signaling, and how miRNAs affect the transdifferentiation of subsets of CD4 T cell and controls their plasticity during inflammation and tolerance. PMID:23386861
Zhang, Le-Ping; Cai, Yin-Yin; Yu, Dan-Na; Storey, Kenneth B.
2018-01-01
The family Toxoderidae (Mantodea) contains an ecologically diverse group of praying mantis species that have in common greatly elongated bodies. In this study, we sequenced and compared the complete mitochondrial genomes of two Toxoderidae species, Paratoxodera polyacantha and Toxodera hauseri, and compared their mitochondrial genome characteristics with another member of the Toxoderidae, Stenotoxodera porioni (KY689118). The lengths of the mitogenomes of T. hauseri and P. polyacantha were 15,616 bp and 15,999 bp, respectively, which is similar to that of S. porioni (15,846 bp). The size of each gene as well as the A+T-rich region and the A+T content of the whole genome were also very similar among the three species as were the protein-coding genes, the A+T content and the codon usages. The mitogenome of T. hauseri had the typical 22 tRNAs, whereas that of P. polyacantha had 26 tRNAs including an extra two copies of trnA-trnR. Intergenic regions of 67 bp and 76 bp were found in T. hauseri and P. polyacantha, respectively, between COX2 and trnK; these can be explained as residues of a tandem duplication/random loss of trnK and trnD. This non-coding region may be synapomorphic for Toxoderidae. In BI and ML analyses, the monophyly of Toxoderidae was supported and P. polyacantha was the sister clade to T. hauseri and S. porioni. PMID:29686943
Thuan, Nguyen Huy; Dhakal, Dipesh; Pokhrel, Anaya Raj; Chu, Luan Luong; Van Pham, Thi Thuy; Shrestha, Anil; Sohng, Jae Kyung
2018-05-01
Streptomyces peucetius ATCC 27952 produces two major anthracyclines, doxorubicin (DXR) and daunorubicin (DNR), which are potent chemotherapeutic agents for the treatment of several cancers. In order to gain detailed insight on genetics and biochemistry of the strain, the complete genome was determined and analyzed. The result showed that its complete sequence contains 7187 protein coding genes in a total of 8,023,114 bp, whereas 87% of the genome contributed to the protein coding region. The genomic sequence included 18 rRNA, 66 tRNAs, and 3 non-coding RNAs. In silico studies predicted ~ 68 biosynthetic gene clusters (BCGs) encoding diverse classes of secondary metabolites, including non-ribosomal polyketide synthase (NRPS), polyketide synthase (PKS I, II, and III), terpenes, and others. Detailed analysis of the genome sequence revealed versatile biocatalytic enzymes such as cytochrome P450 (CYP), electron transfer systems (ETS) genes, methyltransferase (MT), glycosyltransferase (GT). In addition, numerous functional genes (transporter gene, SOD, etc.) and regulatory genes (afsR-sp, metK-sp, etc.) involved in the regulation of secondary metabolites were found. This minireview summarizes the genome-based genome mining (GM) of diverse BCGs and genome exploration (GE) of versatile biocatalytic enzymes, and other enzymes involved in maintenance and regulation of metabolism of S. peucetius. The detailed analysis of genome sequence provides critically important knowledge useful in the bioengineering of the strain or harboring catalytically efficient enzymes for biotechnological applications.
High variability of mitochondrial gene order among fungi.
Aguileta, Gabriela; de Vienne, Damien M; Ross, Oliver N; Hood, Michael E; Giraud, Tatiana; Petit, Elsa; Gabaldón, Toni
2014-02-01
From their origin as an early alpha proteobacterial endosymbiont to their current state as cellular organelles, large-scale genomic reorganization has taken place in the mitochondria of all main eukaryotic lineages. So far, most studies have focused on plant and animal mitochondrial (mt) genomes (mtDNA), but fungi provide new opportunities to study highly differentiated mtDNAs. Here, we analyzed 38 complete fungal mt genomes to investigate the evolution of mtDNA gene order among fungi. In particular, we looked for evidence of nonhomologous intrachromosomal recombination and investigated the dynamics of gene rearrangements. We investigated the effect that introns, intronic open reading frames (ORFs), and repeats may have on gene order. Additionally, we asked whether the distribution of transfer RNAs (tRNAs) evolves independently to that of mt protein-coding genes. We found that fungal mt genomes display remarkable variation between and within the major fungal phyla in terms of gene order, genome size, composition of intergenic regions, and presence of repeats, introns, and associated ORFs. Our results support previous evidence for the presence of mt recombination in all fungal phyla, a process conspicuously lacking in most Metazoa. Overall, the patterns of rearrangements may be explained by the combined influences of recombination (i.e., most likely nonhomologous and intrachromosomal), accumulated repeats, especially at intergenic regions, and to a lesser extent, mobile element dynamics.
Zhi, Shuai; Li, Qiaozhi; Yasui, Yutaka; Edge, Thomas; Topp, Edward; Neumann, Norman F
2015-11-01
Host specificity in E. coli is widely debated. Herein, we used supervised learning logic-regression-based analysis of intergenic DNA sequence variability in E. coli in an attempt to identify single nucleotide polymorphism (SNP) biomarkers of E. coli that are associated with natural selection and evolution toward host specificity. Seven-hundred and eighty strains of E. coli were isolated from 15 different animal hosts. We utilized logic regression for analyzing DNA sequence data of three intergenic regions (flanked by the genes uspC-flhDC, csgBAC-csgDEFG, and asnS-ompF) to identify genetic biomarkers that could potentially discriminate E. coli based on host sources. Across 15 different animal hosts, logic regression successfully discriminated E. coli based on animal host source with relatively high specificity (i.e., among the samples of the non-target animal host, the proportion that correctly did not have the host-specific marker pattern) and sensitivity (i.e., among the samples from a given animal host, the proportion that correctly had the host-specific marker pattern), even after fivefold cross validation. Permutation tests confirmed that for most animals, host specific intergenic biomarkers identified by logic regression in E. coli were significantly associated with animal host source. The highest level of biomarker sensitivity was observed in deer isolates, with 82% of all deer E. coli isolates displaying a unique SNP pattern that was 98% specific to deer. Fifty-three percent of human isolates displayed a unique biomarker pattern that was 98% specific to humans. Twenty-nine percent of cattle isolates displayed a unique biomarker that was 97% specific to cattle. Interestingly, even within a related host group (i.e., Family: Canidae [domestic dogs and coyotes]), highly specific SNP biomarkers (98% and 99% specificity for dog and coyotes, respectively) were observed, with 21% of dog E. coli isolates displaying a unique dog biomarker and 61% of coyote isolates displaying a unique coyote biomarker. Application of a supervised learning method, such as logic regression, to DNA sequence analysis at certain intergenic regions demonstrates that some E. coli strains may evolve to become host-specific. Copyright © 2015 Elsevier Inc. All rights reserved.
HBS1L-MYB intergenic variants modulate fetal hemoglobin via long-range MYB enhancers
Stadhouders, Ralph; Aktuna, Suleyman; Thongjuea, Supat; Aghajanirefah, Ali; Pourfarzad, Farzin; van IJcken, Wilfred; Lenhard, Boris; Rooks, Helen; Best, Steve; Menzel, Stephan; Grosveld, Frank; Thein, Swee Lay; Soler, Eric
2014-01-01
Genetic studies have identified common variants within the intergenic region (HBS1L-MYB) between GTP-binding elongation factor HBS1L and myeloblastosis oncogene MYB on chromosome 6q that are associated with elevated fetal hemoglobin (HbF) levels and alterations of other clinically important human erythroid traits. It is unclear how these noncoding sequence variants affect multiple erythrocyte characteristics. Here, we determined that several HBS1L-MYB intergenic variants affect regulatory elements that are occupied by key erythroid transcription factors within this region. These elements interact with MYB, a critical regulator of erythroid development and HbF levels. We found that several HBS1L-MYB intergenic variants reduce transcription factor binding, affecting long-range interactions with MYB and MYB expression levels. These data provide a functional explanation for the genetic association of HBS1L-MYB intergenic polymorphisms with human erythroid traits and HbF levels. Our results further designate MYB as a target for therapeutic induction of HbF to ameliorate sickle cell and β-thalassemia disease severity. PMID:24614105
Characterization of circulating transfer RNA-derived RNA fragments in cattle
Casas, Eduardo; Cai, Guohong; Neill, John D.
2015-01-01
The objective was to characterize naturally occurring circulating transfer RNA-derived RNA fragments (tRFs) in cattle1. Serum from eight clinically normal adult dairy cows was collected, and small non-coding RNAs were extracted immediately after collection and sequenced by Illumina MiSeq. Sequences aligned to transfer RNA (tRNA) genes or their flanking sequences were characterized. Sequences aligned to the beginning of 5′ end of the mature tRNA were classified as tRF5; those aligned to the 3′ end of mature tRNA were classified as tRF3; and those aligned to the beginning of the 3′ end flanking sequences were classified as tRF1. There were 3,190,962 sequences that mapped to transfer RNA and small non-coding RNAs in the bovine genome. Of these, 2,323,520 were identified as tRF5s, 562 were tRF3s, and 81 were tRF1s. There were 866,799 sequences identified as other small non-coding RNAs (microRNA, rRNA, snoRNA, etc.) and were excluded from the study. The tRF5s ranged from 28 to 40 nucleotides; and 98.7% ranged from 30 to 34 nucleotides in length. The tRFs with the greatest number of sequences were derived from tRNA of histidine, glutamic acid, lysine, glycine, and valine. There was no association between number of codons for each amino acid and number of tRFs in the samples. The reason for tRF5s being the most abundant can only be explained if these sequences are associated with function within the animal. PMID:26379699
Transcriptome-wide discovery of circular RNAs in Archaea
Danan, Miri; Schwartz, Schraga; Edelheit, Sarit; Sorek, Rotem
2012-01-01
Circular RNA forms had been described in all domains of life. Such RNAs were shown to have diverse biological functions, including roles in the life cycle of viral and viroid genomes, and in maturation of permuted tRNA genes. Despite their potentially important biological roles, discovery of circular RNAs has so far been mostly serendipitous. We have developed circRNA-seq, a combined experimental/computational approach that enriches for circular RNAs and allows profiling their prevalence in a whole-genome, unbiased manner. Application of this approach to the archaeon Sulfolobus solfataricus P2 revealed multiple circular transcripts, a subset of which was further validated independently. The identified circular RNAs included expected forms, such as excised tRNA introns and rRNA processing intermediates, but were also enriched with non-coding RNAs, including C/D box RNAs and RNase P, as well as circular RNAs of unknown function. Many of the identified circles were conserved in Sulfolobus acidocaldarius, further supporting their functional significance. Our results suggest that circular RNAs, and particularly circular non-coding RNAs, are more prevalent in archaea than previously recognized, and might have yet unidentified biological roles. Our study establishes a specific and sensitive approach for identification of circular RNAs using RNA-seq, and can readily be applied to other organisms. PMID:22140119
Complete mitochondrial genome of the agarophyte red alga Gelidium vagum (Gelidiales).
Yang, Eun Chan; Kim, Kyeong Mi; Boo, Ga Hun; Lee, Jung-Hyun; Boo, Sung Min; Yoon, Hwan Su
2014-08-01
We describe the first complete mitochondrial genome of Gelidium vagum (Gelidiales) (24,901 bp, 30.4% GC content), an agar-producing red alga. The circular mitochondrial genome contains 43 genes, including 23 protein-coding, 18 tRNA and 2 rRNA genes. All the protein-coding genes have a typical ATG start codon. No introns were found. Two genes, secY and rps12, were overlapped by 41 bp.
McNamara, Patrick J; Krzmarzick, Mark J
2013-07-01
Triclosan is an antimicrobial agent that is discharged to soils with land-applied wastewater biosolids, is persistent under anaerobic conditions, and yet its impact on anaerobic microbial communities in soils is largely unknown. We hypothesized that triclosan enriches for Dehalococcoides-like Chloroflexi because these bacteria respire organochlorides and are likely less sensitive, relative to other bacteria, to the antimicrobial effects of triclosan. Triplicate anaerobic soil microcosms were seeded with agricultural soil, which was not previously exposed to triclosan, and were amended with 1 mg kg(-1) of triclosan. Triplicate control microcosms did not receive triclosan, and the experiment was run for 618 days. The overall bacterial community (assessed by automated ribosomal intergenic spacer analysis and denaturing gradient gel electrophoresis) was not impacted by triclosan; however, the abundance of Dehalococcoides-like Chloroflexi 16S rRNA genes (determined by qPCR) increased 20-fold with triclosan amendment compared with a fivefold increase without triclosan. This work demonstrates that triclosan impacts anaerobic soil communities at environmentally relevant levels. © 2013 Federation of European Microbiological Societies. Published by John Wiley & Sons Ltd. All rights reserved.
Swei, Andrea; Bowie, Verna C; Bowie, Rauri C K
2015-04-01
Vector-borne pathogens are transmitted between vertebrate hosts and arthropod vectors, two immensely different environments for the pathogen. There is further differentiation among vertebrate hosts that often have complex, species-specific immunological responses to the pathogen. All this presents a heterogeneous environmental and immunological landscape with possible consequences on the population genetic structure of the pathogen. We evaluated the differential genetic diversity of the Lyme disease pathogen, Borrelia burgdorferi, in its vector, the western black-legged tick (Ixodes pacificus), and in its mammal host community using the 5S-23S rRNA intergenic spacer region. We found differences in haplotype distribution of B. burgdorferi in tick populations from two counties in California as well as between a sympatric tick and vertebrate host community. In addition, we found that three closely related haplotypes consistently occurred in high frequency in all sample types. Lastly, our study found lower species diversity of the B. burgdorferi species complex, known as B. burgdorferi sensu lato, in small mammal hosts versus the tick populations in a sympatric study area. Copyright © 2015 Elsevier GmbH. All rights reserved.
O'Sullivan, N A; Fallon, R; Carroll, C; Smith, T; Maher, M
2000-02-01
Campylobacter enteritis in humans has been linked to consumption of poultry meat. Surveys show that 30-100% of poultry harbour Campylobacter as normal flora of the digestive tract which indicates a need to identify prevalent organism types in flocks and trace their epidemiology. In this study we describe a Campylobacter genus specific polymerase chain reaction (PCR) assay, amplifying the 16 S-23 S rRNA intergenic spacer region with an internal Campylobacter genus specific DNA probe and species specific probes for Campylobacter jejuni and Campylobacter coli designed for confirmation of the amplified PCR products by Southern blot and colorimetric reverse hybridization assays. The specificity of this assay was established by testing a range of food pathogens. Broiler chicken samples were tested following presumptive positive identification by the Malthus System V analyser (Malthus Instruments, UK). The combined PCR and colorimetric reverse hybridization assay is easy to perform and faster than conventional methods for confirmation and identification of Campylobacter species. Copyright 2000 Academic Press.
Complete Genome Sequence of Bacteroides ovatus V975
Goesmann, Alexander; Carding, Simon R.
2016-01-01
The complete genome sequence of Bacteroides ovatus V975 was determined. The genome consists of a single circular chromosome of 6,475,296 bp containing five rRNA operons, 68 tRNA genes, and 4,959 coding genes. PMID:27908995
Liu, Xia; Li, Yuan; Yang, Hongyuan; Zhou, Boyang
2018-04-09
The complete chloroplast (cp) genome of Talinum paniculatum (Caryophyllale), a source of pharmaceutical efficacy similar to ginseng, and a widely distributed and planted edible vegetable, were sequenced and analyzed. The cp genome size of T. paniculatum is 156,929 bp, with a pair of inverted repeats (IRs) of 25,751 bp separated by a large single copy (LSC) region of 86,898 bp and a small single copy (SSC) region of 18,529 bp. The genome contains 83 protein-coding genes, 37 transfer RNA (tRNA) genes, eight ribosomal RNA (rRNA) genes and four pseudogenes. Fifty one (51) repeat units and ninety two (92) simple sequence repeats (SSRs) were found in the genome. The pseudogene rpl23 (Ribosomal protein L23) was insert AATT than other Caryophyllale species by sequence alignment, which located in IRs region. The gene of trnK-UUU (tRNA-Lys) and rpl16 (Ribosomal protein L16) have larger introns in T. paniculatum , and the existence of matK (maturase K) genes, which usually located in the introns of trnK-UUU , rich sequence divergence in Caryophyllale. Complete cp genome comparison with other eight Caryophyllales species indicated that the differences between T. paniculatum and P. oleracea were very slight, and the most highly divergent regions occurred in intergenic spacers. Comparisons of IR boundaries among nine Caryophyllales species showed that T. paniculatum have larger IRs region and the contraction is relatively slight. The phylogenetic analysis among 35 Caryophyllales species and two outgroup species revealed that T. paniculatum and P. oleracea do not belong to the same family. All these results give good opportunities for future identification, barcoding of Talinum species, understanding the evolutionary mode of Caryophyllale cp genome and molecular breeding of T. paniculatum with high pharmaceutical efficacy.
Ren, Qian; Au, Hilda H.T.; Wang, Qing S.; Lee, Seonghoon; Jan, Eric
2014-01-01
The dicistrovirus intergenic internal ribosome entry site (IGR IRES) directly recruits the ribosome and initiates translation using a non-AUG codon. A subset of IGR IRESs initiates translation in either of two overlapping open reading frames (ORFs), resulting in expression of the 0 frame viral structural polyprotein and an overlapping +1 frame ORFx. A U–G base pair adjacent to the anticodon-like pseudoknot of the IRES directs +1 frame translation. Here, we show that the U-G base pair is not absolutely required for +1 frame translation. Extensive mutagenesis demonstrates that 0 and +1 frame translation can be uncoupled. Ribonucleic acid (RNA) structural probing analyses reveal that the mutant IRESs adopt distinct conformations. Toeprinting analysis suggests that the reading frame is selected at a step downstream of ribosome assembly. We propose a model whereby the IRES adopts conformations to occlude the 0 frame aminoacyl-tRNA thereby allowing delivery of the +1 frame aminoacyl-tRNA to the A site to initiate translation of ORFx. This study provides a new paradigm for programmed recoding mechanisms that increase the coding capacity of a viral genome. PMID:25038250
Zhang, Dawei; Li, Haiyan; Xie, Juping; Jiang, Decan; Cao, Liangqi; Yang, Xuewei; Xue, Ping; Jiang, Xiaofeng
2018-06-01
The aim of the present study was to elucidate whether, and how, long intergenic non-protein coding RNA 1296 (LINC01296) is involved in the modulation of human cholangiocarcinoma (CCA) development and progression. Microarray data analysis and reverse transcription-quantitative polymerase chain reaction analysis demonstrated that LINC01296 was significantly upregulated in human CCA compared with nontumor tissues. Furthermore, the expression of LINC01296 in human CCA was positively associated with tumor severity and clinical stage. Knockdown of LINC01296 dramatically suppressed the viability, migration and invasion of RBE and CCLP1 cells, and promoted cell apoptosis in vitro. Furthermore, LINC01296 knockdown inhibited tumor growth in a xenograft model. Mechanistically, LINC01296 was demonstrated to sponge microRNA-5095 (miR-5095), which targets MYCN proto-oncogene bHLH transcription factor (MYCN) mRNA in human CCA. By inhibition of miR-5095, LINC01296 overexpression upregulated the expression of MYCN and promoted cell viability, migration and invasion in CCA cells. The results reveal that the axis of LINC01296/miR-5095/MYCN may be a mechanism to regulate CCA development and progression.
Long noncoding RNA LINC00858 promotes osteosarcoma through regulating miR-139-CDK14 axis.
Gu, Zenghui; Hou, Zhenhai; Zheng, Longbao; Wang, Xinqiang; Wu, Liangbang; Zhang, Cheng
2018-06-23
Long noncoding RNAs (lncRNAs) have been identified to modulate the tumorigenesis of human cancers. The in-depth of lncRNAs on human osteosarcoma oncogenesis is still ambiguous. In present study, functional and mechanism experiments were conducted to investigate the role of long intergenic non-protein coding RNA 00858 (LINC00858) on human osteosarcoma tumorigenesis. Results demonstrated that LINC00858 expression was significantly upregulated in both osteosarcoma tissues and cell lines. Mechanism assays presented that LINC00858 silencing significantly repressed osteosarcoma cells' proliferation and invasion in vitro, and inhibited the tumor growth in vivo. In further experiments, LINC00858 was identified to sponge miR-139 to form RNA-induced silencing complex (RISC) using luciferase reporter assay and RNA immunoprecipitation (RIP). Besides, CDK14 was validated to be the target protein the miR-139. Rescue experiments confirmed the role of LINC00858/miR-139/CDK14 pathway on osteosarcoma cells' phenotype. In summary, these data prove that LINC00858/miR-139/CDK14 axis promotes the tumorigenesis of osteosarcoma, providing a new mechanism or target for osteosarcoma. Copyright © 2018. Published by Elsevier Inc.
Naum-Onganía, Gabriela; Gago-Zachert, Selma; Peña, Eduardo; Grau, Oscar; Garcia, Maria Laura
2003-10-01
Citrus psorosis virus (CPsV), the type member of genus Ophiovirus, has three genomic RNAs. Complete sequencing of CPsV RNA 1 revealed a size of 8184 nucleotides and Northern blot hybridization with chain specific probes showed that its non-coding strand is preferentially encapsidated. The complementary strand of RNA 1 contains two open reading frames (ORFs) separated by a 109-nt intergenic region, one located near the 5'-end potentially encoding a 24K protein of unknown function, and another of 280K containing the core polymerase motifs characteristic of viral RNA-dependent RNA polymerases (RdRp). Comparison of the core RdRp motifs of negative-stranded RNA viruses, supports grouping CPsV, Ranunculus white mottle virus (RWMV) and Mirafiori lettuce virus (MiLV) within the same genus (Ophiovirus), constituting a monophyletic group separated from all other negative-stranded RNA viruses. Furthermore, RNAs 1 of MiLV, CPsV and RWMV are similar in size and those of MiLV and CPsV also in genomic organization and sequence.
Pontvianne, Frédéric; Carpentier, Marie-Christine; Durut, Nathalie; Pavlištová, Veronika; Jaške, Karin; Schořová, Šárka; Parrinello, Hugues; Rohmer, Marine; Pikaard, Craig S; Fojtová, Miloslava; Fajkus, Jiří; Sáez-Vásquez, Julio
2016-08-09
The nucleolus is the site of rRNA gene transcription, rRNA processing, and ribosome biogenesis. However, the nucleolus also plays additional roles in the cell. We isolated nucleoli using fluorescence-activated cell sorting (FACS) and identified nucleolus-associated chromatin domains (NADs) by deep sequencing, comparing wild-type plants and null mutants for the nucleolar protein NUCLEOLIN 1 (NUC1). NADs are primarily genomic regions with heterochromatic signatures and include transposable elements (TEs), sub-telomeric regions, and mostly inactive protein-coding genes. However, NADs also include active rRNA genes and the entire short arm of chromosome 4 adjacent to them. In nuc1 null mutants, which alter rRNA gene expression and overall nucleolar structure, NADs are altered, telomere association with the nucleolus is decreased, and telomeres become shorter. Collectively, our studies reveal roles for NUC1 and the nucleolus in the spatial organization of chromosomes as well as telomere maintenance. Copyright © 2016 The Authors. Published by Elsevier Inc. All rights reserved.
The "periodic table" of the genetic code: A new way to look at the code and the decoding process.
Komar, Anton A
2016-01-01
Henri Grosjean and Eric Westhof recently presented an information-rich, alternative view of the genetic code, which takes into account current knowledge of the decoding process, including the complex nature of interactions between mRNA, tRNA and rRNA that take place during protein synthesis on the ribosome, and it also better reflects the evolution of the code. The new asymmetrical circular genetic code has a number of advantages over the traditional codon table and the previous circular diagrams (with a symmetrical/clockwise arrangement of the U, C, A, G bases). Most importantly, all sequence co-variances can be visualized and explained based on the internal logic of the thermodynamics of codon-anticodon interactions.
PCR-based 'serotyping' of Legionella pneumophila.
Thürmer, Alexander; Helbig, Jürgen Herbert; Jacobs, Enno; Lück, Paul Christian
2009-05-01
Currently, several PCR assays based on 16S rRNA and virulence-associated genes are available for detection of Legionella pneumophila. So far, no genotyping method has been published that can discriminate between serogroups and monoclonal subgroups of the most common L. pneumophila serogroup 1. Our first approach was to analyse LPS-associated genes of seven L. pneumophila serogroup 1 strains, and we developed two PCR-based methods specific for serogroup 1. Specific DNA fragments could be amplified from all the serogroup 1 strains (n=43) including the strains from the American Type Culture Collection. In contrast, none of the strains from serogroups 2-15 (n=41) contained these specific gene regions. In a second approach, primers specific for the lag-1 gene, encoding an O-acetyltransferase, which is responsible for the presence of the LPS epitope recognized by mAb 3/1, were designed and tested for their ability to differentiate between mAb 3/1-positive and -negative strains. All mAb 3/1-positive strains (n=30) contained the lag-1 gene, but in turn 4 of 13 tested mAb 3/1-negative strains were also positive in the PCR. Thus, the discrimination between mAb 3/1-positive and mAb 3/1-negative subgroups could not be achieved for all strains. In a third approach, two intergenic regions expected to be specific for monoclonal subgroup Knoxville and closely related subgroups Benidorm/Bellingham were identified and used for selective genotyping. These intergenic regions could not only be amplified in every tested strain belonging to the subgroups Knoxville, Benidorm and Bellingham, but also in some strains of other unrelated subgroups. The two PCR approaches with primers specific for serogroup 1 genes definitely represent a valuable tool in outbreak investigations and for risk assessment. They also might be used for culture-independent diagnosis of legionellosis caused by L. pneumophila serogroup 1.
Ben Said, M; Abbassi, M S; Bianchini, V; Sghaier, S; Cremonesi, P; Romanò, A; Gualdi, V; Hassen, A; Luini, M V
2016-12-01
Staphylococcus aureus is a major agent of bovine mastitis in dairy herds, causing economic losses in dairy industry worldwide. In addition, milk and milk-products contaminated by Staph. aureus can cause harmful human diseases. The aim of this study was to characterize Staph. aureus strains isolated from dairy farms in Tunisia. Bulk tank milk (n = 32) and individual cow milk (n = 130) samples were collected during the period of 2013-2014. Forty-three Staph. aureus isolates were recovered and typed by spa typing, 16S-23S rRNA intergenic spacer (RS-PCR) and multiplex PCRs for 22 virulence genes. Antimicrobial resistance was also investigated with a disc diffusion test. A selected subsample of 22 strains was additionally genotyped by multilocus sequence typing. Seventeen spa types were recovered, and t2421 (n = 10), t521 (n = 6) and t2112 (n = 5) were the most common. Fourteen different RS-PCR genotypes grouped into 11 clusters were detected in our study, with predominance of the R VI genotype (n = 24). Eight sequence types were identified and Clonal Complex 97, corresponding to RS-PCR cluster R, was the most common (n = 10), followed by CC1 (n = 4), CC15 (n = 3) and other four accounting for one or two strains. Different combinations of virulence genes were reported, and enterotoxin genes were present in few strains (seh, n = 4; sea, n = 2; sea and seh, n = 2; sec and sel, n = 2). The majority of strains were resistant only to penicillin; only one strain was found to be multiresistant and no methicillin-resistant Staph. aureus was demonstrated. Our study reported the isolation of CC97 from bovine milk in Tunisia for the first time and confirmed the relevance of this lineage in intramammary infection in cows. This paper describes the characteristics of Staphylococcus aureus isolated from bulk tank and individual cow milk in Tunisia. All strains were genotyped by spa typing and RS-PCR, a method based on the amplification of the 16S-23S rRNA intergenic spacer region, and multiplex PCRs for 22 virulence genes. A selected subsample of strains was also genotyped by multilocus sequence typing. All strains were tested for antimicrobial resistance. Our study evidences a predominance of strains belonging to Clonal Complex 97. Methicillin-resistant strains were not detected, and overall low level of antimicrobial resistance was reported. © 2016 The Society for Applied Microbiology.
Morimoto, Tomomi; Arii, Jun; Akashi, Hiroomi; Kawaguchi, Yasushi
2009-03-01
Information on sites in HSV genomes at which foreign gene(s) can be inserted without disrupting viral genes or affecting properties of the parental virus are important for basic research on HSV and development of HSV-based vectors for human therapy. The intergenic region between HSV-1 UL3 and UL4 genes has been reported to satisfy the requirements for such an insertion site. The UL3 and UL4 genes are oriented toward the intergenic region and, therefore, insertion of a foreign gene(s) into the region between the UL3 and UL4 polyadenylation signals should not disrupt any viral genes or transcriptional units. HSV-1 and HSV-2 each have more than 10 additional regions structurally similar to the intergenic region between UL3 and UL4. In the studies reported here, it has been demonstrated that insertion of a reporter gene expression cassette into several of the HSV-1 and HSV-2 intergenic regions has no effect on viral growth in cell culture or virulence in mice, suggesting that these multiple intergenic regions may be suitable HSV sites for insertion of foreign genes.
The evolutionary landscape of intergenic trans-splicing events in insects
Kong, Yimeng; Zhou, Hongxia; Yu, Yao; Chen, Longxian; Hao, Pei; Li, Xuan
2015-01-01
To explore the landscape of intergenic trans-splicing events and characterize their functions and evolutionary dynamics, we conduct a mega-data study of a phylogeny containing eight species across five orders of class Insecta, a model system spanning 400 million years of evolution. A total of 1,627 trans-splicing events involving 2,199 genes are identified, accounting for 1.58% of the total genes. Homology analysis reveals that mod(mdg4)-like trans-splicing is the only conserved event that is consistently observed in multiple species across two orders, which represents a unique case of functional diversification involving trans-splicing. Thus, evolutionarily its potential for generating proteins with novel function is not broadly utilized by insects. Furthermore, 146 non-mod trans-spliced transcripts are found to resemble canonical genes from different species. Trans-splicing preserving the function of ‘breakup' genes may serve as a general mechanism for relaxing the constraints on gene structure, with profound implications for the evolution of genes and genomes. PMID:26521696
SHPRH regulates rRNA transcription by recognizing the histone code in an mTOR-dependent manner.
Lee, Deokjae; An, Jungeun; Park, Young-Un; Liaw, Hungjiun; Woodgate, Roger; Park, Jun Hong; Myung, Kyungjae
2017-04-25
Many DNA repair proteins have additional functions other than their roles in DNA repair. In addition to catalyzing PCNA polyubiquitylation in response to the stalling of DNA replication, SHPRH has the additional function of facilitating rRNA transcription by localizing to the ribosomal DNA (rDNA) promoter in the nucleoli. SHPRH was recruited to the rDNA promoter using its plant homeodomain (PHD), which interacts with histone H3 when the fourth lysine of H3 is not trimethylated. SHPRH enrichment at the rDNA promoter was inhibited by cell starvation, by treatment with actinomycin D or rapamycin, or by depletion of CHD4. SHPRH also physically interacted with the RNA polymerase I complex. Taken together, we provide evidence that SHPRH functions in rRNA transcription through its interaction with histone H3 in a mammalian target of rapamycin (mTOR)-dependent manner.
Babina, Arianne M; Parker, Darren J; Li, Gene-Wei; Meyer, Michelle M
2018-06-20
In many bacteria, ribosomal proteins autogenously repress their own expression by interacting with RNA structures typically located in the 5'-UTRs of their mRNA transcripts. This regulation is necessary to maintain a balance between ribosomal proteins and rRNA to ensure proper ribosome production. Despite advances in non-coding RNA discovery and validation of RNA-protein regulatory interactions, the selective pressures that govern the formation and maintenance of such RNA cis-regulators in the context of an organism remain largely undetermined. To examine the impact disruptions to this regulation have on bacterial fitness, we introduced point mutations that abolish ribosomal protein binding and regulation into the RNA structure that controls expression of ribosomal proteins L20 and L35 within the Bacillus subtilis genome. Our studies indicate that removing this regulation results in reduced log phase growth, improper rRNA maturation, and the accumulation of a kinetically trapped or mis-assembled ribosomal particle at low temperatures, suggesting defects in ribosome synthesis. Such work emphasizes the important role regulatory RNAs play in the stoichiometric production of ribosomal components for proper ribosome composition and overall organism viability and reinforces the potential of targeting ribosomal protein production and ribosome assembly with novel antimicrobials. Published by Cold Spring Harbor Laboratory Press for the RNA Society.
Complete Genome Sequence of Bacteroides ovatus V975.
Wegmann, Udo; Goesmann, Alexander; Carding, Simon R
2016-12-01
The complete genome sequence of Bacteroides ovatus V975 was determined. The genome consists of a single circular chromosome of 6,475,296 bp containing five rRNA operons, 68 tRNA genes, and 4,959 coding genes. Copyright © 2016 Wegmann et al.
Zhang, Wenping; Yue, Bisong; Wang, Xiaofang; Zhang, Xiuyue; Xie, Zhong; Liu, Nonglin; Fu, Wenyuan; Yuan, Yaohua; Chen, Daqing; Fu, Danghua; Zhao, Bo; Yin, Yuzhong; Yan, Xiahui; Wang, Xinjing; Zhang, Rongying; Liu, Jie; Li, Maoping; Tang, Yao; Hou, Rong; Zhang, Zhihe
2011-10-01
In order to investigate the mitochondrial genome of Panthera tigris amoyensis, two South China tigers (P25 and P27) were analyzed following 15 cymt-specific primer sets. The entire mtDNA sequence was found to be 16,957 bp and 17,001 bp long for P25 and P27 respectively, and this difference in length between P25 and P27 occurred in the number of tandem repeats in the RS-3 segment of the control region. The structural characteristics of complete P. t. amoyensis mitochondrial genomes were also highly similar to those of P. uncia. Additionally, the rate of point mutation was only 0.3% and a total of 59 variable sites between P25 and P27 were found. Out of the 59 variable sites, 6 were located in 6 different tRNA genes, 6 in the 2 rRNA genes, 7 in non-coding regions (one located between tRNA-Asn and tRNA-Tyr and six in the D-loop), and 40 in 10 protein-coding genes. COI held the largest amount of variable sites (9 sites) and Cytb contained the highest variable rate (0.7%) in the complete sequences. Moreover, out of the 40 variable sites located in 10 protein-coding genes, 12 sites were nonsynonymous.
Phylogenetic Evidence for Lateral Gene Transfer in the Intestine of Marine Iguanas
Nelson, David M.; Cann, Isaac K. O.; Altermann, Eric; Mackie, Roderick I.
2010-01-01
Background Lateral gene transfer (LGT) appears to promote genotypic and phenotypic variation in microbial communities in a range of environments, including the mammalian intestine. However, the extent and mechanisms of LGT in intestinal microbial communities of non-mammalian hosts remains poorly understood. Methodology/Principal Findings We sequenced two fosmid inserts obtained from a genomic DNA library derived from an agar-degrading enrichment culture of marine iguana fecal material. The inserts harbored 16S rRNA genes that place the organism from which they originated within Clostridium cluster IV, a well documented group that habitats the mammalian intestinal tract. However, sequence analysis indicates that 52% of the protein-coding genes on the fosmids have top BLASTX hits to bacterial species that are not members of Clostridium cluster IV, and phylogenetic analysis suggests that at least 10 of 44 coding genes on the fosmids may have been transferred from Clostridium cluster XIVa to cluster IV. The fosmids encoded four transposase-encoding genes and an integrase-encoding gene, suggesting their involvement in LGT. In addition, several coding genes likely involved in sugar transport were probably acquired through LGT. Conclusion Our phylogenetic evidence suggests that LGT may be common among phylogenetically distinct members of the phylum Firmicutes inhabiting the intestinal tract of marine iguanas. PMID:20520734
Yokoyama, Eiji; Hirai, Shinichiro; Ishige, Taichiro; Murakami, Satoshi
2018-01-02
Seventeen clusters of Shiga toxin-producing Escherichia coli O157:H7/- (O157) strains, determined by cluster analysis of pulsed-field gel electrophoresis patterns, were analyzed using whole genome sequence (WGS) data to investigate this pathogen's molecular epidemiology. The 17 clusters included 136 strains containing strains from nine outbreaks, with each outbreak caused by a single source contaminated with the organism, as shown by epidemiological contact surveys. WGS data of these strains were used to identify single nucleotide polymorphisms (SNPs) by two methods: short read data were directly mapped to a reference genome (mapping derived SNPs) and common SNPs between the mapping derived SNPs and SNPs in assembled data of short read data (common SNPs). Among both SNPs, those that were detected in genes with a gap were excluded to remove ambiguous SNPs from further analysis. The effectiveness of both SNPs was investigated among all the concatenated SNPs that were detected (whole SNP set); SNPs were divided into three categories based on the genes in which they were located (i.e., backbone SNP set, O-island SNP set, and mobile element SNP set); and SNPs in non-coding regions (intergenic region SNP set). When SNPs from strains isolated from the nine single source derived outbreaks were analyzed using an unweighted pair group method with arithmetic mean tree (UPGMA) and a minimum spanning tree (MST), the maximum pair-wise distances of the backbone SNP set of the mapping derived SNPs were significantly smaller than those of the whole and intergenic region SNP set on both UPGMAs and MSTs. This significant difference was also observed when the backbone SNP set of the common SNPs were examined (Steel-Dwass test, P≤0.01). When the maximum pair-wise distances were compared between the mapping derived and common SNPs, significant differences were observed in those of the whole, mobile element, and intergenic region SNP set (Wilcoxon signed rank test, P≤0.01). When all the strains included in one complex on an MST or one cluster on a UPGMA were designated as the same genotype, the values of the Hunter-Gaston Discriminatory Power Index for the backbone SNP set of the mapping derived and common SNPs were higher than those of other SNP sets. In contrast, the mobile element SNP set could not robustly subdivide lineage I strains of tested O157 strains using both the mapping derived and common SNPs. These results suggested that the backbone SNP set were the most effective for analysis of WGS data for O157 in enabling an appropriation of its molecular epidemiology. Copyright © 2017 Elsevier B.V. All rights reserved.
Loutre, Romuald; Heckel, Anne-Marie; Jeandard, Damien; Tarassov, Ivan; Entelis, Nina
2018-01-01
Mutations in mitochondrial DNA are an important source of severe and incurable human diseases. The vast majority of these mutations are heteroplasmic, meaning that mutant and wild-type genomes are present simultaneously in the same cell. Only a very high proportion of mutant mitochondrial DNA (heteroplasmy level) leads to pathological consequences. We previously demonstrated that mitochondrial targeting of small RNAs designed to anneal with mutant mtDNA can decrease the heteroplasmy level by specific inhibition of mutant mtDNA replication, thus representing a potential therapy. We have also shown that 5S ribosomal RNA, partially imported into human mitochondria, can be used as a vector to deliver anti-replicative oligoribonucleotides into human mitochondria. So far, the efficiency of cellular expression of recombinant 5S rRNA molecules bearing therapeutic insertions remained very low. In the present study, we designed new versions of anti-replicative recombinant 5S rRNA targeting a large deletion in mitochondrial DNA which causes the KSS syndrome, analyzed their specific annealing to KSS mitochondrial DNA and demonstrated their import into mitochondria of cultured human cells. To obtain an increased level of the recombinant 5S rRNA stable expression, we created transmitochondrial cybrid cell line bearing a site for Flp-recombinase and used this system for the recombinase-mediated integration of genes coding for the anti-replicative recombinant 5S rRNAs into nuclear genome. We demonstrated that stable expression of anti-replicative 5S rRNA versions in human transmitochondrial cybrid cells can induce a shift in heteroplasmy level of KSS mutation in mtDNA. This shift was directly dependent on the level of the recombinant 5S rRNA expression and the sequence of the anti-replicative insertion. Quantification of mtDNA copy number in transfected cells revealed the absence of a non-specific effect on wild type mtDNA replication, indicating that the decreased proportion between mutant and wild type mtDNA molecules is not a consequence of a random repopulation of depleted pool of mtDNA genomes. The heteroplasmy change could be also modulated by cell growth conditions, namely increased by cells culturing in a carbohydrate-free medium, thus forcing them to use oxidative phosphorylation and providing a selective advantage for cells with improved respiration capacities. We discuss the advantages and limitations of this approach and propose further development of the anti-replicative strategy based on the RNA import into human mitochondria.
Ramírez-Bahena, Martha Helena; Peix, Alvaro; Rivas, Raúl; Camacho, María; Rodríguez-Navarro, Dulce N; Mateos, Pedro F; Martínez-Molina, Eustoquio; Willems, Anne; Velázquez, Encarna
2009-08-01
Several strains isolated from the legume Pachyrhizus erosus were characterized on the basis of diverse genetic, phenotypic and symbiotic approaches. These novel strains formed two groups closely related to Bradyrhizobium elkanii according to their 16S rRNA gene sequences. Strains PAC48T and PAC68T, designated as the type strains of these two groups, presented 99.8 and 99.1% similarity, respectively, in their 16S rRNA gene sequences with respect to B. elkanii USDA 76T. In spite of these high similarity values, the analysis of additional phylogenetic markers such as atpD and glnII genes and the 16S-23S intergenic spacer (ITS) showed that strains PAC48T and PAC68T represented two separate novel species of the genus Bradyrhizobium with B. elkanii as their closest relative. Phenotypic differences among the novel strains isolated from Pachyrhizus and B. elkanii were found regarding the assimilation of carbon sources and antibiotic resistance. All these differences were congruent with DNA-DNA hybridization analysis which revealed 21% genetic relatedness between strains PAC48T and PAC68T and 46% and 25%, respectively, between these strains and B. elkanii LMG 6134T. The nodD and nifH genes of strains PAC48T and PAC68T were phylogenetically divergent from those of bradyrhizobia species that nodulate soybean. Soybean was not nodulated by the novel Pachyrhizus isolates. Based on the genotypic and phenotypic data obtained in this study, the new strains represent two novel species for which the names Bradyrhizobium pachyrhizi sp. nov. (type strain PAC48T=LMG 24246T=CECT 7396T) and Bradyrhizobium jicamae sp. nov. (type strain PAC68T=LMG 24556T=CECT 7395T) are proposed.
Stable Transmission of Borrelia burgdorferi Sensu Stricto on the Outer Banks of North Carolina.
Levine, J F; Apperson, C S; Levin, M; Kelly, T R; Kakumanu, M L; Ponnusamy, L; Sutton, H; Salger, S A; Caldwell, J M; Szempruch, A J
2017-08-01
The spirochaete (Borrelia burgdorferi) associated with Lyme disease was detected in questing ticks and rodents during a period of 18 years, 1991-2009, at five locations on the Outer Banks of North Carolina. The black-legged tick (Ixodes scapularis) was collected at varied intervals between 1991 and 2009 and examined for B. burgdorferi. The white-footed mouse (Peromyscus leucopus), house mouse (Mus musculus) marsh rice rat (Oryzomys palustris), marsh rabbit (Sylvilagus palustris), eastern cottontail (Sylvilagus floridanus) and six-lined racerunner (Cnemidophorus sexlineatus) were live-trapped, and their tissues cultured to isolate spirochaetes. Borrelia burgdorferi isolates were obtained from questing adult I. scapularis and engorged I. scapularis removed from P. leucopus, O. palustris and S. floridanus. The prevalence of B. burgdorferi infection was variable at different times and sites ranging from 7 to 14% of examined questing I. scapularis. Mitochondrial (16S) rRNA gene phylogenetic analysis from 65 adult I. scapularis identified 12 haplotypes in two major clades. Nine haplotypes were associated with northern/Midwestern I. scapularis populations and three with southern I. scapularis populations. Sixteen isolates obtained from tick hosts in 2005 were confirmed to be B. burgdorferi by amplifying and sequencing of 16S rRNA and 5S-23S intergenic spacer fragments. The sequences had 98-99% identity to B. burgdorferi sensu stricto strains B31, JD1 and M11p. Taken together, these studies indicate that B. burgdorferi sensu stricto is endemic in questing I. scapularis and mammalian tick hosts on the Outer Banks of North Carolina. © 2016 Blackwell Verlag GmbH.
Sylvan, J B; Pyenson, B C; Rouxel, O; German, C R; Edwards, K J
2012-03-01
We deployed sediment traps adjacent to two active hydrothermal vents at 9°50'N on the East Pacific Rise (EPR) to assess the variability in bacterial community structure associated with plume particles on the timescale of weeks to months, to determine whether an endemic population of plume microbes exists, and to establish ecological relationships between bacterial populations and vent chemistry. Automated rRNA intergenic spacer analysis (ARISA) indicated that there are separate communities at the two different vents and temporal community variations between each vent. Correlation analysis between chemistry and microbiology indicated that shifts in the coarse particulate (>1 mm) Fe/(Fe+Mn+Al), Cu, V, Ca, Al, (232) Th, and Ti as well as fine-grained particulate (<1 mm) Fe/(Fe+Mn+Al), Fe, Ca, and Co are reflected in shifts in microbial populations. 16S rRNA clone libraries from each trap at three time points revealed a high percentage of Epsilonproteobacteria clones and hyperthermophilic Aquificae. There is a shift toward the end of the experiment to more Gammaproteobacteria and Alphaproteobacteria, many of whom likely participate in Fe and S cycling. The particle-attached plume environment is genetically distinct from the surrounding seawater. While work to date in hydrothermal environments has focused on determining the microbial communities on hydrothermal chimneys and the basaltic lavas that form the surrounding seafloor, little comparable data exist on the plume environment that physically and chemically connects them. By employing sediment traps for a time-series approach to sampling, we show that bacterial community composition on plume particles changes on timescales much shorter than previously known. © 2012 Blackwell Publishing Ltd.
Mohammed, Riazuddin; Brink, Geoffrey E.; Stevenson, David M.; Neumann, Anthony P.; Beauchemin, Karen A.; Suen, Garret; Weimer, Paul J.
2014-01-01
The rich and diverse microbiota of the rumen provides ruminant animals the capacity to utilize highly fibrous feedstuffs as their energy source, but there is surprisingly little information on the composition of the microbiome of ruminants fed all-forage diets, despite the importance of such agricultural production systems worldwide. In three 28-day periods, three ruminally-cannulated Holstein heifers sequentially grazed orchardgrass pasture (OP), then were fed orchardgrass hay (OH), then returned to OP. These heifers displayed greater shifts in ruminal bacterial community composition (determined by automated ribosomal intergenic spacer analysis and by pyrotag sequencing of 16S rRNA genes) than did two other heifers maintained 84 d on the same OP. Phyla Firmicutes and Bacteroidetes dominated all ruminal samples, and quantitative PCR indicated that members of the genus Prevotella averaged 23% of the 16S rRNA gene copies, well below levels previously reported with cows fed total mixed rations. Differences in bacterial community composition and ruminal volatile fatty acid (VFA) profiles were observed between the OP and OH despite similarities in gross chemical composition. Compared to OP, feeding OH increased the molar proportion of ruminal acetate (P = 0.02) and decreased the proportion of ruminal butyrate (P < 0.01), branched-chain VFA (P < 0.01) and the relative population size of the abundant genus Butyrivibrio (P < 0.001), as determined by pyrotag sequencing. Despite the low numbers of animals examined, the observed changes in VFA profile in the rumens of heifers on OP vs. OH are consistent with the shifts in Butyrivibrio abundance and its known physiology as a butyrate producer that ferments both carbohydrates and proteins. PMID:25538699
Species identification and molecular typing of human Brucella isolates from Kuwait.
Mustafa, Abu S; Habibi, Nazima; Osman, Amr; Shaheed, Faraz; Khan, Mohd W
2017-01-01
Brucellosis is a zoonotic disease of major concern in Kuwait and the Middle East. Human brucellosis can be caused by several Brucella species with varying degree of pathogenesis, and relapses are common after apparently successful therapy. The classical biochemical methods for identification of Brucella are time-consuming, cumbersome, and provide information limited to the species level only. In contrast, molecular methods are rapid and provide differentiation at intra-species level. In this study, four molecular methods [16S rRNA gene sequencing, real-time PCR, enterobacterial repetitive intergenic consensus (ERIC)-PCR and multilocus variable-number tandem-repeat analysis (MLVA)-8, MLVA-11 and MLVA-16 were evaluated for the identification and typing of 75 strains of Brucella isolated in Kuwait. 16S rRNA gene sequencing of all isolates showed 90-99% sequence identity with B. melitensis and real-time PCR with genus- and species- specific primers identified all isolates as B. melitensis. The results of ERIC-PCR suggested the existence of 75 ERIC genotypes of B. melitensis with a discriminatory index of 0.997. Cluster classification of these genotypes divided them into two clusters, A and B, diverging at ~25%. The maximum number of genotypes (n = 51) were found in cluster B5. MLVA-8 analysis identified all isolates as B. melitensis, and MLVA-8, MLVA-11 and MLVA-16 typing divided the isolates into 10, 32 and 71 MLVA types, respectively. Furthermore, the combined minimum spanning tree analysis demonstrated that, compared to MLVA types discovered all over the world, the Kuwaiti isolates were a distinct group of MLVA-11 and MLVA-16 types in the East Mediterranean Region.
Species identification and molecular typing of human Brucella isolates from Kuwait
Osman, Amr; Shaheed, Faraz; Khan, Mohd W.
2017-01-01
Brucellosis is a zoonotic disease of major concern in Kuwait and the Middle East. Human brucellosis can be caused by several Brucella species with varying degree of pathogenesis, and relapses are common after apparently successful therapy. The classical biochemical methods for identification of Brucella are time-consuming, cumbersome, and provide information limited to the species level only. In contrast, molecular methods are rapid and provide differentiation at intra-species level. In this study, four molecular methods [16S rRNA gene sequencing, real-time PCR, enterobacterial repetitive intergenic consensus (ERIC)-PCR and multilocus variable-number tandem-repeat analysis (MLVA)-8, MLVA-11 and MLVA-16 were evaluated for the identification and typing of 75 strains of Brucella isolated in Kuwait. 16S rRNA gene sequencing of all isolates showed 90–99% sequence identity with B. melitensis and real-time PCR with genus- and species- specific primers identified all isolates as B. melitensis. The results of ERIC-PCR suggested the existence of 75 ERIC genotypes of B. melitensis with a discriminatory index of 0.997. Cluster classification of these genotypes divided them into two clusters, A and B, diverging at ~25%. The maximum number of genotypes (n = 51) were found in cluster B5. MLVA-8 analysis identified all isolates as B. melitensis, and MLVA-8, MLVA-11 and MLVA-16 typing divided the isolates into 10, 32 and 71 MLVA types, respectively. Furthermore, the combined minimum spanning tree analysis demonstrated that, compared to MLVA types discovered all over the world, the Kuwaiti isolates were a distinct group of MLVA-11 and MLVA-16 types in the East Mediterranean Region. PMID:28800594
Holman, Hoi-Ying N.; DeSantis, Todd Z.; Wanner, Gerhard; Andersen, Gary L.; Perras, Alexandra K.; Meck, Sandra; Völkel, Jörg; Bechtel, Hans A.; Wirth, Reinhard; Moissl-Eichinger, Christine
2014-01-01
Earth harbors an enormous portion of subsurface microbial life, whose microbiome flux across geographical locations remains mainly unexplored due to difficult access to samples. Here, we investigated the microbiome relatedness of subsurface biofilms of two sulfidic springs in southeast Germany that have similar physical and chemical parameters and are fed by one deep groundwater current. Due to their unique hydrogeological setting these springs provide accessible windows to subsurface biofilms dominated by the same uncultivated archaeal species, called SM1 Euryarchaeon. Comparative analysis of infrared imaging spectra demonstrated great variations in archaeal membrane composition between biofilms of the two springs, suggesting different SM1 euryarchaeal strains of the same species at both aquifer outlets. This strain variation was supported by ultrastructural and metagenomic analyses of the archaeal biofilms, which included intergenic spacer region sequencing of the rRNA gene operon. At 16S rRNA gene level, PhyloChip G3 DNA microarray detected similar biofilm communities for archaea, but site-specific communities for bacteria. Both biofilms showed an enrichment of different deltaproteobacterial operational taxonomic units, whose families were, however, congruent as were their lipid spectra. Consequently, the function of the major proportion of the bacteriome appeared to be conserved across the geographic locations studied, which was confirmed by dsrB-directed quantitative PCR. Consequently, microbiome differences of these subsurface biofilms exist at subtle nuances for archaea (strain level variation) and at higher taxonomic levels for predominant bacteria without a substantial perturbation in bacteriome function. The results of this communication provide deep insight into the dynamics of subsurface microbial life and warrant its future investigation with regard to metabolic and genomic analyses. PMID:24971452
Muñoz-Quezada, Sergio; Chenoll, Empar; Vieites, José María; Genovés, Salvador; Maldonado, José; Bermúdez-Brito, Miriam; Gomez-Llorente, Carolina; Matencio, Esther; Bernal, María José; Romero, Fernando; Suárez, Antonio; Ramón, Daniel; Gil, Angel
2013-01-01
The aim of the present study was to isolate, identify and characterise novel strains of lactic acid bacteria and bifidobacteria with probiotic properties from the faeces of exclusively breast-fed infants. Of the 4680 isolated colonies, 758 exhibited resistance to low pH and tolerance to high concentrations of bile salts; of these, only forty-two exhibited a strong ability to adhere to enterocytes in vitro. The identities of the isolates were confirmed by 16S ribosomal RNA (rRNA) sequencing, which permitted the grouping of the forty-two bacteria into three different strains that showed more than 99 % sequence identity with Lactobacillus paracasei, Lactobacillus rhamnosus and Bifidobacterium breve, respectively. The strain identification was confirmed by sequencing the 16S-23S rRNA intergenic spacer regions. Strains were assayed for enzymatic activity and carbohydrate utilisation, and they were deposited in the Collection Nationale de Cultures de Microorganismes (CNCM) of the Institute Pasteur and named L. paracasei CNCM I-4034, B. breve CNCM I-4035 and L. rhamnosus CNCM I-4036. The strains were susceptible to antibiotics and did not produce undesirable metabolites, and their safety was assessed by acute ingestion in immunocompetent and immunosuppressed BALB/c mouse models. The three novel strains inhibited in vitro the meningitis aetiological agent Listeria monocytogenes and human rotavirus infections. B. breve CNCM I-4035 led to a higher IgA concentration in faeces and plasma of mice. Overall, these results suggest that L. paracasei CNCM I-4034, B. breve CNCM I-4035 and L. rhamnosus CNCM I-4036 should be considered as probiotic strains, and their human health benefits should be further evaluated.
Moreira, João Luiz S; Mota, Rodrigo M; Horta, Maria F; Teixeira, Santuza MR; Neumann, Elisabeth; Nicoli, Jacques R; Nunes, Álvaro C
2005-01-01
Background The accurate identification of Lactobacillus and other co-isolated bacteria during microbial ecological studies of ecosystems such as the human or animal intestinal tracts and food products is a hard task by phenotypic methods requiring additional tests such as protein and/or lipids profiling. Results Bacteria isolated in different probiotic prospecting studies, using de Man, Rogosa and Sharpe medium (MRS), were typed at species level by PCR amplification of 16S-23S rRNA intergenic spacers using universal primers that anneal within 16S and 23S genes, followed by restriction digestion analyses of PCR products. The set of enzymes chosen differentiates most species of Lactobacillus genus and also co-isolated bacteria such as Enterococcus, Streptococcus, Weissella, Staphylococcus, and Escherichia species. The in silico predictions of restriction patterns generated by the Lactobacillus shorter spacers digested with 11 restriction enzymes with 6 bp specificities allowed us to distinguish almost all isolates at the species level but not at the subspecies one. Simultaneous theoretical digestions of the three spacers (long, medium and short) with the same set of enzymes provided more complex patterns and allowed us to distinguish the species without purifying and cloning of PCR products. Conclusion Lactobacillus isolates and several other strains of bacteria co-isolated on MRS medium from gastrointestinal ecosystem and fermented food products could be identified using DNA fingerprints generated by restriction endonucleases. The methodology based on amplified ribosomal DNA restriction analysis (ARDRA) is easier, faster and more accurate than the current methodologies based on fermentation profiles, used in most laboratories for the purpose of identification of these bacteria in different prospecting studies. PMID:15788104
Operon-mapper: A Web Server for Precise Operon Identification in Bacterial and Archaeal Genomes.
Taboada, Blanca; Estrada, Karel; Ciria, Ricardo; Merino, Enrique
2018-06-19
Operon-mapper is a web server that accurately, easily, and directly predicts the operons of any bacterial or archaeal genome sequence. The operon predictions are based on the intergenic distance of neighboring genes as well as the functional relationships of their protein-coding products. To this end, Operon-mapper finds all the ORFs within a given nucleotide sequence, along with their genomic coordinates, orthology groups, and functional relationships. We believe that Operon-mapper, due to its accuracy, simplicity and speed, as well as the relevant information that it generates, will be a useful tool for annotating and characterizing genomic sequences. http://biocomputo.ibt.unam.mx/operon_mapper/.
The complete mitochondrial genome sequence of the Tibetan red fox (Vulpes vulpes montana).
Zhang, Jin; Zhang, Honghai; Zhao, Chao; Chen, Lei; Sha, Weilai; Liu, Guangshuai
2015-01-01
In this study, the complete mitochondrial genome of the Tibetan red fox (Vulpes Vulpes montana) was sequenced for the first time using blood samples obtained from a wild female red fox captured from Lhasa in Tibet, China. Qinghai--Tibet Plateau is the highest plateau in the world with an average elevation above 3500 m. Sequence analysis showed it contains 12S rRNA gene, 16S rRNA gene, 22 tRNA genes, 13 protein-coding genes and 1 control region (CR). The variable tandem repeats in CR is the main reason of the length variability of mitochondrial genome among canide animals.
Szymanski, Maciej; Karlowski, Wojciech M
2016-01-01
In eukaryotes, ribosomal 5S rRNAs are products of multigene families organized within clusters of tandemly repeated units. Accumulation of genomic data obtained from a variety of organisms demonstrated that the potential 5S rRNA coding sequences show a large number of variants, often incompatible with folding into a correct secondary structure. Here, we present results of an analysis of a large set of short RNA sequences generated by the next generation sequencing techniques, to address the problem of heterogeneity of the 5S rRNA transcripts in Arabidopsis and identification of potentially functional rRNA-derived fragments.
Complete mitochondrial genome of the moon jellyfish, Aurelia sp. nov. (Cnidaria, Scyphozoa).
Hwang, Dae-Sik; Park, Eunji; Won, Yong-Jin; Lee, Jae-Seong
2014-02-01
We sequenced 16,971 bp of the linear mitochondrial DNA of the moon jellyfish Aurelia sp. nov. and characterized it by comparing with Aurelia aurita. They had 13 protein-coding genes (PCGs), 16S rRNA and 12S rRNA with three tRNAs (tRNA-Leu, tRNA-Ser(TGA), tRNA-Met). Both have another two PCGs, orf969 and orf324 with telomeres at both ends. After comparison of Aurelia sp. nov. with Aurelia aurita, we found low-protein similarity of orf969 (59%) and orf324 (75%), respectively, while the other 13 PCGs showed 80% to 98% protein similarities.
Complete mitochondrial genome of the jellyfish, Chrysaora quinquecirrha (Cnidaria, Scyphozoa).
Hwang, Dae-Sik; Park, Eunji; Won, Yong-Jin; Lee, Woo-Jin; Shin, Kyoungsoon; Lee, Jae-Seong
2014-02-01
We sequenced 16,775 bp of the linear mitochondrial DNA of the jellyfish Chrysaora quinquecirrha and characterized them. C. quinquecirrha has 13 protein-coding genes (PCGs), 16S rRNA and 12S rRNA with 3 tRNAs (tRNA-Leu, tRNA-Ser(TGA), tRNA-Met) as shown in Aurelia sp. nov. Both have another two PCGs such as helicase and orf363 with telomeres at both ends. The PCGs of C. quinquecirrha shows anti-G bias on 2nd and 3rd positions of PCGs as well as anti-C bias on 1st and 3rd positions of PCGs.
Cheng, Hui; Li, Jinfeng; Zhang, Hong; Cai, Binhua; Gao, Zhihong
2017-01-01
Compared with other members of the family Rosaceae, the chloroplast genomes of Fragaria species exhibit low variation, and this situation has limited phylogenetic analyses; thus, complete chloroplast genome sequencing of Fragaria species is needed. In this study, we sequenced the complete chloroplast genome of F. × ananassa ‘Benihoppe’ using the Illumina HiSeq 2500-PE150 platform and then performed a combination of de novo assembly and reference-guided mapping of contigs to generate complete chloroplast genome sequences. The chloroplast genome exhibits a typical quadripartite structure with a pair of inverted repeats (IRs, 25,936 bp) separated by large (LSC, 85,531 bp) and small (SSC, 18,146 bp) single-copy (SC) regions. The length of the F. × ananassa ‘Benihoppe’ chloroplast genome is 155,549 bp, representing the smallest Fragaria chloroplast genome observed to date. The genome encodes 112 unique genes, comprising 78 protein-coding genes, 30 tRNA genes and four rRNA genes. Comparative analysis of the overall nucleotide sequence identity among ten complete chloroplast genomes confirmed that for both coding and non-coding regions in Rosaceae, SC regions exhibit higher sequence variation than IRs. The Ka/Ks ratio of most genes was less than 1, suggesting that most genes are under purifying selection. Moreover, the mVISTA results also showed a high degree of conservation in genome structure, gene order and gene content in Fragaria, particularly among three octoploid strawberries which were F. × ananassa ‘Benihoppe’, F. chiloensis (GP33) and F. virginiana (O477). However, when the sequences of the coding and non-coding regions of F. × ananassa ‘Benihoppe’ were compared in detail with those of F. chiloensis (GP33) and F. virginiana (O477), a number of SNPs and InDels were revealed by MEGA 7. Six non-coding regions (trnK-matK, trnS-trnG, atpF-atpH, trnC-petN, trnT-psbD and trnP-psaJ) with a percentage of variable sites greater than 1% and no less than five parsimony-informative sites were identified and may be useful for phylogenetic analysis of the genus Fragaria. PMID:29038765
Voß, Björn; Bolhuis, Henk; Fewer, David P.; Kopf, Matthias; Möke, Fred; Haas, Fabian; El-Shehawy, Rehab; Hayes, Paul; Bergman, Birgitta; Sivonen, Kaarina; Dittmann, Elke; Scanlan, Dave J.; Hagemann, Martin; Stal, Lucas J.; Hess, Wolfgang R.
2013-01-01
Nodularia spumigena is a filamentous diazotrophic cyanobacterium that dominates the annual late summer cyanobacterial blooms in the Baltic Sea. But N. spumigena also is common in brackish water bodies worldwide, suggesting special adaptation allowing it to thrive at moderate salinities. A draft genome analysis of N. spumigena sp. CCY9414 yielded a single scaffold of 5,462,271 nucleotides in length on which genes for 5,294 proteins were annotated. A subsequent strand-specific transcriptome analysis identified more than 6,000 putative transcriptional start sites (TSS). Orphan TSSs located in intergenic regions led us to predict 764 non-coding RNAs, among them 70 copies of a possible retrotransposon and several potential RNA regulators, some of which are also present in other N2-fixing cyanobacteria. Approximately 4% of the total coding capacity is devoted to the production of secondary metabolites, among them the potent hepatotoxin nodularin, the linear spumigin and the cyclic nodulapeptin. The transcriptional complexity associated with genes involved in nitrogen fixation and heterocyst differentiation is considerably smaller compared to other Nostocales. In contrast, sophisticated systems exist for the uptake and assimilation of iron and phosphorus compounds, for the synthesis of compatible solutes, and for the formation of gas vesicles, required for the active control of buoyancy. Hence, the annotation and interpretation of this sequence provides a vast array of clues into the genomic underpinnings of the physiology of this cyanobacterium and indicates in particular a competitive edge of N. spumigena in nutrient-limited brackish water ecosystems. PMID:23555932
Kim, Min Jee; Hong, Eui Jeong; Kim, Iksoo
2016-01-01
We sequenced the complete mitochondrial (mt) genome of Camponotus atrox (Hymenoptera: Formicidae), which is only distributed in Korea. The genome was 16 540 bp in size and contained typical sets of genes (13 protein-coding genes, 22 tRNAs, and 2 rRNAs). The C. atrox A+T-rich region, at 1402 bp, was the longest of all sequenced ant genomes and was composed of an identical tandem repeat consisting of six 100-bp copies and one 96-bp copy. A total of 315 bp of intergenic spacer sequence was spread over 23 regions. An alignment of the spacer sequences in ants was largely feasible among congeneric species, and there was substantial sequence divergence, indicating their potential use as molecular markers for congeneric species. The A/T contents at the first and second codon positions of protein-coding genes (PCGs) were similar for ant species, including C. atrox (73.9% vs. 72.3%, on average). With increased taxon sampling among hymenopteran superfamilies, differences in the divergence rates (i.e., the non-synonymous substitution rates) between the suborders Symphyta and Apocrita were detected, consistent with previous results. The C. atrox mt genome had a unique gene arrangement, trnI-trnM-trnQ, at the A+T-rich region and ND2 junction (underline indicates inverted gene). This may have originated from a tandem duplication of trnM-trnI, resulting in trnM-trnI-trnM-trnI-trnQ, and the subsequent loss of the first trnM and second trnI, resulting in trnI-trnM-trnQ.
Guo, D; Maiss, E; Adam, G; Casper, R
1995-05-01
The RNA3 of prunus necrotic ringspot ilarvirus (PNRSV) has been cloned and its entire sequence determined. The RNA3 consists of 1943 nucleotides (nt) and possesses two large open reading frames (ORFs) separated by an intergenic region of 74 nt. The 5' proximal ORF is 855 nt in length and codes for a protein of molecular mass 31.4 kDa which has homologies with the putative movement protein of other members of the Bromoviridae. The 3' proximal ORF of 675 nt is the cistron for the coat protein (CP) and has a predicted molecular mass of 24.9 kDa. The sequence of the 3' non-coding region (NCR) of PNRSV RNA3 showed a high degree of similarity with those of tobacco streak virus (TSV), prune dwarf virus (PDV), apple mosaic virus (ApMV) and also alfalfa mosaic virus (AIMV). In addition it contained potential stem-loop structures with interspersed AUGC motifs characteristic for ilar- and alfamoviruses. This conserved primary and secondary structure in all 3' NCRs may be responsible for the interaction with homologous and heterologous CPs and subsequent activation of genome replication. The CP gene of an ApMV isolate (ApMV-G) of 657 nt has also been cloned and sequenced. Although ApMV and PNRSV have a distant serological relationship, the deduced amino acid sequences of their CPs have an identity of only 51.8%. The N termini of PNRSV and ApMV CPs have in common a zinc-finger motif and the potential to form an amphipathic helix.
Shendre, Aditi; Wiener, Howard W.; Irvin, Marguerite R.; Aouizerat, Bradley E.; Overton, Edgar T.; Lazar, Jason; Liu, Chenglong; Hodis, Howard N.; Limdi, Nita A.; Weber, Kathleen M.; Zhi, Degui; Floris-Moore, Michelle A.; Ofotokun, Ighovwerha; Qi, Qibin; Hanna, David B.; Kaplan, Robert C.
2017-01-01
Cardiovascular disease (CVD) is a major comorbidity among HIV-infected individuals. Common carotid artery intima-media thickness (cCIMT) is a valid and reliable subclinical measure of atherosclerosis and is known to predict CVD. We performed genome-wide association (GWA) and admixture analysis among 682 HIV-positive and 288 HIV-negative Black, non-Hispanic women from the Women’s Interagency HIV study (WIHS) cohort using a combined and stratified analysis approach. We found some suggestive associations but none of the SNPs reached genome-wide statistical significance in our GWAS analysis. The top GWAS SNPs were rs2280828 in the region intergenic to mediator complex subunit 30 and exostosin glycosyltransferase 1 (MED30 | EXT1) among all women, rs2907092 in the catenin delta 2 (CTNND2) gene among HIV-positive women, and rs7529733 in the region intergenic to family with sequence similarity 5, member C and regulator of G-protein signaling 18 (FAM5C | RGS18) genes among HIV-negative women. The most significant local European ancestry associations were in the region intergenic to the zinc finger and SCAN domain containing 5D gene and NADH: ubiquinone oxidoreductase complex assembly factor 1 (ZSCAN5D | NDUF1) pseudogene on chromosome 19 among all women, in the region intergenic to vomeronasal 1 receptor 6 pseudogene and zinc finger protein 845 (VN1R6P | ZNF845) gene on chromosome 19 among HIV-positive women, and in the region intergenic to the SEC23-interacting protein and phosphatidic acid phosphatase type 2 domain containing 1A (SEC23IP | PPAPDC1A) genes located on chromosome 10 among HIV-negative women. A number of previously identified SNP associations with cCIMT were also observed and included rs2572204 in the ryanodine receptor 3 (RYR3) and an admixture region in the secretion-regulating guanine nucleotide exchange factor (SERGEF) gene. We report several SNPs and gene regions in the GWAS and admixture analysis, some of which are common across HIV-positive and HIV-negative women as demonstrated using meta-analysis, and also across the two analytic approaches (i.e., GWA and admixture). These findings suggest that local European ancestry plays an important role in genetic associations of cCIMT among black women from WIHS along with other environmental factors that are related to CVD and may also be triggered by HIV. These findings warrant confirmation in independent samples. PMID:29206233
Pan, W J; Blackburn, E H
1995-01-01
The rRNA genes in the somatic macronucleus of Tetrahymena thermophila are normally on 21 kb linear palindromic molecules (rDNA). We examined the effect on rRNA gene dosage of transforming T.thermophila macronuclei with plasmid constructs containing a pair of tandemly repeated rDNA replication origin regions unlinked to the rRNA gene. A significant proportion of the plasmid sequences were maintained as high copy circular molecules, eventually consisting solely of tandem arrays of origin regions. As reported previously for cells transformed by a construct in which the same tandem rDNA origins were linked to the rRNA gene [Yu, G.-L. and Blackburn, E. H. (1990) Mol. Cell. Biol., 10, 2070-2080], origin sequences recombined to form linear molecules bearing several tandem repeats of the origin region, as well as rRNA genes. The total number of rDNA origin sequences eventually exceeded rRNA gene copies by approximately 20- to 40-fold and the number of circular replicons carrying only rDNA origin sequences exceeded rRNA gene copies by 2- to 3-fold. However, the rRNA gene dosage was unchanged. Hence, simply monitoring the total number of rDNA origin regions is not sufficient to regulate rRNA gene copy number. Images PMID:7784211
Comparative sequence analysis of the X-inactivation center region in mouse, human, and bovine.
Chureau, Corinne; Prissette, Marine; Bourdet, Agnès; Barbe, Valérie; Cattolico, Laurence; Jones, Louis; Eggen, André; Avner, Philip; Duret, Laurent
2002-06-01
We have sequenced to high levels of accuracy 714-kb and 233-kb regions of the mouse and bovine X-inactivation centers (Xic), respectively, centered on the Xist gene. This has provided the basis for a fully annotated comparative analysis of the mouse Xic with the 2.3-Mb orthologous region in human and has allowed a three-way species comparison of the core central region, including the Xist gene. These comparisons have revealed conserved genes, both coding and noncoding, conserved CpG islands and, more surprisingly, conserved pseudogenes. The distribution of repeated elements, especially LINE repeats, in the mouse Xic region when compared to the rest of the genome does not support the hypothesis of a role for these repeat elements in the spreading of X inactivation. Interestingly, an asymmetric distribution of LINE elements on the two DNA strands was observed in the three species, not only within introns but also in intergenic regions. This feature is suggestive of important transcriptional activity within these intergenic regions. In silico prediction followed by experimental analysis has allowed four new genes, Cnbp2, Ftx, Jpx, and Ppnx, to be identified and novel, widespread, complex, and apparently noncoding transcriptional activity to be characterized in a region 5' of Xist that was recently shown to attract histone modification early after the onset of X inactivation.
Kwenda, Stanford; Birch, Paul R J; Moleleki, Lucy N
2016-08-11
Long noncoding RNAs (lncRNAs) represent a class of RNA molecules that are implicated in regulation of gene expression in both mammals and plants. While much progress has been made in determining the biological functions of lncRNAs in mammals, the functional roles of lncRNAs in plants are still poorly understood. Specifically, the roles of long intergenic nocoding RNAs (lincRNAs) in plant defence responses are yet to be fully explored. In this study, we used strand-specific RNA sequencing to identify 1113 lincRNAs in potato (Solanum tuberosum) from stem tissues. The lincRNAs are expressed from all 12 potato chromosomes and generally smaller in size compared to protein-coding genes. Like in other plants, most potato lincRNAs possess single exons. A time-course RNA-seq analysis between a tolerant and a susceptible potato cultivar showed that 559 lincRNAs are responsive to Pectobacterium carotovorum subsp. brasiliense challenge compared to mock-inoculated controls. Moreover, coexpression analysis revealed that 17 of these lincRNAs are highly associated with 12 potato defence-related genes. Together, these results suggest that lincRNAs have potential functional roles in potato defence responses. Furthermore, this work provides the first library of potato lincRNAs and a set of novel lincRNAs implicated in potato defences against P. carotovorum subsp. brasiliense, a member of the soft rot Enterobacteriaceae phytopathogens.
Ulianov, Sergey V; Galitsyna, Aleksandra A; Flyamer, Ilya M; Golov, Arkadiy K; Khrameeva, Ekaterina E; Imakaev, Maxim V; Abdennur, Nezar A; Gelfand, Mikhail S; Gavrilov, Alexey A; Razin, Sergey V
2017-07-11
In homeotherms, the alpha-globin gene clusters are located within permanently open genome regions enriched in housekeeping genes. Terminal erythroid differentiation results in dramatic upregulation of alpha-globin genes making their expression comparable to the rRNA transcriptional output. Little is known about the influence of the erythroid-specific alpha-globin gene transcription outburst on adjacent, widely expressed genes and large-scale chromatin organization. Here, we have analyzed the total transcription output, the overall chromatin contact profile, and CTCF binding within the 2.7 Mb segment of chicken chromosome 14 harboring the alpha-globin gene cluster in cultured lymphoid cells and cultured erythroid cells before and after induction of terminal erythroid differentiation. We found that, similarly to mammalian genome, the chicken genomes is organized in TADs and compartments. Full activation of the alpha-globin gene transcription in differentiated erythroid cells is correlated with upregulation of several adjacent housekeeping genes and the emergence of abundant intergenic transcription. An extended chromosome region encompassing the alpha-globin cluster becomes significantly decompacted in differentiated erythroid cells, and depleted in CTCF binding and CTCF-anchored chromatin loops, while the sub-TAD harboring alpha-globin gene cluster and the upstream major regulatory element (MRE) becomes highly enriched with chromatin interactions as compared to lymphoid and proliferating erythroid cells. The alpha-globin gene domain and the neighboring loci reside within the A-like chromatin compartment in both lymphoid and erythroid cells and become further segregated from the upstream gene desert upon terminal erythroid differentiation. Our findings demonstrate that the effects of tissue-specific transcription activation are not restricted to the host genomic locus but affect the overall chromatin structure and transcriptional output of the encompassing topologically associating domain.
Fernández-Méndez, Mar; Turk-Kubo, Kendra A; Buttigieg, Pier L; Rapp, Josephine Z; Krumpen, Thomas; Zehr, Jonathan P; Boetius, Antje
2016-01-01
The Eurasian basin of the Central Arctic Ocean is nitrogen limited, but little is known about the presence and role of nitrogen-fixing bacteria. Recent studies have indicated the occurrence of diazotrophs in Arctic coastal waters potentially of riverine origin. Here, we investigated the presence of diazotrophs in ice and surface waters of the Central Arctic Ocean in the summer of 2012. We identified diverse communities of putative diazotrophs through targeted analysis of the nifH gene, which encodes the iron protein of the nitrogenase enzyme. We amplified 529 nifH sequences from 26 samples of Arctic melt ponds, sea ice and surface waters. These sequences resolved into 43 clusters at 92% amino acid sequence identity, most of which were non-cyanobacterial phylotypes from sea ice and water samples. One cyanobacterial phylotype related to Nodularia sp. was retrieved from sea ice, suggesting that this important functional group is rare in the Central Arctic Ocean. The diazotrophic community in sea-ice environments appear distinct from other cold-adapted diazotrophic communities, such as those present in the coastal Canadian Arctic, the Arctic tundra and glacial Antarctic lakes. Molecular fingerprinting of nifH and the intergenic spacer region of the rRNA operon revealed differences between the communities from river-influenced Laptev Sea waters and those from ice-related environments pointing toward a marine origin for sea-ice diazotrophs. Our results provide the first record of diazotrophs in the Central Arctic and suggest that microbial nitrogen fixation may occur north of 77°N. To assess the significance of nitrogen fixation for the nitrogen budget of the Arctic Ocean and to identify the active nitrogen fixers, further biogeochemical and molecular biological studies are needed.
Fernández-Méndez, Mar; Turk-Kubo, Kendra A.; Buttigieg, Pier L.; Rapp, Josephine Z.; Krumpen, Thomas; Zehr, Jonathan P.; Boetius, Antje
2016-01-01
The Eurasian basin of the Central Arctic Ocean is nitrogen limited, but little is known about the presence and role of nitrogen-fixing bacteria. Recent studies have indicated the occurrence of diazotrophs in Arctic coastal waters potentially of riverine origin. Here, we investigated the presence of diazotrophs in ice and surface waters of the Central Arctic Ocean in the summer of 2012. We identified diverse communities of putative diazotrophs through targeted analysis of the nifH gene, which encodes the iron protein of the nitrogenase enzyme. We amplified 529 nifH sequences from 26 samples of Arctic melt ponds, sea ice and surface waters. These sequences resolved into 43 clusters at 92% amino acid sequence identity, most of which were non-cyanobacterial phylotypes from sea ice and water samples. One cyanobacterial phylotype related to Nodularia sp. was retrieved from sea ice, suggesting that this important functional group is rare in the Central Arctic Ocean. The diazotrophic community in sea-ice environments appear distinct from other cold-adapted diazotrophic communities, such as those present in the coastal Canadian Arctic, the Arctic tundra and glacial Antarctic lakes. Molecular fingerprinting of nifH and the intergenic spacer region of the rRNA operon revealed differences between the communities from river-influenced Laptev Sea waters and those from ice-related environments pointing toward a marine origin for sea-ice diazotrophs. Our results provide the first record of diazotrophs in the Central Arctic and suggest that microbial nitrogen fixation may occur north of 77°N. To assess the significance of nitrogen fixation for the nitrogen budget of the Arctic Ocean and to identify the active nitrogen fixers, further biogeochemical and molecular biological studies are needed. PMID:27933047
Zhang, Yan; Cheng, Xiaoling; Liang, Hua; Jin, Zhenzhen
2018-04-25
Homeobox (HOX) transcript antisense RNA (HOTAIR) is a long intergenic non-coding RNA (lncRNA) that has been reported to be highly upregulated in several types of cancers. However, the role of HOTAIR in human cervical cancer is still unclear. We therefore investigated the expression and probable function of HOTAIR in cervical cancer cells. The expression of HOTAIR was examined in (HeLa, CaSki, ME-180, HT-3) and Human Cervical Epithelial Cells (HCerEpiC) by qRT-PCR. Transfection of si-NC, si-HOTAIR or si-STAT3 was carried out with the help of Lipofectamine 2000. The cell viability was assessed by CCK-8 assay. The cell migration and invasion was examined by wound healing and Boyden chamber assays. Protein expression was determined by western blotting. Our results showed that expression of HOTAIR was significantly upregulated in cervical cancer cells and inhibition of the expression of HOTAIR in HeLa cervical cancer cells resulted in suppression of cell proliferation, migration and invasion. Further, analysis of the promoter of HOTAIR, revealed that STAT3 could potentially regulate the activity of the HOTAIR in cervical cancer cells and inhibition of STAT3 had similar effects on the proliferation, migration and invasion of the cervical cancer cells as that of HOTAIR. Further, the suppression of STAT3 expression was associated with concomitant downregulation of IncRNA HOTAIR as indicated by the qRT-PCR. To unveil if STAT3 and HOTAIR have synergistic effects on the cell migration and invasion, si-STAT3 and si-HOTAIR were co-transformed into cervical HeLa cancer cells and it was observed that STAT3 and HOTAIR could synergistically inhibit the proliferation, migration and invasion of the cervical cancer cells. Taken together we conclude that HOTAIR and STAT3 synergistically regulate the proliferation, migration and invasion of cervical cancer cells. Copyright © 2018. Published by Elsevier B.V.
Toren, Dmitri; Barzilay, Thomer; Tacutu, Robi; Lehmann, Gilad; Muradian, Khachik K; Fraifeld, Vadim E
2016-01-04
Mitochondria are the only organelles in the animal cells that have their own genome. Due to a key role in energy production, generation of damaging factors (ROS, heat), and apoptosis, mitochondria and mtDNA in particular have long been considered one of the major players in the mechanisms of aging, longevity and age-related diseases. The rapidly increasing number of species with fully sequenced mtDNA, together with accumulated data on longevity records, provides a new fascinating basis for comparative analysis of the links between mtDNA features and animal longevity. To facilitate such analyses and to support the scientific community in carrying these out, we developed the MitoAge database containing calculated mtDNA compositional features of the entire mitochondrial genome, mtDNA coding (tRNA, rRNA, protein-coding genes) and non-coding (D-loop) regions, and codon usage/amino acids frequency for each protein-coding gene. MitoAge includes 922 species with fully sequenced mtDNA and maximum lifespan records. The database is available through the MitoAge website (www.mitoage.org or www.mitoage.info), which provides the necessary tools for searching, browsing, comparing and downloading the data sets of interest for selected taxonomic groups across the Kingdom Animalia. The MitoAge website assists in statistical analysis of different features of the mtDNA and their correlative links to longevity. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.
Zhang, Honghai; Chen, Lei
2011-03-01
The dhole (Cuon alpinus) is the only existent species in the genus Cuon (Carnivora: Canidae). In the present study, the complete mitochondrial genome of the dhole was sequenced. The total length is 16672 base pairs which is the shortest in Canidae. Sequence analysis revealed that most mitochondrial genomic functional regions were highly consistent among canid animals except the CSB domain of the control region. The difference in length among the Canidae mitochondrial genome sequences is mainly due to the number of short segments of tandem repeated in the CSB domain. Phylogenetic analysis was progressed based on the concatenated data set of 14 mitochondrial genes of 8 canid animals by using maximum parsimony (MP), maximum likelihood (ML) and Bayesian (BI) inference methods. The genera Vulpes and Nyctereutes formed a sister group and split first within Canidae, followed by that in the Cuon. The divergence in the genus Canis was the latest. The divarication of domestic dogs after that of the Canis lupus laniger is completely supported by all the three topologies. Pairwise sequence divergence data of different mitochondrial genes among canid animals were also determined. Except for the synonymous substitutions in protein-coding genes, the control region exhibits the highest sequence divergences. The synonymous rates are approximately two to six times higher than those of the non-synonymous sites except for a slightly higher rate in the non-synonymous substitution between Cuon alpinus and Vulpes vulpes. 16S rRNA genes have a slightly faster sequence divergence than 12S rRNA and tRNA genes. Based on nucleotide substitutions of tRNA genes and rRNA genes, the times since divergence between dhole and other canid animals, and between domestic dogs and three subspecies of wolves were evaluated. The result indicates that Vulpes and Nyctereutes have a close phylogenetic relationship and the divergence of Nyctereutes is a little earlier. The Tibetan wolf may be an archaic pedigree within wolf subspecies. The genetic distance between wolves and domestic dogs is less than that among different subspecies of wolves. The domestication of dogs was about 1.56-1.92 million years ago or even earlier.
2012-01-01
Background Pseudoscorpions are chelicerates and have historically been viewed as being most closely related to solifuges, harvestmen, and scorpions. No mitochondrial genomes of pseudoscorpions have been published, but the mitochondrial genomes of some lineages of Chelicerata possess unusual features, including short rRNA genes and tRNA genes that lack sequence to encode arms of the canonical cloverleaf-shaped tRNA. Additionally, some chelicerates possess an atypical guanine-thymine nucleotide bias on the major coding strand of their mitochondrial genomes. Results We sequenced the mitochondrial genomes of two divergent taxa from the chelicerate order Pseudoscorpiones. We find that these genomes possess unusually short tRNA genes that do not encode cloverleaf-shaped tRNA structures. Indeed, in one genome, all 22 tRNA genes lack sequence to encode canonical cloverleaf structures. We also find that the large ribosomal RNA genes are substantially shorter than those of most arthropods. We inferred secondary structures of the LSU rRNAs from both pseudoscorpions, and find that they have lost multiple helices. Based on comparisons with the crystal structure of the bacterial ribosome, two of these helices were likely contact points with tRNA T-arms or D-arms as they pass through the ribosome during protein synthesis. The mitochondrial gene arrangements of both pseudoscorpions differ from the ancestral chelicerate gene arrangement. One genome is rearranged with respect to the location of protein-coding genes, the small rRNA gene, and at least 8 tRNA genes. The other genome contains 6 tRNA genes in novel locations. Most chelicerates with rearranged mitochondrial genes show a genome-wide reversal of the CA nucleotide bias typical for arthropods on their major coding strand, and instead possess a GT bias. Yet despite their extensive rearrangement, these pseudoscorpion mitochondrial genomes possess a CA bias on the major coding strand. Phylogenetic analyses of all 13 mitochondrial protein-coding gene sequences consistently yield trees that place pseudoscorpions as sister to acariform mites. Conclusion The well-supported phylogenetic placement of pseudoscorpions as sister to Acariformes differs from some previous analyses based on morphology. However, these two lineages share multiple molecular evolutionary traits, including substantial mitochondrial genome rearrangements, extensive nucleotide substitution, and loss of helices in their inferred tRNA and rRNA structures. PMID:22409411
Paliwal, Anupam; Temkin, Alexis M; Kerkel, Kristi; Yale, Alexander; Yotova, Iveta; Drost, Natalia; Lax, Simon; Nhan-Chang, Chia-Ling; Powell, Charles; Borczuk, Alain; Aviv, Abraham; Wapner, Ronald; Chen, Xiaowei; Nagy, Peter L; Schork, Nicholas; Do, Catherine; Torkamani, Ali; Tycko, Benjamin
2013-08-01
Allele-specific DNA methylation (ASM) is well studied in imprinted domains, but this type of epigenetic asymmetry is actually found more commonly at non-imprinted loci, where the ASM is dictated not by parent-of-origin but instead by the local haplotype. We identified loci with strong ASM in human tissues from methylation-sensitive SNP array data. Two index regions (bisulfite PCR amplicons), one between the C3orf27 and RPN1 genes in chromosome band 3q21 and the other near the VTRNA2-1 vault RNA in band 5q31, proved to be new examples of imprinted DMRs (maternal alleles methylated) while a third, between STEAP3 and C2orf76 in chromosome band 2q14, showed non-imprinted haplotype-dependent ASM. Using long-read bisulfite sequencing (bis-seq) in 8 human tissues we found that in all 3 domains the ASM is restricted to single differentially methylated regions (DMRs), each less than 2kb. The ASM in the C3orf27-RPN1 intergenic region was placenta-specific and associated with allele-specific expression of a long non-coding RNA. Strikingly, the discrete DMRs in all 3 regions overlap with binding sites for the insulator protein CTCF, which we found selectively bound to the unmethylated allele of the STEAP3-C2orf76 DMR. Methylation mapping in two additional genes with non-imprinted haplotype-dependent ASM, ELK3 and CYP2A7, showed that the CYP2A7 DMR also overlaps a CTCF site. Thus, two features of imprinted domains, highly localized DMRs and allele-specific insulator occupancy by CTCF, can also be found in chromosomal domains with non-imprinted ASM. Arguing for biological importance, our analysis of published whole genome bis-seq data from hES cells revealed multiple genome-wide association study (GWAS) peaks near CTCF binding sites with ASM.
Kerkel, Kristi; Yale, Alexander; Yotova, Iveta; Drost, Natalia; Lax, Simon; Nhan-Chang, Chia-Ling; Powell, Charles; Borczuk, Alain; Aviv, Abraham; Wapner, Ronald; Chen, Xiaowei; Nagy, Peter L.; Schork, Nicholas; Do, Catherine; Torkamani, Ali; Tycko, Benjamin
2013-01-01
Allele-specific DNA methylation (ASM) is well studied in imprinted domains, but this type of epigenetic asymmetry is actually found more commonly at non-imprinted loci, where the ASM is dictated not by parent-of-origin but instead by the local haplotype. We identified loci with strong ASM in human tissues from methylation-sensitive SNP array data. Two index regions (bisulfite PCR amplicons), one between the C3orf27 and RPN1 genes in chromosome band 3q21 and the other near the VTRNA2-1 vault RNA in band 5q31, proved to be new examples of imprinted DMRs (maternal alleles methylated) while a third, between STEAP3 and C2orf76 in chromosome band 2q14, showed non-imprinted haplotype-dependent ASM. Using long-read bisulfite sequencing (bis-seq) in 8 human tissues we found that in all 3 domains the ASM is restricted to single differentially methylated regions (DMRs), each less than 2kb. The ASM in the C3orf27-RPN1 intergenic region was placenta-specific and associated with allele-specific expression of a long non-coding RNA. Strikingly, the discrete DMRs in all 3 regions overlap with binding sites for the insulator protein CTCF, which we found selectively bound to the unmethylated allele of the STEAP3-C2orf76 DMR. Methylation mapping in two additional genes with non-imprinted haplotype-dependent ASM, ELK3 and CYP2A7, showed that the CYP2A7 DMR also overlaps a CTCF site. Thus, two features of imprinted domains, highly localized DMRs and allele-specific insulator occupancy by CTCF, can also be found in chromosomal domains with non-imprinted ASM. Arguing for biological importance, our analysis of published whole genome bis-seq data from hES cells revealed multiple genome-wide association study (GWAS) peaks near CTCF binding sites with ASM. PMID:24009515
Root-Bernstein, Robert; Root-Bernstein, Meredith
2016-05-21
We have proposed that the ribosome may represent a missing link between prebiotic chemistries and the first cells. One of the predictions that follows from this hypothesis, which we test here, is that ribosomal RNA (rRNA) must have encoded the proteins necessary for ribosomal function. In other words, the rRNA also functioned pre-biotically as mRNA. Since these ribosome-binding proteins (rb-proteins) must bind to the rRNA, but the rRNA also functioned as mRNA, it follows that rb-proteins should bind to their own mRNA as well. This hypothesis can be contrasted to a "null" hypothesis in which rb-proteins evolved independently of the rRNA sequences and therefore there should be no necessary similarity between the rRNA to which rb-proteins bind and the mRNA that encodes the rb-protein. Five types of evidence reported here support the plausibility of the hypothesis that the mRNA encoding rb-proteins evolved from rRNA: (1) the ubiquity of rb-protein binding to their own mRNAs and autogenous control of their own translation; (2) the higher-than-expected incidence of Arginine-rich modules associated with RNA binding that occurs in rRNA-encoded proteins; (3) the fact that rRNA-binding regions of rb-proteins are homologous to their mRNA binding regions; (4) the higher than expected incidence of rb-protein sequences encoded in rRNA that are of a high degree of homology to their mRNA as compared with a random selection of other proteins; and (5) rRNA in modern prokaryotes and eukaryotes encodes functional proteins. None of these results can be explained by the null hypothesis that assumes independent evolution of rRNA and the mRNAs encoding ribosomal proteins. Also noteworthy is that very few proteins bind their own mRNAs that are not associated with ribosome function. Further tests of the hypothesis are suggested: (1) experimental testing of whether rRNA-encoded proteins bind to rRNA at their coding sites; (2) whether tRNA synthetases, which are also known to bind to their own mRNAs, are encoded by the tRNA sequences themselves; (3) and the prediction that archaeal and prokaryotic (DNA-based) genomes were built around rRNA "genes" so that rRNA-related sequences will be found to make up an unexpectedly high proportion of these genomes. Copyright © 2016 The Authors. Published by Elsevier Ltd.. All rights reserved.
The complete mitochondrial genome sequence of Eimeria innocua (Eimeriidae, Coccidia, Apicomplexa).
Hafeez, Mian Abdul; Vrba, Vladimir; Barta, John Robert
2016-07-01
The complete mitochondrial genome of Eimeria innocua KR strain (Eimeriidae, Coccidia, Apicomplexa) was sequenced. This coccidium infects turkeys (Meleagris gallopavo), Bobwhite quails (Colinus virginianus), and Grey partridges (Perdix perdix). Genome organization and gene contents were comparable with other Eimeria spp. infecting galliform birds. The circular-mapping mt genome of E. innocua is 6247 bp in length with three protein-coding genes (cox1, cox3, and cytb), 19 gene fragments encoding large subunit (LSU) rRNA and 14 gene fragments encoding small subunit (SSU) rRNA. Like other Apicomplexa, no tRNA was encoded. The mitochondrial genome of E. innocua confirms its close phylogenetic affinities to Eimeria dispersa.
Rivera, I G; Chowdhury, M A; Huq, A; Jacobs, D; Martins, M T; Colwell, R R
1995-08-01
Enterobacterial repetitive intergenic consensus (ERIC) sequence polymorphism was studied in Vibrio Cholerae strains isolated before and after the cholera epidemic in Brazil (in 1991), along with epidemic strains from Peru, Mexico, and India, by PCR. A total of 17 fingerprint patterns (FPs) were detected in the V. cholerae strains examined; 96.7% of the toxigenic V. cholerae O1 strains and 100% of the O139 serogroup strains were found to belong to the same FP group comprising four fragments (FP1). The nontoxigenic V. cholerae O1 also yielded four fragments but constituted a different FP group (FP2). A total of 15 different patterns were observed among the V. cholerae non-O1 strains. Two patterns were observed most frequently for V. cholerae non-01 strains, 25% of which have FP3, with five fragments, and 16.7% of which have FP4, with two fragments. Three fragments, 1.75, 0.79, and 0.5 kb, were found to be common to both toxigenic and nontoxigenic V. cholerae O1 strains as well as to group FP3, containing V. cholerae non-O1 strains. Two fragments of group FP3, 1.3 and 1.0 kb, were present in FP1 and FP2 respectively. The 0.5-kb fragment was common to all strains and serogroups of V. cholerae analyzed. It is concluded from the results of this study, based on DNA FPs of environmental isolates, that it is possible to detect an emerging virulent strain in a cholera-endemic region. ERIC-PCR constitutes a powerful tool for determination of the virulence potential of V. cholerae O1 strains isolated in surveillance programs and for molecular epidemiological investigations.
Pontvianne, Frédéric; Carpentier, Marie-Christine; Durut, Nathalie; Pavlištová, Veronika; Jaške, Karin; Schořová, Šárka; Parrinello, Hugues; Rohmer, Marine; Pikaard, Craig S; Fojtová, Miloslava; Fajkus, Jiří; Saez-Vasquez, Julio
2017-01-01
The nucleolus is the site of ribosomal RNA (rRNA) gene transcription, rRNA processing and ribosome biogenesis. However, the nucleolus also plays additional roles in the cell. We isolated nucleoli by Fluorescence Activated Cell Sorting (FACS) and identified Nucleolus-Associated Chromatin Domains (NADs) by deep sequencing, comparing wild-type plants and null mutants for the nucleolar protein, NUCLEOLIN 1 (NUC1). NADs are primarily genomic regions with heterochromatic signatures and include transposable elements (TEs), sub-telomeric regions and mostly inactive protein-coding genes. However, NADs also include active ribosomal RNA genes, and the entire short arm of chromosome 4 adjacent to them. In nuc1 null mutants, which alter rRNA gene expression and overall nucleolar structure, NADs are altered, telomere association with the nucleolus is decreased and telomeres become shorter. Collectively, our studies reveal roles for NUC1 and the nucleolus in the spatial organization of chromosomes as well as telomere maintenance. PMID:27477271
Wright, Stan A; Lemenager, Debbie A; Tucker, James R; Armijos, M Veronica; Yamamoto, Sheryl A
2006-03-01
Birds from 45 species were sampled during three spring seasons from an isolated canyon on the Sutter Buttes in California for the presence of subadult stages of Ixodes pacificus Cooley & Kohls, and for infection with Borrelia burgdorferi Johnson, Schmid, Hyde, Steigerwalt & Brenner. These birds were found to have an infestation prevalence of 45%, a density of 1.7 ticks per bird, and an intensity of 3.8 ticks per infested bird. There was a significant difference in the I. pacificus infestations between canopy and ground-dwelling birds. Birds also demonstrated an overall infection with B. burgdorferi of 6.4% with significant difference between bird species. Amplification and subsequent sequencing of the 23s-5s rRNA intergenic spacer region of the Borrelia genome from one bird, a hermit thrush, Catharus guttatus (Nuttall), showed that the infection in this bird was caused by B. burgdorferi sensu stricto; the first such finding in a bird from the far west. Our results suggest that birds play a role in the distribution and maintenance of I. pacificus, and possibly of B. burgdoferi, at the Sutter Buttes, CA.
Extracellular vesicle-mediated export of fungal RNA
Peres da Silva, Roberta; Puccia, Rosana; Rodrigues, Marcio L.; Oliveira, Débora L.; Joffe, Luna S.; César, Gabriele V.; Nimrichter, Leonardo; Goldenberg, Samuel; Alves, Lysangela R.
2015-01-01
Extracellular vesicles (EVs) play an important role in the biology of various organisms, including fungi, in which they are required for the trafficking of molecules across the cell wall. Fungal EVs contain a complex combination of macromolecules, including proteins, lipids and glycans. In this work, we aimed to describe and characterize RNA in EV preparations from the human pathogens Cryptococcus neoformans, Paracoccidiodes brasiliensis and Candida albicans, and from the model yeast Saccharomyces cerevisiae. The EV RNA content consisted mostly of molecules less than 250 nt long and the reads obtained aligned with intergenic and intronic regions or specific positions within the mRNA. We identified 114 ncRNAs, among them, six small nucleolar (snoRNA), two small nuclear (snRNA), two ribosomal (rRNA) and one transfer (tRNA) common to all the species considered, together with 20 sequences with features consistent with miRNAs. We also observed some copurified mRNAs, as suggested by reads covering entire transcripts, including those involved in vesicle-mediated transport and metabolic pathways. We characterized for the first time RNA molecules present in EVs produced by fungi. Our results suggest that RNA-containing vesicles may be determinant for various biological processes, including cell communication and pathogenesis. PMID:25586039
Characterisation of Bergeyella spp. isolated from the nasal cavities of piglets.
Lorenzo de Arriba, M; Lopez-Serrano, S; Galofre-Mila, N; Aragon, V
2018-04-01
The aim of this study was to characterise bacteria in the genus Bergeyella isolated from the nasal passages of healthy piglets. Nasal swabs from 3 to 4 week-old piglets from eight commercial domestic pig farms and one wild boar farm were cultured under aerobic conditions. Twenty-nine Bergeyella spp. isolates were identified by partial 16S rRNA gene sequencing and 11 genotypes were discriminated by enterobacterial repetitive intergenic consensus (ERIC)-PCR. Bergeyella zoohelcum and Bergeyella porcorum were identified within the 11 genotypes. Bergeyella spp. isolates exhibited resistance to serum complement and phagocytosis, poor capacity to form biofilms and were able to adhere to epithelial cells. Maneval staining was consistent with the presence of a capsule. Multiple drug resistance (resistance to three or more classes of antimicrobial agents) was present in 9/11 genotypes, including one genotype isolated from wild boar with no history of antimicrobial use. In conclusion, Bergeyella spp. isolates from the nasal cavities of piglets showed some in vitro features indicative of a potential for virulence. Further studies are necessary to identify the role of Bergeyella spp. in disease and within the nasal microbiota of pigs. Copyright © 2018 The Author(s). Published by Elsevier Ltd.. All rights reserved.
Antonov, Valery A; Tkachenko, Galina A; Altukhova, Viktoriya V; Savchenko, Sergey S; Zinchenko, Olga V; Viktorov, Dmitry V; Zamaraev, Valery S; Ilyukhin, Vladimir I; Alekseev, Vladimir V
2008-12-01
Burkholderia mallei and B. pseudomallei are highly pathogenic microorganisms for both humans and animals. Moreover, they are regarded as potential agents of bioterrorism. Thus, rapid and unequivocal detection and identification of these dangerous pathogens is critical. In the present study, we describe the use of an optimized protocol for the early diagnosis of experimental glanders and melioidosis and for the rapid differentiation and typing of Burkholderia strains. This experience with PCR-based identification methods indicates that single PCR targets (23S and 16S rRNA genes, 16S-23S intergenic region, fliC and type III secretion gene cluster) should be used with caution for identification of B. mallei and B. pseudomallei, and need to be used alongside molecular methods such as gene sequencing. Several molecular typing procedures have been used to identify genetically related B. pseudomallei and B. mallei isolates, including ribotyping, pulsed-field gel electrophoresis and multilocus sequence typing. However, these methods are time consuming and technically challenging for many laboratories. RAPD, variable amplicon typing scheme, Rep-PCR, BOX-PCR and multiple-locus variable-number tandem repeat analysis have been recommended by us for the rapid differentiation of B. mallei and B. pseudomallei strains.
Rizzardi, Kristina; Winiecka-Krusnell, Jadwiga; Ramliden, Miriam; Alm, Erik; Andersson, Sabina; Byfors, Sara
2015-02-01
Fourteen isolates of an unknown species identified as belonging to the genus Legionella by selective growth on BCYE agar were isolated from the biopurification systems of three different wood processing plants. The mip gene sequence of all 14 isolates was identical and a close match alignment revealed 86 % sequence similarity with Legionella pneumophila serogroup 8. The whole genome of isolate LEGN(T) was sequenced, and a phylogenetic tree based on the alignment of 16S rRNA, mip, rpoB, rnpB and the 23S-5S intergenic region clustered LEGN(T) with L. pneumophila ATCC 33152(T). Analysis of virulence factors showed that strain LEGN(T) carries the majority of known L. pneumophila virulence factors. An amoeba infection assay performed to assess the pathogenicity of strain LEGN(T) towards Acanthamoeba castellanii showed that it can establish a replication vacuole in A. castellanii but does not significantly affect replication of amoebae. Taken together, the results confirm that strain LEGN(T) represents a novel species of the genus Legionella, for which the name Legionella norrlandica sp. nov. is proposed. The type strain is LEGN(T) ( = ATCC BAA-2678(T) = CCUG 65936(T)). © 2015 IUMS.
Terminator Detection by Support Vector Machine Utilizing aStochastic Context-Free Grammar
DOE Office of Scientific and Technical Information (OSTI.GOV)
Francis-Lyon, Patricia; Cristianini, Nello; Holbrook, Stephen
2006-12-30
A 2-stage detector was designed to find rho-independent transcription terminators in the Escherichia coli genome. The detector includes a Stochastic Context Free Grammar (SCFG) component and a Support Vector Machine (SVM) component. To find terminators, the SCFG searches the intergenic regions of nucleotide sequence for local matches to a terminator grammar that was designed and trained utilizing examples of known terminators. The grammar selects sequences that are the best candidates for terminators and assigns them a prefix, stem-loop, suffix structure using the Cocke-Younger-Kasaami (CYK) algorithm, modified to incorporate energy affects of base pairing. The parameters from this inferred structure aremore » passed to the SVM classifier, which distinguishes terminators from non-terminators that score high according to the terminator grammar. The SVM was trained with negative examples drawn from intergenic sequences that include both featureless and RNA gene regions (which were assigned prefix, stem-loop, suffix structure by the SCFG), so that it successfully distinguishes terminators from either of these. The classifier was found to be 96.4% successful during testing.« less
Quach, Tommy; Brooks, Daniel M; Miranda, Hector C
2016-01-01
The complete mitochondrial genome of the Palawan peacock-pheasant Polyplectron napoleonis is 16,710 bp and contains 13 protein-coding genes, 2 rRNA genes, 22 tRNA genes and a control-region. All protein-coding genes use the standard ATG start codon, except for cox1 which has GTG start codon. Seven out of 13 PCGs have TAA stop codons, two have AGG (cox1 and nd6), and three PCGs (nd2, cox2 and nd4) have incomplete stop codon of just T- - nucleotide.
The complete mitochondrial DNA of endemic Eastern Pacific coral (Porites panamensis).
Del Río-Portilla, Miguel A; Vargas-Peralta, Carmen E; Paz-García, David A; Lafarga De La Cruz, Fabiola; Balart, Eduardo F; García-de-León, Francisco J
2016-01-01
The mitogenome of the endemic coral Porites panamensis (Genbank accession number KJ546638) has a total length of 18,628 bp, and the arrangement consist of 13 protein-coding genes, 2 ribosomal RNA (rRNA) genes and 2 transfer RNA (tRNA) genes. Gene order was equal to other scleractinian coral mitogenomes.
Liu, Huawei; Li, Zhiyong; Wang, Chao; Feng, Lin; Huang, Haitao; Liu, Changkui; Li, Fengxia
2016-01-01
As a long noncoding RNA, HOX transcript antisense intergenic RNA (HOTAIR) is highly expressed in many types of tumors. However, its expression and function in oral squamous cell carcinoma (OSCC) cells and tissues remains largely unknown. We herein studied the biological functions of HOTAIR in OSCC Tca8113 cells. Real-time quantitative PCR showed that HOTAIR, p21 and p53 mRNA expressions in doxorubicin (DOX)-treated or γ-ray-irradiated Tca8113 cells were up-regulated. Knockdown of p53 expression inhibited DOX-induced HOTAIR up-regulation, suggesting that DNA damage-induced HOTAIR expression may be associated with p53. Transfection and CCK-8 assays showed that compared with the control group, overexpression of HOTAIR promoted the proliferation of Tca8113 cells, while interfering with its expression played an opposite role. Flow cytometry exhibited that HOTAIR overexpression decreased the rate of DOX-induced apoptosis. When HOTAIR expression was inhibited by siRNA, the proportions of cells in G2/M and S phases increased and decreased respectively. Meanwhile, the rate of DOX-induced apoptosis rose. DNA damage-induced HOTAIR expression facilitated the proliferation of Tca8113 cells and decreased their apoptosis. However, whether the up-regulation depends on p53 still needs in-depth studies. PMID:27904675
Asaf, Sajjad; Khan, Abdul Latif; Khan, Muhammad Aaqil; Waqas, Muhammad; Kang, Sang-Mo; Yun, Byung-Wook; Lee, In-Jung
2017-08-08
We investigated the complete chloroplast (cp) genomes of non-model Arabidopsis halleri ssp. gemmifera and Arabidopsis lyrata ssp. petraea using Illumina paired-end sequencing to understand their genetic organization and structure. Detailed bioinformatics analysis revealed genome sizes of both subspecies ranging between 154.4~154.5 kbp, with a large single-copy region (84,197~84,158 bp), a small single-copy region (17,738~17,813 bp) and pair of inverted repeats (IRa/IRb; 26,264~26,259 bp). Both cp genomes encode 130 genes, including 85 protein-coding genes, eight ribosomal RNA genes and 37 transfer RNA genes. Whole cp genome comparison of A. halleri ssp. gemmifera and A. lyrata ssp. petraea, along with ten other Arabidopsis species, showed an overall high degree of sequence similarity, with divergence among some intergenic spacers. The location and distribution of repeat sequences were determined, and sequence divergences of shared genes were calculated among related species. Comparative phylogenetic analysis of the entire genomic data set and 70 shared genes between both cp genomes confirmed the previous phylogeny and generated phylogenetic trees with the same topologies. The sister species of A. halleri ssp. gemmifera is A. umezawana, whereas the closest relative of A. lyrata spp. petraea is A. arenicola.
Falk, K.; Batts, W.N.; Kvellestad, A.; Kurath, G.; Wiik-Nielsen, J.; Winton, J.R.
2008-01-01
Atlantic salmon paramyxovirus (ASPV) was isolated in 1995 from gills of farmed Atlantic salmon suffering from proliferative gill inflammation. The complete genome sequence of ASPV was determined, revealing a genome 16,968 nucleotides in length consisting of six non-overlapping genes coding for the nucleo- (N), phospho- (P), matrix- (M), fusion- (F), haemagglutinin-neuraminidase- (HN) and large polymerase (L) proteins in the order 3???-N-P-M-F-HN-L-5???. The various conserved features related to virus replication found in most paramyxoviruses were also found in ASPV. These include: conserved and complementary leader and trailer sequences, tri-nucleotide intergenic regions and highly conserved transcription start and stop signal sequences. The P gene expression strategy of ASPV was like that of the respiro-, morbilli- and henipaviruses, which express the P and C proteins from the primary transcript and edit a portion of the mRNA to encode V and W proteins. Sequence similarities among various features related to virus replication, pairwise comparisons of all deduced ASPV protein sequences with homologous regions from other members of the family Paramyxoviridae, and phylogenetic analyses of these amino acid sequences suggested that ASPV was a novel member of the sub-family Paramyxovirinae, most closely related to the respiroviruses. ?? 2008 Elsevier B.V. All rights reserved.
Liu, Mingjian; Fan, Xinpeng; Gao, Feng; Gao, Shan; Yu, Yuhe; Warren, Alan; Huang, Jie
2016-11-01
A cryptic species of the Tetrahymena pyriformis complex, Tetrahymena australis, has been known for a long time but never properly diagnosed based on taxonomic methods. The species name is thus invalid according to the International Code of Zoological Nomenclature. Recently, a population isolated from a freshwater lake in Wuhan, China was investigated using live observations, silver staining methods and gene sequence data. This organism can be separated from other described species of the T. pyriformis complex by its relatively small body size, the number of somatic kineties and differences in sequences of two genes, namely the small subunit ribosomal RNA (SSU rRNA) and the mitochondrial cytochrome c oxidase subunit I (cox1). We compared the SSU rRNA gene sequences of all available Tetrahymena species to reveal the nucleotide differences within this genus. The sequence of the Wuhan population is identical to two sequences of a previously isolated strain of T. australis (ATCC #30831). Phylogenetic analyses indicate that these three sequences (X56167, M98015, KT334373) cluster with Tetrahymena shanghaiensis (EF070256) in a polytomy. However, sequence divergence of the cox1 gene between the Wuhan population and another strain of T. australis (ATCC #30271) is 1.4%, suggesting that these may represent different subspecies. © 2016 The Author(s) Journal of Eukaryotic Microbiology © 2016 International Society of Protistologists.
Mycobacterium ahvazicum sp. nov., the nineteenth species of the Mycobacterium simiae complex.
Bouam, Amar; Heidarieh, Parvin; Shahraki, Abodolrazagh Hashemi; Pourahmad, Fazel; Mirsaeidi, Mehdi; Hashemzadeh, Mohamad; Baptiste, Emeline; Armstrong, Nicholas; Levasseur, Anthony; Robert, Catherine; Drancourt, Michel
2018-03-07
Four slowly growing mycobacteria isolates were isolated from the respiratory tract and soft tissue biopsies collected in four unrelated patients in Iran. Conventional phenotypic tests indicated that these four isolates were identical to Mycobacterium lentiflavum while 16S rRNA gene sequencing yielded a unique sequence separated from that of M. lentiflavum. One representative strain AFP-003 T was characterized as comprising a 6,121,237-bp chromosome (66.24% guanosine-cytosine content) encoding for 5,758 protein-coding genes, 50 tRNA and one complete rRNA operon. A total of 2,876 proteins were found to be associated with the mobilome, including 195 phage proteins. A total of 1,235 proteins were found to be associated with virulence and 96 with toxin/antitoxin systems. The genome of AFP-003 T has the genetic potential to produce secondary metabolites, with 39 genes found to be associated with polyketide synthases and non-ribosomal peptide syntases and 11 genes encoding for bacteriocins. Two regions encoding putative prophages and three OriC regions separated by the dnaA gene were predicted. Strain AFP-003 T genome exhibits 86% average nucleotide identity with Mycobacterium genavense genome. Genetic and genomic data indicate that strain AFP-003 T is representative of a novel Mycobacterium species that we named Mycobacterium ahvazicum, the nineteenth species of the expanding Mycobacterium simiae complex.
Darbani, Behrooz; Noeparvar, Shahin; Borg, Søren
2016-01-01
RNA circularization made by head-to-tail back-splicing events is involved in the regulation of gene expression from transcriptional to post-translational levels. By exploiting RNA-Seq data and down-stream analysis, we shed light on the importance of circular RNAs in plants. The results introduce circular RNAs as novel interactors in the regulation of gene expression in plants and imply the comprehensiveness of this regulatory pathway by identifying circular RNAs for a diverse set of genes. These genes are involved in several aspects of cellular metabolism as hormonal signaling, intracellular protein sorting, carbohydrate metabolism and cell-wall biogenesis, respiration, amino acid biosynthesis, transcription and translation, and protein ubiquitination. Additionally, these parental loci of circular RNAs, from both nuclear and mitochondrial genomes, encode for different transcript classes including protein coding transcripts, microRNA, rRNA, and long non-coding/microprotein coding RNAs. The results shed light on the mitochondrial exonic circular RNAs and imply the importance of circular RNAs for regulation of mitochondrial genes. Importantly, we introduce circular RNAs in barley and elucidate their cellular-level alterations across tissues and in response to micronutrients iron and zinc. In further support of circular RNAs' functional roles in plants, we report several cases where fluctuations of circRNAs do not correlate with the levels of their parental-loci encoded linear transcripts. PMID:27375638
Formighieri, Eduardo F; Tiburcio, Ricardo A; Armas, Eduardo D; Medrano, Francisco J; Shimo, Hugo; Carels, Nicolas; Góes-Neto, Aristóteles; Cotomacci, Carolina; Carazzolle, Marcelo F; Sardinha-Pinto, Naiara; Thomazella, Daniela P T; Rincones, Johana; Digiampietri, Luciano; Carraro, Dirce M; Azeredo-Espin, Ana M; Reis, Sérgio F; Deckmann, Ana C; Gramacho, Karina; Gonçalves, Marilda S; Moura Neto, José P; Barbosa, Luciana V; Meinhardt, Lyndel W; Cascardo, Júlio C M; Pereira, Gonçalo A G
2008-10-01
We present here the sequence of the mitochondrial genome of the basidiomycete phytopathogenic hemibiotrophic fungus Moniliophthora perniciosa, causal agent of the Witches' Broom Disease in Theobroma cacao. The DNA is a circular molecule of 109,103 base pairs, with 31.9% GC, and is the largest sequenced so far. This size is due essentially to the presence of numerous non-conserved hypothetical ORFs. It contains the 14 genes coding for proteins involved in the oxidative phosphorylation, the two rRNA genes, one ORF coding for a ribosomal protein (rps3), and a set of 26 tRNA genes that recognize codons for all amino acids. Seven homing endonucleases are located inside introns. Except atp8, all conserved known genes are in the same orientation. Phylogenetic analysis based on the cox genes agrees with the commonly accepted fungal taxonomy. An uncommon feature of this mitochondrial genome is the presence of a region that contains a set of four, relatively small, nested, inverted repeats enclosing two genes coding for polymerases with an invertron-type structure and three conserved hypothetical genes interpreted as the stable integration of a mitochondrial linear plasmid. The integration of this plasmid seems to be a recent evolutionary event that could have implications in fungal biology. This sequence is available under GenBank accession number AY376688.
RNA therapeutics: RNAi and antisense mechanisms and clinical applications.
Chery, Jessica
2016-07-01
RNA therapeutics refers to the use of oligonucleotides to target primarily ribonucleic acids (RNA) for therapeutic efforts or in research studies to elucidate functions of genes. Oligonucleotides are distinct from other pharmacological modalities, such as small molecules and antibodies that target mainly proteins, due to their mechanisms of action and chemical properties. Nucleic acids come in two forms: deoxyribonucleic acids (DNA) and ribonucleic acids (RNA). Although DNA is more stable, RNA offers more structural variety ranging from messenger RNA (mRNA) that codes for protein to non-coding RNAs, microRNA (miRNA), transfer RNA (tRNA), short interfering RNAs (siRNAs), ribosomal RNA (rRNA), and long-noncoding RNAs (lncRNAs). As our understanding of the wide variety of RNAs deepens, researchers have sought to target RNA since >80% of the genome is estimated to be transcribed. These transcripts include non-coding RNAs such as miRNAs and siRNAs that function in gene regulation by playing key roles in the transfer of genetic information from DNA to protein, the final product of the central dogma in biology 1 . Currently there are two main approaches used to target RNA: double stranded RNA-mediated interference (RNAi) and antisense oligonucleotides (ASO). Both approaches are currently in clinical trials for targeting of RNAs involved in various diseases, such as cancer and neurodegeneration. In fact, ASOs targeting spinal muscular atrophy and amyotrophic lateral sclerosis have shown positive results in clinical trials 2 . Advantages of ASOs include higher affinity due to the development of chemical modifications that increase affinity, selectivity while decreasing toxicity due to off-target effects. This review will highlight the major therapeutic approaches of RNA medicine currently being applied with a focus on RNAi and ASOs.
Xu, Qin; Xiong, Guanjun; Li, Pengbo; He, Fei; Huang, Yi; Wang, Kunbo; Li, Zhaohu; Hua, Jinping
2012-01-01
Background Cotton (Gossypium spp.) is a model system for the analysis of polyploidization. Although ascertaining the donor species of allotetraploid cotton has been intensively studied, sequence comparison of Gossypium chloroplast genomes is still of interest to understand the mechanisms underlining the evolution of Gossypium allotetraploids, while it is generally accepted that the parents were A- and D-genome containing species. Here we performed a comparative analysis of 13 Gossypium chloroplast genomes, twelve of which are presented here for the first time. Methodology/Principal Findings The size of 12 chloroplast genomes under study varied from 159,959 bp to 160,433 bp. The chromosomes were highly similar having >98% sequence identity. They encoded the same set of 112 unique genes which occurred in a uniform order with only slightly different boundary junctions. Divergence due to indels as well as substitutions was examined separately for genome, coding and noncoding sequences. The genome divergence was estimated as 0.374% to 0.583% between allotetraploid species and A-genome, and 0.159% to 0.454% within allotetraploids. Forty protein-coding genes were completely identical at the protein level, and 20 intergenic sequences were completely conserved. The 9 allotetraploids shared 5 insertions and 9 deletions in whole genome, and 7-bp substitutions in protein-coding genes. The phylogenetic tree confirmed a close relationship between allotetraploids and the ancestor of A-genome, and the allotetraploids were divided into four separate groups. Progenitor allotetraploid cotton originated 0.43–0.68 million years ago (MYA). Conclusion Despite high degree of conservation between the Gossypium chloroplast genomes, sequence variations among species could still be detected. Gossypium chloroplast genomes preferred for 5-bp indels and 1–3-bp indels are mainly attributed to the SSR polymorphisms. This study supports that the common ancestor of diploid A-genome species in Gossypium is the maternal source of extant allotetraploid species and allotetraploids have a monophyletic origin. G. hirsutum AD1 lineages have experienced more sequence variations than other allotetraploids in intergenic regions. The available complete nucleotide sequences of 12 Gossypium chloroplast genomes should facilitate studies to uncover the molecular mechanisms of compartmental co-evolution and speciation of Gossypium allotetraploids. PMID:22876273
USDA-ARS?s Scientific Manuscript database
We tested a method of estimating the activity of detectable individual bacterial and archaeal OTUs within a community by calculating ratios of absolute 16S rRNA to rDNA copy numbers. We investigated phylogenetically coherent patterns of activity among soil prokaryotes in non-growing soil communitie...
USDA-ARS?s Scientific Manuscript database
Three new non-ascosporic, ascomycetous yeast genera are proposed based on their isolation from currently described species and genera. Phylogenetic placement of the genera was determined from analysis of nuclear gene sequences for D1/D2 large subunit rRNA, small subunit rRNA, translation elongation...
Chen, Nian; Lai, Xiao-Ping
2010-07-01
We obtained the complete mitochondrial genome of King Cobra(GenBank accession number: EU_921899) by Ex Taq-PCR, TA-cloning and primer-walking methods. This genome is very similar to other vertebrate, which is 17 267 bp in length and encodes 38 genes (including 13 protein-coding, 2 ribosomal RNA and 23 transfer RNA genes) and two long non-coding regions. The duplication of tRNA-Ile gene forms a new mitochondrial gene rearrangement model. Eight tRNA genes and one protein genes were transcribed from L strand, and the other genes were transcribed genes from H strand. Genes on the H strand show a fairly similar content of Adenosine and Thymine respectively, whereas those on the L strand have higher proportion of A than T. Combined rDNA sequence data (12S+16S rRNA) were used to reconstruct the phylogeny of 21 snake species for which complete mitochondrial genome sequences were available in the public databases. This large data set and an appropriate range of outgroup taxa demonstrated that Elapidae is more closely related to colubridae than viperidae, which supports the traditional viewpoints.
Using secondary structure to identify ribosomal numts: cautionary examples from the human genome.
Olson, Link E; Yoder, Anne D
2002-01-01
The identification of inadvertently sequenced mitochondrial pseudogenes (numts) is critical to any study employing mitochondrial DNA sequence data. Failure to discriminate numts correctly can confound phylogenetic reconstruction and studies of molecular evolution. This is especially problematic for ribosomal mtDNA genes. Unlike protein-coding loci, whose pseudogenes tend to accumulate diagnostic frameshift or premature stop mutations, functional ribosomal genes are not constrained to maintain a reading frame and can accumulate insertion-deletion events of varying length, particularly in nonpairing regions. Several authors have advocated using structural features of the transcribed rRNA molecule to differentiate functional mitochondrial rRNA genes from their nuclear paralogs. We explored this approach using the mitochondrial 12S rRNA gene and three known 12S numts from the human genome in the context of anthropoid phylogeny and the inferred secondary structure of primate 12S rRNA. Contrary to expectation, each of the three human numts exhibits striking concordance with secondary structure models, with little, if any, indication of their pseudogene status, and would likely escape detection based on structural criteria alone. Furthermore, we show that the unwitting inclusion of a particularly ancient (18-25 Myr old) and surprisingly cryptic human numt in a phylogenetic analysis would yield a well-supported but dramatically incorrect conclusion regarding anthropoid relationships. Though we endorse the use of secondary structure models for inferring positional homology wholeheartedly, we caution against reliance on structural criteria for the discrimination of rRNA numts, given the potential fallibility of this approach.
Bardet, Lucie; Cimmino, Teresa; Buffet, Clémence; Michelle, Caroline; Rathored, Jaishriram; Tandina, Fatalmoudou; Lagier, Jean-Christophe; Khelaifia, Saber; Abrahão, Jônatas; Raoult, Didier; Rolain, Jean-Marc
2018-02-01
Culturomics is a new postgenomics field that explores the microbial diversity of the human gut coupled with taxono-genomic strategy. Culturomics, and the microbiome science more generally, are anticipated to transform global health diagnostics and inform the ways in which gut microbial diversity contributes to human health and disease, and by extension, to personalized medicine. Using culturomics, we report in this study the description of strain CB1 T ( = CSUR P1334 = DSM 29075), a new species isolated from a stool specimen from a 37-year-old Brazilian woman. This description includes phenotypic characteristics and complete genome sequence and annotation. Strain CB1 T is a gram-negative aerobic and motile bacillus, exhibits neither catalase nor oxidase activities, and presents a 98.3% 16S rRNA sequence similarity with Pseudomonas putida. The 4,723,534 bp long genome contains 4239 protein-coding genes and 74 RNA genes, including 15 rRNA genes (5 16S rRNA, 4 23S rRNA, and 6 5S rRNA) and 59 tRNA genes. Strain CB1 T was named Pseudomonas massiliensis sp. nov. and classified into the family Pseudomonadaceae. This study demonstrates the usefulness of microbial culturomics in exploration of human microbiota in diverse geographies and offers new promise for incorporating new omics technologies for innovation in diagnostic medicine and global health.
Analysis of ribosomal RNA stability in dead cells of wine yeast by quantitative PCR.
Sunyer-Figueres, Merce; Wang, Chunxiao; Mas, Albert
2018-04-02
During wine production, some yeasts enter a Viable But Not Culturable (VBNC) state, which may influence the quality and stability of the final wine through remnant metabolic activity or by resuscitation. Culture-independent techniques are used for obtaining an accurate estimation of the number of live cells, and quantitative PCR could be the most accurate technique. As a marker of cell viability, rRNA was evaluated by analyzing its stability in dead cells. The species-specific stability of rRNA was tested in Saccharomyces cerevisiae, as well as in three species of non-Saccharomyces yeast (Hanseniaspora uvarum, Torulaspora delbrueckii and Starmerella bacillaris). High temperature and antimicrobial dimethyl dicarbonate (DMDC) treatments were efficient in lysing the yeast cells. rRNA gene and rRNA (as cDNA) were analyzed over 48 h after cell lysis by quantitative PCR. The results confirmed the stability of rRNA for 48 h after the cell lysis treatments. To sum up, rRNA may not be a good marker of cell viability in the wine yeasts that were tested. Copyright © 2018 Elsevier B.V. All rights reserved.
Lynch, Ryan C.; Darcy, John L.; Kane, Nolan C.; Nemergut, Diana R.; Schmidt, Steve K.
2014-01-01
Previous surveys of very dry Atacama Desert mineral soils have consistently revealed sparse communities of non-photosynthetic microbes. The functional nature of these microorganisms remains debatable given the harshness of the environment and low levels of biomass and diversity. The aim of this study was to gain an understanding of the phylogenetic community structure and metabolic potential of a low-diversity mineral soil metagenome that was collected from a high-elevation Atacama Desert volcano debris field. We pooled DNA extractions from over 15 g of volcanic material, and using whole genome shotgun sequencing, observed only 75–78 total 16S rRNA gene OTUs3%. The phylogenetic structure of this community is significantly under dispersed, with actinobacterial lineages making up 97.9–98.6% of the 16S rRNA genes, suggesting a high degree of environmental selection. Due to this low diversity and uneven community composition, we assembled and analyzed the metabolic pathways of the most abundant genome, a Pseudonocardia sp. (56–72% of total 16S genes). Our assembly and binning efforts yielded almost 4.9 Mb of Pseudonocardia sp. contigs, which accounts for an estimated 99.3% of its non-repetitive genomic content. This genome contains a limited array of carbohydrate catabolic pathways, but encodes for CO2 fixation via the Calvin cycle. The genome also encodes complete pathways for the catabolism of various trace gases (H2, CO and several organic C1 compounds) and the assimilation of ammonia and nitrate. We compared genomic content among related Pseudonocardia spp. and estimated rates of non-synonymous and synonymous nucleic acid substitutions between protein coding homologs. Collectively, these comparative analyses suggest that the community structure and various functional genes have undergone strong selection in the nutrient poor desert mineral soils and high-elevation atmospheric conditions. PMID:25566214
USDA-ARS?s Scientific Manuscript database
Long noncoding RNAs (lncRNAs) have been recognized in recent years as key regulators of diverse cellular processes. Genome-wide large-scale projects have uncovered thousands of lncRNAs in many model organisms. Large intergenic noncoding RNAs (lincRNAs) are lncRNAs that are transcribed from intergeni...
[Identification of medicinal plant Dendrobium based on the chloroplast psbK-psbI intergenic spacer].
Yao, Hui; Yang, Pei; Zhou, Hong; Ma, Shuang-jiao; Song, Jing-yuan; Chen, Shi-lin
2015-06-01
In this paper, the chloroplast psbK-psbI intergenic spacers of 18 species of Dendrobium and their adulterants were amplified and sequenced, and then the sequence characteristics were analyzed. The sequence lengths of chloroplast psbK-psbI regions of Dendrobium ranged from 474 to 513 bp and the GC contents were 25.4%-27.6%. The variable sites were 71 while the informative sites were 46. The inter-specific genetic distances calculated by Kimura 2-parameter (K2P) of Dendrobium were 0.006 1-0.058 1, with an average of 0.028 4. The K2P genetic distances between Dendrobium species and Bulbophyllum odoratissimum were 0.093 2-0.120 4. The NJ tree showed that the Dendrobium species can be easily differentiated from each other and 6 samples of the inspected Dendrobium species were identified successfully through sequencing the psbK-psbI intergenic spacer. Therefore, the chloroplast psbK-psbI intergenic spacer can be used as a candidate marker to identify Dendrobium species and its adulterants.
Machida, I; Saeki, T; Nakai, S
1986-03-01
The effects of far (254 nm) and near (290-350 nm) ultraviolet (UV) light on mutations, intragenic and intergenic recombinations were compared in diploid strains of Saccharomyces cerevisiae. At equivalent survival levels there was not much difference in the induction of nonsense and missense mutations between far- and near-UV radiations. However, frameshift mutations were induced more frequently by near-UV than by far-UV radiation. Near-UV radiation induced intragenic recombination (gene conversion) as efficiently as far-UV radiation and the induced levels were similar in both radiations at equitoxic doses. A strikingly higher frequency was observed for the intergenic recombination induced by near-UV radiation than by far-UV radiation when compared at equivalent survival levels. Photoreactivation reduced the frequency only slightly in far-UV induced intergenic recombination and not at all in near-UV induction. These results indicate that near-UV damage involves strand breakage in addition to pyrimidine dimers and other lesions induced, whereas far-UV damage consists largely of photoreactivable lesions, pyrimidine dimers, and near-UV induced damage is more efficient for the induction of crossing-over.
HATAKEYAMA, YOSHINORI; SHIBUYA, NORIHIRO; NISHIYAMA, TAKASHI; NAKASHIMA, NOBUHIKO
2004-01-01
The intergenic region (IGR) located upstream of the capsid protein gene in dicistroviruses contains an internal ribosome entry site (IRES). Translation initiation mediated by the IRES does not require initiator methionine tRNA. Comparison of the IGRs among dicistroviruses suggested that Taura syndrome virus (TSV) and acute bee paralysis virus have an extra side stem loop in the predicted IRES. We examined whether the side stem is responsible for translation activity mediated by the IGR using constructs with compensatory mutations. In vitro translation analysis showed that TSV has an IGR-IRES that is structurally distinct from those previously described. Because IGR-IRES elements determine the translation initiation site by virtue of their own tertiary structure formation, the discovery of this initiation mechanism suggests the possibility that eukaryotic mRNAs might have more extensive coding regions than previously predicted. To test this hypothesis, we searched full-length cDNA databases and whole genome sequences of eukaryotes using the pattern matching program, Scan For Matches, with parameters that can extract sequences containing secondary structure elements resembling those of IGR-IRES. Our search yielded several sequences, but their predicted secondary structures were suggested to be unstable in comparison to those of dicistroviruses. These results suggest that RNAs structurally similar to dicistroviruses are not common. If some eukaryotic mRNAs are translated independently of an initiator methionine tRNA, their structures are likely to be significantly distinct from those of dicistroviruses. PMID:15100433
Kang, Sang-Ho; Lee, Jeong-Hoon; Lee, Hyun Oh; Ahn, Byoung Ohg; Won, So Youn; Sohn, Seong-Han; Kim, Jung Sun
2017-10-06
Glycyrrhiza uralensis and G. glabra, members of the Fabaceae, are medicinally important species that are native to Asia and Europe. Extracts from these plants are widely used as natural sweeteners because of their much greater sweetness than sucrose. In this study, the three complete chloroplast genomes and five 45S nuclear ribosomal (nr)DNA sequences of these two licorice species and an interspecific hybrid are presented. The chloroplast genomes of G. glabra, G. uralensis and G. glabra × G. uralensis were 127,895 bp, 127,716 bp and 127,939 bp, respectively. The three chloroplast genomes harbored 110 annotated genes, including 76 protein-coding genes, 30 tRNA genes and 4 rRNA genes. The 45S nrDNA sequences were either 5,947 or 5,948 bp in length. Glycyrrhiza glabra and G. glabra × G. uralensis showed two types of nrDNA, while G. uralensis contained a single type. The complete 45S nrDNA sequence unit contains 18S rRNA, ITS1, 5.8S rRNA, ITS2 and 26S rRNA. We identified simple sequence repeat and tandem repeat sequences. We also developed four reliable markers for analysis of Glycyrrhiza diversity authentication.
Zhao, Shanrong; Zhang, Ying; Gamini, Ramya; Zhang, Baohong; von Schack, David
2018-03-19
To allow efficient transcript/gene detection, highly abundant ribosomal RNAs (rRNA) are generally removed from total RNA either by positive polyA+ selection or by rRNA depletion (negative selection) before sequencing. Comparisons between the two methods have been carried out by various groups, but the assessments have relied largely on non-clinical samples. In this study, we evaluated these two RNA sequencing approaches using human blood and colon tissue samples. Our analyses showed that rRNA depletion captured more unique transcriptome features, whereas polyA+ selection outperformed rRNA depletion with higher exonic coverage and better accuracy of gene quantification. For blood- and colon-derived RNAs, we found that 220% and 50% more reads, respectively, would have to be sequenced to achieve the same level of exonic coverage in the rRNA depletion method compared with the polyA+ selection method. Therefore, in most cases we strongly recommend polyA+ selection over rRNA depletion for gene quantification in clinical RNA sequencing. Our evaluation revealed that a small number of lncRNAs and small RNAs made up a large fraction of the reads in the rRNA depletion RNA sequencing data. Thus, we recommend that these RNAs are specifically depleted to improve the sequencing depth of the remaining RNAs.
Lee, Tzuu-fen; Gurazada, Sai Guna Ranjan; Zhai, Jixian; Li, Shengben; Simon, Stacey A; Matzke, Marjori A; Chen, Xuemei; Meyers, Blake C
2012-07-01
In plants, heterochromatin is maintained by a small RNA-based gene silencing mechanism known as RNA-directed DNA methylation (RdDM). RdDM requires the non-redundant functions of two plant-specific DNA-dependent RNA polymerases (RNAP), RNAP IV and RNAP V. RNAP IV plays a major role in siRNA biogenesis, while RNAP V may recruit DNA methylation machinery to target endogenous loci for silencing. Although small RNA-generating regions that are dependent on both RNAP IV and RNAP V have been identified previously, the genomic loci targeted by RNAP V for siRNA accumulation and silencing have not been described extensively. To characterize the RNAP V-dependent, heterochromatic siRNA-generating regions in the Arabidopsis genome, we deeply sequenced the small RNA populations of wild-type and RNAP V null mutant (nrpe1) plants. Our results showed that RNAP V-dependent siRNA-generating loci are associated predominately with short repetitive sequences in intergenic regions. Suppression of small RNA production from short repetitive sequences was also prominent in RdDM mutants including dms4, drd1, dms3 and rdm1, reflecting the known association of these RdDM effectors with RNAP V. The genomic regions targeted by RNAP V were small, with an estimated average length of 238 bp. Our results suggest that RNAP V affects siRNA production from genomic loci with features dissimilar to known RNAP IV-dependent loci. RNAP V, along with RNAP IV and DRM1/2, may target and silence a set of small, intergenic transposable elements located in dispersed genomic regions for silencing. Silencing at these loci may be actively reinforced by RdDM.
Yang, Yaodong; Mason, Annaliese S.; Lei, Xintao; Ma, Zilong
2013-01-01
MicroRNAs (miRNAs) are important regulators of gene expression at the post-transcriptional level in a wide range of species. Highly conserved miRNAs regulate ancestral transcription factors common to all plants, and control important basic processes such as cell division and meristem function. We selected 21 conserved miRNA families to analyze the distribution and maintenance of miRNAs. Recently, the first genome sequence in Palmaceae was released: date palm (Phoenix dactylifera). We conducted a systematic miRNA analysis in date palm, computationally identifying and characterizing the distribution and duplication of conserved miRNAs in this species compared to other published plant genomes. A total of 81 miRNAs belonging to 18 miRNA families were identified in date palm. The majority of miRNAs in date palm and seven other well-studied plant species were located in intergenic regions and located 4 to 5 kb away from the nearest protein-coding genes. Sequence comparison showed that 67% of date palm miRNA members were present in duplicated segments, and that 135 pairs of miRNA-containing segments were duplicated in Arabidopsis, tomato, orange, rice, apple, poplar and soybean with a high similarity of non coding sequences between duplicated segments, indicating genomic duplication was a major force for expansion of conserved miRNAs. Duplicated miRNA pairs in date palm showed divergence in pre-miRNA sequence and in number of promoters, implying that these duplicated pairs may have undergone divergent evolution. Comparisons between date palm and the seven other plant species for the gain/loss of miR167 loci in an ancient segment shared between monocots and dicots suggested that these conserved miRNAs were highly influenced by and diverged as a result of genomic duplication events. PMID:23951162
Xiao, Yong; Xia, Wei; Yang, Yaodong; Mason, Annaliese S; Lei, Xintao; Ma, Zilong
2013-01-01
MicroRNAs (miRNAs) are important regulators of gene expression at the post-transcriptional level in a wide range of species. Highly conserved miRNAs regulate ancestral transcription factors common to all plants, and control important basic processes such as cell division and meristem function. We selected 21 conserved miRNA families to analyze the distribution and maintenance of miRNAs. Recently, the first genome sequence in Palmaceae was released: date palm (Phoenix dactylifera). We conducted a systematic miRNA analysis in date palm, computationally identifying and characterizing the distribution and duplication of conserved miRNAs in this species compared to other published plant genomes. A total of 81 miRNAs belonging to 18 miRNA families were identified in date palm. The majority of miRNAs in date palm and seven other well-studied plant species were located in intergenic regions and located 4 to 5 kb away from the nearest protein-coding genes. Sequence comparison showed that 67% of date palm miRNA members were present in duplicated segments, and that 135 pairs of miRNA-containing segments were duplicated in Arabidopsis, tomato, orange, rice, apple, poplar and soybean with a high similarity of non coding sequences between duplicated segments, indicating genomic duplication was a major force for expansion of conserved miRNAs. Duplicated miRNA pairs in date palm showed divergence in pre-miRNA sequence and in number of promoters, implying that these duplicated pairs may have undergone divergent evolution. Comparisons between date palm and the seven other plant species for the gain/loss of miR167 loci in an ancient segment shared between monocots and dicots suggested that these conserved miRNAs were highly influenced by and diverged as a result of genomic duplication events.
Freire-Picos, M A; Landeira-Ameijeiras, V; Mayán, María D
2013-07-01
The correct distribution of nuclear domains is critical for the maintenance of normal cellular processes such as transcription and replication, which are regulated depending on their location and surroundings. The most well-characterized nuclear domain, the nucleolus, is essential for cell survival and metabolism. Alterations in nucleolar structure affect nuclear dynamics; however, how the nucleolus and the rest of the nuclear domains are interconnected is largely unknown. In this report, we demonstrate that RNAP-II is vital for the maintenance of the typical crescent-shaped structure of the nucleolar rDNA repeats and rRNA transcription. When stalled RNAP-II molecules are not bound to the chromatin, the nucleolus loses its typical crescent-shaped structure. However, the RNAP-II interaction with Seh1p, or cryptic transcription by RNAP-II, is not critical for morphological changes. Copyright © 2013 John Wiley & Sons, Ltd.
Lee, Hae-Won; Kim, Dae-Won; Lee, Mi-Hwa; Kim, Byung-Yong; Cho, Yong-Joon; Yim, Kyung June; Song, Hye Seon; Rhee, Jin-Kyu; Seo, Myung-Ji; Choi, Hak-Jong; Choi, Jong-Soon; Lee, Dong-Gi; Yoon, Changmann; Nam, Young-Do; Roh, Seong Woon
2015-01-01
An extremely halophilic archaeon, Haladaptatus cibarius D43(T), was isolated from traditional Korean salt-rich fermented seafood. Strain D43(T) shows the highest 16S rRNA gene sequence similarity (98.7 %) with Haladaptatus litoreus RO1-28(T), is Gram-negative staining, motile, and extremely halophilic. Despite potential industrial applications of extremely halophilic archaea, their genome characteristics remain obscure. Here, we describe the whole genome sequence and annotated features of strain D43(T). The 3,926,724 bp genome includes 4,092 protein-coding and 57 RNA genes (including 6 rRNA and 49 tRNA genes) with an average G + C content of 57.76 %.
Camarena-Rosales, Faustino; Del Río-Portilla, Miguel A; Ruiz-Campos, Gorgonio; García-De-León, Francisco J
2016-11-01
The complete mitochondrial genome sequence of the Desert Pupfish, Cyprinodon macularius (Gene accession number KM985373) has a length of 16,940 bp, and the arrangement consisted of 13 protein-coding genes, 2 ribosomal RNA (rRNA) genes and 22 transfer RNA, which are similar to other known mitogenomes for the family Cyprinodontidae.
Duquesne, Véronique; Delcont, Aurélie; Huleux, Anthéa; Beven, Véronique; Touzain, Fabrice; Ribière-Chabert, Magali
2017-11-02
We report here the full mitochondrial genome sequence of Aethina tumida , a Nitidulidae species beetle, that is a pest of bee hives. The obtained sequence is 16,576 bp in length and contains 13 protein-coding genes, 2 rRNA genes, and 22 tRNAs. Copyright © 2017 Duquesne et al.
USDA-ARS?s Scientific Manuscript database
The Kauffman White (KW) serotyping method requires more than 250 antisera to characterize more than 2,500 Salmonella serovars. The complexity of serotyping could be overcome using molecular methods. In this study, a dkgB-linked intergenic sequence ribotyping (ISR) method that generates sequence occu...
Maier, Uwe-G; Zauner, Stefan; Woehle, Christian; Bolte, Kathrin; Hempel, Franziska; Allen, John F.; Martin, William F.
2013-01-01
Plastid and mitochondrial genomes have undergone parallel evolution to encode the same functional set of genes. These encode conserved protein components of the electron transport chain in their respective bioenergetic membranes and genes for the ribosomes that express them. This highly convergent aspect of organelle genome evolution is partly explained by the redox regulation hypothesis, which predicts a separate plastid or mitochondrial location for genes encoding bioenergetic membrane proteins of either photosynthesis or respiration. Here we show that convergence in organelle genome evolution is far stronger than previously recognized, because the same set of genes for ribosomal proteins is independently retained by both plastid and mitochondrial genomes. A hitherto unrecognized selective pressure retains genes for the same ribosomal proteins in both organelles. On the Escherichia coli ribosome assembly map, the retained proteins are implicated in 30S and 50S ribosomal subunit assembly and initial rRNA binding. We suggest that ribosomal assembly imposes functional constraints that govern the retention of ribosomal protein coding genes in organelles. These constraints are subordinate to redox regulation for electron transport chain components, which anchor the ribosome to the organelle genome in the first place. As organelle genomes undergo reduction, the rRNAs also become smaller. Below size thresholds of approximately 1,300 nucleotides (16S rRNA) and 2,100 nucleotides (26S rRNA), all ribosomal protein coding genes are lost from organelles, while electron transport chain components remain organelle encoded as long as the organelles use redox chemistry to generate a proton motive force. PMID:24259312
Spontaneous Mutation Rate in the Smallest Photosynthetic Eukaryotes
Krasovec, Marc; Eyre-Walker, Adam; Sanchez-Ferandin, Sophie
2017-01-01
Abstract Mutation is the ultimate source of genetic variation, and knowledge of mutation rates is fundamental for our understanding of all evolutionary processes. High throughput sequencing of mutation accumulation lines has provided genome wide spontaneous mutation rates in a dozen model species, but estimates from nonmodel organisms from much of the diversity of life are very limited. Here, we report mutation rates in four haploid marine bacterial-sized photosynthetic eukaryotic algae; Bathycoccus prasinos, Ostreococcus tauri, Ostreococcus mediterraneus, and Micromonas pusilla. The spontaneous mutation rate between species varies from μ = 4.4 × 10−10 to 9.8 × 10−10 mutations per nucleotide per generation. Within genomes, there is a two-fold increase of the mutation rate in intergenic regions, consistent with an optimization of mismatch and transcription-coupled DNA repair in coding sequences. Additionally, we show that deviation from the equilibrium GC content increases the mutation rate by ∼2% to ∼12% because of a GC bias in coding sequences. More generally, the difference between the observed and equilibrium GC content of genomes explains some of the inter-specific variation in mutation rates. PMID:28379581
NASA Astrophysics Data System (ADS)
Gao, Fengtao; Wei, Min; Zhu, Ying; Guo, Hua; Chen, Songlin; Yang, Guanpin
2017-06-01
This study presents the complete mitochondrial genome of the hybrid Epinephelus moara♀× Epinephelus lanceolatus♂. The genome is 16886 bp in length, and contains 13 protein-coding genes, 2 rRNA genes, 22 tRNA genes, a light-strand replication origin and a control region. Additionally, phylogenetic analysis based on the nucleotide sequences of 13 conserved protein-coding genes using the maximum likelihood method indicated that the mitochondrial genome is maternally inherited. This study presents genomic data for studying phylogenetic relationships and breeding of hybrid Epinephelinae.
Wang, Mingling; Qiu, Jian-Wen
2016-05-01
We report the complete mitochondrial genome (mitogenome) of the giant ramshorn snail Marisa cornuarietis, a biocontrol agent of freshwater weeds and snail vectors of schistosomes. The mitogenome is 15,923 bp in length, encoding 13 protein-coding genes, 22 transfer RNAs and 2 ribosomal RNAs. The mitogenome is A+T biased (70.0%), with 28.9% A, 41.1% T, 16.7% G, and 13.3% C. A comparison with Pomacea canaliculata, the other member in the same family (Ampullariidae) with a sequenced mitogenome, shows that the two species have an identical gene order, but their intergenic regions vary substantially in sequence length. The mitogenome data can be used to understand the population genetics of M. cornuarietis, and resolve the phylogenetic relationship of various genera in Ampullariidae.
A draft sequence of the rice genome (Oryza sativa L. ssp. indica).
Yu, Jun; Hu, Songnian; Wang, Jun; Wong, Gane Ka-Shu; Li, Songgang; Liu, Bin; Deng, Yajun; Dai, Li; Zhou, Yan; Zhang, Xiuqing; Cao, Mengliang; Liu, Jing; Sun, Jiandong; Tang, Jiabin; Chen, Yanjiong; Huang, Xiaobing; Lin, Wei; Ye, Chen; Tong, Wei; Cong, Lijuan; Geng, Jianing; Han, Yujun; Li, Lin; Li, Wei; Hu, Guangqiang; Huang, Xiangang; Li, Wenjie; Li, Jian; Liu, Zhanwei; Li, Long; Liu, Jianping; Qi, Qiuhui; Liu, Jinsong; Li, Li; Li, Tao; Wang, Xuegang; Lu, Hong; Wu, Tingting; Zhu, Miao; Ni, Peixiang; Han, Hua; Dong, Wei; Ren, Xiaoyu; Feng, Xiaoli; Cui, Peng; Li, Xianran; Wang, Hao; Xu, Xin; Zhai, Wenxue; Xu, Zhao; Zhang, Jinsong; He, Sijie; Zhang, Jianguo; Xu, Jichen; Zhang, Kunlin; Zheng, Xianwu; Dong, Jianhai; Zeng, Wanyong; Tao, Lin; Ye, Jia; Tan, Jun; Ren, Xide; Chen, Xuewei; He, Jun; Liu, Daofeng; Tian, Wei; Tian, Chaoguang; Xia, Hongai; Bao, Qiyu; Li, Gang; Gao, Hui; Cao, Ting; Wang, Juan; Zhao, Wenming; Li, Ping; Chen, Wei; Wang, Xudong; Zhang, Yong; Hu, Jianfei; Wang, Jing; Liu, Song; Yang, Jian; Zhang, Guangyu; Xiong, Yuqing; Li, Zhijie; Mao, Long; Zhou, Chengshu; Zhu, Zhen; Chen, Runsheng; Hao, Bailin; Zheng, Weimou; Chen, Shouyi; Guo, Wei; Li, Guojie; Liu, Siqi; Tao, Ming; Wang, Jian; Zhu, Lihuang; Yuan, Longping; Yang, Huanming
2002-04-05
We have produced a draft sequence of the rice genome for the most widely cultivated subspecies in China, Oryza sativa L. ssp. indica, by whole-genome shotgun sequencing. The genome was 466 megabases in size, with an estimated 46,022 to 55,615 genes. Functional coverage in the assembled sequences was 92.0%. About 42.2% of the genome was in exact 20-nucleotide oligomer repeats, and most of the transposons were in the intergenic regions between genes. Although 80.6% of predicted Arabidopsis thaliana genes had a homolog in rice, only 49.4% of predicted rice genes had a homolog in A. thaliana. The large proportion of rice genes with no recognizable homologs is due to a gradient in the GC content of rice coding sequences.
Luz, Bruna Louise Pereira; Capel, Kátia Cristina Cruz; Stampar, Sérgio Nascimento; Kitahara, Marcelo Visentini
2016-07-01
Dendrophylliidae is one of the few monophyletic families within the Scleractinia that embraces zooxanthellate and azooxanthellate species represented by both solitary and colonial forms. Among the exclusively azooxanthellate genera, Dendrophyllia is reported worldwide from 1 to 1200 m deep. To date, although three complete mitochondrial (mt) genomes from representatives of the family are available, only that from Turbinaria peltata has been formally published. Here we describe the complete nucleotide sequence of the mt genome from Dendrophyllia arbuscula that is 19 069 bp in length and comprises two rDNAs, two tRNAs, and 13 protein-coding genes arranged in the canonical scleractinian mt gene order. No genes overlap, resulting in the presence of 18 intergenic spacers and one of the longest scleractinian mt genome sequenced to date.
2013-01-01
Background Small non-coding RNAs (ncRNAs) are important regulators of gene expression in eukaryotes. Previously, only microRNAs (miRNAs) and piRNAs have been identified in the silkworm, Bombyx mori. Furthermore, only ncRNAs (50-500nt) of intermediate size have been systematically identified in the silkworm. Results Here, we performed a systematic identification and analysis of small RNAs (18-50nt) associated with the Bombyx mori argonaute2 (BmAgo2) protein. Using RIP-seq, we identified various types of small ncRNAs associated with BmAGO2. These ncRNAs showed a multimodal length distribution, with three peaks at ~20nt, ~27nt and ~33nt, which included tRNA-, transposable element (TE)-, rRNA-, snoRNA- and snRNA-derived small RNAs as well as miRNAs and piRNAs. The tRNA-derived fragments (tRFs) were found at an extremely high abundance and accounted for 69.90% of the BmAgo2-associated small RNAs. Northern blotting confirmed that many tRFs were expressed or up-regulated only in the BmNPV-infected cells, implying that the tRFs play a prominent role by binding to BmAgo2 during BmNPV infection. Additional evidence suggested that there are potential cleavage sites on the D, anti-codon and TψC loops of the tRNAs. TE-derived small RNAs and piRNAs also accounted for a significant proportion of the BmAgo2-associated small RNAs, suggesting that BmAgo2 could be involved in the maintenance of genome stability by suppressing the activities of transposons guided by these small RNAs. Finally, Northern blotting was also used to confirm the Bombyx 5.8 s rRNA-derived small RNAs, demonstrating that various novel small RNAs exist in the silkworm. Conclusions Using an RIP-seq method in combination with Northern blotting, we identified various types of small RNAs associated with the BmAgo2 protein, including tRNA-, TE-, rRNA-, snoRNA- and snRNA-derived small RNAs as well as miRNAs and piRNAs. Our findings provide new clues for future functional studies of the role of small RNAs in insect development and evolution. PMID:24074203
Differential accumulation of nif structural gene mRNA in Azotobacter vinelandii.
Hamilton, Trinity L; Jacobson, Marty; Ludwig, Marcus; Boyd, Eric S; Bryant, Donald A; Dean, Dennis R; Peters, John W
2011-09-01
Northern analysis was employed to investigate mRNA produced by mutant strains of Azotobacter vinelandii with defined deletions in the nif structural genes and in the intergenic noncoding regions. The results indicate that intergenic RNA secondary structures effect the differential accumulation of transcripts, supporting the high Fe protein-to-MoFe protein ratio required for optimal diazotrophic growth.
Dennis, P P
1977-01-01
The fraction of the total ribonucleic acid (RNA) synthesis rate that is messenger RNA (mRNA) for ribosomal protein (r-protein) and ribosomal RNA (rRNA) has been estimated in valS(Ts) rel+ stringent and valS(Ts) relA1 relaxed strains of Escherichia coli during a partial inhibition of valyl-transfer RNA aminoacylation. The partial inhibition was accomplished by shifting the strains from the permissive growth temperature of 29.5 degrees C to the semipermissive temperature of 35.5 degrees C. The RNA synthesized at the elevated temperature was pulse labeled with [3H]uracil. The fraction of the total incorpoarted 3H radioactivity in r-protein mRNA or in rRNA was estimated by specific hybridization to the transducing phages gammaspc1, which carries about 15 r-protein genes and lambdailv5, which carries an rRNA transcription unit. The results clearly demonstrate that the rel gene influences the fraction of the total RNA synthesis rate that is r protein mRNA and rRNA; in the rel+ strain they are significantly increased relative to control cultures. This indicates that the expression of the genes coding for the RNA and protein component of the ribosome are most likely regulated at the level of transcription. Furthermore, it appears that the distribution of functioning RNA polymerase between rRNA genes, r-protein genes, and other types of genes is influenced by the rel gene control system; presumably this influence is mediated through the unusual nucleotide guanosine tetraphosphate. PMID:320185
Zhu, Bo; Zhang, Wenli; Jiang, Jiming
2015-01-01
Enhancers are important regulators of gene expression in eukaryotes. Enhancers function independently of their distance and orientation to the promoters of target genes. Thus, enhancers have been difficult to identify. Only a few enhancers, especially distant intergenic enhancers, have been identified in plants. We developed an enhancer prediction system based exclusively on the DNase I hypersensitive sites (DHSs) in the Arabidopsis thaliana genome. A set of 10,044 DHSs located in intergenic regions, which are away from any gene promoters, were predicted to be putative enhancers. We examined the functions of 14 predicted enhancers using the β-glucuronidase gene reporter. Ten of the 14 (71%) candidates were validated by the reporter assay. We also designed 10 constructs using intergenic sequences that are not associated with DHSs, and none of these constructs showed enhancer activities in reporter assays. In addition, the tissue specificity of the putative enhancers can be precisely predicted based on DNase I hypersensitivity data sets developed from different plant tissues. These results suggest that the open chromatin signature-based enhancer prediction system developed in Arabidopsis may serve as a universal system for enhancer identification in plants. PMID:26373455
Shendre, Aditi; Wiener, Howard W.; Zhi, Degui; Vazquez, Ana I; Portman, Michael A.; Shrestha, Sadeep
2014-01-01
Kawasaki disease (KD) is a diffuse and acute small-vessel vasculitis observed in children and has genetic and autoimmune components. We genotyped 112 case-parent trios of European decent (confirmed by AIMS) using the ImmunoChip array and performed association analyses with susceptibility to KD and IVIG non-response. KD susceptibility was assessed using the transmission disequilibrium test whereas IVIG non-response was evaluated using multivariable logistic regression analysis. We replicated SNPs in three gene regions (FCGR, CD40/CDH22, and HLA-DQB2/HLA-DOB) that have been previously associated with KD and provide support to other findings of several novel SNPs in genes with potential pathway in KD pathogenesis. SNP rs838143 in the 3′ UTR of FUT1 gene (2.7×10-5) and rs9847915 in the intergenic region of LOC730109 ∣ BRD7P2 (6.81×10-7) were the top hits for KD susceptibility in additive and dominant models, respectively. The top hits for IVIG responsiveness were rs1200332 in the intergenic region of BAZ1A ∣ C14orf19 (1.4×10-4) and rs4889606 in the intron of the STX1B gene (6.95×10-5) in additive and dominant models, respectively. Our study suggests that genes and biological pathways involved in autoimmune diseases play an important role in the pathogenesis of KD and IVIG response mechanism. PMID:25101798
USDA-ARS?s Scientific Manuscript database
The complete circular mitochondrial genome of D. reticulatum is 14,048 bp in length, consisting of 13 protein-coding genes (PCGs), 22 transfer RNA (tRNA) genes, and 2 ribosomal RNA (rRNA) genes (GenBank accession number: KY765589). The overall base composition was 31.0 % A, 12.2 % C, 17.7 % G and 39...
2010-01-01
Background Snake mitochondrial genomes are of great interest in understanding mitogenomic evolution because of gene duplications and rearrangements and the fast evolutionary rate of their genes compared to other vertebrates. Mitochondrial gene sequences have also played an important role in attempts to resolve the contentious phylogenetic relationships of especially the early divergences among alethinophidian snakes. Two recent innovative studies found dramatic gene- and branch-specific relative acceleration in snake protein-coding gene evolution, particularly along internal branches leading to Serpentes and Alethinophidia. It has been hypothesized that some of these rate shifts are temporally (and possibly causally) associated with control region duplication and/or major changes in ecology and anatomy. Results The near-complete mitochondrial (mt) genomes of three henophidian snakes were sequenced: Anilius scytale, Rhinophis philippinus, and Charina trivirgata. All three genomes share a duplicated control region and translocated tRNALEU, derived features found in all alethinophidian snakes studied to date. The new sequence data were aligned with mt genome data for 21 other species of snakes and used in phylogenetic analyses. Phylogenetic results agreed with many other studies in recovering several robust clades, including Colubroidea, Caenophidia, and Cylindrophiidae+Uropeltidae. Nodes within Henophidia that have been difficult to resolve robustly in previous analyses remained uncompellingly resolved here. Comparisons of relative rates of evolution of rRNA vs. protein-coding genes were conducted by estimating branch lengths across the tree. Our expanded sampling revealed dramatic acceleration along the branch leading to Typhlopidae, particularly long rRNA terminal branches within Scolecophidia, and that most of the dramatic acceleration in protein-coding gene rate along Serpentes and Alethinophidia branches occurred before Anilius diverged from other alethinophidians. Conclusions Mitochondrial gene sequence data alone may not be able to robustly resolve basal divergences among alethinophidian snakes. Taxon sampling plays an important role in identifying mitogenomic evolutionary events within snakes, and in testing hypotheses explaining their origin. Dramatic rate shifts in mitogenomic evolution occur within Scolecophidia as well as Alethinophidia, thus falsifying the hypothesis that these shifts in snakes are associated exclusively with evolution of a non-burrowing lifestyle, macrostomatan feeding ecology and/or duplication of the control region, both restricted to alethinophidians among living snakes. PMID:20055998
Toxicity phenotype does not correlate with phylogeny of Cylindrospermopsis raciborskii strains.
Stucken, Karina; Murillo, Alejandro A; Soto-Liebe, Katia; Fuentes-Valdés, Juan J; Méndez, Marco A; Vásquez, Mónica
2009-02-01
Cylindrospermopsis raciborskii is a species of freshwater, bloom-forming cyanobacterium. C. raciborskii produces toxins, including cylindrospermopsin (hepatotoxin) and saxitoxin (neurotoxin), although non toxin-producing strains are also observed. In spite of differences in toxicity, C. raciborskii strains comprise a monophyletic group, based upon 16S rRNA gene sequence identities (greater than 99%). We performed phylogenetic analyses; 16S rRNA gene and 16S-23S rRNA gene internally transcribed spacer (ITS-1) sequence comparisons, and genomic DNA restriction fragment length polymorphism (RFLP), resolved by pulsed-field gel electrophoresis (PFGE), of strains of C. raciborskii, obtained mainly from the Australian phylogeographic cluster. Our results showed no correlation between toxic phenotype and phylogenetic association in the Australian strains. Analyses of the 16S rRNA gene and the respective ITS-1 sequences (long L, and short S) showed an independent evolution of each ribosomal operon. The genes putatively involved in the cylindrospermopsin biosynthetic pathway were present in one locus and only in the hepatotoxic strains, demonstrating a common genomic organization for these genes and the absence of mutated or inactivated biosynthetic genes in the non toxic strains. In summary, our results support the hypothesis that the genes involved in toxicity may have been transferred as an island by processes of gene lateral transfer, rather than convergent evolution.
The complete nucleotide sequence of the domestic dog (Canis familiaris) mitochondrial genome.
Kim, K S; Lee, S E; Jeong, H W; Ha, J H
1998-10-01
The complete nucleotide sequence of the mitochondrial genome of the domestic dog, Canis familiaris, was determined. The length of the sequence was 16,728 bp; however, the length was not absolute due to the variation (heteroplasmy) caused by differing numbers of the repetitive motif, 5'-GTACACGT(A/G)C-3', in the control region. The genome organization, gene contents, and codon usage conformed to those of other mammalian mitochondrial genomes. Although its features were unknown, the "CTAGA" duplication event which followed the translational stop codon of the COII gene was not observed in other mammalian mitochondrial genomes. In order to determine the possible differences between mtDNAs in carnivores, two rRNA and 13 protein-coding genes from the cat, dog, and seal were compared. The combined molecular differences, in two rRNA genes as well as in the inferred amino acid sequences of the mitochondrial 13 protein-coding genes, suggested that there is a closer relationship between the dog and the seal than there is between either of these species and the cat. Based on the molecular differences of the mtDNA, the evolutionary divergence between the cat, the dog, and the seal was dated to approximately 50 +/- 4 million years ago. The degree of difference between carnivore mtDNAs varied according to the individual protein-coding gene applied, showing that the evolutionary relationships of distantly related species should be presented in an extended study based on ample sequence data like complete mtDNA molecules. Copyright 1998 Academic Press.
Maruyama, Takuro; Kawahara, Nobuo; Yokoyama, Kazumasa; Makino, Yukiko; Fukiharu, Toshimitsu; Goda, Yukihiro
2006-11-10
"Magic mushroom (MM)" is the name most commonly given to psychoactive fungi containing the hallucinogenic components: psilocin (1) and psilocybin (2). We investigated the rRNA gene (internal transcribed spacer (ITS) and large subunit (LSU)) of two Panaeolus species and four Psilocybe species fungi (of these, two are non-psilocybin species). On the basis of sequence alignment, we improved the identification system developed in our previous study. In this paper, we describe the new system capable of distinguishing MMs from non-psilocybin Psilocybe species, its application data and the phylogeny of MM species.
Casal, G; Matos, E; Teles-Grilo, M L; Azevedo, C
2008-08-01
A fish-infecting Microsporidia Potaspora morhaphis n. gen., n. sp. found adherent to the wall of the coelomic cavity of the freshwater fish, Potamorhaphis guianensis, from lower Amazon River is described, based on light microscope and ultrastructural characteristics. This microsporidian forms whitish xenomas distinguished by the numerous filiform and anastomosed microvilli. The xenoma was completely filled by several developmental stages. In all of these stages, the nuclei are monokaryotic and develop in direct contact with host cell cytoplasm. The merogonial plasmodium divides by binary fission and the disporoblastic pyriform spores of sporont origin measure 2.8+/-0.3 x 1.5+/-0.2 microm. In mature spores the polar filament was arranged into 9-10 coils in 2 layers. The polaroplast had 2 distinct regions around the manubrium and an electron-dense globule was observed. The small subunit, intergenic space and partial large subunit rRNA gene were sequenced and maximum parsimony analysis placed the microsporidian described here in the clade that includes the genera Kabatana, Microgemma, Spraguea and Tetramicra. The ultrastructural morphology of the xenoma, and the developmental stages including the spores of this microsporidian parasite, as well as the phylogenetic analysis, suggest the erection of a new genus and species.
Microbial community dynamics in the rhizosphere of a cadmium hyper-accumulator
NASA Astrophysics Data System (ADS)
Wood, J. L.; Zhang, C.; Mathews, E. R.; Tang, C.; Franks, A. E.
2016-11-01
Phytoextraction is influenced by the indigenous soil microbial communities during the remediation of heavy metal contaminated soils. Soil microbial communities can affect plant growth, metal availability and the performance of phytoextraction-assisting inocula. Understanding the basic ecology of indigenous soil communities associated with the phytoextraction process, including the interplay between selective pressures upon the communities, is an important step towards phytoextraction optimization. This study investigated the impact of cadmium (Cd), and the presence of a Cd-accumulating plant, Carpobrotus rossii (Haw.) Schwantes, on the structure of soil-bacterial and fungal communities using automated ribosomal intergenic spacer analysis (ARISA) and quantitative PCR (qPCR). Whilst Cd had no detectable influence upon fungal communities, bacterial communities underwent significant structural changes with no reduction in 16S rRNA copy number. The presence of C. rossii influenced the structure of all communities and increased ITS copy number. Suites of operational taxonomic units (OTUs) changed in abundance in response to either Cd or C. rossii, however we found little evidence to suggest that the two selective pressures were acting synergistically. The Cd-induced turnover in bacterial OTUs suggests that Cd alters competition dynamics within the community. Further work to understand how competition is altered could provide a deeper understanding of the microbiome-plant-environment and aid phytoextraction optimization.
Gonçalves, Daniela Dib; Carreira, Teresa; Nunes, Mónica; Benitez, Aline; Lopes-Mori, Fabiana Maria Ruiz; Vidotto, Odilon; de Freitas, Julio Cesar; Vieira, Maria Luísa
2013-01-01
The aim of this study was to investigate the presence of DNA of Borrelia burgdorferi sensu lato (s.l.) in ticks that feed on horses used for animal traction in rural Jataizinho, Parana, Brazil. Between February and June 2008, a total of 224 ticks was collected of which 75% were identified as Dermacentor nitens and 25% as Amblyomma cajenense. To amplify B. burgdorferi s.l. DNA, the intergenic space region (ISR) between the 5S (rrf) 23S (rrl) rRNA genes was used as targets for nested-PCR. Two ticks of the D. nitens species were positive for B. burgdorferi s.l. Both species showed a fragment of 184 bp, but the sequencing revealed 99.9% homology with the B. burgdorferi sensu stricto (s.s.) strain B31. These results showed, for the first time, the presence of spirochete DNA infecting ticks that parasitize horses used for animal traction, in the rural municipality mentioned. In conclusion, this study opens up promising prospects for determining the infection rate of B. burgdorferi s.s. genospecies or other species in the equine population, as well as the impact of the infection rate on Lyme disease in the state of Parana. PMID:24516456
Microbial community dynamics in the rhizosphere of a cadmium hyper-accumulator
Wood, J. L.; Zhang, C.; Mathews, E. R.; Tang, C.; Franks, A. E.
2016-01-01
Phytoextraction is influenced by the indigenous soil microbial communities during the remediation of heavy metal contaminated soils. Soil microbial communities can affect plant growth, metal availability and the performance of phytoextraction-assisting inocula. Understanding the basic ecology of indigenous soil communities associated with the phytoextraction process, including the interplay between selective pressures upon the communities, is an important step towards phytoextraction optimization. This study investigated the impact of cadmium (Cd), and the presence of a Cd-accumulating plant, Carpobrotus rossii (Haw.) Schwantes, on the structure of soil-bacterial and fungal communities using automated ribosomal intergenic spacer analysis (ARISA) and quantitative PCR (qPCR). Whilst Cd had no detectable influence upon fungal communities, bacterial communities underwent significant structural changes with no reduction in 16S rRNA copy number. The presence of C. rossii influenced the structure of all communities and increased ITS copy number. Suites of operational taxonomic units (OTUs) changed in abundance in response to either Cd or C. rossii, however we found little evidence to suggest that the two selective pressures were acting synergistically. The Cd-induced turnover in bacterial OTUs suggests that Cd alters competition dynamics within the community. Further work to understand how competition is altered could provide a deeper understanding of the microbiome-plant-environment and aid phytoextraction optimization. PMID:27805014
Vigneron, Adrien; Cruaud, Perrine; Roussel, Erwan G.; Pignet, Patricia; Caprais, Jean-Claude; Callac, Nolwenn; Ciobanu, Maria-Cristina; Godfroy, Anne; Cragg, Barry A.; Parkes, John R.; Van Nostrand, Joy D.; He, Zhili; Zhou, Jizhong; Toffin, Laurent
2014-01-01
Subsurface sediments of the Sonora Margin (Guaymas Basin), located in proximity of active cold seep sites were explored. The taxonomic and functional diversity of bacterial and archaeal communities were investigated from 1 to 10 meters below the seafloor. Microbial community structure and abundance and distribution of dominant populations were assessed using complementary molecular approaches (Ribosomal Intergenic Spacer Analysis, 16S rRNA libraries and quantitative PCR with an extensive primers set) and correlated to comprehensive geochemical data. Moreover the metabolic potentials and functional traits of the microbial community were also identified using the GeoChip functional gene microarray and metabolic rates. The active microbial community structure in the Sonora Margin sediments was related to deep subsurface ecosystems (Marine Benthic Groups B and D, Miscellaneous Crenarchaeotal Group, Chloroflexi and Candidate divisions) and remained relatively similar throughout the sediment section, despite defined biogeochemical gradients. However, relative abundances of bacterial and archaeal dominant lineages were significantly correlated with organic carbon quantity and origin. Consistently, metabolic pathways for the degradation and assimilation of this organic carbon as well as genetic potentials for the transformation of detrital organic matters, hydrocarbons and recalcitrant substrates were detected, suggesting that chemoorganotrophic microorganisms may dominate the microbial community of the Sonora Margin subsurface sediments. PMID:25099369
Safe-Site Effects on Rhizosphere Bacterial Communities in a High-Altitude Alpine Environment
Zerbe, Stefan
2014-01-01
The rhizosphere effect on bacterial communities associated with three floristic communities (RW, FI, and M sites) which differed for the developmental stages was studied in a high-altitude alpine ecosystem. RW site was an early developmental stage, FI was an intermediate stage, M was a later more matured stage. The N and C contents in the soils confirmed a different developmental stage with a kind of gradient from the unvegetated bare soil (BS) site through RW, FI up to M site. The floristic communities were composed of 21 pioneer plants belonging to 14 species. Automated ribosomal intergenic spacer analysis showed different bacterial genetic structures per each floristic consortium which differed also from the BS site. When plants of the same species occurred within the same site, almost all their bacterial communities clustered together exhibiting a plant species effect. Unifrac significance value (P < 0.05) on 16S rRNA gene diversity revealed significant differences (P < 0.05) between BS site and the vegetated sites with a weak similarity to the RW site. The intermediate plant colonization stage FI did not differ significantly from the RW and the M vegetated sites. These results pointed out the effect of different floristic communities rhizospheres on their soil bacterial communities. PMID:24995302
Rapid Quantitative Detection of Lactobacillus sakei in Meat and Fermented Sausages by Real-Time PCR
Martín, Belén; Jofré, Anna; Garriga, Margarita; Pla, Maria; Aymerich, Teresa
2006-01-01
A quick and simple method for quantitative detection of Lactobacillus sakei in fermented sausages was successfully developed. It is based on Chelex-100-based DNA purification and real-time PCR enumeration using a TaqMan fluorescence probe. Primers and probes were designed in the L. sakei 16S-23S rRNA intergenic transcribed spacer region, and the assay was evaluated using L. sakei genomic DNA and an artificially inoculated sausage model. The detection limit of this technique was approximately 3 cells per reaction mixture using both purified DNA and the inoculated sausage model. The quantification limit was established at 30 cells per reaction mixture in both models. The assay was then applied to enumerate L. sakei in real samples, and the results were compared to the MRS agar count method followed by confirmation of the percentage of L. sakei colonies. The results obtained by real-time PCR were not statistically significantly different than those obtained by plate count on MRS agar (P > 0.05), showing a satisfactory agreement between both methods. Therefore, the real-time PCR assay developed can be considered a promising rapid alternative method for the quantification of L. sakei and evaluation of the implantation of starter strains of L. sakei in fermented sausages. PMID:16957227
Rapid quantitative detection of Lactobacillus sakei in meat and fermented sausages by real-time PCR.
Martín, Belén; Jofré, Anna; Garriga, Margarita; Pla, Maria; Aymerich, Teresa
2006-09-01
A quick and simple method for quantitative detection of Lactobacillus sakei in fermented sausages was successfully developed. It is based on Chelex-100-based DNA purification and real-time PCR enumeration using a TaqMan fluorescence probe. Primers and probes were designed in the L. sakei 16S-23S rRNA intergenic transcribed spacer region, and the assay was evaluated using L. sakei genomic DNA and an artificially inoculated sausage model. The detection limit of this technique was approximately 3 cells per reaction mixture using both purified DNA and the inoculated sausage model. The quantification limit was established at 30 cells per reaction mixture in both models. The assay was then applied to enumerate L. sakei in real samples, and the results were compared to the MRS agar count method followed by confirmation of the percentage of L. sakei colonies. The results obtained by real-time PCR were not statistically significantly different than those obtained by plate count on MRS agar (P > 0.05), showing a satisfactory agreement between both methods. Therefore, the real-time PCR assay developed can be considered a promising rapid alternative method for the quantification of L. sakei and evaluation of the implantation of starter strains of L. sakei in fermented sausages.
Bastardo, A; Bohle, H; Ravelo, C; Toranzo, A E; Romalde, J L
2011-02-22
We investigated 11 strains of Yersinia ruckeri, the causative agent of enteric redmouth disease (ERM), that had been isolated from Atlantic salmon Salmo salar L. farmed in Chile and previously vaccinated against ERM. Phylogenetic analysis of the 16S rRNA gene sequences confirmed the identification of the salmon isolates as Y. ruckeri. A comparative analysis of the biochemical characteristics was made by means of traditional and commercial miniaturised methods. All studied isolates were motile and Tween 80 positive, and were identified as biotype 1. In addition, drug susceptibility tests determined high sensitivity to sulphamethoxazole/trimethroprim, oxytetracycline, ampicillin and enrofloxacin in all isolates. Serological assays showed the presence of O1a, O1b and O2b serotypes, with a predominance of the O1b serotype in 9 strains. Analysis of the lipopolysaccharide profiles and the correspondent immunoblot confirmed these results. Sodium dodecyl sulphate polyacrylamide gel electrophoresis (SDS-PAGE) of the outer membrane proteins revealed that all Chilean strains had profiles with a molecular weight range between 34 and 55 kDa, with 3 distinct groups based on differences in the major bands. Genotyping analyses by enterobacterial repetitive intergenic consensus (ERIC-) and repetitive extragenic palindromic (REP-)PCR techniques clearly indicated intraspecific genetic diversity among Chilean Y. ruckeri strains.
Ma, Xin-Ye; Xie, Cai-Xiang; Liu, Chang; Song, Jing-Yuan; Yao, Hui; Luo, Kun; Zhu, Ying-Jie; Gao, Ting; Pang, Xiao-Hui; Qian, Jun; Chen, Shi-Lin
2010-01-01
Medicinal pteridophytes are an important group used in traditional Chinese medicine; however, there is no simple and universal way to differentiate various species of this group by morphological traits. A novel technology termed "DNA barcoding" could discriminate species by a standard DNA sequence with universal primers and sufficient variation. To determine whether DNA barcoding would be effective for differentiating pteridophyte species, we first analyzed five DNA sequence markers (psbA-trnH intergenic region, rbcL, rpoB, rpoC1, and matK) using six chloroplast genomic sequences from GeneBank and found psbA-trnH intergenic region the best candidate for availability of universal primers. Next, we amplified the psbA-trnH region from 79 samples of medicinal pteridophyte plants. These samples represented 51 species from 24 families, including all the authentic pteridophyte species listed in the Chinese pharmacopoeia (2005 version) and some commonly used adulterants. We found that the sequence of the psbA-trnH intergenic region can be determined with both high polymerase chain reaction (PCR) amplification efficiency (94.1%) and high direct sequencing success rate (81.3%). Combined with GeneBank data (54 species cross 12 pteridophyte families), species discriminative power analysis showed that 90.2% of species could be separated/identified successfully by the TaxonGap method in conjunction with the Basic Local Alignment Search Tool 1 (BLAST1) method. The TaxonGap method results further showed that, for 37 out of 39 separable species with at least two samples each, between-species variation was higher than the relevant within-species variation. Thus, the psbA-trnH intergenic region is a suitable DNA marker for species identification in medicinal pteridophytes.
Vogel, Jörg; Bartels, Verena; Tang, Thean Hock; Churakov, Gennady; Slagter-Jäger, Jacoba G.; Hüttenhofer, Alexander; Wagner, E. Gerhart H.
2003-01-01
Recent bioinformatics-aided searches have identified many new small RNAs (sRNAs) in the intergenic regions of the bacterium Escherichia coli. Here, a shot-gun cloning approach (RNomics) was used to generate cDNA libraries of small sized RNAs. Besides many of the known sRNAs, we found new species that were not predicted previously. The present work brings the number of sRNAs in E.coli to 62. Experimental transcription start site mapping showed that some sRNAs were encoded from independent genes, while others were processed from mRNA leaders or trailers, indicative of a parallel transcriptional output generating sRNAs co-expressed with mRNAs. Two of these RNAs (SroA and SroG) consist of known (THI and RFN) riboswitch elements. We also show that two recently identified sRNAs (RyeB and SraC/RyeA) interact, resulting in RNase III-dependent cleavage. To the best of our knowledge, this represents the first case of two non-coding RNAs interacting by a putative antisense mechanism. In addition, intracellular metabolic stabilities of sRNAs were determined, including ones from previous screens. The wide range of half-lives (<2 to >32 min) indicates that sRNAs cannot generally be assumed to be metabolically stable. The experimental characterization of sRNAs analyzed here suggests that the definition of an sRNA is more complex than previously assumed. PMID:14602901
Pichon, Christophe; du Merle, Laurence; Caliot, Marie Elise; Trieu-Cuot, Patrick; Le Bouguénec, Chantal
2012-04-01
Characterization of small non-coding ribonucleic acids (sRNA) among the large volume of data generated by high-throughput RNA-seq or tiling microarray analyses remains a challenge. Thus, there is still a need for accurate in silico prediction methods to identify sRNAs within a given bacterial species. After years of effort, dedicated software were developed based on comparative genomic analyses or mathematical/statistical models. Although these genomic analyses enabled sRNAs in intergenic regions to be efficiently identified, they all failed to predict antisense sRNA genes (asRNA), i.e. RNA genes located on the DNA strand complementary to that which encodes the protein. The statistical models enabled any genomic region to be analyzed theorically but not efficiently. We present a new model for in silico identification of sRNA and asRNA candidates within an entire bacterial genome. This model was successfully used to analyze the Gram-negative Escherichia coli and Gram-positive Streptococcus agalactiae. In both bacteria, numerous asRNAs are transcribed from the complementary strand of genes located in pathogenicity islands, strongly suggesting that these asRNAs are regulators of the virulence expression. In particular, we characterized an asRNA that acted as an enhancer-like regulator of the type 1 fimbriae production involved in the virulence of extra-intestinal pathogenic E. coli.
Pichon, Christophe; du Merle, Laurence; Caliot, Marie Elise; Trieu-Cuot, Patrick; Le Bouguénec, Chantal
2012-01-01
Characterization of small non-coding ribonucleic acids (sRNA) among the large volume of data generated by high-throughput RNA-seq or tiling microarray analyses remains a challenge. Thus, there is still a need for accurate in silico prediction methods to identify sRNAs within a given bacterial species. After years of effort, dedicated software were developed based on comparative genomic analyses or mathematical/statistical models. Although these genomic analyses enabled sRNAs in intergenic regions to be efficiently identified, they all failed to predict antisense sRNA genes (asRNA), i.e. RNA genes located on the DNA strand complementary to that which encodes the protein. The statistical models enabled any genomic region to be analyzed theorically but not efficiently. We present a new model for in silico identification of sRNA and asRNA candidates within an entire bacterial genome. This model was successfully used to analyze the Gram-negative Escherichia coli and Gram-positive Streptococcus agalactiae. In both bacteria, numerous asRNAs are transcribed from the complementary strand of genes located in pathogenicity islands, strongly suggesting that these asRNAs are regulators of the virulence expression. In particular, we characterized an asRNA that acted as an enhancer-like regulator of the type 1 fimbriae production involved in the virulence of extra-intestinal pathogenic E. coli. PMID:22139924
Cao, Shuang-Shuang; Du, Yu-Zhou
2014-09-15
The mitogenome of Chilo auricilius (Lepidoptera: Pyraloidea: Crambidae) was a circular molecule made up of 15,367 bp. Sesamia inferens, Chilo suppressalis, Tryporyza incertulas, and C. auricilius, are closely related, well known rice stem borers that are widely distributed in the main rice-growing regions of China. The gene order and orientation of all four stem borers were similar to that of other insect mitogenomes. Among the four stem borers, all AT contents were below 83%, while all AT contents of tRNA genes were above 80%. The genomes were compact, with only 121-257 bp of non-coding intergenic spacer. There are 56 or 62-bp overlapping nucleotides in Crambidae moths, but were only 25-bp overlapping nucleotides in the noctuid moth S. inferens. There was a conserved motif 'ATACTAAA' between trnS2 (UCN) and nad1 in Crambidae moths, but this same region was 'ATCATA' in the noctuid S. inferens. And there was a 6-bp motif 'ATGATAA' of overlapping nucleotides, which was conserved in Lepidoptera, and a 14-bp motif 'TAAGCTATTTAAAT' conserved in the three Crambidae moths (C. suppressalis, C. auricilius and T. incertulas), but not in the noctuid. Finally, there were no stem-and-loop structures in the two Chilo moths. Copyright © 2014 Elsevier B.V. All rights reserved.
Tsuji, K; Tsien, H C; Hanson, R S; DePalma, S R; Scholtz, R; LaRoche, S
1990-01-01
16S ribosomal RNAs (rRNA) of 12 methylotrophic bacteria have been almost completely sequenced to establish their phylogenetic relationships. Methylotrophs that are physiologically related are phylogenetically diverse and are scattered among the purple eubacteria (class Proteobacteria). Group I methylotrophs can be classified in the beta- and the gamma-subdivisions and group II methylotrophs in the alpha-subdivision of the purple eubacteria, respectively. Pink-pigmented facultative and non-pigmented obligate group II methylotrophs form two distinctly separate branches within the alpha-subdivision. The secondary structures of the 16S rRNA sequences of 'Methylocystis parvus' strain OBBP, 'Methylosinus trichosporium' strain OB3b, 'Methylosporovibrio methanica' strain 81Z and Hyphomicrobium sp. strain DM2 are similar, and these non-pigmented obligate group II methylotrophs form one tight cluster in the alpha-subdivision. The pink-pigmented facultative methylotrophs, Methylobacterium extorquens strain AM1, Methylobacterium sp. strain DM4 and Methylobacterium organophilum strain XX form another cluster within the alpha-subdivision. Although similar in phenotypic characteristics, Methylobacterium organophilum strain XX and Methylobacterium extorquens strain AM1 are clearly distinguishable by their 16S rRNA sequences. The group I methylotrophs, Methylophilus methylotrophus strain AS1 and methylotrophic species DM11, which do not utilize methane, are similar in 16S rRNA sequence to bacteria in the beta-subdivision. The methane-utilizing, obligate group I methanotrophs, Methylococcus capsulatus strain BATH and Methylomonas methanica, are placed in the gamma-subdivision. The results demonstrate that it is possible to distinguish and classify the methylotrophic bacteria using 16S rRNA sequence analysis.
Samuels, David C.; Boys, Richard J.; Henderson, Daniel A.; Chinnery, Patrick F.
2003-01-01
We applied a hidden Markov model segmentation method to the human mitochondrial genome to identify patterns in the sequence, to compare these patterns to the gene structure of mtDNA and to see whether these patterns reveal additional characteristics important for our understanding of genome evolution, structure and function. Our analysis identified three segmentation categories based upon the sequence transition probabilities. Category 2 segments corresponded to the tRNA and rRNA genes, with a greater strand-symmetry in these segments. Category 1 and 3 segments covered the protein- coding genes and almost all of the non-coding D-loop. Compared to category 1, the mtDNA segments assigned to category 3 had much lower guanine abundance. A comparison to two independent databases of mitochondrial mutations and polymorphisms showed that the high substitution rate of guanine in human mtDNA is largest in the category 3 segments. Analysis of synonymous mutations showed the same pattern. This suggests that this heterogeneity in the mutation rate is partly independent of respiratory chain function and is a direct property of the genome sequence itself. This has important implications for our understanding of mtDNA evolution and its use as a ‘molecular clock’ to determine the rate of population and species divergence. PMID:14530452
Zhang, Yue; Feng, Shiqian; Zeng, Yiying; Ning, Hong; Liu, Lijun; Zhao, Zihua; Jiang, Fan; Li, Zhihong
2018-06-23
Bactrocera tsuneonis (Miyake), generally known as the Japanese orange fly, is considered to be a major pest of commercial citrus crops. It has a limited distribution in China, Japan and Vietnam, but it has the potential to invade areas outside of Asia. More genetic information of B. tsuneonis should be obtained in order to develop effective methodologies for rapid and accurate molecular identification due to the difficulty of distinguishing it from Bactrocera minax based on morphological features. We report here the whole mitochondrial genome of B. tsuneonis sequenced by next-generation sequencing. This mitogenome sequence had a total length of 15,865 bp, a typical circular molecule comprising 13 protein-coding genes, 2 rRNA genes, 22 tRNA genes and a non-coding region (A + T-rich control region). The structure and organization of the molecule were typical and similar compared with the published homologous sequences of other fruit flies in Tephritidae. The phylogenetic analyses based on the mitochondrial genome data presented a close genetic relationship between B. tsuneonis and B. minax. This is the first report of the complete mitochondrial genome of B. tsuneonis, and it can be used in further studies of species diagnosis, evolutionary biology, prevention and control. Copyright © 2018. Published by Elsevier B.V.
Genomics of Clostridium taeniosporum, an organism which forms endospores with ribbon-like appendages
Cambridge, Joshua M.; Blinkova, Alexandra L.; Salvador Rocha, Erick I.; Bode Hernández, Addys; Moreno, Maday; Ginés-Candelaria, Edwin; Goetz, Benjamin M.; Hunicke-Smith, Scott; Satterwhite, Ed; Tucker, Haley O.
2018-01-01
Clostridium taeniosporum, a non-pathogenic anaerobe closely related to the C. botulinum Group II members, was isolated from Crimean lake silt about 60 years ago. Its endospores are surrounded by an encasement layer which forms a trunk at one spore pole to which about 12–14 large, ribbon-like appendages are attached. The genome consists of one 3,264,813 bp, circular chromosome (with 26.6% GC) and three plasmids. The chromosome contains 2,892 potential protein coding sequences: 2,124 have specific functions, 147 have general functions, 228 are conserved but without known function and 393 are hypothetical based on the fact that no statistically significant orthologs were found. The chromosome also contains 101 genes for stable RNAs, including 7 rRNA clusters. Over 84% of the protein coding sequences and 96% of the stable RNA coding regions are oriented in the same direction as replication. The three known appendage genes are located within a single cluster with five other genes, the protein products of which are closely related, in terms of sequence, to the known appendage proteins. The relatedness of the deduced protein products suggests that all or some of the closely related genes might code for minor appendage proteins or assembly factors. The appendage genes might be unique among the known clostridia; no statistically significant orthologs were found within other clostridial genomes for which sequence data are available. The C. taeniosporum chromosome contains two functional prophages, one Siphoviridae and one Myoviridae, and one defective prophage. Three plasmids of 5.9, 69.7 and 163.1 Kbp are present. These data are expected to contribute to future studies of developmental, structural and evolutionary biology and to potential industrial applications of this organism. PMID:29293521
Cambridge, Joshua M; Blinkova, Alexandra L; Salvador Rocha, Erick I; Bode Hernández, Addys; Moreno, Maday; Ginés-Candelaria, Edwin; Goetz, Benjamin M; Hunicke-Smith, Scott; Satterwhite, Ed; Tucker, Haley O; Walker, James R
2018-01-01
Clostridium taeniosporum, a non-pathogenic anaerobe closely related to the C. botulinum Group II members, was isolated from Crimean lake silt about 60 years ago. Its endospores are surrounded by an encasement layer which forms a trunk at one spore pole to which about 12-14 large, ribbon-like appendages are attached. The genome consists of one 3,264,813 bp, circular chromosome (with 26.6% GC) and three plasmids. The chromosome contains 2,892 potential protein coding sequences: 2,124 have specific functions, 147 have general functions, 228 are conserved but without known function and 393 are hypothetical based on the fact that no statistically significant orthologs were found. The chromosome also contains 101 genes for stable RNAs, including 7 rRNA clusters. Over 84% of the protein coding sequences and 96% of the stable RNA coding regions are oriented in the same direction as replication. The three known appendage genes are located within a single cluster with five other genes, the protein products of which are closely related, in terms of sequence, to the known appendage proteins. The relatedness of the deduced protein products suggests that all or some of the closely related genes might code for minor appendage proteins or assembly factors. The appendage genes might be unique among the known clostridia; no statistically significant orthologs were found within other clostridial genomes for which sequence data are available. The C. taeniosporum chromosome contains two functional prophages, one Siphoviridae and one Myoviridae, and one defective prophage. Three plasmids of 5.9, 69.7 and 163.1 Kbp are present. These data are expected to contribute to future studies of developmental, structural and evolutionary biology and to potential industrial applications of this organism.
Pre-45s rRNA promotes colon cancer and is associated with poor survival of CRC patients.
Tsoi, H; Lam, K C; Dong, Y; Zhang, X; Lee, C K; Zhang, J; Ng, S C; Ng, S S M; Zheng, S; Chen, Y; Fang, J; Yu, J
2017-11-02
One characteristic of cancer cells is the abnormally high rate of cell metabolism to sustain their enhanced proliferation. However, the behind mechanism of this phenomenon is still elusive. Here we find that enhanced precursor 45s ribosomal RNA (pre-45s rRNA) is one of the core mechanisms in promoting the pathogenesis of colorectal cancer (CRC). Pre-45s rRNA expression is significantly higher in primary CRC tumor tissues samples and cancer cell lines compared with the non-tumorous colon tissues, and is associated with tumor sizes. Knockdown of pre-45s rRNA inhibits G1/S cell-cycle transition by stabilizing p53 through inducing murine double minute 2 (MDM2) and ribosomal protein L11 (RpL11) interaction. In addition, we revealed that high rate of cancer cell metabolism triggers the passive release of calcium ion from endoplasmic reticulum to the cytoplasm. The elevated calcium ion in the cytoplasm activates the signaling cascade of calcium/calmodulin-dependent protein kinase II, ribosomal S6 kinase (S6K) and ribosomal S6K (CaMKII-S6K-UBF). The activated UBF promotes the transcription of rDNA, which therefore increases pre-45s rRNA. Disruption of CaMKII-S6K-UBF axis by either RNAi or pharmaceutical approaches leads to reduction of pre-45s rRNA expression, which subsequently suppresses cell proliferation in colon cancer cells by causing cell-cycle arrest. Knockdown of APC activates CaMKII-S6K-UBF cascade and thus enhances pre-45s rRNA expression. Moreover, the high expression level of pre-45s rRNA is associated with poor survival of CRC patients in two independent cohorts. Our study identifies a novel mechanism in CRC pathogenesis mediated by pre-45s rRNA and a prognostic factor of pre-45s rRNA in CRC patients.
Das, Subhadeep; Singh, Deeksha; Madduluri, Madhavi; Chandrababunaidu, Mathu Malar; Gupta, Akash
2015-01-01
We report here the draft genome sequence of Tolypothrix campylonemoides VB511288, isolated from building facades in Santiniketan, India. The members of this genus produce several compounds of commercial importance. The draft assembly is 10,627,177 bases in 135 scaffolds, and it contains 7,886 protein-coding genes, 994 pseudogenes, 18 rRNA genes, and 76 tRNA genes. PMID:25838485
The complete mitochondrial genome of the endangered spotback skate, Atlantoraja castelnaui.
Duckett, Drew J L; Naylor, Gavin J P
2016-05-01
Chondrichthyes are a highly threatened class of organisms, largely due to overfishing and other human activities. The present study describes the complete mitochondrial genome (16,750 bp) of the endangered spotback skate, Atlantoraja castelnaui. The mitogenome is arranged in a typical vertebrate fashion, containing 13 protein-coding genes, 22 tRNA genes, 2 rRNA genes and 1 control region.
Gene copy number variation and its significance in cyanobacterial phylogeny
2012-01-01
Background In eukaryotes, variation in gene copy numbers is often associated with deleterious effects, but may also have positive effects. For prokaryotes, studies on gene copy number variation are rare. Previous studies have suggested that high numbers of rRNA gene copies can be advantageous in environments with changing resource availability, but further association of gene copies and phenotypic traits are not documented. We used one of the morphologically most diverse prokaryotic phyla to test whether numbers of gene copies are associated with levels of cell differentiation. Results We implemented a search algorithm that identified 44 genes with highly conserved copies across 22 fully sequenced cyanobacterial taxa. For two very basal cyanobacterial species, Gloeobacter violaceus and a thermophilic Synechococcus species, distinct phylogenetic positions previously found were supported by identical protein coding gene copy numbers. Furthermore, we found that increased ribosomal gene copy numbers showed a strong correlation to cyanobacteria capable of terminal cell differentiation. Additionally, we detected extremely low variation of 16S rRNA sequence copies within the cyanobacteria. We compared our results for 16S rRNA to three other eubacterial phyla (Chroroflexi, Spirochaetes and Bacteroidetes). Based on Bayesian phylogenetic inference and the comparisons of genetic distances, we could confirm that cyanobacterial 16S rRNA paralogs and orthologs show significantly stronger conservation than found in other eubacterial phyla. Conclusions A higher number of ribosomal operons could potentially provide an advantage to terminally differentiated cyanobacteria. Furthermore, we suggest that 16S rRNA gene copies in cyanobacteria are homogenized by both concerted evolution and purifying selection. In addition, the small ribosomal subunit in cyanobacteria appears to evolve at extraordinary slow evolutionary rates, an observation that has been made previously for morphological characteristics of cyanobacteria. PMID:22894826