intergenic consensus sequence: Topics by Science.gov

Sample records for intergenic consensus sequence

Enterobacterial Repetitive Intergenic Consensus Sequences as Molecular Targets for Typing of Mycobacterium tuberculosis Strains

PubMed Central

Sechi, Leonardo A.; Zanetti, Stefania; Dupré, Ilaria; Delogu, Giovanni; Fadda, Giovanni

1998-01-01

The presence of enterobacterial repetitive intergenic consensus (ERIC) sequences was demonstrated for the first time in the genome of Mycobacterium tuberculosis; these sequences have been found in transcribed regions of the chromosomes of gram-negative bacteria. In this study genetic diversity among clinical isolates of M. tuberculosis was determined by PCR with ERIC primers (ERIC-PCR). The study isolates comprised 71 clinical isolates collected from Sardinia, Italy. ERIC-PCR was able to identify 59 distinct profiles. The results obtained were compared with IS6110 and PCR-GTG fingerprinting. We found that the level of differentiation obtained by ERIC-PCR is greater than that obtained by IS6110 fingerprinting and comparable to that obtained by PCR-GTG. This method of fingerprinting is rapid and sensitive and can be applied to the study of the epidemiology of M. tuberculosis infections, especially when IS6110 fingerprinting is not of any help. PMID:9431935
Molecular Diagnostic Analysis of Outbreak Scenarios

ERIC Educational Resources Information Center

Morsink, M. C.; Dekter, H. E.; Dirks-Mulder, A.; van Leeuwen, W. B.

2012-01-01

In the current laboratory assignment, technical aspects of the polymerase chain reaction (PCR) are integrated in the context of six different bacterial outbreak scenarios. The "Enterobacterial Repetitive Intergenic Consensus Sequence" (ERIC) PCR was used to analyze different outbreak scenarios. First, groups of 2-4 students determined optimal…
Identification of Dendrobium species by a candidate DNA barcode sequence: the chloroplast psbA-trnH intergenic region.

PubMed

Yao, Hui; Song, Jing-Yuan; Ma, Xin-Ye; Liu, Chang; Li, Ying; Xu, Hong-Xi; Han, Jian-Ping; Duan, Li-Sheng; Chen, Shi-Lin

2009-05-01

DNA barcoding is a novel technology that uses a standard DNA sequence to facilitate species identification. Although a consensus has not been reached regarding which DNA sequences can be used as the best plant barcodes, the psbA-trnH spacer region has been tested extensively in recent years. In this study, we hypothesize that the psbA-trnH spacer regions are also effective barcodes for Dendrobium species. We have sequenced the chloroplast psbA-trnH intergenic spacers of 17 Dendrobium species to test this hypothesis. The sequences were found to be significantly different from those of other species, with percentages of variation ranging from 0.3 % to 2.3 % and an average of 1.2 %. In contrast, the intraspecific variation among the Dendrobium species studied ranged from 0 % to 0.1 %. The sequence difference between the psbA-trnH sequences of 17 Dendrobium species and one Bulbophyllum odoratissimum ranged from 2.0 % to 3.1 %, with an average of 2.5 %. Our results support the notion that the psbA-trnH intergenic spacer region could be used as a barcode to distinguish various Dendrobium species and to differentiate Dendrobium species from other adulterating species. Copyright Georg Thieme Verlag KG Stuttgart. New York.
Genomic Variability of Haemophilus influenzae Isolated from Mexican Children Determined by Using Enterobacterial Repetitive Intergenic Consensus Sequences and PCR

PubMed Central

Gomez-De-Leon, Patricia; Santos, Jose I.; Caballero, Javier; Gomez, Demostenes; Espinosa, Luz E.; Moreno, Isabel; Piñero, Daniel; Cravioto, Alejandro

2000-01-01

Genomic fingerprints from 92 capsulated and noncapsulated strains of Haemophilus influenzae from Mexican children with different diseases and healthy carriers were generated by PCR using the enterobacterial repetitive intergenic consensus (ERIC) sequences. A cluster analysis by the unweighted pair-group method with arithmetic averages based on the overall similarity as estimated from the characteristics of the genomic fingerprints, was conducted to group the strains. A total of 69 fingerprint patterns were detected in the H. influenzae strains. Isolates from patients with different diseases were represented by a variety of patterns, which clustered into two major groups. Of the 37 strains isolated from cases of meningitis, 24 shared patterns and were clustered into five groups within a similarity level of 1.0. One fragment of 1.25 kb was common to all meningitis strains. H. influenzae strains from healthy carriers presented fingerprint patterns different from those found in strains from sick children. Isolates from healthy individuals were more variable and were distributed differently from those from patients. The results show that ERIC-PCR provides a powerful tool for the determination of the distinctive pathogenicity potentials of H. influenzae strains and encourage its use for molecular epidemiology investigations. PMID:10878033
Comprehensive Survey of Genetic Diversity in Chloroplast Genomes and 45S nrDNAs within Panax ginseng Species

PubMed Central

Kim, Kyunghee; Lee, Sang-Choon; Lee, Junki; Lee, Hyun Oh; Joh, Ho Jun; Kim, Nam-Hoon; Park, Hyun-Seung; Yang, Tae-Jin

2015-01-01

We report complete sequences of chloroplast (cp) genome and 45S nuclear ribosomal DNA (45S nrDNA) for 11 Panax ginseng cultivars. We have obtained complete sequences of cp and 45S nrDNA, the representative barcoding target sequences for cytoplasm and nuclear genome, respectively, based on low coverage NGS sequence of each cultivar. The cp genomes sizes ranged from 156,241 to 156,425 bp and the major size variation was derived from differences in copy number of tandem repeats in the ycf1 gene and in the intergenic regions of rps16-trnUUG and rpl32-trnUAG. The complete 45S nrDNA unit sequences were 11,091 bp, representing a consensus single transcriptional unit with an intergenic spacer region. Comparative analysis of these sequences as well as those previously reported for three Chinese accessions identified very rare but unique polymorphism in the cp genome within P. ginseng cultivars. There were 12 intra-species polymorphisms (six SNPs and six InDels) among 14 cultivars. We also identified five SNPs from 45S nrDNA of 11 Korean ginseng cultivars. From the 17 unique informative polymorphic sites, we developed six reliable markers for analysis of ginseng diversity and cultivar authentication. PMID:26061692
Structure of genes and an insertion element in the methane producing archaebacterium Methanobrevibacter smithii.

PubMed

Hamilton, P T; Reeve, J N

1985-01-01

DNA fragments cloned from the methanogenic archaebacterium Methanobrevibacter smithii which complement mutations in the purE and proC genes of E. coli have been sequenced. Sequence analyses, transposon mutagenesis and expression in E. coli minicells indicate that purE and proC complementations result from the synthesis of M. smithii polypeptides with molecular weights of 36,697 and 27,836 respectively. The encoding genes appear to be located in operons. The M. smithii genome contains 69% A/T basepairs (bp) which is reflected in unusual codon usages and intergenic regions containing approximately 85% A/T bp. An insertion element, designated ISM1, was found within the cloned M. smithii DNA located adjacent to the proC complementing region. ISM1 is 1381 bp in length, has 29 bp terminal inverted repeat sequences and contains one major ORF encoded in 87% of the ISM1 sequence. ISM1 is mobile, present in approximately 10 copies per genome and integration duplicates 8 bp at the site of insertion. The duplicated sequences show homology with sequences within the 29 bp terminal repeat sequence of ISM1. Comparison of our data with sequences from halophilic archaebacteria suggests that 5'GAANTTTCA and 5'TTTTAATATAAA may be consensus promoter sequences for archaebacteria. These sequences closely resemble the consensus sequences which precede Drosophila heat-shock genes (Pelham 1982; Davidson et al. 1983). Methanogens appear to employ the eubacterial system of mRNA: 16SrRNA hybridization to ensure initiation of translation; the consensus ribosome binding sequence is 5'AGGTGA.
Genetic relationships among strains of Xanthomonas fragariae based on random amplified polymorphic DNA PCR, repetitive extragenic palindromic PCR, and enterobacterial repetitive intergenic consensus PCR data and generation of multiplexed PCR primers useful for the identification of this phytopathogen.

PubMed Central

Pooler, M R; Ritchie, D F; Hartung, J S

1996-01-01

Genetic relationships among 25 isolates of Xanthomonas fragariae from diverse geographic regions were determined by three PCR methods that rely on different amplification priming strategies: random amplified polymorphic DNA (RAPD) PCR, repetitive extragenic palindromic (REP) PCR, and enterobacterial repetitive intergenic consensus (ERIC) PCR. The results of these assays are mutually consistent and indicate that pathogenic strains are very closely related to each other. RAPD, ERIC, and REP PCR assays identified nine, four, and two genotypes, respectively, within X. fragariae isolates. A single nonpathogenic isolate of X. fragariae was not distinguishable by these methods. The results of the PCR assays were also fully confirmed by physiological tests. There was no correlation between DNA amplification product patterns and geographic sites of isolation, suggesting that this bacterium has spread largely through exchange of infected plant germ plasm. Sequences identified through the RAPD assays were used to develop three primer pairs for standard PCR assays to identify X. fragariae. In addition, we developed a stringent multiplexed PCR assay to identify X. fragariae by simultaneously using the three independently derived sets of primers specific for pathogenic strains of the bacteria. PMID:8795198
Comparison of pulsed-field gel electrophoresis and enterobacterial repetitive intergenic consensus PCR and biochemical tests to characterize Lactococcus garvieae.

PubMed

Ture, M; Altinok, I; Capkin, E

2015-01-01

Biochemical test, pulsed-field gel electrophoresis (PFGE) and enterobacterial repetitive intergenic consensus sequence PCR (ERIC-PCR) were used to compare 42 strains of Lactococcus garvieae isolated from different regions of Turkey, Italy, France and Spain. Twenty biotypes of L. garvieae were formed based on 54 biochemical tests. ERIC-PCR of genomic DNA from different L. garvieae strains resulted in amplification of multiple fragments of DNA in sizes ranging between 200 and 5000 bp with various band intensities. After cutting DNA with ApaI restriction enzyme and running on the PFGE, 11–22 resolvable bands ranging from 2 to 194 kb were observed. Turkish isolates were grouped into two clusters, and only A58 (Italy) strain was connected with Turkish isolates. Similarities between Turkish, Spanish, Italian and French isolates were <50% except 216-6 Rize strain. In Turkey, first lactococcosis occurred in Mugla, and then, it has been spread all over the country. Based on ERIC-PCR, Spanish and Italian strains of L. garvieae were related to Mugla strains. Therefore, after comparing PFGE profiles, ERIC-PCR profiles and phenotypic characteristics of 42 strains of L. garvieae, there were no relationships found between these three typing methods. PFGE method was more discriminative than the other methods. © 2014 John Wiley & Sons Ltd.
Sequence analysis and expression of the M1 and M2 matrix protein genes of hirame rhabdovirus (HIRRV)

USGS Publications Warehouse

Nishizawa, T.; Kurath, G.; Winton, J.R.

1997-01-01

We have cloned and sequenced a 2318 nucleotide region of the genomic RNA of hirame rhabdovirus (HIRRV), an important viral pathogen of Japanese flounder Paralichthys olivaceus. This region comprises approximately two-thirds of the 3' end of the nucleocapsid protein (N) gene and the complete matrix protein (M1 and M2) genes with the associated intergenic regions. The partial N gene sequence was 812 nucleotides in length with an open reading frame (ORF) that encoded the carboxyl-terminal 250 amino acids of the N protein. The M1 and M2 genes were 771 and 700 nucleotides in length, respectively, with ORFs encoding proteins of 227 and 193 amino acids. The M1 gene sequence contained an additional small ORF that could encode a highly basic, arginine-rich protein of 25 amino acids. Comparisons of the N, M1, and M2 gene sequences of HIRRV with the corresponding sequences of the fish rhabdoviruses, infectious hematopoietic necrosis virus (IHNV) or viral hemorrhagic septicemia virus (VHSV) indicated that HIRRV was more closely related to IHNV than to VHSV, but was clearly distinct from either. The putative consensus gene termination sequence for IHNV and VHSV, AGAYAG(A)(7), was present in the N-M1, M1-M2, and M2-G intergenic regions of HIRRV as were the putative transcription initiation sequences YGGCAC and AACA. An Escherichia coli expression system was used to produce recombinant proteins from the M1 and M2 genes of HIRRV. These were the same size as the authentic M1 and M2 proteins and reacted with anti-HIRRV rabbit serum in western blots. These reagents can be used for further study of the fish immune response and to test novel control methods.
Variation of 45S rDNA intergenic spacers in Arabidopsis thaliana.

PubMed

Havlová, Kateřina; Dvořáčková, Martina; Peiro, Ramon; Abia, David; Mozgová, Iva; Vansáčová, Lenka; Gutierrez, Crisanto; Fajkus, Jiří

2016-11-01

Approximately seven hundred 45S rRNA genes (rDNA) in the Arabidopsis thaliana genome are organised in two 4 Mbp-long arrays of tandem repeats arranged in head-to-tail fashion separated by an intergenic spacer (IGS). These arrays make up 5 % of the A. thaliana genome. IGS are rapidly evolving sequences and frequent rearrangements inside the rDNA loci have generated considerable interspecific and even intra-individual variability which allows to distinguish among otherwise highly conserved rRNA genes. The IGS has not been comprehensively described despite its potential importance in regulation of rDNA transcription and replication. Here we describe the detailed sequence variation in the complete IGS of A. thaliana WT plants and provide the reference/consensus IGS sequence, as well as genomic DNA analysis. We further investigate mutants dysfunctional in chromatin assembly factor-1 (CAF-1) (fas1 and fas2 mutants), which are known to have a reduced number of rDNA copies, and plant lines with restored CAF-1 function (segregated from a fas1xfas2 genetic background) showing major rDNA rearrangements. The systematic rDNA loss in CAF-1 mutants leads to the decreased variability of the IGS and to the occurrence of distinct IGS variants. We present for the first time a comprehensive and representative set of complete IGS sequences, obtained by conventional cloning and by Pacific Biosciences sequencing. Our data expands the knowledge of the A. thaliana IGS sequence arrangement and variability, which has not been available in full and in detail until now. This is also the first study combining IGS sequencing data with RFLP analysis of genomic DNA.
Enterobacterial repetitive intergenic consensus sequences and the PCR to generate fingerprints of genomic DNAs from Vibrio cholerae O1, O139, and non-O1 strains.

PubMed

Rivera, I G; Chowdhury, M A; Huq, A; Jacobs, D; Martins, M T; Colwell, R R

1995-08-01

Enterobacterial repetitive intergenic consensus (ERIC) sequence polymorphism was studied in Vibrio Cholerae strains isolated before and after the cholera epidemic in Brazil (in 1991), along with epidemic strains from Peru, Mexico, and India, by PCR. A total of 17 fingerprint patterns (FPs) were detected in the V. cholerae strains examined; 96.7% of the toxigenic V. cholerae O1 strains and 100% of the O139 serogroup strains were found to belong to the same FP group comprising four fragments (FP1). The nontoxigenic V. cholerae O1 also yielded four fragments but constituted a different FP group (FP2). A total of 15 different patterns were observed among the V. cholerae non-O1 strains. Two patterns were observed most frequently for V. cholerae non-01 strains, 25% of which have FP3, with five fragments, and 16.7% of which have FP4, with two fragments. Three fragments, 1.75, 0.79, and 0.5 kb, were found to be common to both toxigenic and nontoxigenic V. cholerae O1 strains as well as to group FP3, containing V. cholerae non-O1 strains. Two fragments of group FP3, 1.3 and 1.0 kb, were present in FP1 and FP2 respectively. The 0.5-kb fragment was common to all strains and serogroups of V. cholerae analyzed. It is concluded from the results of this study, based on DNA FPs of environmental isolates, that it is possible to detect an emerging virulent strain in a cholera-endemic region. ERIC-PCR constitutes a powerful tool for determination of the virulence potential of V. cholerae O1 strains isolated in surveillance programs and for molecular epidemiological investigations.
The nucleotide sequence of the intergenic region between the 5.8S and 26S rRNA genes of the yeast ribosomal RNA operon. Possible implications for the interaction between 5.8S and 26S rRNA and the processing of the primary transcript.

PubMed Central

Veldman, G M; Klootwijk, J; van Heerikhuizen, H; Planta, R J

1981-01-01

We have determined the nucleotide sequence of part of a cloned yeast ribosomal RNA operon extending from the 5.8S RNA gene downstream into the 5' -terminal region of the 26S RNA gene. We mapped the pertinent processing sites, viz. the 5' end of 26S rRNA and the 3'ends of 5.8S rRNA and its immediate precursor, 7S RNA. At the 3' end of 7S RNA we find the sequence UCGUUU which is very similar to the type I consensus sequence UCAUUA/U present at the 3' ends of 17S, 5.8S and 26S rRNA as well as 18S precursor rRNA in yeast. At the 5' end of the 26S RNA gene we find a sequence of thirteen nucleotides which is homologous to the type II sequence present at the 5' termini of both the 17S and the 5.8S RNA gene. These findings further support the suggestion put forward earlier (G.M. Veldman et al. (1980) Nucl. Acids Res. 8, 2907-2920) that both consensus sequences are involved in the recognition of precursor rRNA by the processing nuclease(s). We discuss a model for the processing of yeast rRNA in which a processing enzyme sequentially recognizes several combinations of a type I and a type II consensus sequence. We also describe the existence of a significant base complementarity between sequences in the 5' -terminal region of 26S rRNA and the 3' -terminal region of 5.8S rRNA. We suggest that base pairing between these sequences contributes to the binding between 5.8S and 26S rRNA. Images PMID:7312619
Evaluation of a dkgB linked intergenic sequence ribotyping (ISR) method for assigning serotype to Salmonella enterica isolated from poultry environmental samples.

USDA-ARS?s Scientific Manuscript database

The Kauffman White (KW) serotyping method requires more than 250 antisera to characterize more than 2,500 Salmonella serovars. The complexity of serotyping could be overcome using molecular methods. In this study, a dkgB-linked intergenic sequence ribotyping (ISR) method that generates sequence occu...
Recombination in feline immunodeficiency virus from feral and companion domestic cats.

PubMed

Hayward, Jessica J; Rodrigo, Allen G

2008-06-17

Recombination is a relatively common phenomenon in retroviruses. We investigated recombination in Feline Immunodeficiency Virus from naturally-infected New Zealand domestic cats (Felis catus) by sequencing regions of the gag, pol and env genes. The occurrence of intragenic recombination was highest in env, with evidence of recombination in 6.4% (n = 156) of all cats. A further recombinant was identified in each of the gag (n = 48) and pol (n = 91) genes. Comparisons of phylogenetic trees across genes identified cases of incongruence, indicating intergenic recombination. Three (7.7%, n = 39) of these incongruencies were found to be significantly different using the Shimodaira-Hasegawa test.Surprisingly, our phylogenies from the gag and pol genes showed that no New Zealand sequences group with reference subtype C sequences within intrasubtype pairwise distances. Indeed, we find one and two distinct unknown subtype groups in gag and pol, respectively. These observations cause us to speculate that these New Zealand FIV strains have undergone several recombination events between subtype A parent strains and undefined unknown subtype strains, similar to the evolutionary history hypothesised for HIV-1 "subtype E".Endpoint dilution sequencing was used to confirm the consensus sequences of the putative recombinants and unknown subtype groups, providing evidence for the authenticity of these sequences. Endpoint dilution sequencing also resulted in the identification of a dual infection event in the env gene. In addition, an intrahost recombination event between variants of the same subtype in the pol gene was established. This is the first known example of naturally-occurring recombination in a cat with infection of the parent strains. Evidence of intragenic recombination in the gag, pol and env regions, and complex intergenic recombination, of FIV from naturally-infected domestic cats in New Zealand was found. Strains of unknown subtype were identified in all three gene regions. These results have implications for the use of the current FIV vaccine in New Zealand.
[Identification of medicinal plant Dendrobium based on the chloroplast psbK-psbI intergenic spacer].

PubMed

Yao, Hui; Yang, Pei; Zhou, Hong; Ma, Shuang-jiao; Song, Jing-yuan; Chen, Shi-lin

2015-06-01

In this paper, the chloroplast psbK-psbI intergenic spacers of 18 species of Dendrobium and their adulterants were amplified and sequenced, and then the sequence characteristics were analyzed. The sequence lengths of chloroplast psbK-psbI regions of Dendrobium ranged from 474 to 513 bp and the GC contents were 25.4%-27.6%. The variable sites were 71 while the informative sites were 46. The inter-specific genetic distances calculated by Kimura 2-parameter (K2P) of Dendrobium were 0.006 1-0.058 1, with an average of 0.028 4. The K2P genetic distances between Dendrobium species and Bulbophyllum odoratissimum were 0.093 2-0.120 4. The NJ tree showed that the Dendrobium species can be easily differentiated from each other and 6 samples of the inspected Dendrobium species were identified successfully through sequencing the psbK-psbI intergenic spacer. Therefore, the chloroplast psbK-psbI intergenic spacer can be used as a candidate marker to identify Dendrobium species and its adulterants.
Genetic characterization of Streptococcus phocae strains isolated from Atlantic salmon, Salmo salar L., in Chile.

PubMed

Valdés, I; Jaureguiberry, B; Romalde, J L; Toranzo, A E; Magariños, B; Avendaño-Herrera, R

2009-04-01

Streptococcus phocae is a beta-haemolytic bacterium frequently involved in disease outbreaks in seals causing pneumonia or respiratory infection. Since 1999, this pathogen has been isolated from diseased Atlantic salmon, Salmo salar, causing serious economic losses in the salmon industry in Chile. In this study, we used different molecular typing methods, such as pulsed-field gel electrophoresis (PFGE), randomly amplified polymorphic DNA (RAPD), enterobacterial repetitive intergenic consensus sequence PCR (ERIC-PCR), repetitive extragenic palindromic PCR (REP-PCR) and restriction of 16S-23S rDNA intergenic spacer regions to evaluate the genetic diversity in S. phocae. Thirty-four strains isolated in different years were analysed. The S. phocae type strain ATCC 51973(T) was included for comparative purposes. The results demonstrated genetic homogeneity within the S. phocae strains isolated in Chile over several years, suggesting the existence of clonal relationships among S. phocae isolated from Atlantic salmon. The type strain ATCC 51973(T) presented a different genetic pattern with the PFGE, RAPD, ERIC-PCR and REP-PCR methods. However, the fingerprint patterns of two seal isolates were distinct from those of the type strain.
Cladistic biogeography of Juglans (Juglandaceae) based on chloroplast DNA intergenic spacer sequences

USDA-ARS?s Scientific Manuscript database

The phylogenetic utility of sequence variation from five chloroplast DNA intergenic spacer (IGS) regions: trnT-trnF, psbA-trnH, atpB-rbcL, trnV-16S rRNA, and trnS-trnfM was examined in the genus Juglans. A total of seventeen taxa representing the four sections within Juglans and an outgroup taxon, ...
Assessing information content and interactive relationships of subgenomic DNA sequences of the MHC using complexity theory approaches based on the non-extensive statistical mechanics

NASA Astrophysics Data System (ADS)

Karakatsanis, L. P.; Pavlos, G. P.; Iliopoulos, A. C.; Pavlos, E. G.; Clark, P. M.; Duke, J. L.; Monos, D. S.

2018-09-01

This study combines two independent domains of science, the high throughput DNA sequencing capabilities of Genomics and complexity theory from Physics, to assess the information encoded by the different genomic segments of exonic, intronic and intergenic regions of the Major Histocompatibility Complex (MHC) and identify possible interactive relationships. The dynamic and non-extensive statistical characteristics of two well characterized MHC sequences from the homozygous cell lines, PGF and COX, in addition to two other genomic regions of comparable size, used as controls, have been studied using the reconstructed phase space theorem and the non-extensive statistical theory of Tsallis. The results reveal similar non-linear dynamical behavior as far as complexity and self-organization features. In particular, the low-dimensional deterministic nonlinear chaotic and non-extensive statistical character of the DNA sequences was verified with strong multifractal characteristics and long-range correlations. The nonlinear indices repeatedly verified that MHC sequences, whether exonic, intronic or intergenic include varying levels of information and reveal an interaction of the genes with intergenic regions, whereby the lower the number of genes in a region, the less the complexity and information content of the intergenic region. Finally we showed the significance of the intergenic region in the production of the DNA dynamics. The findings reveal interesting content information in all three genomic elements and interactive relationships of the genes with the intergenic regions. The results most likely are relevant to the whole genome and not only to the MHC. These findings are consistent with the ENCODE project, which has now established that the non-coding regions of the genome remain to be of relevance, as they are functionally important and play a significant role in the regulation of expression of genes and coordination of the many biological processes of the cell.
Conserved intergenic sequences revealed by CTAG-profiling in Salmonella: thermodynamic modeling for function prediction

NASA Astrophysics Data System (ADS)

Tang, Le; Zhu, Songling; Mastriani, Emilio; Fang, Xin; Zhou, Yu-Jie; Li, Yong-Guo; Johnston, Randal N.; Guo, Zheng; Liu, Gui-Rong; Liu, Shu-Lin

2017-03-01

Highly conserved short sequences help identify functional genomic regions and facilitate genomic annotation. We used Salmonella as the model to search the genome for evolutionarily conserved regions and focused on the tetranucleotide sequence CTAG for its potentially important functions. In Salmonella, CTAG is highly conserved across the lineages and large numbers of CTAG-containing short sequences fall in intergenic regions, strongly indicating their biological importance. Computer modeling demonstrated stable stem-loop structures in some of the CTAG-containing intergenic regions, and substitution of a nucleotide of the CTAG sequence would radically rearrange the free energy and disrupt the structure. The postulated degeneration of CTAG takes distinct patterns among Salmonella lineages and provides novel information about genomic divergence and evolution of these bacterial pathogens. Comparison of the vertically and horizontally transmitted genomic segments showed different CTAG distribution landscapes, with the genome amelioration process to remove CTAG taking place inward from both terminals of the horizontally acquired segment.
Species identification of medicinal pteridophytes by a DNA barcode marker, the chloroplast psbA-trnH intergenic region.

PubMed

Ma, Xin-Ye; Xie, Cai-Xiang; Liu, Chang; Song, Jing-Yuan; Yao, Hui; Luo, Kun; Zhu, Ying-Jie; Gao, Ting; Pang, Xiao-Hui; Qian, Jun; Chen, Shi-Lin

2010-01-01

Medicinal pteridophytes are an important group used in traditional Chinese medicine; however, there is no simple and universal way to differentiate various species of this group by morphological traits. A novel technology termed "DNA barcoding" could discriminate species by a standard DNA sequence with universal primers and sufficient variation. To determine whether DNA barcoding would be effective for differentiating pteridophyte species, we first analyzed five DNA sequence markers (psbA-trnH intergenic region, rbcL, rpoB, rpoC1, and matK) using six chloroplast genomic sequences from GeneBank and found psbA-trnH intergenic region the best candidate for availability of universal primers. Next, we amplified the psbA-trnH region from 79 samples of medicinal pteridophyte plants. These samples represented 51 species from 24 families, including all the authentic pteridophyte species listed in the Chinese pharmacopoeia (2005 version) and some commonly used adulterants. We found that the sequence of the psbA-trnH intergenic region can be determined with both high polymerase chain reaction (PCR) amplification efficiency (94.1%) and high direct sequencing success rate (81.3%). Combined with GeneBank data (54 species cross 12 pteridophyte families), species discriminative power analysis showed that 90.2% of species could be separated/identified successfully by the TaxonGap method in conjunction with the Basic Local Alignment Search Tool 1 (BLAST1) method. The TaxonGap method results further showed that, for 37 out of 39 separable species with at least two samples each, between-species variation was higher than the relevant within-species variation. Thus, the psbA-trnH intergenic region is a suitable DNA marker for species identification in medicinal pteridophytes.

Acanthamoeba castellanii contains a ribosomal RNA enhancer binding protein which stimulates TIF-IB binding and transcription under stringent conditions.

PubMed

Yang, Q; Radebaugh, C A; Kubaska, W; Geiss, G K; Paule, M R

1995-11-11

The intergenic spacer (IGS) of Acanthamoeba castellanii rRNA genes contains repeated elements which are weak enhancers for transcription by RNA polymerase I. A protein, EBF, was identified and partially purified which binds to the enhancers and to several other sequences within the IGS, but not to other DNA fragments, including the rRNA core promoter. No consensus binding sequence could be discerned in these fragments and bound factor is in rapid equilibrium with unbound. EBF has functional characteristics similar to vertebrate upstream binding factors (UBF). Not only does it bind to the enhancer and other IGS elements, but it also stimulates binding of TIF-IB, the fundamental transcription initiation factor, to the core promoter and stimulates transcription from the promoter. Attempts to identify polypeptides with epitopes similar to rat or Xenopus laevis UBF suggest that structurally the protein from A.castellanii is not closely related to vertebrate UBF.
Acanthamoeba castellanii contains a ribosomal RNA enhancer binding protein which stimulates TIF-IB binding and transcription under stringent conditions.

PubMed Central

Yang, Q; Radebaugh, C A; Kubaska, W; Geiss, G K; Paule, M R

1995-01-01

The intergenic spacer (IGS) of Acanthamoeba castellanii rRNA genes contains repeated elements which are weak enhancers for transcription by RNA polymerase I. A protein, EBF, was identified and partially purified which binds to the enhancers and to several other sequences within the IGS, but not to other DNA fragments, including the rRNA core promoter. No consensus binding sequence could be discerned in these fragments and bound factor is in rapid equilibrium with unbound. EBF has functional characteristics similar to vertebrate upstream binding factors (UBF). Not only does it bind to the enhancer and other IGS elements, but it also stimulates binding of TIF-IB, the fundamental transcription initiation factor, to the core promoter and stimulates transcription from the promoter. Attempts to identify polypeptides with epitopes similar to rat or Xenopus laevis UBF suggest that structurally the protein from A.castellanii is not closely related to vertebrate UBF. Images PMID:7501455
Characteristics and significance of intergenic polyadenylated RNA transcription in Arabidopsis.

PubMed

Moghe, Gaurav D; Lehti-Shiu, Melissa D; Seddon, Alex E; Yin, Shan; Chen, Yani; Juntawong, Piyada; Brandizzi, Federica; Bailey-Serres, Julia; Shiu, Shin-Han

2013-01-01

The Arabidopsis (Arabidopsis thaliana) genome is the most well-annotated plant genome. However, transcriptome sequencing in Arabidopsis continues to suggest the presence of polyadenylated (polyA) transcripts originating from presumed intergenic regions. It is not clear whether these transcripts represent novel noncoding or protein-coding genes. To understand the nature of intergenic polyA transcription, we first assessed its abundance using multiple messenger RNA sequencing data sets. We found 6,545 intergenic transcribed fragments (ITFs) occupying 3.6% of Arabidopsis intergenic space. In contrast to transcribed fragments that map to protein-coding and RNA genes, most ITFs are significantly shorter, are expressed at significantly lower levels, and tend to be more data set specific. A surprisingly large number of ITFs (32.1%) may be protein coding based on evidence of translation. However, our results indicate that these "translated" ITFs tend to be close to and are likely associated with known genes. To investigate if ITFs are under selection and are functional, we assessed ITF conservation through cross-species as well as within-species comparisons. Our analysis reveals that 237 ITFs, including 49 with translation evidence, are under strong selective constraint and relatively distant from annotated features. These ITFs are likely parts of novel genes. However, the selective pressure imposed on most ITFs is similar to that of randomly selected, untranscribed intergenic sequences. Our findings indicate that despite the prevalence of ITFs, apart from the possibility of genomic contamination, many may be background or noisy transcripts derived from "junk" DNA, whose production may be inherent to the process of transcription and which, on rare occasions, may act as catalysts for the creation of novel genes.
Analysis of conserved noncoding DNA in Drosophila reveals similar constraints in intergenic and intronic sequences.

PubMed

Bergman, C M; Kreitman, M

2001-08-01

Comparative genomic approaches to gene and cis-regulatory prediction are based on the principle that differential DNA sequence conservation reflects variation in functional constraint. Using this principle, we analyze noncoding sequence conservation in Drosophila for 40 loci with known or suspected cis-regulatory function encompassing >100 kb of DNA. We estimate the fraction of noncoding DNA conserved in both intergenic and intronic regions and describe the length distribution of ungapped conserved noncoding blocks. On average, 22%-26% of noncoding sequences surveyed are conserved in Drosophila, with median block length approximately 19 bp. We show that point substitution in conserved noncoding blocks exhibits transition bias as well as lineage effects in base composition, and occurs more than an order of magnitude more frequently than insertion/deletion (indel) substitution. Overall, patterns of noncoding DNA structure and evolution differ remarkably little between intergenic and intronic conserved blocks, suggesting that the effects of transcription per se contribute minimally to the constraints operating on these sequences. The results of this study have implications for the development of alignment and prediction algorithms specific to noncoding DNA, as well as for models of cis-regulatory DNA sequence evolution.
Saturation of an Intra-Gene Pool Linkage Map: Towards a Unified Consensus Linkage Map for Fine Mapping and Synteny Analysis in Common Bean

PubMed Central

Galeano, Carlos H.; Fernandez, Andrea C.; Franco-Herrera, Natalia; Cichy, Karen A.; McClean, Phillip E.; Vanderleyden, Jos; Blair, Matthew W.

2011-01-01

Map-based cloning and fine mapping to find genes of interest and marker assisted selection (MAS) requires good genetic maps with reproducible markers. In this study, we saturated the linkage map of the intra-gene pool population of common bean DOR364×BAT477 (DB) by evaluating 2,706 molecular markers including SSR, SNP, and gene-based markers. On average the polymorphism rate was 7.7% due to the narrow genetic base between the parents. The DB linkage map consisted of 291 markers with a total map length of 1,788 cM. A consensus map was built using the core mapping populations derived from inter-gene pool crosses: DOR364×G19833 (DG) and BAT93×JALO EEP558 (BJ). The consensus map consisted of a total of 1,010 markers mapped, with a total map length of 2,041 cM across 11 linkage groups. On average, each linkage group on the consensus map contained 91 markers of which 83% were single copy markers. Finally, a synteny analysis was carried out using our highly saturated consensus maps compared with the soybean pseudo-chromosome assembly. A total of 772 marker sequences were compared with the soybean genome. A total of 44 syntenic blocks were identified. The linkage group Pv6 presented the most diverse pattern of synteny with seven syntenic blocks, and Pv9 showed the most consistent relations with soybean with just two syntenic blocks. Additionally, a co-linear analysis using common bean transcript map information against soybean coding sequences (CDS) revealed the relationship with 787 soybean genes. The common bean consensus map has allowed us to map a larger number of markers, to obtain a more complete coverage of the common bean genome. Our results, combined with synteny relationships provide tools to increase marker density in selected genomic regions to identify closely linked polymorphic markers for indirect selection, fine mapping or for positional cloning. PMID:22174773
Definition of a consensus DNA-binding site for PecS, a global regulator of virulence gene expression in Erwinia chrysanthemi and identification of new members of the PecS regulon.

PubMed

Rouanet, Carine; Reverchon, Sylvie; Rodionov, Dmitry A; Nasser, William

2004-07-16

In Erwinia chrysanthemi, production of pectic enzymes is modulated by a complex network involving several regulators. One of them, PecS, which belongs to the MarR family, also controls the synthesis of various other virulence factors, such as cellulases and indigoidine. Here, the PecS consensus-binding site is defined by combining a systematic evolution of ligands by an exponential enrichment approach and mutational analyses. The consensus consists of a 23-base pair palindromic-like sequence (C(-11)G(-10)A(-9)N(-8)W(-7)T(-6)C(-5)G(-4)T(-3)A(-2))T(-1)A(0)T(1)(T(2)A(3)C(4)G(5)A(6)N(7)N(8)N(9)C(10)G(11)). Mutational experiments revealed that (i) the palindromic organization is required for the binding of PecS, (ii) the very conserved part of the consensus (-6 to 6) allows for a specific interaction with PecS, but the presence of the relatively degenerated bases located apart significantly increases PecS affinity, (iii) the four bases G, A, T, and C are required for efficient binding of PecS, and (iv) the presence of several binding sites on the same promoter increases the affinity of PecS. This consensus is detected in the regions involved in PecS binding on the previously characterized target genes. This variable consensus is in agreement with the observation that the members of the MarR family are able to bind various DNA targets as dimers by means of a winged helix DNA-binding motif. Binding of PecS on a promoter region containing the defined consensus results in a repression of gene transcription in vitro. Preliminary scanning of the E. chrysanthemi genome sequence with the consensus revealed the presence of strong PecS-binding sites in the intergenic region between fliE and fliFGHIJKLMNOPQR which encode proteins involved in the biogenesis of flagellum. Accordingly, PecS directly represses fliE expression. Thus, PecS seems to control the synthesis of virulence factors required for the key steps of plant infection.
Complete sequence of the genome of avian paramyxovirus type 9 and comparison with other paramyxoviruses

PubMed Central

Samuel, Arthur S.; Kumar, Sachin; Madhuri, Subbiah; Collins, Peter L.; Samal, Siba K.

2009-01-01

The complete genome consensus sequence was determined for avian paramyxovirus (APMV) serotype 9 prototype strain PMV-9/domestic Duck/New York/22/78. The genome is 15,438 nucleotides (nt) long and encodes six non-overlapping genes in the order of 3′-N-P/V/W-M-F-HN-L-5′ with intergenic regions of 0–30 nt. The genome length follows the “rule of six” and contains a 55-nt leader sequence at the 3′ end and a 47-nt trailer sequence at the 5′ end. The cleavage site of the F protein is I-R-E-G-R-I↓F, which does not conform to the conventional cleavage site of the ubiquitous cellular protease furin. The virus required exogenous protease for in vitro replication and grew only in a few established cell lines, indicating a restricted host range. Alignment and phylogenetic analysis of the predicted amino acid sequences of APMV-9 proteins with the cognate proteins of viruses of all five genera of family Paramyxoviridae showed that APMV-9 is more closely related to APMV-1 than to other APMVs. The mean death time in embryonated chicken eggs was found to be more than 120 h, indicating APMV-9 to be avirulent for chickens. PMID:19185593
Characterization of Acetic Acid Bacteria in Traditional Acetic Acid Fermentation of Rice Vinegar (Komesu) and Unpolished Rice Vinegar (Kurosu) Produced in Japan

PubMed Central

Nanda, Kumiko; Taniguchi, Mariko; Ujike, Satoshi; Ishihara, Nobuhiro; Mori, Hirotaka; Ono, Hisayo; Murooka, Yoshikatsu

2001-01-01

Bacterial strains were isolated from samples of Japanese rice vinegar (komesu) and unpolished rice vinegar (kurosu) fermented by the traditional static method. Fermentations have never been inoculated with a pure culture since they were started in 1907. A total of 178 isolates were divided into groups A and B on the basis of enterobacterial repetitive intergenic consensus-PCR and random amplified polymorphic DNA fingerprinting analyses. The 16S ribosomal DNA sequences of strains belonging to each group showed similarities of more than 99% with Acetobacter pasteurianus. Group A strains overwhelmingly dominated all stages of fermentation of both types of vinegar. Our results indicate that appropriate strains of acetic acid bacteria have spontaneously established almost pure cultures during nearly a century of komesu and kurosu fermentation. PMID:11157275
Splicing factor SFRS1 recognizes a functionally diverse landscape of RNA transcripts.

PubMed

Sanford, Jeremy R; Wang, Xin; Mort, Matthew; Vanduyn, Natalia; Cooper, David N; Mooney, Sean D; Edenberg, Howard J; Liu, Yunlong

2009-03-01

Metazoan genes are encrypted with at least two superimposed codes: the genetic code to specify the primary structure of proteins and the splicing code to expand their proteomic output via alternative splicing. Here, we define the specificity of a central regulator of pre-mRNA splicing, the conserved, essential splicing factor SFRS1. Cross-linking immunoprecipitation and high-throughput sequencing (CLIP-seq) identified 23,632 binding sites for SFRS1 in the transcriptome of cultured human embryonic kidney cells. SFRS1 was found to engage many different classes of functionally distinct transcripts including mRNA, miRNA, snoRNAs, ncRNAs, and conserved intergenic transcripts of unknown function. The majority of these diverse transcripts share a purine-rich consensus motif corresponding to the canonical SFRS1 binding site. The consensus site was not only enriched in exons cross-linked to SFRS1 in vivo, but was also enriched in close proximity to splice sites. mRNAs encoding RNA processing factors were significantly overrepresented, suggesting that SFRS1 may broadly influence the post-transcriptional control of gene expression in vivo. Finally, a search for the SFRS1 consensus motif within the Human Gene Mutation Database identified 181 mutations in 82 different genes that disrupt predicted SFRS1 binding sites. This comprehensive analysis substantially expands the known roles of human SR proteins in the regulation of a diverse array of RNA transcripts.
Molecular cloning, sequencing, and expression of the outer membrane protein P2 gene of Haemophilus parasuis.

PubMed

Li, Peng; Bai, Juan; Li, Jun-xing; Zhang, Guo-long; Song, Yan-hua; Li, Yu-feng; Wang, Xian-wei; Jiang, Ping

2012-10-01

Haemophilus parasuis is the etiological agent of Glässer's disease characterized by fibrinous polyserositis, polyarthritis, and meningitis in young pigs. But it is difficult to develop universal serological diagnostic tools and effective vaccines against this disease because of the serovar diversity of the isolates. In this study, enterobacterial repetitive intergenic consensus-polymerase chain reaction, were performed to investigate the gene profile of 111 isolates of H. parasuis from China. And a specific common gene of H. parasuis was cloned and identified as the outer-membrane protein (OMP) P2 gene. Sequencing results of OMP P2 genes of 22 isolates showed that they had high homology and could be divided into 2 genetic types. Moreover, the OMPP2 protein was expressed in Escherichia coli expressing system. And the purified recombinant protein provided partial protection against H. parasuis infection in mice. It suggested the OMP P2 was an immunogenic protein and had great potential to serve as a vaccine and diagnostic antigen. Copyright © 2011 Elsevier Ltd. All rights reserved.
Genotyping of human lice suggests multiple emergencies of body lice from local head louse populations.

PubMed

Li, Wenjun; Ortiz, Gabriel; Fournier, Pierre-Edouard; Gimenez, Gregory; Reed, David L; Pittendrigh, Barry; Raoult, Didier

2010-03-23

Genetic analyses of human lice have shown that the current taxonomic classification of head lice (Pediculus humanus capitis) and body lice (Pediculus humanus humanus) does not reflect their phylogenetic organization. Three phylotypes of head lice A, B and C exist but body lice have been observed only in phylotype A. Head and body lice have different behaviours and only the latter have been involved in outbreaks of infectious diseases including epidemic typhus, trench fever and louse borne recurrent fever. Recent studies suggest that body lice arose several times from head louse populations. By introducing a new genotyping technique, sequencing variable intergenic spacers which were selected from louse genomic sequence, we were able to evaluate the genotypic distribution of 207 human lice. Sequence variation of two intergenic spacers, S2 and S5, discriminated the 207 lice into 148 genotypes and sequence variation of another two intergenic spacers, PM1 and PM2, discriminated 174 lice into 77 genotypes. Concatenation of the four intergenic spacers discriminated a panel of 97 lice into 96 genotypes. These intergenic spacer sequence types were relatively specific geographically, and enabled us to identify two clusters in France, one cluster in Central Africa (where a large body louse outbreak has been observed) and one cluster in Russia. Interestingly, head and body lice were not genetically differentiated. We propose a hypothesis for the emergence of body lice, and suggest that humans with both low hygiene and head louse infestations provide an opportunity for head louse variants, able to ingest a larger blood meal (a required characteristic of body lice), to colonize clothing. If this hypothesis is ultimately supported, it would help to explain why poor human hygiene often coincides with outbreaks of body lice. Additionally, if head lice act as a reservoir for body lice, and that any social degradation in human populations may allow the formation of new populations of body lice, then head louse populations are potentially a greater threat to humans than previously assumed.
Genotyping of Human Lice Suggests Multiple Emergences of Body Lice from Local Head Louse Populations

PubMed Central

Li, Wenjun; Ortiz, Gabriel; Fournier, Pierre-Edouard; Gimenez, Gregory; Reed, David L.; Pittendrigh, Barry; Raoult, Didier

2010-01-01

Background Genetic analyses of human lice have shown that the current taxonomic classification of head lice (Pediculus humanus capitis) and body lice (Pediculus humanus humanus) does not reflect their phylogenetic organization. Three phylotypes of head lice A, B and C exist but body lice have been observed only in phylotype A. Head and body lice have different behaviours and only the latter have been involved in outbreaks of infectious diseases including epidemic typhus, trench fever and louse borne recurrent fever. Recent studies suggest that body lice arose several times from head louse populations. Methods and Findings By introducing a new genotyping technique, sequencing variable intergenic spacers which were selected from louse genomic sequence, we were able to evaluate the genotypic distribution of 207 human lice. Sequence variation of two intergenic spacers, S2 and S5, discriminated the 207 lice into 148 genotypes and sequence variation of another two intergenic spacers, PM1 and PM2, discriminated 174 lice into 77 genotypes. Concatenation of the four intergenic spacers discriminated a panel of 97 lice into 96 genotypes. These intergenic spacer sequence types were relatively specific geographically, and enabled us to identify two clusters in France, one cluster in Central Africa (where a large body louse outbreak has been observed) and one cluster in Russia. Interestingly, head and body lice were not genetically differentiated. Conclusions We propose a hypothesis for the emergence of body lice, and suggest that humans with both low hygiene and head louse infestations provide an opportunity for head louse variants, able to ingest a larger blood meal (a required characteristic of body lice), to colonize clothing. If this hypothesis is ultimately supported, it would help to explain why poor human hygiene often coincides with outbreaks of body lice. Additionally, if head lice act as a reservoir for body lice, and that any social degradation in human populations may allow the formation of new populations of body lice, then head louse populations are potentially a greater threat to humans than previously assumed. PMID:20351779
Comparison of four molecular methods to type Salmonella Enteritidis strains.

PubMed

Campioni, Fábio; Pitondo-Silva, André; Bergamini, Alzira M M; Falcão, Juliana P

2015-05-01

This study compared the pulsed-field gel electrophoresis (PFGE), enterobacterial repetitive intergenic consensus-PCR (ERIC-PCR), multilocus variable-number of tanden-repeat analysis (MLVA), and multilocus sequence typing (MLST) methods for typing 188 Salmonella Enteritidis strains from different sources isolated over a 24-year period in Brazil. PFGE and ERIC-PCR were more efficient than MLVA for subtyping the strains. However, MLVA provided additional epidemiological information for those strains. In addition, MLST showed the Brazilian strains as belonging to the main clonal complex of S. Enteritidis, CC11, and provided the first report of two new STs in the S. enterica database but could not properly subtype the strains. Our results showed that the use of PFGE or ERIC-PCR together with MLVA is suitable to efficiently subtype S. Enteritidis strains and provide important epidemiological information. © 2015 APMIS. Published by John Wiley & Sons Ltd.
Ribosomal DNA intergenic spacer sequence in foxtail millet, Setaria italica (L.) P. Beauv. and its characterization and application to typing of foxtail millet landraces.

PubMed

Fukunaga, Kenji; Ichitani, Katsuyuki; Taura, Satoru; Sato, Muneharu; Kawase, Makoto

2005-02-01

We determined the sequence of ribosomal DNA (rDNA) intergenic spacer (IGS) of foxtail millet isolated in our previous study, and identified subrepeats in the polymorphic region. We also developed a PCR-based method for identifying rDNA types based on sequence information and assessed 153 accessions of foxtail millet. Results were congruent with our previous works. This study provides new findings regarding the geographical distribution of rDNA variants. This new method facilitates analyses of numerous foxtail millet accessions. It is helpful for typing of foxtail millet germplasms and elucidating the evolution of this millet.
Comparative isolation and genetic diversity of Arcobacter sp. from fish and the coastal environment.

PubMed

Rathlavath, S; Kumar, S; Nayak, B B

2017-07-01

Arcobacter species are emerging food-borne and water-borne human pathogens associated mostly with food animals and their environment. The present study was aimed to isolate Arcobacter species from fish, shellfish and coastal water samples using two methods and to determine their genetic diversity. Of 201 samples of fish, shellfish and water samples analysed, 66 (32·8%) samples showed the presence of Arcobacter DNA from both Arcobacter enrichment broth and Bolton broth. Arcobacters were isolated from 58 (87·8%) and 38 (57·5%) of Arcobacter DNA-positive samples using Arcobacter blood agar and Preston blood agar, respectively. Arcobacter sp. identified by biochemical tests were further analysed by a genus-specific PCR, followed by a multiplex-PCR and 16S rRNA-RFLP. From both the methods, four different Arcobacter species namely Arcobacter butzleri, Arcobacter skirrowii, Arcobacter mytili and Arcobacter defluvii were isolated, of which A. butzleri was the predominant species. Enterobacterial repetitive intergenic consensus (ERIC)-PCR fingerprint analysis revealed that the arcobacters isolated in this study were genetically very diverse and no specific genotype was found associated with a specific source (seafood or water). Since pathogenic arcobacters are not known to be natural inhabitants of coastal marine environment, identifying the sources of contamination will be crucial for effective management of this problem. Arcobacter sp. are emerging food- and water-borne human pathogens. In this study, comparison of two selective media suggested Arcobacter blood agar to be more efficient in yielding Arcobacter sp. from seafood. Furthermore, the isolation of Arcobacter sp. such as Arcobacter butzleri, A. skirrowii, A. mytili and A. defluvii from seafood suggests diverse sources of contamination of seafood by Arcobacter sp. Analysis of enterobacterial repetitive intergenic consensus sequence-PCR patterns of A. butzleri showed high genetic diversity and lack of clonality among the isolates. Arcobacter contamination of seafood is an emerging issue both from seafood safety and seafood trade point of view. © 2017 The Society for Applied Microbiology.
Molecular Characterization of Vibrio cholerae Isolated From Clinical Samples in Kurdistan Province, Iran.

PubMed

Ramazanzadeh, Rashid; Rouhi, Samaneh; Shakib, Pegah; Shahbazi, Babak; Bidarpour, Farzam; Karimi, Mohammad

2015-05-01

Vibrio cholerae causes diarrhoeal disease that afflicts thousands of people annually. V. cholerae is classified on the basis of somatic antigens into serovars or serogroups and there are at least 200 known serogroup. Two serogroups, O1 and O139 have been associated with epidemic diseases. Virulence genes of these bacteria are OmpW, ctxA and tcpA. Due to the importance of V. cholerae infection and developing molecular diagnostics of this organism in medical and microbiology sciences, this study aimed to describe molecular characterization of V. cholerae isolated from clinical samples using a molecular method. In this study, 48 samples were provided during summer 2013 (late August and early September) by reference laboratory. Samples were assessed using biochemical tests initially. The primer of OmpW, ctxA and tcpA genes was used in Polymerase Chain Reaction (PCR) protocols. Enterobacterial Repetitive Intergenic Consensus (ERIC)-PCR and Repetitive Extragenic Palindromic (REP)-PCR methods were used to subtype V. cholerae. In this study, from a total of 48 clinical stool samples 39 (81.2 %) were positive for V. cholerae in biochemical tests and bacteria culture tests. The PCR results showed that of 39 positive isolates 35 (89.7%), 34 (87.1%) and 37 (94.8%) were positive for ctxA, tcpA and OmpW gene, respectively. Also, in the REP-PCR method with ERIC primer strains were divided into 10 groups. In the REP-PCR method with REP primer, strains were divided into 13 groups. Polymerase chain reaction has specificity and accuracy for identification of the organism and is able to differentiate biotypes. Enterobacterial repetitive intergenic consensus sequence is one of the informative and discriminative methods for the analysis of V. cholerae diversity. The REP-PCR is a less informative and discriminative method compared to other methods for the analysis of V. cholerae diversity.
Similar Ratios of Introns to Intergenic Sequence across Animal Genomes

PubMed Central

Wörheide, Gert

2017-01-01

Abstract One central goal of genome biology is to understand how the usage of the genome differs between organisms. Our knowledge of genome composition, needed for downstream inferences, is critically dependent on gene annotations, yet problems associated with gene annotation and assembly errors are usually ignored in comparative genomics. Here, we analyze the genomes of 68 species across 12 animal phyla and some single-cell eukaryotes for general trends in genome composition and transcription, taking into account problems of gene annotation. We show that, regardless of genome size, the ratio of introns to intergenic sequence is comparable across essentially all animals, with nearly all deviations dominated by increased intergenic sequence. Genomes of model organisms have ratios much closer to 1:1, suggesting that the majority of published genomes of nonmodel organisms are underannotated and consequently omit substantial numbers of genes, with likely negative impact on evolutionary interpretations. Finally, our results also indicate that most animals transcribe half or more of their genomes arguing against differences in genome usage between animal groups, and also suggesting that the transcribed portion is more dependent on genome size than previously thought. PMID:28633296
Comparison of dkgB-linked intergenic sequence ribotyping to DNA microarray hybridization for assigning serotype to Salmonella enterica

PubMed Central

Guard, Jean; Sanchez-Ingunza, Roxana; Morales, Cesar; Stewart, Tod; Liljebjelke, Karen; Kessel, JoAnn; Ingram, Kim; Jones, Deana; Jackson, Charlene; Fedorka-Cray, Paula; Frye, Jonathan; Gast, Richard; Hinton, Arthur

2012-01-01

Two DNA-based methods were compared for the ability to assign serotype to 139 isolates of Salmonella enterica ssp. I. Intergenic sequence ribotyping (ISR) evaluated single nucleotide polymorphisms occurring in a 5S ribosomal gene region and flanking sequences bordering the gene dkgB. A DNA microarray hybridization method that assessed the presence and the absence of sets of genes was the second method. Serotype was assigned for 128 (92.1%) of submissions by the two DNA methods. ISR detected mixtures of serotypes within single colonies and it cost substantially less than Kauffmann–White serotyping and DNA microarray hybridization. Decreasing the cost of serotyping S. enterica while maintaining reliability may encourage routine testing and research. PMID:22998607
Identification of a new genotype H wild-type mumps virus strain and its molecular relatedness to other virulent and attenuated strains.

PubMed

Amexis, Georgios; Rubin, Steven; Chatterjee, Nando; Carbone, Kathryn; Chumakov, Kostantin

2003-06-01

A single clinical isolate of mumps virus designated 88-1961 was obtained from a patient hospitalized with a clinical history of upper respiratory tract infection, parotitis, severe headache, fever and lymphadenopathy. We have sequenced the full-length genome of 88-1961 and compared it against all available full-length sequences of mumps virus. Based upon its nucleotide sequence of the SH gene 88-1961 was identified as a genotype H mumps strain. The overall extent of nucleotide and amino acid differences between each individual gene and protein of 88-1961 and the full-length mumps samples showed that the missense to silent ratios were unevenly distributed. Upon evaluation of the consensus sequence of 88-1961, four positions were found to be clearly heterogeneous at the nucleotide level (NP 315C/T, NP 318C/T, F 271A/C, and HN 855C/T). Sequence analysis revealed that the amino acid sequences for the NP, M, and the L protein were the most conserved, whereas the SH protein exhibited the highest variability among the compared mumps genotypes A, B, and G. No identifying molecular patterns in the non-coding (intergenic) or coding regions of 88-1961 were found when we compared it against relatively virulent (Urabe AM9 B, Glouc1/UK96, 87-1004 and 87-1005) and non-virulent mumps strains (Jeryl Lynn and all Urabe Am9 A substrains). Copyright 2003 Wiley-Liss, Inc.
Identification of Sinorhizobium (Ensifer) medicae based on a specific genomic sequence unveiled by M13-PCR fingerprinting.

PubMed

Dourado, Ana Catarina; Alves, Paula I L; Tenreiro, Tania; Ferreira, Eugénio M; Tenreiro, Rogério; Fareleira, Paula; Crespo, M Teresa Barreto

2009-12-01

A collection of nodule isolates from Medicago polymorpha obtained from southern and central Portugal was evaluated by M13-PCR fingerprinting and hierarchical cluster analysis. Several genomic clusters were obtained which, by 16S rRNA gene sequencing of selected representatives, were shown to be associated with particular taxonomic groups of rhizobia and other soil bacteria. The method provided a clear separation between rhizobia and co-isolated non-symbiotic soil contaminants. Ten M13-PCR groups were assigned to Sinorhizobium (Ensifer) medicae and included all isolates responsible for the formation of nitrogen-fixing nodules upon re-inoculation of M. polymorpha test-plants. In addition, enterobacterial repetitive intergenic consensus (ERIC)-PCR fingerprinting indicated a high genomic heterogeneity within the major M13- PCR clusters of S. medicae isolates. Based on nucleotide sequence data of an M13-PCR amplicon of ca. 1500 bp, observed only in S. medicae isolates and spanning locus Smed_3707 to Smed_3709 from the pSMED01 plasmid sequence of S. medicae WSM419 genome's sequence, a pair of PCR primers was designed and used for direct PCR amplification of a 1399-bp sequence within this fragment. Additional in silico and in vitro experiments, as well as phylogenetic analysis, confirmed the specificity of this primer combination and therefore the reliability of this approach in the prompt identification of S. medicae isolates and their distinction from other soil bacteria.

Comparison of internal transcribed spacers and intergenic spacer regions of five common Iranian sheep bursate nematodes.

PubMed

Nabavi, Reza; Conneely, Brendan; McCarthy, Elaine; Good, Barbara; Shayan, Parviz; DE Waal, Theo

2014-09-01

Accurate identification of sheep nematodes is a critical point in epidemiological studies and monitoring of drug resistance in flocks. However, due to a close morphological similarity between the eggs and larval stages of many of these nematodes, such identification is not a trivial task. There are a number of studies showing that molecular targets in ribosomal DNA (Internal transcribed spacer 1, 2 and Intergenic spacer) are suitable for accurate identification of sheep bursate nematodes. The objective of present study was to compare the ITS1, ITS2 and IGS regions of Iranian common bursate nematodes in order to choose best target for specific identification methods. The first and second internal transcribed spacers (ITS1and ITS2) and intergenic spacer (IGS) of the ribosomal DNA (rDNA) of 5 common Iranian bursate nematodes of sheep were sequenced. The sequences of some non-Iranian isolates were used for comparison in order to evaluate the variation in sequence homology between geographically different nematode populations. Comparison of the ITS1 and ITS2 sequences of Iranian nematodes showed greatest similarity among Teladorsagia circumcincta and Marshallagia marshalli of 94% and 88%, respectively. While Trichostrongylus colubriformis and M. marshalli showed the highest homology (99%) in the IGS sequences. Comparison of the spacer sequences of Iranian with non-Iranian isolates showed significantly higher variation in Haemonchus contortus compared to the other species. Both the ITS1 and ITS2 sequences are convenient targets to have species-specific identification of Iranian bursate nematodes. On the other hand the IGS region may be a less suitable molecular target.
Similar Ratios of Introns to Intergenic Sequence across Animal Genomes.

PubMed

Francis, Warren R; Wörheide, Gert

2017-06-01

One central goal of genome biology is to understand how the usage of the genome differs between organisms. Our knowledge of genome composition, needed for downstream inferences, is critically dependent on gene annotations, yet problems associated with gene annotation and assembly errors are usually ignored in comparative genomics. Here, we analyze the genomes of 68 species across 12 animal phyla and some single-cell eukaryotes for general trends in genome composition and transcription, taking into account problems of gene annotation. We show that, regardless of genome size, the ratio of introns to intergenic sequence is comparable across essentially all animals, with nearly all deviations dominated by increased intergenic sequence. Genomes of model organisms have ratios much closer to 1:1, suggesting that the majority of published genomes of nonmodel organisms are underannotated and consequently omit substantial numbers of genes, with likely negative impact on evolutionary interpretations. Finally, our results also indicate that most animals transcribe half or more of their genomes arguing against differences in genome usage between animal groups, and also suggesting that the transcribed portion is more dependent on genome size than previously thought. © The Author 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
HBS1L-MYB intergenic variants modulate fetal hemoglobin via long-range MYB enhancers

PubMed Central

Stadhouders, Ralph; Aktuna, Suleyman; Thongjuea, Supat; Aghajanirefah, Ali; Pourfarzad, Farzin; van IJcken, Wilfred; Lenhard, Boris; Rooks, Helen; Best, Steve; Menzel, Stephan; Grosveld, Frank; Thein, Swee Lay; Soler, Eric

2014-01-01

Genetic studies have identified common variants within the intergenic region (HBS1L-MYB) between GTP-binding elongation factor HBS1L and myeloblastosis oncogene MYB on chromosome 6q that are associated with elevated fetal hemoglobin (HbF) levels and alterations of other clinically important human erythroid traits. It is unclear how these noncoding sequence variants affect multiple erythrocyte characteristics. Here, we determined that several HBS1L-MYB intergenic variants affect regulatory elements that are occupied by key erythroid transcription factors within this region. These elements interact with MYB, a critical regulator of erythroid development and HbF levels. We found that several HBS1L-MYB intergenic variants reduce transcription factor binding, affecting long-range interactions with MYB and MYB expression levels. These data provide a functional explanation for the genetic association of HBS1L-MYB intergenic polymorphisms with human erythroid traits and HbF levels. Our results further designate MYB as a target for therapeutic induction of HbF to ameliorate sickle cell and β-thalassemia disease severity. PMID:24614105
Sequences in the intergenic spacer influence RNA Pol I transcription from the human rRNA promoter

DOE Office of Scientific and Technical Information (OSTI.GOV)

Li, W.M.; Sylvester, J.E.

1994-09-01

In most eucaryotic species, ribosomal genes are tandemly repeated about 100-5000 times per haploid genome. The 43 Kb human rDNA repeat consists of a 13 Kb coding region for the 18S, 5.8S, 28S ribosomal RNAs (rRNAs) and transcribed spacers separated by a 30 Kb intergenic spacer. For species such as frog, mouse and rat, sequences in the intergenic spacer other than the gene promoter have been shown to modulate transcription of the ribosomal gene. These sequences are spacer promoters, enhancers and the terminator for spacer transcription. We are addressing whether the human ribosomal gene promoter is similarly influenced. In-vitro transcriptionmore » run-off assays have revealed that the 4.5 kb region (CBE), directly upstream of the gene promoter, has cis-stimulation and trans-competition properties. This suggests that the CBE fragment contains an enhancer(s) for ribosomal gene transcription. Further experiments have shown that a fragment ({approximately}1.6 kb) within the CBE fragment also has trans-competition function. Deletion subclones of this region are being tested to delineate the exact sequences responsible for these modulating activities. Previous sequence analysis and functional studies have revealed that CBE contains regions of DNA capable of adopting alternative structures such as bent DNA, Z-DNA, and triple-stranded DNA. Whether these structures are required for modulating transcription remains to be determined as does the specific DNA-protein interaction involved.« less
Sequencing Bands of Ribosomal Intergenic Spacer Analysis Fingerprints for Characterization and Microscale Distribution of Soil Bacterium Populations Responding to Mercury Spiking

PubMed Central

Ranjard, Lionel; Brothier, Elisabeth; Nazaret, Sylvie

2000-01-01

Two major emerging bands (a 350-bp band and a 650-bp band) within the RISA (ribosomal intergenic spacer analysis) profile of a soil bacterial community spiked with Hg(II) were selected for further identification of the populations involved in the response of the community to the added metal. The bands were cut out from polyacrylamide gels, cloned, characterized by restriction analysis, and sequenced for phylogenetic affiliation of dominant clones. The sequences were the intergenic spacer between the rrs and rrl genes and the first 130 nucleotides of the rrl gene. Comparison of sequences derived from the 350-bp band to The GenBank database permitted us to identify the bacteria as being mostly close relatives to low G+C firmicutes (Clostridium-like genera), while the 650-bp band permitted us to identify the bacteria as being mostly close relatives to β-proteobacteria (Ralstonia-like genera). Oligonucleotide probes specific for the identified dominant bacteria were designed and hybridized with the RISA profiles derived from the control and spiked communities. These studies confirmed the contribution of these populations to the community response to the metal. Hybridization of the RISA profiles from subcommunities (bacterial pools associated with different soil microenvironments) also permitted to characterize the distribution and the dynamics of these populations at a microscale level following mercury spiking. PMID:11097911
The phosphotransferase system-dependent sucrose utilization regulon in enteropathogenic Escherichia coli strains is located in a variable chromosomal region containing iap sequences.

PubMed

Treviño-Quintanilla, Luis Gerardo; Escalante, Adelfo; Caro, Alma Delia; Martínez, Alfredo; González, Ricardo; Puente, José Luis; Bolívar, Francisco; Gosset, Guillermo

2007-01-01

The capacity to utilize sucrose as a carbon and energy source (Scr(+) phenotype) is a highly variable trait among Escherichia coli strains. In this study, seven enteropathogenic E. coli (EPEC) strains from different sources were studied for their capacity to grow using sucrose. Liquid media cultures showed that all analyzed strains have the Scr(+) phenotype and two distinct groups were defined: one of five and another of two strains displaying doubling times of 67 and 125 min, respectively. The genes conferring the Scr(+) phenotype in one of the fast-growing strains (T19) were cloned and sequenced. Comparative sequence analysis revealed that this strain possesses the scr regulon genes scrKYABR, encoding phosphoenolpyruvate:phosphotransferase system-dependent sucrose transport and utilization activities. Transcript level quantification revealed sucrose-dependent induction of scrK and scrR genes in fast-growing strains, whereas no transcripts were detected in slow-growing strains. Sequence comparison analysis revealed that the scr genes in strain T19 are almost identical to those present in the scr regulon of prototype EPEC E2348/69 and in both strains, the scr genes are inserted in the chromosomal intergenic region of hypothetical genes ygcE and ygcF. Comparison of the ygcE-ygcF intergenic region sequence of strains MG1655, enterohemorrhagic EDL933, uropathogenic ECFT073 and EPEC T19-E2348/69 revealed that the number of extragenic highly repeated iap sequences corresponded to nine, four, two and none, respectively. These results show that the iap sequence-containing chromosomal ygcE-ygcF intergenic region is highly variable in E. coli. Copyright (c) 2007 S. Karger AG, Basel.
Endemic and Epidemic Lineages of Escherichia coli that Cause Urinary Tract Infections

PubMed Central

Tabor, Helen; Tellis, Patricia; Vincent, Caroline; Tellier, Pierre-Paul

2008-01-01

Women with urinary tract infections (UTIs) in California, USA (1999–2001), were infected with closely related or indistinguishable strains of Escherichia coli (clonal groups), which suggests point source dissemination. We compared strains of UTI-causing E. coli in California with strains causing such infections in Montréal, Québec, Canada. Urine specimens from women with community-acquired UTIs in Montréal (2006) were cultured for E. coli. Isolates that caused 256 consecutive episodes of UTI were characterized by antimicrobial drug susceptibility profile, enterobacterial repetitive intergenic consensus 2 PCR, serotyping, XbaI and NotI pulsed-field gel electrophoresis, multilocus sequence typing, and phylogenetic typing. We confirmed the presence of drug-resistant, genetically related, and temporally clustered E. coli clonal groups that caused community-acquired UTIs in unrelated women in 2 locations and 2 different times. Two clonal groups were identified in both locations. Epidemic transmission followed by endemic transmission of UTI-causing clonal groups may explain these clusters of UTI cases. PMID:18826822
Molecular typing of Vibrio parahaemolyticus isolated from seafood harvested along the south-west coast of India.

PubMed

Bhowmick, P P; Khushiramani, R; Raghunath, P; Karunasagar, I; Karunasagar, I

2008-02-01

Evaluation of protein profiling for typing Vibrio parahaemolyticus using 71 strains isolated from different seafood and comparison with other molecular typing techniques such as random amplified polymorphic DNA analysis (RAPD) and enterobacterial repetitive intergenic consensus sequence (ERIC)-PCR. Three molecular typing methods were used for the typing of 71 V. parahaemolyticus isolates from seafood. RAPD had a discriminatory index (DI) of 0.95, while ERIC-PCR showed a DI of 0.94. Though protein profiling had less discriminatory power, use of this method can be helpful in identifying new proteins which might have a role in establishment in the host or virulence of the organism. The use of protein profiling in combination with other established typing methods such as RAPD and ERIC-PCR generates useful information in the case of V. parahaemolyticus associated with seafood. The study demonstrates the usefulness of nucleic acid and protein-based studies in understanding the relationship between various isolates from seafood.
Novel mechanism of conjoined gene formation in the human genome.

PubMed

Kim, Ryong Nam; Kim, Aeri; Choi, Sang-Haeng; Kim, Dae-Soo; Nam, Seong-Hyeuk; Kim, Dae-Won; Kim, Dong-Wook; Kang, Aram; Kim, Min-Young; Park, Kun-Hyang; Yoon, Byoung-Ha; Lee, Kang Seon; Park, Hong-Seog

2012-03-01

Recently, conjoined genes (CGs) have emerged as important genetic factors necessary for understanding the human genome. However, their formation mechanism and precise structures have remained mysterious. Based on a detailed structural analysis of 57 human CG transcript variants (CGTVs, discovered in this study) and all (833) known CGs in the human genome, we discovered that the poly(A) signal site from the upstream parent gene region is completely removed via the skipping or truncation of the final exon; consequently, CG transcription is terminated at the poly(A) signal site of the downstream parent gene. This result led us to propose a novel mechanism of CG formation: the complete removal of the poly(A) signal site from the upstream parent gene is a prerequisite for the CG transcriptional machinery to continue transcribing uninterrupted into the intergenic region and downstream parent gene. The removal of the poly(A) signal sequence from the upstream gene region appears to be caused by a deletion or truncation mutation in the human genome rather than post-transcriptional trans-splicing events. With respect to the characteristics of CG sequence structures, we found that intergenic regions are hot spots for novel exon creation during CGTV formation and that exons farther from the intergenic regions are more highly conserved in the CGTVs. Interestingly, many novel exons newly created within the intergenic and intragenic regions originated from transposable element sequences. Additionally, the CGTVs showed tumor tissue-biased expression. In conclusion, our study provides novel insights into the CG formation mechanism and expands the present concepts of the genetic structural landscape, gene regulation, and gene formation mechanisms in the human genome.
Genotypic analysis of Xylella fastidiosa isolates from different hosts using sequences homologous to the Xanthomonas rpf genes.

PubMed

Meinhardt, Lyndel W; Ribeiro, Milena P M A; Coletta-Filho, Helvécio D; Dumenyo, C Korsi; Tsai, Sui M; De M Bellato, Cláudia

2003-09-01

SUMMARY This is the first report of a genotypic analysis of the phytopathogenic bacteria Xylella fastidiosa (Xf) using differences within intra- and intergenic regions of pathogenic genes. Orthologous sequences from the genome of Xf were identified for genes involved in the regulation of pathogenicity factors (rpf) from Xanthomonas campestris pv. campestris (Xcc). While the rpf genes were conserved, the chromosomal region revealed differences in gene sizes and intergenic spacings and a major translocational event when compared to Xcc. Primers were designed to amplify three regions: the intragenic region of rpfA (2354 bp), the intergenic region between rpfA and rpfB (5772 bp), and the intergenic region between rpfC and rpfF (2314 bp). Amplicons were obtained for all three regions from 32 of the 33 Xf isolates tested from citrus, grape, coffee, plum, hibiscus and periwinkle. Three Xcc isolates from cruciferous plants only generated PCR products for the rpfC-F region. Cleaved amplified polymorphic sequences (CAPS) (Taq(alpha)I) revealed differential banding profiles for the rpfA-B and rpfC-F regions. Xylella isolates were separated into seven groups via rpfA-B, of which five contained only citrus, while the other two had citrus, grape and coffee, and citrus, coffee, plum and hibiscus isolates. rpfC-F separated the isolates into three host-related groups. Citrus, coffee and hibiscus isolates formed one group, while the other two groups were comprised solely of grape and plum isolates. Xcc isolates formed an out-group. In silico analysis supports these results, which reveal the potential of the rpf genes for genotypic analysis of Xylella fastidiosa.
Piggy: a rapid, large-scale pan-genome analysis tool for intergenic regions in bacteria.

PubMed

Thorpe, Harry A; Bayliss, Sion C; Sheppard, Samuel K; Feil, Edward J

2018-04-01

The concept of the "pan-genome," which refers to the total complement of genes within a given sample or species, is well established in bacterial genomics. Rapid and scalable pipelines are available for managing and interpreting pan-genomes from large batches of annotated assemblies. However, despite overwhelming evidence that variation in intergenic regions in bacteria can directly influence phenotypes, most current approaches for analyzing pan-genomes focus exclusively on protein-coding sequences. To address this we present Piggy, a novel pipeline that emulates Roary except that it is based only on intergenic regions. A key utility provided by Piggy is the detection of highly divergent ("switched") intergenic regions (IGRs) upstream of genes. We demonstrate the use of Piggy on large datasets of clinically important lineages of Staphylococcus aureus and Escherichia coli. For S. aureus, we show that highly divergent (switched) IGRs are associated with differences in gene expression and we establish a multilocus reference database of IGR alleles (igMLST; implemented in BIGSdb).
Structural analysis of two length variants of the rDNA intergenic spacer from Eruca sativa.

PubMed

Lakshmikumaran, M; Negi, M S

1994-03-01

Restriction enzyme analysis of the rRNA genes of Eruca sativa indicated the presence of many length variants within a single plant and also between different cultivars which is unusual for most crucifers studied so far. Two length variants of the rDNA intergenic spacer (IGS) from a single individual E. sativa (cv. Itsa) plant were cloned and characterized. The complete nucleotide sequences of both the variants (3 kb and 4 kb) were determined. The intergenic spacer contains three families of tandemly repeated DNA sequences denoted as A, B and C. However, the long (4 kb) variant shows the presence of an additional repeat, denoted as D, which is a duplication of a 224 bp sequence just upstream of the putative transcription initiation site. Repeat units belonging to the three different families (A, B and C) were in the size range of 22 to 30 bp. Such short repeat elements are present in the IGS of most of the crucifers analysed so far. Sequence analysis of the variants (3 kb and 4 kb) revealed that the length heterogeneity of the spacer is located at three different regions and is due to the varying copy numbers of repeat units belonging to families A and B. Length variation of the spacer is also due to the presence of a large duplication (D repeats) in the 4 kb variant which is absent in the 3 kb variant. The putative transcription initiation site was identified by comparisons with the rDNA sequences from other plant species.
Low DNA Sequence Diversity of the Intergenic Spacer 1 Region in the Human Skin Commensal Fungi Malassezia sympodialis and M. dermatis Isolated from Patients with Malassezia-Associated Skin Diseases and Healthy Subjects.

PubMed

Cho, Otomi; Sugita, Takashi

2016-12-01

As DNA sequences of the intergenic spacer (IGS) region in the rRNA gene show remarkable intraspecies diversity compared with the small subunit, large subunit, and internal transcribed spacer region, the IGS region has been used as an epidemiological tool in studies on Malassezia globosa and M. restricta, which are responsible for the exacerbation of atopic dermatitis (AD) and seborrheic dermatitis (SD). However, the IGS regions of M. sympodialis and M. dermatis obtained from the skin of patients with AD and SD, as well as healthy subjects, lacked sequence diversity. Of the 105 M. sympodialis strains and the 40 M. dermatis strains, the sequences of 103 (98.1 %) and 39 (97.5 %), respectively, were identical. Thus, given the lack of intraspecies diversity in the IGS regions of M. sympodialis and M. dermatis, studies of the diversity of these species should be performed using appropriate genes and not the IGS.
Genotypic and phenotypic diversity of Alicyclobacillus acidocaldarius isolates.

PubMed

Félix-Valenzuela, L; Guardiola-Avila, I; Burgara-Estrella, A; Ibarra-Zavala, M; Mata-Haro, V

2015-10-01

The fruit juice industry recognizes Alicyclobacillus as a major quality control target micro-organism. In this study, we analysed 19 bacterial isolates to identify Alicyclobacillus species by polymerase chain reaction (PCR) and sequencing analyses. Phenotypic and genomic diversity among isolates were investigated by API 50CHB system and ERIC-PCR (enterobacterial repetitive intergenic consensus-PCR) respectively. All bacterial isolates were identified as Alicyclobacillus acidocaldarius, and almost all showed identical DNA sequences according to their 16S rRNA (rDNA) gene partial sequences. Only few carbohydrates were fermented by A. acidocaldarius isolates, and there was little variability in the biochemical profile. Genotypic fingerprinting of the A. acidocaldarius isolates showed high diversity, and clusters by ERIC-PCR were distinct to those obtained from the 16S rRNA gene phylogenetic tree. There was no correlation between phenotypic and genotypic variability in the A. acidocaldarius isolates analysed in this study. Detection of Alicyclobacillus strains is imperative in fruit concentrates and juices due to the production of guaiacol. Identification of the genera originates rejection of the product by processing industry. However, not all the Alicyclobacillus species are deteriorative and hence the importance to differentiate among them. In this study, partial 16S ribosomal RNA sequence alignment allowed the differentiation of species. In addition, ERIC-PCR was introduced for the genotypic characterization of Alicyclobacillus, as an alternative for differentiation among isolates from the same species. © 2015 The Society for Applied Microbiology.
Diversity of Cronobacter spp. isolates from the vegetables in the middle-east coastline of China.

PubMed

Chen, Wanyi; Yang, Jielin; You, Chunping; Liu, Zhenmin

2016-06-01

Cronobacter spp. has caused life-threatening neonatal infections mainly resulted from consumption of contaminated powdered infant formula. A total of 102 vegetable samples from retail markets were evaluated for the presence of Cronobacter spp. Thirty-five presumptive Cronobacter isolates were isolated and identified using API 20E and 16S rDNA sequencing analyses. All isolates and type strains were characterized using enterobacterial repetitive intergenic consensus sequence PCR (ERIC-PCR), and genetic profiles of cluster analysis from this molecular typing test clearly showed that there were differences among isolates from different vegetables. A polymerase chain reaction restriction fragment length polymorphism (PCR-RFLP) based on the amplification of the gyrB gene (1258 bp) was developed to differentiate among Cronobacter species. A new PCR-RFLP assay based on the amplification of the gyrB gene using Alu I and Hinf I endonuclease combination is established and it has been confirmed an accurate and rapid subtyping method to differentiate Cronobacter species. Sequence analysis of the gyrB gene was proven to be suitable for the phylogenetic analysis of the Cronobacter strains, which has much better resolution based on SNPs in the identification of Cronobacter species specificity than PCR-RFLP and ERIC-PCR. Our study further confirmed that vegetables are one of the most common habitats or sources of Cronobacter spp. contamination in the middle-east coastline of China.
Organization of the origins of replication of the chromosomes of Mycobacterium smegmatis, Mycobacterium leprae and Mycobacterium tuberculosis and isolation of a functional origin from M. smegmatis.

PubMed

Salazar, L; Fsihi, H; de Rossi, E; Riccardi, G; Rios, C; Cole, S T; Takiff, H E

1996-04-01

The genus Mycobacterium is composed of species with widely differing growth rates ranging from approximately three hours in Mycobacterium smegmatis to two weeks in Mycobacterium leprae. As DNA replication is coupled to cell duplication, it may be regulated by common mechanisms. The chromosomal regions surrounding the origins of DNA replication from M. smegmatis, M. tuberculosis, and M. leprae have been sequenced, and show very few differences. The gene order, rnpA-rpmH-dnaA-dnaN-recF-orf-gyrB-gyrA, is the same as in other Gram-positive organisms. Although the general organization in M. smegmatis is very similar to that of Streptomyces spp., a closely related genus, M. tuberculosis and M. leprae differ as they lack an open reading frame, between dnaN and recF, which is similar to the gnd gene of Escherichia coli. Within the three mycobacterial species, there is extensive sequence conservation in the intergenic regions flanking dnaA, but more variation from the consensus DnaA box sequence was seen than in other bacteria. By means of subcloning experiments, the putative chromosomal origin of replication of M. smegmatis, containing the dnaA-dnaN region, was shown to promote autonomous replication in M. smegmatis, unlike the corresponding regions from M. tuberculosis or M. leprae.
Complete Genome Sequence of a Novel Newcastle Disease Virus Strain Isolated from a Chicken in West Africa

PubMed Central

Kim, Shin-Hee; Nayak, Subhashree; Paldurai, Anandan; Nayak, Baibaswata; Samuel, Arthur; Aplogan, Gilbert L.; Awoume, Kodzo A.; Webby, Richard J.; Ducatez, Mariette F.; Collins, Peter L.

2012-01-01

The complete genome sequence of an African Newcastle disease virus (NDV) strain isolated from a chicken in Togo in 2009 was determined. The genome is 15,198 nucleotides (nt) in length and is classified in genotype VII in the class II cluster. Compared to common vaccine strains, the African strain contains a previously described 6-nt insert in the downstream untranslated region of the N gene and a novel 6-nt insert in the HN-L intergenic region. Genome length differences are a marker of the natural history of NDV. This is the first description of a class II NDV strain with a genome of 15,198 nt and a 6-nt insert in the HN-L intergenic region. Sequence divergence relative to vaccine strains was substantial, likely contributes to outbreaks, and illustrates the continued evolution of new NDV strains in West Africa. PMID:22997417
Reviving the Dead: History and Reactivation of an Extinct L1

PubMed Central

Yang, Lei; Brunsfeld, John; Scott, LuAnn; Wichman, Holly

2014-01-01

Although L1 sequences are present in the genomes of all placental mammals and marsupials examined to date, their activity was lost in the megabat family, Pteropodidae, ∼24 million years ago. To examine the characteristics of L1s prior to their extinction, we analyzed the evolutionary history of L1s in the genome of a megabat, Pteropus vampyrus, and found a pattern of periodic L1 expansion and quiescence. In contrast to the well-characterized L1s in human and mouse, megabat genomes have accommodated two or more simultaneously active L1 families throughout their evolutionary history, and major peaks of L1 deposition into the genome always involved multiple families. We compared the consensus sequences of the two major megabat L1 families at the time of their extinction to consensus L1s of a variety of mammalian species. Megabat L1s are comparable to the other mammalian L1s in terms of adenosine content and conserved amino acids in the open reading frames (ORFs). However, the intergenic region (IGR) of the reconstructed element from the more active family is dramatically longer than the IGR of well-characterized human and mouse L1s. We synthesized the reconstructed element from this L1 family and tested the ability of its components to support retrotransposition in a tissue culture assay. Both ORFs are capable of supporting retrotransposition, while the IGR is inhibitory to retrotransposition, especially when combined with either of the reconstructed ORFs. We dissected the inhibitory effect of the IGR by testing truncated and shuffled versions and found that length is a key factor, but not the only one affecting inhibition of retrotransposition. Although the IGR is inhibitory to retrotransposition, this inhibition does not account for the extinction of L1s in megabats. Overall, the evolution of the L1 sequence or the quiescence of L1 is unlikely the reason of L1 extinction. PMID:24968166
Reviving the dead: history and reactivation of an extinct l1.

PubMed

Yang, Lei; Brunsfeld, John; Scott, LuAnn; Wichman, Holly

2014-06-01

Although L1 sequences are present in the genomes of all placental mammals and marsupials examined to date, their activity was lost in the megabat family, Pteropodidae, ∼24 million years ago. To examine the characteristics of L1s prior to their extinction, we analyzed the evolutionary history of L1s in the genome of a megabat, Pteropus vampyrus, and found a pattern of periodic L1 expansion and quiescence. In contrast to the well-characterized L1s in human and mouse, megabat genomes have accommodated two or more simultaneously active L1 families throughout their evolutionary history, and major peaks of L1 deposition into the genome always involved multiple families. We compared the consensus sequences of the two major megabat L1 families at the time of their extinction to consensus L1s of a variety of mammalian species. Megabat L1s are comparable to the other mammalian L1s in terms of adenosine content and conserved amino acids in the open reading frames (ORFs). However, the intergenic region (IGR) of the reconstructed element from the more active family is dramatically longer than the IGR of well-characterized human and mouse L1s. We synthesized the reconstructed element from this L1 family and tested the ability of its components to support retrotransposition in a tissue culture assay. Both ORFs are capable of supporting retrotransposition, while the IGR is inhibitory to retrotransposition, especially when combined with either of the reconstructed ORFs. We dissected the inhibitory effect of the IGR by testing truncated and shuffled versions and found that length is a key factor, but not the only one affecting inhibition of retrotransposition. Although the IGR is inhibitory to retrotransposition, this inhibition does not account for the extinction of L1s in megabats. Overall, the evolution of the L1 sequence or the quiescence of L1 is unlikely the reason of L1 extinction.
Identification of 88 regulatory small RNAs in the TIGR4 strain of the human pathogen Streptococcus pneumoniae

PubMed Central

Acebo, Paloma; Martin-Galiano, Antonio J.; Navarro, Sara; Zaballos, Ángel; Amblar, Mónica

2012-01-01

Streptococcus pneumoniae is the main etiological agent of community-acquired pneumonia and a major cause of mortality and morbidity among children and the elderly. Genome sequencing of several pneumococcal strains revealed valuable information about the potential proteins and genetic diversity of this prevalent human pathogen. However, little is known about its transcriptional regulation and its small regulatory noncoding RNAs. In this study, we performed deep sequencing of the S. pneumoniae TIGR4 strain RNome to identify small regulatory RNA candidates expressed in this pathogen. We discovered 1047 potential small RNAs including intragenic, 5′- and/or 3′-overlapping RNAs and 88 small RNAs encoded in intergenic regions. With this approach, we recovered many of the previously identified intergenic small RNAs and identified 68 novel candidates, most of which are conserved in both sequence and genomic context in other S. pneumoniae strains. We confirmed the independent expression of 17 intergenic small RNAs and predicted putative mRNA targets for six of them using bioinformatics tools. Preliminary results suggest that one of these six is a key player in the regulation of competence development. This study is the biggest catalog of small noncoding RNAs reported to date in S. pneumoniae and provides a highly complete view of the small RNA network in this pathogen. PMID:22274957

Single-molecule sequencing of the desiccation-tolerant grass Oropetium thomaeum.

PubMed

VanBuren, Robert; Bryant, Doug; Edger, Patrick P; Tang, Haibao; Burgess, Diane; Challabathula, Dinakar; Spittle, Kristi; Hall, Richard; Gu, Jenny; Lyons, Eric; Freeling, Michael; Bartels, Dorothea; Ten Hallers, Boudewijn; Hastie, Alex; Michael, Todd P; Mockler, Todd C

2015-11-26

Plant genomes, and eukaryotic genomes in general, are typically repetitive, polyploid and heterozygous, which complicates genome assembly. The short read lengths of early Sanger and current next-generation sequencing platforms hinder assembly through complex repeat regions, and many draft and reference genomes are fragmented, lacking skewed GC and repetitive intergenic sequences, which are gaining importance due to projects like the Encyclopedia of DNA Elements (ENCODE). Here we report the whole-genome sequencing and assembly of the desiccation-tolerant grass Oropetium thomaeum. Using only single-molecule real-time sequencing, which generates long (>16 kilobases) reads with random errors, we assembled 99% (244 megabases) of the Oropetium genome into 625 contigs with an N50 length of 2.4 megabases. Oropetium is an example of a 'near-complete' draft genome which includes gapless coverage over gene space as well as intergenic sequences such as centromeres, telomeres, transposable elements and rRNA clusters that are typically unassembled in draft genomes. Oropetium has 28,466 protein-coding genes and 43% repeat sequences, yet with 30% more compact euchromatic regions it is the smallest known grass genome. The Oropetium genome demonstrates the utility of single-molecule real-time sequencing for assembling high-quality plant and other eukaryotic genomes, and serves as a valuable resource for the plant comparative genomics community.
RNA processing in Neurospora crassa mitochondria: use of transfer RNA sequences as signals.

PubMed Central

Breitenberger, C A; Browning, K S; Alzner-DeWeerd, B; RajBhandary, U L

1985-01-01

We have used RNA gel transfer hybridization, S1 nuclease mapping and primer extension to analyze transcripts derived from several genes in Neurospora crassa mitochondria. The transcripts studied include those for cytochrome oxidase subunit III, 17S rRNA and an unidentified open reading frame. In all three cases, initial transcripts are long, include tRNA sequences, and are subsequently processed to generate the mature RNAs. We find that endpoints of the most abundant transcripts generally coincide with those of tRNA sequences. We therefore conclude that tRNA sequences in long transcripts act as primary signals for RNA processing in N. crassa mitochondria. The situation is somewhat analogous to that observed in mammalian mitochondrial systems. The difference, however, is that in mammalian mitochondria, noncoding spacers between tRNA, rRNA and protein genes are very short and in many cases non-existent, allowing no room for intergenic RNA processing signals whereas, in N. crassa mtDNA, intergenic non-coding sequences are usually several hundred nucleotides long and contain highly conserved GC-rich palindromic sequences. Since these GC-rich palindromic sequences are retained in the processed mature RNAs, we conclude that they do not serve as signals for RNA processing. Images Fig. 2. Fig. 3. Fig. 4. Fig. 5. Fig. 6. Fig. 7. PMID:2990893
Intergenic Sequence Ribotyping using a region neighboring dkgB links genovar to Kauffman-White serotype of Salmonella enterica

USDA-ARS?s Scientific Manuscript database

Previous research identified that the 5S ribosomal (rrn) gene and associated flanking sequences that are closely linked to the dkgB gene of Salmonella enterica were highly variable between serotypes, but not between subpopulations within the same serotype (PMID: 17005008). The degree of variability ...
Characterization of North American Armillaria species: Genetic relationships determined by ribosomal DNA sequences and AFLP markers

Treesearch

M. -S. Kim; N. B. Klopfenstein; J. W. Hanna; G. I. McDonald

2006-01-01

Phylogenetic and genetic relationships among 10 North American Armillaria species were analysed using sequence data from ribosomal DNA (rDNA), including intergenic spacer (IGS-1), internal transcribed spacers with associated 5.8S (ITS + 5.8S), and nuclear large subunit rDNA (nLSU), and amplified fragment length polymorphism (AFLP) markers. Based on rDNA sequence data,...
Mechanisms of haplotype divergence at the RGA08 nucleotide-binding leucine-rich repeat gene locus in wild banana (Musa balbisiana).

PubMed

Baurens, Franc-Christophe; Bocs, Stéphanie; Rouard, Mathieu; Matsumoto, Takashi; Miller, Robert N G; Rodier-Goud, Marguerite; MBéguié-A-MBéguié, Didier; Yahiaoui, Nabila

2010-07-16

Comparative sequence analysis of complex loci such as resistance gene analog clusters allows estimating the degree of sequence conservation and mechanisms of divergence at the intraspecies level. In banana (Musa sp.), two diploid wild species Musa acuminata (A genome) and Musa balbisiana (B genome) contribute to the polyploid genome of many cultivars. The M. balbisiana species is associated with vigour and tolerance to pests and disease and little is known on the genome structure and haplotype diversity within this species. Here, we compare two genomic sequences of 253 and 223 kb corresponding to two haplotypes of the RGA08 resistance gene analog locus in M. balbisiana "Pisang Klutuk Wulung" (PKW). Sequence comparison revealed two regions of contrasting features. The first is a highly colinear gene-rich region where the two haplotypes diverge only by single nucleotide polymorphisms and two repetitive element insertions. The second corresponds to a large cluster of RGA08 genes, with 13 and 18 predicted RGA genes and pseudogenes spread over 131 and 152 kb respectively on each haplotype. The RGA08 cluster is enriched in repetitive element insertions, in duplicated non-coding intergenic sequences including low complexity regions and shows structural variations between haplotypes. Although some allelic relationships are retained, a large diversity of RGA08 genes occurs in this single M. balbisiana genotype, with several RGA08 paralogs specific to each haplotype. The RGA08 gene family has evolved by mechanisms of unequal recombination, intragenic sequence exchange and diversifying selection. An unequal recombination event taking place between duplicated non-coding intergenic sequences resulted in a different RGA08 gene content between haplotypes pointing out the role of such duplicated regions in the evolution of RGA clusters. Based on the synonymous substitution rate in coding sequences, we estimated a 1 million year divergence time for these M. balbisiana haplotypes. A large RGA08 gene cluster identified in wild banana corresponds to a highly variable genomic region between haplotypes surrounded by conserved flanking regions. High level of sequence identity (70 to 99%) of the genic and intergenic regions suggests a recent and rapid evolution of this cluster in M. balbisiana.
Analysis of the intergenic region of tomato spotted wilt Tospovirus medium RNA segment.

PubMed

Bhat, A I; Pappu, S S; Pappu, H R; Deom, C M; Culbreath, A K

1999-06-01

The intergenic region (IGR) of the medium (M) RNA of tomato spotted wilt Tospovirus (TSWV) isolates naturally infecting peanut (groundnut), pepper, potato, stokesia, tobacco and watermelon in Georgia (GA) and a peanut isolate from Florida (FL) was cloned and sequenced. The IGR sequences were compared with one another and with respective M RNA IGRs of TSWV isolates from Brazil and Japan and other tospoviruses. The length of M IGR of GA and FL isolates varied from 271 to 277 nucleotides. The M IGRs of TSWV from potato and stokesia, and tobacco and watermelon were identical with each other in their length and sequence. IGR sequences were more conserved (95-100%) among the populations of TSWV from GA and FL, than when compared with those of TSWV isolates from other countries (83-94%). The conserved motif (CAAACTTTGG) present in the IGRs of both M and small (S) RNAs of a Brazilian isolate of TSWV was also conserved in the isolates studied. Cluster analysis of the IGR sequences showed that all GA and FL isolates are closely clustered and are distinct from the TSWV isolates from other countries as well as from other tospoviruses.
Assessing host-specificity of Escherichia coli using a supervised learning logic-regression-based analysis of single nucleotide polymorphisms in intergenic regions.

PubMed

Zhi, Shuai; Li, Qiaozhi; Yasui, Yutaka; Edge, Thomas; Topp, Edward; Neumann, Norman F

2015-11-01

Host specificity in E. coli is widely debated. Herein, we used supervised learning logic-regression-based analysis of intergenic DNA sequence variability in E. coli in an attempt to identify single nucleotide polymorphism (SNP) biomarkers of E. coli that are associated with natural selection and evolution toward host specificity. Seven-hundred and eighty strains of E. coli were isolated from 15 different animal hosts. We utilized logic regression for analyzing DNA sequence data of three intergenic regions (flanked by the genes uspC-flhDC, csgBAC-csgDEFG, and asnS-ompF) to identify genetic biomarkers that could potentially discriminate E. coli based on host sources. Across 15 different animal hosts, logic regression successfully discriminated E. coli based on animal host source with relatively high specificity (i.e., among the samples of the non-target animal host, the proportion that correctly did not have the host-specific marker pattern) and sensitivity (i.e., among the samples from a given animal host, the proportion that correctly had the host-specific marker pattern), even after fivefold cross validation. Permutation tests confirmed that for most animals, host specific intergenic biomarkers identified by logic regression in E. coli were significantly associated with animal host source. The highest level of biomarker sensitivity was observed in deer isolates, with 82% of all deer E. coli isolates displaying a unique SNP pattern that was 98% specific to deer. Fifty-three percent of human isolates displayed a unique biomarker pattern that was 98% specific to humans. Twenty-nine percent of cattle isolates displayed a unique biomarker that was 97% specific to cattle. Interestingly, even within a related host group (i.e., Family: Canidae [domestic dogs and coyotes]), highly specific SNP biomarkers (98% and 99% specificity for dog and coyotes, respectively) were observed, with 21% of dog E. coli isolates displaying a unique dog biomarker and 61% of coyote isolates displaying a unique coyote biomarker. Application of a supervised learning method, such as logic regression, to DNA sequence analysis at certain intergenic regions demonstrates that some E. coli strains may evolve to become host-specific. Copyright © 2015 Elsevier Inc. All rights reserved.
Gut Microbial Diversity in Rat Model Induced by Rhubarb

PubMed Central

Peng, Ying; Wu, Chunfu; Yang, Jingyu; Li, Xiaobo

2014-01-01

Rhubarb is often used to establish chronic diarrhea and spleen (Pi)-deficiency syndrome animal models in China. In this study, we utilized the enterobacterial repetitive intergenic consensus-polymerase chain reaction (ERIC-PCR) method to detect changes in bacterial diversity in feces and the bowel mucosa associated with this model. Total microbial genomic DNA from the small bowel (duodenum, jejunum, and ileum), large bowel (proximal colon, distal colon, and rectum), cecum, and feces of normal and rhubarb-exposed rats were used as templates for the ERIC-PCR analysis. We found that the fecal microbial composition did not correspond to the bowel bacteria mix. More bacterial diversity was observed in the ileum of rhubarb-exposed rats (P<0.05). Furthermore, a 380 bp product was found to be increased in rhubarb-exposed rats both in faces and the bowel mucosa. The product was cloned and sequenced and showed high similarity with regions of the Bacteroides genome. AS a result of discriminant analysis with the SPSS software, the Canonical Discriminant Function Formulae for model rats was established. PMID:25048267
PCR Methods for Rapid Identification and Characterization of Actinobacillus seminis Strains

PubMed Central

Appuhamy, S.; Coote, J. G.; Low, J. C.; Parton, R.

1998-01-01

Twenty-four isolates of Actinobacillus seminis were typed by PCR ribotyping, repetitive extragenic palindromic element (REP)-based PCR, and enterobacterial repetitive intergenic consensus (ERIC)-based PCR. Five types were distinguished by REP-PCR, and nine types were distinguished by ERIC-PCR. PCR ribotyping produced the simplest pattern and could be useful for identification of A. seminis and for its differentiation from related species. REP- and ERIC-PCR could be used for strain differentiation in epidemiological studies of A. seminis. PMID:9508320
Plasmid integration in a wide range of bacteria mediated by the integrase of Lactobacillus delbrueckii bacteriophage mv4.

PubMed Central

Auvray, F; Coddeville, M; Ritzenthaler, P; Dupont, L

1997-01-01

Bacteriophage mv4 is a temperate phage infecting Lactobacillus delbrueckii subsp. bulgaricus. During lysogenization, the phage integrates its genome into the host chromosome at the 3' end of a tRNA(Ser) gene through a site-specific recombination process (L. Dupont et al., J. Bacteriol., 177:586-595, 1995). A nonreplicative vector (pMC1) based on the mv4 integrative elements (attP site and integrase-coding int gene) is able to integrate into the chromosome of a wide range of bacterial hosts, including Lactobacillus plantarum, Lactobacillus casei (two strains), Lactococcus lactis subsp. cremoris, Enterococcus faecalis, and Streptococcus pneumoniae. Integrative recombination of pMC1 into the chromosomes of all of these species is dependent on the int gene product and occurs specifically at the pMC1 attP site. The isolation and sequencing of pMC1 integration sites from these bacteria showed that in lactobacilli, pMC1 integrated into the conserved tRNA(Ser) gene. In the other bacterial species where this tRNA gene is less or not conserved; secondary integration sites either in potential protein-coding regions or in intergenic DNA were used. A consensus sequence was deduced from the analysis of the different integration sites. The comparison of these sequences demonstrated the flexibility of the integrase for the bacterial integration site and suggested the importance of the trinucleotide CCT at the 5' end of the core in the strand exchange reaction. PMID:9068626
Burkholderia sp. induces functional nodules on the South African invasive legume Dipogon lignosus (Phaseoleae) in New Zealand soils.

PubMed

Liu, Wendy Y Y; Ridgway, Hayley J; James, Trevor K; James, Euan K; Chen, Wen-Ming; Sprent, Janet I; Young, J Peter W; Andrews, Mitchell

2014-10-01

The South African invasive legume Dipogon lignosus (Phaseoleae) produces nodules with both determinate and indeterminate characteristics in New Zealand (NZ) soils. Ten bacterial isolates produced functional nodules on D. lignosus. The 16S ribosomal RNA (rRNA) gene sequences identified one isolate as Bradyrhizobium sp., one isolate as Rhizobium sp. and eight isolates as Burkholderia sp. The Bradyrhizobium sp. and Rhizobium sp. 16S rRNA sequences were identical to those of strains previously isolated from crop plants and may have originated from inocula used on crops. Both 16S rRNA and DNA recombinase A (recA) gene sequences placed the eight Burkholderia isolates separate from previously described Burkholderia rhizobial species. However, the isolates showed a very close relationship to Burkholderia rhizobial strains isolated from South African plants with respect to their nitrogenase iron protein (nifH), N-acyltransferase nodulation protein A (nodA) and N-acetylglucosaminyl transferase nodulation protein C (nodC) gene sequences. Gene sequences and enterobacterial repetitive intergenic consensus (ERIC) PCR and repetitive element palindromic PCR (rep-PCR) banding patterns indicated that the eight Burkholderia isolates separated into five clones of one strain and three of another. One strain was tested and shown to produce functional nodules on a range of South African plants previously reported to be nodulated by Burkholderia tuberum STM678(T) which was isolated from the Cape Region. Thus, evidence is strong that the Burkholderia strains isolated here originated in South Africa and were somehow transported with the plants from their native habitat to NZ. It is possible that the strains are of a new species capable of nodulating legumes.
Complete genome sequences of avian paramyxovirus type 8 strains goose/Delaware/1053/76 and pintail/Wakuya/20/78

PubMed Central

Paldurai, Anandan; Subbiah, Madhuri; Kumar, Sachin; Collins, Peter L.; Samal, Siba K.

2009-01-01

Complete consensus genome sequences were determined for avian paramyxovirus type 8 (APMV-8) strains goose/Delaware/1053/76 (prototype strain) and pintail/Wakuya/20/78. The genome of each strain is 15,342 nucleotides (nt) long, which follows the “rule of six”. The genome consists of six genes in the order of 3′-N-P/V/W-M-F-HN-L-5′. The genes are flanked on either side by conserved transcription start and stop signals, and have intergenic regions ranging from 1 to 30 nt. The genome contains a 55 nt leader region at the 3′-end and a 171 nt trailer region at the 5′-end. Comparison of sequences of strains Delaware and Wakuya showed nucleotide identity of 96.8% at the genome level and amino acid identities of 99.3%, 96.5%, 98.6%, 99.4%, 98.6% and 99.1% for the predicted N, P, M, F, HN and L proteins, respectively. Both strains grew in embryonated chicken eggs and in primary chicken embryo kidney cells, and 293T cells. Both strains contained only a single basic residue at the cleavage activation site of the F protein and their efficiency of replication in vitro depended on and was augmented by, the presence of exogenous protease in most cell lines. Sequence alignment and phylogenic analysis of the predicted amino acid sequence of APMV-8 strain Delaware proteins with the cognate proteins of other available APMV serotypes showed that APMV-8 is more closely related to APMV-2 and -6 than to APMV-1, -3 and -4. PMID:19341613
A family of long intergenic non-coding RNA genes in human chromosomal region 22q11.2 carry a DNA translocation breakpoint/AT-rich sequence

PubMed Central

2018-01-01

FAM230C, a long intergenic non-coding RNA (lincRNA) gene in human chromosome 13 (chr13) is a member of lincRNA genes termed family with sequence similarity 230. An analysis using bioinformatics search tools and alignment programs was undertaken to determine properties of FAM230C and its related genes. Results reveal that the DNA translocation element, the Translocation Breakpoint Type A (TBTA) sequence, which consists of satellite DNA, Alu elements, and AT-rich sequences is embedded in the FAM230C gene. Eight lincRNA genes related to FAM230C also carry the TBTA sequences. These genes were formed from a large segment of the 3’ half of the FAM230C sequence duplicated in chr22, and are specifically in regions of low copy repeats (LCR22)s, in or close to the 22q.11.2 region. 22q11.2 is a chromosomal segment that undergoes a high rate of DNA translocation and is prone to genetic deletions. FAM230C-related genes present in other chromosomes do not carry the TBTA motif and were formed from the 5’ half region of the FAM230C sequence. These findings identify a high specificity in lincRNA gene formation by gene sequence duplication in different chromosomes. PMID:29668722
Single-molecule sequencing of the desiccation-tolerant grass Oropetium thomaeum

DOE Office of Scientific and Technical Information (OSTI.GOV)

VanBuren, Robert; Bryant, Doug; Edger, Patrick P.

Plant genomes, and eukaryotic genomes in general, are typically repetitive, polyploid and heterozygous, which complicates genome assembly1. The short read lengths of early Sanger and current next-generation sequencing platforms hinder assembly through complex repeat regions, and many draft and reference genomes are fragmented, lacking skewed GC and repetitive intergenic sequences, which are gaining importance due to projects like the Encyclopedia of DNA Elements (ENCODE). Here we report the whole-genome sequencing and assembly of the desiccation-tolerant grass Oropetium thomaeum. Using only single-molecule real-time sequencing, which generates long (>16 kilobases) reads with random errors, we assembled 99% (244 megabases) of the Oropetiummore » genome into 625 contigs with an N50 length of 2.4 megabases. Oropetium is an example of a ‘near-complete’ draft genome which includes gapless coverage over gene space as well as intergenic sequences such as centromeres, telomeres, transposable elements and rRNA clusters that are typically unassembled in draft genomes. Oropetium has 28,466 protein-coding genes and 43% repeat sequences, yet with 30% more compact euchromatic regions it is the smallest known grass genome. As a result, the Oropetium genome demonstrates the utility of single-molecule real-time sequencing for assembling high-quality plant and other eukaryotic genomes, and serves as a valuable resource for the plant comparative genomics community.« less
Single-molecule sequencing of the desiccation-tolerant grass Oropetium thomaeum

DOE PAGES

VanBuren, Robert; Bryant, Doug; Edger, Patrick P.; ...

2015-11-11

Plant genomes, and eukaryotic genomes in general, are typically repetitive, polyploid and heterozygous, which complicates genome assembly1. The short read lengths of early Sanger and current next-generation sequencing platforms hinder assembly through complex repeat regions, and many draft and reference genomes are fragmented, lacking skewed GC and repetitive intergenic sequences, which are gaining importance due to projects like the Encyclopedia of DNA Elements (ENCODE). Here we report the whole-genome sequencing and assembly of the desiccation-tolerant grass Oropetium thomaeum. Using only single-molecule real-time sequencing, which generates long (>16 kilobases) reads with random errors, we assembled 99% (244 megabases) of the Oropetiummore » genome into 625 contigs with an N50 length of 2.4 megabases. Oropetium is an example of a ‘near-complete’ draft genome which includes gapless coverage over gene space as well as intergenic sequences such as centromeres, telomeres, transposable elements and rRNA clusters that are typically unassembled in draft genomes. Oropetium has 28,466 protein-coding genes and 43% repeat sequences, yet with 30% more compact euchromatic regions it is the smallest known grass genome. As a result, the Oropetium genome demonstrates the utility of single-molecule real-time sequencing for assembling high-quality plant and other eukaryotic genomes, and serves as a valuable resource for the plant comparative genomics community.« less
Epidemiology of vampire bat-transmitted rabies virus in Goiás, central Brazil: re-evaluation based on G-L intergenic region

PubMed Central

2010-01-01

Background Vampire bat related rabies harms both livestock industry and public health sector in central Brazil. The geographical distributions of vampire bat-transmitted rabies virus variants are delimited by mountain chains. These findings were elucidated by analyzing a high conserved nucleoprotein gene. This study aims to elucidate the detailed epidemiological characters of vampire bat-transmitted rabies virus by phylogenetic methods based on 619-nt sequence including unconserved G-L intergenic region. Findings The vampire bat-transmitted rabies virus isolates divided into 8 phylogenetic lineages in the previous nucleoprotein gene analysis were divided into 10 phylogenetic lineages with significant bootstrap values. The distributions of most variants were reconfirmed to be delimited by mountain chains. Furthermore, variants in undulating areas have narrow distributions and are apparently separated by mountain ridges. Conclusions This study demonstrates that the 619-nt sequence including G-L intergenic region is more useful for a state-level phylogenetic analysis of rabies virus than the partial nucleoprotein gene, and simultaneously that the distribution of vampire bat-transmitted RABV variants tends to be separated not only by mountain chains but also by mountain ridges, thus suggesting that the diversity of vampire bat-transmitted RABV variants was delimited by geographical undulations. PMID:21059233
Genome-Wide Prediction and Validation of Intergenic Enhancers in Arabidopsis Using Open Chromatin Signatures[OPEN

PubMed Central

Zhu, Bo; Zhang, Wenli; Jiang, Jiming

2015-01-01

Enhancers are important regulators of gene expression in eukaryotes. Enhancers function independently of their distance and orientation to the promoters of target genes. Thus, enhancers have been difficult to identify. Only a few enhancers, especially distant intergenic enhancers, have been identified in plants. We developed an enhancer prediction system based exclusively on the DNase I hypersensitive sites (DHSs) in the Arabidopsis thaliana genome. A set of 10,044 DHSs located in intergenic regions, which are away from any gene promoters, were predicted to be putative enhancers. We examined the functions of 14 predicted enhancers using the β-glucuronidase gene reporter. Ten of the 14 (71%) candidates were validated by the reporter assay. We also designed 10 constructs using intergenic sequences that are not associated with DHSs, and none of these constructs showed enhancer activities in reporter assays. In addition, the tissue specificity of the putative enhancers can be precisely predicted based on DNase I hypersensitivity data sets developed from different plant tissues. These results suggest that the open chromatin signature-based enhancer prediction system developed in Arabidopsis may serve as a universal system for enhancer identification in plants. PMID:26373455
Variation in the genomic locations and sequence conservation of STAR elements among staphylococcal species provides insight into DNA repeat evolution

PubMed Central

2012-01-01

Background Staphylococcus aureus Repeat (STAR) elements are a type of interspersed intergenic direct repeat. In this study the conservation and variation in these elements was explored by bioinformatic analyses of published staphylococcal genome sequences and through sequencing of specific STAR element loci from a large set of S. aureus isolates. Results Using bioinformatic analyses, we found that the STAR elements were located in different genomic loci within each staphylococcal species. There was no correlation between the number of STAR elements in each genome and the evolutionary relatedness of staphylococcal species, however higher levels of repeats were observed in both S. aureus and S. lugdunensis compared to other staphylococcal species. Unexpectedly, sequencing of the internal spacer sequences of individual repeat elements from multiple isolates showed conservation at the sequence level within deep evolutionary lineages of S. aureus. Whilst individual STAR element loci were demonstrated to expand and contract, the sequences associated with each locus were stable and distinct from one another. Conclusions The high degree of lineage and locus-specific conservation of these intergenic repeat regions suggests that STAR elements are maintained due to selective or molecular forces with some of these elements having an important role in cell physiology. The high prevalence in two of the more virulent staphylococcal species is indicative of a potential role for STAR elements in pathogenesis. PMID:23020678
A Functional Element Necessary for Fetal Hemoglobin Silencing

PubMed Central

Sankaran, Vijay G.; Xu, Jian; Byron, Rachel; Greisman, Harvey A.; Fisher, Chris; Weatherall, David J.; Sabath, Daniel E.; Groudine, Mark; Orkin, Stuart H.; Premawardhena, Anuja; Bender, M.A.

2011-01-01

BACKGROUND An improved understanding of the regulation of the fetal hemoglobin genes holds promise for the development of targeted therapeutic approaches for fetal hemoglobin induction in the β-hemoglobinopathies. Although recent studies have uncovered trans-acting factors necessary for this regulation, limited insight has been gained into the cis-regulatory elements involved. METHODS We identified three families with unusual patterns of hemoglobin expression, suggestive of deletions in the locus of the β-globin gene (β-globin locus). We performed array comparative genomic hybridization to map these deletions and confirmed breakpoints by means of polymerase-chain-reaction assays and DNA sequencing. We compared these deletions, along with previously mapped deletions, and studied the trans-acting factors binding to these sites in the β-globin locus by using chromatin immunoprecipitation. RESULTS We found a new (δβ)0-thalassemia deletion and a rare hereditary persistence of fetal hemoglobin deletion with identical downstream breakpoints. Comparison of the two deletions resulted in the identification of a small intergenic region required for γ-globin (fetal hemoglobin) gene silencing. We mapped a Kurdish β0-thalassemia deletion, which retains the required intergenic region, deletes other surrounding sequences, and maintains fetal hemoglobin silencing. By comparing these deletions and other previously mapped deletions, we elucidated a 3.5-kb intergenic region near the 5′ end of the δ-globin gene that is necessary for γ-globin silencing. We found that a critical fetal hemoglobin silencing factor, BCL11A, and its partners bind within this region in the chromatin of adult erythroid cells. CONCLUSIONS By studying three families with unusual deletions in the β-globin locus, we identified an intergenic region near the δ-globin gene that is necessary for fetal hemoglobin silencing. (Funded by the National Institutes of Health and others.) PMID:21879898
Theria-Specific Homeodomain and cis-Regulatory Element Evolution of the Dlx3–4 Bigene Cluster in 12 Different Mammalian Species

PubMed Central

SUMIYAMA, KENTA; MIYAKE, TSUTOMU; GRIMWOOD, JANE; STUART, ANDREW; DICKSON, MARK; SCHMUTZ, JEREMY; RUDDLE, FRANK H.; MYERS, RICHARD M.; AMEMIYA, CHRIS T.

2013-01-01

The mammalian Dlx3 and Dlx4 genes are configured as a bigene cluster, and their respective expression patterns are controlled temporally and spatially by cis-elements that largely reside within the intergenic region of the cluster. Previous work revealed that there are conspicuously conserved elements within the intergenic region of the Dlx3–4 bigene clusters of mouse and human. In this paper we have extended these analyses to include 12 additional mammalian taxa (including a marsupial and a monotreme) in order to better define the nature and molecular evolutionary trends of the coding and non-coding functional elements among morphologically divergent mammals. Dlx3–4 regions were fully sequenced from 12 divergent taxa of interest. We identified three theria-specific amino acid replacements in homeodomain of Dlx4 gene that functions in placenta. Sequence analyses of constrained nucleotide sites in the intergenic non-coding region showed that many of the intergenic conserved elements are highly conserved and have evolved slowly within the mammals. In contrast, a branchial arch/craniofacial enhancer I37-2 exhibited accelerated evolution at the branch between the monotreme and therian common ancestor despite being highly conserved among therian species. Functional analysis of I37-2 in transgenic mice has shown that the equivalent region of the platypus fails to drive transcriptional activity in branchial arches. These observations, taken together with our molecular evolutionary data, suggest that theria-specific episodic changes in the I37-2 element may have contributed to craniofacial innovation at the base of the mammalian lineage. PMID:22951979

Discrimination of probiotic Lactobacillus strains for poultry by repetitive sequenced-based PCR fingerprinting.

PubMed

Lee, Chin Mei; Sieo, Chin Chin; Cheah, Yoke-Kqueen; Abdullah, Norhani; Ho, Yin Wan

2012-02-01

Four repetitive element sequence-based polymerase chain reaction (rep-PCR) methods, namely repetitive extragenic palindromic PCR (REP-PCR), enterobacterial repetitive intergenic consensus PCR (ERIC-PCR), polytrinucleotide (GTG)₅ -PCR and BOX-PCR, were evaluated for the molecular differentiation of 12 probiotic Lactobacillus strains previously isolated from the gastrointestinal tract of chickens and used as a multistrain probiotic. This study represents the first analysis of the comparative efficacy of these four rep-PCR methods and their combination (composite rep-PCR) in the molecular typing of Lactobacillus strains based on a discriminatory index (D). Species-specific and strain-specific profiles were observed from rep-PCR. From the numerical analysis of composite rep-PCR, BOX-PCR, (GTG)₅ -PCR, REP-PCR and ERIC-PCR, D values of 0.9118, 0.9044, 0.8897, 0.8750 and 0.8529 respectively were obtained. Composite rep-PCR analysis was the most discriminative method, with eight Lactobacillus strains, namely L. brevis ATCC 14869(T) , L. reuteri C 10, L. reuteri ATCC 23272(T) , L. gallinarum ATCC 33199(T) , L. salivarius ATCC 11741(T) , L. salivarius I 24, L. panis JCM 11053(T) and L. panis C 17, being differentiated at the strain level. Composite rep-PCR analysis is potentially a useful fingerprinting method to discriminate probiotic Lactobacillus strains isolated from the gastrointestinal tract of chickens. Copyright © 2011 Society of Chemical Industry.
Evolutionary conservation of regulatory elements in vertebrate HOX gene clusters

DOE Office of Scientific and Technical Information (OSTI.GOV)

Santini, Simona; Boore, Jeffrey L.; Meyer, Axel

2003-12-31

Due to their high degree of conservation, comparisons of DNA sequences among evolutionarily distantly-related genomes permit to identify functional regions in noncoding DNA. Hox genes are optimal candidate sequences for comparative genome analyses, because they are extremely conserved in vertebrates and occur in clusters. We aligned (Pipmaker) the nucleotide sequences of HoxA clusters of tilapia, pufferfish, striped bass, zebrafish, horn shark, human and mouse (over 500 million years of evolutionary distance). We identified several highly conserved intergenic sequences, likely to be important in gene regulation. Only a few of these putative regulatory elements have been previously described as being involvedmore » in the regulation of Hox genes, while several others are new elements that might have regulatory functions. The majority of these newly identified putative regulatory elements contain short fragments that are almost completely conserved and are identical to known binding sites for regulatory proteins (Transfac). The conserved intergenic regions located between the most rostrally expressed genes in the developing embryo are longer and better retained through evolution. We document that presumed regulatory sequences are retained differentially in either A or A clusters resulting from a genome duplication in the fish lineage. This observation supports both the hypothesis that the conserved elements are involved in gene regulation and the Duplication-Deletion-Complementation model.« less
Mechanisms of haplotype divergence at the RGA08 nucleotide-binding leucine-rich repeat gene locus in wild banana (Musa balbisiana)

PubMed Central

2010-01-01

Background Comparative sequence analysis of complex loci such as resistance gene analog clusters allows estimating the degree of sequence conservation and mechanisms of divergence at the intraspecies level. In banana (Musa sp.), two diploid wild species Musa acuminata (A genome) and Musa balbisiana (B genome) contribute to the polyploid genome of many cultivars. The M. balbisiana species is associated with vigour and tolerance to pests and disease and little is known on the genome structure and haplotype diversity within this species. Here, we compare two genomic sequences of 253 and 223 kb corresponding to two haplotypes of the RGA08 resistance gene analog locus in M. balbisiana "Pisang Klutuk Wulung" (PKW). Results Sequence comparison revealed two regions of contrasting features. The first is a highly colinear gene-rich region where the two haplotypes diverge only by single nucleotide polymorphisms and two repetitive element insertions. The second corresponds to a large cluster of RGA08 genes, with 13 and 18 predicted RGA genes and pseudogenes spread over 131 and 152 kb respectively on each haplotype. The RGA08 cluster is enriched in repetitive element insertions, in duplicated non-coding intergenic sequences including low complexity regions and shows structural variations between haplotypes. Although some allelic relationships are retained, a large diversity of RGA08 genes occurs in this single M. balbisiana genotype, with several RGA08 paralogs specific to each haplotype. The RGA08 gene family has evolved by mechanisms of unequal recombination, intragenic sequence exchange and diversifying selection. An unequal recombination event taking place between duplicated non-coding intergenic sequences resulted in a different RGA08 gene content between haplotypes pointing out the role of such duplicated regions in the evolution of RGA clusters. Based on the synonymous substitution rate in coding sequences, we estimated a 1 million year divergence time for these M. balbisiana haplotypes. Conclusions A large RGA08 gene cluster identified in wild banana corresponds to a highly variable genomic region between haplotypes surrounded by conserved flanking regions. High level of sequence identity (70 to 99%) of the genic and intergenic regions suggests a recent and rapid evolution of this cluster in M. balbisiana. PMID:20637079
Terminator Detection by Support Vector Machine Utilizing aStochastic Context-Free Grammar

DOE Office of Scientific and Technical Information (OSTI.GOV)

Francis-Lyon, Patricia; Cristianini, Nello; Holbrook, Stephen

2006-12-30

A 2-stage detector was designed to find rho-independent transcription terminators in the Escherichia coli genome. The detector includes a Stochastic Context Free Grammar (SCFG) component and a Support Vector Machine (SVM) component. To find terminators, the SCFG searches the intergenic regions of nucleotide sequence for local matches to a terminator grammar that was designed and trained utilizing examples of known terminators. The grammar selects sequences that are the best candidates for terminators and assigns them a prefix, stem-loop, suffix structure using the Cocke-Younger-Kasaami (CYK) algorithm, modified to incorporate energy affects of base pairing. The parameters from this inferred structure aremore » passed to the SVM classifier, which distinguishes terminators from non-terminators that score high according to the terminator grammar. The SVM was trained with negative examples drawn from intergenic sequences that include both featureless and RNA gene regions (which were assigned prefix, stem-loop, suffix structure by the SCFG), so that it successfully distinguishes terminators from either of these. The classifier was found to be 96.4% successful during testing.« less
Molecular cloning and analysis of Schizosaccharomyces pombe Reb1p: sequence-specific recognition of two sites in the far upstream rDNA intergenic spacer.

PubMed Central

Zhao, A; Guo, A; Liu, Z; Pape, L

1997-01-01

The coding sequences for a Schizosaccharomyces pombe sequence-specific DNA binding protein, Reb1p, have been cloned. The predicted S. pombe Reb1p is 24-29% identical to mouse TTF-1 (transcription termination factor-1) and Saccharomyces cerevisiae REB1 protein, both of which direct termination of RNA polymerase I catalyzed transcripts. The S.pombe Reb1 cDNA encodes a predicted polypeptide of 504 amino acids with a predicted molecular weight of 58.4 kDa. The S. pombe Reb1p is unusual in that the bipartite DNA binding motif identified originally in S.cerevisiae and Klyveromyces lactis REB1 proteins is uninterrupted and thus S.pombe Reb1p may contain the smallest natural REB1 homologous DNA binding domain. Its genomic coding sequences were shown to be interrupted by two introns. A recombinant histidine-tagged Reb1 protein bearing the rDNA binding domain has two homologous, sequence-specific binding sites in the S. pomber DNA intergenic spacer, located between 289 and 480 nt downstream of the end of the approximately 25S rRNA coding sequences. Each binding site is 13-14 bp downstream of two of the three proposed in vivo termination sites. The core of this 17 bp site, AGGTAAGGGTAATGCAC, is specifically protected by Reb1p in footprinting analysis. PMID:9016645
Molecular typing of Vibrio parahaemolyticus strains isolated from the Philippines by PCR-based methods.

PubMed

Maluping, R P; Ravelo, C; Lavilla-Pitogo, C R; Krovacek, K; Romalde, J L

2005-01-01

The main aim of the present study was to use three PCR-based techniques for the analysis of genetic variability among Vibrio parahaemolyticus strains isolated from the Philippines. Seventeen strains of V. parahaemolyticus isolated from shrimps (Penaeus monodon) and from the environments where these shrimps are being cultivated were analysed by random amplified polymorphic DNA PCR (RAPD-PCR), enterobacterial repetitive intergenic consensus sequence PCR (ERIC-PCR) and repetitive extragenic palindromic PCR (REP-PCR). The results of this work have demonstrated genetic variability within the V. parahaemolyticus strains that were isolated from the Philippines. In addition, RAPD, ERIC and REP-PCR are suitable rapid typing methods for V. parahaemolyticus. All three methods have good discriminative ability and can be used as a rapid means of comparing V. parahaemolyticus strains for epidemiological investigation. Based on the results of this study, we could say that REP-PCR is inferior to RAPD and ERIC-PCR owing to the fact that it is less reproducible. Moreover, the REP-PCR analysis yielded a relatively small number of products. This may suggests that the REP sequences may not be widely distributed in the V. parahaemolyticus genome. Genetic variability within V. parahaemolyticus strains isolated in the Philippines has been demonstrated. The presence of ERIC and REP sequences in the genome of this bacterial species was confirmed. The RAPD, ERIC and REP-PCR techniques are useful methods for molecular typing of V. parahaemolyticus strains. To our knowledge this is the first study of this kind carried out on V. parahaemolyticus strains isolated from the Philippines.
RNA polymerase V-dependent small RNAs in Arabidopsis originate from small, intergenic loci including most SINE repeats.

PubMed

Lee, Tzuu-fen; Gurazada, Sai Guna Ranjan; Zhai, Jixian; Li, Shengben; Simon, Stacey A; Matzke, Marjori A; Chen, Xuemei; Meyers, Blake C

2012-07-01

In plants, heterochromatin is maintained by a small RNA-based gene silencing mechanism known as RNA-directed DNA methylation (RdDM). RdDM requires the non-redundant functions of two plant-specific DNA-dependent RNA polymerases (RNAP), RNAP IV and RNAP V. RNAP IV plays a major role in siRNA biogenesis, while RNAP V may recruit DNA methylation machinery to target endogenous loci for silencing. Although small RNA-generating regions that are dependent on both RNAP IV and RNAP V have been identified previously, the genomic loci targeted by RNAP V for siRNA accumulation and silencing have not been described extensively. To characterize the RNAP V-dependent, heterochromatic siRNA-generating regions in the Arabidopsis genome, we deeply sequenced the small RNA populations of wild-type and RNAP V null mutant (nrpe1) plants. Our results showed that RNAP V-dependent siRNA-generating loci are associated predominately with short repetitive sequences in intergenic regions. Suppression of small RNA production from short repetitive sequences was also prominent in RdDM mutants including dms4, drd1, dms3 and rdm1, reflecting the known association of these RdDM effectors with RNAP V. The genomic regions targeted by RNAP V were small, with an estimated average length of 238 bp. Our results suggest that RNAP V affects siRNA production from genomic loci with features dissimilar to known RNAP IV-dependent loci. RNAP V, along with RNAP IV and DRM1/2, may target and silence a set of small, intergenic transposable elements located in dispersed genomic regions for silencing. Silencing at these loci may be actively reinforced by RdDM.
Characterisation of Bergeyella spp. isolated from the nasal cavities of piglets.

PubMed

Lorenzo de Arriba, M; Lopez-Serrano, S; Galofre-Mila, N; Aragon, V

2018-04-01

The aim of this study was to characterise bacteria in the genus Bergeyella isolated from the nasal passages of healthy piglets. Nasal swabs from 3 to 4 week-old piglets from eight commercial domestic pig farms and one wild boar farm were cultured under aerobic conditions. Twenty-nine Bergeyella spp. isolates were identified by partial 16S rRNA gene sequencing and 11 genotypes were discriminated by enterobacterial repetitive intergenic consensus (ERIC)-PCR. Bergeyella zoohelcum and Bergeyella porcorum were identified within the 11 genotypes. Bergeyella spp. isolates exhibited resistance to serum complement and phagocytosis, poor capacity to form biofilms and were able to adhere to epithelial cells. Maneval staining was consistent with the presence of a capsule. Multiple drug resistance (resistance to three or more classes of antimicrobial agents) was present in 9/11 genotypes, including one genotype isolated from wild boar with no history of antimicrobial use. In conclusion, Bergeyella spp. isolates from the nasal cavities of piglets showed some in vitro features indicative of a potential for virulence. Further studies are necessary to identify the role of Bergeyella spp. in disease and within the nasal microbiota of pigs. Copyright © 2018 The Author(s). Published by Elsevier Ltd.. All rights reserved.
Contribution of Avian Salmonella enterica Isolates to Human Salmonellosis Cases in Constantine (Algeria)

PubMed Central

Elgroud, Rachid; Granier, Sophie A.; Marault, Muriel; Kerouanton, Annaëlle; Lezzar, Abdesslem; Bouzitouna-Bentchouala, Chafia; Brisabois, Anne; Millemann, Yves

2015-01-01

An epidemiological investigation was carried out on one hundred Salmonella isolates from broiler farms, slaughterhouses, and human patients in the Constantine region of Algeria, in order to explore the contribution of avian strains to human salmonellosis cases in this region over the same period of time. The isolates were characterized by phenotypic as well as genotypic methods. A large variety of antimicrobial resistance profiles was found among human isolates, while only seven profiles were found among avian isolates. Enterobacterial Repetitive Intergenic Consensus-PCR (ERIC-PCR), Insertion Sequence 200-PCR (IS200-PCR), and Pulsed Field Gel Electrophoresis (PFGE) resulted in the allocation of the isolates to 16, 20, and 34 different profiles, respectively. The 3 genotyping methods led to complementary results by underlining the clonality of some serovars with the diffusion and persistence of a single clone in the Constantine area as well as stressing the polymorphism present in isolates belonging to other serovars, indicating the diversity of potential reservoirs of nontyphoidal Salmonella. Altogether, our results seem to indicate that nontyphoidal avian Salmonella may play an important role in human salmonellosis in the Constantine region. PMID:26543858
Diversity of acetic acid bacteria present in healthy grapes from the Canary Islands.

PubMed

Valera, Maria José; Laich, Federico; González, Sara S; Torija, Maria Jesús; Mateo, Estibaliz; Mas, Albert

2011-11-15

The identification of acetic acid bacteria (AAB) from sound grapes from the Canary Islands is reported in the present study. No direct recovery of bacteria was possible in the most commonly used medium, so microvinifications were performed on grapes from Tenerife, La Palma and Lanzarote islands. Up to 396 AAB were isolated from those microvinifications and identified by 16S rRNA gene sequencing and phylogenetic analysis. With this method, Acetobacter pasteurianus, Acetobacter tropicalis, Gluconobacter japonicus and Gluconacetobacter saccharivorans were identified. However, no discrimination between the closely related species Acetobacter malorum and Acetobacter cerevisiae was possible. As previously described, 16S-23S rRNA gene internal transcribed spacer (ITS) region phylogenetic analysis was required to classify isolates as one of those species. These two species were the most frequently occurring, accounting for more than 60% of the isolates. For typing the AAB isolates, both the Enterobacterial Repetitive Intergenic Consensus (ERIC)-PCR and (GTG)5-PCR techniques gave similar resolution. A total of 60 profiles were identified. Thirteen of these profiles were found in more than one vineyard, and only one profile was found on two different islands (Tenerife and La Palma). Copyright © 2011 Elsevier B.V. All rights reserved.
Genic and Intergenic SSR Database Generation, SNPs Determination and Pathway Annotations, in Date Palm (Phoenix dactylifera L.).

PubMed

Mokhtar, Morad M; Adawy, Sami S; El-Assal, Salah El-Din S; Hussein, Ebtissam H A

2016-01-01

The present investigation was carried out aiming to use the bioinformatics tools in order to identify and characterize, simple sequence repeats within the third Version of the date palm genome and develop a new SSR primers database. In addition single nucleotide polymorphisms (SNPs) that are located within the SSR flanking regions were recognized. Moreover, the pathways for the sequences assigned by SSR primers, the biological functions and gene interaction were determined. A total of 172,075 SSR motifs was identified on date palm genome sequence with a frequency of 450.97 SSRs per Mb. Out of these, 130,014 SSRs (75.6%) were located within the intergenic regions with a frequency of 499 SSRs per Mb. While, only 42,061 SSRs (24.4%) were located within the genic regions with a frequency of 347.5 SSRs per Mb. A total of 111,403 of SSR primer pairs were designed, that represents 291.9 SSR primers per Mb. Out of the 111,403, only 31,380 SSR primers were in the genic regions, while 80,023 primers were in the intergenic regions. A number of 250,507 SNPs were recognized in 84,172 SSR flanking regions, which represents 75.55% of the total SSR flanking regions. Out of 12,274 genes only 463 genes comprising 896 SSR primers were mapped onto 111 pathways using KEGG data base. The most abundant enzymes were identified in the pathway related to the biosynthesis of antibiotics. We tested 1031 SSR primers using both publicly available date palm genome sequences as templates in the in silico PCR reactions. Concerning in vitro validation, 31 SSR primers among those used in the in silico PCR were synthesized and tested for their ability to detect polymorphism among six Egyptian date palm cultivars. All tested primers have successfully amplified products, but only 18 primers detected polymorphic amplicons among the studied date palm cultivars.
The 28S–18S rDNA intergenic spacer from Crithidia fasciculata: repeated sequences, length heterogeneity, putative processing sites and potential interactions between U3 small nucleolar RNA and the ribosomal RNA precursor

PubMed Central

Schnare, Murray N.; Collings, James C.; Spencer, David F.; Gray, Michael W.

2000-01-01

In Crithidia fasciculata, the ribosomal RNA (rRNA) gene repeats range in size from ∼11 to 12 kb. This length heterogeneity is localized to a region of the intergenic spacer (IGS) that contains tandemly repeated copies of a 19mer sequence. The IGS also contains four copies of an ∼55 nt repeat that has an internal inverted repeat and is also present in the IGS of Leishmania species. We have mapped the C.fasciculata transcription initiation site as well as two other reverse transcriptase stop sites that may be analogous to the A0 and A′ pre-rRNA processing sites within the 5′ external transcribed spacer (ETS) of other eukaryotes. Features that could influence processing at these sites include two stretches of conserved primary sequence and three secondary structure elements present in the 5′ ETS. We also characterized the C.fasciculata U3 snoRNA, which has the potential for base-pairing with pre-rRNA sequences. Finally, we demonstrate that biosynthesis of large subunit rRNA in both C.fasciculata and Trypanosoma brucei involves 3′-terminal addition of three A residues that are not present in the corresponding DNA sequences. PMID:10982863
Centromeres: long intergenic spaces with adaptive features.

PubMed

Kanizay, Lisa; Dawe, R Kelly

2009-08-01

Centromeres are composed of inner kinetochore proteins, which are largely conserved across species, and repetitive DNA, which shows comparatively little sequence conservation. Due to this fundamental paradox the formation and maintenance of centromeres remains largely a mystery. However, it has become increasingly clear that a long-standing balance between epigenetic and genetic control governs the interactions of centromeric DNA and inner kinetochore proteins. The comparison of classical neocentromeres in plants, which are entirely genetic in their mode of operation, and clinical neocentromeres, which are sequence-independent, illustrates the conflict between genetics and epigenetics in regions that control their own transmission to progeny. Tandem repeat arrays present in centromeres may have an origin in meiotic drive or other selfish patterns of evolution, as is the case for the CENP-B box and CENP-B protein in human. In grasses retrotransposons have invaded centromeres to the point of complete domination, consequently breaking genetic regulation at these centromeres. The accumulation of tandem repeats and transposons causes centromeres to expand in size, effectively pushing genes to the sides and opening the centromere to ever fewer constraints on the DNA sequence. On genetic maps centromeres appear as long intergenic spaces that evolve rapidly and apparently without regard to host fitness.
16S-23S rDNA intergenic spacer region polymorphism of Lactococcus garvieae, Lactococcus raffinolactis and Lactococcus lactis as revealed by PCR and nucleotide sequence analysis.

PubMed

Blaiotta, Giuseppe; Pepe, Olimpia; Mauriello, Gianluigi; Villani, Francesco; Andolfi, Rosamaria; Moschetti, Giancarlo

2002-12-01

The intergenic spacer region (ISR) between the 16S and 23S rRNA genes was tested as a tool for differentiating lactococci commonly isolated in a dairy environment. 17 reference strains, representing 11 different species belonging to the genera Lactococcus, Streptococcus, Lactobacillus, Enterococcus and Leuconostoc, and 127 wild streptococcal strains isolated during the whole fermentation process of "Fior di Latte" cheese were analyzed. After 16S-23S rDNA ISR amplification by PCR, species or genus-specific patterns were obtained for most of the reference strains tested. Moreover, results obtained after nucleotide analysis show that the 16S-23S rDNA ISR sequences vary greatly, in size and sequence, among Lactococcus garvieae, Lactococcus raffinolactis, Lactococcus lactis as well as other streptococci from dairy environments. Because of the high degree of inter-specific polymorphism observed, 16S-23S rDNA ISR can be considered a good potential target for selecting species-specific molecular assays, such as PCR primer or probes, for a rapid and extremely reliable differentiation of dairy lactococcal isolates.
Fungal Genes in Context: Genome Architecture Reflects Regulatory Complexity and Function

PubMed Central

Noble, Luke M.; Andrianopoulos, Alex

2013-01-01

Gene context determines gene expression, with local chromosomal environment most influential. Comparative genomic analysis is often limited in scope to conserved or divergent gene and protein families, and fungi are well suited to this approach with low functional redundancy and relatively streamlined genomes. We show here that one aspect of gene context, the amount of potential upstream regulatory sequence maintained through evolution, is highly predictive of both molecular function and biological process in diverse fungi. Orthologs with large upstream intergenic regions (UIRs) are strongly enriched in information processing functions, such as signal transduction and sequence-specific DNA binding, and, in the genus Aspergillus, include the majority of experimentally studied, high-level developmental and metabolic transcriptional regulators. Many uncharacterized genes are also present in this class and, by implication, may be of similar importance. Large intergenic regions also share two novel sequence characteristics, currently of unknown significance: they are enriched for plus-strand polypyrimidine tracts and an information-rich, putative regulatory motif that was present in the last common ancestor of the Pezizomycotina. Systematic consideration of gene UIR in comparative genomics, particularly for poorly characterized species, could help reveal organisms’ regulatory priorities. PMID:23699226
Consensus generation and variant detection by Celera Assembler.

PubMed

Denisov, Gennady; Walenz, Brian; Halpern, Aaron L; Miller, Jason; Axelrod, Nelson; Levy, Samuel; Sutton, Granger

2008-04-15

We present an algorithm to identify allelic variation given a Whole Genome Shotgun (WGS) assembly of haploid sequences, and to produce a set of haploid consensus sequences rather than a single consensus sequence. Existing WGS assemblers take a column-by-column approach to consensus generation, and produce a single consensus sequence which can be inconsistent with the underlying haploid alleles, and inconsistent with any of the aligned sequence reads. Our new algorithm uses a dynamic windowing approach. It detects alleles by simultaneously processing the portions of aligned reads spanning a region of sequence variation, assigns reads to their respective alleles, phases adjacent variant alleles and generates a consensus sequence corresponding to each confirmed allele. This algorithm was used to produce the first diploid genome sequence of an individual human. It can also be applied to assemblies of multiple diploid individuals and hybrid assemblies of multiple haploid organisms. Being applied to the individual human genome assembly, the new algorithm detects exactly two confirmed alleles and reports two consensus sequences in 98.98% of the total number 2,033311 detected regions of sequence variation. In 33,269 out of 460,373 detected regions of size >1 bp, it fixes the constructed errors of a mosaic haploid representation of a diploid locus as produced by the original Celera Assembler consensus algorithm. Using an optimized procedure calibrated against 1 506 344 known SNPs, it detects 438 814 new heterozygous SNPs with false positive rate 12%. The open source code is available at: http://wgs-assembler.cvs.sourceforge.net/wgs-assembler/
Evidence That Intergenic Spacer Repeats of Drosophila Melanogaster Rrna Genes Function as X-Y Pairing Sites in Male Meiosis, and a General Model for Achiasmatic Pairing

PubMed Central

McKee, B. D.; Habera, L.; Vrana, J. A.

1992-01-01

In Drosophila melanogaster males, X-Y meiotic chromosome pairing is mediated by the nucleolus organizers (NOs) which are located in the X heterochromatin (Xh) and near the Y centromere. Deficiencies for Xh disrupt X-Y meiotic pairing and cause high frequencies of X-Y nondisjunction. Insertion of cloned rRNA genes on an Xh(-) chromosome partially restores normal X-Y pairing and disjunction. To map the sequences within an inserted, X-linked rRNA gene responsible for stimulating X-Y pairing, partial deletions were generated by P element-mediated destabilization of the insert. Complete deletions of the rRNA transcription unit did not interfere with the ability to stimulate X-Y pairing as long as most of the intergenic spacer (IGS) remained. Within groups of deletions that lacked the entire transcription unit and differed only in length of residual IGS material, pairing ability was proportional to the dose of 240-bp intergenic spacer repeats. Deletions of the complete rRNA transcription unit or of the 28S sequences alone blocked nucleolus formation, as determined by binding of an antinucleolar antibody, yet did not interfere with pairing ability, suggesting that X-Y pairing may not be mechanistically related to nucleolus formation. A model for achiasmatic pairing in Drosophila males based upon the combined action of topoisomerase I and a strand transferase is proposed. PMID:1330825
Diversities in Virulence, Antifungal Activity, Pigmentation and DNA Fingerprint among Strains of Burkholderia glumae

PubMed Central

Karki, Hari S.; Shrestha, Bishnu K.; Han, Jae Woo; Groth, Donald E.; Barphagha, Inderjit K.; Rush, Milton C.; Melanson, Rebecca A.; Kim, Beom Seok; Ham, Jong Hyun

2012-01-01

Burkholderia glumae is the primary causal agent of bacterial panicle blight of rice. In this study, 11 naturally avirulent and nine virulent strains of B. glumae native to the southern United States were characterized in terms of virulence in rice and onion, toxofalvin production, antifungal activity, pigmentation and genomic structure. Virulence of B. glumae strains on rice panicles was highly correlated to virulence on onion bulb scales, suggesting that onion bulb can be a convenient alternative host system to efficiently determine the virulence of B. glumae strains. Production of toxoflavin, the phytotoxin that functions as a major virulence factor, was closely associated with the virulence phenotypes of B. glumae strains in rice. Some strains of B. glumae showed various levels of antifungal activity against Rhizoctonia solani, the causal agent of sheath blight, and pigmentation phenotypes on casamino acid-peptone-glucose (CPG) agar plates regardless of their virulence traits. Purple and yellow-green pigments were partially purified from a pigmenting strain of B. glumae, 411gr-6, and the purple pigment fraction showed a strong antifungal activity against Collectotrichum orbiculare. Genetic variations were detected among the B. glumae strains from DNA fingerprinting analyses by repetitive element sequence-based PCR (rep-PCR) for BOX-A1R-based repetitive extragenic palindromic (BOX) or enterobacterial repetitive intergenic consensus (ERIC) sequences of bacteria; and close genetic relatedness among virulent but pigment-deficient strains were revealed by clustering analyses of DNA fingerprints from BOX-and ERIC-PCR. PMID:23028972
A 12-year molecular survey of clinical herpes simplex virus type 2 isolates demonstrates the circulation of clade A and B strains in Germany.

PubMed

Schmidt-Chanasit, Jonas; Bialonski, Alexandra; Heinemann, Patrick; Ulrich, Rainer G; Günther, Stephan; Rabenau, Holger F; Doerr, Hans Wilhelm

2010-07-01

Recently two different herpes simplex virus type 2 (HSV-2) clades (A and B) were described on DNA sequence data of the glycoprotein E (gE), G (gG) and I (gI) genes. To type the circulating HSV-2 wild-type strains in Germany by a novel approach and to monitor potential changes in the molecular epidemiology between 1997 and 2008. A total of 64 clinical HSV-2 isolates were analyzed by a novel approach using the DNA sequences of the complete open reading frames of glycoprotein B (gB) and gG. Recombination analysis of the gB and gG gene sequences was performed to reveal intragenic recombinants. Based on the phylogenetic analysis of the gB coding DNA sequence 8 of 64 (12%) isolates were classified as clade A strains and 56 of 64 (88%) isolates were classified as clade B strains. Analysis of the gG coding DNA sequence classified 4 (6%) isolates as clade A strains and 60 (94%) isolates as clade B strains. In comparison, the 8 isolates classified as clade A strains using the gB sequence data were classified as clade B strains when using the gG coding DNA sequence, suggesting intergenic recombination events. Intragenic recombination events were not detected. The first molecular survey of clinical HSV-2 isolates from Germany demonstrated the circulation of clade A and B strains and of intergenic recombinants over a period of 12 years. Copyright (c) 2010 Elsevier B.V. All rights reserved.
Characterization of the intergenic RNA profile at abdominal-A and Abdominal-B in the Drosophila bithorax complex

PubMed Central

Bae, Esther; Calhoun, Vincent C.; Levine, Michael; Lewis, Edward B.; Drewell, Robert A.

2002-01-01

The correct spatial expression of two Drosophila bithorax complex (BX-C) genes, abdominal-A (abdA) and Abdominal-B (AbdB), is dependent on the 100-kb intergenic infraabdominal (iab) region. The iab region is known to contain a number of different domains (iab2 through iab8) that harbor cis-regulatory elements responsible for directing expression of abdA and AbdB in the second through eighth abdominal segments. Here, we use in situ hybridization to perform high-resolution mapping of the transcriptional activity in the iab control regions. We show that transcription of the control regions themselves is abundant and precedes activation of the abdA and AbdB genes. As with the homeotic genes of the BX-C, the transcription patterns of the RNAs from the iab control regions demonstrate colinearity with the sequence of the iab regions along the chromosome and the domains in the embryo under the control of the specific iab regions. These observations suggest that the intergenic RNAs may play a role in initiating cis regulation at the BX-C early in development. PMID:12481037

Different phylogenomic approaches to resolve the evolutionary relationships among model fish species.

PubMed

Negrisolo, Enrico; Kuhl, Heiner; Forcato, Claudio; Vitulo, Nicola; Reinhardt, Richard; Patarnello, Tomaso; Bargelloni, Luca

2010-12-01

Comparative genomics holds the promise to magnify the information obtained from individual genome sequencing projects, revealing common features conserved across genomes and identifying lineage-specific characteristics. To implement such a comparative approach, a robust phylogenetic framework is required to accurately reconstruct evolution at the genome level. Among vertebrate taxa, teleosts represent the second best characterized group, with high-quality draft genome sequences for five model species (Danio rerio, Gasterosteus aculeatus, Oryzias latipes, Takifugu rubripes, and Tetraodon nigroviridis), and several others are in the finishing lane. However, the relationships among the acanthomorph teleost model fishes remain an unresolved taxonomic issue. Here, a genomic region spanning over 1.2 million base pairs was sequenced in the teleost fish Dicentrarchus labrax. Together with genomic data available for the above fish models, the new sequence was used to identify unique orthologous genomic regions shared across all target taxa. Different strategies were applied to produce robust multiple gene and genomic alignments spanning from 11,802 to 186,474 amino acid/nucleotide positions. Ten data sets were analyzed according to Bayesian inference, maximum likelihood, maximum parsimony, and neighbor joining methods. Extensive analyses were performed to explore the influence of several factors (e.g., alignment methodology, substitution model, data set partitions, and long-branch attraction) on the tree topology. Although a general consensus was observed for a closer relationship between G. aculeatus (Gasterosteidae) and Di. labrax (Moronidae) with the atherinomorph O. latipes (Beloniformes) sister taxon of this clade, with the tetraodontiform group Ta. rubripes and Te. nigroviridis (Tetraodontiformes) representing a more distantly related taxon among acanthomorph model fish species, conflicting results were obtained between data sets and methods, especially with respect to the choice of alignment methodology applied to noncoding parts of the genomic region under study. This may limit the use of intergenic/noncoding sequences in phylogenomics until more robust alignment algorithms are developed.
Gene expression patterns are correlated with genomic and genic structure in soybean

USDA-ARS?s Scientific Manuscript database

Studies have indicated that exon and intron size, and intergenic distance are correlated with gene expression levels and expression breadth. Previous studies on these correlations in plants and animals have been conflicting. In this study next-generation sequence data of the soybean transcriptome wa...
Prevalence, Molecular Characterization, and Antibiotic Susceptibility of Vibrio parahaemolyticus from Ready-to-Eat Foods in China

PubMed Central

Xie, Tengfei; Xu, Xiaoke; Wu, Qingping; Zhang, Jumei; Cheng, Jianheng

2016-01-01

Vibrio parahaemolyticus is the leading cause of foodborne outbreaks, particularly outbreaks associated with consumption of fish and shellfish, and represents a major threat to human health worldwide. This bacterium harbors two main virulence factors: the thermostable direct hemolysin (TDH) and TDH-related hemolysin (TRH). Additionally, various serotypes have been identified. The extensive use of antibiotics is a contributing factor to the increasing incidence of antimicrobial-resistant V. parahaemolyticus. In the current study, we aimed to determine the incidence and features of V. parahaemolyticus in ready-to-eat (RTE) foods in China. We found 39 V. parahaemolyticus strains on Chinese RTE foods through investigation of 511 RTE foods samples from 24 cities in China. All isolates were analyzed for the presence of tdh and trh gene by PCR, serotyping was performed using multiplex PCR, antibiotic susceptibility analysis was carried out using the disk diffusion method, and molecular typing was performed using enterobacterial repetitive intergenic consensus sequence PCR (ERIC-PCR) typing and multilocus sequence typing (MLST). The results showed that none of the isolates were positive for tdh and trh. Most of the isolates (33.3%) were serotype O2. Antimicrobial susceptibility results indicated that most strains were resistant to streptomycin (89.7%), cefazolin (51.3%), and ampicillin (51.3%). The isolates were grouped into five clusters by ERIC-PCR and four clusters by MLST. We updated 10 novel loci and 33 sequence types (STs) in the MLST database. Thus, our findings demonstrated the presence of V. parahaemolyticus in Chinese RTE foods, provided insights into the dissemination of antibiotic-resistant strains, and improved our knowledge of methods of microbiological risk assessment in RTE foods. PMID:27148231
Species identification and molecular typing of human Brucella isolates from Kuwait.

PubMed

Mustafa, Abu S; Habibi, Nazima; Osman, Amr; Shaheed, Faraz; Khan, Mohd W

2017-01-01

Brucellosis is a zoonotic disease of major concern in Kuwait and the Middle East. Human brucellosis can be caused by several Brucella species with varying degree of pathogenesis, and relapses are common after apparently successful therapy. The classical biochemical methods for identification of Brucella are time-consuming, cumbersome, and provide information limited to the species level only. In contrast, molecular methods are rapid and provide differentiation at intra-species level. In this study, four molecular methods [16S rRNA gene sequencing, real-time PCR, enterobacterial repetitive intergenic consensus (ERIC)-PCR and multilocus variable-number tandem-repeat analysis (MLVA)-8, MLVA-11 and MLVA-16 were evaluated for the identification and typing of 75 strains of Brucella isolated in Kuwait. 16S rRNA gene sequencing of all isolates showed 90-99% sequence identity with B. melitensis and real-time PCR with genus- and species- specific primers identified all isolates as B. melitensis. The results of ERIC-PCR suggested the existence of 75 ERIC genotypes of B. melitensis with a discriminatory index of 0.997. Cluster classification of these genotypes divided them into two clusters, A and B, diverging at ~25%. The maximum number of genotypes (n = 51) were found in cluster B5. MLVA-8 analysis identified all isolates as B. melitensis, and MLVA-8, MLVA-11 and MLVA-16 typing divided the isolates into 10, 32 and 71 MLVA types, respectively. Furthermore, the combined minimum spanning tree analysis demonstrated that, compared to MLVA types discovered all over the world, the Kuwaiti isolates were a distinct group of MLVA-11 and MLVA-16 types in the East Mediterranean Region.
Species identification and molecular typing of human Brucella isolates from Kuwait

PubMed Central

Osman, Amr; Shaheed, Faraz; Khan, Mohd W.

2017-01-01

Brucellosis is a zoonotic disease of major concern in Kuwait and the Middle East. Human brucellosis can be caused by several Brucella species with varying degree of pathogenesis, and relapses are common after apparently successful therapy. The classical biochemical methods for identification of Brucella are time-consuming, cumbersome, and provide information limited to the species level only. In contrast, molecular methods are rapid and provide differentiation at intra-species level. In this study, four molecular methods [16S rRNA gene sequencing, real-time PCR, enterobacterial repetitive intergenic consensus (ERIC)-PCR and multilocus variable-number tandem-repeat analysis (MLVA)-8, MLVA-11 and MLVA-16 were evaluated for the identification and typing of 75 strains of Brucella isolated in Kuwait. 16S rRNA gene sequencing of all isolates showed 90–99% sequence identity with B. melitensis and real-time PCR with genus- and species- specific primers identified all isolates as B. melitensis. The results of ERIC-PCR suggested the existence of 75 ERIC genotypes of B. melitensis with a discriminatory index of 0.997. Cluster classification of these genotypes divided them into two clusters, A and B, diverging at ~25%. The maximum number of genotypes (n = 51) were found in cluster B5. MLVA-8 analysis identified all isolates as B. melitensis, and MLVA-8, MLVA-11 and MLVA-16 typing divided the isolates into 10, 32 and 71 MLVA types, respectively. Furthermore, the combined minimum spanning tree analysis demonstrated that, compared to MLVA types discovered all over the world, the Kuwaiti isolates were a distinct group of MLVA-11 and MLVA-16 types in the East Mediterranean Region. PMID:28800594
Comparison of Two Capillary Gel Electrophoresis Systems for Clostridium difficile Ribotyping, Using a Panel of Ribotype 027 Isolates and Whole-Genome Sequences as a Reference Standard

PubMed Central

Xiao, Meng; Kong, Fanrong; Jin, Ping; Wang, Qinning; Xiao, Kelin; Jeoffreys, Neisha; James, Gregory

2012-01-01

PCR ribotyping is the most commonly used Clostridium difficile genotyping method, but its utility is limited by lack of standardization. In this study, we analyzed four published whole genomes and tested an international collection of 21 well-characterized C. difficile ribotype 027 isolates as the basis for comparison of two capillary gel electrophoresis (CGE)-based ribotyping methods. There were unexpected differences between the 16S-23S rRNA intergenic spacer region (ISR) allelic profiles of the four ribotype 027 genomes, but six bands were identified in all four and a seventh in three genomes. All seven bands and another, not identified in any of the whole genomes, were found in all 21 isolates. We compared sequencer-based CGE (SCGE) with three different primer pairs to the Qiagen QIAxcel CGE (QCGE) platform. Deviations from individual reference/consensus band sizes were smaller for SCGE (0 to 0.2 bp) than for QCGE (4.2 to 9.5 bp). Compared with QCGE, SCGE more readily distinguished bands of similar length (more discriminatory), detected bands of larger size and lower intensity (more sensitive), and assigned band sizes more accurately and reproducibly, making it more suitable for standardization. Specifically, QCGE failed to identify the largest ISR amplicon. Based on several criteria, we recommend the primer set 16S-USA/23S-USA for use in a proposed standard SCGE method. Similar differences between SCGE and QCGE were found on testing of 14 isolates of four other C. difficile ribotypes. Based on our results, ISR profiles based on accurate sequencer-based band lengths would be preferable to agarose gel-based banding patterns for the assignment of ribotypes. PMID:22692737
Origin, evolution, and biogeography of Juglans: a phylogenetic perspective

USDA-ARS?s Scientific Manuscript database

Phylogenetic analyses of extant Juglans (Juglandaceae) using five cpDNA intergenic spacer (IGS) sequences (trnT-trnF, psbA-trnH, atpB-rbcL, trnV-16S rRNA, and trnS-trnfM) were performed to elucidate the origin, diversification, historical biogeography, and evolutionary relationships within the genus...
The cyanobiont in an Azolla fern is neither Anabaena nor Nostoc.

PubMed

Baker, Judith A; Entsch, Barrie; McKay, David B

2003-12-05

The cyanobacterial symbionts in the fern Azolla have generally been ascribed to either the Anabaena or Nostoc genera. By using comparisons of the sequences of the phycocyanin intergenic spacer and a fragment of the 16S rRNA, we found that the cyanobiont from an Azolla belongs to neither of these genera.
Identification of risk factors and causes of persistence of Salmonella Gallinarum in laying hens farms from Colombia

USDA-ARS?s Scientific Manuscript database

The presence of Salmonella Gallinarum (SG) was recently identified in brown egg layers in Colombia. During 2013, twenty isolates were analyzed using Intergenic Sequence Ribotyping. Eighteen (90%) were SG and two (10%) were Salmonella Enteritidis (SE). SG was isolated mainly from organs of sick birds...
Phylogeographic patterns of Armillaria ostoyae in the western United States

Treesearch

J. W. Hanna; N. B. Klopfenstein; M. -S. Kim; G. I. McDonald; J. A. Moore

2007-01-01

Nuclear ribosomal DNA regions (i.e. large subunit, internal transcribed spacer, 5.8S and intergenic spacer) were sequenced using a direct-polymerase chain reaction method from Armillaria ostoyae genets collected from the western USA. Many of the A. ostoyae genets contained heterogeneity among rDNA repeats, indicating intragenomic variation and likely intraspecific...
Mechanisms generating long range correlation in nucleotide composition of the Borrelia Burgdorferi genome

NASA Astrophysics Data System (ADS)

Mackiewicz, P.; Gierlik, A.; Kowalczuk, M.; Szczepanik, D.; Dudek, M. R.; Cebrat, S.

1999-12-01

We have analysed protein coding and intergenic sequences in the Borrelia burgdorferi (the Lyme disease bacterium) genome using different kinds of DNA walks. Genes occupying the leading strand of DNA have significantly different nucleotide composition from genes occupying the lagging strand. Nucleotide compositional bias of the two DNA strands reflects the aminoacid composition of proteins. 96% of genes coding for ribosomal proteins lie on the leading DNA strand, which suggests that the positions of these as well as other genes are non-random. In the B. burgdorferi genome, the asymmetry in intergenic DNA sequences is lower than the asymmetry in the third positions in codons. All these characters of the B. burgdorferi genome suggest that both replication-associated mutational pressure and recombination mechanisms have established the specific structure of the genome and now any recombination leading to inversion of a gene in respect to the direction of replication is forbidden. This property of the genome allows us to assume that it is in a steady state, which enables us to fix some parameters for simulations of DNA evolution.
Phylogenetic relationships within Chamaecrista sect. Xerocalyx (Leguminosae, Caesalpinioideae) inferred from the cpDNA trnE-trnT intergenic spacer and nrDNA ITS sequences

PubMed Central

Torres, Davi Coe; Lima, João Paulo Matos Santos; Fernandes, Afrânio Gomes; Nunes, Edson Paula; Grangeiro, Thalles Barbosa

2011-01-01

Chamaecrista belongs to subtribe Cassiinae (Caesalpinioideae), and it comprises over 330 species, divided into six sections. The section Xerocalyx has been subjected to a profound taxonomic shuffling over the years. Therefore, we conducted a phylogenetic analysis using a cpDNA trnE-trnT intergenic spacer and nrDNA ITS/5.8S sequences from Cassiinae taxa, in an attempt to elucidate the relationships within this section from Chamaecrista. The tree topology was congruent between the two data sets studied in which the monophyly of the genus Chamaecrista was strongly supported. Our analyses reinforce that new sectional boundaries must be defined in the Chamaecrista genus, especially the inclusion of sections Caliciopsis and Xerocalyx in sect. Chamaecrista, considered here paraphyletic. The section Xerocalyx was strongly supported as monophyletic; however, the current data did not show C. ramosa (microphyllous) and C. desvauxii (macrophyllous) and their respective varieties in distinct clades, suggesting that speciation events are still ongoing in these specimens. PMID:21734825
Trichodesmium genome maintains abundant, widespread noncoding DNA in situ, despite oligotrophic lifestyle

DOE PAGES

Walworth, Nathan; Pfreundt, Ulrike; Nelson, William C.; ...

2015-03-23

Understanding the evolution of the free-living, cyanobacterial, diazotroph Trichodesmium is of great importance because of its critical role in oceanic biogeochemistry and primary production. Unlike the other >150 available genomes of free-living cyanobacteria, only 63.8% of the Trichodesmium erythraeum (strain IMS101) genome is predicted to encode protein, which is 20–25% less than the average for other cyanobacteria and nonpathogenic, free-living bacteria. In this paper, we use distinctive isolates and metagenomic data to show that low coding density observed in IMS101 is a common feature of the Trichodesmium genus, both in culture and in situ. Transcriptome analysis indicates that 86% ofmore » the noncoding space is expressed, although the function of these transcripts is unclear. The density of noncoding, possible regulatory elements predicted in Trichodesmium, when normalized per intergenic kilobase, was comparable and twofold higher than that found in the gene-dense genomes of the sympatric cyanobacterial genera Synechococcus and Prochlorococcus, respectively. Conserved Trichodesmium noncoding RNA secondary structures were predicted between most culture and metagenomic sequences, lending support to the structural conservation. Conservation of these intergenic regions in spatiotemporally separated Trichodesmium populations suggests possible genus-wide selection for their maintenance. These large intergenic spacers may have developed during intervals of strong genetic drift caused by periodic blooms of a subset of genotypes, which may have reduced effective population size. Finally, our data suggest that transposition of selfish DNA, low effective population size, and high-fidelity replication allowed the unusual “inflation” of noncoding sequence observed in Trichodesmium despite its oligotrophic lifestyle.« less
Trichodesmium genome maintains abundant, widespread noncoding DNA in situ, despite oligotrophic lifestyle

DOE PAGES

Walworth, Nathan G.; Pfreundt, Ulrike; Nelson, William C.; ...

2015-04-07

Understanding the evolution of the free-living, cyanobacterial, diazotroph Trichodesmium is of great importance due to its critical role in oceanic biogeochemistry and primary production. Unlike the other >150 available genomes of free-living cyanobacteria, only 63.8% of the Trichodesmium erythraeum (strain IMS101) genome is predicted to encode protein, which is 20-25% less than the average for other cyanobacteria and non-pathogenic, free-living bacteria. We use distinctive isolates and metagenomic data to show that low coding density observed in IMS101 is a common feature of the Trichodesmium genus both in culture and in situ. Transcriptome analysis indicates that 86% of the non-coding spacemore » is expressed, although the function of these transcripts is unclear. The density of noncoding, possible regulatory elements predicted in Trichodesmium, when normalized per intergenic kilobase, was comparable and two fold higher than that found in the gene dense genomes of the sympatric cyanobacterial genera Synechococcus and Prochlorococcus, respectively. Conserved Trichodesmium ncRNA secondary structures were predicted between most culture and metagenomic sequences lending support to the structural conservation. Conservation of these intergenic regions in spatiotemporally separated Trichodesmium populations suggests possible genus-wide selection for their maintenance. These large intergenic spacers may have developed during intervals of strong genetic drift caused by periodic blooms of a subset of genotypes, which may have reduced effective population size. Our data suggest that transposition of selfish DNA, low effective population size, and high fidelity replication allowed the unusual ‘inflation’ of noncoding sequence observed in Trichodesmium despite its oligotrophic lifestyle.« less
Site directed recombination

DOEpatents

Jurka, Jerzy W.

1997-01-01

Enhanced homologous recombination is obtained by employing a consensus sequence which has been found to be associated with integration of repeat sequences, such as Alu and ID. The consensus sequence or sequence having a single transition mutation determines one site of a double break which allows for high efficiency of integration at the site. By introducing single or double stranded DNA having the consensus sequence flanking region joined to a sequence of interest, one can reproducibly direct integration of the sequence of interest at one or a limited number of sites. In this way, specific sites can be identified and homologous recombination achieved at the site by employing a second flanking sequence associated with a sequence proximal to the 3'-nick.
Coexistence of SHV-4- and TEM-24-Producing Enterobacter aerogenes Strains before a Large Outbreak of TEM-24-Producing Strains in a French Hospital

PubMed Central

Mammeri, H.; Laurans, G.; Eveillard, M.; Castelain, S.; Eb, F.

2001-01-01

In 1996, a monitoring program was initiated at the teaching hospital of Amiens, France, and carried out for 3 years. All extended-spectrum β-lactamase (ESBL)-producing Enterobacter aerogenes isolates recovered from clinical specimens were collected for investigation of their epidemiological relatedness by pulsed-field gel electrophoresis and enterobacterial repetitive intergenic consensus PCR (ERIC-PCR) and determination of the type of ESBL harbored by isoelectric focusing and DNA sequencing. Molecular typing revealed the endemic coexistence, during the first 2 years, of two clones expressing, respectively, SHV-4 and TEM-24 ESBLs, while an outbreak of the TEM-24-producing strain raged in the hospital during the third year, causing the infection or colonization of 165 patients. Furthermore, this strain was identified as the prevalent clone responsible for outbreaks in many French hospitals since 1996. This study shows that TEM-24-producing E. aerogenes is an epidemic clone that is well established in the hospital's ecology and able to spread throughout wards. The management of the outbreak at the teaching hospital of Amiens, which included the reinforcement of infection control measures, failed to obtain complete eradication of the clone, which has become an endemic pathogen. PMID:11376055
Serological and molecular heterogeneity among Yersinia ruckeri strains isolated from farmed Atlantic salmon Salmo salar in Chile.

PubMed

Bastardo, A; Bohle, H; Ravelo, C; Toranzo, A E; Romalde, J L

2011-02-22

We investigated 11 strains of Yersinia ruckeri, the causative agent of enteric redmouth disease (ERM), that had been isolated from Atlantic salmon Salmo salar L. farmed in Chile and previously vaccinated against ERM. Phylogenetic analysis of the 16S rRNA gene sequences confirmed the identification of the salmon isolates as Y. ruckeri. A comparative analysis of the biochemical characteristics was made by means of traditional and commercial miniaturised methods. All studied isolates were motile and Tween 80 positive, and were identified as biotype 1. In addition, drug susceptibility tests determined high sensitivity to sulphamethoxazole/trimethroprim, oxytetracycline, ampicillin and enrofloxacin in all isolates. Serological assays showed the presence of O1a, O1b and O2b serotypes, with a predominance of the O1b serotype in 9 strains. Analysis of the lipopolysaccharide profiles and the correspondent immunoblot confirmed these results. Sodium dodecyl sulphate polyacrylamide gel electrophoresis (SDS-PAGE) of the outer membrane proteins revealed that all Chilean strains had profiles with a molecular weight range between 34 and 55 kDa, with 3 distinct groups based on differences in the major bands. Genotyping analyses by enterobacterial repetitive intergenic consensus (ERIC-) and repetitive extragenic palindromic (REP-)PCR techniques clearly indicated intraspecific genetic diversity among Chilean Y. ruckeri strains.
ISSR, ERIC and RAPD techniques to detect genetic diversity in the aphid pathogen Pandora neoaphidis.

PubMed

Tymon, Anna M; Pell, Judith K

2005-03-01

The entomopathogenic fungus Pandora neoaphidis is an important natural enemy of aphids. ISSR, ERIC (Enterobacterial Repetitive Intergenic Consensus) and RAPD PCR-based DNA fingerprint analyses were undertaken to study intra-specific variation amongst 30 isolates of P. neoaphidis worldwide, together with six closely related species of Entomophthorales. All methods yielded scorable binary characters, and distance matrices were constructed from both individual and combined data sets. Neighbour-joining was used to construct consensus phylogenetic trees which showed that although P. neoaphidis isolates were highly polymorphic they separated into a monophyletic group compared with the other Entomophthorales tested. Three distinct subclades were found, with UK isolates occupying two of these. No specific correlation with aphid host species was established for any of the isolates apart from those in one cluster which contained isolates obtained from nettle aphid, Microlophium carnosum. ERIC, ISSR and RAPD analysis allowed the rapid genetic characterisation and differentiation of isolates with the generation of potential isolate- and cluster specific-diagnostic DNA markers.
Linking maternal and somatic 5S rRNA types with different sequence-specific non-LTR retrotransposons

PubMed Central

Pagano, Johanna F.B.; Ensink, Wim A.; van Olst, Marina; van Leeuwen, Selina; Nehrdich, Ulrike; Zhu, Kongju; Spaink, Herman P.; Girard, Geneviève; Rauwerda, Han; Jonker, Martijs J.; Dekker, Rob J.

2017-01-01

5S rRNA is a ribosomal core component, transcribed from many gene copies organized in genomic repeats. Some eukaryotic species have two 5S rRNA types defined by their predominant expression in oogenesis or adult tissue. Our next-generation sequencing study on zebrafish egg, embryo, and adult tissue identified maternal-type 5S rRNA that is exclusively accumulated during oogenesis, replaced throughout the embryogenesis by a somatic-type, and thus virtually absent in adult somatic tissue. The maternal-type 5S rDNA contains several thousands of gene copies on chromosome 4 in tandem repeats with small intergenic regions, whereas the somatic-type is present in only 12 gene copies on chromosome 18 with large intergenic regions. The nine-nucleotide variation between the two 5S rRNA types likely affects TFIII binding and riboprotein L5 binding, probably leading to storage of maternal-type rRNA. Remarkably, these sequence differences are located exactly at the sequence-specific target site for genome integration by the 5S rRNA-specific Mutsu retrotransposon family. Thus, we could define maternal- and somatic-type MutsuDr subfamilies. Furthermore, we identified four additional maternal-type and two new somatic-type MutsuDr subfamilies, each with their own target sequence. This target-site specificity, frequently intact maternal-type retrotransposon elements, plus specific presence of Mutsu retrotransposon RNA and piRNA in egg and adult tissue, suggest an involvement of retrotransposons in achieving the differential copy number of the two types of 5S rDNA loci. PMID:28003516
To Clone or Not To Clone: Method Analysis for Retrieving Consensus Sequences In Ancient DNA Samples

PubMed Central

Winters, Misa; Barta, Jodi Lynn; Monroe, Cara; Kemp, Brian M.

2011-01-01

The challenges associated with the retrieval and authentication of ancient DNA (aDNA) evidence are principally due to post-mortem damage which makes ancient samples particularly prone to contamination from “modern” DNA sources. The necessity for authentication of results has led many aDNA researchers to adopt methods considered to be “gold standards” in the field, including cloning aDNA amplicons as opposed to directly sequencing them. However, no standardized protocol has emerged regarding the necessary number of clones to sequence, how a consensus sequence is most appropriately derived, or how results should be reported in the literature. In addition, there has been no systematic demonstration of the degree to which direct sequences are affected by damage or whether direct sequencing would provide disparate results from a consensus of clones. To address this issue, a comparative study was designed to examine both cloned and direct sequences amplified from ∼3,500 year-old ancient northern fur seal DNA extracts. Majority rules and the Consensus Confidence Program were used to generate consensus sequences for each individual from the cloned sequences, which exhibited damage at 31 of 139 base pairs across all clones. In no instance did the consensus of clones differ from the direct sequence. This study demonstrates that, when appropriate, cloning need not be the default method, but instead, should be used as a measure of authentication on a case-by-case basis, especially when this practice adds time and cost to studies where it may be superfluous. PMID:21738625

Analysis of 16S-23S intergenic spacer regions of the rRNA operons in Edwardsiella ictaluri and Edwardsiella tarda isolates from fish.

PubMed

Panangala, V S; van Santen, V L; Shoemaker, C A; Klesius, P H

2005-01-01

To analyse interspecies and intraspecies differences based on the 16S-23S rRNA intergenic spacer region (ISR) sequences of the fish pathogens Edwardsiella ictaluri and Edwardsiella tarda. The 16S-23S rRNA spacer regions of 19 Edw. ictaluri and four Edw. tarda isolates from four geographical regions were amplified by PCR with primers complementary to conserved sequences within the flanking 16S-23S rRNA coding sequences. Two products were generated from all isolates, without interspecies or intraspecific size polymorphisms. Sequence analysis of the amplified fragments revealed a smaller ISR of 350 bp, which contained a gene for tRNA(Glu), and a larger ISR of 441 bp, which contained genes for tRNA(Ile) and tRNA(Ala). The sequences of the smaller ISR of different Edw. ictaluri isolates were essentially identical to each other. Partial sequences of larger ISR from several Edw. ictaluri isolates also revealed no differences from the one complete Edw. ictaluri large ISR sequence obtained. The sequences of the smaller ISR of Edw. tarda were 97% identical to the Edw. ictaluri smaller ISR and the larger ISR were 96-98% identical to the Edw. ictaluri larger ISR sequence. The Edw. tarda isolates displayed limited ISR sequence heterogeneity, with > or =97% sequence identity among isolates for both small and large ISR. There is a high degree of size and sequence similarity of 16S-23S ISR both among isolates within Edw. ictaluri and Edw. tarda species and between the two species. Our results confirm a close genetic relationship between Edw. ictaluri and Edw. tarda and the relative homogeneity of Edw. ictaluri isolates compared with Edw. tarda isolates. Because no differences were found in ISR sequences among Edw. ictaluri isolates, sequence analysis of the ISR will not be useful to distinguish isolates of Edw. ictaluri. However, we identified restriction sites that differ between ISR sequences of Edw. ictaluri and Edw. tarda, which will be useful in distinguishing the two species.
Spread of imipenem-resistant Acinetobacter baumannii co-expressing OXA-23 and GES-11 carbapenemases in Lebanon.

PubMed

Hammoudi, D; Moubareck, C Ayoub; Hakime, N; Houmani, M; Barakat, A; Najjar, Z; Suleiman, M; Fayad, N; Sarraf, R; Sarkis, D Karam

2015-07-01

The acquisition of carbapenemases by Acinetobacter baumannii is reported increasingly worldwide, but data from Lebanon are limited. The aims of this study were to evaluate the prevalence of imipenem-resistant A. baumannii in Lebanon, identify resistance determinants, and detect clonal relatedness. Imipenem-resistant A. baumannii were collected from nine Lebanese hospitals during 2012. Antimicrobial susceptibility, the cloxacillin effect, and ethylenediaminetetraacetic acid (EDTA) synergy were determined. Genes encoding carbapenemases and insertion sequence ISAba1 were screened via PCR sequencing. ISAba1 position relative to genes encoding Acinetobacter-derived cephalosporinases (ADCs) and OXA-23 was studied by PCR mapping. Clonal linkage was examined by enterobacterial repetitive intergenic consensus PCR (ERIC-PCR). Out of 724 A. baumannii isolated in 2012, 638 (88%) were imipenem-resistant. Of these, 142 were analyzed. Clavulanic acid-imipenem synergy suggested carbapenem-hydrolyzing extended-spectrum β-lactamase. A positive cloxacillin test indicated ADCs, while EDTA detection strips were negative. Genotyping indicated that 90% of isolates co-harbored blaOXA-23 and blaGES-11. The remaining strains had blaOXA-23, blaOXA-24, blaGES-11, or blaOXA-24 with blaGES-11. ISAba1 was located upstream of blaADC and blaOXA-23 in 97% and 100% of isolates, respectively. ERIC-PCR fingerprinting revealed 18 pulsotypes spread via horizontal gene transfer and clonal dissemination. This survey established baseline evidence of OXA-23 and GES-11-producing A. baumannii in Lebanon, indicating the need for further surveillance. Copyright © 2015 The Authors. Published by Elsevier Ltd.. All rights reserved.
Assignment of serotype to Salmonella enterica isolates obtained from poultry and their environment in Southern Brazil.

USDA-ARS?s Scientific Manuscript database

To assess diversity of Salmonella enterica serotypes present in poultry and their environment from Southern Brazil, the Kauffman-White-LeMinor (KWL) scheme was used to serotype a total of 155 isolates. Isolates were then re-examined with nested PCR and sequencing of the dkgB-linked Intergenic Sequ...
Benthic bacterial diversity in submerged sinkhole ecosystems.

PubMed

Nold, Stephen C; Pangborn, Joseph B; Zajack, Heidi A; Kendall, Scott T; Rediske, Richard R; Biddanda, Bopaiah A

2010-01-01

Physicochemical characterization, automated ribosomal intergenic spacer analysis (ARISA) community profiling, and 16S rRNA gene sequencing approaches were used to study bacterial communities inhabiting submerged Lake Huron sinkholes inundated with hypoxic, sulfate-rich groundwater. Photosynthetic cyanobacterial mats on the sediment surface were dominated by Phormidium autumnale, while deeper, organically rich sediments contained diverse and active bacterial communities.
Five Complete Chloroplast Genome Sequences from Diospyros: Genome Organization and Comparative Analysis.

PubMed

Fu, Jianmin; Liu, Huimin; Hu, Jingjing; Liang, Yuqin; Liang, Jinjun; Wuyun, Tana; Tan, Xiaofeng

2016-01-01

Diospyros is the largest genus in Ebenaceae, comprising more than 500 species with remarkable economic value, especially Diospyros kaki Thunb., which has traditionally been an important food resource in China, Korea, and Japan. Complete chloroplast (cp) genomes from D. kaki, D. lotus L., D. oleifera Cheng., D. glaucifolia Metc., and Diospyros 'Jinzaoshi' were sequenced using Illumina sequencing technology. This is the first cp genome reported in Ebenaceae. The cp genome sequences of Diospyros ranged from 157,300 to 157,784 bp in length, presenting a typical quadripartite structure with two inverted repeats each separated by one large and one small single-copy region. For each cp genome, 134 genes were annotated, including 80 protein-coding, 31 tRNA, and 4 rRNA unique genes. In all, 179 repeats and 283 single sequence repeats were identified. Four hypervariable regions, namely, intergenic region of trnQ_rps16, trnV_ndhC, and psbD_trnT, and intron of ndhA, were identified in the Diospyros genomes. Phylogenetic analyses based on the whole cp genome, protein-coding, and intergenic and intron sequences indicated that D. oleifera is closely related to D. kaki and could be used as a model plant for future research on D. kaki; to our knowledge, this is proposed for the first time. Further, these analyses together with two large deletions (301 and 140 bp) in the cp genome of D. 'Jinzaoshi', support its placement as a new species in Diospyros. Both maximum parsimony and likelihood analyses for 19 taxa indicated the basal position of Ericales in asterids and suggested that Ebenaceae is monophyletic in Ericales.
Five Complete Chloroplast Genome Sequences from Diospyros: Genome Organization and Comparative Analysis

PubMed Central

Hu, Jingjing; Liang, Yuqin; Liang, Jinjun; Wuyun, Tana; Tan, Xiaofeng

2016-01-01

Diospyros is the largest genus in Ebenaceae, comprising more than 500 species with remarkable economic value, especially Diospyros kaki Thunb., which has traditionally been an important food resource in China, Korea, and Japan. Complete chloroplast (cp) genomes from D. kaki, D. lotus L., D. oleifera Cheng., D. glaucifolia Metc., and Diospyros ‘Jinzaoshi’ were sequenced using Illumina sequencing technology. This is the first cp genome reported in Ebenaceae. The cp genome sequences of Diospyros ranged from 157,300 to 157,784 bp in length, presenting a typical quadripartite structure with two inverted repeats each separated by one large and one small single-copy region. For each cp genome, 134 genes were annotated, including 80 protein-coding, 31 tRNA, and 4 rRNA unique genes. In all, 179 repeats and 283 single sequence repeats were identified. Four hypervariable regions, namely, intergenic region of trnQ_rps16, trnV_ndhC, and psbD_trnT, and intron of ndhA, were identified in the Diospyros genomes. Phylogenetic analyses based on the whole cp genome, protein-coding, and intergenic and intron sequences indicated that D. oleifera is closely related to D. kaki and could be used as a model plant for future research on D. kaki; to our knowledge, this is proposed for the first time. Further, these analyses together with two large deletions (301 and 140 bp) in the cp genome of D. ‘Jinzaoshi’, support its placement as a new species in Diospyros. Both maximum parsimony and likelihood analyses for 19 taxa indicated the basal position of Ericales in asterids and suggested that Ebenaceae is monophyletic in Ericales. PMID:27442423
Molecular evidence confirms the taxonomic separation of Lutzomyia tihuiliensis from Lutzomyia pia (Diptera: Psychodidae) and the usefulness of pleural pigmentation patterns in species identification.

PubMed

Pérez-Doria, Alveiro; Bejarano, Eduar Elías; Sierra, Diana; Vélez, Iván Darío

2008-07-01

The phlebotomine sand flies Lutzomyia pia (Fairchild & Hertig 1961) and Lutzomyia tihuiliensis Le Pont, Torrez-Espejo & Dujardin 1997 (Diptera: Psychodidae) belong to the pia series of the Lu. verrucarum species group, which includes several species that bite humans in Andean foci of leishmaniasis. The females of these two species exhibit isometry and isomorphism in anatomical structures of the head and terminalia commonly used in taxonomic identification of sand flies. They can only be differentiated based on subtle differences in the pigmentation of the pleura. In Lu. tihuiliensis, this is restricted to the basal portions of the katepimeron and katepisternum, whereas in Lu. pia both structures are totally pigmented. Taking into account the subtle morphological differences between these species, the objective of the current study was to evaluate the specific taxonomic status of Lu. tihuiliensis with respect to Lu. pia. A 475-bp portion of the mitochondrial genome was sequenced, composed of the 3' end of the cytochrome b gene, intergenic spacer 1, the transfer RNA gene for serine, intergenic spacer 2, and the 3' end of the gene NAD dehydrogenase 1. Genetic analysis confirms that Lu. tihuiliensis and Lu. pia constitute two distinct species and this is supported by four strong lines of evidence, i.e., the paired genetic distances, size differences and amino acid composition of the cytochrome b protein, presence and absence of intergenic spacer one and divergence observed in the sequence of the transfer RNA gene for serine. It also confirms the validity of the pleural pigmentation pattern as a species diagnostic character and the importance of performing a detailed examination of this character during morphological determination of phlebotomine sand flies in the series pia.
Identification of Small RNAs in Desulfovibrio vulgaris Hildenborough

DOE Office of Scientific and Technical Information (OSTI.GOV)

Burns, Andrew; Joachimiak, Marcin; Deutschbauer, Adam

2010-05-17

Desulfovibrio vulgaris is an anaerobic sulfate-reducing bacterium capable of facilitating the removal of toxic metals such as uranium from contaminated sites via reduction. As such, it is essential to understand the intricate regulatory cascades involved in how D. vulgaris and its relatives respond to stressors in such sites. One approach is the identification and analysis of small non-coding RNAs (sRNAs); molecules ranging in size from 20-200 nucleotides that predominantly affect gene regulation by binding to complementary mRNA in an anti-sense fashion and therefore provide an immediate regulatory response. To identify sRNAs in D. vulgaris, a bacterium that does not possessmore » an annotated hfq gene, RNA was pooled from stationary and exponential phases, nitrate exposure, and biofilm conditions. The subsequent RNA was size fractionated, modified, and converted to cDNA for high throughput transcriptomic deep sequencing. A computational approach to identify sRNAs via the alignment of seven separate Desulfovibrio genomes was also performed. From the deep sequencing analysis, 2,296 reads between 20 and 250 nt were identified with expression above genome background. Analysis of those reads limited the number of candidates to ~;;87 intergenic, while ~;;140 appeared to be antisense to annotated open reading frames (ORFs). Further BLAST analysis of the intergenic candidates and other Desulfovibrio genomes indicated that eight candidates were likely portions of ORFs not previously annotated in the D. vulgaris genome. Comparison of the intergenic and antisense data sets to the bioinformatical predicted candidates, resulted in ~;;54 common candidates. Current approaches using Northern analysis and qRT-PCR are being used toverify expression of the candidates and to further develop the role these sRNAs play in D. vulgaris regulation.« less
Comparison of Ultra-Conserved Elements in Drosophilids and Vertebrates

PubMed Central

Makunin, Igor V.; Shloma, Viktor V.; Stephen, Stuart J.; Pheasant, Michael; Belyakin, Stepan N.

2013-01-01

Metazoan genomes contain many ultra-conserved elements (UCEs), long sequences identical between distant species. In this study we identified UCEs in drosophilid and vertebrate species with a similar level of phylogenetic divergence measured at protein-coding regions, and demonstrated that both the length and number of UCEs are larger in vertebrates. The proportion of non-exonic UCEs declines in distant drosophilids whilst an opposite trend was observed in vertebrates. We generated a set of 2,126 Sophophora UCEs by merging elements identified in several drosophila species and compared these to the eutherian UCEs identified in placental mammals. In contrast to vertebrates, the Sophophora UCEs are depleted around transcription start sites. Analysis of 52,954 P-element, piggyBac and Minos insertions in the D. melanogaster genome revealed depletion of the P-element and piggyBac insertions in and around the Sophophora UCEs. We examined eleven fly strains with transposon insertions into the intergenic UCEs and identified associated phenotypes in five strains. Four insertions behave as recessive lethals, and in one case we observed a suppression of the marker gene within the transgene, presumably by silenced chromatin around the integration site. To confirm the lethality is caused by integration of transposons we performed a phenotype rescue experiment for two stocks and demonstrated that the excision of the transposons from the intergenic UCEs restores viability. Sequencing of DNA after the transposon excision in one fly strain with the restored viability revealed a 47 bp insertion at the original transposon integration site suggesting that the nature of the mutation is important for the appearance of the phenotype. Our results suggest that the UCEs in flies and vertebrates have both common and distinct features, and demonstrate that a significant proportion of intergenic drosophila UCEs are sensitive to disruption. PMID:24349264
A safe an easy method for building consensus HIV sequences from 454 massively parallel sequencing data.

PubMed

Fernández-Caballero Rico, Jose Ángel; Chueca Porcuna, Natalia; Álvarez Estévez, Marta; Mosquera Gutiérrez, María Del Mar; Marcos Maeso, María Ángeles; García, Federico

2018-02-01

To show how to generate a consensus sequence from the information of massive parallel sequences data obtained from routine HIV anti-retroviral resistance studies, and that may be suitable for molecular epidemiology studies. Paired Sanger (Trugene-Siemens) and next-generation sequencing (NGS) (454 GSJunior-Roche) HIV RT and protease sequences from 62 patients were studied. NGS consensus sequences were generated using Mesquite, using 10%, 15%, and 20% thresholds. Molecular evolutionary genetics analysis (MEGA) was used for phylogenetic studies. At a 10% threshold, NGS-Sanger sequences from 17/62 patients were phylogenetically related, with a median bootstrap-value of 88% (IQR83.5-95.5). Association increased to 36/62 sequences, median bootstrap 94% (IQR85.5-98)], using a 15% threshold. Maximum association was at the 20% threshold, with 61/62 sequences associated, and a median bootstrap value of 99% (IQR98-100). A safe method is presented to generate consensus sequences from HIV-NGS data at 20% threshold, which will prove useful for molecular epidemiological studies. Copyright © 2016 Elsevier España, S.L.U. and Sociedad Española de Enfermedades Infecciosas y Microbiología Clínica. All rights reserved.
Occurrence and Phenotypic Characterization of Yersinia ruckeri Strains with Biofilm-Forming Capacity in a Rainbow Trout Farm

PubMed Central

Coquet, L.; Cosette, P.; Quillet, L.; Petit, F.; Junter, G.-A.; Jouenne, T.

2002-01-01

The presence of Yersinia ruckeri in a French fish farm was investigated. Y. ruckeri was isolated mainly from algae and sediment samples rather than from water. Twenty-two Y. ruckeri isolates were obtained, and three strains were distinguished by enterobacterial repetitive intergenic consensus PCR amplification. These strains were able to adhere to solid supports. This characteristic was correlated with flagellum-mediated motility. Killing experiments showed that sessile cells were more resistant to oxolinic acid than their planktonic counterparts. Our results demonstrate that surface colonization of fish farm tanks by Y. ruckeri biofilms is a potential source of recurrent infection for extended periods of time. PMID:11823180
Whole Genome Sequencing of High-Risk Families to Identify New Mutational Mechanisms of Breast Cancer Predisposition

DTIC Science & Technology

2015-12-01

proportion greater than 0.25 (iv) Read depth greater than 8 in at least one sample The Table below shows variant data from Family 1041 categorized by...patients from a severely affected breast cancer Family 1041 . All Shared Rare Excluding IBD0 Intergenic 3,345,727 1,650,045 35,927 3,990 ncRNA
Benthic Bacterial Diversity in Submerged Sinkhole Ecosystems▿ †

PubMed Central

Nold, Stephen C.; Pangborn, Joseph B.; Zajack, Heidi A.; Kendall, Scott T.; Rediske, Richard R.; Biddanda, Bopaiah A.

2010-01-01

Physicochemical characterization, automated ribosomal intergenic spacer analysis (ARISA) community profiling, and 16S rRNA gene sequencing approaches were used to study bacterial communities inhabiting submerged Lake Huron sinkholes inundated with hypoxic, sulfate-rich groundwater. Photosynthetic cyanobacterial mats on the sediment surface were dominated by Phormidium autumnale, while deeper, organically rich sediments contained diverse and active bacterial communities. PMID:19880643
A predictive biophysical model of translational coupling to coordinate and control protein expression in bacterial operons

PubMed Central

Tian, Tian; Salis, Howard M.

2015-01-01

Natural and engineered genetic systems require the coordinated expression of proteins. In bacteria, translational coupling provides a genetically encoded mechanism to control expression level ratios within multi-cistronic operons. We have developed a sequence-to-function biophysical model of translational coupling to predict expression level ratios in natural operons and to design synthetic operons with desired expression level ratios. To quantitatively measure ribosome re-initiation rates, we designed and characterized 22 bi-cistronic operon variants with systematically modified intergenic distances and upstream translation rates. We then derived a thermodynamic free energy model to calculate de novo initiation rates as a result of ribosome-assisted unfolding of intergenic RNA structures. The complete biophysical model has only five free parameters, but was able to accurately predict downstream translation rates for 120 synthetic bi-cistronic and tri-cistronic operons with rationally designed intergenic regions and systematically increased upstream translation rates. The biophysical model also accurately predicted the translation rates of the nine protein atp operon, compared to ribosome profiling measurements. Altogether, the biophysical model quantitatively predicts how translational coupling controls protein expression levels in synthetic and natural bacterial operons, providing a deeper understanding of an important post-transcriptional regulatory mechanism and offering the ability to rationally engineer operons with desired behaviors. PMID:26117546
A draft sequence of the rice genome (Oryza sativa L. ssp. indica).

PubMed

Yu, Jun; Hu, Songnian; Wang, Jun; Wong, Gane Ka-Shu; Li, Songgang; Liu, Bin; Deng, Yajun; Dai, Li; Zhou, Yan; Zhang, Xiuqing; Cao, Mengliang; Liu, Jing; Sun, Jiandong; Tang, Jiabin; Chen, Yanjiong; Huang, Xiaobing; Lin, Wei; Ye, Chen; Tong, Wei; Cong, Lijuan; Geng, Jianing; Han, Yujun; Li, Lin; Li, Wei; Hu, Guangqiang; Huang, Xiangang; Li, Wenjie; Li, Jian; Liu, Zhanwei; Li, Long; Liu, Jianping; Qi, Qiuhui; Liu, Jinsong; Li, Li; Li, Tao; Wang, Xuegang; Lu, Hong; Wu, Tingting; Zhu, Miao; Ni, Peixiang; Han, Hua; Dong, Wei; Ren, Xiaoyu; Feng, Xiaoli; Cui, Peng; Li, Xianran; Wang, Hao; Xu, Xin; Zhai, Wenxue; Xu, Zhao; Zhang, Jinsong; He, Sijie; Zhang, Jianguo; Xu, Jichen; Zhang, Kunlin; Zheng, Xianwu; Dong, Jianhai; Zeng, Wanyong; Tao, Lin; Ye, Jia; Tan, Jun; Ren, Xide; Chen, Xuewei; He, Jun; Liu, Daofeng; Tian, Wei; Tian, Chaoguang; Xia, Hongai; Bao, Qiyu; Li, Gang; Gao, Hui; Cao, Ting; Wang, Juan; Zhao, Wenming; Li, Ping; Chen, Wei; Wang, Xudong; Zhang, Yong; Hu, Jianfei; Wang, Jing; Liu, Song; Yang, Jian; Zhang, Guangyu; Xiong, Yuqing; Li, Zhijie; Mao, Long; Zhou, Chengshu; Zhu, Zhen; Chen, Runsheng; Hao, Bailin; Zheng, Weimou; Chen, Shouyi; Guo, Wei; Li, Guojie; Liu, Siqi; Tao, Ming; Wang, Jian; Zhu, Lihuang; Yuan, Longping; Yang, Huanming

2002-04-05

We have produced a draft sequence of the rice genome for the most widely cultivated subspecies in China, Oryza sativa L. ssp. indica, by whole-genome shotgun sequencing. The genome was 466 megabases in size, with an estimated 46,022 to 55,615 genes. Functional coverage in the assembled sequences was 92.0%. About 42.2% of the genome was in exact 20-nucleotide oligomer repeats, and most of the transposons were in the intergenic regions between genes. Although 80.6% of predicted Arabidopsis thaliana genes had a homolog in rice, only 49.4% of predicted rice genes had a homolog in A. thaliana. The large proportion of rice genes with no recognizable homologs is due to a gradient in the GC content of rice coding sequences.
Analysis of 16S-23S rRNA intergenic spacer regions of Vibrio cholerae and Vibrio mimicus.

PubMed

Chun, J; Huq, A; Colwell, R R

1999-05-01

Vibrio cholerae identification based on molecular sequence data has been hampered by a lack of sequence variation from the closely related Vibrio mimicus. The two species share many genes coding for proteins, such as ctxAB, and show almost identical 16S DNA coding for rRNA (rDNA) sequences. Primers targeting conserved sequences flanking the 3' end of the 16S and the 5' end of the 23S rDNAs were used to amplify the 16S-23S rRNA intergenic spacer regions of V. cholerae and V. mimicus. Two major (ca. 580 and 500 bp) and one minor (ca. 750 bp) amplicons were consistently generated for both species, and their sequences were determined. The largest fragment contains three tRNA genes (tDNAs) coding for tRNAGlu, tRNALys, and tRNAVal, which has not previously been found in bacteria examined to date. The 580-bp amplicon contained tDNAIle and tDNAAla, whereas the 500-bp fragment had single tDNA coding either tRNAGlu or tRNAAla. Little variation, i.e., 0 to 0.4%, was found among V. cholerae O1 classical, O1 El Tor, and O139 epidemic strains. Slightly more variation was found against the non-O1/non-O139 serotypes (ca. 1% difference) and V. mimicus (2 to 3% difference). A pair of oligonucleotide primers were designed, based on the region differentiating all of V. cholerae strains from V. mimicus. The PCR system developed was subsequently evaluated by using representatives of V. cholerae from environmental and clinical sources, and of other taxa, including V. mimicus. This study provides the first molecular tool for identifying the species V. cholerae.
Population Genomics of Paramecium Species.

PubMed

Johri, Parul; Krenek, Sascha; Marinov, Georgi K; Doak, Thomas G; Berendonk, Thomas U; Lynch, Michael

2017-05-01

Population-genomic analyses are essential to understanding factors shaping genomic variation and lineage-specific sequence constraints. The dearth of such analyses for unicellular eukaryotes prompted us to assess genomic variation in Paramecium, one of the most well-studied ciliate genera. The Paramecium aurelia complex consists of ∼15 morphologically indistinguishable species that diverged subsequent to two rounds of whole-genome duplications (WGDs, as long as 320 MYA) and possess extremely streamlined genomes. We examine patterns of both nuclear and mitochondrial polymorphism, by sequencing whole genomes of 10-13 worldwide isolates of each of three species belonging to the P. aurelia complex: P. tetraurelia, P. biaurelia, P. sexaurelia, as well as two outgroup species that do not share the WGDs: P. caudatum and P. multimicronucleatum. An apparent absence of global geographic population structure suggests continuous or recent dispersal of Paramecium over long distances. Intergenic regions are highly constrained relative to coding sequences, especially in P. caudatum and P. multimicronucleatum that have shorter intergenic distances. Sequence diversity and divergence are reduced up to ∼100-150 bp both upstream and downstream of genes, suggesting strong constraints imposed by the presence of densely packed regulatory modules. In addition, comparison of sequence variation at non-synonymous and synonymous sites suggests similar recent selective pressures on paralogs within and orthologs across the deeply diverging species. This study presents the first genome-wide population-genomic analysis in ciliates and provides a valuable resource for future studies in evolutionary and functional genetics in Paramecium. © The Author 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
Linking maternal and somatic 5S rRNA types with different sequence-specific non-LTR retrotransposons.

PubMed

Locati, Mauro D; Pagano, Johanna F B; Ensink, Wim A; van Olst, Marina; van Leeuwen, Selina; Nehrdich, Ulrike; Zhu, Kongju; Spaink, Herman P; Girard, Geneviève; Rauwerda, Han; Jonker, Martijs J; Dekker, Rob J; Breit, Timo M

2017-04-01

5S rRNA is a ribosomal core component, transcribed from many gene copies organized in genomic repeats. Some eukaryotic species have two 5S rRNA types defined by their predominant expression in oogenesis or adult tissue. Our next-generation sequencing study on zebrafish egg, embryo, and adult tissue identified maternal-type 5S rRNA that is exclusively accumulated during oogenesis, replaced throughout the embryogenesis by a somatic-type, and thus virtually absent in adult somatic tissue. The maternal-type 5S rDNA contains several thousands of gene copies on chromosome 4 in tandem repeats with small intergenic regions, whereas the somatic-type is present in only 12 gene copies on chromosome 18 with large intergenic regions. The nine-nucleotide variation between the two 5S rRNA types likely affects TFIII binding and riboprotein L5 binding, probably leading to storage of maternal-type rRNA. Remarkably, these sequence differences are located exactly at the sequence-specific target site for genome integration by the 5S rRNA-specific Mutsu retrotransposon family. Thus, we could define maternal- and somatic-type MutsuDr subfamilies. Furthermore, we identified four additional maternal-type and two new somatic-type MutsuDr subfamilies, each with their own target sequence. This target-site specificity, frequently intact maternal-type retrotransposon elements, plus specific presence of Mutsu retrotransposon RNA and piRNA in egg and adult tissue, suggest an involvement of retrotransposons in achieving the differential copy number of the two types of 5S rDNA loci. © 2017 Locati et al.; Published by Cold Spring Harbor Laboratory Press for the RNA Society.
Ginkgo and Welwitschia Mitogenomes Reveal Extreme Contrasts in Gymnosperm Mitochondrial Evolution.

PubMed

Guo, Wenhu; Grewe, Felix; Fan, Weishu; Young, Gregory J; Knoop, Volker; Palmer, Jeffrey D; Mower, Jeffrey P

2016-06-01

Mitochondrial genomes (mitogenomes) of flowering plants are well known for their extreme diversity in size, structure, gene content, and rates of sequence evolution and recombination. In contrast, little is known about mitogenomic diversity and evolution within gymnosperms. Only a single complete genome sequence is available, from the cycad Cycas taitungensis, while limited information is available for the one draft sequence, from Norway spruce (Picea abies). To examine mitogenomic evolution in gymnosperms, we generated complete genome sequences for the ginkgo tree (Ginkgo biloba) and a gnetophyte (Welwitschia mirabilis). There is great disparity in size, sequence conservation, levels of shared DNA, and functional content among gymnosperm mitogenomes. The Cycas and Ginkgo mitogenomes are relatively small, have low substitution rates, and possess numerous genes, introns, and edit sites; we infer that these properties were present in the ancestral seed plant. By contrast, the Welwitschia mitogenome has an expanded size coupled with accelerated substitution rates and extensive loss of these functional features. The Picea genome has expanded further, to more than 4 Mb. With regard to structural evolution, the Cycas and Ginkgo mitogenomes share a remarkable amount of intergenic DNA, which may be related to the limited recombinational activity detected at repeats in Ginkgo Conversely, the Welwitschia mitogenome shares almost no intergenic DNA with any other seed plant. By conducting the first measurements of rates of DNA turnover in seed plant mitogenomes, we discovered that turnover rates vary by orders of magnitude among species. © The Author 2016. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
A one-step reaction for the rapid identification of Lactobacillus mindensis, Lactobacillus panis, Lactobacillus paralimentarius, Lactobacillus pontis and Lactobacillus frumenti using oligonucleotide primers designed from the 16S-23S rRNA intergenic sequences.

PubMed

Ferchichi, M; Valcheva, R; Prévost, H; Onno, B; Dousset, X

2008-06-01

Species-specific primers targeting the 16S-23S ribosomal DNA (rDNA) intergenic spacer region (ISR) were designed to rapidly discriminate between Lactobacillus mindensis, Lactobacillus panis, Lactobacillus paralimentarius, Lactobacillus pontis and Lactobacillus frumenti species recently isolated from French sourdough. The 16S-23S ISRs were amplified using primers 16S/p2 and 23S/p7, which anneal to positions 1388-1406 of the 16S rRNA gene and to positions 207-189 of the 23S rRNA gene respectively, Escherichia coli numbering (GenBank accession number V00331). Clone libraries of the resulting amplicons were constructed using a pCR2.1 TA cloning kit and sequenced. Species-specific primers were designed based on the sequences obtained and were used to amplify the 16S-23S ISR in the Lactobacillus species considered. For all of them, two PCR amplicons, designated as small ISR (S-ISR) and large ISR (L-ISR), were obtained. The L-ISR is composed of the corresponding S-ISR, interrupted by a sequence containing tRNA(Ile) and tRNA(Ala) genes. Based on these sequences, species-specific primers were designed and proved to identify accurately the species considered among 30 reference Lactobacillus species tested. Designed species-specific primers enable a rapid and accurate identification of L. mindensis, L. paralimentarius, L. panis, L. pontis and L. frumenti species among other lactobacilli. The proposed method provides a powerful and convenient means of rapidly identifying some sourdough lactobacilli, which could be of help in large starter culture surveys.

DNA barcode and identification of the varieties and provenances of Taiwan's domestic and imported made teas using ribosomal internal transcribed spacer 2 sequences.

PubMed

Lee, Shih-Chieh; Wang, Chia-Hsiang; Yen, Cheng-En; Chang, Chieh

2017-04-01

The major aim of made tea identification is to identify the variety and provenance of the tea plant. The present experiment used 113 tea plants [Camellia sinensis (L.) O. Kuntze] housed at the Tea Research and Extension Substation, from which 113 internal transcribed spacer 2 (ITS2) fragments, 104 trnL intron, and 98 trnL-trnF intergenic sequence region DNA sequences were successfully sequenced. The similarity of the ITS2 nucleotide sequences between tea plants housed at the Tea Research and Extension Substation was 0.379-0.994. In this polymerase chain reaction-amplified noncoding region, no varieties possessed identical sequences. Compared with the trnL intron and trnL-trnF intergenic sequence fragments of chloroplast cpDNA, the proportion of ITS2 nucleotide sequence variation was large and is more suitable for establishing a DNA barcode database to identify tea plant varieties. After establishing the database, 30 imported teas and 35 domestic made teas were used in this model system to explore the feasibility of using ITS2 sequences to identify the varieties and provenances of made teas. A phylogenetic tree was constructed using ITS2 sequences with the unweighted pair group method with arithmetic mean, which indicated that the same variety of tea plant is likely to be successfully categorized into one cluster, but contamination from other tea plants was also detected. This result provides molecular evidence that the similarity between important tea varieties in Taiwan remains high. We suggest a direct, wide collection of made tea and original samples of tea plants to establish an ITS2 sequence molecular barcode identification database to identify the varieties and provenances of tea plants. The DNA barcode comparison method can satisfy the need for a rapid, low-cost, frontline differentiation of the large amount of made teas from Taiwan and abroad, and can provide molecular evidence of their varieties and provenances. Copyright © 2016. Published by Elsevier B.V.
Embedding strategies for effective use of information from multiple sequence alignments.

PubMed Central

Henikoff, S.; Henikoff, J. G.

1997-01-01

We describe a new strategy for utilizing multiple sequence alignment information to detect distant relationships in searches of sequence databases. A single sequence representing a protein family is enriched by replacing conserved regions with position-specific scoring matrices (PSSMs) or consensus residues derived from multiple alignments of family members. In comprehensive tests of these and other family representations, PSSM-embedded queries produced the best results overall when used with a special version of the Smith-Waterman searching algorithm. Moreover, embedding consensus residues instead of PSSMs improved performance with readily available single sequence query searching programs, such as BLAST and FASTA. Embedding PSSMs or consensus residues into a representative sequence improves searching performance by extracting multiple alignment information from motif regions while retaining single sequence information where alignment is uncertain. PMID:9070452
Molecular and Biochemical Analysis of the Galactose Phenotype of Dairy Streptococcus thermophilus Strains Reveals Four Different Fermentation Profiles

PubMed Central

de Vin, Filip; Rådström, Peter; Herman, Lieve; De Vuyst, Luc

2005-01-01

Lactose-limited fermentations of 49 dairy Streptococcus thermophilus strains revealed four distinct fermentation profiles with respect to galactose consumption after lactose depletion. All the strains excreted galactose into the medium during growth on lactose, except for strain IMDOST40, which also displayed extremely high galactokinase (GalK) activity. Among this strain collection eight galactose-positive phenotypes sensu stricto were found and their fermentation characteristics and Leloir enzyme activities were measured. As the gal promoter seems to play an important role in the galactose phenotype, the galR-galK intergenic region was sequenced for all strains yielding eight different nucleotide sequences (NS1 to NS8). The gal promoter played an important role in the Gal-positive phenotype but did not determine it exclusively. Although GalT and GalE activities were detected for all Gal-positive strains, GalK activity could only be detected for two out of eight Gal-positive strains. This finding suggests that the other six S. thermophilus strains metabolize galactose via an alternative route. For each type of fermentation profile obtained, a representative strain was chosen and four complete Leloir gene clusters were sequenced. It turned out that Gal-positive strains contained more amino acid differences within their gal genes than Gal-negative strains. Finally, the biodiversity regarding lactose-galactose utilization among the different S. thermophilus strains used in this study was shown by RAPD-PCR. Five Gal-positive strains that contain nucleotide sequence NS2 in their galR-galK intergenic region were closely related. PMID:16000774
The High Degree of Sequence Plasticity of the Arenavirus Noncoding Intergenic Region (IGR) Enables the Use of a Nonviral Universal Synthetic IGR To Attenuate Arenaviruses

PubMed Central

Iwasaki, Masaharu; Cubitt, Beatrice; Sullivan, Brian M.

2016-01-01

ABSTRACT Hemorrhagic fever arenaviruses (HFAs) pose important public health problems in regions where they are endemic. Concerns about human-pathogenic arenaviruses are exacerbated because of the lack of FDA-licensed arenavirus vaccines and because current antiarenaviral therapy is limited to an off-label use of ribavirin that is only partially effective. We have recently shown that the noncoding intergenic region (IGR) present in each arenavirus genome segment, the S and L segments (S-IGR and L-IGR, respectively), plays important roles in the control of virus protein expression and that this knowledge could be harnessed for the development of live-attenuated vaccine strains to combat HFAs. In this study, we further investigated the sequence plasticity of the arenavirus IGR. We demonstrate that recombinants of the prototypic arenavirus lymphocytic choriomeningitis virus (rLCMVs), whose S-IGRs were replaced by the S-IGR of Lassa virus (LASV) or an entirely nonviral S-IGR-like sequence (Ssyn), are viable, indicating that the function of S-IGR tolerates a high degree of sequence plasticity. In addition, rLCMVs whose L-IGRs were replaced by Ssyn or S-IGRs of the very distantly related reptarenavirus Golden Gate virus (GGV) were viable and severely attenuated in vivo but able to elicit protective immunity against a lethal challenge with wild-type LCMV. Our findings indicate that replacement of L-IGR by a nonviral Ssyn could serve as a universal molecular determinant of arenavirus attenuation. IMPORTANCE Hemorrhagic fever arenaviruses (HFAs) cause high rates of morbidity and mortality and pose important public health problems in regions where they are endemic. Implementation of live-attenuated vaccines (LAVs) will represent a major step to combat HFAs. Here we document that the arenavirus noncoding intergenic region (IGR) has a high degree of plasticity compatible with virus viability. This observation led us to generate recombinant LCMVs containing nonviral synthetic IGRs. These rLCMVs were severely attenuated in vivo but able to elicit protective immunity against a lethal challenge with wild-type LCMV. These nonviral synthetic IGRs can be used as universal molecular determinants of arenavirus attenuation for the rapid development of safe and effective, as well as stable, LAVs to combat HFA. PMID:26739049
Functional characterization of SNPs in CHRNA3/B4 intergenic region associated with drug behaviors

PubMed Central

Flora, Amber V; Zambrano, Cristian A; Gallego, Xavier; Miyamoto, Jill H; Johnson, Krista A; Cowan, Katelyn A; Stitzel, Jerry A; Ehringer, Marissa A

2013-01-01

The cluster of human neuronal nicotinic receptor genes (CHRNA5/A3/B4) (15q25.1) has been associated with a variety of smoking and drug-related behaviors, as well as risk for lung cancer. CHRNA3/B4 intergenic single nucleotide polymorphisms (SNPs) rs1948 and rs8023462 have been associated with early initiation of alcohol and tobacco use, and rs6495309 has been associated with nicotine dependence and risk for lung cancer. An in vitro luciferase expression assay was used to determine whether these SNPs and surrounding sequences contribute to differences in gene expression using cell lines either expressing proteins characteristic of neuronal tissue or derived from lung cancers. Electrophoretic mobility shift assays (EMSAs) were performed to investigate whether nuclear proteins from these cell lines bind SNP alleles differentially. Results from expression assays were dependent on cell culture type and haplotype. EMSAs indicated that rs8023462 and rs6495309 bind nuclear proteins in an allele-specific way. Additionally, GATA transcription factors appeared to bind rs8023462 only when the minor/risk allele was present. Much work has been done to describe the rat Chrnb4/a3 intergenic region, but few studies have examined the human intergenic region effects on expression; therefore, these studies greatly aid human genetic research as it relates to observed nicotine phenotypes, lung cancer risk and potential underlying genetic mechanisms. Data from these experiments support the hypothesis that SNPs associated with human addiction-related phenotypes and lung cancer risk can affect gene expression, and are potential therapeutic targets. Additionally, this is the first evidence that rs8023462 interacts with GATA transcription factors to influence gene expression. PMID:23872218
Genotypic characterization of Escherichia coli O157:H7 isolates from different sources in the North-West Province, South Africa, using enterobacterial repetitive intergenic consensus PCR analysis.

PubMed

Ateba, Collins Njie; Mbewe, Moses

2014-05-30

In many developing countries, proper hygiene is not strictly implemented when animals are slaughtered and meat products become contaminated. Contaminated meat may contain Escherichia coli (E. coli) O157:H7 that could cause diseases in humans if these food products are consumed undercooked. In the present study, a total of 94 confirmed E. coli O157:H7 isolates were subjected to the enterobacterial repetitive intergenic consensus (ERIC) polymerase chain reaction (PCR) typing to generate genetic fingerprints. The ERIC fragments were resolved by electrophoresis on 2% (w/v) agarose gels. The presence, absence and intensity of band data were obtained, exported to Microsoft Excel (Microsoft Office 2003) and used to generate a data matrix. The unweighted pair group method with arithmetic mean (UPGMA) and complete linkage algorithms were used to analyze the percentage of similarity and matrix data. Relationships between the various profiles and/or lanes were expressed as dendrograms. Data from groups of related lanes were compiled and reported on cluster tables. ERIC fragments ranged from one to 15 per isolate, and their sizes varied from 0.25 to 0.771 kb. A large proportion of the isolates produced an ERIC banding pattern with three duplets ranging in sizes from 0.408 to 0.628 kb. Eight major clusters (I-VIII) were identified. Overall, the remarkable similarities (72% to 91%) between the ERIC profiles for the isolate from animal species and their corresponding food products indicated some form of contamination, which may not exclude those at the level of the abattoirs. These results reveal that ERIC PCR analysis can be reliable in comparing the genetic profiles of E. coli O157:H7 from different sources in the North-West Province of South Africa.
Multiple splicing defects in an intronic false exon.

PubMed

Sun, H; Chasin, L A

2000-09-01

Splice site consensus sequences alone are insufficient to dictate the recognition of real constitutive splice sites within the typically large transcripts of higher eukaryotes, and large numbers of pseudoexons flanked by pseudosplice sites with good matches to the consensus sequences can be easily designated. In an attempt to identify elements that prevent pseudoexon splicing, we have systematically altered known splicing signals, as well as immediately adjacent flanking sequences, of an arbitrarily chosen pseudoexon from intron 1 of the human hprt gene. The substitution of a 5' splice site that perfectly matches the 5' consensus combined with mutation to match the CAG/G sequence of the 3' consensus failed to get this model pseudoexon included as the central exon in a dhfr minigene context. Provision of a real 3' splice site and a consensus 5' splice site and removal of an upstream inhibitory sequence were necessary and sufficient to confer splicing on the pseudoexon. This activated context also supported the splicing of a second pseudoexon sequence containing no apparent enhancer. Thus, both the 5' splice site sequence and the polypyrimidine tract of the pseudoexon are defective despite their good agreement with the consensus. On the other hand, the pseudoexon body did not exert a negative influence on splicing. The introduction into the pseudoexon of a sequence selected for binding to ASF/SF2 or its replacement with beta-globin exon 2 only partially reversed the effect of the upstream negative element and the defective polypyrimidine tract. These results support the idea that exon-bridging enhancers are not a prerequisite for constitutive exon definition and suggest that intrinsically defective splice sites and negative elements play important roles in distinguishing the real splicing signal from the vast number of false splicing signals.
Whole Genome Sequencing of High-Risk Families to Identify New Mutational Mechanisms of Breast Cancer Predisposition

DTIC Science & Technology

2015-12-01

Read depth greater than 8 in at least one sample The Table below shows variant data from Family 1041 categorized by functional effect. Table 1...breast cancer Family 1041 . All Shared Rare Excluding IBD0 Intergenic 3,345,727 1,650,045 35,927 3,990 ncRNA 266,300 130,836 3,104 329 Up
Molecular Characterization of Indolent Prostate Cancer

DTIC Science & Technology

2016-12-01

prostate - specific antigen (PSA) test, and treated aggressively following diagnosis, leading to the contemporary problem of prostate cancer over-diagnosis... specific purpose of comparing low- risk and high-risk prostate cancer. Figure 3 shows the mapping rates for exon, intron, and inter-genic sequences. The...FFPE specimens, for the specific comparison of low-risk and high-risk prostate cancer. 5. Identified sufficient number of biopsy cases and sections
Operon-mapper: A Web Server for Precise Operon Identification in Bacterial and Archaeal Genomes.

PubMed

Taboada, Blanca; Estrada, Karel; Ciria, Ricardo; Merino, Enrique

2018-06-19

Operon-mapper is a web server that accurately, easily, and directly predicts the operons of any bacterial or archaeal genome sequence. The operon predictions are based on the intergenic distance of neighboring genes as well as the functional relationships of their protein-coding products. To this end, Operon-mapper finds all the ORFs within a given nucleotide sequence, along with their genomic coordinates, orthology groups, and functional relationships. We believe that Operon-mapper, due to its accuracy, simplicity and speed, as well as the relevant information that it generates, will be a useful tool for annotating and characterizing genomic sequences. http://biocomputo.ibt.unam.mx/operon_mapper/.
Diversity, community structure, and bioremediation potential of mercury-resistant marine bacteria of estuarine and coastal environments of Odisha, India.

PubMed

Dash, Hirak R; Das, Surajit

2016-04-01

Both point and non-point sources increase the pollution status of mercury and increase the population of mercury-resistant marine bacteria (MRMB). They can be targeted as the indicator organism to access marine mercury pollution, besides utilization in bioremediation. Thus, sediment and water samples were collected for 2 years (2010-2012) along Odisha coast of Bay of Bengal, India. Mercury content of the study sites varied from 0.47 to 0.99 ppb irrespective of the seasons of sampling. A strong positive correlation was observed between mercury content and MRMB population (P < 0.05) suggesting the utilization of these bacteria to assess the level of mercury pollution in the marine environment. Seventy-eight percent of the MRMB isolates were under the phylum Firmicutes, and 36 and 31% of them could resist mercury by mer operon-mediated volatilization and mercury biosorption, respectively. In addition, most of the isolates could resist a number of antibiotics and toxic metals. All the MRMB isolates possess the potential of growth and survival at cardinal pH (4-8), temperature (25-37 °C), and salinity (5-35 psu). Enterobacteria repetitive intergenic consensus (ERIC) and repetitive element palindromic PCR (REP-PCR) produced fingerprints corroborating the results of 16S rRNA gene sequencing. Fourier transform infrared (FTIR) spectral analysis also revealed strain-level speciation and phylogenetic relationships.
Detection of antibiotic-resistant bacteria endowed with antimicrobial activity from a freshwater lake and their phylogenetic affiliation

PubMed Central

Zothanpuia; Passari, Ajit K.; Gupta, Vijai K.

2016-01-01

Antimicrobial resistance poses a serious challenge to global public health. In this study, fifty bacterial strains were isolated from the sediments of a freshwater lake and were screened for antibiotic resistance. Out of fifty isolates, thirty-three isolates showed resistance against at least two of the selected antibiotics. Analysis of 16S rDNA sequencing revealed that the isolates belonged to ten different genera, namely Staphylococcus(n = 8), Bacillus(n = 7), Lysinibacillus(n = 4), Achromobacter(n=3), bacterium(n = 3), Methylobacterium(n = 2), Bosea(n = 2), Aneurinibacillus(n = 2), Azospirillum(n = 1), Novosphingobium(n = 1). Enterobacterial repetitive intergenic consensus (ERIC) and BOX-PCR markers were used to study the genetic relatedness among the antibiotic resistant isolates. Further, the isolates were screened for their antimicrobial activity against bacterial pathogens viz., Staphylococcus aureus(MTCC-96), Pseudomonas aeruginosa(MTCC-2453) and Escherichia coli(MTCC-739), and pathogenic fungi viz., Fusarium proliferatum (MTCC-286), Fusarium oxysporum (CABI-293942) and Fusarium oxy. ciceri (MTCC-2791). In addition, biosynthetic genes (polyketide synthase II (PKS-II) and non-ribosomal peptide synthetase (NRPS)) were detected in six and seven isolates, respectively. This is the first report for the multifunctional analysis of the bacterial isolates from a wetland with biosynthetic potential, which could serve as potential source of useful biologically active metabolites. PMID:27330861
Identification of carbapenem-resistant Pseudomonas aeruginosa in selected hospitals of the Gulf Cooperation Council States: dominance of high-risk clones in the region.

PubMed

Zowawi, Hosam M; Syrmis, Melanie W; Kidd, Timothy J; Balkhy, Hanan H; Walsh, Timothy R; Al Johani, Sameera M; Al Jindan, Reem Y; Alfaresi, Mubarak; Ibrahim, Emad; Al-Jardani, Amina; Al Salman, Jameela; Dashti, Ali A; Sidjabat, Hanna E; Baz, Omar; Trembizki, Ella; Whiley, David M; Paterson, David L

2018-04-17

The molecular epidemiology and resistance mechanisms of carbapenem-resistant Pseudomonas aeruginosa (CRPA) were determined in hospitals in the countries of the Gulf Cooperation Council (GCC), namely, Saudi Arabia, the United Arab Emirates, Oman, Qatar, Bahrain and Kuwait. Isolates were screened for common carbapenem-resistance genes by PCR. Relatedness between isolates was assessed using previously described genotyping methods: an informative-single nucleotide polymorphism MassARRAY iPLEX assay (iPLEX20SNP) and the enterobacterial repetitive intergenic consensus (ERIC)-PCR assay, with selected isolates being subjected to multilocus sequence typing (MLST). Ninety-five non-repetitive isolates that were found to be resistant to carbapenems were subjected to further investigation.Results/Key findings. The most prevalent carbapenemase-encoding gene, blaVIM-type, was found in 37/95 (39 %) isolates, while only 1 isolate (from UAE) was found to have blaIMP-type. None of the CRPA were found to have blaNDM-type or blaKPC-type. We found a total of 14 sequence type (ST) clusters, with 4 of these clusters being observed in more than 1 country. Several clusters belonged to the previously recognized internationally disseminated high-risk clones ST357, ST235, ST111, ST233 and ST654. We also found the less predominant ST316, ST308 and ST823 clones, and novel MLST types (ST2010, ST2011, ST2012 and ST2013), in our collection. Overall our data show that 'high-risk' CRPA clones are now detected in the region and highlight the need for strategies to limit further spread of such organisms, including enhanced surveillance, infection control precautions and further promotion of antibiotic stewardship programmes.
A novel functional class 2 integron in clinical Proteus mirabilis isolates.

PubMed

Wei, Quhao; Hu, Qingfeng; Li, Shanshan; Lu, Huoyang; Chen, Guoqiang; Shen, Beiqiong; Zhang, Ping; Zhou, Yonglie

2014-04-01

To describe a novel functional class 2 integron that was found in clinical Proteus mirabilis isolates. Class 1 and 2 integrons were screened by PCR in 153 clinical Proteus isolates. The variable regions of class 1 and 2 integrons were determined by restriction analysis and sequencing. The mutations of internal stop codons in class 2 integrons and their common promoters were also determined by sequencing. Enterobacterial repetitive intergenic consensus (ERIC)-PCR was used to analyse the phylogenetic relations of class 2 integron-positive P. mirabilis isolates. Class 1 integrons were detected in 96 (63%) of 153 Proteus isolates: eight different gene cassette arrays were detected, including dfrA32-ereA1-aadA2, which was detected for the first time in P. mirabilis. Class 2 integrons were detected in 101 (66%) of 153 Proteus isolates: four different gene cassette arrays were detected, including dfrA1-catB2-sat2-aadA1, which was detected for the first time in a class 2 integron. A novel functional class 2 integron was detected in 38 P. mirabilis isolates with a common promoter (-35 TTTAAT|16 bp|-10 TAAAGT). The variable region of this functional class 2 integron contained dfrA14 and three novel open reading frames with unknown functions. Very similar ERIC-PCR fingerprinting patterns were detected in these 38 P. mirabilis isolates and were different from other class 2 integron-positive isolates. A novel functional class 2 integron was found for the first time in P. mirabilis. These functional class 2 integron-harbouring P. mirabilis isolates were likely to be clonally spread in our hospital.
Variability and molecular typing of the woody-tree infecting prunus necrotic ringspot ilarvirus.

PubMed

Vasková, D; Petrzik, K; Karesová, R

2000-01-01

The 3'-part of the movement protein gene, the intergenic region and the complete coat protein gene of sixteen isolates of Prunus necrotic ringspot virus (PNRSV) from five different host species from the Czech Republic were sequenced in order to search for the bases of extensive variability of viroses caused by this pathogen. According to phylogenetic analyses all the 46 isolates sequenced to date split into three main groups, which correlated to a certain extend with their geographic origin. Modelled serological properties showed that all the new isolates belong to one serotype.
Comparative Mitogenomic Analyses of Praying Mantises (Dictyoptera, Mantodea): Origin and Evolution of Unusual Intergenic Gaps

PubMed Central

Zhang, Hong-Li; Ye, Fei

2017-01-01

Praying mantises are a diverse group of predatory insects. Although some Mantodea mitogenomes have been reported, a comprehensive comparative and evolutionary genomic study is lacking for this group. In the present study, four new mitogenomes were sequenced, annotated, and compared to the previously published mitogenomes of other Mantodea species. Most Mantodea mitogenomes share a typical set of mitochondrial genes and a putative control region (CR). Additionally, and most intriguingly, another large non-coding region (LNC) was detected between trnM and ND2 in all six Paramantini mitogenomes examined. The main section in this common region of Paramantini may have initially originated from the corresponding control region for each species, whereas sequence differences between the LNCs and CRs and phylogenetic analyses indicate that LNC and CR are largely independently evolving. Namely, the LNC (the duplicated CR) may have subsequently degenerated during evolution. Furthermore, evidence suggests that special intergenic gaps have been introduced in some species through gene rearrangement and duplication. These gaps are actually the original abutting sequences of migrated or duplicated genes. Some gaps (G5 and G6) are homologous to the 5' and 3' surrounding regions of the duplicated gene in the original gene order, and another specific gap (G7) has tandem repeats. We analysed the phylogenetic relationships of fifteen Mantodea species using 37 concatenated mitochondrial genes and detected several synapomorphies unique to species in some clades. PMID:28367101
Structural variant of the intergenic internal ribosome entry site elements in dicistroviruses and computational search for their counterparts

PubMed Central

HATAKEYAMA, YOSHINORI; SHIBUYA, NORIHIRO; NISHIYAMA, TAKASHI; NAKASHIMA, NOBUHIKO

2004-01-01

The intergenic region (IGR) located upstream of the capsid protein gene in dicistroviruses contains an internal ribosome entry site (IRES). Translation initiation mediated by the IRES does not require initiator methionine tRNA. Comparison of the IGRs among dicistroviruses suggested that Taura syndrome virus (TSV) and acute bee paralysis virus have an extra side stem loop in the predicted IRES. We examined whether the side stem is responsible for translation activity mediated by the IGR using constructs with compensatory mutations. In vitro translation analysis showed that TSV has an IGR-IRES that is structurally distinct from those previously described. Because IGR-IRES elements determine the translation initiation site by virtue of their own tertiary structure formation, the discovery of this initiation mechanism suggests the possibility that eukaryotic mRNAs might have more extensive coding regions than previously predicted. To test this hypothesis, we searched full-length cDNA databases and whole genome sequences of eukaryotes using the pattern matching program, Scan For Matches, with parameters that can extract sequences containing secondary structure elements resembling those of IGR-IRES. Our search yielded several sequences, but their predicted secondary structures were suggested to be unstable in comparison to those of dicistroviruses. These results suggest that RNAs structurally similar to dicistroviruses are not common. If some eukaryotic mRNAs are translated independently of an initiator methionine tRNA, their structures are likely to be significantly distinct from those of dicistroviruses. PMID:15100433
Generation and Characterization of HIV-1 Transmitted and Founder Virus Consensus Sequence from Intravenous Drug Users in Xinjiang, China.

PubMed

Li, Fan; Ma, Liying; Feng, Yi; Hu, Jing; Ni, Na; Ruan, Yuhua; Shao, Yiming

2017-06-01

HIV-1 transmission in intravenous drug users (IDUs) has been characterized by high genetic multiplicity and suggests a greater challenge for HIV-1 infection blocking. We investigated a total of 749 sequences of full-length gp160 gene obtained by single genome sequencing (SGS) from 22 HIV-1 early infected IDUs in Xinjiang province, northwest China, and generated a transmitted and founder virus (T/F virus) consensus sequence (IDU.CON). The T/F virus was classified as subtype CRF07_BC and predicted to be CCR5-tropic virus. The variable region (V1, V2, and V4 loop) of IDU.CON showed length variation compared with the heterosexual T/F virus consensus sequence (HSX.CON) and homosexual T/F virus consensus sequence (MSM.CON). A total of 26 N-linked glycosylation sites were discovered in the IDU.CON sequence, which is less than that of MSM.CON and HSX.CON. Characterization of T/F virus from IDUs highlights the genetic make-up and complexity of virus near the moment of transmission or in early infection preceding systemic dissemination and is important toward the development of an effective HIV-1 preventive methods, including vaccines.
Candida guilliermondii and Other Species of Candida Misidentified as Candida famata: Assessment by Vitek 2, DNA Sequencing Analysis, and Matrix-Assisted Laser Desorption Ionization–Time of Flight Mass Spectrometry in Two Global Antifungal Surveillance Programs

PubMed Central

Woosley, Leah N.; Diekema, Daniel J.; Jones, Ronald N.; Pfaller, Michael A.

2013-01-01

Candida famata (teleomorph Debaryomyces hansenii) has been described as a medically relevant yeast, and this species has been included in many commercial identification systems that are currently used in clinical laboratories. Among 53 strains collected during the SENTRY and ARTEMIS surveillance programs and previously identified as C. famata (includes all submitted strains with this identification) by a variety of commercial methods (Vitek, MicroScan, API, and AuxaColor), DNA sequencing methods demonstrated that 19 strains were C. guilliermondii, 14 were C. parapsilosis, 5 were C. lusitaniae, 4 were C. albicans, and 3 were C. tropicalis, and five isolates belonged to other Candida species (two C. fermentati and one each C. intermedia, C. pelliculosa, and Pichia fabianni). Additionally, three misidentified C. famata strains were correctly identified as Kodomaea ohmeri, Debaryomyces nepalensis, and Debaryomyces fabryi using intergenic transcribed spacer (ITS) and/or intergenic spacer (IGS) sequencing. The Vitek 2 system identified three isolates with high confidence to be C. famata and another 15 with low confidence between C. famata and C. guilliermondii or C. parapsilosis, displaying only 56.6% agreement with DNA sequencing results. Matrix-assisted laser desorption ionization–time of flight (MALDI-TOF) results displayed 81.1% agreement with DNA sequencing. One strain each of C. metapsilosis, C. fermentati, and C. intermedia demonstrated a low score for identification (<2.0) in the MALDI Biotyper. K. ohmeri, D. nepalensis, and D. fabryi identified by DNA sequencing in this study were not in the current database for the MALDI Biotyper. These results suggest that the occurrence of C. famata in fungal infections is much lower than previously appreciated and that commercial systems do not produce accurate identifications except for the newly introduced MALDI-TOF instruments. PMID:23100350
Molecular analysis of the anaerobic rumen fungus Orpinomyces - insights into an AT-rich genome.

PubMed

Nicholson, Matthew J; Theodorou, Michael K; Brookman, Jayne L

2005-01-01

The anaerobic gut fungi occupy a unique niche in the intestinal tract of large herbivorous animals and are thought to act as primary colonizers of plant material during digestion. They are the only known obligately anaerobic fungi but molecular analysis of this group has been hampered by difficulties in their culture and manipulation, and by their extremely high A+T nucleotide content. This study begins to answer some of the fundamental questions about the structure and organization of the anaerobic gut fungal genome. Directed plasmid libraries using genomic DNA digested with highly or moderately rich AT-specific restriction enzymes (VspI and EcoRI) were prepared from a polycentric Orpinomyces isolate. Clones were sequenced from these libraries and the breadth of genomic inserts, both genic and intergenic, was characterized. Genes encoding numerous functions not previously characterized for these fungi were identified, including cytoskeletal, secretory pathway and transporter genes. A peptidase gene with no introns and having sequence similarity to a gene encoding a bacterial peptidase was also identified, extending the range of metabolic enzymes resulting from apparent trans-kingdom transfer from bacteria to fungi, as previously characterized largely for genes encoding plant-degrading enzymes. This paper presents the first thorough analysis of the genic, intergenic and rDNA regions of a variety of genomic segments from an anaerobic gut fungus and provides observations on rules governing intron boundaries, the codon biases observed with different types of genes, and the sequence of only the second anaerobic gut fungal promoter reported. Large numbers of retrotransposon sequences of different types were found and the authors speculate on the possible consequences of any such transposon activity in the genome. The coding sequences identified included several orphan gene sequences, including one with regions strongly suggestive of structural proteins such as collagens and lampirin. This gene was present as a single copy in Orpinomyces, was expressed during vegetative growth and was also detected in genomes from another gut fungal genus, Neocallimastix.

[Phylogenetic relationships of the species of Oxytropis DC. subg. Oxytropis and Phacoxytropis (Fabaceae) from Asian Russia inferred from the nucleotide sequence analysis of the intergenic spacers of the chloroplast genome].

PubMed

Kholina, A B; Kozyrenko, M M; Artyukova, E V; Sandanov, D V; Andrianova, E A

2016-08-01

The nucleotide sequence analysis of trnH–psbA, trnL–trnF, and trnS–trnG intergenic spacer regions of chloroplast DNA performed in the representatives of the genus Oxytropis from Asian Russia provided clarification of the phylogenetic relationships of some species and sections in the subgenera Oxytropis and Phacoxytropis and in the genus Oxytropis as a whole. Only the section Mesogaea corresponds to the subgenus Phacoxytropis, while the section Janthina of the same subgenus groups together with the sections of the subgenus Oxytropis. The sections Chrysantha and Ortholoma of the subgenus Oxytropis are not only closely related to each other, but together with the section Mesogaea, they are grouped into the subgenus Phacoxytropis. It seems likely that the sections Chrysantha and Ortholoma should be assigned to the subgenus Phacoxytropis, and the section Janthina should be assigned to the subgenus Oxytropis. The molecular differences were identified between O. coerulea and O. mandshurica from the section Janthina that were indicative of considerable divergence of their chloroplast genomes and the species independence of the taxa. The species independence of O. czukotica belonging to the section Arctobia was also confirmed.
Differentiation of Populus species using chloroplast single nucleotide polymorphism (SNP) markers--essential for comprehensible and reliable poplar breeding.

PubMed

Schroeder, H; Hoeltken, A M; Fladung, M

2012-03-01

Within the genus Populus several species belonging to different sections are cross-compatible. Hence, high numbers of interspecies hybrids occur naturally and, additionally, have been artificially produced in huge breeding programmes during the last 100 years. Therefore, determination of a single poplar species, used for the production of 'multi-species hybrids' is often difficult, and represents a great challenge for the use of molecular markers in species identification. Within this study, over 20 chloroplast regions, both intergenic spacers and coding regions, have been tested for their ability to differentiate different poplar species using 23 already published barcoding primer combinations and 17 newly designed primer combinations. About half of the published barcoding primers yielded amplification products, whereas the new primers designed on the basis of the total sequenced cpDNA genome of Populus trichocarpa Torr. & Gray yielded much higher amplification success. Intergenic spacers were found to be more variable than coding regions within the genus Populus. The highest discrimination power of Populus species was found in the combination of two intergenic spacers (trnG-psbK, psbK-psbl) and the coding region rpoC. In barcoding projects, the coding regions matK and rbcL are often recommended, but within the genus Populus they only show moderate variability and are not efficient in species discrimination. © 2011 German Botanical Society and The Royal Botanical Society of the Netherlands.
Comprehensive Identification of Meningococcal Genes and Small Noncoding RNAs Required for Host Cell Colonization

PubMed Central

Capel, Elena; Zomer, Aldert L.; Nussbaumer, Thomas; Bole, Christine; Izac, Brigitte; Frapy, Eric; Meyer, Julie; Bouzinba-Ségard, Haniaa; Bille, Emmanuelle; Jamet, Anne; Cavau, Anne; Letourneur, Franck; Bourdoulous, Sandrine; Rattei, Thomas; Coureuil, Mathieu

2016-01-01

ABSTRACT Neisseria meningitidis is a leading cause of bacterial meningitis and septicemia, affecting infants and adults worldwide. N. meningitidis is also a common inhabitant of the human nasopharynx and, as such, is highly adapted to its niche. During bacteremia, N. meningitidis gains access to the blood compartment, where it adheres to endothelial cells of blood vessels and causes dramatic vascular damage. Colonization of the nasopharyngeal niche and communication with the different human cell types is a major issue of the N. meningitidis life cycle that is poorly understood. Here, highly saturated random transposon insertion libraries of N. meningitidis were engineered, and the fitness of mutations during routine growth and that of colonization of endothelial and epithelial cells in a flow device were assessed in a transposon insertion site sequencing (Tn-seq) analysis. This allowed the identification of genes essential for bacterial growth and genes specifically required for host cell colonization. In addition, after having identified the small noncoding RNAs (sRNAs) located in intergenic regions, the phenotypes associated with mutations in those sRNAs were defined. A total of 383 genes and 8 intergenic regions containing sRNA candidates were identified to be essential for growth, while 288 genes and 33 intergenic regions containing sRNA candidates were found to be specifically required for host cell colonization. PMID:27486197
Construction of an ultra-high density consensus genetic map, and enhancement of the physical map from genome sequencing in Lupinus angustifolius.

PubMed

Zhou, Gaofeng; Jian, Jianbo; Wang, Penghao; Li, Chengdao; Tao, Ye; Li, Xuan; Renshaw, Daniel; Clements, Jonathan; Sweetingham, Mark; Yang, Huaan

2018-01-01

An ultra-high density genetic map containing 34,574 sequence-defined markers was developed in Lupinus angustifolius. Markers closely linked to nine genes of agronomic traits were identified. A physical map was improved to cover 560.5 Mb genome sequence. Lupin (Lupinus angustifolius L.) is a recently domesticated legume grain crop. In this study, we applied the restriction-site associated DNA sequencing (RADseq) method to genotype an F 9 recombinant inbred line population derived from a wild type × domesticated cultivar (W × D) cross. A high density linkage map was developed based on the W × D population. By integrating sequence-defined DNA markers reported in previous mapping studies, we established an ultra-high density consensus genetic map, which contains 34,574 markers consisting of 3508 loci covering 2399 cM on 20 linkage groups. The largest gap in the entire consensus map was 4.73 cM. The high density W × D map and the consensus map were used to develop an improved physical map, which covered 560.5 Mb of genome sequence data. The ultra-high density consensus linkage map, the improved physical map and the markers linked to genes of breeding interest reported in this study provide a common tool for genome sequence assembly, structural genomics, comparative genomics, functional genomics, QTL mapping, and molecular plant breeding in lupin.
RNAP-II transcribes two small RNAs at the promoter and terminator regions of the RNAP-I gene in Saccharomyces cerevisiae.

PubMed

Mayán, Maria D

2013-01-01

Three RNA polymerases coexist in the ribosomal DNA of Saccharomyces cerevisiae. RNAP-I transcribes the 35S rRNA, RNAP-III transcribes the 5S rRNA and RNAP-II is found in both intergenic non-coding regions. Previously, we demonstrated that RNAP-II molecules bound to the intergenic non-coding regions (IGS) of the ribosomal locus are mainly found in a stalled conformation, and the stalled polymerase mediates chromatin interactions, which isolate RNAP-I from the RNAP-III transcriptional domain. Besides, RNAP-II transcribes both IGS regions at low levels, using different cryptic promoters. This report demonstrates that RNAP-II also transcribes two sequences located in the 5'- and 3'-ends of the 35S rRNA gene that overlap with the sequences of the 35S rRNA precursor transcribed by RNAP-I. The sequence located at the promoter region of RNAP-I, called the p-RNA transcript, binds to the transcription termination-related protein, Reb1p, while the T-RNA sequence, located in the termination sites of RNAP-I gene, contains the stem-loop recognized by Rtn1p, which is necessary for proper termination of RNAP-I. Because of their location, these small RNAs may play a key role in the initiation and termination of RNAP-I transcription. To correctly synthesize proteins, eukaryotic cells may retain a mechanism that connects the three main polymerases. This report suggests that cryptic transcription by RNAP-II may be required for normal transcription by RNAP-I in the ribosomal locus of S. cerevisiae. Copyright © 2012 John Wiley & Sons, Ltd.
Overexpression of the Lactobacillus plantarum peptidoglycan biosynthesis murA2 gene increases the tolerance of Escherichia coli to alcohols and enhances ethanol production.

PubMed

Yuan, Yongbo; Bi, Changhao; Nicolaou, Sergios A; Zingaro, Kyle A; Ralston, Matthew; Papoutsakis, Eleftherios T

2014-10-01

A major challenge in producing chemicals and biofuels is to increase the tolerance of the host organism to toxic products or byproducts. An Escherichia coli strain with superior ethanol and more generally alcohol tolerance was identified by screening a library constructed by randomly integrating Lactobacillus plantarum genomic DNA fragments into the E. coli chromosome via Cre-lox recombination. Sequencing identified the inserted DNA fragment as the murA2 gene and its upstream intergenic 973-bp sequence, both coded on the negative genomic DNA strand. Overexpression of this murA2 gene and its upstream 973-bp sequence significantly enhanced ethanol tolerance in both E. coli EC100 and wild type E. coli MG1655 strains by 4.1-fold and 2.0-fold compared to control strains, respectively. Tolerance to n-butanol and i-butanol in E. coli MG1655 was increased by 1.85-fold and 1.91-fold, respectively. We show that the intergenic 973-bp sequence contains a native promoter for the murA2 gene along with a long 5' UTR (286 nt) on the negative strand, while a noncoding, small RNA, named MurA2S, is expressed off the positive strand. MurA2S is expressed in E. coli and may interact with murA2, but it does not affect murA2's ability to enhance alcohol tolerance in E. coli. Overexpression of murA2 with its upstream region in the ethanologenic E. coli KO11 strain significantly improved ethanol production in cultures that simulate the industrial Melle-Boinot fermentation process.
Survey and analysis of simple sequence repeats in the Laccaria bicolor genome, with development of microsatellite markers

DOE Office of Scientific and Technical Information (OSTI.GOV)

Labbe, Jessy L; Murat, Claude; Morin, Emmanuelle

It is becoming clear that simple sequence repeats (SSRs) play a significant role in fungal genome organization, and they are a large source of genetic markers for population genetics and meiotic maps. We identified SSRs in the Laccaria bicolor genome by in silico survey and analyzed their distribution in the different genomic regions. We also compared the abundance and distribution of SSRs in L. bicolor with those of the following fungal genomes: Phanerochaete chrysosporium, Coprinopsis cinerea, Ustilago maydis, Cryptococcus neoformans, Aspergillus nidulans, Magnaporthe grisea, Neurospora crassa and Saccharomyces cerevisiae. Using the MISA computer program, we detected 277,062 SSRs in themore » L. bicolor genome representing 8% of the assembled genomic sequence. Among the analyzed basidiomycetes, L. bicolor exhibited the highest SSR density although no correlation between relative abundance and the genome sizes was observed. In most genomes the short motifs (mono- to trinucleotides) were more abundant than the longer repeated SSRs. Generally, in each organism, the occurrence, relative abundance, and relative density of SSRs decreased as the repeat unit increased. Furthermore, each organism had its own common and longest SSRs. In the L. bicolor genome, most of the SSRs were located in intergenic regions (73.3%) and the highest SSR density was observed in transposable elements (TEs; 6,706 SSRs/Mb). However, 81% of the protein-coding genes contained SSRs in their exons, suggesting that SSR polymorphism may alter gene phenotypes. Within a L. bicolor offspring, sequence polymorphism of 78 SSRs was mainly detected in non-TE intergenic regions. Unlike previously developed microsatellite markers, these new ones are spread throughout the genome; these markers could have immediate applications in population genetics.« less
Diagnosis of clinical samples spotted on FTA cards using PCR-based methods.

PubMed

Jamjoom, Manal; Sultan, Amal H

2009-04-01

The broad clinical presentation of Leishmaniasis makes the diagnosis of current and past cases of this disease rather difficult. Differential diagnosis is important because diseases caused by other aetiologies and a clinical spectrum similar to that of leishmaniasis (e.g. leprosy, skin cancers and tuberculosis for CL; malaria and schistosomiasis for VL) are often present in endemic areas of endemicity. Presently, a variety of methods have been developed and tested to aid the identification and diagnosis of Leishmania. The advent of the PCR technology has opened new channels for the diagnosis of leishmaniasis in a variety of clinical materials. PCR is a simple, rapid procedure that has been adapted for diagnosis of leishmaniasis. A range of tools is currently available for the diagnosis and identification of leishmaniasis and Leishmania species, respectively. However, none of these diagnostic tools are examined and tested using samples spotted on FTA cards. Three different PCR-based approaches were examined including: kDNA minicircle, Leishmania 18S rRNA gene and PCR-RFLP of Intergenic region of ribosomal protein. PCR primers were designed that sit within the coding sequences of genes (relatively well conserved) but which amplify across the intervening intergenic sequence (relatively variable). These were used in PCR-RFLP on reference isolates of 10 of the most important Leishmania species: L. donovani, L. infantum, L. major & L. tropica. Digestion of PCR products with restriction enzymes produced species-specific restriction patterns allowed discrimination of reference isolates. The kDNA minicircle primers are highly sensitive in diagnosis of both bone marrow and skin smears from FTA cards. Leishmania 18S rRNA gene conserved region is sensitive in identification of bone marrow smear but less sensitive in diagnosing skin smears. The intergenic nested PCR-RFLP using P5 & P6 as well as P1 & P2 newly designed primers showed high level of reproducibility and sensitivity. Though, it was less sensitive than kDNA minicircle primers, but easily discriminated between Leishmania species.
Chloroplast Genome Differences between Asian and American Equisetum arvense (Equisetaceae) and the Origin of the Hypervariable trnY-trnE Intergenic Spacer

PubMed Central

Kim, Hyoung Tae; Kim, Ki-Joong

2014-01-01

Comparative analyses of complete chloroplast (cp) DNA sequences within a species may provide clues to understand the population dynamics and colonization histories of plant species. Equisetum arvense (Equisetaceae) is a widely distributed fern species in northeastern Asia, Europe, and North America. The complete cp DNA sequences from Asian and American E. arvense individuals were compared in this study. The Asian E. arvense cp genome was 583 bp shorter than that of the American E. arvense. In total, 159 indels were observed between two individuals, most of which were concentrated on the hypervariable trnY-trnE intergenic spacer (IGS) in the large single-copy (LSC) region of the cp genome. This IGS region held a series of 19 bp repeating units. The numbers of the 19 bp repeat unit were responsible for 78% of the total length difference between the two cp genomes. Furthermore, only other closely related species of Equisetum also show the hypervariable nature of the trnY-trnE IGS. By contrast, only a single indel was observed in the gene coding regions: the ycf1 gene showed 24 bp differences between the two continental individuals due to a single tandem-repeat indel. A total of 165 single-nucleotide polymorphisms (SNPs) were recorded between the two cp genomes. Of these, 52 SNPs (31.5%) were distributed in coding regions, 13 SNPs (7.9%) were in introns, and 100 SNPs (60.6%) were in intergenic spacers (IGS). The overall difference between the Asian and American E. arvense cp genomes was 0.12%. Despite the relatively high genetic diversity between Asian and American E. arvense, the two populations are recognized as a single species based on their high morphological similarity. This indicated that the two regional populations have been in morphological stasis. PMID:25157804
Reclassification of Borrelia spp. isolated in South Korea using Multilocus Sequence Typing.

PubMed

Park, Kyung-Hee; Choi, Yeon-Joo; Kim, Jeoungyeon; Park, Hye-Jin; Song, Dayoung; Jang, Won-Jong

2018-05-31

Using Borrelia isolated from South Korea, we evaluated by MLST and three intergenic genes (16S rRNA, ospA, and 5S-23S IGS) typing to analyze the relationship between host and vector and molecular background. Using the MLST analysis, we identified B. afzelii, B. yangtzensis, B. garinii, and B. bavariensis. This study was first report of the identification of B. yangtzensis using the MLST in South Korea.
[Structural organization of 5S ribosomal DNA of Rosa rugosa].

PubMed

Tynkevych, Iu O; Volkov, R A

2014-01-01

In order to clarify molecular organization of the genomic region encoding 5S rRNA in diploid species Rosa rugosa several 5S rDNA repeated units were cloned and sequenced. Analysis of the obtained sequences revealed that only one length variant of 5S rDNA repeated units, which contains intact promoter elements in the intergenic spacer region (IGS) and appears to be transcriptionally active is present in the genome. Additionally, a limited number of 5S rDNA pseudogenes lacking a portion of coding sequence and the complete IGS was detected. A high level of sequence similarity (from 93.7 to 97.5%) between the IGS of major 5S rDNA variants of East Asian R. rugosa and North American R. nitida was found indicating comparatively recent divergence of these species.
Multispacer Typing of Rickettsia prowazekii Enabling Epidemiological Studies of Epidemic Typhus†

PubMed Central

Zhu, Yong; Fournier, Pierre-Edouard; Ogata, Hiroyuki; Raoult, Didier

2005-01-01

Currently, there is no tool for typing Rickettsia prowazekii, the causative agent of epidemic typhus, currently considered a potential bioterrorism agent, at the strain level. To test if the multispacer typing (MST) method could differentiate strains of R. prowazekii, we amplified and sequenced the 25 most variable intergenic spacers between the R. prowazekii and R. conorii genomes in five strains and 10 body louse amplicons of R. prowazekii from various geographic origins. Two intergenic spacers, i.e., rpmE/tRNAfMet and serS/virB4, were variable among tested R. prowazekii isolates and allowed identification of three and two genotypes, respectively. When the genotypes obtained from the two spacers were combined, we identified four different genotypes. MST demonstrated that several R. prowazekii strains circulated in human body lice during an outbreak of epidemic typhus in Burundi. This may help to discriminate between natural and intentional outbreaks. Our study supports the usefulness of MST as a versatile method for rickettsial strain genotyping. PMID:16145131
Multispacer typing of Rickettsia prowazekii enabling epidemiological studies of epidemic typhus.

PubMed

Zhu, Yong; Fournier, Pierre-Edouard; Ogata, Hiroyuki; Raoult, Didier

2005-09-01

Currently, there is no tool for typing Rickettsia prowazekii, the causative agent of epidemic typhus, currently considered a potential bioterrorism agent, at the strain level. To test if the multispacer typing (MST) method could differentiate strains of R. prowazekii, we amplified and sequenced the 25 most variable intergenic spacers between the R. prowazekii and R. conorii genomes in five strains and 10 body louse amplicons of R. prowazekii from various geographic origins. Two intergenic spacers, i.e., rpmE/tRNA(fMet) and serS/virB4, were variable among tested R. prowazekii isolates and allowed identification of three and two genotypes, respectively. When the genotypes obtained from the two spacers were combined, we identified four different genotypes. MST demonstrated that several R. prowazekii strains circulated in human body lice during an outbreak of epidemic typhus in Burundi. This may help to discriminate between natural and intentional outbreaks. Our study supports the usefulness of MST as a versatile method for rickettsial strain genotyping.
Bat white-nose syndrome: a real-time TaqMan polymerase chain reaction test targeting the intergenic spacer region of Geomyces destructanstructans.

USGS Publications Warehouse

Muller, Laura K.; Lorch, Jeffrey M.; Lindner, Daniel L.; O'Connor, Michael; Gargas, Andrea; Blehert, David S.

2013-01-01

The fungus Geomyces destructans is the causative agent of white-nose syndrome (WNS), a disease that has killed millions of North American hibernating bats. We describe a real-time TaqMan PCR test that detects DNA from G. destructans by targeting a portion of the multicopy intergenic spacer region of the rRNA gene complex. The test is highly sensitive, consistently detecting as little as 3.3 fg of genomic DNA from G. destructans. The real-time PCR test specifically amplified genomic DNA from G. destructans but did not amplify target sequence from 54 closely related fungal isolates (including 43 Geomyces spp. isolates) associated with bats. The test was further qualified by analyzing DNA extracted from 91 bat wing skin samples, and PCR results matched histopathology findings. These data indicate the real-time TaqMan PCR method described herein is a sensitive, specific, and rapid test to detect DNA from G. destructans and provides a valuable tool for WNS diagnostics and research.
Biophysical Constraints Arising from Compositional Context in Synthetic Gene Networks.

PubMed

Yeung, Enoch; Dy, Aaron J; Martin, Kyle B; Ng, Andrew H; Del Vecchio, Domitilla; Beck, James L; Collins, James J; Murray, Richard M

2017-07-26

Synthetic gene expression is highly sensitive to intragenic compositional context (promoter structure, spacing regions between promoter and coding sequences, and ribosome binding sites). However, much less is known about the effects of intergenic compositional context (spatial arrangement and orientation of entire genes on DNA) on expression levels in synthetic gene networks. We compare expression of induced genes arranged in convergent, divergent, or tandem orientations. Induction of convergent genes yielded up to 400% higher expression, greater ultrasensitivity, and dynamic range than divergent- or tandem-oriented genes. Orientation affects gene expression whether one or both genes are induced. We postulate that transcriptional interference in divergent and tandem genes, mediated by supercoiling, can explain differences in expression and validate this hypothesis through modeling and in vitro supercoiling relaxation experiments. Treatment with gyrase abrogated intergenic context effects, bringing expression levels within 30% of each other. We rebuilt the toggle switch with convergent genes, taking advantage of supercoiling effects to improve threshold detection and switch stability. Copyright © 2017 Elsevier Inc. All rights reserved.
The mini-exon genes of three Phytomonas isolates that differ in plant tissue tropism.

PubMed

Sturm, N R; Fernandes, O; Campbell, D A

1995-08-01

The tandem mini-exon gene repeat is an ideal diagnostic target for trypanosomatids because it includes sequences that are conserved absolutely coupled with regions of extreme variability. We have exploited these features and the polymerase chain reaction to differentiate Phytomonas strains isolated from phloem, fruit or latex of various host plants. While the transcribed regions are nearly identical, the intergenic sequences are variable in size and content (130-332 base pairs). The mini-exon genes of these phytomonads can therefore be distinguished from each other and from the corresponding genes in insect trypanosomes, with which they are oft confused.
PROSPECT improves cis-acting regulatory element prediction by integrating expression profile data with consensus pattern searches

PubMed Central

Fujibuchi, Wataru; Anderson, John S. J.; Landsman, David

2001-01-01

Consensus pattern and matrix-based searches designed to predict cis-acting transcriptional regulatory sequences have historically been subject to large numbers of false positives. We sought to decrease false positives by incorporating expression profile data into a consensus pattern-based search method. We have systematically analyzed the expression phenotypes of over 6000 yeast genes, across 121 expression profile experiments, and correlated them with the distribution of 14 known regulatory elements over sequences upstream of the genes. Our method is based on a metric we term probabilistic element assessment (PEA), which is a ranking of potential sites based on sequence similarity in the upstream regions of genes with similar expression phenotypes. For eight of the 14 known elements that we examined, our method had a much higher selectivity than a naïve consensus pattern search. Based on our analysis, we have developed a web-based tool called PROSPECT, which allows consensus pattern-based searching of gene clusters obtained from microarray data. PMID:11574681
The repeat organizer, a specialized insulator element within the intergenic spacer of the Xenopus rRNA genes.

PubMed Central

Robinett, C C; O'Connor, A; Dunaway, M

1997-01-01

We have identified a novel activity for the region of the intergenic spacer of the Xenopus laevis rRNA genes that contains the 35- and 100-bp repeats. We devised a new assay for this region by constructing DNA plasmids containing a tandem repeat of rRNA reporter genes that were separated by the 35- and 100-bp repeat region and a rRNA gene enhancer. When the 35- and 100-bp repeat region is present in its normal position and orientation at the 3' end of the rRNA reporter genes, the enhancer activates the adjacent downstream promoter but not the upstream rRNA promoter on the same plasmid. Because this element can restrict the range of an enhancer's activity in the context of tandem genes, we have named it the repeat organizer (RO). The ability to restrict enhancer action is a feature of insulator elements, but unlike previously described insulator elements the RO does not block enhancer action in a simple enhancer-blocking assay. Instead, the activity of the RO requires that it be in its normal position and orientation with respect to the other sequence elements of the rRNA genes. The enhancer-binding transcription factor xUBF also binds to the repetitive sequences of the RO in vitro, but these sequences do not activate transcription in vivo. We propose that the RO is a specialized insulator element that organizes the tandem array of rRNA genes into single-gene expression units by promoting activation of a promoter by its proximal enhancers. PMID:9111359
Intergenic Sequence Comparison of Escherichia coli Isolates Reveals Lifestyle Adaptations but Not Host Specificity▿

PubMed Central

White, A. P.; Sibley, K. A.; Sibley, C. D.; Wasmuth, J. D.; Schaefer, R.; Surette, M. G.; Edge, T. A.; Neumann, N. F.

2011-01-01

Establishing the risk of human infection is one of the goals of public health. For bacterial pathogens, the virulence and zoonotic potential can often be related to their host source. Escherichia coli bacteria are common contaminants of water associated with human recreation and consumption, and many strains are pathogenic. In this study, we analyzed three promoter-containing intergenic regions from 284 diverse E. coli isolates in an attempt to identify molecular signatures associated with specific host types. Promoter sequences controlling production of curli fimbriae, flagella, and nutrient import yielded a phylogenetic tree with isolates clustered by established phylogenetic grouping (A, B1, B2, and D) but not by host source. Virulence genes were more prevalent in groups B2 and D isolates and in human isolates. Group B1 isolates, primarily from nonhuman sources, were the most genetically similar, indicating that they lacked molecular adaptations to specific host environments and were likely host generalists. Conversely, B2 isolates, primarily from human sources, displayed greater genetic distances and were more likely to be host adapted. In agreement with these hypotheses, prevalence of σS activity and the rdar morphotype, phenotypes associated with environmental survival, were significantly higher in B1 isolates than in B2 isolates. Based on our findings, we speculate that E. coli host specificity is not defined by genome-wide sequence changes but, rather, by the presence or absence of specific genes and associated promoter elements. Furthermore, the requirements for colonization of the human gastrointestinal tract may lead to E. coli lifestyle changes along with selection for increased virulence. PMID:21908635
Parallel computation of genome-scale RNA secondary structure to detect structural constraints on human genome.

PubMed

Kawaguchi, Risa; Kiryu, Hisanori

2016-05-06

RNA secondary structure around splice sites is known to assist normal splicing by promoting spliceosome recognition. However, analyzing the structural properties of entire intronic regions or pre-mRNA sequences has been difficult hitherto, owing to serious experimental and computational limitations, such as low read coverage and numerical problems. Our novel software, "ParasoR", is designed to run on a computer cluster and enables the exact computation of various structural features of long RNA sequences under the constraint of maximal base-pairing distance. ParasoR divides dynamic programming (DP) matrices into smaller pieces, such that each piece can be computed by a separate computer node without losing the connectivity information between the pieces. ParasoR directly computes the ratios of DP variables to avoid the reduction of numerical precision caused by the cancellation of a large number of Boltzmann factors. The structural preferences of mRNAs computed by ParasoR shows a high concordance with those determined by high-throughput sequencing analyses. Using ParasoR, we investigated the global structural preferences of transcribed regions in the human genome. A genome-wide folding simulation indicated that transcribed regions are significantly more structural than intergenic regions after removing repeat sequences and k-mer frequency bias. In particular, we observed a highly significant preference for base pairing over entire intronic regions as compared to their antisense sequences, as well as to intergenic regions. A comparison between pre-mRNAs and mRNAs showed that coding regions become more accessible after splicing, indicating constraints for translational efficiency. Such changes are correlated with gene expression levels, as well as GC content, and are enriched among genes associated with cytoskeleton and kinase functions. We have shown that ParasoR is very useful for analyzing the structural properties of long RNA sequences such as mRNAs, pre-mRNAs, and long non-coding RNAs whose lengths can be more than a million bases in the human genome. In our analyses, transcribed regions including introns are indicated to be subject to various types of structural constraints that cannot be explained from simple sequence composition biases. ParasoR is freely available at https://github.com/carushi/ParasoR .

Variability among the Most Rapidly Evolving Plastid Genomic Regions is Lineage-Specific: Implications of Pairwise Genome Comparisons in Pyrus (Rosaceae) and Other Angiosperms for Marker Choice

PubMed Central

Ter-Voskanyan, Hasmik; Allgaier, Martin; Borsch, Thomas

2014-01-01

Plastid genomes exhibit different levels of variability in their sequences, depending on the respective kinds of genomic regions. Genes are usually more conserved while noncoding introns and spacers evolve at a faster pace. While a set of about thirty maximum variable noncoding genomic regions has been suggested to provide universally promising phylogenetic markers throughout angiosperms, applications often require several regions to be sequenced for many individuals. Our project aims to illuminate evolutionary relationships and species-limits in the genus Pyrus (Rosaceae)—a typical case with very low genetic distances between taxa. In this study, we have sequenced the plastid genome of Pyrus spinosa and aligned it to the already available P. pyrifolia sequence. The overall p-distance of the two Pyrus genomes was 0.00145. The intergenic spacers between ndhC–trnV, trnR–atpA, ndhF–rpl32, psbM–trnD, and trnQ–rps16 were the most variable regions, also comprising the highest total numbers of substitutions, indels and inversions (potentially informative characters). Our comparative analysis of further plastid genome pairs with similar low p-distances from Oenothera (representing another rosid), Olea (asterids) and Cymbidium (monocots) showed in each case a different ranking of genomic regions in terms of variability and potentially informative characters. Only two intergenic spacers (ndhF–rpl32 and trnK–rps16) were consistently found among the 30 top-ranked regions. We have mapped the occurrence of substitutions and microstructural mutations in the four genome pairs. High AT content in specific sequence elements seems to foster frequent mutations. We conclude that the variability among the fastest evolving plastid genomic regions is lineage-specific and thus cannot be precisely predicted across angiosperms. The often lineage-specific occurrence of stem-loop elements in the sequences of introns and spacers also governs lineage-specific mutations. Sequencing whole plastid genomes to find markers for evolutionary analyses is therefore particularly useful when overall genetic distances are low. PMID:25405773
Comparative analysis of seven viral nuclear export signals (NESs) reveals the crucial role of nuclear export mediated by the third NES consensus sequence of nucleoprotein (NP) in influenza A virus replication.

PubMed

Chutiwitoonchai, Nopporn; Kakisaka, Michinori; Yamada, Kazunori; Aida, Yoko

2014-01-01

The assembly of influenza virus progeny virions requires machinery that exports viral genomic ribonucleoproteins from the cell nucleus. Currently, seven nuclear export signal (NES) consensus sequences have been identified in different viral proteins, including NS1, NS2, M1, and NP. The present study examined the roles of viral NES consensus sequences and their significance in terms of viral replication and nuclear export. Mutation of the NP-NES3 consensus sequence resulted in a failure to rescue viruses using a reverse genetics approach, whereas mutation of the NS2-NES1 and NS2-NES2 sequences led to a strong reduction in viral replication kinetics compared with the wild-type sequence. While the viral replication kinetics for other NES mutant viruses were also lower than those of the wild-type, the difference was not so marked. Immunofluorescence analysis after transient expression of NP-NES3, NS2-NES1, or NS2-NES2 proteins in host cells showed that they accumulated in the cell nucleus. These results suggest that the NP-NES3 consensus sequence is mostly required for viral replication. Therefore, each of the hydrophobic (Φ) residues within this NES consensus sequence (Φ1, Φ2, Φ3, or Φ4) was mutated, and its viral replication and nuclear export function were analyzed. No viruses harboring NP-NES3 Φ2 or Φ3 mutants could be rescued. Consistent with this, the NP-NES3 Φ2 and Φ3 mutants showed reduced binding affinity with CRM1 in a pull-down assay, and both accumulated in the cell nucleus. Indeed, a nuclear export assay revealed that these mutant proteins showed lower nuclear export activity than the wild-type protein. Moreover, the Φ2 and Φ3 residues (along with other Φ residues) within the NP-NES3 consensus were highly conserved among different influenza A viruses, including human, avian, and swine. Taken together, these results suggest that the Φ2 and Φ3 residues within the NP-NES3 protein are important for its nuclear export function during viral replication.
The complete mitochondrial genome of the bag-shelter moth Ochrogaster lunifer (Lepidoptera, Notodontidae)

PubMed Central

Salvato, Paola; Simonato, Mauro; Battisti, Andrea; Negrisolo, Enrico

2008-01-01

Background Knowledge of animal mitochondrial genomes is very important to understand their molecular evolution as well as for phylogenetic and population genetic studies. The Lepidoptera encompasses more than 160,000 described species and is one of the largest insect orders. To date only nine lepidopteran mitochondrial DNAs have been fully and two others partly sequenced. Furthermore the taxon sampling is very scant. Thus advance of lepidopteran mitogenomics deeply requires new genomes derived from a broad taxon sampling. In present work we describe the mitochondrial genome of the moth Ochrogaster lunifer. Results The mitochondrial genome of O. lunifer is a circular molecule 15593 bp long. It includes the entire set of 37 genes usually present in animal mitochondrial genomes. It contains also 7 intergenic spacers. The gene order of the newly sequenced genome is that typical for Lepidoptera and differs from the insect ancestral type for the placement of trnM. The 77.84% A+T content of its α strand is the lowest among known lepidopteran genomes. The mitochondrial genome of O. lunifer exhibits one of the most marked C-skew among available insect Pterygota genomes. The protein-coding genes have typical mitochondrial start codons except for cox1 that present an unusual CGA. The O. lunifer genome exhibits the less biased synonymous codon usage among lepidopterans. Comparative genomics analysis study identified atp6, cox1, cox2 as cox3, cob, nad1, nad2, nad4, and nad5 as potential markers for population genetics/phylogenetics studies. A peculiar feature of O. lunifer mitochondrial genome it that the intergenic spacers are mostly made by repetitive sequences. Conclusion The mitochondrial genome of O. lunifer is the first representative of superfamily Noctuoidea that account for about 40% of all described Lepidoptera. New genome shares many features with other known lepidopteran genomes. It differs however for its low A+T content and marked C-skew. Compared to other lepidopteran genomes it is less biased in synonymous codon usage. Comparative evolutionary analysis of lepidopteran mitochondrial genomes allowed the identification of previously neglected coding genes as potential phylogenetic markers. Presence of repetitive elements in intergenic spacers of O. lunifer genome supports the role of DNA slippage as possible mechanism to produce spacers during replication. PMID:18627592
Insertion of a self-splicing intron into the mtDNA of atriploblastic animal

DOE Office of Scientific and Technical Information (OSTI.GOV)

Valles, Y.; Halanych, K.; Boore, J.L.

2006-04-14

Nephtys longosetosa is a carnivorous polychaete worm that lives in the intertidal and subtidal zones with worldwide distribution (pleijel&rouse2001). Its mitochondrial genome has the characteristics typical of most metazoans: 37 genes; circular molecule; almost no intergenic sequence; and no significant gene rearrangements when compared to other annelid mtDNAs (booremoritz19981995). Ubiquitous features as small intergenic regions and lack of introns suggested that metazoan mtDNAs are under strong selective pressures to reduce their genome size allowing for faster replication requirements (booremoritz19981995Lynch2005). Yet, in 1996 two type I introns were found in the mtDNA of the basal metazoan Metridium senile (FigureX). Breaking amore » long-standing rule (absence of introns in metazoan mtDNA), this finding was later supported by the further presence of group I introns in other cnidarians. Interestingly, only the class Anthozoa within cnidarians seems to harbor such introns. Although several hundreds of triploblastic metazoan mtDNAs have been sequenced, this study is the first evidence of mitochondrial introns in triploblastic metazoans. The cox1 gene of N. longosetosa has an intron of almost 2 kbs in length. This finding represents as well the first instance of a group II intron (anthozoans harbor group I introns) in all metazoan lineages. Opposite trends are observed within plants, fungi and protist mtDNAs, where introns (both group I and II) and other non-coding sequences are widespread. Plant, fungal and protist mtDNA structure and organization differ enormously from that of metazoan mtDNA. Both, plant and fungal mtDNA are dynamic molecules that undergo high rates of recombination, contain long intergenic spacer regions and harbor both group I and group II introns. However, as metazoans they have a conserved gene content. Protists, on the other hand have a striking variation of gene content and introns that account for the genome size variation. In contrast to this mtDNA structure and organization diversity, current genome level studies point to a monophyletic origin of the mitochondria (REFS), raising questions such as: what are the pressures at work shaping the evolution of the mitochondrial genome at 'higher' levels? What drives the absence of introns and other non-coding spacers in metazoan mtDNA? What characteristics must have an intron to be maintained in an environment where 'extra chromosomes' are usually selected against?« less
DNA sequence analysis of ARS elements from chromosome III of Saccharomyces cerevisiae: identification of a new conserved sequence.

PubMed Central

Palzkill, T G; Oliver, S G; Newlon, C S

1986-01-01

Four fragments of Saccharomyces cerevisiae chromosome III DNA which carry ARS elements have been sequenced. Each fragment contains multiple copies of sequences that have at least 10 out of 11 bases of homology to a previously reported 11 bp core consensus sequence. A survey of these new ARS sequences and previously reported sequences revealed the presence of an additional 11 bp conserved element located on the 3' side of the T-rich strand of the core consensus. Subcloning analysis as well as deletion and transposon insertion mutagenesis of ARS fragments support a role for 3' conserved sequence in promoting ARS activity. PMID:3529036
Bacterial 16S rRNA gene analysis revealed that bacteria related to Arcobacter spp. constitute an abundant and common component of the oyster microbiota (Tiostrea chilensis).

PubMed

Romero, J; García-Varela, M; Laclette, J P; Espejo, R T

2002-11-01

To explore the bacterial microbiota in Chilean oyster (Tiostrea chilensis), a molecular approach that permits detection of different bacteria, independently of their capacity to grow in culture media, was used. Bacterial diversity was assessed by analysis of both the 16S rDNA and the 16S-23S intergenic region, obtained by PCR amplifications of DNA extracted from depurated oysters. RFLP of the PCR amplified 16S rDNA showed a prevailing pattern in most of the individuals analyzed, indicating that a few bacterial species were relatively abundant and common in oysters. Cloning and sequencing of the 16S rDNA with the prevailing RFLP pattern indicated that this rRNA was most closely related to Arcobacter spp. However, analysis by the size of the amplified 16S-23S rRNA intergenic regions revealed not Arcobacter spp. but Staphylococcus spp. related bacteria as a major and common component in oyster. These different results may be caused by the absence of target for one of the primers employed for amplification of the intergenic region. Neither of the two bacteria species found in large abundance was recovered after culturing under aerobic, anaerobic, or microaerophilic conditions. This result, however, is expected because the number of bacteria recovered after cultivation was less than 0.01% of the total. All together, these observations suggest that Arcobacter-related strains are probably abundant and common in the Chilean oyster bacterial microbiota.
The Functional Human C-Terminome

PubMed Central

Hedden, Michael; Lyon, Kenneth F.; Brooks, Steven B.; David, Roxanne P.; Limtong, Justin; Newsome, Jacklyn M.; Novakovic, Nemanja; Rajasekaran, Sanguthevar; Thapar, Vishal; Williams, Sean R.; Schiller, Martin R.

2016-01-01

All translated proteins end with a carboxylic acid commonly called the C-terminus. Many short functional sequences (minimotifs) are located on or immediately proximal to the C-terminus. However, information about the function of protein C-termini has not been consolidated into a single source. Here, we built a new “C-terminome” database and web system focused on human proteins. Approximately 3,600 C-termini in the human proteome have a minimotif with an established molecular function. To help evaluate the function of the remaining C-termini in the human proteome, we inferred minimotifs identified by experimentation in rodent cells, predicted minimotifs based upon consensus sequence matches, and predicted novel highly repetitive sequences in C-termini. Predictions can be ranked by enrichment scores or Gene Evolutionary Rate Profiling (GERP) scores, a measurement of evolutionary constraint. By searching for new anchored sequences on the last 10 amino acids of proteins in the human proteome with lengths between 3–10 residues and up to 5 degenerate positions in the consensus sequences, we have identified new consensus sequences that predict instances in the majority of human genes. All of this information is consolidated into a database that can be accessed through a C-terminome web system with search and browse functions for minimotifs and human proteins. A known consensus sequence-based predicted function is assigned to nearly half the proteins in the human proteome. Weblink: http://cterminome.bio-toolkit.com. PMID:27050421
Two distinct Photobacterium populations thrive in ancient Mediterranean sapropels.

PubMed

Süss, Jacqueline; Herrmann, Kerstin; Seidel, Michael; Cypionka, Heribert; Engelen, Bert; Sass, Henrik

2008-04-01

Eastern Mediterranean sediments are characterized by the periodic occurrence of conspicuous, organic matter-rich sapropel layers. Phylogenetic analysis of a large culture collection isolated from these sediments revealed that about one third of the isolates belonged to the genus Photobacterium. In the present study, 22 of these strains were examined with respect to their phylogenetic and metabolic diversity. The strains belonged to two distinct Photobacterium populations (Mediterranean cluster I and II). Strains of cluster I were isolated almost exclusively from organic-rich sapropel layers and were closely affiliated with P. aplysiae (based on their 16S rRNA gene sequences). They possessed almost identical Enterobacterial Repetitive Intergenic Consensus (ERIC) and substrate utilization patterns, even among strains from different sampling sites or from layers differing up to 100,000 years in age. Strains of cluster II originated from sapropels and from the surface and carbon-lean intermediate layers. They were related to Photobacterium frigidiphilum but differed significantly in their fingerprint patterns and substrate spectra, even when these strains were obtained from the same sampling site and layer. Temperature range for growth (4 to 33 degrees C), salinity tolerance (5 to 100 per thousand), pH requirements (5.5-9.3), and the composition of polar membrane lipids were similar for both clusters. All strains grew by fermentation (glucose, organic acids) and all but five by anaerobic respiration (nitrate, dimethyl sulfoxide, anthraquinone disulfonate, or humic acids). These results indicate that the genus Photobacterium forms subsurface populations well adapted to life in the deep biosphere.
Open reading frame 5 (ORF5), encoding a ferredoxinlike protein, and nifQ are cotranscribed with nifE, nifN, nifX, and ORF4 in Rhodobacter capsulatus.

PubMed Central

Moreno-Vivian, C; Hennecke, S; Pühler, A; Klipp, W

1989-01-01

DNA sequence analysis of a 1,600-base-pair fragment located downstream of nifENX in nif region A of Rhodobacter capsulatus revealed two additional open reading frames (ORFs): ORF5, encoding a ferredoxinlike protein, and nifQ. The ferredoxinlike gene product contained two cysteine motifs, typical of ferredoxins coordinating two 4Fe-4S clusters, but the distance between these two motifs was unusual for low-molecular-weight ferredoxins. The R. capsulatus nifQ gene product shared a high degree of homology with Klebsiella pneumoniae and Azotobacter vinelandii NifQ, including a typical cysteine motif located in the C-terminal part. nifQ insertion mutants and also an ORF5-nifQ double deletion mutant showed normal diazotrophic growth only in the presence of high concentrations of molybdate. This demonstrated that the gene encoding the ferredoxinlike protein is not essential for nitrogen fixation. No NifA-activated consensus promoter could be found in the intergenic region between nifENX-ORF4 and ORF5-nifQ. Analyses of a nifQ-lacZYA fusion revealed that transcription of nifQ was initiated at a promoter in front of nifE. In contrast to other nitrogen-fixing organisms, R. capsulatus nifE, nifN, nifX, ORF4, ORF5, and nifQ were organized in one transcriptional unit. PMID:2708314
Prevalence and Characteristics of Salmonella Isolated from Free-Range Chickens in Shandong Province, China

PubMed Central

Gao, Yanxia; Ye, Chaoqun; Yang, Lingling; Wang, Tao

2016-01-01

Compared with chickens raised in intensively managed breeding farms, free-range chickens in China are quite popular due to lower breeding density and less antibiotics usage. However, investigations about Salmonella enterica from free-range chickens are quite rare. The aim of the present study was to investigate prevalence and characteristics of Salmonella in free-range chickens in Shandong province, China. During the period of August and November 2015, 300 fresh fecal swabs from different broilers in three free-range chicken farms (100 samples per farm) were collected to isolate Salmonella, and then these isolates were subjected to serotyping, antibiotic sensitivity testing, enterobacterial repetitive intergenic consensus-polymerase chain reaction (ERIC-PCR), and multilocus sequence typing (ST). A total of 38 Salmonella isolates (38/300, 12.7%) were recovered. The most common serotype was Enteritidis (81.6%), followed by Indiana (13.2%) and Typhimurium (5.3%). Twenty-two out of 38 isolates (57.9%) were resistant to ampicillin, the highest resistance rate, but resistance rates to cefazolin, cefotaxime, and ceftazidime were only 7.9%. The multidrug resistance (MDR) rate was 26.3%. Additionally, the Salmonella isolates could be classified into 25 genotypes by ERIC-PCR and were divided into three ST types (ST11, ST17, and ST19), with ST11 the highest isolation rate (81.6%). In summary, as with other poultry, free-ranging chickens may also serve as potential reservoir for antibiotic resistant Salmonella, thereby posing a threat to public health. PMID:27800493
Prevalence and Characteristics of Salmonella Isolated from Free-Range Chickens in Shandong Province, China.

PubMed

Zhao, Xiaonan; Gao, Yanxia; Ye, Chaoqun; Yang, Lingling; Wang, Tao; Chang, Weishan

2016-01-01

Compared with chickens raised in intensively managed breeding farms, free-range chickens in China are quite popular due to lower breeding density and less antibiotics usage. However, investigations about Salmonella enterica from free-range chickens are quite rare. The aim of the present study was to investigate prevalence and characteristics of Salmonella in free-range chickens in Shandong province, China. During the period of August and November 2015, 300 fresh fecal swabs from different broilers in three free-range chicken farms (100 samples per farm) were collected to isolate Salmonella , and then these isolates were subjected to serotyping, antibiotic sensitivity testing, enterobacterial repetitive intergenic consensus-polymerase chain reaction (ERIC-PCR), and multilocus sequence typing (ST). A total of 38 Salmonella isolates (38/300, 12.7%) were recovered. The most common serotype was Enteritidis (81.6%), followed by Indiana (13.2%) and Typhimurium (5.3%). Twenty-two out of 38 isolates (57.9%) were resistant to ampicillin, the highest resistance rate, but resistance rates to cefazolin, cefotaxime, and ceftazidime were only 7.9%. The multidrug resistance (MDR) rate was 26.3%. Additionally, the Salmonella isolates could be classified into 25 genotypes by ERIC-PCR and were divided into three ST types (ST11, ST17, and ST19), with ST11 the highest isolation rate (81.6%). In summary, as with other poultry, free-ranging chickens may also serve as potential reservoir for antibiotic resistant Salmonella , thereby posing a threat to public health.
A NodD-like protein activates transcription of genes involved with naringenin degradation in a flavonoid-dependent manner in Herbaspirillum seropedicae.

PubMed

Wassem, R; Marin, A M; Daddaoua, A; Monteiro, R A; Chubatsu, L S; Ramos, J L; Deakin, W J; Broughton, W J; Pedrosa, F O; Souza, E M

2017-03-01

Herbaspirillum seropedicae is an associative, endophytic non-nodulating diazotrophic bacterium that colonises several grasses. An ORF encoding a LysR-type transcriptional regulator, very similar to NodD proteins of rhizobia, was identified in its genome. This nodD-like gene, named fdeR, is divergently transcribed from an operon encoding enzymes involved in flavonoid degradation (fde operon). Apigenin, chrysin, luteolin and naringenin strongly induce transcription of the fde operon, but not that of the fdeR, in an FdeR-dependent manner. The intergenic region between fdeR and fdeA contains several generic LysR consensus sequences (T-N 11 -A) and we propose a binding site for FdeR, which is conserved in other bacteria. DNase I foot-printing revealed that the interaction with the FdeR binding site is modified by the four flavonoids that stimulate transcription of the fde operon. Moreover, FdeR binds naringenin and chrysin as shown by isothermal titration calorimetry. Interestingly, FdeR also binds in vitro to the nod-box from the nodABC operon of Rhizobium sp. NGR234 and is able to activate its transcription in vivo. These results show that FdeR exhibits two features of rhizobial NodD proteins: nod-box recognition and flavonoid-dependent transcription activation, but its role in H. seropedicae and related organisms seems to have evolved to control flavonoid metabolism. © 2016 Society for Applied Microbiology and John Wiley & Sons Ltd.
Comparative sequence analysis of the X-inactivation center region in mouse, human, and bovine.

PubMed

Chureau, Corinne; Prissette, Marine; Bourdet, Agnès; Barbe, Valérie; Cattolico, Laurence; Jones, Louis; Eggen, André; Avner, Philip; Duret, Laurent

2002-06-01

We have sequenced to high levels of accuracy 714-kb and 233-kb regions of the mouse and bovine X-inactivation centers (Xic), respectively, centered on the Xist gene. This has provided the basis for a fully annotated comparative analysis of the mouse Xic with the 2.3-Mb orthologous region in human and has allowed a three-way species comparison of the core central region, including the Xist gene. These comparisons have revealed conserved genes, both coding and noncoding, conserved CpG islands and, more surprisingly, conserved pseudogenes. The distribution of repeated elements, especially LINE repeats, in the mouse Xic region when compared to the rest of the genome does not support the hypothesis of a role for these repeat elements in the spreading of X inactivation. Interestingly, an asymmetric distribution of LINE elements on the two DNA strands was observed in the three species, not only within introns but also in intergenic regions. This feature is suggestive of important transcriptional activity within these intergenic regions. In silico prediction followed by experimental analysis has allowed four new genes, Cnbp2, Ftx, Jpx, and Ppnx, to be identified and novel, widespread, complex, and apparently noncoding transcriptional activity to be characterized in a region 5' of Xist that was recently shown to attract histone modification early after the onset of X inactivation.
Genetic Diversity and Evolution of Bradyrhizobium Populations Nodulating Erythrophleum fordii, an Evergreen Tree Indigenous to the Southern Subtropical Region of China

PubMed Central

Yao, Yao; Wang, Rui; Lu, Jun Kun; Wang, En Tao; Chen, Wen Xin

2014-01-01

The nodulation of Erythrophleum fordii has been recorded recently, but its microsymbionts have never been studied. To investigate the diversity and biogeography of rhizobia associated with this leguminous evergreen tree, root nodules were collected from the southern subtropical region of China. A total of 166 bacterial isolates were obtained from the nodules and characterized. In a PCR-based restriction fragment length polymorphism (RFLP) analysis of ribosomal intergenic sequences, the isolates were classified into 22 types within the genus Bradyrhizobium. Sequence analysis of 16S rRNA, ribosomal intergenic spacer (IGS), and the housekeeping genes recA and glnII classified the isolates into four groups: the Bradyrhizobium elkanii and Bradyrhizobium pachyrhizi groups, comprising the dominant symbionts, Bradyrhizobium yuanmingense, and an unclassified group comprising the minor symbionts. The nodC and nifH phylogenetic trees defined five or six lineages among the isolates, which was largely consistent with the definition of genomic species. The phylogenetic results and evolutionary analysis demonstrated that mutation and vertical transmission of genes were the principal processes for the divergent evolution of Bradyrhizobium species associated with E. fordii, while lateral transfer and recombination of housekeeping and symbiotic genes were rare. The distribution of the dominant rhizobial populations was affected by soil pH and effective phosphorus. This is the first report to characterize E. fordii rhizobia. PMID:25085491
Isolation and characterization of target sequences of the chicken CdxA homeobox gene.

PubMed Central

Margalit, Y; Yarus, S; Shapira, E; Gruenbaum, Y; Fainsod, A

1993-01-01

The DNA binding specificity of the chicken homeodomain protein CDXA was studied. Using a CDXA-glutathione-S-transferase fusion protein, DNA fragments containing the binding site for this protein were isolated. The sources of DNA were oligonucleotides with random sequence and chicken genomic DNA. The DNA fragments isolated were sequenced and tested in DNA binding assays. Sequencing revealed that most DNA fragments are AT rich which is a common feature of homeodomain binding sites. By electrophoretic mobility shift assays it was shown that the different target sequences isolated bind to the CDXA protein with different affinities. The specific sequences bound by the CDXA protein in the genomic fragments isolated, were determined by DNase I footprinting. From the footprinted sequences, the CDXA consensus binding site was determined. The CDXA protein binds the consensus sequence A, A/T, T, A/T, A, T, A/G. The CAUDAL binding site in the ftz promoter is also included in this consensus sequence. When tested, some of the genomic target sequences were capable of enhancing the transcriptional activity of reporter plasmids when introduced into CDXA expressing cells. This study determined the DNA sequence specificity of the CDXA protein and it also shows that this protein can further activate transcription in cells in culture. Images PMID:7909943
A ribosomal orphon sequence from Xenopus laevis flanked by novel low copy number repetitive elements.

PubMed

Guimond, A; Moss, T

1999-02-01

We have used a differential cloning approach to isolate ribosomal/non-ribosomal frontier sequences from Xenopus laevis. A ribosomal intergenic spacer sequence (IGS) was cloned and shown not to be physically linked with the ribosomal locus. This ribosomal orphon contained the IGS sequences found immediately downstream of the 28S gene and included an array of enhancer repetitions and a non-functional spacer promoter. The orphon sequence was flanked by a member of the novel 'Frt' low copy repetitive element family. Three individual Frt repeats were sequenced and all members of this family were shown to lie clustered at two chromosomal sites, one of which contained the ribosomal orphon. One of the Frt elements contained an insertion of 297 bp that showed extensive homology to sequences within at least three other Xenopus genes. Each homology region was flanked by members of the T2 family of short interspersed repetitive elements, (SINEs), and by its target insertion sequence, suggesting multiple translocation events. The data are discussed in terms of the evolution of the ribosomal gene locus.
Reemergence of rabies in the southern Han river region, Korea.

PubMed

Oem, Jae-Ku; Kim, Seong-Hee; Kim, Yeon-Hee; Lee, Myoung-Heon; Lee, Kyoung-Ki

2014-07-01

Recently, 11 cases of animal rabies were reported in the southern region (Suwon and Hwaseong cities) of Gyeonggi Province, South Korea. The cases were temporally separated into two cases in dogs (Canis lupus familiaris) in spring 2012 and nine cases in domestic animals and wildlife in winter 2012-13. All carcasses were submitted for histopathologic examination and viral antigen identification. Sequences of the glycoprotein, nucleoprotein, and glycoprotein-large polymerase protein intergenic noncoding loci of the 11 strains were determined and compared with published reference sequences. All rabies strains were closely related to the Gangwon strains isolated in 2008-09, suggesting that the rabies virus strains isolated in Gyeonggi were introduced from Gangwon Province.
Automated Sanger Analysis Pipeline (ASAP): A Tool for Rapidly Analyzing Sanger Sequencing Data with Minimum User Interference.

PubMed

Singh, Aditya; Bhatia, Prateek

2016-12-01

Sanger sequencing platforms, such as applied biosystems instruments, generate chromatogram files. Generally, for 1 region of a sequence, we use both forward and reverse primers to sequence that area, in that way, we have 2 sequences that need to be aligned and a consensus generated before mutation detection studies. This work is cumbersome and takes time, especially if the gene is large with many exons. Hence, we devised a rapid automated command system to filter, build, and align consensus sequences and also optionally extract exonic regions, translate them in all frames, and perform an amino acid alignment starting from raw sequence data within a very short time. In full capabilities of Automated Mutation Analysis Pipeline (ASAP), it is able to read "*.ab1" chromatogram files through command line interface, convert it to the FASTQ format, trim the low-quality regions, reverse-complement the reverse sequence, create a consensus sequence, extract the exonic regions using a reference exonic sequence, translate the sequence in all frames, and align the nucleic acid and amino acid sequences to reference nucleic acid and amino acid sequences, respectively. All files are created and can be used for further analysis. ASAP is available as Python 3.x executable at https://github.com/aditya-88/ASAP. The version described in this paper is 0.28.
Analysis of Complete Nucleotide Sequences of 12 Gossypium Chloroplast Genomes: Origin and Evolution of Allotetraploids

PubMed Central

Xu, Qin; Xiong, Guanjun; Li, Pengbo; He, Fei; Huang, Yi; Wang, Kunbo; Li, Zhaohu; Hua, Jinping

2012-01-01

Background Cotton (Gossypium spp.) is a model system for the analysis of polyploidization. Although ascertaining the donor species of allotetraploid cotton has been intensively studied, sequence comparison of Gossypium chloroplast genomes is still of interest to understand the mechanisms underlining the evolution of Gossypium allotetraploids, while it is generally accepted that the parents were A- and D-genome containing species. Here we performed a comparative analysis of 13 Gossypium chloroplast genomes, twelve of which are presented here for the first time. Methodology/Principal Findings The size of 12 chloroplast genomes under study varied from 159,959 bp to 160,433 bp. The chromosomes were highly similar having >98% sequence identity. They encoded the same set of 112 unique genes which occurred in a uniform order with only slightly different boundary junctions. Divergence due to indels as well as substitutions was examined separately for genome, coding and noncoding sequences. The genome divergence was estimated as 0.374% to 0.583% between allotetraploid species and A-genome, and 0.159% to 0.454% within allotetraploids. Forty protein-coding genes were completely identical at the protein level, and 20 intergenic sequences were completely conserved. The 9 allotetraploids shared 5 insertions and 9 deletions in whole genome, and 7-bp substitutions in protein-coding genes. The phylogenetic tree confirmed a close relationship between allotetraploids and the ancestor of A-genome, and the allotetraploids were divided into four separate groups. Progenitor allotetraploid cotton originated 0.43–0.68 million years ago (MYA). Conclusion Despite high degree of conservation between the Gossypium chloroplast genomes, sequence variations among species could still be detected. Gossypium chloroplast genomes preferred for 5-bp indels and 1–3-bp indels are mainly attributed to the SSR polymorphisms. This study supports that the common ancestor of diploid A-genome species in Gossypium is the maternal source of extant allotetraploid species and allotetraploids have a monophyletic origin. G. hirsutum AD1 lineages have experienced more sequence variations than other allotetraploids in intergenic regions. The available complete nucleotide sequences of 12 Gossypium chloroplast genomes should facilitate studies to uncover the molecular mechanisms of compartmental co-evolution and speciation of Gossypium allotetraploids. PMID:22876273
Structural analysis of the rDNA intergenic spacer of Brassica nigra: evolutionary divergence of the spacers of the three diploid Brassica species.

PubMed

Bhatia, S; Singh Negi, M; Lakshmikumaran, M

1996-11-01

EcoRI restriction of the B. nigra rDNA recombinants, isolated from a lambda genomic library, showed that the 3.9-kb fragment corresponded to the Intergenic Spacer (IGS), which was sequenced and found to be 3,928 bp in size. Sequence and dot-matrix analyses showed that the organization of the B. nigra rDNA IGS was typical of most rDNA spacers, consisting of a central repetitive region and flanking unique sequences on either side. The repetitive region was composed of two repeat families-RF 'A' and RF 'B.' The B. nigra RF 'A' consisted of a tandem array of three full-length copies of a 106-bp sequence element. RF 'B' was composed of 66 tandemly repeated elements. Each 'B' element was only 21-bp in size and this is the smallest repeat unit identified in plant rDNA to date. The putative transcription initiation site (TIS) was identified as nucleotide position 3,110. Based on the sequence analysis it was suggested that the present organization of the repeat families was generated by successive cycles of deletions and amplifications and was being maintained by homogenization processes such as gene conversion and crossing-over.A detailed comparison of the rDNA IGS sequences of the three diploid Brassica species-namely, B. nigra, B. campestris, and B. oleracea-was carried out. First, comparisons revealed that B. campestris and B. oleracea were close to each other as the repeat families in both showed high sequence homology between each other. Second, the repeat elements in both the species were organized in an interspersed manner. Third, a 52-bp sequence, present just downstream of the repeats in B. campestris, was found to be identical to the B. oleracea repeats, thereby suggesting a common progenitor. On the other hand, in B. nigra no interspersion pattern of organization of repeats was observed. Further, the B. nigra RF 'A' was identified as distinct from the repeat families of B. campestris and B. oleracea. Based on this analysis, it was suggested that during speciation B. campestris and B. oleracea evolved in one lineage whereas B. nigra diverged into a separate lineage. The comparative analysis of the IGS helped in identifying not only conserved ancestral sequence motifs of possible functional significance such as promoters and enhancers, but also sequences which showed variation between the three diploid species and were therefore identified as species-specific sequences.

First full-length genome sequence of the polerovirus luffa aphid-borne yellows virus (LABYV) reveals the presence of at least two consensus sequences in an isolate from Thailand.

PubMed

Knierim, Dennis; Maiss, Edgar; Kenyon, Lawrence; Winter, Stephan; Menzel, Wulf

2015-10-01

Luffa aphid-borne yellows virus (LABYV) was proposed as the name for a previously undescribed polerovirus based on partial genome sequences obtained from samples of cucurbit plants collected in Thailand between 2008 and 2013. In this study, we determined the first full-length genome sequence of LABYV. Based on phylogenetic analysis and genome properties, it is clear that this virus represents a distinct species in the genus Polerovirus. Analysis of sequences from sample TH24, which was collected in 2010 from a luffa plant in Thailand, reveals the presence of two different full-length genome consensus sequences.
Superior ab initio identification, annotation and characterisation of TEs and segmental duplications from genome assemblies.

PubMed

Zeng, Lu; Kortschak, R Daniel; Raison, Joy M; Bertozzi, Terry; Adelson, David L

2018-01-01

Transposable Elements (TEs) are mobile DNA sequences that make up significant fractions of amniote genomes. However, they are difficult to detect and annotate ab initio because of their variable features, lengths and clade-specific variants. We have addressed this problem by refining and developing a Comprehensive ab initio Repeat Pipeline (CARP) to identify and cluster TEs and other repetitive sequences in genome assemblies. The pipeline begins with a pairwise alignment using krishna, a custom aligner. Single linkage clustering is then carried out to produce families of repetitive elements. Consensus sequences are then filtered for protein coding genes and then annotated using Repbase and a custom library of retrovirus and reverse transcriptase sequences. This process yields three types of family: fully annotated, partially annotated and unannotated. Fully annotated families reflect recently diverged/young known TEs present in Repbase. The remaining two types of families contain a mixture of novel TEs and segmental duplications. These can be resolved by aligning these consensus sequences back to the genome to assess copy number vs. length distribution. Our pipeline has three significant advantages compared to other methods for ab initio repeat identification: 1) we generate not only consensus sequences, but keep the genomic intervals for the original aligned sequences, allowing straightforward analysis of evolutionary dynamics, 2) consensus sequences represent low-divergence, recently/currently active TE families, 3) segmental duplications are annotated as a useful by-product. We have compared our ab initio repeat annotations for 7 genome assemblies to other methods and demonstrate that CARP compares favourably with RepeatModeler, the most widely used repeat annotation package.
Superior ab initio identification, annotation and characterisation of TEs and segmental duplications from genome assemblies

PubMed Central

Zeng, Lu; Kortschak, R. Daniel; Raison, Joy M.

2018-01-01

Transposable Elements (TEs) are mobile DNA sequences that make up significant fractions of amniote genomes. However, they are difficult to detect and annotate ab initio because of their variable features, lengths and clade-specific variants. We have addressed this problem by refining and developing a Comprehensive ab initio Repeat Pipeline (CARP) to identify and cluster TEs and other repetitive sequences in genome assemblies. The pipeline begins with a pairwise alignment using krishna, a custom aligner. Single linkage clustering is then carried out to produce families of repetitive elements. Consensus sequences are then filtered for protein coding genes and then annotated using Repbase and a custom library of retrovirus and reverse transcriptase sequences. This process yields three types of family: fully annotated, partially annotated and unannotated. Fully annotated families reflect recently diverged/young known TEs present in Repbase. The remaining two types of families contain a mixture of novel TEs and segmental duplications. These can be resolved by aligning these consensus sequences back to the genome to assess copy number vs. length distribution. Our pipeline has three significant advantages compared to other methods for ab initio repeat identification: 1) we generate not only consensus sequences, but keep the genomic intervals for the original aligned sequences, allowing straightforward analysis of evolutionary dynamics, 2) consensus sequences represent low-divergence, recently/currently active TE families, 3) segmental duplications are annotated as a useful by-product. We have compared our ab initio repeat annotations for 7 genome assemblies to other methods and demonstrate that CARP compares favourably with RepeatModeler, the most widely used repeat annotation package. PMID:29538441
Prevalence and genetic characterization of Vibrio vulnificus in raw seafood and seawater in Malaysia.

PubMed

Paydar, Mohammadjavad; Thong, Kwai Lin

2013-10-01

Vibrio vulnificus is a highly invasive human pathogen that exists naturally in estuarine environment and coastal waters. In this study, we used different PCR assays to detect V. vulnificus in 260 seafood and 80 seawater samples. V. vulnificus was present in about 34 (13%) of the 260 seafood samples and 18 (23%) of the 80 seawater samples. Repetitive extragenic palindromic PCR (REP-PCR) and enterobacterial repetitive intergenic consensus PCR (ERIC-PCR) were applied to subtype the V. vulnificus isolates. Twenty-five REP profiles and 45 ERIC profiles were observed, and the isolates were categorized into 9 and 10 distinct clusters at the similarity of 80%, by REP-PCR and ERIC-PCR, respectively. ERIC-PCR is more discriminative than REP-PCR in subtyping V. vulnificus, demonstrating high genetic diversity among the isolates.
Comparative Sequence Analysis of the X-Inactivation Center Region in Mouse, Human, and Bovine

PubMed Central

Chureau, Corinne; Prissette, Marine; Bourdet, Agnès; Barbe, Valérie; Cattolico, Laurence; Jones, Louis; Eggen, André; Avner, Philip; Duret, Laurent

2002-01-01

We have sequenced to high levels of accuracy 714-kb and 233-kb regions of the mouse and bovine X-inactivation centers (Xic), respectively, centered on the Xist gene. This has provided the basis for a fully annotated comparative analysis of the mouse Xic with the 2.3-Mb orthologous region in human and has allowed a three-way species comparison of the core central region, including the Xist gene. These comparisons have revealed conserved genes, both coding and noncoding, conserved CpG islands and, more surprisingly, conserved pseudogenes. The distribution of repeated elements, especially LINE repeats, in the mouse Xic region when compared to the rest of the genome does not support the hypothesis of a role for these repeat elements in the spreading of X inactivation. Interestingly, an asymmetric distribution of LINE elements on the two DNA strands was observed in the three species, not only within introns but also in intergenic regions. This feature is suggestive of important transcriptional activity within these intergenic regions. In silico prediction followed by experimental analysis has allowed four new genes, Cnbp2, Ftx, Jpx, and Ppnx, to be identified and novel, widespread, complex, and apparently noncoding transcriptional activity to be characterized in a region 5′ of Xist that was recently shown to attract histone modification early after the onset of X inactivation. [The sequence data described in this paper have been submitted to the EMBL data library under accession nos. AJ421478, AJ421479, AJ421480, and AJ421481. Online supplemental data are available at http://pbil.univ-lyon1.fr/datasets/Xic2002/data.html and www.genome.org.] PMID:12045143
The Evolution of Dark Matter in the Mitogenome of Seed Beetles

PubMed Central

Sayadi, Ahmed; Immonen, Elina; Tellgren-Roth, Christian

2017-01-01

Abstract Animal mitogenomes are generally thought of as being economic and optimized for rapid replication and transcription. We use long-read sequencing technology to assemble the remarkable mitogenomes of four species of seed beetles. These are the largest circular mitogenomes ever assembled in insects, ranging from 24,496 to 26,613 bp in total length, and are exceptional in that some 40% consists of non-coding DNA. The size expansion is due to two very long intergenic spacers (LIGSs), rich in tandem repeats. The two LIGSs are present in all species but vary greatly in length (114–10,408 bp), show very low sequence similarity, divergent tandem repeat motifs, a very high AT content and concerted length evolution. The LIGSs have been retained for at least some 45 my but must have undergone repeated reductions and expansions, despite strong purifying selection on protein coding mtDNA genes. The LIGSs are located in two intergenic sites where a few recent studies of insects have also reported shorter LIGSs (>200 bp). These sites may represent spaces that tolerate neutral repeat array expansions or, alternatively, the LIGSs may function to allow a more economic translational machinery. Mitochondrial respiration in adult seed beetles is based almost exclusively on fatty acids, which reduces the need for building complex I of the oxidative phosphorylation pathway (NADH dehydrogenase). One possibility is thus that the LIGSs may allow depressed transcription of NAD genes. RNA sequencing showed that LIGSs are partly transcribed and transcriptional profiling suggested that all seven mtDNA NAD genes indeed show low levels of transcription and co-regulation of transcription across sexes and tissues. PMID:29048527
Rapid differentiation of Staphylococcus aureus isolates harbouring egc loci with pseudogenes psient1 and psient2 and the selu or seluv gene using PCR-RFLP.

PubMed

Collery, Mark M; Smyth, Cyril J

2007-02-01

The egc locus of Staphylococus aureus harbours two enterotoxin genes (seg and sei) and three enterotoxin-like genes (selm, seln and selo). Between the sei and seln genes are located two pseudogenes, psient1 and psient2, or the selu or seluv gene. While these two alternative sei-seln intergenic regions can be distinguished by PCR, to date, DNA sequencing has been the only confirmatory option because of the very high degree of sequence similarity between egc loci bearing the pseudogenes and the selu or seluv gene. In silico restriction enzyme digestion of genomic regions encompassing the egc locus from the 3' end of the sei gene through the 5' first quarter of the seln gene allowed pseudogene- and selu- or seluv-bearing egc loci to be distinguished by PCR-RFLP. Experimental application of these findings demonstrated that endonuclease HindIII cleaved PCR amplimers bearing pseudogenes but not those with a selu or seluv gene, while selu- or seluv-bearing amplimers were susceptible to cleavage by endonuclease HphI, but not by endonuclease HindIII. The restriction enzyme BccI cleaved selu- or seluv-harbouring amplimers at a unique restriction site created by their signature 15 bp insertion compared with pseudogene-bearing amplimers, thereby allowing distinction of these egc loci. PCR-RFLP analysis using these restriction enzymes provides a rapid, easy to interpret alternative to DNA sequencing for verification of PCR findings on the nature of an egc locus type, and can also be used for the primary identification of the intergenic sei-seln egc locus type.
High-throughput sequencing of retrotransposon integration provides a saturated profile of target activity in Schizosaccharomyces pombe.

PubMed

Guo, Yabin; Levin, Henry L

2010-02-01

The biological impact of transposons on the physiology of the host depends greatly on the frequency and position of integration. Previous studies of Tf1, a long terminal repeat retrotransposon in Schizosaccharomyces pombe, showed that integration occurs at the promoters of RNA polymerase II (Pol II) transcribed genes. To determine whether specific promoters are preferred targets of integration, we sequenced large numbers of insertions using high-throughput pyrosequencing. In four independent experiments we identified a total of 73,125 independent integration events. These data provided strong support for the conclusion that Pol II promoters are the targets of Tf1 integration. The size and number of the integration experiments resulted in reproducible measures of integration for each intergenic region and ORF in the S. pombe genome. The reproducibility of the integration activity from experiment to experiment demonstrates that we have saturated the full set of insertion sites that are actively targeted by Tf1. We found Tf1 integration was highly biased in favor of a specific set of Pol II promoters. The overwhelming majority (76%) of the insertions were distributed in intergenic sequences that contained 31% of the promoters of S. pombe. Interestingly, there was no correlation between the amount of integration at these promoters and their level of transcription. Instead, we found Tf1 had a strong preference for promoters that are induced by conditions of stress. This targeting of stress response genes coupled with the ability of Tf1 to regulate the expression of adjacent genes suggests Tf1 may improve the survival of S. pombe when cells are exposed to environmental stress.
High-throughput sequencing of retrotransposon integration provides a saturated profile of target activity in Schizosaccharomyces pombe

PubMed Central

Guo, Yabin; Levin, Henry L.

2010-01-01

The biological impact of transposons on the physiology of the host depends greatly on the frequency and position of integration. Previous studies of Tf1, a long terminal repeat retrotransposon in Schizosaccharomyces pombe, showed that integration occurs at the promoters of RNA polymerase II (Pol II) transcribed genes. To determine whether specific promoters are preferred targets of integration, we sequenced large numbers of insertions using high-throughput pyrosequencing. In four independent experiments we identified a total of 73,125 independent integration events. These data provided strong support for the conclusion that Pol II promoters are the targets of Tf1 integration. The size and number of the integration experiments resulted in reproducible measures of integration for each intergenic region and ORF in the S. pombe genome. The reproducibility of the integration activity from experiment to experiment demonstrates that we have saturated the full set of insertion sites that are actively targeted by Tf1. We found Tf1 integration was highly biased in favor of a specific set of Pol II promoters. The overwhelming majority (76%) of the insertions were distributed in intergenic sequences that contained 31% of the promoters of S. pombe. Interestingly, there was no correlation between the amount of integration at these promoters and their level of transcription. Instead, we found Tf1 had a strong preference for promoters that are induced by conditions of stress. This targeting of stress response genes coupled with the ability of Tf1 to regulate the expression of adjacent genes suggests Tf1 may improve the survival of S. pombe when cells are exposed to environmental stress. PMID:20040583
Identification of Lactobacillus alimentarius and Lactobacillus farciminis with 16S-23S rDNA intergenic spacer region polymorphism and PCR amplification using species-specific oligonucleotide.

PubMed

Rachman, C N; Kabadjova, P; Prévost, H; Dousset, X

2003-01-01

The restriction fragment length polymorphism (RFLP) method was used to differentiate Lactobacillus species having closely related identities in the 16S-23S rDNA intergenic spacer region (ISR). Species-specific primers for Lact. farciminis and Lact. alimentarius were designed and allowed rapid identification of these species. The 16S-23S rDNA spacer region was amplified by primers tAla and 23S/p10, then digested by HinfI and TaqI enzymes and analysed by electrophoresis. Digestion by HinfI was not sufficient to differentiate Lact. sakei, Lact. curvatus, Lact. farciminis, Lact. alimentarius, Lact. plantarum and Lact. paraplantarum. In contrast, digestion carried out by TaqI revealed five different patterns allowing these species to be distinguished, except for Lact. plantarum from Lact. paraplantarum. The 16S-23S rDNA spacer region of Lact. farciminis and Lact. alimentarius were amplified and then cloned into vector pCR(R)2.1 and sequenced. The DNA sequences obtained were analysed and species-specific primers were designed from these sequences. The specificity of these primers was positively demonstrated as no response was obtained for 14 other species tested. The species-specific primers for Lact. farciminis and Lact. alimentarius were shown to be useful for identifying these species among other lactobacilli. The RFLP profile obtained upon digestion with HinfI and TaqI enzymes can be used to discriminate Lact. farciminis, Lact. alimentarius, Lact. sakei, Lact. curvatus and Lact. plantarum. In this paper, we have established the first species-specific primer for PCR identification of Lact. farciminis and Lact. alimentarius. Both species-specific primer and RFLP, could be used as tools for rapid identification of lactobacilli up to species level.
Population genetic structure and phylogeographical pattern of a relict tree fern, Alsophila spinulosa (Cyatheaceae), inferred from cpDNA atpB- rbcL intergenic spacers.

PubMed

Su, Yingjuan; Wang, Ting; Zheng, Bo; Jiang, Yu; Chen, Guopei; Gu, Hongya

2004-11-01

Sequences of chloroplast DNA (cpDNA) atpB- rbcL intergenic spacers of individuals of a tree fern species, Alsophila spinulosa, collected from ten relict populations distributed in the Hainan and Guangdong provinces, and the Guangxi Zhuang region in southern China, were determined. Sequence length varied from 724 bp to 731 bp, showing length polymorphism, and base composition was with high A+T content between 63.17% and 63.95%. Sequences were neutral in terms of evolution (Tajima's criterion D=-1.01899, P>0.10 and Fu and Li's test D*=-1.39008, P>0.10; F*=-1.49775, P>0.10). A total of 19 haplotypes were identified based on nucleotide variation. High levels of haplotype diversity (h=0.744) and nucleotide diversity (Dij=0.01130) were detected in A. spinulosa, probably associated with its long evolutionary history, which has allowed the accumulation of genetic variation within lineages. Both the minimum spanning network and neighbor-joining trees generated for haplotypes demonstrated that current populations of A. spinulosa existing in Hainan, Guangdong, and Guangxi were subdivided into two geographical groups. An analysis of molecular variance indicated that most of the genetic variation (93.49%, P<0.001) was partitioned among regions. Wright's isolation by distance model was not supported across extant populations. Reduced gene flow by the Qiongzhou Strait and inbreeding may result in the geographical subdivision between the Hainan and Guangdong + Guangxi populations (FST=0.95, Nm=0.03). Within each region, the star-like pattern of phylogeography of haplotypes implied a population expansion process during evolutionary history. Gene genealogies together with coalescent theory provided significant information for uncovering phylogeography of A. spinulosa.
RISSC: a novel database for ribosomal 16S–23S RNA genes spacer regions

PubMed Central

García-Martínez, Jesús; Bescós, Ignacio; Rodríguez-Sala, Jesús Javier; Rodríguez-Valera, Francisco

2001-01-01

A novel database, under the acronym RISSC (Ribosomal Intergenic Spacer Sequence Collection), has been created. It compiles more than 1600 entries of edited DNA sequence data from the 16S–23S ribosomal spacers present in most prokaryotes and organelles (e.g. mitochondria and chloroplasts) and is accessible through the Internet (http://ulises.umh.es/RISSC), where systematic searches for specific words can be conducted, as well as BLAST-type sequence searches. Additionally, a characteristic feature of this region, the presence/absence and nature of tRNA genes within the spacer, is included in all the entries, even when not previously indicated in the original database. All these combined features could provide a useful documentation tool for studies on evolution, identification, typing and strain characterization, among others. PMID:11125084
Simplifying complex sequence information: a PCP-consensus protein binds antibodies against all four Dengue serotypes.

PubMed

Bowen, David M; Lewis, Jessica A; Lu, Wenzhe; Schein, Catherine H

2012-09-14

Designing proteins that reflect the natural variability of a pathogen is essential for developing novel vaccines and drugs. Flaviviruses, including Dengue (DENV) and West Nile (WNV), evolve rapidly and can "escape" neutralizing monoclonal antibodies by mutation. Designing antigens that represent many distinct strains is important for DENV, where infection with a strain from one of the four serotypes may lead to severe hemorrhagic disease on subsequent infection with a strain from another serotype. Here, a DENV physicochemical property (PCP)-consensus sequence was derived from 671 unique sequences from the Flavitrack database. PCP-consensus proteins for domain 3 of the envelope protein (EdomIII) were expressed from synthetic genes in Escherichia coli. The ability of the purified consensus proteins to bind polyclonal antibodies generated in response to infection with strains from each of the four DENV serotypes was determined. The initial consensus protein bound antibodies from DENV-1-3 in ELISA and Western blot assays. This sequence was altered in 3 steps to incorporate regions of maximum variability, identified as significant changes in the PCPs, characteristic of DENV-4 strains. The final protein was recognized by antibodies against all four serotypes. Two amino acids essential for efficient binding to all DENV antibodies are part of a discontinuous epitope previously defined for a neutralizing monoclonal antibody. The PCP-consensus method can significantly reduce the number of experiments required to define a multivalent antigen, which is particularly important when dealing with pathogens that must be tested at higher biosafety levels. Copyright © 2012 Elsevier Ltd. All rights reserved.
Fine-tuning structural RNA alignments in the twilight zone.

PubMed

Bremges, Andreas; Schirmer, Stefanie; Giegerich, Robert

2010-04-30

A widely used method to find conserved secondary structure in RNA is to first construct a multiple sequence alignment, and then fold the alignment, optimizing a score based on thermodynamics and covariance. This method works best around 75% sequence similarity. However, in a "twilight zone" below 55% similarity, the sequence alignment tends to obscure the covariance signal used in the second phase. Therefore, while the overall shape of the consensus structure may still be found, the degree of conservation cannot be estimated reliably. Based on a combination of available methods, we present a method named planACstar for improving structure conservation in structural alignments in the twilight zone. After constructing a consensus structure by alignment folding, planACstar abandons the original sequence alignment, refolds the sequences individually, but consistent with the consensus, aligns the structures, irrespective of sequence, by a pure structure alignment method, and derives an improved sequence alignment from the alignment of structures, to be re-submitted to alignment folding, etc.. This circle may be iterated as long as structural conservation improves, but normally, one step suffices. Employing the tools ClustalW, RNAalifold, and RNAforester, we find that for sequences with 30-55% sequence identity, structural conservation can be improved by 10% on average, with a large variation, measured in terms of RNAalifold's own criterion, the structure conservation index.
The Complete Mitochondrial Genome of the Land Snail Cornu aspersum (Helicidae: Mollusca): Intra-Specific Divergence of Protein-Coding Genes and Phylogenetic Considerations within Euthyneura

PubMed Central

Gaitán-Espitia, Juan Diego; Nespolo, Roberto F.; Opazo, Juan C.

2013-01-01

The complete sequences of three mitochondrial genomes from the land snail Cornu aspersum were determined. The mitogenome has a length of 14050 bp, and it encodes 13 protein-coding genes, 22 transfer RNA genes and two ribosomal RNA genes. It also includes nine small intergene spacers, and a large AT-rich intergenic spacer. The intra-specific divergence analysis revealed that COX1 has the lower genetic differentiation, while the most divergent genes were NADH1, NADH3 and NADH4. With the exception of Euhadra herklotsi, the structural comparisons showed the same gene order within the family Helicidae, and nearly identical gene organization to that found in order Pulmonata. Phylogenetic reconstruction recovered Basommatophora as polyphyletic group, whereas Eupulmonata and Pulmonata as paraphyletic groups. Bayesian and Maximum Likelihood analyses showed that C. aspersum is a close relative of Cepaea nemoralis, and with the other Helicidae species form a sister group of Albinaria caerulea, supporting the monophyly of the Stylommatophora clade. PMID:23826260
Consistency of gene starts among Burkholderia genomes

PubMed Central

2011-01-01

Background Evolutionary divergence in the position of the translational start site among orthologous genes can have significant functional impacts. Divergence can alter the translation rate, degradation rate, subcellular location, and function of the encoded proteins. Results Existing Genbank gene maps for Burkholderia genomes suggest that extensive divergence has occurred--53% of ortholog sets based on Genbank gene maps had inconsistent gene start sites. However, most of these inconsistencies appear to be gene-calling errors. Evolutionary divergence was the most plausible explanation for only 17% of the ortholog sets. Correcting probable errors in the Genbank gene maps decreased the percentage of ortholog sets with inconsistent starts by 68%, increased the percentage of ortholog sets with extractable upstream intergenic regions by 32%, increased the sequence similarity of intergenic regions and predicted proteins, and increased the number of proteins with identifiable signal peptides. Conclusions Our findings highlight an emerging problem in comparative genomics: single-digit percent errors in gene predictions can lead to double-digit percentages of inconsistent ortholog sets. The work demonstrates a simple approach to evaluate and improve the quality of gene maps. PMID:21342528
Transcriptome map of plant mitochondria reveals islands of unexpected transcribed regions.

PubMed

Fujii, Sota; Toda, Takushi; Kikuchi, Shunsuke; Suzuki, Ryutaro; Yokoyama, Koji; Tsuchida, Hiroko; Yano, Kentaro; Toriyama, Kinya

2011-06-01

Plant mitochondria contain a relatively large amount of genetic information, suggesting that their functional regulation may not be as straightforward as that of metazoans. We used a genomic tiling array to draw a transcriptomic atlas of Oryza sativa japonica (rice) mitochondria, which was predicted to be approximately 490-kb long. Whereas statistical analysis verified the transcription of all previously known functional genes such as the ones related to oxidative phosphorylation, a similar extent of RNA expression was frequently observed in the inter-genic regions where none of the previously annotated genes are located. The newly identified open reading frames (ORFs) predicted in these transcribed inter-genic regions were generally not conserved among flowering plant species, suggesting that these ORFs did not play a role in mitochondrial principal functions. We also identified two partial fragments of retrotransposon sequences as being transcribed in rice mitochondria. The present study indicated the previously unexpected complexity of plant mitochondrial RNA metabolism. Our transcriptomic data (Oryza sativa Mitochondrial rna Expression Server: OsMES) is publicly accessible at [http://bioinf.mind.meiji.ac.jp/cgi-bin/gbrowse/OsMes/#search].
Global Shifts in Genome and Proteome Composition Are Very Tightly Coupled

PubMed Central

Brbić, Maria; Warnecke, Tobias; Kriško, Anita; Supek, Fran

2015-01-01

The amino acid composition (AAC) of proteomes differs greatly between microorganisms and is associated with the environmental niche they inhabit, suggesting that these changes may be adaptive. Similarly, the oligonucleotide composition of genomes varies and may confer advantages at the DNA/RNA level. These influences overlap in protein-coding sequences, making it difficult to gauge their relative contributions. We disentangle these effects by systematically evaluating the correspondence between intergenic nucleotide composition, where protein-level selection is absent, the AAC, and ecological parameters of 909 prokaryotes. We find that G + C content, the most frequently used measure of genomic composition, cannot capture diversity in AAC and across ecological contexts. However, di-/trinucleotide composition in intergenic DNA predicts amino acid frequencies of proteomes to the point where very little cross-species variability remains unexplained (91% of variance accounted for). Qualitatively similar results were obtained for 49 fungal genomes, where 80% of the variability in AAC could be explained by the composition of introns and intergenic regions. Upon factoring out oligonucleotide composition and phylogenetic inertia, the residual AAC is poorly predictive of the microbes’ ecological preferences, in stark contrast with the original AAC. Moreover, highly expressed genes do not exhibit more prominent environment-related AAC signatures than lowly expressed genes, despite contributing more to the effective proteome. Thus, evolutionary shifts in overall AAC appear to occur almost exclusively through factors shaping the global oligonucleotide content of the genome. We discuss these results in light of contravening evidence from biophysical data and further reading frame-specific analyses that suggest that adaptation takes place at the protein level. PMID:25971281
Intercalation of XR5944 with the estrogen response element is modulated by the tri-nucleotide spacer sequence between half-sites

PubMed Central

Sidell, Neil; Mathad, Raveendra I.; Shu, Feng-jue; Zhang, Zhenjiang; Kallen, Caleb B.; Yang, Danzhou

2011-01-01

DNA-intercalating molecules can impair DNA replication, DNA repair, and gene transcription. We previously demonstrated that XR5944, a DNA bis-intercalator, specifically blocks binding of estrogen receptor-α (ERα) to the consensus estrogen response element (ERE). The consensus ERE sequence is AGGTCAnnnTGACCT, where nnn is known as the tri-nucleotide spacer. Recent work has shown that the tri-nucleotide spacer can modulate ERα-ERE binding affinity and ligand-mediated transcriptional responses. To further understand the mechanism by which XR5944 inhibits ERα-ERE binding, we tested its ability to interact with consensus EREs with variable tri-nucleotide spacer sequences and with natural but non-consensus ERE sequences using one dimensional nuclear magnetic resonance (1D 1H NMR) titration studies. We found that the tri-nucleotide spacer sequence significantly modulates the binding of XR5944 to EREs. Of the sequences that were tested, EREs with CGG and AGG spacers showed the best binding specificity with XR5944, while those spaced with TTT demonstrated the least specific binding. The binding stoichiometry of XR5944 with EREs was 2:1, which can explain why the spacer influences the drug-DNA interaction; each XR5944 spans four nucleotides (including portions of the spacer) when intercalating with DNA. To validate our NMR results, we conducted functional studies using reporter constructs containing consensus EREs with tri-nucleotide spacers CGG, CTG, and TTT. Results of reporter assays in MCF-7 cells indicated that XR5944 was significantly more potent in inhibiting the activity of CGG- than TTT-spaced EREs, consistent with our NMR results. Taken together, these findings predict that the anti-estrogenic effects of XR5944 will depend not only on ERE half-site composition but also on the tri-nucleotide spacer sequence of EREs located in the promoters of estrogen-responsive genes. PMID:21333738
Lactobacillus species isolated from vaginal secretions of healthy and bacterial vaginosis-intermediate Mexican women: a prospective study

PubMed Central

2013-01-01

Background Lactobacillus jensenii, L. iners, L. crispatus and L. gasseri are the most frequently occurring lactobacilli in the vagina. However, the native species vary widely according to the studied population. The present study was performed to genetically determine the identity of Lactobacillus strains present in the vaginal discharge of healthy and bacterial vaginosis (BV) intermediate Mexican women. Methods In a prospective study, 31 strains preliminarily identified as Lactobacillus species were isolated from 21 samples collected from 105 non-pregnant Mexican women. The samples were classified into groups according to the Nugent score criteria proposed for detection of BV: normal (N), intermediate (I) and bacterial vaginosis (BV). We examined the isolates using culture-based methods as well as molecular analysis of the V1–V3 regions of the 16S rRNA gene. Enterobacterial repetitive intergenic consensus (ERIC) sequence analysis was performed to reject clones. Results Clinical isolates (25/31) were classified into four groups based on sequencing and analysis of the 16S rRNA gene: L. acidophilus (14/25), L. reuteri (6/25), L. casei (4/25) and L. buchneri (1/25). The remaining six isolates were presumptively identified as Enterococcus species. Within the L. acidophilus group, L. gasseri was the most frequently isolated species, followed by L. jensenii and L. crispatus. L. fermentum, L. rhamnosus and L. brevis were also isolated, and were placed in the L. reuteri, L. casei and L. buchneri groups, respectively. ERIC profile analysis showed intraspecific variability amongst the L. gasseri and L. fermentum species. Conclusions These findings agree with previous studies showing that L. crispatus, L. gasseri and L. jensenii are consistently present in the healthy vaginal ecosystem. Additional species or phylotypes were detected in the vaginal microbiota of the non-pregnant Mexican (Hispanic-mestizo) population, and thus, these results further our understanding of vaginal lactobacilli colonisation and richness in this particular population. PMID:23617246

The divergently transcribed genes encoding yeast ribosomal proteins L46 and S24 are activated by shared RPG-boxes.

PubMed Central

Kraakman, L S; Mager, W H; Maurer, K T; Nieuwint, R T; Planta, R J

1989-01-01

Transcription of the majority of the ribosomal protein (rp) genes in yeast is activated through common cis-acting elements, designated RPG-boxes. These elements have been shown to act as specific binding sites for the protein factor TUF/RAP1/GRF1 in vitro. Two such elements occur in the intergenic region separating the divergently transcribed genes encoding L46 and S24. To investigate whether the two RPG-boxes mediate transcription activation of both the L46 and S24 gene, two experimental strategies were followed: cloning of the respective genes on multicopy vectors and construction of fusion genes. Cloning of the L46 + S24 gene including the intergenic region in a multicopy yeast vector indicated that both genes are transcriptionally active. Using constructs in which only the S24 or the L46 gene is present, with or without the intergenic region, we obtained evidence that the intergenic region is indispensable for transcription activation of either gene. To demarcate the element(s) responsible for this activation, fusions of the intergenic region in either orientation to the galK reporter gene were made. Northern analysis of the levels of hybrid mRNA demonstrated that the intergenic region can serve as an heterologous promoter when it is in the 'S24-orientation'. Surprisingly, however, when fused in the reverse orientation the intergenic region did hardly confer transcription activity on the fusion gene. Furthermore, a 274 bp FnuDII-FnuDII fragment from the intergenic region that contains the RPG-boxes, could replace the naturally occurring upstream activation site (UASrpg) of the L25 rp-gene only when inserted in the 'S24-orientation'. Removal of 15 bp from the FnuDII fragment appeared to be sufficient to obtain transcription activation in the 'L46 orientation' as well. Analysis of a construct in which the RPG-boxes were selectively deleted from the promoter region of the L46 gene indicated that the RPG-boxes are needed for efficient transcriptional activation of the L46 gene. We conclude that all promoter elements for the S24 gene are located within the intergenic region, where the RPG-boxes are the most likely UAS-elements. However, the intergenic region (including the RPG-boxes) is required but not sufficient to confer transcription activity on the L46 gene. Images PMID:2602141
The divergently transcribed genes encoding yeast ribosomal proteins L46 and S24 are activated by shared RPG-boxes.

PubMed

Kraakman, L S; Mager, W H; Maurer, K T; Nieuwint, R T; Planta, R J

1989-12-11

Transcription of the majority of the ribosomal protein (rp) genes in yeast is activated through common cis-acting elements, designated RPG-boxes. These elements have been shown to act as specific binding sites for the protein factor TUF/RAP1/GRF1 in vitro. Two such elements occur in the intergenic region separating the divergently transcribed genes encoding L46 and S24. To investigate whether the two RPG-boxes mediate transcription activation of both the L46 and S24 gene, two experimental strategies were followed: cloning of the respective genes on multicopy vectors and construction of fusion genes. Cloning of the L46 + S24 gene including the intergenic region in a multicopy yeast vector indicated that both genes are transcriptionally active. Using constructs in which only the S24 or the L46 gene is present, with or without the intergenic region, we obtained evidence that the intergenic region is indispensable for transcription activation of either gene. To demarcate the element(s) responsible for this activation, fusions of the intergenic region in either orientation to the galK reporter gene were made. Northern analysis of the levels of hybrid mRNA demonstrated that the intergenic region can serve as an heterologous promoter when it is in the 'S24-orientation'. Surprisingly, however, when fused in the reverse orientation the intergenic region did hardly confer transcription activity on the fusion gene. Furthermore, a 274 bp FnuDII-FnuDII fragment from the intergenic region that contains the RPG-boxes, could replace the naturally occurring upstream activation site (UASrpg) of the L25 rp-gene only when inserted in the 'S24-orientation'. Removal of 15 bp from the FnuDII fragment appeared to be sufficient to obtain transcription activation in the 'L46 orientation' as well. Analysis of a construct in which the RPG-boxes were selectively deleted from the promoter region of the L46 gene indicated that the RPG-boxes are needed for efficient transcriptional activation of the L46 gene. We conclude that all promoter elements for the S24 gene are located within the intergenic region, where the RPG-boxes are the most likely UAS-elements. However, the intergenic region (including the RPG-boxes) is required but not sufficient to confer transcription activity on the L46 gene.
Cryptic tRNAs in chaetognath mitochondrial genomes.

PubMed

Barthélémy, Roxane-Marie; Seligmann, Hervé

2016-06-01

The chaetognaths constitute a small and enigmatic phylum of little marine invertebrates. Both nuclear and mitochondrial genomes have numerous originalities, some phylum-specific. Until recently, their mitogenomes seemed containing only one tRNA gene (trnMet), but a recent study found in two chaetognath mitogenomes two and four tRNA genes. Moreover, apparently two conspecific mitogenomes have different tRNA gene numbers (one and two). Reanalyses by tRNAscan-SE and ARWEN softwares of the five available complete chaetognath mitogenomes suggest numerous additional tRNA genes from different types. Their total number never reaches the 22 found in most other invertebrates using that genetic code. Predicted error compensation between codon-anticodon mismatch and tRNA misacylation suggests translational activity by tRNAs predicted solely according to secondary structure for tRNAs predicted by tRNAscan-SE, not ARWEN. Numbers of predicted stop-suppressor (antitermination) tRNAs coevolve with predicted overlapping, frameshifted protein coding genes including stop codons. Sequence alignments in secondary structure prediction with non-chaetognath tRNAs suggest that the most likely functional tRNAs are in intergenic regions, as regular mt-tRNAs. Due to usually short intergenic regions, generally tRNA sequences partially overlap with flanking genes. Some tRNA pairs seem templated by sense-antisense strands. Moreover, 16S rRNA genes, but not 12S rRNAs, appear as tRNA nurseries, as previously suggested for multifunctional ribosomal-like protogenomes. Copyright © 2016 Elsevier Ltd. All rights reserved.
A Molecular Portrait of De Novo Genes in Yeasts.

PubMed

Vakirlis, Nikolaos; Hebert, Alex S; Opulente, Dana A; Achaz, Guillaume; Hittinger, Chris Todd; Fischer, Gilles; Coon, Joshua J; Lafontaine, Ingrid

2018-03-01

New genes, with novel protein functions, can evolve "from scratch" out of intergenic sequences. These de novo genes can integrate the cell's genetic network and drive important phenotypic innovations. Therefore, identifying de novo genes and understanding how the transition from noncoding to coding occurs are key problems in evolutionary biology. However, identifying de novo genes is a difficult task, hampered by the presence of remote homologs, fast evolving sequences and erroneously annotated protein coding genes. To overcome these limitations, we developed a procedure that handles the usual pitfalls in de novo gene identification and predicted the emergence of 703 de novo gene candidates in 15 yeast species from 2 genera whose phylogeny spans at least 100 million years of evolution. We validated 85 candidates by proteomic data, providing new translation evidence for 25 of them through mass spectrometry experiments. We also unambiguously identified the mutations that enabled the transition from noncoding to coding for 30 Saccharomyces de novo genes. We established that de novo gene origination is a widespread phenomenon in yeasts, only a few being ultimately maintained by selection. We also found that de novo genes preferentially emerge next to divergent promoters in GC-rich intergenic regions where the probability of finding a fortuitous and transcribed ORF is the highest. Finally, we found a more than 3-fold enrichment of de novo genes at recombination hot spots, which are GC-rich and nucleosome-free regions, suggesting that meiotic recombination contributes to de novo gene emergence in yeasts.
Molecular phylogeny of 21 tropical bamboo species reconstructed by integrating non-coding internal transcribed spacer (ITS1 and 2) sequences and their consensus secondary structure.

PubMed

Ghosh, Jayadri Sekhar; Bhattacharya, Samik; Pal, Amita

2017-06-01

The unavailability of the reproductive structure and unpredictability of vegetative characters for the identification and phylogenetic study of bamboo prompted the application of molecular techniques for greater resolution and consensus. We first employed internal transcribed spacer (ITS1, 5.8S rRNA and ITS2) sequences to construct the phylogenetic tree of 21 tropical bamboo species. While the sequence alone could grossly reconstruct the traditional phylogeny amongst the 21-tropical species studied, some anomalies were encountered that prompted a further refinement of the phylogenetic analyses. Therefore, we integrated the secondary structure of the ITS sequences to derive individual sequence-structure matrix to gain more resolution on the phylogenetic reconstruction. The results showed that ITS sequence-structure is the reliable alternative to the conventional phenotypic method for the identification of bamboo species. The best-fit topology obtained by the sequence-structure based phylogeny over the sole sequence based one underscores closer clustering of all the studied Bambusa species (Sub-tribe Bambusinae), while Melocanna baccifera, which belongs to Sub-Tribe Melocanneae, disjointedly clustered as an out-group within the consensus phylogenetic tree. In this study, we demonstrated the dependability of the combined (ITS sequence+structure-based) approach over the only sequence-based analysis for phylogenetic relationship assessment of bamboo.
Genome-wide admixture and association study of subclinical atherosclerosis in the Women’s Interagency HIV Study (WIHS)

PubMed Central

Shendre, Aditi; Wiener, Howard W.; Irvin, Marguerite R.; Aouizerat, Bradley E.; Overton, Edgar T.; Lazar, Jason; Liu, Chenglong; Hodis, Howard N.; Limdi, Nita A.; Weber, Kathleen M.; Zhi, Degui; Floris-Moore, Michelle A.; Ofotokun, Ighovwerha; Qi, Qibin; Hanna, David B.; Kaplan, Robert C.

2017-01-01

Cardiovascular disease (CVD) is a major comorbidity among HIV-infected individuals. Common carotid artery intima-media thickness (cCIMT) is a valid and reliable subclinical measure of atherosclerosis and is known to predict CVD. We performed genome-wide association (GWA) and admixture analysis among 682 HIV-positive and 288 HIV-negative Black, non-Hispanic women from the Women’s Interagency HIV study (WIHS) cohort using a combined and stratified analysis approach. We found some suggestive associations but none of the SNPs reached genome-wide statistical significance in our GWAS analysis. The top GWAS SNPs were rs2280828 in the region intergenic to mediator complex subunit 30 and exostosin glycosyltransferase 1 (MED30 | EXT1) among all women, rs2907092 in the catenin delta 2 (CTNND2) gene among HIV-positive women, and rs7529733 in the region intergenic to family with sequence similarity 5, member C and regulator of G-protein signaling 18 (FAM5C | RGS18) genes among HIV-negative women. The most significant local European ancestry associations were in the region intergenic to the zinc finger and SCAN domain containing 5D gene and NADH: ubiquinone oxidoreductase complex assembly factor 1 (ZSCAN5D | NDUF1) pseudogene on chromosome 19 among all women, in the region intergenic to vomeronasal 1 receptor 6 pseudogene and zinc finger protein 845 (VN1R6P | ZNF845) gene on chromosome 19 among HIV-positive women, and in the region intergenic to the SEC23-interacting protein and phosphatidic acid phosphatase type 2 domain containing 1A (SEC23IP | PPAPDC1A) genes located on chromosome 10 among HIV-negative women. A number of previously identified SNP associations with cCIMT were also observed and included rs2572204 in the ryanodine receptor 3 (RYR3) and an admixture region in the secretion-regulating guanine nucleotide exchange factor (SERGEF) gene. We report several SNPs and gene regions in the GWAS and admixture analysis, some of which are common across HIV-positive and HIV-negative women as demonstrated using meta-analysis, and also across the two analytic approaches (i.e., GWA and admixture). These findings suggest that local European ancestry plays an important role in genetic associations of cCIMT among black women from WIHS along with other environmental factors that are related to CVD and may also be triggered by HIV. These findings warrant confirmation in independent samples. PMID:29206233
Sequence polymorphism in an insect RNA virus field population: A snapshot from a single point in space and time reveals stochastic differences among and within individual hosts

DOE Office of Scientific and Technical Information (OSTI.GOV)

Stenger, Drake C., E-mail: drake.stenger@ars.usda.

Population structure of Homalodisca coagulata Virus-1 (HoCV-1) among and within field-collected insects sampled from a single point in space and time was examined. Polymorphism in complete consensus sequences among single-insect isolates was dominated by synonymous substitutions. The mutant spectrum of the C2 helicase region within each single-insect isolate was unique and dominated by nonsynonymous singletons. Bootstrapping was used to correct the within-isolate nonsynonymous:synonymous arithmetic ratio (N:S) for RT-PCR error, yielding an N:S value ~one log-unit greater than that of consensus sequences. Probability of all possible single-base substitutions for the C2 region predicted N:S values within 95% confidence limits of themore » corrected within-isolate N:S when the only constraint imposed was viral polymerase error bias for transitions over transversions. These results indicate that bottlenecks coupled with strong negative/purifying selection drive consensus sequences toward neutral sequence space, and that most polymorphism within single-insect isolates is composed of newly-minted mutations sampled prior to selection. -- Highlights: •Sampling protocol minimized differential selection/history among isolates. •Polymorphism among consensus sequences dominated by negative/purifying selection. •Within-isolate N:S ratio corrected for RT-PCR error by bootstrapping. •Within-isolate mutant spectrum dominated by new mutations yet to undergo selection.« less
The hypervariable region 1 protein of hepatitis C virus broadly reactive with sera of patients with chronic hepatitis C has a similar amino acid sequence with the consensus sequence.

PubMed

Watanabe, K; Yoshioka, K; Ito, H; Ishigami, M; Takagi, K; Utsunomiya, S; Kobayashi, M; Kishimoto, H; Yano, M; Kakumu, S

1999-11-10

Hypervariable region 1 (HVR1) proteins of hepatitis C virus (HCV) have been reported to react broadly with sera of patients with HCV infection. However, the variability of the broad reactivity of individual HVR1 proteins has not been elucidated. We assessed the reactivity of 25 different HVR1 proteins (genotype 1b) with sera of 81 patients with HCV infection (genotype 1b) by Western blot. HVR1 proteins reacted with 2-60 sera. The number of sera reactive with each HVR1 protein significantly correlated with the number of amino acid residues identical to the consensus sequence defined by Puntoriero et al. (G. Puntoriero, A. Lahm, S. Zucchelli, B. B. Ercole, R. Tafi, M. Penzzanera, M. U. Mondelli, R. Cortese, A. Tramontano, G. Galfre', and A. Nicosia. 1998. EMBO J. 17, 3521-3533. ) (r = 0.561, P < 0.005). The most widely reactive HVR1 protein, 12-22, had a sequence similar to the consensus sequence. The peptide with C-terminal 13-amino-acids sequence of HVR1 protein 12-22 (NH2-CSFTSLFTPGPSQK) was injected into rabbits as an immunogen. The rabbit immune sera reacted with 9 of 25 HVR1 proteins of genotype 1b including HVR1 protein 12-22 and with 3 of 12 proteins of genotype 2a. These results indicate that the HVR1 protein broadly reactive with patients' sera has a sequence similar to the consensus sequence, can induce broadly reactive sera, and could be one of the candidate immunogens in a prophylactic vaccine against HCV. Copyright 1999 Academic Press.
Comparison of simple sequence repeats in 19 Archaea.

PubMed

Trivedi, S

2006-12-05

All organisms that have been studied until now have been found to have differential distribution of simple sequence repeats (SSRs), with more SSRs in intergenic than in coding sequences. SSR distribution was investigated in Archaea genomes where complete chromosome sequences of 19 Archaea were analyzed with the program SPUTNIK to find di- to penta-nucleotide repeats. The number of repeats was determined for the complete chromosome sequences and for the coding and non-coding sequences. Different from what has been found for other groups of organisms, there is an abundance of SSRs in coding regions of the genome of some Archaea. Dinucleotide repeats were rare and CG repeats were found in only two Archaea. In general, trinucleotide repeats are the most abundant SSR motifs; however, pentanucleotide repeats are abundant in some Archaea. Some of the tetranucleotide and pentanucleotide repeat motifs are organism specific. In general, repeats are short and CG-rich repeats are present in Archaea having a CG-rich genome. Among the 19 Archaea, SSR density was not correlated with genome size or with optimum growth temperature. Pentanucleotide density had an inverse correlation with the CG content of the genome.
Complete sequence of the genome of avian paramyxovirus type 2 (strain Yucaipa) and comparison with other paramyxoviruses

PubMed Central

Subbiah, Madhuri; Xiao, Sa; Collins, Peter L.; Samal, Siba K

2009-01-01

The complete RNA genome sequence of avian paramyxovirus (APMV) serotype 2, strain Yucaipa isolated from chicken has been determined. With genome size of 14,904 nucleotides (nt), strain Yucaipa is consistent with the “rule of six” and is the smallest virus reported to date among the members of subfamily Paramyxovirinae. The genome contains six non-overlapping genes in the order 3′-N-P/V-M-F-HN-L-5′. The genes are flanked on either side by highly-conserved transcription start and stop signals and have intergenic sequences varying in length from 3 to 23 nt. The genome contains a 55 nt leader sequence at 3′ end and a 154 nt trailer sequence at 5′ end. Alignment and phylogenetic analysis of the predicted amino acid sequences of strain Yucaipa proteins with the cognate proteins of viruses of all of the five genera of family Paramyxoviridae showed that APMV-2 strain Yucaipa is more closely related to APMV-6 than APMV-1. PMID:18603323
Transcriptional Regulation of Aggregatibacter actinomycetemcomitans lsrACDBFG and lsrRK Operons and Their Role in Biofilm Formation

PubMed Central

Torres-Escobar, Ascención; Juárez-Rodríguez, María Dolores; Lamont, Richard J.

2013-01-01

Autoinducer-2 (AI-2) is required for biofilm formation and virulence of the oral pathogen Aggregatibacter actinomycetemcomitans, and we previously showed that lsrB codes for a receptor for AI-2. The lsrB gene is expressed as part of the lsrACDBFG operon, which is divergently transcribed from an adjacent lsrRK operon. In Escherichia coli, lsrRK encodes a repressor and AI-2 kinase that function to regulate lsrACDBFG. To determine if lsrRK controls lsrACDBFG expression and influences biofilm growth of A. actinomycetemcomitans, we first defined the promoters for each operon. Transcriptional reporter plasmids containing the 255-bp lsrACDBFG-lsrRK intergenic region (IGR) fused to lacZ showed that essential elements of lsrR promoter reside 89 to 255 bp upstream from the lsrR start codon. Two inverted repeat sequences that represent potential binding sites for LsrR and two sequences resembling the consensus cyclic AMP receptor protein (CRP) binding site were identified in this region. Using electrophoretic mobility shift assay (EMSA), purified LsrR and CRP proteins were shown to bind probes containing these sequences. Surprisingly, the 255-bp IGR did not contain the lsrA promoter. Instead, a fragment encompassing nucleotides +1 to +159 of lsrA together with the 255-bp IGR was required to promote lsrA transcription. This suggests that a region within the lsrA coding sequence influences transcription, or alternatively that the start codon of A. actinomycetemcomitans lsrA has been incorrectly annotated. Transformation of ΔlsrR, ΔlsrK, ΔlsrRK, and Δcrp deletion mutants with lacZ reporters containing the lsrA or lsrR promoter showed that LsrR negatively regulates and CRP positively regulates both lsrACDBFG and lsrRK. However, in contrast to what occurs in E. coli, deletion of lsrK had no effect on the transcriptional activity of the lsrA or lsrR promoters, suggesting that another kinase may be capable of phosphorylating AI-2 in A. actinomycetemcomitans. Finally, biofilm formation of the ΔlsrR, ΔlsrRK, and Δcrp mutants was significantly reduced relative to that of the wild type, indicating that proper regulation of the lsr locus is required for optimal biofilm growth by A. actinomycetemcomitans. PMID:23104800
Importance of databases of nucleic acids for bioinformatic analysis focused to genomics

NASA Astrophysics Data System (ADS)

Jimenez-Gutierrez, L. R.; Barrios-Hernández, C. J.; Pedraza-Ferreira, G. R.; Vera-Cala, L.; Martinez-Perez, F.

2016-08-01

Recently, bioinformatics has become a new field of science, indispensable in the analysis of millions of nucleic acids sequences, which are currently deposited in international databases (public or private); these databases contain information of genes, RNA, ORF, proteins, intergenic regions, including entire genomes from some species. The analysis of this information requires computer programs; which were renewed in the use of new mathematical methods, and the introduction of the use of artificial intelligence. In addition to the constant creation of supercomputing units trained to withstand the heavy workload of sequence analysis. However, it is still necessary the innovation on platforms that allow genomic analyses, faster and more effectively, with a technological understanding of all biological processes.
Chloroplast genes as genetic markers for inferring patterns of change, maternal ancestry and phylogenetic relationships among Eleusine species

PubMed Central

Agrawal, Renuka; Agrawal, Nitin; Tandon, Rajesh; Raina, Soom Nath

2013-01-01

Assessment of phylogenetic relationships is an important component of any successful crop improvement programme, as wild relatives of the crop species often carry agronomically beneficial traits. Since its domestication in East Africa, Eleusine coracana (2n = 4x = 36), a species belonging to the genus Eleusine (x = 8, 9, 10), has held a prominent place in the semi-arid regions of India, Nepal and Africa. The patterns of variation between the cultivated and wild species reported so far and the interpretations based upon them have been considered primarily in terms of nuclear events. We analysed, for the first time, the phylogenetic relationship between finger millet (E. coracana) and its wild relatives by species-specific chloroplast deoxyribonucleic acid (cpDNA) polymerase chain reaction–restriction fragment length polymorphism (PCR–RFLP) and chloroplast simple sequence repeat (cpSSR) markers/sequences. Restriction fragment length polymorphism of the seven amplified chloroplast genes/intergenic spacers (trnK, psbD, psaA, trnH–trnK, trnL–trnF, 16S and trnS–psbC), nucleotide sequencing of the chloroplast trnK gene and chloroplast microsatellite polymorphism were analysed in all nine known species of Eleusine. The RFLP of all seven amplified chloroplast genes/intergenic spacers and trnK gene sequences in the diploid (2n = 16, 18, 20) and allotetraploid (2n = 36, 38) species resulted in well-resolved phylogenetic trees with high bootstrap values. Eleusine coracana, E. africana, E. tristachya, E. indica and E. kigeziensis did not show even a single change in restriction site. Eleusine intermedia and E. floccifolia were also shown to have identical cpDNA fragment patterns. The cpDNA diversity in Eleusine multiflora was found to be more extensive than that of the other eight species. The trnK gene sequence data complemented the results obtained by PCR–RFLP. The maternal lineage of all three allotetraploid species (AABB, AADD) was the same, with E. indica being the maternal diploid progenitor species. The markers specific to certain species were also identified. PMID:24790119
Chloroplast genes as genetic markers for inferring patterns of change, maternal ancestry and phylogenetic relationships among Eleusine species.

PubMed

Agrawal, Renuka; Agrawal, Nitin; Tandon, Rajesh; Raina, Soom Nath

2014-01-01

Assessment of phylogenetic relationships is an important component of any successful crop improvement programme, as wild relatives of the crop species often carry agronomically beneficial traits. Since its domestication in East Africa, Eleusine coracana (2n = 4x = 36), a species belonging to the genus Eleusine (x = 8, 9, 10), has held a prominent place in the semi-arid regions of India, Nepal and Africa. The patterns of variation between the cultivated and wild species reported so far and the interpretations based upon them have been considered primarily in terms of nuclear events. We analysed, for the first time, the phylogenetic relationship between finger millet (E. coracana) and its wild relatives by species-specific chloroplast deoxyribonucleic acid (cpDNA) polymerase chain reaction-restriction fragment length polymorphism (PCR-RFLP) and chloroplast simple sequence repeat (cpSSR) markers/sequences. Restriction fragment length polymorphism of the seven amplified chloroplast genes/intergenic spacers (trnK, psbD, psaA, trnH-trnK, trnL-trnF, 16S and trnS-psbC), nucleotide sequencing of the chloroplast trnK gene and chloroplast microsatellite polymorphism were analysed in all nine known species of Eleusine. The RFLP of all seven amplified chloroplast genes/intergenic spacers and trnK gene sequences in the diploid (2n = 16, 18, 20) and allotetraploid (2n = 36, 38) species resulted in well-resolved phylogenetic trees with high bootstrap values. Eleusine coracana, E. africana, E. tristachya, E. indica and E. kigeziensis did not show even a single change in restriction site. Eleusine intermedia and E. floccifolia were also shown to have identical cpDNA fragment patterns. The cpDNA diversity in Eleusine multiflora was found to be more extensive than that of the other eight species. The trnK gene sequence data complemented the results obtained by PCR-RFLP. The maternal lineage of all three allotetraploid species (AABB, AADD) was the same, with E. indica being the maternal diploid progenitor species. The markers specific to certain species were also identified.
Molecular evolution of the HoxA cluster in the three major gnathostome lineages

PubMed Central

Chiu, Chi-hua; Amemiya, Chris; Dewar, Ken; Kim, Chang-Bae; Ruddle, Frank H.; Wagner, Günter P.

2002-01-01

The duplication of Hox clusters and their maintenance in a lineage has a prominent but little understood role in chordate evolution. Here we examined how Hox cluster duplication may influence changes in cluster architecture and patterns of noncoding sequence evolution. We sequenced the entire duplicated HoxAa and HoxAb clusters of zebrafish (Danio rerio) and extended the 5′ (posterior) part of the HoxM (HoxA-like) cluster of horn shark (Heterodontus francisci) containing the hoxa11 and hoxa13 orthologs as well as intergenic and flanking noncoding sequences. The duplicated HoxA clusters in zebrafish each house considerably fewer genes and are dramatically shorter than the single HoxA clusters of human and horn shark. We compared the intergenic sequences of the HoxA clusters of human, horn shark, zebrafish (Aa, Ab), and striped bass and found extensive conservation of noncoding sequence motifs, i.e., phylogenetic footprints, between the human and horn shark, representing two of the three gnathostome lineages. These are putative cis-regulatory elements that may play a role in the regulation of the ancestral HoxA cluster. In contrast, homologous regions of the duplicated HoxAa and HoxAb clusters of zebrafish and the HoxA cluster of striped bass revealed a striking loss of conservation of these putative cis-regulatory sequences in the 3′ (anterior) segment of the cluster, where zebrafish only retains single representatives of group 1, 3, 4, and 5 (HoxAa) and group 2 (HoxAb) genes and in the 5′ part of the clusters, where zebrafish retains two copies of the group 13, 11, and 9 genes, i.e., AbdB-like genes. In analyzing patterns of cis-sequence evolution in the 5′ part of the clusters, we explicitly looked for evidence of complementary loss of conserved noncoding sequences, as predicted by the duplication-degeneration-complementation model in which genetic redundancy after gene duplication is resolved because of the fixation of complementary degenerative mutations. Our data did not yield evidence supporting this prediction. We conclude that changes in the pattern of cis-sequence conservation after Hox cluster duplication are more consistent with being the outcome of adaptive modification rather than passive mechanisms that erode redundancy created by the duplication event. These results support the view that genome duplications may provide a mechanism whereby master control genes undergo radical modifications conducive to major alterations in body plan. Such genomic revolutions may contribute significantly to the evolutionary process. PMID:11943847
Fine-tuning structural RNA alignments in the twilight zone

PubMed Central

2010-01-01

Background A widely used method to find conserved secondary structure in RNA is to first construct a multiple sequence alignment, and then fold the alignment, optimizing a score based on thermodynamics and covariance. This method works best around 75% sequence similarity. However, in a "twilight zone" below 55% similarity, the sequence alignment tends to obscure the covariance signal used in the second phase. Therefore, while the overall shape of the consensus structure may still be found, the degree of conservation cannot be estimated reliably. Results Based on a combination of available methods, we present a method named planACstar for improving structure conservation in structural alignments in the twilight zone. After constructing a consensus structure by alignment folding, planACstar abandons the original sequence alignment, refolds the sequences individually, but consistent with the consensus, aligns the structures, irrespective of sequence, by a pure structure alignment method, and derives an improved sequence alignment from the alignment of structures, to be re-submitted to alignment folding, etc.. This circle may be iterated as long as structural conservation improves, but normally, one step suffices. Conclusions Employing the tools ClustalW, RNAalifold, and RNAforester, we find that for sequences with 30-55% sequence identity, structural conservation can be improved by 10% on average, with a large variation, measured in terms of RNAalifold's own criterion, the structure conservation index. PMID:20433706
A survey of sRNA families in α-proteobacteria

PubMed Central

del Val, Coral; Romero-Zaliz, Rocío; Torres-Quesada, Omar; Peregrina, Alexandra; Toro, Nicolás; Jiménez-Zurdo, Jose I

2012-01-01

We have performed a computational comparative analysis of six small non-coding RNA (sRNA) families in α-proteobacteria. Members of these families were first identified in the intergenic regions of the nitrogen-fixing endosymbiont S. meliloti by a combined bioinformatics screen followed by experimental verification. Consensus secondary structures inferred from covariance models for each sRNA family evidenced in some cases conserved motifs putatively relevant to the function of trans-encoded base-pairing sRNAs i.e., Hfq-binding signatures and exposed anti Shine-Dalgarno sequences. Two particular family models, namely αr15 and αr35, shared own sub-structural modules with the Rfam model suhB (RF00519) and the uncharacterized sRNA family αr35b, respectively. A third sRNA family, termed αr45, has homology to the cis-acting regulatory element speF (RF00518). However, new experimental data further confirmed that the S. meliloti αr45 representative is an Hfq-binding sRNA processed from or expressed independently of speF, thus refining the Rfam speF model annotation. All the six families have members in phylogenetically related plant-interacting bacteria and animal pathogens of the order of the Rhizobiales, some occurring with high levels of paralogy in individual genomes. In silico and experimental evidences predict differential regulation of paralogous sRNAs in S. meliloti 1021. The distribution patterns of these sRNA families suggest major contributions of vertical inheritance and extensive ancestral duplication events to the evolution of sRNAs in plant-interacting bacteria. PMID:22418845
Rates of gastrointestinal tract colonization of carbapenem-resistant Enterobacteriaceae and Pseudomonas aeruginosa in hospitals in Saudi Arabia

PubMed Central

Abdalhamid, B.; Elhadi, N.; Alabdulqader, N.; Alsamman, K.; Aljindan, R.

2016-01-01

Carbapenem-resistant Enterobacteriaceae (CRE) and carbapenem-resistant Pseudomonas aeruginosa (CRPAE) are globally a major medical issue, especially in intensive care units. The digestive tract is the main reservoir for these isolates; therefore, rectal swab surveillance is highly recommended. The purpose of this study was to detect the prevalence of gastrointestinal tract colonization of CRE and CRPAE in patients admitted to intensive care units in Saudi Arabia. This project also aimed to characterize carbapenem-hydrolyzing enzyme production in these isolates. From February to May 2015, 200 rectal swab specimens were screened by CHROMagar KPC. Organism identification and susceptibility testing were performed using the Vitek 2 system. One CRE and 13 CRPAE strains were identified, for a prevalence of 0.5% (1/200) and 6.5% (13/200) respectively. Strains showed high genetic diversity using enterobacterial repetitive intergenic consensus sequence-based PCR. NDM type and VIM type were detected by PCR in four and one CRPAE isolates respectively. ampC overexpression was detected in eight CRPAE isolates using Mueller-Hinton agar containing 1000 μg/mL cloxacillin. CTX-M-15 type was detected in 1 CRE by PCR. The prevalence of CRE strain colonization was lower than that of CRPAE isolates. The detection of NDM and VIM in the colonizing CRPAE strains is a major infection control concern. To our knowledge, this is the first study in Saudi Arabia and the gulf region focusing on digestive tract colonization of CRE and CRPAE organisms and characterizing the mechanisms of carbapenem resistance. PMID:26933499
A Gibbs sampler for motif detection in phylogenetically close sequences

NASA Astrophysics Data System (ADS)

Siddharthan, Rahul; van Nimwegen, Erik; Siggia, Eric

2004-03-01

Genes are regulated by transcription factors that bind to DNA upstream of genes and recognize short conserved ``motifs'' in a random intergenic ``background''. Motif-finders such as the Gibbs sampler compare the probability of these short sequences being represented by ``weight matrices'' to the probability of their arising from the background ``null model'', and explore this space (analogous to a free-energy landscape). But closely related species may show conservation not because of functional sites but simply because they have not had sufficient time to diverge, so conventional methods will fail. We introduce a new Gibbs sampler algorithm that accounts for common ancestry when searching for motifs, while requiring minimal ``prior'' assumptions on the number and types of motifs, assessing the significance of detected motifs by ``tracking'' clusters that stay together. We apply this scheme to motif detection in sporulation-cycle genes in the yeast S. cerevisiae, using recent sequences of other closely-related Saccharomyces species.
Ebolavirus comparative genomics

DOE PAGES

Jun, Se-Ran; Leuze, Michael R.; Nookaew, Intawat; ...

2015-07-14

The 2014 Ebola outbreak in West Africa is the largest documented for this virus. We examine the dynamics of this genome, comparing more than one hundred currently available ebolavirus genomes to each other and to other viral genomes. Based on oligomer frequency analysis, the family Filoviridae forms a distinct group from all other sequenced viral genomes. All filovirus genomes sequenced to date encode proteins with similar functions and gene order, although there is considerable divergence in sequences between the three genera Ebolavirus, Cuevavirus, and Marburgvirus within the family Filoviridae. Whereas all ebolavirus genomes are quite similar (multiple sequences of themore » same strain are often identical), variation is most common in the intergenic regions and within specific areas of the genes encoding the glycoprotein (GP), nucleoprotein (NP), and polymerase (L). We predict regions that could contain epitope-binding sites, which might be good vaccine targets. In conclusion, this information, combined with glycosylation sites and experimentally determined epitopes, can identify the most promising regions for the development of therapeutic strategies.« less

The Arabidopsis lyrata genome sequence and the basis of rapid genome size change

DOE Office of Scientific and Technical Information (OSTI.GOV)

Hu, Tina T.; Pattyn, Pedro; Bakker, Erica G.

2011-04-29

In our manuscript, we present a high-quality genome sequence of the Arabidopsis thaliana relative, Arabidopsis lyrata, produced by dideoxy sequencing. We have performed the usual types of genome analysis (gene annotation, dN/dS studies etc. etc.), but this is relegated to the Supporting Information. Instead, we focus on what was a major motivation for sequencing this genome, namely to understand how A. thaliana lost half its genome in a few million years and lived to tell the tale. The rather surprising conclusion is that there is not a single genomic feature that accounts for the reduced genome, but that every aspectmore » centromeres, intergenic regions, transposable elements, gene family number is affected through hundreds of thousands of cuts. This strongly suggests that overall genome size in itself is what has been under selection, a suggestion that is strongly supported by our demonstration (using population genetics data from A. thaliana) that new deletions seem to be driven to fixation.« less
Molecular epidemiology and evolution of avian infectious bronchitis virus in Spain over a fourteen-year period.

PubMed

Dolz, Roser; Pujols, Joan; Ordóñez, German; Porta, Ramon; Majó, Natàlia

2008-04-25

An in-depth molecular study of infectious bronchitis viruses (IBV) with particular interest in evolutionary aspects of IBV in Spain was carried out in the present study based on the S1 gene molecular characterization of twenty-six Spanish strains isolated over a fourteen-year period. Four genotypes were identified based on S1 gene sequence analyses and phylogenetic studies. A drastic virus population shift was demonstrated along time and the novel Italy 02 serotype was shown to have displaced the previous predominant serotype 4/91 in the field. Detailed analyses of synonymous to non-synonymous ratio of the S1 gene sequences of this new serotype Italy 02 suggested positive selection pressures might have contributed to the successful establishment of Italy 02 serotype in our country. In addition, differences on the fitness abilities of new emergent genotypes were indicated. Furthermore, intergenic sequences (IGs)-like motifs within S1 gene sequences of IBV isolates were suggested to enhance the recombination abilities of certain serotypes.
Trichosporon inkin: an uncommon agent of scalp white piedra. Report of four cases in Brazilian children.

PubMed

Fischman, Olga; Bezerra, Fabiane Castilho; Francisco, Elaine Cristina; da Silva, Flávia Cristina; Nishikaku, Angela Satie; Cavalcanti, Sarah Desirée Barbosa; de Azevedo Melo, Analy Salles; Bentubo, Henri Donnarumma Levy; Petri, Valéria

2014-08-01

We report four cases of scalp white piedra (SWP) in Brazilian female children. Morphological and physiological approaches gave inconsistent results for identifying Trichosporon to species level, while the sequencing of the intergenic spacer 1 region of ribosomal DNA accurately identified the agent of SWP as T. inkin. These cases emphasize the occurrence of this species causing this type of infection. The molecular identification of the suspected agent is needed for appropriate epidemiological surveillance of superficial mycoses caused by Trichosporon species.
Novel dicistrovirus from bat guano.

PubMed

Reuter, Gábor; Pankovics, Péter; Gyöngyi, Zoltán; Delwart, Eric; Boros, Akos

2014-12-01

A novel dicistrovirus (strain NB-1/2011/HUN, KJ802403) genome was detected from guano collected from an insectivorous bat (species Pipistrellus pipistrellus) in Hungary, using viral metagenomics. The complete genome of NB-1 is 9136 nt in length, excluding the poly(A) tail. NB-1 has a genome organization typical of a dicistrovirus with multiple 3B(VPg) and a cripavirus-like intergenic region (IGR)-IRES. NB-1 shares only 41 % average amino acid sequence identity with capsid proteins of Himetobi P virus, indicating a potential novel species in the genus Cripavirus, family Dicistroviridae.
Genetic diversity of Streptococcus equi subsp. zooepidemicus and doxycycline resistance in kennelled dogs.

PubMed

Chalker, Victoria J; Waller, Andrew; Webb, Katy; Spearing, Emma; Crosse, Patricia; Brownlie, Joe; Erles, Kerstin

2012-06-01

The genetic diversity and antibiotic resistance profiles of 38 Streptococcus equi subsp. zooepidemicus isolates were determined from a kennelled canine population during two outbreaks of hemorrhagic pneumonia (1999 to 2002 and 2007 to 2010). Analysis of the szp gene hypervariable region and the 16S-23S rRNA intergenic spacer region and multilocus sequence typing (MLST) indicated a predominant tetO-positive, doxycycline-resistant ST-10 strain during 1999 to 2002 and a predominant tetM-positive doxycycline-resistant ST-62 strain during 2007 to 2010.
Genetic Diversity of Streptococcus equi subsp. zooepidemicus and Doxycycline Resistance in Kennelled Dogs

PubMed Central

Chalker, Victoria J.; Waller, Andrew; Webb, Katy; Spearing, Emma; Crosse, Patricia; Brownlie, Joe

2012-01-01

The genetic diversity and antibiotic resistance profiles of 38 Streptococcus equi subsp. zooepidemicus isolates were determined from a kennelled canine population during two outbreaks of hemorrhagic pneumonia (1999 to 2002 and 2007 to 2010). Analysis of the szp gene hypervariable region and the 16S-23S rRNA intergenic spacer region and multilocus sequence typing (MLST) indicated a predominant tetO-positive, doxycycline-resistant ST-10 strain during 1999 to 2002 and a predominant tetM-positive doxycycline-resistant ST-62 strain during 2007 to 2010. PMID:22495558
Using information content and base frequencies to distinguish mutations from genetic polymorphisms in splice junction recognition sites.

PubMed

Rogan, P K; Schneider, T D

1995-01-01

Predicting the effects of nucleotide substitutions in human splice sites has been based on analysis of consensus sequences. We used a graphic representation of sequence conservation and base frequency, the sequence logo, to demonstrate that a change in a splice acceptor of hMSH2 (a gene associated with familial nonpolyposis colon cancer) probably does not reduce splicing efficiency. This confirms a population genetic study that suggested that this substitution is a genetic polymorphism. The information theory-based sequence logo is quantitative and more sensitive than the corresponding splice acceptor consensus sequence for detection of true mutations. Information analysis may potentially be used to distinguish polymorphisms from mutations in other types of transcriptional, translational, or protein-coding motifs.
Efficient and Accurate Algorithm for Cleaved Fragments Prediction (CFPA) in Protein Sequences Dataset Based on Consensus and Its Variants: A Novel Degradomics Prediction Application.

PubMed

El-Assaad, Atlal; Dawy, Zaher; Nemer, Georges; Hajj, Hazem; Kobeissy, Firas H

2017-01-01

Degradomics is a novel discipline that involves determination of the proteases/substrate fragmentation profile, called the substrate degradome, and has been recently applied in different disciplines. A major application of degradomics is its utility in the field of biomarkers where the breakdown products (BDPs) of different protease have been investigated. Among the major proteases assessed, calpain and caspase proteases have been associated with the execution phases of the pro-apoptotic and pro-necrotic cell death, generating caspase/calpain-specific cleaved fragments. The distinction between calpain and caspase protein fragments has been applied to distinguish injury mechanisms. Advanced proteomics technology has been used to identify these BDPs experimentally. However, it has been a challenge to identify these BDPs with high precision and efficiency, especially if we are targeting a number of proteins at one time. In this chapter, we present a novel bioinfromatic detection method that identifies BDPs accurately and efficiently with validation against experimental data. This method aims at predicting the consensus sequence occurrences and their variants in a large set of experimentally detected protein sequences based on state-of-the-art sequence matching and alignment algorithms. After detection, the method generates all the potential cleaved fragments by a specific protease. This space and time-efficient algorithm is flexible to handle the different orientations that the consensus sequence and the protein sequence can take before cleaving. It is O(mn) in space complexity and O(Nmn) in time complexity, with N number of protein sequences, m length of the consensus sequence, and n length of each protein sequence. Ultimately, this knowledge will subsequently feed into the development of a novel tool for researchers to detect diverse types of selected BDPs as putative disease markers, contributing to the diagnosis and treatment of related disorders.
Suitability of partial 16S ribosomal RNA gene sequence analysis for the identification of dangerous bacterial pathogens.

PubMed

Ruppitsch, W; Stöger, A; Indra, A; Grif, K; Schabereiter-Gurtner, C; Hirschl, A; Allerberger, F

2007-03-01

In a bioterrorism event a rapid tool is needed to identify relevant dangerous bacteria. The aim of the study was to assess the usefulness of partial 16S rRNA gene sequence analysis and the suitability of diverse databases for identifying dangerous bacterial pathogens. For rapid identification purposes a 500-bp fragment of the 16S rRNA gene of 28 isolates comprising Bacillus anthracis, Brucella melitensis, Burkholderia mallei, Burkholderia pseudomallei, Francisella tularensis, Yersinia pestis, and eight genus-related and unrelated control strains was amplified and sequenced. The obtained sequence data were submitted to three public and two commercial sequence databases for species identification. The most frequent reason for incorrect identification was the lack of the respective 16S rRNA gene sequences in the database. Sequence analysis of a 500-bp 16S rDNA fragment allows the rapid identification of dangerous bacterial species. However, for discrimination of closely related species sequencing of the entire 16S rRNA gene, additional sequencing of the 23S rRNA gene or sequencing of the 16S-23S rRNA intergenic spacer is essential. This work provides comprehensive information on the suitability of partial 16S rDNA analysis and diverse databases for rapid and accurate identification of dangerous bacterial pathogens.
Regions flanking ori sequences affect the replication efficiency of the mitochondrial genome of ori+ petite mutants from yeast.

PubMed

Rayko, E; Goursot, R; Cherif-Zahar, B; Melis, R; Bernardi, G

1988-03-31

The mitochondrial genomes of progenies from 26 crosses between 17 cytoplasmic, spontaneous, suppressive, ori+ petite mutants of Saccharomyces cerevisiae have been studied by electrophoresis of restriction fragments. Only parental genomes (or occasionally, genomes derived from them by secondary excisions) were found in the progenies of the almost 500 diploids investigated; no evidence for illegitimate, site-specific mitochondrial recombination was detected. One of the parental genomes was always found to be predominate over the other one, although to different extents in different crosses. This predominance appears to be due to a higher replication efficiency, which is correlated with a greater density of ori sequences on the mitochondrial genome (and with a shorter repeat unit size of the latter). Exceptions to the 'repeat-unit-size rule' were found, however, even when the parental mitochondrial genomes carried the same ori sequence. This indicates that noncoding, intergenic sequences outside ori sequences also play a role in modulating replication efficiency. Since in different petites such sequences differ in primary structure, size, and position relative to ori sequences, this modulation is likely to take place through an indirect effect on DNA and nucleoid structure.
The transcriptome of the novel dinoflagellate Oxyrrhis marina (Alveolata: Dinophyceae): response to salinity examined by 454 sequencing

PubMed Central

2011-01-01

Background The heterotrophic dinoflagellate Oxyrrhis marina is increasingly studied in experimental, ecological and evolutionary contexts. Its basal phylogenetic position within the dinoflagellates make O. marina useful for understanding the origin of numerous unusual features of the dinoflagellate lineage; its broad distribution has lent O. marina to the study of protist biogeography; and nutritive flexibility and eurytopy have made it a common lab rat for the investigation of physiological responses of marine heterotrophic flagellates. Nevertheless, genome-scale resources for O. marina are scarce. Here we present a 454-based transcriptome survey for this organism. In addition, we assess sequence read abundance, as a proxy for gene expression, in response to salinity, an environmental factor potentially important in determining O. marina spatial distributions. Results Sequencing generated ~57 Mbp of data which assembled into 7, 398 contigs. Approximately 24% of contigs were nominally identified by BLAST. A further clustering of contigs (at ≥ 90% identity) revealed 164 transcript variant clusters, the largest of which (Phosphoribosylaminoimidazole-succinocarboxamide synthase) was composed of 28 variants displaying predominately synonymous variation. In a genomic context, a sample of 5 different genes were demonstrated to occur as tandem repeats, separated by short (~200-340 bp) inter-genic regions. For HSP90 several intergenic variants were detected suggesting a potentially complex genomic arrangement. In response to salinity, analysis of 454 read abundance highlighted 9 and 20 genes over or under expressed at 50 PSU, respectively. However, 454 read abundance and subsequent qPCR validation did not correlate well - suggesting that measures of gene expression via ad hoc analysis of sequence read abundance require careful interpretation. Conclusion Here we indicate that tandem gene arrangements and the occurrence of multiple transcribed gene variants are common and indicate potentially complex genomic arrangements in O. marina. Comparison of the reported data set with existing O. marina and other dinoflagellates ESTs indicates little sequence overlap likely as a result of the relatively limited extent of genome scale sequence data currently available for the dinoflagellates. This is one of the first 454-based transcriptome surveys of an ancestral dinoflagellate taxon and will undoubtedly prove useful for future comparative studies aimed at reconstructing the origin of novel features of the dinoflagellates. PMID:22014029
Gene-for-genes interactions between cotton R genes and Xanthomonas campestris pv. malvacearum avr genes.

PubMed

De Feyter, R; Yang, Y; Gabriel, D W

1993-01-01

Six plasmid-borne avirulence (avr) genes were previously cloned from strain XcmH of the cotton pathogen, Xanthomonas campestris pv. malvacearum. We have now localized all six avr genes on the cloned fragments by subcloning and Tn5-gusA insertional mutagenesis. None of these avr genes appeared to exhibit exclusively gene-for-gene patterns of interactions with cotton R genes, and avrB4 was demonstrated to confer avr gene-for-R genes (plural) avirulence to X. c. pv. malvacearum on congenic cotton lines carrying either of two different resistance loci, B1 or B4. Furthermore, the B1 locus appeared to confer R gene-for-avr genes resistance to cotton against isogenic X. c. pv. malvacearum strains carrying any one of three avr genes: avrB4, avrb6, or avrB102. Restriction enzyme, Southern blot hybridization, and DNA sequence analyses showed that the XcmH avr genes are all highly similar to each other, to avrBs3 and avrBsP from the pepper pathogen X. c. pv. vesicatoria, and to the host-specific virulence gene pthA from the citrus pathogen X. citri. The XcmH avr genes differed primarily in the multiplicity of a tandemly repeated 102-base pair motif within the central portions of the genes, repeated from 14 to 23 times in members of this gene family. The complete nucleotide sequence of avrb6 revealed that it is 97% identical in DNA sequence to avrB4, avrBs3, avrBsP, and pthA and that 62-bp inverted terminal repeats mark the boundaries of homology between avrb6 and all members of this Xanthomonas virulence/avirulence gene family sequenced to date. The terminal 38 bp of both inverted repeats are highly similar to the 38-bp consensus terminal sequence of the Tn3 family of transposons. Up to 11 members of the avr gene family appear to be present in North American strains of X. c. pv. malvacearum, including XcmH. The high level of homology observed among these avr genes and their presence in multiple copies may explain the gene-for-genes interactions and also the observed high frequencies (10(-3) to 10(-4) per locus) of X. c. pv. malvacearum race change mutations. Five spontaneous race change mutants of XcmH suffered avr locus deletions, strongly indicating intergenic recombination as the primary mechanism for generating new races in X. c. pv. malvacearum.
Biological assay using T cell response for Cry-consensus peptide designed for the peptide-based immunotherapy of Japanese cedar pollinosis.

PubMed

Kozutsumi, Daisuke; Tsunematsu, Masako; Yamaji, Taketo; Kino, Kohsuke

2007-01-01

Cry-consensus peptide is a linearly linked peptide of T-cell epitopes for the management of Japanese cedar (JC) pollinosis and is expected to become a new drug for immunotherapy. However, the mechanism of T-cell epitopes in allergic diseases is not well understood, and thus, a simple in vitro procedure for evaluation of its biological activity is desired. Peripheral blood mononuclear cells (PBMC) were isolated from 27 JC pollinosis patients and 10 healthy subjects, and cultured in vitro for 4 days in the presence of Cry-consensus peptide and (3)H-thymidine. The relationship between growth stimulation (stimulation index; SI) and antigen-specific IgE levels in serum was also investigated in JC pollinosis patients. Moreover, to confirm the importance of the primary sequence in Cry-consensus peptide, heat-treated Cry-consensus peptide and a mixture of the amino acids of which Cry-consensus peptide is composed, and their (3)H-thymidine uptake was compared with Cry-consensus peptide. Finally, whether Cry-consensus peptide stimulates PBMCs from healthy subjects was investigated. The mean SI of JC patients showed a good correlation with Cry-consensus peptide concentration in the culture medium; however, the SI was independent of the anti-Cry j 1 IgE level. Heat-denatured Cry-consensus peptide retained a PBMC proliferation stimulatory effect comparable to the original Cry-consensus peptide, while the mixture of amino acids constituting Cry-consensus peptide did not stimulate PBMC proliferation. PBMCs from healthy subjects did not respond to Cry-consensus peptide at all. These data indicate that the PBMC response of patients suffering from JC pollinosis to Cry-consensus peptide is specific for the sequence of T cell epitopes thereof and may be useful for the evaluation of the efficacy of Cry-consensus peptide in vivo.
Nucleotide sequence of a cluster of early and late genes in a conserved segment of the vaccinia virus genome.

PubMed Central

Plucienniczak, A; Schroeder, E; Zettlmeissl, G; Streeck, R E

1985-01-01

The nucleotide sequence of a 7.6 kb vaccinia DNA segment from a genomic region conserved among different orthopox virus has been determined. This segment contains a tight cluster of 12 partly overlapping open reading frames most of which can be correlated with previously identified early and late proteins and mRNAs. Regulatory signals used by vaccinia virus have been studied. Presumptive promoter regions are rich in A, T and carry the consensus sequences TATA and AATAA spaced at 20-24 base pairs. Tandem repeats of a CTATTC consensus sequence are proposed to be involved in the termination of early transcription. PMID:2987815
R2R--software to speed the depiction of aesthetic consensus RNA secondary structures.

PubMed

Weinberg, Zasha; Breaker, Ronald R

2011-01-04

With continuing identification of novel structured noncoding RNAs, there is an increasing need to create schematic diagrams showing the consensus features of these molecules. RNA structural diagrams are typically made either with general-purpose drawing programs like Adobe Illustrator, or with automated or interactive programs specific to RNA. Unfortunately, the use of applications like Illustrator is extremely time consuming, while existing RNA-specific programs produce figures that are useful, but usually not of the same aesthetic quality as those produced at great cost in Illustrator. Additionally, most existing RNA-specific applications are designed for drawing single RNA molecules, not consensus diagrams. We created R2R, a computer program that facilitates the generation of aesthetic and readable drawings of RNA consensus diagrams in a fraction of the time required with general-purpose drawing programs. Since the inference of a consensus RNA structure typically requires a multiple-sequence alignment, the R2R user annotates the alignment with commands directing the layout and annotation of the RNA. R2R creates SVG or PDF output that can be imported into Adobe Illustrator, Inkscape or CorelDRAW. R2R can be used to create consensus sequence and secondary structure models for novel RNA structures or to revise models when new representatives for known RNA classes become available. Although R2R does not currently have a graphical user interface, it has proven useful in our efforts to create 100 schematic models of distinct noncoding RNA classes. R2R makes it possible to obtain high-quality drawings of the consensus sequence and structural models of many diverse RNA structures with a more practical amount of effort. R2R software is available at http://breaker.research.yale.edu/R2R and as an Additional file.
Complete mitochondrial genome of the giant ramshorn snail Marisa cornuarietis (Gastropoda: Ampullariidae).

PubMed

Wang, Mingling; Qiu, Jian-Wen

2016-05-01

We report the complete mitochondrial genome (mitogenome) of the giant ramshorn snail Marisa cornuarietis, a biocontrol agent of freshwater weeds and snail vectors of schistosomes. The mitogenome is 15,923 bp in length, encoding 13 protein-coding genes, 22 transfer RNAs and 2 ribosomal RNAs. The mitogenome is A+T biased (70.0%), with 28.9% A, 41.1% T, 16.7% G, and 13.3% C. A comparison with Pomacea canaliculata, the other member in the same family (Ampullariidae) with a sequenced mitogenome, shows that the two species have an identical gene order, but their intergenic regions vary substantially in sequence length. The mitogenome data can be used to understand the population genetics of M. cornuarietis, and resolve the phylogenetic relationship of various genera in Ampullariidae.
Detection and Identification of Gastrointestinal Lactobacillus Species by Using Denaturing Gradient Gel Electrophoresis and Species-Specific PCR Primers

PubMed Central

Walter, J.; Tannock, G. W.; Tilsala-Timisjarvi, A.; Rodtong, S.; Loach, D. M.; Munro, K.; Alatossava, T.

2000-01-01

Denaturing gradient gel electrophoresis (DGGE) of DNA fragments obtained by PCR amplification of the V2-V3 region of the 16S rRNA gene was used to detect the presence of Lactobacillus species in the stomach contents of mice. Lactobacillus isolates cultured from human and porcine gastrointestinal samples were identified to the species level by using a combination of DGGE and species-specific PCR primers that targeted 16S-23S rRNA intergenic spacer region or 16S rRNA gene sequences. The identifications obtained by this approach were confirmed by sequencing the V2-V3 region of the 16S rRNA gene and by a BLAST search of the GenBank database. PMID:10618239
Detection of Marteilia refringens using nested PCR and in situ hybridisation in Chamelea gallina from the Balearic Islands (Spain).

PubMed

López-Flores, Inmaculada; Robles, Francisca; Valencia, José M; Grau, Amalia; Villalba, Antonio; de la Herrán, Roberto; Garrido-Ramos, Manuel A; Ruiz-Rejón, Carmelo; Ruiz-Rejón, Manuel; Navas, José I

2008-10-16

In the course of a histopathological survey performed to discover the cause of mass mortality of the striped clam Chamelea gallina in the Balearic Islands (Spain, Mediterranean Sea), we detected a Marteilia-like parasite in 3 clams. Molecular methods were applied to identify the parasite. DNA extracted from a paraffin block was used to carry out a PCR assay for Marteilia refringens detection based on a rDNA sequence of the parasite (the intergenic spacer of ribosomal genes, IGS). The nucleotide sequence of the IGS amplified fragment and the positive signal obtained by in situ hybridisation analysis with a M. refringens-specific probe allowed us to confirm the presence of this parasite in the digestive gland tissue of C. gallina.
Description of the mitochondrial genome of the tree coral Dendrophyllia arbuscula (Anthozoa, Scleractinia).

PubMed

Luz, Bruna Louise Pereira; Capel, Kátia Cristina Cruz; Stampar, Sérgio Nascimento; Kitahara, Marcelo Visentini

2016-07-01

Dendrophylliidae is one of the few monophyletic families within the Scleractinia that embraces zooxanthellate and azooxanthellate species represented by both solitary and colonial forms. Among the exclusively azooxanthellate genera, Dendrophyllia is reported worldwide from 1 to 1200 m deep. To date, although three complete mitochondrial (mt) genomes from representatives of the family are available, only that from Turbinaria peltata has been formally published. Here we describe the complete nucleotide sequence of the mt genome from Dendrophyllia arbuscula that is 19 069 bp in length and comprises two rDNAs, two tRNAs, and 13 protein-coding genes arranged in the canonical scleractinian mt gene order. No genes overlap, resulting in the presence of 18 intergenic spacers and one of the longest scleractinian mt genome sequenced to date.
Species identification of mutans streptococci by groESL gene sequence.

PubMed

Hung, Wei-Chung; Tsai, Jui-Chang; Hsueh, Po-Ren; Chia, Jean-San; Teng, Lee-Jene

2005-09-01

The near full-length sequences of the groESL genes were determined and analysed among eight reference strains (serotypes a to h) representing five species of mutans group streptococci. The groES sequences from these reference strains revealed that there are two lengths (285 and 288 bp) in the five species. The intergenic spacer between groES and groEL appears to be a unique marker for species, with a variable size (ranging from 111 to 310 bp) and sequence. Phylogenetic analysis of groES and groEL separated the eight serotypes into two major clusters. Strains of serotypes b, c, e and f were highly related and had groES gene sequences of the same length, 288 bp, while strains of serotypes a, d, g and h were also closely related and their groES gene sequence lengths were 285 bp. The groESL sequences in clinical isolates of three serotypes of S. mutans were analysed for intraspecies polymorphism. The results showed that the groESL sequences could provide information for differentiation among species, but were unable to distinguish serotypes of the same species. Based on the determined sequences, a PCR assay was developed that could differentiate members of the mutans streptococci by amplicon size and provide an alternative way for distinguishing mutans streptococci from other viridans streptococci.

High-resolution definition of the Vibrio cholerae essential gene set with hidden Markov model–based analyses of transposon-insertion sequencing data

PubMed Central

Chao, Michael C.; Pritchard, Justin R.; Zhang, Yanjia J.; Rubin, Eric J.; Livny, Jonathan; Davis, Brigid M.; Waldor, Matthew K.

2013-01-01

The coupling of high-density transposon mutagenesis to high-throughput DNA sequencing (transposon-insertion sequencing) enables simultaneous and genome-wide assessment of the contributions of individual loci to bacterial growth and survival. We have refined analysis of transposon-insertion sequencing data by normalizing for the effect of DNA replication on sequencing output and using a hidden Markov model (HMM)-based filter to exploit heretofore unappreciated information inherent in all transposon-insertion sequencing data sets. The HMM can smooth variations in read abundance and thereby reduce the effects of read noise, as well as permit fine scale mapping that is independent of genomic annotation and enable classification of loci into several functional categories (e.g. essential, domain essential or ‘sick’). We generated a high-resolution map of genomic loci (encompassing both intra- and intergenic sequences) that are required or beneficial for in vitro growth of the cholera pathogen, Vibrio cholerae. This work uncovered new metabolic and physiologic requirements for V. cholerae survival, and by combining transposon-insertion sequencing and transcriptomic data sets, we also identified several novel noncoding RNA species that contribute to V. cholerae growth. Our findings suggest that HMM-based approaches will enhance extraction of biological meaning from transposon-insertion sequencing genomic data. PMID:23901011
Determination of locust bean gum and guar gum by polymerase chain reaction and restriction fragment length polymorphism analysis.

PubMed

Meyer, K; Rosa, C; Hischenhuber, C; Meyer, R

2001-01-01

A polymerase chain reaction (PCR) was developed to differentiate the thickening agents locust bean gum (LBG) and the cheaper guar gum in finished food products. Universal primers for amplification of the intergenic spacer region between trnL 3' (UAA) exon and trnF (GAA) gene in the chloroplast (cp) genome and subsequent restriction analysis were applied to differentiate guar gum and LBG. The presence of <5% (w/w) guar gum powder added to LBG powder was detectable. Based on data obtained from sequencing this intergenic spacer region, a second PCR method for the specific detection of guar gum DNA was also developed. This assay detected guar gum powder in LBG in amounts as low as 1% (w/w). Both methods successfully detected guar gum and/or LBG in ice cream stabilizers and in foodstuffs, such as dairy products, ice cream, dry seasoning mixes, a finished roasting sauce, and a fruit jelly product, but not in products with highly degraded DNA, such as tomato ketchup and sterilized chocolate cream. Both methods detected guar gum and LBG in ice cream and fresh cheese at levels <0.1%.
Explaining the disease phenotype of intergenic SNP through predicted long range regulation

PubMed Central

Chen, Jingqi; Tian, Weidong

2016-01-01

Thousands of disease-associated SNPs (daSNPs) are located in intergenic regions (IGR), making it difficult to understand their association with disease phenotypes. Recent analysis found that non-coding daSNPs were frequently located in or approximate to regulatory elements, inspiring us to try to explain the disease phenotypes of IGR daSNPs through nearby regulatory sequences. Hence, after locating the nearest distal regulatory element (DRE) to a given IGR daSNP, we applied a computational method named INTREPID to predict the target genes regulated by the DRE, and then investigated their functional relevance to the IGR daSNP's disease phenotypes. 36.8% of all IGR daSNP-disease phenotype associations investigated were possibly explainable through the predicted target genes, which were enriched with, were functionally relevant to, or consisted of the corresponding disease genes. This proportion could be further increased to 60.5% if the LD SNPs of daSNPs were also considered. Furthermore, the predicted SNP-target gene pairs were enriched with known eQTL/mQTL SNP-gene relationships. Overall, it's likely that IGR daSNPs may contribute to disease phenotypes by interfering with the regulatory function of their nearby DREs and causing abnormal expression of disease genes. PMID:27280978
Detection of 224 candidate structured RNAs by comparative analysis of specific subsets of intergenic regions

PubMed Central

Lünse, Christina E.; Corbino, Keith A.; Ames, Tyler D.; Nelson, James W.; Roth, Adam; Perkins, Kevin R.; Sherlock, Madeline E.

2017-01-01

Abstract The discovery of structured non-coding RNAs (ncRNAs) in bacteria can reveal new facets of biology and biochemistry. Comparative genomics analyses executed by powerful computer algorithms have successfully been used to uncover many novel bacterial ncRNA classes in recent years. However, this general search strategy favors the discovery of more common ncRNA classes, whereas progressively rarer classes are correspondingly more difficult to identify. In the current study, we confront this problem by devising several methods to select subsets of intergenic regions that can concentrate these rare RNA classes, thereby increasing the probability that comparative sequence analysis approaches will reveal their existence. By implementing these methods, we discovered 224 novel ncRNA classes, which include ROOL RNA, an RNA class averaging 581 nt and present in multiple phyla, several highly conserved and widespread ncRNA classes with properties that suggest sophisticated biochemical functions and a multitude of putative cis-regulatory RNA classes involved in a variety of biological processes. We expect that further research on these newly found RNA classes will reveal additional aspects of novel biology, and allow for greater insights into the biochemistry performed by ncRNAs. PMID:28977401
Using a color-coded ambigraphic nucleic acid notation to visualize conserved palindromic motifs within and across genomes

PubMed Central

2014-01-01

Background Ambiscript is a graphically-designed nucleic acid notation that uses symbol symmetries to support sequence complementation, highlight biologically-relevant palindromes, and facilitate the analysis of consensus sequences. Although the original Ambiscript notation was designed to easily represent consensus sequences for multiple sequence alignments, the notation’s black-on-white ambiguity characters are unable to reflect the statistical distribution of nucleotides found at each position. We now propose a color-augmented ambigraphic notation to encode the frequency of positional polymorphisms in these consensus sequences. Results We have implemented this color-coding approach by creating an Adobe Flash® application ( http://www.ambiscript.org) that shades and colors modified Ambiscript characters according to the prevalence of the encoded nucleotide at each position in the alignment. The resulting graphic helps viewers perceive biologically-relevant patterns in multiple sequence alignments by uniquely combining color, shading, and character symmetries to highlight palindromes and inverted repeats in conserved DNA motifs. Conclusion Juxtaposing an intuitive color scheme over the deliberate character symmetries of an ambigraphic nucleic acid notation yields a highly-functional nucleic acid notation that maximizes information content and successfully embodies key principles of graphic excellence put forth by the statistician and graphic design theorist, Edward Tufte. PMID:24447494
Defining a Conformational Consensus Motif in Cotransin-Sensitive Signal Sequences: A Proteomic and Site-Directed Mutagenesis Study

PubMed Central

Klein, Wolfgang; Westendorf, Carolin; Schmidt, Antje; Conill-Cortés, Mercè; Rutz, Claudia; Blohs, Marcus; Beyermann, Michael; Protze, Jonas; Krause, Gerd; Krause, Eberhard; Schülein, Ralf

2015-01-01

The cyclodepsipeptide cotransin was described to inhibit the biosynthesis of a small subset of proteins by a signal sequence-discriminatory mechanism at the Sec61 protein-conducting channel. However, it was not clear how selective cotransin is, i.e. how many proteins are sensitive. Moreover, a consensus motif in signal sequences mediating cotransin sensitivity has yet not been described. To address these questions, we performed a proteomic study using cotransin-treated human hepatocellular carcinoma cells and the stable isotope labelling by amino acids in cell culture technique in combination with quantitative mass spectrometry. We used a saturating concentration of cotransin (30 micromolar) to identify also less-sensitive proteins and to discriminate the latter from completely resistant proteins. We found that the biosynthesis of almost all secreted proteins was cotransin-sensitive under these conditions. In contrast, biosynthesis of the majority of the integral membrane proteins was cotransin-resistant. Cotransin sensitivity of signal sequences was neither related to their length nor to their hydrophobicity. Instead, in the case of signal anchor sequences, we identified for the first time a conformational consensus motif mediating cotransin sensitivity. PMID:25806945
Acute Chagas outbreaks: molecular and biological features of Trypanosoma cruzi isolates, and clinical aspects of acute cases in Santander, Colombia.

PubMed

Díaz, Martha Lucía; Leal, Sandra; Mantilla, Julio César; Molina-Berríos, Alfredo; López-Muñoz, Rodrigo; Solari, Aldo; Escobar, Patricia; González Rugeles, Clara Isabel

2015-11-26

Outbreaks of acute Chagas disease associated with oral transmission are easily detected nowadays with trained health personnel in areas of low endemicity, or in which the vector transmission has been interrupted. Given the biological and genetic diversity of Trypanosoma cruzi, the high morbidity, mortality, and the observed therapeutic failure, new characteristics of these outbreaks need to be addressed at different levels, both in Trypanosoma cruzi as in patient response. The aim of this work was to evaluate the patient's features involved in six outbreaks of acute Chagas disease which occurred in Santander, Colombia, and the characteristics of Trypanosoma cruzi clones isolated from these patients, to establish the potential relationship between the etiologic agent features with host behavior. The clinical, pathological and epidemiological aspects of outbreaks were analyzed. In addition, Trypanosoma cruzi clones were biologically characterized both in vitro and in vivo, and the susceptibility to the classical trypanocidal drugs nifurtimox and benznidazole was evaluated. Trypanosoma cruzi clones were genotyped by means of mini-exon intergenic spacer and cytochrome b genes sequencing. All clones were DTU I, and based on the mini-exon intergenic spacer, belong to two genotypes: G2 related with sub-urban, and G11 with rural outbreaks. Girón outbreak clones with higher susceptibility to drugs presented G2 genotype and C/T transition in Cyt b. The outbreaks affected mainly young population (±25.9 years), and the mortality rate was 10 %. The cardiac tissue showed intense inflammatory infiltrate, myocardial necrosis and abundant amastigote nests. However, although the gastrointestinal tissue was congestive, no inflammation or parasites were observed. Although all clones belong to DTU I, two intra-DTU genotypes were found with the sequencing of the mini-exon intergenic spacer, however there is no strict correlation between genetic groups, the cycles of the parasite or the clinical forms of the disease. Trypanosoma cruzi clones from Girón with higher sensitivity to nifurtimox presented a particular G2 genotype and C/T transition in Cyt b. When the diagnosis was early, the patients responded well to antichagasic treatment, which highlights the importance of diagnosis and treatment early to prevent fatal outcomes associated with these acute episodes.
Comparison of four polymerase chain reaction assays for the detection of Brucella spp. in clinical samples from dogs

PubMed Central

Boeri, Eduardo J.; Wanke, María M.; Madariaga, María J.; Teijeiro, María L.; Elena, Sebastian A.; Trangoni, Marcos D.

2018-01-01

Aim: This study aimed to compare the sensitivity (S), specificity (Sp), and positive likelihood ratios (LR+) of four polymerase chain reaction (PCR) assays for the detection of Brucella spp. in dog’s clinical samples. Materials and Methods: A total of 595 samples of whole blood, urine, and genital fluids were evaluated between October 2014 and November 2016. To compare PCR assays, the gold standard was defined using a combination of different serological and microbiological test. Bacterial isolation from urine and blood cultures was carried out. Serological methods such as rapid slide agglutination test, indirect enzyme-linked immunosorbent assay, agar gel immunodiffusion test, and buffered plate antigen test were performed. Four genes were evaluated: (i) The gene coding for the BCSP31 protein, (ii) the ribosomal gene coding for the 16S-23S intergenic spacer region, (iii) the gene coding for porins omp2a/omp2b, and (iv) the gene coding for the insertion sequence IS711. Results: The results obtained were as follows: (1) For the primers that amplify the gene coding for the BCSP31 protein: S: 45.64% (confidence interval [CI] 39.81-51.46), Sp: 95.62% (CI 93.13-98.12), and LR+: 10.43 (CI 6.04-18); (2) for the primers that amplify the ribosomal gene of the 16S-23S rDNA intergenic spacer region: S: 69.80% (CI 64.42-75.18), Sp: 95.62 % (CI 93.13-98.12), and LR+: 11.52 (CI 7.31-18.13); (3) for the primers that amplify the omp2a and omp2b genes: S: 39.26% (CI 33.55-44.97), Sp: 97.31% (CI 95.30-99.32), and LR+ 14.58 (CI 7.25-29.29); and (4) for the primers that amplify the insertion sequence IS711: S: 22.82% (CI 17.89 - 27.75), Sp: 99.66% (CI 98.84-100), and LR+ 67.77 (CI 9.47-484.89). Conclusion: We concluded that the gene coding for the 16S-23S rDNA intergenic spacer region was the one that best detected Brucella spp. in canine clinical samples. PMID:29657404
Plastome Sequence Determination and Comparative Analysis for Members of the Lolium-Festuca Grass Species Complex

PubMed Central

Hand, Melanie L.; Spangenberg, German C.; Forster, John W.; Cogan, Noel O. I.

2013-01-01

Chloroplast genome sequences are of broad significance in plant biology, due to frequent use in molecular phylogenetics, comparative genomics, population genetics, and genetic modification studies. The present study used a second-generation sequencing approach to determine and assemble the plastid genomes (plastomes) of four representatives from the agriculturally important Lolium-Festuca species complex of pasture grasses (Lolium multiflorum, Festuca pratensis, Festuca altissima, and Festuca ovina). Total cellular DNA was extracted from either roots or leaves, was sequenced, and the output was filtered for plastome-related reads. A comparison between sources revealed fewer plastome-related reads from root-derived template but an increase in incidental bacterium-derived sequences. Plastome assembly and annotation indicated high levels of sequence identity and a conserved organization and gene content between species. However, frequent deletions within the F. ovina plastome appeared to contribute to a smaller plastid genome size. Comparative analysis with complete plastome sequences from other members of the Poaceae confirmed conservation of most grass-specific features. Detailed analysis of the rbcL–psaI intergenic region, however, revealed a “hot-spot” of variation characterized by independent deletion events. The evolutionary implications of this observation are discussed. The complete plastome sequences are anticipated to provide the basis for potential organelle-specific genetic modification of pasture grasses. PMID:23550121
Computational RNomics of Drosophilids

PubMed Central

Rose, Dominic; Hackermüller, Jörg; Washietl, Stefan; Reiche, Kristin; Hertel, Jana; Findeiß, Sven; Stadler, Peter F; Prohaska, Sonja J

2007-01-01

Background Recent experimental and computational studies have provided overwhelming evidence for a plethora of diverse transcripts that are unrelated to protein-coding genes. One subclass consists of those RNAs that require distinctive secondary structure motifs to exert their biological function and hence exhibit distinctive patterns of sequence conservation characteristic for positive selection on RNA secondary structure. The deep-sequencing of 12 drosophilid species coordinated by the NHGRI provides an ideal data set of comparative computational approaches to determine those genomic loci that code for evolutionarily conserved RNA motifs. This class of loci includes the majority of the known small ncRNAs as well as structured RNA motifs in mRNAs. We report here on a genome-wide survey using RNAz. Results We obtain 16 000 high quality predictions among which we recover the majority of the known ncRNAs. Taking a pessimistically estimated false discovery rate of 40% into account, this implies that at least some ten thousand loci in the Drosophila genome show the hallmarks of stabilizing selection action of RNA structure, and hence are most likely functional at the RNA level. A subset of RNAz predictions overlapping with TRF1 and BRF binding sites [Isogai et al., EMBO J. 26: 79–89 (2007)], which are plausible candidates of Pol III transcripts, have been studied in more detail. Among these sequences we identify several "clusters" of ncRNA candidates with striking structural similarities. Conclusion The statistical evaluation of the RNAz predictions in comparison with a similar analysis of vertebrate genomes [Washietl et al., Nat. Biotech. 23: 1383–1390 (2005)] shows that qualitatively similar fractions of structured RNAs are found in introns, UTRs, and intergenic regions. The intergenic RNA structures, however, are concentrated much more closely around known protein-coding loci, suggesting that flies have significantly smaller complement of independent structured ncRNAs compared to mammals. PMID:17996037
Analysis of capsid portal protein and terminase functional domains: interaction sites required for DNA packaging in bacteriophage T4.

PubMed

Lin, H; Rao, V B; Black, L W

1999-06-04

Bacteriophage DNA packaging results from an ATP-driven translocation of concatemeric DNA into the prohead by the phage terminase complexed with the portal vertex dodecamer of the prohead. Functional domains of the bacteriophage T4 terminase and portal gene 20 product (gp20) were determined by mutant analysis and sequence localization within the structural genes. Interaction regions of the portal vertex and large terminase subunit (gp17) were determined by genetic (terminase-portal intergenic suppressor mutations), biochemical (column retention of gp17 and inhibition of in vitro DNA packaging by gp20 peptides), and immunological (co-immunoprecipitation of polymerized gp20 peptide and gp17) studies. The specificity of the interaction was tested by means of a phage T4 HOC (highly antigenicoutercapsid protein) display system in which wild-type, cs20, and scrambled portal peptide sequences were displayed on the HOC protein of phage T4. Binding affinities of these recombinant phages as determined by the retention of these phages by a His-tag immobilized gp17 column, and by co-immunoprecipitation with purified terminase supported the specific nature of the portal protein and terminase interaction sites. In further support of specificity, a gp20 peptide corresponding to a portion of the identified site inhibited packaging whereas the scrambled sequence peptide did not block DNA packaging in vitro. The portal interaction site is localized to 28 residues in the central portion of the linear sequence of gp20 (524 residues). As judged by two pairs of intergenic portal-terminase suppressor mutations, two separate regions of the terminase large subunit gp17 (central and COOH-terminal) interact through hydrophobic contacts at the portal site. Although the terminase apparently interacts with this gp20 portal peptide, polyclonal antibody against the portal peptide appears unable to access it in the native structure, suggesting intimate association of gp20 and gp17 possibly internalizes terminase regions within the portal in the packasome complex. Both similarities and differences are seen in comparison to analogous sites which have been identified in phages T3 and lambda. Copyright 1999 Academic Press.
Recombination Analysis of Herpes Simplex Virus 1 Reveals a Bias toward GC Content and the Inverted Repeat Regions

PubMed Central

Lee, Kyubin; Kolb, Aaron W.; Sverchkov, Yuriy; Cuellar, Jacqueline A.; Craven, Mark

2015-01-01

ABSTRACT Herpes simplex virus 1 (HSV-1) causes recurrent mucocutaneous ulcers and is the leading cause of infectious blindness and sporadic encephalitis in the United States. HSV-1 has been shown to be highly recombinogenic; however, to date, there has been no genome-wide analysis of recombination. To address this, we generated 40 HSV-1 recombinants derived from two parental strains, OD4 and CJ994. The 40 OD4-CJ994 HSV-1 recombinants were sequenced using the Illumina sequencing system, and recombination breakpoints were determined for each of the recombinants using the Bootscan program. Breakpoints occurring in the terminal inverted repeats were excluded from analysis to prevent double counting, resulting in a total of 272 breakpoints in the data set. By placing windows around the 272 breakpoints followed by Monte Carlo analysis comparing actual data to simulated data, we identified a recombination bias toward both high GC content and intergenic regions. A Monte Carlo analysis also suggested that recombination did not appear to be responsible for the generation of the spontaneous nucleotide mutations detected following sequencing. Additionally, kernel density estimation analysis across the genome found that the large, inverted repeats comprise a recombination hot spot. IMPORTANCE Herpes simplex virus 1 (HSV-1) virus is the leading cause of sporadic encephalitis and blinding keratitis in developed countries. HSV-1 has been shown to be highly recombinogenic, and recombination itself appears to be a significant component of genome replication. To date, there has been no genome-wide analysis of recombination. Here we present the findings of the first genome-wide study of recombination performed by generating and sequencing 40 HSV-1 recombinants derived from the OD4 and CJ994 parental strains, followed by bioinformatics analysis. Recombination breakpoints were determined, yielding 272 breakpoints in the full data set. Kernel density analysis determined that the large inverted repeats constitute a recombination hot spot. Additionally, Monte Carlo analyses found biases toward high GC content and intergenic and repetitive regions. PMID:25926637
Distribution and sequence homogeneity of an abundant satellite DNA in the beetle, Tenebrio molitor.

PubMed Central

Davis, C A; Wyatt, G R

1989-01-01

The mealworm beetle, Tenebrio molitor, contains an unusually abundant and homogeneous satellite DNA which constitutes up to 60% of its genome. The satellite DNA is shown to be present in all of the chromosomes by in situ hybridization. 18 dimers of the repeat unit were cloned and sequenced. The consensus sequence is 142 nt long and lacks any internal repeat structure. Monomers of the sequence are very similar, showing on average a 2% divergence from the calculated consensus. Variant nucleotides are scattered randomly throughout the sequence although some variants are more common than others. Neighboring repeat units are no more alike than randomly chosen ones. The results suggest that some mechanism, perhaps gene conversion, is acting to maintain the homogeneity of the satellite DNA despite its abundance and distribution on all of the chromosomes. Images PMID:2762148
Screening a yeast promoter library leads to the isolation of the RP29/L32 and SNR17B/RPL37A divergent promoters and the discovery of a gene encoding ribosomal protein L37.

PubMed

Santangelo, G M; Tornow, J; McLaughlin, C S; Moldave, K

1991-08-30

Two promoters (A7 and A23), isolated at random from the Saccharomyces cerevisiae genome by virtue of their capacity to activate transcription, are identical to known intergenic bidirectional promoters. Sequence analysis of the genomic DNA adjacent to the A7 promoter identified a split gene encoding ribosomal (r) protein L37, which is homologous to the tRNA-binding r-proteins, L35a (from human and rat) and L32 (from frogs).
Identification of multiple sites suitable for insertion of foreign genes in herpes simplex virus genomes.

PubMed

Morimoto, Tomomi; Arii, Jun; Akashi, Hiroomi; Kawaguchi, Yasushi

2009-03-01

Information on sites in HSV genomes at which foreign gene(s) can be inserted without disrupting viral genes or affecting properties of the parental virus are important for basic research on HSV and development of HSV-based vectors for human therapy. The intergenic region between HSV-1 UL3 and UL4 genes has been reported to satisfy the requirements for such an insertion site. The UL3 and UL4 genes are oriented toward the intergenic region and, therefore, insertion of a foreign gene(s) into the region between the UL3 and UL4 polyadenylation signals should not disrupt any viral genes or transcriptional units. HSV-1 and HSV-2 each have more than 10 additional regions structurally similar to the intergenic region between UL3 and UL4. In the studies reported here, it has been demonstrated that insertion of a reporter gene expression cassette into several of the HSV-1 and HSV-2 intergenic regions has no effect on viral growth in cell culture or virulence in mice, suggesting that these multiple intergenic regions may be suitable HSV sites for insertion of foreign genes.
Using multi-locus allelic sequence data to estimate genetic divergence among four Lilium (Liliaceae) cultivars

PubMed Central

Shahin, Arwa; Smulders, Marinus J. M.; van Tuyl, Jaap M.; Arens, Paul; Bakker, Freek T.

2014-01-01

Next Generation Sequencing (NGS) may enable estimating relationships among genotypes using allelic variation of multiple nuclear genes simultaneously. We explored the potential and caveats of this strategy in four genetically distant Lilium cultivars to estimate their genetic divergence from transcriptome sequences using three approaches: POFAD (Phylogeny of Organisms from Allelic Data, uses allelic information of sequence data), RAxML (Randomized Accelerated Maximum Likelihood, tree building based on concatenated consensus sequences) and Consensus Network (constructing a network summarizing among gene tree conflicts). Twenty six gene contigs were chosen based on the presence of orthologous sequences in all cultivars, seven of which also had an orthologous sequence in Tulipa, used as out-group. The three approaches generated the same topology. Although the resolution offered by these approaches is high, in this case there was no extra benefit in using allelic information. We conclude that these 26 genes can be widely applied to construct a species tree for the genus Lilium. PMID:25368628
Conserved sequence-specific lincRNA-steroid receptor interactions drive transcriptional repression and direct cell fate

DOE Office of Scientific and Technical Information (OSTI.GOV)

Hudson, William H.; Pickard, Mark R.; de Vera, Ian Mitchelle S.

2014-12-23

The majority of the eukaryotic genome is transcribed, generating a significant number of long intergenic noncoding RNAs (lincRNAs). Although lincRNAs represent the most poorly understood product of transcription, recent work has shown lincRNAs fulfill important cellular functions. In addition to low sequence conservation, poor understanding of structural mechanisms driving lincRNA biology hinders systematic prediction of their function. Here we report the molecular requirements for the recognition of steroid receptors (SRs) by the lincRNA growth arrest-specific 5 (Gas5), which regulates steroid-mediated transcriptional regulation, growth arrest and apoptosis. We identify the functional Gas5-SR interface and generate point mutations that ablate the SR-Gas5more » lincRNA interaction, altering Gas5-driven apoptosis in cancer cell lines. Further, we find that the Gas5 SR-recognition sequence is conserved among haplorhines, with its evolutionary origin as a splice acceptor site. This study demonstrates that lincRNAs can recognize protein targets in a conserved, sequence-specific manner in order to affect critical cell functions.« less
The complete genome sequence of the Atlantic salmon paramyxovirus (ASPV)

DOE Office of Scientific and Technical Information (OSTI.GOV)

Nylund, Stian; Karlsen, Marius; Nylund, Are

2008-03-30

The complete RNA genome of the Atlantic salmon paramyxovirus (ASPV), isolated from Atlantic salmon suffering from proliferative gill inflammation (PGI), has been determined. The genome is 16,965 nucleotides in length and consists of six nonoverlapping genes in the order 3'- N - P/C/V - M - F - HN - L -5', coding for the nucleocapsid, phospho-, matrix, fusion, hemagglutinin-neuraminidase and large polymerase proteins, respectively. The gene junctions contain highly conserved transcription start and stop signal sequences and trinucleotide intergenic regions similar to those of other Paramyxoviridae. The ASPV P-gene expression strategy is like that of the respiro- and morbilliviruses,more » which express the phosphoprotein from the primary transcript, and edit a portion of the mRNA to encode the accessory proteins V and W. It also encodes the C-protein by ribosomal choice of translation initiation. Pairwise comparisons of amino acid identities, and phylogenetic analysis of deduced ASPV protein sequences with homologous sequences from other Paramyxoviridae, show that ASPV has an affinity for the genus Respirovirus, but may represent a new genus within the subfamily Paramyxovirinae.« less
An Evaluation of the Genetic Diversity of Xylella fastidiosa Isolated from Diseased Citrus and Coffee in São Paulo, Brazil.

PubMed

Qin, X; Miranda, V S; Machado, M A; Lemos, E G; Hartung, J S

2001-06-01

ABSTRACT Strains of Xylella fastidiosa, isolated from sweet orange trees (Citrus sinensis) and coffee trees (Coffea arabica) with symptoms of citrus variegated chlorosis and Requeima do Café, respectively, were indistinguishable based on repetitive extragenic palindromic polymerase chain reaction (PCR) and enterobacterial repetitive intergenic consensus PCR assays. These strains were also indistinguishable with a previously described PCR assay that distinguished the citrus strains from all other strains of Xylella fastidiosa. Because we were not able to document any genomic diversity in our collection of Xylella fastidiosa strains isolated from diseased citrus, the observed gradient of increasing disease severity from southern to northern regions of São Paulo State is unlikely due to the presence of significantly different strains of the pathogen in the different regions. When comparisons were made to reference strains of Xylella fastidiosa isolated from other hosts using these methods, four groups were consistently identified consistent with the hosts and regions from which the strains originated: citrus and coffee, grapevine and almond, mulberry, and elm, plum, and oak. Independent results from random amplified polymorphic DNA (RAPD) PCR assays were also consistent with these results; however, two of the primers tested in RAPD-PCR were able to distinguish the coffee and citrus strains. Sequence comparisons of a PCR product amplified from all strains of Xylella fastidiosa confirmed the presence of a CfoI polymorphism that can be used to distinguish the citrus strains from all others. The ability to distinguish Xylella fastidiosa strains from citrus and coffee with a PCR-based assay will be useful in epidemiological and etiological studies of this pathogen.
Differences in the Fecal Concentrations and Genetic Diversities of Campylobacter jejuni Populations among Individual Cows in Two Dairy Herds

PubMed Central

Ross, Colleen M.; Pleydell, Eve J.; Muirhead, Richard W.

2012-01-01

Dairy cows have been identified as common carriers of Campylobacter jejuni, which causes many of the human gastroenteritis cases reported worldwide. To design on-farm management practices that control the human infection sourced from dairy cows, the first step is to acquire an understanding of the excretion patterns of the cow reservoir. We monitored the same 35 cows from two dairy farms for C. jejuni excretion fortnightly for up to 12 months. The objective was to examine the concentration of C. jejuni and assess the genetic relationship of the C. jejuni populations excreted by individual cows. Significant differences (P < 0.01) in C. jejuni fecal concentration were observed among the 35 cows, with median concentrations that varied by up to 3.6 log10 · g−1 feces. A total of 36 different genotypes were identified from the 514 positive samples by using enterobacterial repetitive intergenic consensus (ERIC)-PCR. Although 22 of these genotypes were excreted by more than one cow, the analysis of frequencies and distribution of the genotypes by model-based statistics revealed a high degree of individuality in the C. jejuni population in each cow. The observed variation in the frequency of excretion of a genotype among cows and the analysis by multilocus sequence typing (MLST) of these genotypes suggest that excretion of C. jejuni in high numbers is due to a successful adaptation of a particular genotype to a particular cow's gut environment, but that animal-related factors render some individual cows resistant to colonization by particular genotypes. The reasons for differences in C. jejuni colonization of animals warrant further investigation. PMID:22904055

Risk of Transmission of Antimicrobial Resistant Escherichia coli from Commercial Broiler and Free-Range Retail Chicken in India

PubMed Central

Hussain, Arif; Shaik, Sabiha; Ranjan, Amit; Nandanwar, Nishant; Tiwari, Sumeet K.; Majid, Mohammad; Baddam, Ramani; Qureshi, Insaf A.; Semmler, Torsten; Wieler, Lothar H.; Islam, Mohammad A.; Chakravortty, Dipshikha; Ahmed, Niyaz

2017-01-01

Multidrug-resistant Escherichia coli infections are a growing public health concern. This study analyzed the possibility of contamination of commercial poultry meat (broiler and free-range) with pathogenic and or multi-resistant E. coli in retail chain poultry meat markets in India. We analyzed 168 E. coli isolates from broiler and free-range retail poultry (meat/ceca) sampled over a wide geographical area, for their antimicrobial sensitivity, phylogenetic groupings, virulence determinants, extended-spectrum-β-lactamase (ESBL) genotypes, fingerprinting by Enterobacterial Repetitive Intergenic Consensus (ERIC) PCR and genetic relatedness to human pathogenic E. coli using whole genome sequencing (WGS). The prevalence rates of ESBL producing E. coli among broiler chicken were: meat 46%; ceca 40%. Whereas, those for free range chicken were: meat 15%; ceca 30%. E. coli from broiler and free-range chicken exhibited varied prevalence rates for multi-drug resistance (meat 68%; ceca 64% and meat 8%; ceca 26%, respectively) and extraintestinal pathogenic E. coli (ExPEC) contamination (5 and 0%, respectively). WGS analysis confirmed two globally emergent human pathogenic lineages of E. coli, namely the ST131 (H30-Rx subclone) and ST117 among our poultry E. coli isolates. These results suggest that commercial poultry meat is not only an indirect public health risk by being a possible carrier of non-pathogenic multi-drug resistant (MDR)-E. coli, but could as well be the carrier of human E. coli pathotypes. Further, the free-range chicken appears to carry low risk of contamination with antimicrobial resistant and extraintestinal pathogenic E. coli (ExPEC). Overall, these observations reinforce the understanding that poultry meat in the retail chain could possibly be contaminated by MDR and/or pathogenic E. coli. PMID:29180984
Risk of Transmission of Antimicrobial Resistant Escherichia coli from Commercial Broiler and Free-Range Retail Chicken in India.

PubMed

Hussain, Arif; Shaik, Sabiha; Ranjan, Amit; Nandanwar, Nishant; Tiwari, Sumeet K; Majid, Mohammad; Baddam, Ramani; Qureshi, Insaf A; Semmler, Torsten; Wieler, Lothar H; Islam, Mohammad A; Chakravortty, Dipshikha; Ahmed, Niyaz

2017-01-01

Multidrug-resistant Escherichia coli infections are a growing public health concern. This study analyzed the possibility of contamination of commercial poultry meat (broiler and free-range) with pathogenic and or multi-resistant E. coli in retail chain poultry meat markets in India. We analyzed 168 E. coli isolates from broiler and free-range retail poultry (meat/ceca) sampled over a wide geographical area, for their antimicrobial sensitivity, phylogenetic groupings, virulence determinants, extended-spectrum-β-lactamase (ESBL) genotypes, fingerprinting by Enterobacterial Repetitive Intergenic Consensus (ERIC) PCR and genetic relatedness to human pathogenic E. coli using whole genome sequencing (WGS). The prevalence rates of ESBL producing E. coli among broiler chicken were: meat 46%; ceca 40%. Whereas, those for free range chicken were: meat 15%; ceca 30%. E. coli from broiler and free-range chicken exhibited varied prevalence rates for multi-drug resistance (meat 68%; ceca 64% and meat 8%; ceca 26%, respectively) and extraintestinal pathogenic E. coli (ExPEC) contamination (5 and 0%, respectively). WGS analysis confirmed two globally emergent human pathogenic lineages of E. coli , namely the ST131 ( H 30-Rx subclone) and ST117 among our poultry E. coli isolates. These results suggest that commercial poultry meat is not only an indirect public health risk by being a possible carrier of non-pathogenic multi-drug resistant (MDR)- E. coli , but could as well be the carrier of human E. coli pathotypes. Further, the free-range chicken appears to carry low risk of contamination with antimicrobial resistant and extraintestinal pathogenic E. coli (ExPEC). Overall, these observations reinforce the understanding that poultry meat in the retail chain could possibly be contaminated by MDR and/or pathogenic E. coli.
Genome-wide identification and functional prediction of novel and fungi-responsive lincRNAs in Triticum aestivum.

PubMed

Zhang, Hong; Hu, Weiguo; Hao, Jilei; Lv, Shikai; Wang, Changyou; Tong, Wei; Wang, Yajuan; Wang, Yanzhen; Liu, Xinlun; Ji, Wanquan

2016-03-15

Stripe rust (Puccinia striiformis f. sp. tritici; Pst) and powdery mildew (Blumeria graminis f. sp. tritici; Bgt) are important diseases of wheat (Triticum aestivum) worldwide. Increasingly evidences suggest that long intergenic ncRNAs (lincRNAs) are developmentally regulated and play important roles in development and stress responses of plants. However, identification of lincRNAs in wheat is still limited comparing with functional gene expression. The transcriptome of the hexaploid wheat line N9134 inoculated with the Chinese Pst race CYR31 and Bgt race E09 at 1, 2, and 3 days post-inoculation was recapitulated to detect the lincRNAs. Here, 283 differential expressed lincRNAs were identified from 58218 putative lincRNAs, which account for 31.2% of transcriptome. Of which, 254 DE-LincRNAs responded to the Bgt stress, and 52 lincRNAs in Pst. Among them, 1328 SnRNP motifs (sm sites) were detected and showed RRU4-11RR sm site element and consensus RRU1-9VU1-7RR SnRNP motifs, where the total number of uridine was more than 3 but less than 11. Additionally, 101 DE-lincRNAs were predicted as targets of miRNA by psRNATarget, while 5 target mimics were identified using target mimicry search in TAPIR. Taken together, our findings indicate that the lincRNA of wheat responded to Bgt and Pst stress and played important roles in splicesome and inter-regulating with miRNA. The sm site of wheat showed a more complex construction than that in mammal and model plant. The mass sequence data generated in this study provide a cue for future functional and molecular research on wheat-fungus interactions.
Genomic Heterogeneity and O-Antigenic Diversity of Campylobacter upsaliensis and Campylobacter helveticus Strains Isolated from Dogs and Cats in Germany

PubMed Central

Moser, I.; Rieksneuwöhner, B.; Lentzsch, P.; Schwerk, P.; Wieler, L. H.

2001-01-01

A serotyping scheme based on heat-stable surface antigens was established for 101 Campylobacter upsaliensis and 10 Campylobacter helveticus strains isolated from 261 dogs and 46 cats of different ages originating from two geographically distinct regions in Germany. The prevalence of C. upsaliensis varied between 27.8% in juvenile dogs (<12 months of age) and 55.4% in adult dogs (P < 0.05). Of the cats, 19.6% harbored C. upsaliensis, whereas 21.7% carried C. helveticus. Of the C. upsaliensis isolates from both host species, 93.1% belonged to five different serogroups, two of them being prevalent at rates of 47.5 and 27.7%, with different frequencies in both regions. Six (54.6%) of the C. helveticus isolates also belonged to serotypes found among C. upsaliensis strains, whereas five (45.4%) possessed an O antigen unique for C. helveticus. In contrast, a considerable degree of genomic diversity of the isolates was assessed by macrorestriction analyses with the endonucleases SmaI and XhoI, using pulsed-field gel electrophoresis as well as enterobacterial repetitive intergenic consensus sequence PCR (ERIC PCR). Restriction with SmaI pointed towards the existence of clonal groups associated to some extent with serotypes, while restriction with XhoI disintegrated these groups to smaller noncoherent subgroups. Analysis of ERIC PCR profiles did not exhibit any associations with serotypes. In conclusion these data demonstrate the genomic heterogeneity among C. upsaliensis strains and indicate that the combination of SmaI restriction with serotyping is a useful tool to investigate the expansion of clonal groups of C. upsaliensis. PMID:11427567
Molecular studies on larvae of Pseudoterranova parasite of Trichiurus lepturus Linnaeus, 1758 and Pomatomus saltatrix (Linnaeus, 1766) off Brazilian waters.

PubMed

Borges, Juliana N; Cunha, Luiz F G; Miranda, Daniele F; Monteiro-Neto, Cassiano; Santos, Cláudia P

2015-12-01

Pseudoterranova larvae parasitizing cutlassfish Trichiurus lepturus and bluefish Pomatomus saltatrix from Southwest Atlantic coast of Brazil were studied in this work by morphological, ultrastructural and molecular approaches. The genetic analysis were performed for the ITS2 intergenic region specific for Pseudoterranova decipiens, the partial 28S (LSU) of ribosomal DNA and the mtDNA cox-1 region. We obtained results for the 28S region and mtDNA cox-1 that was amplified using the polymerase chain reaction and sequenced to evaluate the phylogenetic relationships between sequences of this study and sequences from the GenBank. The morphological profile indicated that all the nine specimens collected from both fish were L3 larvae of Pseudoterranova sp. The genetic profile confirmed the generic level but due to the absence of similar sequences for adult parasites on GenBank for the regions amplifyied, it was not possible to identify them to the species level. The sequences obtained presented 89% of similarity with Pseudoterranova decipiens (28S sequences) and Contracaecum osculatum B (mtDNA cox-1). The low similarity allied to the fact that the amplification with the specific primer for P. decipiens didn't occur, lead us to conclude that our sequences don't belong to P. decipiens complex.
Gene finding in metatranscriptomic sequences.

PubMed

Ismail, Wazim Mohammed; Ye, Yuzhen; Tang, Haixu

2014-01-01

Metatranscriptomic sequencing is a highly sensitive bioassay of functional activity in a microbial community, providing complementary information to the metagenomic sequencing of the community. The acquisition of the metatranscriptomic sequences will enable us to refine the annotations of the metagenomes, and to study the gene activities and their regulation in complex microbial communities and their dynamics. In this paper, we present TransGeneScan, a software tool for finding genes in assembled transcripts from metatranscriptomic sequences. By incorporating several features of metatranscriptomic sequencing, including strand-specificity, short intergenic regions, and putative antisense transcripts into a Hidden Markov Model, TranGeneScan can predict a sense transcript containing one or multiple genes (in an operon) or an antisense transcript. We tested TransGeneScan on a mock metatranscriptomic data set containing three known bacterial genomes. The results showed that TranGeneScan performs better than metagenomic gene finders (MetaGeneMark and FragGeneScan) on predicting protein coding genes in assembled transcripts, and achieves comparable or even higher accuracy than gene finders for microbial genomes (Glimmer and GeneMark). These results imply, with the assistance of metatranscriptomic sequencing, we can obtain a broad and precise picture about the genes (and their functions) in a microbial community. TransGeneScan is available as open-source software on SourceForge at https://sourceforge.net/projects/transgenescan/.
Mitochondrial sequencing reveals five separate origins of 'black' Apis mellifera (Hymenoptera: Apidae) in eastern Australian commercial colonies.

PubMed

Oxley, P R; Oldroyd, B P

2009-04-01

Establishment of a closed population honey bee, Apis mellifera L. (Hymenoptera: Apidae), breeding program based on 'black' strains has been proposed for eastern Australia. Long-term success of such a program requires a high level of genetic variance. To determine the likely extent of genetic variation available, 50 colonies from 11 different commercial apiaries were sequenced in the mitochondrial cytochrome oxidase I and II intergenic region. Five distinct and novel mitotypes were identified. No colonies were found with the A. mellifera mellifera mitotype, which is often associated with undesirable feral strains. One group of mitotypes was consistent with a caucasica origin, two with carnica, and two with ligustica. The results suggest that there is sufficient genetic diversity to support a breeding program provided all these five sources were pooled.
Genomic maps of lincRNA occupancy reveal principles of RNA-chromatin interactions

PubMed Central

Chu, Ci; Qu, Kun; Zhong, Franklin; Artandi, Steven E.; Chang, Howard Y.

2011-01-01

SUMMARY Long intergenic noncoding RNAs (lincRNAs) are key regulators of chromatin state, yet the nature and sites of RNA-chromatin interaction are mostly unknown. Here we introduce Chromatin Isolation by RNA Purification (ChIRP), where tiling oligonucleotides retrieve specific lincRNAs with bound protein and DNA sequences, which are enumerated by deep sequencing. ChIRP-seq of three lincRNAs reveal that RNA occupancy sites in the genome are focal, sequence-specific, and numerous. Drosophila roX2 RNA occupies male X-linked gene bodies with increasing tendency toward the 3’ end, peaking at CES sites. Human telomerase RNA TERC occupies telomeres and Wnt pathway genes. HOTAIR lincRNA preferentially occupies a GA-rich DNA motif to nucleate broad domains of Polycomb occupancy and histone H3 lysine 27 trimethylation. HOTAIR occupancy occurs independently of EZH2, suggesting the order of RNA guidance of Polycomb occupancy. ChIRP-seq is generally applicable to illuminate the intersection of RNA and chromatin with newfound precision genome-wide. PMID:21963238
Ub-ISAP: a streamlined UNIX pipeline for mining unique viral vector integration sites from next generation sequencing data.

PubMed

Kamboj, Atul; Hallwirth, Claus V; Alexander, Ian E; McCowage, Geoffrey B; Kramer, Belinda

2017-06-17

The analysis of viral vector genomic integration sites is an important component in assessing the safety and efficiency of patient treatment using gene therapy. Alongside this clinical application, integration site identification is a key step in the genetic mapping of viral elements in mutagenesis screens that aim to elucidate gene function. We have developed a UNIX-based vector integration site analysis pipeline (Ub-ISAP) that utilises a UNIX-based workflow for automated integration site identification and annotation of both single and paired-end sequencing reads. Reads that contain viral sequences of interest are selected and aligned to the host genome, and unique integration sites are then classified as transcription start site-proximal, intragenic or intergenic. Ub-ISAP provides a reliable and efficient pipeline to generate large datasets for assessing the safety and efficiency of integrating vectors in clinical settings, with broader applications in cancer research. Ub-ISAP is available as an open source software package at https://sourceforge.net/projects/ub-isap/ .
Chromosomal arrangement of leghemoglobin genes in soybean.

PubMed Central

Lee, J S; Brown, G G; Verma, D P

1983-01-01

A cluster of four different leghemoglobin (Lb) genes was isolated from AluI-HaeIII and EcoRI genomic libraries of soybean in a set of overlapping clones which together include 45 kilobases (kb) of contiguous DNA. These four genes, including a pseudogene, are present in the same orientation and are arranged in the order: 5'-Lba-Lbc1-Lb psi-Lbc3-3'. The intergenic regions average 2.5 kb. In addition to this main Lb locus, there are other Lb genes which do not appear to be contiguous to this locus. A sequence probably common to the 3' region of Lb loci was found flanking the Lbc3 gene. The 3' flanking region of the main Lb locus also contains a sequence that appears to be expressed more abundantly in root tissue. Another sequence which is primarily expressed in root and leaf is found 5' to two Lb loci. Overall, the main leghemoglobin locus is similar in structure to the mammalian globin gene loci. Images PMID:6310504
The recX gene product is involved in the SOS response in Herbaspirillum seropedicae.

PubMed

Galvão, Carolina W; Pedrosa, Fábio O; Souza, Emanuel M; Yates, M Geoffrey; Chubatsu, Leda S; Steffens, Maria Berenice R

2003-02-01

The recA and the recX genes of Herbaspirillum seropedicae were sequenced. The recX is located 359 bp downstream from recA. Sequence analysis indicated the presence of a putative operator site overlapping a probable sigma70-dependent promoter upstream of recA and a transcription terminator downstream from recX, with no apparent promoter sequence in the intergenic region. Transcriptional analysis using lacZ promoter fusions indicated that recA expression increased three- to fourfold in the presence of methyl methanesulfonate (MMS). The roles of recA and recX genes in the SOS response were determined from studies of chromosomal mutants. The recA mutant showed the highest sensitivity to MMS and UV, and the recX mutant had an intermediate sensitivity, compared with the wild type (SMR1), confirming the essential role of the RecA protein in cell viability in the presence of mutagenic agents and also indicating a role for RecX in the SOS response.
Sequence Analysis of Mitochondrial Genome of Toxascaris leonina from a South China Tiger.

PubMed

Li, Kangxin; Yang, Fang; Abdullahi, A Y; Song, Meiran; Shi, Xianli; Wang, Minwei; Fu, Yeqi; Pan, Weida; Shan, Fang; Chen, Wu; Li, Guoqing

2016-12-01

Toxascaris leonina is a common parasitic nematode of wild mammals and has significant impacts on the protection of rare wild animals. To analyze population genetic characteristics of T. leonina from South China tiger, its mitochondrial (mt) genome was sequenced. Its complete circular mt genome was 14,277 bp in length, including 12 protein-coding genes, 22 tRNA genes, 2 rRNA genes, and 2 non-coding regions. The nucleotide composition was biased toward A and T. The most common start codon and stop codon were TTG and TAG, and 4 genes ended with an incomplete stop codon. There were 13 intergenic regions ranging 1 to 10 bp in size. Phylogenetically, T. leonina from a South China tiger was close to canine T. leonina . This study reports for the first time a complete mt genome sequence of T. leonina from the South China tiger, and provides a scientific basis for studying the genetic diversity of nematodes between different hosts.
Detection of non-coding RNA in bacteria and archaea using the DETR'PROK Galaxy pipeline.

PubMed

Toffano-Nioche, Claire; Luo, Yufei; Kuchly, Claire; Wallon, Claire; Steinbach, Delphine; Zytnicki, Matthias; Jacq, Annick; Gautheret, Daniel

2013-09-01

RNA-seq experiments are now routinely used for the large scale sequencing of transcripts. In bacteria or archaea, such deep sequencing experiments typically produce 10-50 million fragments that cover most of the genome, including intergenic regions. In this context, the precise delineation of the non-coding elements is challenging. Non-coding elements include untranslated regions (UTRs) of mRNAs, independent small RNA genes (sRNAs) and transcripts produced from the antisense strand of genes (asRNA). Here we present a computational pipeline (DETR'PROK: detection of ncRNAs in prokaryotes) based on the Galaxy framework that takes as input a mapping of deep sequencing reads and performs successive steps of clustering, comparison with existing annotation and identification of transcribed non-coding fragments classified into putative 5' UTRs, sRNAs and asRNAs. We provide a step-by-step description of the protocol using real-life example data sets from Vibrio splendidus and Escherichia coli. Copyright © 2013 The Authors. Published by Elsevier Inc. All rights reserved.
Rice MEL2, the RNA recognition motif (RRM) protein, binds in vitro to meiosis-expressed genes containing U-rich RNA consensus sequences in the 3'-UTR.

PubMed

Miyazaki, Saori; Sato, Yutaka; Asano, Tomoya; Nagamura, Yoshiaki; Nonomura, Ken-Ichi

2015-10-01

Post-transcriptional gene regulation by RNA recognition motif (RRM) proteins through binding to cis-elements in the 3'-untranslated region (3'-UTR) is widely used in eukaryotes to complete various biological processes. Rice MEIOSIS ARRESTED AT LEPTOTENE2 (MEL2) is the RRM protein that functions in the transition to meiosis in proper timing. The MEL2 RRM preferentially associated with the U-rich RNA consensus, UUAGUU[U/A][U/G][A/U/G]U, dependently on sequences and proportionally to MEL2 protein amounts in vitro. The consensus sequences were located in the putative looped structures of the RNA ligand. A genome-wide survey revealed a tendency of MEL2-binding consensus appearing in 3'-UTR of rice genes. Of 249 genes that conserved the consensus in their 3'-UTR, 13 genes spatiotemporally co-expressed with MEL2 in meiotic flowers, and included several genes whose function was supposed in meiosis; such as Replication protein A and OsMADS3. The proteome analysis revealed that the amounts of small ubiquitin-related modifier-like protein and eukaryotic translation initiation factor3-like protein were dramatically altered in mel2 mutant anthers. Taken together with transcriptome and gene ontology results, we propose that the rice MEL2 is involved in the translational regulation of key meiotic genes on 3'-UTRs to achieve the faithful transition of germ cells to meiosis.
R2R - software to speed the depiction of aesthetic consensus RNA secondary structures

PubMed Central

2011-01-01

Background With continuing identification of novel structured noncoding RNAs, there is an increasing need to create schematic diagrams showing the consensus features of these molecules. RNA structural diagrams are typically made either with general-purpose drawing programs like Adobe Illustrator, or with automated or interactive programs specific to RNA. Unfortunately, the use of applications like Illustrator is extremely time consuming, while existing RNA-specific programs produce figures that are useful, but usually not of the same aesthetic quality as those produced at great cost in Illustrator. Additionally, most existing RNA-specific applications are designed for drawing single RNA molecules, not consensus diagrams. Results We created R2R, a computer program that facilitates the generation of aesthetic and readable drawings of RNA consensus diagrams in a fraction of the time required with general-purpose drawing programs. Since the inference of a consensus RNA structure typically requires a multiple-sequence alignment, the R2R user annotates the alignment with commands directing the layout and annotation of the RNA. R2R creates SVG or PDF output that can be imported into Adobe Illustrator, Inkscape or CorelDRAW. R2R can be used to create consensus sequence and secondary structure models for novel RNA structures or to revise models when new representatives for known RNA classes become available. Although R2R does not currently have a graphical user interface, it has proven useful in our efforts to create 100 schematic models of distinct noncoding RNA classes. Conclusions R2R makes it possible to obtain high-quality drawings of the consensus sequence and structural models of many diverse RNA structures with a more practical amount of effort. R2R software is available at http://breaker.research.yale.edu/R2R and as an Additional file. PMID:21205310
Spliced DNA Sequences in the Paramecium Germline: Their Properties and Evolutionary Potential

PubMed Central

Catania, Francesco; McGrath, Casey L.; Doak, Thomas G.; Lynch, Michael

2013-01-01

Despite playing a crucial role in germline-soma differentiation, the evolutionary significance of developmentally regulated genome rearrangements (DRGRs) has received scant attention. An example of DRGR is DNA splicing, a process that removes segments of DNA interrupting genic and/or intergenic sequences. Perhaps, best known for shaping immune-system genes in vertebrates, DNA splicing plays a central role in the life of ciliated protozoa, where thousands of germline DNA segments are eliminated after sexual reproduction to regenerate a functional somatic genome. Here, we identify and chronicle the properties of 5,286 sequences that putatively undergo DNA splicing (i.e., internal eliminated sequences [IESs]) across the genomes of three closely related species of the ciliate Paramecium (P. tetraurelia, P. biaurelia, and P. sexaurelia). The study reveals that these putative IESs share several physical characteristics. Although our results are consistent with excision events being largely conserved between species, episodes of differential IES retention/excision occur, may have a recent origin, and frequently involve coding regions. Our findings indicate interconversion between somatic—often coding—DNA sequences and noncoding IESs, and provide insights into the role of DNA splicing in creating potentially functional genetic innovation. PMID:23737328
Sequence of rat alpha- and gamma-casein mRNAs: evolutionary comparison of the calcium-dependent rat casein multigene family.

PubMed Central

Hobbs, A A; Rosen, J M

1982-01-01

The complete sequences of rat alpha- and gamma-casein mRNAs have been determined. The 1402-nucleotide alpha- and 864-nucleotide gamma-casein mRNAs both encode 15 amino acid signal peptides and mature proteins of 269 and 164 residues, respectively. Considerable homology between the 5' non-coding regions, and the regions encoding the signal peptides and the phosphorylation sites, in these mRNAs as compared to several other rodent casein mRNAs, was observed. Significant homology was also detected between rat alpha- and bovine alpha s1-casein. Comparison of the rodent and bovine sequences suggests that the caseins evolved at about the time of the appearance of the primitive mammals. This may have occurred by intragenic duplication of a nucleotide sequence encoding a primitive phosphorylation site, -(Ser)n-Glu-Glu-, and intergenic duplication resulting in the small casein multigene family. A unique feature of the rat alpha-casein sequence is an insertion in the coding region containing 10 repeated elements of 18 nucleotides each. This insertion appears to have occurred 7-12 million years ago, just prior to the divergence of rat and mouse. Images PMID:6298707
Nonhybrid, finished microbial genome assemblies from long-read SMRT sequencing data.

PubMed

Chin, Chen-Shan; Alexander, David H; Marks, Patrick; Klammer, Aaron A; Drake, James; Heiner, Cheryl; Clum, Alicia; Copeland, Alex; Huddleston, John; Eichler, Evan E; Turner, Stephen W; Korlach, Jonas

2013-06-01

We present a hierarchical genome-assembly process (HGAP) for high-quality de novo microbial genome assemblies using only a single, long-insert shotgun DNA library in conjunction with Single Molecule, Real-Time (SMRT) DNA sequencing. Our method uses the longest reads as seeds to recruit all other reads for construction of highly accurate preassembled reads through a directed acyclic graph-based consensus procedure, which we follow with assembly using off-the-shelf long-read assemblers. In contrast to hybrid approaches, HGAP does not require highly accurate raw reads for error correction. We demonstrate efficient genome assembly for several microorganisms using as few as three SMRT Cell zero-mode waveguide arrays of sequencing and for BACs using just one SMRT Cell. Long repeat regions can be successfully resolved with this workflow. We also describe a consensus algorithm that incorporates SMRT sequencing primary quality values to produce de novo genome sequence exceeding 99.999% accuracy.
Consensus-Degenerate Hybrid Oligonucleotide Primers for Amplification of Priming Glycosyltransferase Genes of the Exopolysaccharide Locus in Strains of the Lactobacillus casei Group

PubMed Central

Provencher, Cathy; LaPointe, Gisèle; Sirois, Stéphane; Van Calsteren, Marie-Rose; Roy, Denis

2003-01-01

A primer design strategy named CODEHOP (consensus-degenerate hybrid oligonucleotide primer) for amplification of distantly related sequences was used to detect the priming glycosyltransferase (GT) gene in strains of the Lactobacillus casei group. Each hybrid primer consisted of a short 3′ degenerate core based on four highly conserved amino acids and a longer 5′ consensus clamp region based on six sequences of the priming GT gene products from exopolysaccharide (EPS)-producing bacteria. The hybrid primers were used to detect the priming GT gene of 44 commercial isolates and reference strains of Lactobacillus rhamnosus, L. casei, Lactobacillus zeae, and Streptococcus thermophilus. The priming GT gene was detected in the genome of both non-EPS-producing (EPS−) and EPS-producing (EPS+) strains of L. rhamnosus. The sequences of the cloned PCR products were similar to those of the priming GT gene of various gram-negative and gram-positive EPS+ bacteria. Specific primers designed from the L. rhamnosus RW-9595M GT gene were used to sequence the end of the priming GT gene in selected EPS+ strains of L. rhamnosus. Phylogenetic analysis revealed that Lactobacillus spp. form a distinctive group apart from other lactic acid bacteria for which GT genes have been characterized to date. Moreover, the sequences show a divergence existing among strains of L. rhamnosus with respect to the terminal region of the priming GT gene. Thus, the PCR approach with consensus-degenerate hybrid primers designed with CODEHOP is a practical approach for the detection of similar genes containing conserved motifs in different bacterial genomes. PMID:12788729
Transposable elements (TEs) contribute to stress-related long intergenic noncoding RNAs in plants.

PubMed

Wang, Dong; Qu, Zhipeng; Yang, Lan; Zhang, Qingzhu; Liu, Zhi-Hong; Do, Trung; Adelson, David L; Wang, Zhen-Yu; Searle, Iain; Zhu, Jian-Kang

2017-04-01

Noncoding RNAs have been extensively described in plant and animal transcriptomes by using high-throughput sequencing technology. Of these noncoding RNAs, a growing number of long intergenic noncoding RNAs (lincRNAs) have been described in multicellular organisms, however the origins and functions of many lincRNAs remain to be explored. In many eukaryotic genomes, transposable elements (TEs) are widely distributed and often account for large fractions of plant and animal genomes yet the contribution of TEs to lincRNAs is largely unknown. By using strand-specific RNA-sequencing, we profiled the expression patterns of lincRNAs in Arabidopsis, rice and maize, and identified 47 611 and 398 TE-associated lincRNAs (TE-lincRNAs), respectively. TE-lincRNAs were more often derived from retrotransposons than DNA transposons and as retrotransposon copy number in both rice and maize genomes so did TE-lincRNAs. We validated the expression of these TE-lincRNAs by strand-specific RT-PCR and also demonstrated tissue-specific transcription and stress-induced TE-lincRNAs either after salt, abscisic acid (ABA) or cold treatments. For Arabidopsis TE-lincRNA11195, mutants had reduced sensitivity to ABA as demonstrated by longer roots and higher shoot biomass when compared to wild-type. Finally, by altering the chromatin state in the Arabidopsis chromatin remodelling mutant ddm1, unique lincRNAs including TE-lincRNAs were generated from the preceding untranscribed regions and interestingly inherited in a wild-type background in subsequent generations. Our findings not only demonstrate that TE-associated lincRNAs play important roles in plant abiotic stress responses but lincRNAs and TE-lincRNAs might act as an adaptive reservoir in eukaryotes. © 2017 The Authors The Plant Journal © 2017 John Wiley & Sons Ltd.

Cross-species amplification of mitochondrial DNA sequence-tagged-site markers in conifers: the nature of polymorphism and variation within and among species in Picea.

PubMed

Jaramillo-Correa, J P; Bousquet, J; Beaulieu, J; Isabel, N; Perron, M; Bouillé, M

2003-05-01

Primers previously developed to amplify specific non-coding regions of the mitochondrial genome in Angiosperms, and new primers for additional non-coding mtDNA regions, were tested for their ability to direct DNA amplification in 12 conifer taxa and to detect sequence-tagged-site (STS) polymorphisms within and among eight species in Picea. Out of 12 primer pairs, nine were successful at amplifying mtDNA in most of the taxa surveyed. In conifers, indels and substitutions were observed for several loci, allowing them to distinguish between families, genera and, in some cases, between species within genera. In Picea, interspecific polymorphism was detected for four loci, while intraspecific variation was observed for three of the mtDNA regions studied. One of these (SSU rRNA V1 region) exhibited indel polymorphisms, and the two others ( nad1 intron b/c and nad5 intron1) revealed restriction differences after digestion with Sau3AI (PCR-RFLP). A fourth locus, the nad4L- orf25 intergenic region, showed a multibanding pattern for most of the spruce species, suggesting a possible gene duplication. Maternal inheritance, expected for mtDNA in conifers, was observed for all polymorphic markers except the intergenic region nad4L- orf25. Pooling of the variation observed with the remaining three markers resulted in two to six different mtDNA haplotypes within the different species of Picea. Evidence for intra-genomic recombination was observed in at least two taxa. Thus, these mitotypes are likely to be more informative than single-locus haplotypes. They should be particularly useful for the study of biogeography and the dynamics of hybrid zones.
DNA Polymorphisms and Biocontrol of Bacillus Antagonistic to Citrus Bacterial Canker with Indication of the Interference of Phyllosphere Biofilms

PubMed Central

Huang, Tzu-Pi; Tzeng, Dean Der-Syh; Wong, Amy C. L.; Chen, Chun-Han; Lu, Kuan-Min; Lee, Ya-Huei; Huang, Wen-Di; Hwang, Bing-Fang; Tzeng, Kuo-Ching

2012-01-01

Citrus bacterial canker caused by Xanthomonas axonopodis pv. citri is a devastating disease resulting in significant crop losses in various citrus cultivars worldwide. A biocontrol agent has not been recommended for this disease. To explore the potential of bacilli native to Taiwan to control this disease, Bacillus species with a broad spectrum of antagonistic activity against various phytopathogens were isolated from plant potting mixes, organic compost and the rhizosphere soil. Seven strains TKS1-1, OF3-16, SP4-17, HSP1, WG6-14, TLB7-7, and WP8-12 showing superior antagonistic activity were chosen for biopesticide development. The genetic identity based on 16S rDNA sequences indicated that all seven native strains were close relatives of the B. subtilis group and appeared to be discrete from the B. cereus group. DNA polymorphisms in strains WG6-14, SP4-17, TKS1-1, and WP8-12, as revealed by repetitive sequence-based PCR with the BOXA1R primers were similar to each other, but different from those of the respective Bacillus type strains. However, molecular typing of the strains using either tDNA-intergenic spacer regions or 16S–23S intergenic transcribed spacer regions was unable to differentiate the strains at the species level. Strains TKS1-1 and WG6-14 attenuated symptom development of citrus bacterial canker, which was found to be correlated with a reduction in colonization and biofilm formation by X. axonopodis pv. citri on leaf surfaces. The application of a Bacillus strain TKS1-1 endospore formulation to the leaf surfaces of citrus reduced the incidence of citrus bacterial canker and could prevent development of the disease. PMID:22848728
Isolation and characterization of Treponema phagedenis-like spirochetes from digital dermatitis lesions in Swedish dairy cattle.

PubMed

Pringle, Märit; Bergsten, Christer; Fernström, Lise-Lotte; Höök, Helena; Johansson, Karl-Erik

2008-10-20

Digital dermatitis in cattle is an emerging infectious disease. Ulcerative lesions are typically located on the plantar skin between the heel bulbs and adjacent to the coronet. Spirochetes of the genus Treponema are found in high numbers in the lesions and are likely to be involved in the pathogenesis. The aim of this study was to obtain pure cultures of spirochetes from cattle with digital dermatitis and to describe them further. Tissue samples and swabs from active digital dermatitis lesions were used for culturing. Pure isolates were subjected to, molecular typing through 16S rRNA gene sequencing, pulsed-field gel electrophoresis (PFGE), random amplified polymorphic DNA (RAPD) and an intergenic spacer PCR developed for Treponema spp. as well as API-ZYM and antimicrobial susceptibility tests. The antimicrobial agents used were tiamulin, valnemulin, tylosin, aivlosin, lincomycin and doxycycline. Seven spirochete isolates from five herds were obtained. Both 16S rRNA gene sequences, which were identical except for three polymorphic nucleotide positions, and the intergenic spacer PCR indicated that all isolates were of one yet unnamed species, most closely related to Treponema phagedenis. The enzymatic profile and antimicrobial susceptibility pattern were also similar for all isolates. However it was possible to separate the isolates through their PFGE and RAPD banding pattern. This is the first report on isolation of a Treponema sp. from cattle with digital dermatitis in Scandinavia. The phylotype isolated has previously been cultured from samples from cattle in the USA and the UK and is closely related to T. phagedenis. While very similar, the isolates in this study were possible to differentiate through PFGE and RAPD indicating that these methods are suitable for subtyping of this phylotype. No antimicrobial resistance could be detected among the tested isolates.
Evolution of the rpoB-psbZ region in fern plastid genomes: notable structural rearrangements and highly variable intergenic spacers

PubMed Central

2011-01-01

Background The rpoB-psbZ (BZ) region of some fern plastid genomes (plastomes) has been noted to go through considerable genomic changes. Unraveling its evolutionary dynamics across all fern lineages will lead to clarify the fundamental process shaping fern plastome structure and organization. Results A total of 24 fern BZ sequences were investigated with taxon sampling covering all the extant fern orders. We found that: (i) a tree fern Plagiogyria japonica contained a novel gene order that can be generated from either the ancestral Angiopteris type or the derived Adiantum type via a single inversion; (ii) the trnY-trnE intergenic spacer (IGS) of the filmy fern Vandenboschia radicans was expanded 3-fold due to the tandem 27-bp repeats which showed strong sequence similarity with the anticodon domain of trnY; (iii) the trnY-trnE IGSs of two horsetail ferns Equisetum ramosissimum and E. arvense underwent an unprecedented 5-kb long expansion, more than a quarter of which was consisted of a single type of direct repeats also relevant to the trnY anticodon domain; and (iv) ycf66 has independently lost at least four times in ferns. Conclusions Our results provided fresh insights into the evolutionary process of fern BZ regions. The intermediate BZ gene order was not detected, supporting that the Adiantum type was generated by two inversions occurring in pairs. The occurrence of Vandenboschia 27-bp repeats represents the first evidence of partial tRNA gene duplication in fern plastomes. Repeats potentially forming a stem-loop structure play major roles in the expansion of the trnY-trnE IGS. PMID:21486489
Evolution of the rpoB-psbZ region in fern plastid genomes: notable structural rearrangements and highly variable intergenic spacers.

PubMed

Gao, Lei; Zhou, Yuan; Wang, Zhi-Wei; Su, Ying-Juan; Wang, Ting

2011-04-13

The rpoB-psbZ (BZ) region of some fern plastid genomes (plastomes) has been noted to go through considerable genomic changes. Unraveling its evolutionary dynamics across all fern lineages will lead to clarify the fundamental process shaping fern plastome structure and organization. A total of 24 fern BZ sequences were investigated with taxon sampling covering all the extant fern orders. We found that: (i) a tree fern Plagiogyria japonica contained a novel gene order that can be generated from either the ancestral Angiopteris type or the derived Adiantum type via a single inversion; (ii) the trnY-trnE intergenic spacer (IGS) of the filmy fern Vandenboschia radicans was expanded 3-fold due to the tandem 27-bp repeats which showed strong sequence similarity with the anticodon domain of trnY; (iii) the trnY-trnE IGSs of two horsetail ferns Equisetum ramosissimum and E. arvense underwent an unprecedented 5-kb long expansion, more than a quarter of which was consisted of a single type of direct repeats also relevant to the trnY anticodon domain; and (iv) ycf66 has independently lost at least four times in ferns. Our results provided fresh insights into the evolutionary process of fern BZ regions. The intermediate BZ gene order was not detected, supporting that the Adiantum type was generated by two inversions occurring in pairs. The occurrence of Vandenboschia 27-bp repeats represents the first evidence of partial tRNA gene duplication in fern plastomes. Repeats potentially forming a stem-loop structure play major roles in the expansion of the trnY-trnE IGS.
Detection and Analysis of Six Lizard Adenoviruses by Consensus Primer PCR Provides Further Evidence of a Reptilian Origin for the Atadenoviruses

PubMed Central

Wellehan, James F. X.; Johnson, April J.; Harrach, Balázs; Benkö, Mária; Pessier, Allan P.; Johnson, Calvin M.; Garner, Michael M.; Childress, April; Jacobson, Elliott R.

2004-01-01

A consensus nested-PCR method was designed for investigation of the DNA polymerase gene of adenoviruses. Gene fragments were amplified and sequenced from six novel adenoviruses from seven lizard species, including four species from which adenoviruses had not previously been reported. Host species included Gila monster, leopard gecko, fat-tail gecko, blue-tongued skink, Tokay gecko, bearded dragon, and mountain chameleon. This is the first sequence information from lizard adenoviruses. Phylogenetic analysis indicated that these viruses belong to the genus Atadenovirus, supporting the reptilian origin of atadenoviruses. This PCR method may be useful for obtaining templates for initial sequencing of novel adenoviruses. PMID:15542689
Detection and analysis of six lizard adenoviruses by consensus primer PCR provides further evidence of a reptilian origin for the atadenoviruses.

PubMed

Wellehan, James F X; Johnson, April J; Harrach, Balázs; Benkö, Mária; Pessier, Allan P; Johnson, Calvin M; Garner, Michael M; Childress, April; Jacobson, Elliott R

2004-12-01

A consensus nested-PCR method was designed for investigation of the DNA polymerase gene of adenoviruses. Gene fragments were amplified and sequenced from six novel adenoviruses from seven lizard species, including four species from which adenoviruses had not previously been reported. Host species included Gila monster, leopard gecko, fat-tail gecko, blue-tongued skink, Tokay gecko, bearded dragon, and mountain chameleon. This is the first sequence information from lizard adenoviruses. Phylogenetic analysis indicated that these viruses belong to the genus Atadenovirus, supporting the reptilian origin of atadenoviruses. This PCR method may be useful for obtaining templates for initial sequencing of novel adenoviruses.
Identification and application of self-binding zipper-like sequences in SARS-CoV spike protein.

PubMed

Zhang, Si Min; Liao, Ying; Neo, Tuan Ling; Lu, Yanning; Liu, Ding Xiang; Vahlne, Anders; Tam, James P

2018-05-22

Self-binding peptides containing zipper-like sequences, such as the Leu/Ile zipper sequence within the coiled coil regions of proteins and the cross-β spine steric zippers within the amyloid-like fibrils, could bind to the protein-of-origin through homophilic sequence-specific zipper motifs. These self-binding sequences represent opportunities for the development of biochemical tools and/or therapeutics. Here, we report on the identification of a putative self-binding β-zipper-forming peptide within the severe acute respiratory syndrome-associated coronavirus spike (S) protein and its application in viral detection. Peptide array scanning of overlapping peptides covering the entire length of S protein identified 34 putative self-binding peptides of six clusters, five of which contained octapeptide core consensus sequences. The Cluster I consensus octapeptide sequence GINITNFR was predicted by the Eisenberg's 3D profile method to have high amyloid-like fibrillation potential through steric β-zipper formation. Peptide C6 containing the Cluster I consensus sequence was shown to oligomerize and form amyloid-like fibrils. Taking advantage of this, C6 was further applied to detect the S protein expression in vitro by fluorescence staining. Meanwhile, the coiled-coil-forming Leu/Ile heptad repeat sequences within the S protein were under-represented during peptide array scanning, in agreement with that long peptide lengths were required to attain high helix-mediated interaction avidity. The data suggest that short β-zipper-like self-binding peptides within the S protein could be identified through combining the peptide scanning and predictive methods, and could be exploited as biochemical detection reagents for viral infection. Copyright © 2018. Published by Elsevier Ltd.
Nucleotide sequence of the gene encoding the nitrogenase iron protein of Thiobacillus ferrooxidans

DOE Office of Scientific and Technical Information (OSTI.GOV)

Pretorius, I.M.; Rawlings, D.E.; O'Neill, E.G.

1987-01-01

The DNA sequence was determined for the cloned Thiobacillus ferrooxidans nifH and part of the nifD genes. The DNA chains were radiolabeled with (..cap alpha..-/sup 32/P)dCTP (3000 Ci/mmol) or (..cap alpha..-/sup 35/S)dCTP (400 Ci/mmol). A putative T. ferrooxidans nifH promoter was identified whose sequences showed perfect consensus with those of the Klebsiella pneumoniae nif promoter. Two putative consensus upstream activator sequences were also identified. The amino acid sequence was deduced from the DNA sequence. In a comparison of nifH DNA sequences from T. ferrooxidans and eight other nitrogen-fixing microbes, a Rhizobium sp. isolated from Parasponia andersonii showed the greatest homologymore » (74%) and Clostridium pasteurianum (nifH1) showed the least homology (54%). In the comparison of the amino acid sequences of the Fe proteins, the Rhizobium sp. and Rhizobium japonicum showed the greatest homology (both 86%) and C. pasteurianum (nifH1 gene product) demonstrated the least homology (56%) to the T. ferrooxidans Fe protein.« less
Explaining the disease phenotype of intergenic SNP through predicted long range regulation.

PubMed

Chen, Jingqi; Tian, Weidong

2016-10-14

Thousands of disease-associated SNPs (daSNPs) are located in intergenic regions (IGR), making it difficult to understand their association with disease phenotypes. Recent analysis found that non-coding daSNPs were frequently located in or approximate to regulatory elements, inspiring us to try to explain the disease phenotypes of IGR daSNPs through nearby regulatory sequences. Hence, after locating the nearest distal regulatory element (DRE) to a given IGR daSNP, we applied a computational method named INTREPID to predict the target genes regulated by the DRE, and then investigated their functional relevance to the IGR daSNP's disease phenotypes. 36.8% of all IGR daSNP-disease phenotype associations investigated were possibly explainable through the predicted target genes, which were enriched with, were functionally relevant to, or consisted of the corresponding disease genes. This proportion could be further increased to 60.5% if the LD SNPs of daSNPs were also considered. Furthermore, the predicted SNP-target gene pairs were enriched with known eQTL/mQTL SNP-gene relationships. Overall, it's likely that IGR daSNPs may contribute to disease phenotypes by interfering with the regulatory function of their nearby DREs and causing abnormal expression of disease genes. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.
Placental Hypomethylation Is More Pronounced in Genomic Loci Devoid of Retroelements

PubMed Central

Chatterjee, Aniruddha; Macaulay, Erin C.; Rodger, Euan J.; Stockwell, Peter A.; Parry, Matthew F.; Roberts, Hester E.; Slatter, Tania L.; Hung, Noelyn A.; Devenish, Celia J.; Morison, Ian M.

2016-01-01

The human placenta is hypomethylated compared to somatic tissues. However, the degree and specificity of placental hypomethylation across the genome is unclear. We assessed genome-wide methylation of the human placenta and compared it to that of the neutrophil, a representative homogeneous somatic cell. We observed global hypomethylation in placenta (relative reduction of 22%) compared to neutrophils. Placental hypomethylation was pronounced in intergenic regions and gene bodies, while the unmethylated state of the promoter remained conserved in both tissues. For every class of repeat elements, the placenta showed lower methylation but the degree of hypomethylation differed substantially between these classes. However, some retroelements, especially the evolutionarily younger Alu elements, retained high levels of placental methylation. Surprisingly, nonretrotransposon-containing sequences showed a greater degree of placental hypomethylation than retrotransposons in every genomic element (intergenic, introns, and exons) except promoters. The differentially methylated fragments (DMFs) in placenta and neutrophils were enriched in gene-poor and CpG-poor regions. The placentally hypomethylated DMFs were enriched in genomic regions that are usually inactive, whereas hypermethylated DMFs were enriched in active regions. Hypomethylation of the human placenta is not specific to retroelements, indicating that the evolutionary advantages of placental hypomethylation go beyond those provided by expression of retrotransposons and retrogenes. PMID:27172225
Extensive length variation in the ribosomal DNA intergenic spacer of yellow perch (Perca flavescens).

PubMed

Kakou, Bidénam; Angers, Bernard; Glémet, Hélène

2016-03-01

The intergenic spacer (IGS) is located between ribosomal RNA (rRNA) gene copies. Within the IGS, regulatory elements for rRNA gene transcription are found, as well as a varying number of other repetitive elements that are at the root of IGS length heterogeneity. This heterogeneity has been shown to have a functional significance through its effect on growth rate. Here, we present the structural organization of yellow perch (Perca flavescens) IGS based on its entire sequence, as well as the IGS length variation within a natural population. Yellow perch IGS structure has four discrete regions containing tandem repeat elements. For three of these regions, no specific length class was detected as allele size was seemingly normally distributed. However, for one repeat region, PCR amplification uncovered the presence of two distinctive IGS variants representing a length difference of 1116 bp. This repeat region was also devoid of any CpG sites despite a high GC content. Balanced selection may be holding the alleles in the population and would account for the high diversity of length variants observed for adjacent regions. Our study is an important precursor for further work aiming to assess the role of IGS length variation in influencing growth rate in fish.
A Bioinformatics-Based Alternative mRNA Splicing Code that May Explain Some Disease Mutations Is Conserved in Animals.

PubMed

Qu, Wen; Cingolani, Pablo; Zeeberg, Barry R; Ruden, Douglas M

2017-01-01

Deep sequencing of cDNAs made from spliced mRNAs indicates that most coding genes in many animals and plants have pre-mRNA transcripts that are alternatively spliced. In pre-mRNAs, in addition to invariant exons that are present in almost all mature mRNA products, there are at least 6 additional types of exons, such as exons from alternative promoters or with alternative polyA sites, mutually exclusive exons, skipped exons, or exons with alternative 5' or 3' splice sites. Our bioinformatics-based hypothesis is that, in analogy to the genetic code, there is an "alternative-splicing code" in introns and flanking exon sequences, analogous to the genetic code, that directs alternative splicing of many of the 36 types of introns. In humans, we identified 42 different consensus sequences that are each present in at least 100 human introns. 37 of the 42 top consensus sequences are significantly enriched or depleted in at least one of the 36 types of introns. We further supported our hypothesis by showing that 96 out of 96 analyzed human disease mutations that affect RNA splicing, and change alternative splicing from one class to another, can be partially explained by a mutation altering a consensus sequence from one type of intron to that of another type of intron. Some of the alternative splicing consensus sequences, and presumably their small-RNA or protein targets, are evolutionarily conserved from 50 plant to animal species. We also noticed the set of introns within a gene usually share the same splicing codes, thus arguing that one sub-type of splicesosome might process all (or most) of the introns in a given gene. Our work sheds new light on a possible mechanism for generating the tremendous diversity in protein structure by alternative splicing of pre-mRNAs.
Molecular characterisation of Atlantic salmon paramyxovirus (ASPV): A novel paramyxovirus associated with proliferative gill inflammation

USGS Publications Warehouse

Falk, K.; Batts, W.N.; Kvellestad, A.; Kurath, G.; Wiik-Nielsen, J.; Winton, J.R.

2008-01-01

Atlantic salmon paramyxovirus (ASPV) was isolated in 1995 from gills of farmed Atlantic salmon suffering from proliferative gill inflammation. The complete genome sequence of ASPV was determined, revealing a genome 16,968 nucleotides in length consisting of six non-overlapping genes coding for the nucleo- (N), phospho- (P), matrix- (M), fusion- (F), haemagglutinin-neuraminidase- (HN) and large polymerase (L) proteins in the order 3???-N-P-M-F-HN-L-5???. The various conserved features related to virus replication found in most paramyxoviruses were also found in ASPV. These include: conserved and complementary leader and trailer sequences, tri-nucleotide intergenic regions and highly conserved transcription start and stop signal sequences. The P gene expression strategy of ASPV was like that of the respiro-, morbilli- and henipaviruses, which express the P and C proteins from the primary transcript and edit a portion of the mRNA to encode V and W proteins. Sequence similarities among various features related to virus replication, pairwise comparisons of all deduced ASPV protein sequences with homologous regions from other members of the family Paramyxoviridae, and phylogenetic analyses of these amino acid sequences suggested that ASPV was a novel member of the sub-family Paramyxovirinae, most closely related to the respiroviruses. ?? 2008 Elsevier B.V. All rights reserved.
Scanning the Effects of Ethyl Methanesulfonate on the Whole Genome of Lotus japonicus Using Second-Generation Sequencing Analysis

PubMed Central

Mohd-Yusoff, Nur Fatihah; Ruperao, Pradeep; Tomoyoshi, Nurain Emylia; Edwards, David; Gresshoff, Peter M.; Biswas, Bandana; Batley, Jacqueline

2015-01-01

Genetic structure can be altered by chemical mutagenesis, which is a common method applied in molecular biology and genetics. Second-generation sequencing provides a platform to reveal base alterations occurring in the whole genome due to mutagenesis. A model legume, Lotus japonicus ecotype Miyakojima, was chemically mutated with alkylating ethyl methanesulfonate (EMS) for the scanning of DNA lesions throughout the genome. Using second-generation sequencing, two individually mutated third-generation progeny (M3, named AM and AS) were sequenced and analyzed to identify single nucleotide polymorphisms and reveal the effects of EMS on nucleotide sequences in these mutant genomes. Single-nucleotide polymorphisms were found in every 208 kb (AS) and 202 kb (AM) with a bias mutation of G/C-to-A/T changes at low percentage. Most mutations were intergenic. The mutation spectrum of the genomes was comparable in their individual chromosomes; however, each mutated genome has unique alterations, which are useful to identify causal mutations for their phenotypic changes. The data obtained demonstrate that whole genomic sequencing is applicable as a high-throughput tool to investigate genomic changes due to mutagenesis. The identification of these single-point mutations will facilitate the identification of phenotypically causative mutations in EMS-mutated germplasm. PMID:25660167
The first whole transcriptomic exploration of pre-oviposited early chicken embryos using single and bulked embryonic RNA-sequencing.

PubMed

Hwang, Young Sun; Seo, Minseok; Choi, Hee Jung; Kim, Sang Kyung; Kim, Heebal; Han, Jae Yong

2018-04-01

The chicken is a valuable model organism, especially in evolutionary and embryology research because its embryonic development occurs in the egg. However, despite its scientific importance, no transcriptome data have been generated for deciphering the early developmental stages of the chicken because of practical and technical constraints in accessing pre-oviposited embryos. Here, we determine the entire transcriptome of pre-oviposited avian embryos, including oocyte, zygote, and intrauterine embryos from Eyal-giladi and Kochav stage I (EGK.I) to EGK.X collected using a noninvasive approach for the first time. We also compare RNA-sequencing data obtained using a bulked embryo sequencing and single embryo/cell sequencing technique. The raw sequencing data were preprocessed with two genome builds, Galgal4 and Galgal5, and the expression of 17,108 and 26,102 genes was quantified in the respective builds. There were some differences between the two techniques, as well as between the two genome builds, and these were affected by the emergence of long intergenic noncoding RNA annotations. The first transcriptome datasets of pre-oviposited early chicken embryos based on bulked and single embryo sequencing techniques will serve as a valuable resource for investigating early avian embryogenesis, for comparative studies among vertebrates, and for novel gene annotation in the chicken genome.
Accurate RNA consensus sequencing for high-fidelity detection of transcriptional mutagenesis-induced epimutations.

PubMed

Reid-Bayliss, Kate S; Loeb, Lawrence A

2017-08-29

Transcriptional mutagenesis (TM) due to misincorporation during RNA transcription can result in mutant RNAs, or epimutations, that generate proteins with altered properties. TM has long been hypothesized to play a role in aging, cancer, and viral and bacterial evolution. However, inadequate methodologies have limited progress in elucidating a causal association. We present a high-throughput, highly accurate RNA sequencing method to measure epimutations with single-molecule sensitivity. Accurate RNA consensus sequencing (ARC-seq) uniquely combines RNA barcoding and generation of multiple cDNA copies per RNA molecule to eliminate errors introduced during cDNA synthesis, PCR, and sequencing. The stringency of ARC-seq can be scaled to accommodate the quality of input RNAs. We apply ARC-seq to directly assess transcriptome-wide epimutations resulting from RNA polymerase mutants and oxidative stress.
Cloning, sequencing and characterization of lipase genes from a polyhydroxyalkanoate- (PHA-) synthesizing Pseudomonas resinovorans

USDA-ARS?s Scientific Manuscript database

Lipase (lip) and lipase-specific foldase (lif) genes of a biodegradable polyhydroxyalkanoate- (PHA-) synthesizing Pseudomonas resinovorans NRRL B-2649 were cloned using primers based on consensus sequences, followed by PCR-based genome walking. Sequence analyses showed a putative Lip gene-product (...
Cofactor specificity switch in Shikimate dehydrogenase by rational design and consensus engineering.

PubMed

García-Guevara, Fernando; Bravo, Iris; Martínez-Anaya, Claudia; Segovia, Lorenzo

2017-08-01

Consensus engineering has been used to design more stable variants using the most frequent amino acid at each site of a multiple sequence alignment; sometimes consensus engineering modifies function, but efforts have mainly been focused on studying stability. Here we constructed a consensus Rossmann domain for the Shikimate dehydrogenase enzyme; separately we decided to switch the cofactor specificity through rational design in the Escherichia coli Shikimate dehydrogenase enzyme and then analyzed the effect of consensus mutations on top of our design. We found that consensus mutations closest to the 2' adenine moiety increased the activity in our design. Consensus engineering has been shown to result in more stable proteins and our findings suggest it could also be used as a complementary tool for increasing or modifying enzyme activity during design. © The Author 2017. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.
Molecular Evidence of Bartonella Infection in Domestic Dogs from Algeria, North Africa, by Polymerase Chain Reaction (PCR)

PubMed Central

Kernif, Tahar; Aissi, Meriem; Doumandji, Salah-Eddine; Chomel, Bruno B.; Raoult, Didier; Bitam, Idir

2010-01-01

Bartonella species are being recognized as important bacterial human and canine pathogens, and are associated with multiple arthropod vectors. Bartonella DNA extracted from blood samples was obtained from domestic dogs in Algiers, Algeria. Polymerase chain reaction (PCR) and DNA sequence analyses of the ftsZ gene and the 16S-23S intergenic spacer region (ITS) were performed. Three Bartonella species: Bartonella vinsonii subsp. berkhoffii, Bartonella clarridgeiae, and Bartonells elizabethae were detected infecting Algerian dogs. To our knowledge, this study is the first report of detection by PCR amplification of Bartonella in dogs in North Africa. PMID:20682871

Whole genome sequencing: an efficient approach to ensuring food safety

NASA Astrophysics Data System (ADS)

Lakicevic, B.; Nastasijevic, I.; Dimitrijevic, M.

2017-09-01

Whole genome sequencing is an effective, powerful tool that can be applied to a wide range of public health and food safety applications. A major difference between WGS and the traditional typing techniques is that WGS allows all genes to be included in the analysis, instead of a well-defined subset of genes or variable intergenic regions. Also, the use of WGS can facilitate the understanding of contamination/colonization routes of foodborne pathogens within the food production environment, and can also afford efficient tracking of pathogens’ entry routes and distribution from farm-to-consumer. Tracking foodborne pathogens in the food processing-distribution-retail-consumer continuum is of the utmost importance for facilitation of outbreak investigations and rapid action in controlling/preventing foodborne outbreaks. Therefore, WGS likely will replace most of the numerous workflows used in public health laboratories to characterize foodborne pathogens into one consolidated, efficient workflow.
Bacteremia due to Moraxella atlantae in a cancer patient.

PubMed

De Baere, Thierry; Muylaert, An; Everaert, Els; Wauters, Georges; Claeys, Geert; Verschraegen, Gerda; Vaneechoutte, Mario

2002-07-01

A gram-negative alkaline phosphatase- and pyrrolidone peptidase-positive rod-shaped bacterium (CCUG 45702) was isolated from two aerobic blood cultures from a female cancer patient. No identification could be reached using phenotypic techniques. Amplification of the tRNA intergenic spacers revealed fragments with lengths of 116, 133, and 270 bp, but no such pattern was present in our reference library. Sequencing of the 16S rRNA gene revealed its identity as Moraxella atlantae, a species isolated only rarely and published only once as causing infection. In retrospect, the phenotypic characteristics fit the identification as M. atlantae (formerly known as CDC group M-3). Comparative 16S rRNA sequence analysis indicates that M. atlantae, M. lincolnii, and M. osloensis might constitute three separate genera within the MORAXELLACEAE: After treatment with amoxicillin-clavulanic acid for 2 days, fever subsided and the patient was dismissed.
Bacteremia Due to Moraxella atlantae in a Cancer Patient

PubMed Central

De Baere, Thierry; Muylaert, An; Everaert, Els; Wauters, Georges; Claeys, Geert; Verschraegen, Gerda; Vaneechoutte, Mario

2002-01-01

A gram-negative alkaline phosphatase- and pyrrolidone peptidase-positive rod-shaped bacterium (CCUG 45702) was isolated from two aerobic blood cultures from a female cancer patient. No identification could be reached using phenotypic techniques. Amplification of the tRNA intergenic spacers revealed fragments with lengths of 116, 133, and 270 bp, but no such pattern was present in our reference library. Sequencing of the 16S rRNA gene revealed its identity as Moraxella atlantae, a species isolated only rarely and published only once as causing infection. In retrospect, the phenotypic characteristics fit the identification as M. atlantae (formerly known as CDC group M-3). Comparative 16S rRNA sequence analysis indicates that M. atlantae, M. lincolnii, and M. osloensis might constitute three separate genera within the Moraxellaceae. After treatment with amoxicillin-clavulanic acid for 2 days, fever subsided and the patient was dismissed. PMID:12089312
Authentication of Botanical Origin in Herbal Teas by Plastid Noncoding DNA Length Polymorphisms.

PubMed

Uncu, Ali Tevfik; Uncu, Ayse Ozgur; Frary, Anne; Doganlar, Sami

2015-07-01

The aim of this study was to develop a DNA barcode assay to authenticate the botanical origin of herbal teas. To reach this aim, we tested the efficiency of a PCR-capillary electrophoresis (PCR-CE) approach on commercial herbal tea samples using two noncoding plastid barcodes, the trnL intron and the intergenic spacer between trnL and trnF. Barcode DNA length polymorphisms proved successful in authenticating the species origin of herbal teas. We verified the validity of our approach by sequencing species-specific barcode amplicons from herbal tea samples. Moreover, we displayed the utility of PCR-CE assays coupled with sequencing to identify the origin of undeclared plant material in herbal tea samples. The PCR-CE assays proposed in this work can be applied as routine tests for the verification of botanical origin in herbal teas and can be extended to authenticate all types of herbal foodstuffs.
Presence of a consensus DNA motif at nearby DNA sequence of the mutation susceptible CG nucleotides.

PubMed

Chowdhury, Kaushik; Kumar, Suresh; Sharma, Tanu; Sharma, Ankit; Bhagat, Meenakshi; Kamai, Asangla; Ford, Bridget M; Asthana, Shailendra; Mandal, Chandi C

2018-01-10

Complexity in tissues affected by cancer arises from somatic mutations and epigenetic modifications in the genome. The mutation susceptible hotspots present within the genome indicate a non-random nature and/or a position specific selection of mutation. An association exists between the occurrence of mutations and epigenetic DNA methylation. This study is primarily aimed at determining mutation status, and identifying a signature for predicting mutation prone zones of tumor suppressor (TS) genes. Nearby sequences from the top five positions having a higher mutation frequency in each gene of 42 TS genes were selected from a cosmic database and were considered as mutation prone zones. The conserved motifs present in the mutation prone DNA fragments were identified. Molecular docking studies were done to determine putative interactions between the identified conserved motifs and enzyme methyltransferase DNMT1. Collective analysis of 42 TS genes found GC as the most commonly replaced and AT as the most commonly formed residues after mutation. Analysis of the top 5 mutated positions of each gene (210 DNA segments for 42 TS genes) identified that CG nucleotides of the amino acid codons (e.g., Arginine) are most susceptible to mutation, and found a consensus DNA "T/AGC/GAGGA/TG" sequence present in these mutation prone DNA segments. Similar to TS genes, analysis of 54 oncogenes not only found CG nucleotides of the amino acid Arg as the most susceptible to mutation, but also identified the presence of similar consensus DNA motifs in the mutation prone DNA fragments (270 DNA segments for 54 oncogenes) of oncogenes. Docking studies depicted that, upon binding of DNMT1 methylates to this consensus DNA motif (C residues of CpG islands), mutation was likely to occur. Thus, this study proposes that DNMT1 mediated methylation in chromosomal DNA may decrease if a foreign DNA segment containing this consensus sequence along with CG nucleotides is exogenously introduced to dividing cancer cells. Copyright © 2017 Elsevier B.V. All rights reserved.
A first report and complete genome sequence of alfalfa enamovirus from Sudan

USDA-ARS?s Scientific Manuscript database

A full genome sequence of a viral pathogen, provisionally named alfalfa enamovirus 2 (AEV-2), was reconstructed from short reads obtained by Illumina RNA sequencing of alfalfa sample originating from Sudan. Ambiguous nucleotides in the resultant consensus assembly and identity of the predicted virus...
A consensus linkage map of lentil based on DArT markers from three RIL mapping populations.

PubMed

Ates, Duygu; Aldemir, Secil; Alsaleh, Ahmad; Erdogmus, Semih; Nemli, Seda; Kahriman, Abdullah; Ozkan, Hakan; Vandenberg, Albert; Tanyolac, Bahattin

2018-01-01

Lentil (Lens culinaris ssp. culinaris Medikus) is a diploid (2n = 2x = 14), self-pollinating grain legume with a haploid genome size of about 4 Gbp and is grown throughout the world with current annual production of 4.9 million tonnes. A consensus map of lentil (Lens culinaris ssp. culinaris Medikus) was constructed using three different lentils recombinant inbred line (RIL) populations, including "CDC Redberry" x "ILL7502" (LR8), "ILL8006" x "CDC Milestone" (LR11) and "PI320937" x "Eston" (LR39). The lentil consensus map was composed of 9,793 DArT markers, covered a total of 977.47 cM with an average distance of 0.10 cM between adjacent markers and constructed 7 linkage groups representing 7 chromosomes of the lentil genome. The consensus map had no gap larger than 12.67 cM and only 5 gaps were found to be between 12.67 cM and 6.0 cM (on LG3 and LG4). The localization of the SNP markers on the lentil consensus map were in general consistent with their localization on the three individual genetic linkage maps and the lentil consensus map has longer map length, higher marker density and shorter average distance between the adjacent markers compared to the component linkage maps. This high-density consensus map could provide insight into the lentil genome. The consensus map could also help to construct a physical map using a Bacterial Artificial Chromosome library and map based cloning studies. Sequence information of DArT may help localization of orientation scaffolds from Next Generation Sequencing data.
A consensus linkage map of lentil based on DArT markers from three RIL mapping populations

PubMed Central

Ates, Duygu; Aldemir, Secil; Alsaleh, Ahmad; Erdogmus, Semih; Nemli, Seda; Kahriman, Abdullah; Ozkan, Hakan; Vandenberg, Albert

2018-01-01

Background Lentil (Lens culinaris ssp. culinaris Medikus) is a diploid (2n = 2x = 14), self-pollinating grain legume with a haploid genome size of about 4 Gbp and is grown throughout the world with current annual production of 4.9 million tonnes. Materials and methods A consensus map of lentil (Lens culinaris ssp. culinaris Medikus) was constructed using three different lentils recombinant inbred line (RIL) populations, including “CDC Redberry” x “ILL7502” (LR8), “ILL8006” x “CDC Milestone” (LR11) and “PI320937” x “Eston” (LR39). Results The lentil consensus map was composed of 9,793 DArT markers, covered a total of 977.47 cM with an average distance of 0.10 cM between adjacent markers and constructed 7 linkage groups representing 7 chromosomes of the lentil genome. The consensus map had no gap larger than 12.67 cM and only 5 gaps were found to be between 12.67 cM and 6.0 cM (on LG3 and LG4). The localization of the SNP markers on the lentil consensus map were in general consistent with their localization on the three individual genetic linkage maps and the lentil consensus map has longer map length, higher marker density and shorter average distance between the adjacent markers compared to the component linkage maps. Conclusion This high-density consensus map could provide insight into the lentil genome. The consensus map could also help to construct a physical map using a Bacterial Artificial Chromosome library and map based cloning studies. Sequence information of DArT may help localization of orientation scaffolds from Next Generation Sequencing data. PMID:29351563
Consensus statement: Virus taxonomy in the age of metagenomics.

PubMed

Simmonds, Peter; Adams, Mike J; Benkő, Mária; Breitbart, Mya; Brister, J Rodney; Carstens, Eric B; Davison, Andrew J; Delwart, Eric; Gorbalenya, Alexander E; Harrach, Balázs; Hull, Roger; King, Andrew M Q; Koonin, Eugene V; Krupovic, Mart; Kuhn, Jens H; Lefkowitz, Elliot J; Nibert, Max L; Orton, Richard; Roossinck, Marilyn J; Sabanadzovic, Sead; Sullivan, Matthew B; Suttle, Curtis A; Tesh, Robert B; van der Vlugt, René A; Varsani, Arvind; Zerbini, F Murilo

2017-03-01

The number and diversity of viral sequences that are identified in metagenomic data far exceeds that of experimentally characterized virus isolates. In a recent workshop, a panel of experts discussed the proposal that, with appropriate quality control, viruses that are known only from metagenomic data can, and should be, incorporated into the official classification scheme of the International Committee on Taxonomy of Viruses (ICTV). Although a taxonomy that is based on metagenomic sequence data alone represents a substantial departure from the traditional reliance on phenotypic properties, the development of a robust framework for sequence-based virus taxonomy is indispensable for the comprehensive characterization of the global virome. In this Consensus Statement article, we consider the rationale for why metagenomic sequence data should, and how it can, be incorporated into the ICTV taxonomy, and present proposals that have been endorsed by the Executive Committee of the ICTV.
Conservative secondary structure motifs already present in early-stage folding (in silico) as found in serpines family.

PubMed

Brylinski, Michal; Konieczny, Leszek; Kononowicz, Andrzej; Roterman, Irena

2008-03-21

The well-known procedure implemented in ClustalW oriented on the sequence comparison was applied to structure comparison. The consensus sequence as well as consensus structure has been defined for proteins belonging to serpine family. The structure of early stage intermediate was the object for similarity search. The high values of W(sequence) appeared to be accordant with high values of W(structure) making possible structure comparison using common criteria for sequence and structure comparison. Since the early stage structural form has been created according to limited conformational sub-space which does not include the beta-structure (this structure is mediated by C7eq structural form), is particularly important to see, that the C7eq structural form may be treated as the seed for beta-structure present in the final native structure of protein. The applicability of ClustalW procedure to structure comparison makes these two comparisons unified.
Multiplex PCR method for MinION and Illumina sequencing of Zika and other virus genomes directly from clinical samples

PubMed Central

Quick, Josh; Grubaugh, Nathan D; Pullan, Steven T; Claro, Ingra M; Smith, Andrew D; Gangavarapu, Karthik; Oliveira, Glenn; Robles-Sikisaka, Refugio; Rogers, Thomas F; Beutler, Nathan A; Burton, Dennis R; Lewis-Ximenez, Lia Laura; de Jesus, Jaqueline Goes; Giovanetti, Marta; Hill, Sarah; Black, Allison; Bedford, Trevor; Carroll, Miles W; Nunes, Marcio; Alcantara, Luiz Carlos; Sabino, Ester C; Baylis, Sally A; Faria, Nuno; Loose, Matthew; Simpson, Jared T; Pybus, Oliver G; Andersen, Kristian G; Loman, Nicholas J

2018-01-01

Genome sequencing has become a powerful tool for studying emerging infectious diseases; however, genome sequencing directly from clinical samples without isolation remains challenging for viruses such as Zika, where metagenomic sequencing methods may generate insufficient numbers of viral reads. Here we present a protocol for generating coding-sequence complete genomes comprising an online primer design tool, a novel multiplex PCR enrichment protocol, optimised library preparation methods for the portable MinION sequencer (Oxford Nanopore Technologies) and the Illumina range of instruments, and a bioinformatics pipeline for generating consensus sequences. The MinION protocol does not require an internet connection for analysis, making it suitable for field applications with limited connectivity. Our method relies on multiplex PCR for targeted enrichment of viral genomes from samples containing as few as 50 genome copies per reaction. Viral consensus sequences can be achieved starting with clinical samples in 1-2 days following a simple laboratory workflow. This method has been successfully used by several groups studying Zika virus evolution and is facilitating an understanding of the spread of the virus in the Americas. PMID:28538739
Fast and accurate de novo genome assembly from long uncorrected reads

PubMed Central

Vaser, Robert; Sović, Ivan; Nagarajan, Niranjan

2017-01-01

The assembly of long reads from Pacific Biosciences and Oxford Nanopore Technologies typically requires resource-intensive error-correction and consensus-generation steps to obtain high-quality assemblies. We show that the error-correction step can be omitted and that high-quality consensus sequences can be generated efficiently with a SIMD-accelerated, partial-order alignment–based, stand-alone consensus module called Racon. Based on tests with PacBio and Oxford Nanopore data sets, we show that Racon coupled with miniasm enables consensus genomes with similar or better quality than state-of-the-art methods while being an order of magnitude faster. PMID:28100585
Phylogeny and strain typing of Escherichia coli, inferred from variation at mononucleotide repeat loci.

PubMed

Diamant, Eran; Palti, Yniv; Gur-Arie, Riva; Cohen, Helit; Hallerman, Eric M; Kashi, Yechezkel

2004-04-01

Multilocus sequencing of housekeeping genes has been used previously for bacterial strain typing and for inferring evolutionary relationships among strains of Escherichia coli. In this study, we used shorter intergenic sequences that contained simple sequence repeats (SSRs) of repeating mononucleotide motifs (mononucleotide repeats [MNRs]) to infer the phylogeny of pathogenic and commensal E. coli strains. Seven noncoding loci (four MNRs and three non-SSRs) were sequenced in 27 strains, including enterohemorrhagic (six isolates of O157:H7), enteropathogenic, enterotoxigenic, B, and K-12 strains. The four MNRs were also sequenced in 20 representative strains of the E. coli reference (ECOR) collection. Sequence polymorphism was significantly higher at the MNR loci, including the flanking sequences, indicating a higher mutation rate in the sequences flanking the MNR tracts. The four MNR loci were amplifiable by PCR in the standard ECOR A, B1, and D groups, but only one (yaiN) in the B2 group was amplified, which is consistent with previous studies that suggested that B2 is the most ancient group. High sequence compatibility was found between the four MNR loci, indicating that they are in the same clonal frame. The phylogenetic trees that were constructed from the sequence data were in good agreement with those of previous studies that used multilocus enzyme electrophoresis. The results demonstrate that MNR loci are useful for inferring phylogenetic relationships and provide much higher sequence variation than housekeeping genes. Therefore, the use of MNR loci for multilocus sequence typing should prove efficient for clinical diagnostics, epidemiology, and evolutionary study of bacteria.
Phylogeny and Strain Typing of Escherichia coli, Inferred from Variation at Mononucleotide Repeat Loci

PubMed Central

Diamant, Eran; Palti, Yniv; Gur-Arie, Riva; Cohen, Helit; Hallerman, Eric M.; Kashi, Yechezkel

2004-01-01

Multilocus sequencing of housekeeping genes has been used previously for bacterial strain typing and for inferring evolutionary relationships among strains of Escherichia coli. In this study, we used shorter intergenic sequences that contained simple sequence repeats (SSRs) of repeating mononucleotide motifs (mononucleotide repeats [MNRs]) to infer the phylogeny of pathogenic and commensal E. coli strains. Seven noncoding loci (four MNRs and three non-SSRs) were sequenced in 27 strains, including enterohemorrhagic (six isolates of O157:H7), enteropathogenic, enterotoxigenic, B, and K-12 strains. The four MNRs were also sequenced in 20 representative strains of the E. coli reference (ECOR) collection. Sequence polymorphism was significantly higher at the MNR loci, including the flanking sequences, indicating a higher mutation rate in the sequences flanking the MNR tracts. The four MNR loci were amplifiable by PCR in the standard ECOR A, B1, and D groups, but only one (yaiN) in the B2 group was amplified, which is consistent with previous studies that suggested that B2 is the most ancient group. High sequence compatibility was found between the four MNR loci, indicating that they are in the same clonal frame. The phylogenetic trees that were constructed from the sequence data were in good agreement with those of previous studies that used multilocus enzyme electrophoresis. The results demonstrate that MNR loci are useful for inferring phylogenetic relationships and provide much higher sequence variation than housekeeping genes. Therefore, the use of MNR loci for multilocus sequence typing should prove efficient for clinical diagnostics, epidemiology, and evolutionary study of bacteria. PMID:15066845
Mind the gap; seven reasons to close fragmented genome assemblies.

PubMed

Thomma, Bart P H J; Seidl, Michael F; Shi-Kunne, Xiaoqian; Cook, David E; Bolton, Melvin D; van Kan, Jan A L; Faino, Luigi

2016-05-01

Like other domains of life, research into the biology of filamentous microbes has greatly benefited from the advent of whole-genome sequencing. Next-generation sequencing (NGS) technologies have revolutionized sequencing, making genomic sciences accessible to many academic laboratories including those that study non-model organisms. Thus, hundreds of fungal genomes have been sequenced and are publically available today, although these initiatives have typically yielded considerably fragmented genome assemblies that often lack large contiguous genomic regions. Many important genomic features are contained in intergenic DNA that is often missing in current genome assemblies, and recent studies underscore the significance of non-coding regions and repetitive elements for the life style, adaptability and evolution of many organisms. The study of particular types of genetic elements, such as telomeres, centromeres, repetitive elements, effectors, and clusters of co-regulated genes, but also of phenomena such as structural rearrangements, genome compartmentalization and epigenetics, greatly benefits from having a contiguous and high-quality, preferably even complete and gapless, genome assembly. Here we discuss a number of important reasons to produce gapless, finished, genome assemblies to help answer important biological questions. Copyright © 2015 Elsevier Inc. All rights reserved.
Sequence specificity of the human mRNA N6-adenosine methylase in vitro.

PubMed Central

Harper, J E; Miceli, S M; Roberts, R J; Manley, J L

1990-01-01

N6-adenosine methylation is a frequent modification of mRNAs and their precursors, but little is known about the mechanism of the reaction or the function of the modification. To explore these questions, we developed conditions to examine N6-adenosine methylase activity in HeLa cell nuclear extracts. Transfer of the methyl group from S-[3H methyl]-adenosylmethionine to unlabeled random copolymer RNA substrates of varying ribonucleotide composition revealed a substrate specificity consistent with a previously deduced consensus sequence, Pu[G greater than A]AC[A/C/U]. 32-P labeled RNA substrates of defined sequence were used to examine the minimum sequence requirements for methylation. Each RNA was 20 nucleotides long, and contained either the core consensus sequence GGACU, or some variation of this sequence. RNAs containing GGACU, either in single or multiple copies, were good substrates for methylation, whereas RNAs containing single base substitutions within the GGACU sequence gave dramatically reduced methylation. These results demonstrate that the N6-adenosine methylase has a strict sequence specificity, and that there is no requirement for extended sequences or secondary structures for methylation. Recognition of this sequence does not require an RNA component, as micrococcal nuclease pretreatment of nuclear extracts actually increased methylation efficiency. Images PMID:2216767
Randomization and In Vivo Selection Reveal a GGRG Motif Essential for Packaging Human Immunodeficiency Virus Type 2 RNA ▿ †

PubMed Central

Baig, Tayyba T.; Lanchy, Jean-Marc; Lodmell, J. Stephen

2009-01-01

The packaging signal (ψ) of human immunodeficiency virus type 2 (HIV-2) is present in the 5′ noncoding region of RNA and contains a 10-nucleotide palindrome (pal; 5′-392-GGAGUGCUCC) located upstream of the dimerization signal stem-loop 1 (SL1). pal has been shown to be functionally important in vitro and in vivo. We previously showed that the 3′ side of pal (GCUCC-3′) is involved in base-pairing interactions with a sequence downstream of SL1 to make an extended SL1, which is important for replication in vivo and the regulation of dimerization in vitro. However, the role of the 5′ side of pal (5′-GGAGU) was less clear. Here, we characterized this role using an in vivo SELEX approach. We produced a population of HIV-2 DNA genomes with random sequences within the 5′ side of pal and transfected these into COS-7 cells. Viruses from COS-7 cells were used to infect C8166 permissive cells. After several weeks of serial passage in C8166 cells, surviving viruses were sequenced. On the 5′ side of pal there was a striking convergence toward a GGRGN consensus sequence. Individual clones with consensus and nonconsensus sequences were tested in infectivity and packaging assays. Analysis of individuals that diverged from the consensus sequence showed normal viral RNA and protein synthesis but had replication defects and impaired RNA packaging. These findings clearly indicate that the GGRG motif is essential for viral replication and genomic RNA packaging. PMID:18971263
CapZyme-Seq Comprehensively Defines Promoter-Sequence Determinants for RNA 5' Capping with NAD.

PubMed

Vvedenskaya, Irina O; Bird, Jeremy G; Zhang, Yuanchao; Zhang, Yu; Jiao, Xinfu; Barvík, Ivan; Krásný, Libor; Kiledjian, Megerditch; Taylor, Deanne M; Ebright, Richard H; Nickels, Bryce E

2018-05-03

Nucleoside-containing metabolites such as NAD + can be incorporated as 5' caps on RNA by serving as non-canonical initiating nucleotides (NCINs) for transcription initiation by RNA polymerase (RNAP). Here, we report CapZyme-seq, a high-throughput-sequencing method that employs NCIN-decapping enzymes NudC and Rai1 to detect and quantify NCIN-capped RNA. By combining CapZyme-seq with multiplexed transcriptomics, we determine efficiencies of NAD + capping by Escherichia coli RNAP for ∼16,000 promoter sequences. The results define preferred transcription start site (TSS) positions for NAD + capping and define a consensus promoter sequence for NAD + capping: HRRASWW (TSS underlined). By applying CapZyme-seq to E. coli total cellular RNA, we establish that sequence determinants for NCIN capping in vivo match the NAD + -capping consensus defined in vitro, and we identify and quantify NCIN-capped small RNAs (sRNAs). Our findings define the promoter-sequence determinants for NCIN capping with NAD + and provide a general method for analysis of NCIN capping in vitro and in vivo. Copyright © 2018 Elsevier Inc. All rights reserved.
The Solanum commersonii Genome Sequence Provides Insights into Adaptation to Stress Conditions and Genome Evolution of Wild Potato Relatives

PubMed Central

Aversano, Riccardo; Contaldi, Felice; Ercolano, Maria Raffaella; Grosso, Valentina; Iorizzo, Massimo; Tatino, Filippo; Xumerle, Luciano; Dal Molin, Alessandra; Avanzato, Carla; Ferrarini, Alberto; Delledonne, Massimo; Sanseverino, Walter; Cigliano, Riccardo Aiese; Capella-Gutierrez, Salvador; Gabaldón, Toni; Frusciante, Luigi; Bradeen, James M.; Carputo, Domenico

2015-01-01

Here, we report the draft genome sequence of Solanum commersonii, which consists of ∼830 megabases with an N50 of 44,303 bp anchored to 12 chromosomes, using the potato (Solanum tuberosum) genome sequence as a reference. Compared with potato, S. commersonii shows a striking reduction in heterozygosity (1.5% versus 53 to 59%), and differences in genome sizes were mainly due to variations in intergenic sequence length. Gene annotation by ab initio prediction supported by RNA-seq data produced a catalog of 1703 predicted microRNAs, 18,882 long noncoding RNAs of which 20% are shown to target cold-responsive genes, and 39,290 protein-coding genes with a significant repertoire of nonredundant nucleotide binding site-encoding genes and 126 cold-related genes that are lacking in S. tuberosum. Phylogenetic analyses indicate that domesticated potato and S. commersonii lineages diverged ∼2.3 million years ago. Three duplication periods corresponding to genome enrichment for particular gene families related to response to salt stress, water transport, growth, and defense response were discovered. The draft genome sequence of S. commersonii substantially increases our understanding of the domesticated germplasm, facilitating translation of acquired knowledge into advances in crop stability in light of global climate and environmental changes. PMID:25873387
Completion of full length genome sequence of novel avian paramyxovirus strain APMV/Shimane67 isolated from migratory wild geese in Japan.

PubMed

Yamamoto, Eiji; Ito, Toshihiro; Ito, Hiroshi

2016-11-01

The nucleotide sequences of nucleocapsid protein (N); phosphoprotein (P); matrix protein (M); hemagglutinin-neuraminidase (HN); and large polymerase protein (L) genes, 3'-end leader, 5'-end trailer and intergenic regions of the avian paramyxovirus (APMV) strain goose/Shimane/67/2000 (APMV/Shimane67) were determined. Together with previously reported data on fusion protein (F) gene sequence [46], the determination of the genome sequence of APMV/Shimane67 has been completed in this study. The genome of APMV/Shimane67 comprised 16,146 nucleotides in length and contains six genes in the order of 3'-N-P-M-F-HN-L-5'. The features of the APMV/Shimane67 genome (e.g., nucleotide length of whole genome and each of the six genes, and predicted amino acid length of each of the six genes) were distinct from those of other APMV serotypes. Phylogenetic analysis indicated that although APMV/Shimane67 was grouped with APMV-1, -9 and -12, the evolutionary distance between APMV/Shimane67 and these viruses was longer than that observed between intra-serotype viruses. These results show that the genome sequence of APMV/Shimane67 contains specific characteristics and is distinguishable from other types of APMV.

Genetic structure and evolution of natural populations of viruses causing the tomato yellow leaf curl disease in Spain.

PubMed

Font, María Isabel; Rubio, Luis; Martínez-Culebras, Pedro Vicente; Jordá, Concepción

2007-09-01

The population structure and genetic variation of two begomoviruses: tomato yellow leaf curl Sardinia virus (TYLCSV) and tomato yellow leaf curl virus (TYLCV) in tomato crops of Spain were studied from 1997 until 2001. Restriction digestion of a genomic region comprised of the CP coat protein gene (CPR) of 358 TYLC virus isolates enabled us to classify them into 14 haplotypes. Nucleotide sequences of two genomic regions: CPR, and the surrounding intergenic region (SIR) were determined for at least two isolates per haplotype. SIR was more variable than CPR and showed multiple recombination events whereas no recombination was detected within CPR. In all geographic regions except Murcia, the population was, or evolved to be composed of one predominant haplotype with a low genetic diversity (<0.0180). In Murcia, two successive changes of the predominant haplotype were observed in the best studied population. Phylogenetic analysis showed that the TYLCSV sequences determined clustered with sequences obtained from the GenBank of other TYLCSV Spanish isolates which were clearly separated from TYLCSV Italian isolates. Most of our TYLCV sequences were similar to those of isolates from Japan and Portugal, and the sequences obtained from TYLCV isolates from the Canary island of Lanzarote were similar to those of Caribbean TYLCV isolates.
Asparagine-linked oligosaccharides present on a non-consensus amino acid sequence in the CH1 domain of human antibodies.

PubMed

Valliere-Douglass, John F; Kodama, Paul; Mujacic, Mirna; Brady, Lowell J; Wang, Wes; Wallace, Alison; Yan, Boxu; Reddy, Pranhitha; Treuheit, Michael J; Balland, Alain

2009-11-20

We report that N-linked oligosaccharide structures can be present on an asparagine residue not adhering to the consensus site motif NX(S/T), where X is not proline, described in the literature. We have observed oligosaccharides on a non-consensus asparaginyl residue in the C(H)1 constant domain of IgG1 and IgG2 antibodies. The initial findings were obtained from characterization of charge variant populations evident in a recombinant human antibody of the IgG2 subclass. HPLC-MS results indicated that cation-exchange chromatography acidic variant populations were enriched in antibody with a second glycosylation site, in addition to the well documented canonical glycosylation site located in the C(H)2 domain. Subsequent tryptic and chymotryptic peptide map data indicated that the second glycosylation site was associated with the amino acid sequence TVSWN(162)SGAL in the C(H)1 domain of the antibody. This highly atypical modification is present at levels of 0.5-2.0% on most of the recombinant antibodies that have been tested and has also been observed in IgG1 antibodies derived from human donors. Site-directed mutagenesis of the C(H)1 domain sequence in a recombinant-human IgG1 antibody resulted in an increase in non-consensus glycosylation to 3.15%, a greater than 4-fold increase over the level observed in the wild type, by changing the -1 and +1 amino acids relative to the asparagine residue at position 162. We believe that further understanding of the phenomenon of non-consensus glycosylation can be used to gain fundamental insights into the fidelity of the cellular glycosylation machinery.
Magnetic resonance imaging for the detection, localisation, and characterisation of prostate cancer: recommendations from a European consensus meeting.

PubMed

Dickinson, Louise; Ahmed, Hashim U; Allen, Clare; Barentsz, Jelle O; Carey, Brendan; Futterer, Jurgen J; Heijmink, Stijn W; Hoskin, Peter J; Kirkham, Alex; Padhani, Anwar R; Persad, Raj; Puech, Philippe; Punwani, Shonit; Sohaib, Aslam S; Tombal, Bertrand; Villers, Arnauld; van der Meulen, Jan; Emberton, Mark

2011-04-01

Multiparametric magnetic resonance imaging (mpMRI) may have a role in detecting clinically significant prostate cancer in men with raised serum prostate-specific antigen levels. Variations in technique and the interpretation of images have contributed to inconsistency in its reported performance characteristics. Our aim was to make recommendations on a standardised method for the conduct, interpretation, and reporting of prostate mpMRI for prostate cancer detection and localisation. A consensus meeting of 16 European prostate cancer experts was held that followed the UCLA-RAND Appropriateness Method and facilitated by an independent chair. Before the meeting, 520 items were scored for "appropriateness" by panel members, discussed face to face, and rescored. Agreement was reached in 67% of 260 items related to imaging sequence parameters. T2-weighted, dynamic contrast-enhanced, and diffusion-weighted MRI were the key sequences incorporated into the minimum requirements. Consensus was also reached on 54% of 260 items related to image interpretation and reporting, including features of malignancy on individual sequences. A 5-point scale was agreed on for communicating the probability of malignancy, with a minimum of 16 prostatic regions of interest, to include a pictorial representation of suspicious foci. Limitations relate to consensus methodology. Dominant personalities are known to affect the opinions of the group and were countered by a neutral chairperson. Consensus was reached on a number of areas related to the conduct, interpretation, and reporting of mpMRI for the detection, localisation, and characterisation of prostate cancer. Before optimal dissemination of this technology, these outcomes will require formal validation in prospective trials. Copyright © 2010 European Association of Urology. Published by Elsevier B.V. All rights reserved.
A filtering method to generate high quality short reads using illumina paired-end technology.

PubMed

Eren, A Murat; Vineis, Joseph H; Morrison, Hilary G; Sogin, Mitchell L

2013-01-01

Consensus between independent reads improves the accuracy of genome and transcriptome analyses, however lack of consensus between very similar sequences in metagenomic studies can and often does represent natural variation of biological significance. The common use of machine-assigned quality scores on next generation platforms does not necessarily correlate with accuracy. Here, we describe using the overlap of paired-end, short sequence reads to identify error-prone reads in marker gene analyses and their contribution to spurious OTUs following clustering analysis using QIIME. Our approach can also reduce error in shotgun sequencing data generated from libraries with small, tightly constrained insert sizes. The open-source implementation of this algorithm in Python programming language with user instructions can be obtained from https://github.com/meren/illumina-utils.
Search for 5'-leader regulatory RNA structures based on gene annotation aided by the RiboGap database.

PubMed

Naghdi, Mohammad Reza; Smail, Katia; Wang, Joy X; Wade, Fallou; Breaker, Ronald R; Perreault, Jonathan

2017-03-15

The discovery of noncoding RNAs (ncRNAs) and their importance for gene regulation led us to develop bioinformatics tools to pursue the discovery of novel ncRNAs. Finding ncRNAs de novo is challenging, first due to the difficulty of retrieving large numbers of sequences for given gene activities, and second due to exponential demands on calculation needed for comparative genomics on a large scale. Recently, several tools for the prediction of conserved RNA secondary structure were developed, but many of them are not designed to uncover new ncRNAs, or are too slow for conducting analyses on a large scale. Here we present various approaches using the database RiboGap as a primary tool for finding known ncRNAs and for uncovering simple sequence motifs with regulatory roles. This database also can be used to easily extract intergenic sequences of eubacteria and archaea to find conserved RNA structures upstream of given genes. We also show how to extend analysis further to choose the best candidate ncRNAs for experimental validation. Copyright © 2017 Elsevier Inc. All rights reserved.
Trichomonas vaginalis ribosomal RNA: identification and characterisation of the transcription promoter and terminator sequences.

PubMed

Franco, Bernardo; Hernández, Roberto; López-Villaseñor, Imelda

2012-09-01

Trichomonas vaginalis is a parasitic protozoan of both medical and biological relevance. Transcriptional studies in this organism have focused mainly on type II pol promoters, whereas the elements necessary for transcription by polI or polIII have not been investigated. Here, with the aid of a transient transcription system, we characterised the rDNA intergenic region, defining both the promoter and the terminator sequences required for transcription. We defined the promoter as a compact region of approximately 180 bp. We also identified a potential upstream control element (UCE) that was located 80 bp upstream of the transcription start point (TSP). A transcription termination element was identified within a 34 bp region that was located immediately downstream of the 28S coding sequence. The function of this element depends upon polarity and the presence of both a stretch of uridine residues (U's) and a hairpin structure in the transcript. Our observations provide a strong basis for the study of DNA recognition by the polI transcriptional machinery in this early divergent organism. Copyright © 2012 Elsevier B.V. All rights reserved.
A novel strategy for the determination of a rhabdovirus genome and its application to sequencing of Eggplant mottled dwarf virus.

PubMed

Pappi, Polyxeni G; Dovas, Chrysostomos I; Efthimiou, Konstantinos E; Maliogka, Varvara I; Katis, Nikolaos I

2013-08-01

A novel strategy employing the rhabdovirus untranslated conserved intergenic regions was developed and applied successfully for the determination of the complete nucleotide sequence of Eggplant mottled dwarf virus (EMDV). The EMDV genome contains seven open reading frames with the same organization as Potato yellow dwarf virus (PYDV), the type species of the genus Nucleorhabdovirus. These two species encode five core genes [nucleocapsid (N), phosphoprotein (P), matrix (M), glycoprotein (G), and the polymerase (L)] like other viruses of the genus and an additional one (X), located between N and P, giving rise to a protein with currently unknown function. Furthermore, both EMDV and PYDV contain a gene (Y), inserted between P and M, which probably encodes the virus movement protein, in concordance with the rest of the plant-infecting rhabdoviruses. Phylogenetic analysis of the polymerase gene confirmed the classification of EMDV within the genus Nucleorhabdovirus and showed a close evolutionary relationship to PYDV. The novel sequencing strategy developed is a useful tool for the genome determination of yet uncharacterized rhabdoviruses.
Phytoplasma-specific PCR primers based on sequences of the 16S-23S rRNA spacer region.

PubMed Central

Smart, C D; Schneider, B; Blomquist, C L; Guerra, L J; Harrison, N A; Ahrens, U; Lorenz, K H; Seemüller, E; Kirkpatrick, B C

1996-01-01

In order to develop a diagnostic tool to identify phytoplasmas and classify them according to their phylogenetic group, we took advantage of the sequence diversity of the 16S-23S intergenic spacer regions (SRs) of phytoplasmas. Ten PCR primers were developed from the SR sequences and were shown to amplify in a group-specific fashion. For some groups of phytoplasmas, such as elm yellows, ash yellows, and pear decline, the SR primer was paired with a specific primer from within the 16S rRNA gene. Each of these primer pairs was specific for a specific phytoplasma group, and they did not produce PCR products of the correct size from any other phytoplasma group. One primer was designed to anneal within the conserved tRNA(Ile) and, when paired with a universal primer, amplified all phytoplasmas tested. None of the primers produced PCR amplification products of the correct size from healthy plant DNA. These primers can serve as effective tools for identifying particular phytoplasmas in field samples. PMID:8702291
Spontaneous Mutation Rate in the Smallest Photosynthetic Eukaryotes

PubMed Central

Krasovec, Marc; Eyre-Walker, Adam; Sanchez-Ferandin, Sophie

2017-01-01

Abstract Mutation is the ultimate source of genetic variation, and knowledge of mutation rates is fundamental for our understanding of all evolutionary processes. High throughput sequencing of mutation accumulation lines has provided genome wide spontaneous mutation rates in a dozen model species, but estimates from nonmodel organisms from much of the diversity of life are very limited. Here, we report mutation rates in four haploid marine bacterial-sized photosynthetic eukaryotic algae; Bathycoccus prasinos, Ostreococcus tauri, Ostreococcus mediterraneus, and Micromonas pusilla. The spontaneous mutation rate between species varies from μ = 4.4 × 10−10 to 9.8 × 10−10 mutations per nucleotide per generation. Within genomes, there is a two-fold increase of the mutation rate in intergenic regions, consistent with an optimization of mismatch and transcription-coupled DNA repair in coding sequences. Additionally, we show that deviation from the equilibrium GC content increases the mutation rate by ∼2% to ∼12% because of a GC bias in coding sequences. More generally, the difference between the observed and equilibrium GC content of genomes explains some of the inter-specific variation in mutation rates. PMID:28379581
DNA Sequence-Mediated, Evolutionarily Rapid Redistribution of Meiotic Recombination Hotspots

PubMed Central

Wahls, Wayne P.; Davidson, Mari K.

2011-01-01

Hotspots regulate the position and frequency of Spo11 (Rec12)-initiated meiotic recombination, but paradoxically they are suicidal and are somehow resurrected elsewhere in the genome. After the DNA sequence-dependent activation of hotspots was discovered in fission yeast, nearly two decades elapsed before the key realizations that (A) DNA site-dependent regulation is broadly conserved and (B) individual eukaryotes have multiple different DNA sequence motifs that activate hotspots. From our perspective, such findings provide a conceptually straightforward solution to the hotspot paradox and can explain other, seemingly complex features of meiotic recombination. We describe how a small number of single-base-pair substitutions can generate hotspots de novo and dramatically alter their distribution in the genome. This model also shows how equilibrium rate kinetics could maintain the presence of hotspots over evolutionary timescales, without strong selective pressures invoked previously, and explains why hotspots localize preferentially to intergenic regions and introns. The model is robust enough to account for all hotspots of humans and chimpanzees repositioned since their divergence from the latest common ancestor. PMID:22084420
Improvement in Protein Domain Identification Is Reached by Breaking Consensus, with the Agreement of Many Profiles and Domain Co-occurrence

PubMed Central

Bernardes, Juliana; Zaverucha, Gerson; Vaquero, Catherine; Carbone, Alessandra

2016-01-01

Traditional protein annotation methods describe known domains with probabilistic models representing consensus among homologous domain sequences. However, when relevant signals become too weak to be identified by a global consensus, attempts for annotation fail. Here we address the fundamental question of domain identification for highly divergent proteins. By using high performance computing, we demonstrate that the limits of state-of-the-art annotation methods can be bypassed. We design a new strategy based on the observation that many structural and functional protein constraints are not globally conserved through all species but might be locally conserved in separate clades. We propose a novel exploitation of the large amount of data available: 1. for each known protein domain, several probabilistic clade-centered models are constructed from a large and differentiated panel of homologous sequences, 2. a decision-making protocol combines outcomes obtained from multiple models, 3. a multi-criteria optimization algorithm finds the most likely protein architecture. The method is evaluated for domain and architecture prediction over several datasets and statistical testing hypotheses. Its performance is compared against HMMScan and HHblits, two widely used search methods based on sequence-profile and profile-profile comparison. Due to their closeness to actual protein sequences, clade-centered models are shown to be more specific and functionally predictive than the broadly used consensus models. Based on them, we improved annotation of Plasmodium falciparum protein sequences on a scale not previously possible. We successfully predict at least one domain for 72% of P. falciparum proteins against 63% achieved previously, corresponding to 30% of improvement over the total number of Pfam domain predictions on the whole genome. The method is applicable to any genome and opens new avenues to tackle evolutionary questions such as the reconstruction of ancient domain duplications, the reconstruction of the history of protein architectures, and the estimation of protein domain age. Website and software: http://www.lcqb.upmc.fr/CLADE. PMID:27472895
AMS 4.0: consensus prediction of post-translational modifications in protein sequences.

PubMed

Plewczynski, Dariusz; Basu, Subhadip; Saha, Indrajit

2012-08-01

We present here the 2011 update of the AutoMotif Service (AMS 4.0) that predicts the wide selection of 88 different types of the single amino acid post-translational modifications (PTM) in protein sequences. The selection of experimentally confirmed modifications is acquired from the latest UniProt and Phospho.ELM databases for training. The sequence vicinity of each modified residue is represented using amino acids physico-chemical features encoded using high quality indices (HQI) obtaining by automatic clustering of known indices extracted from AAindex database. For each type of the numerical representation, the method builds the ensemble of Multi-Layer Perceptron (MLP) pattern classifiers, each optimising different objectives during the training (for example the recall, precision or area under the ROC curve (AUC)). The consensus is built using brainstorming technology, which combines multi-objective instances of machine learning algorithm, and the data fusion of different training objects representations, in order to boost the overall prediction accuracy of conserved short sequence motifs. The performance of AMS 4.0 is compared with the accuracy of previous versions, which were constructed using single machine learning methods (artificial neural networks, support vector machine). Our software improves the average AUC score of the earlier version by close to 7 % as calculated on the test datasets of all 88 PTM types. Moreover, for the selected most-difficult sequence motifs types it is able to improve the prediction performance by almost 32 %, when compared with previously used single machine learning methods. Summarising, the brainstorming consensus meta-learning methodology on the average boosts the AUC score up to around 89 %, averaged over all 88 PTM types. Detailed results for single machine learning methods and the consensus methodology are also provided, together with the comparison to previously published methods and state-of-the-art software tools. The source code and precompiled binaries of brainstorming tool are available at http://code.google.com/p/automotifserver/ under Apache 2.0 licensing.
Identification and Functional Prediction of Large Intergenic Noncoding RNAs (lincRNAs) in Rainbow Trout (Oncorhynchus mykiss)

USDA-ARS?s Scientific Manuscript database

Long noncoding RNAs (lncRNAs) have been recognized in recent years as key regulators of diverse cellular processes. Genome-wide large-scale projects have uncovered thousands of lncRNAs in many model organisms. Large intergenic noncoding RNAs (lincRNAs) are lncRNAs that are transcribed from intergeni...
Effects of near-ultraviolet light on mutations, intragenic and intergenic recombinations in Saccharomyces cerevisiae.

PubMed

Machida, I; Saeki, T; Nakai, S

1986-03-01

The effects of far (254 nm) and near (290-350 nm) ultraviolet (UV) light on mutations, intragenic and intergenic recombinations were compared in diploid strains of Saccharomyces cerevisiae. At equivalent survival levels there was not much difference in the induction of nonsense and missense mutations between far- and near-UV radiations. However, frameshift mutations were induced more frequently by near-UV than by far-UV radiation. Near-UV radiation induced intragenic recombination (gene conversion) as efficiently as far-UV radiation and the induced levels were similar in both radiations at equitoxic doses. A strikingly higher frequency was observed for the intergenic recombination induced by near-UV radiation than by far-UV radiation when compared at equivalent survival levels. Photoreactivation reduced the frequency only slightly in far-UV induced intergenic recombination and not at all in near-UV induction. These results indicate that near-UV damage involves strand breakage in addition to pyrimidine dimers and other lesions induced, whereas far-UV damage consists largely of photoreactivable lesions, pyrimidine dimers, and near-UV induced damage is more efficient for the induction of crossing-over.
Transcriptome characterization and polymorphism detection between subspecies of big sagebrush (Artemisia tridentata)

PubMed Central

2011-01-01

Background Big sagebrush (Artemisia tridentata) is one of the most widely distributed and ecologically important shrub species in western North America. This species serves as a critical habitat and food resource for many animals and invertebrates. Habitat loss due to a combination of disturbances followed by establishment of invasive plant species is a serious threat to big sagebrush ecosystem sustainability. Lack of genomic data has limited our understanding of the evolutionary history and ecological adaptation in this species. Here, we report on the sequencing of expressed sequence tags (ESTs) and detection of single nucleotide polymorphism (SNP) and simple sequence repeat (SSR) markers in subspecies of big sagebrush. Results cDNA of A. tridentata sspp. tridentata and vaseyana were normalized and sequenced using the 454 GS FLX Titanium pyrosequencing technology. Assembly of the reads resulted in 20,357 contig consensus sequences in ssp. tridentata and 20,250 contigs in ssp. vaseyana. A BLASTx search against the non-redundant (NR) protein database using 29,541 consensus sequences obtained from a combined assembly resulted in 21,436 sequences with significant blast alignments (≤ 1e-15). A total of 20,952 SNPs and 119 polymorphic SSRs were detected between the two subspecies. SNPs were validated through various methods including sequence capture. Validation of SNPs in different individuals uncovered a high level of nucleotide variation in EST sequences. EST sequences of a third, tetraploid subspecies (ssp. wyomingensis) obtained by Illumina sequencing were mapped to the consensus sequences of the combined 454 EST assembly. Approximately one-third of the SNPs between sspp. tridentata and vaseyana identified in the combined assembly were also polymorphic within the two geographically distant ssp. wyomingensis samples. Conclusion We have produced a large EST dataset for Artemisia tridentata, which contains a large sample of the big sagebrush leaf transcriptome. SNP mapping among the three subspecies suggest the origin of ssp. wyomingensis via mixed ancestry. A large number of SNP and SSR markers provide the foundation for future research to address questions in big sagebrush evolution, ecological genetics, and conservation using genomic approaches. PMID:21767398
Molecular characterization of a novel bat-associated circovirus with a poly-T tract in the 3' intergenic region.

PubMed

Zhu, Aiwei; Jiang, Tinglei; Hu, Tingsong; Mi, Shijiang; Zhao, Zihan; Zhang, Fuqiang; Feng, Jiang; Fan, Quanshui; He, Biao; Tu, Changchun

2018-05-02

The family Circoviridae comprises a large group of small circular single-stranded DNA viruses with several members causing severe pig and poultry diseases. In recent years the number of new viruses within the family has had an explosive increase showing a high level of genetic diversity and a broad host range. In this report we describe two more circoviruses identified from bats in Yunnan and Heilongjiang provinces in China. Full genome sequencing has revealed that these bat associated circoviruses (bat ACV) should be classified as new species within the genus Circovirus based on the demarcation criteria of the International Committee on the Taxonomy of Viruses (ICTV). The most striking result is the novel finding of a 21-28 nt polythymidine (poly-T) tract in the 3' terminal intergenic region of bat ACV isolates from Heilongjiang province. To understand its role in viral replication, a wild type bat ACV and a mutated version with the entire poly-T deleted were rescued through construction of infectious clones. Replication comparison in vitro showed that the poly-T is not essential for viral replication. Identification of additional bat ACV isolates and study of their biological characteristics will be the main task in future to understand the potential roles of bats in transmission of circoviruses to terrestrial mammals and humans. Copyright © 2018 Elsevier B.V. All rights reserved.
Genome-wide identification of potato long intergenic noncoding RNAs responsive to Pectobacterium carotovorum subspecies brasiliense infection.

PubMed

Kwenda, Stanford; Birch, Paul R J; Moleleki, Lucy N

2016-08-11

Long noncoding RNAs (lncRNAs) represent a class of RNA molecules that are implicated in regulation of gene expression in both mammals and plants. While much progress has been made in determining the biological functions of lncRNAs in mammals, the functional roles of lncRNAs in plants are still poorly understood. Specifically, the roles of long intergenic nocoding RNAs (lincRNAs) in plant defence responses are yet to be fully explored. In this study, we used strand-specific RNA sequencing to identify 1113 lincRNAs in potato (Solanum tuberosum) from stem tissues. The lincRNAs are expressed from all 12 potato chromosomes and generally smaller in size compared to protein-coding genes. Like in other plants, most potato lincRNAs possess single exons. A time-course RNA-seq analysis between a tolerant and a susceptible potato cultivar showed that 559 lincRNAs are responsive to Pectobacterium carotovorum subsp. brasiliense challenge compared to mock-inoculated controls. Moreover, coexpression analysis revealed that 17 of these lincRNAs are highly associated with 12 potato defence-related genes. Together, these results suggest that lincRNAs have potential functional roles in potato defence responses. Furthermore, this work provides the first library of potato lincRNAs and a set of novel lincRNAs implicated in potato defences against P. carotovorum subsp. brasiliense, a member of the soft rot Enterobacteriaceae phytopathogens.
Rapid identification of 11 human intestinal Lactobacillus species by multiplex PCR assays using group- and species-specific primers derived from the 16S-23S rRNA intergenic spacer region and its flanking 23S rRNA.

PubMed

Song, Y; Kato, N; Liu, C; Matsumiya, Y; Kato, H; Watanabe, K

2000-06-15

Rapid and reliable two-step multiplex polymerase chain reaction (PCR) assays were established to identify human intestinal lactobacilli; a multiplex PCR was used for grouping of lactobacilli with a mixture of group-specific primers followed by four multiplex PCR assays with four sorts of species-specific primer mixtures for identification at the species level. Primers used were designed from nucleotide sequences of the 16S-23S rRNA intergenic spacer region and its flanking 23S rRNA gene of members of the genus Lactobacillus which are commonly isolated from human stool specimens: Lactobacillus acidophilus, Lactobacillus crispatus, Lactobacillus delbrueckii (ssp. bulgaricus and ssp. lactis), Lactobacillus fermentum, Lactobacillus gasseri, Lactobacillus jensenii, Lactobacillus paracasei (ssp. paracasei and ssp. tolerans), Lactobacillus plantarum, Lactobacillus reuteri, Lactobacillus rhamnosus and Lactobacillus salivarius (ssp. salicinius and ssp. salivarius). The established two-step multiplex PCR assays were applied to the identification of 84 Lactobacillus strains isolated from human stool specimens and the PCR results were consistent with the results from the DNA-DNA hybridization assay. These results suggest that the multiplex PCR system established in this study is a simple, rapid and reliable method for the identification of common Lactobacillus isolates from human stool samples.
The glnAntrBC operon of Herbaspirillum seropedicae is transcribed by two oppositely regulated promoters upstream of glnA.

PubMed

Schwab, Stefan; Souza, Emanuel M; Yates, Marshall G; Persuhn, Darlene C; Steffens, M Berenice R; Chubatsu, Leda S; Pedrosa, Fábio O; Rigo, Liu U

2007-01-01

Herbaspirillum seropedicae is an endophytic bacterium that fixes nitrogen under microaerophilic conditions. The putative promoter sequences glnAp1 (sigma70-dependent) and glnAp2 (sigma54), and two NtrC-binding sites were identified upstream from the glnA, ntrB and ntrC genes of this microorganism. To study their transcriptional regulation, we used lacZ fusions to the H. seropedicae glnA gene, and the glnA-ntrB and ntrB-ntrC intergenic regions. Expression of glnA was up-regulated under low ammonium, but no transcription activity was detected from the intergenic regions under any condition tested, suggesting that glnA, ntrB and ntrC are co-transcribed from the promoters upstream of glnA. Ammonium regulation was lost in the ntrC mutant strain. A point mutation was introduced in the conserved -25/-24 dinucleotide (GG-->TT) of the putative sigma54-dependent promoter (glnAp2). Contrary to the wild-type promoter, glnA expression with the mutant glnAp2 promoter was repressed in the wild-type strain under low ammonium levels, but this repression was abolished in an ntrC background. Together our results indicate that the H. seropedicae glnAntrBC operon is regulated from two functional promoters upstream from glnA, which are oppositely regulated by the NtrC protein.
Myc-induced anchorage of the rDNA IGS region to nucleolar matrix modulates growth-stimulated changes in higher-order rDNA architecture

PubMed Central

Shiue, Chiou-Nan; Nematollahi-Mahani, Amir; Wright, Anthony P.H.

2014-01-01

Chromatin domain organization and the compartmentalized distribution of chromosomal regions are essential for packaging of deoxyribonucleic acid (DNA) in the eukaryotic nucleus as well as regulated gene expression. Nucleoli are the most prominent morphological structures of cell nuclei and nucleolar organization is coupled to cell growth. It has been shown that nuclear scaffold/matrix attachment regions often define the base of looped chromosomal domains in vivo and that they are thereby critical for correct chromosome architecture and gene expression. Here, we show regulated organization of mammalian ribosomal ribonucleic acid genes into distinct chromatin loops by tethering to nucleolar matrix via the non-transcribed inter-genic spacer region of the ribosomal DNA (rDNA). The rDNA gene loop structures are induced specifically upon growth stimulation and are dependent on the activity of the c-Myc protein. Matrix-attached rDNA genes are hypomethylated at the promoter and are thus available for transcriptional activation. rDNA genes silenced by methylation are not recruited to the matrix. c-Myc, which has been shown to induce rDNA transcription directly, is physically associated with rDNA gene looping structures and the intergenic spacer sequence in growing cells. Such a role of Myc proteins in gene activation has not been reported previously. PMID:24609384

Myc-induced anchorage of the rDNA IGS region to nucleolar matrix modulates growth-stimulated changes in higher-order rDNA architecture.

PubMed

Shiue, Chiou-Nan; Nematollahi-Mahani, Amir; Wright, Anthony P H

2014-05-01

Chromatin domain organization and the compartmentalized distribution of chromosomal regions are essential for packaging of deoxyribonucleic acid (DNA) in the eukaryotic nucleus as well as regulated gene expression. Nucleoli are the most prominent morphological structures of cell nuclei and nucleolar organization is coupled to cell growth. It has been shown that nuclear scaffold/matrix attachment regions often define the base of looped chromosomal domains in vivo and that they are thereby critical for correct chromosome architecture and gene expression. Here, we show regulated organization of mammalian ribosomal ribonucleic acid genes into distinct chromatin loops by tethering to nucleolar matrix via the non-transcribed inter-genic spacer region of the ribosomal DNA (rDNA). The rDNA gene loop structures are induced specifically upon growth stimulation and are dependent on the activity of the c-Myc protein. Matrix-attached rDNA genes are hypomethylated at the promoter and are thus available for transcriptional activation. rDNA genes silenced by methylation are not recruited to the matrix. c-Myc, which has been shown to induce rDNA transcription directly, is physically associated with rDNA gene looping structures and the intergenic spacer sequence in growing cells. Such a role of Myc proteins in gene activation has not been reported previously. © 2014 The Author(s). Published by Oxford University Press [on behalf of Nucleic Acids Research].
Complete Chloroplast Genome Sequence of Tartary Buckwheat (Fagopyrum tataricum) and Comparative Analysis with Common Buckwheat (F. esculentum)

PubMed Central

Cho, Kwang-Soo; Yun, Bong-Kyoung; Yoon, Young-Ho; Hong, Su-Young; Mekapogu, Manjulatha; Kim, Kyung-Hee; Yang, Tae-Jin

2015-01-01

We report the chloroplast (cp) genome sequence of tartary buckwheat (Fagopyrum tataricum) obtained by next-generation sequencing technology and compared this with the previously reported common buckwheat (F. esculentum ssp. ancestrale) cp genome. The cp genome of F. tataricum has a total sequence length of 159,272 bp, which is 327 bp shorter than the common buckwheat cp genome. The cp gene content, order, and orientation are similar to those of common buckwheat, but with some structural variation at tandem and palindromic repeat frequencies and junction areas. A total of seven InDels (around 100 bp) were found within the intergenic sequences and the ycf1 gene. Copy number variation of the 21-bp tandem repeat varied in F. tataricum (four repeats) and F. esculentum (one repeat), and the InDel of the ycf1 gene was 63 bp long. Nucleotide and amino acid have highly conserved coding sequence with about 98% homology and four genes—rpoC2, ycf3, accD, and clpP—have high synonymous (Ks) value. PCR based InDel markers were applied to diverse genetic resources of F. tataricum and F. esculentum, and the amplicon size was identical to that expected in silico. Therefore, these InDel markers are informative biomarkers to practically distinguish raw or processed buckwheat products derived from F. tataricum and F. esculentum. PMID:25966355
Identification of an estrogen response element in the 3'-flanking region of the murine c-fos protooncogene.

PubMed

Hyder, S M; Stancel, G M; Nawaz, Z; McDonnell, D P; Loose-Mitchell, D S

1992-09-05

We have used transient transfection assays with reporter plasmids expressing chloramphenicol acetyltransferase, linked to regions of mouse c-fos, to identify a specific estrogen response element (ERE) in this protooncogene. This element is located in the untranslated 3'-flanking region of the c-fos gene, 5 kilobases (kb) downstream from the c-fos promoter and 1.5 kb downstream of the poly(A) signal. This element confers estrogen responsiveness to chloramphenicol acetyltransferase reporters linked to both the herpes simplex virus thymidine kinase promoter and the homologous c-fos promoter. Deletion analysis localized the response element to a 200-base pair fragment which contains the element GGTCACCACAGCC that resembles the consensus ERE sequence GGTCACAGTGACC originally identified in Xenopus vitellogenin A2 gene. A synthetic 36-base pair oligodeoxynucleotide containing this c-fos sequence conferred estrogen inducibility to the thymidine kinase promoter. The corresponding sequence also induced reporter activity when present in the c-fos gene fragment 3 kb from the thymidine kinase promoter. Gel-shift experiments demonstrated that synthetic oligonucleotides containing either the consensus ERE or the c-fos element bind human estrogen receptor obtained from a yeast expression system. However, the mobility of the shifted band is faster for the fos-ERE-complex than the consensus ERE complex suggesting that the three-dimensional structure of the protein-DNA complexes is different or that other factors are differentially involved in the two reactions. When the 5'-GGTCA sequence present in the c-fos ERE is mutated to 5'-TTTCA, transcriptional activation and receptor binding activities are both lost. Mutation of the CAGCC-3' element corresponding to the second half-site of the c-fos sequence also led to the loss of receptor binding activity, suggesting that both half-sites of this element are involved in this function. The estrogen induction mediated by either the c-fos or the consensus ERE was blunted by the antiestrogen tamoxifen. Based on these studies, we believe the 3'-fos ERE sequence we have identified may be a major cis-acting element involved in the physiological regulation of the gene by estrogens in vivo.
Sequences of Zika Virus Genomes from a Pediatric Cohort in Nicaragua.

PubMed

Oldfield, Lauren M; Fedorova, Nadia; Puri, Vinita; Shrivastava, Susmita; Amedeo, Paolo; Durbin, Alan; Rocchi, Iara; Williams, Torrey; Shabman, Reed S; Tan, Gene S; Balmaseda, Angel; Kuan, Guillermina; Saborio, Saira; Gordon, Aubree; Harris, Eva; Pickett, Brett E

2018-06-14

We report here the whole-genome sequence of 11 Zika virus (ZIKV) samples from six pediatric patients in Nicaragua. Serum samples were collected, and ZIKV was isolated in tissue culture. Both serum and virus isolates were sequenced. The consensus ZIKV genomes are greater than 99% identical to each other. Copyright © 2018 Oldfield et al.
Regulation of the alpha-glucuronidase-encoding gene ( aguA) from Aspergillus niger.

PubMed

de Vries, R P; van de Vondervoort, P J I; Hendriks, L; van de Belt, M; Visser, J

2002-09-01

The alpha-glucuronidase gene aguA from Aspergillus niger was cloned and characterised. Analysis of the promoter region of aguA revealed the presence of four putative binding sites for the major carbon catabolite repressor protein CREA and one putative binding site for the transcriptional activator XLNR. In addition, a sequence motif was detected which differed only in the last nucleotide from the XLNR consensus site. A construct in which part of the aguA coding region was deleted still resulted in production of a stable mRNA upon transformation of A. niger. The putative XLNR binding sites and two of the putative CREA binding sites were mutated individually in this construct and the effects on expression were examined in A. niger transformants. Northern analysis of the transformants revealed that the consensus XLNR site is not actually functional in the aguA promoter, whereas the sequence that diverges from the consensus at a single position is functional. This indicates that XLNR is also able to bind to the sequence GGCTAG, and the XLNR binding site consensus should therefore be changed to GGCTAR. Both CREA sites are functional, indicating that CREA has a strong influence on aguA expression. A detailed expression analysis of aguA in four genetic backgrounds revealed a second regulatory system involved in activation of aguA gene expression. This system responds to the presence of glucuronic and galacturonic acids, and is not dependent on XLNR.
Suppression Analysis Reveals a Functional Difference between the Serines in Positions Two and Five in the Consensus Sequence of the C-Terminal Domain of Yeast RNA Polymerase II

PubMed Central

Yuryev, A.; Corden, J. L.

1996-01-01

The largest subunit of RNA polymerase II contains a repetitive C-terminal domain (CTD) consisting of tandem repeats of the consensus sequence Tyr(1)Ser(2)Pro(3)Thr(4) Ser(5)Pro(6) Ser(7). Substitution of nonphosphorylatable amino acids at positions two or five of the Saccharomyces cerevisiae CTD is lethal. We developed a selection ssytem for isolating suppressors of this lethal phenotype and cloned a gene, SCA1 (suppressor of CTD alanine), which complements recessive suppressors of lethal multiple-substitution mutations. A partial deletion of SCA1 (sca1Δ::hisG) suppresses alanine or glutamate substitutions at position two of the consensus CTD sequence, and a lethal CTD truncation mutation, but SCA1 deletion does not suppress alanine or glutamate substitutions at position five. SCA1 is identical to SRB9, a suppressor of a cold-sensitive CTD truncation mutation. Strains carrying dominant SRB mutations have the same suppression properties as a sca1Δ::hisG strain. These results reveal a functional difference between positions two and five of the consensus CTD heptapeptide repeat. The ability of SCA1 and SRB mutant alleles to suppress CTD truncation mutations suggest that substitutions at position two, but not at position five, cause a defect in RNA polymerase II function similar to that introduced by CTD truncation. PMID:8725217
PERMANENT GENETIC RESOURCES: Consensus primers of cyp73 genes discriminate willow species and hybrids (Salix, Salicaceae).

PubMed

Trung, Le Quang; VAN Puyvelde, Karolien; Triest, Ludwig

2008-03-01

Consensus primers, based on exon sequences of the cyp73 gene family coding for cinnamate 4-hydroxylase (C4H) of the lignin biosynthesis pathway, were designed for the tetraploid willow species Salix alba and Salix fragilis. Diagnostic alleles at species level were observed among introns of three cyp73 genes and allowed unambiguous detection of the first generation and introgressed hybrids in populations. Progeny analysis of a female S. alba with a male introgressed hybrid confirmed the codominant inheritance of each intron. Sequences of the diagnostic alleles of both species were similar to those found in the hybrids. © 2007 The Authors.
An Unusual Case of White Piedra Due to Trichosporon inkin Mimicking Trichobacteriosis.

PubMed

Zhuang, Kaiwen; Ran, Xin; Dai, Yaling; Tang, Jiaoqing; Yang, Qin; Pradhan, Sushmita; Ran, Yuping

2016-12-01

White piedra is a superficial mycosis characterized by soft, white-to-tan, irregular nodules attached to the hair shafts. A 36-year-old man presented with small lumps in his pubic hair, without any other symptoms. The clinical features were suggestive of trichobacteriosis. Pathology analysis of the infected hair revealed that the concretions surrounding the hair shaft were full of fungal elements, parts of which had invaded into the cuticle. Culture on Sabouraud dextrose agar grew creamy, yellow-white colonies identified as Trichosporon inkin by the sequence of the nuclear ribosomal intergenic spacer region. The condition was treated by shaving the pubic hair and administering antifungal therapy (oral itraconazole and topical ketoconazole).
Low-copy nuclear primers and ycf1 primers in Cactaceae.

PubMed

Franck, Alan R; Cochrane, Bruce J; Garey, James R

2012-10-01

To increase the number of variable regions available for phylogenetic study in the Cactaceae, primers were developed for a portion of the plastid ycf1 gene and intron-spanning regions of two low-copy nuclear genes (isi1, nhx1). • Primers were tested on several families within Caryophyllales, focusing on the Cactaceae. Gel electrophoresis indicated positive amplification in most samples. Sequences of these three regions (isi1, nhx1, ycf1) from Harrisia exhibited variation similar to or greater than two plastid regions (atpB-rbcL intergenic spacer and rpl16 intron). • The isi, nhx, and ycf1 primers amplify phylogenetically useful information applicable to the Cactaceae and other families in the Caryophyllales.
[DNA mutations associated to rifampicin or isoniazid resistance in M. tuberculosis clinical isolates from Sonora, Mexico].

PubMed

Bolado-Martínez, Enrique; Pérez-Mendoza, Ansix; Alegría-Morquecho, Francisco Monserrat; Candia-Plata, María del Carmen; Aguayo-Verdugo, María del Rosario; Alvarez-Hernández, Gerardo

2012-01-01

To perform the analysis of specific regions of the major genes associated with resistance to isoniazid or rifampin. Twenty two M. tuberculosis strains, isolated from human samples obtained in Sonora, Mexico. Specific primers for hotspots of the rpoB, katG, inhA genes and the ahpC-oxyR intergenic region were used. The purified PCR products were sequenced. Mutations in the promoter of inhA, the ahpC-oxyR region, and codon 315 of katG and in 451 or 456 codons of rpoB, were identified. Detection of mutations not previously reported requires further genotypic analysis of Mycobacterium tuberculosis isolates in Sonora.
Highly divergent cyclo-like virus in a great roundleaf bat (Hipposideros armiger) in Vietnam.

PubMed

Kemenesi, Gábor; Kurucz, Kornélia; Zana, Brigitta; Tu, Vuong Tan; Görföl, Tamás; Estók, Péter; Földes, Fanni; Sztancsik, Katalin; Urbán, Péter; Fehér, Enikő; Jakab, Ferenc

2017-08-01

Members of the viral family Circoviridae are increasingly recognized worldwide. Bats seem to be natural reservoirs or dietary-related dispensers of these viruses. Here, we report a distantly related member of the genus Cyclovirus detected in the faeces of a great roundleaf bat (Hipposideros armiger). Interestingly, the novel virus lacks a Circoviridae-specific stem-loop structure, although a Geminiviridae-like nonamer sequence was detected in the large intergenic region. Based on these differences and its phylogenetic position, we propose that our new virus represents a distant and highly divergent member of the genus Cyclovirus. However it is lacking several characteristics of members of the genus, which raises a challenge in its taxonomic classification.
A 1,681-locus consensus genetic map of cultivated cucumber including 67 NB-LRR resistance gene homolog and ten gene loci

PubMed Central

2013-01-01

Background Cucumber is an important vegetable crop that is susceptible to many pathogens, but no disease resistance (R) genes have been cloned. The availability of whole genome sequences provides an excellent opportunity for systematic identification and characterization of the nucleotide binding and leucine-rich repeat (NB-LRR) type R gene homolog (RGH) sequences in the genome. Cucumber has a very narrow genetic base making it difficult to construct high-density genetic maps. Development of a consensus map by synthesizing information from multiple segregating populations is a method of choice to increase marker density. As such, the objectives of the present study were to identify and characterize NB-LRR type RGHs, and to develop a high-density, integrated cucumber genetic-physical map anchored with RGH loci. Results From the Gy14 draft genome, 70 NB-containing RGHs were identified and characterized. Most RGHs were in clusters with uneven distribution across seven chromosomes. In silico analysis indicated that all 70 RGHs had EST support for gene expression. Phylogenetic analysis classified 58 RGHs into two clades: CNL and TNL. Comparative analysis revealed high-degree sequence homology and synteny in chromosomal locations of these RGH members between the cucumber and melon genomes. Fifty-four molecular markers were developed to delimit 67 of the 70 RGHs, which were integrated into a genetic map through linkage analysis. A 1,681-locus cucumber consensus map including 10 gene loci and spanning 730.0 cM in seven linkage groups was developed by integrating three component maps with a bin-mapping strategy. Physically, 308 scaffolds with 193.2 Mbp total DNA sequences were anchored onto this consensus map that covered 52.6% of the 367 Mbp cucumber genome. Conclusions Cucumber contains relatively few NB-LRR RGHs that are clustered and unevenly distributed in the genome. All RGHs seem to be transcribed and shared significant sequence homology and synteny with the melon genome suggesting conservation of these RGHs in the Cucumis lineage. The 1,681-locus consensus genetic-physical map developed and the RGHs identified and characterized herein are valuable genomics resources that may have many applications such as quantitative trait loci identification, map-based gene cloning, association mapping, marker-assisted selection, as well as assembly of a more complete cucumber genome. PMID:23531125
Genetic dissection of the consensus sequence for the class 2 and class 3 flagellar promoters

PubMed Central

Wozniak, Christopher E.; Hughes, Kelly T.

2008-01-01

Summary Computational searches for DNA binding sites often utilize consensus sequences. These search models make assumptions that the frequency of a base pair in an alignment relates to the base pair’s importance in binding and presume that base pairs contribute independently to the overall interaction with the DNA binding protein. These two assumptions have generally been found to be accurate for DNA binding sites. However, these assumptions are often not satisfied for promoters, which are involved in additional steps in transcription initiation after RNA polymerase has bound to the DNA. To test these assumptions for the flagellar regulatory hierarchy, class 2 and class 3 flagellar promoters were randomly mutagenized in Salmonella. Important positions were then saturated for mutagenesis and compared to scores calculated from the consensus sequence. Double mutants were constructed to determine how mutations combined for each promoter type. Mutations in the binding site for FlhD4C2, the activator of class 2 promoters, better satisfied the assumptions for the binding model than did mutations in the class 3 promoter, which is recognized by the σ28 transcription factor. These in vivo results indicate that the activator sites within flagellar promoters can be modeled using simple assumptions but that the DNA sequences recognized by the flagellar sigma factor require more complex models. PMID:18486950
Sequences within the 5' untranslated region regulate the levels of a kinetoplast DNA topoisomerase mRNA during the cell cycle.

PubMed Central

Pasion, S G; Hines, J C; Ou, X; Mahmood, R; Ray, D S

1996-01-01

Gene expression in trypanosomatids appears to be regulated largely at the posttranscriptional level and involves maturation of mRNA precursors by trans splicing of a 39-nucleotide miniexon sequence to the 5' end of the mRNA and cleavage and polyadenylation at the 3' end of the mRNA. To initiate the identification of sequences involved in the periodic expression of DNA replication genes in trypanosomatids, we have mapped splice acceptor sites in the 5' flanking region of the TOP2 gene, which encodes the kinetoplast DNA topoisomerase, and have carried out deletion analysis of this region on a plasmid-encoded TOP2 gene. Block deletions within the 5' untranslated region (UTR) identified two regions (-608 to -388 and -387 to -186) responsible for periodic accumulation of the mRNA. Deletion of one or the other of these sequences had no effect on periodic expression of the mRNA, while deletion of both regions resulted in constitutive expression of the mRNA throughout the cell cycle. Subcloning of these sequences into the 5' UTR of a construct lacking both regions of the TOP2 5' UTR has shown that an octamer consensus sequence present in the 5' UTR of the TOP2, RPA1, and DHFR-TS mRNAs is required for normal cycling of the TOP2 mRNA. Mutation of the consensus octamer sequence in the TOP2 5' UTR in a plasmid construct containing only a single consensus octamer and that shows normal cycling of the plasmid-encoded TOP2 mRNA resulted in substantial reduction of the cycling of the mRNA level. These results imply a negative regulation of TOP2 mRNA during the cell cycle by a mechanism involving redundant elements containing one or more copies of a conserved octamer sequence within the 5' UTR of TOP2 mRNA. PMID:8943327
Sequences within the 5' untranslated region regulate the levels of a kinetoplast DNA topoisomerase mRNA during the cell cycle.

PubMed

Pasion, S G; Hines, J C; Ou, X; Mahmood, R; Ray, D S

1996-12-01

Gene expression in trypanosomatids appears to be regulated largely at the posttranscriptional level and involves maturation of mRNA precursors by trans splicing of a 39-nucleotide miniexon sequence to the 5' end of the mRNA and cleavage and polyadenylation at the 3' end of the mRNA. To initiate the identification of sequences involved in the periodic expression of DNA replication genes in trypanosomatids, we have mapped splice acceptor sites in the 5' flanking region of the TOP2 gene, which encodes the kinetoplast DNA topoisomerase, and have carried out deletion analysis of this region on a plasmid-encoded TOP2 gene. Block deletions within the 5' untranslated region (UTR) identified two regions (-608 to -388 and -387 to -186) responsible for periodic accumulation of the mRNA. Deletion of one or the other of these sequences had no effect on periodic expression of the mRNA, while deletion of both regions resulted in constitutive expression of the mRNA throughout the cell cycle. Subcloning of these sequences into the 5' UTR of a construct lacking both regions of the TOP2 5' UTR has shown that an octamer consensus sequence present in the 5' UTR of the TOP2, RPA1, and DHFR-TS mRNAs is required for normal cycling of the TOP2 mRNA. Mutation of the consensus octamer sequence in the TOP2 5' UTR in a plasmid construct containing only a single consensus octamer and that shows normal cycling of the plasmid-encoded TOP2 mRNA resulted in substantial reduction of the cycling of the mRNA level. These results imply a negative regulation of TOP2 mRNA during the cell cycle by a mechanism involving redundant elements containing one or more copies of a conserved octamer sequence within the 5' UTR of TOP2 mRNA.
A universal protocol to generate consensus level genome sequences for foot-and-mouth disease virus and other positive-sense polyadenylated RNA viruses using the Illumina MiSeq.

PubMed

Logan, Grace; Freimanis, Graham L; King, David J; Valdazo-González, Begoña; Bachanek-Bankowska, Katarzyna; Sanderson, Nicholas D; Knowles, Nick J; King, Donald P; Cottam, Eleanor M

2014-09-30

Next-Generation Sequencing (NGS) is revolutionizing molecular epidemiology by providing new approaches to undertake whole genome sequencing (WGS) in diagnostic settings for a variety of human and veterinary pathogens. Previous sequencing protocols have been subject to biases such as those encountered during PCR amplification and cell culture, or are restricted by the need for large quantities of starting material. We describe here a simple and robust methodology for the generation of whole genome sequences on the Illumina MiSeq. This protocol is specific for foot-and-mouth disease virus (FMDV) or other polyadenylated RNA viruses and circumvents both the use of PCR and the requirement for large amounts of initial template. The protocol was successfully validated using five FMDV positive clinical samples from the 2001 epidemic in the United Kingdom, as well as a panel of representative viruses from all seven serotypes. In addition, this protocol was successfully used to recover 94% of an FMDV genome that had previously been identified as cell culture negative. Genome sequences from three other non-FMDV polyadenylated RNA viruses (EMCV, ERAV, VESV) were also obtained with minor protocol amendments. We calculated that a minimum coverage depth of 22 reads was required to produce an accurate consensus sequence for FMDV O. This was achieved in 5 FMDV/O/UKG isolates and the type O FMDV from the serotype panel with the exception of the 5' genomic termini and area immediately flanking the poly(C) region. We have developed a universal WGS method for FMDV and other polyadenylated RNA viruses. This method works successfully from a limited quantity of starting material and eliminates the requirement for genome-specific PCR amplification. This protocol has the potential to generate consensus-level sequences within a routine high-throughput diagnostic environment.
Prototype foamy virus envelope glycoprotein leader peptide processing is mediated by a furin-like cellular protease, but cleavage is not essential for viral infectivity.

PubMed

Duda, Anja; Stange, Annett; Lüftenegger, Daniel; Stanke, Nicole; Westphal, Dana; Pietschmann, Thomas; Eastman, Scott W; Linial, Maxine L; Rethwilm, Axel; Lindemann, Dirk

2004-12-01

Analogous to cellular glycoproteins, viral envelope proteins contain N-terminal signal sequences responsible for targeting them to the secretory pathway. The prototype foamy virus (PFV) envelope (Env) shows a highly unusual biosynthesis. Its precursor protein has a type III membrane topology with both the N and C terminus located in the cytoplasm. Coexpression of FV glycoprotein and interaction of its leader peptide (LP) with the viral capsid is essential for viral particle budding and egress. Processing of PFV Env into the particle-associated LP, surface (SU), and transmembrane (TM) subunits occur posttranslationally during transport to the cell surface by yet-unidentified cellular proteases. Here we provide strong evidence that furin itself or a furin-like protease and not the signal peptidase complex is responsible for both processing events. N-terminal protein sequencing of the SU and TM subunits of purified PFV Env-immunoglobulin G immunoadhesin identified furin consensus sequences upstream of both cleavage sites. Mutagenesis analysis of two overlapping furin consensus sequences at the PFV LP/SU cleavage site in the wild-type protein confirmed the sequencing data and demonstrated utilization of only the first site. Fully processed SU was almost completely absent in viral particles of mutants having conserved arginine residues replaced by alanines in the first furin consensus sequence, but normal processing was observed upon mutation of the second motif. Although these mutants displayed a significant loss in infectivity as a result of reduced particle release, no correlation to processing inhibition was observed, since another mutant having normal LP/SU processing had a similar defect.
cWINNOWER algorithm for finding fuzzy dna motifs

NASA Technical Reports Server (NTRS)

Liang, S.; Samanta, M. P.; Biegel, B. A.

2004-01-01

The cWINNOWER algorithm detects fuzzy motifs in DNA sequences rich in protein-binding signals. A signal is defined as any short nucleotide pattern having up to d mutations differing from a motif of length l. The algorithm finds such motifs if a clique consisting of a sufficiently large number of mutated copies of the motif (i.e., the signals) is present in the DNA sequence. The cWINNOWER algorithm substantially improves the sensitivity of the winnower method of Pevzner and Sze by imposing a consensus constraint, enabling it to detect much weaker signals. We studied the minimum detectable clique size qc as a function of sequence length N for random sequences. We found that qc increases linearly with N for a fast version of the algorithm based on counting three-member sub-cliques. Imposing consensus constraints reduces qc by a factor of three in this case, which makes the algorithm dramatically more sensitive. Our most sensitive algorithm, which counts four-member sub-cliques, needs a minimum of only 13 signals to detect motifs in a sequence of length N = 12,000 for (l, d) = (15, 4). Copyright Imperial College Press.
cWINNOWER Algorithm for Finding Fuzzy DNA Motifs

NASA Technical Reports Server (NTRS)

Liang, Shoudan

2003-01-01

The cWINNOWER algorithm detects fuzzy motifs in DNA sequences rich in protein-binding signals. A signal is defined as any short nucleotide pattern having up to d mutations differing from a motif of length l. The algorithm finds such motifs if multiple mutated copies of the motif (i.e., the signals) are present in the DNA sequence in sufficient abundance. The cWINNOWER algorithm substantially improves the sensitivity of the winnower method of Pevzner and Sze by imposing a consensus constraint, enabling it to detect much weaker signals. We studied the minimum number of detectable motifs qc as a function of sequence length N for random sequences. We found that qc increases linearly with N for a fast version of the algorithm based on counting three-member sub-cliques. Imposing consensus constraints reduces qc, by a factor of three in this case, which makes the algorithm dramatically more sensitive. Our most sensitive algorithm, which counts four-member sub-cliques, needs a minimum of only 13 signals to detect motifs in a sequence of length N = 12000 for (l,d) = (15,4).
CODEHOP (COnsensus-DEgenerate Hybrid Oligonucleotide Primer) PCR primer design

PubMed Central

Rose, Timothy M.; Henikoff, Jorja G.; Henikoff, Steven

2003-01-01

We have developed a new primer design strategy for PCR amplification of distantly related gene sequences based on consensus-degenerate hybrid oligonucleotide primers (CODEHOPs). An interactive program has been written to design CODEHOP PCR primers from conserved blocks of amino acids within multiply-aligned protein sequences. Each CODEHOP consists of a pool of related primers containing all possible nucleotide sequences encoding 3–4 highly conserved amino acids within a 3′ degenerate core. A longer 5′ non-degenerate clamp region contains the most probable nucleotide predicted for each flanking codon. CODEHOPs are used in PCR amplification to isolate distantly related sequences encoding the conserved amino acid sequence. The primer design software and the CODEHOP PCR strategy have been utilized for the identification and characterization of new gene orthologs and paralogs in different plant, animal and bacterial species. In addition, this approach has been successful in identifying new pathogen species. The CODEHOP designer (http://blocks.fhcrc.org/codehop.html) is linked to BlockMaker and the Multiple Alignment Processor within the Blocks Database World Wide Web (http://blocks.fhcrc.org). PMID:12824413

Mapping and Sequencing the Human Genome

DOE R&D Accomplishments Database

1988-01-01

Numerous meetings have been held and a debate has developed in the biological community over the merits of mapping and sequencing the human genome. In response a committee to examine the desirability and feasibility of mapping and sequencing the human genome was formed to suggest options for implementing the project. The committee asked many questions. Should the analysis of the human genome be left entirely to the traditionally uncoordinated, but highly successful, support systems that fund the vast majority of biomedical research. Or should a more focused and coordinated additional support system be developed that is limited to encouraging and facilitating the mapping and eventual sequencing of the human genome. If so, how can this be done without distorting the broader goals of biological research that are crucial for any understanding of the data generated in such a human genome project. As the committee became better informed on the many relevant issues, the opinions of its members coalesced, producing a shared consensus of what should be done. This report reflects that consensus.
Identification and Structural Characterization of the ALIX-Binding Late Domains of Simian Immunodeficiency Virus SIV mac239 and SIV agmTan-1

DOE Office of Scientific and Technical Information (OSTI.GOV)

Q Zhai; M Landesman; H Robinson

2011-12-31

Retroviral Gag proteins contain short late-domain motifs that recruit cellular ESCRT pathway proteins to facilitate virus budding. ALIX-binding late domains often contain the core consensus sequence YPX{sub n}L (where X{sub n} can vary in sequence and length). However, some simian immunodeficiency virus (SIV) Gag proteins lack this consensus sequence, yet still bind ALIX. We mapped divergent, ALIX-binding late domains within the p6{sup Gag} proteins of SIV{sub MAC239} ({sub 40}SREK{und P}YKE{und VT}ED{und L}LHLNSLF{sub 59}) and SIV{sub agmTan-1} ({sub 24}AAG{und A}YDP{und AR}KL{und L}EQYAKK{sub 41}). Crystal structures revealed that anchoring tyrosines (in lightface) and nearby hydrophobic residues (underlined) contact the ALIX V domain,more » revealing how lentiviruses employ a diverse family of late-domain sequences to bind ALIX and promote virus budding.« less
Identification and Structural Characterization of the ALIX-Binding Late Domains of Simian Immunodeficiency Virus SIVmac239 and SIVagmTan-1

DOE Office of Scientific and Technical Information (OSTI.GOV)

Zhai, Q.; Robinson, H.; Landesman, M. B.

2011-01-01

Retroviral Gag proteins contain short late-domain motifs that recruit cellular ESCRT pathway proteins to facilitate virus budding. ALIX-binding late domains often contain the core consensus sequence YPX{sub n}L (where X{sub n} can vary in sequence and length). However, some simian immunodeficiency virus (SIV) Gag proteins lack this consensus sequence, yet still bind ALIX. We mapped divergent, ALIX-binding late domains within the p6{sup Gag} proteins of SIV{sub mac239} ({sub 40}SREK{und P}YKE{und VT}ED{und L}LHLNSLF{sub 59}) and SIV{sub agmTan-1} ({sub 24}AAG{und A}YDP{und AR}KL{und L}EQYAKK{sub 41}). Crystal structures revealed that anchoring tyrosines (in lightface) and nearby hydrophobic residues (underlined) contact the ALIX V domain,more » revealing how lentiviruses employ a diverse family of late-domain sequences to bind ALIX and promote virus budding.« less
Human adenovirus serotype 12 virion precursors pMu and pVI are cleaved at amino-terminal and carboxy-terminal sites that conform to the adenovirus 2 endoproteinase cleavage consensus sequence.

PubMed

Freimuth, P; Anderson, C W

1993-03-01

The sequence of a 1158-base pair fragment of the human adenovirus serotype 12 (Ad12) genome was determined. This segment encodes the precursors for virion components Mu and VI. Both Ad12 precursors contain two sequences that conform to a consensus sequence motif for cleavage by the endoproteinase of adenovirus 2 (Ad2). Analysis of the amino terminus of VI and of the peptide fragments found in Ad12 virions demonstrated that these sites are cleaved during Ad12 maturation. This observation suggests that the recognition motif for adenovirus endoproteinases is highly conserved among human serotypes. The adenovirus 2 endoproteinase polypeptide requires additional co-factors for activity (C. W. Anderson, Protein Expression Purif., 1993, 4, 8-15). Synthetic Ad12 or Ad2 pVI carboxy-terminal peptides each permitted efficient cleavage of an artificial endoproteinase substrate by recombinant Ad2 endoproteinase polypeptide.
High-Throughput SNP Discovery through Deep Resequencing of a Reduced Representation Library to Anchor and Orient Scaffolds in the Soybean Whole Genome Sequence

USDA-ARS?s Scientific Manuscript database

The soybean Consensus Map 4.0 facilitated the anchoring of 95.6% of the soybean whole genome sequence developed by the Joint Genome Institute, Department of Energy but only properly oriented 66% of the sequence scaffolds. To find additional single nucleotide polymorphism (SNP) markers for additiona...
Mini-midi-mito: adapting the amplification and sequencing strategy of mtDNA to the degradation state of crime scene samples.

PubMed

Berger, Cordula; Parson, Walther

2009-06-01

The degradation state of some biological traces recovered from the crime scene requires the amplification of very short fragments to attain a useful mitochondrial (mt)DNA sequence. We have previously introduced two mini-multiplex assays that amplify 10 overlapping control region (CR) fragments in two separate multiplex PCRs, which brought successful CR consensus sequences from even highly degraded DNA extracts. This procedure requires a total of 20 sequencing reactions per sample, which is laborious and cost intensive. For only moderately degraded samples that we encounter more frequently with typical mtDNA casework material, we developed two new multiplex assays that use a subset of the mini-amplicon primers but embrace larger fragments (midis) and require only 10 sequencing reactions to build a double-stranded CR consensus sequence. We used a preceding mtDNA quantitation step by real-time PCR with two different target fragments (143 and 283 bp) that roughly correspond to the average fragment sizes of the different multiplex approaches to estimate size-dependent mtDNA quantities and to aid the choice of the appropriate PCR multiplexes with respect to quality of the results and required costs.
Strain typing of acetic acid bacteria responsible for vinegar production by the submerged elaboration method.

PubMed

Fernández-Pérez, Rocío; Torres, Carmen; Sanz, Susana; Ruiz-Larrea, Fernanda

2010-12-01

Strain typing of 103 acetic acid bacteria isolates from vinegars elaborated by the submerged method from ciders, wines and spirit ethanol, was carried on in this study. Two different molecular methods were utilised: pulsed field gel electrophoresis (PFGE) of total DNA digests with a number of restriction enzymes, and enterobacterial repetitive intergenic consensus (ERIC) - PCR analysis. The comparative study of both methods showed that restriction fragment PFGE of SpeI digests of total DNA was a suitable method for strain typing and for determining which strains were present in vinegar fermentations. Results showed that strains of the species Gluconacetobacter europaeus were the most frequent leader strains of fermentations by the submerged method in the studied vinegars, and among them strain R1 was the predominant one. Results showed as well that mixed populations (at least two different strains) occurred in vinegars from cider and wine, whereas unique strains were found in spirit vinegars, which offered the most stressing conditions for bacterial growth. Copyright © 2010 Elsevier Ltd. All rights reserved.
Differential accumulation of nif structural gene mRNA in Azotobacter vinelandii.

PubMed

Hamilton, Trinity L; Jacobson, Marty; Ludwig, Marcus; Boyd, Eric S; Bryant, Donald A; Dean, Dennis R; Peters, John W

2011-09-01

Northern analysis was employed to investigate mRNA produced by mutant strains of Azotobacter vinelandii with defined deletions in the nif structural genes and in the intergenic noncoding regions. The results indicate that intergenic RNA secondary structures effect the differential accumulation of transcripts, supporting the high Fe protein-to-MoFe protein ratio required for optimal diazotrophic growth.
Transcriptional Profiling of the Oral Pathogen Streptococcus mutans in Response to Competence Signaling Peptide XIP.

PubMed

Wenderska, Iwona B; Latos, Andrew; Pruitt, Benjamin; Palmer, Sara; Spatafora, Grace; Senadheera, Dilani B; Cvitkovitch, Dennis G

2017-01-01

In the cariogenic Streptococcus mutans , competence development is regulated by the ComRS signaling system comprised of the ComR regulator and the ComS prepeptide to the competence signaling peptide XIP (ComX-inducing peptide). Aside from competence development, XIP signaling has been demonstrated to regulate cell lysis, and recently, the expression of bacteriocins, small antimicrobial peptides used by bacteria to inhibit closely related species. Our study further explores the effect of XIP signaling on the S. mutans transcriptome. RNA sequencing revealed that XIP induction resulted in a global change in gene expression that was consistent with a stress response. An increase in several membrane-bound regulators, including HdrRM and BrsRM, involved in bacteriocin production, and the VicRKX system, involved in acid tolerance and biofilm formation, was observed. Furthermore, global changes in gene expression corresponded to changes observed during the stringent response to amino acid starvation. Effects were also observed on genes involved in sugar transport and carbon catabolite repression and included the levQRST and levDEFG operons. Finally, our work identified a novel heat shock-responsive intergenic region, encoding a small RNA, with a potential role in competence shutoff. IMPORTANCE Genetic competence provides bacteria with an opportunity to increase genetic diversity or acquire novel traits conferring a survival advantage. In the cariogenic pathogen Streptococcus mutans , DNA transformation is regulated by the competence stimulating peptide XIP (ComX-inducing peptide). The present study utilizes high-throughput RNA sequencing (RNAseq) to provide a greater understanding of how global gene expression patterns change in response to XIP. Overall, our work demonstrates that in S. mutans , XIP signaling induces a response that resembles the stringent response to amino acid starvation. We further identify a novel heat shock-responsive intergenic region with a potential role in competence shutoff. Together, our results provide further evidence that multiple stress response mechanisms are linked through the genetic competence signaling pathway in S. mutans .
RNA Sequencing of the Exercise Transcriptome in Equine Athletes

PubMed Central

Verini-Supplizi, Andrea; Barcaccia, Gianni; Albiero, Alessandro; D'Angelo, Michela; Campagna, Davide; Valle, Giorgio; Felicetti, Michela; Silvestrelli, Maurizio; Cappelli, Katia

2013-01-01

The horse is an optimal model organism for studying the genomic response to exercise-induced stress, due to its natural aptitude for athletic performance and the relative homogeneity of its genetic and environmental backgrounds. Here, we applied RNA-sequencing analysis through the use of SOLiD technology in an experimental framework centered on exercise-induced stress during endurance races in equine athletes. We monitored the transcriptional landscape by comparing gene expression levels between animals at rest and after competition. Overall, we observed a shift from coding to non-coding regions, suggesting that the stress response involves the differential expression of not annotated regions. Notably, we observed significant post-race increases of reads that correspond to repeats, especially the intergenic and intronic L1 and L2 transposable elements. We also observed increased expression of the antisense strands compared to the sense strands in intronic and regulatory regions (1 kb up- and downstream) of the genes, suggesting that antisense transcription could be one of the main mechanisms for transposon regulation in the horse under stress conditions. We identified a large number of transcripts corresponding to intergenic and intronic regions putatively associated with new transcriptional elements. Gene expression and pathway analysis allowed us to identify several biological processes and molecular functions that may be involved with exercise-induced stress. Ontology clustering reflected mechanisms that are already known to be stress activated (e.g., chemokine-type cytokines, Toll-like receptors, and kinases), as well as “nucleic acid binding” and “signal transduction activity” functions. There was also a general and transient decrease in the global rates of protein synthesis, which would be expected after strenuous global stress. In sum, our network analysis points toward the involvement of specific gene clusters in equine exercise-induced stress, including those involved in inflammation, cell signaling, and immune interactions. PMID:24391776
Isolation and characterization of Treponema phagedenis-like spirochetes from digital dermatitis lesions in Swedish dairy cattle

PubMed Central

Pringle, Märit; Bergsten, Christer; Fernström, Lise-Lotte; Höök, Helena; Johansson, Karl-Erik

2008-01-01

Background Digital dermatitis in cattle is an emerging infectious disease. Ulcerative lesions are typically located on the plantar skin between the heel bulbs and adjacent to the coronet. Spirochetes of the genus Treponema are found in high numbers in the lesions and are likely to be involved in the pathogenesis. The aim of this study was to obtain pure cultures of spirochetes from cattle with digital dermatitis and to describe them further. Methods Tissue samples and swabs from active digital dermatitis lesions were used for culturing. Pure isolates were subjected to, molecular typing through 16S rRNA gene sequencing, pulsed-field gel electrophoresis (PFGE), random amplified polymorphic DNA (RAPD) and an intergenic spacer PCR developed for Treponema spp. as well as API-ZYM and antimicrobial susceptibility tests. The antimicrobial agents used were tiamulin, valnemulin, tylosin, aivlosin, lincomycin and doxycycline. Results Seven spirochete isolates from five herds were obtained. Both 16S rRNA gene sequences, which were identical except for three polymorphic nucleotide positions, and the intergenic spacer PCR indicated that all isolates were of one yet unnamed species, most closely related to Treponema phagedenis. The enzymatic profile and antimicrobial susceptibility pattern were also similar for all isolates. However it was possible to separate the isolates through their PFGE and RAPD banding pattern. Conclusion This is the first report on isolation of a Treponema sp. from cattle with digital dermatitis in Scandinavia. The phylotype isolated has previously been cultured from samples from cattle in the USA and the UK and is closely related to T. phagedenis. While very similar, the isolates in this study were possible to differentiate through PFGE and RAPD indicating that these methods are suitable for subtyping of this phylotype. No antimicrobial resistance could be detected among the tested isolates. PMID:18937826
Nanopore DNA Sequencing and Genome Assembly on the International Space Station.

PubMed

Castro-Wallace, Sarah L; Chiu, Charles Y; John, Kristen K; Stahl, Sarah E; Rubins, Kathleen H; McIntyre, Alexa B R; Dworkin, Jason P; Lupisella, Mark L; Smith, David J; Botkin, Douglas J; Stephenson, Timothy A; Juul, Sissel; Turner, Daniel J; Izquierdo, Fernando; Federman, Scot; Stryke, Doug; Somasekar, Sneha; Alexander, Noah; Yu, Guixia; Mason, Christopher E; Burton, Aaron S

2017-12-21

We evaluated the performance of the MinION DNA sequencer in-flight on the International Space Station (ISS), and benchmarked its performance off-Earth against the MinION, Illumina MiSeq, and PacBio RS II sequencing platforms in terrestrial laboratories. Samples contained equimolar mixtures of genomic DNA from lambda bacteriophage, Escherichia coli (strain K12, MG1655) and Mus musculus (female BALB/c mouse). Nine sequencing runs were performed aboard the ISS over a 6-month period, yielding a total of 276,882 reads with no apparent decrease in performance over time. From sequence data collected aboard the ISS, we constructed directed assemblies of the ~4.6 Mb E. coli genome, ~48.5 kb lambda genome, and a representative M. musculus sequence (the ~16.3 kb mitochondrial genome), at 100%, 100%, and 96.7% consensus pairwise identity, respectively; de novo assembly of the E. coli genome from raw reads yielded a single contig comprising 99.9% of the genome at 98.6% consensus pairwise identity. Simulated real-time analyses of in-flight sequence data using an automated bioinformatic pipeline and laptop-based genomic assembly demonstrated the feasibility of sequencing analysis and microbial identification aboard the ISS. These findings illustrate the potential for sequencing applications including disease diagnosis, environmental monitoring, and elucidating the molecular basis for how organisms respond to spaceflight.
Ribosomal RNA Genes Contribute to the Formation of Pseudogenes and Junk DNA in the Human Genome.

PubMed

Robicheau, Brent M; Susko, Edward; Harrigan, Amye M; Snyder, Marlene

2017-02-01

Approximately 35% of the human genome can be identified as sequence devoid of a selected-effect function, and not derived from transposable elements or repeated sequences. We provide evidence supporting a known origin for a fraction of this sequence. We show that: 1) highly degraded, but near full length, ribosomal DNA (rDNA) units, including both 45S and Intergenic Spacer (IGS), can be found at multiple sites in the human genome on chromosomes without rDNA arrays, 2) that these rDNA sequences have a propensity for being centromere proximal, and 3) that sequence at all human functional rDNA array ends is divergent from canonical rDNA to the point that it is pseudogenic. We also show that small sequence strings of rDNA (from 45S + IGS) can be found distributed throughout the genome and are identifiable as an "rDNA-like signal", representing 0.26% of the q-arm of HSA21 and ∼2% of the total sequence of other regions tested. The size of sequence strings found in the rDNA-like signal intergrade into the size of sequence strings that make up the full-length degrading rDNA units found scattered throughout the genome. We conclude that the displaced and degrading rDNA sequences are likely of a similar origin but represent different stages in their evolution towards random sequence. Collectively, our data suggests that over vast evolutionary time, rDNA arrays contribute to the production of junk DNA. The concept that the production of rDNA pseudogenes is a by-product of concerted evolution represents a previously under-appreciated process; we demonstrate here its importance. © The Author(s) 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
The sampled-data consensus of multi-agent systems with probabilistic time-varying delays and packet losses

NASA Astrophysics Data System (ADS)

Sui, Xin; Yang, Yongqing; Xu, Xianyun; Zhang, Shuai; Zhang, Lingzhong

2018-02-01

This paper investigates the consensus of multi-agent systems with probabilistic time-varying delays and packet losses via sampled-data control. On the one hand, a Bernoulli-distributed white sequence is employed to model random packet losses among agents. On the other hand, a switched system is used to describe packet dropouts in a deterministic way. Based on the special property of the Laplacian matrix, the consensus problem can be converted into a stabilization problem of a switched system with lower dimensions. Some mean square consensus criteria are derived in terms of constructing an appropriate Lyapunov function and using linear matrix inequalities (LMIs). Finally, two numerical examples are given to show the effectiveness of the proposed method.
Application of circular consensus sequencing and network analysis to characterize the bovine IgG repertoire

USDA-ARS?s Scientific Manuscript database

Background: Vertebrate immune systems generate diverse repertoires of antibodies capable of mediating response to a variety of antigens. Next generation sequencing methods provide unique approaches to a number of immuno-based research areas including antibody discovery and engineering, disease surve...
Landscape of somatic mutations in 560 breast cancer whole-genome sequences

DOE PAGES

Nik-Zainal, Serena; Davies, Helen; Staaf, Johan; ...

2016-05-02

Here, we analysed whole-genome sequences of 560 breast cancers to advance understanding of the driver mutations conferring clonal advantage and the mutational processes generating somatic mutations. We found that 93 protein-coding cancer genes carried probable driver mutations. Some non-coding regions exhibited high mutation frequencies, but most have distinctive structural features probably causing elevated mutation rates and do not contain driver mutations. Mutational signature analysis was extended to genome rearrangements and revealed twelve base substitution and six rearrangement signatures. Three rearrangement signatures, characterized by tandem duplications or deletions, appear associated with defective homologous-recombination-based DNA repair: one with deficient BRCA1 function, anothermore » with deficient BRCA1 or BRCA2 function, the cause of the third is unknown. This analysis of all classes of somatic mutation across exons, introns and intergenic regions highlights the repertoire of cancer genes and mutational processes operating, and progresses towards a comprehensive account of the somatic genetic basis of breast cancer.« less
Landscape of somatic mutations in 560 breast cancer whole-genome sequences

DOE Office of Scientific and Technical Information (OSTI.GOV)

Nik-Zainal, Serena; Davies, Helen; Staaf, Johan

Here, we analysed whole-genome sequences of 560 breast cancers to advance understanding of the driver mutations conferring clonal advantage and the mutational processes generating somatic mutations. We found that 93 protein-coding cancer genes carried probable driver mutations. Some non-coding regions exhibited high mutation frequencies, but most have distinctive structural features probably causing elevated mutation rates and do not contain driver mutations. Mutational signature analysis was extended to genome rearrangements and revealed twelve base substitution and six rearrangement signatures. Three rearrangement signatures, characterized by tandem duplications or deletions, appear associated with defective homologous-recombination-based DNA repair: one with deficient BRCA1 function, anothermore » with deficient BRCA1 or BRCA2 function, the cause of the third is unknown. This analysis of all classes of somatic mutation across exons, introns and intergenic regions highlights the repertoire of cancer genes and mutational processes operating, and progresses towards a comprehensive account of the somatic genetic basis of breast cancer.« less
Evolution of sfbI Encoding Streptococcal Fibronectin-Binding Protein I: Horizontal Genetic Transfer and Gene Mosaic Structure

PubMed Central

Towers, Rebecca J.; Fagan, Peter K.; Talay, Susanne R.; Currie, Bart J.; Sriprakash, Kadaba S.; Walker, Mark J.; Chhatwal, Gursharan S.

2003-01-01

Streptococcal fibronectin-binding protein is an important virulence factor involved in colonization and invasion of epithelial cells and tissues by Streptococcus pyogenes. In order to investigate the mechanisms involved in the evolution of sfbI, the sfbI genes from 54 strains were sequenced. Thirty-four distinct alleles were identified. Three principal mechanisms appear to have been involved in the evolution of sfbI. The amino-terminal aromatic amino acid-rich domain is the most variable region and is apparently generated by intergenic recombination of horizontally acquired DNA cassettes, resulting in a genetic mosaic in this region. Two distinct and divergent sequence types that shared only 61 to 70% identity were identified in the central proline-rich region, while variation at the 3′ end of the gene is due to deletion or duplication of defined repeat units. Potential antigenic and functional variabilities in SfbI imply significant selective pressure in vivo with direct implications for the microbial pathogenesis of S. pyogenes. PMID:14662917
Rapid identification of probiotic Lactobacillus species by multiplex PCR using species-specific primers based on the region extending from 16S rRNA through 23S rRNA.

PubMed

Kwon, Hyuk-Sang; Yang, Eun-Hee; Yeon, Seung-Woo; Kang, Byoung-Hwa; Kim, Tae-Yong

2004-10-15

This study aimed to develop a novel multiplex polymerase chain reaction (PCR) primer set for the identification of seven probiotic Lactobacillus species such as Lactobacillus acidophilus, Lactobacillus delbrueckii, Lactobacillus casei, Lactobacillus gasseri, Lactobacillus plantarum, Lactobacillus reuteri and Lactobacillus rhamnosus. The primer set, comprising of seven specific and two conserved primers, was derived from the integrated sequences of 16S and 23S rRNA genes and their rRNA intergenic spacer region of each species. It was able to identify the seven target species with 93.6% accuracy, which exceeds that of the general biochemical methods. The phylogenetic analyses, using 16S rDNA sequences of the probiotic isolates, also provided further support that the results from the multiplex PCR assay were trustworthy. Taken together, we suggest that the multiplex primer set is an efficient tool for simple, rapid and reliable identification of seven Lactobacillus species.
Molecular epidemiological tracing of a cattle rabies outbreak lasting less than a month in Rio Grande do Sul in southern Brazil.

PubMed

Itou, Takuya; Fukayama, Toshiharu; Mochizuki, Nobuyuki; Kobayashi, Yuki; Deberaldini, Eduardo R; Carvalho, Adolorata A B; Ito, Fumio H; Sakai, Takeo

2016-02-12

Vampire bat-transmitted cattle rabies cases are typically encountered in areas where the disease is endemic. However, over the period of a month in 2009, an outbreak of cattle rabies occurred and then ended spontaneously in a small area of the Rio Grande do Sul State in southern Brazil. To investigate the epidemiological characteristics of this rabies outbreak in Rio Grande do Sul, 26 nucleotide sequences of rabies virus (RABV) genomes that were collected in this area were analyzed phylogenetically. Nucleotide sequence identities of the nucleoprotein gene and G-L intergenic region of the 26 RABVs were greater than 99.6 %. Phylogenetic analysis showed that all RABVs clustered with the vampire bat-related cattle RABV strains and that the RABVs were mainly distributed in southern Brazil. The findings of the present study suggested that a small population of rabid vampire bats carrying a single RABV strain produced a spatiotemporally restricted outbreak of cattle rabies in southern Brazil.

High-throughput annotation of full-length long noncoding RNAs with capture long-read sequencing.

PubMed

Lagarde, Julien; Uszczynska-Ratajczak, Barbara; Carbonell, Silvia; Pérez-Lluch, Sílvia; Abad, Amaya; Davis, Carrie; Gingeras, Thomas R; Frankish, Adam; Harrow, Jennifer; Guigo, Roderic; Johnson, Rory

2017-12-01

Accurate annotation of genes and their transcripts is a foundation of genomics, but currently no annotation technique combines throughput and accuracy. As a result, reference gene collections remain incomplete-many gene models are fragmentary, and thousands more remain uncataloged, particularly for long noncoding RNAs (lncRNAs). To accelerate lncRNA annotation, the GENCODE consortium has developed RNA Capture Long Seq (CLS), which combines targeted RNA capture with third-generation long-read sequencing. Here we present an experimental reannotation of the GENCODE intergenic lncRNA populations in matched human and mouse tissues that resulted in novel transcript models for 3,574 and 561 gene loci, respectively. CLS approximately doubled the annotated complexity of targeted loci, outperforming existing short-read techniques. Full-length transcript models produced by CLS enabled us to definitively characterize the genomic features of lncRNAs, including promoter and gene structure, and protein-coding potential. Thus, CLS removes a long-standing bottleneck in transcriptome annotation and generates manual-quality full-length transcript models at high-throughput scales.
Landscape of somatic mutations in 560 breast cancer whole genome sequences

PubMed Central

Nik-Zainal, Serena; Davies, Helen; Staaf, Johan; Ramakrishna, Manasa; Glodzik, Dominik; Zou, Xueqing; Martincorena, Inigo; Alexandrov, Ludmil B.; Martin, Sancha; Wedge, David C.; Van Loo, Peter; Ju, Young Seok; Smid, Marcel; Brinkman, Arie B; Morganella, Sandro; Aure, Miriam R.; Lingjærde, Ole Christian; Langerød, Anita; Ringnér, Markus; Ahn, Sung-Min; Boyault, Sandrine; Brock, Jane E.; Broeks, Annegien; Butler, Adam; Desmedt, Christine; Dirix, Luc; Dronov, Serge; Fatima, Aquila; Foekens, John A.; Gerstung, Moritz; Hooijer, Gerrit KJ; Jang, Se Jin; Jones, David R.; Kim, Hyung-Yong; King, Tari A.; Krishnamurthy, Savitri; Lee, Hee Jin; Lee, Jeong-Yeon; Li, Yilong; McLaren, Stuart; Menzies, Andrew; Mustonen, Ville; O’Meara, Sarah; Pauporté, Iris; Pivot, Xavier; Purdie, Colin A.; Raine, Keiran; Ramakrishnan, Kamna; Rodríguez-González, F. Germán; Romieu, Gilles; Sieuwerts, Anieta M.; Simpson, Peter T; Shepherd, Rebecca; Stebbings, Lucy; Stefansson, Olafur A; Teague, Jon; Tommasi, Stefania; Treilleux, Isabelle; Van den Eynden, Gert G.; Vermeulen, Peter; Vincent-Salomon, Anne; Yates, Lucy; Caldas, Carlos; van’t Veer, Laura; Tutt, Andrew; Knappskog, Stian; Tan, Benita Kiat Tee; Jonkers, Jos; Borg, Åke; Ueno, Naoto T; Sotiriou, Christos; Viari, Alain; Futreal, P. Andrew; Campbell, Peter J; Span, Paul N.; Van Laere, Steven; Lakhani, Sunil R; Eyfjord, Jorunn E.; Thompson, Alastair M.; Birney, Ewan; Stunnenberg, Hendrik G; van de Vijver, Marc J; Martens, John W.M.; Børresen-Dale, Anne-Lise; Richardson, Andrea L.; Kong, Gu; Thomas, Gilles; Stratton, Michael R.

2016-01-01

We analysed whole genome sequences of 560 breast cancers to advance understanding of the driver mutations conferring clonal advantage and the mutational processes generating somatic mutations. 93 protein-coding cancer genes carried likely driver mutations. Some non-coding regions exhibited high mutation frequencies but most have distinctive structural features probably causing elevated mutation rates and do not harbour driver mutations. Mutational signature analysis was extended to genome rearrangements and revealed 12 base substitution and six rearrangement signatures. Three rearrangement signatures, characterised by tandem duplications or deletions, appear associated with defective homologous recombination based DNA repair: one with deficient BRCA1 function; another with deficient BRCA1 or BRCA2 function; the cause of the third is unknown. This analysis of all classes of somatic mutation across exons, introns and intergenic regions highlights the repertoire of cancer genes and mutational processes operative, and progresses towards a comprehensive account of the somatic genetic basis of breast cancer. PMID:27135926
Genetic Control of Amadori Product Degradation in Bacillus subtilis via Regulation of frlBONMD Expression by FrlR▿†

PubMed Central

Deppe, Veronika Maria; Klatte, Stephanie; Bongaerts, Johannes; Maurer, Karl-Heinz; O'Connell, Timothy; Meinhardt, Friedhelm

2011-01-01

Bacillus subtilis is capable of degrading fructosamines. The phosphorylation and the cleavage of the resulting fructosamine 6-phosphates is catalyzed by the frlD and frlB gene products, respectively. This study addresses the physiological importance of the frlBONMD genes (formerly yurPONML), revealing the necessity of their expression for growth on fructosamines and focusing on the complex regulation of the corresponding transcription unit. In addition to the known regulation by the global transcriptional regulator CodY, the frl genes are repressed by the convergently transcribed FrlR (formerly YurK). The latter causes repression during growth on substrates other than fructosamines. Additionally, we identified in the first intergenic region of the operon an FrlR binding site which is centrally located within a 38-bp perfect palindromic sequence. There is genetic evidence that this sequence, in combination with FrlR, contributes to the remarkable decrease in the transcription downstream of the first gene of the frl operon. PMID:21398478
Prediction of Protein Structural Classes for Low-Similarity Sequences Based on Consensus Sequence and Segmented PSSM.

PubMed

Liang, Yunyun; Liu, Sanyang; Zhang, Shengli

2015-01-01

Prediction of protein structural classes for low-similarity sequences is useful for understanding fold patterns, regulation, functions, and interactions of proteins. It is well known that feature extraction is significant to prediction of protein structural class and it mainly uses protein primary sequence, predicted secondary structure sequence, and position-specific scoring matrix (PSSM). Currently, prediction solely based on the PSSM has played a key role in improving the prediction accuracy. In this paper, we propose a novel method called CSP-SegPseP-SegACP by fusing consensus sequence (CS), segmented PsePSSM, and segmented autocovariance transformation (ACT) based on PSSM. Three widely used low-similarity datasets (1189, 25PDB, and 640) are adopted in this paper. Then a 700-dimensional (700D) feature vector is constructed and the dimension is decreased to 224D by using principal component analysis (PCA). To verify the performance of our method, rigorous jackknife cross-validation tests are performed on 1189, 25PDB, and 640 datasets. Comparison of our results with the existing PSSM-based methods demonstrates that our method achieves the favorable and competitive performance. This will offer an important complementary to other PSSM-based methods for prediction of protein structural classes for low-similarity sequences.
Molecular Characterization of the Kamese Virus, an Unassigned Rhabdovirus, Isolated from Culex pruina in the Central African Republic.

PubMed

Simo Tchetgna, Huguette Dorine; Nakoune, Emmanuel; Selekon, Benjamin; Gessain, Antoine; Manuguerra, Jean-Claude; Kazanji, Mirdad; Berthet, Nicolas

2017-06-01

Rhabdoviridae is one of the most diversified families of RNA viruses whose members infect a wide range of plants, animals, and arthropods. The members of this family are classified into 13 genera and >150 unassigned viruses. Here, we sequenced the complete genome of a rhabdovirus belonging to the Hart Park serogroup, the Kamese virus (KAMV), isolated in 1977 from Culex pruina in the Central African Republic. The genomic sequence showed an organization typical of rhabdoviruses with additional genes in the P-M and G-L intergenic regions, as already reported for the Hart Park serogroup. Our Kamese strain (ArB9074) had 98% and 78.8% nucleotide sequence similarity with the prototypes of the KAMV and Mossuril virus isolated in Uganda and Mozambique in two different Culex species, respectively. Moreover, the protein sequences had 98-100% amino acid similarity with the prototype of the KAMV, except for an additional gene (U3) that showed a divergence of 6%. These molecular data show that our strain of the KAMV is genetically close to the Culex annuliorus strain that was circulating in Uganda in 1967. However, this study suggests the need to improve our knowledge of the KAMV to better understand its behavior, its life cycle, and its potential reservoirs.
Chloroplast genomes of Arabidopsis halleri ssp. gemmifera and Arabidopsis lyrata ssp. petraea: Structures and comparative analysis.

PubMed

Asaf, Sajjad; Khan, Abdul Latif; Khan, Muhammad Aaqil; Waqas, Muhammad; Kang, Sang-Mo; Yun, Byung-Wook; Lee, In-Jung

2017-08-08

We investigated the complete chloroplast (cp) genomes of non-model Arabidopsis halleri ssp. gemmifera and Arabidopsis lyrata ssp. petraea using Illumina paired-end sequencing to understand their genetic organization and structure. Detailed bioinformatics analysis revealed genome sizes of both subspecies ranging between 154.4~154.5 kbp, with a large single-copy region (84,197~84,158 bp), a small single-copy region (17,738~17,813 bp) and pair of inverted repeats (IRa/IRb; 26,264~26,259 bp). Both cp genomes encode 130 genes, including 85 protein-coding genes, eight ribosomal RNA genes and 37 transfer RNA genes. Whole cp genome comparison of A. halleri ssp. gemmifera and A. lyrata ssp. petraea, along with ten other Arabidopsis species, showed an overall high degree of sequence similarity, with divergence among some intergenic spacers. The location and distribution of repeat sequences were determined, and sequence divergences of shared genes were calculated among related species. Comparative phylogenetic analysis of the entire genomic data set and 70 shared genes between both cp genomes confirmed the previous phylogeny and generated phylogenetic trees with the same topologies. The sister species of A. halleri ssp. gemmifera is A. umezawana, whereas the closest relative of A. lyrata spp. petraea is A. arenicola.
Molecular Characterization of the Pericentric Inversion That Causes Differences Between Chimpanzee Chromosome 19 and Human Chromosome 17

PubMed Central

Kehrer-Sawatzki, Hildegard; Schreiner, Bettina; Tänzer, Simone; Platzer, Matthias; Müller, Stefan; Hameister, Horst

2002-01-01

A comparison of the human genome with that of the chimpanzee is an attractive approach to attempts to understand the specificity of a certain phenotype's development. The two karyotypes differ by one chromosome fusion, nine pericentric inversions, and various additions of heterochromatin to chromosomal telomeres. Only the fusion, which gave rise to human chromosome 2, has been characterized at the sequence level. During the present study, we investigated the pericentric inversion by which chimpanzee chromosome 19 differs from human chromosome 17. Fluorescence in situ hybridization was used to identify breakpoint-spanning bacterial artificial chromosomes (BACs) and plasmid artificial chromosomes (PACs). By sequencing the junction fragments, we localized breakpoints in intergenic regions rich in repetitive elements. Our findings suggest that repeat-mediated nonhomologous recombination has facilitated inversion formation. No addition or deletion of any sequence element was detected at the breakpoints or in the surrounding sequences. Next to the break, at a distance of 10.2–39.1 kb, the following genes were found: NGFR and NXPH3 (on human chromosome 17q21.3) and GUC2D and ALOX15B (on human chromosome 17p13). The inversion affects neither the genomic structure nor the gene-activity state with regard to replication timing of these genes. PMID:12094327
Characterization of cis-acting elements required for autorepression of the equine herpesvirus 1 IE gene

PubMed Central

Kim, Seongman; Dai, Gan; O’Callaghan, Dennis J.; Kim, Seong Kee

2012-01-01

The immediate-early protein (IEP), the major regulatory protein encoded by the IE gene of equine herpesvirus 1 (EHV-1), plays a crucial role as both transcription activator and repressor during a productive lytic infection. To investigate the mechanism by which the EHV-1 IEP inhibits its own promoter, IE promoter-luciferase reporter plasmids containing wild-type and mutant IEP-binding site (IEBS) were constructed and used for luciferase reporter assays. The IEP inhibited transcription from its own promoter in the presence of a consensus IEBS (5’-ATCGT-3’) located near the transcription initiation site but did not inhibit when the consensus sequence was deleted. To determine whether the distance between the TATA box and the IEBS affects transcriptional repression, the IEBS was displaced from the original site by the insertion of synthetic DNA sequences. Luciferase reporter assays revealed that the IEP is able to repress its own promoter when the IEBS is located within 26-bp from the TATA box. We also found that the proper orientation and position of the IEBS were required for the repression by the IEP. Interestingly, the level of repression was significantly reduced when a consensus TATA sequence was deleted from the promoter region, indicating that the IEP efficiently inhibits its own promoter in a TATA box-dependent manner. Taken together, these results suggest that the EHV-1 IEP delicately modulates autoregulation of its gene through the consensus IEBS that is near the transcription initiation site and the TATA box. PMID:22265772
Characterization of cis-acting elements required for autorepression of the equine herpesvirus 1 IE gene.

PubMed

Kim, Seongman; Dai, Gan; O'Callaghan, Dennis J; Kim, Seong Kee

2012-04-01

The immediate-early protein (IEP), the major regulatory protein encoded by the IE gene of equine herpesvirus 1 (EHV-1), plays a crucial role as both transcription activator and repressor during a productive lytic infection. To investigate the mechanism by which the EHV-1 IEP inhibits its own promoter, IE promoter-luciferase reporter plasmids containing wild-type and mutant IEP-binding site (IEBS) were constructed and used for luciferase reporter assays. The IEP inhibited transcription from its own promoter in the presence of a consensus IEBS (5'-ATCGT-3') located near the transcription initiation site but did not inhibit when the consensus sequence was deleted. To determine whether the distance between the TATA box and the IEBS affects transcriptional repression, the IEBS was displaced from the original site by the insertion of synthetic DNA sequences. Luciferase reporter assays revealed that the IEP is able to repress its own promoter when the IEBS is located within 26-bp from the TATA box. We also found that the proper orientation and position of the IEBS were required for the repression by the IEP. Interestingly, the level of repression was significantly reduced when a consensus TATA sequence was deleted from the promoter region, indicating that the IEP efficiently inhibits its own promoter in a TATA box-dependent manner. Taken together, these results suggest that the EHV-1 IEP delicately modulates autoregulation of its gene through the consensus IEBS that is near the transcription initiation site and the TATA box. Copyright © 2012. Published by Elsevier B.V.
Virus genome dynamics under different propagation pressures: reconstruction of whole genome haplotypes of West Nile viruses from NGS data.

PubMed

Kortenhoeven, Cornell; Joubert, Fourie; Bastos, Armanda D S; Abolnik, Celia

2015-02-22

Extensive focus is placed on the comparative analyses of consensus genotypes in the study of West Nile virus (WNV) emergence. Few studies account for genetic change in the underlying WNV quasispecies population variants. These variants are not discernable in the consensus genome at the time of emergence, and the maintenance of mutation-selection equilibria of population variants is greatly underestimated. The emergence of lineage 1 WNV strains has been studied extensively, but recent epidemics caused by lineage 2 WNV strains in Hungary, Austria, Greece and Italy emphasizes the increasing importance of this lineage to public health. In this study we explored the quasispecies dynamics of minority variants that contribute to cell-tropism and host determination, i.e. the ability to infect different cell types or cells from different species from Next Generation Sequencing (NGS) data of a historic lineage 2 WNV strain. Minority variants contributing to host cell membrane association persist in the viral population without contributing to the genetic change in the consensus genome. Minority variants are shown to maintain a stable mutation-selection equilibrium under positive selection, particularly in the capsid gene region. This study is the first to infer positive selection and the persistence of WNV haplotype variants that contribute to viral fitness without accompanying genetic change in the consensus genotype, documented solely from NGS sequence data. The approach used in this study streamlines the experimental design seeking viral minority variants accurately from NGS data whilst minimizing the influence of associated sequence error.
The complete mitochondrial genome of Percocypris pingi (Teleostei, Cypriniformes).

PubMed

Li, Yanping; Wang, Jinjin; Peng, Zuogang

2013-02-01

Percocypris pingi is an endemic and economic fish species only found in the upper Yangtze River basin in China. It has become endangered in recent years due to overfishing and/or dam construction. However, the available genetic data are still scarce for this species. Here, we sequenced the complete mitochondrial genome sequence of P. pingi using long polymerase chain reactions. The complete mitogenome sequence has 16,586 bp and contains the usual 13 protein-coding genes, 2 ribosomal RNA genes, 22 transfer RNA (tRNA) genes, and 1 control region, the gene composition and order of which are similar to most of other vertebrates. Most mitochondrial genes except ND6 and eight tRNAs are encoded on the heavy strand. The overall base composition of the heavy strand is 30.9% A, 25.7% T, 26.6% C, and 16.8% G with a slight AT bias of 56.6%. There are seven regions of gene overlaps totaling 23 bp and 11 intergenic spacer regions totaling 35 bp. Combined with the COI barcoding region sequences of other 25 cyprinids, the phylogenetic position of P. pingi was estimated using neighbor-joining method. The results showed that P. pingi had a close phylogenetic relationship with the species from genus Schizothorax. This mitogenome sequence data of P. pingi would provide the fundamental genetic data for further conservation genetic studies for this endangered fish species.
Improved serial analysis of V1 ribosomal sequence tags (SARST-V1) provides a rapid, comprehensive, sequence-based characterization of bacterial diversity and community composition.

PubMed

Yu, Zhongtang; Yu, Marie; Morrison, Mark

2006-04-01

Serial analysis of ribosomal sequence tags (SARST) is a recently developed technology that can generate large 16S rRNA gene (rrs) sequence data sets from microbiomes, but there are numerous enzymatic and purification steps required to construct the ribosomal sequence tag (RST) clone libraries. We report here an improved SARST method, which still targets the V1 hypervariable region of rrs genes, but reduces the number of enzymes, oligonucleotides, reagents, and technical steps needed to produce the RST clone libraries. The new method, hereafter referred to as SARST-V1, was used to examine the eubacterial diversity present in community DNA recovered from the microbiome resident in the ovine rumen. The 190 sequenced clones contained 1055 RSTs and no less than 236 unique phylotypes (based on > or = 95% sequence identity) that were assigned to eight different eubacterial phyla. Rarefaction and monomolecular curve analyses predicted that the complete RST clone library contains 99% of the 353 unique phylotypes predicted to exist in this microbiome. When compared with ribosomal intergenic spacer analysis (RISA) of the same community DNA sample, as well as a compilation of nine previously published conventional rrs clone libraries prepared from the same type of samples, the RST clone library provided a more comprehensive characterization of the eubacterial diversity present in rumen microbiomes. As such, SARST-V1 should be a useful tool applicable to comprehensive examination of diversity and composition in microbiomes and offers an affordable, sequence-based method for diversity analysis.
Complete chloroplast genome sequences of Hordeum vulgare, Sorghum bicolor and Agrostis stolonifera, and comparative analyses with other grass genomes

PubMed Central

Saski, Christopher; Lee, Seung-Bum; Fjellheim, Siri; Guda, Chittibabu; Jansen, Robert K.; Luo, Hong; Tomkins, Jeffrey; Rognli, Odd Arne; Clarke, Jihong Liu

2009-01-01

Comparisons of complete chloroplast genome sequences of Hordeum vulgare, Sorghum bicolor and Agrostis stolonifera to six published grass chloroplast genomes reveal that gene content and order are similar but two microstructural changes have occurred. First, the expansion of the IR at the SSC/IRa boundary that duplicates a portion of the 5′ end of ndhH is restricted to the three genera of the subfamily Pooideae (Agrostis, Hordeum and Triticum). Second, a 6 bp deletion in ndhK is shared by Agrostis, Hordeum, Oryza and Triticum, and this event supports the sister relationship between the subfamilies Erhartoideae and Pooideae. Repeat analysis identified 19–37 direct and inverted repeats 30 bp or longer with a sequence identity of at least 90%. Seventeen of the 26 shared repeats are found in all the grass chloroplast genomes examined and are located in the same genes or intergenic spacer (IGS) regions. Examination of simple sequence repeats (SSRs) identified 16–21 potential polymorphic SSRs. Five IGS regions have 100% sequence identity among Zea mays, Saccharum officinarum and Sorghum bicolor, whereas no spacer regions were identical among Oryza sativa, Triticum aestivum, H. vulgare and A. stolonifera despite their close phylogenetic relationship. Alignment of EST sequences and DNA coding sequences identified six C–U conversions in both Sorghum bicolor and H. vulgare but only one in A. stolonifera. Phylogenetic trees based on DNA sequences of 61 protein-coding genes of 38 taxa using both maximum parsimony and likelihood methods provide moderate support for a sister relationship between the subfamilies Erhartoideae and Pooideae. PMID:17534593
Repeated extragenic sequences in prokaryotic genomes: a proposal for the origin and dynamics of the RUP element in Streptococcus pneumoniae.

PubMed

Oggioni, M R; Claverys, J P

1999-10-01

A survey of all Streptococcus pneumoniae GenBank/EMBL DNA sequence entries and of the public domain sequence (representing more than 90% of the genome) of an S. pneumoniae type 4 strain allowed identification of 108 copies of a 107-bp-long highly repeated intergenic element called RUP (for repeat unit of pneumococcus). Several features of the element, revealed in this study, led to the proposal that RUP is an insertion sequence (IS)-derivative that could still be mobile. Among these features are: (1) a highly significant homology between the terminal inverted repeats (IRs) of RUPs and of IS630-Spn1, a new putative IS of S. pneumoniae; and (2) insertion at a TA dinucleotide, a characteristic target of several members of the IS630 family. Trans-mobilization of RUP is therefore proposed to be mediated by the transposase of IS630-Spn1. To account for the observation that RUPs are distributed among four subtypes which exhibit different degrees of sequence homogeneity, a scenario is invoked based on successive stages of RUP mobility and non-mobility, depending on whether an active transposase is present or absent. In the latter situation, an active transposase could be reintroduced into the species through natural transformation. Examination of sequences flanking RUP revealed a preferential association with ISs. It also provided evidence that RUPs promote sequence rearrangements, thereby contributing to genome flexibility. The possibility that RUP preferentially targets transforming DNA of foreign origin and subsequently favours disruption/rearrangement of exogenous sequences is discussed.
Application of whole genome sequence data in analyzing the molecular epidemiology of Shiga toxin-producing Escherichia coli O157:H7/H.

PubMed

Yokoyama, Eiji; Hirai, Shinichiro; Ishige, Taichiro; Murakami, Satoshi

2018-01-02

Seventeen clusters of Shiga toxin-producing Escherichia coli O157:H7/- (O157) strains, determined by cluster analysis of pulsed-field gel electrophoresis patterns, were analyzed using whole genome sequence (WGS) data to investigate this pathogen's molecular epidemiology. The 17 clusters included 136 strains containing strains from nine outbreaks, with each outbreak caused by a single source contaminated with the organism, as shown by epidemiological contact surveys. WGS data of these strains were used to identify single nucleotide polymorphisms (SNPs) by two methods: short read data were directly mapped to a reference genome (mapping derived SNPs) and common SNPs between the mapping derived SNPs and SNPs in assembled data of short read data (common SNPs). Among both SNPs, those that were detected in genes with a gap were excluded to remove ambiguous SNPs from further analysis. The effectiveness of both SNPs was investigated among all the concatenated SNPs that were detected (whole SNP set); SNPs were divided into three categories based on the genes in which they were located (i.e., backbone SNP set, O-island SNP set, and mobile element SNP set); and SNPs in non-coding regions (intergenic region SNP set). When SNPs from strains isolated from the nine single source derived outbreaks were analyzed using an unweighted pair group method with arithmetic mean tree (UPGMA) and a minimum spanning tree (MST), the maximum pair-wise distances of the backbone SNP set of the mapping derived SNPs were significantly smaller than those of the whole and intergenic region SNP set on both UPGMAs and MSTs. This significant difference was also observed when the backbone SNP set of the common SNPs were examined (Steel-Dwass test, P≤0.01). When the maximum pair-wise distances were compared between the mapping derived and common SNPs, significant differences were observed in those of the whole, mobile element, and intergenic region SNP set (Wilcoxon signed rank test, P≤0.01). When all the strains included in one complex on an MST or one cluster on a UPGMA were designated as the same genotype, the values of the Hunter-Gaston Discriminatory Power Index for the backbone SNP set of the mapping derived and common SNPs were higher than those of other SNP sets. In contrast, the mobile element SNP set could not robustly subdivide lineage I strains of tested O157 strains using both the mapping derived and common SNPs. These results suggested that the backbone SNP set were the most effective for analysis of WGS data for O157 in enabling an appropriation of its molecular epidemiology. Copyright © 2017 Elsevier B.V. All rights reserved.
Characterization of Hepatitis C Virus (HCV) Envelope Diversification from Acute to Chronic Infection within a Sexually Transmitted HCV Cluster by Using Single-Molecule, Real-Time Sequencing

PubMed Central

Ho, Cynthia K. Y.; Raghwani, Jayna; Koekkoek, Sylvie; Liang, Richard H.; Van der Meer, Jan T. M.; Van Der Valk, Marc; De Jong, Menno; Pybus, Oliver G.

2016-01-01

ABSTRACT In contrast to other available next-generation sequencing platforms, PacBio single-molecule, real-time (SMRT) sequencing has the advantage of generating long reads albeit with a relatively higher error rate in unprocessed data. Using this platform, we longitudinally sampled and sequenced the hepatitis C virus (HCV) envelope genome region (1,680 nucleotides [nt]) from individuals belonging to a cluster of sexually transmitted cases. All five subjects were coinfected with HIV-1 and a closely related strain of HCV genotype 4d. In total, 50 samples were analyzed by using SMRT sequencing. By using 7 passes of circular consensus sequencing, the error rate was reduced to 0.37%, and the median number of sequences was 612 per sample. A further reduction of insertions was achieved by alignment against a sample-specific reference sequence. However, in vitro recombination during PCR amplification could not be excluded. Phylogenetic analysis supported close relationships among HCV sequences from the four male subjects and subsequent transmission from one subject to his female partner. Transmission was characterized by a strong genetic bottleneck. Viral genetic diversity was low during acute infection and increased upon progression to chronicity but subsequently fluctuated during chronic infection, caused by the alternate detection of distinct coexisting lineages. SMRT sequencing combines long reads with sufficient depth for many phylogenetic analyses and can therefore provide insights into within-host HCV evolutionary dynamics without the need for haplotype reconstruction using statistical algorithms. IMPORTANCE Next-generation sequencing has revolutionized the study of genetically variable RNA virus populations, but for phylogenetic and evolutionary analyses, longer sequences than those generated by most available platforms, while minimizing the intrinsic error rate, are desired. Here, we demonstrate for the first time that PacBio SMRT sequencing technology can be used to generate full-length HCV envelope sequences at the single-molecule level, providing a data set with large sequencing depth for the characterization of intrahost viral dynamics. The selection of consensus reads derived from at least 7 full circular consensus sequencing rounds significantly reduced the intrinsic high error rate of this method. We used this method to genetically characterize a unique transmission cluster of sexually transmitted HCV infections, providing insight into the distinct evolutionary pathways in each patient over time and identifying the transmission-associated genetic bottleneck as well as fluctuations in viral genetic diversity over time, accompanied by dynamic shifts in viral subpopulations. PMID:28077634
Cloning, sequencing and characterization of lipase from a polyhydroxyalkanoate- (PHA-) synthesizing Pseudomonas resinovorans

USDA-ARS?s Scientific Manuscript database

Lipase gene (lip) of a biodegradable polyhydroxyalkanoate- (PHA-) synthesizing bacterium P. resinovorans NRRL B-2649 was cloned, sequenced and characterized by using consensus primers and PCR-based genome walking method. The ORF of the putative Lip (314 amino acids) and its active site (Ser111, Asp...
A FRET Biosensor for ROCK Based on a Consensus Substrate Sequence Identified by KISS Technology.

PubMed

Li, Chunjie; Imanishi, Ayako; Komatsu, Naoki; Terai, Kenta; Amano, Mutsuki; Kaibuchi, Kozo; Matsuda, Michiyuki

2017-01-11

Genetically-encoded biosensors based on Förster/fluorescence resonance energy transfer (FRET) are versatile tools for studying the spatio-temporal regulation of signaling molecules within not only the cells but also tissues. Perhaps the hardest task in the development of a FRET biosensor for protein kinases is to identify the kinase-specific substrate peptide to be used in the FRET biosensor. To solve this problem, we took advantage of kinase-interacting substrate screening (KISS) technology, which deduces a consensus substrate sequence for the protein kinase of interest. Here, we show that a consensus substrate sequence for ROCK identified by KISS yielded a FRET biosensor for ROCK, named Eevee-ROCK, with high sensitivity and specificity. By treating HeLa cells with inhibitors or siRNAs against ROCK, we show that a substantial part of the basal FRET signal of Eevee-ROCK was derived from the activities of ROCK1 and ROCK2. Eevee-ROCK readily detected ROCK activation by epidermal growth factor, lysophosphatidic acid, and serum. When cells stably-expressing Eevee-ROCK were time-lapse imaged for three days, ROCK activity was found to increase after the completion of cytokinesis, concomitant with the spreading of cells. Eevee-ROCK also revealed a gradual increase in ROCK activity during apoptosis. Thus, Eevee-ROCK, which was developed from a substrate sequence predicted by the KISS technology, will pave the way to a better understanding of the function of ROCK in a physiological context.
ChIP-seq analysis of the σ E regulon of Salmonella enterica serovar typhimurium reveals new genes implicated in heat shock and oxidative stress response

DOE PAGES

Li, Jie; Overall, Christopher C.; Johnson, Rudd C.; ...

2015-09-21

The alternative sigma factor σ E functions to maintain bacterial homeostasis and membrane integrity in response to extracytoplasmic stress by regulating thousands of genes both directly and indirectly. The transcriptional regulatory network governed by σ E in Salmonella and E. coli has been examined using microarray, however a genome-wide analysis of σ E–binding sites inSalmonella has not yet been reported. We infected macrophages with Salmonella Typhimurium over a select time course. Using chromatin immunoprecipitation followed by high-throughput DNA sequencing (ChIP-seq), 31 σ E–binding sites were identified. Seventeen sites were new, which included outer membrane proteins, a quorum-sensing protein, a cellmore » division factor, and a signal transduction modulator. The consensus sequence identified for σ E in vivo binding was similar to the one previously reported, except for a conserved G and A between the -35 and -10 regions. One third of the σ E–binding sites did not contain the consensus sequence, suggesting there may be alternative mechanisms by which σ E modulates transcription. By dissecting direct and indirect modes of σ E-mediated regulation, we found that σ E activates gene expression through recognition of both canonical and reversed consensus sequence. Lastly, new σ E regulated genes ( greA, luxS, ompA and ompX) are shown to be involved in heat shock and oxidative stress responses.« less
ChIP-seq analysis of the σ E regulon of Salmonella enterica serovar typhimurium reveals new genes implicated in heat shock and oxidative stress response

DOE Office of Scientific and Technical Information (OSTI.GOV)

Li, Jie; Overall, Christopher C.; Johnson, Rudd C.

The alternative sigma factor σ E functions to maintain bacterial homeostasis and membrane integrity in response to extracytoplasmic stress by regulating thousands of genes both directly and indirectly. The transcriptional regulatory network governed by σ E in Salmonella and E. coli has been examined using microarray, however a genome-wide analysis of σ E–binding sites inSalmonella has not yet been reported. We infected macrophages with Salmonella Typhimurium over a select time course. Using chromatin immunoprecipitation followed by high-throughput DNA sequencing (ChIP-seq), 31 σ E–binding sites were identified. Seventeen sites were new, which included outer membrane proteins, a quorum-sensing protein, a cellmore » division factor, and a signal transduction modulator. The consensus sequence identified for σ E in vivo binding was similar to the one previously reported, except for a conserved G and A between the -35 and -10 regions. One third of the σ E–binding sites did not contain the consensus sequence, suggesting there may be alternative mechanisms by which σ E modulates transcription. By dissecting direct and indirect modes of σ E-mediated regulation, we found that σ E activates gene expression through recognition of both canonical and reversed consensus sequence. Lastly, new σ E regulated genes ( greA, luxS, ompA and ompX) are shown to be involved in heat shock and oxidative stress responses.« less

Development of the first consensus genetic map of intermediate wheatgrass (Thinopyrum intermedium) using genotyping-by-sequencing.

PubMed

Kantarski, Traci; Larson, Steve; Zhang, Xiaofei; DeHaan, Lee; Borevitz, Justin; Anderson, James; Poland, Jesse

2017-01-01

Development of the first consensus genetic map of intermediate wheatgrass gives insight into the genome and tools for molecular breeding. Intermediate wheatgrass (Thinopyrum intermedium) has been identified as a candidate for domestication and improvement as a perennial grain, forage, and biofuel crop and is actively being improved by several breeding programs. To accelerate this process using genomics-assisted breeding, efficient genotyping methods and genetic marker reference maps are needed. We present here the first consensus genetic map for intermediate wheatgrass (IWG), which confirms the species' allohexaploid nature (2n = 6x = 42) and homology to Triticeae genomes. Genotyping-by-sequencing was used to identify markers that fit expected segregation ratios and construct genetic maps for 13 heterogeneous parents of seven full-sib families. These maps were then integrated using a linear programming method to produce a consensus map with 21 linkage groups containing 10,029 markers, 3601 of which were present in at least two populations. Each of the 21 linkage groups contained between 237 and 683 markers, cumulatively covering 5061 cM (2891 cM--Kosambi) with an average distance of 0.5 cM between each pair of markers. Through mapping the sequence tags to the diploid (2n = 2x = 14) barley reference genome, we observed high colinearity and synteny between these genomes, with three homoeologous IWG chromosomes corresponding to each of the seven barley chromosomes, and mapped translocations that are known in the Triticeae. The consensus map is a valuable tool for wheat breeders to map important disease-resistance genes within intermediate wheatgrass. These genomic tools can help lead to rapid improvement of IWG and development of high-yielding cultivars of this perennial grain that would facilitate the sustainable intensification of agricultural systems.
Multiplex PCR method for MinION and Illumina sequencing of Zika and other virus genomes directly from clinical samples.

PubMed

Quick, Joshua; Grubaugh, Nathan D; Pullan, Steven T; Claro, Ingra M; Smith, Andrew D; Gangavarapu, Karthik; Oliveira, Glenn; Robles-Sikisaka, Refugio; Rogers, Thomas F; Beutler, Nathan A; Burton, Dennis R; Lewis-Ximenez, Lia Laura; de Jesus, Jaqueline Goes; Giovanetti, Marta; Hill, Sarah C; Black, Allison; Bedford, Trevor; Carroll, Miles W; Nunes, Marcio; Alcantara, Luiz Carlos; Sabino, Ester C; Baylis, Sally A; Faria, Nuno R; Loose, Matthew; Simpson, Jared T; Pybus, Oliver G; Andersen, Kristian G; Loman, Nicholas J

2017-06-01

Genome sequencing has become a powerful tool for studying emerging infectious diseases; however, genome sequencing directly from clinical samples (i.e., without isolation and culture) remains challenging for viruses such as Zika, for which metagenomic sequencing methods may generate insufficient numbers of viral reads. Here we present a protocol for generating coding-sequence-complete genomes, comprising an online primer design tool, a novel multiplex PCR enrichment protocol, optimized library preparation methods for the portable MinION sequencer (Oxford Nanopore Technologies) and the Illumina range of instruments, and a bioinformatics pipeline for generating consensus sequences. The MinION protocol does not require an Internet connection for analysis, making it suitable for field applications with limited connectivity. Our method relies on multiplex PCR for targeted enrichment of viral genomes from samples containing as few as 50 genome copies per reaction. Viral consensus sequences can be achieved in 1-2 d by starting with clinical samples and following a simple laboratory workflow. This method has been successfully used by several groups studying Zika virus evolution and is facilitating an understanding of the spread of the virus in the Americas. The protocol can be used to sequence other viral genomes using the online Primal Scheme primer designer software. It is suitable for sequencing either RNA or DNA viruses in the field during outbreaks or as an inexpensive, convenient method for use in the lab.
The Large Mitochondrial Genome of Symbiodinium minutum Reveals Conserved Noncoding Sequences between Dinoflagellates and Apicomplexans

PubMed Central

Shoguchi, Eiichi; Shinzato, Chuya; Hisata, Kanako; Satoh, Nori; Mungpakdee, Sutada

2015-01-01

Even though mitochondrial genomes, which characterize eukaryotic cells, were first discovered more than 50 years ago, mitochondrial genomics remains an important topic in molecular biology and genome sciences. The Phylum Alveolata comprises three major groups (ciliates, apicomplexans, and dinoflagellates), the mitochondrial genomes of which have diverged widely. Even though the gene content of dinoflagellate mitochondrial genomes is reportedly comparable to that of apicomplexans, the highly fragmented and rearranged genome structures of dinoflagellates have frustrated whole genomic analysis. Consequently, noncoding sequences and gene arrangements of dinoflagellate mitochondrial genomes have not been well characterized. Here we report that the continuous assembled genome (∼326 kb) of the dinoflagellate, Symbiodinium minutum, is AT-rich (∼64.3%) and that it contains three protein-coding genes. Based upon in silico analysis, the remaining 99% of the genome comprises transcriptomic noncoding sequences. RNA edited sites and unique, possible start and stop codons clarify conserved regions among dinoflagellates. Our massive transcriptome analysis shows that almost all regions of the genome are transcribed, including 27 possible fragmented ribosomal RNA genes and 12 uncharacterized small RNAs that are similar to mitochondrial RNA genes of the malarial parasite, Plasmodium falciparum. Gene map comparisons show that gene order is only slightly conserved between S. minutum and P. falciparum. However, small RNAs and intergenic sequences share sequence similarities with P. falciparum, suggesting that the function of noncoding sequences has been preserved despite development of very different genome structures. PMID:26199191
GEITLERINEMA SPECIES (OSCILLATORIALES, CYANOBACTERIA) REVEALED BY CELLULAR MORPHOLOGY, ULTRASTRUCTURE, AND DNA SEQUENCING(1).

PubMed

Do Carmo Bittencourt-Oliveira, Maria; Do Nascimento Moura, Ariadne; De Oliveira, Mariana Cabral; Sidnei Massola, Nelson

2009-06-01

Geitlerinema amphibium (C. Agardh ex Gomont) Anagn. and G. unigranulatum (Rama N. Singh) Komárek et M. T. P. Azevedo are morphologically close species with characteristics frequently overlapping. Ten strains of Geitlerinema (six of G. amphibium and four of G. unigranulatum) were analyzed by DNA sequencing and transmission electronic and optical microscopy. Among the investigated strains, the two species were not separated with respect to cellular dimensions, and cellular width was the most varying characteristic. The number and localization of granules, as well as other ultrastructural characteristics, did not provide a means to discriminate between the two species. The two species were not separated either by geography or environment. These results were further corroborated by the analysis of the cpcB-cpcA intergenic spacer (PC-IGS) sequences. Given the fact that morphology is very uniform, plus the coexistence of these populations in the same habitat, it would be nearly impossible to distinguish between them in nature. On the other hand, two of the analyzed strains were distinct from all others based on the PC-IGS sequences, in spite of their morphological similarity. PC-IGS sequences indicate that these two strains could be a different species of Geitlerinema. Using morphology, cell ultrastructure, and PC-IGS sequences, it is not possible to distinguish G. amphibium and G. unigranulatum. Therefore, they should be treated as one species, G. unigranulatum as a synonym of G. amphibium. © 2009 Phycological Society of America.
Genotype diversity of Escherichia coli isolates in natural waters determined by PFGE and ERIC-PCR.

PubMed

Casarez, Elizabeth A; Pillai, Suresh D; Di Giovanni, George D

2007-08-01

Most library-dependent bacterial source tracking studies using Escherichia coli (E. coli) have focused on strain diversity of isolates obtained from known human and animal faecal sources for library development. In contrast, this study evaluated the genotype variation of E. coli isolated from natural surface water using pulsed field gel electrophoresis (PFGE) and enterobacterial repetitive intergenic consensus sequence polymerase chain reaction (ERIC-PCR) to better understand these naturally occurring populations. A total of 650 water samples were collected over a nine month period from eleven sampling stations from Lake Waco and Belton Lake in Central Texas. Of the 650 water samples collected, 412 were positive for E. coli, yielding a total of 631 E. coli isolates (1-12 isolates collected per sample). PFGE and ERIC-PCR patterns were successfully generated for 555 isolates and were compared using the curve-based Pearson's product-moment correlation coefficient. The 555 E. coli isolates represented 461 PFGE genotypes, with 84% (386/461) of the genotypes being represented by individual isolates. The remaining 75 genotypes were represented by 2-5 isolates each. Using ERIC-PCR, the 555 E. coli isolates represented 175 genotypes, with 63% (109/175) of the genotypes being represented by individual isolates. In contrast to the PFGE results, two ERIC-PCR genotypes represented 37% of the E. coli isolates, (83 and 124 isolates, respectively), and were found throughout the watersheds both spatially and temporally. Based on the PFGE genotype diversity of water isolates, there is little evidence that a small number of environmentally-adapted E. coli represent dominant populations in the studied waterbodies. However, with the lower discriminatory power technique ERIC-PCR, an opposing conclusion might have been drawn. These results emphasize the importance of considering the resolving power of the source tracking technique being used when assessing strain diversity and geographical stability.
Epidemic and virulence characteristic of Shigella spp. with extended-spectrum cephalosporin resistance in Xiaoshan District, Hangzhou, China

PubMed Central

2014-01-01

Background Shigellae have become increasingly resistant to the extended-spectrum cephalosporin (ESC) worldwide and pose a great challenge to anti-infection treatment options. The purpose of this study was to determine the resistance, cephalosporin resistance mechanisms, virulence characteristic and genotype of ESC-resistant Shigella. Methods From 2008 to 2012, Shigella isolates collected from diarrhea patients were detected for antibiotics sensitivity by disk diffusion, cephalosporin resistance determinants and virulence genes using polymerase chain reaction (PCR) and genotyping through enterobacterial repetitive intergenic consensus sequence PCR (ERIC-PCR). Results A total of 356 Shigella isolates were gathered, and 198 (55.6%, 58 S. flexneri and 140 S. sonnei) were resistant to ESC. All ESC-resistant isolates were susceptible to imipenem, and only 0.5% isolate was resistant to piperacillin/tazobactam. ESC-resistant S. flexneri showed high degrees of resistance to ampicillin (100%), ampicillin/sulbactam (96.6%), piperacillin (100%), trimethoprim/sulfamethoxazole (74.1%), ciprofloxacin (74.1%), levofloxacin (53.4%), ceftazidime (58.6%) and cefepime (58.6%). ESC-resistant S. sonnei exhibited high resistance rates to ampicillin (100%), piperacillin (100%) and trimethoprim/sulfamethoxazole (96.4%). Cephalosporin resistance genes were confirmed in 184 ESC-resistant isolates. blaCTX-M types (91.8%, mainly blaCTX-M-14, blaCTX-M-15 and blaCTX-M-57) were most prevalent, followed by blaOXA-30 (26.3%). Over 99.0% ESC-resistant isolates harbored virulence genes ial, ipaH, virA and sen. However, set1 were more prevalent in ESC-resistant S. flexneri isolates than in S. sonnei isolates. ERIC-PCR results showed that 2 and 3 main genotypes were detected in ESC-resistant S. flexneri and S. sonnei, respectively. Conclusion Our findings indicated that a high prevalence of ESC-resistant Shigella mediated mainly by blaCTX-M with stronger resistance and virulence, and the existence of specific clones responsible for these Shigella infection in the region studied. PMID:24886028
Epidemic and virulence characteristic of Shigella spp. with extended-spectrum cephalosporin resistance in Xiaoshan District, Hangzhou, China.

PubMed

Zhang, Chuan-Ling; Liu, Qing-Zhong; Wang, Juan; Chu, Xu; Shen, Li-Meng; Guo, Yuan-Yu

2014-05-15

Shigellae have become increasingly resistant to the extended-spectrum cephalosporin (ESC) worldwide and pose a great challenge to anti-infection treatment options. The purpose of this study was to determine the resistance, cephalosporin resistance mechanisms, virulence characteristic and genotype of ESC-resistant Shigella. From 2008 to 2012, Shigella isolates collected from diarrhea patients were detected for antibiotics sensitivity by disk diffusion, cephalosporin resistance determinants and virulence genes using polymerase chain reaction (PCR) and genotyping through enterobacterial repetitive intergenic consensus sequence PCR (ERIC-PCR). A total of 356 Shigella isolates were gathered, and 198 (55.6%, 58 S. flexneri and 140 S. sonnei) were resistant to ESC. All ESC-resistant isolates were susceptible to imipenem, and only 0.5% isolate was resistant to piperacillin/tazobactam. ESC-resistant S. flexneri showed high degrees of resistance to ampicillin (100%), ampicillin/sulbactam (96.6%), piperacillin (100%), trimethoprim/sulfamethoxazole (74.1%), ciprofloxacin (74.1%), levofloxacin (53.4%), ceftazidime (58.6%) and cefepime (58.6%). ESC-resistant S. sonnei exhibited high resistance rates to ampicillin (100%), piperacillin (100%) and trimethoprim/sulfamethoxazole (96.4%). Cephalosporin resistance genes were confirmed in 184 ESC-resistant isolates. bla(CTX-M) types (91.8%, mainly bla(CTX-M-14), bla(CTX-M-15) and bla(CTX-M-57)) were most prevalent, followed by bla(OXA-30) (26.3%). Over 99.0% ESC-resistant isolates harbored virulence genes ial, ipaH, virA and sen. However, set1 were more prevalent in ESC-resistant S. flexneri isolates than in S. sonnei isolates. ERIC-PCR results showed that 2 and 3 main genotypes were detected in ESC-resistant S. flexneri and S. sonnei, respectively. Our findings indicated that a high prevalence of ESC-resistant Shigella mediated mainly by bla(CTX-M) with stronger resistance and virulence, and the existence of specific clones responsible for these Shigella infection in the region studied.
Milky hemolymph syndrome (MHS) in spiny lobsters, penaeid shrimp and crabs.

PubMed

Nunan, Linda M; Poulos, Bonnie T; Navarro, Solangel; Redman, Rita M; Lightner, Donald V

2010-09-02

Black tiger shrimp Penaeus monodon, European shore crab Carcinus maenas and spiny lobster Panulirus spp. can be affected by milky hemolymph syndrome (MHS). Four rickettsia-like bacteria (RLB) isolates of MHS originating from 5 geographical areas have been identified to date. The histopathology of the disease was characterized and a multiplex PCR assay was developed for detection of the 4 bacterial isolates. The 16S rRNA gene and 16-23S rRNA intergenic spacer region (ISR) were used to examine the phylogeny of the MHS isolates. Although the pathology of this disease appears similar in the various different hosts, sequencing and examination of the phylogenetic relationships reveal 4 distinct RLB involved in the infection process.
Fission yeast retrotransposon Tf1 integration is targeted to 5' ends of open reading frames.

PubMed

Behrens, R; Hayles, J; Nurse, P

2000-12-01

Target site selection of transposable elements is usually not random but involves some specificity for a DNA sequence or a DNA binding host factor. We have investigated the target site selection of the long terminal repeat-containing retrotransposon Tf1 from the fission yeast Schizosaccharomyces pombe. By monitoring induced transposition events we found that Tf1 integration sites were distributed throughout the genome. Mapping these insertions revealed that Tf1 did not integrate into open reading frames, but occurred preferentially in longer intergenic regions with integration biased towards a region 100-420 bp upstream of the translation start site. Northern blot analysis showed that transcription of genes adjacent to Tf1 insertions was not significantly changed.
Fission yeast retrotransposon Tf1 integration is targeted to 5′ ends of open reading frames

PubMed Central

Behrens, Ralf; Hayles, Jacky; Nurse, Paul

2000-01-01

Target site selection of transposable elements is usually not random but involves some specificity for a DNA sequence or a DNA binding host factor. We have investigated the target site selection of the long terminal repeat-containing retrotransposon Tf1 from the fission yeast Schizosaccharomyces pombe. By monitoring induced transposition events we found that Tf1 integration sites were distributed throughout the genome. Mapping these insertions revealed that Tf1 did not integrate into open reading frames, but occurred preferentially in longer intergenic regions with integration biased towards a region 100–420 bp upstream of the translation start site. Northern blot analysis showed that transcription of genes adjacent to Tf1 insertions was not significantly changed. PMID:11095681
A conserved mechanism for replication origin recognition and binding in archaea.

PubMed

Majerník, Alan I; Chong, James P J

2008-01-15

To date, methanogens are the only group within the archaea where firing DNA replication origins have not been demonstrated in vivo. In the present study we show that a previously identified cluster of ORB (origin recognition box) sequences do indeed function as an origin of replication in vivo in the archaeon Methanothermobacter thermautotrophicus. Although the consensus sequence of ORBs in M. thermautotrophicus is somewhat conserved when compared with ORB sequences in other archaea, the Cdc6-1 protein from M. thermautotrophicus (termed MthCdc6-1) displays sequence-specific binding that is selective for the MthORB sequence and does not recognize ORBs from other archaeal species. Stabilization of in vitro MthORB DNA binding by MthCdc6-1 requires additional conserved sequences 3' to those originally described for M. thermautotrophicus. By testing synthetic sequences bearing mutations in the MthORB consensus sequence, we show that Cdc6/ORB binding is critically dependent on the presence of an invariant guanine found in all archaeal ORB sequences. Mutation of a universally conserved arginine residue in the recognition helix of the winged helix domain of archaeal Cdc6-1 shows that specific origin sequence recognition is dependent on the interaction of this arginine residue with the invariant guanine. Recognition of a mutated origin sequence can be achieved by mutation of the conserved arginine residue to a lysine or glutamine residue. Thus despite a number of differences in protein and DNA sequences between species, the mechanism of origin recognition and binding appears to be conserved throughout the archaea.
The mitochondrial genome of Paraspadella gotoi is highly reduced and reveals that chaetognaths are a sister-group to protostomes

DOE Office of Scientific and Technical Information (OSTI.GOV)

Helfenbein, Kevin G.; Fourcade, H. Matthew; Vanjani, Rohit G.

2004-05-01

We report the first complete mitochondrial (mt) DNA sequence from a member of the phylum Chaetognatha (arrow worms). The Paraspadella gotoi mtDNA is highly unusual, missing 23 of the genes commonly found in animal mtDNAs, including atp6, which has otherwise been found universally to be present. Its 14 genes are unusually arranged into two groups, one on each strand. One group is punctuated by numerous non-coding intergenic nucleotides, while the other group is tightly packed, having no non-coding nucleotides, leading to speculation that there are two transcription units with differing modes of expression. The phylogenetic position of the Chaetognatha withinmore » the Metazoa has long been uncertain, with conflicting or equivocal results from various morphological analyses and rRNA sequence comparisons. Comparisons here of amino acid sequences from mitochondrially encoded proteins gives a single most parsimonious tree that supports a position of Chaetognatha as sister to the protostomes studied here. From this, one can more clearly interpret the patterns of evolution of various developmental features, especially regarding the embryological fate of the blastopore.« less
Identification of Bacillus spp. from Bikalga, fermented seeds of Hibiscus sabdariffa: phenotypic and genotypic characterization.

PubMed

Ouoba, L I I; Parkouda, C; Diawara, B; Scotti, C; Varnam, A H

2008-01-01

To identify Bacillus spp. responsible of the fermentation of Hibiscus sabdariffa for production of Bikalga, an alkaline fermented food used as a condiment in Burkina Faso. Seventy bacteria were isolated from Bikalga produced in different regions of Burkina Faso and identified by phenotyping and genotyping using PCR amplification of the 16S-23S rDNA intergenic transcribed spacer (ITS-PCR), repetitive sequence-based PCR (rep-PCR) and DNA sequencing. The isolates were characterized as motile, rod-shaped, endospore forming, catalase positive, Gram-positive bacteria. ITS-PCR allowed typing mainly at species level. Rep-PCR was more discriminative and allowed a typing at ssp. level. The DNA sequencing combined with the Blast search program and fermentation profiles using API 50CHB system allowed an identification of the bacteria as Bacillus subtilis, B. licheniformis, B. cereus, B. pumilus, B. badius, Brevibacillus bortelensis, B. sphaericus and B. fusiformis. B. subtilis were the predominant bacterium (42) followed by B. licheniformis (16). Various species and ssp. of Bacillus are involved in fermentation of H. sabdariffa for production of Bikalga. Selection of starter cultures of Bacillus for controlled production of Bikalga, selection of probiotic bacteria.
Genetic analysis of a novel Xylella fastidiosa subspecies found in the southwestern United States.

PubMed

Randall, Jennifer J; Goldberg, Natalie P; Kemp, John D; Radionenko, Maxim; French, Jason M; Olsen, Mary W; Hanson, Stephen F

2009-09-01

Xylella fastidiosa, the causal agent of several scorch diseases, is associated with leaf scorch symptoms in Chitalpa tashkentensis, a common ornamental landscape plant used throughout the southwestern United States. For a number of years, many chitalpa trees in southern New Mexico and Arizona exhibited leaf scorch symptoms, and the results from a regional survey show that chitalpa trees from New Mexico, Arizona, and California are frequently infected with X. fastidiosa. Phylogenetic analysis of multiple loci was used to compare the X. fastidiosa infecting chitalpa strains from New Mexico, Arizona, and trees imported into New Mexico nurseries with previously reported X. fastidiosa strains. Loci analyzed included the 16S ribosome, 16S-23S ribosomal intergenic spacer region, gyrase-B, simple sequence repeat sequences, X. fastidiosa-specific sequences, and the virulence-associated protein (VapD). This analysis indicates that the X. fastidiosa isolates associated with infected chitalpa trees in the Southwest are a highly related group that is distinct from the four previously defined taxons X. fastidiosa subsp. fastidiosa (piercei), X. fastidiosa subsp. multiplex, X. fastidiosa subsp. sandyi, and X. fastidiosa subsp. pauca. Therefore, the classification proposed for this new subspecies is X. fastidiosa subsp. tashke.
[Investigation of algae pollution in Xiliu Lake and identification of toxic cyanobacteria by whole-cell PCR].

PubMed

Ban, Hai-qun; Zhuang, Dong-gang; Zhu, Jing-yuan; Ba, Yue

2006-03-01

To investigate the contaminative condition of the floating algae (especially toxic cyanobacteria) in Xiliu Lake, and establish a whole-cell PCR method for identifying the toxic cyanobacteria. The surface water of Xiliu Lake was sampled by plastic sampler from March, 2004, and the number of algae was counted by using blood cell counter. The phycocyanin intergenic spacer region (PC-IGS) and microcystin synthetase gene B (mcyB) were identified by whole-cell PCR in water samples, and the amplified product of mcyB was inserted into T vector and sequenced. Cyanobacteria, Chlorophyta, Bacillariophyta and Euglenophyta were main algae, and cyanobacteria was the dominant algae in summer and autumn. From July 7 to September 27,2 004, PC-IGS was detected positively in 11 samples, and from July 29 to September 27, mcyB was-detieted positively in 9 samples. Compared with the reported mcyB of Microcystis aeruginosa in Genbank, the homology of gene sequence was more than 97 t he homology of amino acid sequence was more than 94%. In summer and autumn toxic cyanobacteria could be detected in Xiliu Lake. Toxic cyanobacteria could be identified successfully by whole-cell PCR.
Legionella norrlandica sp. nov., isolated from the biopurification systems of wood processing plants.

PubMed

Rizzardi, Kristina; Winiecka-Krusnell, Jadwiga; Ramliden, Miriam; Alm, Erik; Andersson, Sabina; Byfors, Sara

2015-02-01

Fourteen isolates of an unknown species identified as belonging to the genus Legionella by selective growth on BCYE agar were isolated from the biopurification systems of three different wood processing plants. The mip gene sequence of all 14 isolates was identical and a close match alignment revealed 86 % sequence similarity with Legionella pneumophila serogroup 8. The whole genome of isolate LEGN(T) was sequenced, and a phylogenetic tree based on the alignment of 16S rRNA, mip, rpoB, rnpB and the 23S-5S intergenic region clustered LEGN(T) with L. pneumophila ATCC 33152(T). Analysis of virulence factors showed that strain LEGN(T) carries the majority of known L. pneumophila virulence factors. An amoeba infection assay performed to assess the pathogenicity of strain LEGN(T) towards Acanthamoeba castellanii showed that it can establish a replication vacuole in A. castellanii but does not significantly affect replication of amoebae. Taken together, the results confirm that strain LEGN(T) represents a novel species of the genus Legionella, for which the name Legionella norrlandica sp. nov. is proposed. The type strain is LEGN(T) ( = ATCC BAA-2678(T) = CCUG 65936(T)). © 2015 IUMS.
Three-dimensional sampling perfection with application-optimised contrasts using a different flip angle evolutions sequence for routine imaging of the spine: preliminary experience

PubMed Central

Tins, B; Cassar-Pullicino, V; Haddaway, M; Nachtrab, U

2012-01-01

Objectives The bulk of spinal imaging is still performed with conventional two-dimensional sequences. This study assesses the suitability of three-dimensional sampling perfection with application-optimised contrasts using a different flip angle evolutions (SPACE) sequence for routine spinal imaging. Methods 62 MRI examinations of the spine were evaluated by 2 examiners in consensus for the depiction of anatomy and presence of artefact. We noted pathologies that might be missed using the SPACE sequence only or the SPACE and a sagittal T1 weighted sequence. The reference standards were sagittal and axial T1 weighted and T2 weighted sequences. At a later date the evaluation was repeated by one of the original examiners and an additional examiner. Results There was good agreement of the single evaluations and consensus evaluation for the conventional sequences: κ>0.8, confidence interval (CI)>0.6–1.0. For the SPACE sequence, depiction of anatomy was very good for 84% of cases, with high interobserver agreement, but there was poor interobserver agreement for other cases. For artefact assessment of SPACE, κ=0.92, CI=0.92–1.0. The SPACE sequence was superior to conventional sequences for depiction of anatomy and artefact resistance. The SPACE sequence occasionally missed bone marrow oedema. In conjunction with sagittal T1 weighted sequences, no abnormality was missed. The isotropic SPACE sequence was superior to conventional sequences in imaging difficult anatomy such as in scoliosis and spondylolysis. Conclusion The SPACE sequence allows excellent assessment of anatomy owing to high spatial resolution and resistance to artefact. The sensitivity for bone marrow abnormalities is limited. PMID:22374284
Three-dimensional sampling perfection with application-optimised contrasts using a different flip angle evolutions sequence for routine imaging of the spine: preliminary experience.

PubMed

Tins, B; Cassar-Pullicino, V; Haddaway, M; Nachtrab, U

2012-08-01

The bulk of spinal imaging is still performed with conventional two-dimensional sequences. This study assesses the suitability of three-dimensional sampling perfection with application-optimised contrasts using a different flip angle evolutions (SPACE) sequence for routine spinal imaging. 62 MRI examinations of the spine were evaluated by 2 examiners in consensus for the depiction of anatomy and presence of artefact. We noted pathologies that might be missed using the SPACE sequence only or the SPACE and a sagittal T(1) weighted sequence. The reference standards were sagittal and axial T(1) weighted and T(2) weighted sequences. At a later date the evaluation was repeated by one of the original examiners and an additional examiner. There was good agreement of the single evaluations and consensus evaluation for the conventional sequences: κ>0.8, confidence interval (CI)>0.6-1.0. For the SPACE sequence, depiction of anatomy was very good for 84% of cases, with high interobserver agreement, but there was poor interobserver agreement for other cases. For artefact assessment of SPACE, κ=0.92, CI=0.92-1.0. The SPACE sequence was superior to conventional sequences for depiction of anatomy and artefact resistance. The SPACE sequence occasionally missed bone marrow oedema. In conjunction with sagittal T(1) weighted sequences, no abnormality was missed. The isotropic SPACE sequence was superior to conventional sequences in imaging difficult anatomy such as in scoliosis and spondylolysis. The SPACE sequence allows excellent assessment of anatomy owing to high spatial resolution and resistance to artefact. The sensitivity for bone marrow abnormalities is limited.
DOE Office of Scientific and Technical Information (OSTI.GOV)

Rubin, Benjamin E.; Wetmore, Kelly M.; Price, Morgan N.

Synechococcus elongatus PCC 7942 is a model organism used for studying photosynthesis and the circadian clock, and it is being developed for the production of fuel, industrial chemicals, and pharmaceuticals. To identify a comprehensive set of genes and intergenic regions that impacts fitness in S. elongatus, we created a pooled library of ~250,000 transposon mutants and used sequencing to identify the insertion locations. By analyzing the distribution and survival of these mutants, we identified 718 of the organism's 2,723 genes as essential for survival under laboratory conditions. The validity of the essential gene set is supported by its tight overlapmore » with wellconserved genes and its enrichment for core biological processes. The differences noted between our dataset and these predictors of essentiality, however, have led to surprising biological insights. One such finding is that genes in a large portion of the TCA cycle are dispensable, suggesting that S. elongatus does not require a cyclic TCA process. Furthermore, the density of the transposon mutant library enabled individual and global statements about the essentiality of noncoding RNAs, regulatory elements, and other intergenic regions. In this way, a group I intron located in tRNA Leu , which has been used extensively for phylogenetic studies, was shown here to be essential for the survival of S. elongatus. Our survey of essentiality for every locus in the S. elongatus genome serves as a powerful resource for understanding the organism's physiology and defines the essential gene set required for the growth of a photosynthetic organism.« less
The essential gene set of a photosynthetic organism

DOE PAGES

Rubin, Benjamin E.; Wetmore, Kelly M.; Price, Morgan N.; ...

2015-10-27

Synechococcus elongatus PCC 7942 is a model organism used for studying photosynthesis and the circadian clock, and it is being developed for the production of fuel, industrial chemicals, and pharmaceuticals. To identify a comprehensive set of genes and intergenic regions that impacts fitness in S. elongatus, we created a pooled library of ~250,000 transposon mutants and used sequencing to identify the insertion locations. By analyzing the distribution and survival of these mutants, we identified 718 of the organism's 2,723 genes as essential for survival under laboratory conditions. The validity of the essential gene set is supported by its tight overlapmore » with wellconserved genes and its enrichment for core biological processes. The differences noted between our dataset and these predictors of essentiality, however, have led to surprising biological insights. One such finding is that genes in a large portion of the TCA cycle are dispensable, suggesting that S. elongatus does not require a cyclic TCA process. Furthermore, the density of the transposon mutant library enabled individual and global statements about the essentiality of noncoding RNAs, regulatory elements, and other intergenic regions. In this way, a group I intron located in tRNA Leu , which has been used extensively for phylogenetic studies, was shown here to be essential for the survival of S. elongatus. Our survey of essentiality for every locus in the S. elongatus genome serves as a powerful resource for understanding the organism's physiology and defines the essential gene set required for the growth of a photosynthetic organism.« less

Molecular Epidemiological Study of Aspergillus fumigatus in a Bone Marrow Transplantation Unit by PCR Amplification of Ribosomal Intergenic Spacer Sequences

PubMed Central

Radford, Sarah A.; Johnson, Elizabeth M.; Leeming, John P.; Millar, Michael R.; Cornish, Jacqueline M.; Foot, Annabel B. M.; Warnock, David W.

1998-01-01

We have developed a PCR-based method for the subspecific discrimination of Aspergillus fumigatus types by using two primers designed to amplify the intergenic spacer regions between ribosomal DNA transcription units. The method permitted the reproducible discrimination of 11 distinct DNA types among a total of 119 isolates of A. fumigatus collected from patients and from the environment of a bone marrow transplantation (BMT) unit over a three-year period. Ten DNA types of A. fumigatus were isolated from patients in the BMT unit; eight of these types were also found in the hospital environment, and six of these were present in the unit itself. Thirteen BMT patients developed infection with one of three DNA types some months after these had first been found in the environment of the unit. In other instances, the same DNA types of A. fumigatus were isolated from BMT patients that were later recovered from the environment of the unit. Several DNA types of A. fumigatus were found in the hospital environment over an 18-month period. Molecular typing of multiple isolates of A. fumigatus, obtained from postmortem tissue samples, showed that one patient was infected with a single DNA type, but two others had up to three different DNA types. Our findings suggest that A. fumigatus infection in BMT recipients may be nosocomial in origin and underline the need for careful environmental monitoring of units in which high-risk patients are housed. PMID:9574694
Insights into the history of a bacterial group II intron remnant from the genomes of the nitrogen-fixing symbionts Sinorhizobium meliloti and Sinorhizobium medicae.

PubMed

Toro, N; Martínez-Rodríguez, L; Martínez-Abarca, F

2014-10-01

Group II introns are self-splicing catalytic RNAs that act as mobile retroelements. In bacteria, they are thought to be tolerated to some extent because they self-splice and home preferentially to sites outside of functional genes, generally within intergenic regions or in other mobile genetic elements, by mechanisms including the divergence of DNA target specificity to prevent target site saturation. RmInt1 is a mobile group II intron that is widespread in natural populations of Sinorhizobium meliloti and was first described in the GR4 strain. Like other bacterial group II introns, RmInt1 tends to evolve toward an inactive form by fragmentation, with loss of the 3' terminus. We identified genomic evidence of a fragmented intron closely related to RmInt1 buried in the genome of the extant S. meliloti/S. medicae species. By studying this intron, we obtained evidence for the occurrence of intron insertion before the divergence of ancient rhizobial species. This fragmented group II intron has thus existed for a long time and has provided sequence variation, on which selection can act, contributing to diverse genetic rearrangements, and to generate pan-genome divergence after strain differentiation. The data presented here suggest that fragmented group II introns within intergenic regions closed to functionally important neighboring genes may have been microevolutionary forces driving adaptive evolution of these rhizobial species.
Insights into the history of a bacterial group II intron remnant from the genomes of the nitrogen-fixing symbionts Sinorhizobium meliloti and Sinorhizobium medicae

PubMed Central

Toro, N; Martínez-Rodríguez, L; Martínez-Abarca, F

2014-01-01

Group II introns are self-splicing catalytic RNAs that act as mobile retroelements. In bacteria, they are thought to be tolerated to some extent because they self-splice and home preferentially to sites outside of functional genes, generally within intergenic regions or in other mobile genetic elements, by mechanisms including the divergence of DNA target specificity to prevent target site saturation. RmInt1 is a mobile group II intron that is widespread in natural populations of Sinorhizobium meliloti and was first described in the GR4 strain. Like other bacterial group II introns, RmInt1 tends to evolve toward an inactive form by fragmentation, with loss of the 3′ terminus. We identified genomic evidence of a fragmented intron closely related to RmInt1 buried in the genome of the extant S. meliloti/S. medicae species. By studying this intron, we obtained evidence for the occurrence of intron insertion before the divergence of ancient rhizobial species. This fragmented group II intron has thus existed for a long time and has provided sequence variation, on which selection can act, contributing to diverse genetic rearrangements, and to generate pan-genome divergence after strain differentiation. The data presented here suggest that fragmented group II introns within intergenic regions closed to functionally important neighboring genes may have been microevolutionary forces driving adaptive evolution of these rhizobial species. PMID:24736785
Genome-wide identification and functional prediction of nitrogen-responsive intergenic and intronic long non-coding RNAs in maize (Zea mays L.).

PubMed

Lv, Yuanda; Liang, Zhikai; Ge, Min; Qi, Weicong; Zhang, Tifu; Lin, Feng; Peng, Zhaohua; Zhao, Han

2016-05-11

Nitrogen (N) is an essential and often limiting nutrient to plant growth and development. Previous studies have shown that the mRNA expressions of numerous genes are regulated by nitrogen supplies; however, little is known about the expressed non-coding elements, for example long non-coding RNAs (lncRNAs) that control the response of maize (Zea mays L.) to nitrogen. LncRNAs are a class of non-coding RNAs larger than 200 bp, which have emerged as key regulators in gene expression. In this study, we surveyed the intergenic/intronic lncRNAs in maize B73 leaves at the V7 stage under conditions of N-deficiency and N-sufficiency using ribosomal RNA depletion and ultra-deep total RNA sequencing approaches. By integration with mRNA expression profiles and physiological evaluations, 7245 lncRNAs and 637 nitrogen-responsive lncRNAs were identified that exhibited unique expression patterns. Co-expression network analysis showed that the nitrogen-responsive lncRNAs were enriched mainly in one of the three co-expressed modules. The genes in the enriched module are mainly involved in NADH dehydrogenase activity, oxidative phosphorylation and the nitrogen compounds metabolic process. We identified a large number of lncRNAs in maize and illustrated their potential regulatory roles in response to N stress. The results lay the foundation for further in-depth understanding of the molecular mechanisms of lncRNAs' role in response to nitrogen stresses.
Unique haplotypes of cacao trees as revealed by trnH-psbA chloroplast DNA

PubMed Central

Gutiérrez-López, Nidia; Ovando-Medina, Isidro; Salvador-Figueroa, Miguel; Molina-Freaner, Francisco; Avendaño-Arrazate, Carlos H.

2016-01-01

Cacao trees have been cultivated in Mesoamerica for at least 4,000 years. In this study, we analyzed sequence variation in the chloroplast DNA trnH-psbA intergenic spacer from 28 cacao trees from different farms in the Soconusco region in southern Mexico. Genetic relationships were established by two analysis approaches based on geographic origin (five populations) and genetic origin (based on a previous study). We identified six polymorphic sites, including five insertion/deletion (indels) types and one transversion. The overall nucleotide diversity was low for both approaches (geographic = 0.0032 and genetic = 0.0038). Conversely, we obtained moderate to high haplotype diversity (0.66 and 0.80) with 10 and 12 haplotypes, respectively. The common haplotype (H1) for both networks included cacao trees from all geographic locations (geographic approach) and four genetic groups (genetic approach). This common haplotype (ancient) derived a set of intermediate haplotypes and singletons interconnected by one or two mutational steps, which suggested directional selection and event purification from the expansion of narrow populations. Cacao trees from Soconusco region were grouped into one cluster without any evidence of subclustering based on AMOVA (FST = 0) and SAMOVA (FST = 0.04393) results. One population (Mazatán) showed a high haplotype frequency; thus, this population could be considered an important reservoir of genetic material. The indels located in the trnH-psbA intergenic spacer of cacao trees could be useful as markers for the development of DNA barcoding. PMID:27076998
Investigation of DNA sequence recognition by a streptomycete MarR family transcriptional regulator through surface plasmon resonance and X-ray crystallography

PubMed Central

Stevenson, Clare E. M.; Assaad, Aoun; Chandra, Govind; Le, Tung B. K.; Greive, Sandra J.; Bibb, Mervyn J.; Lawson, David M.

2013-01-01

Consistent with their complex lifestyles and rich secondary metabolite profiles, the genomes of streptomycetes encode a plethora of transcription factors, the vast majority of which are uncharacterized. Herein, we use Surface Plasmon Resonance (SPR) to identify and delineate putative operator sites for SCO3205, a MarR family transcriptional regulator from Streptomyces coelicolor that is well represented in sequenced actinomycete genomes. In particular, we use a novel SPR footprinting approach that exploits indirect ligand capture to vastly extend the lifetime of a standard streptavidin SPR chip. We define two operator sites upstream of sco3205 and a pseudopalindromic consensus sequence derived from these enables further potential operator sites to be identified in the S. coelicolor genome. We evaluate each of these through SPR and test the importance of the conserved bases within the consensus sequence. Informed by these results, we determine the crystal structure of a SCO3205-DNA complex at 2.8 Å resolution, enabling molecular level rationalization of the SPR data. Taken together, our observations support a DNA recognition mechanism involving both direct and indirect sequence readout. PMID:23748564
The Complete Mitochondrial Genome of Gossypium hirsutum and Evolutionary Analysis of Higher Plant Mitochondrial Genomes

PubMed Central

Su, Aiguo; Geng, Jianing; Grover, Corrinne E.; Hu, Songnian; Hua, Jinping

2013-01-01

Background Mitochondria are the main manufacturers of cellular ATP in eukaryotes. The plant mitochondrial genome contains large number of foreign DNA and repeated sequences undergone frequently intramolecular recombination. Upland Cotton (Gossypium hirsutum L.) is one of the main natural fiber crops and also an important oil-producing plant in the world. Sequencing of the cotton mitochondrial (mt) genome could be helpful for the evolution research of plant mt genomes. Methodology/Principal Findings We utilized 454 technology for sequencing and combined with Fosmid library of the Gossypium hirsutum mt genome screening and positive clones sequencing and conducted a series of evolutionary analysis on Cycas taitungensis and 24 angiosperms mt genomes. After data assembling and contigs joining, the complete mitochondrial genome sequence of G. hirsutum was obtained. The completed G.hirsutum mt genome is 621,884 bp in length, and contained 68 genes, including 35 protein genes, four rRNA genes and 29 tRNA genes. Five gene clusters are found conserved in all plant mt genomes; one and four clusters are specifically conserved in monocots and dicots, respectively. Homologous sequences are distributed along the plant mt genomes and species closely related share the most homologous sequences. For species that have both mt and chloroplast genome sequences available, we checked the location of cp-like migration and found several fragments closely linked with mitochondrial genes. Conclusion The G. hirsutum mt genome possesses most of the common characters of higher plant mt genomes. The existence of syntenic gene clusters, as well as the conservation of some intergenic sequences and genic content among the plant mt genomes suggest that evolution of mt genomes is consistent with plant taxonomy but independent among different species. PMID:23940520
The complete mitochondrial genome of Gossypium hirsutum and evolutionary analysis of higher plant mitochondrial genomes.

PubMed

Liu, Guozheng; Cao, Dandan; Li, Shuangshuang; Su, Aiguo; Geng, Jianing; Grover, Corrinne E; Hu, Songnian; Hua, Jinping

2013-01-01

Mitochondria are the main manufacturers of cellular ATP in eukaryotes. The plant mitochondrial genome contains large number of foreign DNA and repeated sequences undergone frequently intramolecular recombination. Upland Cotton (Gossypium hirsutum L.) is one of the main natural fiber crops and also an important oil-producing plant in the world. Sequencing of the cotton mitochondrial (mt) genome could be helpful for the evolution research of plant mt genomes. We utilized 454 technology for sequencing and combined with Fosmid library of the Gossypium hirsutum mt genome screening and positive clones sequencing and conducted a series of evolutionary analysis on Cycas taitungensis and 24 angiosperms mt genomes. After data assembling and contigs joining, the complete mitochondrial genome sequence of G. hirsutum was obtained. The completed G.hirsutum mt genome is 621,884 bp in length, and contained 68 genes, including 35 protein genes, four rRNA genes and 29 tRNA genes. Five gene clusters are found conserved in all plant mt genomes; one and four clusters are specifically conserved in monocots and dicots, respectively. Homologous sequences are distributed along the plant mt genomes and species closely related share the most homologous sequences. For species that have both mt and chloroplast genome sequences available, we checked the location of cp-like migration and found several fragments closely linked with mitochondrial genes. The G. hirsutum mt genome possesses most of the common characters of higher plant mt genomes. The existence of syntenic gene clusters, as well as the conservation of some intergenic sequences and genic content among the plant mt genomes suggest that evolution of mt genomes is consistent with plant taxonomy but independent among different species.
The complete chloroplast genome sequence of the medicinal plant Salvia miltiorrhiza.

PubMed

Qian, Jun; Song, Jingyuan; Gao, Huanhuan; Zhu, Yingjie; Xu, Jiang; Pang, Xiaohui; Yao, Hui; Sun, Chao; Li, Xian'en; Li, Chuyuan; Liu, Juyan; Xu, Haibin; Chen, Shilin

2013-01-01

Salvia miltiorrhiza is an important medicinal plant with great economic and medicinal value. The complete chloroplast (cp) genome sequence of Salvia miltiorrhiza, the first sequenced member of the Lamiaceae family, is reported here. The genome is 151,328 bp in length and exhibits a typical quadripartite structure of the large (LSC, 82,695 bp) and small (SSC, 17,555 bp) single-copy regions, separated by a pair of inverted repeats (IRs, 25,539 bp). It contains 114 unique genes, including 80 protein-coding genes, 30 tRNAs and four rRNAs. The genome structure, gene order, GC content and codon usage are similar to the typical angiosperm cp genomes. Four forward, three inverted and seven tandem repeats were detected in the Salvia miltiorrhiza cp genome. Simple sequence repeat (SSR) analysis among the 30 asterid cp genomes revealed that most SSRs are AT-rich, which contribute to the overall AT richness of these cp genomes. Additionally, fewer SSRs are distributed in the protein-coding sequences compared to the non-coding regions, indicating an uneven distribution of SSRs within the cp genomes. Entire cp genome comparison of Salvia miltiorrhiza and three other Lamiales cp genomes showed a high degree of sequence similarity and a relatively high divergence of intergenic spacers. Sequence divergence analysis discovered the ten most divergent and ten most conserved genes as well as their length variation, which will be helpful for phylogenetic studies in asterids. Our analysis also supports that both regional and functional constraints affect gene sequence evolution. Further, phylogenetic analysis demonstrated a sister relationship between Salvia miltiorrhiza and Sesamum indicum. The complete cp genome sequence of Salvia miltiorrhiza reported in this paper will facilitate population, phylogenetic and cp genetic engineering studies of this medicinal plant.
Rapid Multi-Locus Sequence Typing Using Microfluidic Biochips

DTIC Science & Technology

2010-05-12

Sequence Types. The evolutionary history of all the B. cereus MLST concatenated Sequence Types (545 taxa, 2,394 nucleotide positions) was inferred using...the Neighbor-Joining method [28]. The bootstrap consensus tree inferred from 100 replicates was taken to represent the evolutionary history of the... Chlamydia (manuscript in preparation) and performed pilot studies on Staphylococcus aureus and Streptoccus pneumoniae (Data S4 and Text S2). Another potential
GeneSilico protein structure prediction meta-server.

PubMed

Kurowski, Michal A; Bujnicki, Janusz M

2003-07-01

Rigorous assessments of protein structure prediction have demonstrated that fold recognition methods can identify remote similarities between proteins when standard sequence search methods fail. It has been shown that the accuracy of predictions is improved when refined multiple sequence alignments are used instead of single sequences and if different methods are combined to generate a consensus model. There are several meta-servers available that integrate protein structure predictions performed by various methods, but they do not allow for submission of user-defined multiple sequence alignments and they seldom offer confidentiality of the results. We developed a novel WWW gateway for protein structure prediction, which combines the useful features of other meta-servers available, but with much greater flexibility of the input. The user may submit an amino acid sequence or a multiple sequence alignment to a set of methods for primary, secondary and tertiary structure prediction. Fold-recognition results (target-template alignments) are converted into full-atom 3D models and the quality of these models is uniformly assessed. A consensus between different FR methods is also inferred. The results are conveniently presented on-line on a single web page over a secure, password-protected connection. The GeneSilico protein structure prediction meta-server is freely available for academic users at http://genesilico.pl/meta.
GeneSilico protein structure prediction meta-server

PubMed Central

Kurowski, Michal A.; Bujnicki, Janusz M.

2003-01-01

Rigorous assessments of protein structure prediction have demonstrated that fold recognition methods can identify remote similarities between proteins when standard sequence search methods fail. It has been shown that the accuracy of predictions is improved when refined multiple sequence alignments are used instead of single sequences and if different methods are combined to generate a consensus model. There are several meta-servers available that integrate protein structure predictions performed by various methods, but they do not allow for submission of user-defined multiple sequence alignments and they seldom offer confidentiality of the results. We developed a novel WWW gateway for protein structure prediction, which combines the useful features of other meta-servers available, but with much greater flexibility of the input. The user may submit an amino acid sequence or a multiple sequence alignment to a set of methods for primary, secondary and tertiary structure prediction. Fold-recognition results (target-template alignments) are converted into full-atom 3D models and the quality of these models is uniformly assessed. A consensus between different FR methods is also inferred. The results are conveniently presented on-line on a single web page over a secure, password-protected connection. The GeneSilico protein structure prediction meta-server is freely available for academic users at http://genesilico.pl/meta. PMID:12824313
The sequence specificity of UV-induced DNA damage in a systematically altered DNA sequence.

PubMed

Khoe, Clairine V; Chung, Long H; Murray, Vincent

2018-06-01

The sequence specificity of UV-induced DNA damage was investigated in a specifically designed DNA plasmid using two procedures: end-labelling and linear amplification. Absorption of UV photons by DNA leads to dimerisation of pyrimidine bases and produces two major photoproducts, cyclobutane pyrimidine dimers (CPDs) and pyrimidine(6-4)pyrimidone photoproducts (6-4PPs). A previous study had determined that two hexanucleotide sequences, 5'-GCTC*AC and 5'-TATT*AA, were high intensity UV-induced DNA damage sites. The UV clone plasmid was constructed by systematically altering each nucleotide of these two hexanucleotide sequences. One of the main goals of this study was to determine the influence of single nucleotide alterations on the intensity of UV-induced DNA damage. The sequence 5'-GCTC*AC was designed to examine the sequence specificity of 6-4PPs and the highest intensity 6-4PP damage sites were found at 5'-GTTC*CC nucleotides. The sequence 5'-TATT*AA was devised to investigate the sequence specificity of CPDs and the highest intensity CPD damage sites were found at 5'-TTTT*CG nucleotides. It was proposed that the tetranucleotide DNA sequence, 5'-YTC*Y (where Y is T or C), was the consensus sequence for the highest intensity UV-induced 6-4PP adduct sites; while it was 5'-YTT*C for the highest intensity UV-induced CPD damage sites. These consensus tetranucleotides are composed entirely of consecutive pyrimidines and must have a DNA conformation that is highly productive for the absorption of UV photons. Crown Copyright © 2018. Published by Elsevier B.V. All rights reserved.
Small gene family encoding an eggshell (chorion) protein of the human parasite Schistosoma mansoni

DOE Office of Scientific and Technical Information (OSTI.GOV)

Bobek, L.A.; Rekosh, D.M.; Lo Verde, P.T.

1988-08-01

The authors isolated six independent genomic clones encoding schistosome chorion or eggshell proteins from a Schistosoma mansoni genomic library. A linkage map of five of the clones spanning 35 kilobase pairs (kbp) of the S. mansoni genome was constructed. The region contained two eggshell protein genes closely linked, separated by 7.5 kbp of intergenic DNA. The two genes of the cluster were arranged in the same orientation, that is, they were transcribed from the same strand. The sixth clone probably represents a third copy of the eggshell gene that is not contained within the 35-kbp region. The 5- end ofmore » the mRNA transcribed from these genes was defined by primer extension directly off the RNA. The ATCAT cap site sequence was homologous to a silkmoth chorion PuTCATT cap site sequence, where Pu indicates any purine. DNA sequence analysis showed that there were no introns in these genes. The DNA sequences of the three genes were very homologous to each other and to a cDNA clone, pSMf61-46, differing only in three or four nucleotices. A multiple TATA box was located at positions -23 to -31, and a CAAAT sequence was located at -52 upstream of the eggshell transcription unit. Comparison of sequences in regions further upstream with silkmoth and Drosophila sequences revealed very short elements that were shared. One such element, TCACGT, recently shown to be an essential cis-regulatory element for silkmoth chorion gene promoter function, was found at a similar position in all three organisms.« less
Exome capture from the spruce and pine giga-genomes.

PubMed

Suren, H; Hodgins, K A; Yeaman, S; Nurkowski, K A; Smets, P; Rieseberg, L H; Aitken, S N; Holliday, J A

2016-09-01

Sequence capture is a flexible tool for generating reduced representation libraries, particularly in species with massive genomes. We used an exome capture approach to sequence the gene space of two of the dominant species in Canadian boreal and montane forests - interior spruce (Picea glauca x engelmanii) and lodgepole pine (Pinus contorta). Transcriptome data generated with RNA-seq were coupled with draft genome sequences to design baits corresponding to 26 824 genes from pine and 28 649 genes from spruce. A total of 579 samples for spruce and 631 samples for pine were included, as well as two pine congeners and six spruce congeners. More than 50% of targeted regions were sequenced at >10× depth in each species, while ~12% captured near-target regions within 500 bp of a bait position were sequenced to a depth >10×. Much of our read data arose from off-target regions, which was likely due to the fragmented and incomplete nature of the draft genome assemblies. Capture in general was successful for the related species, suggesting that baits designed for a single species are likely to successfully capture sequences from congeners. From these data, we called approximately 10 million SNPs and INDELs in each species from coding regions, introns, untranslated and flanking regions, as well as from the intergenic space. Our study demonstrates the utility of sequence capture for resequencing in complex conifer genomes, suggests guidelines for improving capture efficiency and provides a rich resource of genetic variants for studies of selection and local adaptation in these species. © 2016 John Wiley & Sons Ltd.
Event-triggered consensus tracking of multi-agent systems with Lur'e nonlinear dynamics

NASA Astrophysics Data System (ADS)

Huang, Na; Duan, Zhisheng; Wen, Guanghui; Zhao, Yu

2016-05-01

In this paper, distributed consensus tracking problem for networked Lur'e systems is investigated based on event-triggered information interactions. An event-triggered control algorithm is designed with the advantages of reducing controller update frequency and sensor energy consumption. By using tools of ?-procedure and Lyapunov functional method, some sufficient conditions are derived to guarantee that consensus tracking is achieved under a directed communication topology. Meanwhile, it is shown that Zeno behaviour of triggering time sequences is excluded for the proposed event-triggered rule. Finally, some numerical simulations on coupled Chua's circuits are performed to illustrate the effectiveness of the theoretical algorithms.
An Alternative Time for Telling: When Conceptual Instruction Prior to Problem Solving Improves Mathematical Knowledge

ERIC Educational Resources Information Center

Fyfe, Emily R.; DeCaro, Marci S.; Rittle-Johnson, Bethany

2014-01-01

Background: The sequencing of learning materials greatly influences the knowledge that learners construct. Recently, learning theorists have focused on the sequencing of instruction in relation to solving related problems. The general consensus suggests explicit instruction should be provided; however, when to provide instruction remains unclear.…
Sampled-Data Consensus of Linear Multi-agent Systems With Packet Losses.

PubMed

Zhang, Wenbing; Tang, Yang; Huang, Tingwen; Kurths, Jurgen

In this paper, the consensus problem is studied for a class of multi-agent systems with sampled data and packet losses, where random and deterministic packet losses are considered, respectively. For random packet losses, a Bernoulli-distributed white sequence is used to describe packet dropouts among agents in a stochastic way. For deterministic packet losses, a switched system with stable and unstable subsystems is employed to model packet dropouts in a deterministic way. The purpose of this paper is to derive consensus criteria, such that linear multi-agent systems with sampled-data and packet losses can reach consensus. By means of the Lyapunov function approach and the decomposition method, the design problem of a distributed controller is solved in terms of convex optimization. The interplay among the allowable bound of the sampling interval, the probability of random packet losses, and the rate of deterministic packet losses are explicitly derived to characterize consensus conditions. The obtained criteria are closely related to the maximum eigenvalue of the Laplacian matrix versus the second minimum eigenvalue of the Laplacian matrix, which reveals the intrinsic effect of communication topologies on consensus performance. Finally, simulations are given to show the effectiveness of the proposed results.In this paper, the consensus problem is studied for a class of multi-agent systems with sampled data and packet losses, where random and deterministic packet losses are considered, respectively. For random packet losses, a Bernoulli-distributed white sequence is used to describe packet dropouts among agents in a stochastic way. For deterministic packet losses, a switched system with stable and unstable subsystems is employed to model packet dropouts in a deterministic way. The purpose of this paper is to derive consensus criteria, such that linear multi-agent systems with sampled-data and packet losses can reach consensus. By means of the Lyapunov function approach and the decomposition method, the design problem of a distributed controller is solved in terms of convex optimization. The interplay among the allowable bound of the sampling interval, the probability of random packet losses, and the rate of deterministic packet losses are explicitly derived to characterize consensus conditions. The obtained criteria are closely related to the maximum eigenvalue of the Laplacian matrix versus the second minimum eigenvalue of the Laplacian matrix, which reveals the intrinsic effect of communication topologies on consensus performance. Finally, simulations are given to show the effectiveness of the proposed results.
Examination of the catalytic fitness of the hammerhead ribozyme by in vitro selection.

PubMed Central

Tang, J; Breaker, R R

1997-01-01

We have designed a self-cleaving ribozyme construct that is rendered inactive during preparative in vitro transcription by allosteric interactions with ATP. This allosteric ribozyme was constructed by joining a hammerhead domain to an ATP-binding RNA aptamer, thereby creating a ribozyme whose catalytic rate can be controlled by ATP. Upon purification by PAGE, the engineered ribozyme undergoes rapid self-cleavage when incubated in the absence of ATP. This strategy of "allosteric delay" was used to prepare intact hammerhead ribozymes that would otherwise self-destruct during transcription. Using a similar strategy, we have prepared a combinatorial pool of RNA in order to assess the catalytic fitness of ribozymes that carry the natural consensus sequence for the hammerhead. Using in vitro selection, this comprehensive RNA pool was screened for sequence variants of the hammerhead ribozyme that also display catalytic activity. We find that sequences that comprise the core of naturally occurring hammerhead dominate the population of selected RNAs, indicating that the natural consensus sequence of this ribozyme is optimal for catalytic function. PMID:9257650
Aberrant signature methylome by DNMT1 hot spot mutation in hereditary sensory and autonomic neuropathy 1E.

PubMed

Sun, Zhifu; Wu, Yanhong; Ordog, Tamas; Baheti, Saurabh; Nie, Jinfu; Duan, Xiaohui; Hojo, Kaori; Kocher, Jean-Pierre; Dyck, Peter J; Klein, Christopher J

2014-08-01

DNA methyltransferase 1 (DNMT1) is essential for DNA methylation, gene regulation and chromatin stability. We previously discovered DNMT1 mutations cause hereditary sensory and autonomic neuropathy type 1 with dementia and hearing loss (HSAN1E; OMIM 614116). HSAN1E is the first adult-onset neurodegenerative disorder caused by a defect in a methyltransferase gene. HSAN1E patients appear clinically normal until young adulthood, then begin developing the characteristic symptoms involving central and peripheral nervous systems. Some HSAN1E patients also develop narcolepsy and it has recently been suggested that HSAN1E is allelic to autosomal dominant cerebellar ataxia, deafness, with narcolepsy (ADCA-DN; OMIM 604121), which is also caused by mutations in DNMT1. A hotspot mutation Y495C within the targeting sequence domain of DNMT1 has been identified among HSAN1E patients. The mutant DNMT1 protein shows premature degradation and reduced DNA methyltransferase activity. Herein, we investigate genome-wide DNA methylation at single-base resolution through whole-genome bisulfite sequencing of germline DNA in 3 pairs of HSAN1E patients and their gender- and age-matched siblings. Over 1 billion 75-bp single-end reads were generated for each sample. In the 3 affected siblings, overall methylation loss was consistently found in all chromosomes with X and 18 being most affected. Paired sample analysis identified 564,218 differentially methylated CpG sites (DMCs; P<0.05), of which 300 134 were intergenic and 264 084 genic CpGs. Hypomethylation was predominant in both genic and intergenic regions, including promoters, exons, most CpG islands, L1, L2, Alu, and satellite repeats and simple repeat sequences. In some CpG islands, hypermethylated CpGs outnumbered hypomethylated CpGs. In 201 imprinted genes, there were more DMCs than in non-imprinted genes and most were hypomethylated. Differentially methylated region (DMR) analysis identified 5649 hypomethylated and 1872 hypermethylated regions. Importantly, pathway analysis revealed 1693 genes associated with the identified DMRs were highly associated in diverse neurological disorders and NAD+/NADH metabolism pathways is implicated in the pathogenesis. Our results provide novel insights into the epigenetic mechanism of neurodegeneration arising from a hotspot DNMT1 mutation and reveal pathways potentially important in a broad category of neurological and psychological disorders.

Aberrant signature methylome by DNMT1 hot spot mutation in hereditary sensory and autonomic neuropathy 1E

PubMed Central

Sun, Zhifu; Wu, Yanhong; Ordog, Tamas; Baheti, Saurabh; Nie, Jinfu; Duan, Xiaohui; Hojo, Kaori; Kocher, Jean-Pierre; Dyck, Peter J; Klein, Christopher J

2014-01-01

DNA methyltransferase 1 (DNMT1) is essential for DNA methylation, gene regulation and chromatin stability. We previously discovered DNMT1 mutations cause hereditary sensory and autonomic neuropathy type 1 with dementia and hearing loss (HSAN1E; OMIM 614116). HSAN1E is the first adult-onset neurodegenerative disorder caused by a defect in a methyltransferase gene. HSAN1E patients appear clinically normal until young adulthood, then begin developing the characteristic symptoms involving central and peripheral nervous systems. Some HSAN1E patients also develop narcolepsy and it has recently been suggested that HSAN1E is allelic to autosomal dominant cerebellar ataxia, deafness, with narcolepsy (ADCA-DN; OMIM 604121), which is also caused by mutations in DNMT1. A hotspot mutation Y495C within the targeting sequence domain of DNMT1 has been identified among HSAN1E patients. The mutant DNMT1 protein shows premature degradation and reduced DNA methyltransferase activity. Herein, we investigate genome-wide DNA methylation at single-base resolution through whole-genome bisulfite sequencing of germline DNA in 3 pairs of HSAN1E patients and their gender- and age-matched siblings. Over 1 billion 75-bp single-end reads were generated for each sample. In the 3 affected siblings, overall methylation loss was consistently found in all chromosomes with X and 18 being most affected. Paired sample analysis identified 564,218 differentially methylated CpG sites (DMCs; P < 0.05), of which 300 134 were intergenic and 264 084 genic CpGs. Hypomethylation was predominant in both genic and intergenic regions, including promoters, exons, most CpG islands, L1, L2, Alu, and satellite repeats and simple repeat sequences. In some CpG islands, hypermethylated CpGs outnumbered hypomethylated CpGs. In 201 imprinted genes, there were more DMCs than in non-imprinted genes and most were hypomethylated. Differentially methylated region (DMR) analysis identified 5649 hypomethylated and 1872 hypermethylated regions. Importantly, pathway analysis revealed 1693 genes associated with the identified DMRs were highly associated in diverse neurological disorders and NAD+/NADH metabolism pathways is implicated in the pathogenesis. Our results provide novel insights into the epigenetic mechanism of neurodegeneration arising from a hotspot DNMT1 mutation and reveal pathways potentially important in a broad category of neurological and psychological disorders. PMID:25033457
X-Prolyl Dipeptidyl Aminopeptidase Gene (pepX) Is Part of the glnRA Operon in Lactobacillus rhamnosus

PubMed Central

Varmanen, Pekka; Savijoki, Kirsi; Åvall, Silja; Palva, Airi; Tynkkynen, Soile

2000-01-01

A peptidase gene expressing X-prolyl dipeptidyl aminopeptidase (PepX) activity was cloned from Lactobacillus rhamnosus 1/6 by using the chromogenic substrate l-glycyl-l-prolyl-β-naphthylamide for screening of a genomic library in Escherichia coli. The nucleotide sequence of a 3.5-kb HindIII fragment expressing the peptidase activity revealed one complete open reading frame (ORF) of 2,391 nucleotides. The 797-amino-acid protein encoded by this ORF was shown to be 40, 39, and 36% identical with PepXs from Lactobacillus helveticus, Lactobacillus delbrueckii, and Lactococcus lactis, respectively. By Northern analysis with a pepX-specific probe, transcripts of 4.5 and 7.0 kb were detected, indicating that pepX is part of a polycistronic operon in L. rhamnosus. Cloning and sequencing of the upstream region of pepX revealed the presence of two ORFs of 360 and 1,338 bp that were shown to be able to encode proteins with high homology to GlnR and GlnA proteins, respectively. By multiple primer extension analyses, the only functional promoter in the pepX region was located 25 nucleotides upstream of glnR. Northern analysis with glnA- and pepX-specific probes indicated that transcription from glnR promoter results in a 2.0-kb dicistronic glnR-glnA transcript and also in a longer read-through polycistronic transcript of 7.0 kb that was detected with both probes in samples from cells in exponential growth phase. The glnA gene was disrupted by a single-crossover recombinant event using a nonreplicative plasmid carrying an internal part of glnA. In the disruption mutant, glnRA-specific transcription was derepressed 10-fold compared to the wild type, but the 7.0-kb transcript was no longer detectable with either the glnA- or pepX-specific probe, demonstrating that pepX is indeed part of glnRA operon in L. rhamnosus. Reverse transcription-PCR analysis further supported this operon structure. An extended stem-loop structure was identified immediately upstream of pepX in the glnA-pepX intergenic region, a sequence that showed homology to a 23S-5S intergenic spacer and to several other L. rhamnosus-related entries in data banks. PMID:10613874
X-prolyl dipeptidyl aminopeptidase gene (pepX) is part of the glnRA operon in Lactobacillus rhamnosus.

PubMed

Varmanen, P; Savijoki, K; Avall, S; Palva, A; Tynkkynen, S

2000-01-01

A peptidase gene expressing X-prolyl dipeptidyl aminopeptidase (PepX) activity was cloned from Lactobacillus rhamnosus 1/6 by using the chromogenic substrate L-glycyl-L-prolyl-beta-naphthylamide for screening of a genomic library in Escherichia coli. The nucleotide sequence of a 3.5-kb HindIII fragment expressing the peptidase activity revealed one complete open reading frame (ORF) of 2,391 nucleotides. The 797-amino-acid protein encoded by this ORF was shown to be 40, 39, and 36% identical with PepXs from Lactobacillus helveticus, Lactobacillus delbrueckii, and Lactococcus lactis, respectively. By Northern analysis with a pepX-specific probe, transcripts of 4.5 and 7.0 kb were detected, indicating that pepX is part of a polycistronic operon in L. rhamnosus. Cloning and sequencing of the upstream region of pepX revealed the presence of two ORFs of 360 and 1,338 bp that were shown to be able to encode proteins with high homology to GlnR and GlnA proteins, respectively. By multiple primer extension analyses, the only functional promoter in the pepX region was located 25 nucleotides upstream of glnR. Northern analysis with glnA- and pepX-specific probes indicated that transcription from glnR promoter results in a 2.0-kb dicistronic glnR-glnA transcript and also in a longer read-through polycistronic transcript of 7.0 kb that was detected with both probes in samples from cells in exponential growth phase. The glnA gene was disrupted by a single-crossover recombinant event using a nonreplicative plasmid carrying an internal part of glnA. In the disruption mutant, glnRA-specific transcription was derepressed 10-fold compared to the wild type, but the 7.0-kb transcript was no longer detectable with either the glnA- or pepX-specific probe, demonstrating that pepX is indeed part of glnRA operon in L. rhamnosus. Reverse transcription-PCR analysis further supported this operon structure. An extended stem-loop structure was identified immediately upstream of pepX in the glnA-pepX intergenic region, a sequence that showed homology to a 23S-5S intergenic spacer and to several other L. rhamnosus-related entries in data banks.
The complete chloroplast DNA sequences of the charophycean green algae Staurastrum and Zygnema reveal that the chloroplast genome underwent extensive changes during the evolution of the Zygnematales

PubMed Central

Turmel, Monique; Otis, Christian; Lemieux, Claude

2005-01-01

Background The Streptophyta comprise all land plants and six monophyletic groups of charophycean green algae. Phylogenetic analyses of four genes from three cellular compartments support the following branching order for these algal lineages: Mesostigmatales, Chlorokybales, Klebsormidiales, Zygnematales, Coleochaetales and Charales, with the last lineage being sister to land plants. Comparative analyses of the Mesostigma viride (Mesostigmatales) and land plant chloroplast genome sequences revealed that this genome experienced many gene losses, intron insertions and gene rearrangements during the evolution of charophyceans. On the other hand, the chloroplast genome of Chaetosphaeridium globosum (Coleochaetales) is highly similar to its land plant counterparts in terms of gene content, intron composition and gene order, indicating that most of the features characteristic of land plant chloroplast DNA (cpDNA) were acquired from charophycean green algae. To gain further insight into when the highly conservative pattern displayed by land plant cpDNAs originated in the Streptophyta, we have determined the cpDNA sequences of the distantly related zygnematalean algae Staurastrum punctulatum and Zygnema circumcarinatum. Results The 157,089 bp Staurastrum and 165,372 bp Zygnema cpDNAs encode 121 and 125 genes, respectively. Although both cpDNAs lack an rRNA-encoding inverted repeat (IR), they are substantially larger than Chaetosphaeridium and land plant cpDNAs. This increased size is explained by the expansion of intergenic spacers and introns. The Staurastrum and Zygnema genomes differ extensively from one another and from their streptophyte counterparts at the level of gene order, with the Staurastrum genome more closely resembling its land plant counterparts than does Zygnema cpDNA. Many intergenic regions in Zygnema cpDNA harbor tandem repeats. The introns in both Staurastrum (8 introns) and Zygnema (13 introns) cpDNAs represent subsets of those found in land plant cpDNAs. They represent 16 distinct insertion sites, only five of which are shared by the two zygnematalean genomes. Three of these insertions sites have not been identified in Chaetosphaeridium cpDNA. Conclusion The chloroplast genome experienced substantial changes in overall structure, gene order, and intron content during the evolution of the Zygnematales. Most of the features considered earlier as typical of land plant cpDNAs probably originated before the emergence of the Zygnematales and Coleochaetales. PMID:16236178
The compact genome of the plant pathogen Plasmodiophora brassicae is adapted to intracellular interactions with host Brassica spp.

PubMed

Rolfe, Stephen A; Strelkov, Stephen E; Links, Matthew G; Clarke, Wayne E; Robinson, Stephen J; Djavaheri, Mohammad; Malinowski, Robert; Haddadi, Parham; Kagale, Sateesh; Parkin, Isobel A P; Taheri, Ali; Borhan, M Hossein

2016-03-31

The protist Plasmodiophora brassicae is a soil-borne pathogen of cruciferous species and the causal agent of clubroot disease of Brassicas including agriculturally important crops such as canola/rapeseed (Brassica napus). P. brassicae has remained an enigmatic plant pathogen and is a rare example of an obligate biotroph that resides entirely inside the host plant cell. The pathogen is the cause of severe yield losses and can render infested fields unsuitable for Brassica crop growth due to the persistence of resting spores in the soil for up to 20 years. To provide insight into the biology of the pathogen and its interaction with its primary host B. napus, we produced a draft genome of P. brassicae pathotypes 3 and 6 (Pb3 and Pb6) that differ in their host range. Pb3 is highly virulent on B. napus (but also infects other Brassica species) while Pb6 infects only vegetable Brassica crops. Both the Pb3 and Pb6 genomes are highly compact, each with a total size of 24.2 Mb, and contain less than 2 % repetitive DNA. Clustering of genome-wide single nucleotide polymorphisms (SNP) of Pb3, Pb6 and three additional re-sequenced pathotypes (Pb2, Pb5 and Pb8) shows a high degree of correlation of cluster grouping with host range. The Pb3 genome features significant reduction of intergenic space with multiple examples of overlapping untranslated regions (UTRs). Dependency on the host for essential nutrients is evident from the loss of genes for the biosynthesis of thiamine and some amino acids and the presence of a wide range of transport proteins, including some unique to P. brassicae. The annotated genes of Pb3 include those with a potential role in the regulation of the plant growth hormones cytokinin and auxin. The expression profile of Pb3 genes, including putative effectors, during infection and their potential role in manipulation of host defence is discussed. The P. brassicae genome sequence reveals a compact genome, a dependency of the pathogen on its host for some essential nutrients and a potential role in the regulation of host plant cytokinin and auxin. Genome annotation supported by RNA sequencing reveals significant reduction in intergenic space which, in addition to low repeat content, has likely contributed to the P. brassicae compact genome.
Complete sequence and analysis of the mitochondrial genome of Hemiselmis andersenii CCMP644 (Cryptophyceae).

PubMed

Kim, Eunsoo; Lane, Christopher E; Curtis, Bruce A; Kozera, Catherine; Bowman, Sharen; Archibald, John M

2008-05-12

Cryptophytes are an enigmatic group of unicellular eukaryotes with plastids derived by secondary (i.e., eukaryote-eukaryote) endosymbiosis. Cryptophytes are unusual in that they possess four genomes-a host cell-derived nuclear and mitochondrial genome and an endosymbiont-derived plastid and 'nucleomorph' genome. The evolutionary origins of the host and endosymbiont components of cryptophyte algae are at present poorly understood. Thus far, a single complete mitochondrial genome sequence has been determined for the cryptophyte Rhodomonas salina. Here, the second complete mitochondrial genome of the cryptophyte alga Hemiselmis andersenii CCMP644 is presented. The H. andersenii mtDNA is 60,553 bp in size and encodes 30 structural RNAs and 36 protein-coding genes, all located on the same strand. A prominent feature of the genome is the presence of a approximately 20 Kbp long intergenic region comprised of numerous tandem and dispersed repeat units of between 22-336 bp. Adjacent to these repeats are 27 copies of palindromic sequences predicted to form stable DNA stem-loop structures. One such stem-loop is located near a GC-rich and GC-poor region and may have a regulatory function in replication or transcription. The H. andersenii mtDNA shares a number of features in common with the genome of the cryptophyte Rhodomonas salina, including general architecture, gene content, and the presence of a large repeat region. However, the H. andersenii mtDNA is devoid of inverted repeats and introns, which are present in R. salina. Comparative analyses of the suite of tRNAs encoded in the two genomes reveal that the H. andersenii mtDNA has lost or converted its original trnK(uuu) gene and possesses a trnS-derived 'trnK(uuu)', which appears unable to produce a functional tRNA. Mitochondrial protein coding gene phylogenies strongly support a variety of previously established eukaryotic groups, but fail to resolve the relationships among higher-order eukaryotic lineages. Comparison of the H. andersenii and R. salina mitochondrial genomes reveals a number of cryptophyte-specific genomic features, most notably the presence of a large repeat-rich intergenic region. However, unlike R. salina, the H. andersenii mtDNA does not possess introns and lacks a Lys-tRNA, which is presumably imported from the cytosol.
Complete Sequence and Analysis of the Mitochondrial Genome of Hemiselmis andersenii CCMP644 (Cryptophyceae)

PubMed Central

Kim, Eunsoo; Lane, Christopher E; Curtis, Bruce A; Kozera, Catherine; Bowman, Sharen; Archibald, John M

2008-01-01

Background Cryptophytes are an enigmatic group of unicellular eukaryotes with plastids derived by secondary (i.e., eukaryote-eukaryote) endosymbiosis. Cryptophytes are unusual in that they possess four genomes–a host cell-derived nuclear and mitochondrial genome and an endosymbiont-derived plastid and 'nucleomorph' genome. The evolutionary origins of the host and endosymbiont components of cryptophyte algae are at present poorly understood. Thus far, a single complete mitochondrial genome sequence has been determined for the cryptophyte Rhodomonas salina. Here, the second complete mitochondrial genome of the cryptophyte alga Hemiselmis andersenii CCMP644 is presented. Results The H. andersenii mtDNA is 60,553 bp in size and encodes 30 structural RNAs and 36 protein-coding genes, all located on the same strand. A prominent feature of the genome is the presence of a ~20 Kbp long intergenic region comprised of numerous tandem and dispersed repeat units of between 22–336 bp. Adjacent to these repeats are 27 copies of palindromic sequences predicted to form stable DNA stem-loop structures. One such stem-loop is located near a GC-rich and GC-poor region and may have a regulatory function in replication or transcription. The H. andersenii mtDNA shares a number of features in common with the genome of the cryptophyte Rhodomonas salina, including general architecture, gene content, and the presence of a large repeat region. However, the H. andersenii mtDNA is devoid of inverted repeats and introns, which are present in R. salina. Comparative analyses of the suite of tRNAs encoded in the two genomes reveal that the H. andersenii mtDNA has lost or converted its original trnK(uuu) gene and possesses a trnS-derived 'trnK(uuu)', which appears unable to produce a functional tRNA. Mitochondrial protein coding gene phylogenies strongly support a variety of previously established eukaryotic groups, but fail to resolve the relationships among higher-order eukaryotic lineages. Conclusion Comparison of the H. andersenii and R. salina mitochondrial genomes reveals a number of cryptophyte-specific genomic features, most notably the presence of a large repeat-rich intergenic region. However, unlike R. salina, the H. andersenii mtDNA does not possess introns and lacks a Lys-tRNA, which is presumably imported from the cytosol. PMID:18474103
A core phylogeny of Dictyostelia inferred from genomes representative of the eight major and minor taxonomic divisions of the group.

PubMed

Singh, Reema; Schilde, Christina; Schaap, Pauline

2016-11-17

Dictyostelia are a well-studied group of organisms with colonial multicellularity, which are members of the mostly unicellular Amoebozoa. A phylogeny based on SSU rDNA data subdivided all Dictyostelia into four major groups, but left the position of the root and of six group-intermediate taxa unresolved. Recent phylogenies inferred from 30 or 213 proteins from sequenced genomes, positioned the root between two branches, each containing two major groups, but lacked data to position the group-intermediate taxa. Since the positions of these early diverging taxa are crucial for understanding the evolution of phenotypic complexity in Dictyostelia, we sequenced six representative genomes of early diverging taxa. We retrieved orthologs of 47 housekeeping proteins with an average size of 890 amino acids from six newly sequenced and eight published genomes of Dictyostelia and unicellular Amoebozoa and inferred phylogenies from single and concatenated protein sequence alignments. Concatenated alignments of all 47 proteins, and four out of five subsets of nine concatenated proteins all produced the same consensus phylogeny with 100% statistical support. Trees inferred from just two out of the 47 proteins, individually reproduced the consensus phylogeny, highlighting that single gene phylogenies will rarely reflect correct species relationships. However, sets of two or three concatenated proteins again reproduced the consensus phylogeny, indicating that a small selection of genes suffices for low cost classification of as yet unincorporated or newly discovered dictyostelid and amoebozoan taxa by gene amplification. The multi-locus consensus phylogeny shows that groups 1 and 2 are sister clades in branch I, with the group-intermediate taxon D. polycarpum positioned as outgroup to group 2. Branch II consists of groups 3 and 4, with the group-intermediate taxon Polysphondylium violaceum positioned as sister to group 4, and the group-intermediate taxon Dictyostelium polycephalum branching at the base of that whole clade. Given the data, the approximately unbiased test rejects all alternative topologies favoured by SSU rDNA and individual proteins with high statistical support. The test also rejects monophyletic origins for the genera Acytostelium, Polysphondylium and Dictyostelium. The current position of Acytostelium ellipticum in the consensus phylogeny indicates that somatic cells were lost twice in Dictyostelia.
Evaluation of Phage Display Discovered Peptides as Ligands for Prostate-Specific Membrane Antigen (PSMA)

PubMed Central

Edwards, W. Barry

2013-01-01

The aim of this study was to identify potential ligands of PSMA suitable for further development as novel PSMA-targeted peptides using phage display technology. The human PSMA protein was immobilized as a target followed by incubation with a 15-mer phage display random peptide library. After one round of prescreening and two rounds of screening, high-stringency screening at the third round of panning was performed to identify the highest affinity binders. Phages which had a specific binding activity to PSMA in human prostate cancer cells were isolated and the DNA corresponding to the 15-mers were sequenced to provide three consensus sequences: GDHSPFT, SHFSVGS and EVPRLSLLAVFL as well as other sequences that did not display consensus. Two of the peptide sequences deduced from DNA sequencing of binding phages, SHSFSVGSGDHSPFT and GRFLTGGTGRLLRIS were labeled with 5-carboxyfluorescein and shown to bind and co-internalize with PSMA on human prostate cancer cells by fluorescence microscopy. The high stringency requirements yielded peptides with affinities KD∼1 µM or greater which are suitable starting points for affinity maturation. While these values were less than anticipated, the high stringency did yield peptide sequences that apparently bound to different surfaces on PSMA. These peptide sequences could be the basis for further development of peptides for prostate cancer tumor imaging and therapy. PMID:23935860
Epstein-Barr virus latent gene sequences as geographical markers of viral origin: unique EBNA3 gene signatures identify Japanese viruses as distinct members of the Asian virus family.

PubMed

Sawada, Akihisa; Croom-Carter, Deborah; Kondo, Osamu; Yasui, Masahiro; Koyama-Sato, Maho; Inoue, Masami; Kawa, Keisei; Rickinson, Alan B; Tierney, Rosemary J

2011-05-01

Polymorphisms in Epstein-Barr virus (EBV) latent genes can identify virus strains from different human populations and individual strains within a population. An Asian EBV signature has been defined almost exclusively from Chinese viruses, with little information from other Asian countries. Here we sequenced polymorphic regions of the EBNA1, 2, 3A, 3B, 3C and LMP1 genes of 31 Japanese strains from control donors and EBV-associated T/NK-cell lymphoproliferative disease (T/NK-LPD) patients. Though identical to Chinese strains in their dominant EBNA1 and LMP1 alleles, Japanese viruses were subtly different at other loci. Thus, while Chinese viruses mainly fall into two families with strongly linked 'Wu' or 'Li' alleles at EBNA2 and EBNA3A/B/C, Japanese viruses all have the consensus Wu EBNA2 allele but fall into two families at EBNA3A/B/C. One family has variant Li-like sequences at EBNA3A and 3B and the consensus Li sequence at EBNA3C; the other family has variant Wu-like sequences at EBNA3A, variants of a low frequency Chinese allele 'Sp' at EBNA3B and a consensus Sp sequence at EBNA3C. Thus, EBNA3A/B/C allelotypes clearly distinguish Japanese from Chinese strains. Interestingly, most Japanese viruses also lack those immune-escape mutations in the HLA-A11 epitope-encoding region of EBNA3B that are so characteristic of viruses from the highly A11-positive Chinese population. Control donor-derived and T/NK-LPD-derived strains were similarly distributed across allelotypes and, by using allelic polymorphisms to track virus strains in patients pre- and post-haematopoietic stem-cell transplant, we show that a single strain can induce both T/NK-LPD and B-cell-lymphoproliferative disease in the same patient.
A Consensus Genetic Map for Pinus taeda and Pinus elliottii and Extent of Linkage Disequilibrium in Two Genotype-Phenotype Discovery Populations of Pinus taeda

PubMed Central

Westbrook, Jared W.; Chhatre, Vikram E.; Wu, Le-Shin; Chamala, Srikar; Neves, Leandro Gomide; Muñoz, Patricio; Martínez-García, Pedro J.; Neale, David B.; Kirst, Matias; Mockaitis, Keithanne; Nelson, C. Dana; Peter, Gary F.; Echt, Craig S.

2015-01-01

A consensus genetic map for Pinus taeda (loblolly pine) and Pinus elliottii (slash pine) was constructed by merging three previously published P. taeda maps with a map from a pseudo-backcross between P. elliottii and P. taeda. The consensus map positioned 3856 markers via genotyping of 1251 individuals from four pedigrees. It is the densest linkage map for a conifer to date. Average marker spacing was 0.6 cM and total map length was 2305 cM. Functional predictions of mapped genes were improved by aligning expressed sequence tags used for marker discovery to full-length P. taeda transcripts. Alignments to the P. taeda genome mapped 3305 scaffold sequences onto 12 linkage groups. The consensus genetic map was used to compare the genome-wide linkage disequilibrium in a population of distantly related P. taeda individuals (ADEPT2) used for association genetic studies and a multiple-family pedigree used for genomic selection (CCLONES). The prevalence and extent of LD was greater in CCLONES as compared to ADEPT2; however, extended LD with LGs or between LGs was rare in both populations. The average squared correlations, r2, between SNP alleles less than 1 cM apart were less than 0.05 in both populations and r2 did not decay substantially with genetic distance. The consensus map and analysis of linkage disequilibrium establish a foundation for comparative association mapping and genomic selection in P. taeda and P. elliottii. PMID:26068575
Draft Genome Sequences of Two Novel Salmonella enterica subsp. enterica Strains Isolated from Low-Moisture Foods with Applications in Food Safety Research.

PubMed

Radford, Devon R; Leon-Velarde, Carlos G; Chen, Shu; Hamidi Oskouei, Amir M; Balamurugan, Sampathkumar

2018-03-29

The genomes of two strains of Salmonella enterica subsp. enterica serovar Cubana and serovar Muenchen, isolated from dry hazelnuts and chia seeds, respectively, were sequenced using the Illumina MiSeq platform, assembled de novo using the overlap-layout-consensus method, and aligned to their respective most identical sequence genome scaffolds using MUMMER and BLAST searches. Copyright © 2018 Radford et al.
From cheek swabs to consensus sequences: an A to Z protocol for high-throughput DNA sequencing of complete human mitochondrial genomes

PubMed Central

2014-01-01

Background Next-generation DNA sequencing (NGS) technologies have made huge impacts in many fields of biological research, but especially in evolutionary biology. One area where NGS has shown potential is for high-throughput sequencing of complete mtDNA genomes (of humans and other animals). Despite the increasing use of NGS technologies and a better appreciation of their importance in answering biological questions, there remain significant obstacles to the successful implementation of NGS-based projects, especially for new users. Results Here we present an ‘A to Z’ protocol for obtaining complete human mitochondrial (mtDNA) genomes – from DNA extraction to consensus sequence. Although designed for use on humans, this protocol could also be used to sequence small, organellar genomes from other species, and also nuclear loci. This protocol includes DNA extraction, PCR amplification, fragmentation of PCR products, barcoding of fragments, sequencing using the 454 GS FLX platform, and a complete bioinformatics pipeline (primer removal, reference-based mapping, output of coverage plots and SNP calling). Conclusions All steps in this protocol are designed to be straightforward to implement, especially for researchers who are undertaking next-generation sequencing for the first time. The molecular steps are scalable to large numbers (hundreds) of individuals and all steps post-DNA extraction can be carried out in 96-well plate format. Also, the protocol has been assembled so that individual ‘modules’ can be swapped out to suit available resources. PMID:24460871
Bioinformatics prediction of siRNAs as potential antiviral agents against dengue viruses

PubMed Central

Villegas-Rosales, Paula M; Méndez-Tenorio, Alfonso; Ortega-Soto, Elizabeth; Barrón, Blanca L

2012-01-01

Dengue virus (DENV 1-4) represents the major emerging arthropod-borne viral infection in the world. Currently, there is neither an available vaccine nor a specific treatment. Hence, there is a need of antiviral drugs for these viral infections; we describe the prediction of short interfering RNA (siRNA) as potential therapeutic agents against the four DENV serotypes. Our strategy was to carry out a series of multiple alignments using ClustalX program to find conserved sequences among the four DENV serotype genomes to obtain a consensus sequence for siRNAs design. A highly conserved sequence among the four DENV serotypes, located in the encoding sequence for NS4B and NS5 proteins was found. A total of 2,893 complete DENV genomes were downloaded from the NCBI, and after a depuration procedure to identify identical sequences, 220 complete DENV genomes were left. They were edited to select the NS4B and NS5 sequences, which were aligned to obtain a consensus sequence. Three different servers were used for siRNA design, and the resulting siRNAs were aligned to identify the most prevalent sequences. Three siRNAs were chosen, one targeted the genome region that codifies for NS4B protein and the other two; the region for NS5 protein. Predicted secondary structure for DENV genomes was used to demonstrate that the siRNAs were able to target the viral genome forming double stranded structures, necessary to activate the RNA silencing machinery. PMID:22829722
Comparative analysis of the small RNA transcriptomes of Pinus contorta and Oryza sativa

PubMed Central

Morin, Ryan D.; Aksay, Gozde; Dolgosheina, Elena; Ebhardt, H. Alexander; Magrini, Vincent; Mardis, Elaine R.; Sahinalp, S. Cenk; Unrau, Peter J.

2008-01-01

The diversity of microRNAs and small-interfering RNAs has been extensively explored within angiosperms by focusing on a few key organisms such as Oryza sativa and Arabidopsis thaliana. A deeper division of the plants is defined by the radiation of the angiosperms and gymnosperms, with the latter comprising the commercially important conifers. The conifers are expected to provide important information regarding the evolution of highly conserved small regulatory RNAs. Deep sequencing provides the means to characterize and quantitatively profile small RNAs in understudied organisms such as these. Pyrosequencing of small RNAs from O. sativa revealed, as expected, ∼21- and ∼24-nt RNAs. The former contained known microRNAs, and the latter largely comprised intergenic-derived sequences likely representing heterochromatin siRNAs. In contrast, sequences from Pinus contorta were dominated by 21-nt small RNAs. Using a novel sequence-based clustering algorithm, we identified sequences belonging to 18 highly conserved microRNA families in P. contorta as well as numerous clusters of conserved small RNAs of unknown function. Using multiple methods, including expressed sequence folding and machine learning algorithms, we found a further 53 candidate novel microRNA families, 51 appearing specific to the P. contorta library. In addition, alignment of small RNA sequences to the O. sativa genome revealed six perfectly conserved classes of small RNA that included chloroplast transcripts and specific types of genomic repeats. The conservation of microRNAs and other small RNAs between the conifers and the angiosperms indicates that important RNA silencing processes were highly developed in the earliest spermatophytes. Genomic mapping of all sequences to the O. sativa genome can be viewed at http://microrna.bcgsc.ca/cgi-bin/gbrowse/rice_build_3/. PMID:18323537
Investigating intra-host and intra-herd sequence diversity of foot-and-mouth disease virus.

PubMed

King, David J; Freimanis, Graham L; Orton, Richard J; Waters, Ryan A; Haydon, Daniel T; King, Donald P

2016-10-01

Due to the poor-fidelity of the enzymes involved in RNA genome replication, foot-and-mouth disease (FMD) virus samples comprise of unique polymorphic populations. In this study, deep sequencing was utilised to characterise the diversity of FMD virus (FMDV) populations in 6 infected cattle present on a single farm during the series of outbreaks in the UK in 2007. A novel RT-PCR method was developed to amplify a 7.6kb nucleotide fragment encompassing the polyprotein coding region of the FMDV genome. Illumina sequencing of each sample identified the fine polymorphic structures at each nucleotide position, from consensus level changes to variants present at a 0.24% frequency. These data were used to investigate population dynamics of FMDV at both herd and host levels, evaluate the impact of host on the viral swarm structure and to identify transmission links with viruses recovered from other farms in the same series of outbreaks. In 7 samples, from 6 different animals, a total of 5 consensus level variants were identified, in addition to 104 sub-consensus variants of which 22 were shared between 2 or more animals. Further analysis revealed differences in swarm structures from samples derived from the same animal suggesting the presence of distinct viral populations evolving independently at different lesion sites within the same infected animal. Copyright © 2016 The Authors. Published by Elsevier B.V. All rights reserved.
Spatial Sequences, but Not Verbal Sequences, Are Vulnerable to General Interference during Retention in Working Memory

ERIC Educational Resources Information Center

Morey, Candice C.; Miron, Monica D.

2016-01-01

Among models of working memory, there is not yet a consensus about how to describe functions specific to storing verbal or visual-spatial memories. We presented aural-verbal and visual-spatial lists simultaneously and sometimes cued one type of information after presentation, comparing accuracy in conditions with and without informative…
Homologues of insulinase, a new superfamily of metalloendopeptidases.

PubMed Central

Rawlings, N D; Barrett, A J

1991-01-01

On the basis of a statistical analysis of an alignment of the amino acid sequences, a new superfamily of metalloendopeptidases is proposed, consisting of human insulinase, Escherichia coli protease III and mitochondrial processing endopeptidases from Saccharomyces and Neurospora. These enzymes do not contain the 'HEXXH' consensus sequence found in all previously recognized zinc metalloendopeptidases. PMID:2025223
Complete genome sequence and integrated protein localization and interaction map for alfalfa dwarf virus, which combines properties of both cytoplasmic and nuclear plant rhabdoviruses

DOE Office of Scientific and Technical Information (OSTI.GOV)

Bejerman, Nicolás, E-mail: n.bejerman@uq.edu.au; Queensland Alliance for Agriculture and Food Innovation, The University of Queensland, St Lucia, QLD 4072; Giolitti, Fabián

Summary: We have determined the full-length 14,491-nucleotide genome sequence of a new plant rhabdovirus, alfalfa dwarf virus (ADV). Seven open reading frames (ORFs) were identified in the antigenomic orientation of the negative-sense, single-stranded viral RNA, in the order 3′-N-P-P3-M-G-P6-L-5′. The ORFs are separated by conserved intergenic regions and the genome coding region is flanked by complementary 3′ leader and 5′ trailer sequences. Phylogenetic analysis of the nucleoprotein amino acid sequence indicated that this alfalfa-infecting rhabdovirus is related to viruses in the genus Cytorhabdovirus. When transiently expressed as GFP fusions in Nicotiana benthamiana leaves, most ADV proteins accumulated in the cellmore » periphery, but unexpectedly P protein was localized exclusively in the nucleus. ADV P protein was shown to have a homotypic, and heterotypic nuclear interactions with N, P3 and M proteins by bimolecular fluorescence complementation. ADV appears unique in that it combines properties of both cytoplasmic and nuclear plant rhabdoviruses. - Highlights: • The complete genome of alfalfa dwarf virus is obtained. • An integrated localization and interaction map for ADV is determined. • ADV has a genome sequence similarity and evolutionary links with cytorhabdoviruses. • ADV protein localization and interaction data show an association with the nucleus. • ADV combines properties of both cytoplasmic and nuclear plant rhabdoviruses.« less
Complete nucleotide sequence of pig (Sus scrofa) mitochondrial genome and dating evolutionary divergence within Artiodactyla.

PubMed

Lin, C S; Sun, Y L; Liu, C Y; Yang, P C; Chang, L C; Cheng, I C; Mao, S J; Huang, M C

1999-08-05

The complete nucleotide sequence of the pig (Sus scrofa) mitochondrial genome, containing 16613bp, is presented in this report. The genome is not a specific length because of the presence of the variable numbers of tandem repeats, 5'-CGTGCGTACA in the displacement loop (D-loop). Genes responsible for 12S and 16S rRNAs, 22 tRNAs, and 13 protein-coding regions are found. The genome carries very few intergenic nucleotides with several instances of overlap between protein-coding or tRNA genes, except in the D-loop region. For evaluating the possible evolutionary relationships between Artiodactyla and Cetacea, the nucleotide substitutions and amino acid sequences of 13 protein-coding genes were aligned by pairwise comparisons of the pig, cow, and fin whale. By comparing these sequences, we suggest that there is a closer relationship between the pig and cow than that between either of these species and fin whale. In addition, the accumulation of transversions and gaps in pig 12S and 16S rRNA genes was compared with that in other eutherian species, including cow, fin whale, human, horse, and harbor seal. The results also reveal a close phylogenetic relationship between pig and cow, as compared to fin whale and others. Thus, according to the sequence differences of mitochondrial rRNA genes in eutherian species, the evolutionary separation of pig and cow occurred about 53-60 million years ago.

Dynamic evolution at pericentromeres.

PubMed

Hall, Anne E; Kettler, Gregory C; Preuss, Daphne

2006-03-01

Pericentromeres are exceptional genomic regions: in animals they contain extensive segmental duplications implicated in gene creation, and in plants they sustain rearrangements and insertions uncommon in euchromatin. To examine the mechanisms and patterns of plant pericentromere evolution, we compared pericentromere sequence from four Brassicaceae species separated by <15 million years (Myr). This flowering plant family is ideal for studying relationships between genome reorganization and pericentromere evolution-its members have undergone recent polyploidization and hybridization, with close relatives changing in genome size and chromosome number. Through sequence and hybridization analyses, we examined regions from Arabidopsis arenosa, Capsella rubella, and Olimarabidopsis pumila that are homologous to Arabidopsis thaliana pericentromeres (peri-CENs) III and V, and used FISH to demonstrate they have been maintained near centromere satellite arrays in each species. Sequence analysis revealed a set of highly conserved genes, yet we discovered substantial differences in intergenic length and species-specific changes in sequence content and gene density. We discovered that A. thaliana has undergone recent, significant expansions within its pericentromeres, in some cases measuring hundreds of kilobases; these findings are in marked contrast to euchromatic segments in these species that exhibit only minor length changes. While plant pericentromeres do contain some duplications, we did not find evidence of extensive segmental duplications, as has been documented in primates. Our data support a model in which plant pericentromeres may experience selective pressures distinct from euchromatin, tolerating rapid, dynamic changes in structure and sequence content, including large insertions of mobile elements, 5S rDNA arrays and pseudogenes.
Species-specific markers for the differential diagnosis of Trypanosoma cruzi and Trypanosoma rangeli and polymorphisms detection in Trypanosoma rangeli.

PubMed

Ferreira, Keila Adriana Magalhães; Fajardo, Emanuella Francisco; Baptista, Rodrigo P; Macedo, Andrea Mara; Lages-Silva, Eliane; Ramírez, Luis Eduardo; Pedrosa, André Luiz

2014-06-01

Trypanosoma cruzi and Trypanosoma rangeli are kinetoplastid parasites which are able to infect humans in Central and South America. Misdiagnosis between these trypanosomes can be avoided by targeting barcoding sequences or genes of each organism. This work aims to analyze the feasibility of using species-specific markers for identification of intraspecific polymorphisms and as target for diagnostic methods by PCR. Accordingly, primers which are able to specifically detect T. cruzi or T. rangeli genomic DNA were characterized. The use of intergenic regions, generally divergent in the trypanosomatids, and the serine carboxypeptidase gene were successful. Using T. rangeli genomic sequences for the identification of group-specific polymorphisms and a polymorphic AT(n) dinucleotide repeat permitted the classification of the strains into two groups, which are entirely coincident with T. rangeli main lineages, KP1 (+) and KP1 (-), previously determined by kinetoplast DNA (kDNA) characterization. The sequences analyzed totalize 622 bp (382 bp represent a hypothetical protein sequence, and 240 bp represent an anonymous sequence), and of these, 581 (93.3%) are conserved sites and 41 bp (6.7%) are polymorphic, with 9 transitions (21.9%), 2 transversions (4.9%), and 30 (73.2%) insertion/deletion events. Taken together, the species-specific markers analyzed may be useful for the development of new strategies for the accurate diagnosis of infections. Furthermore, the identification of T. rangeli polymorphisms has a direct impact in the understanding of the population structure of this parasite.
Bat white-nose syndrome: A real-time TaqMan polymerase chain reaction test targeting the intergenic spacer region of Geomyces destructans

Treesearch

Laura K Muller; Jeffrey M. Lorch; Daniel L. Lindner; Michael O' Connor; Andrea Gargas; David S. Blehert

2013-01-01

The fungusÂ Geomyces destructansÂ is the causative agent of white-nose syndrome (WNS), a disease that has killed millions of North American hibernating bats. We describe a real-time TaqMan PCR test that detects DNA fromÂ G. destructansÂ by targeting a portion of the multicopy intergenic spacer region of the rRNA gene complex. The...
Copy Number Variations in Candidate Genes and Intergenic Regions Affect Body Mass Index and Abdominal Obesity in Mexican Children

PubMed Central

Burguete-García, Ana Isabel; Bonnefond, Amélie; Peralta-Romero, Jesús; Froguel, Philippe

2017-01-01

Introduction. Increase in body weight is a gradual process that usually begins in childhood and in adolescence as a result of multiple interactions among environmental and genetic factors. This study aimed to analyze the relationship between copy number variants (CNVs) in five genes and four intergenic regions with obesity in Mexican children. Methods. We studied 1423 children aged 6–12 years. Anthropometric measurements and blood levels of biochemical parameters were obtained. Identification of CNVs was performed by real-time PCR. The effect of CNVs on obesity or body composition was assessed using regression models adjusted for age, gender, and family history of obesity. Results. Gains in copy numbers of LEPR and NEGR1 were associated with decreased body mass index (BMI), waist circumference (WC), and risk of abdominal obesity, whereas gain in ARHGEF4 and CPXCR1 and the intergenic regions 12q15c, 15q21.1a, and 22q11.21d and losses in INS were associated with increased BMI and WC. Conclusion. Our results indicate a possible contribution of CNVs in LEPR, NEGR1, ARHGEF4, and CPXCR1 and the intergenic regions 12q15c, 15q21.1a, and 22q11.21d to the development of obesity, particularly abdominal obesity in Mexican children. PMID:28428959
Common Viral Integration Sites Identified in Avian Leukosis Virus-Induced B-Cell Lymphomas

PubMed Central

Justice, James F.; Morgan, Robin W.

2015-01-01

ABSTRACT Avian leukosis virus (ALV) induces B-cell lymphoma and other neoplasms in chickens by integrating within or near cancer genes and perturbing their expression. Four genes—MYC, MYB, Mir-155, and TERT—have previously been identified as common integration sites in these virus-induced lymphomas and are thought to play a causal role in tumorigenesis. In this study, we employ high-throughput sequencing to identify additional genes driving tumorigenesis in ALV-induced B-cell lymphomas. In addition to the four genes implicated previously, we identify other genes as common integration sites, including TNFRSF1A, MEF2C, CTDSPL, TAB2, RUNX1, MLL5, CXorf57, and BACH2. We also analyze the genome-wide ALV integration landscape in vivo and find increased frequency of ALV integration near transcriptional start sites and within transcripts. Previous work has shown ALV prefers a weak consensus sequence for integration in cultured human cells. We confirm this consensus sequence for ALV integration in vivo in the chicken genome. PMID:26670384
Insights Into Upland Cotton (Gossypium hirsutum L.) Genetic Recombination Based on 3 High-Density Single-Nucleotide Polymorphism and a Consensus Map Developed Independently With Common Parents.

PubMed

Ulloa, Mauricio; Hulse-Kemp, Amanda M; De Santiago, Luis M; Stelly, David M; Burke, John J

2017-01-01

High-density linkage maps are vital to supporting the correct placement of scaffolds and gene sequences on chromosomes and fundamental to contemporary organismal research and scientific approaches to genetic improvement, especially in paleopolyploids with exceptionally complex genomes, eg, upland cotton ( Gossypium hirsutum L., "2n = 52"). Three independently developed intraspecific upland mapping populations were analyzed to generate 3 high-density genetic linkage single-nucleotide polymorphism (SNP) maps and a consensus map using the CottonSNP63K array. The populations consisted of a previously reported F 2 , a recombinant inbred line (RIL), and reciprocal RIL population, from "Phytogen 72" and "Stoneville 474" cultivars. The cluster file provided 7417 genotyped SNP markers, resulting in 26 linkage groups corresponding to the 26 chromosomes (c) of the allotetraploid upland cotton (AD) 1 arisen from the merging of 2 genomes ("A" Old World and "D" New World). Patterns of chromosome-specific recombination were largely consistent across mapping populations. The high-density genetic consensus map included 7244 SNP markers that spanned 3538 cM and comprised 3824 SNP bins, of which 1783 and 2041 were in the A t and D t subgenomes with 1825 and 1713 cM map lengths, respectively. Subgenome average distances were nearly identical, indicating that subgenomic differences in bin number arose due to the high numbers of SNPs on the D t subgenome. Examination of expected recombination frequency or crossovers (COs) on the chromosomes within each population of the 2 subgenomes revealed that COs were also not affected by the SNPs or SNP bin number in these subgenomes. Comparative alignment analyses identified historical ancestral A t -subgenomic translocations of c02 and c03, as well as of c04 and c05. The consensus map SNP sequences aligned with high congruency to the NBI assembly of Gossypium hirsutum . However, the genomic comparisons revealed evidence of additional unconfirmed possible duplications, inversions and translocations, and unbalance SNP sequence homology or SNP sequence/loci genomic dominance, or homeolog loci bias of the upland tetraploid A t and D t subgenomes. The alignments indicated that 364 SNP-associated previously unintegrated scaffolds can be placed in pseudochromosomes of the NBI G hirsutum assembly. This is the first intraspecific SNP genetic linkage consensus map assembled in G hirsutum with a core of reproducible mendelian SNP markers assayed on different populations and it provides further knowledge of chromosome arrangement of genic and nongenic SNPs. Together, the consensus map and RIL populations provide a synergistically useful platform for localizing and identifying agronomically important loci for improvement of the cotton crop.
Chloroplast DNA analysis of Tunisian cork oak populations (Quercus suber L.): sequence variations and molecular evolution of the trnL (UAA)-trnF (GAA) region.

PubMed

Abdessamad, A; Baraket, G; Sakka, H; Ammari, Y; Ksontini, M; Hannachi, A Salhi

2016-10-24

Sequences of the trnL-trnF spacer and combined trnL-trnF region in chloroplast DNA of cork oak (Quercus suber L.) were analyzed to detect polymorphisms and to elucidate molecular evolution and demographic history. The aligned sequences varied in length and nucleotide composition. The overall ratio of transition/transversion (ti/tv) of 0.724 for the intergenic spacer and 0.258 for the pooled sequences were estimated, and indicated that transversions are more frequent than transitions. The molecular evolution and demographic history of Q. suber were investigated. Neutrality tests (Tajima's D and Fu and Li) ruled out the null hypothesis of a strictly neutral model, and Fu's Fs and Ramos-Onsins and Rozas' R2 confirmed the recent expansion of cork oak trees, validating its persistency in North Africa since the last glaciation during the Quaternary. The observed uni-modal mismatch distribution and the Harpending's raggedness index confirmed the demographic history model for cork oak. A phylogenetic dendrogram showed that the distribution of Q. suber trees occurs independently of geographical origin, the relief of the population site, and the bioclimatic stages. The molecular history and cytoplasmic diversity suggest that in situ and ex situ conservation strategies can be recommended for preserving landscape value and facing predictable future climatic changes.
Modeling of DNA local parameters predicts encrypted architectural motifs in Xenopus laevis ribosomal gene promoter.

PubMed

Roux-Rouquie, M; Marilley, M

2000-09-15

We have modeled local DNA sequence parameters to search for DNA architectural motifs involved in transcription regulation and promotion within the Xenopus laevis ribosomal gene promoter and the intergenic spacer (IGS) sequences. The IGS was found to be shaped into distinct topological domains. First, intrinsic bends split the IGS into domains of common but different helical features. Local parameters at inter-domain junctions exhibit a high variability with respect to intrinsic curvature, bendability and thermal stability. Secondly, the repeated sequence blocks of the IGS exhibit right-handed supercoiled structures which could be related to their enhancer properties. Thirdly, the gene promoter presents both inherent curvature and minor groove narrowing which may be viewed as motifs of a structural code for protein recognition and binding. Such pre-existing deformations could simply be remodeled during the binding of the transcription complex. Alternatively, these deformations could pre-shape the promoter in such a way that further remodeling is facilitated. Mutations shown to abolish promoter curvature as well as intrinsic minor groove narrowing, in a variant which maintained full transcriptional activity, bring circumstantial evidence for structurally-preorganized motifs in relation to transcription regulation and promotion. Using well documented X. laevis rDNA regulatory sequences we showed that computer modeling may be of invaluable assistance in assessing encrypted architectural motifs. The evidence of these DNA topological motifs with respect to the concept of structural code is discussed.
Degenerative minimalism in the genome of a psyllid endosymbiont.

PubMed

Clark, M A; Baumann, L; Thao, M L; Moran, N A; Baumann, P

2001-03-01

Psyllids, like aphids, feed on plant phloem sap and are obligately associated with prokaryotic endosymbionts acquired through vertical transmission from an ancestral infection. We have sequenced 37 kb of DNA of the genome of Carsonella ruddii, the endosymbiont of psyllids, and found that it has a number of unusual properties revealing a more extreme case of degeneration than was previously reported from studies of eubacterial genomes, including that of the aphid endosymbiont Buchnera aphidicola. Among the unusual properties are an exceptionally low guanine-plus-cytosine content (19.9%), almost complete absence of intergenic spaces, operon fusion, and lack of the usual promoter sequences upstream of 16S rDNA. These features suggest the synthesis of long mRNAs and translational coupling. The most extreme instances of base compositional bias occur in the genes encoding proteins that have less highly conserved amino acid sequences; the guanine-plus-cytosine content of some protein-coding sequences is as low as 10%. The shift in base composition has a large effect on proteins: in polypeptides of C. ruddii, half of the residues consist of five amino acids with codons low in guanine plus cytosine. Furthermore, the proteins of C. ruddii are reduced in size, with an average of about 9% fewer amino acids than in homologous proteins of related bacteria. These observations suggest that the C. ruddii genome is not subject to constraints that limit the evolution of other known eubacteria.
Use of DNA barcodes to identify flowering plants.

PubMed

Kress, W John; Wurdack, Kenneth J; Zimmer, Elizabeth A; Weigt, Lee A; Janzen, Daniel H

2005-06-07

Methods for identifying species by using short orthologous DNA sequences, known as "DNA barcodes," have been proposed and initiated to facilitate biodiversity studies, identify juveniles, associate sexes, and enhance forensic analyses. The cytochrome c oxidase 1 sequence, which has been found to be widely applicable in animal barcoding, is not appropriate for most species of plants because of a much slower rate of cytochrome c oxidase 1 gene evolution in higher plants than in animals. We therefore propose the nuclear internal transcribed spacer region and the plastid trnH-psbA intergenic spacer as potentially usable DNA regions for applying barcoding to flowering plants. The internal transcribed spacer is the most commonly sequenced locus used in plant phylogenetic investigations at the species level and shows high levels of interspecific divergence. The trnH-psbA spacer, although short ( approximately 450-bp), is the most variable plastid region in angiosperms and is easily amplified across a broad range of land plants. Comparison of the total plastid genomes of tobacco and deadly nightshade enhanced with trials on widely divergent angiosperm taxa, including closely related species in seven plant families and a group of species sampled from a local flora encompassing 50 plant families (for a total of 99 species, 80 genera, and 53 families), suggest that the sequences in this pair of loci have the potential to discriminate among the largest number of plant species for barcoding purposes.
The complete nucleotide sequences of the 5 genetically distinct plastid genomes of Oenothera, subsection Oenothera: II. A microevolutionary view using bioinformatics and formal genetic data.

PubMed

Greiner, Stephan; Wang, Xi; Herrmann, Reinhold G; Rauwolf, Uwe; Mayer, Klaus; Haberer, Georg; Meurer, Jörg

2008-09-01

A unique combination of genetic features and a rich stock of information make the flowering plant genus Oenothera an appealing model to explore the molecular basis of speciation processes including nucleus-organelle coevolution. From representative species, we have recently reported complete nucleotide sequences of the 5 basic and genetically distinguishable plastid chromosomes of subsection Oenothera (I-V). In nature, Oenothera plastid genomes are associated with 6 distinct, either homozygous or heterozygous, diploid nuclear genotypes of the 3 basic genomes A, B, or C. Artificially produced plastome-genome combinations that do not occur naturally often display interspecific plastome-genome incompatibility (PGI). In this study, we compare formal genetic data available from all 30 plastome-genome combinations with sequence differences between the plastomes to uncover potential determinants for interspecific PGI. Consistent with an active role in speciation, a remarkable number of genes have high Ka/Ks ratios. Different from the Solanacean cybrid model Atropa/tobacco, RNA editing seems not to be relevant for PGIs in Oenothera. However, predominantly sequence polymorphisms in intergenic segments are proposed as possible sources for PGI. A single locus, the bidirectional promoter region between psbB and clpP, is suggested to contribute to compartmental PGI in the interspecific AB hybrid containing plastome I (AB-I), consistent with its perturbed photosystem II activity.
A novel, highly divergent ssDNA virus identified in Brazil infecting apple, pear and grapevine.

PubMed

Basso, Marcos Fernando; da Silva, José Cleydson Ferreira; Fajardo, Thor Vinícius Martins; Fontes, Elizabeth Pacheco Batista; Zerbini, Francisco Murilo

2015-12-02

Fruit trees of temperate and tropical climates are of great economical importance worldwide and several viruses have been reported affecting their productivity and longevity. Fruit trees of different Brazilian regions displaying virus-like symptoms were evaluated for infection by circular DNA viruses. Seventy-four fruit trees were sampled and a novel, highly divergent, monopartite circular ssDNA virus was cloned from apple, pear and grapevine trees. Forty-five complete viral genomes were sequenced, with a size of approx. 3.4 kb and organized into five ORFs. Deduced amino acid sequences showed identities in the range of 38% with unclassified circular ssDNA viruses, nanoviruses and alphasatellites (putative Replication-associated protein, Rep), and begomo-, curto- and mastreviruses (putative coat protein, CP, and movement protein, MP). A large intergenic region contains a short palindromic sequence capable of forming a hairpin-like structure with the loop sequence TAGTATTAC, identical to the conserved nonanucleotide of circoviruses, nanoviruses and alphasatellites. Recombination events were not detected and phylogenetic analysis showed a relationship with circo-, nano- and geminiviruses. PCR confirmed the presence of this novel ssDNA virus in field plants. Infectivity tests using the cloned viral genome confirmed its ability to infect apple and pear tree seedlings, but not Nicotiana benthamiana. The name "Temperate fruit decay-associated virus" (TFDaV) is proposed for this novel virus. Copyright © 2015 Elsevier B.V. All rights reserved.
Analysis of sequence variation among smeDEF multi drug efflux pump genes and flanking DNA from defined 16S rRNA subgroups of clinical Stenotrophomonas maltophilia isolates.

PubMed

Gould, Virginia C; Okazaki, Aki; Howe, Robin A; Avison, Matthew B

2004-08-01

To determine the level of variation in the smeDEF efflux pump and smeT transcriptional regulator genes among three defined 16S rRNA sequence subgroups of clinical Stenotrophomonas maltophilia isolates. smeDEF sequencing used a PCR genome walking approach. Determination of the sequence surrounding smeDEF used a flanking primer PCR method and specific primers anchored in smeD or smeF together with random primers. smeDEF is chromosomal and located in the same position in the chromosome in all three subgroups of isolates. Flanking smeD is a gene, smeT, encoding a putative transcriptional repressor for smeDEF. Variation at these loci among the isolates is considerably lower (up to 10%) than at intrinsic beta-lactamase loci (up to 30%) in the same isolates, implying greater functional constraint. The smeD-smeT intergenic region contains a highly conserved section, which maps with previously predicted promoter/operator regions, and a hypervariable untranslated region, which can be used to subgroup clinical isolates. These data provide further evidence that it is possible to group clinical isolates of the inherently variable species, S. maltophilia, based on genotypic properties. Isolate D457, in which most work concerning smeDEF expression has been performed, does not fall into S. maltophilia subgroup A, which is the most typical.
Evolutionary and biotechnology implications of plastid genome variation in the inverted-repeat-lacking clade of legumes.

PubMed

Sabir, Jamal; Schwarz, Erika; Ellison, Nicholas; Zhang, Jin; Baeshen, Nabih A; Mutwakil, Muhammed; Jansen, Robert; Ruhlman, Tracey

2014-08-01

Land plant plastid genomes (plastomes) provide a tractable model for evolutionary study in that they are relatively compact and gene dense. Among the groups that display an appropriate level of variation for structural features, the inverted-repeat-lacking clade (IRLC) of papilionoid legumes presents the potential to advance general understanding of the mechanisms of genomic evolution. Here, are presented six complete plastome sequences from economically important species of the IRLC, a lineage previously represented by only five completed plastomes. A number of characters are compared across the IRLC including gene retention and divergence, synteny, repeat structure and functional gene transfer to the nucleus. The loss of clpP intron 2 was identified in one newly sequenced member of IRLC, Glycyrrhiza glabra. Using deeply sequenced nuclear transcriptomes from two species helped clarify the nature of the functional transfer of accD to the nucleus in Trifolium, which likely occurred in the lineage leading to subgenus Trifolium. Legumes are second only to cereal crops in agricultural importance based on area harvested and total production. Genetic improvement via plastid transformation of IRLC crop species is an appealing proposition. Comparative analyses of intergenic spacer regions emphasize the need for complete genome sequences for developing transformation vectors for plastid genetic engineering of legume crops. © 2014 Society for Experimental Biology, Association of Applied Biologists and John Wiley & Sons Ltd.
Transposons play an important role in the evolution and diversification of centromeres among closely related species

PubMed Central

Gao, Dongying; Jiang, Ning; Wing, Rod A.; Jiang, Jiming; Jackson, Scott A.

2015-01-01

Centromeres are important chromosomal regions necessary for eukaryotic cell segregation and replication. Due to high amounts of tandem repeats and transposons, centromeres have been difficult to sequence in most multicellular organisms, thus their sequence structure and evolution are poorly understood. In this study, we analyzed transposons in the centromere 8 (Cen8) from the African cultivated rice (O. glaberrima) and two subspecies of the Asian cultivated rice (O. sativa), indica and japonica. We detected much higher transposon contents (>69%) in centromere regions than in the whole genomes of O. sativa ssp. japonica and O. glaberrima (~35%). We compared the three Cen8s and identified numerous recent insertions of transposons that were frequently organized into multiple-layer nested blocks, similar to nested transposons in maize. Except for the Hopi retrotransposon, all LTR retrotransposons were shared but exhibit different abundances amongst the three Cen8s. Even though a majority of the transposons were located in intergenic regions, some gene-related transposons were found and may be involved in gene diversification. Chromatin immunoprecipitated (ChIP) data analysis revealed that 165 families from both Class I and Class II transposons were found in CENH3-associated chromatin sequences. These results indicate essential roles for transposons in centromeres and that the rapid divergence of the Cen8 sequences between the two cultivated rice species was primarily caused by recent transposon insertions. PMID:25904926
Transposons play an important role in the evolution and diversification of centromeres among closely related species.

PubMed

Gao, Dongying; Jiang, Ning; Wing, Rod A; Jiang, Jiming; Jackson, Scott A

2015-01-01

Centromeres are important chromosomal regions necessary for eukaryotic cell segregation and replication. Due to high amounts of tandem repeats and transposons, centromeres have been difficult to sequence in most multicellular organisms, thus their sequence structure and evolution are poorly understood. In this study, we analyzed transposons in the centromere 8 (Cen8) from the African cultivated rice (O. glaberrima) and two subspecies of the Asian cultivated rice (O. sativa), indica and japonica. We detected much higher transposon contents (>69%) in centromere regions than in the whole genomes of O. sativa ssp. japonica and O. glaberrima (~35%). We compared the three Cen8s and identified numerous recent insertions of transposons that were frequently organized into multiple-layer nested blocks, similar to nested transposons in maize. Except for the Hopi retrotransposon, all LTR retrotransposons were shared but exhibit different abundances amongst the three Cen8s. Even though a majority of the transposons were located in intergenic regions, some gene-related transposons were found and may be involved in gene diversification. Chromatin immunoprecipitated (ChIP) data analysis revealed that 165 families from both Class I and Class II transposons were found in CENH3-associated chromatin sequences. These results indicate essential roles for transposons in centromeres and that the rapid divergence of the Cen8 sequences between the two cultivated rice species was primarily caused by recent transposon insertions.
Whole-genome sequencing of Aspergillus tubingensis G131 and overview of its secondary metabolism potential.

PubMed

Choque, Elodie; Klopp, Christophe; Valiere, Sophie; Raynal, José; Mathieu, Florence

2018-03-15

Black Aspergilli represent one of the most important fungal resources of primary and secondary metabolites for biotechnological industry. Having several black Aspergilli sequenced genomes should allow targeting the production of certain metabolites with bioactive properties. In this study, we report the draft genome of a black Aspergilli, A. tubingensis G131, isolated from a French Mediterranean vineyard. This 35 Mb genome includes 10,994 predicted genes. A genomic-based discovery identifies 80 secondary metabolites biosynthetic gene clusters. Genomic sequences of these clusters were blasted on 3 chosen black Aspergilli genomes: A. tubingensis CBS 134.48, A. niger CBS 513.88 and A. kawachii IFO 4308. This comparison highlights different levels of clusters conservation between the four strains. It also allows identifying seven unique clusters in A. tubingensis G131. Moreover, the putative secondary metabolites clusters for asperazine and naphtho-gamma-pyrones production were proposed based on this genomic analysis. Key biosynthetic genes required for the production of 2 mycotoxins, ochratoxin A and fumonisin, are absent from this draft genome. Even if intergenic sequences of these mycotoxins biosynthetic pathways are present, this could not lead to the production of those mycotoxins by A. tubingensis G131. Functional and bioinformatics analyses of A. tubingensis G131 genome highlight its potential for metabolites production in particular for TAN-1612, asperazine and naphtho-gamma-pyrones presenting antioxidant, anticancer or antibiotic properties.
BASIC PENTACYSTEINE Proteins Mediate MADS Domain Complex Binding to the DNA for Tissue-Specific Expression of Target Genes in Arabidopsis[W

PubMed Central

Simonini, Sara; Roig-Villanova, Irma; Gregis, Veronica; Colombo, Bilitis; Colombo, Lucia; Kater, Martin M.

2012-01-01

BASIC PENTACYSTEINE (BPC) transcription factors have been identified in a large variety of plant species. In Arabidopsis thaliana there are seven BPC genes, which, except for BPC5, are expressed ubiquitously. BPC genes are functionally redundant in a wide range of developmental processes. Recently, we reported that BPC1 binds to guanine and adenine (GA)–rich consensus sequences in the SEEDSTICK (STK) promoter in vitro and induces conformational changes. Here we show by chromatin immunoprecipitation experiments that in vivo BPCs also bind to the consensus boxes, and when these were mutated, expression from the STK promoter was derepressed, resulting in ectopic expression in the inflorescence. We also reveal that SHORT VEGETATIVE PHASE (SVP) is a direct regulator of STK. SVP is a floral meristem identity gene belonging to the MADS box gene family. The SVP-APETALA1 (AP1) dimer recruits the SEUSS (SEU)-LEUNIG (LUG) transcriptional cosuppressor to repress floral homeotic gene expression in the floral meristem. Interestingly, we found that GA consensus sequences in the STK promoter to which BPCs bind are essential for recruitment of the corepressor complex to this promoter. Our data suggest that we have identified a new regulatory mechanism controlling plant gene expression that is probably generally used, when considering BPCs’ wide expression profile and the frequent presence of consensus binding sites in plant promoters. PMID:23054472
Modeling repetitive, non‐globular proteins

PubMed Central

Basu, Koli; Campbell, Robert L.; Guo, Shuaiqi; Sun, Tianjun

2016-01-01

Abstract While ab initio modeling of protein structures is not routine, certain types of proteins are more straightforward to model than others. Proteins with short repetitive sequences typically exhibit repetitive structures. These repetitive sequences can be more amenable to modeling if some information is known about the predominant secondary structure or other key features of the protein sequence. We have successfully built models of a number of repetitive structures with novel folds using knowledge of the consensus sequence within the sequence repeat and an understanding of the likely secondary structures that these may adopt. Our methods for achieving this success are reviewed here. PMID:26914323
Towards the Rational Design of a Candidate Vaccine against Pregnancy Associated Malaria: Conserved Sequences of the DBL6ε Domain of VAR2CSA

PubMed Central

Badaut, Cyril; Bertin, Gwladys; Rustico, Tatiana; Fievet, Nadine; Massougbodji, Achille; Gaye, Alioune; Deloron, Philippe

2010-01-01

Background Placental malaria is a disease linked to the sequestration of Plasmodium falciparum infected red blood cells (IRBC) in the placenta, leading to reduced materno-fetal exchanges and to local inflammation. One of the virulence factors of P. falciparum involved in cytoadherence to chondroitin sulfate A, its placental receptor, is the adhesive protein VAR2CSA. Its localisation on the surface of IRBC makes it accessible to the immune system. VAR2CSA contains six DBL domains. The DBL6ε domain is the most variable. High variability constitutes a means for the parasite to evade the host immune response. The DBL6ε domain could constitute a very attractive basis for a vaccine candidate but its reported variability necessitates, for antigenic characterisations, identifying and classifying commonalities across isolates. Methodology/Principal Findings Local alignment analysis of the DBL6ε domain had revealed that it is not as variable as previously described. Variability is concentrated in seven regions present on the surface of the DBL6ε domain. The main goal of our work is to classify and group variable sequences that will simplify further research to determine dominant epitopes. Firstly, variable sequences were grouped following their average percent pairwise identity (APPI). Groups comprising many variable sequences sharing low variability were found. Secondly, ELISA experiments following the IgG recognition of a recombinant DBL6ε domain, and of peptides mimicking its seven variable blocks, allowed to determine an APPI cut-off and to isolate groups represented by a single consensus sequence. Conclusions/Significance A new sequence approach is used to compare variable regions in sequences that have extensive segmental gene relationship. Using this approach, the VAR2CSA DBL6 domain is composed of 7 variable blocks with limited polymorphism. Each variable block is composed of a limited number of consensus types. Based on peptide based ELISA, variable blocks with 85% or greater sequence identity are expected to be recognized equally well by antibody and can be considered the same consensus type. Therefore, the analysis of the antibody response against the classified small number of sequences should be helpful to determine epitopes. PMID:20585655

Some links on this page may take you to non-federal websites. Their policies may differ from this site.