Mitra, Abhishek; Skrzypczak, Magdalena; Ginalski, Krzysztof; Rowicka, Maga
2015-01-01
Sequencing microRNA, reduced representation sequencing, Hi-C technology and any method requiring the use of in-house barcodes result in sequencing libraries with low initial sequence diversity. Sequencing such data on the Illumina platform typically produces low quality data due to the limitations of the Illumina cluster calling algorithm. Moreover, even in the case of diverse samples, these limitations are causing substantial inaccuracies in multiplexed sample assignment (sample bleeding). Such inaccuracies are unacceptable in clinical applications, and in some other fields (e.g. detection of rare variants). Here, we discuss how both problems with quality of low-diversity samples and sample bleeding are caused by incorrect detection of clusters on the flowcell during initial sequencing cycles. We propose simple software modifications (Long Template Protocol) that overcome this problem. We present experimental results showing that our Long Template Protocol remarkably increases data quality for low diversity samples, as compared with the standard analysis protocol; it also substantially reduces sample bleeding for all samples. For comprehensiveness, we also discuss and compare experimental results from alternative approaches to sequencing low diversity samples. First, we discuss how the low diversity problem, if caused by barcodes, can be avoided altogether at the barcode design stage. Second and third, we present modified guidelines, which are more stringent than the manufacturer’s, for mixing low diversity samples with diverse samples and lowering cluster density, which in our experience consistently produces high quality data from low diversity samples. Fourth and fifth, we present rescue strategies that can be applied when sequencing results in low quality data and when there is no more biological material available. In such cases, we propose that the flowcell be re-hybridized and sequenced again using our Long Template Protocol. Alternatively, we discuss how analysis can be repeated from saved sequencing images using the Long Template Protocol to increase accuracy. PMID:25860802
Wu, Fengnian; Jiang, Hongyan; Beattie, G Andrew C; Holford, Paul; Chen, Jianchi; Wallis, Christopher M; Zheng, Zheng; Deng, Xiaoling; Cen, Yijing
2018-04-24
Diaphorina citri (Asian citrus psyllid; ACP) transmits 'Candidatus Liberibacter asiaticus' associated with citrus Huanglongbing (HLB). ACP has been reported in 11 provinces/regions in China, yet its population diversity remains unclear. In this study, we evaluated ACP population diversity in China using representative whole mitochondrial genome (mitogenome) sequences. Additional mitogenome sequences outside China were also acquired and evaluated. The sizes of the 27 ACP mitogenome sequences ranged from 14 986 to 15 030 bp. Along with three previously published mitogenome sequences, the 30 sequences formed three major mitochondrial groups (MGs): MG1, present in southwestern China and occurring at elevations above 1000 m; MG2, present in southeastern China and Southeast Asia (Cambodia, Indonesia, Malaysia, and Vietnam) and occurring at elevations below 180 m; and MG3, present in the USA and Pakistan. Single nucleotide polymorphisms in five genes (cox2, atp8, nad3, nad1 and rrnL) contributed mostly in the ACP diversity. Among these genes, rrnL had the most variation. Mitogenome sequences analyses revealed two major phylogenetic groups of ACP present in China as well as a possible unique group present currently in Pakistan and the USA. The information could have significant implications for current ACP control and HLB management. © 2018 Society of Chemical Industry. © 2018 Society of Chemical Industry.
Sequence Diversity Diagram for comparative analysis of multiple sequence alignments.
Sakai, Ryo; Aerts, Jan
2014-01-01
The sequence logo is a graphical representation of a set of aligned sequences, commonly used to depict conservation of amino acid or nucleotide sequences. Although it effectively communicates the amount of information present at every position, this visual representation falls short when the domain task is to compare between two or more sets of aligned sequences. We present a new visual presentation called a Sequence Diversity Diagram and validate our design choices with a case study. Our software was developed using the open-source program called Processing. It loads multiple sequence alignment FASTA files and a configuration file, which can be modified as needed to change the visualization. The redesigned figure improves on the visual comparison of two or more sets, and it additionally encodes information on sequential position conservation. In our case study of the adenylate kinase lid domain, the Sequence Diversity Diagram reveals unexpected patterns and new insights, for example the identification of subgroups within the protein subfamily. Our future work will integrate this visual encoding into interactive visualization tools to support higher level data exploration tasks.
NASA Astrophysics Data System (ADS)
Zhang, Xiao-Yong; Wang, Guang-Hua; Xu, Xin-Ya; Nong, Xu-Hua; Wang, Jie; Amin, Muhammad; Qi, Shu-Hua
2016-10-01
The present study investigated the fungal diversity in four different deep-sea sediments from Okinawa Trough using high-throughput Illumina sequencing of the nuclear ribosomal internal transcribed spacer-1 (ITS1). A total of 40,297 fungal ITS1 sequences clustered into 420 operational taxonomic units (OTUs) with 97% sequence similarity and 170 taxa were recovered from these sediments. Most ITS1 sequences (78%) belonged to the phylum Ascomycota, followed by Basidiomycota (17.3%), Zygomycota (1.5%) and Chytridiomycota (0.8%), and a small proportion (2.4%) belonged to unassigned fungal phyla. Compared with previous studies on fungal diversity of sediments from deep-sea environments by culture-dependent approach and clone library analysis, the present result suggested that Illumina sequencing had been dramatically accelerating the discovery of fungal community of deep-sea sediments. Furthermore, our results revealed that Sordariomycetes was the most diverse and abundant fungal class in this study, challenging the traditional view that the diversity of Sordariomycetes phylotypes was low in the deep-sea environments. In addition, more than 12 taxa accounted for 21.5% sequences were found to be rarely reported as deep-sea fungi, suggesting the deep-sea sediments from Okinawa Trough harbored a plethora of different fungal communities compared with other deep-sea environments. To our knowledge, this study is the first exploration of the fungal diversity in deep-sea sediments from Okinawa Trough using high-throughput Illumina sequencing.
Aguilera-Mendoza, Longendri; Marrero-Ponce, Yovani; Tellez-Ibarra, Roberto; Llorente-Quesada, Monica T; Salgado, Jesús; Barigye, Stephen J; Liu, Jun
2015-08-01
The large variety of antimicrobial peptide (AMP) databases developed to date are characterized by a substantial overlap of data and similarity of sequences. Our goals are to analyze the levels of redundancy for all available AMP databases and use this information to build a new non-redundant sequence database. For this purpose, a new software tool is introduced. A comparative study of 25 AMP databases reveals the overlap and diversity among them and the internal diversity within each database. The overlap analysis shows that only one database (Peptaibol) contains exclusive data, not present in any other, whereas all sequences in the LAMP_Patent database are included in CAMP_Patent. However, the majority of databases have their own set of unique sequences, as well as some overlap with other databases. The complete set of non-duplicate sequences comprises 16 990 cases, which is almost half of the total number of reported peptides. On the other hand, the diversity analysis identifies the most and least diverse databases and proves that all databases exhibit some level of redundancy. Finally, we present a new parallel-free software, named Dover Analyzer, developed to compute the overlap and diversity between any number of databases and compile a set of non-redundant sequences. These results are useful for selecting or building a suitable representative set of AMPs, according to specific needs. © The Author 2015. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.
Harris, Katherine E; Aldred, Shelley Force; Davison, Laura M; Ogana, Heather Anne N; Boudreau, Andrew; Brüggemann, Marianne; Osborn, Michael; Ma, Biao; Buelow, Benjamin; Clarke, Starlynn C; Dang, Kevin H; Iyer, Suhasini; Jorgensen, Brett; Pham, Duy T; Pratap, Payal P; Rangaswamy, Udaya S; Schellenberger, Ute; van Schooten, Wim C; Ugamraj, Harshad S; Vafa, Omid; Buelow, Roland; Trinklein, Nathan D
2018-01-01
We created a novel transgenic rat that expresses human antibodies comprising a diverse repertoire of heavy chains with a single common rearranged kappa light chain (IgKV3-15-JK1). This fixed light chain animal, called OmniFlic, presents a unique system for human therapeutic antibody discovery and a model to study heavy chain repertoire diversity in the context of a constant light chain. The purpose of this study was to analyze heavy chain variable gene usage, clonotype diversity, and to describe the sequence characteristics of antigen-specific monoclonal antibodies (mAbs) isolated from immunized OmniFlic animals. Using next-generation sequencing antibody repertoire analysis, we measured heavy chain variable gene usage and the diversity of clonotypes present in the lymph node germinal centers of 75 OmniFlic rats immunized with 9 different protein antigens. Furthermore, we expressed 2,560 unique heavy chain sequences sampled from a diverse set of clonotypes as fixed light chain antibody proteins and measured their binding to antigen by ELISA. Finally, we measured patterns and overall levels of somatic hypermutation in the full B-cell repertoire and in the 2,560 mAbs tested for binding. The results demonstrate that OmniFlic animals produce an abundance of antigen-specific antibodies with heavy chain clonotype diversity that is similar to what has been described with unrestricted light chain use in mammals. In addition, we show that sequence-based discovery is a highly effective and efficient way to identify a large number of diverse monoclonal antibodies to a protein target of interest.
Yu, Zhongtang; Yu, Marie; Morrison, Mark
2006-04-01
Serial analysis of ribosomal sequence tags (SARST) is a recently developed technology that can generate large 16S rRNA gene (rrs) sequence data sets from microbiomes, but there are numerous enzymatic and purification steps required to construct the ribosomal sequence tag (RST) clone libraries. We report here an improved SARST method, which still targets the V1 hypervariable region of rrs genes, but reduces the number of enzymes, oligonucleotides, reagents, and technical steps needed to produce the RST clone libraries. The new method, hereafter referred to as SARST-V1, was used to examine the eubacterial diversity present in community DNA recovered from the microbiome resident in the ovine rumen. The 190 sequenced clones contained 1055 RSTs and no less than 236 unique phylotypes (based on > or = 95% sequence identity) that were assigned to eight different eubacterial phyla. Rarefaction and monomolecular curve analyses predicted that the complete RST clone library contains 99% of the 353 unique phylotypes predicted to exist in this microbiome. When compared with ribosomal intergenic spacer analysis (RISA) of the same community DNA sample, as well as a compilation of nine previously published conventional rrs clone libraries prepared from the same type of samples, the RST clone library provided a more comprehensive characterization of the eubacterial diversity present in rumen microbiomes. As such, SARST-V1 should be a useful tool applicable to comprehensive examination of diversity and composition in microbiomes and offers an affordable, sequence-based method for diversity analysis.
Complete sequence and diversity of a maize-associated Polerovirus in East Africa
USDA-ARS?s Scientific Manuscript database
Since 2011-2012, Maize lethal necrosis (MLN) has emerged in East Africa, causing massive yield loss and propelling research to identify viruses and virus populations present in maize. As expected, next generation sequencing (NGS) has revealed diverse and abundant viruses from the family Potyviridae,...
Broad Surveys of DNA Viral Diversity Obtained through Viral Metagenomics of Mosquitoes
Ng, Terry Fei Fan; Willner, Dana L.; Lim, Yan Wei; Schmieder, Robert; Chau, Betty; Nilsson, Christina; Anthony, Simon; Ruan, Yijun; Rohwer, Forest; Breitbart, Mya
2011-01-01
Viruses are the most abundant and diverse genetic entities on Earth; however, broad surveys of viral diversity are hindered by the lack of a universal assay for viruses and the inability to sample a sufficient number of individual hosts. This study utilized vector-enabled metagenomics (VEM) to provide a snapshot of the diversity of DNA viruses present in three mosquito samples from San Diego, California. The majority of the sequences were novel, suggesting that the viral community in mosquitoes, as well as the animal and plant hosts they feed on, is highly diverse and largely uncharacterized. Each mosquito sample contained a distinct viral community. The mosquito viromes contained sequences related to a broad range of animal, plant, insect and bacterial viruses. Animal viruses identified included anelloviruses, circoviruses, herpesviruses, poxviruses, and papillomaviruses, which mosquitoes may have obtained from vertebrate hosts during blood feeding. Notably, sequences related to human papillomaviruses were identified in one of the mosquito samples. Sequences similar to plant viruses were identified in all mosquito viromes, which were potentially acquired through feeding on plant nectar. Numerous bacteriophages and insect viruses were also detected, including a novel densovirus likely infecting Culex erythrothorax. Through sampling insect vectors, VEM enables broad survey of viral diversity and has significantly increased our knowledge of the DNA viruses present in mosquitoes. PMID:21674005
Sakthivelkumar, S; Ramaraj, P; Veeramani, V; Janarthanan, S
2015-09-01
The basis of the present study was to distinguish the existence of any genetic variability among populations of Culex quinquefasciatus which would be a valuable tool in the management of mosquito control programmes. In the present study, population of Cx. quinquefasciatus collected at different locations in Tamil Nadu were analyzed for their genetic variation based on 28S rDNA D2 region nucleotide sequences. A high degree of genetic polymorphism was detected in the sequences of D2 region of 28S rDNA on the predicted secondary structures in spite of high nucleotide sequence similarity. The findings based on secondary structure using rDNA sequences suggested the existence of a complex genotypic diversity of Cx. quinquefasciatus population collected at different locations of Tamil Nadu, India. This complexity in genetic diversity in a single mosquito population collected at different locations is considered an important issue towards their influence and nature of vector potential of these mosquitoes.
Feranchuk, Sergey; Belkova, Natalia; Potapova, Ulyana; Kuzmin, Dmitry; Belikov, Sergei
2018-05-23
Several measures of biodiversity are commonly used to describe microbial communities, analyzed using 16S gene sequencing. A wide range of available experiments on 16S gene sequencing allows us to present a framework for a comparison of various diversity indices. The criterion for the comparison is the statistical significance of the difference in index values for microbial communities with different traits, within the same experiment. The results of the evaluation indicate that Shannon diversity is the most effective measure among the commonly used diversity indices. The results also indicate that, within the present framework, the Gini coefficient as a diversity index is comparable to Shannon diversity, despite the fact that the Gini coefficient, as a diversity estimator, is far less popular in microbiology than several other measures. Copyright © 2018 Institut Pasteur. Published by Elsevier Masson SAS. All rights reserved.
Nonpareil 3: Fast Estimation of Metagenomic Coverage and Sequence Diversity.
Rodriguez-R, Luis M; Gunturu, Santosh; Tiedje, James M; Cole, James R; Konstantinidis, Konstantinos T
2018-01-01
Estimations of microbial community diversity based on metagenomic data sets are affected, often to an unknown degree, by biases derived from insufficient coverage and reference database-dependent estimations of diversity. For instance, the completeness of reference databases cannot be generally estimated since it depends on the extant diversity sampled to date, which, with the exception of a few habitats such as the human gut, remains severely undersampled. Further, estimation of the degree of coverage of a microbial community by a metagenomic data set is prohibitively time-consuming for large data sets, and coverage values may not be directly comparable between data sets obtained with different sequencing technologies. Here, we extend Nonpareil, a database-independent tool for the estimation of coverage in metagenomic data sets, to a high-performance computing implementation that scales up to hundreds of cores and includes, in addition, a k -mer-based estimation as sensitive as the original alignment-based version but about three hundred times as fast. Further, we propose a metric of sequence diversity ( N d ) derived directly from Nonpareil curves that correlates well with alpha diversity assessed by traditional metrics. We use this metric in different experiments demonstrating the correlation with the Shannon index estimated on 16S rRNA gene profiles and show that N d additionally reveals seasonal patterns in marine samples that are not captured by the Shannon index and more precise rankings of the magnitude of diversity of microbial communities in different habitats. Therefore, the new version of Nonpareil, called Nonpareil 3, advances the toolbox for metagenomic analyses of microbiomes. IMPORTANCE Estimation of the coverage provided by a metagenomic data set, i.e., what fraction of the microbial community was sampled by DNA sequencing, represents an essential first step of every culture-independent genomic study that aims to robustly assess the sequence diversity present in a sample. However, estimation of coverage remains elusive because of several technical limitations associated with high computational requirements and limiting statistical approaches to quantify diversity. Here we described Nonpareil 3, a new bioinformatics algorithm that circumvents several of these limitations and thus can facilitate culture-independent studies in clinical or environmental settings, independent of the sequencing platform employed. In addition, we present a new metric of sequence diversity based on rarefied coverage and demonstrate its use in communities from diverse ecosystems.
Xiao, Fanshu; Yu, Yuhe; Li, Jinjin; Juneau, Philippe; Yan, Qingyun
2018-05-25
The 16S rRNA gene is one of the most commonly used molecular markers for estimating bacterial diversity during the past decades. However, there is no consistency about the sequencing depth (from thousand to millions of sequences per sample), and the clustering methods used to generate OTUs may also be different among studies. These inconsistent premises make effective comparisons among studies difficult or unreliable. This study aims to examine the necessary sequencing depth and clustering method that would be needed to ensure a stable diversity patterns for studying fish gut microbiota. A total number of 42 samples dataset of Siniperca chuatsi (carnivorous fish) gut microbiota were used to test how the sequencing depth and clustering may affect the alpha and beta diversity patterns of fish intestinal microbiota. Interestingly, we found that the sequencing depth (resampling 1000-11,000 per sample) and the clustering methods (UPARSE and UCLUST) did not bias the estimates of the diversity patterns during the fish development from larva to adult. Although we should acknowledge that a suitable sequencing depth may differ case by case, our finding indicates that a shallow sequencing such as 1000 sequences per sample may be also enough to reflect the general diversity patterns of fish gut microbiota. However, we have shown in the present study that strict pre-processing of the original sequences is required to ensure reliable results. This study provides evidences to help making a strong scientific choice of the sequencing depth and clustering method for future studies on fish gut microbiota patterns, but at the same time reducing as much as possible the costs related to the analysis.
Viral metagenomic analysis of feces of wild small carnivores
2014-01-01
Background Recent studies have clearly demonstrated the enormous virus diversity that exists among wild animals. This exemplifies the required expansion of our knowledge of the virus diversity present in wildlife, as well as the potential transmission of these viruses to domestic animals or humans. Methods In the present study we evaluated the viral diversity of fecal samples (n = 42) collected from 10 different species of wild small carnivores inhabiting the northern part of Spain using random PCR in combination with next-generation sequencing. Samples were collected from American mink (Neovison vison), European mink (Mustela lutreola), European polecat (Mustela putorius), European pine marten (Martes martes), stone marten (Martes foina), Eurasian otter (Lutra lutra) and Eurasian badger (Meles meles) of the family of Mustelidae; common genet (Genetta genetta) of the family of Viverridae; red fox (Vulpes vulpes) of the family of Canidae and European wild cat (Felis silvestris) of the family of Felidae. Results A number of sequences of possible novel viruses or virus variants were detected, including a theilovirus, phleboviruses, an amdovirus, a kobuvirus and picobirnaviruses. Conclusions Using random PCR in combination with next generation sequencing, sequences of various novel viruses or virus variants were detected in fecal samples collected from Spanish carnivores. Detected novel viruses highlight the viral diversity that is present in fecal material of wild carnivores. PMID:24886057
Diversity of Babesia bovis merozoite surface antigen genes in the Philippines.
Tattiyapong, Muncharee; Sivakumar, Thillaiampalam; Ybanez, Adrian Patalinghug; Ybanez, Rochelle Haidee Daclan; Perez, Zandro Obligado; Guswanto, Azirwan; Igarashi, Ikuo; Yokoyama, Naoaki
2014-02-01
Babesia bovis is the causative agent of fatal babesiosis in cattle. In the present study, we investigated the genetic diversity of B. bovis among Philippine cattle, based on the genes that encode merozoite surface antigens (MSAs). Forty-one B. bovis-positive blood DNA samples from cattle were used to amplify the msa-1, msa-2b, and msa-2c genes. In phylogenetic analyses, the msa-1, msa-2b, and msa-2c gene sequences generated from Philippine B. bovis-positive DNA samples were found in six, three, and four different clades, respectively. All of the msa-1 and most of the msa-2b sequences were found in clades that were formed only by Philippine msa sequences in the respective phylograms. While all the msa-1 sequences from the Philippines showed similarity to those formed by Australian msa-1 sequences, the msa-2b sequences showed similarity to either Australian or Mexican msa-2b sequences. In contrast, msa-2c sequences from the Philippines were distributed across all the clades of the phylogram, although one clade was formed exclusively by Philippine msa-2c sequences. Similarities among the deduced amino acid sequences of MSA-1, MSA-2b, and MSA-2c from the Philippines were 62.2-100, 73.1-100, and 67.3-100%, respectively. The present findings demonstrate that B. bovis populations are genetically diverse in the Philippines. This information will provide a good foundation for the future design and implementation of improved immunological preventive methodologies against bovine babesiosis in the Philippines. The study has also generated a set of data that will be useful for futher understanding of the global genetic diversity of this important parasite. © 2013.
Microbial Diversity in Deep-sea Methane Seep Sediments Presented by SSU rRNA Gene Tag Sequencing
Nunoura, Takuro; Takaki, Yoshihiro; Kazama, Hiromi; Hirai, Miho; Ashi, Juichiro; Imachi, Hiroyuki; Takai, Ken
2012-01-01
Microbial community structures in methane seep sediments in the Nankai Trough were analyzed by tag-sequencing analysis for the small subunit (SSU) rRNA gene using a newly developed primer set. The dominant members of Archaea were Deep-sea Hydrothermal Vent Euryarchaeotic Group 6 (DHVEG 6), Marine Group I (MGI) and Deep Sea Archaeal Group (DSAG), and those in Bacteria were Alpha-, Gamma-, Delta- and Epsilonproteobacteria, Chloroflexi, Bacteroidetes, Planctomycetes and Acidobacteria. Diversity and richness were examined by 8,709 and 7,690 tag-sequences from sediments at 5 and 25 cm below the seafloor (cmbsf), respectively. The estimated diversity and richness in the methane seep sediment are as high as those in soil and deep-sea hydrothermal environments, although the tag-sequences obtained in this study were not sufficient to show whole microbial diversity in this analysis. We also compared the diversity and richness of each taxon/division between the sediments from the two depths, and found that the diversity and richness of some taxa/divisions varied significantly along with the depth. PMID:22510646
Nagano, Daisuke; Sivakumar, Thillaiampalam; De De Macedo, Alane Caine Costa; Inpankaew, Tawin; Alhassan, Andy; Igarashi, Ikuo; Yokoyama, Naoaki
2013-11-01
In the present study, we screened blood DNA samples obtained from cattle bred in Brazil (n=164) and Ghana (n=80) for Babesia bovis using a diagnostic PCR assay and found prevalences of 14.6% and 46.3%, respectively. Subsequently, the genetic diversity of B. bovis in Thailand, Brazil and Ghana was analyzed, based on the DNA sequence of merozoite surface antigen-1 (MSA-1). In Thailand, MSA-1 sequences were relatively conserved and found in a single clade of the phylogram, while Brazilian MSA-1 sequences showed high genetic diversity and were dispersed across three different clades. In contrast, the sequences from Ghanaian samples were detected in two different clades, one of which contained only a single Ghanaian sequence. The identities among the MSA-1 sequences from Thailand, Brazil and Ghana were 99.0-100%, 57.5-99.4% and 60.3-100%, respectively, while the similarities among the deduced MSA-1 amino acid sequences within the respective countries were 98.4-100%, 59.4-99.7% and 58.7-100%, respectively. These observations suggested that the genetic diversity of B. bovis based on MSA-1 sequences was higher in Brazil and Ghana than in Thailand. The current data highlight the importance of conducting extensive studies on the genetic diversity of B. bovis before designing immune control strategies in each surveyed country.
A communal catalogue reveals Earth's multiscale microbial diversity.
Thompson, Luke R; Sanders, Jon G; McDonald, Daniel; Amir, Amnon; Ladau, Joshua; Locey, Kenneth J; Prill, Robert J; Tripathi, Anupriya; Gibbons, Sean M; Ackermann, Gail; Navas-Molina, Jose A; Janssen, Stefan; Kopylova, Evguenia; Vázquez-Baeza, Yoshiki; González, Antonio; Morton, James T; Mirarab, Siavash; Zech Xu, Zhenjiang; Jiang, Lingjing; Haroon, Mohamed F; Kanbar, Jad; Zhu, Qiyun; Jin Song, Se; Kosciolek, Tomasz; Bokulich, Nicholas A; Lefler, Joshua; Brislawn, Colin J; Humphrey, Gregory; Owens, Sarah M; Hampton-Marcell, Jarrad; Berg-Lyons, Donna; McKenzie, Valerie; Fierer, Noah; Fuhrman, Jed A; Clauset, Aaron; Stevens, Rick L; Shade, Ashley; Pollard, Katherine S; Goodwin, Kelly D; Jansson, Janet K; Gilbert, Jack A; Knight, Rob
2017-11-23
Our growing awareness of the microbial world's importance and diversity contrasts starkly with our limited understanding of its fundamental structure. Despite recent advances in DNA sequencing, a lack of standardized protocols and common analytical frameworks impedes comparisons among studies, hindering the development of global inferences about microbial life on Earth. Here we present a meta-analysis of microbial community samples collected by hundreds of researchers for the Earth Microbiome Project. Coordinated protocols and new analytical methods, particularly the use of exact sequences instead of clustered operational taxonomic units, enable bacterial and archaeal ribosomal RNA gene sequences to be followed across multiple studies and allow us to explore patterns of diversity at an unprecedented scale. The result is both a reference database giving global context to DNA sequence data and a framework for incorporating data from future studies, fostering increasingly complete characterization of Earth's microbial diversity.
Genetic Diversity of Bacterial Communities and Gene Transfer Agents in Northern South China Sea
Sun, Fu-Lin; Wang, You-Shao; Wu, Mei-Lin; Jiang, Zhao-Yu; Sun, Cui-Ci; Cheng, Hao
2014-01-01
Pyrosequencing of the 16S ribosomal RNA gene (rDNA) amplicons was performed to investigate the unique distribution of bacterial communities in northern South China Sea (nSCS) and evaluate community structure and spatial differences of bacterial diversity. Cyanobacteria, Proteobacteria, Actinobacteria, and Bacteroidetes constitute the majority of bacteria. The taxonomic description of bacterial communities revealed that more Chroococcales, SAR11 clade, Acidimicrobiales, Rhodobacterales, and Flavobacteriales are present in the nSCS waters than other bacterial groups. Rhodobacterales were less abundant in tropical water (nSCS) than in temperate and cold waters. Furthermore, the diversity of Rhodobacterales based on the gene transfer agent (GTA) major capsid gene (g5) was investigated. Four g5 gene clone libraries were constructed from samples representing different regions and yielded diverse sequences. Fourteen g5 clusters could be identified among 197 nSCS clones. These clusters were also related to known g5 sequences derived from genome-sequenced Rhodobacterales. The composition of g5 sequences in surface water varied with the g5 sequences in the sampling sites; this result indicated that the Rhodobacterales population could be highly diverse in nSCS. Phylogenetic tree analysis result indicated distinguishable diversity patterns among tropical (nSCS), temperate, and cold waters, thereby supporting the niche adaptation of specific Rhodobacterales members in unique environments. PMID:25364820
Davidsson, Marcus; Diaz-Fernandez, Paula; Schwich, Oliver D.; Torroba, Marcos; Wang, Gang; Björklund, Tomas
2016-01-01
Detailed characterization and mapping of oligonucleotide function in vivo is generally a very time consuming effort that only allows for hypothesis driven subsampling of the full sequence to be analysed. Recent advances in deep sequencing together with highly efficient parallel oligonucleotide synthesis and cloning techniques have, however, opened up for entirely new ways to map genetic function in vivo. Here we present a novel, optimized protocol for the generation of universally applicable, barcode labelled, plasmid libraries. The libraries are designed to enable the production of viral vector preparations assessing coding or non-coding RNA function in vivo. When generating high diversity libraries, it is a challenge to achieve efficient cloning, unambiguous barcoding and detailed characterization using low-cost sequencing technologies. With the presented protocol, diversity of above 3 million uniquely barcoded adeno-associated viral (AAV) plasmids can be achieved in a single reaction through a process achievable in any molecular biology laboratory. This approach opens up for a multitude of in vivo assessments from the evaluation of enhancer and promoter regions to the optimization of genome editing. The generated plasmid libraries are also useful for validation of sequencing clustering algorithms and we here validate the newly presented message passing clustering process named Starcode. PMID:27874090
The Diversity Present in 5140 Human Mitochondrial Genomes
Pereira, Luísa; Freitas, Fernando; Fernandes, Verónica; Pereira, Joana B.; Costa, Marta D.; Costa, Stephanie; Máximo, Valdemar; Macaulay, Vincent; Rocha, Ricardo; Samuels, David C.
2009-01-01
We analyzed the current status (as of the end of August 2008) of human mitochondrial genomes deposited in GenBank, amounting to 5140 complete or coding-region sequences, in order to present an overall picture of the diversity present in the mitochondrial DNA of the global human population. To perform this task, we developed mtDNA-GeneSyn, a computer tool that identifies and exhaustedly classifies the diversity present in large genetic data sets. The diversity observed in the 5140 human mitochondrial genomes was compared with all possible transitions and transversions from the standard human mitochondrial reference genome. This comparison showed that tRNA and rRNA secondary structures have a large effect in limiting the diversity of the human mitochondrial sequences, whereas for the protein-coding genes there is a bias toward less variation at the second codon positions. The analysis of the observed amino acid variations showed a tolerance of variations that convert between the amino acids V, I, A, M, and T. This defines a group of amino acids with similar chemical properties that can interconvert by a single transition. PMID:19426953
Sanz, Yolanda
2017-01-01
Abstract The miniaturized and portable DNA sequencer MinION™ has demonstrated great potential in different analyses such as genome-wide sequencing, pathogen outbreak detection and surveillance, human genome variability, and microbial diversity. In this study, we tested the ability of the MinION™ platform to perform long amplicon sequencing in order to design new approaches to study microbial diversity using a multi-locus approach. After compiling a robust database by parsing and extracting the rrn bacterial region from more than 67000 complete or draft bacterial genomes, we demonstrated that the data obtained during sequencing of the long amplicon in the MinION™ device using R9 and R9.4 chemistries were sufficient to study 2 mock microbial communities in a multiplex manner and to almost completely reconstruct the microbial diversity contained in the HM782D and D6305 mock communities. Although nanopore-based sequencing produces reads with lower per-base accuracy compared with other platforms, we presented a novel approach consisting of multi-locus and long amplicon sequencing using the MinION™ MkIb DNA sequencer and R9 and R9.4 chemistries that help to overcome the main disadvantage of this portable sequencing platform. Furthermore, the nanopore sequencing library, constructed with the last releases of pore chemistry (R9.4) and sequencing kit (SQK-LSK108), permitted the retrieval of the higher level of 1D read accuracy sufficient to characterize the microbial species present in each mock community analysed. Improvements in nanopore chemistry, such as minimizing base-calling errors and new library protocols able to produce rapid 1D libraries, will provide more reliable information in the near future. Such data will be useful for more comprehensive and faster specific detection of microbial species and strains in complex ecosystems. PMID:28605506
Viral quasispecies inference from 454 pyrosequencing
2013-01-01
Background Many potentially life-threatening infectious viruses are highly mutable in nature. Characterizing the fittest variants within a quasispecies from infected patients is expected to allow unprecedented opportunities to investigate the relationship between quasispecies diversity and disease epidemiology. The advent of next-generation sequencing technologies has allowed the study of virus diversity with high-throughput sequencing, although these methods come with higher rates of errors which can artificially increase diversity. Results Here we introduce a novel computational approach that incorporates base quality scores from next-generation sequencers for reconstructing viral genome sequences that simultaneously infers the number of variants within a quasispecies that are present. Comparisons on simulated and clinical data on dengue virus suggest that the novel approach provides a more accurate inference of the underlying number of variants within the quasispecies, which is vital for clinical efforts in mapping the within-host viral diversity. Sequence alignments generated by our approach are also found to exhibit lower rates of error. Conclusions The ability to infer the viral quasispecies colony that is present within a human host provides the potential for a more accurate classification of the viral phenotype. Understanding the genomics of viruses will be relevant not just to studying how to control or even eradicate these viral infectious diseases, but also in learning about the innate protection in the human host against the viruses. PMID:24308284
[Current applications of high-throughput DNA sequencing technology in antibody drug research].
Yu, Xin; Liu, Qi-Gang; Wang, Ming-Rong
2012-03-01
Since the publication of a high-throughput DNA sequencing technology based on PCR reaction was carried out in oil emulsions in 2005, high-throughput DNA sequencing platforms have been evolved to a robust technology in sequencing genomes and diverse DNA libraries. Antibody libraries with vast numbers of members currently serve as a foundation of discovering novel antibody drugs, and high-throughput DNA sequencing technology makes it possible to rapidly identify functional antibody variants with desired properties. Herein we present a review of current applications of high-throughput DNA sequencing technology in the analysis of antibody library diversity, sequencing of CDR3 regions, identification of potent antibodies based on sequence frequency, discovery of functional genes, and combination with various display technologies, so as to provide an alternative approach of discovery and development of antibody drugs.
Vergani, Stefano; Korsunsky, Ilya; Mazzarello, Andrea Nicola; Ferrer, Gerardo; Chiorazzi, Nicholas; Bagnara, Davide
2017-01-01
Efficient and accurate high-throughput DNA sequencing of the adaptive immune receptor repertoire (AIRR) is necessary to study immune diversity in healthy subjects and disease-related conditions. The high complexity and diversity of the AIRR coupled with the limited amount of starting material, which can compromise identification of the full biological diversity makes such sequencing particularly challenging. AIRR sequencing protocols often fail to fully capture the sampled AIRR diversity, especially for samples containing restricted numbers of B lymphocytes. Here, we describe a library preparation method for immunoglobulin sequencing that results in an exhaustive full-length repertoire where virtually every sampled B-cell is sequenced. This maximizes the likelihood of identifying and quantifying the entire IGHV-D-J repertoire of a sample, including the detection of rearrangements present in only one cell in the starting population. The methodology establishes the importance of circumventing genetic material dilution in the preamplification phases and incorporates the use of certain described concepts: (1) balancing the starting material amount and depth of sequencing, (2) avoiding IGHV gene-specific amplification, and (3) using Unique Molecular Identifier. Together, this methodology is highly efficient, in particular for detecting rare rearrangements in the sampled population and when only a limited amount of starting material is available.
Global sequence diversity of the lactate dehydrogenase gene in Plasmodium falciparum.
Simpalipan, Phumin; Pattaradilokrat, Sittiporn; Harnyuttanakorn, Pongchai
2018-01-09
Antigen-detecting rapid diagnostic tests (RDTs) have been recommended by the World Health Organization for use in remote areas to improve malaria case management. Lactate dehydrogenase (LDH) of Plasmodium falciparum is one of the main parasite antigens employed by various commercial RDTs. It has been hypothesized that the poor detection of LDH-based RDTs is attributed in part to the sequence diversity of the gene. To test this, the present study aimed to investigate the genetic diversity of the P. falciparum ldh gene in Thailand and to construct the map of LDH sequence diversity in P. falciparum populations worldwide. The ldh gene was sequenced for 50 P. falciparum isolates in Thailand and compared with hundreds of sequences from P. falciparum populations worldwide. Several indices of molecular variation were calculated, including the proportion of polymorphic sites, the average nucleotide diversity index (π), and the haplotype diversity index (H). Tests of positive selection and neutrality tests were performed to determine signatures of natural selection on the gene. Mean genetic distance within and between species of Plasmodium ldh was analysed to infer evolutionary relationships. Nucleotide sequences of P. falciparum ldh could be classified into 9 alleles, encoding 5 isoforms of LDH. L1a was the most common allelic type and was distributed in P. falciparum populations worldwide. Plasmodium falciparum ldh sequences were highly conserved, with haplotype and nucleotide diversity values of 0.203 and 0.0004, respectively. The extremely low genetic diversity was maintained by purifying selection, likely due to functional constraints. Phylogenetic analysis inferred the close genetic relationship of P. falciparum to malaria parasites of great apes, rather than to other human malaria parasites. This study revealed the global genetic variation of the ldh gene in P. falciparum, providing knowledge for improving detection of LDH-based RDTs and supporting the candidacy of LDH as a therapeutic drug target.
The Hidden Diversity of Flagellated Protists in Soil.
Venter, Paul Christiaan; Nitsche, Frank; Arndt, Hartmut
2018-07-01
Protists are among the most diverse and abundant eukaryotes in soil. However, gaps between described and sequenced protist morphospecies still present a pending problem when surveying environmental samples for known species using molecular methods. The number of sequences in the molecular PR 2 database (∼130,000) is limited compared to the species richness expected (>1 million protist species) - limiting the recovery rate. This is important, since high throughput sequencing (HTS) methods are used to find associative patterns between functional traits, taxa and environmental parameters. We performed HTS to survey soil flagellates in 150 grasslands of central Europe, and tested the recovery rate of ten previously isolated and cultivated cercomonad species, among locally found diversity. We recovered sequences for reference soil flagellate species, but also a great number of their phylogenetically evaluated genetic variants, among rare and dominant taxa with presumably own biogeography. This was recorded among dominant (cercozoans, Sandona), rare (apusozoans) and a large hidden diversity of predominantly aquatic protists in soil (choanoflagellates, bicosoecids) often forming novel clades associated with uncultured environmental sequences. Evaluating the reads, instead of the OTUs that individual reads are usually clustered into, we discovered that much of this hidden diversity may be lost due to clustering. Copyright © 2018 Elsevier GmbH. All rights reserved.
Escalante, Adelfo; Rodríguez, María Elena; Martínez, Alfredo; López-Munguía, Agustín; Bolívar, Francisco; Gosset, Guillermo
2004-06-15
The bacterial diversity in pulque, a traditional Mexican alcoholic fermented beverage, was studied in 16S rDNA clone libraries from three pulque samples. Sequenced clones identified as Lactobacillus acidophilus, Lactobacillus strain ASF360, L. kefir, L. acetotolerans, L. hilgardii, L. plantarum, Leuconostoc pseudomesenteroides, Microbacterium arborescens, Flavobacterium johnsoniae, Acetobacter pomorium, Gluconobacter oxydans, and Hafnia alvei, were detected for the first time in pulque. Identity of 16S rDNA sequenced clones showed that bacterial diversity present among pulque samples is dominated by Lactobacillus species (80.97%). Seventy-eight clones exhibited less than 95% of relatedness to NCBI database sequences, which may indicate the presence of new species in pulque samples.
Sequence diversity and evolution of antimicrobial peptides in invertebrates.
Tassanakajon, Anchalee; Somboonwiwat, Kunlaya; Amparyup, Piti
2015-02-01
Antimicrobial peptides (AMPs) are evolutionarily ancient molecules that act as the key components in the invertebrate innate immunity against invading pathogens. Several AMPs have been identified and characterized in invertebrates, and found to display considerable diversity in their amino acid sequence, structure and biological activity. AMP genes appear to have rapidly evolved, which might have arisen from the co-evolutionary arms race between host and pathogens, and enabled organisms to survive in different microbial environments. Here, the sequence diversity of invertebrate AMPs (defensins, cecropins, crustins and anti-lipopolysaccharide factors) are presented to provide a better understanding of the evolution pattern of these peptides that play a major role in host defense mechanisms. Copyright © 2014 Elsevier Ltd. All rights reserved.
He, Shui-Lian; Yang, Yang; Morrell, Peter L; Yi, Ting-Shuang
2015-01-01
Foxtail millet (Setaria italica (L.) Beauv) is one of the earliest domesticated grains, which has been cultivated in northern China by 8,700 years before present (YBP) and across Eurasia by 4,000 YBP. Owing to a small genome and diploid nature, foxtail millet is a tractable model crop for studying functional genomics of millets and bioenergy grasses. In this study, we examined nucleotide sequence diversity, geographic structure, and levels of linkage disequilibrium at four nuclear loci (ADH1, G3PDH, IGS1 and TPI1) in representative samples of 311 landrace accessions across its cultivated range. Higher levels of nucleotide sequence and haplotype diversity were observed in samples from China relative to other sampled regions. Genetic assignment analysis classified the accessions into seven clusters based on nucleotide sequence polymorphisms. Intralocus LD decayed rapidly to half the initial value within ~1.2 kb or less.
Leon, Carla G; Moraga, Ruben; Valenzuela, Cristian; Gugliandolo, Concetta; Lo Giudice, Angelina; Papale, Maria; Vilo, Claudia; Dong, Qunfeng; Smith, Carlos T; Rossello-Mora, Ramon; Yañez, Jorge; Campos, Victor L
2018-01-01
Arsenic (As), a highly toxic metalloid, naturally present in Camarones River (Atacama Desert, Chile) is a great health concern for the local population and authorities. In this study, the taxonomic and functional characterization of bacterial communities associated to metal-rich sediments from three sites of the river (sites M1, M2 and M3), showing different arsenic concentrations, were evaluated using a combination of approaches. Diversity of bacterial communities was evaluated by Illumina sequencing. Strains resistant to arsenic concentrations varying from 0.5 to 100 mM arsenite or arsenate were isolated and the presence of genes coding for enzymes involved in arsenic oxidation (aio) or reduction (arsC) investigated. Bacterial communities showed a moderate diversity which increased as arsenic concentrations decreased along the river. Sequences of the dominant taxonomic groups (abundances ≥1%) present in all three sites were affiliated to Proteobacteria (range 40.3-47.2%), Firmicutes (8.4-24.8%), Acidobacteria (10.4-17.1%), Actinobacteria (5.4-8.1%), Chloroflexi (3.9-7.5%), Planctomycetes (1.2-5.3%), Gemmatimonadetes (1.2-1.5%), and Nitrospirae (1.1-1.2%). Bacterial communities from sites M2 and M3 showed no significant differences in diversity between each other (p = 0.9753) but they were significantly more diverse than M1 (p<0.001 and p<0.001, respectively). Sequences affiliated with Proteobacteria, Firmicutes, Acidobacteria, Chloroflexi and Actinobacteria at M1 accounted for more than 89% of the total classified bacterial sequences present but these phyla were present in lesser proportions in M2 and M3 sites. Strains isolated from the sediment of sample M1, having the greatest arsenic concentration (498 mg kg-1), showed the largest percentages of arsenic oxidation and reduction. Genes aio were more frequently detected in isolates from M1 (54%), whereas arsC genes were present in almost all isolates from all three sediments, suggesting that bacterial communities play an important role in the arsenic biogeochemical cycle and detoxification of arsenical compounds. Overall, results provide further knowledge on the microbial diversity of arsenic contaminated fresh-water sediments.
The diversity of Klebsiella pneumoniae surface polysaccharides.
Follador, Rainer; Heinz, Eva; Wyres, Kelly L; Ellington, Matthew J; Kowarik, Michael; Holt, Kathryn E; Thomson, Nicholas R
2016-08-01
Klebsiella pneumoniae is considered an urgent health concern due to the emergence of multi-drug-resistant strains for which vaccination offers a potential remedy. Vaccines based on surface polysaccharides are highly promising but need to address the high diversity of surface-exposed polysaccharides, synthesized as O-antigens (lipopolysaccharide, LPS) and K-antigens (capsule polysaccharide, CPS), present in K. pneumoniae . We present a comprehensive and clinically relevant study of the diversity of O- and K-antigen biosynthesis gene clusters across a global collection of over 500 K. pneumoniae whole-genome sequences and the seroepidemiology of human isolates from different infection types. Our study defines the genetic diversity of O- and K-antigen biosynthesis cluster sequences across this collection, identifying sequences for known serotypes as well as identifying novel LPS and CPS gene clusters found in circulating contemporary isolates. Serotypes O1, O2 and O3 were most prevalent in our sample set, accounting for approximately 80 % of all infections. In contrast, K serotypes showed an order of magnitude higher diversity and differ among infection types. In addition we investigated a potential association of O or K serotypes with phylogenetic lineage, infection type and the presence of known virulence genes. K1 and K2 serotypes, which are associated with hypervirulent K. pneumoniae , were associated with a higher abundance of virulence genes and more diverse O serotypes compared to other common K serotypes.
The diversity of Klebsiella pneumoniae surface polysaccharides
Heinz, Eva; Wyres, Kelly L.; Ellington, Matthew J.; Kowarik, Michael; Holt, Kathryn E.; Thomson, Nicholas R.
2016-01-01
Klebsiella pneumoniae is considered an urgent health concern due to the emergence of multi-drug-resistant strains for which vaccination offers a potential remedy. Vaccines based on surface polysaccharides are highly promising but need to address the high diversity of surface-exposed polysaccharides, synthesized as O-antigens (lipopolysaccharide, LPS) and K-antigens (capsule polysaccharide, CPS), present in K. pneumoniae. We present a comprehensive and clinically relevant study of the diversity of O- and K-antigen biosynthesis gene clusters across a global collection of over 500 K. pneumoniae whole-genome sequences and the seroepidemiology of human isolates from different infection types. Our study defines the genetic diversity of O- and K-antigen biosynthesis cluster sequences across this collection, identifying sequences for known serotypes as well as identifying novel LPS and CPS gene clusters found in circulating contemporary isolates. Serotypes O1, O2 and O3 were most prevalent in our sample set, accounting for approximately 80 % of all infections. In contrast, K serotypes showed an order of magnitude higher diversity and differ among infection types. In addition we investigated a potential association of O or K serotypes with phylogenetic lineage, infection type and the presence of known virulence genes. K1 and K2 serotypes, which are associated with hypervirulent K. pneumoniae, were associated with a higher abundance of virulence genes and more diverse O serotypes compared to other common K serotypes. PMID:28348868
Sharma, Anshul; Kaur, Jasmine; Lee, Sulhee; Park, Young-Seo
2018-06-01
In the present study, 35 Leuconostoc mesenteroides strains isolated from vegetables and food products from South Korea were studied by multilocus sequence typing (MLST) of seven housekeeping genes (atpA, groEL, gyrB, pheS, pyrG, rpoA, and uvrC). The fragment sizes of the seven amplified housekeeping genes ranged in length from 366 to 1414 bp. Sequence analysis indicated 27 different sequence types (STs) with 25 of them being represented by a single strain indicating high genetic diversity, whereas the remaining 2 were characterized by five strains each. In total, 220 polymorphic nucleotide sites were detected among seven housekeeping genes. The phylogenetic analysis based on the STs of the seven loci indicated that the 35 strains belonged to two major groups, A (28 strains) and B (7 strains). Split decomposition analysis showed that intraspecies recombination played a role in generating diversity among strains. The minimum spanning tree showed that the evolution of the STs was not correlated with food source. This study signifies that the multilocus sequence typing is a valuable tool to access the genetic diversity among L. mesenteroides strains from South Korea and can be used further to monitor the evolutionary changes.
Zeng, Y H; Chen, X H; Jiao, N Z
2007-12-01
To assess how completely the diversity of anoxygenic phototrophic bacteria (APB) was sampled in natural environments. All nucleotide sequences of the APB marker gene pufM from cultures and environmental clones were retrieved from the GenBank database. A set of cutoff values (sequence distances 0.06, 0.15 and 0.48 for species, genus, and (sub)phylum levels, respectively) was established using a distance-based grouping program. Analysis of the environmental clones revealed that current efforts on APB isolation and sampling in natural environments are largely inadequate. Analysis of the average distance between each identified genus and an uncultured environmental pufM sequence indicated that the majority of cultured APB genera lack environmental representatives. The distance-based grouping method is fast and efficient for bulk functional gene sequences analysis. The results clearly show that we are at a relatively early stage in sampling the global richness of APB species. Periodical assessment will undoubtedly facilitate in-depth analysis of potential biogeographical distribution pattern of APB. This is the first attempt to assess the present understanding of APB diversity in natural environments. The method used is also useful for assessing the diversity of other functional genes.
Characterization of ciliate diversity in bromeliad tank waters from the Brazilian Atlantic Forest.
Simão, Taiz L L; Borges, Adriana Giongo; Gano, Kelsey A; Davis-Richardson, Austin G; Brown, Christopher T; Fagen, Jennie R; Triplett, Eric W; Dias, Raquel; Mondin, Claudio A; da Silva, Renata M; Eizirik, Eduardo; Utz, Laura R P
2017-10-01
Bromeliads are a diverse group of plants that includes many species whose individuals are capable of retaining water, forming habitats called phytotelmata. These habitats harbor a diversity of organisms including prokaryotes, unicellular eukaryotes, metazoans, and fungi. Among single-celled eukaryotic organisms, ciliates are generally the most abundant. In the present study, we used Illumina DNA sequencing to survey the eukaryotic communities, especially ciliates, inhabiting the tanks of the bromeliads Aechmea gamosepala and Vriesea platynema in the Atlantic Forest of southern Brazil. Filtered sequences were clustered into distinct OTUs using a 99% identity threshold, and then assigned to phylum and genus using a BLAST-based approach (implemented in QIIME) and the SILVA reference database. Both bromeliad species harbored very diverse eukaryotic communities, with Arthropoda and Ciliophora showing the highest abundance (as estimated by the number of sequence reads). The ciliate genus Tetrahymena was the most abundant among single-celled organisms, followed by apicomplexan gregarines and the ciliate genus Glaucoma. Another interesting finding was the presence and high abundance of Trypanosoma in these bromeliad tanks, demonstrating their occurrence in this type of environment. The results presented here demonstrate a hidden diversity of eukaryotes in bromeliad tank waters, opening up new avenues for their in-depth characterization. Copyright © 2017 The Authors. Published by Elsevier GmbH.. All rights reserved.
Diversity of phytases in the rumen.
Nakashima, Brenda A; McAllister, Tim A; Sharma, Ranjana; Selinger, L Brent
2007-01-01
Examples of a new class of phytase related to protein tyrosine phosphatases (PTP) were recently isolated from several anaerobic bacteria from the rumen of cattle. In this study, the diversity of PTP-like phytase gene sequences in the rumen was surveyed by using the polymerase chain reaction (PCR). Two sets of degenerate primers were used to amplify sequences from rumen fluid total community DNA and genomic DNA from nine bacterial isolates. Four novel PTP-like phytase sequences were retrieved from rumen fluid, whereas all nine of the anaerobic bacterial isolates investigated in this work contained PTP-like phytase sequences. One isolate, Selenomonas lacticifex, contained two distinct PTP-like phytase sequences, suggesting that multiple phytate hydrolyzing enzymes are present in this bacterium. The degenerate primer and PCR conditions described here, as well as novel sequences obtained in this study, will provide a valuable resource for future studies on this new class of phytase. The observed diversity of microbial phytases in the rumen may account for the ability of ruminants to derive a significant proportion of their phosphorus requirements from phytate.
Phylogenetic diversity in the genus Bacillus as seen by 16S rRNA sequencing studies
NASA Technical Reports Server (NTRS)
Rossler, D.; Ludwig, W.; Schleifer, K. H.; Lin, C.; McGill, T. J.; Wisotzkey, J. D.; Jurtshuk, P. Jr; Fox, G. E.
1991-01-01
Comparative sequence analysis of 16S ribosomal (r)RNAs or DNAs of Bacillus alvei, B. laterosporus, B. macerans, B. macquariensis, B. polymyxa and B. stearothermophilus revealed the phylogenetic diversity of the genus Bacillus. Based on the presently available data set of 16S rRNA sequences from bacilli and relatives at least four major "Bacillus clusters" can be defined: a "Bacillus subtilis cluster" including B. stearothermophilus, a "B. brevis cluster" including B. laterosporus, a "B. alvei cluster" including B. macerans, B. maquariensis and B. polymyxa and a "B. cycloheptanicus branch".
Whole genome sequencing data and de novo draft assemblies for 66 teleost species
Malmstrøm, Martin; Matschiner, Michael; Tørresen, Ole K.; Jakobsen, Kjetill S.; Jentoft, Sissel
2017-01-01
Teleost fishes comprise more than half of all vertebrate species, yet genomic data are only available for 0.2% of their diversity. Here, we present whole genome sequencing data for 66 new species of teleosts, vastly expanding the availability of genomic data for this important vertebrate group. We report on de novo assemblies based on low-coverage (9–39×) sequencing and present detailed methodology for all analyses. To facilitate further utilization of this data set, we present statistical analyses of the gene space completeness and verify the expected phylogenetic position of the sequenced genomes in a large mitogenomic context. We further present a nuclear marker set used for phylogenetic inference and evaluate each gene tree in relation to the species tree to test for homogeneity in the phylogenetic signal. Collectively, these analyses illustrate the robustness of this highly diverse data set and enable extensive reuse of the selected phylogenetic markers and the genomic data in general. This data set covers all major teleost lineages and provides unprecedented opportunities for comparative studies of teleosts. PMID:28094797
A Public Database of Memory and Naive B-Cell Receptor Sequences.
DeWitt, William S; Lindau, Paul; Snyder, Thomas M; Sherwood, Anna M; Vignali, Marissa; Carlson, Christopher S; Greenberg, Philip D; Duerkopp, Natalie; Emerson, Ryan O; Robins, Harlan S
2016-01-01
The vast diversity of B-cell receptors (BCR) and secreted antibodies enables the recognition of, and response to, a wide range of epitopes, but this diversity has also limited our understanding of humoral immunity. We present a public database of more than 37 million unique BCR sequences from three healthy adult donors that is many fold deeper than any existing resource, together with a set of online tools designed to facilitate the visualization and analysis of the annotated data. We estimate the clonal diversity of the naive and memory B-cell repertoires of healthy individuals, and provide a set of examples that illustrate the utility of the database, including several views of the basic properties of immunoglobulin heavy chain sequences, such as rearrangement length, subunit usage, and somatic hypermutation positions and dynamics.
Zhou, Bin; Lin, Xudong; Wang, Wei; Halpin, Rebecca A.; Bera, Jayati; Stockwell, Timothy B.; Barr, Ian G.
2014-01-01
Although human influenza B virus (IBV) is a significant human pathogen, its great genetic diversity has limited our ability to universally amplify the entire genome for subsequent sequencing or vaccine production. The generation of sequence data via next-generation approaches and the rapid cloning of viral genes are critical for basic research, diagnostics, antiviral drugs, and vaccines to combat IBV. To overcome the difficulty of amplifying the diverse and ever-changing IBV genome, we developed and optimized techniques that amplify the complete segmented negative-sense RNA genome from any IBV strain in a single tube/well (IBV genomic amplification [IBV-GA]). Amplicons for >1,000 diverse IBV genomes from different sample types (e.g., clinical specimens) were generated and sequenced using this robust technology. These approaches are sensitive, robust, and sequence independent (i.e., universally amplify past, present, and future IBVs), which facilitates next-generation sequencing and advanced genomic diagnostics. Importantly, special terminal sequences engineered into the optimized IBV-GA2 products also enable ligation-free cloning to rapidly generate reverse-genetics plasmids, which can be used for the rescue of recombinant viruses and/or the creation of vaccine seed stock. PMID:24501036
Bacterial Diversity in Microbial Mats and Sediments from the Atacama Desert.
Rasuk, Maria Cecilia; Fernández, Ana Beatriz; Kurth, Daniel; Contreras, Manuel; Novoa, Fernando; Poiré, Daniel; Farías, María Eugenia
2016-01-01
The Atacama Desert has extreme environmental conditions that allow the development of unique microbial communities. The present paper reports the bacterial diversity of microbial mats and sediments and its mineralogical components. Some physicochemical conditions of the water surrounding these ecosystems have also been studied trying to determine their influence on the diversity of these communities. In that way, mats and sediments distributed among different hypersaline lakes located in salt flats of the Atacama Desert were subjected to massive parallel sequencing of the V4 region of the 16S rRNA genes of Bacteria. A higher diversity in sediment than in mat samples have been found. Lakes that harbor microbial mats have higher salinity than lakes where mats are absent. Proteobacteria and/or Bacteroidetes are the major phyla represented in all samples. An interesting item is the finding of a low proportion or absence of Cyanobacteria sequences in the ecosystems studied, suggesting the possibility that other groups may be playing an essential role as primary producers in these extreme environments. Additionally, the large proportion of 16S rRNA gene sequences that could not be classified at the level of phylum indicates potential new phyla present in these ecosystems.
Lefèvre, Emilie; Bardot, Corinne; Noël, Christophe; Carrias, Jean-François; Viscogliosi, Eric; Amblard, Christian; Sime-Ngando, Télesphore
2007-01-01
This study presents an original 18S rRNA PCR survey of the freshwater picoeukaryote community, and was designed to detect unidentified heterotrophic picoflagellates (size range 0.6-5 microm) which are prevalent throughout the year within the heterotrophic flagellate assemblage in Lake Pavin. Four clone libraries were constructed from samples collected in two contrasting zones in the lake. Computerized statistic tools have suggested that sequence retrieval was representative of the in situ picoplankton diversity. The two sampling zones exhibited similar diversity patterns but shared only about 5% of the operational taxonomic units (OTUs). Phylogenetic analysis clustered our sequences into three taxonomic groups: Alveolates (30% of OTUs), Fungi (23%) and Cercozoa (19%). Fungi thus substantially contributed to the detected diversity, as was additionally supported by direct microscopic observations of fungal zoospores and sporangia. A large fraction of the sequences belonged to parasites, including Alveolate sequences affiliated to the genus Perkinsus known as zooparasites, and chytrids that include host-specific parasitic fungi of various freshwater phytoplankton species, primarily diatoms. Phylogenetic analysis revealed five novel clades that probably include typical freshwater environmental sequences. Overall, from the unsuspected fungal diversity unveiled, we think that fungal zooflagellates have been misidentified as phagotrophic nanoflagellates in previous studies. This is in agreement with a recent experimental demonstration that zoospore-producing fungi and parasitic activity may play an important role in aquatic food webs.
Insertion sequence diversity in archaea.
Filée, J; Siguier, P; Chandler, M
2007-03-01
Insertion sequences (ISs) can constitute an important component of prokaryotic (bacterial and archaeal) genomes. Over 1,500 individual ISs are included at present in the ISfinder database (www-is.biotoul.fr), and these represent only a small portion of those in the available prokaryotic genome sequences and those that are being discovered in ongoing sequencing projects. In spite of this diversity, the transposition mechanisms of only a few of these ubiquitous mobile genetic elements are known, and these are all restricted to those present in bacteria. This review presents an overview of ISs within the archaeal kingdom. We first provide a general historical summary of the known properties and behaviors of archaeal ISs. We then consider how transposition might be regulated in some cases by small antisense RNAs and by termination codon readthrough. This is followed by an extensive analysis of the IS content in the sequenced archaeal genomes present in the public databases as of June 2006, which provides an overview of their distribution among the major archaeal classes and species. We show that the diversity of archaeal ISs is very great and comparable to that of bacteria. We compare archaeal ISs to known bacterial ISs and find that most are clearly members of families first described for bacteria. Several cases of lateral gene transfer between bacteria and archaea are clearly documented, notably for methanogenic archaea. However, several archaeal ISs do not have bacterial equivalents but can be grouped into Archaea-specific groups or families. In addition to ISs, we identify and list nonautonomous IS-derived elements, such as miniature inverted-repeat transposable elements. Finally, we present a possible scenario for the evolutionary history of ISs in the Archaea.
Nemati, Sara; Fazaeli, Asghar; Hajjaran, Homa; Khamesipour, Ali; Anbaran, Mohsen Falahati; Bozorgomid, Arezoo; Zarei, Fatah
2017-08-01
Despite the broad distribution of leishmaniasis among Iranians and animals across the country, little is known about the genetic characteristics of the causative agents. Applying both HSP70 PCR-RFLP and sequence analyses, this study aimed to evaluate the genetic diversity and phylogenetic relationships among Leishmania spp. isolated from Iranian endemic foci and available reference strains. A total of 36 Leishmania isolates from almost all districts across the country were genetically analyzed for the HSP70 gene using both PCR-RFLP and sequence analysis. The original HSP70 gene sequences were aligned along with homologous Leishmania sequences retrieved from NCBI, and subjected to the phylogenetic analysis. Basic parameters of genetic diversity were also estimated. The HSP70 PCR-RFLP presented 3 different electrophoretic patterns, with no further intraspecific variation, corresponding to 3 Leishmania species available in the country, L. tropica, L. major, and L. infantum. Phylogenetic analyses presented 5 major clades, corresponding to 5 species complexes. Iranian lineages, including L. major, L. tropica, and L. infantum, were distributed among 3 complexes L. major, L. tropica, and L. donovani. However, within the L. major and L. donovani species complexes, the HSP70 phylogeny was not able to distinguish clearly between the L. major and L. turanica isolates, and between the L. infantum, L. donovani, and L. chagasi isolates, respectively. Our results indicated that both HSP70 PCR-RFLP and sequence analyses are medically applicable tools for identification of Leishmania species in Iranian patients. However, the reduced genetic diversity of the target gene makes it inevitable that its phylogeny only resolves the major groups, namely, the species complexes.
Jaratlerdsiri, Weerachai; Isberg, Sally R.; Higgins, Damien P.; Miles, Lee G.; Gongora, Jaime
2014-01-01
Major Histocompatibility Complex (MHC) class II genes encode for molecules that aid in the presentation of antigens to helper T cells. MHC characterisation within and between major vertebrate taxa has shed light on the evolutionary mechanisms shaping the diversity within this genomic region, though little characterisation has been performed within the Order Crocodylia. Here we investigate the extent and effect of selective pressures and trans-species polymorphism on MHC class II α and β evolution among 20 extant species of Crocodylia. Selection detection analyses showed that diversifying selection influenced MHC class II β diversity, whilst diversity within MHC class II α is the result of strong purifying selection. Comparison of translated sequences between species revealed the presence of twelve trans-species polymorphisms, some of which appear to be specific to the genera Crocodylus and Caiman. Phylogenetic reconstruction clustered MHC class II α sequences into two major clades representing the families Crocodilidae and Alligatoridae. However, no further subdivision within these clades was evident and, based on the observation that most MHC class II α sequences shared the same trans-species polymorphisms, it is possible that they correspond to the same gene lineage across species. In contrast, phylogenetic analyses of MHC class II β sequences showed a mixture of subclades containing sequences from Crocodilidae and/or Alligatoridae, illustrating orthologous relationships among those genes. Interestingly, two of the subclades containing sequences from both Crocodilidae and Alligatoridae shared specific trans-species polymorphisms, suggesting that they may belong to ancient lineages pre-dating the divergence of these two families from the common ancestor 85–90 million years ago. The results presented herein provide an immunogenetic resource that may be used to further assess MHC diversity and functionality in Crocodylia. PMID:24503938
USDA-ARS?s Scientific Manuscript database
Cowpea (Vigna unguiculata) is an important legume crop with diverse uses. The species is presently a minor crop, and evaluation of its genetic diversity has been very limited. In this study, a total of 200 genic and 100 genomic simple sequence repeat (SSR) markers were developed from cowpea unigene ...
Diverse molecular signatures for ribosomally ‘active’ Perkinsea in marine sediments
2014-01-01
Background Perkinsea are a parasitic lineage within the eukaryotic superphylum Alveolata. Recent studies making use of environmental small sub-unit ribosomal RNA gene (SSU rDNA) sequencing methodologies have detected a significant diversity and abundance of Perkinsea-like phylotypes in freshwater environments. In contrast only a few Perkinsea environmental sequences have been retrieved from marine samples and only two groups of Perkinsea have been cultured and morphologically described and these are parasites of marine molluscs or marine protists. These two marine groups form separate and distantly related phylogenetic clusters, composed of closely related lineages on SSU rDNA trees. Here, we test the hypothesis that Perkinsea are a hitherto under-sampled group in marine environments. Using 454 diversity ‘tag’ sequencing we investigate the diversity and distribution of these protists in marine sediments and water column samples taken from the Deep Chlorophyll Maximum (DCM) and sub-surface using both DNA and RNA as the source template and sampling four European offshore locations. Results We detected the presence of 265 sequences branching with known Perkinsea, the majority of them recovered from marine sediments. Moreover, 27% of these sequences were sampled from RNA derived cDNA libraries. Phylogenetic analyses classify a large proportion of these sequences into 38 cluster groups (including 30 novel marine cluster groups), which share less than 97% sequence similarity suggesting this diversity encompasses a range of biologically and ecologically distinct organisms. Conclusions These results demonstrate that the Perkinsea lineage is considerably more diverse than previously detected in marine environments. This wide diversity of Perkinsea-like protists is largely retrieved in marine sediment with a significant proportion detected in RNA derived libraries suggesting this diversity represents ribosomally ‘active’ and intact cells. Given the phylogenetic range of hosts infected by known Perkinsea parasites, these data suggest that Perkinsea either play a significant but hitherto unrecognized role as parasites in marine sediments and/or members of this group are present in the marine sediment possibly as part of the ‘seed bank’ microbial community. PMID:24779375
Genetic diversity of merozoite surface antigens in Babesia bovis detected from Sri Lankan cattle.
Sivakumar, Thillaiampalam; Okubo, Kazuhiro; Igarashi, Ikuo; de Silva, Weligodage Kumarawansa; Kothalawala, Hemal; Silva, Seekkuge Susil Priyantha; Vimalakumar, Singarayar Caniciyas; Meewewa, Asela Sanjeewa; Yokoyama, Naoaki
2013-10-01
Babesia bovis, the causative agent of severe bovine babesiosis, is endemic in Sri Lanka. The live attenuated vaccine (K-strain), which was introduced in the early 1990s, has been used to immunize cattle populations in endemic areas of the country. The present study was undertaken to determine the genetic diversity of merozoite surface antigens (MSAs) in B. bovis isolates from Sri Lankan cattle, and to compare the gene sequences obtained from such isolates against those of the K-strain. Forty-four bovine blood samples isolated from different geographical regions of Sri Lanka and judged to be B. bovis-positive by PCR screening were used to amplify MSAs (MSA-1, MSA-2c, MSA-2a1, MSA-2a2, and MSA-2b), AMA-1, and 12D3 genes from parasite DNA. Although the AMA-1 and 12D3 gene sequences were highly conserved among the Sri Lankan isolates, the MSA gene sequences from the same isolates were highly diverse. Sri Lankan MSA-1, MSA-2c, MSA-2a1, MSA-2a2, and MSA-2b sequences clustered within 5, 2, 4, 1, and 9 different clades in the gene phylograms, respectively, while the minimum similarity values among the deduced amino acid sequences of these genes were 36.8%, 68.7%, 80.3%, 100%, and 68.3%, respectively. In the phylograms, none of the Sri Lankan sequences fell within clades containing the respective K-strain sequences. Additionally, the similarity values for MSA-1 and MSA-2c were 40-61.8% and 90.9-93.2% between the Sri Lankan isolates and the K-strain, respectively, while the K-strain MSA-2a/b sequence shared 64.5-69.8%, 69.3%, and 70.5-80.3% similarities with the Sri Lankan MSA-2a1, MSA-2a2, and MSA-2b sequences, respectively. The present study has shown that genetic diversity among MSAs of Sri Lankan B. bovis isolates is very high, and that the sequences of field isolates diverged genetically from the K-strain. Copyright © 2013 Elsevier B.V. All rights reserved.
Early history of European domestic cattle as revealed by ancient DNA.
Bollongino, R; Edwards, C J; Alt, K W; Burger, J; Bradley, D G
2006-03-22
We present an extensive ancient DNA analysis of mainly Neolithic cattle bones sampled from archaeological sites along the route of Neolithic expansion, from Turkey to North-Central Europe and Britain. We place this first reasonable population sample of Neolithic cattle mitochondrial DNA sequence diversity in context to illustrate the continuity of haplotype variation patterns from the first European domestic cattle to the present. Interestingly, the dominant Central European pattern, a starburst phylogeny around the modal sequence, T3, has a Neolithic origin, and the reduced diversity within this cluster in the ancient samples accords with their shorter history of post-domestic accumulation of mutation.
Ning, Yi; Li, Yan-Ling; Zhou, Guo-Ying; Yang, Lu-Cun; Xu, Wen-Hua
2016-04-01
High throughput sequencing technology is also called Next Generation Sequencing (NGS), which can sequence hundreds and thousands sequences in different samples at the same time. In the present study, the culture-independent high throughput sequencing technology was applied to sequence the fungi metagenomic DNA of the fungal internal transcribed spacer 1(ITS 1) in the root of Sinopodophyllum hexandrum. Sequencing data suggested that after the quality control, 22 565 reads were remained. Cluster similarity analysis was done based on 97% sequence similarity, which obtained 517 OTUs for the three samples (LD1, LD2 and LD3). All the fungi which identified from all the reads of OTUs based on 0.8 classification thresholds using the software of RDP classifier were classified as 13 classes, 35 orders, 44 family, 55 genera. Among these genera, the genus of Tetracladium was the dominant genera in all samples(35.49%, 68.55% and 12.96%).The Shannon's diversity indices and the Simpson indices of the endophytic fungi in the samples ranged from 1.75-2.92, 0.11-0.32, respectively.This is the first time for applying high through put sequencing technol-ogyto analyze the community composition and diversity of endophytic fungi in the medicinal plant, and the results showed that there were hyper diver sity and high community composition complexity of endophytic fungi in the root of S. hexandrum. It is also proved that the high through put sequencing technology has great advantage for analyzing ecommunity composition and diversity of endophtye in the plant. Copyright© by the Chinese Pharmaceutical Association.
The"minimum information about an environmental sequence" (MIENS) specification
DOE Office of Scientific and Technical Information (OSTI.GOV)
Yilmaz, P.; Kottmann, R.; Field, D.
We present the Genomic Standards Consortium's (GSC) 'Minimum Information about an ENvironmental Sequence' (MIENS) standard for describing marker genes. Adoption of MIENS will enhance our ability to analyze natural genetic diversity across the Tree of Life as it is currently being documented by massive DNA sequencing efforts from myriad ecosystems in our ever-changing biosphere.
Mosaic HIV-1 vaccines expand the breadth and depth of cellular immune responses in rhesus monkeys.
Barouch, Dan H; O'Brien, Kara L; Simmons, Nathaniel L; King, Sharon L; Abbink, Peter; Maxfield, Lori F; Sun, Ying-Hua; La Porte, Annalena; Riggs, Ambryice M; Lynch, Diana M; Clark, Sarah L; Backus, Katherine; Perry, James R; Seaman, Michael S; Carville, Angela; Mansfield, Keith G; Szinger, James J; Fischer, Will; Muldoon, Mark; Korber, Bette
2010-03-01
The worldwide diversity of HIV-1 presents an unprecedented challenge for vaccine development. Antigens derived from natural HIV-1 sequences have elicited only a limited breadth of cellular immune responses in nonhuman primate studies and clinical trials to date. Polyvalent 'mosaic' antigens, in contrast, are designed to optimize cellular immunologic coverage of global HIV-1 sequence diversity. Here we show that mosaic HIV-1 Gag, Pol and Env antigens expressed by recombinant, replication-incompetent adenovirus serotype 26 vectors markedly augmented both the breadth and depth without compromising the magnitude of antigen-specific T lymphocyte responses as compared with consensus or natural sequence HIV-1 antigens in rhesus monkeys. Polyvalent mosaic antigens therefore represent a promising strategy to expand cellular immunologic vaccine coverage for genetically diverse pathogens such as HIV-1.
Graças, Diego A; Miranda, Paulo R; Baraúna, Rafael A; McCulloch, John A; Ghilardi, Rubens; Schneider, Maria Paula C; Silva, Artur
2011-11-01
Microbial diversity was evaluated in an anoxic zone of Tucuruí Hydroelectric Power Station reservoir in Brazilian Amazonia using a culture-independent approach by amplifying and sequencing fragments of the 16S rRNA gene using metagenomic DNA as a template. Samples obtained from the photic, aphotic (40 m) and sediment (60 m) layers were used to construct six 16S rDNA libraries containing a total of 1,152 clones. The sediment, aphotic and photic layers presented 64, 33 and 35 unique archaeal operational taxonomic units (OTUs). The estimated richness of these layers was evaluated to be 153, 106 and 79 archaeal OTUs, respectively, using the abundance-based coverage estimator (ACE) and 114, 83 and 77 OTUs using the Chao1 estimator. For bacterial sequences, 114, 69 and 57 OTUs were found in the sediment, aphotic and photic layers, which presented estimated richnesses of 1,414, 522 and 197 OTUs (ACE) and 1,059, 1,014 and 148 OTUs (Chao1), respectively. Phylogenetic analyses of the sequences obtained revealed a high richness of microorganisms which participate in the carbon cycle, namely, methanogenic archaea and methanotrophic proteobacteria. Most sequences obtained belong to non-culturable prokaryotes. The present study offers the first glimpse of the huge microbial diversity of an anoxic area of a man-made lacustrine environment in the tropics.
Ramos, Vitor M C; Castelo-Branco, Raquel; Leão, Pedro N; Martins, Joana; Carvalhal-Gomes, Sinda; Sobrinho da Silva, Frederico; Mendonça Filho, João G; Vasconcelos, Vitor M
2017-01-01
Microbial mats are complex, micro-scale ecosystems that can be found in a wide range of environments. In the top layer of photosynthetic mats from hypersaline environments, a large diversity of cyanobacteria typically predominates. With the aim of strengthening the knowledge on the cyanobacterial diversity present in the coastal lagoon system of Araruama (state of Rio de Janeiro, Brazil), we have characterized three mat samples by means of a polyphasic approach. We have used morphological and molecular data obtained by culture-dependent and -independent methods. Moreover, we have compared different classification methodologies and discussed the outcomes, challenges, and pitfalls of these methods. Overall, we show that Araruama's lagoons harbor a high cyanobacterial diversity. Thirty-six unique morphospecies could be differentiated, which increases by more than 15% the number of morphospecies and genera already reported for the entire Araruama system. Morphology-based data were compared with the 16S rRNA gene phylogeny derived from isolate sequences and environmental sequences obtained by PCR-DGGE and pyrosequencing. Most of the 48 phylotypes could be associated with the observed morphospecies at the order level. More than one third of the sequences demonstrated to be closely affiliated (best BLAST hit results of ≥99%) with cyanobacteria from ecologically similar habitats. Some sequences had no close relatives in the public databases, including one from an isolate, being placed as "loner" sequences within different orders. This hints at hidden cyanobacterial diversity in the mats of the Araruama system, while reinforcing the relevance of using complementary approaches to study cyanobacterial diversity.
Ramos, Vitor M. C.; Castelo-Branco, Raquel; Leão, Pedro N.; Martins, Joana; Carvalhal-Gomes, Sinda; Sobrinho da Silva, Frederico; Mendonça Filho, João G.; Vasconcelos, Vitor M.
2017-01-01
Microbial mats are complex, micro-scale ecosystems that can be found in a wide range of environments. In the top layer of photosynthetic mats from hypersaline environments, a large diversity of cyanobacteria typically predominates. With the aim of strengthening the knowledge on the cyanobacterial diversity present in the coastal lagoon system of Araruama (state of Rio de Janeiro, Brazil), we have characterized three mat samples by means of a polyphasic approach. We have used morphological and molecular data obtained by culture-dependent and -independent methods. Moreover, we have compared different classification methodologies and discussed the outcomes, challenges, and pitfalls of these methods. Overall, we show that Araruama's lagoons harbor a high cyanobacterial diversity. Thirty-six unique morphospecies could be differentiated, which increases by more than 15% the number of morphospecies and genera already reported for the entire Araruama system. Morphology-based data were compared with the 16S rRNA gene phylogeny derived from isolate sequences and environmental sequences obtained by PCR-DGGE and pyrosequencing. Most of the 48 phylotypes could be associated with the observed morphospecies at the order level. More than one third of the sequences demonstrated to be closely affiliated (best BLAST hit results of ≥99%) with cyanobacteria from ecologically similar habitats. Some sequences had no close relatives in the public databases, including one from an isolate, being placed as “loner” sequences within different orders. This hints at hidden cyanobacterial diversity in the mats of the Araruama system, while reinforcing the relevance of using complementary approaches to study cyanobacterial diversity. PMID:28713360
Xia, Xia-Yu; Ge, Meng; Hsi, Jenny H; He, Xiang; Ruan, Yu-Hua; Wang, Zhi-Xin; Shao, Yi-Ming; Pan, Xian-Ming
2014-01-01
Accurate estimates of HIV-1 incidence are essential for monitoring epidemic trends and evaluating intervention efforts. However, the long asymptomatic stage of HIV-1 infection makes it difficult to effectively distinguish incident infections from chronic ones. Current incidence assays based on serology or viral sequence diversity are both still lacking in accuracy. In the present work, a sequence clustering based diversity (SCBD) assay was devised by utilizing the fact that viral sequences derived from each transmitted/founder (T/F) strain tend to cluster together at early stage, and that only the intra-cluster diversity is correlated with the time since HIV-1 infection. The dot-matrix pairwise alignment was used to eliminate the disproportional impact of insertion/deletions (indels) and recombination events, and so was the proportion of clusterable sequences (Pc) as an index to identify late chronic infections with declined viral genetic diversity. Tested on a dataset containing 398 incident and 163 chronic infection cases collected from the Los Alamos HIV database (last modified 2/8/2012), our SCBD method achieved 99.5% sensitivity and 98.8% specificity, with an overall accuracy of 99.3%. Further analysis and evaluation also suggested its performance was not affected by host factors such as the viral subtypes and transmission routes. The SCBD method demonstrated the potential of sequencing based techniques to become useful for identifying incident infections. Its use may be most advantageous for settings with low to moderate incidence relative to available resources. The online service is available at http://www.bioinfo.tsinghua.edu.cn:8080/SCBD/index.jsp.
Ayyagari, Vijaya Sai; Sreerama, Krupanidhi
2017-08-01
Achatina fulica (Lissachatina fulica) is one of the most invasive species found across the globe causing a significant damage to crops, vegetables, and horticultural plants. This terrestrial snail is native to east Africa and spread to different parts of the world by introductions. India, a hot spot for biodiversity of several endemic gastropods, has witnessed an outburst of this snail population in several parts of the country posing a serious threat to crop loss and also to human health. With an objective to evaluate the genetic diversity of this snail, we have sampled this snail from different parts of India and analyzed its haplotype diversity by means of 16S rDNA sequence information. Apart from this, we have studied the phylogenetic relationships of the isolates sequenced in the present study in relation with other global populations by Bayesian and Maximum-likelihood approaches. Of the isolates sequenced, haplotype 'C' is the predominant one. A new haplotype 'S' from the state of Odisha was observed. The isolates sequenced in the present study clustered with its conspecifics from the Indian sub-continent. Haplotype network analyses were also carried out for studying the evolution of different haplotypes. It was observed that haplotype 'S' was associated with a Mauritius haplotype 'H', indicating the possibility of multiple introductions of A. fulica to India.
Novel viruses in salivary glands of mosquitoes from sylvatic Cerrado, Midwestern Brazil
de Lara Pinto, Andressa Zelenski; Santos de Carvalho, Michellen; de Melo, Fernando Lucas; Ribeiro, Ana Lúcia Maria; Morais Ribeiro, Bergmann
2017-01-01
Viruses may represent the most diverse microorganisms on Earth. Novel viruses and variants continue to emerge. Mosquitoes are the most dangerous animals to humankind. This study aimed at identifying viral RNA diversity in salivary glands of mosquitoes captured in a sylvatic area of Cerrado at the Chapada dos Guimarães National Park, Mato Grosso, Brazil. In total, 66 Culicinae mosquitoes belonging to 16 species comprised 9 pools, subjected to viral RNA extraction, double-strand cDNA synthesis, random amplification and high-throughput sequencing, revealing the presence of seven insect-specific viruses, six of which represent new species of Rhabdoviridae (Lobeira virus), Chuviridae (Cumbaru and Croada viruses), Totiviridae (Murici virus) and Partitiviridae (Araticum and Angico viruses). In addition, two mosquito pools presented Kaiowa virus sequences that had already been reported in South Pantanal, Brazil. These findings amplify the understanding of viral diversity in wild-type Culicinae. Insect-specific viruses may present a broader diversity than previously imagined and future studies may address their possible role in mosquito vector competence. PMID:29117239
Brown, Steven D.; Podar, Mircea; Klingeman, Dawn M.; Johnson, Courtney M.; Yang, Zamin K.; Utturkar, Sagar M.; Land, Miriam L.; Mosher, Jennifer J.; Hurt, Richard A.; Phelps, Tommy J.; Palumbo, Anthony V.; Arkin, Adam P.; Hazen, Terry C.
2012-01-01
Pelosinus fermentans 16S rRNA gene sequences have been reported from diverse geographical sites since the recent isolation of the type strain. We present the genome sequence of the P. fermentans type strain R7 (DSM 17108) and genome sequences for two new strains with different abilities to reduce iron, chromate, and uranium. PMID:22933770
Leon, Carla G.; Moraga, Ruben; Valenzuela, Cristian; Gugliandolo, Concetta; Lo Giudice, Angelina; Papale, Maria; Vilo, Claudia; Dong, Qunfeng; Smith, Carlos T.; Rossello-Mora, Ramon; Yañez, Jorge
2018-01-01
Arsenic (As), a highly toxic metalloid, naturally present in Camarones River (Atacama Desert, Chile) is a great health concern for the local population and authorities. In this study, the taxonomic and functional characterization of bacterial communities associated to metal-rich sediments from three sites of the river (sites M1, M2 and M3), showing different arsenic concentrations, were evaluated using a combination of approaches. Diversity of bacterial communities was evaluated by Illumina sequencing. Strains resistant to arsenic concentrations varying from 0.5 to 100 mM arsenite or arsenate were isolated and the presence of genes coding for enzymes involved in arsenic oxidation (aio) or reduction (arsC) investigated. Bacterial communities showed a moderate diversity which increased as arsenic concentrations decreased along the river. Sequences of the dominant taxonomic groups (abundances ≥1%) present in all three sites were affiliated to Proteobacteria (range 40.3–47.2%), Firmicutes (8.4–24.8%), Acidobacteria (10.4–17.1%), Actinobacteria (5.4–8.1%), Chloroflexi (3.9–7.5%), Planctomycetes (1.2–5.3%), Gemmatimonadetes (1.2–1.5%), and Nitrospirae (1.1–1.2%). Bacterial communities from sites M2 and M3 showed no significant differences in diversity between each other (p = 0.9753) but they were significantly more diverse than M1 (p<0.001 and p<0.001, respectively). Sequences affiliated with Proteobacteria, Firmicutes, Acidobacteria, Chloroflexi and Actinobacteria at M1 accounted for more than 89% of the total classified bacterial sequences present but these phyla were present in lesser proportions in M2 and M3 sites. Strains isolated from the sediment of sample M1, having the greatest arsenic concentration (498 mg kg-1), showed the largest percentages of arsenic oxidation and reduction. Genes aio were more frequently detected in isolates from M1 (54%), whereas arsC genes were present in almost all isolates from all three sediments, suggesting that bacterial communities play an important role in the arsenic biogeochemical cycle and detoxification of arsenical compounds. Overall, results provide further knowledge on the microbial diversity of arsenic contaminated fresh-water sediments. PMID:29715297
A Pan-HIV Strategy for Complete Genome Sequencing
Yamaguchi, Julie; Alessandri-Gradt, Elodie; Tell, Robert W.; Brennan, Catherine A.
2015-01-01
Molecular surveillance is essential to monitor HIV diversity and track emerging strains. We have developed a universal library preparation method (HIV-SMART [i.e., switching mechanism at 5′ end of RNA transcript]) for next-generation sequencing that harnesses the specificity of HIV-directed priming to enable full genome characterization of all HIV-1 groups (M, N, O, and P) and HIV-2. Broad application of the HIV-SMART approach was demonstrated using a panel of diverse cell-cultured virus isolates. HIV-1 non-subtype B-infected clinical specimens from Cameroon were then used to optimize the protocol to sequence directly from plasma. When multiplexing 8 or more libraries per MiSeq run, full genome coverage at a median ∼2,000× depth was routinely obtained for either sample type. The method reproducibly generated the same consensus sequence, consistently identified viral sequence heterogeneity present in specimens, and at viral loads of ≤4.5 log copies/ml yielded sufficient coverage to permit strain classification. HIV-SMART provides an unparalleled opportunity to identify diverse HIV strains in patient specimens and to determine phylogenetic classification based on the entire viral genome. Easily adapted to sequence any RNA virus, this technology illustrates the utility of next-generation sequencing (NGS) for viral characterization and surveillance. PMID:26699702
Algorithms for optimizing cross-overs in DNA shuffling.
He, Lu; Friedman, Alan M; Bailey-Kellogg, Chris
2012-03-21
DNA shuffling generates combinatorial libraries of chimeric genes by stochastically recombining parent genes. The resulting libraries are subjected to large-scale genetic selection or screening to identify those chimeras with favorable properties (e.g., enhanced stability or enzymatic activity). While DNA shuffling has been applied quite successfully, it is limited by its homology-dependent, stochastic nature. Consequently, it is used only with parents of sufficient overall sequence identity, and provides no control over the resulting chimeric library. This paper presents efficient methods to extend the scope of DNA shuffling to handle significantly more diverse parents and to generate more predictable, optimized libraries. Our CODNS (cross-over optimization for DNA shuffling) approach employs polynomial-time dynamic programming algorithms to select codons for the parental amino acids, allowing for zero or a fixed number of conservative substitutions. We first present efficient algorithms to optimize the local sequence identity or the nearest-neighbor approximation of the change in free energy upon annealing, objectives that were previously optimized by computationally-expensive integer programming methods. We then present efficient algorithms for more powerful objectives that seek to localize and enhance the frequency of recombination by producing "runs" of common nucleotides either overall or according to the sequence diversity of the resulting chimeras. We demonstrate the effectiveness of CODNS in choosing codons and allocating substitutions to promote recombination between parents targeted in earlier studies: two GAR transformylases (41% amino acid sequence identity), two very distantly related DNA polymerases, Pol X and β (15%), and beta-lactamases of varying identity (26-47%). Our methods provide the protein engineer with a new approach to DNA shuffling that supports substantially more diverse parents, is more deterministic, and generates more predictable and more diverse chimeric libraries.
PuLSE: Quality control and quantification of peptide sequences explored by phage display libraries.
Shave, Steven; Mann, Stefan; Koszela, Joanna; Kerr, Alastair; Auer, Manfred
2018-01-01
The design of highly diverse phage display libraries is based on assumption that DNA bases are incorporated at similar rates within the randomized sequence. As library complexity increases and expected copy numbers of unique sequences decrease, the exploration of library space becomes sparser and the presence of truly random sequences becomes critical. We present the program PuLSE (Phage Library Sequence Evaluation) as a tool for assessing randomness and therefore diversity of phage display libraries. PuLSE runs on a collection of sequence reads in the fastq file format and generates tables profiling the library in terms of unique DNA sequence counts and positions, translated peptide sequences, and normalized 'expected' occurrences from base to residue codon frequencies. The output allows at-a-glance quantitative quality control of a phage library in terms of sequence coverage both at the DNA base and translated protein residue level, which has been missing from toolsets and literature. The open source program PuLSE is available in two formats, a C++ source code package for compilation and integration into existing bioinformatics pipelines and precompiled binaries for ease of use.
Gene sequences present in Citrullus sp. having been lost during domestication of watermelon
USDA-ARS?s Scientific Manuscript database
A wide genetic diversity exists among Citrullus species, while watermelon cultivars (Citrullus lanatus var. lanatus) share a narrow genetic base as a result of many years of domestication and selection for desirable fruit qualities. The recent international watermelon genome sequencing project reve...
Genotyping of ancient Mycobacterium tuberculosis strains reveals historic genetic diversity.
Müller, Romy; Roberts, Charlotte A; Brown, Terence A
2014-04-22
The evolutionary history of the Mycobacterium tuberculosis complex (MTBC) has previously been studied by analysis of sequence diversity in extant strains, but not addressed by direct examination of strain genotypes in archaeological remains. Here, we use ancient DNA sequencing to type 11 single nucleotide polymorphisms and two large sequence polymorphisms in the MTBC strains present in 10 archaeological samples from skeletons from Britain and Europe dating to the second-nineteenth centuries AD. The results enable us to assign the strains to groupings and lineages recognized in the extant MTBC. We show that at least during the eighteenth-nineteenth centuries AD, strains of M. tuberculosis belonging to different genetic groups were present in Britain at the same time, possibly even at a single location, and we present evidence for a mixed infection in at least one individual. Our study shows that ancient DNA typing applied to multiple samples can provide sufficiently detailed information to contribute to both archaeological and evolutionary knowledge of the history of tuberculosis.
Barbi, Florian; Bragalini, Claudia; Vallon, Laurent; Prudent, Elsa; Dubost, Audrey; Fraissinet-Tachet, Laurence; Marmeisse, Roland; Luis, Patricia
2014-01-01
Plant biomass degradation in soil is one of the key steps of carbon cycling in terrestrial ecosystems. Fungal saprotrophic communities play an essential role in this process by producing hydrolytic enzymes active on the main components of plant organic matter. Open questions in this field regard the diversity of the species involved, the major biochemical pathways implicated and how these are affected by external factors such as litter quality or climate changes. This can be tackled by environmental genomic approaches involving the systematic sequencing of key enzyme-coding gene families using soil-extracted RNA as material. Such an approach necessitates the design and evaluation of gene family-specific PCR primers producing sequence fragments compatible with high-throughput sequencing approaches. In the present study, we developed and evaluated PCR primers for the specific amplification of fungal CAZy Glycoside Hydrolase gene families GH5 (subfamily 5) and GH11 encoding endo-β-1,4-glucanases and endo-β-1,4-xylanases respectively as well as Basidiomycota class II peroxidases, corresponding to the CAZy Auxiliary Activity family 2 (AA2), active on lignin. These primers were experimentally validated using DNA extracted from a wide range of Ascomycota and Basidiomycota species including 27 with sequenced genomes. Along with the published primers for Glycoside Hydrolase GH7 encoding enzymes active on cellulose, the newly design primers were shown to be compatible with the Illumina MiSeq sequencing technology. Sequences obtained from RNA extracted from beech or spruce forest soils showed a high diversity and were uniformly distributed in gene trees featuring the global diversity of these gene families. This high-throughput sequencing approach using several degenerate primers constitutes a robust method, which allows the simultaneous characterization of the diversity of different fungal transcripts involved in plant organic matter degradation and may lead to the discovery of complex patterns in gene expression of soil fungal communities. PMID:25545363
Paulsrud, Per; Lindblad, Peter
1998-01-01
We examined the genetic diversity of Nostoc symbionts in some lichens by using the tRNALeu (UAA) intron as a genetic marker. The nucleotide sequence was analyzed in the context of the secondary structure of the transcribed intron. Cyanobacterial tRNALeu (UAA) introns were specifically amplified from freshly collected lichen samples without previous DNA extraction. The lichen species used in the present study were Nephroma arcticum, Peltigera aphthosa, P. membranacea, and P. canina. Introns with different sizes around 300 bp were consistently obtained. Multiple clones from single PCRs were screened by using their single-stranded conformational polymorphism pattern, and the nucleotide sequence was determined. No evidence for sample heterogenity was found. This implies that the symbiont in situ is not a diverse community of cyanobionts but, rather, one Nostoc strain. Furthermore, each lichen thallus contained only one intron type, indicating that each thallus is colonized only once or that there is a high degree of specificity. The same cyanobacterial intron sequence was also found in samples of one lichen species from different localities. In a phylogenetic analysis, the cyanobacterial lichen sequences grouped together with the sequences from two free-living Nostoc strains. The size differences in the intron were due to insertions and deletions in highly variable regions. The sequence data were used in discussions concerning specificity and biology of the lichen symbiosis. It is concluded that the tRNALeu (UAA) intron can be of great value when examining cyanobacterial diversity. PMID:9435083
Genetic variation in potential Giardia vaccine candidates cyst wall protein 2 and α1-giardin.
Radunovic, Matej; Klotz, Christian; Saghaug, Christina Skår; Brattbakk, Hans-Richard; Aebischer, Toni; Langeland, Nina; Hanevik, Kurt
2017-08-01
Giardia is a prevalent intestinal parasitic infection. The trophozoite structural protein a1-giardin (a1-g) and the cyst protein cyst wall protein 2 (CWP2) have shown promise as Giardia vaccine antigen candidates in murine models. The present study assesses the genetic diversity of a1-g and CWP2 between and within assemblages A and B in human clinical isolates. a1-g and CWP2 sequences were acquired from 15 Norwegian isolates by PCR amplification and 20 sequences from German cultured isolates by whole genome sequencing. Sequences were aligned to reference genomes from assemblage A2 and B to identify genetic variance. Genetic diversity was found between assemblage A and B reference sequences for both a1-g (90.8% nucleotide identity) and CWP2 (82.5% nucleotide identity). However, for a1-g, this translated into only 3 amino acid (aa) substitutions, while for CWP2 there were 41 aa substitutions, and also one aa deletion. Genetic diversity within assemblage B was larger; nucleotide identity 92.0% for a1-g and 94.3% for CWP2, than within assemblage A (nucleotide identity 99.0% for a1-g and 99.7% for CWP2). For CWP2, the diversity on both nucleotide and protein level was higher in the C-terminal end. Predicted antigenic epitopes were not affected for a1-g, but partially for CWP2. Despite genetic diversity in a1-g, we found aa sequence, characteristics, and antigenicity to be well preserved. CWP2 showed more aa variance and potential antigenic differences. Several CWP2 antigens might be necessary in a future Giardia vaccine to provide cross protection against both Giardia assemblages infecting humans.
Rhizobial characterization in revegetated areas after bauxite mining.
Borges, Wardsson Lustrino; Prin, Yves; Ducousso, Marc; Le Roux, Christine; de Faria, Sergio Miana
2016-01-01
Little is known regarding how the increased diversity of nitrogen-fixing bacteria contributes to the productivity and diversity of plants in complex communities. However, some authors have shown that the presence of a diverse group of nodulating bacteria is required for different plant species to coexist. A better understanding of the plant symbiotic organism diversity role in natural ecosystems can be extremely useful to define recovery strategies of environments that were degraded by human activities. This study used ARDRA, BOX-PCR fingerprinting and sequencing of the 16S rDNA gene to assess the diversity of root nodule nitrogen-fixing bacteria in former bauxite mining areas that were replanted in 1981, 1985, 1993, 1998, 2004 and 2006 and in a native forest. Among the 12 isolates for which the 16S rDNA gene was partially sequenced, eight, three and one isolate(s) presented similarity with sequences of the genera Bradyrhizobium, Rhizobium and Mesorhizobium, respectively. The richness, Shannon and evenness indices were the highest in the area that was replanted the earliest (1981) and the lowest in the area that was replanted most recently (2006). Copyright © 2016 Sociedade Brasileira de Microbiologia. Published by Elsevier Editora Ltda. All rights reserved.
Helicobacter pylori Heat Shock Protein A: Serologic Responses and Genetic Diversity
Ng, Enders K. W.; Thompson, Stuart A.; Pérez-Pérez, Guillermo I.; Kansau, Imad; van der Ende, Arie; Labigne, Agnès; Sung, Joseph J. Y.; Chung, S. C. Sydney; Blaser, Martin J.
1999-01-01
Helicobacter pylori synthesizes an unusual GroES homolog, heat shock protein A (HspA). The present study was aimed at an assessment of the serological response to HspA in a group of Chinese patients with defined gastroduodenal pathologies and determination of whether diversity is present in the nucleotide sequences encoding HspA in isolates from these patients. Serum samples collected from 154 patients who had an upper gastrointestinal pathology and the presence of H. pylori defined by biopsy were tested for an immunoglobulin G (IgG) serologic response to H. pylori HspA by an enzyme linked immunosorbant assay. HspA-encoding nucleotide sequences in H. pylori isolates from 14 patients (7 seropositive and 7 seronegative for HspA) were analyzed by PCR and direct sequencing of the PCR products. The sequencing results were compared to those of 48 isolates from other parts of the world. Of the 154 known H. pylori-positive patients, 54 (35.1%) were seropositive for HspA. The A domain (GroES homology) of HspA was highly conserved in the 14 isolates tested. Although the B domain (metal-binding site unique to H. pylori) resembled that in the known major variant, particular amino acid substitutions allowed definition of an HspA variant associated with isolates from East Asia. There were no associations between patient characteristics and HspA seropositivity or amino acid sequences. We confirmed in this study that the clinical outcomes of H. pylori infection are not related to HspA antigenicity or to sequence variation. However, B-domain sequence variation may be a marker for the study of the genetic diversity of H. pylori strains of different geographic origins. PMID:10225839
Ling, Alison L.; Robertson, Charles E.; Harris, J. Kirk; Frank, Daniel N.; Kotter, Cassandra V.; Stevens, Mark J.; Pace, Norman R.; Hernandez, Mark T.
2015-01-01
Microbially-induced concrete corrosion in headspaces threatens wastewater infrastructure worldwide. Models for predicting corrosion rates in sewer pipe networks rely largely on information from culture-based investigations. In this study, the succession of microbes associated with corroding concrete was characterized over a one-year monitoring campaign using rRNA sequence-based phylogenetic methods. New concrete specimens were exposed in two highly corrosive manholes (high concentrations of hydrogen sulfide and carbon dioxide gas) on the Colorado Front Range for up to a year. Community succession on corroding surfaces was assessed using Illumina MiSeq sequencing of 16S bacterial rRNA amplicons and Sanger sequencing of 16S universal rRNA clones. Microbial communities associated with corrosion fronts presented distinct succession patterns which converged to markedly low α-diversity levels (< 10 taxa) in conjunction with decreasing pH. The microbial community succession pattern observed in this study agreed with culture-based models that implicate acidophilic sulfur-oxidizer Acidithiobacillus spp. in advanced communities, with two notable exceptions. Early communities exposed to alkaline surface pH presented relatively high α-diversity, including heterotrophic, nitrogen-fixing, and sulfur-oxidizing genera, and one community exposed to neutral surface pH presented a diverse transition community comprised of less than 20% sulfur-oxidizers. PMID:25748024
Ling, Alison L; Robertson, Charles E; Harris, J Kirk; Frank, Daniel N; Kotter, Cassandra V; Stevens, Mark J; Pace, Norman R; Hernandez, Mark T
2015-01-01
Microbially-induced concrete corrosion in headspaces threatens wastewater infrastructure worldwide. Models for predicting corrosion rates in sewer pipe networks rely largely on information from culture-based investigations. In this study, the succession of microbes associated with corroding concrete was characterized over a one-year monitoring campaign using rRNA sequence-based phylogenetic methods. New concrete specimens were exposed in two highly corrosive manholes (high concentrations of hydrogen sulfide and carbon dioxide gas) on the Colorado Front Range for up to a year. Community succession on corroding surfaces was assessed using Illumina MiSeq sequencing of 16S bacterial rRNA amplicons and Sanger sequencing of 16S universal rRNA clones. Microbial communities associated with corrosion fronts presented distinct succession patterns which converged to markedly low α-diversity levels (< 10 taxa) in conjunction with decreasing pH. The microbial community succession pattern observed in this study agreed with culture-based models that implicate acidophilic sulfur-oxidizer Acidithiobacillus spp. in advanced communities, with two notable exceptions. Early communities exposed to alkaline surface pH presented relatively high α-diversity, including heterotrophic, nitrogen-fixing, and sulfur-oxidizing genera, and one community exposed to neutral surface pH presented a diverse transition community comprised of less than 20% sulfur-oxidizers.
Reevaluating the serotype II capsular locus of Streptococcus agalactiae.
Martins, E R; Melo-Cristino, J; Ramirez, M
2007-10-01
We report a novel sequence of the serotype II capsular locus of group B streptococcus that resolves inconsistencies among the results of various groups and the sequence in GenBank. This locus was found in diverse lineages and presents genes consistent with the complete synthesis of the type II polysaccharide.
Lossius, Andreas; Johansen, Jorunn N; Vartdal, Frode; Robins, Harlan; Jūratė Šaltytė, Benth; Holmøy, Trygve; Olweus, Johanna
2014-11-01
Epstein-Barr virus (EBV) has long been suggested as a pathogen in multiple sclerosis (MS). Here, we used high-throughput sequencing to determine the diversity, compartmentalization, persistence, and EBV-reactivity of the T-cell receptor (TCR) repertoires in MS. TCR-β genes were sequenced in paired samples of cerebrospinal fluid (CSF) and blood from patients with MS and controls with other inflammatory neurological diseases. The TCR repertoires were highly diverse in both compartments and patient groups. Expanded T-cell clones, represented by TCR-β sequences >0.1%, were of different identity in CSF and blood of MS patients, and persisted for more than a year. Reference TCR-β libraries generated from peripheral blood T cells reactive against autologous EBV-transformed B cells were highly enriched for public EBV-specific sequences and were used to quantify EBV-reactive TCR-β sequences in CSF. TCR-β sequences of EBV-reactive CD8+ T cells, including several public EBV-specific sequences, were intrathecally enriched in MS patients only, whereas those of EBV-reactive CD4+ T cells were also enriched in CSF of controls. These data provide evidence for a clonally diverse, yet compartmentalized and persistent, intrathecal T-cell response in MS. The presented strategy links TCR sequence to intrathecal T-cell specificity, demonstrating enrichment of EBV-reactive CD8+ T cells in MS. © 2014 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Alterations of microbiota in urine from women with interstitial cystitis
2012-01-01
Background Interstitial Cystitis (IC) is a chronic inflammatory condition of the bladder with unknown etiology. The aim of this study was to characterize the microbial community present in the urine from IC female patients by 454 high throughput sequencing of the 16S variable regions V1V2 and V6. The taxonomical composition, richness and diversity of the IC microbiota were determined and compared to the microbial profile of asymptomatic healthy female (HF) urine. Results The composition and distribution of bacterial sequences differed between the urine microbiota of IC patients and HFs. Reduced sequence richness and diversity were found in IC patient urine, and a significant difference in the community structure of IC urine in relation to HF urine was observed. More than 90% of the IC sequence reads were identified as belonging to the bacterial genus Lactobacillus, a marked increase compared to 60% in HF urine. Conclusion The 16S rDNA sequence data demonstrates a shift in the composition of the bacterial community in IC urine. The reduced microbial diversity and richness is accompanied by a higher abundance of the bacterial genus Lactobacillus, compared to HF urine. This study demonstrates that high throughput sequencing analysis of urine microbiota in IC patients is a powerful tool towards a better understanding of this enigmatic disease. PMID:22974186
Archaeal β diversity patterns under the seafloor along geochemical gradients
NASA Astrophysics Data System (ADS)
Koyano, Hitoshi; Tsubouchi, Taishi; Kishino, Hirohisa; Akutsu, Tatsuya
2014-09-01
Recently, deep drilling into the seafloor has revealed that there are vast sedimentary ecosystems of diverse microorganisms, particularly archaea, in subsurface areas. We investigated the β diversity patterns of archaeal communities in sediment layers under the seafloor and their determinants. This study was accomplished by analyzing large environmental samples of 16S ribosomal RNA gene sequences and various geochemical data collected from a sediment core of 365.3 m, obtained by drilling into the seafloor off the east coast of the Shimokita Peninsula. To extract the maximum amount of information from these environmental samples, we first developed a method for measuring β diversity using sequence data by applying probability theory on a set of strings developed by two of the authors in a previous publication. We introduced an index of β diversity between sequence populations from which the sequence data were sampled. We then constructed an estimator of the β diversity index based on the sequence data and demonstrated that it converges to the β diversity index between sequence populations with probability of 1 as the number of sampled sequences increases. Next, we applied this new method to quantify β diversities between archaeal sequence populations under the seafloor and constructed a quantitative model of the estimated β diversity patterns. Nearly 90% of the variation in the archaeal β diversity was explained by a model that included as variables the differences in the abundances of chlorine, iodine, and carbon between the sediment layers.
Lumkul, Lalita; Sawaswong, Vorthon; Simpalipan, Phumin; Kaewthamasorn, Morakot; Harnyuttanakorn, Pongchai; Pattaradilokrat, Sittiporn
2018-01-01
Development of an effective vaccine is critically needed for the prevention of malaria. One of the key antigens for malaria vaccines is the apical membrane antigen 1 (AMA-1) of the human malaria parasite Plasmodium falciparum, the surface protein for erythrocyte invasion of the parasite. The gene encoding AMA-1 has been sequenced from populations of P. falciparum worldwide, but the haplotype diversity of the gene in P. falciparum populations in the Greater Mekong Subregion (GMS), including Thailand, remains to be characterized. In the present study, the AMA-1 gene was PCR amplified and sequenced from the genomic DNA of 65 P. falciparum isolates from 5 endemic areas in Thailand. The nearly full-length 1,848 nucleotide sequence of AMA-1 was subjected to molecular analyses, including nucleotide sequence diversity, haplotype diversity and deduced amino acid sequence diversity and neutrality tests. Phylogenetic analysis and pairwise population differentiation (Fst indices) were performed to infer the population structure. The analyses identified 60 single nucleotide polymorphic loci, predominately located in domain I of AMA-1. A total of 31 unique AMA-1 haplotypes were identified, which included 11 novel ones. The phylogenetic tree of the AMA-1 haplotypes revealed multiple clades of AMA-1, each of which contained parasites of multiple geographical origins, consistent with the Fst indices indicating genetic homogeneity or gene flow among geographically distinct populations of P. falciparum in Thailand’s borders with Myanmar, Laos and Cambodia. In summary, the study revealed novel haplotypes and population structure needed for the further advancement of AMA-1-based malaria vaccines in the GMS. PMID:29742870
Zhang, Haifang; Zhang, Xiaolei; Yan, Meiying; Pang, Bo; Kan, Biao; Xu, Huaxi; Huang, Xinxiang
2011-12-15
To determine the genotype of Salmonella enterica serovar Typhi (S. Typhi) strains in China and analyze their genetic diversity. We collected S. Typhi strains from 1959 to 2006 in five highly endemic Chinese provinces and chose 40 representative strains. Multilocus sequence typing was used to determine the genotypes or sequence types (ST) and microarray-based comparative genomic hybridization (M-CGH) to investigate the differences in gene content among these strains. Forty representative S. Typhi strains belonged to 4 sequence types (ST1, ST2, ST890, and ST892). The predominant S. Typhi genotype (31/40) was ST2 and it had a diverse geographic distribution. We discovered two novel STs - ST890 and ST892. M-CGH showed that 69 genes in these two novel STs were divergent from S. Typhi Ty2, which belongs to ST1. In addition, 5 representative Typhi strains of ST2 isolated from Guizhou province showed differences in divergent genes. We determined two novel sequence types, ST890 and ST892, and found that ST2 was the most prevalent genotype of S. Typhi in China. Genetic diversity was present even within a highly clonal bacterial population.
Al-Rshaidat, Mamoon M D; Snider, Allison; Rosebraugh, Sydney; Devine, Amanda M; Devine, Thomas D; Plaisance, Laetitia; Knowlton, Nancy; Leray, Matthieu
2016-09-01
High-throughput sequencing (HTS) of DNA barcodes (metabarcoding), particularly when combined with standardized sampling protocols, is one of the most promising approaches for censusing overlooked cryptic invertebrate communities. We present biodiversity estimates based on sequencing of the cytochrome c oxidase subunit 1 (COI) gene for coral reefs of the Gulf of Aqaba, a semi-enclosed system in the northern Red Sea. Samples were obtained from standardized sampling devices (Autonomous Reef Monitoring Structures (ARMS)) deployed for 18 months. DNA barcoding of non-sessile specimens >2 mm revealed 83 OTUs in six phyla, of which only 25% matched a reference sequence in public databases. Metabarcoding of the 2 mm - 500 μm and sessile bulk fractions revealed 1197 OTUs in 15 animal phyla, of which only 4.9% matched reference barcodes. These results highlight the scarcity of COI data for cryptobenthic organisms of the Red Sea. Compared with data obtained using similar methods, our results suggest that Gulf of Aqaba reefs are less diverse than two Pacific coral reefs but much more diverse than an Atlantic oyster reef at a similar latitude. The standardized approaches used here show promise for establishing baseline data on biodiversity, monitoring the impacts of environmental change, and quantifying patterns of diversity at regional and global scales.
Kehie, Mechuselie; Kumaria, Suman; Devi, Khumuckcham Sangeeta; Tandon, Pramod
2016-02-01
Sequences of the Internal Transcribed Spacer (ITS1-5.8S-ITS2) of nuclear ribosomal DNAs were explored to study the genetic diversity and molecular evolution of Naga King Chili. Our study indicated the occurrence of nucleotide polymorphism and haplotypic diversity in the ITS regions. The present study demonstrated that the variability of ITS1 with respect to nucleotide diversity and sequence polymorphism exceeded that of ITS2. Sequence analysis of 5.8S gene revealed a much conserved region in all the accessions of Naga King Chili. However, strong phylogenetic information of this species is the distinct 13 bp deletion in the 5.8S gene which discriminated Naga King Chili from the rest of the Capsicum sp. Neutrality test results implied a neutral variation, and population seems to be evolving at drift-mutation equilibrium and free from directed selection pressure. Furthermore, mismatch analysis showed multimodal curve indicating a demographic equilibrium. Phylogenetic relationships revealed by Median Joining Network (MJN) analysis denoted a clear discrimination of Naga King Chili from its closest sister species (Capsicum chinense and Capsicum frutescens). The absence of star-like network of haplotypes suggested an ancient population expansion of this chili.
Al-Shahrani, Sarah A; Alajmi, Reem A; Ayaad, Tahany H; Al-Shahrani, Mohammed A; Shaurub, El-Sayed H
2017-10-01
The present work aimed at investigating the genetic diversity of the head louse Pediculus humanus capitis (P. humanus capitis) among infested primary school girls at Bisha governorate, Saudi Arabia, based on the sequence of mitochondrial cytochrome b (mt cyt b) gene of 121 P. humanus capitis adults. Additionally, the prevalence of pediculosis capitis was surveyed. The results of sequencing were compared with the sequence of human head lice that are genotyped previously. Phylogenetic tree analysis showed the presence of 100% identity (n = 26) of louse specimens with clade A (prevalent worldwide) of the GenBank data base. Louse individuals (n = 50) showed 99.8% similarity with the same clade A reference having a single base pair difference. Also, a number of 22 louse individuals revealed 99.8% identity with clade B reference (prevalent in North and Central Americas, Europe, and Australia) with individual diversity in two base pairs. Moreover, 14 louse individual sequences revealed 99.4% identity with three base pair differences. It was concluded that moderate pediculosis (~13%) prevailed among the female students of the primary schools. It was age-and hair texture (straight or curly)-dependent. P. humanus capitis prevalence diversity is of clades A and B genotyping.
Molecular diversity and distribution of marine fungi across 130 European environmental samples.
Richards, Thomas A; Leonard, Guy; Mahé, Frédéric; Del Campo, Javier; Romac, Sarah; Jones, Meredith D M; Maguire, Finlay; Dunthorn, Micah; De Vargas, Colomban; Massana, Ramon; Chambouvet, Aurélie
2015-11-22
Environmental DNA and culture-based analyses have suggested that fungi are present in low diversity and in low abundance in many marine environments, especially in the upper water column. Here, we use a dual approach involving high-throughput diversity tag sequencing from both DNA and RNA templates and fluorescent cell counts to evaluate the diversity and relative abundance of fungi across marine samples taken from six European near-shore sites. We removed very rare fungal operational taxonomic units (OTUs) selecting only OTUs recovered from multiple samples for a detailed analysis. This approach identified a set of 71 fungal 'OTU clusters' that account for 66% of all the sequences assigned to the Fungi. Phylogenetic analyses demonstrated that this diversity includes a significant number of chytrid-like lineages that had not been previously described, indicating that the marine environment encompasses a number of zoosporic fungi that are new to taxonomic inventories. Using the sequence datasets, we identified cases where fungal OTUs were sampled across multiple geographical sites and between different sampling depths. This was especially clear in one relatively abundant and diverse phylogroup tentatively named Novel Chytrid-Like-Clade 1 (NCLC1). For comparison, a subset of the water column samples was also investigated using fluorescent microscopy to examine the abundance of eukaryotes with chitin cell walls. Comparisons of relative abundance of RNA-derived fungal tag sequences and chitin cell-wall counts demonstrate that fungi constitute a low fraction of the eukaryotic community in these water column samples. Taken together, these results demonstrate the phylogenetic position and environmental distribution of 71 lineages, improving our understanding of the diversity and abundance of fungi in marine environments. © 2015 The Authors.
Molecular diversity and distribution of marine fungi across 130 European environmental samples
Richards, Thomas A.; Leonard, Guy; Mahé, Frédéric; del Campo, Javier; Romac, Sarah; Jones, Meredith D. M.; Maguire, Finlay; Dunthorn, Micah; De Vargas, Colomban; Massana, Ramon; Chambouvet, Aurélie
2015-01-01
Environmental DNA and culture-based analyses have suggested that fungi are present in low diversity and in low abundance in many marine environments, especially in the upper water column. Here, we use a dual approach involving high-throughput diversity tag sequencing from both DNA and RNA templates and fluorescent cell counts to evaluate the diversity and relative abundance of fungi across marine samples taken from six European near-shore sites. We removed very rare fungal operational taxonomic units (OTUs) selecting only OTUs recovered from multiple samples for a detailed analysis. This approach identified a set of 71 fungal ‘OTU clusters' that account for 66% of all the sequences assigned to the Fungi. Phylogenetic analyses demonstrated that this diversity includes a significant number of chytrid-like lineages that had not been previously described, indicating that the marine environment encompasses a number of zoosporic fungi that are new to taxonomic inventories. Using the sequence datasets, we identified cases where fungal OTUs were sampled across multiple geographical sites and between different sampling depths. This was especially clear in one relatively abundant and diverse phylogroup tentatively named Novel Chytrid-Like-Clade 1 (NCLC1). For comparison, a subset of the water column samples was also investigated using fluorescent microscopy to examine the abundance of eukaryotes with chitin cell walls. Comparisons of relative abundance of RNA-derived fungal tag sequences and chitin cell-wall counts demonstrate that fungi constitute a low fraction of the eukaryotic community in these water column samples. Taken together, these results demonstrate the phylogenetic position and environmental distribution of 71 lineages, improving our understanding of the diversity and abundance of fungi in marine environments. PMID:26582030
Assessing Diversity of DNA Structure-Related Sequence Features in Prokaryotic Genomes
Huang, Yongjie; Mrázek, Jan
2014-01-01
Prokaryotic genomes are diverse in terms of their nucleotide and oligonucleotide composition as well as presence of various sequence features that can affect physical properties of the DNA molecule. We present a survey of local sequence patterns which have a potential to promote non-canonical DNA conformations (i.e. different from standard B-DNA double helix) and interpret the results in terms of relationships with organisms' habitats, phylogenetic classifications, and other characteristics. Our present work differs from earlier similar surveys not only by investigating a wider range of sequence patterns in a large number of genomes but also by using a more realistic null model to assess significant deviations. Our results show that simple sequence repeats and Z-DNA-promoting patterns are generally suppressed in prokaryotic genomes, whereas palindromes and inverted repeats are over-represented. Representation of patterns that promote Z-DNA and intrinsic DNA curvature increases with increasing optimal growth temperature (OGT), and decreases with increasing oxygen requirement. Additionally, representations of close direct repeats, palindromes and inverted repeats exhibit clear negative trends with increasing OGT. The observed relationships with environmental characteristics, particularly OGT, suggest possible evolutionary scenarios of structural adaptation of DNA to particular environmental niches. PMID:24408877
Tattiyapong, Muncharee; Sivakumar, Thillaiampalam; Takemae, Hitoshi; Simking, Pacharathon; Jittapalapong, Sathaporn; Igarashi, Ikuo; Yokoyama, Naoaki
2016-07-01
Babesia bovis, an intraerythrocytic protozoan parasite, causes severe clinical disease in cattle worldwide. The genetic diversity of parasite antigens often results in different immune profiles in infected animals, hindering efforts to develop immune control methodologies against the B. bovis infection. In this study, we analyzed the genetic diversity of the merozoite surface antigen-1 (msa-1) gene using 162 B. bovis-positive blood DNA samples sourced from cattle populations reared in different geographical regions of Thailand. The identity scores shared among 93 msa-1 gene sequences isolated by PCR amplification were 43.5-100%, and the similarity values among the translated amino acid sequences were 42.8-100%. Of 23 total clades detected in our phylogenetic analysis, Thai msa-1 gene sequences occurred in 18 clades; seven among them were composed of sequences exclusively from Thailand. To investigate differential antigenicity of isolated MSA-1 proteins, we expressed and purified eight recombinant MSA-1 (rMSA-1) proteins, including an rMSA-1 from B. bovis Texas (T2Bo) strain and seven rMSA-1 proteins based on the Thai msa-1 sequences. When these antigens were analyzed in a western blot assay, anti-T2Bo cattle serum strongly reacted with the rMSA-1 from T2Bo, as well as with three other rMSA-1 proteins that shared 54.9-68.4% sequence similarity with T2Bo MSA-1. In contrast, no or weak reactivity was observed for the remaining rMSA-1 proteins, which shared low sequence similarity (35.0-39.7%) with T2Bo MSA-1. While demonstrating the high genetic diversity of the B. bovis msa-1 gene in Thailand, the present findings suggest that the genetic diversity results in antigenicity variations among the MSA-1 antigens of B. bovis in Thailand. Copyright © 2016 Elsevier B.V. All rights reserved.
Lijavetzky, Diego; Cabezas, José Antonio; Ibáñez, Ana; Rodríguez, Virginia; Martínez-Zapater, José M
2007-01-01
Background Single-nucleotide polymorphisms (SNPs) are the most abundant type of DNA sequence polymorphisms. Their higher availability and stability when compared to simple sequence repeats (SSRs) provide enhanced possibilities for genetic and breeding applications such as cultivar identification, construction of genetic maps, the assessment of genetic diversity, the detection of genotype/phenotype associations, or marker-assisted breeding. In addition, the efficiency of these activities can be improved thanks to the ease with which SNP genotyping can be automated. Expressed sequence tags (EST) sequencing projects in grapevine are allowing for the in silico detection of multiple putative sequence polymorphisms within and among a reduced number of cultivars. In parallel, the sequence of the grapevine cultivar Pinot Noir is also providing thousands of polymorphisms present in this highly heterozygous genome. Still the general application of those SNPs requires further validation since their use could be restricted to those specific genotypes. Results In order to develop a large SNP set of wide application in grapevine we followed a systematic re-sequencing approach in a group of 11 grape genotypes corresponding to ancient unrelated cultivars as well as wild plants. Using this approach, we have sequenced 230 gene fragments, what represents the analysis of over 1 Mb of grape DNA sequence. This analysis has allowed the discovery of 1573 SNPs with an average of one SNP every 64 bp (one SNP every 47 bp in non-coding regions and every 69 bp in coding regions). Nucleotide diversity in grape (π = 0.0051) was found to be similar to values observed in highly polymorphic plant species such as maize. The average number of haplotypes per gene sequence was estimated as six, with three haplotypes representing over 83% of the analyzed sequences. Short-range linkage disequilibrium (LD) studies within the analyzed sequences indicate the existence of a rapid decay of LD within the selected grapevine genotypes. To validate the use of the detected polymorphisms in genetic mapping, cultivar identification and genetic diversity studies we have used the SNPlex™ genotyping technology in a sample of grapevine genotypes and segregating progenies. Conclusion These results provide accurate values for nucleotide diversity in coding sequences and a first estimate of short-range LD in grapevine. Using SNPlex™ genotyping we have shown the application of a set of discovered SNPs as molecular markers for cultivar identification, linkage mapping and genetic diversity studies. Thus, the combination a highly efficient re-sequencing approach and the SNPlex™ high throughput genotyping technology provide a powerful tool for grapevine genetic analysis. PMID:18021442
High genetic variability of HIV-1 in female sex workers from Argentina.
Pando, María A; Eyzaguirre, Lindsay M; Carrion, Gladys; Montano, Silvia M; Sanchez, José L; Carr, Jean K; Avila, María M
2007-08-13
A cross-sectional study on 625 Female Sex Workers (FSWs) was conducted between 2000 and 2002 in 6 cities in Argentina. This study describes the genetic diversity and the resistance profile of the HIV-infected subjects. Seventeen samples from HIV positive FSWs were genotyped by env HMA, showing the presence of 9 subtype F, 6 subtype B and 2 subtype C. Sequence analysis of the protease/RT region on 16 of these showed that 10 were BF recombinants, three were subtype B, two were subtype C, and one sample presented a dual infection with subtype B and a BF recombinant. Full-length genomes of five of the protease/RT BF recombinants were also sequenced, showing that three of them were CRF12_BF. One FSW had a dual HIV-1 infection with subtype B and a BF recombinant. The B sections of the BF recombinant clustered closely with the pure B sequence isolated from the same patient. Major resistance mutations to antiretroviral drugs were found in 3 of 16 (18.8%) strains. The genetic diversity of HIV strains among FSWs in Argentina was extensive; about three-quarters of the samples were infected with diverse BF recombinants, near twenty percent had primary ART resistance and one sample presented a dual infection. Heterosexual transmission of genetically diverse, drug resistant strains among FSWs and their clients represents an important and underestimated threat, in Argentina.
Insights into the diversity of eukaryotes in acid mine drainage biofilm communities.
Baker, Brett J; Tyson, Gene W; Goosherst, Lindsey; Banfield, Jillian F
2009-04-01
Microscopic eukaryotes are known to have important ecosystem functions, but their diversity in most environments remains vastly unexplored. Here we analyzed an 18S rRNA gene library from a subsurface iron- and sulfur-oxidizing microbial community growing in highly acidic (pH < 0.9) runoff within the Richmond Mine at Iron Mountain (northern California). Phylogenetic analysis revealed that the majority (68%) of the sequences belonged to fungi. Protists falling into the deeply branching lineage named the acidophilic protist clade (APC) and the class Heterolobosea were also present. The APC group represents kingdom-level novelty, with <76% sequence similarity to 18S rRNA gene sequences of organisms from other environments. Fluorescently labeled oligonucleotide rRNA probes were designed to target each of these groups in biofilm samples, enabling abundance and morphological characterization. Results revealed that the populations vary significantly with the habitat and no group is ubiquitous. Surprisingly, many of the eukaryotic lineages (with the exception of the APC) are closely related to neutrophiles, suggesting that they recently adapted to this extreme environment. Molecular analyses presented here confirm that the number of eukaryotic species associated with the acid mine drainage (AMD) communities is low. This finding is consistent with previous results showing a limited diversity of archaea, bacteria, and viruses in AMD environments and suggests that the environmental pressures and interplay between the members of these communities limit species diversity at all trophic levels.
Phylogenetic Diversity of Koala Retrovirus within a Wild Koala Population.
Chappell, K J; Brealey, J C; Amarilla, A A; Watterson, D; Hulse, L; Palmieri, C; Johnston, S D; Holmes, E C; Meers, J; Young, P R
2017-02-01
Koala populations are in serious decline across many areas of mainland Australia, with infectious disease a contributing factor. Koala retrovirus (KoRV) is a gammaretrovirus present in most wild koala populations and captive colonies. Five subtypes of KoRV (A to E) have been identified based on amino acid sequence divergence in a hypervariable region of the receptor binding domain of the envelope protein. However, analysis of viral genetic diversity has been conducted primarily on KoRV in captive koalas housed in zoos in Japan, the United States, and Germany. Wild koalas within Australia have not been comparably assessed. Here we report a detailed analysis of KoRV genetic diversity in samples collected from 18 wild koalas from southeast Queensland. By employing deep sequencing we identified 108 novel KoRV envelope sequences and determined their phylogenetic diversity. Genetic diversity in KoRV was abundant and fell into three major groups; two comprised the previously identified subtypes A and B, while the third contained the remaining hypervariable region subtypes (C, D, and E) as well as four hypervariable region subtypes that we newly define here (F, G, H, and I). In addition to the ubiquitous presence of KoRV-A, which may represent an exclusively endogenous variant, subtypes B, D, and F were found to be at high prevalence, while subtypes G, H, and I were present in a smaller number of animals. Koala retrovirus (KoRV) is thought to be a significant contributor to koala disease and population decline across mainland Australia. This study is the first to determine KoRV subtype prevalence among a wild koala population, and it significantly expands the total number of KoRV sequences available, providing a more precise picture of genetic diversity. This understanding of KoRV subtype prevalence and genetic diversity will be important for conservation efforts attempting to limit the spread of KoRV. Furthermore, KoRV is one of the only retroviruses shown to exist in both endogenous (transmitted vertically to offspring in the germ line DNA) and exogenous (horizontally transmitted between infected individuals) forms, a division of fundamental evolutionary importance. Copyright © 2017 American Society for Microbiology.
Klingeman, Dawn M.; Utturkar, Sagar; Lu, Tse -Yuan S.; ...
2015-11-12
Draft genome sequences for four Actinobacteria from the genus Streptomyces are presented. Streptomyces is a metabolically diverse genus that is abundant in soils and has been reported in association with plants. The strains described in this study were isolated from the Populus trichocarpa endosphere and rhizosphere.
Development of phoH as a Novel Signature Gene for Assessing Marine Phage Diversity▿
Goldsmith, Dawn B.; Crosti, Giuseppe; Dwivedi, Bhakti; McDaniel, Lauren D.; Varsani, Arvind; Suttle, Curtis A.; Weinbauer, Markus G.; Sandaa, Ruth-Anne; Breitbart, Mya
2011-01-01
Phages play a key role in the marine environment by regulating the transfer of energy between trophic levels and influencing global carbon and nutrient cycles. The diversity of marine phage communities remains difficult to characterize because of the lack of a signature gene common to all phages. Recent studies have demonstrated the presence of host-derived auxiliary metabolic genes in phage genomes, such as those belonging to the Pho regulon, which regulates phosphate uptake and metabolism under low-phosphate conditions. Among the completely sequenced phage genomes in GenBank, this study identified Pho regulon genes in nearly 40% of the marine phage genomes, while only 4% of nonmarine phage genomes contained these genes. While several Pho regulon genes were identified, phoH was the most prevalent, appearing in 42 out of 602 completely sequenced phage genomes. Phylogenetic analysis demonstrated that phage phoH sequences formed a cluster distinct from those of their bacterial hosts. PCR primers designed to amplify a region of the phoH gene were used to determine the diversity of phage phoH sequences throughout a depth profile in the Sargasso Sea and at six locations worldwide. phoH was present at all sites examined, and a high diversity of phoH sequences was recovered. Most phoH sequences belonged to clusters without any cultured representatives. Each depth and geographic location had a distinct phoH composition, although most phoH clusters were recovered from multiple sites. Overall, phoH is an effective signature gene for examining phage diversity in the marine environment. PMID:21926220
Wetmore, Kelly M.; Price, Morgan N.; Waters, Robert J.; Lamson, Jacob S.; He, Jennifer; Hoover, Cindi A.; Blow, Matthew J.; Bristow, James; Butland, Gareth
2015-01-01
ABSTRACT Transposon mutagenesis with next-generation sequencing (TnSeq) is a powerful approach to annotate gene function in bacteria, but existing protocols for TnSeq require laborious preparation of every sample before sequencing. Thus, the existing protocols are not amenable to the throughput necessary to identify phenotypes and functions for the majority of genes in diverse bacteria. Here, we present a method, random bar code transposon-site sequencing (RB-TnSeq), which increases the throughput of mutant fitness profiling by incorporating random DNA bar codes into Tn5 and mariner transposons and by using bar code sequencing (BarSeq) to assay mutant fitness. RB-TnSeq can be used with any transposon, and TnSeq is performed once per organism instead of once per sample. Each BarSeq assay requires only a simple PCR, and 48 to 96 samples can be sequenced on one lane of an Illumina HiSeq system. We demonstrate the reproducibility and biological significance of RB-TnSeq with Escherichia coli, Phaeobacter inhibens, Pseudomonas stutzeri, Shewanella amazonensis, and Shewanella oneidensis. To demonstrate the increased throughput of RB-TnSeq, we performed 387 successful genome-wide mutant fitness assays representing 130 different bacterium-carbon source combinations and identified 5,196 genes with significant phenotypes across the five bacteria. In P. inhibens, we used our mutant fitness data to identify genes important for the utilization of diverse carbon substrates, including a putative d-mannose isomerase that is required for mannitol catabolism. RB-TnSeq will enable the cost-effective functional annotation of diverse bacteria using mutant fitness profiling. PMID:25968644
Fusarium diversity in soil using a specific molecular approach and a cultural approach.
Edel-Hermann, Véronique; Gautheron, Nadine; Mounier, Arnaud; Steinberg, Christian
2015-04-01
Fusarium species are ubiquitous in soil. They cause plant and human diseases and can produce mycotoxins. Surveys of Fusarium species diversity in environmental samples usually rely on laborious culture-based methods. In the present study, we have developed a molecular method to analyze Fusarium diversity directly from soil DNA. We designed primers targeting the translation elongation factor 1-alpha (EF-1α) gene and demonstrated their specificity toward Fusarium using a large collection of fungi. We used the specific primers to construct a clone library from three contrasting soils. Sequence analysis confirmed the specificity of the assay, with 750 clones identified as Fusarium and distributed among eight species or species complexes. The Fusarium oxysporum species complex (FOSC) was the most abundant one in the three soils, followed by the Fusarium solani species complex (FSSC). We then compared our molecular approach results with those obtained by isolating Fusarium colonies on two culture media and identifying species by sequencing part of the EF-1α gene. The 750 isolates were distributed into eight species or species complexes, with the same dominant species as with the cloning method. Sequence diversity was much higher in the clone library than in the isolate collection. The molecular approach proved to be a valuable tool to assess Fusarium diversity in environmental samples. Combined with high throughput sequencing, it will allow for in-depth analysis of large numbers of samples. Published by Elsevier B.V.
Baker, C S; Vant, M D; Dalebout, M L; Lento, G M; O'Brien, S J; Yuhki, N
2006-05-01
The molecular diversity and phylogenetic relationships of two class II genes of the baleen whale major histocompatibility complex were investigated and compared to toothed whales and out-groups. Amplification of the DQB exon 2 provided sequences showing high within-species and between-species nucleotide diversity and uninterrupted reading frames consistent with functional class II loci found in related mammals (e.g., ruminants). Cloning of amplified products indicated gene duplication in the humpback whale and triplication in the southern right whale, with average nucleotide diversity of 5.9 and 6.3%, respectively, for alleles of each species. Significantly higher nonsynonymous divergence at sites coding for peptide binding (32% for humpback and 40% for southern right) suggested that these loci were subject to positive (overdominant) selection. A population survey of humpback whales detected 23 alleles, differing by up to 21% of their inferred amino acid sequences. Amplification of the DRB exon 2 resulted in two groups of sequences. One was most similar to the DRB3 of the cow and present in all whales screened to date, including toothed whales. The second was most similar to the DRB2 of the cow and was found only in the bowhead and right whales. Both loci showed low diversity among species and apparent loss of function or altered function including interruption of reading frames. Finally, comparison of inferred protein sequence of the DRB3-like locus suggested convergence with the DQB, perhaps resulting from intergenic conversion or recombination.
Weiss, Eric R; Lamers, Susanna L; Henderson, Jennifer L; Melnikov, Alexandre; Somasundaran, Mohan; Garber, Manuel; Selin, Liisa; Nusbaum, Chad; Luzuriaga, Katherine
2018-01-15
Over 90% of the world's population is persistently infected with Epstein-Barr virus. While EBV does not cause disease in most individuals, it is the common cause of acute infectious mononucleosis (AIM) and has been associated with several cancers and autoimmune diseases, highlighting a need for a preventive vaccine. At present, very few primary, circulating EBV genomes have been sequenced directly from infected individuals. While low levels of diversity and low viral evolution rates have been predicted for double-stranded DNA (dsDNA) viruses, recent studies have demonstrated appreciable diversity in common dsDNA pathogens (e.g., cytomegalovirus). Here, we report 40 full-length EBV genome sequences obtained from matched oral wash and B cell fractions from a cohort of 10 AIM patients. Both intra- and interpatient diversity were observed across the length of the entire viral genome. Diversity was most pronounced in viral genes required for establishing latent infection and persistence, with appreciable levels of diversity also detected in structural genes, including envelope glycoproteins. Interestingly, intrapatient diversity declined significantly over time ( P < 0.01), and this was particularly evident on comparison of viral genomes sequenced from B cell fractions in early primary infection and convalescence ( P < 0.001). B cell-associated viral genomes were observed to converge, becoming nearly identical to the B95.8 reference genome over time (Spearman rank-order correlation test; r = -0.5589, P = 0.0264). The reduction in diversity was most marked in the EBV latency genes. In summary, our data suggest independent convergence of diverse viral genome sequences toward a reference-like strain within a relatively short period following primary EBV infection. IMPORTANCE Identification of viral proteins with low variability and high immunogenicity is important for the development of a protective vaccine. Knowledge of genome diversity within circulating viral populations is a key step in this process, as is the expansion of intrahost genomic variation during infection. We report full-length EBV genomes sequenced from the blood and oral wash of 10 individuals early in primary infection and during convalescence. Our data demonstrate considerable diversity within the pool of circulating EBV strains, as well as within individual patients. Overall viral diversity decreased from early to persistent infection, particularly in latently infected B cells, which serve as the viral reservoir. Reduction in B cell-associated viral genome diversity coincided with a convergence toward a reference-like EBV genotype. Greater convergence positively correlated with time after infection, suggesting that the reference-like genome is the result of selection. Copyright © 2018 American Society for Microbiology.
Using high throughput sequencing to explore the biodiversity in oral bacterial communities.
Diaz, P I; Dupuy, A K; Abusleme, L; Reese, B; Obergfell, C; Choquette, L; Dongari-Bagtzoglou, A; Peterson, D E; Terzi, E; Strausbaugh, L D
2012-06-01
High throughput sequencing of 16S ribosomal RNA gene amplicons is a cost-effective method for characterization of oral bacterial communities. However, before undertaking large-scale studies, it is necessary to understand the technique-associated limitations and intrinsic variability of the oral ecosystem. In this work we evaluated bias in species representation using an in vitro-assembled mock community of oral bacteria. We then characterized the bacterial communities in saliva and buccal mucosa of five healthy subjects to investigate the power of high throughput sequencing in revealing their diversity and biogeography patterns. Mock community analysis showed primer and DNA isolation biases and an overestimation of diversity that was reduced after eliminating singleton operational taxonomic units (OTUs). Sequencing of salivary and mucosal communities found a total of 455 OTUs (0.3% dissimilarity) with only 78 of these present in all subjects. We demonstrate that this variability was partly the result of incomplete richness coverage even at great sequencing depths, and so comparing communities by their structure was more effective than comparisons based solely on membership. With respect to oral biogeography, we found inter-subject variability in community structure was lower than site differences between salivary and mucosal communities within subjects. These differences were evident at very low sequencing depths and were mostly caused by the abundance of Streptococcus mitis and Gemella haemolysans in mucosa. In summary, we present an experimental and data analysis framework that will facilitate design and interpretation of pyrosequencing-based studies. Despite challenges associated with this technique, we demonstrate its power for evaluation of oral diversity and biogeography patterns. © 2012 John Wiley & Sons A/S.
Ceuppens, Siele; De Coninck, Dieter; Bottledoorn, Nadine; Van Nieuwerburgh, Filip; Uyttendaele, Mieke
2017-09-18
Application of 16S rRNA (gene) amplicon sequencing on food samples is increasingly applied for assessing microbial diversity but may as unintended advantage also enable simultaneous detection of any human pathogens without a priori definition. In the present study high-throughput next-generation sequencing (NGS) of the V1-V2-V3 regions of the 16S rRNA gene was applied to identify the bacteria present on fresh basil leaves. However, results were strongly impacted by variations in the bioinformatics analysis pipelines (MEGAN, SILVAngs, QIIME and MG-RAST), including the database choice (Greengenes, RDP and M5RNA) and the annotation algorithm (best hit, representative hit and lowest common ancestor). The use of pipelines with default parameters will lead to discrepancies. The estimate of microbial diversity of fresh basil using 16S rRNA (gene) amplicon sequencing is thus indicative but subject to biases. Salmonella enterica was detected at low frequencies, between 0.1% and 0.4% of bacterial sequences, corresponding with 37 to 166 reads. However, this result was dependent upon the pipeline used: Salmonella was detected by MEGAN, SILVAngs and MG-RAST, but not by QIIME. Confirmation of Salmonella sequences by real-time PCR was unsuccessful. It was shown that taxonomic resolution obtained from the short (500bp) sequence reads of the 16S rRNA gene containing the hypervariable regions V1-V3 cannot allow distinction of Salmonella with closely related enterobacterial species. In conclusion 16S amplicon sequencing, getting the status of standard method in microbial ecology studies of foods, needs expertise on both bioinformatics and microbiology for analysis of results. It is a powerful tool to estimate bacterial diversity but amenable to biases. Limitations concerning taxonomic resolution for some bacterial species or its inability to detect sub-dominant (pathogenic) species should be acknowledged in order to avoid overinterpretation of results. Copyright © 2017 Elsevier B.V. All rights reserved.
NASA Astrophysics Data System (ADS)
Hager, K. W.; Fullerton, H.; Moyer, C. L.
2015-12-01
Hydrothermal vents along the Mariana Arc and back-arc represent a hotspot of microbial diversity that has not yet been fully recognized. The Mariana Arc and back-arc contain hydrothermal vents with varied vent effluent chemistry and temperature, which translates to diverse community composition. We have focused on iron-rich sites where the dominant primary producers are iron oxidizing bacteria. Because microbes from these environments have proven elusive in culturing efforts, we performed culture independent analysis among different microbial communities found at these hydrothermal vents. Terminal-restriction fragment length polymorphism (T-RFLP) and Illumina sequencing of small subunit ribosomal gene amplicons were used to characterize community members and identify samples for shotgun metagenomics. Used in combination, these methods will better elucidate the composition and characteristics of the bacterial communities at these hydrothermal vent systems. The overarching goal of this study is to evaluate and compare taxonomic and metabolic diversity among different communities of microbial mats. We compared communities collected on a fine scale to analyze the bacterial community based on gross mat morphology, geography, and nearby vent effluent chemistry. Taxa richness and evenness are compared with rarefaction curves to visualize diversity. As well as providing a survey of diversity this study also presents a juxtaposition of three methods in which ribosomal small subunit diversity is compared with T-RFLP, next generation amplicon sequencing, and metagenomic shotgun sequencing.
Singh, A K; Rai, V P; Chand, R; Singh, R P; Singh, M N
2013-01-01
Genetic diversity and identification of simple sequence repeat markers correlated with Fusarium wilt resistance was performed in a set of 36 elite cultivated pigeonpea genotypes differing in levels of resistance to Fusarium wilt. Twenty-four polymorphic sequence repeat markers were screened across these genotypes, and amplified a total of 59 alleles with an average high polymorphic information content value of 0.52. Cluster analysis, done by UPGMA and PCA, grouped the 36 pigeonpea genotypes into two main clusters according to their Fusarium wilt reaction. Based on the Kruskal-Wallis ANOVA and simple regression analysis, six simple sequence repeat markers were found to be significantly associated with Fusarium wilt resistance. The phenotypic variation explained by these markers ranged from 23.7 to 56.4%. The present study helps in finding out feasibility of prescreened SSR markers to be used in genetic diversity analysis and their potential association with disease resistance.
Lucero, Mary E.; Unc, Adrian; Cooke, Peter; Dowd, Scot; Sun, Shulei
2011-01-01
Microbial diversity associated with micropropagated Atriplex species was assessed using microscopy, isolate culturing, and sequencing. Light, electron, and confocal microscopy revealed microbial cells in aseptically regenerated leaves and roots. Clone libraries and tag-encoded FLX amplicon pyrosequencing (TEFAP) analysis amplified sequences from callus homologous to diverse fungal and bacterial taxa. Culturing isolated some seed borne endophyte taxa which could be readily propagated apart from the host. Microbial cells were observed within biofilm-like residues associated with plant cell surfaces and intercellular spaces. Various universal primers amplified both plant and microbial sequences, with different primers revealing different patterns of fungal diversity. Bacterial and fungal TEFAP followed by alignment with sequences from curated databases revealed 7 bacterial and 17 ascomycete taxa in A. canescens, and 5 bacterial taxa in A. torreyi. Additional diversity was observed among isolates and clone libraries. Micropropagated Atriplex retains a complex, intimately associated microbiome which includes diverse strains well poised to interact in manners that influence host physiology. Microbiome analysis was facilitated by high throughput sequencing methods, but primer biases continue to limit recovery of diverse sequences from even moderately complex communities. PMID:21437280
Chromosome ends: different sequences may provide conserved functions.
Louis, Edward J; Vershinin, Alexander V
2005-07-01
The structures of specific chromosome regions, centromeres and telomeres, present a number of puzzles. As functions performed by these regions are ubiquitous and essential, their DNA, proteins and chromatin structure are expected to be conserved. Recent studies of centromeric DNA from human, Drosophila and plant species have demonstrated that a hidden universal centromere-specific sequence is highly unlikely. The DNA of telomeres is more conserved consisting of a tandemly repeated 6-8 bp Arabidopsis-like sequence in a majority of organisms as diverse as protozoan, fungi, mammals and plants. However, there are alternatives to short DNA repeats at the ends of chromosomes and for telomere elongation by telomerase. Here we focus on the similarities and diversity that exist among the structural elements, DNA sequences and proteins, that make up terminal domains (telomeres and subtelomeres), and how organisms use these in different ways to fulfil the functions of end-replication and end-protection. Copyright (c) 2005 Wiley Periodicals, Inc.
You, M; Chan, Y; Lacap-Bugler, D C; Huo, Y-B; Gao, W; Leung, W K; Watt, R M
2017-12-01
Treponema denticola and other species (phylotypes) of oral spirochetes are widely considered to play important etiological roles in periodontitis and other oral infections. The major surface protein (Msp) of T. denticola is directly implicated in several pathological mechanisms. Here, we have analyzed msp sequence diversity across 68 strains of oral phylogroup 1 and 2 treponemes; including reference strains of T. denticola, Treponema putidum, Treponema medium, 'Treponema vincentii', and 'Treponema sinensis'. All encoded Msp proteins contained highly conserved, taxon-specific signal peptides, and shared a predicted 'three-domain' structure. A clone-based strategy employing 'msp-specific' polymerase chain reaction primers was used to analyze msp gene sequence diversity present in subgingival plaque samples collected from a group of individuals with chronic periodontitis (n=10), vs periodontitis-free controls (n=10). We obtained 626 clinical msp gene sequences, which were assigned to 21 distinct 'clinical msp genotypes' (95% sequence identity cut-off). The most frequently detected clinical msp genotype corresponded to T. denticola ATCC 35405 T , but this was not correlated to disease status. UniFrac and libshuff analysis revealed that individuals with periodontitis and periodontitis-free controls harbored significantly different communities of treponeme clinical msp genotypes (P<.001). Patients with periodontitis had higher levels of clinical msp genotype diversity than periodontitis-free controls (Mann-Whitney U-test, P<.05). The relative proportions of 'T. vincentii' clinical msp genotypes were significantly higher in the control group than in the periodontitis group (P=.018). In conclusion, our data clearly show that both healthy and diseased individuals commonly harbor a wide diversity of Treponema clinical msp genotypes within their subgingival niches. © 2017 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.
Structure-Based Phylogenetic Analysis of the Lipocalin Superfamily.
Lakshmi, Balasubramanian; Mishra, Madhulika; Srinivasan, Narayanaswamy; Archunan, Govindaraju
2015-01-01
Lipocalins constitute a superfamily of extracellular proteins that are found in all three kingdoms of life. Although very divergent in their sequences and functions, they show remarkable similarity in 3-D structures. Lipocalins bind and transport small hydrophobic molecules. Earlier sequence-based phylogenetic studies of lipocalins highlighted that they have a long evolutionary history. However the molecular and structural basis of their functional diversity is not completely understood. The main objective of the present study is to understand functional diversity of the lipocalins using a structure-based phylogenetic approach. The present study with 39 protein domains from the lipocalin superfamily suggests that the clusters of lipocalins obtained by structure-based phylogeny correspond well with the functional diversity. The detailed analysis on each of the clusters and sub-clusters reveals that the 39 lipocalin domains cluster based on their mode of ligand binding though the clustering was performed on the basis of gross domain structure. The outliers in the phylogenetic tree are often from single member families. Also structure-based phylogenetic approach has provided pointers to assign putative function for the domains of unknown function in lipocalin family. The approach employed in the present study can be used in the future for the functional identification of new lipocalin proteins and may be extended to other protein families where members show poor sequence similarity but high structural similarity.
Walker, Sara Imari; Grover, Martha A.; Hud, Nicholas V.
2012-01-01
Many models for the origin of life have focused on understanding how evolution can drive the refinement of a preexisting enzyme, such as the evolution of efficient replicase activity. Here we present a model for what was, arguably, an even earlier stage of chemical evolution, when polymer sequence diversity was generated and sustained before, and during, the onset of functional selection. The model includes regular environmental cycles (e.g. hydration-dehydration cycles) that drive polymers between times of replication and functional activity, which coincide with times of different monomer and polymer diffusivity. Template-directed replication of informational polymers, which takes place during the dehydration stage of each cycle, is considered to be sequence-independent. New sequences are generated by spontaneous polymer formation, and all sequences compete for a finite monomer resource that is recycled via reversible polymerization. Kinetic Monte Carlo simulations demonstrate that this proposed prebiotic scenario provides a robust mechanism for the exploration of sequence space. Introduction of a polymer sequence with monomer synthetase activity illustrates that functional sequences can become established in a preexisting pool of otherwise non-functional sequences. Functional selection does not dominate system dynamics and sequence diversity remains high, permitting the emergence and spread of more than one functional sequence. It is also observed that polymers spontaneously form clusters in simulations where polymers diffuse more slowly than monomers, a feature that is reminiscent of a previous proposal that the earliest stages of life could have been defined by the collective evolution of a system-wide cooperation of polymer aggregates. Overall, the results presented demonstrate the merits of considering plausible prebiotic polymer chemistries and environments that would have allowed for the rapid turnover of monomer resources and for regularly varying monomer/polymer diffusivities. PMID:22493682
Manoharan, Lokeshwaran; Kushwaha, Sandeep K.; Hedlund, Katarina; Ahrén, Dag
2015-01-01
Microbial enzyme diversity is a key to understand many ecosystem processes. Whole metagenome sequencing (WMG) obtains information on functional genes, but it is costly and inefficient due to large amount of sequencing that is required. In this study, we have applied a captured metagenomics technique for functional genes in soil microorganisms, as an alternative to WMG. Large-scale targeting of functional genes, coding for enzymes related to organic matter degradation, was applied to two agricultural soil communities through captured metagenomics. Captured metagenomics uses custom-designed, hybridization-based oligonucleotide probes that enrich functional genes of interest in metagenomic libraries where only probe-bound DNA fragments are sequenced. The captured metagenomes were highly enriched with targeted genes while maintaining their target diversity and their taxonomic distribution correlated well with the traditional ribosomal sequencing. The captured metagenomes were highly enriched with genes related to organic matter degradation; at least five times more than similar, publicly available soil WMG projects. This target enrichment technique also preserves the functional representation of the soils, thereby facilitating comparative metagenomics projects. Here, we present the first study that applies the captured metagenomics approach in large scale, and this novel method allows deep investigations of central ecosystem processes by studying functional gene abundances. PMID:26490729
Estrada-Bárcenas, Daniel Alfonso; Vite-Garín, Tania; Navarro-Barranco, Hortensia; de la Torre-Arciniega, Raúl; Pérez-Mejía, Amelia; Rodríguez-Arellanes, Gabriela; Ramirez, Jose Antonio; Humberto Sahaza, Jorge; Taylor, Maria Lucia; Toriello, Conchita
2014-01-01
High sensitivity and specificity of molecular biology techniques have proven usefulness for the detection, identification and typing of different pathogens. The ITS (Internal Transcribed Spacer) regions of the ribosomal DNA are highly conserved non-coding regions, and have been widely used in different studies including the determination of the genetic diversity of human fungal pathogens. This article wants to contribute to the understanding of the intra- and interspecific genetic diversity of isolates of the Histoplasma capsulatum and Sporothrix schenckii species complexes by an analysis of the available sequences of the ITS regions from different sequence databases. ITS1-5.8S-ITS2 sequences of each fungus, either deposited in GenBank, or from our research groups (registered in the Fungi Barcode of Life Database), were analyzed using the maximum likelihood (ML) method. ML analysis of the ITS sequences discriminated isolates from distant geographic origins and particular wild hosts, depending on the fungal species analyzed. This manuscript is part of the series of works presented at the "V International Workshop: Molecular genetic approaches to the study of human pathogenic fungi" (Oaxaca, Mexico, 2012). Copyright © 2013 Revista Iberoamericana de Micología. Published by Elsevier Espana. All rights reserved.
USDA-ARS?s Scientific Manuscript database
The increase in the consumption of fresh produce in the United States has correlated with a rise in the number of reported foodborne illnesses. To identify potential risk factors associated with post-harvest practices, the present study employed multilocus sequence typing (MLST) for the genotypic c...
Fungal diversity in deep-sea sediments associated with asphalt seeps at the Sao Paulo Plateau
NASA Astrophysics Data System (ADS)
Nagano, Yuriko; Miura, Toshiko; Nishi, Shinro; Lima, Andre O.; Nakayama, Cristina; Pellizari, Vivian H.; Fujikura, Katsunori
2017-12-01
We investigated the fungal diversity in a total of 20 deep-sea sediment samples (of which 14 samples were associated with natural asphalt seeps and 6 samples were not associated) collected from two different sites at the Sao Paulo Plateau off Brazil by Ion Torrent PGM targeting ITS region of ribosomal RNA. Our results suggest that diverse fungi (113 operational taxonomic units (OTUs) based on clustering at 97% sequence similarity assigned into 9 classes and 31 genus) are present in deep-sea sediment samples collected at the Sao Paulo Plateau, dominated by Ascomycota (74.3%), followed by Basidiomycota (11.5%), unidentified fungi (7.1%), and sequences with no affiliation to any organisms in the public database (7.1%). However, it was revealed that only three species, namely Penicillium sp., Cadophora malorum and Rhodosporidium diobovatum, were dominant, with the majority of OTUs remaining a minor community. Unexpectedly, there was no significant difference in major fungal community structure between the asphalt seep and non-asphalt seep sites, despite the presence of mass hydrocarbon deposits and the high amount of macro organisms surrounding the asphalt seeps. However, there were some differences in the minor fungal communities, with possible asphalt degrading fungi present specifically in the asphalt seep sites. In contrast, some differences were found between the two different sampling sites. Classification of OTUs revealed that only 47 (41.6%) fungal OTUs exhibited >97% sequence similarity, in comparison with pre-existing ITS sequences in public databases, indicating that a majority of deep-sea inhabiting fungal taxa still remain undescribed. Although our knowledge on fungi and their role in deep-sea environments is still limited and scarce, this study increases our understanding of fungal diversity and community structure in deep-sea environments.
Shaik, Razia S; Zhu, Xiaocheng; Clements, David R; Weston, Leslie A
2016-01-01
Part of the challenge in dealing with invasive plant species is that they seldom represent a uniform, static entity. Often, an accurate understanding of the history of plant introduction and knowledge of the real levels of genetic diversity present in species and populations of importance is lacking. Currently, the role of genetic diversity in promoting the successful establishment of invasive plants is not well defined. Genetic profiling of invasive plants should enhance our understanding of the dynamics of colonization in the invaded range. Recent advances in DNA sequencing technology have greatly facilitated the rapid and complete assessment of plant population genetics. Here, we apply our current understanding of the genetics and ecophysiology of plant invasions to recent work on Australian plant invaders from the Cucurbitaceae and Boraginaceae. The Cucurbitaceae study showed that both prickly paddy melon ( Cucumis myriocarpus ) and camel melon ( Citrullus lanatus ) were represented by only a single genotype in Australia, implying that each was probably introduced as a single introduction event. In contrast, a third invasive melon, Citrullus colocynthis , possessed a moderate level of genetic diversity in Australia and was potentially introduced to the continent at least twice. The Boraginaceae study demonstrated the value of comparing two similar congeneric species; one, Echium plantagineum , is highly invasive and genetically diverse, whereas the other, Echium vulgare , exhibits less genetic diversity and occupies a more limited ecological niche. Sequence analysis provided precise identification of invasive plant species, as well as information on genetic diversity and phylogeographic history. Improved sequencing technologies will continue to allow greater resolution of genetic relationships among invasive plant populations, thereby potentially improving our ability to predict the impact of these relationships upon future spread and better manage invaders possessing potentially diverse biotypes and exhibiting diverse breeding systems, life histories and invasion histories.
Buttet, Géraldine F.; Holliger, Christof
2013-01-01
Reductive dehalogenases are the key enzymes involved in the anaerobic respiration of organohalides such as the widespread groundwater pollutant tetrachloroethene. The increasing number of available bacterial genomes and metagenomes gives access to hundreds of new putative reductive dehalogenase genes that display a high level of sequence diversity and for which substrate prediction remains very challenging. In this study, we present the development of a functional genotyping method targeting the diverse reductive dehalogenases present in Sulfurospirillum spp., which allowed us to unambiguously identify a new reductive dehalogenase from our tetrachloroethene-dechlorinating SL2 bacterial consortia. The new enzyme, named PceATCE, shows 92% sequence identity with the well-characterized PceA enzyme of Sulfurospirillum multivorans, but in contrast to the latter, it is restricted to tetrachloroethene as a substrate. Its apparent higher dechlorinating activity with tetrachloroethene likely allowed its selection and maintenance in the bacterial consortia among other enzymes showing broader substrate ranges. The sequence-substrate relationships within tetrachloroethene reductive dehalogenases are also discussed. PMID:23995945
Egge, Elianne; Bittner, Lucie; Andersen, Tom; Audic, Stéphane; de Vargas, Colomban; Edvardsen, Bente
2013-01-01
Next generation sequencing of ribosomal DNA is increasingly used to assess the diversity and structure of microbial communities. Here we test the ability of 454 pyrosequencing to detect the number of species present, and assess the relative abundance in terms of cell numbers and biomass of protists in the phylum Haptophyta. We used a mock community consisting of equal number of cells of 11 haptophyte species and compared targeting DNA and RNA/cDNA, and two different V4 SSU rDNA haptophyte-biased primer pairs. Further, we tested four different bioinformatic filtering methods to reduce errors in the resulting sequence dataset. With sequencing depth of 11000–20000 reads and targeting cDNA with Haptophyta specific primers Hap454 we detected all 11 species. A rarefaction analysis of expected number of species recovered as a function of sampling depth suggested that minimum 1400 reads were required here to recover all species in the mock community. Relative read abundance did not correlate to relative cell numbers. Although the species represented with the largest biomass was also proportionally most abundant among the reads, there was generally a weak correlation between proportional read abundance and proportional biomass of the different species, both with DNA and cDNA as template. The 454 sequencing generated considerable spurious diversity, and more with cDNA than DNA as template. With initial filtering based only on match with barcode and primer we observed 100-fold more operational taxonomic units (OTUs) at 99% similarity than the number of species present in the mock community. Filtering based on quality scores, or denoising with PyroNoise resulted in ten times more OTU99% than the number of species. Denoising with AmpliconNoise reduced the number of OTU99% to match the number of species present in the mock community. Based on our analyses, we propose a strategy to more accurately depict haptophyte diversity using 454 pyrosequencing. PMID:24069303
Archaeon and archaeal virus diversity classification via sequence entropy and fractal dimension
NASA Astrophysics Data System (ADS)
Tremberger, George, Jr.; Gallardo, Victor; Espinoza, Carola; Holden, Todd; Gadura, N.; Cheung, E.; Schneider, P.; Lieberman, D.; Cheung, T.
2010-09-01
Archaea are important potential candidates in astrobiology as their metabolism includes solar, inorganic and organic energy sources. Archaeal viruses would also be expected to be present in a sustainable archaeal exobiological community. Genetic sequence Shannon entropy and fractal dimension can be used to establish a two-dimensional measure for classification and phylogenetic study of these organisms. A sequence fractal dimension can be calculated from a numerical series consisting of the atomic numbers of each nucleotide. Archaeal 16S and 23S ribosomal RNA sequences were studied. Outliers in the 16S rRNA fractal dimension and entropy plot were found to be halophilic archaea. Positive correlation (R-square ~ 0.75, N = 18) was observed between fractal dimension and entropy across the studied species. The 16S ribosomal RNA sequence entropy correlates with the 23S ribosomal RNA sequence entropy across species with R-square 0.93, N = 18. Entropy values correspond positively with branch lengths of a published phylogeny. The studied archaeal virus sequences have high fractal dimensions of 2.02 or more. A comparison of selected extremophile sequences with archaeal sequences from the Humboldt Marine Ecosystem database (Wood-Hull Oceanography Institute, MIT) suggests the presence of continuous sequence expression as inferred from distributions of entropy and fractal dimension, consistent with the diversity expected in an exobiological archaeal community.
Souza, Renata Carolini; Mendes, Iêda Carvalho; Reis-Junior, Fábio Bueno; Carvalho, Fabíola Marques; Nogueira, Marco Antonio; Vasconcelos, Ana Tereza Ribeiro; Vicente, Vânia Aparecida; Hungria, Mariangela
2016-03-16
The Cerrado--an edaphic type of savannah--comprises the second largest biome of the Brazilian territory and is the main area for grain production in the country, but information about the impact of land conversion to agriculture on microbial diversity is still scarce. We used a shotgun metagenomic approach to compare undisturbed (native) soil and soils cropped for 23 years with soybean/maize under conservation tillage--"no-till" (NT)--and conventional tillage (CT) systems in the Cerrado biome. Soil management and fertilizer inputs with the introduction of agriculture improved chemical properties, but decreased soil macroporosity and microbial biomass of carbon and nitrogen. Principal coordinates analyses confirmed different taxonomic and functional profiles for each treatment. There was predominance of the Bacteria domain, especially the phylum Proteobacteria, with higher numbers of sequences in the NT and CT treatments; Archaea and Viruses also had lower numbers of sequences in the undisturbed soil. Within the Alphaproteobacteria, there was dominance of Rhizobiales and of the genus Bradyrhizobium in the NT and CT systems, attributed to massive inoculation of soybean, and also of Burkholderiales. In contrast, Rhizobium, Azospirillum, Xanthomonas, Pseudomonas and Acidobacterium predominated in the native Cerrado. More Eukaryota, especially of the phylum Ascomycota were detected in the NT. The functional analysis revealed lower numbers of sequences in the five dominant categories for the CT system, whereas the undisturbed Cerrado presented higher abundance. High impact of agriculture in taxonomic and functional microbial diversity in the biome Cerrado was confirmed. Functional diversity was not necessarily associated with taxonomic diversity, as the less conservationist treatment (CT) presented increased taxonomic sequences and reduced functional profiles, indicating a strategy to try to maintain soil functioning by favoring taxa that are probably not the most efficient for some functions. Our results highlight that underneath the rustic appearance of the Cerrado vegetation there is a fragile soil microbial community.
Samanta, Brajogopal; Bhadury, Punyasloke
2016-01-01
Marine chromophytes are taxonomically diverse group of algae and contribute approximately half of the total oceanic primary production. To understand the global patterns of functional diversity of chromophytic phytoplankton, robust bioinformatics and statistical analyses including deep phylogeny based on 2476 form ID rbcL gene sequences representing seven ecologically significant oceanographic ecoregions were undertaken. In addition, 12 form ID rbcL clone libraries were generated and analyzed (148 sequences) from Sundarbans Biosphere Reserve representing the world’s largest mangrove ecosystem as part of this study. Global phylogenetic analyses recovered 11 major clades of chromophytic phytoplankton in varying proportions with several novel rbcL sequences in each of the seven targeted ecoregions. Majority of OTUs was found to be exclusive to each ecoregion, whereas some were shared by two or more ecoregions based on beta-diversity analysis. Present phylogenetic and bioinformatics analyses provide a strong statistical support for the hypothesis that different oceanographic regimes harbor distinct and coherent groups of chromophytic phytoplankton. It has been also shown as part of this study that varying natural selection pressure on form ID rbcL gene under different environmental conditions could lead to functional differences and overall fitness of chromophytic phytoplankton populations. PMID:26861415
Species and hybrids in the genus Diaphanosoma Fischer, 1850 (Crustacea: Branchiopoda: Cladocera).
Liu, Ping; Xu, Lei; Xu, Shao-Lin; Martínez, Alejandro; Chen, Hua; Cheng, Dan; Dumont, Henri J; Han, Bo-Ping; Fontaneto, Diego
2018-01-01
Cladocerans are well-studied planktonic crustaceans, especially those of the genus Daphnia in which interesting evolutionary questions have been addressed on speciation processes. The aim of the present study is to demonstrate that other genera of cladocerans show similar levels of cryptic diversity, intraspecific gene flow, and thus become useful model systems for comparison. In order to do so, we chose the genus Diaphanosoma, widespread in tropical and temperate areas. We started with a survey of species diversity in the genus Diaphanosoma in Asia using a morphological approach, then obtained sequences from a mitochondrial and a nuclear marker from multiple individuals of different species, performed tests on DNA taxonomy and molecular phylogenies, and assessed the role of hybridization in explaining the cases of mitonuclear discordance. The results are that cryptic diversity occurs in Diaphanosoma, and mitonuclear discordance was found in about 6% of the sequenced animals. Past hybridization is supported as the most likely explanation for the discordance: no evidence was found of first generation hybrids with heterozygous sequences. Our analysis on patterns of genetic diversity in Diaphanosoma supports similarities and differences with what is known in Daphnia. Copyright © 2017 Elsevier Inc. All rights reserved.
Cousins, Matthew M.; Ou, San-San; Wawer, Maria J.; Munshaw, Supriya; Swan, David; Magaret, Craig A.; Mullis, Caroline E.; Serwadda, David; Porcella, Stephen F.; Gray, Ronald H.; Quinn, Thomas C.; Donnell, Deborah; Eshleman, Susan H.
2012-01-01
Next-generation sequencing (NGS) has recently been used for analysis of HIV diversity, but this method is labor-intensive, costly, and requires complex protocols for data analysis. We compared diversity measures obtained using NGS data to those obtained using a diversity assay based on high-resolution melting (HRM) of DNA duplexes. The HRM diversity assay provides a single numeric score that reflects the level of diversity in the region analyzed. HIV gag and env from individuals in Rakai, Uganda, were analyzed in a previous study using NGS (n = 220 samples from 110 individuals). Three sequence-based diversity measures were calculated from the NGS sequence data (percent diversity, percent complexity, and Shannon entropy). The amplicon pools used for NGS were analyzed with the HRM diversity assay. HRM scores were significantly associated with sequence-based measures of HIV diversity for both gag and env (P < 0.001 for all measures). The level of diversity measured by the HRM diversity assay and NGS increased over time in both regions analyzed (P < 0.001 for all measures except for percent complexity in gag), and similar amounts of diversification were observed with both methods (P < 0.001 for all measures except for percent complexity in gag). Diversity measures obtained using the HRM diversity assay were significantly associated with those from NGS, and similar increases in diversity over time were detected by both methods. The HRM diversity assay is faster and less expensive than NGS, facilitating rapid analysis of large studies of HIV diversity and evolution. PMID:22785188
Gao, Lihai; Lin, Weitie
2011-01-01
In order to study the diversity of ammonia-oxidizing bacteria (AOB) and ammonia-oxidizing archaea (AOA) in shrimp farm sediment. Total microbial DNA was directly extracted from the shrimp farm sediment. The clone library of amoA genes were constructed with beta-Proteobacterial-AOB and AOA specific primers. The library was screened by PCR-restriction fragment length polymorphism (RFLP) analysis and clones with unique RFLP patterns were sequenced. Phylogenetic analyses of the amoA gene fragments showed that all AOB sequences from shrimp farm sediment were affiliated with Nitrosomonas (61.54%) or Nitrosomonas-like (38. 46%) species and grouped into Nitrosomonas communis cluster, Nitrosomonas sp. Nm148 cluster, Nitrosomonas oligotropha cluster. All AOA sequences belonged to the kingdom Crenarchaeote except that one Operational Taxa Unit (OTU) sequence was Unclassified-Archaea and fell within cluster S (soil origin). AOB and AOA species composition included 13 OTUs and 9 OTUs. The clone coverage of bacterial and archaeal amoA genes was 73.47% and 90.43%. The Shannon-Wiener index, Evenness index, Simpson index and Richness index of AOB were higher than those of AOA. These findings represent the first detailed examination of archaeal amoA diversity in shrimp farm sediment and demonstrate that diverse communities of Crenarchaeote capable of ammonia oxidation are present within shrimp farm sediment, where they may be actively involved in nitrification.
How many novel eukaryotic 'kingdoms'? Pitfalls and limitations of environmental DNA surveys
Berney, Cédric; Fahrni, José; Pawlowski, Jan
2004-01-01
Background Over the past few years, the use of molecular techniques to detect cultivation-independent, eukaryotic diversity has proven to be a powerful approach. Based on small-subunit ribosomal RNA (SSU rRNA) gene analyses, these studies have revealed the existence of an unexpected variety of new phylotypes. Some of them represent novel diversity in known eukaryotic groups, mainly stramenopiles and alveolates. Others do not seem to be related to any molecularly described lineage, and have been proposed to represent novel eukaryotic kingdoms. In order to review the evolutionary importance of this novel high-level eukaryotic diversity critically, and to test the potential technical and analytical pitfalls and limitations of eukaryotic environmental DNA surveys (EES), we analysed 484 environmental SSU rRNA gene sequences, including 81 new sequences from sediments of the small river, the Seymaz (Geneva, Switzerland). Results Based on a detailed screening of an exhaustive alignment of eukaryotic SSU rRNA gene sequences and the phylogenetic re-analysis of previously published environmental sequences using Bayesian methods, our results suggest that the number of novel higher-level taxa revealed by previously published EES was overestimated. Three main sources of errors are responsible for this situation: (1) the presence of undetected chimeric sequences; (2) the misplacement of several fast-evolving sequences; and (3) the incomplete sampling of described, but yet unsequenced eukaryotes. Additionally, EES give a biased view of the diversity present in a given biotope because of the difficult amplification of SSU rRNA genes in some taxonomic groups. Conclusions Environmental DNA surveys undoubtedly contribute to reveal many novel eukaryotic lineages, but there is no clear evidence for a spectacular increase of the diversity at the kingdom level. After re-analysis of previously published data, we found only five candidate lineages of possible novel high-level eukaryotic taxa, two of which comprise several phylotypes that were found independently in different studies. To ascertain their taxonomic status, however, the organisms themselves have now to be identified. PMID:15176975
Zhou, Chan; Mao, Fenglou; Yin, Yanbin; Huang, Jinling; Gogarten, Johann Peter; Xu, Ying
2014-01-01
A challenge in phylogenetic inference of gene trees is how to properly sample a large pool of homologous sequences to derive a good representative subset of sequences. Such a need arises in various applications, e.g. when (1) accuracy-oriented phylogenetic reconstruction methods may not be able to deal with a large pool of sequences due to their high demand in computing resources; (2) applications analyzing a collection of gene trees may prefer to use trees with fewer operational taxonomic units (OTUs), for instance for the detection of horizontal gene transfer events by identifying phylogenetic conflicts; and (3) the pool of available sequences is biased towards extensively studied species. In the past, the creation of subsamples often relied on manual selection. Here we present an Automated sequence-Sampling method for improving the Taxonomic diversity of gene phylogenetic trees, AST, to obtain representative sequences that maximize the taxonomic diversity of the sampled sequences. To demonstrate the effectiveness of AST, we have tested it to solve four problems, namely, inference of the evolutionary histories of the small ribosomal subunit protein S5 of E. coli, 16 S ribosomal RNAs and glycosyl-transferase gene family 8, and a study of ancient horizontal gene transfers from bacteria to plants. Our results show that the resolution of our computational results is almost as good as that of manual inference by domain experts, hence making the tool generally useful to phylogenetic studies by non-phylogeny specialists. The program is available at http://csbl.bmb.uga.edu/~zhouchan/AST.php.
Zhou, Chan; Mao, Fenglou; Yin, Yanbin; Huang, Jinling; Gogarten, Johann Peter; Xu, Ying
2014-01-01
A challenge in phylogenetic inference of gene trees is how to properly sample a large pool of homologous sequences to derive a good representative subset of sequences. Such a need arises in various applications, e.g. when (1) accuracy-oriented phylogenetic reconstruction methods may not be able to deal with a large pool of sequences due to their high demand in computing resources; (2) applications analyzing a collection of gene trees may prefer to use trees with fewer operational taxonomic units (OTUs), for instance for the detection of horizontal gene transfer events by identifying phylogenetic conflicts; and (3) the pool of available sequences is biased towards extensively studied species. In the past, the creation of subsamples often relied on manual selection. Here we present an Automated sequence-Sampling method for improving the Taxonomic diversity of gene phylogenetic trees, AST, to obtain representative sequences that maximize the taxonomic diversity of the sampled sequences. To demonstrate the effectiveness of AST, we have tested it to solve four problems, namely, inference of the evolutionary histories of the small ribosomal subunit protein S5 of E. coli, 16 S ribosomal RNAs and glycosyl-transferase gene family 8, and a study of ancient horizontal gene transfers from bacteria to plants. Our results show that the resolution of our computational results is almost as good as that of manual inference by domain experts, hence making the tool generally useful to phylogenetic studies by non-phylogeny specialists. The program is available at http://csbl.bmb.uga.edu/~zhouchan/AST.php. PMID:24892935
Population Structure in Nontypeable Haemophilus influenzae
LaCross, Nathan C.; Marrs, Carl F.; Gilsdorf, Janet R.
2013-01-01
Nontypeable Haemophilus influenzae (NTHi) frequently colonize the human pharynx asymptomatically, and are an important cause of otitis media in children. Past studies have identified typeable H. influenzae as being clonal, but the population structure of NTHi has not been extensively characterized. The research presented here investigated the diversity and population structure in a well-characterized collection of NTHi isolated from the middle ears of children with otitis media or the pharynges of healthy children in three disparate geographic regions. Multilocus sequence typing identified 109 unique sequence types among 170 commensal and otitis media-associated NTHi isolates from Finland, Israel, and the US. The largest clonal complex contained only five sequence types, indicating a high level of genetic diversity. The eBURST v3, ClonalFrame 1.1, and structure 2.3.3 programs were used to further characterize diversity and population structure from the sequence typing data. Little clustering was apparent by either disease state (otitis media or commensalism) or geography in the ClonalFrame phylogeny. Population structure was clearly evident, with support for eight populations when all 170 isolates were analyzed. Interestingly, one population contained only commensal isolates, while two others consisted solely of otitis media isolates, suggesting associations between population structure and disease. PMID:23266487
Human IgG repertoire of malaria antigen-immunized human immune system (HIS) mice.
Nogueira, Raquel Tayar; Sahi, Vincent; Huang, Jing; Tsuji, Moriya
2017-08-01
Humanized mouse models present an important tool for preclinical evaluation of new vaccines and therapeutics. Here we show the human variable repertoire of antibody sequences cloned from a previously described human immune system (HIS) mouse model that possesses functional human CD4+ T cells and B cells, namely HIS-CD4/B mice. We sequenced variable IgG genes from single memory B-cell and plasma-cell sorted from splenocytes or whole blood lymphocytes of HIS-CD4/B mice that were vaccinated with a human plasmodial antigen, a recombinant Plasmodium falciparum circumsporozoite protein (rPfCSP). We demonstrate that rPfCSP immunization triggers a diverse B-cell IgG repertoire composed of various human VH family genes and distinct V(D)J recombinations that constitute diverse CDR3 sequences similar to humans, although low hypermutated sequences were generated. These results demonstrate the substantial genetic diversity of responding human B cells of HIS-CD4/B mice and their capacity to mount human IgG class-switched antibody response upon vaccination. Copyright © 2017 European Federation of Immunological Societies. Published by Elsevier B.V. All rights reserved.
Reading biological processes from nucleotide sequences
NASA Astrophysics Data System (ADS)
Murugan, Anand
Cellular processes have traditionally been investigated by techniques of imaging and biochemical analysis of the molecules involved. The recent rapid progress in our ability to manipulate and read nucleic acid sequences gives us direct access to the genetic information that directs and constrains biological processes. While sequence data is being used widely to investigate genotype-phenotype relationships and population structure, here we use sequencing to understand biophysical mechanisms. We present work on two different systems. First, in chapter 2, we characterize the stochastic genetic editing mechanism that produces diverse T-cell receptors in the human immune system. We do this by inferring statistical distributions of the underlying biochemical events that generate T-cell receptor coding sequences from the statistics of the observed sequences. This inferred model quantitatively describes the potential repertoire of T-cell receptors that can be produced by an individual, providing insight into its potential diversity and the probability of generation of any specific T-cell receptor. Then in chapter 3, we present work on understanding the functioning of regulatory DNA sequences in both prokaryotes and eukaryotes. Here we use experiments that measure the transcriptional activity of large libraries of mutagenized promoters and enhancers and infer models of the sequence-function relationship from this data. For the bacterial promoter, we infer a physically motivated 'thermodynamic' model of the interaction of DNA-binding proteins and RNA polymerase determining the transcription rate of the downstream gene. For the eukaryotic enhancers, we infer heuristic models of the sequence-function relationship and use these models to find synthetic enhancer sequences that optimize inducibility of expression. Both projects demonstrate the utility of sequence information in conjunction with sophisticated statistical inference techniques for dissecting underlying biophysical mechanisms.
DeMaere, Matthew Z.
2016-01-01
Background Chromosome conformation capture, coupled with high throughput DNA sequencing in protocols like Hi-C and 3C-seq, has been proposed as a viable means of generating data to resolve the genomes of microorganisms living in naturally occuring environments. Metagenomic Hi-C and 3C-seq datasets have begun to emerge, but the feasibility of resolving genomes when closely related organisms (strain-level diversity) are present in the sample has not yet been systematically characterised. Methods We developed a computational simulation pipeline for metagenomic 3C and Hi-C sequencing to evaluate the accuracy of genomic reconstructions at, above, and below an operationally defined species boundary. We simulated datasets and measured accuracy over a wide range of parameters. Five clustering algorithms were evaluated (2 hard, 3 soft) using an adaptation of the extended B-cubed validation measure. Results When all genomes in a sample are below 95% sequence identity, all of the tested clustering algorithms performed well. When sequence data contains genomes above 95% identity (our operational definition of strain-level diversity), a naive soft-clustering extension of the Louvain method achieves the highest performance. Discussion Previously, only hard-clustering algorithms have been applied to metagenomic 3C and Hi-C data, yet none of these perform well when strain-level diversity exists in a metagenomic sample. Our simple extension of the Louvain method performed the best in these scenarios, however, accuracy remained well below the levels observed for samples without strain-level diversity. Strain resolution is also highly dependent on the amount of available 3C sequence data, suggesting that depth of sequencing must be carefully considered during experimental design. Finally, there appears to be great scope to improve the accuracy of strain resolution through further algorithm development. PMID:27843713
Takeet, Michael I; Oyewusi, Adeoye J; Abakpa, Simon A V; Daramola, Olukayode O; Peters, Sunday O
2017-03-01
Adequate knowledge of the genetic diversity among Babesia species infecting dogs is necessary for a better understanding of the epidemiology and control of canine babesiosis. Hence, this study determined the genetic diversity among the Babesia rossi detected in dogs presented for routine examination in Veterinary Hospitals in Abeokuta, Nigeria. Blood were randomly collected from 209 dogs. Field-stained thin smears were made and DNA extracted from the blood. Partial region of the 18S small subunit ribosomal RNA (rRNA) gene was amplified, sequenced and analysed. Babesia species was detected in 16 (7.7%) of the dogs by microscopy. Electrophoresed PCR products from 39 (18.66%) dogs revealed band size of 450 bp and 2 (0.95%) dogs had band size of 430 bp. The sequences obtained from 450 bp amplicon displayed homology of 99.74% (387/388) with partial sequences of 18S rRNA gene of Babesia rossi in the GeneBank. Of the two sequences that had 430 bp amplicon, one was identified as T. annulata and second as T. ovis. A significantly (p<0.05) higher prevalence of B. rossi was detected by PCR compared to microscopy. The mean PCV of Babesia infected dogs was significantly (p<0.05) lower than non-infected dogs. Phylogenetic analysis revealed minimal diversity among B. rossi with the exception of one sequence that was greatly divergent from the others. This study suggests that more than one genotype of B. rossi may be in circulation among the dog population in the study area and this may have potential implication on clinical outcome of canine babesiosis.
Lee, Wonhoon; Park, Jongsun; Choi, Jaeyoung; Jung, Kyongyong; Park, Bongsoo; Kim, Donghan; Lee, Jaeyoung; Ahn, Kyohun; Song, Wonho; Kang, Seogchan; Lee, Yong-Hwan; Lee, Seunghwan
2009-01-01
Background Sequences and organization of the mitochondrial genome have been used as markers to investigate evolutionary history and relationships in many taxonomic groups. The rapidly increasing mitochondrial genome sequences from diverse insects provide ample opportunities to explore various global evolutionary questions in the superclass Hexapoda. To adequately support such questions, it is imperative to establish an informatics platform that facilitates the retrieval and utilization of available mitochondrial genome sequence data. Results The Insect Mitochondrial Genome Database (IMGD) is a new integrated platform that archives the mitochondrial genome sequences from 25,747 hexapod species, including 112 completely sequenced and 20 nearly completed genomes and 113,985 partially sequenced mitochondrial genomes. The Species-driven User Interface (SUI) of IMGD supports data retrieval and diverse analyses at multi-taxon levels. The Phyloviewer implemented in IMGD provides three methods for drawing phylogenetic trees and displays the resulting trees on the web. The SNP database incorporated to IMGD presents the distribution of SNPs and INDELs in the mitochondrial genomes of multiple isolates within eight species. A newly developed comparative SNU Genome Browser supports the graphical presentation and interactive interface for the identified SNPs/INDELs. Conclusion The IMGD provides a solid foundation for the comparative mitochondrial genomics and phylogenetics of insects. All data and functions described here are available at the web site . PMID:19351385
Yilmaz, Pelin; Kottmann, Renzo; Field, Dawn; Knight, Rob; Cole, James R; Amaral-Zettler, Linda; Gilbert, Jack A; Karsch-Mizrachi, Ilene; Johnston, Anjanette; Cochrane, Guy; Vaughan, Robert; Hunter, Christopher; Park, Joonhong; Morrison, Norman; Rocca-Serra, Philippe; Sterk, Peter; Arumugam, Manimozhiyan; Bailey, Mark; Baumgartner, Laura; Birren, Bruce W; Blaser, Martin J; Bonazzi, Vivien; Booth, Tim; Bork, Peer; Bushman, Frederic D; Buttigieg, Pier Luigi; Chain, Patrick S G; Charlson, Emily; Costello, Elizabeth K; Huot-Creasy, Heather; Dawyndt, Peter; DeSantis, Todd; Fierer, Noah; Fuhrman, Jed A; Gallery, Rachel E; Gevers, Dirk; Gibbs, Richard A; Gil, Inigo San; Gonzalez, Antonio; Gordon, Jeffrey I; Guralnick, Robert; Hankeln, Wolfgang; Highlander, Sarah; Hugenholtz, Philip; Jansson, Janet; Kau, Andrew L; Kelley, Scott T; Kennedy, Jerry; Knights, Dan; Koren, Omry; Kuczynski, Justin; Kyrpides, Nikos; Larsen, Robert; Lauber, Christian L; Legg, Teresa; Ley, Ruth E; Lozupone, Catherine A; Ludwig, Wolfgang; Lyons, Donna; Maguire, Eamonn; Methé, Barbara A; Meyer, Folker; Muegge, Brian; Nakielny, Sara; Nelson, Karen E; Nemergut, Diana; Neufeld, Josh D; Newbold, Lindsay K; Oliver, Anna E; Pace, Norman R; Palanisamy, Giriprakash; Peplies, Jörg; Petrosino, Joseph; Proctor, Lita; Pruesse, Elmar; Quast, Christian; Raes, Jeroen; Ratnasingham, Sujeevan; Ravel, Jacques; Relman, David A; Assunta-Sansone, Susanna; Schloss, Patrick D; Schriml, Lynn; Sinha, Rohini; Smith, Michelle I; Sodergren, Erica; Spor, Aymé; Stombaugh, Jesse; Tiedje, James M; Ward, Doyle V; Weinstock, George M; Wendel, Doug; White, Owen; Whiteley, Andrew; Wilke, Andreas; Wortman, Jennifer R; Yatsunenko, Tanya; Glöckner, Frank Oliver
2012-01-01
Here we present a standard developed by the Genomic Standards Consortium (GSC) for reporting marker gene sequences—the minimum information about a marker gene sequence (MIMARKS). We also introduce a system for describing the environment from which a biological sample originates. The ‘environmental packages’ apply to any genome sequence of known origin and can be used in combination with MIMARKS and other GSC checklists. Finally, to establish a unified standard for describing sequence data and to provide a single point of entry for the scientific community to access and learn about GSC checklists, we present the minimum information about any (x) sequence (MIxS). Adoption of MIxS will enhance our ability to analyze natural genetic diversity documented by massive DNA sequencing efforts from myriad ecosystems in our ever-changing biosphere. PMID:21552244
MHC class II genes in European wolves: a comparison with dogs.
Seddon, Jennifer M; Ellegren, Hans
2002-10-01
The genome of the grey wolf, one of the most widely distributed land mammal species, has been subjected to both stochastic factors, including biogeographical subdivision and population fragmentation, and strong selection during the domestication of the dog. To explore the effects of drift and selection on the partitioning of MHC variation in the diversification of species, we present nine DQA, 10 DQB, and 17 DRB1 sequences of the second exon for European wolves and compare them with sequences of North American wolves and dogs. The relatively large number of class II alleles present in both European and North American wolves attests to their large historical population sizes, yet there are few alleles shared between these regions at DQB and DRB1. Similarly, the dog has an extensive array of class II MHC alleles, a consequence of a genetically diverse origin, but allelic overlap with wolves only at DQA. Although we might expect a progression from shared alleles to shared allelic lineages during differentiation, the partitioning of diversity between wolves and dogs at DQB and DRB1 differs from that at DQA. Furthermore, an extensive region of nucleotide sequence shared between DRB1 and DQB alleles and a shared motif suggests intergenic recombination may have contributed to MHC diversity in the Canidae.
Genome-wide characterization of centromeric satellites from multiple mammalian genomes.
Alkan, Can; Cardone, Maria Francesca; Catacchio, Claudia Rita; Antonacci, Francesca; O'Brien, Stephen J; Ryder, Oliver A; Purgato, Stefania; Zoli, Monica; Della Valle, Giuliano; Eichler, Evan E; Ventura, Mario
2011-01-01
Despite its importance in cell biology and evolution, the centromere has remained the final frontier in genome assembly and annotation due to its complex repeat structure. However, isolation and characterization of the centromeric repeats from newly sequenced species are necessary for a complete understanding of genome evolution and function. In recent years, various genomes have been sequenced, but the characterization of the corresponding centromeric DNA has lagged behind. Here, we present a computational method (RepeatNet) to systematically identify higher-order repeat structures from unassembled whole-genome shotgun sequence and test whether these sequence elements correspond to functional centromeric sequences. We analyzed genome datasets from six species of mammals representing the diversity of the mammalian lineage, namely, horse, dog, elephant, armadillo, opossum, and platypus. We define candidate monomer satellite repeats and demonstrate centromeric localization for five of the six genomes. Our analysis revealed the greatest diversity of centromeric sequences in horse and dog in contrast to elephant and armadillo, which showed high-centromeric sequence homogeneity. We could not isolate centromeric sequences within the platypus genome, suggesting that centromeres in platypus are not enriched in satellite DNA. Our method can be applied to the characterization of thousands of other vertebrate genomes anticipated for sequencing in the near future, providing an important tool for annotation of centromeres.
Genetic diversity in the feline leukemia virus gag gene.
Kawamura, Maki; Watanabe, Shinya; Odahara, Yuka; Nakagawa, So; Endo, Yasuyuki; Tsujimoto, Hajime; Nishigaki, Kazuo
2015-06-02
Feline leukemia virus (FeLV) belongs to the Gammaretrovirus genus and is horizontally transmitted among cats. FeLV is known to undergo recombination with endogenous retroviruses already present in the host during FeLV-subgroup A infection. Such recombinant FeLVs, designated FeLV-subgroup B or FeLV-subgroup D, can be generated by transduced endogenous retroviral env sequences encoding the viral envelope. These recombinant viruses have biologically distinct properties and may mediate different disease outcomes. The generation of such recombinant viruses resulted in structural diversity of the FeLV particle and genetic diversity of the virus itself. FeLV env diversity through mutation and recombination has been studied, while gag diversity and its possible effects are less well understood. In this study, we investigated recombination events in the gag genes of FeLVs isolated from naturally infected cats and reference isolates. Recombination and phylogenetic analyses indicated that the gag genes often contain endogenous FeLV sequences and were occasionally replaced by entire endogenous FeLV gag genes. Phylogenetic reconstructions of FeLV gag sequences allowed for classification into three distinct clusters, similar to those previously established for the env gene. Analysis of the recombination junctions in FeLV gag indicated that these variants have similar recombination patterns within the same genotypes, indicating that the recombinant viruses were horizontally transmitted among cats. It remains to be investigated whether the recombinant sequences affect the molecular mechanism of FeLV transmission. These findings extend our understanding of gammaretrovirus evolutionary patterns in the field. Copyright © 2015 Elsevier B.V. All rights reserved.
Knief, Claudia
2015-01-01
Methane-oxidizing bacteria are characterized by their capability to grow on methane as sole source of carbon and energy. Cultivation-dependent and -independent methods have revealed that this functional guild of bacteria comprises a substantial diversity of organisms. In particular the use of cultivation-independent methods targeting a subunit of the particulate methane monooxygenase (pmoA) as functional marker for the detection of aerobic methanotrophs has resulted in thousands of sequences representing “unknown methanotrophic bacteria.” This limits data interpretation due to restricted information about these uncultured methanotrophs. A few groups of uncultivated methanotrophs are assumed to play important roles in methane oxidation in specific habitats, while the biology behind other sequence clusters remains still largely unknown. The discovery of evolutionary related monooxygenases in non-methanotrophic bacteria and of pmoA paralogs in methanotrophs requires that sequence clusters of uncultivated organisms have to be interpreted with care. This review article describes the present diversity of cultivated and uncultivated aerobic methanotrophic bacteria based on pmoA gene sequence diversity. It summarizes current knowledge about cultivated and major clusters of uncultivated methanotrophic bacteria and evaluates habitat specificity of these bacteria at different levels of taxonomic resolution. Habitat specificity exists for diverse lineages and at different taxonomic levels. Methanotrophic genera such as Methylocystis and Methylocaldum are identified as generalists, but they harbor habitat specific methanotrophs at species level. This finding implies that future studies should consider these diverging preferences at different taxonomic levels when analyzing methanotrophic communities. PMID:26696968
Bào, Yīmíng; Amarasinghe, Gaya K; Basler, Christopher F; Bavari, Sina; Bukreyev, Alexander; Chandran, Kartik; Dolnik, Olga; Dye, John M; Ebihara, Hideki; Formenty, Pierre; Hewson, Roger; Kobinger, Gary P; Leroy, Eric M; Mühlberger, Elke; Netesov, Sergey V; Patterson, Jean L; Paweska, Janusz T; Smither, Sophie J; Takada, Ayato; Towner, Jonathan S; Volchkov, Viktor E; Wahl-Jensen, Victoria; Kuhn, Jens H
2017-05-11
The mononegaviral family Filoviridae has eight members assigned to three genera and seven species. Until now, genus and species demarcation were based on arbitrarily chosen filovirus genome sequence divergence values (≈50% for genera, ≈30% for species) and arbitrarily chosen phenotypic virus or virion characteristics. Here we report filovirus genome sequence-based taxon demarcation criteria using the publicly accessible PAirwise Sequencing Comparison (PASC) tool of the US National Center for Biotechnology Information (Bethesda, MD, USA). Comparison of all available filovirus genomes in GenBank using PASC revealed optimal genus demarcation at the 55-58% sequence diversity threshold range for genera and at the 23-36% sequence diversity threshold range for species. Because these thresholds do not change the current official filovirus classification, these values are now implemented as filovirus taxon demarcation criteria that may solely be used for filovirus classification in case additional data are absent. A near-complete, coding-complete, or complete filovirus genome sequence will now be required to allow official classification of any novel "filovirus." Classification of filoviruses into existing taxa or determining the need for novel taxa is now straightforward and could even become automated using a presented algorithm/flowchart rooted in RefSeq (type) sequences.
Shubin, Li; Juan, Huang; RenChao, Zhou; ShiRu, Xu; YuanXiao, Jin
2014-01-01
In the present study, the terminal-restriction fragment length polymorphism (T-RFLP) technique, combined with the use of a clone library, was applied to assess the baseline diversity of fungal endophyte communities associated with rhizomes of Alpinia officinarum Hance, a medicinal plant with a long history of use. A total of 46 distinct T-RFLP fragment peaks were detected using HhaI or MspI mono-digestion-targeted, amplified fungal rDNA ITS sequences from A. officinarum rhizomes. Cloning and sequencing of representative sequences resulted in the detection of members of 10 fungal genera: Pestalotiopsis, Sebacina, Penicillium, Marasmius, Fusarium, Exserohilum, Mycoleptodiscus, Colletotrichum, Meyerozyma, and Scopulariopsis. The T-RFLP profiles revealed an influence of growth year of the host plant on fungal endophyte communities in rhizomes of this plant species; whereas, the geographic location where A. officinarum was grown contributed to only limited variation in the fungal endophyte communities of the host tissue. Furthermore, non-metric multidimensional scaling (NMDS) analysis across all of the rhizome samples showed that the fungal endophyte community assemblages in the rhizome samples could be grouped according to the presence of two types of active indicator chemicals: total volatile oils and galangin. Our present results, for the first time, address a diverse fungal endophyte community is able to internally colonize the rhizome tissue of A. officinarum. The diversity of the fungal endophytes found in the A. officinarum rhizome appeared to be closely correlated with the accumulation of active chemicals in the host plant tissue. The present study also provides the first systematic overview of the fungal endophyte communities in plant rhizome tissue using a culture-independent method. PMID:25536070
Shubin, Li; Juan, Huang; RenChao, Zhou; ShiRu, Xu; YuanXiao, Jin
2014-01-01
In the present study, the terminal-restriction fragment length polymorphism (T-RFLP) technique, combined with the use of a clone library, was applied to assess the baseline diversity of fungal endophyte communities associated with rhizomes of Alpinia officinarum Hance, a medicinal plant with a long history of use. A total of 46 distinct T-RFLP fragment peaks were detected using HhaI or MspI mono-digestion-targeted, amplified fungal rDNA ITS sequences from A. officinarum rhizomes. Cloning and sequencing of representative sequences resulted in the detection of members of 10 fungal genera: Pestalotiopsis, Sebacina, Penicillium, Marasmius, Fusarium, Exserohilum, Mycoleptodiscus, Colletotrichum, Meyerozyma, and Scopulariopsis. The T-RFLP profiles revealed an influence of growth year of the host plant on fungal endophyte communities in rhizomes of this plant species; whereas, the geographic location where A. officinarum was grown contributed to only limited variation in the fungal endophyte communities of the host tissue. Furthermore, non-metric multidimensional scaling (NMDS) analysis across all of the rhizome samples showed that the fungal endophyte community assemblages in the rhizome samples could be grouped according to the presence of two types of active indicator chemicals: total volatile oils and galangin. Our present results, for the first time, address a diverse fungal endophyte community is able to internally colonize the rhizome tissue of A. officinarum. The diversity of the fungal endophytes found in the A. officinarum rhizome appeared to be closely correlated with the accumulation of active chemicals in the host plant tissue. The present study also provides the first systematic overview of the fungal endophyte communities in plant rhizome tissue using a culture-independent method.
Wong, Lai-Ping; Lai, Jason Kuan-Han; Saw, Woei-Yuh; Ong, Rick Twee-Hee; Cheng, Anthony Youzhi; Pillai, Nisha Esakimuthu; Liu, Xuanyao; Xu, Wenting; Chen, Peng; Foo, Jia-Nee; Tan, Linda Wei-Lin; Koo, Seok-Hwee; Soong, Richie; Wenk, Markus Rene; Lim, Wei-Yen; Khor, Chiea-Chuen; Little, Peter; Chia, Kee-Seng; Teo, Yik-Ying
2014-05-01
South Asia possesses a significant amount of genetic diversity due to considerable intergroup differences in culture and language. There have been numerous reports on the genetic structure of Asian Indians, although these have mostly relied on genotyping microarrays or targeted sequencing of the mitochondria and Y chromosomes. Asian Indians in Singapore are primarily descendants of immigrants from Dravidian-language-speaking states in south India, and 38 individuals from the general population underwent deep whole-genome sequencing with a target coverage of 30X as part of the Singapore Sequencing Indian Project (SSIP). The genetic structure and diversity of these samples were compared against samples from the Singapore Sequencing Malay Project and populations in Phase 1 of the 1,000 Genomes Project (1 KGP). SSIP samples exhibited greater intra-population genetic diversity and possessed higher heterozygous-to-homozygous genotype ratio than other Asian populations. When compared against a panel of well-defined Asian Indians, the genetic makeup of the SSIP samples was closely related to South Indians. However, even though the SSIP samples clustered distinctly from the Europeans in the global population structure analysis with autosomal SNPs, eight samples were assigned to mitochondrial haplogroups that were predominantly present in Europeans and possessed higher European admixture than the remaining samples. An analysis of the relative relatedness between SSIP with two archaic hominins (Denisovan, Neanderthal) identified higher ancient admixture in East Asian populations than in SSIP. The data resource for these samples is publicly available and is expected to serve as a valuable complement to the South Asian samples in Phase 3 of 1 KGP.
Briney, Bryan S; Willis, Jordan R; Hicar, Mark D; Thomas, James W; Crowe, James E
2012-09-01
Antibody heavy-chain recombination that results in the incorporation of multiple diversity (D) genes, although uncommon, contributes substantially to the diversity of the human antibody repertoire. Such recombination allows the generation of heavy chain complementarity determining region 3 (HCDR3) regions of extreme length and enables junctional regions that, because of the nucleotide bias of N-addition regions, are difficult to produce through normal V(D)J recombination. Although this non-classical recombination process has been observed infrequently, comprehensive analysis of the frequency and genetic characteristics of such events in the human peripheral blood antibody repertoire has not been possible because of the rarity of such recombinants and the limitations of traditional sequencing technologies. Here, through the use of high-throughput sequencing of the normal human peripheral blood antibody repertoire, we analysed the frequency and genetic characteristics of V(DD)J recombinants. We found that these recombinations were present in approximately 1 in 800 circulating B cells, and that the frequency was severely reduced in memory cell subsets. We also found that V(DD)J recombination can occur across the spectrum of diversity genes, indicating that virtually all recombination signal sequences that flank diversity genes are amenable to V(DD)J recombination. Finally, we observed a repertoire bias in the diversity gene repertoire at the upstream (5') position, and discovered that this bias was primarily attributable to the order of diversity genes in the genomic locus. © 2012 The Authors. Immunology © 2012 Blackwell Publishing Ltd.
High Diversity of CTX-M Extended-Spectrum β-Lactamases in Municipal Wastewater and Urban Wetlands
Borgogna, Timothy R.; Borgogna, Joanna-Lynn; Mielke, Jenna A.; Brown, Celeste J.; Top, Eva M.; Botts, Ryan T.
2016-01-01
The CTX-M-type extended-spectrum β-lactamases (ESBLs) present a serious public health threat as they have become nearly ubiquitous among clinical gram-negative pathogens, particularly the enterobacteria. To aid in the understanding and eventual control of the spread of such resistance genes, we sought to determine the diversity of CTX-M ESBLs not among clinical isolates, but in the environment, where weaker and more diverse selective pressures may allow greater enzyme diversification. This was done by examining the CTX-M diversity in municipal wastewater and urban coastal wetlands in southern California, United States, by Sanger sequencing of polymerase chain reaction amplicons. Of the five known CTX-M phylogroups (1, 2, 8, 9, and 25), only genes from groups 1 and 2 were detected in both wastewater treatment plants (WWTPs), and group 1 genes were also detected in one of the two wetlands after a winter rain. The highest relative abundance of blaCTX-M group 1 genes was in the sludge of one WWTP (2.1 × 10−4 blaCTX-M copies/16S rRNA gene copy). Gene libraries revealed surprisingly high nucleotide sequence diversity, with 157 new variants not found in GenBank, representing 99 novel amino acid sequences. Our results indicate that the resistomes of WWTPs and urban wetlands contain diverse blaCTX-M ESBLs, which may constitute a mobile reservoir of clinically relevant resistance genes. PMID:26670020
Exploring bacterial diversity in hospital environments by GS-FLX Titanium pyrosequencing.
Poza, Margarita; Gayoso, Carmen; Gómez, Manuel J; Rumbo-Feal, Soraya; Tomás, María; Aranda, Jesús; Fernández, Ana; Bou, Germán
2012-01-01
Understanding microbial populations in hospital environments is crucial for improving human health. Hospital-acquired infections are an increasing problem in intensive care units (ICU). In this work we present an exploration of bacterial diversity at inanimate surfaces of the ICU wards of the University Hospital A Coruña (Spain), as an example of confined hospital environment subjected to selective pressure, taking the entrance hall of the hospital, an open and crowded environment, as reference. Surface swab samples were collected from both locations and recovered DNA used as template to amplify a hypervariable region of the bacterial 16S rRNA gene. Sequencing of the amplicons was performed at the Roche 454 Sequencing Center using GS-FLX Titanium procedures. Reads were pre-processed and clustered into OTUs (operational taxonomic units), which were further classified. A total of 16 canonical bacterial phyla were detected in both locations. Members of the phyla Firmicutes (mainly Staphylococcus and Streptococcus) and Actinobacteria (mainly Micrococcaceae, Corynebacteriaceae and Brevibacteriaceae) were over-represented in the ICU with respect to the Hall. The phyllum Proteobacteria was also well represented in the ICU, mainly by members of the families Enterobacteriaceae, Methylobacteriaceae and Sphingomonadaceae. In the Hall sample, the phyla Proteobacteria, Bacteroidetes, Deinococcus-Thermus and Cyanobacteria were over-represented with respect to the ICU. Over-representation of Proteobacteria was mainly due to the high abundance of Enterobacteriaceae members. The presented results demonstrate that bacterial diversity differs at the ICU and entrance hall locations. Reduced diversity detected at ICU, relative to the entrance hall, can be explained by its confined character and by the existence of antimicrobial selective pressure. This is the first study using deep sequencing techniques made in hospital wards showing substantial hospital microbial diversity.
Dinoflagellates associated with freshwater sponges from the ancient lake baikal.
Annenkova, Natalia V; Lavrov, Dennis V; Belikov, Sergey I
2011-04-01
Dinoflagellates are a diverse group of protists that are common in both marine and freshwater environments. While the biology of marine dinoflagellates has been the focus of several recent studies, their freshwater relatives remain little-investigated. In the present study we explore the diversity of dinoflagellates in Lake Baikal by identifying and analyzing dinoflagellate sequences for 18S rDNA and ITS-2 from total DNA extracted from three species of endemic Baikalian sponges (Baikalospongia intermedia,Baikalospongia rectaand Lubomirskia incrustans). Phylogenetic analyses of these sequences revealed extensive dinoflagellate diversity in Lake Baikal. We found two groups of sequences clustering within the order Suessiales, known for its symbiotic relationships with various invertebrates. Thus they may be regarded as potential symbionts of Baikalian sponges. In addition,Gyrodinium helveticum, representatives from the genus Gymnodinium, dinoflagellates close to the family Pfiesteriaceae, and a few dinoflagellates without definite affiliation were detected. No pronounced difference in the distribution of dinoflagellates among the studied sponges was found, except for the absence of the Piscinoodinium-like dinoflagellates inL. incrustans. To the best of our knowledge, this is the first study of the diversity of dinoflagellates in freshwater sponges, the first systematic investigation of dinoflagellate molecular diversity in Lake Baikal and the first finding of members of the order Suessiales as symbionts of freshwater invertebrates. Copyright © 2010 Elsevier GmbH. All rights reserved.
Genotypic analysis of Mucor from the platypus in Australia.
Connolly, J H; Stodart, B J; Ash, G J
2010-01-01
Mucor amphibiorum is the only pathogen known to cause significant morbidity and mortality in the free-living platypus (Ornithorhynchus anatinus) in Tasmania. Infection has also been reported in free-ranging cane toads (Bufo marinus) and green tree frogs (Litoria caerulea) from mainland Australia but has not been confirmed in platypuses from the mainland. To date, there has been little genotyping specifically conducted on M. amphibiorum. A collection of 21 Mucor isolates representing isolates from the platypus, frogs and toads, and environmental samples were obtained for genotypic analysis. Internal transcribed spacer (ITS) region sequencing and GenBank comparison confirmed the identity of most of the isolates. Representative isolates from infected platypuses formed a clade containing the reference isolates of M. amphibiorum from the Centraal Bureau voor Schimmelcultures repository. The M. amphibiorum isolates showed a close sequence identity with Mucor indicus and consisted of two haplotypes, differentiated by single nucleotide polymorphisms within the ITS1 and ITS2 regions. With the exception of isolate 96-4049, all isolates from platypuses were in one haplotype. Multilocus fingerprinting via the use of intersimple sequence repeats polymerase chain reaction identified 19 genotypes. Two major clusters were evident: 1) M. amphibiorum and Mucor racemosus; and 2) Mucor circinelloides, Mucor ramosissimus, and Mucor fragilis. Seven M. amphibiorum isolates from platypuses were present in two subclusters, with isolate 96-4053 appearing genetically distinct from all other isolates. Isolates classified as M. circinelloides by sequence analysis formed a separate subcluster, distinct from other Mucor spp. The combination of sequencing and multilocus fingerprinting has the potential to provide the tools for rapid identification of M. amphibiorum. Data presented on the diversity of the pathogen and further work in linking genetic diversity to functional diversity will provide critical information for its management in Tasmanian river systems.
Prevalence and Identity of Taenia multiceps cysts "Coenurus cerebralis" in Sheep in Egypt.
Amer, Said; ElKhatam, Ahmed; Fukuda, Yasuhiro; Bakr, Lamia I; Zidan, Shereif; Elsify, Ahmed; Mohamed, Mostafa A; Tada, Chika; Nakai, Yutaka
2017-12-01
Coenurosis is a parasitic disease caused by the larval stage (Coenurus cerebralis) of the canids cestode Taenia multiceps. C. cerebralis particularly infects sheep and goats, and pose a public health concerns. The present study aimed to determine the occurrence and molecular identity of C. cerebralis infecting sheep in Egypt. Infection rate was determined by postmortem inspection of heads of the cases that showed neurological manifestations. Species identification and genetic diversity were analyzed based on PCR-sequence analysis of nuclear ITS1 and mitochondrial cytochrome oxidase (COI) and nicotinamide adenine dinucleotide dehydrogenase (ND1) gene markers. Out of 3668 animals distributed in 50 herds at localities of Ashmoun and El Sadat cities, El Menoufia Province, Egypt, 420 (11.45%) sheep showed neurological disorders. Postmortem examination of these animals after slaughter at local abattoirs indicated to occurrence of C. cerebralis cysts in the brain of 111 out of 420 (26.4%), with overall infection rate 3.03% of the involved sheep population. Molecular analysis of representative samples of coenuri at ITS1 gene marker showed extensive intra- and inter-sequence diversity due to deletions/insertions in the microsatellite regions. On contrast to the nuclear gene marker, considerably low genetic diversity was seen in the analyzed mitochondrial gene markers. Phylogenetic analysis based on COI and ND1 gene sequences indicated that the generated sequences in the present study and the reference sequences in the database clustered in 4 haplogroups, with more or less similar topologies. Clustering pattern of the phylogenetic tree showed no effect for the geographic location or the host species. Copyright © 2017 Elsevier B.V. All rights reserved.
de Oliveira, Thais C.; Rodrigues, Priscila T.; Menezes, Maria José; Gonçalves-Lopes, Raquel M.; Bastos, Melissa S.; Lima, Nathália F.; Barbosa, Susana; Gerber, Alexandra L.; Loss de Morais, Guilherme; Berná, Luisa; Phelan, Jody; Robello, Carlos; de Vasconcelos, Ana Tereza R.
2017-01-01
Background The Americas were the last continent colonized by humans carrying malaria parasites. Plasmodium falciparum from the New World shows very little genetic diversity and greater linkage disequilibrium, compared with its African counterparts, and is clearly subdivided into local, highly divergent populations. However, limited available data have revealed extensive genetic diversity in American populations of another major human malaria parasite, P. vivax. Methods We used an improved sample preparation strategy and next-generation sequencing to characterize 9 high-quality P. vivax genome sequences from northwestern Brazil. These new data were compared with publicly available sequences from recently sampled clinical P. vivax isolates from Brazil (BRA, total n = 11 sequences), Peru (PER, n = 23), Colombia (COL, n = 31), and Mexico (MEX, n = 19). Principal findings/Conclusions We found that New World populations of P. vivax are as diverse (nucleotide diversity π between 5.2 × 10−4 and 6.2 × 10−4) as P. vivax populations from Southeast Asia, where malaria transmission is substantially more intense. They display several non-synonymous nucleotide substitutions (some of them previously undescribed) in genes known or suspected to be involved in antimalarial drug resistance, such as dhfr, dhps, mdr1, mrp1, and mrp-2, but not in the chloroquine resistance transporter ortholog (crt-o) gene. Moreover, P. vivax in the Americas is much less geographically substructured than local P. falciparum populations, with relatively little between-population genome-wide differentiation (pairwise FST values ranging between 0.025 and 0.092). Finally, P. vivax populations show a rapid decline in linkage disequilibrium with increasing distance between pairs of polymorphic sites, consistent with very frequent outcrossing. We hypothesize that the high diversity of present-day P. vivax lineages in the Americas originated from successive migratory waves and subsequent admixture between parasite lineages from geographically diverse sites. Further genome-wide analyses are required to test the demographic scenario suggested by our data. PMID:28759591
pico-PLAZA, a genome database of microbial photosynthetic eukaryotes.
Vandepoele, Klaas; Van Bel, Michiel; Richard, Guilhem; Van Landeghem, Sofie; Verhelst, Bram; Moreau, Hervé; Van de Peer, Yves; Grimsley, Nigel; Piganeau, Gwenael
2013-08-01
With the advent of next generation genome sequencing, the number of sequenced algal genomes and transcriptomes is rapidly growing. Although a few genome portals exist to browse individual genome sequences, exploring complete genome information from multiple species for the analysis of user-defined sequences or gene lists remains a major challenge. pico-PLAZA is a web-based resource (http://bioinformatics.psb.ugent.be/pico-plaza/) for algal genomics that combines different data types with intuitive tools to explore genomic diversity, perform integrative evolutionary sequence analysis and study gene functions. Apart from homologous gene families, multiple sequence alignments, phylogenetic trees, Gene Ontology, InterPro and text-mining functional annotations, different interactive viewers are available to study genome organization using gene collinearity and synteny information. Different search functions, documentation pages, export functions and an extensive glossary are available to guide non-expert scientists. To illustrate the versatility of the platform, different case studies are presented demonstrating how pico-PLAZA can be used to functionally characterize large-scale EST/RNA-Seq data sets and to perform environmental genomics. Functional enrichments analysis of 16 Phaeodactylum tricornutum transcriptome libraries offers a molecular view on diatom adaptation to different environments of ecological relevance. Furthermore, we show how complementary genomic data sources can easily be combined to identify marker genes to study the diversity and distribution of algal species, for example in metagenomes, or to quantify intraspecific diversity from environmental strains. © 2013 John Wiley & Sons Ltd and Society for Applied Microbiology.
Huang, Kailong; Zhang, Xu-Xiang; Shi, Peng; Wu, Bing; Ren, Hongqiang
2014-11-01
In order to comprehensively investigate bacterial virulence in drinking water, 454 pyrosequencing and Illumina high-throughput sequencing were used to detect potential pathogenic bacteria and virulence factors (VFs) in a full-scale drinking water treatment and distribution system. 16S rRNA gene pyrosequencing revealed high bacterial diversity in the drinking water (441-586 operational taxonomic units). Bacterial diversity decreased after chlorine disinfection, but increased after pipeline distribution. α-Proteobacteria was the most dominant taxonomic class. Alignment against the established pathogen database showed that several types of putative pathogens were present in the drinking water and Pseudomonas aeruginosa had the highest abundance (over 11‰ of total sequencing reads). Many pathogens disappeared after chlorine disinfection, but P. aeruginosa and Leptospira interrogans were still detected in the tap water. High-throughput sequencing revealed prevalence of various pathogenicity islands and virulence proteins in the drinking water, and translocases, transposons, Clp proteases and flagellar motor switch proteins were the predominant VFs. Both diversity and abundance of the detectable VFs increased after the chlorination, and decreased after the pipeline distribution. This study indicates that joint use of 454 pyrosequencing and Illumina sequencing can comprehensively characterize environmental pathogenesis, and several types of putative pathogens and various VFs are prevalent in drinking water. Copyright © 2014 Elsevier Inc. All rights reserved.
USDA-ARS?s Scientific Manuscript database
Results of the present study reveal that members of the Fusarium incarnatum-equiseti (FIESC) and F. chlamydosporum species complexes (FCSC) collectively account for approximately 15% of all fusarial infections of humans and other animals within the U. S. Moreover, the diverse toxins these fungi pro...
High levels of diversity characterize mandrill (Mandrillus sphinx) Mhc-DRB sequences.
Abbott, Kristin M; Wickings, E Jean; Knapp, Leslie A
2006-08-01
The major histocompatibility complex (MHC) is highly polymorphic in most primate species studied thus far. The rhesus macaque (Macaca mulatta) has been studied extensively and the Mhc-DRB region demonstrates variability similar to humans. The extent of MHC diversity is relatively unknown for other Old World monkeys (OWM), especially among genera other than Macaca. A molecular survey of the Mhc-DRB region in mandrills (Mandrillus sphinx) revealed extensive variability, suggesting that other OWMs may also possess high levels of Mhc-DRB polymorphism. In the present study, 33 Mhc-DRB loci were identified from only 13 animals. Eleven were wild-born and presumed to be unrelated and two were captive-born twins. Two to seven different sequences were identified for each individual, suggesting that some mandrills may have as many as four Mhc-DRB loci on a single haplotype. From these sequences, representatives of at least six Mhc-DRB loci or lineages were identified. As observed in other primates, some new lineages may have arisen through the process of gene conversion. These findings indicate that mandrills have Mhc-DRB diversity not unlike rhesus macaques and humans.
Pissard, A; Ghislain, M; Bertin, P
2006-01-01
The Andean tuber-bearing species, Oxalis tuberosa Mol., is a vegetatively propagated crop cultivated in the uplands of the Andes. Its genetic diversity was investigated in the present study using the inter-simple sequence repeat (ISSR) technique. Thirty-two accessions originating from South America (Argentina, Bolivia, Chile, and Peru) and maintained in vitro were chosen to represent the ecogeographic diversity of its cultivation area. Twenty-two primers were tested and 9 were selected according to fingerprinting quality and reproducibility. Genetic diversity analysis was performed with 90 markers. Jaccard's genetic distance between accessions ranged from 0 to 0.49 with an average of 0.28 +/- 0.08 (mean +/- SD). Dendrogram (UPGMA (unweighted pair-group method with arithmetic averaging)) and factorial correspondence analysis (FCA) showed that the genetic structure was influenced by the collection site. The two most distant clusters contained all of the Peruvian accessions, one from Bolivia, none from Argentina or Chile. Analysis by country revealed that Peru presented the greatest genetic distances from the other countries and possessed the highest intra-country genetic distance (0.30 +/- 0.08). This suggests that the Peruvian oca accessions form a distinct genetic group. The relatively low level of genetic diversity in the oca species may be related to its predominating reproduction strategy, i.e., vegetative propagation. The extent and structure of the genetic diversity of the species detailed here should help the establishment of conservation strategies.
Statistical inference of the generation probability of T-cell receptors from sequence repertoires.
Murugan, Anand; Mora, Thierry; Walczak, Aleksandra M; Callan, Curtis G
2012-10-02
Stochastic rearrangement of germline V-, D-, and J-genes to create variable coding sequence for certain cell surface receptors is at the origin of immune system diversity. This process, known as "VDJ recombination", is implemented via a series of stochastic molecular events involving gene choices and random nucleotide insertions between, and deletions from, genes. We use large sequence repertoires of the variable CDR3 region of human CD4+ T-cell receptor beta chains to infer the statistical properties of these basic biochemical events. Because any given CDR3 sequence can be produced in multiple ways, the probability distribution of hidden recombination events cannot be inferred directly from the observed sequences; we therefore develop a maximum likelihood inference method to achieve this end. To separate the properties of the molecular rearrangement mechanism from the effects of selection, we focus on nonproductive CDR3 sequences in T-cell DNA. We infer the joint distribution of the various generative events that occur when a new T-cell receptor gene is created. We find a rich picture of correlation (and absence thereof), providing insight into the molecular mechanisms involved. The generative event statistics are consistent between individuals, suggesting a universal biochemical process. Our probabilistic model predicts the generation probability of any specific CDR3 sequence by the primitive recombination process, allowing us to quantify the potential diversity of the T-cell repertoire and to understand why some sequences are shared between individuals. We argue that the use of formal statistical inference methods, of the kind presented in this paper, will be essential for quantitative understanding of the generation and evolution of diversity in the adaptive immune system.
Identification, validation and high-throughput genotyping of transcribed gene SNPs in cassava.
Ferguson, Morag E; Hearne, Sarah J; Close, Timothy J; Wanamaker, Steve; Moskal, William A; Town, Christopher D; de Young, Joe; Marri, Pradeep Reddy; Rabbi, Ismail Yusuf; de Villiers, Etienne P
2012-03-01
The availability of genomic resources can facilitate progress in plant breeding through the application of advanced molecular technologies for crop improvement. This is particularly important in the case of less researched crops such as cassava, a staple and food security crop for more than 800 million people. Here, expressed sequence tags (ESTs) were generated from five drought stressed and well-watered cassava varieties. Two cDNA libraries were developed: one from root tissue (CASR), the other from leaf, stem and stem meristem tissue (CASL). Sequencing generated 706 contigs and 3,430 singletons. These sequences were combined with those from two other EST sequencing initiatives and filtered based on the sequence quality. Quality sequences were aligned using CAP3 and embedded in a Windows browser called HarvEST:Cassava which is made available. HarvEST:Cassava consists of a Unigene set of 22,903 quality sequences. A total of 2,954 putative SNPs were identified. Of these 1,536 SNPs from 1,170 contigs and 53 cassava genotypes were selected for SNP validation using Illumina's GoldenGate assay. As a result 1,190 SNPs were validated technically and biologically. The location of validated SNPs on scaffolds of the cassava genome sequence (v.4.1) is provided. A diversity assessment of 53 cassava varieties reveals some sub-structure based on the geographical origin, greater diversity in the Americas as opposed to Africa, and similar levels of diversity in West Africa and southern, eastern and central Africa. The resources presented allow for improved genetic dissection of economically important traits and the application of modern genomics-based approaches to cassava breeding and conservation.
mtDNA sequence diversity of Hazara ethnic group from Pakistan.
Rakha, Allah; Fatima; Peng, Min-Sheng; Adan, Atif; Bi, Rui; Yasmin, Memona; Yao, Yong-Gang
2017-09-01
The present study was undertaken to investigate mitochondrial DNA (mtDNA) control region sequences of Hazaras from Pakistan, so as to generate mtDNA reference database for forensic casework in Pakistan and to analyze phylogenetic relationship of this particular ethnic group with geographically proximal populations. Complete mtDNA control region (nt 16024-576) sequences were generated through Sanger Sequencing for 319 Hazara individuals from Quetta, Baluchistan. The population sample set showed a total of 189 distinct haplotypes, belonging mainly to West Eurasian (51.72%), East & Southeast Asian (29.78%) and South Asian (18.50%) haplogroups. Compared with other populations from Pakistan, the Hazara population had a relatively high haplotype diversity (0.9945) and a lower random match probability (0.0085). The dataset has been incorporated into EMPOP database under accession number EMP00680. The data herein comprises the largest, and likely most thoroughly examined, control region mtDNA dataset from Hazaras of Pakistan. Copyright © 2017 Elsevier B.V. All rights reserved.
Oluwayelu, D O; Todd, D; Olaleye, O D
2008-12-01
This work reports the first molecular analysis study of chicken anaemia virus (CAV) in backyard chickens in Africa using molecular cloning and sequence analysis to characterize CAV strains obtained from commercial chickens and Nigerian backyard chickens. Partial VP1 gene sequences were determined for three CAVs from commercial chickens and for six CAV variants present in samples from a backyard chicken. Multiple alignment analysis revealed that the 6% and 4% nucleotide diversity obtained respectively for the commercial and backyard chicken strains translated to only 2% amino acid diversity for each breed. Overall, the amino acid composition of Nigerian CAVs was found to be highly conserved. Since the partial VP1 gene sequence of two backyard chicken cloned CAV strains (NGR/CI-8 and NGR/CI-9) were almost identical and evolutionarily closely related to the commercial chicken strains NGR-1, and NGR-4 and NGR-5, respectively, we concluded that CAV infections had crossed the farm boundary.
PHYLOSCANNER: Inferring Transmission from Within- and Between-Host Pathogen Genetic Diversity
Hall, Matthew; Ratmann, Oliver; Bonsall, David; Golubchik, Tanya; de Cesare, Mariateresa; Gall, Astrid; Cornelissen, Marion; Fraser, Christophe
2018-01-01
Abstract A central feature of pathogen genomics is that different infectious particles (virions and bacterial cells) within an infected individual may be genetically distinct, with patterns of relatedness among infectious particles being the result of both within-host evolution and transmission from one host to the next. Here, we present a new software tool, phyloscanner, which analyses pathogen diversity from multiple infected hosts. phyloscanner provides unprecedented resolution into the transmission process, allowing inference of the direction of transmission from sequence data alone. Multiply infected individuals are also identified, as they harbor subpopulations of infectious particles that are not connected by within-host evolution, except where recombinant types emerge. Low-level contamination is flagged and removed. We illustrate phyloscanner on both viral and bacterial pathogens, namely HIV-1 sequenced on Illumina and Roche 454 platforms, HCV sequenced with the Oxford Nanopore MinION platform, and Streptococcus pneumoniae with sequences from multiple colonies per individual. phyloscanner is available from https://github.com/BDI-pathogens/phyloscanner. PMID:29186559
de Vries, Ronald P; Riley, Robert; Wiebenga, Ad; Aguilar-Osorio, Guillermo; Amillis, Sotiris; Uchima, Cristiane Akemi; Anderluh, Gregor; Asadollahi, Mojtaba; Askin, Marion; Barry, Kerrie; Battaglia, Evy; Bayram, Özgür; Benocci, Tiziano; Braus-Stromeyer, Susanna A; Caldana, Camila; Cánovas, David; Cerqueira, Gustavo C; Chen, Fusheng; Chen, Wanping; Choi, Cindy; Clum, Alicia; Dos Santos, Renato Augusto Corrêa; Damásio, André Ricardo de Lima; Diallinas, George; Emri, Tamás; Fekete, Erzsébet; Flipphi, Michel; Freyberg, Susanne; Gallo, Antonia; Gournas, Christos; Habgood, Rob; Hainaut, Matthieu; Harispe, María Laura; Henrissat, Bernard; Hildén, Kristiina S; Hope, Ryan; Hossain, Abeer; Karabika, Eugenia; Karaffa, Levente; Karányi, Zsolt; Kraševec, Nada; Kuo, Alan; Kusch, Harald; LaButti, Kurt; Lagendijk, Ellen L; Lapidus, Alla; Levasseur, Anthony; Lindquist, Erika; Lipzen, Anna; Logrieco, Antonio F; MacCabe, Andrew; Mäkelä, Miia R; Malavazi, Iran; Melin, Petter; Meyer, Vera; Mielnichuk, Natalia; Miskei, Márton; Molnár, Ákos P; Mulé, Giuseppina; Ngan, Chew Yee; Orejas, Margarita; Orosz, Erzsébet; Ouedraogo, Jean Paul; Overkamp, Karin M; Park, Hee-Soo; Perrone, Giancarlo; Piumi, Francois; Punt, Peter J; Ram, Arthur F J; Ramón, Ana; Rauscher, Stefan; Record, Eric; Riaño-Pachón, Diego Mauricio; Robert, Vincent; Röhrig, Julian; Ruller, Roberto; Salamov, Asaf; Salih, Nadhira S; Samson, Rob A; Sándor, Erzsébet; Sanguinetti, Manuel; Schütze, Tabea; Sepčić, Kristina; Shelest, Ekaterina; Sherlock, Gavin; Sophianopoulou, Vicky; Squina, Fabio M; Sun, Hui; Susca, Antonia; Todd, Richard B; Tsang, Adrian; Unkles, Shiela E; van de Wiele, Nathalie; van Rossen-Uffink, Diana; Oliveira, Juliana Velasco de Castro; Vesth, Tammi C; Visser, Jaap; Yu, Jae-Hyuk; Zhou, Miaomiao; Andersen, Mikael R; Archer, David B; Baker, Scott E; Benoit, Isabelle; Brakhage, Axel A; Braus, Gerhard H; Fischer, Reinhard; Frisvad, Jens C; Goldman, Gustavo H; Houbraken, Jos; Oakley, Berl; Pócsi, István; Scazzocchio, Claudio; Seiboth, Bernhard; vanKuyk, Patricia A; Wortman, Jennifer; Dyer, Paul S; Grigoriev, Igor V
2017-02-14
The fungal genus Aspergillus is of critical importance to humankind. Species include those with industrial applications, important pathogens of humans, animals and crops, a source of potent carcinogenic contaminants of food, and an important genetic model. The genome sequences of eight aspergilli have already been explored to investigate aspects of fungal biology, raising questions about evolution and specialization within this genus. We have generated genome sequences for ten novel, highly diverse Aspergillus species and compared these in detail to sister and more distant genera. Comparative studies of key aspects of fungal biology, including primary and secondary metabolism, stress response, biomass degradation, and signal transduction, revealed both conservation and diversity among the species. Observed genomic differences were validated with experimental studies. This revealed several highlights, such as the potential for sex in asexual species, organic acid production genes being a key feature of black aspergilli, alternative approaches for degrading plant biomass, and indications for the genetic basis of stress response. A genome-wide phylogenetic analysis demonstrated in detail the relationship of the newly genome sequenced species with other aspergilli. Many aspects of biological differences between fungal species cannot be explained by current knowledge obtained from genome sequences. The comparative genomics and experimental study, presented here, allows for the first time a genus-wide view of the biological diversity of the aspergilli and in many, but not all, cases linked genome differences to phenotype. Insights gained could be exploited for biotechnological and medical applications of fungi.
Wetmore, Kelly M.; Price, Morgan N.; Waters, Robert J.; ...
2015-05-12
Transposon mutagenesis with next-generation sequencing (TnSeq) is a powerful approach to annotate gene function in bacteria, but existing protocols for TnSeq require laborious preparation of every sample before sequencing. Thus, the existing protocols are not amenable to the throughput necessary to identify phenotypes and functions for the majority of genes in diverse bacteria. Here, we present a method, random bar code transposon-site sequencing (RB-TnSeq), which increases the throughput of mutant fitness profiling by incorporating random DNA bar codes into Tn5 and mariner transposons and by using bar code sequencing (BarSeq) to assay mutant fitness. RB-TnSeq can be used with anymore » transposon, and TnSeq is performed once per organism instead of once per sample. Each BarSeq assay requires only a simple PCR, and 48 to 96 samples can be sequenced on one lane of an Illumina HiSeq system. We demonstrate the reproducibility and biological significance of RB-TnSeq with Escherichia coli, Phaeobacter inhibens, Pseudomonas stutzeri, Shewanella amazonensis, and Shewanella oneidensis. To demonstrate the increased throughput of RB-TnSeq, we performed 387 successful genome-wide mutant fitness assays representing 130 different bacterium-carbon source combinations and identified 5,196 genes with significant phenotypes across the five bacteria. In P. inhibens, we used our mutant fitness data to identify genes important for the utilization of diverse carbon substrates, including a putative D-mannose isomerase that is required for mannitol catabolism. RB-TnSeq will enable the cost-effective functional annotation of diverse bacteria using mutant fitness profiling. A large challenge in microbiology is the functional assessment of the millions of uncharacterized genes identified by genome sequencing. Transposon mutagenesis coupled to next-generation sequencing (TnSeq) is a powerful approach to assign phenotypes and functions to genes. However, the current strategies for TnSeq are too laborious to be applied to hundreds of experimental conditions across multiple bacteria. Here, we describe an approach, random bar code transposon-site sequencing (RB-TnSeq), which greatly simplifies the measurement of gene fitness by using bar code sequencing (BarSeq) to monitor the abundance of mutants. We performed 387 genome-wide fitness assays across five bacteria and identified phenotypes for over 5,000 genes. RB-TnSeq can be applied to diverse bacteria and is a powerful tool to annotate uncharacterized genes using phenotype data.« less
DOE Office of Scientific and Technical Information (OSTI.GOV)
Wetmore, Kelly M.; Price, Morgan N.; Waters, Robert J.
Transposon mutagenesis with next-generation sequencing (TnSeq) is a powerful approach to annotate gene function in bacteria, but existing protocols for TnSeq require laborious preparation of every sample before sequencing. Thus, the existing protocols are not amenable to the throughput necessary to identify phenotypes and functions for the majority of genes in diverse bacteria. Here, we present a method, random bar code transposon-site sequencing (RB-TnSeq), which increases the throughput of mutant fitness profiling by incorporating random DNA bar codes into Tn5 and mariner transposons and by using bar code sequencing (BarSeq) to assay mutant fitness. RB-TnSeq can be used with anymore » transposon, and TnSeq is performed once per organism instead of once per sample. Each BarSeq assay requires only a simple PCR, and 48 to 96 samples can be sequenced on one lane of an Illumina HiSeq system. We demonstrate the reproducibility and biological significance of RB-TnSeq with Escherichia coli, Phaeobacter inhibens, Pseudomonas stutzeri, Shewanella amazonensis, and Shewanella oneidensis. To demonstrate the increased throughput of RB-TnSeq, we performed 387 successful genome-wide mutant fitness assays representing 130 different bacterium-carbon source combinations and identified 5,196 genes with significant phenotypes across the five bacteria. In P. inhibens, we used our mutant fitness data to identify genes important for the utilization of diverse carbon substrates, including a putative D-mannose isomerase that is required for mannitol catabolism. RB-TnSeq will enable the cost-effective functional annotation of diverse bacteria using mutant fitness profiling. A large challenge in microbiology is the functional assessment of the millions of uncharacterized genes identified by genome sequencing. Transposon mutagenesis coupled to next-generation sequencing (TnSeq) is a powerful approach to assign phenotypes and functions to genes. However, the current strategies for TnSeq are too laborious to be applied to hundreds of experimental conditions across multiple bacteria. Here, we describe an approach, random bar code transposon-site sequencing (RB-TnSeq), which greatly simplifies the measurement of gene fitness by using bar code sequencing (BarSeq) to monitor the abundance of mutants. We performed 387 genome-wide fitness assays across five bacteria and identified phenotypes for over 5,000 genes. RB-TnSeq can be applied to diverse bacteria and is a powerful tool to annotate uncharacterized genes using phenotype data.« less
Reduced Mtdna Diversity in the Ngobe Amerinds of Panama
Kolman, C. J.; Bermingham, E.; Cooke, R.; Ward, R. H.; Arias, T. D.; Guionneau-Sinclair, F.
1995-01-01
Mitochondrial DNA (mtDNA) haplotype diversity was determined for 46 Ngobe Amerinds sampled widely across their geographic range in western Panama. The Ngobe data were compared with mtDNA control region I sequences from two additional Amerind groups located at the northern and southern extremes of Amerind distribution, the Nuu-Chah-Nulth of the Pacific Northwest and the Chilean Mapuche and from one Na-Dene group, the Haida of the Pacific Northwest. The Ngobe exhibit the lowest mtDNA control region sequence diversity yet reported for an Amerind group. Moreover, they carry only two of the four Amerind founding lineages first described by Wallace and coworkers. We posit that the Ngobe passed through a population bottleneck caused by ethnogenesis from a small founding population and/or European conquest and colonization. Dating of the Ngobe population expansion using the HARPENDING et al. approach to the analysis of pairwise genetic differences indicates a Ngobe expansion at roughly 6800 years before present (range: 1850-14,000 years before present), a date more consistent with a bottleneck at Chibcha ethnogenesis than a conquest-based event. PMID:7635293
Barik, Suvakanta; SarkarDas, Shabari; Singh, Archita; Gautam, Vibhav; Kumar, Pramod; Majee, Manoj; Sarkar, Ananda K
2014-01-01
Similar to the majority of the microRNAs, mature miR166s are derived from multiple members of MIR166 genes (precursors) and regulate various aspects of plant development by negatively regulating their target genes (Class III HD-ZIP). The evolutionary conservation or functional diversification of miRNA166 family members remains elusive. Here, we show the phylogenetic relationships among MIR166 precursor and mature sequences from three diverse model plant species. Despite strong conservation, some mature miR166 sequences, such as ppt-miR166m, have undergone sequence variation. Critical sequence variation in ppt-miR166m has led to functional diversification, as it targets non-HD-ZIPIII gene transcript (s). MIR166 precursor sequences have diverged in a lineage specific manner, and both precursors and mature osa-miR166i/j are highly conserved. Interestingly, polycistronic MIR166s were present in Physcomitrella and Oryza but not in Arabidopsis. The nature of cis-regulatory motifs on the upstream promoter sequences of MIR166 genes indicates their possible contribution to the functional variation observed among miR166 species. Copyright © 2013 Elsevier Inc. All rights reserved.
Impact of sequencing depth on the characterization of the microbiome and resistome.
Zaheer, Rahat; Noyes, Noelle; Ortega Polo, Rodrigo; Cook, Shaun R; Marinier, Eric; Van Domselaar, Gary; Belk, Keith E; Morley, Paul S; McAllister, Tim A
2018-04-12
Developments in high-throughput next generation sequencing (NGS) technology have rapidly advanced the understanding of overall microbial ecology as well as occurrence and diversity of specific genes within diverse environments. In the present study, we compared the ability of varying sequencing depths to generate meaningful information about the taxonomic structure and prevalence of antimicrobial resistance genes (ARGs) in the bovine fecal microbial community. Metagenomic sequencing was conducted on eight composite fecal samples originating from four beef cattle feedlots. Metagenomic DNA was sequenced to various depths, D1, D0.5 and D0.25, with average sample read counts of 117, 59 and 26 million, respectively. A comparative analysis of the relative abundance of reads aligning to different phyla and antimicrobial classes indicated that the relative proportions of read assignments remained fairly constant regardless of depth. However, the number of reads being assigned to ARGs as well as to microbial taxa increased significantly with increasing depth. We found a depth of D0.5 was suitable to describe the microbiome and resistome of cattle fecal samples. This study helps define a balance between cost and required sequencing depth to acquire meaningful results.
Zhang, Yanhong; Pham, Nancy Kim; Zhang, Huixian; Lin, Junda; Lin, Qiang
2014-01-01
Population genetic of seahorses is confidently influenced by their species-specific ecological requirements and life-history traits. In the present study, partial sequences of mitochondrial cytochrome b (cytb) and control region (CR) were obtained from 50 Hippocampus mohnikei and 92 H. trimaculatus from four zoogeographical zones. A total of 780 base pairs of cytb gene were sequenced to characterize mitochondrial DNA (mtDNA) diversity. The mtDNA marker revealed high haplotype diversity, low nucleotide diversity, and a lack of population structure across both populations of H. mohnikei and H. trimaculatus. A neighbour-joining (NJ) tree of cytb gene sequences showed that H. mohnikei haplotypes formed one cluster. A maximum likelihood (ML) tree of cytb gene sequences showed that H. trimaculatus belonged to one lineage. The star-like pattern median-joining network of cytb and CR markers indicated a previous demographic expansion of H. mohnikei and H. trimaculatus. The cytb and CR data sets exhibited a unimodal mismatch distribution, which may have resulted from population expansion. Mismatch analysis suggested that the expansion was initiated about 276,000 years ago for H. mohnikei and about 230,000 years ago for H. trimaculatus during the middle Pleistocene period. This study indicates a possible signature of genetic variation and population expansion in two seahorses under complex marine environments.
Liu, Jiang; Deng, Jun-cai; Yang, Cai-qiong; Huang, Ni; Chang, Xiao-li; Zhang, Jing; Yang, Feng; Liu, Wei-guo; Wang, Xiao-chun; Yong, Tai-wen; Du, Jun-bo; Shu, Kai; Yang, Wen-yu
2017-01-01
Continuous rain and an abnormally wet climate during harvest can easily lead to soybean plants being damaged by field mold (FM), which can reduce seed yield and quality. However, to date, the underlying pathogen and its resistance mechanism have remained unclear. The objective of the present study was to investigate the fungal diversity of various soybean varieties and to identify and confirm the FM pathogenic fungi. A total of 62,382 fungal ITS1 sequences clustered into 164 operational taxonomic units (OTUs) with 97% sequence similarity; 69 taxa were recovered from the samples by internal transcribed spacer (ITS) region sequencing. The fungal community compositions differed among the tested soybeans, with 42 OTUs being amplified from all varieties. The quadratic relationships between fungal diversity and organ-specific mildew indexes were analyzed, confirming that mildew on soybean pods can mitigate FM damage to the seeds. In addition, four potentially pathogenic fungi were isolated from FM-damaged soybean fruits; morphological and molecular identification confirmed these fungi as Aspergillus flavus, A. niger, Fusarium moniliforme, and Penicillium chrysogenum. Further re-inoculation experiments demonstrated that F. moniliforme is dominant among these FM pathogenic fungi. These results lay the foundation for future studies on mitigating or preventing FM damage to soybean. PMID:28515718
Chen, Tingtao; Shi, Yan; Wang, Xiaolei; Wang, Xin; Meng, Fanjing; Yang, Shaoguo; Yang, Jian; Xin, Hongbo
2017-07-01
Recurrence of oral diseases caused by antibiotics has brought about an urgent requirement to explore the oral microbial diversity in the human oral cavity. In the present study, the high‑throughput sequencing method was adopted to compare the microbial diversity of healthy people and oral patients and sequence analysis was performed by UPARSE software package. The Venn results indicated that a mean of 315 operational taxonomic units (OTUs) was obtained, and 73, 64, 53, 19 and 18 common OTUs belonging to Firmicutes, Bacteroidetes, Proteobacteria, Actinobacteria and Fusobacteria, respectively, were identified in healthy people. Moreover, the reduction of Firmicutes and the increase of Proteobacteria in the children group, and the increase of Firmicutes and the reduction of Proteobacteria in the youth and adult groups, indicated that the age bracket and oral disease had largely influenced the tooth development and microbial development in the oral cavity. In addition, the traditional 'pathogenic bacteria' of Firmicutes, Proteobacteria and Bacteroidetes (accounted for >95% of the total sequencing number in each group) indicated that the 'harmful' bacteria may exert beneficial effects on oral health. Therefore, the data will provide certain clues for curing some oral diseases by the strategy of adjusting the disturbed microbial compositions in oral disease to healthy level.
Quaglino, Fabio; Kube, Michael; Jawhari, Maan; Abou-Jawdah, Yusuf; Siewert, Christin; Choueiri, Elia; Sobh, Hana; Casati, Paola; Tedeschi, Rosemarie; Lova, Marina Molino; Alma, Alberto; Bianco, Piero Attilio
2015-07-30
Almond witches'-broom (AlmWB), a devastating disease of almond, peach and nectarine in Lebanon, is associated with 'Candidatus Phytoplasma phoenicium'. In the present study, we generated a draft genome sequence of 'Ca. P. phoenicium' strain SA213, representative of phytoplasma strain populations from different host plants, and determined the genetic diversity among phytoplasma strain populations by phylogenetic analyses of 16S rRNA, groEL, tufB and inmp gene sequences. Sequence-based typing and phylogenetic analysis of the gene inmp, coding an integral membrane protein, distinguished AlmWB-associated phytoplasma strains originating from diverse host plants, whereas their 16S rRNA, tufB and groEL genes shared 100 % sequence identity. Moreover, dN/dS analysis indicated positive selection acting on inmp gene. Additionally, the analysis of 'Ca. P. phoenicium' draft genome revealed the presence of integral membrane proteins and effector-like proteins and potential candidates for interaction with hosts. One of the integral membrane proteins was predicted as BI-1, an inhibitor of apoptosis-promoting Bax factor. Bioinformatics analyses revealed the presence of putative BI-1 in draft and complete genomes of other 'Ca. Phytoplasma' species. The genetic diversity within 'Ca. P. phoenicium' strain populations in Lebanon suggested that AlmWB disease could be associated with phytoplasma strains derived from the adaptation of an original strain to diverse hosts. Moreover, the identification of a putative inhibitor of apoptosis-promoting Bax factor (BI-1) in 'Ca. P. phoenicium' draft genome and within genomes of other 'Ca. Phytoplasma' species suggested its potential role as a phytoplasma fitness-increasing factor by modification of the host-defense response.
Romanchuk, Artur; Chang, Jeff H.; Mukhtar, M. Shahid; Cherkis, Karen; Roach, Jeff; Grant, Sarah R.; Jones, Corbin D.; Dangl, Jeffery L.
2011-01-01
Closely related pathogens may differ dramatically in host range, but the molecular, genetic, and evolutionary basis for these differences remains unclear. In many Gram- negative bacteria, including the phytopathogen Pseudomonas syringae, type III effectors (TTEs) are essential for pathogenicity, instrumental in structuring host range, and exhibit wide diversity between strains. To capture the dynamic nature of virulence gene repertoires across P. syringae, we screened 11 diverse strains for novel TTE families and coupled this nearly saturating screen with the sequencing and assembly of 14 phylogenetically diverse isolates from a broad collection of diseased host plants. TTE repertoires vary dramatically in size and content across all P. syringae clades; surprisingly few TTEs are conserved and present in all strains. Those that are likely provide basal requirements for pathogenicity. We demonstrate that functional divergence within one conserved locus, hopM1, leads to dramatic differences in pathogenicity, and we demonstrate that phylogenetics-informed mutagenesis can be used to identify functionally critical residues of TTEs. The dynamism of the TTE repertoire is mirrored by diversity in pathways affecting the synthesis of secreted phytotoxins, highlighting the likely role of both types of virulence factors in determination of host range. We used these 14 draft genome sequences, plus five additional genome sequences previously reported, to identify the core genome for P. syringae and we compared this core to that of two closely related non-pathogenic pseudomonad species. These data revealed the recent acquisition of a 1 Mb megaplasmid by a sub-clade of cucumber pathogens. This megaplasmid encodes a type IV secretion system and a diverse set of unknown proteins, which dramatically increases both the genomic content of these strains and the pan-genome of the species. PMID:21799664
Fraga, Aline Padilha de; Gräf, Tiago; Pereira, Cleiton Schneider; Ikuta, Nilo; Fonseca, André Salvador Kazantzi; Lunge, Vagner Ricardo
2018-07-01
Avian infectious bronchitis virus (IBV) is the etiological agent of a highly contagious disease, which results in severe economic losses to the poultry industry. The spike protein (S1 subunit) is responsible for the molecular diversity of the virus and many sero/genotypes are described around the world. Recently a new standardized classification of the IBV molecular diversity was conducted, based on phylogenetic analysis of the S1 gene sequences sampled worldwide. Brazil is one of the biggest poultry producers in the world and the present study aimed to review the molecular diversity and reconstruct the evolutionary history of IBV in the country. All IBV S1 gene sequences, with local and year of collection information available on GenBank, were retrieved. Phylogenetic analyses were carried out based on a maximum likelihood method for the classification of genotypes occurring in Brazil, according to the new classification. Bayesian phylogenetic analyses were performed with the Brazilian clade and related international sequences to determine the evolutionary history of IBV in Brazil. A total of 143 Brazilian sequences were classified as GI-11 and 46 as GI-1 (Mass). Within the GI-11 clade, we have identified a potential recombinant strain circulating in Brazil. Phylodynamic analysis demonstrated that IBV GI-11 lineage was introduced in Brazil in the 1950s (1951, 1917-1975 95% HPD) and population dynamics was mostly constant throughout the time. Despite the national vaccination protocols, our results show the widespread dissemination and maintenance of the IBV GI-11 lineage in Brazil and highlight the importance of continuous surveillance to evaluate the impact of currently used vaccine strains on the observed viral diversity of the country. Copyright © 2018 Elsevier B.V. All rights reserved.
Jacob, Jacob H; Hussein, Emad I; Shakhatreh, Muhamad Ali K; Cornelison, Christopher T
2017-10-01
Amplicon sequencing using next-generation technology (bTEFAP ® ) has been utilized in describing the diversity of Dead Sea microbiota. The investigated area is a well-known salt lake in the western part of Jordan found in the lowest geographical location in the world (more than 420 m below sea level) and characterized by extreme salinity (approximately, 34%) in addition to other extreme conditions (low pH, unique ionic composition different from sea water). DNA was extracted from Dead Sea water. A total of 314,310 small subunit RNA (SSU rRNA) sequences were parsed, and 288,452 sequences were then clustered. For alpha diversity analysis, sample was rarefied to 3,000 sequences. The Shannon-Wiener index curve plot reached a plateau at approximately 3,000 sequences indicating that sequencing depth was sufficient to capture the full scope of microbial diversity. Archaea was found to be dominating the sequences (52%), whereas Bacteria constitute 45% of the sequences. Altogether, prokaryotic sequences (which constitute 97% of all sequences) were found to predominate. The findings expand on previous studies by using high-throughput amplicon sequencing to describe the microbial community in an environment which in recent years has been shown to hide some interesting diversity. © 2017 The Authors. MicrobiologyOpen published by John Wiley & Sons Ltd.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Chapman, Carol; Henry, Matthew; Bishop-Lilly, Kimberly A.
Historically, cholera outbreaks have been linked to V. cholerae O1 serogroup strains or its derivatives of the O37 and O139 serogroups. A genomic study on the 2010 Haiti cholera outbreak strains highlighted the putative role of non O1/non-O139 V. cholerae in causing cholera and the lack of genomic sequences of such strains from around the world. Here we address these gaps by scanning a global collection of V. cholerae strains as a first step towards understanding the population genetic diversity and epidemic potential of non O1/non-O139 strains. Whole Genome Mapping (Optical Mapping) based bar coding produces a high resolution, orderedmore » restriction map, depicting a complete view of the unique chromosomal architecture of an organism. To assess the genomic diversity of non-O1/non-O139 V. cholerae, we applied a Whole Genome Mapping strategy on a well-defined and geographically and temporally diverse strain collection, the Sakazaki serogroup type strains. Whole Genome Map data on 91 of the 206 serogroup type strains support the hypothesis that V. cholerae has an unprecedented genetic and genomic structural diversity. Interestingly, we discovered chromosomal fusions in two unusual strains that possess a single chromosome instead of the two chromosomes usually found in V. cholerae. We also found pervasive chromosomal rearrangements such as duplications and indels in many strains. The majority of Vibrio genome sequences currently in public databases are unfinished draft sequences. The Whole Genome Mapping approach presented here enables rapid screening of large strain collections to capture genomic complexities that would not have been otherwise revealed by unfinished draft genome sequencing and thus aids in assembling and finishing draft sequences of complex genomes. Furthermore, Whole Genome Mapping allows for prediction of novel V. cholerae non-O1/non-O139 strains that may have the potential to cause future cholera outbreaks.« less
Chapman, Carol; Henry, Matthew; Bishop-Lilly, Kimberly A; Awosika, Joy; Briska, Adam; Ptashkin, Ryan N; Wagner, Trevor; Rajanna, Chythanya; Tsang, Hsinyi; Johnson, Shannon L; Mokashi, Vishwesh P; Chain, Patrick S G; Sozhamannan, Shanmuga
2015-01-01
Historically, cholera outbreaks have been linked to V. cholerae O1 serogroup strains or its derivatives of the O37 and O139 serogroups. A genomic study on the 2010 Haiti cholera outbreak strains highlighted the putative role of non O1/non-O139 V. cholerae in causing cholera and the lack of genomic sequences of such strains from around the world. Here we address these gaps by scanning a global collection of V. cholerae strains as a first step towards understanding the population genetic diversity and epidemic potential of non O1/non-O139 strains. Whole Genome Mapping (Optical Mapping) based bar coding produces a high resolution, ordered restriction map, depicting a complete view of the unique chromosomal architecture of an organism. To assess the genomic diversity of non-O1/non-O139 V. cholerae, we applied a Whole Genome Mapping strategy on a well-defined and geographically and temporally diverse strain collection, the Sakazaki serogroup type strains. Whole Genome Map data on 91 of the 206 serogroup type strains support the hypothesis that V. cholerae has an unprecedented genetic and genomic structural diversity. Interestingly, we discovered chromosomal fusions in two unusual strains that possess a single chromosome instead of the two chromosomes usually found in V. cholerae. We also found pervasive chromosomal rearrangements such as duplications and indels in many strains. The majority of Vibrio genome sequences currently in public databases are unfinished draft sequences. The Whole Genome Mapping approach presented here enables rapid screening of large strain collections to capture genomic complexities that would not have been otherwise revealed by unfinished draft genome sequencing and thus aids in assembling and finishing draft sequences of complex genomes. Furthermore, Whole Genome Mapping allows for prediction of novel V. cholerae non-O1/non-O139 strains that may have the potential to cause future cholera outbreaks.
Chapman, Carol; Henry, Matthew; Bishop-Lilly, Kimberly A.; ...
2015-03-20
Historically, cholera outbreaks have been linked to V. cholerae O1 serogroup strains or its derivatives of the O37 and O139 serogroups. A genomic study on the 2010 Haiti cholera outbreak strains highlighted the putative role of non O1/non-O139 V. cholerae in causing cholera and the lack of genomic sequences of such strains from around the world. Here we address these gaps by scanning a global collection of V. cholerae strains as a first step towards understanding the population genetic diversity and epidemic potential of non O1/non-O139 strains. Whole Genome Mapping (Optical Mapping) based bar coding produces a high resolution, orderedmore » restriction map, depicting a complete view of the unique chromosomal architecture of an organism. To assess the genomic diversity of non-O1/non-O139 V. cholerae, we applied a Whole Genome Mapping strategy on a well-defined and geographically and temporally diverse strain collection, the Sakazaki serogroup type strains. Whole Genome Map data on 91 of the 206 serogroup type strains support the hypothesis that V. cholerae has an unprecedented genetic and genomic structural diversity. Interestingly, we discovered chromosomal fusions in two unusual strains that possess a single chromosome instead of the two chromosomes usually found in V. cholerae. We also found pervasive chromosomal rearrangements such as duplications and indels in many strains. The majority of Vibrio genome sequences currently in public databases are unfinished draft sequences. The Whole Genome Mapping approach presented here enables rapid screening of large strain collections to capture genomic complexities that would not have been otherwise revealed by unfinished draft genome sequencing and thus aids in assembling and finishing draft sequences of complex genomes. Furthermore, Whole Genome Mapping allows for prediction of novel V. cholerae non-O1/non-O139 strains that may have the potential to cause future cholera outbreaks.« less
PCR Primers for Metazoan Nuclear 18S and 28S Ribosomal DNA Sequences
Machida, Ryuji J.; Knowlton, Nancy
2012-01-01
Background Metagenetic analyses, which amplify and sequence target marker DNA regions from environmental samples, are increasingly employed to assess the biodiversity of communities of small organisms. Using this approach, our understanding of microbial diversity has expanded greatly. In contrast, only a few studies using this approach to characterize metazoan diversity have been reported, despite the fact that many metazoan species are small and difficult to identify or are undescribed. One of the reasons for this discrepancy is the availability of universal primers for the target taxa. In microbial studies, analysis of the 16S ribosomal DNA is standard. In contrast, the best gene for metazoan metagenetics is less clear. In the present study, we have designed primers that amplify the nuclear 18S and 28S ribosomal DNA sequences of most metazoan species with the goal of providing effective approaches for metagenetic analyses of metazoan diversity in environmental samples, with a particular emphasis on marine biodiversity. Methodology/Principal Findings Conserved regions suitable for designing PCR primers were identified using 14,503 and 1,072 metazoan sequences of the nuclear 18S and 28S rDNA regions, respectively. The sequence similarity of both these newly designed and the previously reported primers to the target regions of these primers were compared for each phylum to determine the expected amplification efficacy. The nucleotide diversity of the flanking regions of the primers was also estimated for genera or higher taxonomic groups of 11 phyla to determine the variable regions within the genes. Conclusions/Significance The identified nuclear ribosomal DNA primers (five primer pairs for 18S and eleven for 28S) and the results of the nucleotide diversity analyses provide options for primer combinations for metazoan metagenetic analyses. Additionally, advantages and disadvantages of not only the 18S and 28S ribosomal DNA, but also other marker regions as targets for metazoan metagenetic analyses, are discussed. PMID:23049971
Low diversity in the mitogenome of sperm whales revealed by next-generation sequencing
Alana Alexander; Debbie Steel; Beth Slikas; Kendra Hoekzema; Colm Carraher; Matthew Parks; Richard Cronn; C. Scott Baker
2012-01-01
Large population sizes and global distributions generally associate with high mitochondrial DNA control region (CR) diversity. The sperm whale (Physeter macrocephalus) is an exception, showing low CR diversity relative to other cetaceans; however, diversity levels throughout the remainder of the sperm whale mitogenome are unknown. We sequenced 20...
Cyanobacterial diversity in extreme environments in Baja California, Mexico: a polyphasic study.
López-Cortés, A; García-Pichel, F; Nübel, U; Vázquez-Juárez, R
2001-12-01
Cyanobacterial diversity from two geographical areas of Baja California Sur, Mexico, were studied: Bahia Concepcion, and Ensenada de Aripez. The sites included hypersaline ecosystems, sea bottom, hydrothermal springs, and a shrimp farm. In this report we describe four new morphotypes, two are marine epilithic from Bahia Concepcion, Dermocarpa sp. and Hyella sp. The third, Geitlerinema sp., occurs in thermal springs and in shrimp ponds, and the fourth, Tychonema sp., is from a shrimp pond. The partial sequences of the 16S rRNA genes and the phylogenetic relationship of four cyanobacterial strains (Synechococcus cf. elongatus, Leptolyngbya cf. thermalis, Leptolyngbya sp., and Geitlerinema sp.) are also presented. Polyphasic studies that include the combination of light microscopy, cultures and the comparative analysis of 16S rRNA gene sequences provide the most powerful approach currently available to establish the diversity of these oxygenic photosynthetic microorganisms in culture and in nature.
Jang, Yeongseon; Jang, Seokyoon; Min, Mihee; Hong, Joo-Hyun; Lee, Hanbyul; Lee, Hwanhwi; Lim, Young Woon; Kim, Jae-Jin
2015-10-01
In this study, three different methods (fruiting body collection, mycelial isolation, and 454 sequencing) were implemented to determine the diversity of wood-inhabiting basidiomycetes from dead Manchurian fir (Abies holophylla). The three methods recovered similar species richness (26 species from fruiting bodies, 32 species from mycelia, and 32 species from 454 sequencing), but Fisher's alpha, Shannon-Wiener, Simpson's diversity indices of fungal communities indicated fruiting body collection and mycelial isolation displayed higher diversity compared with 454 sequencing. In total, 75 wood-inhabiting basidiomycetes were detected. The most frequently observed species were Heterobasidion orientale (fruiting body collection), Bjerkandera adusta (mycelial isolation), and Trichaptum fusco-violaceum (454 sequencing). Only two species, Hymenochaete yasudae and Hypochnicium karstenii, were detected by all three methods. This result indicated that Manchurian fir harbors a diverse basidiomycetous fungal community and for complete estimation of fungal diversity, multiple methods should be used. Further studies are required to understand their ecology in the context of forest ecosystems.
Feng, Hui; Gupta, Bhavna; Wang, Meilian; Zheng, Wenqi; Zheng, Li; Zhu, Xiaotong; Yang, Yimei; Fang, Qiang; Luo, Enjie; Fan, Qi; Tsuboi, Takafumi; Cao, Yaming; Cui, Liwang
2015-12-01
The male gamete fertilization factor P48/45 in malaria parasites is a prime transmission-blocking vaccine (TBV) candidate. Efforts to develop antimalarial vaccines are often thwarted by genetic diversity of the target antigens. Here we evaluated the genetic diversity of Pvs48/45 gene in global Plasmodium vivax populations. We determined 200 Pvs48/45 sequences collected from temperate and subtropical parasite populations in China. Population genetic and evolutionary analyses were performed to determine the levels of genetic diversity, potential signature of selection, and population differentiation. Analysis of the Pvs48/45 sequences from 200 P. vivax parasites collected in a temperate and a tropical region revealed a low level of genetic diversity (π = 0.0012) with 14 single nucleotide polymorphisms, of which 11 were nonsynonymous. Analysis of 344 Pvs48/45 sequences from nine worldwide P. vivax populations detected a total of 38 haplotypes, of which 13 haplotypes were present only once. Multiple tests for selection confirmed a signature of positive selection on Pvs48/45 with selection skewed to the second cysteine domain. Haplotype network analysis and Wright's fixation index showed large geographical differentiation with the presence of continent-or region-specific mutations in this gene. Pvs48/45 displays low levels of genetic diversity with the presence of region-specific mutations. Some of the mutations may be potential epitope targets based on their positions in the predicted structure, highlighting the need for future evaluation of these mutations in designing Pvs48/45-based TBV.
Hou, Weiguo; Wang, Shang; Briggs, Brandon R; Li, Gaoyuan; Xie, Wei; Dong, Hailiang
2018-01-01
Myocyanophages, a group of viruses infecting cyanobacteria, are abundant and play important roles in elemental cycling. Here we investigated the particle-associated viral communities retained on 0.2 μm filters and in sediment samples (representing ancient cyanophage communities) from four ocean and three lake locations, using high-throughput sequencing and a newly designed primer pair targeting a gene fragment (∼145-bp in length) encoding the cyanophage gp23 major capsid protein (MCP). Diverse viral communities were detected in all samples. The fragments of 142-, 145-, and 148-bp in length were most abundant in the amplicons, and most sequences (>92%) belonged to cyanophages. Additionally, different sequencing depths resulted in different diversity estimates of the viral community. Operational taxonomic units obtained from deep sequencing of the MCP gene covered the majority of those obtained from shallow sequencing, suggesting that deep sequencing exhibited a more complete picture of cyanophage community than shallow sequencing. Our results also revealed a wide geographic distribution of marine myocyanophages, i.e., higher dissimilarities of the myocyanophage communities corresponded with the larger distances between the sampling sites. Collectively, this study suggests that the newly designed primer pair can be effectively used to study the community and diversity of myocyanophage from different environments, and the high-throughput sequencing represents a good method to understand viral diversity.
Hou, Weiguo; Wang, Shang; Briggs, Brandon R.; Li, Gaoyuan; Xie, Wei; Dong, Hailiang
2018-01-01
Myocyanophages, a group of viruses infecting cyanobacteria, are abundant and play important roles in elemental cycling. Here we investigated the particle-associated viral communities retained on 0.2 μm filters and in sediment samples (representing ancient cyanophage communities) from four ocean and three lake locations, using high-throughput sequencing and a newly designed primer pair targeting a gene fragment (∼145-bp in length) encoding the cyanophage gp23 major capsid protein (MCP). Diverse viral communities were detected in all samples. The fragments of 142-, 145-, and 148-bp in length were most abundant in the amplicons, and most sequences (>92%) belonged to cyanophages. Additionally, different sequencing depths resulted in different diversity estimates of the viral community. Operational taxonomic units obtained from deep sequencing of the MCP gene covered the majority of those obtained from shallow sequencing, suggesting that deep sequencing exhibited a more complete picture of cyanophage community than shallow sequencing. Our results also revealed a wide geographic distribution of marine myocyanophages, i.e., higher dissimilarities of the myocyanophage communities corresponded with the larger distances between the sampling sites. Collectively, this study suggests that the newly designed primer pair can be effectively used to study the community and diversity of myocyanophage from different environments, and the high-throughput sequencing represents a good method to understand viral diversity.
Rusch, Douglas B; Halpern, Aaron L; Sutton, Granger; Heidelberg, Karla B; Williamson, Shannon; Yooseph, Shibu; Wu, Dongying; Eisen, Jonathan A; Hoffman, Jeff M; Remington, Karin; Beeson, Karen; Tran, Bao; Smith, Hamilton; Baden-Tillson, Holly; Stewart, Clare; Thorpe, Joyce; Freeman, Jason; Andrews-Pfannkoch, Cynthia; Venter, Joseph E; Li, Kelvin; Kravitz, Saul; Heidelberg, John F; Utterback, Terry; Rogers, Yu-Hui; Falcón, Luisa I; Souza, Valeria; Bonilla-Rosso, Germán; Eguiarte, Luis E; Karl, David M; Sathyendranath, Shubha; Platt, Trevor; Bermingham, Eldredge; Gallardo, Victor; Tamayo-Castillo, Giselle; Ferrari, Michael R; Strausberg, Robert L; Nealson, Kenneth; Friedman, Robert; Frazier, Marvin; Venter, J. Craig
2007-01-01
The world's oceans contain a complex mixture of micro-organisms that are for the most part, uncharacterized both genetically and biochemically. We report here a metagenomic study of the marine planktonic microbiota in which surface (mostly marine) water samples were analyzed as part of the Sorcerer II Global Ocean Sampling expedition. These samples, collected across a several-thousand km transect from the North Atlantic through the Panama Canal and ending in the South Pacific yielded an extensive dataset consisting of 7.7 million sequencing reads (6.3 billion bp). Though a few major microbial clades dominate the planktonic marine niche, the dataset contains great diversity with 85% of the assembled sequence and 57% of the unassembled data being unique at a 98% sequence identity cutoff. Using the metadata associated with each sample and sequencing library, we developed new comparative genomic and assembly methods. One comparative genomic method, termed “fragment recruitment,” addressed questions of genome structure, evolution, and taxonomic or phylogenetic diversity, as well as the biochemical diversity of genes and gene families. A second method, termed “extreme assembly,” made possible the assembly and reconstruction of large segments of abundant but clearly nonclonal organisms. Within all abundant populations analyzed, we found extensive intra-ribotype diversity in several forms: (1) extensive sequence variation within orthologous regions throughout a given genome; despite coverage of individual ribotypes approaching 500-fold, most individual sequencing reads are unique; (2) numerous changes in gene content some with direct adaptive implications; and (3) hypervariable genomic islands that are too variable to assemble. The intra-ribotype diversity is organized into genetically isolated populations that have overlapping but independent distributions, implying distinct environmental preference. We present novel methods for measuring the genomic similarity between metagenomic samples and show how they may be grouped into several community types. Specific functional adaptations can be identified both within individual ribotypes and across the entire community, including proteorhodopsin spectral tuning and the presence or absence of the phosphate-binding gene PstS. PMID:17355176
Téllez-Sosa, Juan; Rodríguez, Mario Henry; Gómez-Barreto, Rosa E.; Valdovinos-Torres, Humberto; Hidalgo, Ana Cecilia; Cruz-Hervert, Pablo; Luna, René Santos; Carrillo-Valenzo, Erik; Ramos, Celso; García-García, Lourdes; Martínez-Barnetche, Jesús
2013-01-01
Background Influenza viruses display a high mutation rate and complex evolutionary patterns. Next-generation sequencing (NGS) has been widely used for qualitative and semi-quantitative assessment of genetic diversity in complex biological samples. The “deep sequencing” approach, enabled by the enormous throughput of current NGS platforms, allows the identification of rare genetic viral variants in targeted genetic regions, but is usually limited to a small number of samples. Methodology and Principal Findings We designed a proof-of-principle study to test whether redistributing sequencing throughput from a high depth-small sample number towards a low depth-large sample number approach is feasible and contributes to influenza epidemiological surveillance. Using 454-Roche sequencing, we sequenced at a rather low depth, a 307 bp amplicon of the neuraminidase gene of the Influenza A(H1N1) pandemic (A(H1N1)pdm) virus from cDNA amplicons pooled in 48 barcoded libraries obtained from nasal swab samples of infected patients (n = 299) taken from May to November, 2009 pandemic period in Mexico. This approach revealed that during the transition from the first (May-July) to second wave (September-November) of the pandemic, the initial genetic variants were replaced by the N248D mutation in the NA gene, and enabled the establishment of temporal and geographic associations with genetic diversity and the identification of mutations associated with oseltamivir resistance. Conclusions NGS sequencing of a short amplicon from the NA gene at low sequencing depth allowed genetic screening of a large number of samples, providing insights to viral genetic diversity dynamics and the identification of genetic variants associated with oseltamivir resistance. Further research is needed to explain the observed replacement of the genetic variants seen during the second wave. As sequencing throughput rises and library multiplexing and automation improves, we foresee that the approach presented here can be scaled up for global genetic surveillance of influenza and other infectious diseases. PMID:23843978
Niemi, Marianna; Bläuer, Auli; Iso-Touru, Terhi; Nyström, Veronica; Harjula, Janne; Taavitsainen, Jussi-Pekka; Storå, Jan; Lidén, Kerstin; Kantanen, Juha
2013-01-22
Several molecular and population genetic studies have focused on the native sheep breeds of Finland. In this work, we investigated their ancestral sheep populations from Iron Age, Medieval and Post-Medieval periods by sequencing a partial mitochondrial DNA D-loop and the 5'-promoter region of the SRY gene. We compared the maternal (mitochondrial DNA haplotypes) and paternal (SNP oY1) genetic diversity of ancient sheep in Finland with modern domestic sheep populations in Europe and Asia to study temporal changes in genetic variation and affinities between ancient and modern populations. A 523-bp mitochondrial DNA sequence was successfully amplified for 26 of 36 sheep ancient samples i.e. five, seven and 14 samples representative of Iron Age, Medieval and Post-Medieval sheep, respectively. Genetic diversity was analyzed within the cohorts. This ancient dataset was compared with present-day data consisting of 94 animals from 10 contemporary European breeds and with GenBank DNA sequence data to carry out a haplotype sharing analysis. Among the 18 ancient mitochondrial DNA haplotypes identified, 14 were present in the modern breeds. Ancient haplotypes were assigned to the highly divergent ovine haplogroups A and B, haplogroup B being the major lineage within the cohorts. Only two haplotypes were detected in the Iron Age samples, while the genetic diversity of the Medieval and Post-Medieval cohorts was higher. For three of the ancient DNA samples, Y-chromosome SRY gene sequences were amplified indicating that they originated from rams. The SRY gene of these three ancient ram samples contained SNP G-oY1, which is frequent in modern north-European sheep breeds. Our study did not reveal any sign of major population replacement of native sheep in Finland since the Iron Age. Variations in the availability of archaeological remains may explain differences in genetic diversity estimates and patterns within the cohorts rather than demographic events that occurred in the past. Our ancient DNA results fit well with the genetic context of domestic sheep as determined by analyses of modern north-European sheep breeds.
2013-01-01
Background Several molecular and population genetic studies have focused on the native sheep breeds of Finland. In this work, we investigated their ancestral sheep populations from Iron Age, Medieval and Post-Medieval periods by sequencing a partial mitochondrial DNA D-loop and the 5’-promoter region of the SRY gene. We compared the maternal (mitochondrial DNA haplotypes) and paternal (SNP oY1) genetic diversity of ancient sheep in Finland with modern domestic sheep populations in Europe and Asia to study temporal changes in genetic variation and affinities between ancient and modern populations. Results A 523-bp mitochondrial DNA sequence was successfully amplified for 26 of 36 sheep ancient samples i.e. five, seven and 14 samples representative of Iron Age, Medieval and Post-Medieval sheep, respectively. Genetic diversity was analyzed within the cohorts. This ancient dataset was compared with present-day data consisting of 94 animals from 10 contemporary European breeds and with GenBank DNA sequence data to carry out a haplotype sharing analysis. Among the 18 ancient mitochondrial DNA haplotypes identified, 14 were present in the modern breeds. Ancient haplotypes were assigned to the highly divergent ovine haplogroups A and B, haplogroup B being the major lineage within the cohorts. Only two haplotypes were detected in the Iron Age samples, while the genetic diversity of the Medieval and Post-Medieval cohorts was higher. For three of the ancient DNA samples, Y-chromosome SRY gene sequences were amplified indicating that they originated from rams. The SRY gene of these three ancient ram samples contained SNP G-oY1, which is frequent in modern north-European sheep breeds. Conclusions Our study did not reveal any sign of major population replacement of native sheep in Finland since the Iron Age. Variations in the availability of archaeological remains may explain differences in genetic diversity estimates and patterns within the cohorts rather than demographic events that occurred in the past. Our ancient DNA results fit well with the genetic context of domestic sheep as determined by analyses of modern north-European sheep breeds. PMID:23339395
Vite-Garín, Tania; Estrada-Bárcenas, Daniel Alfonso; Cifuentes, Joaquín; Taylor, Maria Lucia
2014-01-01
Advances in the classification of the human pathogen Histoplasma capsulatum (H. capsulatum) (ascomycete) are sustained by the results of several genetic analyses that support the high diversity of this dimorphic fungus. The present mini-review highlights the great genetic plasticity of H. capsulatum. Important records with different molecular tools, mainly single- or multi-locus sequence analyses developed with this fungus, are discussed. Recent phylogenetic data with a multi-locus sequence analysis using 5 polymorphic loci support a new clade and/or phylogenetic species of H. capsulatum for the Americas, which was associated with fungal isolates obtained from the migratory bat Tadarida brasiliensis. This manuscript is part of the series of works presented at the "V International Workshop: Molecular genetic approaches to the study of human pathogenic fungi" (Oaxaca, Mexico, 2012). Copyright © 2013 Revista Iberoamericana de Micología. Published by Elsevier Espana. All rights reserved.
Microbial community structure in three deep-sea carbonate crusts.
Heijs, S K; Aloisi, G; Bouloubassi, I; Pancost, R D; Pierre, C; Sinninghe Damsté, J S; Gottschal, J C; van Elsas, J D; Forney, L J
2006-10-01
Carbonate crusts in marine environments can act as sinks for carbon dioxide. Therefore, understanding carbonate crust formation could be important for understanding global warming. In the present study, the microbial communities of three carbonate crust samples from deep-sea mud volcanoes in the eastern Mediterranean were characterized by sequencing 16S ribosomal RNA (rRNA) genes amplified from DNA directly retrieved from the samples. In combination with the mineralogical composition of the crusts and lipid analyses, sequence data were used to assess the possible role of prokaryotes in crust formation. Collectively, the obtained data showed the presence of highly diverse communities, which were distinct in each of the carbonate crusts studied. Bacterial 16S rRNA gene sequences were found in all crusts and the majority was classified as alpha-, gamma-, and delta- Proteobacteria. Interestingly, sequences of Proteobacteria related to Halomonas and Halovibrio sp., which can play an active role in carbonate mineral formation, were present in all crusts. Archaeal 16S rRNA gene sequences were retrieved from two of the crusts studied. Several of those were closely related to archaeal sequences of organisms that have previously been linked to the anaerobic oxidation of methane (AOM). However, the majority of archaeal sequences were not related to sequences of organisms known to be involved in AOM. In combination with the strongly negative delta 13C values of archaeal lipids, these results open the possibility that organisms with a role in AOM may be more diverse within the Archaea than previously suggested. Different communities found in the crusts could carry out similar processes that might play a role in carbonate crust formation.
Jeanbille, M; Buée, M; Bach, C; Cébron, A; Frey-Klett, P; Turpault, M P; Uroz, S
2016-02-01
Soil and climatic conditions as well as land cover and land management have been shown to strongly impact the structure and diversity of the soil bacterial communities. Here, we addressed under a same land cover the potential effect of the edaphic parameters on the soil bacterial communities, excluding potential confounding factors as climate. To do this, we characterized two natural soil sequences occurring in the Montiers experimental site. Spatially distant soil samples were collected below Fagus sylvatica tree stands to assess the effect of soil sequences on the edaphic parameters, as well as the structure and diversity of the bacterial communities. Soil analyses revealed that the two soil sequences were characterized by higher pH and calcium and magnesium contents in the lower plots. Metabolic assays based on Biolog Ecoplates highlighted higher intensity and richness in usable carbon substrates in the lower plots than in the middle and upper plots, although no significant differences occurred in the abundance of bacterial and fungal communities along the soil sequences as assessed using quantitative PCR. Pyrosequencing analysis of 16S ribosomal RNA (rRNA) gene amplicons revealed that Proteobacteria, Acidobacteria and Bacteroidetes were the most abundantly represented phyla. Acidobacteria, Proteobacteria and Chlamydiae were significantly enriched in the most acidic and nutrient-poor soils compared to the Bacteroidetes, which were significantly enriched in the soils presenting the higher pH and nutrient contents. Interestingly, aluminium, nitrogen, calcium, nutrient availability and pH appeared to be the best predictors of the bacterial community structures along the soil sequences.
Single-cell genome sequencing at ultra-high-throughput with microfluidic droplet barcoding.
Lan, Freeman; Demaree, Benjamin; Ahmed, Noorsher; Abate, Adam R
2017-07-01
The application of single-cell genome sequencing to large cell populations has been hindered by technical challenges in isolating single cells during genome preparation. Here we present single-cell genomic sequencing (SiC-seq), which uses droplet microfluidics to isolate, fragment, and barcode the genomes of single cells, followed by Illumina sequencing of pooled DNA. We demonstrate ultra-high-throughput sequencing of >50,000 cells per run in a synthetic community of Gram-negative and Gram-positive bacteria and fungi. The sequenced genomes can be sorted in silico based on characteristic sequences. We use this approach to analyze the distributions of antibiotic-resistance genes, virulence factors, and phage sequences in microbial communities from an environmental sample. The ability to routinely sequence large populations of single cells will enable the de-convolution of genetic heterogeneity in diverse cell populations.
Loux, Valentin; Coeuret, Gwendoline; Zagorec, Monique; Champomier Vergès, Marie-Christine; Chaillou, Stéphane
2018-04-19
We present here the complete and draft genome sequences of nine Lactobacillus sakei strains, selected from the entire range of clonal complexes from the three known lineages of the species. The strains were chosen to provide a wide view of pangenomic and plasmidic diversity for this important foodborne species. Copyright © 2018 Loux et al.
A global view of structure–function relationships in the tautomerase superfamily
Davidson, Rebecca; Baas, Bert-Jan; Akiva, Eyal; Holliday, Gemma L.; Polacco, Benjamin J.; LeVieux, Jake A.; Pullara, Collin R.; Zhang, Yan Jessie; Whitman, Christian P.
2018-01-01
The tautomerase superfamily (TSF) consists of more than 11,000 nonredundant sequences present throughout the biosphere. Characterized members have attracted much attention because of the unusual and key catalytic role of an N-terminal proline. These few characterized members catalyze a diverse range of chemical reactions, but the full scale of their chemical capabilities and biological functions remains unknown. To gain new insight into TSF structure–function relationships, we performed a global analysis of similarities across the entire superfamily and computed a sequence similarity network to guide classification into distinct subgroups. Our results indicate that TSF members are found in all domains of life, with most being present in bacteria. The eukaryotic members of the cis-3-chloroacrylic acid dehalogenase subgroup are limited to fungal species, whereas the macrophage migration inhibitory factor subgroup has wide eukaryotic representation (including mammals). Unexpectedly, we found that 346 TSF sequences lack Pro-1, of which 85% are present in the malonate semialdehyde decarboxylase subgroup. The computed network also enabled the identification of similarity paths, namely sequences that link functionally diverse subgroups and exhibit transitional structural features that may help explain reaction divergence. A structure-guided comparison of these linker proteins identified conserved transitions between them, and kinetic analysis paralleled these observations. Phylogenetic reconstruction of the linker set was consistent with these findings. Our results also suggest that contemporary TSF members may have evolved from a short 4-oxalocrotonate tautomerase–like ancestor followed by gene duplication and fusion. Our new linker-guided strategy can be used to enrich the discovery of sequence/structure/function transitions in other enzyme superfamilies. PMID:29184004
Fu, Xiaoran; Apgar, James R.; Keating, Amy E.
2007-01-01
Computational protein design can be used to select sequences that are compatible with a fixed-backbone template. This strategy has been used in numerous instances to engineer novel proteins. However, the fixed-backbone assumption severely restricts the sequence space that is accessible via design. For challenging problems, such as the design of functional proteins, this may not be acceptable. In this paper, we present a method for introducing backbone flexibility into protein design calculations and apply it to the design of diverse helical BH3 ligands that bind to the anti-apoptotic protein Bcl-xL, a member of the Bcl-2 protein family. We demonstrate how normal mode analysis can be used to sample different BH3 backbones, and show that this leads to a larger and more diverse set of low-energy solutions than can be achieved using a native high-resolution Bcl-xL complex crystal structure as a template. We tested several of the designed solutions experimentally and found that this approach worked well when normal mode calculations were used to deform a native BH3 helix structure, but less well when they were used to deform an idealized helix. A subsequent round of design and testing identified a likely source of the problem as inadequate sampling of the helix pitch. In all, we tested seventeen designed BH3 peptide sequences, including several point mutants. Of these, eight bound well to Bcl-xL and four others showed weak but detectable binding. The successful designs showed a diversity of sequences that would have been difficult or impossible to achieve using only a fixed backbone. Thus, introducing backbone flexibility via normal mode analysis effectively broadened the set of sequences identified by computational design, and provided insight into positions important for binding Bcl-xL. PMID:17597151
Genetic diversity of Plasmodium Vivax revealed by the merozoite surface protein-1 icb5-6 fragment.
Ruan, Wei; Zhang, Ling-Ling; Feng, Yan; Zhang, Xuan; Chen, Hua-Liang; Lu, Qiao-Yi; Yao, Li-Nong; Hu, Wei
2017-06-05
Plasmodium vivax remains a potential cause of morbidity and mortality for people living in its endemic areas. Understanding the genetic diversity of P. vivax from different regions is valuable for studying population dynamics and tracing the origins of parasites. The PvMSP-1 gene is highly polymorphic and has been used as a marker in many P. vivax population studies. The aim of this study was to investigate the genetic diversity of the PvMSP-1 gene icb5-6 fragment and to provide more genetic polymorphism data for further studies on P. vivax population structure and tracking of the origin of clinical cases. Nested PCR and sequencing of the PvMSP-1 icb5-6 marker were performed to obtain the nucleotide sequences of 95 P. vivax isolates collected from Zhejiang province, China. To investigate the genetic diversity of PvMSP-1, the 95 nucleotide sequences of the PvMSP-1 icb5-6 fragment were genotyped and analyzed using DnaSP v5, MEGA software. The 95 P. vivax isolates collected from Zhejiang province were either indigenous cases or imported cases from different regions around the world. A total of 95 sequences ranging from 390 to 460 bp were obtained. The 95 sequences were genotyped into four allele-types (Sal I, Belem, R-III and R-IV) and 17 unique haplotypes. R-III and Sal I were the predominant allele-types. The haplotype diversity (Hd) and nucleotide diversity (Pi) were estimated to be 0.729 and 0.062, indicating that the PvMSP-1 icb5-6 fragment had the highest level of polymorphism due to frequent recombination processes and single nucleotide polymorphism. The values of dN/dS and Tajima's D both suggested neutral selection for the PvMSP-1icb5-6 fragment. In addition, a rare recombinant style of R-IV type was identified. This study presented high genetic diversity in the PvMSP-1 marker among P. vivax strains from around the world. The genetic data is valuable for expanding the polymorphism information on P. vivax, which could be helpful for further study on population dynamics and tracking the origin of P. vivax.
Ottesen, Elizabeth A.; Leadbetter, Jared R.
2011-01-01
In this study, we examine gene diversity for formyl-tetrahydrofolate synthetase (FTHFS), a key enzyme in homoacetogenesis, recovered from the gut microbiota of six species of higher termites. The “higher” termites (family Termitidae), which represent the majority of extant termite species and genera, engage in a broader diversity of feeding and nesting styles than the “lower” termites. Previous studies of termite gut homoacetogenesis have focused on wood-feeding lower termites, from which the preponderance of FTHFS sequences recovered were related to those from acetogenic treponemes. While sequences belonging to this group were present in the guts of all six higher termites examined, treponeme-like FTHFS sequences represented the majority of recovered sequences in only two species (a wood-feeding Nasutitermes sp. and a palm-feeding Microcerotermes sp.). The remaining four termite species analyzed (a Gnathamitermes sp. and two Amitermes spp. that were recovered from subterranean nests with indeterminate feeding strategies and a litter-feeding Rhynchotermes sp.) yielded novel FTHFS clades not observed in lower termites. These termites yielded two distinct clusters of probable purinolytic Firmicutes and a large group of potential homoacetogens related to sequences previously recovered from the guts of omnivorous cockroaches. These findings suggest that the gut environments of different higher termite species may select for different groups of homoacetogens, with some species hosting treponeme-dominated homoacetogen populations similar to those of wood-feeding, lower termites while others host Firmicutes-dominated communities more similar to those of omnivorous cockroaches. PMID:21441328
Silas, Sukrit; Makarova, Kira S; Shmakov, Sergey; Páez-Espino, David; Mohr, Georg; Liu, Yi; Davison, Michelle; Roux, Simon; Krishnamurthy, Siddharth R; Fu, Becky Xu Hua; Hansen, Loren L; Wang, David; Sullivan, Matthew B; Millard, Andrew; Clokie, Martha R; Bhaya, Devaki; Lambowitz, Alan M; Kyrpides, Nikos C; Koonin, Eugene V; Fire, Andrew Z
2017-07-11
Cas1 integrase is the key enzyme of the clustered regularly interspaced short palindromic repeat (CRISPR)-Cas adaptation module that mediates acquisition of spacers derived from foreign DNA by CRISPR arrays. In diverse bacteria, the cas1 gene is fused (or adjacent) to a gene encoding a reverse transcriptase (RT) related to group II intron RTs. An RT-Cas1 fusion protein has been recently shown to enable acquisition of CRISPR spacers from RNA. Phylogenetic analysis of the CRISPR-associated RTs demonstrates monophyly of the RT-Cas1 fusion, and coevolution of the RT and Cas1 domains. Nearly all such RTs are present within type III CRISPR-Cas loci, but their phylogeny does not parallel the CRISPR-Cas type classification, indicating that RT-Cas1 is an autonomous functional module that is disseminated by horizontal gene transfer and can function with diverse type III systems. To compare the sequence pools sampled by RT-Cas1-associated and RT-lacking CRISPR-Cas systems, we obtained samples of a commercially grown cyanobacterium- Arthrospira platensis Sequencing of the CRISPR arrays uncovered a highly diverse population of spacers. Spacer diversity was particularly striking for the RT-Cas1-containing type III-B system, where no saturation was evident even with millions of sequences analyzed. In contrast, analysis of the RT-lacking type III-D system yielded a highly diverse pool but reached a point where fewer novel spacers were recovered as sequencing depth was increased. Matches could be identified for a small fraction of the non-RT-Cas1-associated spacers, and for only a single RT-Cas1-associated spacer. Thus, the principal source(s) of the spacers, particularly the hypervariable spacer repertoire of the RT-associated arrays, remains unknown. IMPORTANCE While the majority of CRISPR-Cas immune systems adapt to foreign genetic elements by capturing segments of invasive DNA, some systems carry reverse transcriptases (RTs) that enable adaptation to RNA molecules. From analysis of available bacterial sequence data, we find evidence that RT-based RNA adaptation machinery has been able to join with CRISPR-Cas immune systems in many, diverse bacterial species. To investigate whether the abilities to adapt to DNA and RNA molecules are utilized for defense against distinct classes of invaders in nature, we sequenced CRISPR arrays from samples of commercial-scale open-air cultures of Arthrospira platensis , a cyanobacterium that contains both RT-lacking and RT-containing CRISPR-Cas systems. We uncovered a diverse pool of naturally occurring immune memories, with the RT-lacking locus acquiring a number of segments matching known viral or bacterial genes, while the RT-containing locus has acquired spacers from a distinct sequence pool for which the source remains enigmatic. Copyright © 2017 Silas et al.
It's all relative: ranking the diversity of aquatic bacterial communities.
Shaw, Allison K; Halpern, Aaron L; Beeson, Karen; Tran, Bao; Venter, J Craig; Martiny, Jennifer B H
2008-09-01
The study of microbial diversity patterns is hampered by the enormous diversity of microbial communities and the lack of resources to sample them exhaustively. For many questions about richness and evenness, however, one only needs to know the relative order of diversity among samples rather than total diversity. We used 16S libraries from the Global Ocean Survey to investigate the ability of 10 diversity statistics (including rarefaction, non-parametric, parametric, curve extrapolation and diversity indices) to assess the relative diversity of six aquatic bacterial communities. Overall, we found that the statistics yielded remarkably similar rankings of the samples for a given sequence similarity cut-off. This correspondence, despite the different underlying assumptions of the statistics, suggests that diversity statistics are a useful tool for ranking samples of microbial diversity. In addition, sequence similarity cut-off influenced the diversity ranking of the samples, demonstrating that diversity statistics can also be used to detect differences in phylogenetic structure among microbial communities. Finally, a subsampling analysis suggests that further sequencing from these particular clone libraries would not have substantially changed the richness rankings of the samples.
Pan, Yuezhi; Wang, Xueqin; Sun, Guiling; Li, Fusheng; Gong, Xun
2016-01-01
Panax notoginseng, a traditional Chinese medicinal plant, has been cultivated and domesticated for approximately 400 years, mainly in Yunnan and Guangxi, two provinces in southwest China. This species was named according to cultivated rather than wild individuals, and no wild populations had been found until now. The genetic resources available on farms are important for both breeding practices and resource conservation. In the present study, the recently developed technology RADseq, which is based on next-generation sequencing, was used to analyze the genetic variation and differentiation of P. notoginseng. The nucleotide diversity and heterozygosity results indicated that P. notoginseng had low genetic diversity at both the species and population levels. Almost no genetic differentiation has been detected, and all populations were genetically similar due to strong gene flow and insufficient splitting time. Although the genetic diversity of P. notoginseng was low at both species and population levels, several traditional plantations had relatively high genetic diversity, as revealed by the He and π values and by the private allele numbers. These valuable genetic resources should be protected as soon as possible to facilitate future breeding projects. The possible geographical origin of Sanqi domestication was discussed based on the results of the genetic diversity analysis. PMID:27846268
Comprehensive phylogenetic analysis of bacterial reverse transcriptases.
Toro, Nicolás; Nisa-Martínez, Rafael
2014-01-01
Much less is known about reverse transcriptases (RTs) in prokaryotes than in eukaryotes, with most prokaryotic enzymes still uncharacterized. Two surveys involving BLAST searches for RT genes in prokaryotic genomes revealed the presence of large numbers of diverse, uncharacterized RTs and RT-like sequences. Here, using consistent annotation across all sequenced bacterial species from GenBank and other sources via RAST, available from the PATRIC (Pathogenic Resource Integration Center) platform, we have compiled the data for currently annotated reverse transcriptases from completely sequenced bacterial genomes. RT sequences are broadly distributed across bacterial phyla, but green sulfur bacteria and cyanobacteria have the highest levels of RT sequence diversity (≤85% identity) per genome. By contrast, phylum Actinobacteria, for which a large number of genomes have been sequenced, was found to have a low RT sequence diversity. Phylogenetic analyses revealed that bacterial RTs could be classified into 17 main groups: group II introns, retrons/retron-like RTs, diversity-generating retroelements (DGRs), Abi-like RTs, CRISPR-Cas-associated RTs, group II-like RTs (G2L), and 11 other groups of RTs of unknown function. Proteobacteria had the highest potential functional diversity, as they possessed most of the RT groups. Group II introns and DGRs were the most widely distributed RTs in bacterial phyla. Our results provide insights into bacterial RT phylogeny and the basis for an update of annotation systems based on sequence/domain homology.
Comprehensive Phylogenetic Analysis of Bacterial Reverse Transcriptases
Toro, Nicolás; Nisa-Martínez, Rafael
2014-01-01
Much less is known about reverse transcriptases (RTs) in prokaryotes than in eukaryotes, with most prokaryotic enzymes still uncharacterized. Two surveys involving BLAST searches for RT genes in prokaryotic genomes revealed the presence of large numbers of diverse, uncharacterized RTs and RT-like sequences. Here, using consistent annotation across all sequenced bacterial species from GenBank and other sources via RAST, available from the PATRIC (Pathogenic Resource Integration Center) platform, we have compiled the data for currently annotated reverse transcriptases from completely sequenced bacterial genomes. RT sequences are broadly distributed across bacterial phyla, but green sulfur bacteria and cyanobacteria have the highest levels of RT sequence diversity (≤85% identity) per genome. By contrast, phylum Actinobacteria, for which a large number of genomes have been sequenced, was found to have a low RT sequence diversity. Phylogenetic analyses revealed that bacterial RTs could be classified into 17 main groups: group II introns, retrons/retron-like RTs, diversity-generating retroelements (DGRs), Abi-like RTs, CRISPR-Cas-associated RTs, group II-like RTs (G2L), and 11 other groups of RTs of unknown function. Proteobacteria had the highest potential functional diversity, as they possessed most of the RT groups. Group II introns and DGRs were the most widely distributed RTs in bacterial phyla. Our results provide insights into bacterial RT phylogeny and the basis for an update of annotation systems based on sequence/domain homology. PMID:25423096
Dacheux, Laurent; Cervantes-Gonzalez, Minerva; Guigon, Ghislaine; Thiberge, Jean-Michel; Vandenbogaert, Mathias; Maufrais, Corinne
2014-01-01
The prediction of viral zoonosis epidemics has become a major public health issue. A profound understanding of the viral population in key animal species acting as reservoirs represents an important step towards this goal. Bats harbor diverse viruses, some of which are of particular interest because they cause severe human diseases. However, little is known about the diversity of the global population of viruses found in bats (virome). We determined the viral diversity of five different French insectivorous bat species (nine specimens in total) in close contact with humans. Sequence-independent amplification, high-throughput sequencing with Illumina technology and a dedicated bioinformatics analysis pipeline were used on pooled tissues (brain, liver and lungs). Comparisons of the sequences of contigs and unassembled reads provided a global taxonomic distribution of virus-related sequences for each sample, highlighting differences both within and between bat species. Many viral families were present in these viromes, including viruses known to infect bacteria, plants/fungi, insects or vertebrates, the most relevant being those infecting mammals (Retroviridae, Herpesviridae, Bunyaviridae, Poxviridae, Flaviviridae, Reoviridae, Bornaviridae, Picobirnaviridae). In particular, we detected several new mammalian viruses, including rotaviruses, gammaretroviruses, bornaviruses and bunyaviruses with the identification of the first bat nairovirus. These observations demonstrate that bats naturally harbor viruses from many different families, most of which infect mammals. They may therefore constitute a major reservoir of viral diversity that should be analyzed carefully, to determine the role played by bats in the spread of zoonotic viral infections. PMID:24489870
Sharma, Rahul; Prakash, Om; Sonawane, Mahesh S; Nimonkar, Yogesh; Golellu, Priyanka B; Sharma, Rohit
2016-01-01
Soda lake is hyper alkaline and saline habitat located in closed craters with high evaporation rate. In current study fungal diversity from water and sediment samples of a soda lake (Lonar lake) located in Buldhana district of Maharashtra, India was investigated using extensive culturomics approach and mimicking the natural conditions of Lonar lake in culture media. A total of 104 diverse isolates of extremophilic fungi were recovered from this study and phylogenetically characterized by internal transcribed spacer (ITS) region sequencing. In addition, due to important role of phenol oxidase, and peroxidase in degradation of toxic phenol, lignin, etc., all isolated pure cultures were also screened for extracellular phenol oxidase and peroxidase production potential. Diversity analysis indicated that different groups of extremophilic fungi are present in the water and sediment samples of Lonar lake. A total of 38 species of fungi belonging to 18-different genera were recovered. Out of 104 isolates 32 showed ≤97% sequences similarity, which were morphologically different and could be potential novel isolates of extremophilic fungi. However, out of 104 isolates only 14 showed the extracellular phenol oxidase production potentials at alkaline pH. Curvularia sp. strain MEF018 showed highest phenol oxidase production at alkaline condition and had low sequence similarity with previously characterized species (96% with Curvularia pseudorobusta ). Taxonomic characterization (morphological and physiological) and multi locus sequence analysis (MLSA) using combined alignment of ITS-LSU- gpd of strain MEF018 showed that it is a novel species of the genus Curvularia and hence proposed as Curvularia lonarensis sp. nov.
Sharma, Rahul; Prakash, Om; Sonawane, Mahesh S.; Nimonkar, Yogesh; Golellu, Priyanka B.; Sharma, Rohit
2016-01-01
Soda lake is hyper alkaline and saline habitat located in closed craters with high evaporation rate. In current study fungal diversity from water and sediment samples of a soda lake (Lonar lake) located in Buldhana district of Maharashtra, India was investigated using extensive culturomics approach and mimicking the natural conditions of Lonar lake in culture media. A total of 104 diverse isolates of extremophilic fungi were recovered from this study and phylogenetically characterized by internal transcribed spacer (ITS) region sequencing. In addition, due to important role of phenol oxidase, and peroxidase in degradation of toxic phenol, lignin, etc., all isolated pure cultures were also screened for extracellular phenol oxidase and peroxidase production potential. Diversity analysis indicated that different groups of extremophilic fungi are present in the water and sediment samples of Lonar lake. A total of 38 species of fungi belonging to 18-different genera were recovered. Out of 104 isolates 32 showed ≤97% sequences similarity, which were morphologically different and could be potential novel isolates of extremophilic fungi. However, out of 104 isolates only 14 showed the extracellular phenol oxidase production potentials at alkaline pH. Curvularia sp. strain MEF018 showed highest phenol oxidase production at alkaline condition and had low sequence similarity with previously characterized species (96% with Curvularia pseudorobusta). Taxonomic characterization (morphological and physiological) and multi locus sequence analysis (MLSA) using combined alignment of ITS-LSU-gpd of strain MEF018 showed that it is a novel species of the genus Curvularia and hence proposed as Curvularia lonarensis sp. nov. PMID:27920761
A Diverse Range of Novel RNA Viruses in Geographically Distinct Honey Bee Populations
Shi, Mang; Buchmann, Gabriele; Blacquière, Tjeerd; Beekman, Madeleine; Ashe, Alyson
2017-01-01
ABSTRACT Understanding the diversity and consequences of viruses present in honey bees is critical for maintaining pollinator health and managing the spread of disease. The viral landscape of honey bees (Apis mellifera) has changed dramatically since the emergence of the parasitic mite Varroa destructor, which increased the spread of virulent variants of viruses such as deformed wing virus. Previous genomic studies have focused on colonies suffering from infections by Varroa and virulent viruses, which could mask other viral species present in honey bees, resulting in a distorted view of viral diversity. To capture the viral diversity within colonies that are exposed to mites but do not suffer the ultimate consequences of the infestation, we examined populations of honey bees that have evolved naturally or have been selected for resistance to Varroa. This analysis revealed seven novel viruses isolated from honey bees sampled globally, including the first identification of negative-sense RNA viruses in honey bees. Notably, two rhabdoviruses were present in three geographically diverse locations and were also present in Varroa mites parasitizing the bees. To characterize the antiviral response, we performed deep sequencing of small RNA populations in honey bees and mites. This provided evidence of a Dicer-mediated immune response in honey bees, while the viral small RNA profile in Varroa mites was novel and distinct from the response observed in bees. Overall, we show that viral diversity in honey bee colonies is greater than previously thought, which encourages additional studies of the bee virome on a global scale and which may ultimately improve disease management. IMPORTANCE Honey bee populations have become increasingly susceptible to colony losses due to pathogenic viruses spread by parasitic Varroa mites. To date, 24 viruses have been described in honey bees, with most belonging to the order Picornavirales. Collapsing Varroa-infected colonies are often overwhelmed with high levels of picornaviruses. To examine the underlying viral diversity in honey bees, we employed viral metatranscriptomics analyses on three geographically diverse Varroa-resistant populations from Europe, Africa, and the Pacific. We describe seven novel viruses from a range of diverse viral families, including two viruses that are present in all three locations. In honey bees, small RNA sequences indicate that these viruses are processed by Dicer and the RNA interference pathway, whereas Varroa mites produce strikingly novel small RNA patterns. This work increases the number and diversity of known honey bee viruses and will ultimately contribute to improved disease management in our most important agricultural pollinator. PMID:28515299
A Diverse Range of Novel RNA Viruses in Geographically Distinct Honey Bee Populations.
Remnant, Emily J; Shi, Mang; Buchmann, Gabriele; Blacquière, Tjeerd; Holmes, Edward C; Beekman, Madeleine; Ashe, Alyson
2017-08-15
Understanding the diversity and consequences of viruses present in honey bees is critical for maintaining pollinator health and managing the spread of disease. The viral landscape of honey bees ( Apis mellifera ) has changed dramatically since the emergence of the parasitic mite Varroa destructor , which increased the spread of virulent variants of viruses such as deformed wing virus. Previous genomic studies have focused on colonies suffering from infections by Varroa and virulent viruses, which could mask other viral species present in honey bees, resulting in a distorted view of viral diversity. To capture the viral diversity within colonies that are exposed to mites but do not suffer the ultimate consequences of the infestation, we examined populations of honey bees that have evolved naturally or have been selected for resistance to Varroa This analysis revealed seven novel viruses isolated from honey bees sampled globally, including the first identification of negative-sense RNA viruses in honey bees. Notably, two rhabdoviruses were present in three geographically diverse locations and were also present in Varroa mites parasitizing the bees. To characterize the antiviral response, we performed deep sequencing of small RNA populations in honey bees and mites. This provided evidence of a Dicer-mediated immune response in honey bees, while the viral small RNA profile in Varroa mites was novel and distinct from the response observed in bees. Overall, we show that viral diversity in honey bee colonies is greater than previously thought, which encourages additional studies of the bee virome on a global scale and which may ultimately improve disease management. IMPORTANCE Honey bee populations have become increasingly susceptible to colony losses due to pathogenic viruses spread by parasitic Varroa mites. To date, 24 viruses have been described in honey bees, with most belonging to the order Picornavirales Collapsing Varroa -infected colonies are often overwhelmed with high levels of picornaviruses. To examine the underlying viral diversity in honey bees, we employed viral metatranscriptomics analyses on three geographically diverse Varroa- resistant populations from Europe, Africa, and the Pacific. We describe seven novel viruses from a range of diverse viral families, including two viruses that are present in all three locations. In honey bees, small RNA sequences indicate that these viruses are processed by Dicer and the RNA interference pathway, whereas Varroa mites produce strikingly novel small RNA patterns. This work increases the number and diversity of known honey bee viruses and will ultimately contribute to improved disease management in our most important agricultural pollinator. Copyright © 2017 Remnant et al.
Estimating Bacterial Diversity for Ecological Studies: Methods, Metrics, and Assumptions
Birtel, Julia; Walser, Jean-Claude; Pichon, Samuel; Bürgmann, Helmut; Matthews, Blake
2015-01-01
Methods to estimate microbial diversity have developed rapidly in an effort to understand the distribution and diversity of microorganisms in natural environments. For bacterial communities, the 16S rRNA gene is the phylogenetic marker gene of choice, but most studies select only a specific region of the 16S rRNA to estimate bacterial diversity. Whereas biases derived from from DNA extraction, primer choice and PCR amplification are well documented, we here address how the choice of variable region can influence a wide range of standard ecological metrics, such as species richness, phylogenetic diversity, β-diversity and rank-abundance distributions. We have used Illumina paired-end sequencing to estimate the bacterial diversity of 20 natural lakes across Switzerland derived from three trimmed variable 16S rRNA regions (V3, V4, V5). Species richness, phylogenetic diversity, community composition, β-diversity, and rank-abundance distributions differed significantly between 16S rRNA regions. Overall, patterns of diversity quantified by the V3 and V5 regions were more similar to one another than those assessed by the V4 region. Similar results were obtained when analyzing the datasets with different sequence similarity thresholds used during sequences clustering and when the same analysis was used on a reference dataset of sequences from the Greengenes database. In addition we also measured species richness from the same lake samples using ARISA Fingerprinting, but did not find a strong relationship between species richness estimated by Illumina and ARISA. We conclude that the selection of 16S rRNA region significantly influences the estimation of bacterial diversity and species distributions and that caution is warranted when comparing data from different variable regions as well as when using different sequencing techniques. PMID:25915756
Putaporntip, Chaturong; Thongaree, Siriporn; Jongwutiwes, Somchai
2013-08-01
To determine the genetic diversity and potential transmission routes of Plasmodium knowlesi, we analyzed the complete nucleotide sequence of the gene encoding the merozoite surface protein-1 of this simian malaria (Pkmsp-1), an asexual blood-stage vaccine candidate, from naturally infected humans and macaques in Thailand. Analysis of Pkmsp-1 sequences from humans (n=12) and monkeys (n=12) reveals five conserved and four variable domains. Most nucleotide substitutions in conserved domains were dimorphic whereas three of four variable domains contained complex repeats with extensive sequence and size variation. Besides purifying selection in conserved domains, evidence of intragenic recombination scattering across Pkmsp-1 was detected. The number of haplotypes, haplotype diversity, nucleotide diversity and recombination sites of human-derived sequences exceeded that of monkey-derived sequences. Phylogenetic networks based on concatenated conserved sequences of Pkmsp-1 displayed a character pattern that could have arisen from sampling process or the presence of two independent routes of P. knowlesi transmission, i.e. from macaques to human and from human to humans in Thailand. Copyright © 2013 Elsevier B.V. All rights reserved.
Mourier, Tobias; Mollerup, Sarah; Vinner, Lasse; Hansen, Thomas Arn; Kjartansdóttir, Kristín Rós; Guldberg Frøslev, Tobias; Snogdal Boutrup, Torsten; Nielsen, Lars Peter; Willerslev, Eske; Hansen, Anders J.
2015-01-01
From Illumina sequencing of DNA from brain and liver tissue from the lion, Panthera leo, and tumor samples from the pike-perch, Sander lucioperca, we obtained two assembled sequence contigs with similarity to known retroviruses. Phylogenetic analyses suggest that the pike-perch retrovirus belongs to the epsilonretroviruses, and the lion retrovirus to the gammaretroviruses. To determine if these novel retroviral sequences originate from an endogenous retrovirus or from a recently integrated exogenous retrovirus, we assessed the genetic diversity of the parental sequences from which the short Illumina reads are derived. First, we showed by simulations that we can robustly infer the level of genetic diversity from short sequence reads. Second, we find that the measures of nucleotide diversity inferred from our retroviral sequences significantly exceed the level observed from Human Immunodeficiency Virus infections, prompting us to conclude that the novel retroviruses are both of endogenous origin. Through further simulations, we rule out the possibility that the observed elevated levels of nucleotide diversity are the result of co-infection with two closely related exogenous retroviruses. PMID:26493184
Comparative Study on the Genetic Diversity of GHR Gene in Tibetan Cattle and Holstein Cows.
Deng, Feilong; Xia, Chenyang; Jia, Xianbo; Song, Tianzeng; Liu, Jianzhi; Lai, Song-Jia; Chen, Shi-Yi
2015-01-01
Due to the phenotype-based artificial selection in domestic cattle, the underlying functional genes may be indirectly selected and show decreasing diversity in theory. The growth hormone receptor (GHR) gene has been widely proposed to significantly associate with critical economic traits in cattle. In the present study, we comparatively studied the genetic diversity of GHR in Tibetan cattle (a traditional unselected breed, n = 93) and Chinese Holstein cow (the intensively selected breed, n = 94). The Tibetan yak (n = 38) was also included as an outgroup breed. A total of 21 variants were detected by sequencing 1279 bp genomic fragments encompassing the largest exon 9. Twelve haplotypes (H1∼H12) constructed by 15 coding SNPs were presented as a star-like network profile, in which haplotype H2 was located at the central position and almost occupied by Tibetan yaks. Furthermore, H2 was also identical to the formerly reported sequence specific to African cattle. Only haplotype H5 was simultaneously shared by all three breeds. Tibetan cattle showed higher nucleotide diversity (0.00215 ± 0.00015) and haplotype diversity (0.678 ± 0.026) than Holstein cow. Conclusively, we found Tibetan cattle have retained relatively high genetic variation of GHR. The predominant presence of African cattle specific H2 in the outgroup yak breed would highlight its ancestral relationship, which may be used as one informative molecular marker in the phylogenetic studies.
Metagenomics and the protein universe
Godzik, Adam
2011-01-01
Metagenomics sequencing projects have dramatically increased our knowledge of the protein universe and provided over one-half of currently known protein sequences; they have also introduced a much broader phylogenetic diversity into the protein databases. The full analysis of metagenomic datasets is only beginning, but it has already led to the discovery of thousands of new protein families, likely representing novel functions specific to given environments. At the same time, a deeper analysis of such novel families, including experimental structure determination of some representatives, suggests that most of them represent distant homologs of already characterized protein families, and thus most of the protein diversity present in the new environments are due to functional divergence of the known protein families rather than the emergence of new ones. PMID:21497084
Analysis of genetic diversity using SNP markers in oat
USDA-ARS?s Scientific Manuscript database
A large-scale single nucleotide polymorphism (SNP) discovery was carried out in cultivated oat using Roche 454 sequencing methods. DNA sequences were generated from cDNAs originating from a panel of 20 diverse oat cultivars, and from Diversity Array Technology (DArT) genomic complexity reductions fr...
Insights into Structural and Mechanistic Features of Viral IRES Elements
Martinez-Salas, Encarnacion; Francisco-Velilla, Rosario; Fernandez-Chamorro, Javier; Embarek, Azman M.
2018-01-01
Internal ribosome entry site (IRES) elements are cis-acting RNA regions that promote internal initiation of protein synthesis using cap-independent mechanisms. However, distinct types of IRES elements present in the genome of various RNA viruses perform the same function despite lacking conservation of sequence and secondary RNA structure. Likewise, IRES elements differ in host factor requirement to recruit the ribosomal subunits. In spite of this diversity, evolutionarily conserved motifs in each family of RNA viruses preserve sequences impacting on RNA structure and RNA–protein interactions important for IRES activity. Indeed, IRES elements adopting remarkable different structural organizations contain RNA structural motifs that play an essential role in recruiting ribosomes, initiation factors and/or RNA-binding proteins using different mechanisms. Therefore, given that a universal IRES motif remains elusive, it is critical to understand how diverse structural motifs deliver functions relevant for IRES activity. This will be useful for understanding the molecular mechanisms beyond cap-independent translation, as well as the evolutionary history of these regulatory elements. Moreover, it could improve the accuracy to predict IRES-like motifs hidden in genome sequences. This review summarizes recent advances on the diversity and biological relevance of RNA structural motifs for viral IRES elements. PMID:29354113
Leite, A M O; Mayo, B; Rachid, C T C C; Peixoto, R S; Silva, J T; Paschoalin, V M F; Delgado, S
2012-09-01
The microbial diversity and community structure of three different kefir grains from different parts of Brazil were examined via the combination of two culture-independent methods: PCR-denaturing gradient gel electrophoresis (PCR-DGGE) and pyrosequencing. PCR-DGGE showed Lactobacillus kefiranofaciens and Lactobacillus kefiri to be the major bacterial populations in all three grains. The yeast community was dominated by Saccharomyces cerevisiae. Pyrosequencing produced a total of 14,314 partial 16S rDNA sequence reads from the three grains. Sequence analysis grouped the reads into three phyla, of which Firmicutes was dominant. Members of the genus Lactobacillus were the most abundant operational taxonomic units (OTUs) in all samples, accounting for up to 96% of the sequences. OTUs belonging to other lactic and acetic acid bacteria genera, such as Lactococcus, Leuconostoc, Streptococcus and Acetobacter, were also identified at low levels. Two of the grains showed identical DGGE profiles and a similar number of OTUs, while the third sample showed the highest diversity by both techniques. Pyrosequencing allowed the identification of bacteria that were present in small numbers and rarely associated with the microbial community of this complex ecosystem. Copyright © 2012 Elsevier Ltd. All rights reserved.
Characterization of the Gut Microbiome Using 16S or Shotgun Metagenomics
Jovel, Juan; Patterson, Jordan; Wang, Weiwei; Hotte, Naomi; O'Keefe, Sandra; Mitchel, Troy; Perry, Troy; Kao, Dina; Mason, Andrew L.; Madsen, Karen L.; Wong, Gane K.-S.
2016-01-01
The advent of next generation sequencing (NGS) has enabled investigations of the gut microbiome with unprecedented resolution and throughput. This has stimulated the development of sophisticated bioinformatics tools to analyze the massive amounts of data generated. Researchers therefore need a clear understanding of the key concepts required for the design, execution and interpretation of NGS experiments on microbiomes. We conducted a literature review and used our own data to determine which approaches work best. The two main approaches for analyzing the microbiome, 16S ribosomal RNA (rRNA) gene amplicons and shotgun metagenomics, are illustrated with analyses of libraries designed to highlight their strengths and weaknesses. Several methods for taxonomic classification of bacterial sequences are discussed. We present simulations to assess the number of sequences that are required to perform reliable appraisals of bacterial community structure. To the extent that fluctuations in the diversity of gut bacterial populations correlate with health and disease, we emphasize various techniques for the analysis of bacterial communities within samples (α-diversity) and between samples (β-diversity). Finally, we demonstrate techniques to infer the metabolic capabilities of a bacteria community from these 16S and shotgun data. PMID:27148170
Specificity, Privacy, and Degeneracy in the CD4 T Cell Receptor Repertoire Following Immunization
Sun, Yuxin; Best, Katharine; Cinelli, Mattia; Heather, James M.; Reich-Zeliger, Shlomit; Shifrut, Eric; Friedman, Nir; Shawe-Taylor, John; Chain, Benny
2017-01-01
T cells recognize antigen using a large and diverse set of antigen-specific receptors created by a complex process of imprecise somatic cell gene rearrangements. In response to antigen-/receptor-binding-specific T cells then divide to form memory and effector populations. We apply high-throughput sequencing to investigate the global changes in T cell receptor sequences following immunization with ovalbumin (OVA) and adjuvant, to understand how adaptive immunity achieves specificity. Each immunized mouse contained a predominantly private but related set of expanded CDR3β sequences. We used machine learning to identify common patterns which distinguished repertoires from mice immunized with adjuvant with and without OVA. The CDR3β sequences were deconstructed into sets of overlapping contiguous amino acid triplets. The frequencies of these motifs were used to train the linear programming boosting (LPBoost) algorithm LPBoost to classify between TCR repertoires. LPBoost could distinguish between the two classes of repertoire with accuracies above 80%, using a small subset of triplet sequences present at defined positions along the CDR3. The results suggest a model in which such motifs confer degenerate antigen specificity in the context of a highly diverse and largely private set of T cell receptors. PMID:28450864
Hwang, Jonathan; Zhao, Qi; Yang, Zhu L; Wang, Zheng; Townsend, Jeffrey P
2015-08-01
The relation between ecological and genetic divergence of Helvella species (saddle fungi) has been perplexing. While a few species have been clearly demonstrated to be ectomycorrhizal fungi, ecological roles of many other species have been controversial, alternately considered as either saprotrophic or mycorrhizal. We applied SATé to build an inclusive deoxyribonucleic acid sequence alignment for the internal transcribed spacers (ITS) of annotated Helvella species and related environmental sequences. Phylogenetic informativeness of ITS and its regions were assessed using PhyDesign. Mycorrhizal lineages present a diversity of ecology, host type and geographic distribution. In two Helvella clades, no Helvella ITS sequences were recovered from root tips. Inclusion of environmental sequences in the ITS phylogeny from these sequences has the potential to link these data and reveal Helvella ecology. This study can serve as a model for revealing the diversity of relationships between unculturable fungi and their potential plant hosts. How non-mycorrhizal life styles within Helvella evolved will require expanded metagenomic investigation of soil and other environmental samples along with study of Helvella genomes. © 2015 Society for Applied Microbiology and John Wiley & Sons Ltd.
Wei, Jingli; Hu, Xiaorong; Yang, Jingjing; Yang, Wencai
2012-01-01
The genus Physalis includes a number of commercially important edible and ornamental species. Its high nutritional value and potential medicinal properties leads to the increased commercial interest in the products of this genus worldwide. However, lack of molecular markers prevents the detailed study of genetics and phylogeny in Physalis, which limits the progress of breeding. In the present study, we compared the DNA sequences between Physalis and tomato, and attempted to analyze genetic diversity in Physalis using tomato markers. Blasting 23180 DNA sequences derived from Physalis against the International Tomato Annotation Group (ITAG) Release2.3 Predicted CDS (SL2.40) discovered 3356 single-copy orthologous genes between them. A total of 38 accessions from at least six species of Physalis were subjected to genetic diversity analysis using 97 tomato markers and 25 SSR markers derived from P. peruviana. Majority (73.2%) of tomato markers could amplify DNA fragments from at least one accession of Physalis. Diversity in Physalis at molecular level was also detected. The average Nei’s genetic distance between accessions was 0.3806 with a range of 0.2865 to 0.7091. These results indicated Physalis and tomato had similarity at both molecular marker and DNA sequence levels. Therefore, the molecular markers developed in tomato can be used in genetic study in Physalis. PMID:23166835
Wei, Jingli; Hu, Xiaorong; Yang, Jingjing; Yang, Wencai
2012-01-01
The genus Physalis includes a number of commercially important edible and ornamental species. Its high nutritional value and potential medicinal properties leads to the increased commercial interest in the products of this genus worldwide. However, lack of molecular markers prevents the detailed study of genetics and phylogeny in Physalis, which limits the progress of breeding. In the present study, we compared the DNA sequences between Physalis and tomato, and attempted to analyze genetic diversity in Physalis using tomato markers. Blasting 23180 DNA sequences derived from Physalis against the International Tomato Annotation Group (ITAG) Release2.3 Predicted CDS (SL2.40) discovered 3356 single-copy orthologous genes between them. A total of 38 accessions from at least six species of Physalis were subjected to genetic diversity analysis using 97 tomato markers and 25 SSR markers derived from P. peruviana. Majority (73.2%) of tomato markers could amplify DNA fragments from at least one accession of Physalis. Diversity in Physalis at molecular level was also detected. The average Nei's genetic distance between accessions was 0.3806 with a range of 0.2865 to 0.7091. These results indicated Physalis and tomato had similarity at both molecular marker and DNA sequence levels. Therefore, the molecular markers developed in tomato can be used in genetic study in Physalis.
Raw Sewage Harbors Diverse Viral Populations
Cantalupo, Paul G.; Calgua, Byron; Zhao, Guoyan; Hundesa, Ayalkibet; Wier, Adam D.; Katz, Josh P.; Grabe, Michael; Hendrix, Roger W.; Girones, Rosina; Wang, David; Pipas, James M.
2011-01-01
ABSTRACT At this time, about 3,000 different viruses are recognized, but metagenomic studies suggest that these viruses are a small fraction of the viruses that exist in nature. We have explored viral diversity by deep sequencing nucleic acids obtained from virion populations enriched from raw sewage. We identified 234 known viruses, including 17 that infect humans. Plant, insect, and algal viruses as well as bacteriophages were also present. These viruses represented 26 taxonomic families and included viruses with single-stranded DNA (ssDNA), double-stranded DNA (dsDNA), positive-sense ssRNA [ssRNA(+)], and dsRNA genomes. Novel viruses that could be placed in specific taxa represented 51 different families, making untreated wastewater the most diverse viral metagenome (genetic material recovered directly from environmental samples) examined thus far. However, the vast majority of sequence reads bore little or no sequence relation to known viruses and thus could not be placed into specific taxa. These results show that the vast majority of the viruses on Earth have not yet been characterized. Untreated wastewater provides a rich matrix for identifying novel viruses and for studying virus diversity. Importance At this time, virology is focused on the study of a relatively small number of viral species. Specific viruses are studied either because they are easily propagated in the laboratory or because they are associated with disease. The lack of knowledge of the size and characteristics of the viral universe and the diversity of viral genomes is a roadblock to understanding important issues, such as the origin of emerging pathogens and the extent of gene exchange among viruses. Untreated wastewater is an ideal system for assessing viral diversity because virion populations from large numbers of individuals are deposited and because raw sewage itself provides a rich environment for the growth of diverse host species and thus their viruses. These studies suggest that the viral universe is far more vast and diverse than previously suspected. PMID:21972239
Behnke, Anke; Bunge, John; Barger, Kathryn; Breiner, Hans-Werner; Alla, Victoria; Stoeck, Thorsten
2006-01-01
To resolve the fine-scale architecture of anoxic protistan communities, we conducted a cultivation-independent 18S rRNA survey in the superanoxic Framvaren Fjord in Norway. We generated three clone libraries along the steep O2/H2S gradient, using the multiple-primer approach. Of 1,100 clones analyzed, 753 proved to be high-quality protistan target sequences. These sequences were grouped into 92 phylotypes, which displayed high protistan diversity in the fjord (17 major eukaryotic phyla). Only a few were closely related to known taxa. Several sequences were dissimilar to all previously described sequences and occupied a basal position in the inferred phylogenies, suggesting that the sequences recovered were derived from novel, deeply divergent eukaryotes. We detected sequence clades with evolutionary importance (for example, clades in the euglenozoa) and clades that seem to be specifically adapted to anoxic environments, challenging the hypothesis that the global dispersal of protists is uniform. Moreover, with the detection of clones affiliated with jakobid flagellates, we present evidence that primitive descendants of early eukaryotes are present in this anoxic environment. To estimate sample coverage and phylotype richness, we used parametric and nonparametric statistical methods. The results show that although our data set is one of the largest published inventories, our sample missed a substantial proportion of the protistan diversity. Nevertheless, statistical and phylogenetic analyses of the three libraries revealed the fine-scale architecture of anoxic protistan communities, which may exhibit adaptation to different environmental conditions along the O2/H2S gradient. PMID:16672511
Asha, Srinivasan; Sreekumar, Sweda; Soniya, E V
2016-01-01
Analysis of high-throughput small RNA deep sequencing data, in combination with black pepper transcriptome sequences revealed microRNA-mediated gene regulation in black pepper ( Piper nigrum L.). Black pepper is an important spice crop and its berries are used worldwide as a natural food additive that contributes unique flavour to foods. In the present study to characterize microRNAs from black pepper, we generated a small RNA library from black pepper leaf and sequenced it by Illumina high-throughput sequencing technology. MicroRNAs belonging to a total of 303 conserved miRNA families were identified from the sRNAome data. Subsequent analysis from recently sequenced black pepper transcriptome confirmed precursor sequences of 50 conserved miRNAs and four potential novel miRNA candidates. Stem-loop qRT-PCR experiments demonstrated differential expression of eight conserved miRNAs in black pepper. Computational analysis of targets of the miRNAs showed 223 potential black pepper unigene targets that encode diverse transcription factors and enzymes involved in plant development, disease resistance, metabolic and signalling pathways. RLM-RACE experiments further mapped miRNA-mediated cleavage at five of the mRNA targets. In addition, miRNA isoforms corresponding to 18 miRNA families were also identified from black pepper. This study presents the first large-scale identification of microRNAs from black pepper and provides the foundation for the future studies of miRNA-mediated gene regulation of stress responses and diverse metabolic processes in black pepper.
Tang, Kai; Lin, Dan; Zheng, Qiang; Liu, Keshao; Yang, Yujie; Han, Yu; Jiao, Nianzhi
2017-06-27
Marine phages are spectacularly diverse in nature. Dozens of roseophages infecting members of Roseobacter clade bacteria were isolated and characterized, exhibiting a very high degree of genetic diversity. In the present study, the induction of two temperate bacteriophages, namely, vB_ThpS-P1 and vB_PeaS-P1, was performed in Roseobacter clade bacteria isolated from the deep-sea water, Thiobacimonas profunda JLT2016 and Pelagibaca abyssi JLT2014, respectively. Two novel phages in morphological, genomic and proteomic features were presented, and their phylogeny and evolutionary relationships were explored by bioinformatic analysis. Electron microscopy showed that the morphology of the two phages were similar to that of siphoviruses. Genome sequencing indicated that the two phages were similar in size, organization, and content, thereby suggesting that these shared a common ancestor. Despite the presence of Mu-like phage head genes, the phages are more closely related to Rhodobacter phage RC1 than Mu phages in terms of gene content and sequence similarity. Based on comparative genomic and phylogenetic analysis, we propose a Mu-like head phage group to allow for the inclusion of Mu-like phages and two newly phages. The sequences of the Mu-like head phage group were widespread, occurring in each investigated metagenomes. Furthermore, the horizontal exchange of genetic material within the Mu-like head phage group might have involved a gene that was associated with phage phenotypic characteristics. This study is the first report on the complete genome sequences of temperate phages that infect deep-sea roseobacters, belonging to the Mu-like head phage group. The Mu-like head phage group might represent a small but ubiquitous fraction of marine viral diversity.
Characterization of the bacterial biodiversity in Pico cheese (an artisanal Azorean food).
Riquelme, Cristina; Câmara, Sandra; Dapkevicius, Maria de Lurdes N Enes; Vinuesa, Pablo; da Silva, Célia Costa Gomes; Malcata, F Xavier; Rego, Oldemiro A
2015-01-02
This work presents the first study on the bacterial communities in Pico cheese, a traditional cheese of the Azores (Portugal), made from raw cow's milk. Pyrosequencing of tagged amplicons of the V3-V4 regions of the 16S rDNA and Operational Taxonomic Unit-based (OTU-based) analysis were applied to obtain an overall idea of the microbiota in Pico cheese and to elucidate possible differences between cheese-makers (A, B and C) and maturation times. Pyrosequencing revealed a high bacterial diversity in Pico cheese. Four phyla (Firmicutes, Proteobacteria, Actinobacteria and Bacteroidetes) and 54 genera were identified. The predominant genus was Lactococcus (77% of the sequences). Sequences belonging to major cheese-borne pathogens were not found. Staphylococcus accounted for 0.5% of the sequences. Significant differences in bacterial community composition were observed between cheese-maker B and the other two units that participated in the study. However, OTU analysis identified a set of taxa (Lactococcus, Streptococcus, Acinetobacter, Enterococcus, Lactobacillus, Staphylococcus, Rothia, Pantoea and unclassified genera belonging to the Enterobacteriaceae family) that would represent the core components of artisanal Pico cheese microbiota. A diverse bacterial community was present at early maturation, with an increase in the number of phylotypes up to 2 weeks, followed by a decrease at the end of ripening. The most remarkable trend in abundance patterns throughout ripening was an increase in the number of sequences belonging to the Lactobacillus genus, with a concomitant decrease in Acinetobacter, and Stenotrophomonas. Microbial rank abundance curves showed that Pico cheese's bacterial communities are characterized by a few dominant taxa and many low-abundance, highly diverse taxa that integrate the so-called "rare biosphere". Copyright © 2014 Elsevier B.V. All rights reserved.
Evidence for a Complex Class of Nonadenylated mRNA in Drosophila
Zimmerman, J. Lynn; Fouts, David L.; Manning, Jerry E.
1980-01-01
The amount, by mass, of poly(A+) mRNA present in the polyribosomes of third-instar larvae of Drosophila melanogaster, and the relative contribution of the poly(A+) mRNA to the sequence complexity of total polysomal RNA, has been determined. Selective removal of poly(A+) mRNA from total polysomal RNA by use of either oligo-dT-cellulose, or poly(U)-sepharose affinity chromatography, revealed that only 0.15% of the mass of the polysomal RNA was present as poly(A+) mRNA. The present study shows that this RNA hybridized at saturation with 3.3% of the single-copy DNA in the Drosophila genome. After correction for asymmetric transcription and reactability of the DNA, 7.4% of the single-copy DNA in the Drosophila genome is represented in larval poly(A+) mRNA. This corresponds to 6.73 x 106 nucleotides of mRNA coding sequences, or approximately 5,384 diverse RNA sequences of average size 1,250 nucleotides. However, total polysomal RNA hybridizes at saturation to 10.9% of the single-copy DNA sequences. After correcting this value for asymmetric transcription and tracer DNA reactability, 24% of the single-copy DNA in Drosophila is represented in total polysomal RNA. This corresponds to 2.18 x 107 nucleotides of RNA coding sequences or 17,440 diverse RNA molecules of size 1,250 nucleotides. This value is 3.2 times greater than that observed for poly(A+) mRNA, and indicates that ≃69% of the polysomal RNA sequence complexity is contributed by nonadenylated RNA. Furthermore, if the number of different structural genes represented in total polysomal RNA is ≃1.7 x 104, then the number of genes expressed in third-instar larvae exceeds the number of chromomeres in Drosophila by about a factor of three. This numerology indicates that the number of chromomeres observed in polytene chromosomes does not reflect the number of structural gene sequences in the Drosophila genome. PMID:6777246
Legault, Boris A; Lopez-Lopez, Arantxa; Alba-Casado, Jose Carlos; Doolittle, W Ford; Bolhuis, Henk; Rodriguez-Valera, Francisco; Papke, R Thane
2006-01-01
Background Mature saturated brine (crystallizers) communities are largely dominated (>80% of cells) by the square halophilic archaeon "Haloquadratum walsbyi". The recent cultivation of the strain HBSQ001 and thesequencing of its genome allows comparison with the metagenome of this taxonomically simplified environment. Similar studies carried out in other extreme environments have revealed very little diversity in gene content among the cell lineages present. Results The metagenome of the microbial community of a crystallizer pond has been analyzed by end sequencing a 2000 clone fosmid library and comparing the sequences obtained with the genome sequence of "Haloquadratum walsbyi". The genome of the sequenced strain was retrieved nearly complete within this environmental DNA library. However, many ORF's that could be ascribed to the "Haloquadratum" metapopulation by common genome characteristics or scaffolding to the strain genome were not present in the specific sequenced isolate. Particularly, three regions of the sequenced genome were associated with multiple rearrangements and the presence of different genes from the metapopulation. Many transposition and phage related genes were found within this pool which, together with the associated atypical GC content in these areas, supports lateral gene transfer mediated by these elements as the most probable genetic cause of this variability. Additionally, these sequences were highly enriched in putative regulatory and signal transduction functions. Conclusion These results point to a large pan-genome (total gene repertoire of the genus/species) even in this highly specialized extremophile and at a single geographic location. The extensive gene repertoire is what might be expected of a population that exploits a diverse nutrient pool, resulting from the degradation of biomass produced at lower salinities. PMID:16820057
Identification and Characterization of Domesticated Bacterial Transposases
Gallie, Jenna; Rainey, Paul B.
2017-01-01
Abstract Selfish genetic elements, such as insertion sequences and transposons are found in most genomes. Transposons are usually identifiable by their high copy number within genomes. In contrast, REP-associated tyrosine transposases (RAYTs), a recently described class of bacterial transposase, are typically present at just one copy per genome. This suggests that RAYTs no longer copy themselves and thus they no longer function as a typical transposase. Motivated by this possibility we interrogated thousands of fully sequenced bacterial genomes in order to determine patterns of RAYT diversity, their distribution across chromosomes and accessory elements, and rate of duplication. RAYTs encompass exceptional diversity and are divisible into at least five distinct groups. They possess features more similar to housekeeping genes than insertion sequences, are predominantly vertically transmitted and have persisted through evolutionary time to the point where they are now found in 24% of all species for which at least one fully sequenced genome is available. Overall, the genomic distribution of RAYTs suggests that they have been coopted by host genomes to perform a function that benefits the host cell. PMID:28910967
Microbes in deep marine sediments viewed through amplicon sequencing and metagenomics
NASA Astrophysics Data System (ADS)
Biddle, J.; Leon, Z. R.; Russell, J. A., III; Martino, A. J.
2016-12-01
Nearly twenty percent of microbial biomass on Earth can be found in the marine subsurface. The majority of this is concentrated on continental margins, which have been investigated by scientific drilling. On the Costa Rica Margin, Iberian Margin and Peru Margins, sediment samples have been investigated through DNA extraction followed by amplicon and metagenomic sequencing. Overall samples show a high degree of microbial diversity, including many lineages of newly defined groups. In this talk, metagenome assembled genomes of unusual lineages will be presented, including their relationships to shallower relatives. From Costa Rica, in particular, we have retrieved deep relatives of Lokiarchaeota and Thorarchaeota, as well as other deeply branching archaeal relatives. We discuss their genome similarities to both other archaea and eukaryotes. From the Iberian Margin, relatives of Atribacteria and Aerophobetes will be discussed. Finally, we will detail the knowledge lost or gained depending on whether samples are studied via amplicon sequencing or total metagenomics, as studies in other environments have shown that up to 15% of microbial diversity is ignored when samples are studied via amplicon sequencing alone.
Elisa, Mwega; Hasan, Salih Dia; Moses, Njahira; Elpidius, Rukambile; Skilton, Robert; Gwakisa, Paul
2015-04-01
This study investigated the genetic and antigenic diversity of Theileria parva in cattle from the Eastern and Southern zones of Tanzania. Thirty-nine (62%) positive samples were genotyped using 14 mini- and microsatellite markers with coverage of all four T. parva chromosomes. Wright's F index (F(ST) = 0 × 094) indicated a high level of panmixis. Linkage equilibrium was observed in the two zones studied, suggesting existence of a panmyctic population. In addition, sequence analysis of CD8+ T-cell target antigen genes Tp1 revealed a single protein sequence in all samples analysed, which is also present in the T. parva Muguga strain, which is a component of the FAO1 vaccine. All Tp2 epitope sequences were identical to those in the T. parva Muguga strain, except for one variant of a Tp2 epitope, which is found in T. parva Kiambu 5 strain, also a component the FAO1 vaccine. Neighbour joining tree of the nucleotide sequences of Tp2 showed clustering according to geographical origin. Our results show low genetic and antigenic diversity of T. parva within the populations analysed. This has very important implications for the development of sustainable control measures for T. parva in Eastern and Southern zones of Tanzania, where East Coast fever is endemic.
Zhang, Yanhong; Pham, Nancy Kim; Zhang, Huixian; Lin, Junda; Lin, Qiang
2014-01-01
Population genetic of seahorses is confidently influenced by their species-specific ecological requirements and life-history traits. In the present study, partial sequences of mitochondrial cytochrome b (cytb) and control region (CR) were obtained from 50 Hippocampus mohnikei and 92 H. trimaculatus from four zoogeographical zones. A total of 780 base pairs of cytb gene were sequenced to characterize mitochondrial DNA (mtDNA) diversity. The mtDNA marker revealed high haplotype diversity, low nucleotide diversity, and a lack of population structure across both populations of H. mohnikei and H. trimaculatus. A neighbour-joining (NJ) tree of cytb gene sequences showed that H. mohnikei haplotypes formed one cluster. A maximum likelihood (ML) tree of cytb gene sequences showed that H. trimaculatus belonged to one lineage. The star-like pattern median-joining network of cytb and CR markers indicated a previous demographic expansion of H. mohnikei and H. trimaculatus. The cytb and CR data sets exhibited a unimodal mismatch distribution, which may have resulted from population expansion. Mismatch analysis suggested that the expansion was initiated about 276,000 years ago for H. mohnikei and about 230,000 years ago for H. trimaculatus during the middle Pleistocene period. This study indicates a possible signature of genetic variation and population expansion in two seahorses under complex marine environments. PMID:25144384
Chimeric 16S rRNA sequence formation and detection in Sanger and 454-pyrosequenced PCR amplicons
Haas, Brian J.; Gevers, Dirk; Earl, Ashlee M.; Feldgarden, Mike; Ward, Doyle V.; Giannoukos, Georgia; Ciulla, Dawn; Tabbaa, Diana; Highlander, Sarah K.; Sodergren, Erica; Methé, Barbara; DeSantis, Todd Z.; Petrosino, Joseph F.; Knight, Rob; Birren, Bruce W.
2011-01-01
Bacterial diversity among environmental samples is commonly assessed with PCR-amplified 16S rRNA gene (16S) sequences. Perceived diversity, however, can be influenced by sample preparation, primer selection, and formation of chimeric 16S amplification products. Chimeras are hybrid products between multiple parent sequences that can be falsely interpreted as novel organisms, thus inflating apparent diversity. We developed a new chimera detection tool called Chimera Slayer (CS). CS detects chimeras with greater sensitivity than previous methods, performs well on short sequences such as those produced by the 454 Life Sciences (Roche) Genome Sequencer, and can scale to large data sets. By benchmarking CS performance against sequences derived from a controlled DNA mixture of known organisms and a simulated chimera set, we provide insights into the factors that affect chimera formation such as sequence abundance, the extent of similarity between 16S genes, and PCR conditions. Chimeras were found to reproducibly form among independent amplifications and contributed to false perceptions of sample diversity and the false identification of novel taxa, with less-abundant species exhibiting chimera rates exceeding 70%. Shotgun metagenomic sequences of our mock community appear to be devoid of 16S chimeras, supporting a role for shotgun metagenomics in validating novel organisms discovered in targeted sequence surveys. PMID:21212162
Exome Sequencing in the Clinical Diagnosis of Sporadic or Familial Cerebellar Ataxia
Fogel, Brent L.; Lee, Hane; Deignan, Joshua L.; Strom, Samuel P.; Kantarci, Sibel; Wang, Xizhe; Quintero-Rivera, Fabiola; Vilain, Eric; Grody, Wayne W.; Perlman, Susan; Geschwind, Daniel H.; Nelson, Stanley F.
2015-01-01
IMPORTANCE Cerebellar ataxias are a diverse collection of neurologic disorders with causes ranging from common acquired etiologies to rare genetic conditions. Numerous genetic disorders have been associated with chronic progressive ataxia and this consequently presents a diagnostic challenge for the clinician regarding how to approach and prioritize genetic testing in patients with such clinically heterogeneous phenotypes. Additionally, while the value of genetic testing in early-onset and/or familial cases seems clear, many patients with ataxia present sporadically with adult onset of symptoms and the contribution of genetic variation to the phenotype of these patients has not yet been established. OBJECTIVE To investigate the contribution of genetic disease in a population of patients with predominantly adult- and sporadic-onset cerebellar ataxia. DESIGN, SETTING, AND PARTICIPANTS We examined a consecutive series of 76 patients presenting to a tertiary referral center for evaluation of chronic progressive cerebellar ataxia. MAIN OUTCOMES AND MEASURES Next-generation exome sequencing coupled with comprehensive bioinformatic analysis, phenotypic analysis, and clinical correlation. RESULTS We identified clinically relevant genetic information in more than 60% of patients studied (n = 46), including diagnostic pathogenic gene variants in 21% (n = 16), a notable yield given the diverse genetics and clinical heterogeneity of the cerebellar ataxias. CONCLUSIONS AND RELEVANCE This study demonstrated that clinical exome sequencing in patients with adult-onset and sporadic presentations of ataxia is a high-yield test, providing a definitive diagnosis in more than one-fifth of patients and suggesting a potential diagnosis in more than one-third to guide additional phenotyping and diagnostic evaluation. Therefore, clinical exome sequencing is an appropriate consideration in the routine genetic evaluation of all patients presenting with chronic progressive cerebellar ataxia. PMID:25133958
USDA-ARS?s Scientific Manuscript database
Genetic diversity is an essential resource for breeders to improve new cultivars with desirable characteristics. Recently genotyping-by-sequencing (GBS), a next generation sequencing (NGS) based technology that can simplify complex genomes, has been used as a high-throughput and cost-effective molec...
USDA-ARS?s Scientific Manuscript database
Next generation sequencing technologies and improved bioinformatics methods have provided opportunities to study sequence variability in complex polyploid transcriptomes. In this study, we used a diverse panel of twenty-two Arachis accessions representing seven Arachis hypogaea market classes, A-, B...
Historically low mitochondrial DNA diversity in koalas (Phascolarctos cinereus)
2012-01-01
Background The koala (Phascolarctos cinereus) is an arboreal marsupial that was historically widespread across eastern Australia until the end of the 19th century when it suffered a steep population decline. Hunting for the fur trade, habitat conversion, and disease contributed to a precipitous reduction in koala population size during the late 1800s and early 1900s. To examine the effects of these reductions in population size on koala genetic diversity, we sequenced part of the hypervariable region of mitochondrial DNA (mtDNA) in koala museum specimens collected in the 19th and 20th centuries, hypothesizing that the historical samples would exhibit greater genetic diversity. Results The mtDNA haplotypes present in historical museum samples were identical to haplotypes found in modern koala populations, and no novel haplotypes were detected. Rarefaction analyses suggested that the mtDNA genetic diversity present in the museum samples was similar to that of modern koalas. Conclusions Low mtDNA diversity may have been present in koala populations prior to recent population declines. When considering management strategies, low genetic diversity of the mtDNA hypervariable region may not indicate recent inbreeding or founder events but may reflect an older historical pattern for koalas. PMID:23095716
What can we learn about lyssavirus genomes using 454 sequencing?
Höper, Dirk; Finke, Stefan; Freuling, Conrad M; Hoffmann, Bernd; Beer, Martin
2012-01-01
The main task of the individual project number four"Whole genome sequencing, virus-host adaptation, and molecular epidemiological analyses of lyssaviruses "within the network" Lyssaviruses--a potential re-emerging public health threat" is to provide high quality complete genome sequences from lyssaviruses. These sequences are analysed in-depth with regard to the diversity of the viral populations as to both quasi-species and so-called defective interfering RNAs. Moreover, the sequence data will facilitate further epidemiological analyses, will provide insight into the evolution of lyssaviruses and will be the basis for the design of novel nucleic acid based diagnostics. The first results presented here indicate that not only high quality full-length lyssavirus genome sequences can be generated, but indeed efficient analysis of the viral population gets feasible.
Brown, Steven D; Utturkar, Sagar M; Klingeman, Dawn M; Johnson, Courtney M; Martin, Stanton L; Land, Miriam L; Lu, Tse-Yuan S; Schadt, Christopher W; Doktycz, Mitchel J; Pelletier, Dale A
2012-11-01
To aid in the investigation of the Populus deltoides microbiome, we generated draft genome sequences for 21 Pseudomonas strains and 19 other diverse bacteria isolated from Populus deltoides roots. Genome sequences for isolates similar to Acidovorax, Bradyrhizobium, Brevibacillus, Caulobacter, Chryseobacterium, Flavobacterium, Herbaspirillum, Novosphingobium, Pantoea, Phyllobacterium, Polaromonas, Rhizobium, Sphingobium, and Variovorax were generated.
HIV-1 sequence variation between isolates from mother-infant transmission pairs
DOE Office of Scientific and Technical Information (OSTI.GOV)
Wike, C.M.; Daniels, M.R.; Furtado, M.
1991-12-31
To examine the sequence diversity of human immunodeficiency virus type 1 (HIV-1) between known transmission sets, sequences from the V3 and V4-V5 region of the env gene from 4 mother-infant pairs were analyzed. The mean interpatient sequence variation between isolates from linked mother-infant pairs was comparable to the sequence diversity found between isolates from other close contacts. The mean intrapatient variation was significantly less in the infants` isolates then the isolates from both their mothers and other characterized intrapatient sequence sets. In addition, a distinct and characteristic difference in the glycosylation pattern preceding the V3 loop was found between eachmore » linked transmission pair. These findings indicate that selection of specific genotypic variants, which may play a role in some direct transmission sets, and the duration of infection are important factors in the degree of diversity seen between the sequence sets.« less
Nyaga, Martin M; Tan, Yi; Seheri, Mapaseka L; Halpin, Rebecca A; Akopov, Asmik; Stucker, Karla M; Fedorova, Nadia B; Shrivastava, Susmita; Duncan Steele, A; Mwenda, Jason M; Pickett, Brett E; Das, Suman R; Jeffrey Mphahlele, M
2018-05-18
Rotavirus A (RVA) exhibits a wide genotype diversity globally. Little is known about the genetic composition of genotype P[6] from Africa. This study investigated possible evolutionary mechanisms leading to genetic diversity of genotype P[6] VP4 sequences. Phylogenetic analyses on 167 P[6] VP4 full-length sequences were conducted, which included six porcine-origin sequences. Of the 167 sequences, 57 were newly acquired through whole genome sequencing as part of this study. The other 110 sequences were all publicly-available global P[6] VP4 full-length sequences downloaded from GenBank. The strength of association between the phenotypic features and the phylogeny was also determined. A number of reassortment and mixed infections of RVA genotype P[6] strains were observed in this study. Phylogenetic analyses demostrated the extensive genetic diversity that exists among human P[6] strains, porcine-like strains, their concomitant clades/subclades and estimated that P[6] VP4 gene has a higher substitution rate with the mean of 1.05E-3 substitutions/site/year. Further, the phylogenetic analyses indicated that genotype P[6] strains were endemic in Africa, characterised by an extensive genetic diversity and long-time local evolution of the viruses. This was also supported by phylogeographic clustering and G-genotype clustering of the P[6] strains when Bayesian Tip-association Significance testing (BaTS) was applied, clearly supporting that the viruses evolved locally in Africa instead of spatial mixing among different regions. Overall, the results demonstrated that multiple mechanisms such as reassortment events, various mutations and possibly interspecies transmission account for the enormous diversity of genotype P[6] strains in Africa. These findings highlight the need for continued global surveillance of rotavirus diversity. Copyright © 2018 Elsevier B.V. All rights reserved.
Comeau, André M; Arbiol, Christine; Krisch, Henry M
2014-06-19
The diverse T4-like phages (Tquatrovirinae) infect a wide array of gram-negative bacterial hosts. The genome architecture of these phages is generally well conserved, most of the phylogenetically variable genes being grouped together in a series hyperplastic regions (HPRs) that are interspersed among large blocks of conserved core genes. Recent evidence from a pair of closely related T4-like phages has suggested that small, composite terminator/promoter sequences (promoterearly stem loop [PeSLs]) were implicated in mediating the high levels of genetic plasticity by indels occurring within the HPRs. Here, we present the genome sequence analysis of two T4-like phages, PST (168 kb, 272 open reading frames [ORFs]) and nt-1 (248 kb, 405 ORFs). These two phages were chosen for comparative sequence analysis because, although they are closely related to phages that have been previously sequenced (T4 and KVP40, respectively), they have different host ranges. In each case, one member of the pair infects a bacterial strain that is a human pathogen, whereas the other phage's host is a nonpathogen. Despite belonging to phylogenetically distant branches of the T4-likes, these pairs of phage have diverged from each other in part by a mechanism apparently involving PeSL-mediated recombination. This analysis confirms a role of PeSL sequences in the generation of genomic diversity by serving as a point of genetic exchange between otherwise unrelated sequences within the HPRs. Finally, the palette of divergent genes swapped by PeSL-mediated homologous recombination is discussed in the context of the PeSLs' potentially important role in facilitating phage adaption to new hosts and environments. © The Author(s) 2014. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
DNA-Encoded Solid-Phase Synthesis: Encoding Language Design and Complex Oligomer Library Synthesis.
MacConnell, Andrew B; McEnaney, Patrick J; Cavett, Valerie J; Paegel, Brian M
2015-09-14
The promise of exploiting combinatorial synthesis for small molecule discovery remains unfulfilled due primarily to the "structure elucidation problem": the back-end mass spectrometric analysis that significantly restricts one-bead-one-compound (OBOC) library complexity. The very molecular features that confer binding potency and specificity, such as stereochemistry, regiochemistry, and scaffold rigidity, are conspicuously absent from most libraries because isomerism introduces mass redundancy and diverse scaffolds yield uninterpretable MS fragmentation. Here we present DNA-encoded solid-phase synthesis (DESPS), comprising parallel compound synthesis in organic solvent and aqueous enzymatic ligation of unprotected encoding dsDNA oligonucleotides. Computational encoding language design yielded 148 thermodynamically optimized sequences with Hamming string distance ≥ 3 and total read length <100 bases for facile sequencing. Ligation is efficient (70% yield), specific, and directional over 6 encoding positions. A series of isomers served as a testbed for DESPS's utility in split-and-pool diversification. Single-bead quantitative PCR detected 9 × 10(4) molecules/bead and sequencing allowed for elucidation of each compound's synthetic history. We applied DESPS to the combinatorial synthesis of a 75,645-member OBOC library containing scaffold, stereochemical and regiochemical diversity using mixed-scale resin (160-μm quality control beads and 10-μm screening beads). Tandem DNA sequencing/MALDI-TOF MS analysis of 19 quality control beads showed excellent agreement (<1 ppt) between DNA sequence-predicted mass and the observed mass. DESPS synergistically unites the advantages of solid-phase synthesis and DNA encoding, enabling single-bead structural elucidation of complex compounds and synthesis using reactions normally considered incompatible with unprotected DNA. The widespread availability of inexpensive oligonucleotide synthesis, enzymes, DNA sequencing, and PCR make implementation of DESPS straightforward, and may prompt the chemistry community to revisit the synthesis of more complex and diverse libraries.
Open-Source Sequence Clustering Methods Improve the State Of the Art.
Kopylova, Evguenia; Navas-Molina, Jose A; Mercier, Céline; Xu, Zhenjiang Zech; Mahé, Frédéric; He, Yan; Zhou, Hong-Wei; Rognes, Torbjørn; Caporaso, J Gregory; Knight, Rob
2016-01-01
Sequence clustering is a common early step in amplicon-based microbial community analysis, when raw sequencing reads are clustered into operational taxonomic units (OTUs) to reduce the run time of subsequent analysis steps. Here, we evaluated the performance of recently released state-of-the-art open-source clustering software products, namely, OTUCLUST, Swarm, SUMACLUST, and SortMeRNA, against current principal options (UCLUST and USEARCH) in QIIME, hierarchical clustering methods in mothur, and USEARCH's most recent clustering algorithm, UPARSE. All the latest open-source tools showed promising results, reporting up to 60% fewer spurious OTUs than UCLUST, indicating that the underlying clustering algorithm can vastly reduce the number of these derived OTUs. Furthermore, we observed that stringent quality filtering, such as is done in UPARSE, can cause a significant underestimation of species abundance and diversity, leading to incorrect biological results. Swarm, SUMACLUST, and SortMeRNA have been included in the QIIME 1.9.0 release. IMPORTANCE Massive collections of next-generation sequencing data call for fast, accurate, and easily accessible bioinformatics algorithms to perform sequence clustering. A comprehensive benchmark is presented, including open-source tools and the popular USEARCH suite. Simulated, mock, and environmental communities were used to analyze sensitivity, selectivity, species diversity (alpha and beta), and taxonomic composition. The results demonstrate that recent clustering algorithms can significantly improve accuracy and preserve estimated diversity without the application of aggressive filtering. Moreover, these tools are all open source, apply multiple levels of multithreading, and scale to the demands of modern next-generation sequencing data, which is essential for the analysis of massive multidisciplinary studies such as the Earth Microbiome Project (EMP) (J. A. Gilbert, J. K. Jansson, and R. Knight, BMC Biol 12:69, 2014, http://dx.doi.org/10.1186/s12915-014-0069-1).
Molecular diversity of early foraminifera
NASA Astrophysics Data System (ADS)
Holzmann, Maria; Pawlowski, Jan
2017-04-01
Monothalamid foraminifera are a diverse group that is characterized by single-chambered agglutinated or organic test. They occur in all marine habitats and are also present in terrestrial and freshwater environments. Monothalamids branch at the base of foraminiferal tree, as a paraphyletic group with some clades branching at the base of Globothalamea and Tubothalamea. We have currently more than 1500 sequences of monothalamids in our database that can be divided in at least 20 clades among which certain are particularly well presented by sequence numbers and/or number of different species. These are members of clade BM that contain Bathysiphon and Micrometula, clade C that contains among others xenophyophorans, saccaminids, and a large variety of organic-walled or agglutinated genera, clade E that contains the genera Psammophaga, Vellaria and Nellya and four clades that contain freshwater foraminifera. In general, the monothalamid clades comprise both agglutinated and organic-walled genera. Some common genera, such as Crithionina, Saccammina, Hippocrepina, are polyphyletic. Our results clearly show that monothalamids are highly diverse and their molecular diversity by far surpasses their morphological variety. Based on phylogenomic studies, monothalamids evolved early in the evolution of eukaryotes, as a part of the supergroup of Rhizaria, comprising also radiolarians and other amoeboid protists. The monothalamids have diverged from ancestral radiolarians, probably about 1000 million years ago, but the exact time is difficult to infer because of the uncertainties concerning a calibration of a eukaryotic phylogenomic tree.
Genomic analysis of bluetongue virus episystems in Australia and Indonesia.
Firth, Cadhla; Blasdell, Kim R; Amos-Ritchie, Rachel; Sendow, Indrawati; Agnihotri, Kalpana; Boyle, David B; Daniels, Peter; Kirkland, Peter D; Walker, Peter J
2017-11-23
The distribution of bluetongue viruses (BTV) in Australia is represented by two distinct and interconnected epidemiological systems (episystems)-one distributed primarily in the north and one in the east. The northern episystem is characterised by substantially greater antigenic diversity than the eastern episystem; yet the forces that act to limit the diversity present in the east remain unclear. Previous work has indicated that the northern episystem is linked to that of island South East Asia and Melanesia, and that BTV present in Indonesia, Papua New Guinea and East Timor, may act as source populations for new serotypes and genotypes of BTV to enter Australia's north. In this study, the genomes of 49 bluetongue viruses from the eastern episystem and 13 from Indonesia were sequenced and analysed along with 27 previously published genome sequences from the northern Australian episystem. The results of this analysis confirm that the Australian BTV population has its origins in the South East Asian/Melanesian episystem, and that incursions into northern Australia occur with some regularity. In addition, the presence of limited genetic diversity in the eastern episystem relative to that found in the north supports the presence of substantial, but not complete, barriers to gene flow between the northern and eastern Australian episystems. Genetic bottlenecks between each successive episystem are evident, and appear to be responsible for the reduction in BTV genetic diversity observed in the north to south-east direction.
Ancient DNA evidence for the loss of a highly divergent brown bear clade during historical times.
Calvignac, Sebastien; Hughes, Sandrine; Tougard, Christelle; Michaux, Jacques; Thevenot, Michel; Philippe, Michel; Hamdine, Watik; Hänni, Catherine
2008-04-01
The genetic diversity of present-day brown bears (Ursus arctos) has been extensively studied over the years and appears to be geographically structured into five main clades. The question of the past diversity of the species has been recently addressed by ancient DNA studies that concluded to a relative genetic stability over the last 35,000 years. However, the post-last glacial maximum genetic diversity of the species still remains poorly documented, notably in the Old World. Here, we analyse Atlas brown bears, which became extinct during the Holocene period. A divergent brown bear mitochondrial DNA lineage not present in any of the previously studied modern or ancient bear samples was uncovered, suggesting that the diversity of U. arctos was larger in the past than it is now. Specifically, a significant portion (with respect to sequence divergence) of the intraspecific diversity of the brown bear was lost with the extinction of the Atlas brown bear after the Pleistocene/Holocene transition.
Bacterial diversity in a glacier foreland of the high Arctic.
Schütte, Ursel M E; Abdo, Zaid; Foster, James; Ravel, Jacques; Bunge, John; Solheim, Bjørn; Forney, Larry J
2010-03-01
Over the past 100 years, Arctic temperatures have increased at almost twice the global average rate. One consequence is the acceleration of glacier retreat, exposing new habitats that are colonized by microorganisms whose diversity and function are unknown. Here, we characterized bacterial diversity along two approximately parallel chronosequences in an Arctic glacier forefield that span six time points following glacier retreat. We assessed changes in phylotype richness, evenness and turnover rate through the analysis of 16S rRNA gene sequences recovered from 52 samples taken from surface layers along the chronosequences. An average of 4500 sequences was obtained from each sample by 454 pyrosequencing. Using parametric methods, it was estimated that bacterial phylotype richness was high, and that it increased significantly from an average of 4000 (at a threshold of 97% sequence similarity) at locations exposed for 5 years to an average of 7050 phylotypes per 0.5 g of soil at sites that had been exposed for 150 years. Phylotype evenness also increased over time, with an evenness of 0.74 for 150 years since glacier retreat reflecting large proportions of rare phylotypes. The bacterial species turnover rate was especially high between sites exposed for 5 and 19 years. The level of bacterial diversity present in this High Arctic glacier foreland was comparable with that found in temperate and tropical soils, raising the question whether global patterns of bacterial species diversity parallel that of plants and animals, which have been found to form a latitudinal gradient and be lower in polar regions compared with the tropics.
Möbius, Petra; Hölzer, Martin; Felder, Marius; Nordsiek, Gabriele; Groth, Marco; Köhler, Heike; Reichwald, Kathrin; Platzer, Matthias; Marz, Manja
2015-01-01
Mycobacterium avium (M. a.) subsp. paratuberculosis (MAP)—the etiologic agent of Johne’s disease—affects cattle, sheep, and other ruminants worldwide. To decipher phenotypic differences among sheep and cattle strains (belonging to MAP-S [Type-I/III], respectively, MAP-C [Type-II]), comparative genome analysis needs data from diverse isolates originating from different geographic regions of the world. This study presents the so far best assembled genome of a MAP-S-strain: Sheep isolate JIII-386 from Germany. One newly sequenced cattle isolate (JII-1961, Germany), four published MAP strains of MAP-C and MAP-S from the United States and Australia, and M. a. subsp. hominissuis (MAH) strain 104 were used for assembly improvement and comparisons. All genomes were annotated by BacProt and results compared with NCBI (National Center for Biotechnology Information) annotation. Corresponding protein-coding sequences (CDSs) were detected, but also CDSs that were exclusively determined by either NCBI or BacProt. A new Shine–Dalgarno sequence motif (5′-AGCTGG-3′) was extracted. Novel CDSs including PE-PGRS family protein genes and about 80 noncoding RNAs exhibiting high sequence conservation are presented. Previously found genetic differences between MAP-types are partially revised. Four of ten assumed MAP-S-specific large sequence polymorphism regions (LSPSs) are still present in MAP-C strains; new LSPSs were identified. Independently of the regional origin of the strains, the number of individual CDSs and single nucleotide variants confirms the strong similarity of MAP-C strains and shows higher diversity among MAP-S strains. This study gives ambiguous results regarding the hypothesis that MAP-S is the evolutionary intermediate between MAH and MAP-C, but it clearly shows a higher similarity of MAP to MAH than to Mycobacterium intracellulare. PMID:26384038
Wettstein, P J; States, J S
1986-01-01
The extent of polymorphism and the rate of divergence of class I and class II sequences mapping to the mammalian major histocompatibility complex (MHC) have been the subject of experimentation and speculation. To provide further insight into the evolution of the MHC we have initiated the analysis of two geographically isolated subspecies of tassel-eared squirrels. In the preceding communication we described the number and polymorphism of TSLA class I and class II sequences in Kaibab squirrels (S. aberti kaibabensis), which live north of the Grand Canyon. In this report we present a parallel analysis of Abert squirrels (S. aberti aberti), which live south of the Grand Canyon in northern Arizona. Genomic DNA from 12 Abert squirrels was digested with restriction enzymes, electrophoresed, blotted, and hybridized with DR alpha, DR beta, DQ alpha, DQ beta, and HLA-B7 probes. The results of these hybridizations were remarkably similar to those obtained in Kaibab squirrels. The majority of class I and class II bands were identical in size and number, suggesting that Abert and Kaibab squirrels have not significantly diverged in the TSLA complex despite their geographical separation. Relative polymorphism of class II sequences was similar to that observed with Kaibab squirrels: beta sequences exhibited higher polymorphism than alpha sequences. As in Kaibab squirrels, a number of alpha and beta sequences were apparently carried on the same fragments. In comparison to class II beta sequences, there was limited polymorphism in class I sequences, although a diverse number of class I genotypes were observed. Attempts to identify segregating TSLA haplotypes were futile in that the only families of sequences with concordant distributions were DQ alpha and DQ beta. These observations and those obtained with Kaibab squirrels suggest that the present-day TSLA haplotypes of both subspecies are derived from a limited number of common, progenitor haplotypes through repeated intra-TSLA recombination.
Monier, Adam; Worden, Alexandra Z; Richards, Thomas A
2016-08-01
High-throughput diversity amplicon sequencing of marine microbial samples has revealed that members of the Mamiellophyceae lineage are successful phytoplankton in many oceanic habitats. Indeed, these eukaryotic green algae can dominate the picoplanktonic biomass, however, given the broad expanses of the oceans, their geographical distributions and the phylogenetic diversity of some groups remain poorly characterized. As these algae play a foundational role in marine food webs, it is crucial to assess their global distribution in order to better predict potential changes in abundance and community structure. To this end, we analyzed the V9-18S small subunit rDNA sequences deposited from the Tara Oceans expedition to evaluate the diversity and biogeography of these phytoplankton. Our results show that the phylogenetic composition of Mamiellophyceae communities is in part determined by geographical provenance, and do not appear to be influenced - in the samples recovered - by water depth, at least at the resolution possible with the V9-18S. Phylogenetic classification of Mamiellophyceae sequences revealed that the Dolichomastigales order encompasses more sequence diversity than other orders in this lineage. These results indicate that a large fraction of the Mamiellophyceae diversity has been hitherto overlooked, likely because of a combination of size fraction, sequencing and geographical limitations. © 2016 Society for Applied Microbiology and John Wiley & Sons Ltd.
Takai, Ken; Horikoshi, Koki
1999-01-01
Molecular phylogenetic analysis of a naturally occurring microbial community in a deep-subsurface geothermal environment indicated that the phylogenetic diversity of the microbial population in the environment was extremely limited and that only hyperthermophilic archaeal members closely related to Pyrobaculum were present. All archaeal ribosomal DNA sequences contained intron-like sequences, some of which had open reading frames with repeated homing-endonuclease motifs. The sequence similarity analysis and the phylogenetic analysis of these homing endonucleases suggested the possible phylogenetic relationship among archaeal rRNA-encoded homing endonucleases. PMID:10584021
New Insight Into the Diversity of SemiSWEET Sugar Transporters and the Homologs in Prokaryotes
Jia, Baolei; Hao, Lujiang; Xuan, Yuan Hu; Jeon, Che Ok
2018-01-01
Sugars will eventually be exported transporters (SWEETs) and SemiSWEETs represent a family of sugar transporters in eukaryotes and prokaryotes, respectively. SWEETs contain seven transmembrane helices (TMHs), while SemiSWEETs contain three. The functions of SemiSWEETs are less studied. In this perspective article, we analyzed the diversity and conservation of SemiSWEETs and further proposed the possible functions. 1,922 SemiSWEET homologs were retrieved from the UniProt database, which is not proportional to the sequenced prokaryotic genomes. However, these proteins are very diverse in sequences and can be classified into 19 clusters when >50% sequence identity is required. Moreover, a gene context analysis indicated that several SemiSWEETs are located in the operons that are related to diverse carbohydrate metabolism. Several proteins with seven TMHs can be found in bacteria, and sequence alignment suggested that these proteins in bacteria may be formed by the duplication and fusion. Multiple sequence alignments showed that the amino acids for sugar translocation are still conserved and coevolved, although the sequences show diversity. Among them, the functions of a few amino acids are still not clear. These findings highlight the challenges that exist in SemiSWEETs and provide future researchers the foundation to explore these uncharted areas. PMID:29872447
New Insight Into the Diversity of SemiSWEET Sugar Transporters and the Homologs in Prokaryotes.
Jia, Baolei; Hao, Lujiang; Xuan, Yuan Hu; Jeon, Che Ok
2018-01-01
Sugars will eventually be exported transporters (SWEETs) and SemiSWEETs represent a family of sugar transporters in eukaryotes and prokaryotes, respectively. SWEETs contain seven transmembrane helices (TMHs), while SemiSWEETs contain three. The functions of SemiSWEETs are less studied. In this perspective article, we analyzed the diversity and conservation of SemiSWEETs and further proposed the possible functions. 1,922 SemiSWEET homologs were retrieved from the UniProt database, which is not proportional to the sequenced prokaryotic genomes. However, these proteins are very diverse in sequences and can be classified into 19 clusters when >50% sequence identity is required. Moreover, a gene context analysis indicated that several SemiSWEETs are located in the operons that are related to diverse carbohydrate metabolism. Several proteins with seven TMHs can be found in bacteria, and sequence alignment suggested that these proteins in bacteria may be formed by the duplication and fusion. Multiple sequence alignments showed that the amino acids for sugar translocation are still conserved and coevolved, although the sequences show diversity. Among them, the functions of a few amino acids are still not clear. These findings highlight the challenges that exist in SemiSWEETs and provide future researchers the foundation to explore these uncharted areas.
Paul, Fiona; Otte, Jürgen; Schmitt, Imke; Dal Grande, Francesco
2018-06-05
The implementation of HTS (high-throughput sequencing) approaches is rapidly changing our understanding of the lichen symbiosis, by uncovering high bacterial and fungal diversity, which is often host-specific. Recently, HTS methods revealed the presence of multiple photobionts inside a single thallus in several lichen species. This differs from Sanger technology, which typically yields a single, unambiguous algal sequence per individual. Here we compared HTS and Sanger methods for estimating the diversity of green algal symbionts within lichen thalli using 240 lichen individuals belonging to two species of lichen-forming fungi. According to HTS data, Sanger technology consistently yielded the most abundant photobiont sequence in the sample. However, if the second most abundant photobiont exceeded 30% of the total HTS reads in a sample, Sanger sequencing generally failed. Our results suggest that most lichen individuals in the two analyzed species, Lasallia hispanica and L. pustulata, indeed contain a single, predominant green algal photobiont. We conclude that Sanger sequencing is a valid approach to detect the dominant photobionts in lichen individuals and populations. We discuss which research areas in lichen ecology and evolution will continue to benefit from Sanger sequencing, and which areas will profit from HTS approaches to assessing symbiont diversity.
Population genetics of Ice Age brown bears
Leonard, Jennifer A.; Wayne, Robert K.; Cooper, Alan
2000-01-01
The Pleistocene was a dynamic period for Holarctic mammal species, complicated by episodes of glaciation, local extinctions, and intercontinental migration. The genetic consequences of these events are difficult to resolve from the study of present-day populations. To provide a direct view of population genetics in the late Pleistocene, we measured mitochondrial DNA sequence variation in seven permafrost-preserved brown bear (Ursus arctos) specimens, dated from 14,000 to 42,000 years ago. Approximately 36,000 years ago, the Beringian brown bear population had a higher genetic diversity than any extant North American population, but by 15,000 years ago genetic diversity appears similar to the modern day. The older, genetically diverse, Beringian population contained sequences from three clades now restricted to local regions within North America, indicating that current phylogeographic patterns may provide misleading data for evolutionary studies and conservation management. The late Pleistocene phylogeographic data also indicate possible colonization routes to areas south of the Cordilleran ice sheet. PMID:10677513
Sola, Christophe
2015-06-01
The natural history of tuberculosis may be tackled by various means, among which the record of molecular scars that have been registered by the Mycobacterium tuberculosis complex (MTBC) genomes transmitted from patient to patient for tens of thousands years and possibly more. Recently discovered polymorphic loci, the CRISPR sequences, are indirect witnesses of the historical phage-bacteria struggle, and may be related to the time when the ancestor of today's tubercle bacilli were environmental bacteria, i.e. before becoming intracellular parasites. In this article, we present what are CRISPRs and try to summarize almost 20 years of research results obtained using the genetic diversity of the CRISPR loci in MTBC as a perspective for studying new models. We show that the study of the diversity of CRISPR sequences, thanks to «spoligotyping», has played a great role in our global understanding of the population structure of MTBC. Copyright © 2015 Elsevier Ltd. All rights reserved.
Motility and Flagellar Glycosylation in Clostridium difficile▿ †
Twine, Susan M.; Reid, Christopher W.; Aubry, Annie; McMullin, David R.; Fulton, Kelly M.; Austin, John; Logan, Susan M.
2009-01-01
In this study, intact flagellin proteins were purified from strains of Clostridium difficile and analyzed using quadrupole time of flight and linear ion trap mass spectrometers. Top-down studies showed the flagellin proteins to have a mass greater than that predicted from the corresponding gene sequence. These top-down studies revealed marker ions characteristic of glycan modifications. Additionally, diversity in the observed masses of glycan modifications was seen between strains. Electron transfer dissociation mass spectrometry was used to demonstrate that the glycan was attached to the flagellin protein backbone in O linkage via a HexNAc residue in all strains examined. Bioinformatic analysis of C. difficile genomes revealed diversity with respect to glycan biosynthesis gene content within the flagellar biosynthesis locus, likely reflected by the observed flagellar glycan diversity. In C. difficile strain 630, insertional inactivation of a glycosyltransferase gene (CD0240) present in all sequenced genomes resulted in an inability to produce flagellar filaments at the cell surface and only minor amounts of unmodified flagellin protein. PMID:19749038
Mason, Christopher E.; Shu, Feng-Jue; Wang, Cheng; Session, Ryan M.; Kallen, Roland G.; Sidell, Neil; Yu, Tianwei; Liu, Mei Hui; Cheung, Edwin; Kallen, Caleb B.
2010-01-01
Location analysis for estrogen receptor-α (ERα)-bound cis-regulatory elements was determined in MCF7 cells using chromatin immunoprecipitation (ChIP)-on-chip. Here, we present the estrogen response element (ERE) sequences that were identified at ERα-bound loci and quantify the incidence of ERE sequences under two stringencies of detection: <10% and 10–20% nucleotide deviation from the canonical ERE sequence. We demonstrate that ∼50% of all ERα-bound loci do not have a discernable ERE and show that most ERα-bound EREs are not perfect consensus EREs. Approximately one-third of all ERα-bound ERE sequences reside within repetitive DNA sequences, most commonly of the AluS family. In addition, the 3-bp spacer between the inverted ERE half-sites, rather than being random nucleotides, is C(A/T)G-enriched at bona fide receptor targets. Diverse ERα-bound loci were validated using electrophoretic mobility shift assay and ChIP-polymerase chain reaction (PCR). The functional significance of receptor-bound loci was demonstrated using luciferase reporter assays which proved that repetitive element ERE sequences contribute to enhancer function. ChIP-PCR demonstrated estrogen-dependent recruitment of the coactivator SRC3 to these loci in vivo. Our data demonstrate that ERα binds to widely variant EREs with less sequence specificity than had previously been suspected and that binding at repetitive and nonrepetitive genomic targets is favored by specific trinucleotide spacers. PMID:20047966
Mason, Christopher E; Shu, Feng-Jue; Wang, Cheng; Session, Ryan M; Kallen, Roland G; Sidell, Neil; Yu, Tianwei; Liu, Mei Hui; Cheung, Edwin; Kallen, Caleb B
2010-04-01
Location analysis for estrogen receptor-alpha (ERalpha)-bound cis-regulatory elements was determined in MCF7 cells using chromatin immunoprecipitation (ChIP)-on-chip. Here, we present the estrogen response element (ERE) sequences that were identified at ERalpha-bound loci and quantify the incidence of ERE sequences under two stringencies of detection: <10% and 10-20% nucleotide deviation from the canonical ERE sequence. We demonstrate that approximately 50% of all ERalpha-bound loci do not have a discernable ERE and show that most ERalpha-bound EREs are not perfect consensus EREs. Approximately one-third of all ERalpha-bound ERE sequences reside within repetitive DNA sequences, most commonly of the AluS family. In addition, the 3-bp spacer between the inverted ERE half-sites, rather than being random nucleotides, is C(A/T)G-enriched at bona fide receptor targets. Diverse ERalpha-bound loci were validated using electrophoretic mobility shift assay and ChIP-polymerase chain reaction (PCR). The functional significance of receptor-bound loci was demonstrated using luciferase reporter assays which proved that repetitive element ERE sequences contribute to enhancer function. ChIP-PCR demonstrated estrogen-dependent recruitment of the coactivator SRC3 to these loci in vivo. Our data demonstrate that ERalpha binds to widely variant EREs with less sequence specificity than had previously been suspected and that binding at repetitive and nonrepetitive genomic targets is favored by specific trinucleotide spacers.
Utturkar, Sagar M.; Klingeman, Dawn M.; Johnson, Courtney M.; Martin, Stanton L.; Land, Miriam L.; Lu, Tse-Yuan S.; Schadt, Christopher W.; Doktycz, Mitchel J.
2012-01-01
To aid in the investigation of the Populus deltoides microbiome, we generated draft genome sequences for 21 Pseudomonas strains and 19 other diverse bacteria isolated from Populus deltoides roots. Genome sequences for isolates similar to Acidovorax, Bradyrhizobium, Brevibacillus, Caulobacter, Chryseobacterium, Flavobacterium, Herbaspirillum, Novosphingobium, Pantoea, Phyllobacterium, Polaromonas, Rhizobium, Sphingobium, and Variovorax were generated. PMID:23045501
DOE Office of Scientific and Technical Information (OSTI.GOV)
Brown, Steven D; Utturkar, Sagar M; Klingeman, Dawn Marie
To aid in the investigation of the Populus deltoides microbiome we generated draft genome sequences for twenty one Pseudomonas and twenty one other diverse bacteria isolated from Populus deltoides roots. Genome sequences for isolates similar to Acidovorax, Bradyrhizobium, Brevibacillus, Burkholderia, Caulobacter, Chryseobacterium, Flavobacterium, Herbaspirillum, Novosphingobium, Pantoea, Phyllobacterium, Polaromonas, Rhizobium, Sphingobium and Variovorax were generated.
Johansson, Anders; Aspan, Anna; Bagge, Elisabeth; Båverud, Viveca; Engström, Björn E; Johansson, Karl-Erik
2006-01-01
Background Clostridium perfringens, a serious pathogen, causes enteric diseases in domestic animals and food poisoning in humans. The epidemiological relationship between C. perfringens isolates from the same source has previously been investigated chiefly by pulsed-field gel electrophoresis (PFGE). In this study the genetic diversity of C. perfringens isolated from various animals, from food poisoning outbreaks and from sludge was investigated. Results We used PFGE to examine the genetic diversity of 95 C. perfringens type A isolates from eight different sources. The isolates were also examined for the presence of the beta2 toxin gene (cpb2) and the enterotoxin gene (cpe). The cpb2 gene from the 28 cpb2-positive isolates was also partially sequenced (519 bp, corresponding to positions 188 to 706 in the consensus cpb2 sequence). The results of PFGE revealed a wide genetic diversity among the C. perfringens type A isolates. The genetic relatedness of the isolates ranged from 58 to 100% and 56 distinct PFGE types were identified. Almost all clusters with similar patterns comprised isolates with a known epidemiological correlation. Most of the isolates from pig, horse and sheep carried the cpb2 gene. All isolates originating from food poisoning outbreaks carried the cpe gene and three of these also carried cpb2. Two evolutionary different populations were identified by sequence analysis of the partially sequenced cpb2 genes from our study and cpb2 sequences previously deposited in GenBank. Conclusion As revealed by PFGE, there was a wide genetic diversity among C. perfringens isolates from different sources. Epidemiologically related isolates showed a high genetic similarity, as expected, while isolates with no obvious epidemiological relationship expressed a lesser degree of genetic similarity. The wide diversity revealed by PFGE was not reflected in the 16S rRNA sequences, which had a considerable degree of sequence similarity. Sequence comparison of the partially sequenced cpb2 gene revealed two genetically different populations. This is to our knowledge the first study in which the genetic diversity of C. perfringens isolates both from different animals species, from food poisoning outbreaks and from sludge has been investigated. PMID:16737528
Rosa, Rafael D; Santini, Adrien; Fievet, Julie; Bulet, Philippe; Destoumieux-Garzón, Delphine; Bachère, Evelyne
2011-01-01
Big defensin is an antimicrobial peptide composed of a highly hydrophobic N-terminal region and a cationic C-terminal region containing six cysteine residues involved in three internal disulfide bridges. While big defensin sequences have been reported in various mollusk species, few studies have been devoted to their sequence diversity, gene organization and their expression in response to microbial infections. Using the high-throughput Digital Gene Expression approach, we have identified in Crassostrea gigas oysters several sequences coding for big defensins induced in response to a Vibrio infection. We showed that the oyster big defensin family is composed of three members (named Cg-BigDef1, Cg-BigDef2 and Cg-BigDef3) that are encoded by distinct genomic sequences. All Cg-BigDefs contain a hydrophobic N-terminal domain and a cationic C-terminal domain that resembles vertebrate β-defensins. Both domains are encoded by separate exons. We found that big defensins form a group predominantly present in mollusks and closer to vertebrate defensins than to invertebrate and fungi CSαβ-containing defensins. Moreover, we showed that Cg-BigDefs are expressed in oyster hemocytes only and follow different patterns of gene expression. While Cg-BigDef3 is non-regulated, both Cg-BigDef1 and Cg-BigDef2 transcripts are strongly induced in response to bacterial challenge. Induction was dependent on pathogen associated molecular patterns but not damage-dependent. The inducibility of Cg-BigDef1 was confirmed by HPLC and mass spectrometry, since ions with a molecular mass compatible with mature Cg-BigDef1 (10.7 kDa) were present in immune-challenged oysters only. From our biochemical data, native Cg-BigDef1 would result from the elimination of a prepropeptide sequence and the cyclization of the resulting N-terminal glutamine residue into a pyroglutamic acid. We provide here the first report showing that big defensins form a family of antimicrobial peptides diverse not only in terms of sequences but also in terms of genomic organization and regulation of gene expression.
Rosa, Rafael D.; Santini, Adrien; Fievet, Julie; Bulet, Philippe; Destoumieux-Garzón, Delphine; Bachère, Evelyne
2011-01-01
Background Big defensin is an antimicrobial peptide composed of a highly hydrophobic N-terminal region and a cationic C-terminal region containing six cysteine residues involved in three internal disulfide bridges. While big defensin sequences have been reported in various mollusk species, few studies have been devoted to their sequence diversity, gene organization and their expression in response to microbial infections. Findings Using the high-throughput Digital Gene Expression approach, we have identified in Crassostrea gigas oysters several sequences coding for big defensins induced in response to a Vibrio infection. We showed that the oyster big defensin family is composed of three members (named Cg-BigDef1, Cg-BigDef2 and Cg-BigDef3) that are encoded by distinct genomic sequences. All Cg-BigDefs contain a hydrophobic N-terminal domain and a cationic C-terminal domain that resembles vertebrate β-defensins. Both domains are encoded by separate exons. We found that big defensins form a group predominantly present in mollusks and closer to vertebrate defensins than to invertebrate and fungi CSαβ-containing defensins. Moreover, we showed that Cg-BigDefs are expressed in oyster hemocytes only and follow different patterns of gene expression. While Cg-BigDef3 is non-regulated, both Cg-BigDef1 and Cg-BigDef2 transcripts are strongly induced in response to bacterial challenge. Induction was dependent on pathogen associated molecular patterns but not damage-dependent. The inducibility of Cg-BigDef1 was confirmed by HPLC and mass spectrometry, since ions with a molecular mass compatible with mature Cg-BigDef1 (10.7 kDa) were present in immune-challenged oysters only. From our biochemical data, native Cg-BigDef1 would result from the elimination of a prepropeptide sequence and the cyclization of the resulting N-terminal glutamine residue into a pyroglutamic acid. Conclusions We provide here the first report showing that big defensins form a family of antimicrobial peptides diverse not only in terms of sequences but also in terms of genomic organization and regulation of gene expression. PMID:21980497
Defining the healthy "core microbiome" of oral microbial communities
2009-01-01
Background Most studies examining the commensal human oral microbiome are focused on disease or are limited in methodology. In order to diagnose and treat diseases at an early and reversible stage an in-depth definition of health is indispensible. The aim of this study therefore was to define the healthy oral microbiome using recent advances in sequencing technology (454 pyrosequencing). Results We sampled and sequenced microbiomes from several intraoral niches (dental surfaces, cheek, hard palate, tongue and saliva) in three healthy individuals. Within an individual oral cavity, we found over 3600 unique sequences, over 500 different OTUs or "species-level" phylotypes (sequences that clustered at 3% genetic difference) and 88 - 104 higher taxa (genus or more inclusive taxon). The predominant taxa belonged to Firmicutes (genus Streptococcus, family Veillonellaceae, genus Granulicatella), Proteobacteria (genus Neisseria, Haemophilus), Actinobacteria (genus Corynebacterium, Rothia, Actinomyces), Bacteroidetes (genus Prevotella, Capnocytophaga, Porphyromonas) and Fusobacteria (genus Fusobacterium). Each individual sample harboured on average 266 "species-level" phylotypes (SD 67; range 123 - 326) with cheek samples being the least diverse and the dental samples from approximal surfaces showing the highest diversity. Principal component analysis discriminated the profiles of the samples originating from shedding surfaces (mucosa of tongue, cheek and palate) from the samples that were obtained from solid surfaces (teeth). There was a large overlap in the higher taxa, "species-level" phylotypes and unique sequences among the three microbiomes: 84% of the higher taxa, 75% of the OTUs and 65% of the unique sequences were present in at least two of the three microbiomes. The three individuals shared 1660 of 6315 unique sequences. These 1660 sequences (the "core microbiome") contributed 66% of the reads. The overlapping OTUs contributed to 94% of the reads, while nearly all reads (99.8%) belonged to the shared higher taxa. Conclusions We obtained the first insight into the diversity and uniqueness of individual oral microbiomes at a resolution of next-generation sequencing. We showed that a major proportion of bacterial sequences of unrelated healthy individuals is identical, supporting the concept of a core microbiome at health. PMID:20003481
Cho, Otomi; Sugita, Takashi
2016-12-01
As DNA sequences of the intergenic spacer (IGS) region in the rRNA gene show remarkable intraspecies diversity compared with the small subunit, large subunit, and internal transcribed spacer region, the IGS region has been used as an epidemiological tool in studies on Malassezia globosa and M. restricta, which are responsible for the exacerbation of atopic dermatitis (AD) and seborrheic dermatitis (SD). However, the IGS regions of M. sympodialis and M. dermatis obtained from the skin of patients with AD and SD, as well as healthy subjects, lacked sequence diversity. Of the 105 M. sympodialis strains and the 40 M. dermatis strains, the sequences of 103 (98.1 %) and 39 (97.5 %), respectively, were identical. Thus, given the lack of intraspecies diversity in the IGS regions of M. sympodialis and M. dermatis, studies of the diversity of these species should be performed using appropriate genes and not the IGS.
Bulgari, Daniela; Casati, Paola; Brusetti, Lorenzo; Quaglino, Fabio; Brasca, Milena; Daffonchio, Daniele; Bianco, Piero Attilio
2009-08-01
Diversity of bacterial endophytes associated with grapevine leaf tissues was analyzed by cultivation and cultivation-independent methods. In order to identify bacterial endophytes directly from metagenome, a protocol for bacteria enrichment and DNA extraction was optimized. Sequence analysis of 16S rRNA gene libraries underscored five diverse Operational Taxonomic Units (OTUs), showing best sequence matches with gamma-Proteobacteria, family Enterobacteriaceae, with a dominance of the genus Pantoea. Bacteria isolation through cultivation revealed the presence of six OTUs, showing best sequence matches with Actinobacteria, genus Curtobacterium, and with Firmicutes genera Bacillus and Enterococcus. Length Heterogeneity-PCR (LH-PCR) electrophoretic peaks from single bacterial clones were used to setup a database representing the bacterial endophytes identified in association with grapevine tissues. Analysis of healthy and phytoplasma-infected grapevine plants showed that LH-PCR could be a useful complementary tool for examining the diversity of bacterial endophytes especially for diversity survey on a large number of samples.
1,003 reference genomes of bacterial and archaeal isolates expand coverage of the tree of life
Mukherjee, Supratim; Seshadri, Rekha; Varghese, Neha J.; ...
2017-06-12
We present 1,003 reference genomes that were sequenced as part of the Genomic Encyclopedia of Bacteria and Archaea (GEBA) initiative, selected to maximize sequence coverage of phylogenetic space. These genomes double the number of existing type strains and expand their overall phylogenetic diversity by 25%. Comparative analyses with previously available finished and draft genomes reveal a 10.5% increase in novel protein families as a function of phylogenetic diversity. The GEBA genomes recruit 25 million previously unassigned metagenomic proteins from 4,650 samples, improving their phylogenetic and functional interpretation. We identify numerous biosynthetic clusters and experimentally validate a divergent phenazine cluster withmore » potential new chemical structure and antimicrobial activity. This Resource is the largest single release of reference genomes to date. Bacterial and archaeal isolate sequence space is still far from saturated, and future endeavors in this direction will continue to be a valuable resource for scientific discovery.« less
Bagwell, Christopher E; Liu, Xuaduan; Wu, Liyou; Zhou, Jizhong
2006-03-01
The impact of legacy nuclear waste on the compositional diversity and distribution of sulfate-reducing bacteria in a heavily contaminated subsurface aquifer was examined. dsrAB clone libraries were constructed and restriction fragment length polymorphism (RFLP) analysis used to evaluate genetic variation between sampling wells. Principal component analysis identified nickel, nitrate, technetium, and organic carbon as the primary variables contributing to well-to-well geochemical variability, although comparative sequence analysis showed the sulfate-reducing bacteria community structure to be consistent throughout contaminated and uncontaminated regions of the aquifer. Only 3% of recovered dsrAB gene sequences showed apparent membership to the Deltaproteobacteria. The remainder of recovered sequences may represent novel, deep-branching lineages that, to our knowledge, do not presently contain any cultivated members; although corresponding phylotypes have recently been reported from several different marine ecosystems. These findings imply resiliency and adaptability of sulfate-reducing bacteria to extremes in environmental conditions, although the possibility for horizontal transfer of dsrAB is also discussed.
1,003 reference genomes of bacterial and archaeal isolates expand coverage of the tree of life
DOE Office of Scientific and Technical Information (OSTI.GOV)
Mukherjee, Supratim; Seshadri, Rekha; Varghese, Neha J.
We present 1,003 reference genomes that were sequenced as part of the Genomic Encyclopedia of Bacteria and Archaea (GEBA) initiative, selected to maximize sequence coverage of phylogenetic space. These genomes double the number of existing type strains and expand their overall phylogenetic diversity by 25%. Comparative analyses with previously available finished and draft genomes reveal a 10.5% increase in novel protein families as a function of phylogenetic diversity. The GEBA genomes recruit 25 million previously unassigned metagenomic proteins from 4,650 samples, improving their phylogenetic and functional interpretation. We identify numerous biosynthetic clusters and experimentally validate a divergent phenazine cluster withmore » potential new chemical structure and antimicrobial activity. This Resource is the largest single release of reference genomes to date. Bacterial and archaeal isolate sequence space is still far from saturated, and future endeavors in this direction will continue to be a valuable resource for scientific discovery.« less
Özdemir, Ebru; Altındağ, Ahmet; Kandemir, İrfan
2017-05-01
Daphnia is a freshwater zooplankton species with controversial taxonomy due to its high morphological variation linked to environmental factors and inter-specific hybridization and polyploidy in some groups. The aim of the present study is to examine molecular diversity of some Daphnia species in Turkey and to establish DNA barcodes of Turkish Daphnia species. Sequence analysis was performed using 540 bp region of cytochrome oxidase subunit I gene of mitochondrial DNA. A total of 34 haplotypes have been identified for Turkey. Daphnia pulex complex was divided into two clades with 16.1% sequence divergence according to molecular taxonomy based on Kimura 2-parameter. The clade which was molecularly diverged from Daphnia pulex with 16.1% sequence divergence was found to show 99% similarity with Daphnia cf. pulicaria (sensu Alonso 1996) instead of Daphnia pulicaria Forbes, 1893. Furthermore, this study has contributed to Turkish zoogeography by demonstrating the distribution of Daphnia species in Turkey.
Núñez, Andrés; Amo de Paz, Guillermo; Ferencova, Zuzana; Rastrojo, Alberto; Guantes, Raúl; García, Ana M; Alcamí, Antonio; Gutiérrez-Bustillo, A Montserrat; Moreno, Diego A
2017-07-01
Pollen, fungi, and bacteria are the main microscopic biological entities present in outdoor air, causing allergy symptoms and disease transmission and having a significant role in atmosphere dynamics. Despite their relevance, a method for monitoring simultaneously these biological particles in metropolitan environments has not yet been developed. Here, we assessed the use of the Hirst-type spore trap to characterize the global airborne biota by high-throughput DNA sequencing, selecting regions of the 16S rRNA gene and internal transcribed spacer for the taxonomic assignment. We showed that aerobiological communities are well represented by this approach. The operational taxonomic units (OTUs) of two traps working synchronically compiled >87% of the total relative abundance for bacterial diversity collected in each sampler, >89% for fungi, and >97% for pollen. We found a good correspondence between traditional characterization by microscopy and genetic identification, obtaining more-accurate taxonomic assignments and detecting a greater diversity using the latter. We also demonstrated that DNA sequencing accurately detects differences in biodiversity between samples. We concluded that high-throughput DNA sequencing applied to aerobiological samples obtained with Hirst spore traps provides reliable results and can be easily implemented for monitoring prokaryotic and eukaryotic entities present in the air of urban areas. IMPORTANCE Detection, monitoring, and characterization of the wide diversity of biological entities present in the air are difficult tasks that require time and expertise in different disciplines. We have evaluated the use of the Hirst spore trap (an instrument broadly employed in aerobiological studies) to detect and identify these organisms by DNA-based analyses. Our results showed a consistent collection of DNA and a good concordance with traditional methods for identification, suggesting that these devices can be used as a tool for continuous monitoring of the airborne biodiversity, improving taxonomic resolution and characterization together. They are also suitable for acquiring novel DNA amplicon-based information in order to gain a better understanding of the biological particles present in a scarcely known environment such as the air. Copyright © 2017 American Society for Microbiology.
Núñez, Andrés; Amo de Paz, Guillermo; Ferencova, Zuzana; Rastrojo, Alberto; Guantes, Raúl; García, Ana M.; Alcamí, Antonio; Gutiérrez-Bustillo, A. Montserrat
2017-01-01
ABSTRACT Pollen, fungi, and bacteria are the main microscopic biological entities present in outdoor air, causing allergy symptoms and disease transmission and having a significant role in atmosphere dynamics. Despite their relevance, a method for monitoring simultaneously these biological particles in metropolitan environments has not yet been developed. Here, we assessed the use of the Hirst-type spore trap to characterize the global airborne biota by high-throughput DNA sequencing, selecting regions of the 16S rRNA gene and internal transcribed spacer for the taxonomic assignment. We showed that aerobiological communities are well represented by this approach. The operational taxonomic units (OTUs) of two traps working synchronically compiled >87% of the total relative abundance for bacterial diversity collected in each sampler, >89% for fungi, and >97% for pollen. We found a good correspondence between traditional characterization by microscopy and genetic identification, obtaining more-accurate taxonomic assignments and detecting a greater diversity using the latter. We also demonstrated that DNA sequencing accurately detects differences in biodiversity between samples. We concluded that high-throughput DNA sequencing applied to aerobiological samples obtained with Hirst spore traps provides reliable results and can be easily implemented for monitoring prokaryotic and eukaryotic entities present in the air of urban areas. IMPORTANCE Detection, monitoring, and characterization of the wide diversity of biological entities present in the air are difficult tasks that require time and expertise in different disciplines. We have evaluated the use of the Hirst spore trap (an instrument broadly employed in aerobiological studies) to detect and identify these organisms by DNA-based analyses. Our results showed a consistent collection of DNA and a good concordance with traditional methods for identification, suggesting that these devices can be used as a tool for continuous monitoring of the airborne biodiversity, improving taxonomic resolution and characterization together. They are also suitable for acquiring novel DNA amplicon-based information in order to gain a better understanding of the biological particles present in a scarcely known environment such as the air. PMID:28455334
Will, Christiane; Thürmer, Andrea; Wollherr, Antje; Nacke, Heiko; Herold, Nadine; Schrumpf, Marion; Gutknecht, Jessica; Wubet, Tesfaye; Buscot, François; Daniel, Rolf
2010-01-01
The diversity of bacteria in soil is enormous, and soil bacterial communities can vary greatly in structure. Here, we employed a pyrosequencing-based analysis of the V2-V3 16S rRNA gene region to characterize the overall and horizon-specific (A and B horizons) bacterial community compositions in nine grassland soils, which covered three different land use types. The entire data set comprised 752,838 sequences, 600,544 of which could be classified below the domain level. The average number of sequences per horizon was 41,824. The dominant taxonomic groups present in all samples and horizons were the Acidobacteria, Betaproteobacteria, Actinobacteria, Gammaproteobacteria, Alphaproteobacteria, Deltaproteobacteria, Chloroflexi, Firmicutes, and Bacteroidetes. Despite these overarching dominant taxa, the abundance, diversity, and composition of bacterial communities were horizon specific. In almost all cases, the estimated bacterial diversity (H′) was higher in the A horizons than in the corresponding B horizons. In addition, the H′ was positively correlated with the organic carbon content, the total nitrogen content, and the C-to-N ratio, which decreased with soil depth. It appeared that lower land use intensity results in higher bacterial diversity. The majority of sequences affiliated with the Actinobacteria, Bacteroidetes, Cyanobacteria, Fibrobacteres, Firmicutes, Spirochaetes, Verrucomicrobia, Alphaproteobacteria, Betaproteobacteria, and Gammaproteobacteria were derived from A horizons, whereas the majority of the sequences related to Acidobacteria, Chloroflexi, Gemmatimonadetes, Nitrospira, TM7, and WS3 originated from B horizons. The distribution of some bacterial phylogenetic groups and subgroups in the different horizons correlated with soil properties such as organic carbon content, total nitrogen content, or microbial biomass. PMID:20729324
Ghost-tree: creating hybrid-gene phylogenetic trees for diversity analyses.
Fouquier, Jennifer; Rideout, Jai Ram; Bolyen, Evan; Chase, John; Shiffer, Arron; McDonald, Daniel; Knight, Rob; Caporaso, J Gregory; Kelley, Scott T
2016-02-24
Fungi play critical roles in many ecosystems, cause serious diseases in plants and animals, and pose significant threats to human health and structural integrity problems in built environments. While most fungal diversity remains unknown, the development of PCR primers for the internal transcribed spacer (ITS) combined with next-generation sequencing has substantially improved our ability to profile fungal microbial diversity. Although the high sequence variability in the ITS region facilitates more accurate species identification, it also makes multiple sequence alignment and phylogenetic analysis unreliable across evolutionarily distant fungi because the sequences are hard to align accurately. To address this issue, we created ghost-tree, a bioinformatics tool that integrates sequence data from two genetic markers into a single phylogenetic tree that can be used for diversity analyses. Our approach starts with a "foundation" phylogeny based on one genetic marker whose sequences can be aligned across organisms spanning divergent taxonomic groups (e.g., fungal families). Then, "extension" phylogenies are built for more closely related organisms (e.g., fungal species or strains) using a second more rapidly evolving genetic marker. These smaller phylogenies are then grafted onto the foundation tree by mapping taxonomic names such that each corresponding foundation-tree tip would branch into its new "extension tree" child. We applied ghost-tree to graft fungal extension phylogenies derived from ITS sequences onto a foundation phylogeny derived from fungal 18S sequences. Our analysis of simulated and real fungal ITS data sets found that phylogenetic distances between fungal communities computed using ghost-tree phylogenies explained significantly more variance than non-phylogenetic distances. The phylogenetic metrics also improved our ability to distinguish small differences (effect sizes) between microbial communities, though results were similar to non-phylogenetic methods for larger effect sizes. The Silva/UNITE-based ghost tree presented here can be easily integrated into existing fungal analysis pipelines to enhance the resolution of fungal community differences and improve understanding of these communities in built environments. The ghost-tree software package can also be used to develop phylogenetic trees for other marker gene sets that afford different taxonomic resolution, or for bridging genome trees with amplicon trees. ghost-tree is pip-installable. All source code, documentation, and test code are available under the BSD license at https://github.com/JTFouquier/ghost-tree .
Zhang, Li; Cham, Jason; Paciorek, Alan; Trager, James; Sheikh, Nadeem; Fong, Lawrence
2017-02-27
Cancer immunotherapy has demonstrated significant clinical activity in different cancers. T cells represent a crucial component of the adaptive immune system and are thought to mediate anti-tumoral immunity. Antigen-specific recognition by T cells is via the T cell receptor (TCR) which is unique for each T cell. Next generation sequencing (NGS) of the TCRs can be used as a platform to profile the T cell repertoire. Though there are a number of software tools available for processing repertoire data by mapping antigen receptor segments to sequencing reads and assembling the clonotypes, most of them are not designed to track and examine the dynamic nature of the TCR repertoire across multiple time points or between different biologic compartments (e.g., blood and tissue samples) in a clinical context. We integrated different diversity measures to assess the T cell repertoire diversity and examined the robustness of the diversity indices. Among those tested, Clonality was identified for its robustness as a key metric for study design and the first choice to measure TCR repertoire diversity. To evaluate the dynamic nature of T cell clonotypes across time, we utilized several binary similarity measures (such as Baroni-Urbani and Buser overlap index), relative clonality and Morisita's overlap index, as well as the intraclass correlation coefficient, and performed fold change analysis, which was further extended to investigate the transition of clonotypes among different biological compartments. Furthermore, the application of differential testing enabled the detection of clonotypes which were significantly changed across time. By applying the proposed "3D" analysis pipeline to the real example of prostate cancer subjects who received sipuleucel-T, an FDA-approved immunotherapy, we were able to detect changes in TCR sequence frequency and diversity thus demonstrating that sipuleucel-T treatment affected TCR repertoire in blood and in prostate tissue. We also found that the increase in common TCR sequences between tissue and blood after sipuleucel-T treatment supported the hypothesis that treatment-induced T cell migrated into the prostate tissue. In addition, a second example of prostate cancer subjects treated with Ipilimumab and granulocyte macrophage colony stimulating factor (GM-CSF) was presented in the supplementary documents to further illustrate assessing the treatment-associated change in a clinical context by the proposed workflow. Our paper provides guidance to study the diversity and dynamics of NGS-based TCR repertoire profiling in a clinical context to ensure consistency and reproducibility of post-analysis. This analysis pipeline will provide an initial workflow for TCR sequencing data with serial time points and for comparing T cells in multiple compartments for a clinical study.
Poimenidou, Sofia V; Dalmasso, Marion; Papadimitriou, Konstantinos; Fox, Edward M; Skandamis, Panagiotis N; Jordan, Kieran
2018-01-01
The prfA -virulence gene cluster ( p VGC) is the main pathogenicity island in Listeria monocytogenes , comprising the prfA, plcA, hly, mpl, actA , and plcB genes. In this study, the p VGC of 36 L. monocytogenes isolates with respect to different serotypes (1/2a or 4b), geographical origin (Australia, Greece or Ireland) and isolation source (food-associated or clinical) was characterized. The most conserved genes were prfA and hly , with the lowest nucleotide diversity (π) among all genes ( P < 0.05), and the lowest number of alleles, substitutions and non-synonymous substitutions for prfA . Conversely, the most diverse gene was actA , which presented the highest number of alleles ( n = 20) and showed the highest nucleotide diversity. Grouping by serotype had a significantly lower π value ( P < 0.0001) compared to isolation source or geographical origin, suggesting a distinct and well-defined unit compared to other groupings. Among all tested genes, only hly and mpl were those with lower nucleotide diversity in 1/2a serotype than 4b serotype, reflecting a high within-1/2a serotype divergence compared to 4b serotype. Geographical divergence was noted with respect to the hly gene, where serotype 4b Irish strains were distinct from Greek and Australian strains. Australian strains showed less diversity in plcB and mpl relative to Irish or Greek strains. Notable differences regarding sequence mutations were identified between food-associated and clinical isolates in prfA, actA , and plcB sequences. Overall, these results indicate that virulence genes follow different evolutionary pathways, which are affected by a strain's origin and serotype and may influence virulence and/or epidemiological dominance of certain subgroups.
Sierra-Garcia, Isabel Natalia; Dellagnezze, Bruna M; Santos, Viviane P; Chaves B, Michel R; Capilla, Ramsés; Santos Neto, Eugenio V; Gray, Neil; Oliveira, Valeria M
2017-01-01
Microorganisms have shown their ability to colonize extreme environments including deep subsurface petroleum reservoirs. Physicochemical parameters may vary greatly among petroleum reservoirs worldwide and so do the microbial communities inhabiting these different environments. The present work aimed at the characterization of the microbiota in biodegraded and non-degraded petroleum samples from three Brazilian reservoirs and the comparison of microbial community diversity across oil reservoirs at local and global scales using 16S rRNA clone libraries. The analysis of 620 16S rRNA bacterial and archaeal sequences obtained from Brazilian oil samples revealed 42 bacterial OTUs and 21 archaeal OTUs. The bacterial community from the degraded oil was more diverse than the non-degraded samples. Non-degraded oil samples were overwhelmingly dominated by gammaproteobacterial sequences with a predominance of the genera Marinobacter and Marinobacterium. Comparisons of microbial diversity among oil reservoirs worldwide suggested an apparent correlation of prokaryotic communities with reservoir temperature and depth and no influence of geographic distance among reservoirs. The detailed analysis of the phylogenetic diversity across reservoirs allowed us to define a core microbiome encompassing three bacterial classes (Gammaproteobacteria, Clostridia, and Bacteroidia) and one archaeal class (Methanomicrobia) ubiquitous in petroleum reservoirs and presumably owning the abilities to sustain life in these environments.
Geleta, Mulatu; Herrera, Isabel; Monzón, Arnulfo; Bryngelsson, Tomas
2012-01-01
Coffea arabica L. (arabica coffee), the only tetraploid species in the genus Coffea, represents the majority of the world's coffee production and has a significant contribution to Nicaragua's economy. The present paper was conducted to determine the genetic diversity of arabica coffee in Nicaragua for its conservation and breeding values. Twenty-six populations that represent eight varieties in Nicaragua were investigated using simple sequence repeat (SSR) markers. A total of 24 alleles were obtained from the 12 loci investigated across 260 individual plants. The total Nei's gene diversity (H T) and the within-population gene diversity (H S) were 0.35 and 0.29, respectively, which is comparable with that previously reported from other countries and regions. Among the varieties, the highest diversity was recorded in the variety Catimor. Analysis of variance (AMOVA) revealed that about 87% of the total genetic variation was found within populations and the remaining 13% differentiate the populations (F ST = 0.13; P < 0.001). The variation among the varieties was also significant. The genetic variation in Nicaraguan coffee is significant enough to be used in the breeding programs, and most of this variation can be conserved through ex situ conservation of a low number of populations from each variety. PMID:22701376
Bondici, V F; Lawrence, J R; Khan, N H; Hill, J E; Yergeau, E; Wolfaardt, G M; Warner, J; Korber, D R
2013-06-01
To describe the diversity and metabolic potential of microbial communities in uranium mine tailings characterized by high pH, high metal concentration and low permeability. To assess microbial diversity and their potential to influence the geochemistry of uranium mine tailings using aerobic and anaerobic culture-based methods, in conjunction with next generation sequencing and clone library sequencing targeting two universal bacterial markers (the 16S rRNA and cpn60 genes). Growth assays revealed that 69% of the 59 distinct culturable isolates evaluated were multiple-metal resistant, with 15% exhibiting dual-metal hypertolerance. There was a moderately positive correlation coefficient (R = 0·43, P < 0·05) between multiple-metal resistance of the isolates and their enzyme expression profile. Of the isolates tested, 17 reduced amorphous iron, 22 reduced molybdate and seven oxidized arsenite. Based on next generation sequencing, tailings depth was shown to influence bacterial community composition, with the difference in the microbial diversity of the upper (0-20 m) and middle (20-40 m) tailings zones being highly significant (P < 0·01) from the lower zone (40-60 m) and the difference in diversity of the upper and middle tailings zone being significant (P < 0·05). Phylotypes closely related to well-known sulfate-reducing and iron-reducing bacteria were identified with low abundance, yet relatively high diversity. The presence of a population of metabolically-diverse, metal-resistant micro-organisms within the tailings environment, along with their demonstrated capacity for transforming metal elements, suggests that these organisms have the potential to influence the long-term geochemistry of the tailings. This study is the first investigation of the diversity and functional potential of micro-organisms present in low permeability, high pH uranium mine tailings. © 2013 The Society for Applied Microbiology.
Actinobacterial Diversity in Volcanic Caves and Associated Geomicrobiological Interactions
Riquelme, Cristina; Marshall Hathaway, Jennifer J.; Enes Dapkevicius, Maria de L. N.; Miller, Ana Z.; Kooser, Ara; Northup, Diana E.; Jurado, Valme; Fernandez, Octavio; Saiz-Jimenez, Cesareo; Cheeptham, Naowarat
2015-01-01
Volcanic caves are filled with colorful microbial mats on the walls and ceilings. These volcanic caves are found worldwide, and studies are finding vast bacteria diversity within these caves. One group of bacteria that can be abundant in volcanic caves, as well as other caves, is Actinobacteria. As Actinobacteria are valued for their ability to produce a variety of secondary metabolites, rare and novel Actinobacteria are being sought in underexplored environments. The abundance of novel Actinobacteria in volcanic caves makes this environment an excellent location to study these bacteria. Scanning electron microscopy (SEM) from several volcanic caves worldwide revealed diversity in the morphologies present. Spores, coccoid, and filamentous cells, many with hair-like or knobby extensions, were some of the microbial structures observed within the microbial mat samples. In addition, the SEM study pointed out that these features figure prominently in both constructive and destructive mineral processes. To further investigate this diversity, we conducted both Sanger sequencing and 454 pyrosequencing of the Actinobacteria in volcanic caves from four locations, two islands in the Azores, Portugal, and Hawai'i and New Mexico, USA. This comparison represents one of the largest sequencing efforts of Actinobacteria in volcanic caves to date. The diversity was shown to be dominated by Actinomycetales, but also included several newly described orders, such as Euzebyales, and Gaiellales. Sixty-two percent of the clones from the four locations shared less than 97% similarity to known sequences, and nearly 71% of the clones were singletons, supporting the commonly held belief that volcanic caves are an untapped resource for novel and rare Actinobacteria. The amplicon libraries depicted a wider view of the microbial diversity in Azorean volcanic caves revealing three additional orders, Rubrobacterales, Solirubrobacterales, and Coriobacteriales. Studies of microbial ecology in volcanic caves are still very limited. To rectify this deficiency, the results from our study help fill in the gaps in our knowledge of actinobacterial diversity and their potential roles in the volcanic cave ecosystems. PMID:26696966
Modelling the Dust Around Vega-Like Stars
NASA Technical Reports Server (NTRS)
Sylvester, Roger J.; Skinner, C. J.; Barlow, M. J.
1996-01-01
Models are presented of four Vega-like stars: main-sequence stars with infrared emission from circumstellar dust. The dusty environments of the four stars are rather diverse, as shown by their spectral energy distributions. Good fits to the observations were obtained for all four stars.
Amaral, Wellington Z; Lubach, Gabriele R; Kapoor, Amita; Proctor, Alexandra; Phillips, Gregory J; Lyte, Mark; Coe, Christopher L
2017-10-01
The lower reproductive tract of nonhuman primates is colonized with a diverse microbiota, resembling bacterial vaginosis (BV), a gynecological condition associated with negative reproductive outcomes in women. Our 4 aims were to: (i) assess the prevalence of low Lactobacilli and a BV-like profile in female rhesus monkeys; (ii) quantify cytokines in their cervicovaginal fluid (CVF); (iii) examine the composition and structure of their mucosal microbiota with culture-independent sequencing methods; and (iv) evaluate the potential influence on reproductive success. CVF specimens were obtained from 27 female rhesus monkeys for Gram's staining, and to determine acidity (pH), and quantify proinflammatory cytokines. Based on Nugent's classification, 40% had a score of 7 or higher, which would be indicative of BV in women. Nugent scores were significantly correlated with the pH of the CVF. Interleukin-1ß was present at high concentrations, but not further elevated by high Nugent scores. Vaginal swabs were obtained from eight additional females to determine microbial diversity by rRNA gene amplicon sequencing. At the phylum level, the Firmicutes/Bacteroidetes ratio was low. The relative abundance of Lactobacilli was also low (between 3% and 17%), and 11 other genera were present at >1%. However, neither the microbial diversity in the community structure, nor high Nugent scores, was associated with reduced fecundity. Female monkeys provide an opportunity to understand how reproductive success can be sustained in the presence of a diverse polymicrobial community in the reproductive tract. © 2017 Wiley Periodicals, Inc.
Serotype and genetic diversity of human rhinovirus strains that circulated in Kenya in 2008.
Milanoi, Sylvia; Ongus, Juliette R; Gachara, George; Coldren, Rodney; Bulimo, Wallace
2016-05-01
Human rhinoviruses (HRVs) are a well-established cause of the common cold and recent studies indicated that they may be associated with severe acute respiratory illnesses (SARIs) like pneumonia, asthma, and bronchiolitis. Despite global studies on the genetic diversity of the virus, the serotype diversity of these viruses across diverse geographic regions in Kenya has not been characterized. This study sought to characterize the serotype diversity of HRV strains that circulated in Kenya in 2008. A total of 517 archived nasopharyngeal samples collected in a previous respiratory virus surveillance program across Kenya in 2008 were selected. Participants enrolled were outpatients who presented with influenza-like (ILI) symptoms. Real-time RT-PCR was employed for preliminary HRV detection. HRV-positive samples were amplified using RT-PCR and thereafter the nucleotide sequences of the amplicons were determined followed by phylogenetic analysis. Twenty-five percent of the samples tested positive for HRV. Phylogenetic analysis revealed that the Kenyan HRVs clustered into three main species comprising HRV-A (54%), HRV-B (12%), and HRV-C (35%). Overall, 20 different serotypes were identified. Intrastrain sequence homology among the Kenyan strains ranged from 58% to 100% at the nucleotide level and 55% to 100% at the amino acid level. These results show that a wide range of HRV serotypes with different levels of nucleotide variation were present in Kenya. Furthermore, our data show that HRVs contributed substantially to influenza-like illness in Kenya in 2008. © 2016 The Authors. Influenza and Other Respiratory Viruses Published by John Wiley & Sons Ltd.
Zhang, Haihan; Huang, Tinglin; Liu, Tingting
2013-01-01
Drinking water reservoir plays a vital role in the security of urban water supply, yet little is known about microbial community diversity harbored in the sediment of this oligotrophic freshwater environmental ecosystem. In the present study, integrating community level physiological profiles (CLPPs), nested polymerase chain reaction (PCR)-denaturing gradient gel electrophoresis (DGGE) and clone sequence technologies, we examined the sediment urease and protease activities, bacterial community functional diversity, genetic diversity of bacterial and fungal communities in sediments from six sampling sites of Zhou cun drinking water reservoir, eastern China. The results showed that sediment urease activity was markedly distinct along the sites, ranged from 2.48 to 11.81 mg NH3-N/(g·24h). The highest average well color development (AWCD) was found in site C, indicating the highest metabolic activity of heterotrophic bacterial community. Principal component analysis (PCA) revealed tremendous differences in the functional (metabolic) diversity patterns of the sediment bacterial communities from different sites. Meanwhile, DGGE fingerprints also indicated spatial changes of genetic diversity of sediment bacterial and fungal communities. The sequence BLAST analysis of all the sediment samples found that Comamonas sp. was the dominant bacterial species harbored in site A. Alternaria alternate, Allomyces macrogynus and Rhizophydium sp. were most commonly detected fungal species in sediments of the Zhou cun drinking water reservoir. The results from this work provide new insights about the heterogeneity of sediment microbial community metabolic activity and genetic diversity in the oligotrophic drinking water reservoir. PMID:24205265
Diversity of immunoglobulin lambda light chain gene usage over developmental stages in the horse.
Tallmadge, Rebecca L; Tseng, Chia T; Felippe, M Julia B
2014-10-01
To further studies of neonatal immune responses to pathogens and vaccination, we investigated the dynamics of B lymphocyte development and immunoglobulin (Ig) gene diversity. Previously we demonstrated that equine fetal Ig VDJ sequences exhibit combinatorial and junctional diversity levels comparable to those of adult Ig VDJ sequences. Herein, RACE clones from fetal, neonatal, foal, and adult lymphoid tissue were assessed for Ig lambda light chain combinatorial, junctional, and sequence diversity. Remarkably, more lambda variable genes (IGLV) were used during fetal life than later stages and IGLV gene usage differed significantly with time, in contrast to the Ig heavy chain. Junctional diversity measured by CDR3L length was constant over time. Comparison of Ig lambda transcripts to germline revealed significant increases in nucleotide diversity over time, even during fetal life. These results suggest that the Ig lambda light chain provides an additional dimension of diversity to the equine Ig repertoire. Copyright © 2014 Elsevier Ltd. All rights reserved.
Illeghems, Koen; De Vuyst, Luc; Papalexandratou, Zoi; Weckx, Stefan
2012-01-01
This is the first report on the phylogenetic analysis of the community diversity of a single spontaneous cocoa bean box fermentation sample through a metagenomic approach involving 454 pyrosequencing. Several sequence-based and composition-based taxonomic profiling tools were used and evaluated to avoid software-dependent results and their outcome was validated by comparison with previously obtained culture-dependent and culture-independent data. Overall, this approach revealed a wider bacterial (mainly γ-Proteobacteria) and fungal diversity than previously found. Further, the use of a combination of different classification methods, in a software-independent way, helped to understand the actual composition of the microbial ecosystem under study. In addition, bacteriophage-related sequences were found. The bacterial diversity depended partially on the methods used, as composition-based methods predicted a wider diversity than sequence-based methods, and as classification methods based solely on phylogenetic marker genes predicted a more restricted diversity compared with methods that took all reads into account. The metagenomic sequencing analysis identified Hanseniaspora uvarum, Hanseniaspora opuntiae, Saccharomyces cerevisiae, Lactobacillus fermentum, and Acetobacter pasteurianus as the prevailing species. Also, the presence of occasional members of the cocoa bean fermentation process was revealed (such as Erwinia tasmaniensis, Lactobacillus brevis, Lactobacillus casei, Lactobacillus rhamnosus, Lactococcus lactis, Leuconostoc mesenteroides, and Oenococcus oeni). Furthermore, the sequence reads associated with viral communities were of a restricted diversity, dominated by Myoviridae and Siphoviridae, and reflecting Lactobacillus as the dominant host. To conclude, an accurate overview of all members of a cocoa bean fermentation process sample was revealed, indicating the superiority of metagenomic sequencing over previously used techniques.
Pitkäranta, Miia; Meklin, Teija; Hyvärinen, Anne; Nevalainen, Aino; Paulin, Lars; Auvinen, Petri; Lignell, Ulla; Rintala, Helena
2011-10-21
Indoor microbial contamination due to excess moisture is an important contributor to human illness in both residential and occupational settings. However, the census of microorganisms in the indoor environment is limited by the use of selective, culture-based detection techniques. By using clone library sequencing of full-length internal transcribed spacer region combined with quantitative polymerase chain reaction (qPCR) for 69 fungal species or assay groups and cultivation, we have been able to generate a more comprehensive description of the total indoor mycoflora. Using this suite of methods, we assessed the impact of moisture damage on the fungal community composition of settled dust and building material samples (n = 8 and 16, correspondingly). Water-damaged buildings (n = 2) were examined pre- and post- remediation, and compared with undamaged reference buildings (n = 2). Culture-dependent and independent methods were consistent in the dominant fungal taxa in dust, but sequencing revealed a five to ten times higher diversity at the genus level than culture or qPCR. Previously unknown, verified fungal phylotypes were detected in dust, accounting for 12% of all diversity. Fungal diversity, especially within classes Dothideomycetes and Agaricomycetes tended to be higher in the water damaged buildings. Fungal phylotypes detected in building materials were present in dust samples, but their proportion of total fungi was similar for damaged and reference buildings. The quantitative correlation between clone library phylotype frequencies and qPCR counts was moderate (r = 0.59, p < 0.01). We examined a small number of target buildings and found indications of elevated fungal diversity associated with water damage. Some of the fungi in dust were attributable to building growth, but more information on the material-associated communities is needed in order to understand the dynamics of microbial communities between building structures and dust. The sequencing-based method proved indispensable for describing the true fungal diversity in indoor environments. However, making conclusions concerning the effect of building conditions on building mycobiota using this methodology was complicated by the wide natural diversity in the dust samples, the incomplete knowledge of material-associated fungi fungi and the semiquantitative nature of sequencing based methods.
Jeffery, Nicholas W; Elías-Gutiérrez, Manuel; Adamowicz, Sarah J
2011-01-01
The region of Churchill, Manitoba, contains a wide variety of habitats representative of both the boreal forest and arctic tundra and has been used as a model site for biodiversity studies for nearly seven decades within Canada. Much previous work has been done in Churchill to study the Daphnia pulex species complex in particular, but no study has completed a wide-scale survey on the crustacean species that inhabit Churchill's aquatic ecosystems using molecular markers. We have employed DNA barcoding to study the diversity of the Branchiopoda (Crustacea) in a wide variety of freshwater habitats and to determine the likely origins of the Churchill fauna following the last glaciation. The standard animal barcode marker (COI) was sequenced for 327 specimens, and a 3% divergence threshold was used to delineate potential species. We found 42 provisional and valid branchiopod species from this survey alone, including several cryptic lineages, in comparison with the 25 previously recorded from previous ecological works. Using published sequence data, we explored the phylogeographic affinities of Churchill's branchiopods, finding that the Churchill fauna apparently originated from all directions from multiple glacial refugia (including southern, Beringian, and high arctic regions). Overall, these microcrustaceans are very diverse in Churchill and contain multiple species complexes. The present study introduces among the first sequences for some understudied genera, for which further work is required to delineate species boundaries and develop a more complete understanding of branchiopod diversity over a larger spatial scale.
Pretzer, Carina; Druzhinina, Irina S; Amaro, Carmen; Benediktsdóttir, Eva; Hedenström, Ingela; Hervio-Heath, Dominique; Huhulescu, Steliana; Schets, Franciska M; Farnleitner, Andreas H; Kirschner, Alexander K T
2017-01-01
Coastal marine Vibrio cholerae populations usually exhibit high genetic diversity. To assess the genetic diversity of abundant V. cholerae non-O1/non-O139 populations in the Central European lake Neusiedler See, we performed a phylogenetic analysis based on recA, toxR, gyrB and pyrH loci sequenced for 472 strains. The strains were isolated from three ecologically different habitats in a lake that is a hot-spot of migrating birds and an important bathing water. We also analyzed 76 environmental and human V. cholerae non-O1/non-O139 isolates from Austria and other European countries and added sequences of seven genome-sequenced strains. Phylogenetic analysis showed that the lake supports a unique endemic diversity of V. cholerae that is particularly rich in the reed stand. Phylogenetic trees revealed that many V. cholerae isolates from European countries were genetically related to the strains present in the lake belonging to statistically supported monophyletic clades. We hypothesize that the observed phenomena can be explained by the high degree of genetic recombination that is particularly intensive in the reed stand, acting along with the long distance transfer of strains most probably via birds and/or humans. Thus, the Neusiedler See may serve as a bioreactor for the appearance of new strains with new (pathogenic) properties. © 2016 The Authors. Environmental Microbiology published by Society for Applied Microbiology and John Wiley & Sons Ltd.
Burrell, A Millie; Pepper, Alan E; Hodnett, George; Goolsby, John A; Overholt, William A; Racelis, Alexis E; Diaz, Rodrigo; Klein, Patricia E
2015-05-01
Imperata cylindrica (Cogongrass, Speargrass) is a diploid C4 grass that is a noxious weed in 73 countries and constitutes a significant threat to global biodiversity and sustainable agriculture. We used a cost-effective genotyping-by-sequencing (GBS) approach to identify the reproductive system, genetic diversity and geographic origins of invasions in the south-eastern United States. In this work, we demonstrated the advantage of employing the closely related, fully sequenced crop species Sorghum bicolor (L.) Moench as a proxy reference genome to identify a set of 2320 informative single nucleotide and insertion-deletion polymorphisms. Genetic analyses identified four clonal lineages of cogongrass and one clonal lineage of Imperata brasiliensis Trin. in the United States. Each lineage was highly homogeneous, and we found no evidence of hybridization among the different lineages, despite geographical overlap. We found evidence that at least three of these lineages showed clonal reproduction prior to introduction to the United States. These results indicate that cogongrass has limited evolutionary potential to adapt to novel environments and further suggest that upon arrival to its invaded range, this species did not require local adaptation through hybridization/introgression or selection of favourable alleles from a broad genetic base. Thus, cogongrass presents a clear case of broad invasive success, across a diversity of environments, in a clonal organism with limited genetic diversity. © 2015 John Wiley & Sons Ltd.
ITS2 sequence-structure phylogeny reveals diverse endophytic Pseudocercospora fungi on poplars.
Yan, Dong-Hui; Gao, Qian; Sun, Xiaoming; Song, Xiaoyu; Li, Hongchang
2018-04-01
For matching the new fungal nomenclature to abolish pleomorphic names for a fungus, a genus Pseudocercospora s. str. was suggested to host holomorphic Pseudocercosproa fungi. But the Pseudocercosproa fungi need extra phylogenetic loci to clarify their taxonomy and diversity for their existing and coming species. Internal transcribed spacer 2 (ITS2) secondary structures have been promising in charactering species phylogeny in plants, animals and fungi. In present study, a conserved model of ITS2 secondary structures was confirmed on fungi in Pseudocercospora s. str. genus using RNAshape program. The model has a typical eukaryotic four-helix ITS2 secondary structure. But a single U base occurred in conserved motif of U-U mismatch in Helix 2, and a UG emerged in UGGU motif in Helix 3 to Pseudocercospora fungi. The phylogeny analyses based on the ITS2 sequence-secondary structures with compensatory base change characterizations are able to delimit more species for Pseudocercospora s. str. than phylogenic inferences of traditional multi-loci alignments do. The model was employed to explore the diversity of endophytic Pseudocercospora fungi in poplar trees. The analysis results also showed that endophytic Pseudocercospora fungi were diverse in species and evolved a specific lineage in poplar trees. This work suggested that ITS2 sequence-structures could become as additionally significant loci for species phylogenetic and taxonomic studies on Pseudocerospora fungi, and that Pseudocercospora endophytes could be important roles to Pseudocercospora fungi's evolution and function in ecology.
Jeffery, Nicholas W.; Elías-Gutiérrez, Manuel; Adamowicz, Sarah J.
2011-01-01
The region of Churchill, Manitoba, contains a wide variety of habitats representative of both the boreal forest and arctic tundra and has been used as a model site for biodiversity studies for nearly seven decades within Canada. Much previous work has been done in Churchill to study the Daphnia pulex species complex in particular, but no study has completed a wide-scale survey on the crustacean species that inhabit Churchill's aquatic ecosystems using molecular markers. We have employed DNA barcoding to study the diversity of the Branchiopoda (Crustacea) in a wide variety of freshwater habitats and to determine the likely origins of the Churchill fauna following the last glaciation. The standard animal barcode marker (COI) was sequenced for 327 specimens, and a 3% divergence threshold was used to delineate potential species. We found 42 provisional and valid branchiopod species from this survey alone, including several cryptic lineages, in comparison with the 25 previously recorded from previous ecological works. Using published sequence data, we explored the phylogeographic affinities of Churchill's branchiopods, finding that the Churchill fauna apparently originated from all directions from multiple glacial refugia (including southern, Beringian, and high arctic regions). Overall, these microcrustaceans are very diverse in Churchill and contain multiple species complexes. The present study introduces among the first sequences for some understudied genera, for which further work is required to delineate species boundaries and develop a more complete understanding of branchiopod diversity over a larger spatial scale. PMID:21610864
Oh, Jeongsu; Choi, Chi-Hwan; Park, Min-Kyu; Kim, Byung Kwon; Hwang, Kyuin; Lee, Sang-Heon; Hong, Soon Gyu; Nasir, Arshan; Cho, Wan-Sup; Kim, Kyung Mo
2016-01-01
High-throughput sequencing can produce hundreds of thousands of 16S rRNA sequence reads corresponding to different organisms present in the environmental samples. Typically, analysis of microbial diversity in bioinformatics starts from pre-processing followed by clustering 16S rRNA reads into relatively fewer operational taxonomic units (OTUs). The OTUs are reliable indicators of microbial diversity and greatly accelerate the downstream analysis time. However, existing hierarchical clustering algorithms that are generally more accurate than greedy heuristic algorithms struggle with large sequence datasets. To keep pace with the rapid rise in sequencing data, we present CLUSTOM-CLOUD, which is the first distributed sequence clustering program based on In-Memory Data Grid (IMDG) technology-a distributed data structure to store all data in the main memory of multiple computing nodes. The IMDG technology helps CLUSTOM-CLOUD to enhance both its capability of handling larger datasets and its computational scalability better than its ancestor, CLUSTOM, while maintaining high accuracy. Clustering speed of CLUSTOM-CLOUD was evaluated on published 16S rRNA human microbiome sequence datasets using the small laboratory cluster (10 nodes) and under the Amazon EC2 cloud-computing environments. Under the laboratory environment, it required only ~3 hours to process dataset of size 200 K reads regardless of the complexity of the human microbiome data. In turn, one million reads were processed in approximately 20, 14, and 11 hours when utilizing 20, 30, and 40 nodes on the Amazon EC2 cloud-computing environment. The running time evaluation indicates that CLUSTOM-CLOUD can handle much larger sequence datasets than CLUSTOM and is also a scalable distributed processing system. The comparative accuracy test using 16S rRNA pyrosequences of a mock community shows that CLUSTOM-CLOUD achieves higher accuracy than DOTUR, mothur, ESPRIT-Tree, UCLUST and Swarm. CLUSTOM-CLOUD is written in JAVA and is freely available at http://clustomcloud.kopri.re.kr.
Park, Min-Kyu; Kim, Byung Kwon; Hwang, Kyuin; Lee, Sang-Heon; Hong, Soon Gyu; Nasir, Arshan; Cho, Wan-Sup; Kim, Kyung Mo
2016-01-01
High-throughput sequencing can produce hundreds of thousands of 16S rRNA sequence reads corresponding to different organisms present in the environmental samples. Typically, analysis of microbial diversity in bioinformatics starts from pre-processing followed by clustering 16S rRNA reads into relatively fewer operational taxonomic units (OTUs). The OTUs are reliable indicators of microbial diversity and greatly accelerate the downstream analysis time. However, existing hierarchical clustering algorithms that are generally more accurate than greedy heuristic algorithms struggle with large sequence datasets. To keep pace with the rapid rise in sequencing data, we present CLUSTOM-CLOUD, which is the first distributed sequence clustering program based on In-Memory Data Grid (IMDG) technology–a distributed data structure to store all data in the main memory of multiple computing nodes. The IMDG technology helps CLUSTOM-CLOUD to enhance both its capability of handling larger datasets and its computational scalability better than its ancestor, CLUSTOM, while maintaining high accuracy. Clustering speed of CLUSTOM-CLOUD was evaluated on published 16S rRNA human microbiome sequence datasets using the small laboratory cluster (10 nodes) and under the Amazon EC2 cloud-computing environments. Under the laboratory environment, it required only ~3 hours to process dataset of size 200 K reads regardless of the complexity of the human microbiome data. In turn, one million reads were processed in approximately 20, 14, and 11 hours when utilizing 20, 30, and 40 nodes on the Amazon EC2 cloud-computing environment. The running time evaluation indicates that CLUSTOM-CLOUD can handle much larger sequence datasets than CLUSTOM and is also a scalable distributed processing system. The comparative accuracy test using 16S rRNA pyrosequences of a mock community shows that CLUSTOM-CLOUD achieves higher accuracy than DOTUR, mothur, ESPRIT-Tree, UCLUST and Swarm. CLUSTOM-CLOUD is written in JAVA and is freely available at http://clustomcloud.kopri.re.kr. PMID:26954507
Low Diversity in the Mitogenome of Sperm Whales Revealed by Next-Generation Sequencing
Alexander, Alana; Steel, Debbie; Slikas, Beth; Hoekzema, Kendra; Carraher, Colm; Parks, Matthew; Cronn, Richard; Baker, C. Scott
2013-01-01
Large population sizes and global distributions generally associate with high mitochondrial DNA control region (CR) diversity. The sperm whale (Physeter macrocephalus) is an exception, showing low CR diversity relative to other cetaceans; however, diversity levels throughout the remainder of the sperm whale mitogenome are unknown. We sequenced 20 mitogenomes from 17 sperm whales representative of worldwide diversity using Next Generation Sequencing (NGS) technologies (Illumina GAIIx, Roche 454 GS Junior). Resequencing of three individuals with both NGS platforms and partial Sanger sequencing showed low discrepancy rates (454-Illumina: 0.0071%; Sanger-Illumina: 0.0034%; and Sanger-454: 0.0023%) confirming suitability of both NGS platforms for investigating low mitogenomic diversity. Using the 17 sperm whale mitogenomes in a phylogenetic reconstruction with 41 other species, including 11 new dolphin mitogenomes, we tested two hypotheses for the low CR diversity. First, the hypothesis that CR-specific constraints have reduced diversity solely in the CR was rejected as diversity was low throughout the mitogenome, not just in the CR (overall diversity π = 0.096%; protein-coding 3rd codon = 0.22%; CR = 0.35%), and CR phylogenetic signal was congruent with protein-coding regions. Second, the hypothesis that slow substitution rates reduced diversity throughout the sperm whale mitogenome was rejected as sperm whales had significantly higher rates of CR evolution and no evidence of slow coding region evolution relative to other cetaceans. The estimated time to most recent common ancestor for sperm whale mitogenomes was 72,800 to 137,400 years ago (95% highest probability density interval), consistent with previous hypotheses of a bottleneck or selective sweep as likely causes of low mitogenome diversity. PMID:23254394
Low diversity in the mitogenome of sperm whales revealed by next-generation sequencing.
Alexander, Alana; Steel, Debbie; Slikas, Beth; Hoekzema, Kendra; Carraher, Colm; Parks, Matthew; Cronn, Richard; Baker, C Scott
2013-01-01
Large population sizes and global distributions generally associate with high mitochondrial DNA control region (CR) diversity. The sperm whale (Physeter macrocephalus) is an exception, showing low CR diversity relative to other cetaceans; however, diversity levels throughout the remainder of the sperm whale mitogenome are unknown. We sequenced 20 mitogenomes from 17 sperm whales representative of worldwide diversity using Next Generation Sequencing (NGS) technologies (Illumina GAIIx, Roche 454 GS Junior). Resequencing of three individuals with both NGS platforms and partial Sanger sequencing showed low discrepancy rates (454-Illumina: 0.0071%; Sanger-Illumina: 0.0034%; and Sanger-454: 0.0023%) confirming suitability of both NGS platforms for investigating low mitogenomic diversity. Using the 17 sperm whale mitogenomes in a phylogenetic reconstruction with 41 other species, including 11 new dolphin mitogenomes, we tested two hypotheses for the low CR diversity. First, the hypothesis that CR-specific constraints have reduced diversity solely in the CR was rejected as diversity was low throughout the mitogenome, not just in the CR (overall diversity π = 0.096%; protein-coding 3rd codon = 0.22%; CR = 0.35%), and CR phylogenetic signal was congruent with protein-coding regions. Second, the hypothesis that slow substitution rates reduced diversity throughout the sperm whale mitogenome was rejected as sperm whales had significantly higher rates of CR evolution and no evidence of slow coding region evolution relative to other cetaceans. The estimated time to most recent common ancestor for sperm whale mitogenomes was 72,800 to 137,400 years ago (95% highest probability density interval), consistent with previous hypotheses of a bottleneck or selective sweep as likely causes of low mitogenome diversity.
High-throughput sequencing reveals unprecedented diversities of Aspergillus species in outdoor air.
Lee, S; An, C; Xu, S; Lee, S; Yamamoto, N
2016-09-01
This study used the Illumina MiSeq to analyse compositions and diversities of Aspergillus species in outdoor air. The seasonal air samplings were performed at two locations in Seoul, South Korea. The results showed the relative abundances of all Aspergillus species combined ranging from 0·20 to 18% and from 0·19 to 21% based on the number of the internal transcribed spacer 1 (ITS1) and β-tubulin (BenA) gene sequences respectively. Aspergillus fumigatus was the most dominant species with the mean relative abundances of 1·2 and 5·5% based on the number of the ITS1 and BenA sequences respectively. A total of 29 Aspergillus species were detected and identified down to the species rank, among which nine species were known opportunistic pathogens. Remarkably, eight of the nine pathogenic species were detected by either one of the two markers, suggesting the need of using multiple markers and/or primer pairs when the assessments are made based on the high-throughput sequencing. Due to diversity of species within the genus Aspergillus, the high-throughput sequencing was useful to characterize their compositions and diversities in outdoor air, which are thought to be difficult to be accurately characterized by conventional culture and/or Sanger sequencing-based techniques. Aspergillus is a diverse genus of fungi with more than 300 species reported in literature. Aspergillus is important since some species are known allergens and opportunistic human pathogens. Traditionally, growth-dependent methods have been used to detect Aspergillus species in air. However, these methods are limited in the number of isolates that can be analysed for their identities, resulting in inaccurate characterizations of Aspergillus diversities. This study used the high-throughput sequencing to explore Aspergillus diversities in outdoor, which are thought to be difficult to be accurately characterized by traditional growth-dependent techniques. © 2016 The Society for Applied Microbiology.
Paparini, Andrea; Yang, Rongchang; Chen, Linda; Tong, Kaising; Gibson-Kueh, Susan; Lymbery, Alan; Ryan, Una M
2017-11-01
Currently, the systematics, biology and epidemiology of piscine Cryptosporidium species are poorly understood. Here, we compared Sanger ‒ and next-generation ‒ sequencing (NGS), of piscine Cryptosporidium, at the 18S rRNA and actin genes. The hosts comprised 11 ornamental fish species, spanning four orders and eight families. The objectives were: to (i) confirm the rich genetic diversity of the parasite and the high frequency of mixed infections; and (ii) explore the potential of NGS in the presence of complex genetic mixtures. By Sanger sequencing, four main genotypes were obtained at the actin locus, while for the 18S locus, seven genotypes were identified. At both loci, NGS revealed frequent mixed infections, consisting of one highly dominant variant plus substantially rarer genotypes. Both sequencing methods detected novel Cryptosporidium genotypes at both loci, including a novel and highly abundant actin genotype that was identified by both Sanger sequencing and NGS. Importantly, this genotype accounted for 68·9% of all NGS reads from all samples (249 585/362 372). The present study confirms that aquarium fish can harbour a large and unexplored Cryptosporidium genetic diversity. Although commonly used in molecular parasitology studies, nested PCR prevents quantitative comparisons and thwarts the advantages of NGS, when this latter approach is used to investigate multiple infections.
Tsuchiaka, Shinobu; Naoi, Yuki; Imai, Ryo; Masuda, Tsuneyuki; Ito, Mika; Akagami, Masataka; Ouchi, Yoshinao; Ishii, Kazuo; Sakaguchi, Shoichi; Omatsu, Tsutomu; Katayama, Yukie; Oba, Mami; Shirai, Junsuke; Satani, Yuki; Takashima, Yasuhiro; Taniguchi, Yuji; Takasu, Masaki; Madarame, Hiroo; Sunaga, Fujiko; Aoki, Hiroshi; Makino, Shinji; Mizutani, Tetsuya; Nagai, Makoto
2018-01-01
To study the genetic diversity of enterovirus G (EV-G) among Japanese pigs, metagenomics sequencing was performed on fecal samples from pigs with or without diarrhea, collected between 2014 and 2016. Fifty-nine EV-G sequences, which were >5,000 nucleotides long, were obtained. By complete VP1 sequence analysis, Japanese EV-G isolates were classified into G1 (17 strains), G2 (four strains), G3 (22 strains), G4 (two strains), G6 (two strains), G9 (six strains), G10 (five strains), and a new genotype (one strain). Remarkably, 16 G1 and one G2 strain identified in diarrheic (23.5%; four strains) or normal (76.5%; 13 strains) fecal samples possessed a papain-like cysteine protease (PL-CP) sequence, which was recently found in the USA and Belgium in the EV-G genome, at the 2C-3A junction site. This paper presents the first report of the high prevalence of viruses carrying PL-CP in the EV-G population. Furthermore, possible inter- and intragenotype recombination events were found among EV-G strains, including G1-PL-CP strains. Our findings may advance the understanding of the molecular epidemiology and genetic evolution of EV-Gs.
Zhang, Gengxin; Dong, Hailiang; Xu, Zhiqin; Zhao, Donggao; Zhang, Chuanlun
2005-06-01
Microbial communities in ultra-high-pressure (UHP) rocks and drilling fluids from the Chinese Continental Scientific Drilling Project were characterized. The rocks had a porosity of 1 to 3.5% and a permeability of approximately 0.5 mDarcy. Abundant fluid and gas inclusions were present in the minerals. The rocks contained significant amounts of Fe2O3, FeO, P2O5, and nitrate (3 to 16 ppm). Acridine orange direct counting and phospholipid fatty acid analysis indicated that the total counts in the rocks and the fluids were 5.2 x 10(3) to 2.4 x 10(4) cells/g and 3.5 x 10(8) to 4.2 x 10(9) cells/g, respectively. Enrichment assays resulted in successful growth of thermophilic and alkaliphilic bacteria from the fluids, and some of these bacteria reduced Fe(III) to magnetite. 16S rRNA gene analyses indicated that the rocks were dominated by sequences similar to sequences of Proteobacteria and that most organisms were related to nitrate reducers from a saline, alkaline, cold habitat; however, some phylotypes were either members of a novel lineage or closely related to uncultured clones. The bacterial communities in the fluids were more diverse and included Proteobacteria, Bacteroidetes, gram-positive bacteria, Planctomycetes, and Candidatus taxa. The archaeal diversity was lower, and most sequences were not related to any known cultivated species. Some archaeal sequences were 90 to 95% similar to sequences recovered from ocean sediments or other subsurface environments. Some archaeal sequences from the drilling fluids were >93% similar to sequences of Sulfolobus solfataricus, and the thermophilic nature was consistent with the in situ temperature. We inferred that the microbes in the UHP rocks reside in fluid and gas inclusions, whereas those in the drilling fluids may be derived from subsurface fluids.
Zhang, Gengxin; Dong, Hailiang; Xu, Zhiqin; Zhao, Donggao; Zhang, Chuanlun
2005-01-01
Microbial communities in ultra-high-pressure (UHP) rocks and drilling fluids from the Chinese Continental Scientific Drilling Project were characterized. The rocks had a porosity of 1 to 3.5% and a permeability of ∼0.5 mDarcy. Abundant fluid and gas inclusions were present in the minerals. The rocks contained significant amounts of Fe2O3, FeO, P2O5, and nitrate (3 to 16 ppm). Acridine orange direct counting and phospholipid fatty acid analysis indicated that the total counts in the rocks and the fluids were 5.2 × 103 to 2.4 × 104 cells/g and 3.5 × 108 to 4.2 × 109 cells/g, respectively. Enrichment assays resulted in successful growth of thermophilic and alkaliphilic bacteria from the fluids, and some of these bacteria reduced Fe(III) to magnetite. 16S rRNA gene analyses indicated that the rocks were dominated by sequences similar to sequences of Proteobacteria and that most organisms were related to nitrate reducers from a saline, alkaline, cold habitat; however, some phylotypes were either members of a novel lineage or closely related to uncultured clones. The bacterial communities in the fluids were more diverse and included Proteobacteria, Bacteroidetes, gram-positive bacteria, Planctomycetes, and Candidatus taxa. The archaeal diversity was lower, and most sequences were not related to any known cultivated species. Some archaeal sequences were 90 to 95% similar to sequences recovered from ocean sediments or other subsurface environments. Some archaeal sequences from the drilling fluids were >93% similar to sequences of Sulfolobus solfataricus, and the thermophilic nature was consistent with the in situ temperature. We inferred that the microbes in the UHP rocks reside in fluid and gas inclusions, whereas those in the drilling fluids may be derived from subsurface fluids. PMID:15933024
Rhizosphere bacteriome of the medicinal plant Sapindus saponaria L. revealed by pyrosequencing.
Garcia, A; Polonio, J C; Polli, A D; Santos, C M; Rhoden, S A; Quecine, M C; Azevedo, J L; Pamphile, J A
2016-11-03
Sapindus saponaria L. of Sapindaceae family is popularly known as soldier soap and is found in Central and South America. A study of such medicinal plants might reveal a more complex diversity of microorganisms as compared to non-medicinal plants, considering their metabolic potential and the chemical communication between their natural microbiota. Rhizosphere is a highly diverse microbial habitat with respect to both the diversity of species and the size of the community. Rhizosphere bacteriome associated with medicinal plant S. saponaria is still poorly known. The objective of this study was to assess the rhizosphere microbiome of the medicinal plant S. saponaria using pyrosequencing, a culture-independent approach that is increasingly being used to estimate the number of bacterial species present in different environments. In their rhizosphere microbiome, 26 phyla were identified from 5089 sequences of 16S rRNA gene, with a predominance of Actinobacteria (33.54%), Acidobacteria (22.62%), and Proteobacteria (24.72%). The rarefaction curve showed a linear increase, with 2660 operational taxonomic units at 3% distance sequence dissimilarity, indicating that the rhizosphere microbiome associated with S. saponaria was highly diverse with groups of bacteria important for soil management, which could be further exploited for agricultural and biotechnological purposes.
Diversity and Evolution in the Genome of Clostridium difficile
Knight, Daniel R.; Elliott, Briony; Chang, Barbara J.; Perkins, Timothy T.
2015-01-01
SUMMARY Clostridium difficile infection (CDI) is the leading cause of antimicrobial and health care-associated diarrhea in humans, presenting a significant burden to global health care systems. In the last 2 decades, PCR- and sequence-based techniques, particularly whole-genome sequencing (WGS), have significantly furthered our knowledge of the genetic diversity, evolution, epidemiology, and pathogenicity of this once enigmatic pathogen. C. difficile is taxonomically distinct from many other well-known clostridia, with a diverse population structure comprising hundreds of strain types spread across at least 6 phylogenetic clades. The C. difficile species is defined by a large diverse pangenome with extreme levels of evolutionary plasticity that has been shaped over long time periods by gene flux and recombination, often between divergent lineages. These evolutionary events are in response to environmental and anthropogenic activities and have led to the rapid emergence and worldwide dissemination of virulent clonal lineages. Moreover, genome analysis of large clinically relevant data sets has improved our understanding of CDI outbreaks, transmission, and recurrence. The epidemiology of CDI has changed dramatically over the last 15 years, and CDI may have a foodborne or zoonotic etiology. The WGS era promises to continue to redefine our view of this significant pathogen. PMID:26085550
The Epigenomic Landscape of Prokaryotes
Blow, Matthew J.; Clark, Tyson A.; Daum, Chris G.; ...
2016-02-12
DNA methylation acts in concert with restriction enzymes to protect the integrity of prokaryotic genomes. Studies in a limited number of organisms suggest that methylation also contributes to prokaryotic genome regulation, but the prevalence and properties of such non-restriction-associated methylation systems remain poorly understood. Here, we used single molecule, real-time sequencing to map DNA modifications including m6A, m4C, and m5C across the genomes of 230 diverse bacterial and archaeal species. We observed DNA methylation in nearly all (93%) organisms examined, and identified a total of 834 distinct reproducibly methylated motifs. This data enabled annotation of the DNA binding specificities ofmore » 620 DNA Methyltransferases (MTases), doubling known specificities for previously hard to study Type I, IIG and III MTases, and revealing their extraordinary diversity. Strikingly, 48% of organisms harbor active Type II MTases with no apparent cognate restriction enzyme. These active ‘orphan’ MTases are present in diverse bacterial and archaeal phyla and show motif specificities and methylation patterns consistent with functions in gene regulation and DNA replication. Our results reveal the pervasive presence of DNA methylation throughout the prokaryotic kingdoms, as well as the diversity of sequence specificities and potential functions of DNA methylation systems.« less
The Epigenomic Landscape of Prokaryotes
DOE Office of Scientific and Technical Information (OSTI.GOV)
Blow, Matthew J.; Clark, Tyson A.; Daum, Chris G.
DNA methylation acts in concert with restriction enzymes to protect the integrity of prokaryotic genomes. Studies in a limited number of organisms suggest that methylation also contributes to prokaryotic genome regulation, but the prevalence and properties of such non-restriction-associated methylation systems remain poorly understood. Here, we used single molecule, real-time sequencing to map DNA modifications including m6A, m4C, and m5C across the genomes of 230 diverse bacterial and archaeal species. We observed DNA methylation in nearly all (93%) organisms examined, and identified a total of 834 distinct reproducibly methylated motifs. This data enabled annotation of the DNA binding specificities ofmore » 620 DNA Methyltransferases (MTases), doubling known specificities for previously hard to study Type I, IIG and III MTases, and revealing their extraordinary diversity. Strikingly, 48% of organisms harbor active Type II MTases with no apparent cognate restriction enzyme. These active ‘orphan’ MTases are present in diverse bacterial and archaeal phyla and show motif specificities and methylation patterns consistent with functions in gene regulation and DNA replication. Our results reveal the pervasive presence of DNA methylation throughout the prokaryotic kingdoms, as well as the diversity of sequence specificities and potential functions of DNA methylation systems.« less
Troyer, Ryan M.; LaPatra, Scott E.; Kurath, Gael
2000-01-01
Infectious haematopoietic necrosis virus (IHNV) is the most significant virus pathogen of salmon and trout in North America. Previous studies have shown relatively low genetic diversity of IHNV within large geographical regions. In this study, the genetic heterogeneity of 84 IHNV isolates sampled from rainbow trout (Oncorhynchus mykiss) over a 20 year period at four aquaculture facilities within a 12 mile stretch of the Snake River in Idaho, USA was investigated. The virus isolates were characterized using an RNase protection assay (RPA) and nucleotide sequence analyses. Among the 84 isolates analysed, 46 RPA haplotypes were found and analyses revealed a high level of genetic heterogeneity relative to that detected in other regions. Sequence analyses revealed up to 7·6% nucleotide divergence, which is the highest level of diversity reported for IHNV to date. Phylogenetic analyses identified four distinct monophyletic clades representing four virus lineages. These lineages were distributed across facilities, and individual facilities contained multiple lineages. These results suggest that co-circulating IHNV lineages of relatively high genetic diversity are present in the IHNV populations in this rainbow trout culture study site. Three of the four lineages exhibited temporal trends consistent with rapid evolution.
Assessing Species Diversity Using Metavirome Data: Methods and Challenges.
Herath, Damayanthi; Jayasundara, Duleepa; Ackland, David; Saeed, Isaam; Tang, Sen-Lin; Halgamuge, Saman
2017-01-01
Assessing biodiversity is an important step in the study of microbial ecology associated with a given environment. Multiple indices have been used to quantify species diversity, which is a key biodiversity measure. Measuring species diversity of viruses in different environments remains a challenge relative to measuring the diversity of other microbial communities. Metagenomics has played an important role in elucidating viral diversity by conducting metavirome studies; however, metavirome data are of high complexity requiring robust data preprocessing and analysis methods. In this review, existing bioinformatics methods for measuring species diversity using metavirome data are categorised broadly as either sequence similarity-dependent methods or sequence similarity-independent methods. The former includes a comparison of DNA fragments or assemblies generated in the experiment against reference databases for quantifying species diversity, whereas estimates from the latter are independent of the knowledge of existing sequence data. Current methods and tools are discussed in detail, including their applications and limitations. Drawbacks of the state-of-the-art method are demonstrated through results from a simulation. In addition, alternative approaches are proposed to overcome the challenges in estimating species diversity measures using metavirome data.
Hu, Jian; Zhang, Xiaoyun; Jiang, Zhilin; Zhang, Feifei; Liu, Yuanyuan; Li, Zhan; Zhang, Zhongkai
2018-04-01
The whitefly Bemisia tabaci (Gennadius) (Hemiptera: Aleyrodidae) is a cryptic species complex and widely distributed throughout tropical and subtropical regions. To understand the B. tabaci cryptic species diversity in China more comprehensively, in the year 2014 and 2016, a large-scale sampling was conducted from the famous biodiversity hotspot of China, Yunnan province. Mitochondrial cytochrome oxidase I gene sequences were used to identify new putative cryptic species. Phylogenetic analyses were performed using Bayesian methods to evaluate the position of new cryptic species in the context of the B. tabaci diversity in Asia. Two new cryptic species, China 5 and Asia V were identified. In total, 19 B. tabaci cryptic species are present in China, two invasive (MED and MEAM1) and 17 indigenous. A new sibling species of B. tabaci was first defined and reported. Based on the mtCOI sequences and haplotype network analyses, the genetic diversity of MED was far higher than MEAM1. We confirmed the exotic MED was originated from the western Mediterranean regions and first invaded into Yunnan, China. The genetic structures of other four indigenous species (Asia I, Asia II 1, Asia II 6, and China 1) with relatively wide distribution ranges in China were also discussed.
Investigation of the bottleneck leading to the domestication of maize
Eyre-Walker, Adam; Gaut, Rebecca L.; Hilton, Holly; Feldman, Dawn L.; Gaut, Brandon S.
1998-01-01
Maize (Zea mays ssp. mays) is genetically diverse, yet it is also morphologically distinct from its wild relatives. These two observations are somewhat contradictory: the first observation is consistent with a large historical population size for maize, but the latter observation is consistent with strong, diversity-limiting selection during maize domestication. In this study, we sampled sequence diversity, coupled with simulations of the coalescent process, to study the dynamics of a population bottleneck during the domestication of maize. To do this, we determined the DNA sequence of a 1,400-bp region of the Adh1 locus from 19 individuals representing maize, its presumed progenitor (Z. mays ssp. parviglumis), and a more distant relative (Zea luxurians). The sequence data were used to guide coalescent simulations of population bottlenecks associated with domestication. Our study confirms high genetic diversity in maize—maize contains 75% of the variation found in its progenitor and is more diverse than its wild relative, Z. luxurians—but it also suggests that sequence diversity in maize can be explained by a bottleneck of short duration and very small size. For example, the breadth of genetic diversity in maize is consistent with a founding population of only 20 individuals when the domestication event is 10 generations in length. PMID:9539756
Merson, Samuel D.; Ouwerkerk, Diane; Gulino, Lisa-Maree; Klieve, Athol; Bonde, Robert K.; Burgess, Elizabeth A.; Lanyon, Janet M.
2014-01-01
The Florida manatee, Trichechus manatus latirostris, is a hindgut-fermenting herbivore. In winter, manatees migrate to warm water overwintering sites where they undergo dietary shifts and may suffer from cold-induced stress. Given these seasonally induced changes in diet, the present study aimed to examine variation in the hindgut bacterial communities of wild manatees overwintering at Crystal River, west Florida. Faeces were sampled from 36 manatees of known sex and body size in early winter when manatees were newly arrived and then in mid-winter and late winter when diet had probably changed and environmental stress may have increased. Concentrations of faecal cortisol metabolite, an indicator of a stress response, were measured by enzyme immunoassay. Using 454-pyrosequencing, 2027 bacterial operational taxonomic units were identified in manatee faeces following amplicon pyrosequencing of the 16S rRNA gene V3/V4 region. Classified sequences were assigned to eight previously described bacterial phyla; only 0.36% of sequences could not be classified to phylum level. Five core phyla were identified in all samples. The majority (96.8%) of sequences were classified as Firmicutes (77.3 ± 11.1% of total sequences) or Bacteroidetes (19.5 ± 10.6%). Alpha-diversity measures trended towards higher diversity of hindgut microbiota in manatees in mid-winter compared to early and late winter. Beta-diversity measures, analysed through permanova, also indicated significant differences in bacterial communities based on the season.
2012-01-01
Background Microbial anaerobic digestion (AD) is used as a waste treatment process to degrade complex organic compounds into methane. The archaeal and bacterial taxa involved in AD are well known, whereas composition of the fungal community in the process has been less studied. The present study aimed to reveal the composition of archaeal, bacterial and fungal communities in response to increasing organic loading in mesophilic and thermophilic AD processes by applying 454 amplicon sequencing technology. Furthermore, a DNA microarray method was evaluated in order to develop a tool for monitoring the microbiological status of AD. Results The 454 sequencing showed that the diversity and number of bacterial taxa decreased with increasing organic load, while archaeal i.e. methanogenic taxa remained more constant. The number and diversity of fungal taxa increased during the process and varied less in composition with process temperature than bacterial and archaeal taxa, even though the fungal diversity increased with temperature as well. Evaluation of the microarray using AD sample DNA showed correlation of signal intensities with sequence read numbers of corresponding target groups. The sensitivity of the test was found to be about 1%. Conclusions The fungal community survives in anoxic conditions and grows with increasing organic loading, suggesting that Fungi may contribute to the digestion by metabolising organic nutrients for bacterial and methanogenic groups. The microarray proof of principle tests suggest that the method has the potential for semiquantitative detection of target microbial groups given that comprehensive sequence data is available for probe design. PMID:22727142
Marine Fungi: Their Ecology and Molecular Diversity
NASA Astrophysics Data System (ADS)
Richards, Thomas A.; Jones, Meredith D. M.; Leonard, Guy; Bass, David
2012-01-01
Fungi appear to be rare in marine environments. There are relatively few marine isolates in culture, and fungal small subunit ribosomal DNA (SSU rDNA) sequences are rarely recovered in marine clone library experiments (i.e., culture-independent sequence surveys of eukaryotic microbial diversity from environmental DNA samples). To explore the diversity of marine fungi, we took a broad selection of SSU rDNA data sets and calculated a summary phylogeny. Bringing these data together identified a diverse collection of marine fungi, including sequences branching close to chytrids (flagellated fungi), filamentous hypha-forming fungi, and multicellular fungi. However, the majority of the sequences branched with ascomycete and basidiomycete yeasts. We discuss evidence for 36 novel marine lineages, the majority and most divergent of which branch with the chytrids. We then investigate what these data mean for the evolutionary history of the Fungi and specifically marine-terrestrial transitions. Finally, we discuss the roles of fungi in marine ecosystems.
Oshiki, Mamoru; Segawa, Takahiro; Ishii, Satoshi
2018-02-02
Various microorganisms play key roles in the Nitrogen (N) cycle. Quantitative PCR (qPCR) and PCR-amplicon sequencing of the N cycle functional genes allow us to analyze the abundance and diversity of microbes responsible in the N transforming reactions in various environmental samples. However, analysis of multiple target genes can be cumbersome and expensive. PCR-independent analysis, such as metagenomics and metatranscriptomics, is useful but expensive especially when we analyze multiple samples and try to detect N cycle functional genes present at relatively low abundance. Here, we present the application of microfluidic qPCR chip technology to simultaneously quantify and prepare amplicon sequence libraries for multiple N cycle functional genes as well as taxon-specific 16S rRNA gene markers for many samples. This approach, named as N cycle evaluation (NiCE) chip, was evaluated by using DNA from pure and artificially mixed bacterial cultures and by comparing the results with those obtained by conventional qPCR and amplicon sequencing methods. Quantitative results obtained by the NiCE chip were comparable to those obtained by conventional qPCR. In addition, the NiCE chip was successfully applied to examine abundance and diversity of N cycle functional genes in wastewater samples. Although non-specific amplification was detected on the NiCE chip, this could be overcome by optimizing the primer sequences in the future. As the NiCE chip can provide high-throughput format to quantify and prepare sequence libraries for multiple N cycle functional genes, this tool should advance our ability to explore N cycling in various samples. Importance. We report a novel approach, namely Nitrogen Cycle Evaluation (NiCE) chip by using microfluidic qPCR chip technology. By sequencing the amplicons recovered from the NiCE chip, we can assess diversities of the N cycle functional genes. The NiCE chip technology is applicable to analyze the temporal dynamics of the N cycle gene transcriptions in wastewater treatment bioreactors. The NiCE chip can provide high-throughput format to quantify and prepare sequence libraries for multiple N cycle functional genes. While there is a room for future improvement, this tool should significantly advance our ability to explore the N cycle in various environmental samples. Copyright © 2018 American Society for Microbiology.
Diversity of virus-host systems in hypersaline Lake Retba, Senegal.
Sime-Ngando, Télesphore; Lucas, Soizick; Robin, Agnès; Tucker, Kimberly Pause; Colombet, Jonathan; Bettarel, Yvan; Desmond, Elie; Gribaldo, Simonetta; Forterre, Patrick; Breitbart, Mya; Prangishvili, David
2011-08-01
Remarkable morphological diversity of virus-like particles was observed by transmission electron microscopy in a hypersaline water sample from Lake Retba, Senegal. The majority of particles morphologically resembled hyperthermophilic archaeal DNA viruses isolated from extreme geothermal environments. Some hypersaline viral morphotypes have not been previously observed in nature, and less than 1% of observed particles had a head-and-tail morphology, which is typical for bacterial DNA viruses. Culture-independent analysis of the microbial diversity in the sample suggested the dominance of extremely halophilic archaea. Few of the 16S sequences corresponded to known archeal genera (Haloquadratum, Halorubrum and Natronomonas), whereas the majority represented novel archaeal clades. Three sequences corresponded to a new basal lineage of the haloarchaea. Bacteria belonged to four major phyla, consistent with the known diversity in saline environments. Metagenomic sequencing of DNA from the purified virus-like particles revealed very few similarities to the NCBI non-redundant database at either the nucleotide or amino acid level. Some of the identifiable virus sequences were most similar to previously described haloarchaeal viruses, but no sequence similarities were found to archaeal viruses from extreme geothermal environments. A large proportion of the sequences had similarity to previously sequenced viral metagenomes from solar salterns. © 2010 Society for Applied Microbiology and Blackwell Publishing Ltd.
A user's guide to quantitative and comparative analysis of metagenomic datasets.
Luo, Chengwei; Rodriguez-R, Luis M; Konstantinidis, Konstantinos T
2013-01-01
Metagenomics has revolutionized microbiological studies during the past decade and provided new insights into the diversity, dynamics, and metabolic potential of natural microbial communities. However, metagenomics still represents a field in development, and standardized tools and approaches to handle and compare metagenomes have not been established yet. An important reason accounting for the latter is the continuous changes in the type of sequencing data available, for example, long versus short sequencing reads. Here, we provide a guide to bioinformatic pipelines developed to accomplish the following tasks, focusing primarily on those developed by our team: (i) assemble a metagenomic dataset; (ii) determine the level of sequence coverage obtained and the amount of sequencing required to obtain complete coverage; (iii) identify the taxonomic affiliation of a metagenomic read or assembled contig; and (iv) determine differentially abundant genes, pathways, and species between different datasets. Most of these pipelines do not depend on the type of sequences available or can be easily adjusted to fit different types of sequences, and are freely available (for instance, through our lab Web site: http://www.enve-omics.gatech.edu/). The limitations of current approaches, as well as the computational aspects that can be further improved, will also be briefly discussed. The work presented here provides practical guidelines on how to perform metagenomic analysis of microbial communities characterized by varied levels of diversity and establishes approaches to handle the resulting data, independent of the sequencing platform employed. © 2013 Elsevier Inc. All rights reserved.
Niira, Kazutaka; Ito, Mika; Masuda, Tsuneyuki; Saitou, Toshiya; Abe, Tadatsugu; Komoto, Satoshi; Sato, Mitsuo; Yamasato, Hiroshi; Kishimoto, Mai; Naoi, Yuki; Sano, Kaori; Tuchiaka, Shinobu; Okada, Takashi; Omatsu, Tsutomu; Furuya, Tetsuya; Aoki, Hiroshi; Katayama, Yukie; Oba, Mami; Shirai, Junsuke; Taniguchi, Koki; Mizutani, Tetsuya; Nagai, Makoto
2016-10-01
Porcine rotavirus C (RVC) is distributed throughout the world and is thought to be a pathogenic agent of diarrhea in piglets. Although, the VP7, VP4, and VP6 gene sequences of Japanese porcine RVCs are currently available, there is no whole-genome sequence data of Japanese RVC. Furthermore, only one to three sequences are available for porcine RVC VP1-VP3 and NSP1-NSP3 genes. Therefore, we determined nearly full-length whole-genome sequences of nine Japanese porcine RVCs from seven piglets with diarrhea and two healthy pigs and compared them with published RVC sequences from a database. The VP7 genes of two Japanese RVCs from healthy pigs were highly divergent from other known RVC strains and were provisionally classified as G12 and G13 based on the 86% nucleotide identity cut-off value. Pairwise sequence identity calculations and phylogenetic analyses revealed that candidate novel genotypes of porcine Japanese RVC were identified in the NSP1, NSP2 and NSP3 encoding genes, respectively. Furthermore, VP3 of Japanese porcine RVCs was shown to be closely related to human RVCs, suggesting a gene reassortment event between porcine and human RVCs and past interspecies transmission. The present study demonstrated that porcine RVCs show greater genetic diversity among strains than human and bovine RVCs. Copyright © 2016 Elsevier B.V. All rights reserved.
Llewellyn, Martin S; Messenger, Louisa A; Luquetti, Alejandro O; Garcia, Lineth; Torrico, Faustino; Tavares, Suelene B N; Cheaib, Bachar; Derome, Nicolas; Delepine, Marc; Baulard, Céline; Deleuze, Jean-Francois; Sauer, Sascha; Miles, Michael A
2015-04-01
Chagas disease results from infection with the diploid protozoan parasite Trypanosoma cruzi. T. cruzi is highly genetically diverse, and multiclonal infections in individual hosts are common, but little studied. In this study, we explore T. cruzi infection multiclonality in the context of age, sex and clinical profile among a cohort of chronic patients, as well as paired congenital cases from Cochabamba, Bolivia and Goias, Brazil using amplicon deep sequencing technology. A 450bp fragment of the trypomastigote TcGP63I surface protease gene was amplified and sequenced across 70 chronic and 22 congenital cases on the Illumina MiSeq platform. In addition, a second, mitochondrial target--ND5--was sequenced across the same cohort of cases. Several million reads were generated, and sequencing read depths were normalized within patient cohorts (Goias chronic, n = 43, Goias congenital n = 2, Bolivia chronic, n = 27; Bolivia congenital, n = 20), Among chronic cases, analyses of variance indicated no clear correlation between intra-host sequence diversity and age, sex or symptoms, while principal coordinate analyses showed no clustering by symptoms between patients. Between congenital pairs, we found evidence for the transmission of multiple sequence types from mother to infant, as well as widespread instances of novel genotypes in infants. Finally, non-synonymous to synonymous (dn:ds) nucleotide substitution ratios among sequences of TcGP63Ia and TcGP63Ib subfamilies within each cohort provided powerful evidence of strong diversifying selection at this locus. Our results shed light on the diversity of parasite DTUs within each patient, as well as the extent to which parasite strains pass between mother and foetus in congenital cases. Although we were unable to find any evidence that parasite diversity accumulates with age in our study cohorts, putative diversifying selection within members of the TcGP63I gene family suggests a link between genetic diversity within this gene family and survival in the mammalian host.
Diversity and phylogenetic relationships among Bartonella strains from Thai bats.
McKee, Clifton D; Kosoy, Michael Y; Bai, Ying; Osikowicz, Lynn M; Franka, Richard; Gilbert, Amy T; Boonmar, Sumalee; Rupprecht, Charles E; Peruski, Leonard F
2017-01-01
Bartonellae are phylogenetically diverse, intracellular bacteria commonly found in mammals. Previous studies have demonstrated that bats have a high prevalence and diversity of Bartonella infections globally. Isolates (n = 42) were obtained from five bat species in four provinces of Thailand and analyzed using sequences of the citrate synthase gene (gltA). Sequences clustered into seven distinct genogroups; four of these genogroups displayed similarity with Bartonella spp. sequences from other bats in Southeast Asia, Africa, and Eastern Europe. Thirty of the isolates representing these seven genogroups were further characterized by sequencing four additional loci (ftsZ, nuoG, rpoB, and ITS) to clarify their evolutionary relationships with other Bartonella species and to assess patterns of diversity among strains. Among the seven genogroups, there were differences in the number of sequence variants, ranging from 1-5, and the amount of nucleotide divergence, ranging from 0.035-3.9%. Overall, these seven genogroups meet the criteria for distinction as novel Bartonella species, with sequence divergence among genogroups ranging from 6.4-15.8%. Evidence of intra- and intercontinental phylogenetic relationships and instances of homologous recombination among Bartonella genogroups in related bat species were found in Thai bats.
Ben Chobba, Ines; Elleuch, Amine; Ayadi, Imen; Khannous, Lamia; Namsi, Ahmed; Cerqueira, Frederique; Drira, Noureddine; Gharsallah, Néji; Vallaeys, Tatiana
2013-01-01
Endophytic flora plays a vital role in the colonization and survival of host plants, especially in harsh environments, such as arid regions. This flora may, however, contain pathogenic species responsible for various troublesome host diseases. The present study is aimed at investigating the diversity of both cultivable and non-cultivable endophytic fungal floras in the internal tissues (roots and leaves) of Tunisian date palm trees (Phoenix dactylifera). Accordingly, 13 isolates from both root and leaf samples, exhibiting distinct colony morphology, were selected from potato dextrose agar (PDA) medium and identified by a sequence match search wherein their 18S–28S internal transcribed spacer (ITS) sequences were compared to those available in public databases. These findings revealed that the cultivable root and leaf isolates fell into two groups, namely Nectriaceae and Pleosporaceae. Additionally, total DNA from palm roots and leaves was further extracted and ITS fragments were amplified. Restriction fragment length polymorphism (RFLP) analysis of the ITS from 200 fungal clones (leaves: 100; roots: 100) using HaeIII restriction enzyme revealed 13 distinct patterns that were further sequenced and led to the identification of Alternaria, Cladosporium, Davidiella (Cladosporium teleomorph), Pythium, Curvularia, and uncharacterized fungal endophytes. Both approaches confirmed that while the roots were predominantly colonized by Fusaria (members of the Nectriaceae family), the leaves were essentially colonized by Alternaria (members of the Pleosporaceae family). Overall, the findings of the present study constitute, to the authors’ knowledge, the first extensive report on the diversity of endophytic fungal flora associated with date palm trees (P. dactylifera). PMID:24302709
Castillo, Daniel; Pérez-Reytor, Diliana; Plaza, Nicolás; Ramírez-Araya, Sebastián; Blondel, Carlos J.; Corsini, Gino; Bastías, Roberto; Loyola, David E.; Jaña, Víctor; Pavez, Leonardo; García, Katherine
2018-01-01
Vibrio parahaemolyticus is the leading cause of seafood-borne gastroenteritis worldwide. As reported in other countries, after the rise and fall of the pandemic strain in Chile, other post-pandemic strains have been associated with clinical cases, including strains lacking the major toxins TDH and TRH. Since the presence or absence of tdh and trh genes has been used for diagnostic purposes and as a proxy of the virulence of V. parahaemolyticus isolates, the understanding of virulence in V. parahaemolyticus strains lacking toxins is essential to detect these strains present in water and marine products to avoid possible food-borne infection. In this study, we characterized the genome of four environmental and two clinical non-toxigenic strains (tdh-, trh-, and T3SS2-). Using whole-genome sequencing, phylogenetic, and comparative genome analysis, we identified the core and pan-genome of V. parahaemolyticus of strains of southern Chile. The phylogenetic tree based on the core genome showed low genetic diversity but the analysis of the pan-genome revealed that all strains harbored genomic islands carrying diverse virulence and fitness factors or prophage-like elements that encode toxins like Zot and RTX. Interestingly, the three strains carrying Zot-like toxin have a different sequence, although the alignment showed some conserved areas with the zot sequence found in V. cholerae. In addition, we identified an unexpected diversity in the genetic architecture of the T3SS1 gene cluster and the presence of the T3SS2 gene cluster in a non-pandemic environmental strain. Our study sheds light on the diversity of V. parahaemolyticus strains from the southern Pacific which increases our current knowledge regarding the global diversity of this organism. PMID:29472910
Expansion of the Preimmune Antibody Repertoire by Junctional Diversity in Bos taurus
Liljavirta, Jenni; Niku, Mikael; Pessa-Morikawa, Tiina; Ekman, Anna; Iivanainen, Antti
2014-01-01
Cattle have a limited range of immunoglobulin genes which are further diversified by antigen independent somatic hypermutation in fetuses. Junctional diversity generated during somatic recombination contributes to antibody diversity but its relative significance has not been comprehensively studied. We have investigated the importance of terminal deoxynucleotidyl transferase (TdT) -mediated junctional diversity to the bovine immunoglobulin repertoire. We also searched for new bovine heavy chain diversity (IGHD) genes as the information of the germline sequences is essential to define the junctional boundaries between gene segments. New heavy chain variable genes (IGHV) were explored to address the gene usage in the fetal recombinations. Our bioinformatics search revealed five new IGHD genes, which included the longest IGHD reported so far, 154 bp. By genomic sequencing we found 26 new IGHV sequences that represent potentially new IGHV genes or allelic variants. Sequence analysis of immunoglobulin heavy chain cDNA libraries of fetal bone marrow, ileum and spleen showed 0 to 36 nontemplated N-nucleotide additions between variable, diversity and joining genes. A maximum of 8 N nucleotides were also identified in the light chains. The junctional base profile was biased towards A and T nucleotide additions (64% in heavy chain VD, 52% in heavy chain DJ and 61% in light chain VJ junctions) in contrast to the high G/C content which is usually observed in mice. Sequence analysis also revealed extensive exonuclease activity, providing additional diversity. B-lymphocyte specific TdT expression was detected in bovine fetal bone marrow by reverse transcription-qPCR and immunofluorescence. These results suggest that TdT-mediated junctional diversity and exonuclease activity contribute significantly to the size of the cattle preimmune antibody repertoire already in the fetal period. PMID:24926997
A HIGH COVERAGE GENOME SEQUENCE FROM AN ARCHAIC DENISOVAN INDIVIDUAL
Meyer, Matthias; Kircher, Martin; Gansauge, Marie-Theres; Li, Heng; Racimo, Fernando; Mallick, Swapan; Schraiber, Joshua G.; Jay, Flora; Prüfer, Kay; de Filippo, Cesare; Sudmant, Peter H.; Alkan, Can; Fu, Qiaomei; Do, Ron; Rohland, Nadin; Tandon, Arti; Siebauer, Michael; Green, Richard E.; Bryc, Katarzyna; Briggs, Adrian W.; Stenzel, Udo; Dabney, Jesse; Shendure, Jay; Kitzman, Jacob; Hammer, Michael F.; Shunkov, Michael V.; Derevianko, Anatoli P.; Patterson, Nick; Andrés, Aida M.; Eichler, Evan E.; Slatkin, Montgomery; Reich, David; Kelso, Janet; Pääbo, Svante
2013-01-01
We present a DNA library preparation method that has allowed us to reconstruct a high coverage (30X) genome sequence of a Denisovan, an extinct relative of Neandertals. The quality of this genome allows a direct estimation of Denisovan heterozygosity indicating that genetic diversity in these archaic hominins was extremely low. It also allows tentative dating of the specimen on the basis of “missing evolution” in its genome, detailed measurements of Denisovan and Neandertal admixture into present-day human populations, and the generation of a near-complete catalog of genetic changes that swept to high frequency in modern humans since their divergence from Denisovans. PMID:22936568
Núñez, Andrés; Amo de Paz, Guillermo; Rastrojo, Alberto; García, Ana M; Alcamí, Antonio; Gutiérrez-Bustillo, A Montserrat; Moreno, Diego A
2016-06-01
The air we breathe contains microscopic biological particles such as viruses, bacteria, fungi and pollen, some of them with relevant clinic importance. These organisms and/or their propagules have been traditionally studied by different disciplines and diverse methodologies like culture and microscopy. These techniques require time, expertise and also have some important biases. As a consequence, our knowledge on the total diversity and the relationships between the different biological entities present in the air is far from being complete. Currently, metagenomics and next-generation sequencing (NGS) may resolve this shortage of information and have been recently applied to metropolitan areas. Although the procedures and methods are not totally standardized yet, the first studies from urban air samples confirm the previous results obtained by culture and microscopy regarding abundance and variation of these biological particles. However, DNA-sequence analyses call into question some preceding ideas and also provide new interesting insights into diversity and their spatial distribution inside the cities. Here, we review the procedures, results and perspectives of the recent works that apply NGS to study the main biological particles present in the air of urban environments. [Int Microbiol 19(2):69-80(2016)]. Copyright© by the Spanish Society for Microbiology and Institute for Catalan Studies.
Information-Theoretic Uncertainty of SCFG-Modeled Folding Space of The Non-coding RNA
Manzourolajdad, Amirhossein; Wang, Yingfeng; Shaw, Timothy I.; Malmberg, Russell L.
2012-01-01
RNA secondary structure ensembles define probability distributions for alternative equilibrium secondary structures of an RNA sequence. Shannon’s Entropy is a measure for the amount of diversity present in any ensemble. In this work, Shannon’s entropy of the SCFG ensemble on an RNA sequence is derived and implemented in polynomial time for both structurally ambiguous and unambiguous grammars. Micro RNA sequences generally have low folding entropy, as previously discovered. Surprisingly, signs of significantly high folding entropy were observed in certain ncRNA families. More effective models coupled with targeted randomization tests can lead to a better insight into folding features of these families. PMID:23160142
Futami, K; Valderrama, A; Baldi, M; Minakawa, N; Marín Rodríguez, R; Chaves, L F
2015-04-01
The Asian tiger mosquito, Aedes albopictus (Skuse) (Diptera: Culicidae), is a vector of several human pathogens. Ae. albopictus is also an invasive species that, over recent years, has expanded its range out of its native Asia. Ae. albopictus was suspected to be present in Central America since the 1990s, and its presence was confirmed by most Central American nations by 2010. Recently, this species has been regularly found, yet in low numbers, in limited areas of Panamá and Costa Rica (CR). Here, we report that short sequences (∼558 bp) of the mitochondrial cytochrome oxidase subunit 1 (COI) and NADH dehydrogenase subunit 5 genes of Ae. albopictus, had no haplotype diversity. Instead, there was a common haplotype for each gene in both CR and Panamá. In contrast, a long COI sequence (∼1,390 bp) revealed that haplotype diversity (±SD) was relatively high in CR (0.72±0.04) when compared with Panamá (0.33±0.13), below the global estimate for reported samples (0.89±0.01). The long COI sequence allowed us to identify seven (five new) haplotypes in CR and two (one new) in Panamá. A haplotype network for the long COI gene sequence showed that samples from CR and Panamá belong to a single large group. The long COI gene sequences suggest that haplotypes in Panamá and CR, although similar to each other, had a significant geographic differentiation (Kst=1.33; P<0.001). Thus, most of our results suggest a recent range expansion in CR and Panamá. © The Authors 2015. Published by Oxford University Press on behalf of Entomological Society of America. All rights reserved. For Permissions, please email: journals.permissions@oup.com.
Pattaradilokrat, Sittiporn; Trakoolsoontorn, Chawinya; Simpalipan, Phumin; Warrit, Natapot; Kaewthamasorn, Morakot; Harnyuttanakorn, Pongchai
2018-01-22
The glutamate-rich protein (GLURP) of the malaria parasite Plasmodium falciparum is a key surface antigen that serves as a component of a clinical vaccine. Moreover, the GLURP gene is also employed routinely as a genetic marker for malarial genotyping in epidemiological studies. While extensive size polymorphisms in GLURP are well recorded, the extent of the sequence diversity of this gene is rarely investigated. The present study aimed to explore the genetic diversity of GLURP in natural populations of P. falciparum. The polymorphic C-terminal repetitive R2 region of GLURP sequences from 65 P. falciparum isolates in Thailand were generated and combined with the data from 103 worldwide isolates to generate a GLURP database. The collection was comprised of 168 alleles, encoding 105 unique GLURP subtypes, characterized by 18 types of amino acid repeat units (AAU). Of these, 28 GLURP subtypes, formed by 10 AAU types, were detected in P. falciparum in Thailand. Among them, 19 GLURP subtypes and 2 AAU types are described for the first time in the Thai parasite population. The AAU sequences were highly conserved, which is likely due to negative selection. Standard Fst analysis revealed the shared distributions of GLURP types among the P. falciparum populations, providing evidence of gene flow among the different demographic populations. Sequence diversity causing size variations in GLURP in Thai P. falciparum populations were detected, and caused by non-synonymous substitutions in repeat units and some insertion/deletion of aspartic acid or glutamic acid codons between repeat units. The P. falciparum population structure based on GLURP showed promising implications for the development of GLURP-based vaccines and for monitoring vaccine efficacy.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Guibert, Lilian M.; Loviso, Claudia L.; Borglin, Sharon
We aimed to gain insight into the alkane degradation potential of microbial communities from chronically polluted sediments of a subantarctic coastal environment using a combination of metagenomic approaches. A total of 6178 sequences annotated as alkane-1-monooxygenases (EC 1.14.15.3) were retrieved from a shotgun metagenomic dataset that included two sites analyzed in triplicate. The majority of the sequences binned with AlkB described in Bacteroidetes (32 ± 13 %) or Proteobacteria (29 ± 7 %), although a large proportion remained unclassified at the phylum level. Operational taxonomic unit (OTU)-based analyses showed small differences in AlkB distribution among samples that could be correlatedmore » with alkane concentrations, as well as with site-specific variations in pH and salinity. A number of low-abundance OTUs, mostly affiliated with Actinobacterial sequences, were found to be only present in the most contaminated samples. On the other hand, the molecular screening of a large-insert metagenomic library of intertidal sediments from one of the sampling sites identified two genomic fragments containing novel alkB gene sequences, as well as various contiguous genes related to lipid metabolism. Both genomic fragments were affiliated with the phylum Planctomycetes, and one could be further assigned to the genus Rhodopirellula due to the presence of a partial sequence of the 23S ribosomal RNA (rRNA) gene. This work highlights the diversity of bacterial groups contributing to the alkane degradation potential and reveals patterns of functional diversity in relation with environmental stressors in a chronically polluted, high-latitude coastal environment. In addition, alkane biodegradation genes are described for the first time in members of Planctomycetes.« less
Al-Jarbou, Ahmed Nasser
2012-01-01
Bacterial pathogenesis presents an astounding arsenal of virulence factors that allow them to conquer many different niches throughout the course of infection. Principally fascinating is the fact that some bacterial species are able to induce different diseases by expression of different combinations of virulence factors. Nevertheless, studies aiming at screening for the presence of bacteriophages in humans have been limited. Such screening procedures would eventually lead to identification of phage-encoded properties that impart increased bacterial fitness and/or virulence in a particular niche, and hence, would potentially be used to reverse the course of bacterial infections. As the human oral cavity represents a rich and dynamic ecosystem for several upper respiratory tract pathogens. However, little is known about virus diversity in human dental plaque which is an important reservoir. We applied the culture-independent approach to characterize virus diversity in human dental plaque making a library from a virus DNA fraction amplified using a multiple displacement method and sequenced 80 clones. The resulting sequence showed 44% significant identities to GenBank databases by TBLASTX analysis. TBLAST homology comparisons showed that 66% was viral; 18% eukarya; 10% bacterial; 6% mobile elements. These sequences were sorted into 6 contigs and 45 single sequences in which 4 contigs and a single sequence showed significant identity to a small region of a putative prophage in the Corynebacterium diphtheria genome. These findings interestingly highlight the uniqueness of over half of the sequences, whilst the dominance of a pathogen-specific prophage sequences imply their role in virulence.
Reduced representation approaches to interrogate genome diversity in large repetitive plant genomes.
Hirsch, Cory D; Evans, Joseph; Buell, C Robin; Hirsch, Candice N
2014-07-01
Technology and software improvements in the last decade now provide methodologies to access the genome sequence of not only a single accession, but also multiple accessions of plant species. This provides a means to interrogate species diversity at the genome level. Ample diversity among accessions in a collection of species can be found, including single-nucleotide polymorphisms, insertions and deletions, copy number variation and presence/absence variation. For species with small, non-repetitive rich genomes, re-sequencing of query accessions is robust, highly informative, and economically feasible. However, for species with moderate to large sized repetitive-rich genomes, technical and economic barriers prevent en masse genome re-sequencing of accessions. Multiple approaches to access a focused subset of loci in species with larger genomes have been developed, including reduced representation sequencing, exome capture and transcriptome sequencing. Collectively, these approaches have enabled interrogation of diversity on a genome scale for large plant genomes, including crop species important to worldwide food security. © The Author 2014. Published by Oxford University Press. All rights reserved. For permissions, please email: journals.permissions@oup.com.
Wu, Nicholas C; Xie, Jia; Zheng, Tianqing; Nycholat, Corwin M; Grande, Geramie; Paulson, James C; Lerner, Richard A; Wilson, Ian A
2017-06-14
Influenza A virus hemagglutinin (HA) initiates viral entry by engaging host receptor sialylated glycans via its receptor-binding site (RBS). The amino acid sequence of the RBS naturally varies across avian and human influenza virus subtypes and is also evolvable. However, functional sequence diversity in the RBS has not been fully explored. Here, we performed a large-scale mutational analysis of the RBS of A/WSN/33 (H1N1) and A/Hong Kong/1/1968 (H3N2) HAs. Many replication-competent mutants not yet observed in nature were identified, including some that could escape from an RBS-targeted broadly neutralizing antibody. This functional sequence diversity is made possible by pervasive epistasis in the RBS 220-loop and can be buffered by avidity in viral receptor binding. Overall, our study reveals that the HA RBS can accommodate a much greater range of sequence diversity than previously thought, which has significant implications for the complex evolutionary interrelationships between receptor specificity and immune escape. Copyright © 2017 Elsevier Inc. All rights reserved.
USDA-ARS?s Scientific Manuscript database
Bacterial spot of tomato (BST) is a major constraint to tomato production in Ethiopia and many other countries leading to significant crop losses. In the present study, using pathogenicity tests, sensitivity to copper and streptomycin, and multilocus sequence analysis, a diverse group of Xanthomonas...
USDA-ARS?s Scientific Manuscript database
Switchgrass (Panicum virgatum L.) is a polyploid, perennial grass species that is native to North America, and is being developed as a future biofuels feedstock crop. Switchgrass is present primarily in two ecotypes: a northern upland ecotype composed of tetraploid and octoploid accessions, and a so...
Genomics of peanut leaf-spot pathogens; and RNA-interference-mediated control of aflatoxins
USDA-ARS?s Scientific Manuscript database
An overview update of the research done at USDA-ARS National Peanut Research Laboratory will be presented: including: the release of the Cercospora arachidicola genome, sequencing of Cercosporidium personatum, a workflow to study genetic diversity of aflatoxigenic Aspergillus, and progress on the us...
Substrates of Peltigera Lichens as a Potential Source of Cyanobionts.
Zúñiga, Catalina; Leiva, Diego; Carú, Margarita; Orlando, Julieta
2017-10-01
Photobiont availability is one of the main factors determining the success of the lichenization process. Although multiple sources of photobionts have been proposed, there is no substantial evidence confirming that the substrates on which lichens grow are one of them. In this work, we obtained cyanobacterial 16S ribosomal RNA gene sequences from the substrates underlying 186 terricolous Peltigera cyanolichens from localities in Southern Chile and maritime Antarctica and compared them with the sequences of the cyanobionts of these lichens, in order to determine if cyanobacteria potentially available for lichenization were present in the substrates. A phylogenetic analysis of the sequences showed that Nostoc phylotypes dominated the cyanobacterial communities of the substrates in all sites. Among them, an overlap was observed between the phylotypes of the lichen cyanobionts and those of the cyanobacteria present in their substrates, suggesting that they could be a possible source of lichen photobionts. Also, in most cases, higher Nostoc diversity was observed in the lichens than in the substrates from each site. A better understanding of cyanobacterial diversity in lichen substrates and their relatives in the lichens would bring insights into mycobiont selection and the distribution patterns of lichens, providing a background for hypothesis testing and theory development for future studies of the lichenization process.
Hughes, Joseph; Biek, Roman; Litster, Annette; Willett, Brian J.; Hosie, Margaret J.
2015-01-01
Analysing the evolution of feline immunodeficiency virus (FIV) at the intra-host level is important in order to address whether the diversity and composition of viral quasispecies affect disease progression. We examined the intra-host diversity and the evolutionary rates of the entire env and structural fragments of the env sequences obtained from sequential blood samples in 43 naturally infected domestic cats that displayed different clinical outcomes. We observed in the majority of cats that FIV env showed very low levels of intra-host diversity. We estimated that env evolved at a rate of 1.16×10−3 substitutions per site per year and demonstrated that recombinant sequences evolved faster than non-recombinant sequences. It was evident that the V3–V5 fragment of FIV env displayed higher evolutionary rates in healthy cats than in those with terminal illness. Our study provided the first evidence that the leader sequence of env, rather than the V3–V5 sequence, had the highest intra-host diversity and the highest evolutionary rate of all env fragments, consistent with this region being under a strong selective pressure for genetic variation. Overall, FIV env displayed relatively low intra-host diversity and evolved slowly in naturally infected cats. The maximum evolutionary rate was observed in the leader sequence of env. Although genetic stability is not necessarily a prerequisite for clinical stability, the higher genetic stability of FIV compared with human immunodeficiency virus might explain why many naturally infected cats do not progress rapidly to AIDS. PMID:25535323
Nanba, K.; King, G. M.; Dunfield, K.
2004-01-01
A 492- to 495-bp fragment of the gene coding for the large subunit of the form I ribulose 1,5-bisphosphate carboxylase/oxygenase (RubisCO) (rbcL) was amplified by PCR from facultatively lithotrophic aerobic CO-oxidizing bacteria, colorless and purple sulfide-oxidizing microbial mats, and genomic DNA extracts from tephra and ash deposits from Kilauea volcano, for which atmospheric CO and hydrogen have been previously documented as important substrates. PCR products from the mats and volcanic sites were used to construct rbcL clone libraries. Phylogenetic analyses showed that the rbcL sequences from all isolates clustered with form IC rbcL sequences derived from facultative lithotrophs. In contrast, the microbial mat clone sequences clustered with sequences from obligate lithotrophs representative of form IA rbcL. Clone sequences from volcanic sites fell within the form IC clade, suggesting that these sites were dominated by facultative lithotrophs, an observation consistent with biogeochemical patterns at the sites. Based on phylogenetic and statistical analyses, clone libraries differed significantly among volcanic sites, indicating that they support distinct lithotrophic assemblages. Although some of the clone sequences were similar to known rbcL sequences, most were novel. Based on nucleotide diversity and average pairwise difference, a forested site and an 1894 lava flow were found to support the most diverse and least diverse lithotrophic populations, respectively. These indices of diversity were not correlated with rates of atmospheric CO and hydrogen uptake but were correlated with estimates of respiration and microbial biomass. PMID:15066819
Nanba, K; King, G M; Dunfield, K
2004-04-01
A 492- to 495-bp fragment of the gene coding for the large subunit of the form I ribulose 1,5-bisphosphate carboxylase/oxygenase (RubisCO) (rbcL) was amplified by PCR from facultatively lithotrophic aerobic CO-oxidizing bacteria, colorless and purple sulfide-oxidizing microbial mats, and genomic DNA extracts from tephra and ash deposits from Kilauea volcano, for which atmospheric CO and hydrogen have been previously documented as important substrates. PCR products from the mats and volcanic sites were used to construct rbcL clone libraries. Phylogenetic analyses showed that the rbcL sequences from all isolates clustered with form IC rbcL sequences derived from facultative lithotrophs. In contrast, the microbial mat clone sequences clustered with sequences from obligate lithotrophs representative of form IA rbcL. Clone sequences from volcanic sites fell within the form IC clade, suggesting that these sites were dominated by facultative lithotrophs, an observation consistent with biogeochemical patterns at the sites. Based on phylogenetic and statistical analyses, clone libraries differed significantly among volcanic sites, indicating that they support distinct lithotrophic assemblages. Although some of the clone sequences were similar to known rbcL sequences, most were novel. Based on nucleotide diversity and average pairwise difference, a forested site and an 1894 lava flow were found to support the most diverse and least diverse lithotrophic populations, respectively. These indices of diversity were not correlated with rates of atmospheric CO and hydrogen uptake but were correlated with estimates of respiration and microbial biomass.
Pryde, S E; Richardson, A J; Stewart, C S; Flint, H J
1999-12-01
Random clones of 16S ribosomal DNA gene sequences were isolated after PCR amplification with eubacterial primers from total genomic DNA recovered from samples of the colonic lumen, colonic wall, and cecal lumen from a pig. Sequences were also obtained for cultures isolated anaerobically from the same colonic-wall sample. Phylogenetic analysis showed that many sequences were related to those of Lactobacillus or Streptococcus spp. or fell into clusters IX, XIVa, and XI of gram-positive bacteria. In addition, 59% of randomly cloned sequences showed less than 95% similarity to database entries or sequences from cultivated organisms. Cultivation bias is also suggested by the fact that the majority of isolates (54%) recovered from the colon wall by culturing were related to Lactobacillus and Streptococcus, whereas this group accounted for only one-third of the sequence variation for the same sample from random cloning. The remaining cultured isolates were mainly Selenomonas related. A higher proportion of Lactobacillus reuteri-related sequences than of Lactobacillus acidophilus- and Lactobacillus amylovorus-related sequences were present in the colonic-wall sample. Since the majority of bacterial ribosomal sequences recovered from the colon wall are less than 95% related to known organisms, the roles of many of the predominant wall-associated bacteria remain to be defined.
Pryde, Susan E.; Richardson, Anthony J.; Stewart, Colin S.; Flint, Harry J.
1999-01-01
Random clones of 16S ribosomal DNA gene sequences were isolated after PCR amplification with eubacterial primers from total genomic DNA recovered from samples of the colonic lumen, colonic wall, and cecal lumen from a pig. Sequences were also obtained for cultures isolated anaerobically from the same colonic-wall sample. Phylogenetic analysis showed that many sequences were related to those of Lactobacillus or Streptococcus spp. or fell into clusters IX, XIVa, and XI of gram-positive bacteria. In addition, 59% of randomly cloned sequences showed less than 95% similarity to database entries or sequences from cultivated organisms. Cultivation bias is also suggested by the fact that the majority of isolates (54%) recovered from the colon wall by culturing were related to Lactobacillus and Streptococcus, whereas this group accounted for only one-third of the sequence variation for the same sample from random cloning. The remaining cultured isolates were mainly Selenomonas related. A higher proportion of Lactobacillus reuteri-related sequences than of Lactobacillus acidophilus- and Lactobacillus amylovorus-related sequences were present in the colonic-wall sample. Since the majority of bacterial ribosomal sequences recovered from the colon wall are less than 95% related to known organisms, the roles of many of the predominant wall-associated bacteria remain to be defined. PMID:10583991
Hernández-Martínez, Miguel Ángel; Escalante, Ananías A.; Arévalo-Herrera, Myriam; Herrera, Sócrates
2011-01-01
Circumsporozoite (CS) protein is a malaria antigen involved in sporozoite invasion of hepatocytes, and thus considered to have good vaccine potential. We evaluated the polymorphism of the Plasmodium vivax CS gene in 24 parasite isolates collected from malaria-endemic areas of Colombia. We sequenced 27 alleles, most of which (25/27) corresponded to the VK247 genotype and the remainder to the VK210 type. All VK247 alleles presented a mutation (Gly → Asn) at position 28 in the N-terminal region, whereas the C-terminal presented three insertions: the ANKKAGDAG, which is common in all VK247 isolates; 12 alleles presented the insertion GAGGQAAGGNAANKKAGDAG; and 5 alleles presented the insertion GGNAGGNA. Both repeat regions were polymorphic in gene sequence and size. Sequences coding for B-, T-CD4+, and T-CD8+ cell epitopes were found to be conserved. This study confirms the high polymorphism of the repeat domain and the highly conserved nature of the flanking regions. PMID:21292878
NASA Technical Reports Server (NTRS)
Nakayama, S.; Kretsinger, R. H.
1993-01-01
In the first report in this series we presented dendrograms based on 152 individual proteins of the EF-hand family. In the second we used sequences from 228 proteins, containing 835 domains, and showed that eight of the 29 subfamilies are congruent and that the EF-hand domains of the remaining 21 subfamilies have diverse evolutionary histories. In this study we have computed dendrograms within and among the EF-hand subfamilies using the encoding DNA sequences. In most instances the dendrograms based on protein and on DNA sequences are very similar. Significant differences between protein and DNA trees for calmodulin remain unexplained. In our fourth report we evaluate the sequences and the distribution of introns within the EF-hand family and conclude that exon shuffling did not play a significant role in its evolution.
Li, Yuanyuan; Chen, Longqian; Wen, Hongyu; Zhou, Tianjian; Zhang, Ting; Gao, Xiali
2014-03-28
Significant alteration in the microbial community can occur across reclamation areas suffering subsidence from mining. A reclamation site undergoing fertilization practices and an adjacent coal-excavated subsidence site (sites A and B, respectively) were examined to characterize the bacterial diversity using 454 high-throughput 16S rDNA sequencing. The dominant taxonomic groups in both the sites were Proteobacteria, Acidobacteria, Bacteroidetes, Betaproteobacteria, Actinobacteria, Gammaproteobacteria, Alphaproteobacteria, Deltaproteobacteria, Chloroflexi, and Firmicutes. However, the bacterial communities' abundance, diversity, and composition differed significantly between the sites. Site A presented higher bacterial diversity and more complex community structures than site B. The majority of sequences related to Proteobacteria, Gemmatimonadetes, Chloroflexi, Nitrospirae, Firmicutes, Betaproteobacteria, Deltaproteobacteria, and Anaerolineae were from site A; whereas those related to Actinobacteria, Planctomycetes, Bacteroidetes, Verrucomicrobia, Gammaproteobacteria, Nitriliruptoria, Alphaproteobacteria, and Phycisphaerae originated from site B. The distribution of some bacterial groups and subgroups in the two sites correlated with soil properties and vegetation due to reclamation practice. Site A exhibited enriched bacterial community, soil organic matter (SOM), and total nitrogen (TN), suggesting the presence of relatively diverse microorganisms. SOM and TN were important factors shaping the underlying microbial communities. Furthermore, the specific plant functional group (legumes) was also an important factor influencing soil microbial community composition. Thus, the effectiveness of 454 pyrosequencing in analyzing soil bacterial diversity was validated and an association between land ecological system restoration, mostly mediated by microbial communities, and an improvement in soil properties in coalmining reclamation areas was suggested.
McConnell, Sean C.; Hernandez, Kyle M.; Wcisel, Dustin J.; Kettleborough, Ross N.; Stemple, Derek L.; Andrade, Jorge; de Jong, Jill L. O.
2016-01-01
Antigen processing and presentation genes found within the MHC are among the most highly polymorphic genes of vertebrate genomes, providing populations with diverse immune responses to a wide array of pathogens. Here, we describe transcriptome, exome, and whole-genome sequencing of clonal zebrafish, uncovering the most extensive diversity within the antigen processing and presentation genes of any species yet examined. Our CG2 clonal zebrafish assembly provides genomic context within a remarkably divergent haplotype of the core MHC region on chromosome 19 for six expressed genes not found in the zebrafish reference genome: mhc1uga, proteasome-β 9b (psmb9b), psmb8f, and previously unknown genes psmb13b, tap2d, and tap2e. We identify ancient lineages for Psmb13 within a proteasome branch previously thought to be monomorphic and provide evidence of substantial lineage diversity within each of three major trifurcations of catalytic-type proteasome subunits in vertebrates: Psmb5/Psmb8/Psmb11, Psmb6/Psmb9/Psmb12, and Psmb7/Psmb10/Psmb13. Strikingly, nearby tap2 and MHC class I genes also retain ancient sequence lineages, indicating that alternative lineages may have been preserved throughout the entire MHC pathway since early diversification of the adaptive immune system ∼500 Mya. Furthermore, polymorphisms within the three MHC pathway steps (antigen cleavage, transport, and presentation) are each predicted to alter peptide specificity. Lastly, comparative analysis shows that antigen processing gene diversity is far more extensive than previously realized (with ancient coelacanth psmb8 lineages, shark psmb13, and tap2t and psmb10 outside the teleost MHC), implying distinct immune functions and conserved roles in shaping MHC pathway evolution throughout vertebrates. PMID:27493218
Genetic Diversity among Clostridium botulinum Strains Harboring bont/A2 and bont/A3 Genes
Raphael, Brian H.; Joseph, Lavin A.; Meno, Sarah R.; Fernández, Rafael A.; Maslanka, Susan E.
2012-01-01
Clostridium botulinum type A strains are known to be genetically diverse and widespread throughout the world. Genetic diversity studies have focused mainly on strains harboring one type A botulinum toxin gene, bont/A1, although all reported bont/A gene variants have been associated with botulism cases. Our study provides insight into the genetic diversity of C. botulinum type A strains, which contain bont/A2 (n = 42) and bont/A3 (n = 4) genes, isolated from diverse samples and geographic origins. Genetic diversity was assessed by using bont nucleotide sequencing, content analysis of the bont gene clusters, multilocus sequence typing (MLST), and pulsed-field gel electrophoresis (PFGE). Sequences of bont genes obtained in this study showed 99.9 to 100% identity with other bont/A2 or bont/A3 gene sequences available in public databases. The neurotoxin gene clusters of the subtype A2 and A3 strains analyzed in this study were similar in gene content. C. botulinum strains harboring bont/A2 and bont/A3 genes were divided into six and two MLST profiles, respectively. Four groups of strains shared a similarity of at least 95% by PFGE; the largest group included 21 out of 46 strains. The strains analyzed in this study showed relatively limited genetic diversity using either MLST or PFGE. PMID:23042179
Sitt, Tatjana; Pelle, Roger; Chepkwony, Maurine; Morrison, W Ivan; Toye, Philip
2018-05-06
The extent of sequence diversity among the genes encoding 10 antigens (Tp1-10) known to be recognized by CD8+ T lymphocytes from cattle immune to Theileria parva was analysed. The sequences were derived from parasites in 23 buffalo-derived cell lines, three cattle-derived isolates and one cloned cell line obtained from a buffalo-derived stabilate. The results revealed substantial variation among the antigens through sequence diversity. The greatest nucleotide and amino acid diversity were observed in Tp1, Tp2 and Tp9. Tp5 and Tp7 showed the least amount of allelic diversity, and Tp5, Tp6 and Tp7 had the lowest levels of protein diversity. Tp6 was the most conserved protein; only a single non-synonymous substitution was found in all obtained sequences. The ratio of non-synonymous: synonymous substitutions varied from 0.84 (Tp1) to 0.04 (Tp6). Apart from Tp2 and Tp9, we observed no variation in the other defined CD8+ T cell epitopes (Tp4, 5, 7 and 8), indicating that epitope variation is not a universal feature of T. parva antigens. In addition to providing markers that can be used to examine the diversity in T. parva populations, the results highlight the potential for using conserved antigens to develop vaccines that provide broad protection against T. parva.
Lasserre, Moira; Fresia, Pablo; Greif, Gonzalo; Iraola, Gregorio; Castro-Ramos, Miguel; Juambeltz, Arturo; Nuñez, Álvaro; Naya, Hugo; Robello, Carlos; Berná, Luisa
2018-01-02
Bovine tuberculosis (bTB) poses serious risks to animal welfare and economy, as well as to public health as a zoonosis. Its etiological agent, Mycobacterium bovis, belongs to the Mycobacterium tuberculosis complex (MTBC), a group of genetically monomorphic organisms featured by a remarkably high overall nucleotide identity (99.9%). Indeed, this characteristic is of major concern for correct typing and determination of strain-specific traits based on sequence diversity. Due to its historical economic dependence on cattle production, Uruguay is deeply affected by the prevailing incidence of Mycobacterium bovis. With the world's highest number of cattle per human, and its intensive cattle production, Uruguay represents a particularly suited setting to evaluate genomic variability among isolates, and the diversity traits associated to this pathogen. We compared 186 genomes from MTBC strains isolated worldwide, and found a highly structured population in M. bovis. The analysis of 23 new M. bovis genomes, belonging to strains isolated in Uruguay evidenced three groups present in the country. Despite presenting an expected highly conserved genomic structure and sequence, these strains segregate into a clustered manner within the worldwide phylogeny. Analysis of the non-pe/ppe differential areas against a reference genome defined four main sources of variability, namely: regions of difference (RD), variable genes, duplications and novel genes. RDs and variant analysis segregated the strains into clusters that are concordant with their spoligotype identities. Due to its high homoplasy rate, spoligotyping failed to reflect the true genomic diversity among worldwide representative strains, however, it remains a good indicator for closely related populations. This study introduces a comprehensive population structure analysis of worldwide M. bovis isolates. The incorporation and analysis of 23 novel Uruguayan M. bovis genomes, sheds light onto the genomic diversity of this pathogen, evidencing the existence of greater genetic variability among strains than previously contemplated.
Belda, Eugeni; Pedrola, Laia; Peretó, Juli; Martínez-Blanch, Juan F.; Montagud, Arnau; Navarro, Emilio; Urchueguía, Javier; Ramón, Daniel; Moya, Andrés; Porcar, Manuel
2011-01-01
Background Insects are associated with microorganisms that contribute to the digestion and processing of nutrients. The European Corn Borer (ECB) is a moth present world-wide, causing severe economical damage as a pest on corn and other crops. In the present work, we give a detailed view of the complexity of the microorganisms forming the ECB midgut microbiota with the objective of comparing the biodiversity of the midgut-associated microbiota and explore their potential as a source of genes and enzymes with biotechnological applications. Methodological/Principal Findings A high-throughput sequencing approach has been used to identify bacterial species, genes and metabolic pathways, particularly those involved in plant-matter degradation, in two different ECB populations (field-collected vs. lab-reared population with artificial diet). Analysis of the resulting sequences revealed the massive presence of Staphylococcus warneri and Weissella paramesenteroides in the lab-reared sample. This enabled us to reconstruct both genomes almost completely. Despite the apparently low diversity, 208 different genera were detected in the sample, although most of them at very low frequency. By contrast, the natural population exhibited an even higher taxonomic diversity along with a wider array of cellulolytic enzyme families. However, in spite of the differences in relative abundance of major taxonomic groups, not only did both metagenomes share a similar functional profile but also a similar distribution of non-redundant genes in different functional categories. Conclusions/Significance Our results reveal a highly diverse pool of bacterial species in both O. nubilalis populations, with major differences: The lab-reared sample is rich in gram-positive species (two of which have almost fully sequenced genomes) while the field sample harbors mainly gram-negative species and has a larger set of cellulolytic enzymes. We have found a clear relationship between the diet and the midgut microbiota, which reveals the selection pressure of food on the community of intestinal bacteria. PMID:21738787
Visualization of Genome Diversity in German Shepherd Dogs.
Mortlock, Sally-Anne; Booth, Rachel; Mazrier, Hamutal; Khatkar, Mehar S; Williamson, Peter
2015-01-01
A loss of genetic diversity may lead to increased disease risks in subpopulations of dogs. The canine breed structure has contributed to relatively small effective population size in many breeds and can limit the options for selective breeding strategies to maintain diversity. With the completion of the canine genome sequencing project, and the subsequent reduction in the cost of genotyping on a genomic scale, evaluating diversity in dogs has become much more accurate and accessible. This provides a potential tool for advising dog breeders and developing breeding programs within a breed. A challenge in doing this is to present complex relationship data in a form that can be readily utilized. Here, we demonstrate the use of a pipeline, known as NetView, to visualize the network of relationships in a subpopulation of German Shepherd Dogs.
Ling, Shaoping; Hu, Zheng; Yang, Zuyu; Yang, Fang; Li, Yawei; Lin, Pei; Chen, Ke; Dong, Lili; Cao, Lihua; Tao, Yong; Hao, Lingtong; Chen, Qingjian; Gong, Qiang; Wu, Dafei; Li, Wenjie; Zhao, Wenming; Tian, Xiuyun; Hao, Chunyi; Hungate, Eric A; Catenacci, Daniel V T; Hudson, Richard R; Li, Wen-Hsiung; Lu, Xuemei; Wu, Chung-I
2015-11-24
The prevailing view that the evolution of cells in a tumor is driven by Darwinian selection has never been rigorously tested. Because selection greatly affects the level of intratumor genetic diversity, it is important to assess whether intratumor evolution follows the Darwinian or the non-Darwinian mode of evolution. To provide the statistical power, many regions in a single tumor need to be sampled and analyzed much more extensively than has been attempted in previous intratumor studies. Here, from a hepatocellular carcinoma (HCC) tumor, we evaluated multiregional samples from the tumor, using either whole-exome sequencing (WES) (n = 23 samples) or genotyping (n = 286) under both the infinite-site and infinite-allele models of population genetics. In addition to the many single-nucleotide variations (SNVs) present in all samples, there were 35 "polymorphic" SNVs among samples. High genetic diversity was evident as the 23 WES samples defined 20 unique cell clones. With all 286 samples genotyped, clonal diversity agreed well with the non-Darwinian model with no evidence of positive Darwinian selection. Under the non-Darwinian model, MALL (the number of coding region mutations in the entire tumor) was estimated to be greater than 100 million in this tumor. DNA sequences reveal local diversities in small patches of cells and validate the estimation. In contrast, the genetic diversity under a Darwinian model would generally be orders of magnitude smaller. Because the level of genetic diversity will have implications on therapeutic resistance, non-Darwinian evolution should be heeded in cancer treatments even for microscopic tumors.
Wessels, Jocelyn M.; Lajoie, Julie; Vitali, Danielle; Omollo, Kenneth; Kimani, Joshua; Oyugi, Julius; Cheruiyot, Juliana; Kimani, Makubo; Mungai, John N.; Akolo, Maureen; Stearns, Jennifer C.; Surette, Michael G.; Fowke, Keith R.
2017-01-01
Objective To compare the vaginal microbiota of women engaged in high-risk sexual behaviour (sex work) with women who are not engaged in high-risk sexual behaviour. Diverse vaginal microbiota, low in Lactobacillus species, like those in bacterial vaginosis (BV), are associated with increased prevalence of sexually transmitted infections (STIs) and human immunodeficiency virus (HIV) acquisition. Although high-risk sexual behaviour increases risk for STIs, the vaginal microbiota of sex workers is understudied. Methods A retrospective cross-sectional study was conducted comparing vaginal microbiota of women who are not engaged in sex work (non-sex worker controls, NSW, N = 19) and women engaged in sex work (female sex workers, FSW, N = 48), using Illumina sequencing (16S rRNA, V3 region). Results Bacterial richness and diversity were significantly less in controls, than FSW. Controls were more likely to have Lactobacillus as the most abundant genus (58% vs. 17%; P = 0.002) and composition of their vaginal microbiota differed from FSW (PERMANOVA, P = 0.001). Six microbiota clusters were detected, including a high diversity cluster with three sub-clusters, and 55% of women with low Nugent Scores fell within this cluster. High diversity was observed by 16S sequencing in FSW, regardless of Nugent Scores, suggesting that Nugent Score may not be capable of capturing the diversity present in the FSW vaginal microbiota. Conclusions High-risk sexual behaviour is associated with diversity of the vaginal microbiota and lack of Lactobacillus. These factors could contribute to increased risk of STIs and HIV in women engaged in high-risk sexual behaviour. PMID:29095928
Wessels, Jocelyn M; Lajoie, Julie; Vitali, Danielle; Omollo, Kenneth; Kimani, Joshua; Oyugi, Julius; Cheruiyot, Juliana; Kimani, Makubo; Mungai, John N; Akolo, Maureen; Stearns, Jennifer C; Surette, Michael G; Fowke, Keith R; Kaushic, Charu
2017-01-01
To compare the vaginal microbiota of women engaged in high-risk sexual behaviour (sex work) with women who are not engaged in high-risk sexual behaviour. Diverse vaginal microbiota, low in Lactobacillus species, like those in bacterial vaginosis (BV), are associated with increased prevalence of sexually transmitted infections (STIs) and human immunodeficiency virus (HIV) acquisition. Although high-risk sexual behaviour increases risk for STIs, the vaginal microbiota of sex workers is understudied. A retrospective cross-sectional study was conducted comparing vaginal microbiota of women who are not engaged in sex work (non-sex worker controls, NSW, N = 19) and women engaged in sex work (female sex workers, FSW, N = 48), using Illumina sequencing (16S rRNA, V3 region). Bacterial richness and diversity were significantly less in controls, than FSW. Controls were more likely to have Lactobacillus as the most abundant genus (58% vs. 17%; P = 0.002) and composition of their vaginal microbiota differed from FSW (PERMANOVA, P = 0.001). Six microbiota clusters were detected, including a high diversity cluster with three sub-clusters, and 55% of women with low Nugent Scores fell within this cluster. High diversity was observed by 16S sequencing in FSW, regardless of Nugent Scores, suggesting that Nugent Score may not be capable of capturing the diversity present in the FSW vaginal microbiota. High-risk sexual behaviour is associated with diversity of the vaginal microbiota and lack of Lactobacillus. These factors could contribute to increased risk of STIs and HIV in women engaged in high-risk sexual behaviour.
Ling, Shaoping; Hu, Zheng; Yang, Zuyu; Yang, Fang; Li, Yawei; Lin, Pei; Chen, Ke; Dong, Lili; Cao, Lihua; Tao, Yong; Hao, Lingtong; Chen, Qingjian; Gong, Qiang; Wu, Dafei; Li, Wenjie; Zhao, Wenming; Tian, Xiuyun; Hao, Chunyi; Hungate, Eric A.; Catenacci, Daniel V. T.; Hudson, Richard R.; Li, Wen-Hsiung; Lu, Xuemei; Wu, Chung-I
2015-01-01
The prevailing view that the evolution of cells in a tumor is driven by Darwinian selection has never been rigorously tested. Because selection greatly affects the level of intratumor genetic diversity, it is important to assess whether intratumor evolution follows the Darwinian or the non-Darwinian mode of evolution. To provide the statistical power, many regions in a single tumor need to be sampled and analyzed much more extensively than has been attempted in previous intratumor studies. Here, from a hepatocellular carcinoma (HCC) tumor, we evaluated multiregional samples from the tumor, using either whole-exome sequencing (WES) (n = 23 samples) or genotyping (n = 286) under both the infinite-site and infinite-allele models of population genetics. In addition to the many single-nucleotide variations (SNVs) present in all samples, there were 35 “polymorphic” SNVs among samples. High genetic diversity was evident as the 23 WES samples defined 20 unique cell clones. With all 286 samples genotyped, clonal diversity agreed well with the non-Darwinian model with no evidence of positive Darwinian selection. Under the non-Darwinian model, MALL (the number of coding region mutations in the entire tumor) was estimated to be greater than 100 million in this tumor. DNA sequences reveal local diversities in small patches of cells and validate the estimation. In contrast, the genetic diversity under a Darwinian model would generally be orders of magnitude smaller. Because the level of genetic diversity will have implications on therapeutic resistance, non-Darwinian evolution should be heeded in cancer treatments even for microscopic tumors. PMID:26561581
Yokoyama, Naoaki; Sivakumar, Thillaiampalam; Tuvshintulga, Bumduuren; Hayashida, Kyoko; Igarashi, Ikuo; Inoue, Noboru; Long, Phung Thang; Lan, Dinh Thi Bich
2015-03-01
The genes that encode merozoite surface antigens (MSAs) in Babesia bovis are genetically diverse. In this study, we analyzed the genetic diversity of B. bovis MSA-1, MSA-2b, and MSA-2c genes in Vietnamese cattle and water buffaloes. Blood DNA samples from 258 cattle and 49 water buffaloes reared in the Thua Thien Hue province of Vietnam were screened with a B. bovis-specific diagnostic PCR assay. The B. bovis-positive DNA samples (23 cattle and 16 water buffaloes) were then subjected to PCR assays to amplify the MSA-1, MSA-2b, and MSA-2c genes. Sequencing analyses showed that the Vietnamese MSA-1 and MSA-2b sequences are genetically diverse, whereas MSA-2c is relatively conserved. The nucleotide identity values for these MSA gene sequences were similar in the cattle and water buffaloes. Consistent with the sequencing data, the Vietnamese MSA-1 and MSA-2b sequences were dispersed across several clades in the corresponding phylogenetic trees, whereas the MSA-2c sequences occurred in a single clade. Cattle- and water-buffalo-derived sequences also often clustered together on the phylogenetic trees. The Vietnamese MSA-1, MSA-2b, and MSA-2c sequences were then screened for recombination with automated methods. Of the seven recombination events detected, five and two were associated with the MSA-2b and MSA-2c recombinant sequences, respectively, whereas no MSA-1 recombinants were detected among the sequences analyzed. Recombination between the sequences derived from cattle and water buffaloes was very common, and the resultant recombinant sequences were found in both host animals. These data indicate that the genetic diversity of the MSA sequences does not differ between cattle and water buffaloes in Vietnam. They also suggest that recombination between the B. bovis MSA sequences in both cattle and water buffaloes might contribute to the genetic variation in these genes in Vietnam. Copyright © 2015 Elsevier B.V. All rights reserved.
Sánchez-Sevilla, José F.; Horvath, Aniko; Botella, Miguel A.; Gaston, Amèlia; Folta, Kevin; Kilian, Andrzej; Denoyes, Beatrice; Amaya, Iraida
2015-01-01
Cultivated strawberry (Fragaria × ananassa) is a genetically complex allo-octoploid crop with 28 pairs of chromosomes (2n = 8x = 56) for which a genome sequence is not yet available. The diploid Fragaria vesca is considered the donor species of one of the octoploid sub-genomes and its available genome sequence can be used as a reference for genomic studies. A wide number of strawberry cultivars are stored in ex situ germplasm collections world-wide but a number of previous studies have addressed the genetic diversity present within a limited number of these collections. Here, we report the development and application of two platforms based on the implementation of Diversity Array Technology (DArT) markers for high-throughput genotyping in strawberry. The first DArT microarray was used to evaluate the genetic diversity of 62 strawberry cultivars that represent a wide range of variation based on phenotype, geographical and temporal origin and pedigrees. A total of 603 DArT markers were used to evaluate the diversity and structure of the population and their cluster analyses revealed that these markers were highly efficient in classifying the accessions in groups based on historical, geographical and pedigree-based cues. The second DArTseq platform took benefit of the complexity reduction method optimized for strawberry and the development of next generation sequencing technologies. The strawberry DArTseq was used to generate a total of 9,386 SNP markers in the previously developed ‘232’ × ‘1392’ mapping population, of which, 4,242 high quality markers were further selected to saturate this map after several filtering steps. The high-throughput platforms here developed for genotyping strawberry will facilitate genome-wide characterizations of large accessions sets and complement other available options. PMID:26675207
Bacteriophages of Gordonia spp. Display a Spectrum of Diversity and Genetic Relationships.
Pope, Welkin H; Mavrich, Travis N; Garlena, Rebecca A; Guerrero-Bustamante, Carlos A; Jacobs-Sera, Deborah; Montgomery, Matthew T; Russell, Daniel A; Warner, Marcie H; Hatfull, Graham F
2017-08-15
The global bacteriophage population is large, dynamic, old, and highly diverse genetically. Many phages are tailed and contain double-stranded DNA, but these remain poorly characterized genomically. A collection of over 1,000 phages infecting Mycobacterium smegmatis reveals the diversity of phages of a common bacterial host, but their relationships to phages of phylogenetically proximal hosts are not known. Comparative sequence analysis of 79 phages isolated on Gordonia shows these also to be diverse and that the phages can be grouped into 14 clusters of related genomes, with an additional 14 phages that are "singletons" with no closely related genomes. One group of six phages is closely related to Cluster A mycobacteriophages, but the other Gordonia phages are distant relatives and share only 10% of their genes with the mycobacteriophages. The Gordonia phage genomes vary in genome length (17.1 to 103.4 kb), percentage of GC content (47 to 68.8%), and genome architecture and contain a variety of features not seen in other phage genomes. Like the mycobacteriophages, the highly mosaic Gordonia phages demonstrate a spectrum of genetic relationships. We show this is a general property of bacteriophages and suggest that any barriers to genetic exchange are soft and readily violable. IMPORTANCE Despite the numerical dominance of bacteriophages in the biosphere, there is a dearth of complete genomic sequences. Current genomic information reveals that phages are highly diverse genomically and have mosaic architectures formed by extensive horizontal genetic exchange. Comparative analysis of 79 phages of Gordonia shows them to not only be highly diverse, but to present a spectrum of relatedness. Most are distantly related to phages of the phylogenetically proximal host Mycobacterium smegmatis , although one group of Gordonia phages is more closely related to mycobacteriophages than to the other Gordonia phages. Phage genome sequence space remains largely unexplored, but further isolation and genomic comparison of phages targeted at related groups of hosts promise to reveal pathways of bacteriophage evolution. Copyright © 2017 Pope et al.
Ubiquity and Diversity of Heterotrophic Bacterial nasA Genes in Diverse Marine Environments
Jiang, Xuexia; Dang, Hongyue; Jiao, Nianzhi
2015-01-01
Nitrate uptake by heterotrophic bacteria plays an important role in marine N cycling. However, few studies have investigated the diversity of environmental nitrate assimilating bacteria (NAB). In this study, the diversity and biogeographical distribution of NAB in several global oceans and particularly in the western Pacific marginal seas were investigated using both cultivation and culture-independent molecular approaches. Phylogenetic analyses based on 16S rRNA and nasA (encoding the large subunit of the assimilatory nitrate reductase) gene sequences indicated that the cultivable NAB in South China Sea belonged to the α-Proteobacteria, γ-Proteobacteria and CFB (Cytophaga-Flavobacteria-Bacteroides) bacterial groups. In all the environmental samples of the present study, α-Proteobacteria, γ-Proteobacteria and Bacteroidetes were found to be the dominant nasA-harboring bacteria. Almost all of the α-Proteobacteria OTUs were classified into three Roseobacter-like groups (I to III). Clone library analysis revealed previously underestimated nasA diversity; e.g. the nasA gene sequences affiliated with β-Proteobacteria, ε-Proteobacteria and Lentisphaerae were observed in the field investigation for the first time, to the best of our knowledge. The geographical and vertical distributions of seawater nasA-harboring bacteria indicated that NAB were highly diverse and ubiquitously distributed in the studied marginal seas and world oceans. Niche adaptation and separation and/or limited dispersal might mediate the NAB composition and community structure in different water bodies. In the shallow-water Kueishantao hydrothermal vent environment, chemolithoautotrophic sulfur-oxidizing bacteria were the primary NAB, indicating a unique nitrate-assimilating community in this extreme environment. In the coastal water of the East China Sea, the relative abundance of Alteromonas and Roseobacter-like nasA gene sequences responded closely to algal blooms, indicating that NAB may be active participants contributing to the bloom dynamics. Our statistical results suggested that salinity, temperature and nitrate may be some of the key environmental factors controlling the composition and dynamics of the marine NAB communities. PMID:25647610
Beyond Bacteria: A Study of the Enteric Microbial Consortium in Extremely Low Birth Weight Infants
Cotton, Charles Michael; Goldberg, Ronald N.; Wynn, James L.; Jackson, Robert B.; Seed, Patrick C.
2011-01-01
Extremely low birth weight (ELBW) infants have high morbidity and mortality, frequently due to invasive infections from bacteria, fungi, and viruses. The microbial communities present in the gastrointestinal tracts of preterm infants may serve as a reservoir for invasive organisms and remain poorly characterized. We used deep pyrosequencing to examine the gut-associated microbiome of 11 ELBW infants in the first postnatal month, with a first time determination of the eukaryote microbiota such as fungi and nematodes, including bacteria and viruses that have not been previously described. Among the fungi observed, Candida sp. and Clavispora sp. dominated the sequences, but a range of environmental molds were also observed. Surprisingly, seventy-one percent of the infant fecal samples tested contained ribosomal sequences corresponding to the parasitic organism Trichinella. Ribosomal DNA sequences for the roundworm symbiont Xenorhabdus accompanied these sequences in the infant with the greatest proportion of Trichinella sequences. When examining ribosomal DNA sequences in aggregate, Enterobacteriales, Pseudomonas, Staphylococcus, and Enterococcus were the most abundant bacterial taxa in a low diversity bacterial community (mean Shannon-Weaver Index of 1.02±0.69), with relatively little change within individual infants through time. To supplement the ribosomal sequence data, shotgun sequencing was performed on DNA from multiple displacement amplification (MDA) of total fecal genomic DNA from two infants. In addition to the organisms mentioned previously, the metagenome also revealed sequences for gram positive and gram negative bacteriophages, as well as human adenovirus C. Together, these data reveal surprising eukaryotic and viral microbial diversity in ELBW enteric microbiota dominated bytypes of bacteria known to cause invasive disease in these infants. PMID:22174751
Mucosal and Cutaneous Human Papillomaviruses Detected in Raw Sewages
La Rosa, Giuseppina; Fratini, Marta; Accardi, Luisa; D'Oro, Graziana; Della Libera, Simonetta; Muscillo, Michele; Di Bonito, Paola
2013-01-01
Epitheliotropic viruses can find their way into sewage. The aim of the present study was to investigate the occurrence, distribution, and genetic diversity of Human Papillomaviruses (HPVs) in urban wastewaters. Sewage samples were collected from treatment plants distributed throughout Italy. The DNA extracted from these samples was analyzed by PCR using five PV-specific sets of primers targeting the L1 (GP5/GP6, MY09/MY11, FAP59/64, SKF/SKR) and E1 regions (PM-A/PM-B), according to the protocols previously validated for the detection of mucosal and cutaneous HPV genotypes. PCR products underwent sequencing analysis and the sequences were aligned to reference genomes from the Papillomavirus Episteme database. Phylogenetic analysis was then performed to assess the genetic relationships among the different sequences and between the sequences of the samples and those of the prototype strains. A broad spectrum of sequences related to mucosal and cutaneous HPV types was detected in 81% of the sewage samples analyzed. Surprisingly, sequences related to the anogenital HPV6 and 11 were detected in 19% of the samples, and sequences related to the “high risk” oncogenic HPV16 were identified in two samples. Sequences related to HPV9, HPV20, HPV25, HPV76, HPV80, HPV104, HPV110, HPV111, HPV120 and HPV145 beta Papillomaviruses were detected in 76% of the samples. In addition, similarity searches and phylogenetic analysis of some sequences suggest that they could belong to putative new genotypes of the beta genus. In this study, for the first time, the presence of HPV viruses strongly related to human cancer is reported in sewage samples. Our data increases the knowledge of HPV genomic diversity and suggests that virological analysis of urban sewage can provide key information useful in supporting epidemiological studies. PMID:23341898
Tzanetakis, Giorgos N; Azcarate-Peril, M Andrea; Zachaki, Sophia; Panopoulos, Panos; Kontakiotis, Evangelos G; Madianos, Phoebus N; Divaris, Kimon
2015-08-01
Elucidating the microbial ecology of endodontic infections (EIs) is a necessary step in developing effective intracanal antimicrobials. The aim of the present study was to investigate the bacterial composition of symptomatic and asymptomatic primary and persistent infections in a Greek population using high-throughput sequencing methods. 16S amplicon pyrosequencing of 48 root canal bacterial samples was conducted, and sequencing data were analyzed using an oral microbiome-specific and a generic (Greengenes) database. Bacterial abundance and diversity were examined by EI type (primary or persistent), and statistical analysis was performed by using non-parametric and parametric tests accounting for clustered data. Bacteroidetes was the most abundant phylum in both infection groups. Significant, albeit weak associations of bacterial diversity were found, as measured by UniFrac distances with infection type (analyses of similarity, R = 0.087, P = .005) and symptoms (analyses of similarity, R = 0.055, P = .047). Persistent infections were significantly enriched for Proteobacteria and Tenericutes compared with primary ones; at the genus level, significant differences were noted for 14 taxa, including increased enrichment of persistent infections for Lactobacillus, Streptococcus, and Sphingomonas. More but less abundant phyla were identified using the Greengenes database; among those, Cyanobacteria (0.018%) and Acidobacteria (0.007%) were significantly enriched among persistent infections. Persistent infections showed higher phylogenetic diversity (PD) (asymptomatic: PD = 9.2, standard error [SE] = 1.3; symptomatic: PD = 8.2, SE = 0.7) compared with primary infections (asymptomatic: PD = 5.9, SE = 0.8; symptomatic: PD = 7.4, SE = 1.0). The present study revealed a high bacterial diversity of EI and suggests that persistent infections may have more diverse bacterial communities than primary infections. Copyright © 2015 American Association of Endodontists. Published by Elsevier Inc. All rights reserved.
Foliar fungi of Betula pendula: impact of tree species mixtures and assessment methods
Nguyen, Diem; Boberg, Johanna; Cleary, Michelle; Bruelheide, Helge; Hönig, Lydia; Koricheva, Julia; Stenlid, Jan
2017-01-01
Foliar fungi of silver birch (Betula pendula) in an experimental Finnish forest were investigated across a gradient of tree species richness using molecular high-throughput sequencing and visual macroscopic assessment. We hypothesized that the molecular approach detects more fungal taxa than visual assessment, and that there is a relationship among the most common fungal taxa detected by both techniques. Furthermore, we hypothesized that the fungal community composition, diversity, and distribution patterns are affected by changes in tree diversity. Sequencing revealed greater diversity of fungi on birch leaves than the visual assessment method. One species showed a linear relationship between the methods. Species-specific variation in fungal community composition could be partially explained by tree diversity, though overall fungal diversity was not affected by tree diversity. Analysis of specific fungal taxa indicated tree diversity effects at the local neighbourhood scale, where the proportion of birch among neighbouring trees varied, but not at the plot scale. In conclusion, both methods may be used to determine tree diversity effects on the foliar fungal community. However, high-throughput sequencing provided higher resolution of the fungal community, while the visual macroscopic assessment detected functionally active fungal species. PMID:28150710
Bowers, Robert M.; Kyrpides, Nikos C.; Stepanauskas, Ramunas; ...
2017-08-08
Here, we present two standards developed by the Genomic Standards Consortium (GSC) for reporting bacterial and archaeal genome sequences. Both are extensions of the Minimum Information about Any (x) Sequence (MIxS). The standards are the Minimum Information about a Single Amplified Genome (MISAG) and the Minimum Information about a MetagenomeAssembled Genome (MIMAG), including, but not limited to, assembly quality, and estimates of genome completeness and contamination. These standards can be used in combination with other GSC checklists, including the Minimum Information about a Genome Sequence (MIGS), Minimum Information about a Metagenomic Sequence (MIMS), and Minimum Information about a Marker Genemore » Sequence (MIMARKS). Community-wide adoption of MISAG and MIMAG will facilitate more robust comparative genomic analyses of bacterial and archaeal diversity.« less
DOE Office of Scientific and Technical Information (OSTI.GOV)
Bowers, Robert M.; Kyrpides, Nikos C.; Stepanauskas, Ramunas
Here, we present two standards developed by the Genomic Standards Consortium (GSC) for reporting bacterial and archaeal genome sequences. Both are extensions of the Minimum Information about Any (x) Sequence (MIxS). The standards are the Minimum Information about a Single Amplified Genome (MISAG) and the Minimum Information about a MetagenomeAssembled Genome (MIMAG), including, but not limited to, assembly quality, and estimates of genome completeness and contamination. These standards can be used in combination with other GSC checklists, including the Minimum Information about a Genome Sequence (MIGS), Minimum Information about a Metagenomic Sequence (MIMS), and Minimum Information about a Marker Genemore » Sequence (MIMARKS). Community-wide adoption of MISAG and MIMAG will facilitate more robust comparative genomic analyses of bacterial and archaeal diversity.« less
Integrative Clinical Genomics of Metastatic Cancer
Robinson, Dan R.; Wu, Yi-Mi; Lonigro, Robert J.; Vats, Pankaj; Cobain, Erin; Everett, Jessica; Cao, Xuhong; Rabban, Erica; Kumar-Sinha, Chandan; Raymond, Victoria; Schuetze, Scott; Alva, Ajjai; Siddiqui, Javed; Chugh, Rashmi; Worden, Francis; Zalupski, Mark M.; Innis, Jeffrey; Mody, Rajen J.; Tomlins, Scott A.; Lucas, David; Baker, Laurence H.; Ramnath, Nithya; Schott, Ann F.; Hayes, Daniel F.; Vijai, Joseph; Offit, Kenneth; Stoffel, Elena M.; Roberts, J. Scott; Smith, David C.; Kunju, Lakshmi P.; Talpaz, Moshe; Cieslik, Marcin; Chinnaiyan, Arul M.
2017-01-01
SUMMARY Metastasis is the primary cause of cancer-related deaths. While The Cancer Genome Atlas (TCGA) has sequenced primary tumor types obtained from surgical resections, much less comprehensive molecular analysis is available from clinically acquired metastatic cancers. Here, we perform whole exome and transcriptome sequencing of 500 adult patients with metastatic solid tumors of diverse lineage and biopsy site. The most prevalent genes somatically altered in metastatic cancer included TP53, CDKN2A, PTEN, PIK3CA, and RB1. Putative pathogenic germline variants were present in 12.2% of cases of which 75% were related to defects in DNA repair. RNA sequencing complemented DNA sequencing for the identification of gene fusions, pathway activation, and immune profiling. Integrative sequence analysis provides a clinically relevant, multi-dimensional view of the complex molecular landscape and microenvironment of metastatic cancers. PMID:28783718
Jensen, Peter D; Zhang, Yuanji; Wiggins, B Elizabeth; Petrick, Jay S; Zhu, Jin; Kerstetter, Randall A; Heck, Gregory R; Ivashuta, Sergey I
2013-01-01
Long double-stranded RNAs (long dsRNAs) are precursors for the effector molecules of sequence-specific RNA-based gene silencing in eukaryotes. Plant cells can contain numerous endogenous long dsRNAs. This study demonstrates that such endogenous long dsRNAs in plants have sequence complementarity to human genes. Many of these complementary long dsRNAs have perfect sequence complementarity of at least 21 nucleotides to human genes; enough complementarity to potentially trigger gene silencing in targeted human cells if delivered in functional form. However, the number and diversity of long dsRNA molecules in plant tissue from crops such as lettuce, tomato, corn, soy and rice with complementarity to human genes that have a long history of safe consumption supports a conclusion that long dsRNAs do not present a significant dietary risk.
Chang, Yu C; Scaria, Joy; Ibraham, Mariamma; Doiphode, Sanjay; Chang, Yung-Fu; Sultan, Ali; Mohammed, Hussni O
2016-01-01
Salmonella enterica is one of the most commonly reported causes of bacterial foodborne illness around the world. Understanding the sources of this pathogen and the associated factors that exacerbate its risk to humans will help in developing risk mitigation strategies. The genetic relatedness among Salmonella isolates recovered from human gastroenteritis cases and food animals in Qatar were investigated in the hope of shedding light on these sources, their possible transmission routes, and any associated factors. A repeat cross-sectional study was conducted in which the samples and associated data were collected from both populations (gastroenteritis cases and animals). Salmonella isolates were initially analyzed using multi-locus sequence typing (MLST) to investigate the genetic diversity and clonality. The relatedness among the isolates was assessed using the minimum spanning tree (MST). Twenty-seven different sequence types (STs) were identified in this study; among them, seven were novel, including ST1695, ST1696, ST1697, ST1698, ST1699, ST1702, and ST1703. The pattern of overall ST distribution was diverse; in particular, it was revealed that ST11 and ST19 were the most common sequence types, presenting 29.5% and 11.5% within the whole population. In addition, 20 eBurst Groups (eBGs) were identified in our data, which indicates that ST11 and ST19 belonged to eBG4 and eBG1, respectively. In addition, the potential association between the putative risk factors and eBGs were evaluated. There was no significant clustering of these eBGs by season; however, a significant association was identified in terms of nationality in that Qataris were six times more likely to present with eBG1 compared to non-Qataris. In the MST analysis, four major clusters were presented, namely, ST11, ST19, ST16, and ST31. The linkages between the clusters alluded to a possible transmission route. The results of the study have provided insight into the ST distributions of S. enterica and their possible zoonotic associations in Qatar. Published by Elsevier Ltd.
HIV-1 envelope sequence-based diversity measures for identifying recent infections
Kafando, Alexis; Fournier, Eric; Serhir, Bouchra; Martineau, Christine; Doualla-Bell, Florence; Sangaré, Mohamed Ndongo; Sylla, Mohamed; Chamberland, Annie; El-Far, Mohamed; Charest, Hugues
2017-01-01
Identifying recent HIV-1 infections is crucial for monitoring HIV-1 incidence and optimizing public health prevention efforts. To identify recent HIV-1 infections, we evaluated and compared the performance of 4 sequence-based diversity measures including percent diversity, percent complexity, Shannon entropy and number of haplotypes targeting 13 genetic segments within the env gene of HIV-1. A total of 597 diagnostic samples obtained in 2013 and 2015 from recently and chronically HIV-1 infected individuals were selected. From the selected samples, 249 (134 from recent versus 115 from chronic infections) env coding regions, including V1-C5 of gp120 and the gp41 ectodomain of HIV-1, were successfully amplified and sequenced by next generation sequencing (NGS) using the Illumina MiSeq platform. The ability of the four sequence-based diversity measures to correctly identify recent HIV infections was evaluated using the frequency distribution curves, median and interquartile range and area under the curve (AUC) of the receiver operating characteristic (ROC). Comparing the median and interquartile range and evaluating the frequency distribution curves associated with the 4 sequence-based diversity measures, we observed that the percent diversity, number of haplotypes and Shannon entropy demonstrated significant potential to discriminate recent from chronic infections (p<0.0001). Using the AUC of ROC analysis, only the Shannon entropy measure within three HIV-1 env segments could accurately identify recent infections at a satisfactory level. The env segments were gp120 C2_1 (AUC = 0.806), gp120 C2_3 (AUC = 0.805) and gp120 V3 (AUC = 0.812). Our results clearly indicate that the Shannon entropy measure represents a useful tool for predicting HIV-1 infection recency. PMID:29284009
HIV-1 envelope sequence-based diversity measures for identifying recent infections.
Kafando, Alexis; Fournier, Eric; Serhir, Bouchra; Martineau, Christine; Doualla-Bell, Florence; Sangaré, Mohamed Ndongo; Sylla, Mohamed; Chamberland, Annie; El-Far, Mohamed; Charest, Hugues; Tremblay, Cécile L
2017-01-01
Identifying recent HIV-1 infections is crucial for monitoring HIV-1 incidence and optimizing public health prevention efforts. To identify recent HIV-1 infections, we evaluated and compared the performance of 4 sequence-based diversity measures including percent diversity, percent complexity, Shannon entropy and number of haplotypes targeting 13 genetic segments within the env gene of HIV-1. A total of 597 diagnostic samples obtained in 2013 and 2015 from recently and chronically HIV-1 infected individuals were selected. From the selected samples, 249 (134 from recent versus 115 from chronic infections) env coding regions, including V1-C5 of gp120 and the gp41 ectodomain of HIV-1, were successfully amplified and sequenced by next generation sequencing (NGS) using the Illumina MiSeq platform. The ability of the four sequence-based diversity measures to correctly identify recent HIV infections was evaluated using the frequency distribution curves, median and interquartile range and area under the curve (AUC) of the receiver operating characteristic (ROC). Comparing the median and interquartile range and evaluating the frequency distribution curves associated with the 4 sequence-based diversity measures, we observed that the percent diversity, number of haplotypes and Shannon entropy demonstrated significant potential to discriminate recent from chronic infections (p<0.0001). Using the AUC of ROC analysis, only the Shannon entropy measure within three HIV-1 env segments could accurately identify recent infections at a satisfactory level. The env segments were gp120 C2_1 (AUC = 0.806), gp120 C2_3 (AUC = 0.805) and gp120 V3 (AUC = 0.812). Our results clearly indicate that the Shannon entropy measure represents a useful tool for predicting HIV-1 infection recency.
Mousavi, Soraya; Mariotti, Roberto; Regni, Luca; Nasini, Luigi; Bufacchi, Marina; Pandolfi, Saverio; Baldoni, Luciana; Proietti, Primo
2017-01-01
Germplasm collections of tree crop species represent fundamental tools for conservation of diversity and key steps for its characterization and evaluation. For the olive tree, several collections were created all over the world, but only few of them have been fully characterized and molecularly identified. The olive collection of Perugia University (UNIPG), established in the years' 60, represents one of the first attempts to gather and safeguard olive diversity, keeping together cultivars from different countries. In the present study, a set of 370 olive trees previously uncharacterized was screened with 10 standard simple sequence repeats (SSRs) and nine new EST-SSR markers, to correctly and thoroughly identify all genotypes, verify their representativeness of the entire cultivated olive variation, and validate the effectiveness of new markers in comparison to standard genotyping tools. The SSR analysis revealed the presence of 59 genotypes, corresponding to 72 well known cultivars, 13 of them resulting exclusively present in this collection. The new EST-SSRs have shown values of diversity parameters quite similar to those of best standard SSRs. When compared to hundreds of Mediterranean cultivars, the UNIPG olive accessions were splitted into the three main populations (East, Center and West Mediterranean), confirming that the collection has a good representativeness of the entire olive variability. Furthermore, Bayesian analysis, performed on the 59 genotypes of the collection by the use of both sets of markers, have demonstrated their splitting into four clusters, with a well balanced membership obtained by EST respect to standard SSRs. The new OLEST ( Olea expressed sequence tags) SSR markers resulted as effective as the best standard markers. The information obtained from this study represents a high valuable tool for ex situ conservation and management of olive genetic resources, useful to build a common database from worldwide olive cultivar collections, also based on recently developed markers.
He, Yan; Caporaso, J Gregory; Jiang, Xiao-Tao; Sheng, Hua-Fang; Huse, Susan M; Rideout, Jai Ram; Edgar, Robert C; Kopylova, Evguenia; Walters, William A; Knight, Rob; Zhou, Hong-Wei
2015-01-01
The operational taxonomic unit (OTU) is widely used in microbial ecology. Reproducibility in microbial ecology research depends on the reliability of OTU-based 16S ribosomal subunit RNA (rRNA) analyses. Here, we report that many hierarchical and greedy clustering methods produce unstable OTUs, with membership that depends on the number of sequences clustered. If OTUs are regenerated with additional sequences or samples, sequences originally assigned to a given OTU can be split into different OTUs. Alternatively, sequences assigned to different OTUs can be merged into a single OTU. This OTU instability affects alpha-diversity analyses such as rarefaction curves, beta-diversity analyses such as distance-based ordination (for example, Principal Coordinate Analysis (PCoA)), and the identification of differentially represented OTUs. Our results show that the proportion of unstable OTUs varies for different clustering methods. We found that the closed-reference method is the only one that produces completely stable OTUs, with the caveat that sequences that do not match a pre-existing reference sequence collection are discarded. As a compromise to the factors listed above, we propose using an open-reference method to enhance OTU stability. This type of method clusters sequences against a database and includes unmatched sequences by clustering them via a relatively stable de novo clustering method. OTU stability is an important consideration when analyzing microbial diversity and is a feature that should be taken into account during the development of novel OTU clustering methods.
Tiago, Igor; Veríssimo, António
2013-06-01
Microbial and functional diversity were assessed, from a serpentinization-driven subterrestrial alkaline aquifer - Cabeço de Vide Aquifer (CVA) in Portugal. DGGE analyses revealed the presence of a stable microbial community. By 16S rRNA gene libraries and pyrosequencing analyses, a diverse bacterial composition was determined, contrasting with low archaeal diversity. Within Bacteria the majority of the populations were related to organisms or sequences affiliated to class Clostridia, but members of classes Acidobacteria, Actinobacteria, Alphaproteobacteria, Betaproteobacteria, Deinococci, Gammaproteobacteria and of the phyla Bacteroidetes, Chloroflexi and Nitrospira were also detected. Domain Archaea encompassed mainly sequences affiliated to Euryarchaeota. Only form I RuBisCO - cbbL was detected. Autotrophic carbon fixation via the rTCA, 3-HP and 3-HP/4H-B cycles could not be confirmed. The detected APS reductase alpha subunit - aprA sequences were phylogenetically related to sequences of sulfate-reducing bacteria belonging to Clostridia, and also to sequences of chemolithoautothrophic sulfur-oxidizing bacteria belonging to Betaproteobacteria. Sequences of methyl coenzyme M reductase - mcrA were phylogenetically affiliated to sequences belonging to Anaerobic Methanotroph group 1 (ANME-1). The populations found and the functional key markers detected in CVA suggest that metabolisms related to H2 , methane and/or sulfur may be the major driving forces in this environment. © 2012 Society for Applied Microbiology and Blackwell Publishing Ltd.
József Geml; Gary A. Laursen; Ian C. Herriott; Jack M. McFarland; Michael G. Booth; Niall Lennon; H. Chad Nusbaum; D. Lee Taylor
2010-01-01
Although critical for the functioning of ecosystems, fungi are poorly known in high-latitude regions. Here, we provide the first genetic diversity assessment of one of the most diverse and abundant ectomycorrhizal genera in Alaska: Russula. We analyzed internal transcribed spacer rDNA sequences from sporocarps and soil samples using phylogenetic...
Loss of arbuscular mycorrhizal fungal diversity in trap cultures during long-term subculturing.
Trejo-Aguilar, Dora; Lara-Capistrán, Liliana; Maldonado-Mendoza, Ignacio E; Zulueta-Rodríguez, Ramón; Sangabriel-Conde, Wendy; Mancera-López, María Elena; Negrete-Yankelevich, Simoneta; Barois, Isabelle
2013-12-01
Long-term successional dynamics of an inoculum of arbuscular mycorrhizal fungi (AMF) associated with the maize rhizosphere (from traditionally managed agroecosystems in Los Tuxtlas, Veracruz, Mexico), was followed in Bracchiaria comata trap cultures for almost eight years. The results indicate that AMF diversity is lost following long-term subculturing of a single plant host species. Only the dominant species, Claroideoglomus etunicatum, persisted in pot cultures after 13 cycles. The absence of other morphotypes was demonstrated by an 18S rDNA survey, which confirmed that the sequences present solely belonged to C. etunicatum. Members of Diversisporales were the first to decrease in diversity, and the most persistent species belonged to Glomerales.
TIR-NBS-LRR genes are rare in monocots: evidence from diverse monocot orders
Tarr, D Ellen K; Alexander, Helen M
2009-01-01
Background Plant resistance (R) gene products recognize pathogen effector molecules. Many R genes code for proteins containing nucleotide binding site (NBS) and C-terminal leucine-rich repeat (LRR) domains. NBS-LRR proteins can be divided into two groups, TIR-NBS-LRR and non-TIR-NBS-LRR, based on the structure of the N-terminal domain. Although both classes are clearly present in gymnosperms and eudicots, only non-TIR sequences have been found consistently in monocots. Since most studies in monocots have been limited to agriculturally important grasses, it is difficult to draw conclusions. The purpose of our study was to look for evidence of these sequences in additional monocot orders. Findings Using degenerate PCR, we amplified NBS sequences from four monocot species (C. blanda, D. marginata, S. trifasciata, and Spathiphyllum sp.), a gymnosperm (C. revoluta) and a eudicot (C. canephora). We successfully amplified TIR-NBS-LRR sequences from dicot and gymnosperm DNA, but not from monocot DNA. Using databases, we obtained NBS sequences from additional monocots, magnoliids and basal angiosperms. TIR-type sequences were not present in monocot or magnoliid sequences, but were present in the basal angiosperms. Phylogenetic analysis supported a single TIR clade and multiple non-TIR clades. Conclusion We were unable to find monocot TIR-NBS-LRR sequences by PCR amplification or database searches. In contrast to previous studies, our results represent five monocot orders (Poales, Zingiberales, Arecales, Asparagales, and Alismatales). Our results establish the presence of TIR-NBS-LRR sequences in basal angiosperms and suggest that although these sequences were present in early land plants, they have been reduced significantly in monocots and magnoliids. PMID:19785756
Characterization of an endogenous retrovirus class in elephants and their relatives
Greenwood, Alex D; Englbrecht, Claudia C; MacPhee, Ross DE
2004-01-01
Background Endogenous retrovirus-like elements (ERV-Ls, primed with tRNA leucine) are a diverse group of reiterated sequences related to foamy viruses and widely distributed among mammals. As shown in previous investigations, in many primates and rodents this class of elements has remained transpositionally active, as reflected by increased copy number and high sequence diversity within and among taxa. Results Here we examine whether proviral-like sequences may be suitable molecular probes for investigating the phylogeny of groups known to have high element diversity. As a test we characterized ERV-Ls occurring in a sample of extant members of superorder Uranotheria (Asian and African elephants, manatees, and hyraxes). The ERV-L complement in this group is even more diverse than previously suspected, and there is sequence evidence for active expansion, particularly in elephantids. Many of the elements characterized have protein coding potential suggestive of activity. Conclusions In general, the evidence supports the hypothesis that the complement had a single origin within basal Uranotheria. PMID:15476555
Pattaradilokrat, Sittiporn; Sawaswong, Vorthon; Simpalipan, Phumin; Kaewthamasorn, Morakot; Siripoon, Napaporn; Harnyuttanakorn, Pongchai
2016-10-21
An effective malaria vaccine is an urgently needed tool to fight against human malaria, the most deadly parasitic disease of humans. One promising candidate is the merozoite surface protein-3 (MSP-3) of Plasmodium falciparum. This antigenic protein, encoded by the merozoite surface protein (msp-3) gene, is polymorphic and classified according to size into the two allelic types of K1 and 3D7. A recent study revealed that both the K1 and 3D7 alleles co-circulated within P. falciparum populations in Thailand, but the extent of the sequence diversity and variation within each allelic type remains largely unknown. The msp-3 gene was sequenced from 59 P. falciparum samples collected from five endemic areas (Mae Hong Son, Kanchanaburi, Ranong, Trat and Ubon Ratchathani) in Thailand and analysed for nucleotide sequence diversity, haplotype diversity and deduced amino acid sequence diversity. The gene was also subject to population genetic analysis (F st ) and neutrality tests (Tajima's D, Fu and Li D* and Fu and Li' F* tests) to determine any signature of selection. The sequence analyses revealed eight unique DNA haplotypes and seven amino acid sequence variants, with a haplotype and nucleotide diversity of 0.828 and 0.049, respectively. Neutrality tests indicated that the polymorphism detected in the alanine heptad repeat region of MSP-3 was maintained by positive diversifying selection, suggesting its role as a potential target of protective immune responses and supporting its role as a vaccine candidate. Comparison of MSP-3 variants among parasite populations in Thailand, India and Nigeria also inferred a close genetic relationship between P. falciparum populations in Asia. This study revealed the extent of the msp-3 gene diversity in P. falciparum in Thailand, providing the fundamental basis for the better design of future blood stage malaria vaccines against P. falciparum.
Pardal, Sara; Drews, Anna; Alves, José A; Ramos, Jaime A; Westerdahl, Helena
2017-07-01
The major histocompatibility complex (MHC) encodes proteins that are central for antigen presentation and pathogen elimination. MHC class I (MHC-I) genes have attracted a great deal of interest among researchers in ecology and evolution and have been partly characterized in a wide range of bird species. So far, the main focus has been on species within the bird orders Galliformes and Passeriformes, while Charadriiformes remain vastly underrepresented with only two species studied to date. These two Charadriiformes species exhibit striking differences in MHC-I characteristics and MHC-I diversity. We therefore set out to study a third species within Charadriiformes, the Icelandic subspecies of black-tailed godwits (Limosa limosa islandica). This subspecies is normally confined to parasite-poor environments, and we hence expected low MHC diversity. MHC-I was partially characterized first using Sanger sequencing and then using high-throughput sequencing (MiSeq) in 84 individuals. We verified 47 nucleotide alleles in open reading frame with classical MHC-I characteristics, and each individual godwit had two to seven putatively classical MHC alleles. However, in contrast to previous MHC-I data within Charadriiformes, we did not find any evidence of alleles with low sequence diversity, believed to represent non-classical MHC genes. The diversity and divergence of the godwits MHC-I genes to a large extent fell between the previous estimates within Charadriiformes. However, the MHC genes of the migratory godwits had few sites subject to positive selection, and one possible explanation could be a low exposure to pathogens.
Ginger, Michael L; Fritz-Laylin, Lillian K; Fulton, Chandler; Cande, W Zacheus; Dawson, Scott C
2010-12-01
Protists account for the bulk of eukaryotic diversity. Through studies of gene and especially genome sequences the molecular basis for this diversity can be determined. Evident from genome sequencing are examples of versatile metabolism that go far beyond the canonical pathways described for eukaryotes in textbooks. In the last 2-3 years, genome sequencing and transcript profiling has unveiled several examples of heterotrophic and phototrophic protists that are unexpectedly well-equipped for ATP production using a facultative anaerobic metabolism, including some protists that can (Chlamydomonas reinhardtii) or are predicted (Naegleria gruberi, Acanthamoeba castellanii, Amoebidium parasiticum) to produce H(2) in their metabolism. It is possible that some enzymes of anaerobic metabolism were acquired and distributed among eukaryotes by lateral transfer, but it is also likely that the common ancestor of eukaryotes already had far more metabolic versatility than was widely thought a few years ago. The discussion of core energy metabolism in unicellular eukaryotes is the subject of this review. Since genomic sequencing has so far only touched the surface of protist diversity, it is anticipated that sequences of additional protists may reveal an even wider range of metabolic capabilities, while simultaneously enriching our understanding of the early evolution of eukaryotes. Copyright © 2010 Elsevier GmbH. All rights reserved.
Strain-Level Diversity of Secondary Metabolism in Streptomyces albus
Seipke, Ryan F.
2015-01-01
Streptomyces spp. are robust producers of medicinally-, industrially- and agriculturally-important small molecules. Increased resistance to antibacterial agents and the lack of new antibiotics in the pipeline have led to a renaissance in natural product discovery. This endeavor has benefited from inexpensive high quality DNA sequencing technology, which has generated more than 140 genome sequences for taxonomic type strains and environmental Streptomyces spp. isolates. Many of the sequenced streptomycetes belong to the same species. For instance, Streptomyces albus has been isolated from diverse environmental niches and seven strains have been sequenced, consequently this species has been sequenced more than any other streptomycete, allowing valuable analyses of strain-level diversity in secondary metabolism. Bioinformatics analyses identified a total of 48 unique biosynthetic gene clusters harboured by Streptomyces albus strains. Eighteen of these gene clusters specify the core secondary metabolome of the species. Fourteen of the gene clusters are contained by one or more strain and are considered auxiliary, while 16 of the gene clusters encode the production of putative strain-specific secondary metabolites. Analysis of Streptomyces albus strains suggests that each strain of a Streptomyces species likely harbours at least one strain-specific biosynthetic gene cluster. Importantly, this implies that deep sequencing of a species will not exhaust gene cluster diversity and will continue to yield novelty. PMID:25635820
Ginger, Michael L.; Fritz-Laylin, Lillian K.; Fulton, Chandler; Cande, W. Zacheus; Dawson, Scott C.
2011-01-01
Protists account for the bulk of eukaryotic diversity. Through studies of gene and especially genome sequences the molecular basis for this diversity can be determined. Evident from genome sequencing are examples of versatile metabolism that go far beyond the canonical pathways described for eukaryotes in textbooks. In the last 2–3 years, genome sequencing and transcript profiling has unveiled several examples of heterotrophic and phototrophic protists that are unexpectedly well-equipped for ATP production using a facultative anaerobic metabolism, including some protists that can (Chlamydomonas reinhardtii) or are predicted (Naegleria gruberi, Acanthamoeba castellanii, Amoebidium parasiticum) to produce H2 in their metabolism. It is possible that some enzymes of anaerobic metabolism were acquired and distributed among eukaryotes by lateral transfer, but it is also likely that the common ancestor of eukaryotes already had far more metabolic versatility than was widely thought a few years ago. The discussion of core energy metabolism in unicellular eukaryotes is the subject of this review. Since genomic sequencing has so far only touched the surface of protist diversity, it is anticipated that sequences of additional protists may reveal an even wider range of metabolic capabilities, while simultaneously enriching our understanding of the early evolution of eukaryotes. PMID:21036663
Identifying airborne fungi in Seoul, Korea using metagenomics.
Oh, Seung-Yoon; Fong, Jonathan J; Park, Myung Soo; Chang, Limseok; Lim, Young Woon
2014-06-01
Fungal spores are widespread and common in the atmosphere. In this study, we use a metagenomic approach to study the fungal diversity in six total air samples collected from April to May 2012 in Seoul, Korea. This springtime period is important in Korea because of the peak in fungal spore concentration and Asian dust storms, although the year of this study (2012) was unique in that were no major Asian dust events. Clustering sequences for operational taxonomic unit (OTU) identification recovered 1,266 unique OTUs in the combined dataset, with between 223᾿96 OTUs present in individual samples. OTUs from three fungal phyla were identified. For Ascomycota, Davidiella (anamorph: Cladosporium) was the most common genus in all samples, often accounting for more than 50% of all sequences in a sample. Other common Ascomycota genera identified were Alternaria, Didymella, Khuskia, Geosmitha, Penicillium, and Aspergillus. While several Basidiomycota genera were observed, Chytridiomycota OTUs were only present in one sample. Consistency was observed within sampling days, but there was a large shift in species composition from Ascomycota dominant to Basidiomycota dominant in the middle of the sampling period. This marked change may have been caused by meteorological events. A potential set of 40 allergy-inducing genera were identified, accounting for a large proportion of the diversity present (22.5᾿7.2%). Our study identifies high fungal diversity and potentially high levels of fungal allergens in springtime air of Korea, and provides a good baseline for future comparisons with Asian dust storms.
Next generation sequencing for molecular confirmation of hereditary sudden cardiac death syndromes.
Márquez, Manlio F; Cruz-Robles, David; Ines-Real, Selene; Vargas-Alarcón, Gilberto; Cárdenas, Manuel
2015-01-01
Hereditary sudden cardiac death syndromes comprise a wide range of diseases resulting from alteration in cardiac ion channels. Genes involved in these syndromes represent diverse mutations that cause the altered encoding of the diverse proteins constituting these channels, thus affecting directly the currents of the corresponding ions. In the present article we will briefly review how to arrive to a clinical diagnosis and we will present the results of molecular genetic studies made in Mexican subjects attending the SCD Syndromes Clinic of the National Institute of Cardiology of Mexico City. Copyright © 2014 Instituto Nacional de Cardiología Ignacio Chávez. Published by Masson Doyma México S.A. All rights reserved.
The African Genome Variation Project shapes medical genetics in Africa
NASA Astrophysics Data System (ADS)
Gurdasani, Deepti; Carstensen, Tommy; Tekola-Ayele, Fasil; Pagani, Luca; Tachmazidou, Ioanna; Hatzikotoulas, Konstantinos; Karthikeyan, Savita; Iles, Louise; Pollard, Martin O.; Choudhury, Ananyo; Ritchie, Graham R. S.; Xue, Yali; Asimit, Jennifer; Nsubuga, Rebecca N.; Young, Elizabeth H.; Pomilla, Cristina; Kivinen, Katja; Rockett, Kirk; Kamali, Anatoli; Doumatey, Ayo P.; Asiki, Gershim; Seeley, Janet; Sisay-Joof, Fatoumatta; Jallow, Muminatou; Tollman, Stephen; Mekonnen, Ephrem; Ekong, Rosemary; Oljira, Tamiru; Bradman, Neil; Bojang, Kalifa; Ramsay, Michele; Adeyemo, Adebowale; Bekele, Endashaw; Motala, Ayesha; Norris, Shane A.; Pirie, Fraser; Kaleebu, Pontiano; Kwiatkowski, Dominic; Tyler-Smith, Chris; Rotimi, Charles; Zeggini, Eleftheria; Sandhu, Manjinder S.
2015-01-01
Given the importance of Africa to studies of human origins and disease susceptibility, detailed characterization of African genetic diversity is needed. The African Genome Variation Project provides a resource with which to design, implement and interpret genomic studies in sub-Saharan Africa and worldwide. The African Genome Variation Project represents dense genotypes from 1,481 individuals and whole-genome sequences from 320 individuals across sub-Saharan Africa. Using this resource, we find novel evidence of complex, regionally distinct hunter-gatherer and Eurasian admixture across sub-Saharan Africa. We identify new loci under selection, including loci related to malaria susceptibility and hypertension. We show that modern imputation panels (sets of reference genotypes from which unobserved or missing genotypes in study sets can be inferred) can identify association signals at highly differentiated loci across populations in sub-Saharan Africa. Using whole-genome sequencing, we demonstrate further improvements in imputation accuracy, strengthening the case for large-scale sequencing efforts of diverse African haplotypes. Finally, we present an efficient genotype array design capturing common genetic variation in Africa.
Meczker, Katalin; Dömötör, Dóra; Vass, János; Rákhely, Gábor; Schneider, György; Kovács, Tamás
2014-01-01
The enterobacterium Erwinia amylovora is the causal agent of fire blight. This study presents the analysis of the complete genome of phage PhiEaH1, isolated from the soil surrounding an E. amylovora-infected apple tree in Hungary. Its genome is 218 kb in size, containing 244 ORFs. PhiEaH1 is the second E. amylovora infecting phage from the Siphoviridae family whose complete genome sequence was determined. Beside PhiEaH2, PhiEaH1 is the other active component of Erwiphage, the first bacteriophage-based pesticide on the market against E. amylovora. Comparative genome analysis in this study has revealed that PhiEaH1 not only differs from the 10 formerly sequenced E. amylovora bacteriophages belonging to other phage families, but also from PhiEaH2. Sequencing of more Siphoviridae phage genomes might reveal further diversity, providing opportunities for the development of even more effective biological control agents, phage cocktails against Erwinia fire blight disease of commercial fruit crops.
Patil, Tejas Suresh; Tamboli, Asif Shabodin; Patil, Swapnil Mahadeo; Bhosale, Amrut Ravindra; Govindwar, Sanjay Prabhu; Muley, Dipak Vishwanathrao
2016-01-01
Genus Nemacheilus, Nemachilichthys and Schistura belong to the family Nemacheilidae of the order Cypriniformes. The present investigation was undertaken to observe genetic diversity, phylogenetic relationship and to develop a molecular-based tool for taxonomic identification. For this purpose, four different types of molecular markers were utilized in which 29 random amplified polymorphic DNA (RAPD), 25 inter-simple sequence repeat (ISSR) markers, and 10 amplified fragment length polymorphism (AFLP) marker sets were screened and mitochondrial COI gene was sequenced. This study added COI barcodes for the identification of Nemacheilus anguilla, Nemachilichthys rueppelli and Schistura denisoni. RAPD showed higher polymorphism (100%) than the ISSR (93.75-100%) and AFLP (93.86-98.96%). The polymorphic information content (PIC), heterozygosity, multiplex ratio, and gene diversity was observed highest for AFLP primers, whereas the major allele frequency was observed higher for RAPD (0.5556) and lowest for AFLP (0.1667). The COI region of all individuals was successfully amplified and sequenced, which gave a 100% species resolution. Copyright © 2016 Académie des sciences. Published by Elsevier SAS. All rights reserved.
The African Genome Variation Project shapes medical genetics in Africa.
Gurdasani, Deepti; Carstensen, Tommy; Tekola-Ayele, Fasil; Pagani, Luca; Tachmazidou, Ioanna; Hatzikotoulas, Konstantinos; Karthikeyan, Savita; Iles, Louise; Pollard, Martin O; Choudhury, Ananyo; Ritchie, Graham R S; Xue, Yali; Asimit, Jennifer; Nsubuga, Rebecca N; Young, Elizabeth H; Pomilla, Cristina; Kivinen, Katja; Rockett, Kirk; Kamali, Anatoli; Doumatey, Ayo P; Asiki, Gershim; Seeley, Janet; Sisay-Joof, Fatoumatta; Jallow, Muminatou; Tollman, Stephen; Mekonnen, Ephrem; Ekong, Rosemary; Oljira, Tamiru; Bradman, Neil; Bojang, Kalifa; Ramsay, Michele; Adeyemo, Adebowale; Bekele, Endashaw; Motala, Ayesha; Norris, Shane A; Pirie, Fraser; Kaleebu, Pontiano; Kwiatkowski, Dominic; Tyler-Smith, Chris; Rotimi, Charles; Zeggini, Eleftheria; Sandhu, Manjinder S
2015-01-15
Given the importance of Africa to studies of human origins and disease susceptibility, detailed characterization of African genetic diversity is needed. The African Genome Variation Project provides a resource with which to design, implement and interpret genomic studies in sub-Saharan Africa and worldwide. The African Genome Variation Project represents dense genotypes from 1,481 individuals and whole-genome sequences from 320 individuals across sub-Saharan Africa. Using this resource, we find novel evidence of complex, regionally distinct hunter-gatherer and Eurasian admixture across sub-Saharan Africa. We identify new loci under selection, including loci related to malaria susceptibility and hypertension. We show that modern imputation panels (sets of reference genotypes from which unobserved or missing genotypes in study sets can be inferred) can identify association signals at highly differentiated loci across populations in sub-Saharan Africa. Using whole-genome sequencing, we demonstrate further improvements in imputation accuracy, strengthening the case for large-scale sequencing efforts of diverse African haplotypes. Finally, we present an efficient genotype array design capturing common genetic variation in Africa.
Hemmink, Johanneke D; Sitt, Tatjana; Pelle, Roger; de Klerk-Lorist, Lin-Mari; Shiels, Brian; Toye, Philip G; Morrison, W Ivan; Weir, William
2018-03-01
An infection and treatment protocol involving infection with a mixture of three parasite isolates and simultaneous treatment with oxytetracycline is currently used to vaccinate cattle against Theileria parva. While vaccination results in high levels of protection in some regions, little or no protection is observed in areas where animals are challenged predominantly by parasites of buffalo origin. A previous study involving sequencing of two antigen-encoding genes from a series of parasite isolates indicated that this is associated with greater antigenic diversity in buffalo-derived T. parva. The current study set out to extend these analyses by applying high-throughput sequencing to ex vivo samples from naturally infected buffalo to determine the extent of diversity in a set of antigen-encoding genes. Samples from two populations of buffalo, one in Kenya and the other in South Africa, were examined to investigate the effect of geographical distance on the nature of sequence diversity. The results revealed a number of significant findings. First, there was a variable degree of nucleotide sequence diversity in all gene segments examined, with the percentage of polymorphic nucleotides ranging from 10% to 69%. Second, large numbers of allelic variants of each gene were found in individual animals, indicating multiple infection events. Third, despite the observed diversity in nucleotide sequences, several of the gene products had highly conserved amino acid sequences, and thus represent potential candidates for vaccine development. Fourth, although compelling evidence for population differentiation between the Kenyan and South African T. parva parasites was identified, analysis of molecular variance for each gene revealed that the majority of the underlying nucleotide sequence polymorphism was common to both areas, indicating that much of this aspect of genetic variation in the parasite population arose prior to geographic separation. Copyright © 2018 The Authors. Published by Elsevier Ltd.. All rights reserved.
Norman, Paul J.; Norberg, Steven J.; Guethlein, Lisbeth A.; Nemat-Gorgani, Neda; Royce, Thomas; Wroblewski, Emily E.; Dunn, Tamsen; Mann, Tobias; Alicata, Claudia; Hollenbach, Jill A.; Chang, Weihua; Shults Won, Melissa; Gunderson, Kevin L.; Abi-Rached, Laurent; Ronaghi, Mostafa; Parham, Peter
2017-01-01
The most polymorphic part of the human genome, the MHC, encodes over 160 proteins of diverse function. Half of them, including the HLA class I and II genes, are directly involved in immune responses. Consequently, the MHC region strongly associates with numerous diseases and clinical therapies. Notoriously, the MHC region has been intractable to high-throughput analysis at complete sequence resolution, and current reference haplotypes are inadequate for large-scale studies. To address these challenges, we developed a method that specifically captures and sequences the 4.8-Mbp MHC region from genomic DNA. For 95 MHC homozygous cell lines we assembled, de novo, a set of high-fidelity contigs and a sequence scaffold, representing a mean 98% of the target region. Included are six alternative MHC reference sequences of the human genome that we completed and refined. Characterization of the sequence and structural diversity of the MHC region shows the approach accurately determines the sequences of the highly polymorphic HLA class I and HLA class II genes and the complex structural diversity of complement factor C4A/C4B. It has also uncovered extensive and unexpected diversity in other MHC genes; an example is MUC22, which encodes a lung mucin and exhibits more coding sequence alleles than any HLA class I or II gene studied here. More than 60% of the coding sequence alleles analyzed were previously uncharacterized. We have created a substantial database of robust reference MHC haplotype sequences that will enable future population scale studies of this complicated and clinically important region of the human genome. PMID:28360230
Bacterial diversity in permanently cold and alkaline ikaite columns from Greenland.
Schmidt, Mariane; Priemé, Anders; Stougaard, Peter
2006-12-01
Bacterial diversity in alkaline (pH 10.4) and permanently cold (4 degrees C) ikaite tufa columns from the Ikka Fjord, SW Greenland, was investigated using growth characterization of cultured bacterial isolates with Terminal-restriction fragment length polymorphism (T-RFLP) and sequence analysis of bacterial 16S rRNA gene fragments. More than 200 bacterial isolates were characterized with respect to pH and temperature tolerance, and it was shown that the majority were cold-active alkaliphiles. T-RFLP analysis revealed distinct bacterial communities in different fractions of three ikaite columns, and, along with sequence analysis, it showed the presence of rich and diverse bacterial communities. Rarefaction analysis showed that the 109 sequenced clones in the 16S rRNA gene library represented between 25 and 65% of the predicted species richness in the three ikaite columns investigated. Phylogenetic analysis of the 16S rRNA gene sequences revealed many sequences with similarity to alkaliphilic or psychrophilic bacteria, and showed that 33% of the cloned sequences and 33% of the cultured bacteria showed less than 97% sequence identity to known sequences in databases, and may therefore represent yet unknown species.
Comparative analysis of the feline immunoglobulin repertoire.
Steiniger, Sebastian C J; Glanville, Jacob; Harris, Douglas W; Wilson, Thomas L; Ippolito, Gregory C; Dunham, Steven A
2017-03-01
Next-Generation Sequencing combined with bioinformatics is a powerful tool for analyzing the large number of DNA sequences present in the expressed antibody repertoire and these data sets can be used to advance a number of research areas including antibody discovery and engineering. The accurate measurement of the immune repertoire sequence composition, diversity and abundance is important for understanding the repertoire response in infections, vaccinations and cancer immunology and could also be useful for elucidating novel molecular targets. In this study 4 individual domestic cats (Felis catus) were subjected to antibody repertoire sequencing with total number of sequences generated 1079863 for VH for IgG, 1050824 VH for IgM, 569518 for VK and 450195 for VL. Our analysis suggests that a similar VDJ expression patterns exists across all cats. Similar to the canine repertoire, the feline repertoire is dominated by a single subgroup, namely VH3. The antibody paratope of felines showed similar amino acid variation when compared to human, mouse and canine counterparts. All animals show a similarly skewed VH CDR-H3 profile and, when compared to canine, human and mouse, distinct differences are observed. Our study represents the first attempt to characterize sequence diversity in the expressed feline antibody repertoire and this demonstrates the utility of using NGS to elucidate entire antibody repertoires from individual animals. These data provide significant insight into understanding the feline immune system function. Copyright © 2017 International Alliance for Biological Standardization. Published by Elsevier Ltd. All rights reserved.
Friis-Nielsen, Jens; Vinner, Lasse; Hansen, Thomas Arn; Richter, Stine Raith; Fridholm, Helena; Herrera, Jose Alejandro Romero; Lund, Ole; Brunak, Søren; Izarzugaza, Jose M. G.; Mourier, Tobias; Nielsen, Lars Peter
2016-01-01
Propionibacterium acnes is the most abundant bacterium on human skin, particularly in sebaceous areas. P. acnes is suggested to be an opportunistic pathogen involved in the development of diverse medical conditions but is also a proven contaminant of human clinical samples and surgical wounds. Its significance as a pathogen is consequently a matter of debate. In the present study, we investigated the presence of P. acnes DNA in 250 next-generation sequencing data sets generated from 180 samples of 20 different sample types, mostly of cancerous origin. The samples were subjected to either microbial enrichment, involving nuclease treatment to reduce the amount of host nucleic acids, or shotgun sequencing. We detected high proportions of P. acnes DNA in enriched samples, particularly skin tissue-derived and other tissue samples, with the levels being higher in enriched samples than in shotgun-sequenced samples. P. acnes reads were detected in most samples analyzed, though the proportions in most shotgun-sequenced samples were low. Our results show that P. acnes can be detected in practically all sample types when molecular methods, such as next-generation sequencing, are employed. The possibility of contamination from the patient or other sources, including laboratory reagents or environment, should therefore always be considered carefully when P. acnes is detected in clinical samples. We advocate that detection of P. acnes always be accompanied by experiments validating the association between this bacterium and any clinical condition. PMID:26818667
A Window Into Clinical Next-Generation Sequencing-Based Oncology Testing Practices.
Nagarajan, Rakesh; Bartley, Angela N; Bridge, Julia A; Jennings, Lawrence J; Kamel-Reid, Suzanne; Kim, Annette; Lazar, Alexander J; Lindeman, Neal I; Moncur, Joel; Rai, Alex J; Routbort, Mark J; Vasalos, Patricia; Merker, Jason D
2017-12-01
- Detection of acquired variants in cancer is a paradigm of precision medicine, yet little has been reported about clinical laboratory practices across a broad range of laboratories. - To use College of American Pathologists proficiency testing survey results to report on the results from surveys on next-generation sequencing-based oncology testing practices. - College of American Pathologists proficiency testing survey results from more than 250 laboratories currently performing molecular oncology testing were used to determine laboratory trends in next-generation sequencing-based oncology testing. - These presented data provide key information about the number of laboratories that currently offer or are planning to offer next-generation sequencing-based oncology testing. Furthermore, we present data from 60 laboratories performing next-generation sequencing-based oncology testing regarding specimen requirements and assay characteristics. The findings indicate that most laboratories are performing tumor-only targeted sequencing to detect single-nucleotide variants and small insertions and deletions, using desktop sequencers and predesigned commercial kits. Despite these trends, a diversity of approaches to testing exists. - This information should be useful to further inform a variety of topics, including national discussions involving clinical laboratory quality systems, regulation and oversight of next-generation sequencing-based oncology testing, and precision oncology efforts in a data-driven manner.
CRISPR Diversity and Microevolution in Clostridium difficile
Andersen, Joakim M.; Shoup, Madelyn; Robinson, Cathy; Britton, Robert; Olsen, Katharina E.P.; Barrangou, Rodolphe
2016-01-01
Abstract Virulent strains of Clostridium difficile have become a global health problem associated with morbidity and mortality. Traditional typing methods do not provide ideal resolution to track outbreak strains, ascertain genetic diversity between isolates, or monitor the phylogeny of this species on a global basis. Here, we investigate the occurrence and diversity of clustered regularly interspaced short palindromic repeats (CRISPR) and CRISPR-associated genes (cas) in C. difficile to assess the potential of CRISPR-based phylogeny and high-resolution genotyping. A single Type-IB CRISPR-Cas system was identified in 217 analyzed genomes with cas gene clusters present at conserved chromosomal locations, suggesting vertical evolution of the system, assessing a total of 1,865 CRISPR arrays. The CRISPR arrays, markedly enriched (8.5 arrays/genome) compared with other species, occur both at conserved and variable locations across strains, and thus provide a basis for typing based on locus occurrence and spacer polymorphism. Clustering of strains by array composition correlated with sequence type (ST) analysis. Spacer content and polymorphism within conserved CRISPR arrays revealed phylogenetic relationship across clades and within ST. Spacer polymorphisms of conserved arrays were instrumental for differentiating closely related strains, e.g., ST1/RT027/B1 strains and pathogenicity locus encoding ST3/RT001 strains. CRISPR spacers showed sequence similarity to phage sequences, which is consistent with the native role of CRISPR-Cas as adaptive immune systems in bacteria. Overall, CRISPR-Cas sequences constitute a valuable basis for genotyping of C. difficile isolates, provide insights into the micro-evolutionary events that occur between closely related strains, and reflect the evolutionary trajectory of these genomes. PMID:27576538
Kumar, Anil; Sharma, Divya; Tiwari, Apoorv; Jaiswal, J P; Singh, N K; Sood, Salej
2016-07-01
Finger millet [ (L.) Gaertn.] is grown mainly by subsistence farmers in arid and semiarid regions of the world. To broaden its genetic base and to boost its production, it is of paramount importance to characterize and genotype the diverse gene pool of this important food and nutritional security crop. However, as a result of nonavailability of the genome sequence of finger millet, the progress could not be made in realizing the molecular basis of unique qualities of the crop. In the present investigation, attempts have been made to characterize the genetically diverse collection of 113 finger millet accessions through whole-genome genotyping-by-sequencing (GBS), which resulted in a genome-wide set of 23,000 single-nucleotide polymorphisms (SNPs) segregating across the entire collection and several thousand SNPs segregating within every accession. A model-based population structure analysis reveals the presence of three subpopulations among the finger millet accessions, which are in parallel with the results of phylogenetic analysis. The observed population structure is consistent with the hypothesis that finger millet was domesticated first in Africa, and from there it was introduced to India some 3000 yr ago. A total of 1128 gene ontology (GO) terms were assigned to SNP-carrying genes for three main categories: biological process, cellular component, and molecular function. Facilitated access to high-throughput genotyping and sequencing technologies are likely to improve the breeding process in developing countries, and as such, this data will be very useful to breeders who are working for the genetic improvement of finger millet. Copyright © 2016 Crop Science Society of America.
Complete sequence and diversity of a maize-associated Polerovirus in East Africa.
Massawe, Deogracious P; Stewart, Lucy R; Kamatenesi, Jovia; Asiimwe, Theodore; Redinbaugh, Margaret G
2018-06-01
Since 2011-2012, Maize lethal necrosis (MLN) has emerged in East Africa, causing massive yield loss and propelling research to identify viruses and virus populations present in maize. As expected, next generation sequencing (NGS) has revealed diverse and abundant viruses from the family Potyviridae, primarily sugarcane mosaic virus (SCMV), and maize chlorotic mottle virus (MCMV) (Tombusviridae), which are known to cause MLN by synergistic co-infection. In addition to these expected viruses, we identified a virus in the genus Polerovirus (family Luteoviridae) in 104/172 samples selected for MLN or other potential virus symptoms from Kenya, Uganda, Rwanda, and Tanzania. This polerovirus (MF974579) nucleotide sequence is 97% identical to maize-associated viruses recently reported in China, termed 'maize yellow mosaic virus' (MaYMV) and maize yellow dwarf virus (MaYMV; KU291101, KU291107, MYDV-RMV2; KT992824); and 99% identical to MaYMV (KY684356) infecting sugarcane and itch grass in Nigeria; 83% identical to a barley-associated polerovirus recently identified in Korea (BVG; KT962089); and 79% identical to the U.S. maize-infecting polerovirus maize yellow dwarf virus (MYDV-RMV; KT992824). Nucleotide sequences from ORF0 of 20 individual East African isolates collected from Kenya, Uganda, Rwanda, and Tanzania shared 98% or higher identity, and were detected in 104/172 (60.5%) of samples collected for virus-like symptoms, indicating extensive prevalence but limited diversity of this virus in East Africa. We refer to this virus as "MYDV-like polerovirus" until symptoms of the virus in maize are known.
Bacterioplankton diversity and community composition in the Southern Lagoon of Venice.
Simonato, Francesca; Gómez-Pereira, Paola R; Fuchs, Bernhard M; Amann, Rudolf
2010-04-01
The Lagoon of Venice is a large water basin that exchanges water with the Northern Adriatic Sea through three large inlets. In this study, the 16S rRNA approach was used to investigate the bacterial diversity and community composition within the southern basin of the Lagoon of Venice and at one inlet in October 2007 and June 2008. Comparative sequence analysis of 645 mostly partial 16S rRNA gene sequences indicated high diversity and dominance of Alphaproteobacteria, Gammaproteobacteria and Bacteroidetes at the lagoon as well as at the inlet station, therefore pointing to significant mixing. Many of these sequences were close to the 16S rRNA of marine, often coastal, bacterioplankton, such as the Roseobacter clade, the family Vibrionaceae, and class Flavobacteria. Sequences of Actinobacteria were indicators of a freshwater input. The composition of the bacterioplankton was quantified by catalyzed reporter deposition fluorescence in situ hybridization (CARD-FISH) with a set of rRNA-targeted oligonucleotide probes. CARD-FISH counts corroborated the dominance of members of the phyla Alphaproteobacteria, Gammaproteobacteria and Bacteroidetes. When assessed by a probe set for the quantification of selected clades within Alphaproteobacteria and Gammaproteobacteria, bacterioplankton composition differed between October 2007 and June 2008, and also between the inlet and the lagoon. In particular, members of the readily culturable copiotrophic gammaproteobacterial genera Vibrio, Alteromonas and Pseudoalteromonas were enriched in the southern basin of the Lagoon of Venice. Interestingly, the alphaproteobacterial SAR11 clade and related clusters were also present in high abundances at the inlet and within the lagoon, which was indicative of inflow of water from the open sea.
Mendes, Lucas William; Taketani, Rodrigo Gouvêa; Navarrete, Acácio Aparecido; Tsai, Siu Mui
2012-06-01
This study focused on the structure and composition of archaeal communities in sediments of tropical mangroves in order to obtain sufficient insight into two Brazilian sites from different locations (one pristine and another located in an urban area) and at different depth levels from the surface. Terminal restriction fragment length polymorphism (T-RFLP) of PCR-amplified 16S rRNA gene fragments was used to scan the archaeal community structure, and 16S rRNA gene clone libraries were used to determine the community composition. Redundancy analysis of T-RFLP patterns revealed differences in archaeal community structure according to location, depth and soil attributes. Parameters such as pH, organic matter, potassium and magnesium presented significant correlation with general community structure. Furthermore, phylogenetic analysis revealed a community composition distributed differently according to depth where, in shallow samples, 74.3% of sequences were affiliated with Euryarchaeota and 25.7% were shared between Crenarchaeota and Thaumarchaeota, while for the deeper samples, 24.3% of the sequences were affiliated with Euryarchaeota and 75.7% with Crenarchaeota and Thaumarchaeota. Archaeal diversity measurements based on 16S rRNA gene clone libraries decreased with increasing depth and there was a greater difference between depths (<18% of sequences shared) than sites (>25% of sequences shared). Taken together, our findings indicate that mangrove ecosystems support a diverse archaeal community; it might possibly be involved in nutrient cycles and are affected by sediment properties, depth and distinct locations. Copyright © 2012 Institut Pasteur. Published by Elsevier Masson SAS. All rights reserved.
Merson, Samuel D; Ouwerkerk, Diane; Gulino, Lisa-Maree; Klieve, Athol; Bonde, Robert K; Burgess, Elizabeth A; Lanyon, Janet M
2014-03-01
The Florida manatee, Trichechus manatus latirostris, is a hindgut-fermenting herbivore. In winter, manatees migrate to warm water overwintering sites where they undergo dietary shifts and may suffer from cold-induced stress. Given these seasonally induced changes in diet, the present study aimed to examine variation in the hindgut bacterial communities of wild manatees overwintering at Crystal River, west Florida. Faeces were sampled from 36 manatees of known sex and body size in early winter when manatees were newly arrived and then in mid-winter and late winter when diet had probably changed and environmental stress may have increased. Concentrations of faecal cortisol metabolite, an indicator of a stress response, were measured by enzyme immunoassay. Using 454-pyrosequencing, 2027 bacterial operational taxonomic units were identified in manatee faeces following amplicon pyrosequencing of the 16S rRNA gene V3/V4 region. Classified sequences were assigned to eight previously described bacterial phyla; only 0.36% of sequences could not be classified to phylum level. Five core phyla were identified in all samples. The majority (96.8%) of sequences were classified as Firmicutes (77.3 ± 11.1% of total sequences) or Bacteroidetes (19.5 ± 10.6%). Alpha-diversity measures trended towards higher diversity of hindgut microbiota in manatees in mid-winter compared to early and late winter. Beta-diversity measures, analysed through PERMANOVA, also indicated significant differences in bacterial communities based on the season. © 2013 Federation of European Microbiological Societies. Published by John Wiley & Sons Ltd. All rights reserved.
Wuchter, Cornelia; Banning, Erin; Mincer, Tracy J.; Drenzek, Nicholas J.; Coolen, Marco J. L.
2013-01-01
The Antrim Shale in the Michigan Basin is one of the most productive shale gas formations in the U.S., but optimal resource recovery strategies must rely on a thorough understanding of the complex biogeochemical, microbial, and physical interdependencies in this and similar systems. We used Illumina MiSeq 16S rDNA sequencing to analyze the diversity and relative abundance of prokaryotic communities present in Antrim shale formation water of three closely spaced recently fractured gas-producing wells. In addition, the well waters were incubated with a suite of fermentative and methanogenic substrates in an effort to stimulate microbial methane generation. The three wells exhibited substantial differences in their community structure that may arise from their different drilling and fracturing histories. Bacterial sequences greatly outnumbered those of archaea and shared highest similarity to previously described cultures of mesophiles and moderate halophiles within the Firmicutes, Bacteroidetes, and δ- and ε-Proteobacteria. The majority of archaeal sequences shared highest sequence similarity to uncultured euryarchaeotal environmental clones. Some sequences closely related to cultured methylotrophic and hydrogenotrophic methanogens were also present in the initial well water. Incubation with methanol and trimethylamine stimulated methylotrophic methanogens and resulted in the largest increase in methane production in the formation waters, while fermentation triggered by the addition of yeast extract and formate indirectly stimulated hydrogenotrophic methanogens. The addition of sterile powdered shale as a complex natural substrate stimulated the rate of methane production without affecting total methane yields. Depletion of methane indicative of anaerobic methane oxidation (AMO) was observed over the course of incubation with some substrates. This process could constitute a substantial loss of methane in the shale formation. PMID:24367357
Henry, Kevin A
2018-01-01
Immunogenetic analyses of expressed antibody repertoires are becoming increasingly common experimental investigations and are critical to furthering our understanding of autoimmunity, infectious disease, and cancer. Next-generation DNA sequencing (NGS) technologies have now made it possible to interrogate antibody repertoires to unprecedented depths, typically by sequencing of cDNAs encoding immunoglobulin variable domains. In this chapter, we describe simple, fast, and reliable methods for producing and sequencing multiplex PCR amplicons derived from the variable regions (V H , V H H or V L ) of rearranged immunoglobulin heavy and light chain genes using the Illumina MiSeq platform. We include complete protocols and primer sets for amplicon sequencing of V H /V H H/V L repertoires directly from human, mouse, and llama lymphocytes as well as from phage-displayed V H /V H H/V L libraries; these can be easily be adapted to other types of amplicons with little modification. The resulting amplicons are diverse and representative, even using as few as 10 3 input B cells, and their generation is relatively inexpensive, requiring no special equipment and only a limited set of primers. In the absence of heavy-light chain pairing, single-domain antibodies are uniquely amenable to NGS analyses. We present a number of applications of NGS technology useful in discovery of single-domain antibodies from phage display libraries, including: (i) assessment of library functionality; (ii) confirmation of desired library randomization; (iii) estimation of library diversity; and (iv) monitoring the progress of panning experiments. While the case studies presented here are of phage-displayed single-domain antibody libraries, the principles extend to other types of in vitro display libraries.
Steel, Olivia; Kraberger, Simona; Sikorski, Alyssa; Young, Laura M; Catchpole, Ryan J; Stevens, Aaron J; Ladley, Jenny J; Coray, Dorien S; Stainton, Daisy; Dayaram, Anisha; Julian, Laurel; van Bysterveldt, Katherine; Varsani, Arvind
2016-09-01
In recent years, innovations in molecular techniques and sequencing technologies have resulted in a rapid expansion in the number of known viral sequences, in particular those with circular replication-associated protein (Rep)-encoding single-stranded (CRESS) DNA genomes. CRESS DNA viruses are present in the virome of many ecosystems and are known to infect a wide range of organisms. A large number of the recently identified CRESS DNA viruses cannot be classified into any known viral families, indicating that the current view of CRESS DNA viral sequence space is greatly underestimated. Animal faecal matter has proven to be a particularly useful source for sampling CRESS DNA viruses in an ecosystem, as it is cost-effective and non-invasive. In this study a viral metagenomic approach was used to explore the diversity of CRESS DNA viruses present in the faeces of domesticated and wild animals in New Zealand. Thirty-eight complete CRESS DNA viral genomes and two circular molecules (that may be defective molecules or single components of multicomponent genomes) were identified from forty-nine individual animal faecal samples. Based on shared genome organisations and sequence similarities, eighteen of the isolates were classified as gemycircularviruses and twelve isolates were classified as smacoviruses. The remaining eight isolates lack significant sequence similarity with any members of known CRESS DNA virus groups. This research adds significantly to our knowledge of CRESS DNA viral diversity in New Zealand, emphasising the prevalence of CRESS DNA viruses in nature, and reinforcing the suggestion that a large proportion of CRESS DNA viruses are yet to be identified. Copyright © 2016 Elsevier B.V. All rights reserved.
Turnbaugh, Peter J.; Quince, Christopher; Faith, Jeremiah J.; McHardy, Alice C.; Yatsunenko, Tanya; Niazi, Faheem; Affourtit, Jason; Egholm, Michael; Henrissat, Bernard; Knight, Rob; Gordon, Jeffrey I.
2010-01-01
We deeply sampled the organismal, genetic, and transcriptional diversity in fecal samples collected from a monozygotic (MZ) twin pair and compared the results to 1,095 communities from the gut and other body habitats of related and unrelated individuals. Using a new scheme for noise reduction in pyrosequencing data, we estimated the total diversity of species-level bacterial phylotypes in the 1.2-1.5 million bacterial 16S rRNA reads obtained from each deeply sampled cotwin to be ~800 (35.9%, 49.1% detected in both). A combined 1.1 million read 16S rRNA dataset representing 281 shallowly sequenced fecal samples from 54 twin pairs and their mothers contained an estimated 4,018 species-level phylotypes, with each sample having a unique species assemblage (53.4 ± 0.6% and 50.3 ± 0.5% overlap with the deeply sampled cotwins). Of the 134 phylotypes with a relative abundance of >0.1% in the combined dataset, only 37 appeared in >50% of the samples, with one phylotype in the Lachnospiraceae family present in 99%. Nongut communities had significantly reduced overlap with the deeply sequenced twins’ fecal microbiota (18.3 ± 0.3%, 15.3 ± 0.3%). The MZ cotwins’ fecal DNA was deeply sequenced (3.8-6.3 Gbp/sample) and assembled reads were assigned to 25 genus-level phylogenetic bins. Only 17% of the genes in these bins were shared between the cotwins. Bins exhibited differences in their degree of sequence variation, gene content including the repertoire of carbohydrate active enzymes present within and between twins (e.g., predicted cellulases, dockerins), and transcriptional activities. These results provide an expanded perspective about features that make each of us unique life forms and directions for future characterization of our gut ecosystems. PMID:20363958
Severe chronic osteomyelitis caused by Morganella morganii with high population diversity.
Zhu, Jialiang; Li, Haifeng; Feng, Li; Yang, Min; Yang, Ronggong; Yang, Lin; Li, Li; Li, Ruoyan; Liu, Minshan; Hou, Shuxun; Ke, Yuehua; Li, Wenfeng; Bai, Fan
2016-09-01
A case of chronic osteomyelitis probably caused by Morganella morganii, occurring over a period of 30 years, is reported. The organism was identified through a combination of sample culture, direct sequencing, and 16S RNA gene amplicon sequencing. Further whole-genome sequencing and population structure analysis of the isolates from the patient showed the bacterial population to be highly diverse. This case provides a valuable example of a long-term infection caused by an opportunistic pathogen, M. morganii, with high diversity, which might evolve during replication within the host. Copyright © 2016 The Authors. Published by Elsevier Ltd.. All rights reserved.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Andersen, Mikael R.; Salazar, Margarita; Schaap, Peter
2011-06-01
The filamentous fungus Aspergillus niger exhibits great diversity in its phenotype. It is found globally, both as marine and terrestrial strains, produces both organic acids and hydrolytic enzymes in high amounts, and some isolates exhibit pathogenicity. Although the genome of an industrial enzyme-producing A. niger strain (CBS 513.88) has already been sequenced, the versatility and diversity of this species compels additional exploration. We therefore undertook whole genome sequencing of the acidogenic A. niger wild type strain (ATCC 1015), and produced a genome sequence of very high quality. Only 15 gaps are present in the sequence and half the telomeric regionsmore » have been elucidated. Moreover, sequence information from ATCC 1015 was utilized to improve the genome sequence of CBS 513.88. Chromosome-level comparisons uncovered several genome rearrangements, deletions, a clear case of strain-specific horizontal gene transfer, and identification of 0.8 megabase of novel sequence. Single nucleotide polymorphisms per kilobase (SNPs/kb) between the two strains were found to be exceptionally high (average: 7.8, maximum: 160 SNPs/kb). High variation within the species was confirmed with exo-metabolite profiling and phylogenetics. Detailed lists of alleles were generated, and genotypic differences were observed to accumulate in metabolic pathways essential to acid production and protein synthesis. A transcriptome analysis revealed up-regulation of the electron transport chain, specifically the alternative oxidative pathway in ATCC 1015, while CBS 513.88 showed significant up regulation of genes associated with biosynthesis of amino acids that are abundant in glucoamylase A, tRNA-synthases and protein transporters.« less
Valiadi, Martha; Iglesias-Rodriguez, Maria Debora
2014-01-01
Dinoflagellate bioluminescence systems operate with or without a luciferin binding protein, representing two distinct modes of light production. However, the distribution, diversity, and evolution of the luciferin binding protein gene within bioluminescent dinoflagellates are not well known. We used PCR to detect and partially sequence this gene from the heterotrophic dinoflagellate Noctiluca scintillans and a group of ecologically important gonyaulacoid species. We report an additional luciferin binding protein gene in N. scintillans which is not attached to luciferase, further to its typical combined bioluminescence gene. This supports the hypothesis that a profound re-organization of the bioluminescence system has taken place in this organism. We also show that the luciferin binding protein gene is present in the genera Ceratocorys, Gonyaulax, and Protoceratium, and is prevalent in bioluminescent species of Alexandrium. Therefore, this gene is an integral component of the standard molecular bioluminescence machinery in dinoflagellates. Nucleotide sequences showed high within-strain variation among gene copies, revealing a highly diverse gene family comprising multiple gene types in some organisms. Phylogenetic analyses showed that, in some species, the evolution of the luciferin binding protein gene was different from the organism's general phylogenies, highlighting the complex evolutionary history of dinoflagellate bioluminescence systems. © 2013 The Author(s) Journal of Eukaryotic Microbiology © 2013 International Society of Protistologists.
Robarts, Daniel W H; Wolfe, Andrea D
2014-07-01
In the past few decades, many investigations in the field of plant biology have employed selectively neutral, multilocus, dominant markers such as inter-simple sequence repeat (ISSR), random-amplified polymorphic DNA (RAPD), and amplified fragment length polymorphism (AFLP) to address hypotheses at lower taxonomic levels. More recently, sequence-related amplified polymorphism (SRAP) markers have been developed, which are used to amplify coding regions of DNA with primers targeting open reading frames. These markers have proven to be robust and highly variable, on par with AFLP, and are attained through a significantly less technically demanding process. SRAP markers have been used primarily for agronomic and horticultural purposes, developing quantitative trait loci in advanced hybrids and assessing genetic diversity of large germplasm collections. Here, we suggest that SRAP markers should be employed for research addressing hypotheses in plant systematics, biogeography, conservation, ecology, and beyond. We provide an overview of the SRAP literature to date, review descriptive statistics of SRAP markers in a subset of 171 publications, and present relevant case studies to demonstrate the applicability of SRAP markers to the diverse field of plant biology. Results of these selected works indicate that SRAP markers have the potential to enhance the current suite of molecular tools in a diversity of fields by providing an easy-to-use, highly variable marker with inherent biological significance.
Robarts, Daniel W. H.; Wolfe, Andrea D.
2014-01-01
In the past few decades, many investigations in the field of plant biology have employed selectively neutral, multilocus, dominant markers such as inter-simple sequence repeat (ISSR), random-amplified polymorphic DNA (RAPD), and amplified fragment length polymorphism (AFLP) to address hypotheses at lower taxonomic levels. More recently, sequence-related amplified polymorphism (SRAP) markers have been developed, which are used to amplify coding regions of DNA with primers targeting open reading frames. These markers have proven to be robust and highly variable, on par with AFLP, and are attained through a significantly less technically demanding process. SRAP markers have been used primarily for agronomic and horticultural purposes, developing quantitative trait loci in advanced hybrids and assessing genetic diversity of large germplasm collections. Here, we suggest that SRAP markers should be employed for research addressing hypotheses in plant systematics, biogeography, conservation, ecology, and beyond. We provide an overview of the SRAP literature to date, review descriptive statistics of SRAP markers in a subset of 171 publications, and present relevant case studies to demonstrate the applicability of SRAP markers to the diverse field of plant biology. Results of these selected works indicate that SRAP markers have the potential to enhance the current suite of molecular tools in a diversity of fields by providing an easy-to-use, highly variable marker with inherent biological significance. PMID:25202637
Cantanhêde, Lilian Motta; Fernandes, Flavia Gonçalves; Ferreira, Gabriel Eduardo Melim; Porrozzi, Renato; Ferreira, Ricardo de Godoi Mattos; Cupolillo, Elisa
2018-01-01
Cutaneous leishmaniasis is a neglected parasitic disease that manifests in infected individuals under different phenotypes, with a range of factors contributing to its broad clinical spectrum. One factor, Leishmania RNA Virus 1 (LRV1), has been described as an endosymbiont present in different species of Leishmania. LRV1 significantly worsens the lesion, exacerbating the immune response in both experimentally infected animals and infected individuals. Little is known about the composition and genetic diversity of these viruses. Here, we investigated the relationship between the genetic composition of LRV1 detected in strains of Leishmania (Viannia) braziliensis and L. (V.) guyanensis and the interaction between the endosymbiont and the parasitic species, analyzing an approximately 850 base pair region of the viral genome. We also included one LRV1 sequence detected in L. (V.) shawi, representing the first report of LRV1 in a species other than L. braziliensis and L. guyanensis. The results illustrate the genetic diversity of the LRV1 strains analyzed here, with smaller divergences detected among viral sequences from the same parasite species. Phylogenetic analyses showed that the LRV1 sequences are grouped according to the parasite species and possibly according to the population of the parasite in which the virus was detected, corroborating the hypothesis of joint evolution of the viruses with the speciation of Leishmania parasites.
Strategies for Improving Achievement within Diversity.
ERIC Educational Resources Information Center
Thomson, Scott D.
Understanding students' learning styles is one of the first steps to providing an effective education. Four elements must be present in schools for mastery of content to occur. These four elements operate in sequence and each supports those that follow. The elements are (1) diagnosis of student traits and skills, (2) development of specific…
USDA-ARS?s Scientific Manuscript database
Enterohaemorrhagic E. coli 0157 is a zoonotic pathogen for which colonisation of cattle and virulence in humans is associated with the expression of multiple horizontally acquired genes, the majority present in active or cryptic prophages. Our understanding of the evolution and phylogeny of E. coli ...
Analysis of pig genomes provide insight into porcine demography and evolution
USDA-ARS?s Scientific Manuscript database
For nearly 8,000 years pigs and humans have shared a close and complex relationship, and through domestication and breeding, humans have shaped the genomes of current diverse pig breeds. Here we present the assembly and analysis of the genome sequence of a female domestic pig from the European Duroc...
Development and use of molecular markers: past and present.
Grover, Atul; Sharma, P C
2016-01-01
Molecular markers, due to their stability, cost-effectiveness and ease of use provide an immensely popular tool for a variety of applications including genome mapping, gene tagging, genetic diversity diversity, phylogenetic analysis and forensic investigations. In the last three decades, a number of molecular marker techniques have been developed and exploited worldwide in different systems. However, only a handful of these techniques, namely RFLPs, RAPDs, AFLPs, ISSRs, SSRs and SNPs have received global acceptance. A recent revolution in DNA sequencing techniques has taken the discovery and application of molecular markers to high-throughput and ultrahigh-throughput levels. Although, the choice of marker will obviously depend on the targeted use, microsatellites, SNPs and genotyping by sequencing (GBS) largely fulfill most of the user requirements. Further, modern transcriptomic and functional markers will lead the ventures onto high-density genetic map construction, identification of QTLs, breeding and conservation strategies in times to come in combination with other high throughput techniques. This review presents an overview of different marker technologies and their variants with a comparative account of their characteristic features and applications.
Singh, Prashant; Singh, Satya Shila; Elster, Josef; Mishra, Arun Kumar
2013-06-01
In order to assess phylogeny, population genetics, and approximation of future course of cyanobacterial evolution based on nifH gene sequences, 41 heterocystous cyanobacterial strains collected from all over India have been used in the present study. NifH gene sequence analysis data confirm that the heterocystous cyanobacteria are monophyletic while the stigonematales show polyphyletic origin with grave intermixing. Further, analysis of nifH gene sequence data using intricate mathematical extrapolations revealed that the nucleotide diversity and recombination frequency is much greater in Nostocales than the Stigonematales. Similarly, DNA divergence studies showed significant values of divergence with greater gene conversion tracts in the unbranched (Nostocales) than the branched (Stigonematales) strains. Our data strongly support the origin of true branching cyanobacterial strains from the unbranched strains.
Towards a molecular taxonomy for protists: benefits, risks, and applications in plankton ecology.
Caron, David A
2013-01-01
The increasing use of genetic information for the development of methods to study the diversity, distributions, and activities of protists in nature has spawned a new generation of powerful tools. For ecologists, one lure of these approaches lies in the potential for DNA sequences to provide the only immediately obvious means of normalizing the diverse criteria that presently exist for identifying and counting protists. A single, molecular taxonomy would allow studies of diversity across a broad range of species, as well as the detection and quantification of particular species of interest within complex, natural assemblages; goals that are not feasible using traditional methods. However, these advantages are not without their potential pitfalls and problems. Conflicts involving the species concept, disagreements over the true (physiological/ecological) meaning of genetic diversity, and a perceived threat by some that sequence information will displace knowledge regarding the morphologies, functions and physiologies of protistan taxa, have created debate and doubt regarding the efficacy and appropriateness of some genetic approaches. These concerns need continued discussion and eventual resolution as we move toward the irresistible attraction, and potentially enormous benefits, of the application of genetic approaches to protistan ecology. © 2013 The Author(s) Journal of Eukaryotic Microbiology © 2013 International Society of Protistologists.
Abdelfattah, Ahmed; Wisniewski, Michael; Li Destri Nicosia, Maria Giulia; Cacciola, Santa Olga
2016-01-01
An amplicon metagenomic approach based on the ITS2 region of fungal rDNA was used to identify the composition of fungal communities associated with different strawberry organs (leaves, flowers, immature and mature fruits), grown on a farm using management practices that entailed the routine use of various chemical pesticides. ITS2 sequences clustered into 316 OTUs and Ascomycota was the dominant phyla (95.6%) followed by Basidiomycota (3.9%). Strawberry plants supported a high diversity of microbial organisms, but two genera, Botrytis and Cladosporium, were the most abundant, representing 70–99% of the relative abundance (RA) of all detected sequences. According to alpha and beta diversity analyses, strawberry organs displayed significantly different fungal communities with leaves having the most diverse fungal community, followed by flowers, and fruit. The interruption of chemical treatments for one month resulted in a significant modification in the structure of the fungal community of leaves and flowers while immature and mature fruit were not significantly affected. Several plant pathogens of other plant species, that would not be intuitively expected to be present on strawberry plants such as Erysiphe, were detected, while some common strawberry pathogens, such as Rhizoctonia, were less evident or absent. PMID:27490110
Genetic Diversity of Ascaris in China Assessed Using Simple Sequence Repeat Markers.
Zhou, Chunhua; Jian, Shaoqing; Peng, Weidong; Li, Min
2018-04-01
The giant roundworm Ascaris infects pigs and people worldwide and causes serious diseases. The taxonomic relationship between Ascaris suum and Ascaris lumbricoides is still unclear. The purpose of the present study was to investigate the genetic diversity and population genetic structure of 258 Ascaris specimens from humans and pigs from 6 sympatric regions in Ascaris -endemic regions of China using existing simple sequence repeat data. The microsatellite markers showed a high level of allelic richness and genetic diversity in the samples. Each of the populations demonstrated excess homozygosity (Ho
MicRhoDE: a curated database for the analysis of microbial rhodopsin diversity and evolution
Boeuf, Dominique; Audic, Stéphane; Brillet-Guéguen, Loraine; Caron, Christophe; Jeanthon, Christian
2015-01-01
Microbial rhodopsins are a diverse group of photoactive transmembrane proteins found in all three domains of life and in viruses. Today, microbial rhodopsin research is a flourishing research field in which new understandings of rhodopsin diversity, function and evolution are contributing to broader microbiological and molecular knowledge. Here, we describe MicRhoDE, a comprehensive, high-quality and freely accessible database that facilitates analysis of the diversity and evolution of microbial rhodopsins. Rhodopsin sequences isolated from a vast array of marine and terrestrial environments were manually collected and curated. To each rhodopsin sequence are associated related metadata, including predicted spectral tuning of the protein, putative activity and function, taxonomy for sequences that can be linked to a 16S rRNA gene, sampling date and location, and supporting literature. The database currently covers 7857 aligned sequences from more than 450 environmental samples or organisms. Based on a robust phylogenetic analysis, we introduce an operational classification system with multiple phylogenetic levels ranging from superclusters to species-level operational taxonomic units. An integrated pipeline for online sequence alignment and phylogenetic tree construction is also provided. With a user-friendly interface and integrated online bioinformatics tools, this unique resource should be highly valuable for upcoming studies of the biogeography, diversity, distribution and evolution of microbial rhodopsins. Database URL: http://micrhode.sb-roscoff.fr. PMID:26286928
MicRhoDE: a curated database for the analysis of microbial rhodopsin diversity and evolution.
Boeuf, Dominique; Audic, Stéphane; Brillet-Guéguen, Loraine; Caron, Christophe; Jeanthon, Christian
2015-01-01
Microbial rhodopsins are a diverse group of photoactive transmembrane proteins found in all three domains of life and in viruses. Today, microbial rhodopsin research is a flourishing research field in which new understandings of rhodopsin diversity, function and evolution are contributing to broader microbiological and molecular knowledge. Here, we describe MicRhoDE, a comprehensive, high-quality and freely accessible database that facilitates analysis of the diversity and evolution of microbial rhodopsins. Rhodopsin sequences isolated from a vast array of marine and terrestrial environments were manually collected and curated. To each rhodopsin sequence are associated related metadata, including predicted spectral tuning of the protein, putative activity and function, taxonomy for sequences that can be linked to a 16S rRNA gene, sampling date and location, and supporting literature. The database currently covers 7857 aligned sequences from more than 450 environmental samples or organisms. Based on a robust phylogenetic analysis, we introduce an operational classification system with multiple phylogenetic levels ranging from superclusters to species-level operational taxonomic units. An integrated pipeline for online sequence alignment and phylogenetic tree construction is also provided. With a user-friendly interface and integrated online bioinformatics tools, this unique resource should be highly valuable for upcoming studies of the biogeography, diversity, distribution and evolution of microbial rhodopsins. Database URL: http://micrhode.sb-roscoff.fr. © The Author(s) 2015. Published by Oxford University Press.
Mejia-Velasquez, Paula J; Dilcher, David L; Jaramillo, Carlos A; Fortini, Lucas B; Manchester, Steven R
2012-11-01
Reconstruction of floristic patterns during the early diversification of angiosperms is impeded by the scarce fossil record, especially in tropical latitudes. Here we collected quantitative palynological data from a stratigraphic sequence in tropical South America to provide floristic and climatic insights into such tropical environments during the Early Cretaceous. We reconstructed the floristic composition of an Aptian-Albian tropical sequence from central Colombia using quantitative palynology (rarefied species richness and abundance) and used it to infer its predominant climatic conditions. Additionally, we compared our results with available quantitative data from three other sequences encompassing 70 floristic assemblages to determine latitudinal diversity patterns. Abundance of humidity indicators was higher than that of aridity indicators (61% vs. 10%). Additionally, we found an angiosperm latitudinal diversity gradient (LDG) for the Aptian, but not for the Albian, and an inverted LDG of the overall diversity for the Albian. Angiosperm species turnover during the Albian, however, was higher in humid tropics. There were humid climates in northwestern South America during the Aptian-Albian interval contrary to the widespread aridity expected for the tropical belt. The Albian inverted overall LDG is produced by a faster increase in per-sample angiosperm and pteridophyte diversity in temperate latitudes. However, humid tropical sequences had higher rates of floristic turnover suggesting a higher degree of morphological variation than in temperate regions.
Mejia-Velasquez, Paula J.; Dilcher, David L.; Jaramillo, Carlos A.; Fortini, Lucas B.; Manchester, Steven R.
2012-01-01
Premise of the study: Reconstruction of floristic patterns during the early diversification of angiosperms is impeded by the scarce fossil record, especially in tropical latitudes. Here we collected quantitative palynological data from a stratigraphic sequence in tropical South America to provide floristic and climatic insights into such tropical environments during the Early Cretaceous. Methods: We reconstructed the floristic composition of an Aptian-Albian tropical sequence from central Colombia using quantitative palynology (rarefied species richness and abundance) and used it to infer its predominant climatic conditions. Additionally, we compared our results with available quantitative data from three other sequences encompassing 70 floristic assemblages to determine latitudinal diversity patterns. Key results: Abundance of humidity indicators was higher than that of aridity indicators (61% vs. 10%). Additionally, we found an angiosperm latitudinal diversity gradient (LDG) for the Aptian, but not for the Albian, and an inverted LDG of the overall diversity for the Albian. Angiosperm species turnover during the Albian, however, was higher in humid tropics. Conclusions: There were humid climates in northwestern South America during the Aptian-Albian interval contrary to the widespread aridity expected for the tropical belt. The Albian inverted overall LDG is produced by a faster increase in per-sample angiosperm and pteridophyte diversity in temperate latitudes. However, humid tropical sequences had higher rates of floristic turnover suggesting a higher degree of morphological variation than in temperate regions.
Large-Scale Sequencing: The Future of Genomic Sciences Colloquium
DOE Office of Scientific and Technical Information (OSTI.GOV)
Margaret Riley; Merry Buckley
2009-01-01
Genetic sequencing and the various molecular techniques it has enabled have revolutionized the field of microbiology. Examining and comparing the genetic sequences borne by microbes - including bacteria, archaea, viruses, and microbial eukaryotes - provides researchers insights into the processes microbes carry out, their pathogenic traits, and new ways to use microorganisms in medicine and manufacturing. Until recently, sequencing entire microbial genomes has been laborious and expensive, and the decision to sequence the genome of an organism was made on a case-by-case basis by individual researchers and funding agencies. Now, thanks to new technologies, the cost and effort of sequencingmore » is within reach for even the smallest facilities, and the ability to sequence the genomes of a significant fraction of microbial life may be possible. The availability of numerous microbial genomes will enable unprecedented insights into microbial evolution, function, and physiology. However, the current ad hoc approach to gathering sequence data has resulted in an unbalanced and highly biased sampling of microbial diversity. A well-coordinated, large-scale effort to target the breadth and depth of microbial diversity would result in the greatest impact. The American Academy of Microbiology convened a colloquium to discuss the scientific benefits of engaging in a large-scale, taxonomically-based sequencing project. A group of individuals with expertise in microbiology, genomics, informatics, ecology, and evolution deliberated on the issues inherent in such an effort and generated a set of specific recommendations for how best to proceed. The vast majority of microbes are presently uncultured and, thus, pose significant challenges to such a taxonomically-based approach to sampling genome diversity. However, we have yet to even scratch the surface of the genomic diversity among cultured microbes. A coordinated sequencing effort of cultured organisms is an appropriate place to begin, since not only are their genomes available, but they are also accompanied by data on environment and physiology that can be used to understand the resulting data. As single cell isolation methods improve, there should be a shift toward incorporating uncultured organisms and communities into this effort. Efforts to sequence cultivated isolates should target characterized isolates from culture collections for which biochemical data are available, as well as other cultures of lasting value from personal collections. The genomes of type strains should be among the first targets for sequencing, but creative culture methods, novel cell isolation, and sorting methods would all be helpful in obtaining organisms we have not yet been able to cultivate for sequencing. The data that should be provided for strains targeted for sequencing will depend on the phylogenetic context of the organism and the amount of information available about its nearest relatives. Annotation is an important part of transforming genome sequences into useful resources, but it represents the most significant bottleneck to the field of comparative genomics right now and must be addressed. Furthermore, there is a need for more consistency in both annotation and achieving annotation data. As new annotation tools become available over time, re-annotation of genomes should be implemented, taking advantage of advancements in annotation techniques in order to capitalize on the genome sequences and increase both the societal and scientific benefit of genomics work. Given the proper resources, the knowledge and ability exist to be able to select model systems, some simple, some less so, and dissect them so that we may understand the processes and interactions at work in them. Colloquium participants suggest a five-pronged, coordinated initiative to exhaustively describe six different microbial ecosystems, designed to describe all the gene diversity, across genomes. In this effort, sequencing should be complemented by other experimental data, particularly transcriptomics and metabolomics data, all of which should be gathered and curated continuously. Systematic genomics efforts like the ones outlined in this document would significantly broaden our view of biological diversity and have major effects on science. This has to be backed up with examples. Considering these potential impacts and the need for acquiescence from both the public and scientists to get such projects funded and functioning, education and training will be crucial. New collaborations within the scientific community will also be necessary.« less
Mbareche, Hamza; Veillette, Marc; Bonifait, Laetitia; Dubuis, Marie-Eve; Benard, Yves; Marchand, Geneviève; Bilodeau, Guillaume J; Duchaine, Caroline
2017-12-01
Composting is used all over the world to transform different types of organic matter through the actions of complex microbial communities. Moving and handling composting material may lead to the emission of high concentrations of bioaerosols. High exposure levels are associated with adverse health effects among compost industry workers. Fungal spores are suspected to play a role in many respiratory illnesses. There is a paucity of information related to the detailed fungal diversity in compost as well as in the aerosols emitted through composting activities. The aim of this study was to analyze the fungal diversity of both organic matter and aerosols present in facilities that process domestic compost and facilities that process pig carcasses. This was accomplished using a next generation sequencing approach that targets the ITS1 genomic region. Multivariate analyses revealed differences in the fungal community present in samples coming from compost treating both raw materials. Furthermore, results show that the compost type affects the fungal diversity of aerosols emitted. Although 8 classes were evenly distributed in all samples, Eurotiomycetes were more dominant in carcass compost while Sordariomycetes were dominant in domestic compost. A large diversity profile was observed in bioaerosols from both compost types showing the presence of a number of pathogenic fungi newly identified in bioaerosols emitted from composting plants. Members of the family Herpotrichiellaceae and Gymnoascaceae which have been shown to cause human diseases were detected in compost and air samples. Moreover, some fungi were identified in higher proportion in air compared to compost. This is the first study to identify a high level of fungal diversity in bioaerosols present in composting plants suggesting a potential exposure risk for workers. This study suggests the need for creating guidelines that address human exposure to bioaerosols. The implementation of technical and organizational measure should be a top priority. However, skin and respiratory protection for compost workers could be used to reduce the exposure as a second resort. Copyright © 2017 Elsevier B.V. All rights reserved.
spads 1.0: a toolbox to perform spatial analyses on DNA sequence data sets.
Dellicour, Simon; Mardulyn, Patrick
2014-05-01
SPADS 1.0 (for 'Spatial and Population Analysis of DNA Sequences') is a population genetic toolbox for characterizing genetic variability within and among populations from DNA sequences. In view of the drastic increase in genetic information available through sequencing methods, spads was specifically designed to deal with multilocus data sets of DNA sequences. It computes several summary statistics from populations or groups of populations, performs input file conversions for other population genetic programs and implements locus-by-locus and multilocus versions of two clustering algorithms to study the genetic structure of populations. The toolbox also includes two MATLAB and r functions, GDISPAL and GDIVPAL, to display differentiation and diversity patterns across landscapes. These functions aim to generate interpolating surfaces based on multilocus distance and diversity indices. In the case of multiple loci, such surfaces can represent a useful alternative to multiple pie charts maps traditionally used in phylogeography to represent the spatial distribution of genetic diversity. These coloured surfaces can also be used to compare different data sets or different diversity and/or distance measures estimated on the same data set. © 2013 John Wiley & Sons Ltd.
Chávez Montes, Ricardo A; de Fátima Rosas-Cárdenas, Flor; De Paoli, Emanuele; Accerbi, Monica; Rymarquis, Linda A; Mahalingam, Gayathri; Marsch-Martínez, Nayelli; Meyers, Blake C; Green, Pamela J; de Folter, Stefan
2014-04-23
Small RNAs are pivotal regulators of gene expression that guide transcriptional and post-transcriptional silencing mechanisms in eukaryotes, including plants. Here we report a comprehensive atlas of sRNA and miRNA from 3 species of algae and 31 representative species across vascular plants, including non-model plants. We sequence and quantify sRNAs from 99 different tissues or treatments across species, resulting in a data set of over 132 million distinct sequences. Using miRBase mature sequences as a reference, we identify the miRNA sequences present in these libraries. We apply diverse profiling methods to examine critical sRNA and miRNA features, such as size distribution, tissue-specific regulation and sequence conservation between species, as well as to predict putative new miRNA sequences. We also develop database resources, computational analysis tools and a dedicated website, http://smallrna.udel.edu/. This study provides new insights on plant sRNAs and miRNAs, and a foundation for future studies.
Land, language, and loci: mtDNA in Native Americans and the genetic history of Peru.
Lewis, Cecil M; Tito, Raúl Y; Lizárraga, Beatriz; Stone, Anne C
2005-07-01
Despite a long history of complex societies and despite extensive present-day linguistic and ethnic diversity, relatively few populations in Peru have been sampled for population genetic investigations. In order to address questions about the relationships between South American populations and about the extent of correlation between genetic distance, language, and geography in the region, mitochondrial DNA (mtDNA) hypervariable region I sequences and mtDNA haplogroup markers were examined in 33 individuals from the state of Ancash, Peru. These sequences were compared to those from 19 American Indian populations using diversity estimates, AMOVA tests, mismatch distributions, a multidimensional scaling plot, and regressions. The results show correlations between genetics, linguistics, and geographical affinities, with stronger correlations between genetics and language. Additionally, the results suggest a pattern of differential gene flow and drift in western vs. eastern South America, supporting previous mtDNA and Y chromosome investigations. (c) 2004 Wiley-Liss, Inc
Rosas-Pérez, Tania; Rosenblueth, Mónica; Rincón-Rosales, Reiner; Mora, Jaime; Martínez-Romero, Esperanza
2014-01-01
Scale insects (Hemiptera: Coccoidae) constitute a very diverse group of sap-feeding insects with a large diversity of symbiotic associations with bacteria. Here, we present the complete genome sequence, metabolic reconstruction, and comparative genomics of the flavobacterial endosymbiont of the giant scale insect Llaveia axin axin. The gene repertoire of its 309,299 bp genome was similar to that of other flavobacterial insect endosymbionts though not syntenic. According to its genetic content, essential amino acid biosynthesis is likely to be the flavobacterial endosymbiont's principal contribution to the symbiotic association with its insect host. We also report the presence of a γ-proteobacterial symbiont that may be involved in waste nitrogen recycling and also has amino acid biosynthetic capabilities that may provide metabolic precursors to the flavobacterial endosymbiont. We propose “Candidatus Walczuchella monophlebidarum” as the name of the flavobacterial endosymbiont of insects from the Monophlebidae family. PMID:24610838
The African Genome Variation Project shapes medical genetics in Africa
Gurdasani, Deepti; Carstensen, Tommy; Tekola-Ayele, Fasil; Pagani, Luca; Tachmazidou, Ioanna; Hatzikotoulas, Konstantinos; Karthikeyan, Savita; Iles, Louise; Pollard, Martin O.; Choudhury, Ananyo; Ritchie, Graham R. S.; Xue, Yali; Asimit, Jennifer; Nsubuga, Rebecca N.; Young, Elizabeth H.; Pomilla, Cristina; Kivinen, Katja; Rockett, Kirk; Kamali, Anatoli; Doumatey, Ayo P.; Asiki, Gershim; Seeley, Janet; Sisay-Joof, Fatoumatta; Jallow, Muminatou; Tollman, Stephen; Mekonnen, Ephrem; Ekong, Rosemary; Oljira, Tamiru; Bradman, Neil; Bojang, Kalifa; Ramsay, Michele; Adeyemo, Adebowale; Bekele, Endashaw; Motala, Ayesha; Norris, Shane A.; Pirie, Fraser; Kaleebu, Pontiano; Kwiatkowski, Dominic; Tyler-Smith, Chris; Rotimi, Charles; Zeggini, Eleftheria; Sandhu, Manjinder S.
2014-01-01
Given the importance of Africa to studies of human origins and disease susceptibility, detailed characterisation of African genetic diversity is needed. The African Genome Variation Project (AGVP) provides a resource to help design, implement and interpret genomic studies in sub-Saharan Africa (SSA) and worldwide. The AGVP represents dense genotypes from 1,481 and whole genome sequences (WGS) from 320 individuals across SSA. Using this resource, we find novel evidence of complex, regionally distinct hunter-gatherer and Eurasian admixture across SSA. We identify new loci under selection, including for malaria and hypertension. We show that modern imputation panels can identify association signals at highly differentiated loci across populations in SSA. Using WGS, we show further improvement in imputation accuracy supporting efforts for large-scale sequencing of diverse African haplotypes. Finally, we present an efficient genotype array design capturing common genetic variation in Africa, showing for the first time that such designs are feasible. PMID:25470054
The rise and fall of a human recombination hot spot.
Jeffreys, Alec J; Neumann, Rita
2009-05-01
Human meiotic crossovers mainly cluster into narrow hot spots that profoundly influence patterns of haplotype diversity and that may also affect genome instability and sequence evolution. Hot spots also seem to be ephemeral, but processes of hot-spot activation and their subsequent evolutionary dynamics remain unknown. We now analyze the life cycle of a recombination hot spot. Sperm typing revealed a polymorphic hot spot that was activated in cis by a single base change, providing evidence for a primary sequence determinant necessary, though not sufficient, to activate recombination. This activating mutation occurred roughly 70,000 y ago and has persisted to the present, most likely fortuitously through genetic drift despite its systematic elimination by biased gene conversion. Nonetheless, this self-destructive conversion will eventually lead to hot-spot extinction. These findings define a subclass of highly transient hot spots and highlight the importance of understanding hot-spot turnover and how it influences haplotype diversity.
Llewellyn, Martin S.; Messenger, Louisa A.; Luquetti, Alejandro O.; Garcia, Lineth; Torrico, Faustino; Tavares, Suelene B. N.; Cheaib, Bachar; Derome, Nicolas; Delepine, Marc; Baulard, Céline; Deleuze, Jean-Francois; Sauer, Sascha; Miles, Michael A.
2015-01-01
Background Chagas disease results from infection with the diploid protozoan parasite Trypanosoma cruzi. T. cruzi is highly genetically diverse, and multiclonal infections in individual hosts are common, but little studied. In this study, we explore T. cruzi infection multiclonality in the context of age, sex and clinical profile among a cohort of chronic patients, as well as paired congenital cases from Cochabamba, Bolivia and Goias, Brazil using amplicon deep sequencing technology. Methodology/ Principal Findings A 450bp fragment of the trypomastigote TcGP63I surface protease gene was amplified and sequenced across 70 chronic and 22 congenital cases on the Illumina MiSeq platform. In addition, a second, mitochondrial target—ND5—was sequenced across the same cohort of cases. Several million reads were generated, and sequencing read depths were normalized within patient cohorts (Goias chronic, n = 43, Goias congenital n = 2, Bolivia chronic, n = 27; Bolivia congenital, n = 20), Among chronic cases, analyses of variance indicated no clear correlation between intra-host sequence diversity and age, sex or symptoms, while principal coordinate analyses showed no clustering by symptoms between patients. Between congenital pairs, we found evidence for the transmission of multiple sequence types from mother to infant, as well as widespread instances of novel genotypes in infants. Finally, non-synonymous to synonymous (dn:ds) nucleotide substitution ratios among sequences of TcGP63Ia and TcGP63Ib subfamilies within each cohort provided powerful evidence of strong diversifying selection at this locus. Conclusions/Significance Our results shed light on the diversity of parasite DTUs within each patient, as well as the extent to which parasite strains pass between mother and foetus in congenital cases. Although we were unable to find any evidence that parasite diversity accumulates with age in our study cohorts, putative diversifying selection within members of the TcGP63I gene family suggests a link between genetic diversity within this gene family and survival in the mammalian host. PMID:25849488
2013-01-01
Background Deep sequencing of viruses isolated from infected hosts is an efficient way to measure population-genetic variation and can reveal patterns of dispersal and natural selection. In this study, we mined existing Illumina sequence reads to investigate single-nucleotide polymorphisms (SNPs) within two RNA viruses of the Western honey bee (Apis mellifera), deformed wing virus (DWV) and Israel acute paralysis virus (IAPV). All viral RNA was extracted from North American samples of honey bees or, in one case, the ectoparasitic mite Varroa destructor. Results Coverage depth was generally lower for IAPV than DWV, and marked gaps in coverage occurred in several narrow regions (< 50 bp) of IAPV. These coverage gaps occurred across sequencing runs and were virtually unchanged when reads were re-mapped with greater permissiveness (up to 8% divergence), suggesting a recurrent sequencing artifact rather than strain divergence. Consensus sequences of DWV for each sample showed little phylogenetic divergence, low nucleotide diversity, and strongly negative values of Fu and Li’s D statistic, suggesting a recent population bottleneck and/or purifying selection. The Kakugo strain of DWV fell outside of all other DWV sequences at 100% bootstrap support. IAPV consensus sequences supported the existence of multiple clades as had been previously reported, and Fu and Li’s D was closer to neutral expectation overall, although a sliding-window analysis identified a significantly positive D within the protease region, suggesting selection maintains diversity in that region. Within-sample mean diversity was comparable between the two viruses on average, although for both viruses there was substantial variation among samples in mean diversity at third codon positions and in the number of high-diversity sites. FST values were bimodal for DWV, likely reflecting neutral divergence in two low-diversity populations, whereas IAPV had several sites that were strong outliers with very low FST. Conclusions This initial survey of genetic variation within honey bee RNA viruses suggests future directions for studies examining the underlying causes of population-genetic structure in these economically important pathogens. PMID:23497218
Cornman, Robert Scott; Boncristiani, Humberto; Dainat, Benjamin; Chen, Yanping; vanEngelsdorp, Dennis; Weaver, Daniel; Evans, Jay D
2013-03-07
Deep sequencing of viruses isolated from infected hosts is an efficient way to measure population-genetic variation and can reveal patterns of dispersal and natural selection. In this study, we mined existing Illumina sequence reads to investigate single-nucleotide polymorphisms (SNPs) within two RNA viruses of the Western honey bee (Apis mellifera), deformed wing virus (DWV) and Israel acute paralysis virus (IAPV). All viral RNA was extracted from North American samples of honey bees or, in one case, the ectoparasitic mite Varroa destructor. Coverage depth was generally lower for IAPV than DWV, and marked gaps in coverage occurred in several narrow regions (< 50 bp) of IAPV. These coverage gaps occurred across sequencing runs and were virtually unchanged when reads were re-mapped with greater permissiveness (up to 8% divergence), suggesting a recurrent sequencing artifact rather than strain divergence. Consensus sequences of DWV for each sample showed little phylogenetic divergence, low nucleotide diversity, and strongly negative values of Fu and Li's D statistic, suggesting a recent population bottleneck and/or purifying selection. The Kakugo strain of DWV fell outside of all other DWV sequences at 100% bootstrap support. IAPV consensus sequences supported the existence of multiple clades as had been previously reported, and Fu and Li's D was closer to neutral expectation overall, although a sliding-window analysis identified a significantly positive D within the protease region, suggesting selection maintains diversity in that region. Within-sample mean diversity was comparable between the two viruses on average, although for both viruses there was substantial variation among samples in mean diversity at third codon positions and in the number of high-diversity sites. FST values were bimodal for DWV, likely reflecting neutral divergence in two low-diversity populations, whereas IAPV had several sites that were strong outliers with very low FST. This initial survey of genetic variation within honey bee RNA viruses suggests future directions for studies examining the underlying causes of population-genetic structure in these economically important pathogens.
Połka, Justyna; Rebecchi, Annalisa; Pisacane, Vincenza; Morelli, Lorenzo; Puglisi, Edoardo
2015-04-01
The bacterial diversity involved in food fermentations is one of the most important factors shaping the final characteristics of traditional foods. Knowledge about this diversity can be greatly improved by the application of high-throughput sequencing technologies (HTS) coupled to the PCR amplification of the 16S rRNA subunit. Here we investigated the bacterial diversity in batches of Salame Piacentino PDO (Protected Designation of Origin), a dry fermented sausage that is typical of a regional area of Northern Italy. Salami samples from 6 different local factories were analysed at 0, 21, 49 and 63 days of ripening; raw meat at time 0 and casing samples at 21 days of ripening where also analysed, and the effect of starter addition was included in the experimental set-up. Culture-based microbiological analyses and PCR-DGGE were carried out in order to be compared with HTS results. A total of 722,196 high quality sequences were obtained after trimming, paired-reads assembly and quality screening of raw reads obtained by Illumina MiSeq sequencing of the two bacterial 16S hypervariable regions V3 and V4; manual curation of 16S database allowed a correct taxonomical classification at the species for 99.5% of these reads. Results confirmed the presence of main bacterial species involved in the fermentation of salami as assessed by PCR-DGGE, but with a greater extent of resolution and quantitative assessments that are not possible by the mere analyses of gel banding patterns. Thirty-two different Staphylococcus and 33 Lactobacillus species where identified in the salami from different producers, while the whole data set obtained accounted for 13 main families and 98 rare ones, 23 of which were present in at least 10% of the investigated samples, with casings being the major sources of the observed diversity. Multivariate analyses also showed that batches from 6 local producers tend to cluster altogether after 21 days of ripening, thus indicating that HTS has the potential for fine scale differentiation of local fermented foods. Copyright © 2014 Elsevier Ltd. All rights reserved.
Frantzen, Cyril A; Kleppen, Hans Petter; Holo, Helge
2018-02-01
Undefined mesophilic mixed (DL) starter cultures are used in the production of continental cheeses and contain unknown strain mixtures of Lactococcus lactis and leuconostocs. The choice of starter culture affects the taste, aroma, and quality of the final product. To gain insight into the diversity of Lactococcus lactis strains in starter cultures, we whole-genome sequenced 95 isolates from three different starter cultures. Pan-genomic analyses, which included 30 publically available complete genomes, grouped the strains into 21 L. lactis subsp . lactis and 28 L. lactis subsp. cremoris lineages. Only one of the 95 isolates grouped with previously sequenced strains, and the three starter cultures showed no overlap in lineage distributions. The culture diversity was assessed by targeted amplicon sequencing using purR , a core gene, and epsD , present in 93 of the 95 starter culture isolates but absent in most of the reference strains. This enabled an unprecedented discrimination of starter culture Lactococcus lactis and revealed substantial differences between the three starter cultures and compositional shifts during the cultivation of cultures in milk. IMPORTANCE In contemporary cheese production, standardized frozen seed stock starter cultures are used to ensure production stability, reproducibility, and quality control of the product. The dairy industry experiences significant disruptions of cheese production due to phage attacks, and one commonly used countermeasure to phage attack is to employ a starter rotation strategy, in which two or more starters with minimal overlap in phage sensitivity are used alternately. A culture-independent analysis of the lactococcal diversity in complex undefined starter cultures revealed large differences between the three starter cultures and temporal shifts in lactococcal composition during the production of bulk starters. A better understanding of the lactococcal diversity in starter cultures will enable the development of more robust starter cultures and assist in maintaining the efficiency and stability of the production process by ensuring the presence of key bacteria that are important to the characteristics of the product. Copyright © 2018 American Society for Microbiology.
Heitlinger, Emanuel; Ferreira, Susana C M; Thierer, Dagmar; Hofer, Heribert; East, Marion L
2017-01-01
In mammals, two factors likely to affect the diversity and composition of intestinal bacteria (bacterial microbiome) and eukaryotes (eukaryome) are social status and age. In species in which social status determines access to resources, socially dominant animals maintain better immune processes and health status than subordinates. As high species diversity is an index of ecosystem health, the intestinal biome of healthier, socially dominant animals should be more diverse than those of subordinates. Gradual colonization of the juvenile intestine after birth predicts lower intestinal biome diversity in juveniles than adults. We tested these predictions on the effect of: (1) age (juvenile/adult) and (2) social status (low/high) on bacterial microbiome and eukaryome diversity and composition in the spotted hyena ( Crocuta crocuta ), a highly social, female-dominated carnivore in which social status determines access to resources. We comprehensively screened feces from 35 individually known adult females and 7 juveniles in the Serengeti ecosystem for bacteria and eukaryotes, using a set of 48 different amplicons (4 for bacterial 16S, 44 for eukaryote 18S) in a multi-amplicon sequencing approach. We compared sequence abundances to classical coprological egg or oocyst counts. For all parasite taxa detected in more than six samples, the number of sequence reads significantly predicted the number of eggs or oocysts counted, underscoring the value of an amplicon sequencing approach for quantitative measurements of parasite load. In line with our predictions, our results revealed a significantly less diverse microbiome in juveniles than adults and a significantly higher diversity of eukaryotes in high-ranking than low-ranking animals. We propose that free-ranging wildlife can provide an intriguing model system to assess the adaptive value of intestinal biome diversity for both bacteria and eukaryotes.
Heitlinger, Emanuel; Ferreira, Susana C. M.; Thierer, Dagmar; Hofer, Heribert; East, Marion L.
2017-01-01
In mammals, two factors likely to affect the diversity and composition of intestinal bacteria (bacterial microbiome) and eukaryotes (eukaryome) are social status and age. In species in which social status determines access to resources, socially dominant animals maintain better immune processes and health status than subordinates. As high species diversity is an index of ecosystem health, the intestinal biome of healthier, socially dominant animals should be more diverse than those of subordinates. Gradual colonization of the juvenile intestine after birth predicts lower intestinal biome diversity in juveniles than adults. We tested these predictions on the effect of: (1) age (juvenile/adult) and (2) social status (low/high) on bacterial microbiome and eukaryome diversity and composition in the spotted hyena (Crocuta crocuta), a highly social, female-dominated carnivore in which social status determines access to resources. We comprehensively screened feces from 35 individually known adult females and 7 juveniles in the Serengeti ecosystem for bacteria and eukaryotes, using a set of 48 different amplicons (4 for bacterial 16S, 44 for eukaryote 18S) in a multi-amplicon sequencing approach. We compared sequence abundances to classical coprological egg or oocyst counts. For all parasite taxa detected in more than six samples, the number of sequence reads significantly predicted the number of eggs or oocysts counted, underscoring the value of an amplicon sequencing approach for quantitative measurements of parasite load. In line with our predictions, our results revealed a significantly less diverse microbiome in juveniles than adults and a significantly higher diversity of eukaryotes in high-ranking than low-ranking animals. We propose that free-ranging wildlife can provide an intriguing model system to assess the adaptive value of intestinal biome diversity for both bacteria and eukaryotes. PMID:28670573
Measuring the diversity of the human microbiota with targeted next-generation sequencing.
Finotello, Francesca; Mastrorilli, Eleonora; Di Camillo, Barbara
2016-12-26
The human microbiota is a complex ecological community of commensal, symbiotic and pathogenic microorganisms harboured by the human body. Next-generation sequencing (NGS) technologies, in particular targeted amplicon sequencing of the 16S ribosomal RNA gene (16S-seq), are enabling the identification and quantification of human-resident microorganisms at unprecedented resolution, providing novel insights into the role of the microbiota in health and disease. Once microbial abundances are quantified through NGS data analysis, diversity indices provide valuable mathematical tools to describe the ecological complexity of a single sample or to detect species differences between samples. However, diversity is not a determined physical quantity for which a consensus definition and unit of measure have been established, and several diversity indices are currently available. Furthermore, they were originally developed for macroecology and their robustness to the possible bias introduced by sequencing has not been characterized so far. To assist the reader with the selection and interpretation of diversity measures, we review a panel of broadly used indices, describing their mathematical formulations, purposes and properties, and characterize their behaviour and criticalities in dependence of the data features using simulated data as ground truth. In addition, we make available an R package, DiversitySeq, which implements in a unified framework the full panel of diversity indices and a simulator of 16S-seq data, and thus represents a valuable resource for the analysis of diversity from NGS count data and for the benchmarking of computational methods for 16S-seq. © The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.
Characterization of bovine MHC DRB3 diversity in Latin American Creole cattle breeds.
Giovambattista, Guillermo; Takeshima, Shin-nosuke; Ripoli, Maria Veronica; Matsumoto, Yuki; Franco, Luz Angela Alvarez; Saito, Hideki; Onuma, Misao; Aida, Yoko
2013-04-25
In cattle, bovine leukocyte antigens (BoLAs) have been extensively used as markers for diseases and immunological traits. However, none of the highly adapted Latin American Creole breeds have been characterized for BoLA gene polymorphism by high resolution typing methods. In this work, we sequenced exon 2 of the BoLA class II DRB3 gene from 179 cattle (113 Bolivian Yacumeño cattle and 66 Colombian Hartón del Valle cattle breeds) using a polymerase chain reaction sequence-based typing (PCR-SBT) method. We identified 36 previously reported alleles and three novel alleles. Thirty-five (32 reported and three new) and 24 alleles (22 reported and two new) were detected in Yacumeño and Hartón del Valle breeds, respectively. Interestingly, Latin American Creole cattle showed a high degree of gene diversity despite their small population sizes, and 10 alleles including three new alleles were found only in these two Creole breeds. We next compared the degree of genetic variability at the population and sequence levels and the genetic distance in the two breeds with those previously reported in five other breeds: Holstein, Japanese Shorthorn, Japanese Black, Jersey, and Hanwoo. Both Creole breeds presented gene diversity higher than 0.90, a nucleotide diversity higher than 0.07, and mean number of pairwise differences higher than 19, indicating that Creole cattle had similar genetic diversity at BoLA-DRB3 to the other breeds. A neutrality test showed that the high degree of genetic variability may be maintained by balancing selection. The FST index and the exact G test showed significant differences across all cattle populations (FST=0.0478; p<0.001). Results from the principal components analysis and the phylogenetic tree showed that Yacumeño and Hartón del Valle breeds were closely related to each other. Collectively, our results suggest that the high level of genetic diversity could be explained by the multiple origins of the Creole germplasm (European, African and Indicus), and this diversity might be maintained by balancing selection. Copyright © 2013 Elsevier B.V. All rights reserved.
A genomic scale map of genetic diversity in Trypanosoma cruzi
2012-01-01
Background Trypanosoma cruzi, the causal agent of Chagas Disease, affects more than 16 million people in Latin America. The clinical outcome of the disease results from a complex interplay between environmental factors and the genetic background of both the human host and the parasite. However, knowledge of the genetic diversity of the parasite, is currently limited to a number of highly studied loci. The availability of a number of genomes from different evolutionary lineages of T. cruzi provides an unprecedented opportunity to look at the genetic diversity of the parasite at a genomic scale. Results Using a bioinformatic strategy, we have clustered T. cruzi sequence data available in the public domain and obtained multiple sequence alignments in which one or two alleles from the reference CL-Brener were included. These data covers 4 major evolutionary lineages (DTUs): TcI, TcII, TcIII, and the hybrid TcVI. Using these set of alignments we have identified 288,957 high quality single nucleotide polymorphisms and 1,480 indels. In a reduced re-sequencing study we were able to validate ~ 97% of high-quality SNPs identified in 47 loci. Analysis of how these changes affect encoded protein products showed a 0.77 ratio of synonymous to non-synonymous changes in the T. cruzi genome. We observed 113 changes that introduce or remove a stop codon, some causing significant functional changes, and a number of tri-allelic and tetra-allelic SNPs that could be exploited in strain typing assays. Based on an analysis of the observed nucleotide diversity we show that the T. cruzi genome contains a core set of genes that are under apparent purifying selection. Interestingly, orthologs of known druggable targets show statistically significant lower nucleotide diversity values. Conclusions This study provides the first look at the genetic diversity of T. cruzi at a genomic scale. The analysis covers an estimated ~ 60% of the genetic diversity present in the population, providing an essential resource for future studies on the development of new drugs and diagnostics, for Chagas Disease. These data is available through the TcSNP database (http://snps.tcruzi.org). PMID:23270511
Bacterial Community Analysis of Drinking Water Biofilms in Southern Sweden
Lührig, Katharina; Canbäck, Björn; Paul, Catherine J.; Johansson, Tomas; Persson, Kenneth M.; Rådström, Peter
2015-01-01
Next-generation sequencing of the V1–V2 and V3 variable regions of the 16S rRNA gene generated a total of 674,116 reads that described six distinct bacterial biofilm communities from both water meters and pipes. A high degree of reproducibility was demonstrated for the experimental and analytical work-flow by analyzing the communities present in parallel water meters, the rare occurrence of biological replicates within a working drinking water distribution system. The communities observed in water meters from households that did not complain about their drinking water were defined by sequences representing Proteobacteria (82–87%), with 22–40% of all sequences being classified as Sphingomonadaceae. However, a water meter biofilm community from a household with consumer reports of red water and flowing water containing elevated levels of iron and manganese had fewer sequences representing Proteobacteria (44%); only 0.6% of all sequences were classified as Sphingomonadaceae; and, in contrast to the other water meter communities, markedly more sequences represented Nitrospira and Pedomicrobium. The biofilm communities in pipes were distinct from those in water meters, and contained sequences that were identified as Mycobacterium, Nocardia, Desulfovibrio, and Sulfuricurvum. The approach employed in the present study resolved the bacterial diversity present in these biofilm communities as well as the differences that occurred in biofilms within a single distribution system, and suggests that next-generation sequencing of 16S rRNA amplicons can show changes in bacterial biofilm communities associated with different water qualities. PMID:25739379
Bacterial community analysis of drinking water biofilms in southern Sweden.
Lührig, Katharina; Canbäck, Björn; Paul, Catherine J; Johansson, Tomas; Persson, Kenneth M; Rådström, Peter
2015-01-01
Next-generation sequencing of the V1-V2 and V3 variable regions of the 16S rRNA gene generated a total of 674,116 reads that described six distinct bacterial biofilm communities from both water meters and pipes. A high degree of reproducibility was demonstrated for the experimental and analytical work-flow by analyzing the communities present in parallel water meters, the rare occurrence of biological replicates within a working drinking water distribution system. The communities observed in water meters from households that did not complain about their drinking water were defined by sequences representing Proteobacteria (82-87%), with 22-40% of all sequences being classified as Sphingomonadaceae. However, a water meter biofilm community from a household with consumer reports of red water and flowing water containing elevated levels of iron and manganese had fewer sequences representing Proteobacteria (44%); only 0.6% of all sequences were classified as Sphingomonadaceae; and, in contrast to the other water meter communities, markedly more sequences represented Nitrospira and Pedomicrobium. The biofilm communities in pipes were distinct from those in water meters, and contained sequences that were identified as Mycobacterium, Nocardia, Desulfovibrio, and Sulfuricurvum. The approach employed in the present study resolved the bacterial diversity present in these biofilm communities as well as the differences that occurred in biofilms within a single distribution system, and suggests that next-generation sequencing of 16S rRNA amplicons can show changes in bacterial biofilm communities associated with different water qualities.
Deep sampling of the Palomero maize transcriptome by a high throughput strategy of pyrosequencing.
Vega-Arreguín, Julio C; Ibarra-Laclette, Enrique; Jiménez-Moraila, Beatriz; Martínez, Octavio; Vielle-Calzada, Jean Philippe; Herrera-Estrella, Luis; Herrera-Estrella, Alfredo
2009-07-06
In-depth sequencing analysis has not been able to determine the overall complexity of transcriptional activity of a plant organ or tissue sample. In some cases, deep parallel sequencing of Expressed Sequence Tags (ESTs), although not yet optimized for the sequencing of cDNAs, has represented an efficient procedure for validating gene prediction and estimating overall gene coverage. This approach could be very valuable for complex plant genomes. In addition, little emphasis has been given to efforts aiming at an estimation of the overall transcriptional universe found in a multicellular organism at a specific developmental stage. To explore, in depth, the transcriptional diversity in an ancient maize landrace, we developed a protocol to optimize the sequencing of cDNAs and performed 4 consecutive GS20-454 pyrosequencing runs of a cDNA library obtained from 2 week-old Palomero Toluqueño maize plants. The protocol reported here allowed obtaining over 90% of informative sequences. These GS20-454 runs generated over 1.5 Million reads, representing the largest amount of sequences reported from a single plant cDNA library. A collection of 367,391 quality-filtered reads (30.09 Mb) from a single run was sufficient to identify transcripts corresponding to 34% of public maize ESTs databases; total sequences generated after 4 filtered runs increased this coverage to 50%. Comparisons of all 1.5 Million reads to the Maize Assembled Genomic Islands (MAGIs) provided evidence for the transcriptional activity of 11% of MAGIs. We estimate that 5.67% (86,069 sequences) do not align with public ESTs or annotated genes, potentially representing new maize transcripts. Following the assembly of 74.4% of the reads in 65,493 contigs, real-time PCR of selected genes confirmed a predicted correlation between the abundance of GS20-454 sequences and corresponding levels of gene expression. A protocol was developed that significantly increases the number, length and quality of cDNA reads using massive 454 parallel sequencing. We show that recurrent 454 pyrosequencing of a single cDNA sample is necessary to attain a thorough representation of the transcriptional universe present in maize, that can also be used to estimate transcript abundance of specific genes. This data suggests that the molecular and functional diversity contained in the vast native landraces remains to be explored, and that large-scale transcriptional sequencing of a presumed ancestor of the modern maize varieties represents a valuable approach to characterize the functional diversity of maize for future agricultural and evolutionary studies.
De Silva, Jeremy Ryan; Lau, Yee Ling; Fong, Mun Yik
2017-01-03
The simian malaria parasite Plasmodium knowlesi has been reported to cause significant numbers of human infection in South East Asia. Its merozoite surface protein-3 (MSP3) is a protein that belongs to a multi-gene family of proteins first found in Plasmodium falciparum. Several studies have evaluated the potential of P. falciparum MSP3 as a potential vaccine candidate. However, to date no detailed studies have been carried out on P. knowlesi MSP3 gene (pkmsp3). The present study investigates the genetic diversity, and haplotypes groups of pkmsp3 in P. knowlesi clinical samples from Peninsular Malaysia. Blood samples were collected from P. knowlesi malaria patients within a period of 4 years (2008-2012). The pkmsp3 gene of the isolates was amplified via PCR, and subsequently cloned and sequenced. The full length pkmsp3 sequence was divided into Domain A and Domain B. Natural selection, genetic diversity, and haplotypes of pkmsp3 were analysed using MEGA6 and DnaSP ver. 5.10.00 programmes. From 23 samples, 48 pkmsp3 sequences were successfully obtained. At the nucleotide level, 101 synonymous and 238 non-synonymous mutations were observed. Tests of neutrality were not significant for the full length, Domain A or Domain B sequences. However, the dN/dS ratio of Domain B indicates purifying selection for this domain. Analysis of the deduced amino acid sequences revealed 42 different haplotypes. Neighbour Joining phylogenetic tree and haplotype network analyses revealed that the haplotypes clustered into two distinct groups. A moderate level of genetic diversity was observed in the pkmsp3 and only the C-terminal region (Domain B) appeared to be under purifying selection. The separation of the pkmsp3 into two haplotype groups provides further evidence of the existence of two distinct P. knowlesi types or lineages. Future studies should investigate the diversity of pkmsp3 among P. knowlesi isolates in North Borneo, where large numbers of human knowlesi malaria infection still occur.
AlZahal, Ousama; Valdes, Eduardo V; McBride, Brian W
2016-01-01
The objective of this study was to characterize the structure of the fecal bacterial community of five giraffes (Giraffa camelopardalis) at Disney's Animal Kingdom, FL. Fecal genomic DNA was extracted and variable regions 1-3 of the 16S rRNA gene was PCR-amplified and then sequenced. The MOTHUR software-program was used for sequence processing, diversity analysis, and classification. A total of 181,689 non-chimeric bacterial sequences were obtained, and average number of sequences per sample was 36,338 -± 8,818. Sequences were assigned to 8,284 operational taxonomic units (OTU) with 95% of genetic similarity, which included 2,942 singletons (36%). Number of OTUs per sample was 2,554 ± 264. Samples were normalized and alpha (intra-sample) diversity indices; Chao1, Inverse Simpson, Shannon, and coverage were estimated as 3,712 ± 430, 116 -± 70, 6.1 ± 0.4, and 96 ± 1%, respectively. Thirteen phyla were detected and Firmicutes, Bacteroidetes, and Spirochaetes were the most dominant phyla (more than 2% of total sequences), and constituted 92% of the classified sequences, 66% of total sequences, and 43% of total OTUs. Our computation predicted that three OTUs were likely to be present in at least three of the five samples at greater than 1% dominance rate. These OTUs were Treponema, an unidentified OTU belonging to the order Bacteroidales, and Ruminococcus. This report was the first to characterize the bacterial community of the distal gut in giraffes utilizing fecal samples, and it demonstrated that the distal gut of giraffes is likely a potential reservoir for a number of undocumented species of bacteria. © 2015 Wiley Periodicals, Inc.
Error correction and diversity analysis of population mixtures determined by NGS
Burroughs, Nigel J.; Evans, David J.; Ryabov, Eugene V.
2014-01-01
The impetus for this work was the need to analyse nucleotide diversity in a viral mix taken from honeybees. The paper has two findings. First, a method for correction of next generation sequencing error in the distribution of nucleotides at a site is developed. Second, a package of methods for assessment of nucleotide diversity is assembled. The error correction method is statistically based and works at the level of the nucleotide distribution rather than the level of individual nucleotides. The method relies on an error model and a sample of known viral genotypes that is used for model calibration. A compendium of existing and new diversity analysis tools is also presented, allowing hypotheses about diversity and mean diversity to be tested and associated confidence intervals to be calculated. The methods are illustrated using honeybee viral samples. Software in both Excel and Matlab and a guide are available at http://www2.warwick.ac.uk/fac/sci/systemsbiology/research/software/, the Warwick University Systems Biology Centre software download site. PMID:25405074
Renzette, Nicholas; Somasundaran, Mohan; Brewster, Frank; Coderre, James; Weiss, Eric R.; McManus, Margaret; Greenough, Thomas; Tabak, Barbara; Garber, Manuel; Kowalik, Timothy F.
2014-01-01
ABSTRACT We report the diversity of latent membrane protein 1 (LMP1) gene founder sequences and the level of Epstein-Barr virus (EBV) genome variability over time and across anatomic compartments by using virus genomes amplified directly from oropharyngeal wash specimens and peripheral blood B cells during acute infection and convalescence. The intrahost nucleotide variability of the founder virus was 0.02% across the region sequences, and diversity increased significantly over time in the oropharyngeal compartment (P = 0.004). The LMP1 region showing the greatest level of variability in both compartments, and over time, was concentrated within the functional carboxyl-terminal activating regions 2 and 3 (CTAR2 and CTAR3). Interestingly, a deletion in a proline-rich repeat region (amino acids 274 to 289) of EBV commonly reported in EBV sequenced from cancer specimens was not observed in acute infectious mononucleosis (AIM) patients. Taken together, these data highlight the diversity in circulating EBV genomes and its potential importance in disease pathogenesis and vaccine design. IMPORTANCE This study is among the first to leverage an improved high-throughput deep-sequencing methodology to investigate directly from patient samples the degree of diversity in Epstein-Barr virus (EBV) populations and the extent to which viral genome diversity develops over time in the infected host. Significant variability of circulating EBV latent membrane protein 1 (LMP1) gene sequences was observed between cellular and oral wash samples, and this variability increased over time in oral wash samples. The significance of EBV genetic diversity in transmission and disease pathogenesis are discussed. PMID:24429365
Renzette, Nicholas; Somasundaran, Mohan; Brewster, Frank; Coderre, James; Weiss, Eric R; McManus, Margaret; Greenough, Thomas; Tabak, Barbara; Garber, Manuel; Kowalik, Timothy F; Luzuriaga, Katherine
2014-04-01
We report the diversity of latent membrane protein 1 (LMP1) gene founder sequences and the level of Epstein-Barr virus (EBV) genome variability over time and across anatomic compartments by using virus genomes amplified directly from oropharyngeal wash specimens and peripheral blood B cells during acute infection and convalescence. The intrahost nucleotide variability of the founder virus was 0.02% across the region sequences, and diversity increased significantly over time in the oropharyngeal compartment (P = 0.004). The LMP1 region showing the greatest level of variability in both compartments, and over time, was concentrated within the functional carboxyl-terminal activating regions 2 and 3 (CTAR2 and CTAR3). Interestingly, a deletion in a proline-rich repeat region (amino acids 274 to 289) of EBV commonly reported in EBV sequenced from cancer specimens was not observed in acute infectious mononucleosis (AIM) patients. Taken together, these data highlight the diversity in circulating EBV genomes and its potential importance in disease pathogenesis and vaccine design. This study is among the first to leverage an improved high-throughput deep-sequencing methodology to investigate directly from patient samples the degree of diversity in Epstein-Barr virus (EBV) populations and the extent to which viral genome diversity develops over time in the infected host. Significant variability of circulating EBV latent membrane protein 1 (LMP1) gene sequences was observed between cellular and oral wash samples, and this variability increased over time in oral wash samples. The significance of EBV genetic diversity in transmission and disease pathogenesis are discussed.
The complete genome sequences of 65 Campylobacter jejuni and C. coli strains
USDA-ARS?s Scientific Manuscript database
Campylobacter jejuni (Cj) and C. coli (Cc) are genetically highly diverse based on various molecular methods including MLST, microarray-based comparisons and the whole genome sequences of a few strains. Cj and Cc diversity is also exhibited by variable capsular polysaccharides (CPS) that are the maj...
Maize HapMap2 identifies extant variation from a genome in flux
USDA-ARS?s Scientific Manuscript database
The maize genome is the largest, most diverse and complex plant genome sequenced to date. Using high-throughput sequencing to access genetic variation and a population genetics model to score the polymorphisms, we characterize and unite the diversity of the world’s key breeding germplasm, wild rela...
USDA-ARS?s Scientific Manuscript database
Alternative splicing is a well-known phenomenon that dramatically increases eukaryotic transcriptome diversity. The extent of mRNA isoform diversity among porcine tissues was assessed using Pacific Biosciences single-molecule long-read isoform sequencing (Iso-Seq) and Illumina short read sequencing ...
2011-01-01
Background Indoor microbial contamination due to excess moisture is an important contributor to human illness in both residential and occupational settings. However, the census of microorganisms in the indoor environment is limited by the use of selective, culture-based detection techniques. By using clone library sequencing of full-length internal transcribed spacer region combined with quantitative polymerase chain reaction (qPCR) for 69 fungal species or assay groups and cultivation, we have been able to generate a more comprehensive description of the total indoor mycoflora. Using this suite of methods, we assessed the impact of moisture damage on the fungal community composition of settled dust and building material samples (n = 8 and 16, correspondingly). Water-damaged buildings (n = 2) were examined pre- and post- remediation, and compared with undamaged reference buildings (n = 2). Results Culture-dependent and independent methods were consistent in the dominant fungal taxa in dust, but sequencing revealed a five to ten times higher diversity at the genus level than culture or qPCR. Previously unknown, verified fungal phylotypes were detected in dust, accounting for 12% of all diversity. Fungal diversity, especially within classes Dothideomycetes and Agaricomycetes tended to be higher in the water damaged buildings. Fungal phylotypes detected in building materials were present in dust samples, but their proportion of total fungi was similar for damaged and reference buildings. The quantitative correlation between clone library phylotype frequencies and qPCR counts was moderate (r = 0.59, p < 0.01). Conclusions We examined a small number of target buildings and found indications of elevated fungal diversity associated with water damage. Some of the fungi in dust were attributable to building growth, but more information on the material-associated communities is needed in order to understand the dynamics of microbial communities between building structures and dust. The sequencing-based method proved indispensable for describing the true fungal diversity in indoor environments. However, making conclusions concerning the effect of building conditions on building mycobiota using this methodology was complicated by the wide natural diversity in the dust samples, the incomplete knowledge of material-associated fungi fungi and the semiquantitative nature of sequencing based methods. PMID:22017920
2011-01-01
Background Deep-sea hydrothermal vent animals occupy patchy and ephemeral habitats supported by chemosynthetic primary production. Volcanic and tectonic activities controlling the turnover of these habitats contribute to demographic instability that erodes genetic variation within and among colonies of these animals. We examined DNA sequences from one mitochondrial and three nuclear gene loci to assess genetic diversity in the siboglinid tubeworm, Riftia pachyptila, a widely distributed constituent of vents along the East Pacific Rise and Galápagos Rift. Results Genetic differentiation (FST) among populations increased with geographical distances, as expected under a linear stepping-stone model of dispersal. Low levels of DNA sequence diversity occurred at all four loci, allowing us to exclude the hypothesis that an idiosyncratic selective sweep eliminated mitochondrial diversity alone. Total gene diversity declined with tectonic spreading rates. The southernmost populations, which are subjected to superfast spreading rates and high probabilities of extinction, are relatively homogenous genetically. Conclusions Compared to other vent species, DNA sequence diversity is extremely low in R. pachyptila. Though its dispersal abilities appear to be effective, the low diversity, particularly in southern hemisphere populations, is consistent with frequent local extinction and (re)colonization events. PMID:21489281
Coykendall, D.K.; Johnson, S.B.; Karl, S.A.; Lutz, R.A.; Vrijenhoek, R.C.
2011-01-01
Background: Deep-sea hydrothermal vent animals occupy patchy and ephemeral habitats supported by chemosynthetic primary production. Volcanic and tectonic activities controlling the turnover of these habitats contribute to demographic instability that erodes genetic variation within and among colonies of these animals. We examined DNA sequences from one mitochondrial and three nuclear gene loci to assess genetic diversity in the siboglinid tubeworm, Riftia pachyptila, a widely distributed constituent of vents along the East Pacific Rise and Galpagos Rift. Results: Genetic differentiation (FST) among populations increased with geographical distances, as expected under a linear stepping-stone model of dispersal. Low levels of DNA sequence diversity occurred at all four loci, allowing us to exclude the hypothesis that an idiosyncratic selective sweep eliminated mitochondrial diversity alone. Total gene diversity declined with tectonic spreading rates. The southernmost populations, which are subjected to superfast spreading rates and high probabilities of extinction, are relatively homogenous genetically. Conclusions: Compared to other vent species, DNA sequence diversity is extremely low in R. pachyptila. Though its dispersal abilities appear to be effective, the low diversity, particularly in southern hemisphere populations, is consistent with frequent local extinction and (re)colonization events. ?? 2011 Coykendall et al; licensee BioMed Central Ltd.
MHC class I diversity in chimpanzees and bonobos.
Maibach, Vincent; Hans, Jörg B; Hvilsom, Christina; Marques-Bonet, Tomas; Vigilant, Linda
2017-10-01
Major histocompatibility complex (MHC) class I genes are critically involved in the defense against intracellular pathogens. MHC diversity comparisons among samples of closely related taxa may reveal traces of past or ongoing selective processes. The bonobo and chimpanzee are the closest living evolutionary relatives of humans and last shared a common ancestor some 1 mya. However, little is known concerning MHC class I diversity in bonobos or in central chimpanzees, the most numerous and genetically diverse chimpanzee subspecies. Here, we used a long-read sequencing technology (PacBio) to sequence the classical MHC class I genes A, B, C, and A-like in 20 and 30 wild-born bonobos and chimpanzees, respectively, with a main focus on central chimpanzees to assess and compare diversity in those two species. We describe in total 21 and 42 novel coding region sequences for the two species, respectively. In addition, we found evidence for a reduced MHC class I diversity in bonobos as compared to central chimpanzees as well as to western chimpanzees and humans. The reduced bonobo MHC class I diversity may be the result of a selective process in their evolutionary past since their split from chimpanzees.
Ashfaq, Muhammad; Hebert, Paul D N; Mirza, M Sajjad; Khan, Arif M; Mansoor, Shahid; Shah, Ghulam S; Zafar, Yusuf
2014-01-01
Although whiteflies (Bemisia tabaci complex) are an important pest of cotton in Pakistan, its taxonomic diversity is poorly understood. As DNA barcoding is an effective tool for resolving species complexes and analyzing species distributions, we used this approach to analyze genetic diversity in the B. tabaci complex and map the distribution of B. tabaci lineages in cotton growing areas of Pakistan. Sequence diversity in the DNA barcode region (mtCOI-5') was examined in 593 whiteflies from Pakistan to determine the number of whitefly species and their distributions in the cotton-growing areas of Punjab and Sindh provinces. These new records were integrated with another 173 barcode sequences for B. tabaci, most from India, to better understand regional whitefly diversity. The Barcode Index Number (BIN) System assigned the 766 sequences to 15 BINs, including nine from Pakistan. Representative specimens of each Pakistan BIN were analyzed for mtCOI-3' to allow their assignment to one of the putative species in the B. tabaci complex recognized on the basis of sequence variation in this gene region. This analysis revealed the presence of Asia II 1, Middle East-Asia Minor 1, Asia 1, Asia II 5, Asia II 7, and a new lineage "Pakistan". The first two taxa were found in both Punjab and Sindh, but Asia 1 was only detected in Sindh, while Asia II 5, Asia II 7 and "Pakistan" were only present in Punjab. The haplotype networks showed that most haplotypes of Asia II 1, a species implicated in transmission of the cotton leaf curl virus, occurred in both India and Pakistan. DNA barcodes successfully discriminated cryptic species in B. tabaci complex. The dominant haplotypes in the B. tabaci complex were shared by India and Pakistan. Asia II 1 was previously restricted to Punjab, but is now the dominant lineage in southern Sindh; its southward spread may have serious implications for cotton plantations in this region.
Molecular Analysis of Methanogen Richness in Landfill and Marshland Targeting 16S rDNA Sequences
Yadav, Shailendra; Kundu, Sharbadeb; Ghosh, Sankar K.; Maitra, S. S.
2015-01-01
Methanogens, a key contributor in global carbon cycling, methane emission, and alternative energy production, generate methane gas via anaerobic digestion of organic matter. The methane emission potential depends upon methanogenic diversity and activity. Since they are anaerobes and difficult to isolate and culture, their diversity present in the landfill sites of Delhi and marshlands of Southern Assam, India, was analyzed using molecular techniques like 16S rDNA sequencing, DGGE, and qPCR. The sequencing results indicated the presence of methanogens belonging to the seventh order and also the order Methanomicrobiales in the Ghazipur and Bhalsawa landfill sites of Delhi. Sequences, related to the phyla Crenarchaeota (thermophilic) and Thaumarchaeota (mesophilic), were detected from marshland sites of Southern Assam, India. Jaccard analysis of DGGE gel using Gel2K showed three main clusters depending on the number and similarity of band patterns. The copy number analysis of hydrogenotrophic methanogens using qPCR indicates higher abundance in landfill sites of Delhi as compared to the marshlands of Southern Assam. The knowledge about “methanogenic archaea composition” and “abundance” in the contrasting ecosystems like “landfill” and “marshland” may reorient our understanding of the Archaea inhabitants. This study could shed light on the relationship between methane-dynamics and the global warming process. PMID:26568700
Molecular Cloning of Drebrin: Progress and Perspectives.
Kojima, Nobuhiko
2017-01-01
Chicken drebrin isoforms were first identified in the optic tectum of developing brain. Although the time course of protein expression was different in each drebrin isoform, the similarity between their protein structures was suggested by biochemical analysis of purified protein. To determine their protein structures, the cloning of drebrin cDNAs was conducted. Comparison between the cDNA sequences shows that all drebrin cDNAs are identical except that the internal insertion sequences are present or absent in their sequences. Chicken drebrin are now classified into three isoforms, namely, drebrins E1, E2, and A. Genomic cloning demonstrated that the three isoforms are generated by an alternative splicing of individual exons encoding the insertion sequences from single drebrin gene. The mechanism should be precisely regulated in cell-type-specific and developmental stage-specific fashion. Drebrin protein, which is well conserved in various vertebrate species, although mammalian drebrin has only two isoforms, namely, drebrin E and drebrin A, is different from chicken drebrin that has three isoforms. Drebrin belongs to an actin-depolymerizing factor homology (ADF-H) domain protein family. Besides the ADF-H domain, drebrin has other domains, including the actin-binding domain and Homer-binding motifs. Diversity of protein isoform and multiple domains of drebrin could interact differentially with the actin cytoskeleton and other intracellular proteins and regulate diverse cellular processes.
Naveed, Muhammad; Mubeen, Samavia; Khan, SamiUllah; Ahmed, Iftikhar; Khalid, Nauman; Suleria, Hafiz Ansar Rasul; Bano, Asghari; Mumtaz, Abdul Samad
2014-01-01
In the present study, samples of rhizosphere and root nodules were collected from different areas of Pakistan to isolate plant growth promoting rhizobacteria. Identification of bacterial isolates was made by 16S rRNA gene sequence analysis and taxonomical confirmation on EzTaxon Server. The identified bacterial strains were belonged to 5 genera i.e. Ensifer, Bacillus, Pseudomona, Leclercia and Rhizobium. Phylogenetic analysis inferred from 16S rRNA gene sequences showed the evolutionary relationship of bacterial strains with the respective genera. Based on phylogenetic analysis, some candidate novel species were also identified. The bacterial strains were also characterized for morphological, physiological, biochemical tests and glucose dehydrogenase (gdh) gene that involved in the phosphate solublization using cofactor pyrroloquinolone quinone (PQQ). Seven rhizoshperic and 3 root nodulating stains are positive for gdh gene. Furthermore, this study confirms a novel association between microbes and their hosts like field grown crops, leguminous and non-leguminous plants. It was concluded that a diverse group of bacterial population exist in the rhizosphere and root nodules that might be useful in evaluating the mechanisms behind plant microbial interactions and strains QAU-63 and QAU-68 have sequence similarity of 97 and 95% which might be declared as novel after further taxonomic characterization.
Phylogenetic diversity and position of the genus Campylobacter
NASA Technical Reports Server (NTRS)
Lau, P. P.; DeBrunner-Vossbrinck, B.; Dunn, B.; Miotto, K.; MacDonnell, M. T.; Rollins, D. M.; Pillidge, C. J.; Hespell, R. B.; Colwell, R. R.; Sogin, M. L.;
1987-01-01
RNA sequence analysis has been used to examine the phylogenetic position and structure of the genus Campylobacter. A complete 5S rRNA sequence was determined for two strains of Campylobacter jejuni and extensive partial sequences of the 16S rRNA were obtained for several strains of C. jejuni and Wolinella succinogenes. In addition limited partial sequence data were obtained from the 16S rRNAs of isolates of C. coli, C. laridis, C. fetus, C. fecalis, and C. pyloridis. It was found that W. succinogenes is specifically related to, but not included, in the genus Campylobacter as presently constituted. Within the genus significant diversity was noted. C. jejuni, C. coli and C. laridis are very closely related but the other species are distinctly different from one another. C. pyloridis is without question the most divergent of the Campylobacter isolates examined here and is sufficiently distinct to warrant inclusion in a separate genus. In terms of overall position in bacterial phylogeny, the Campylobacter/Wolinella cluster represents a deep branching most probably located within an expanded version of the Division containing the purple photosynthetic bacteria and their relatives. The Campylobacter/Wolinella cluster is not specifically includable in either the alpha, beta or gamma subdivisions of the purple bacteria.
Naveed, Muhammad; Mubeen, Samavia; khan, SamiUllah; Ahmed, Iftikhar; Khalid, Nauman; Suleria, Hafiz Ansar Rasul; Bano, Asghari; Mumtaz, Abdul Samad
2014-01-01
In the present study, samples of rhizosphere and root nodules were collected from different areas of Pakistan to isolate plant growth promoting rhizobacteria. Identification of bacterial isolates was made by 16S rRNA gene sequence analysis and taxonomical confirmation on EzTaxon Server. The identified bacterial strains were belonged to 5 genera i.e. Ensifer, Bacillus, Pseudomona, Leclercia and Rhizobium. Phylogenetic analysis inferred from 16S rRNA gene sequences showed the evolutionary relationship of bacterial strains with the respective genera. Based on phylogenetic analysis, some candidate novel species were also identified. The bacterial strains were also characterized for morphological, physiological, biochemical tests and glucose dehydrogenase (gdh) gene that involved in the phosphate solublization using cofactor pyrroloquinolone quinone (PQQ). Seven rhizoshperic and 3 root nodulating stains are positive for gdh gene. Furthermore, this study confirms a novel association between microbes and their hosts like field grown crops, leguminous and non-leguminous plants. It was concluded that a diverse group of bacterial population exist in the rhizosphere and root nodules that might be useful in evaluating the mechanisms behind plant microbial interactions and strains QAU-63 and QAU-68 have sequence similarity of 97 and 95% which might be declared as novel after further taxonomic characterization. PMID:25477935
Conservation and variability of West Nile virus proteins.
Koo, Qi Ying; Khan, Asif M; Jung, Keun-Ok; Ramdas, Shweta; Miotto, Olivo; Tan, Tin Wee; Brusic, Vladimir; Salmon, Jerome; August, J Thomas
2009-01-01
West Nile virus (WNV) has emerged globally as an increasingly important pathogen for humans and domestic animals. Studies of the evolutionary diversity of the virus over its known history will help to elucidate conserved sites, and characterize their correspondence to other pathogens and their relevance to the immune system. We describe a large-scale analysis of the entire WNV proteome, aimed at identifying and characterizing evolutionarily conserved amino acid sequences. This study, which used 2,746 WNV protein sequences collected from the NCBI GenPept database, focused on analysis of peptides of length 9 amino acids or more, which are immunologically relevant as potential T-cell epitopes. Entropy-based analysis of the diversity of WNV sequences, revealed the presence of numerous evolutionarily stable nonamer positions across the proteome (entropy value of < or = 1). The representation (frequency) of nonamers variant to the predominant peptide at these stable positions was, generally, low (< or = 10% of the WNV sequences analyzed). Eighty-eight fragments of length 9-29 amino acids, representing approximately 34% of the WNV polyprotein length, were identified to be identical and evolutionarily stable in all analyzed WNV sequences. Of the 88 completely conserved sequences, 67 are also present in other flaviviruses, and several have been associated with the functional and structural properties of viral proteins. Immunoinformatic analysis revealed that the majority (78/88) of conserved sequences are potentially immunogenic, while 44 contained experimentally confirmed human T-cell epitopes. This study identified a comprehensive catalogue of completely conserved WNV sequences, many of which are shared by other flaviviruses, and majority are potential epitopes. The complete conservation of these immunologically relevant sequences through the entire recorded WNV history suggests they will be valuable as components of peptide-specific vaccines or other therapeutic applications, for sequence-specific diagnosis of a wide-range of Flavivirus infections, and for studies of homologous sequences among other flaviviruses.
Construction of a scFv Library with Synthetic, Non-combinatorial CDR Diversity.
Bai, Xuelian; Shim, Hyunbo
2017-01-01
Many large synthetic antibody libraries have been designed, constructed, and successfully generated high-quality antibodies suitable for various demanding applications. While synthetic antibody libraries have many advantages such as optimized framework sequences and a broader sequence landscape than natural antibodies, their sequence diversities typically are generated by random combinatorial synthetic processes which cause the incorporation of many undesired CDR sequences. Here, we describe the construction of a synthetic scFv library using oligonucleotide mixtures that contain predefined, non-combinatorially synthesized CDR sequences. Each CDR is first inserted to a master scFv framework sequence and the resulting single-CDR libraries are subjected to a round of proofread panning. The proofread CDR sequences are assembled to produce the final scFv library with six diversified CDRs.
Genome survey sequencing of red swamp crayfish Procambarus clarkii.
Shi, Linlin; Yi, Shaokui; Li, Yanhe
2018-06-21
Red swamp crayfish, Procambarus clarkii, presently is an important aquatic commercial species in China. The crayfish is a hot area of research focus, and its genetic improvement is quite urgent for the crayfish aquaculture in China. However, the knowledge of its genomic landscape is limited. In this study, a survey of P. clarkii genome was investigated based on Illumina's Solexa sequencing platform. Meanwhile, its genome size was estimated using flow cytometry. Interestingly, the genome size estimated is about 8.50 Gb by flow cytometry and 1.86 Gb with genome survey sequencing. Based on the assembled genome sequences, total of 136,962 genes and 152,268 exons were predicted, and the predicted genes ranged from 150 to 12,807 bp in length. The survey sequences could help accelerate the progress of gene discovery involved in genetic diversity and evolutionary analysis, even though it could not successfully applied for estimation of P. clarkii genome size.
Protein Science by DNA Sequencing: How Advances in Molecular Biology Are Accelerating Biochemistry.
Higgins, Sean A; Savage, David F
2018-01-09
A fundamental goal of protein biochemistry is to determine the sequence-function relationship, but the vastness of sequence space makes comprehensive evaluation of this landscape difficult. However, advances in DNA synthesis and sequencing now allow researchers to assess the functional impact of every single mutation in many proteins, but challenges remain in library construction and the development of general assays applicable to a diverse range of protein functions. This Perspective briefly outlines the technical innovations in DNA manipulation that allow massively parallel protein biochemistry and then summarizes the methods currently available for library construction and the functional assays of protein variants. Areas in need of future innovation are highlighted with a particular focus on assay development and the use of computational analysis with machine learning to effectively traverse the sequence-function landscape. Finally, applications in the fundamentals of protein biochemistry, disease prediction, and protein engineering are presented.
Sobti, Ranbir Chander; Kumari, Mamtesh; Sharma, Vijay Lakshmi; Sodhi, Monika; Mukesh, Manishi; Shouche, Yogesh
2009-11-01
The present study was aimed to get the nucleotide sequences of a part of COII mitochondrial gene amplified from individuals of five species of Termites (Isoptera: Termitidae: Macrotermitinae). Four of them belonged to the genus Odontotermes (O. obesus, O. horni, O. bhagwatii and Odontotermes sp.) and one to Microtermes (M. obesi). Partial COII gene fragments were amplified by using specific primers. The sequences so obtained were characterized to calculate the frequencies of each nucleotide bases and a high A + T content was observed. The interspecific pairwise sequence divergence in Odontotermes species ranged from 6.5% to 17.1% across COII fragment. M. obesi sequence diversity ranged from 2.5 with Odontotermes sp. to 19.0% with O. bhagwatii. Phylogenetic trees drawn on the basis of distance neighbour-joining method revealed three main clades clustering all the individuals according to their genera and families.
Chapell, J D; Goral, M I; Rodgers, S E; dePamphilis, C W; Dermody, T S
1994-01-01
To better understand genetic diversity within mammalian reoviruses, we determined S2 nucleotide and deduced sigma 2 amino acid sequences of nine reovirus strains and compared these sequences with those of prototype strains of the three reovirus serotypes. The S2 gene and sigma 2 protein are highly conserved among the four type 1, one type 2, and seven type 3 strains studied. Phylogenetic analyses based on S2 nucleotide sequences of the 12 reovirus strains indicate that diversity within the S2 gene is independent of viral serotype. Additionally, we found marked topological differences between phylogenetic trees generated from S1 and S2 gene nucleotide sequences of the seven type 3 strains. These results demonstrate that reovirus S1 and S2 genes have distinct evolutionary histories, thus providing phylogenetic evidence for lateral transfer of reovirus genes in nature. When variability among the 12 sigma 2-encoding S2 nucleotide sequences was analyzed at synonymous positions, we found that approximately 60 nucleotides at the 5' terminus and 30 nucleotides at the 3' terminus were markedly conserved in comparison with other sigma 2-encoding regions of S2. Predictions of RNA secondary structures indicate that the more conserved S2 sequences participate in the formation of an extended region of duplex RNA interrupted by a pair of stem-loops. Among the 12 deduced sigma 2 amino acid sequences examined, substitutions were observed at only 11% of amino acid positions. This finding suggests that constraints on the structure or function of sigma 2, perhaps in part because of its location in the virion core, have limited sequence diversity within this protein. PMID:8289378
Variath, Murali Tottekkad; Joshi, Gopal; Bali, Sapinder; Agarwal, Manu; Kumar, Amar; Jagannath, Arun; Goel, Shailendra
2015-01-01
Background Safflower (Carthamus tinctorius L.), an Asteraceae member, yields high quality edible oil rich in unsaturated fatty acids and is resilient to dry conditions. The crop holds tremendous potential for improvement through concerted molecular breeding programs due to the availability of significant genetic and phenotypic diversity. Genomic resources that could facilitate such breeding programs remain largely underdeveloped in the crop. The present study was initiated to develop a large set of novel microsatellite markers for safflower using next generation sequencing. Principal Findings Low throughput genome sequencing of safflower was performed using Illumina paired end technology providing ~3.5X coverage of the genome. Analysis of sequencing data allowed identification of 23,067 regions harboring perfect microsatellite loci. The safflower genome was found to be rich in dinucleotide repeats followed by tri-, tetra-, penta- and hexa-nucleotides. Primer pairs were designed for 5,716 novel microsatellite sequences with repeat length ≥ 20 bases and optimal flanking regions. A subset of 325 microsatellite loci was tested for amplification, of which 294 loci produced robust amplification. The validated primers were used for assessment of 23 safflower accessions belonging to diverse agro-climatic zones of the world leading to identification of 93 polymorphic primers (31.6%). The numbers of observed alleles at each locus ranged from two to four and mean polymorphism information content was found to be 0.3075. The polymorphic primers were tested for cross-species transferability on nine wild relatives of cultivated safflower. All primers except one showed amplification in at least two wild species while 25 primers amplified across all the nine species. The UPGMA dendrogram clustered C. tinctorius accessions and wild species separately into two major groups. The proposed progenitor species of safflower, C. oxyacantha and C. palaestinus were genetically closer to cultivated safflower and formed a distinct cluster. The cluster analysis also distinguished diploid and tetraploid wild species of safflower. Conclusion Next generation sequencing of safflower genome generated a large set of microsatellite markers. The novel markers developed in this study will add to the existing repertoire of markers and can be used for diversity analysis, synteny studies, construction of linkage maps and marker-assisted selection. PMID:26287743
Mottawea, Walid; Duceppe, Marc-Olivier; Dupras, Andrée A; Usongo, Valentine; Jeukens, Julie; Freschi, Luca; Emond-Rheault, Jean-Guillaume; Hamel, Jeremie; Kukavica-Ibrulj, Irena; Boyle, Brian; Gill, Alexander; Burnett, Elton; Franz, Eelco; Arya, Gitanjali; Weadge, Joel T; Gruenheid, Samantha; Wiedmann, Martin; Huang, Hongsheng; Daigle, France; Moineau, Sylvain; Bekal, Sadjia; Levesque, Roger C; Goodridge, Lawrence D; Ogunremi, Dele
2018-01-01
Non-typhoidal Salmonella is a leading cause of foodborne illness worldwide. Prompt and accurate identification of the sources of Salmonella responsible for disease outbreaks is crucial to minimize infections and eliminate ongoing sources of contamination. Current subtyping tools including single nucleotide polymorphism (SNP) typing may be inadequate, in some instances, to provide the required discrimination among epidemiologically unrelated Salmonella strains. Prophage genes represent the majority of the accessory genes in bacteria genomes and have potential to be used as high discrimination markers in Salmonella . In this study, the prophage sequence diversity in different Salmonella serovars and genetically related strains was investigated. Using whole genome sequences of 1,760 isolates of S. enterica representing 151 Salmonella serovars and 66 closely related bacteria, prophage sequences were identified from assembled contigs using PHASTER. We detected 154 different prophages in S. enterica genomes. Prophage sequences were highly variable among S. enterica serovars with a median ± interquartile range (IQR) of 5 ± 3 prophage regions per genome. While some prophage sequences were highly conserved among the strains of specific serovars, few regions were lineage specific. Therefore, strains belonging to each serovar could be clustered separately based on their prophage content. Analysis of S . Enteritidis isolates from seven outbreaks generated distinct prophage profiles for each outbreak. Taken altogether, the diversity of the prophage sequences correlates with genome diversity. Prophage repertoires provide an additional marker for differentiating S. enterica subtypes during foodborne outbreaks.
Diversity and function in microbial mats from the Lucky Strike hydrothermal vent field.
Crépeau, Valentin; Cambon Bonavita, Marie-Anne; Lesongeur, Françoise; Randrianalivelo, Henintsoa; Sarradin, Pierre-Marie; Sarrazin, Jozée; Godfroy, Anne
2011-06-01
Diversity and function in microbial mats from the Lucky Strike hydrothermal vent field (Mid-Atlantic Ridge) were investigated using molecular approaches. DNA and RNA were extracted from mat samples overlaying hydrothermal deposits and Bathymodiolus azoricus mussel assemblages. We constructed and analyzed libraries of 16S rRNA gene sequences and sequences of functional genes involved in autotrophic carbon fixation [forms I and II RuBisCO (cbbL/M), ATP-citrate lyase B (aclB)]; methane oxidation [particulate methane monooxygenase (pmoA)] and sulfur oxidation [adenosine-5'-phosphosulfate reductase (aprA) and soxB]. To gain new insights into the relationships between mats and mussels, we also used new domain-specific 16S rRNA gene primers targeting Bathymodiolus sp. symbionts. All identified archaeal sequences were affiliated with a single group: the marine group 1 Thaumarchaeota. In contrast, analyses of bacterial sequences revealed much higher diversity, although two phyla Proteobacteria and Bacteroidetes were largely dominant. The 16S rRNA gene sequence library revealed that species affiliated to Beggiatoa Gammaproteobacteria were the dominant active population. Analyses of DNA and RNA functional gene libraries revealed a diverse and active chemolithoautotrophic population. Most of these sequences were affiliated with Gammaproteobacteria, including hydrothermal fauna symbionts, Thiotrichales and Methylococcales. PCR and reverse transcription-PCR using 16S rRNA gene primers targeted to Bathymodiolus sp. symbionts revealed sequences affiliated with both methanotrophic and thiotrophic endosymbionts. © 2011 Federation of European Microbiological Societies. Published by Blackwell Publishing Ltd. All rights reserved.
Genetic diversity and gene differentiation among ten species of Zingiberaceae from Eastern India.
Mohanty, Sujata; Panda, Manoj Kumar; Acharya, Laxmikanta; Nayak, Sanghamitra
2014-08-01
In the present study, genetic fingerprints of ten species of Zingiberaceae from eastern India were developed using PCR-based markers. 19 RAPD (Rapid Amplified polymorphic DNA), 8 ISSR (Inter Simple Sequence Repeats) and 8 SSR (Simple Sequence Repeats) primers were used to elucidate genetic diversity important for utilization, management and conservation. These primers produced 789 loci, out of which 773 loci were polymorphic (including 220 unique loci) and 16 monomorphic loci. Highest number of bands amplified (263) in Curcuma caesia whereas lowest (209) in Zingiber cassumunar. Though all the markers discriminated the species effectively, analysis of combined data of all markers resulted in better distinction of individual species. Highest number of loci was amplified with SSR primers with resolving power in a range of 17.4-39. Dendrogram based on three molecular data using unweighted pair group method with arithmetic mean classified all the species into two clusters. Mantle matrix correspondence test revealed high matrix correlation in all the cases. Correlation values for RAPD, ISSR and SSR were 0.797, 0.84 and 0.8, respectively, with combined data. In both the genera wild and cultivated species were completely separated from each other at genomic level. It also revealed distinct genetic identity between species of Curcuma and Zingiber. High genetic diversity documented in the present study provides a baseline data for optimization of conservation and breeding programme of the studied zingiberacious species.
Genus-Specific Primers for Study of Fusarium Communities in Field Samples
Edel-Hermann, Véronique; Gautheron, Nadine; Durling, Mikael Brandström; Kolseth, Anna-Karin; Steinberg, Christian; Persson, Paula; Friberg, Hanna
2015-01-01
Fusarium is a large and diverse genus of fungi of great agricultural and economic importance, containing many plant pathogens and mycotoxin producers. To date, high-throughput sequencing of Fusarium communities has been limited by the lack of genus-specific primers targeting regions with high discriminatory power at the species level. In the present study, we evaluated two Fusarium-specific primer pairs targeting translation elongation factor 1 (TEF1). We also present the new primer pair Fa+7/Ra+6. Mock Fusarium communities reflecting phylogenetic diversity were used to evaluate the accuracy of the primers in reflecting the relative abundance of the species. TEF1 amplicons were subjected to 454 high-throughput sequencing to characterize Fusarium communities. Field samples from soil and wheat kernels were included to test the method on more-complex material. For kernel samples, a single PCR was sufficient, while for soil samples, nested PCR was necessary. The newly developed primer pairs Fa+7/Ra+6 and Fa/Ra accurately reflected Fusarium species composition in mock DNA communities. In field samples, 47 Fusarium operational taxonomic units were identified, with the highest Fusarium diversity in soil. The Fusarium community in soil was dominated by members of the Fusarium incarnatum-Fusarium equiseti species complex, contradicting findings in previous studies. The method was successfully applied to analyze Fusarium communities in soil and plant material and can facilitate further studies of Fusarium ecology. PMID:26519387
Mehetre, Gajanan T.; Paranjpe, Aditi; Dastager, Syed G.
2016-01-01
Microbial diversity in geothermal waters of the Unkeshwar hot springs in Maharashtra, India, was studied using 16S rRNA amplicon metagenomic sequencing. Taxonomic analysis revealed the presence of Bacteroidetes, Proteobacteria, Cyanobacteria, Actinobacteria, Archeae, and OD1 phyla. Metabolic function prediction analysis indicated a battery of biological information systems indicating rich and novel microbial diversity, with potential biotechnological applications in this niche. PMID:26950332
Compound haplotypes at Xp11.23 and human population growth in Eurasia.
Alonso, S; Armour, J A L
2004-09-01
To investigate patterns of diversity and the evolutionary history of Eurasians, we have sequenced a 2.8 kb region at Xp11.23 in a sample of African and Eurasian chromosomes. This region is in a long intron of CLCN5 and is immediately flanked by a highly variable minisatellite, DXS255, and a human-specific Ta0 LINE. Compared to Africans, Eurasians showed a marked reduction in sequence diversity. The main Euro-Asiatic haplotype seems to be the ancestral haplotype for the whole sample. Coalescent simulations, including recombination and exponential growth, indicate a median length of strong linkage disequilibrium, up to approximately 9 kb for this area. The Ka/Ks ratio between the coding sequence of human CLCN5 and its mouse orthologue is much less than 1. This implies that the region sequenced is unlikely to be under the strong influence of positive selective processes on CLCN5, mutations in which have been associated with disorders such as Dent's disease. In contrast, a scenario based on a population bottleneck and exponential growth seems a more likely explanation for the reduced diversity observed in Eurasians. Coalescent analysis and linked minisatellite diversity (which reaches a gene diversity value greater than 98% in Eurasians) suggest an estimated age of origin of the Euro-Asiatic diversity compatible with a recent out-of-Africa model for colonization of Eurasia by modern Homo sapiens.
Genomic Diversity and Evolution of the Lyssaviruses
Delmas, Olivier; Holmes, Edward C.; Talbi, Chiraz; Larrous, Florence; Dacheux, Laurent; Bouchier, Christiane; Bourhy, Hervé
2008-01-01
Lyssaviruses are RNA viruses with single-strand, negative-sense genomes responsible for rabies-like diseases in mammals. To date, genomic and evolutionary studies have most often utilized partial genome sequences, particularly of the nucleoprotein and glycoprotein genes, with little consideration of genome-scale evolution. Herein, we report the first genomic and evolutionary analysis using complete genome sequences of all recognised lyssavirus genotypes, including 14 new complete genomes of field isolates from 6 genotypes and one genotype that is completely sequenced for the first time. In doing so we significantly increase the extent of genome sequence data available for these important viruses. Our analysis of these genome sequence data reveals that all lyssaviruses have the same genomic organization. A phylogenetic analysis reveals strong geographical structuring, with the greatest genetic diversity in Africa, and an independent origin for the two known genotypes that infect European bats. We also suggest that multiple genotypes may exist within the diversity of viruses currently classified as ‘Lagos Bat’. In sum, we show that rigorous phylogenetic techniques based on full length genome sequence provide the best discriminatory power for genotype classification within the lyssaviruses. PMID:18446239
Genetic diversity of Babesia bovis in virulent and attenuated strains.
Mazuz, M L; Molad, T; Fish, L; Leibovitz, B; Wolkomirsky, R; Fleiderovitz, L; Shkap, V
2012-03-01
The aim of this study was to compare the genetic diversity of the single copy Bv80 gene sequences of Babesia bovis in populations of attenuated and virulent parasites. PCR/ RT-PCR followed by cloning and sequence analyses of 4 attenuated and 4 virulent strains were performed. Multiple fragments in the range of 420 to 744 bp were amplified by PCR or RT-PCR. Cloning of the PCR fragments and sequence analyses revealed the presence of mixed subpopulations in either virulent or attenuated parasites with a total of 19 variants with 12 different sequences that differed in number and type of tandem repeats. High levels of intra- and inter-strain diversity of the Bv80 gene, with the presence of mixed populations of parasites were found in both the virulent field isolates and the attenuated vaccine strains. In addition, during the attenuation process, sequence analyses showed changes in the pattern of the parasite subpopulations. Despite high polymorphism found by sequence analyses, the patterns observed and the number of repeats, order, or motifs found could not discriminate between virulent field isolates and attenuated vaccine strains of the parasite.
Novel chytrid lineages dominate fungal sequences in diverse marine and freshwater habitats
NASA Astrophysics Data System (ADS)
Comeau, André M.; Vincent, Warwick F.; Bernier, Louis; Lovejoy, Connie
2016-07-01
In aquatic environments, fungal communities remain little studied despite their taxonomic and functional diversity. To extend the ecological coverage of this group, we conducted an in-depth analysis of fungal sequences within our collection of 3.6 million V4 18S rRNA pyrosequences originating from 319 individual marine (including sea-ice) and freshwater samples from libraries generated within diverse projects studying Arctic and temperate biomes in the past decade. Among the ~1.7 million post-filtered reads of highest taxonomic and phylogenetic quality, 23,263 fungal sequences were identified. The overall mean proportion was 1.35%, but with large variability; for example, from 0.01 to 59% of total sequences for Arctic seawater samples. Almost all sample types were dominated by Chytridiomycota-like sequences, followed by moderate-to-minor contributions of Ascomycota, Cryptomycota and Basidiomycota. Species and/or strain richness was high, with many novel sequences and high niche separation. The affinity of the most common reads to phytoplankton parasites suggests that aquatic fungi deserve renewed attention for their role in algal succession and carbon cycling.
Cao, Guojie; Allard, Marc; Hoffmann, Maria; Muruvanda, Tim; Luo, Yan; Payne, Justin; Meng, Kevin; Zhao, Shaohua; McDermott, Patrick; Brown, Eric; Meng, Jianghong
2018-06-01
Multidrug-resistant (MDR) plasmids play an important role in disseminating antimicrobial resistance genes. To elucidate the antimicrobial resistance gene compositions in A/C incompatibility complex (IncA/C) plasmids carried by animal-derived MDR Salmonella Newport, and to investigate the spread mechanism of IncA/C plasmids, this study characterizes the complete nucleotide sequences of IncA/C plasmids by comparative analysis. Complete nucleotide sequencing of plasmids and chromosomes of six MDR Salmonella Newport strains was performed using PacBio RSII. Open reading frames were assigned using prokaryotic genome annotation pipeline (PGAP). To understand genomic diversity and evolutionary relationships among Salmonella Newport IncA/C plasmids, we included three complete IncA/C plasmid sequences with similar backbones from Salmonella Newport and Escherichia coli: pSN254, pAM04528, and peH4H, and additional 200 draft chromosomes. With the exception of canine isolate CVM22462, which contained an additional IncI1 plasmid, each of the six MDR Salmonella Newport strains contained only the IncA/C plasmid. These IncA/C plasmids (including references) ranged in size from 80.1 (pCVM21538) to 176.5 kb (pSN254) and carried various resistance genes. Resistance genes floR, tetA, tetR, strA, strB, sul, and mer were identified in all IncA/C plasmids. Additionally, bla CMY-2 and sugE were present in all IncA/C plasmids, excepting pCVM21538. Plasmid pCVM22462 was capable of being transferred by conjugation. The IncI1 plasmid pCVM22462b in CVM22462 carried bla CMY-2 and sugE. Our data showed that MDR Salmonella Newport strains carrying similar IncA/C plasmids clustered together in the phylogenetic tree using chromosome sequences and the IncA/C plasmids from animal-derived Salmonella Newport contained diverse resistance genes. In the current study, we analyzed genomic diversities and phylogenetic relationships among MDR Salmonella Newport using complete plasmids and chromosome sequences and provided possible spread mechanism of IncA/C plasmids in Salmonella Newport Lineage II.
Warburton, Marilyn L; Williams, William Paul; Hawkins, Leigh; Bridges, Susan; Gresham, Cathy; Harper, Jonathan; Ozkan, Seval; Mylroie, J Erik; Shan, Xueyan
2011-07-01
A public candidate gene testing pipeline for resistance to aflatoxin accumulation or Aspergillus flavus infection in maize is presented here. The pipeline consists of steps for identifying, testing, and verifying the association of selected maize gene sequences with resistance under field conditions. Resources include a database of genetic and protein sequences associated with the reduction in aflatoxin contamination from previous studies; eight diverse inbred maize lines for polymorphism identification within any maize gene sequence; four Quantitative Trait Loci (QTL) mapping populations and one association mapping panel, all phenotyped for aflatoxin accumulation resistance and associated phenotypes; and capacity for Insertion/Deletion (InDel) and SNP genotyping in the population(s) for mapping. To date, ten genes have been identified as possible candidate genes and put through the candidate gene testing pipeline, and results are presented here to demonstrate the utility of the pipeline.
Investigating Holocene human population history in North Asia using ancient mitogenomes.
Kılınç, Gülşah Merve; Kashuba, Natalija; Yaka, Reyhan; Sümer, Arev Pelin; Yüncü, Eren; Shergin, Dmitrij; Ivanov, Grigorij Leonidovich; Kichigin, Dmitrii; Pestereva, Kjunnej; Volkov, Denis; Mandryka, Pavel; Kharinskii, Artur; Tishkin, Alexey; Ineshin, Evgenij; Kovychev, Evgeniy; Stepanov, Aleksandr; Alekseev, Aanatolij; Fedoseeva, Svetlana Aleksandrovna; Somel, Mehmet; Jakobsson, Mattias; Krzewińska, Maja; Storå, Jan; Götherström, Anders
2018-06-12
Archaeogenomic studies have largely elucidated human population history in West Eurasia during the Stone Age. However, despite being a broad geographical region of significant cultural and linguistic diversity, little is known about the population history in North Asia. We present complete mitochondrial genome sequences together with stable isotope data for 41 serially sampled ancient individuals from North Asia, dated between c.13,790 BP and c.1,380 BP extending from the Palaeolithic to the Iron Age. Analyses of mitochondrial DNA sequences and haplogroup data of these individuals revealed the highest genetic affinity to present-day North Asian populations of the same geographical region suggesting a possible long-term maternal genetic continuity in the region. We observed a decrease in genetic diversity over time and a reduction of maternal effective population size (N e ) approximately seven thousand years before present. Coalescent simulations were consistent with genetic continuity between present day individuals and individuals dating to 7,000 BP, 4,800 BP or 3,000 BP. Meanwhile, genetic differences observed between 7,000 BP and 3,000 BP as well as between 4,800 BP and 3,000 BP were inconsistent with genetic drift alone, suggesting gene flow into the region from distant gene pools or structure within the population. These results indicate that despite some level of continuity between ancient groups and present-day populations, the region exhibits a complex demographic history during the Holocene.
d’Avila-Levy, Claudia Masini; Boucinha, Carolina; Kostygov, Alexei; Santos, Helena Lúcia Carneiro; Morelli, Karina Alessandra; Grybchuk-Ieremenko, Anastasiia; Duval, Linda; Votýpka, Jan; Yurchenko, Vyacheslav; Grellier, Philippe; Lukeš, Julius
2015-01-01
The class Kinetoplastea encompasses both free-living and parasitic species from a wide range of hosts. Several representatives of this group are responsible for severe human diseases and for economic losses in agriculture and livestock. While this group encompasses over 30 genera, most of the available information has been derived from the vertebrate pathogenic genera Leishmaniaand Trypanosoma. Recent studies of the previously neglected groups of Kinetoplastea indicated that the actual diversity is much higher than previously thought. This article discusses the known segment of kinetoplastid diversity and how gene-directed Sanger sequencing and next-generation sequencing methods can help to deepen our knowledge of these interesting protists. PMID:26602872
De Cremer, Koen; Piérard, Denis; Hendrickx, Marijke
2016-01-01
Recently, the Fusarium genus has been narrowed based upon phylogenetic analyses and a Fusarium-like clade was adopted. The few species of the Fusarium-like clade were moved to new, re-installed or existing genera or provisionally retained as "Fusarium." Only a limited number of reference strains and DNA marker sequences are available for this clade and not much is known about its actual species diversity. Here, we report six strains, preserved by the Belgian fungal culture collection BCCM/IHEM as a Fusarium species, that belong to the Fusarium-like clade. They showed a slow growth and produced pionnotes, typical morphological characteristics of many Fusarium-like species. Multilocus sequencing with comparative sequence analyses in GenBank and phylogenetic analyses, using reference sequences of type material, confirmed that they were indeed member of the Fusarium-like clade. One strain was identified as "Fusarium" ciliatum whereas another strain was identified as Fusicolla merismoides. The four remaining strains were shown to represent a unique phylogenetic lineage in the Fusarium-like clade and were also found morphologically distinct from other members of the Fusarium-like clade. Based upon phylogenetic considerations, a new genus, Pseudofusicolla gen. nov., and a new species, Pseudofusicolla belgica sp. nov., were installed for this lineage. A formal description is provided in this study. Additional sampling will be required to gather isolates other than the historical strains presented in the present study as well as to further reveal the actual species diversity in the Fusarium-like clade. PMID:27790062
Distribution, functional impact, and origin mechanisms of copy number variation in the barley genome
2013-01-01
Background There is growing evidence for the prevalence of copy number variation (CNV) and its role in phenotypic variation in many eukaryotic species. Here we use array comparative genomic hybridization to explore the extent of this type of structural variation in domesticated barley cultivars and wild barleys. Results A collection of 14 barley genotypes including eight cultivars and six wild barleys were used for comparative genomic hybridization. CNV affects 14.9% of all the sequences that were assessed. Higher levels of CNV diversity are present in the wild accessions relative to cultivated barley. CNVs are enriched near the ends of all chromosomes except 4H, which exhibits the lowest frequency of CNVs. CNV affects 9.5% of the coding sequences represented on the array and the genes affected by CNV are enriched for sequences annotated as disease-resistance proteins and protein kinases. Sequence-based comparisons of CNV between cultivars Barke and Morex provided evidence that DNA repair mechanisms of double-strand breaks via single-stranded annealing and synthesis-dependent strand annealing play an important role in the origin of CNV in barley. Conclusions We present the first catalog of CNVs in a diploid Triticeae species, which opens the door for future genome diversity research in a tribe that comprises the economically important cereal species wheat, barley, and rye. Our findings constitute a valuable resource for the identification of CNV affecting genes of agronomic importance. We also identify potential mechanisms that can generate variation in copy number in plant genomes. PMID:23758725
Tellapragada, Chaitanya; Kamthan, Aayushi; Shaw, Tushar; Ke, Vandana; Kumar, Subodh; Bhat, Vinod; Mukhopadhyay, Chiranjay
2016-01-01
There is a slow but steady rise in the case detection rates of melioidosis from various parts of the Indian sub-continent in the past two decades. However, the epidemiology of the disease in India and the surrounding South Asian countries remains far from well elucidated. Multi-locus sequence typing (MLST) is a useful epidemiological tool to study the genetic relatedness of bacterial isolates both with-in and across the countries. With this background, we studied the molecular epidemiology of 32 Burkholderia pseudomallei isolates (31 clinical and 1 soil isolate) obtained during 2006-2015 from various parts of south India using multi-locus sequencing typing and analysis. Of the 32 isolates included in the analysis, 30 (93.7%) had novel allelic profiles that were not reported previously. Sequence type (ST) 1368 (n = 15, 46.8%) with allelic profile (1, 4, 6, 4, 1, 1, 3) was the most common genotype observed. We did not observe a genotypic association of STs with geographical location, type of infection and year of isolation in the present study. Measure of genetic differentiation (FST) between Indian and the rest of world isolates was 0.14413. Occurrence of the same ST across three adjacent states of south India suggest the dispersion of B.pseudomallei across the south western coastal part of India with limited geographical clustering. However, majority of the STs reported from the present study remained as "outliers" on the eBURST "Population snapshot", suggesting the genetic diversity of Indian isolates from the Australasian and Southeast Asian isolates.
Santana, Priscila Bessa; Junior, Rubens Ghilardi; Alves, Claudio Nahum; Silva, Jeronimo Lameira; McCulloch, John Anthony; Schneider, Maria Paula Cruz; da Costa da Silva, Artur
2012-01-01
Methanogenic archaeans are organisms of considerable ecological and biotechnological interest that produce methane through a restricted metabolic pathway, which culminates in the reaction catalyzed by the Methyl-coenzyme M reductase (Mcr) enzyme, and results in the release of methane. Using a metagenomic approach, the gene of the α subunit of mcr (mcrα) was isolated from sediment sample from an anoxic zone, rich in decomposing organic material, obtained from the Tucuruí hydroelectric dam reservoir in eastern Brazilian Amazonia. The partial nucleotide sequences obtained were 83 to 95% similar to those available in databases, indicating a low diversity of archaeans in the reservoir. Two orders were identified - the Methanomicrobiales, and a unique Operational Taxonomic Unit (OTU) forming a clade with the Methanosarcinales according to low bootstrap values. Homology modeling was used to determine the three-dimensional (3D) structures, for this the partial nucleotide sequence of the mcrα were isolated and translated on their partial amino acid sequences. The 3D structures of the archaean Mcrα observed in the present study varied little, and presented approximately 70% identity in comparison with the Mcrα of Methanopyrus klanderi. The results demonstrated that the community of methanogenic archaeans of the anoxic C1 region of the Tucurui reservoir is relatively homogeneous. PMID:22481885
Hemosporidian parasites in forest birds from Venezuela: genetic lineage analyses.
Mijares, Alfredo; Rosales, Romel; Silva-Iturriza, Adriana
2012-09-01
Avian hemosporidian parasites of the genera Haemoproteus, Plasmodium, and Leucocytozoon are transmitted by different dipteran vectors. In the present work, we looked for the presence of these parasites in 47 birds from 12 families, which were sampled in the migratory corridor Paso de Portachuelo, located at the Henri Pittier National Park, Venezuela. The presence of the parasites was evidenced by amplification of a region of 471 bp of their cytochrome b gene. This region of the marker presents enough polymorphism to identify most of the mitochondrial lineages. Therefore, the obtained amplicons were sequenced, not only to identify the genus of the parasites sampled, but also to analyze their genetic diversity in the study area. The overall parasite prevalence was low (11%). We reported, for the first time, Plasmodium in birds of the species Formicarius analis and Chamaeza campanisona (Formicariidae) and Haemoproteus in Geotrygon linearis (Columbidae). A phylogenetic tree was generated using the Haemoproteus, Plasmodium, and Leucocytozoon sequences obtained in this study, together with representative sequences from previous studies. The highest genetic diversities between the two Haemoproteus lineages (11.70%) and among the three Plasmodium lineages (7.86%) found in this study are also similar to those found when lineages reported in the literature were used. These results indicate that in the migratory corridor Paso de Portachuleo, representative parasite lineages are found, making this location an attractive location for future studies.
Jacinto, R C; Gomes, B P F A; Desai, M; Rajendram, D; Shah, H N
2007-12-01
The aim of this study was to examine the diversity of bacterial species in the infected root canals of teeth associated with endodontic abscesses by cloning and sequencing techniques in concert with denaturing high-performance liquid chromatography. Samples collected from five infected root canals were subjected to polymerase chain reaction (PCR) with universal 16S ribosomal DNA primers. Products of these PCRs were cloned and sequenced. Denaturing high-performance liquid chromatography (DHPLC) was used as a screening method to reduce the number of clones necessary for DNA sequencing. All samples were positive for the presence of bacteria and a range of 7-13 different bacteria were found per root canal sample. In total, 48 different oral clones were detected among the five root canal samples. Olsenella profusa was the only species present in all samples. Porphyromonas gingivalis, Dialister pneumosintes, Dialister invisus, Lachnospiraceae oral clone, Staphylococcus aureus, Pseudoramibacter alactolyticus, Peptostreptococcus micros and Enterococcus faecalis were found in two of the five samples. The majority of the taxa were present in only one sample, for example Tannerella forsythia, Shuttleworthia satelles and Filifactor alocis. Some facultative anaerobes that are frequently isolated from endodontic infections such as E. faecalis, Streptococcus anginosus and Lactobacillus spp. were also found in this study. Clonal analysis of the microflora associated with endodontic infections revealed a wide diversity of oral species.
Investigating intra-host and intra-herd sequence diversity of foot-and-mouth disease virus.
King, David J; Freimanis, Graham L; Orton, Richard J; Waters, Ryan A; Haydon, Daniel T; King, Donald P
2016-10-01
Due to the poor-fidelity of the enzymes involved in RNA genome replication, foot-and-mouth disease (FMD) virus samples comprise of unique polymorphic populations. In this study, deep sequencing was utilised to characterise the diversity of FMD virus (FMDV) populations in 6 infected cattle present on a single farm during the series of outbreaks in the UK in 2007. A novel RT-PCR method was developed to amplify a 7.6kb nucleotide fragment encompassing the polyprotein coding region of the FMDV genome. Illumina sequencing of each sample identified the fine polymorphic structures at each nucleotide position, from consensus level changes to variants present at a 0.24% frequency. These data were used to investigate population dynamics of FMDV at both herd and host levels, evaluate the impact of host on the viral swarm structure and to identify transmission links with viruses recovered from other farms in the same series of outbreaks. In 7 samples, from 6 different animals, a total of 5 consensus level variants were identified, in addition to 104 sub-consensus variants of which 22 were shared between 2 or more animals. Further analysis revealed differences in swarm structures from samples derived from the same animal suggesting the presence of distinct viral populations evolving independently at different lesion sites within the same infected animal. Copyright © 2016 The Authors. Published by Elsevier B.V. All rights reserved.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Howe, Adina; Yang, Fan; Williams, Ryan J.
Despite the central role of soil microbial communities in global carbon (C) cycling, little is known about soil microbial community structure and even less about their metabolic pathways. Efforts to characterize soil communities often focus on identifying differences in gene content across environmental gradients, but an alternative question is what genes are similar in soils. These genes may indicate critical species or potential functions that are required in all soils. Here we identified the “core” set of C cycling sequences widely present in multiple soil metagenomes from a fertilized prairie (FP). Of 226,887 sequences associated with known enzymes involved inmore » the synthesis, metabolism, and transport of carbohydrates, 843 were identified to be consistently prevalent across four replicate soil metagenomes. This core metagenome was functionally and taxonomically diverse, representing five enzyme classes and 99 enzyme families within the CAZy database. Though it only comprised 0.4% of all CAZy-associated genes identified in FP metagenomes, the core was found to be comprised of functions similar to those within cumulative soils. The FP CAZy-associated core sequences were present in multiple publicly available soil metagenomes and most similar to soils sharing geographic proximity. As a result, in soil ecosystems, where high diversity remains a key challenge for metagenomic investigations, these core genes represent a subset of critical functions necessary for carbohydrate metabolism, which can be targeted to evaluate important C fluxes in these and other similar soils.« less
Li, Xiaofang; Zhu, Yong-Guan; Shaban, Babak; Bruxner, Timothy J. C.; Bond, Philip L.; Huang, Longbin
2015-01-01
Characterizing the genetic diversity of microbial copper (Cu) resistance at the community level remains challenging, mainly due to the polymorphism of the core functional gene copA. In this study, a local BLASTN method using a copA database built in this study was developed to recover full-length putative copA sequences from an assembled tailings metagenome; these sequences were then screened for potentially functioning CopA using conserved metal-binding motifs, inferred by evolutionary trace analysis of CopA sequences from known Cu resistant microorganisms. In total, 99 putative copA sequences were recovered from the tailings metagenome, out of which 70 were found with high potential to be functioning in Cu resistance. Phylogenetic analysis of selected copA sequences detected in the tailings metagenome showed that topology of the copA phylogeny is largely congruent with that of the 16S-based phylogeny of the tailings microbial community obtained in our previous study, indicating that the development of copA diversity in the tailings might be mainly through vertical descent with few lateral gene transfer events. The method established here can be used to explore copA (and potentially other metal resistance genes) diversity in any metagenome and has the potential to exhaust the full-length gene sequences for downstream analyses. PMID:26286020
Zhang, Likui; Kang, Manyu; Huang, Yangchao; Yang, Lixiang
2016-05-01
The diversity and ecological significance of bacteria and archaea in deep-sea environments have been thoroughly investigated, but eukaryotic microorganisms in these areas, such as fungi, are poorly understood. To elucidate fungal diversity in calcareous deep-sea sediments in the Southwest India Ridge (SWIR), the internal transcribed spacer (ITS) regions of rRNA genes from two sediment metagenomic DNA samples were amplified and sequenced using the Illumina sequencing platform. The results revealed that 58-63 % and 36-42 % of the ITS sequences (97 % similarity) belonged to Basidiomycota and Ascomycota, respectively. These findings suggest that Basidiomycota and Ascomycota are the predominant fungal phyla in the two samples. We also found that Agaricomycetes, Leotiomycetes, and Pezizomycetes were the major fungal classes in the two samples. At the species level, Thelephoraceae sp. and Phialocephala fortinii were major fungal species in the two samples. Despite the low relative abundance, unidentified fungal sequences were also observed in the two samples. Furthermore, we found that there were slight differences in fungal diversity between the two sediment samples, although both were collected from the SWIR. Thus, our results demonstrate that calcareous deep-sea sediments in the SWIR harbor diverse fungi, which augment the fungal groups in deep-sea sediments. This is the first report of fungal communities in calcareous deep-sea sediments in the SWIR revealed by Illumina sequencing.
Genotyping and Bioforensics of Ricinus communis
DOE Office of Scientific and Technical Information (OSTI.GOV)
Hinckley, Aubree Christine
The castor bean plant (Ricinus communis) is a member of the family Euphorbiaceae. In spite of its common name, the castor plant is not a true bean (i.e., leguminous plants belonging to the family, Fabaceae). Ricinus communis is native to tropical Africa, but because the plant was recognized for its production of oil with many desirable properties, it has been introduced and cultivated in warm temperate regions throughout the world (Armstrong 1999 and Brown 2005). Castor bean plants have also been valued by gardeners as an ornamental plant and, historically, as a natural rodenticide. Today, escaped plants grow like weedsmore » throughout much of the southwestern United States, and castor seeds are even widely available to the public for order through the Internet. In this study, multiple loci of chloroplast noncoding sequence data and a few nuclear noncoding regions were examined to identify DNA polymorphisms present among representatives from a geographically diverse panel of Ricinus communis cultivated varieties. The primary objectives for this research were (1) to successfully cultivate castor plants and extract sufficient yields of high quality DNA from an assortment of castor cultivated varieties, (2) to use PCR and sequencing to screen available universal oligos against a small panel of castor cultivars, (3) to identify DNA polymorphisms within the amplified regions, and (4) to evaluate these DNA polymorphisms as appropriate candidates for assay development (see Figure 1). Additional goals were to design, test and optimize assays targeting any DNA polymorphisms that were discovered and to rapidly screen many castor cultivars to determine the amount of diversity present at that particular locus. Ultimately, the goal of this study was to construct a phylogeographic tree representing the genetic relationships present among Ricinus communis cultivars from diverse geographic regions. These research objectives were designed to test the hypothesis that cultivated varieties of Ricinus communis from various geographic regions can be distinguished from one another based on differences present at the genetic level. In addition, the present study sought to determine the amount of diversity present among Ricinus communis cultivars.« less
Fungal genome sequencing: basic biology to biotechnology.
Sharma, Krishna Kant
2016-08-01
The genome sequences provide a first glimpse into the genomic basis of the biological diversity of filamentous fungi and yeast. The genome sequence of the budding yeast, Saccharomyces cerevisiae, with a small genome size, unicellular growth, and rich history of genetic and molecular analyses was a milestone of early genomics in the 1990s. The subsequent completion of fission yeast, Schizosaccharomyces pombe and genetic model, Neurospora crassa initiated a revolution in the genomics of the fungal kingdom. In due course of time, a substantial number of fungal genomes have been sequenced and publicly released, representing the widest sampling of genomes from any eukaryotic kingdom. An ambitious genome-sequencing program provides a wealth of data on metabolic diversity within the fungal kingdom, thereby enhancing research into medical science, agriculture science, ecology, bioremediation, bioenergy, and the biotechnology industry. Fungal genomics have higher potential to positively affect human health, environmental health, and the planet's stored energy. With a significant increase in sequenced fungal genomes, the known diversity of genes encoding organic acids, antibiotics, enzymes, and their pathways has increased exponentially. Currently, over a hundred fungal genome sequences are publicly available; however, no inclusive review has been published. This review is an initiative to address the significance of the fungal genome-sequencing program and provides the road map for basic and applied research.
Effects of 16S rDNA sampling on estimates of the number of endosymbiont lineages in sucking lice
Burleigh, J. Gordon; Light, Jessica E.; Reed, David L.
2016-01-01
Phylogenetic trees can reveal the origins of endosymbiotic lineages of bacteria and detect patterns of co-evolution with their hosts. Although taxon sampling can greatly affect phylogenetic and co-evolutionary inference, most hypotheses of endosymbiont relationships are based on few available bacterial sequences. Here we examined how different sampling strategies of Gammaproteobacteria sequences affect estimates of the number of endosymbiont lineages in parasitic sucking lice (Insecta: Phthirapatera: Anoplura). We estimated the number of louse endosymbiont lineages using both newly obtained and previously sequenced 16S rDNA bacterial sequences and more than 42,000 16S rDNA sequences from other Gammaproteobacteria. We also performed parametric and nonparametric bootstrapping experiments to examine the effects of phylogenetic error and uncertainty on these estimates. Sampling of 16S rDNA sequences affects the estimates of endosymbiont diversity in sucking lice until we reach a threshold of genetic diversity, the size of which depends on the sampling strategy. Sampling by maximizing the diversity of 16S rDNA sequences is more efficient than randomly sampling available 16S rDNA sequences. Although simulation results validate estimates of multiple endosymbiont lineages in sucking lice, the bootstrap results suggest that the precise number of endosymbiont origins is still uncertain. PMID:27547523
NASA Astrophysics Data System (ADS)
Zhao, Feng; Xu, Kuidong
2016-10-01
In comparison with the macrobenthos and prokaryotes, patterns of diversity and distribution of microbial eukaryotes in deep-sea hydrothermal vents are poorly known. The widely used high-throughput sequencing of 18S rDNA has revealed a high diversity of microeukaryotes yielded from both living organisms and buried DNA in marine sediments. More recently, cDNA surveys have been utilized to uncover the diversity of active organisms. However, both methods have never been used to evaluate the diversity of ciliates in hydrothermal vents. By using high-throughput DNA and cDNA sequencing of 18S rDNA, we evaluated the molecular diversity of ciliates, a representative group of microbial eukaryotes, from the sediments of deep-sea hydrothermal vents in the Okinawa Trough and compared it with that of an adjacent deep-sea area about 15 km away and that of an offshore area of the Yellow Sea about 500 km away. The results of DNA sequencing showed that Spirotrichea and Oligohymenophorea were the most diverse and abundant groups in all the three habitats. The proportion of sequences of Oligohymenophorea was the highest in the hydrothermal vents whereas Spirotrichea was the most diverse group at all three habitats. Plagiopyleans were found only in the hydrothermal vents but with low diversity and abundance. By contrast, the cDNA sequencing showed that Plagiopylea was the most diverse and most abundant group in the hydrothermal vents, followed by Spirotrichea in terms of diversity and Oligohymenophorea in terms of relative abundance. A novel group of ciliates, distinctly separate from the 12 known classes, was detected in the hydrothermal vents, indicating undescribed, possibly highly divergent ciliates may inhabit this environment. Statistical analyses showed that: (i) the three habitats differed significantly from one another in terms of diversity of both the rare and the total ciliate taxa, and; (ii) the adjacent deep sea was more similar to the offshore area than to the hydrothermal vents. In terms of the diversity of abundant taxa, however, there was no significant difference between the hydrothermal vents and the adjacent deep sea, both of which differed significantly from the offshore area. As abundant ciliate taxa can be found in several sampling sites, they are likely adapted to large environmental variations, while rare taxa are found in specific habitat and thus are potentially more sensitive to varying environmental conditions.
Dillon, Jesse G.; Carlin, Mark; Gutierrez, Abraham; Nguyen, Vivian; McLain, Nathan
2013-01-01
The goal of this study was to use environmental sequencing of 16S rRNA and bop genes to compare the diversity of planktonic bacteria and archaea across ponds with increasing salinity in the Exportadora de Sal (ESSA) evaporative saltern in Guerrero Negro, Baja CA S., Mexico. We hypothesized that diverse communities of heterotrophic bacteria and archaea would be found in the ESSA ponds, but that bacterial diversity would decrease relative to archaea at the highest salinities. Archaeal 16S rRNA diversity was higher in Ponds 11 and 12 (370 and 380 g l−1 total salts, respectively) compared to Pond 9 (180 g l−1 total salts). Both Pond 11 and 12 communities had high representation (47 and 45% of clones, respectively) by Haloquadratum walsbyi-like (99% similarity) lineages. The archaeal community in Pond 9 was dominated (79%) by a single uncultured phylotype with 99% similarity to sequences recovered from the Sfax saltern in Tunisia. This pattern was mirrored in bop gene diversity with greater numbers of highly supported phylotypes including many Haloquadratum-like sequences from the two highest salinity ponds. In Pond 9, most bop sequences, were not closely related to sequences in databases. Bacterial 16S rRNA diversity was higher than archaeal in both Pond 9 and Pond 12 samples, but not Pond 11, where a non-Salinibacter lineage within the Bacteroidetes >98% similar to environmental clones recovered from Lake Tuz in Turkey and a saltern in Chula Vista, CA was most abundant (69% of community). This OTU was also the most abundant in Pond 12, but only represented 14% of clones in the more diverse pond. The most abundant OTU in Pond 9 (33% of community) was 99% similar to an uncultured gammaproteobacterial clone from the Salton Sea. Results suggest that the communities of saltern bacteria and archaea vary even in ponds with similar salinity and further investigation into the ecology of diverse, uncultured halophile communities is warranted. PMID:24391633
Alonso-Vega, Pablo; Normand, Philippe; Bacigalupe, Rodrigo; Pujic, Petar; Lajus, Aurelie; Vallenet, David; Carro, Lorena; Coll, Pedro
2012-01-01
Micromonospora strains have been isolated from diverse niches, including soil, water, and marine sediments and root nodules of diverse symbiotic plants. In this work, we report the genome sequence of Micromonospora lupini Lupac 08 isolated from root nodules of the wild legume Lupinus angustifolious. PMID:22815450
USDA-ARS?s Scientific Manuscript database
Human selection has reshaped crop genomes. Here we report an apple genome variation map generated through genome sequencing of 117 diverse accessions. A comprehensive model of apple speciation and domestication along the Silk Road was proposed based on evidence from diverse genomic analyses. Cultiva...
DOE Office of Scientific and Technical Information (OSTI.GOV)
Gordon, Sean
2013-03-01
Sean Gordon of the USDA on Natural variation in Brachypodium disctachyon: Deep Sequencing of Highly Diverse Natural Accessions at the 8th Annual Genomics of Energy Environment Meeting on March 27, 2013 in Walnut Creek, CA.
Allana K. Welsh; Jeffrey O. Dawson; Gerald J. Gottfried; Dittmar Hahn
2009-01-01
The diversity of uncultured Frankia populations in root nodules of Alnus oblongifolia trees geographically isolated on mountaintops of central Arizona was analyzed by comparative sequence analyses of nifH gene fragments. Sequences were retrieved from Frankia populations in nodules of four trees from each of...
Previous studies have shown that culture-based methods tend to underestimate the densities and diversity of bacterial populations inhabiting water distribution systems (WDS). In this study, the phylogenetic diversity of drinking water bacteria was assessed using sequence analysis...
[Study on Microbial Diversity of Peri-implantitis Subgingival by High-throughput Sequencing].
Li, Zhi-jie; Wang, Shao-guo; Li, Yue-hong; Tu, Dong-xiang; Liu, Shi-yun; Nie, Hong-bing; Li, Zhi-qiang; Zhang, Ju-mei
2015-07-01
To study microbial diversity of peri-implantitis subgingival with high-throughput sequencing, and investigate microbiological etiology of peri-implantitis. Subgingival plaques were sampled from the patients with peri-implantitis (D group) and non-peri-implantitis subjects (N group). The microbiological diversity of the subgingival plaques was detected by sequencing V4 region of 16S rRNA with Illumina Miseq platform. The diversity of the community structure was analyzed using Mothur software. A total of 156 507 gene sequences were detected in nine samples and 4 402 operational taxonomic units (OTUs) were found. Selenomonas, Pseudomonas, and Fusobacterium were dominant bacteria in D group, while Fusobacterium, Veillonella and Streptococcus were dominant bacteria in N group. Differences between peri-implantitis and non-peri-implantitis bacterial communities were observed at all phylogenetic levels by LEfSe, which was also found in PcoA test. The occurrence of peri-implantitis is not only related to periodontitis pathogenic microbe, but also related with the changes of oral microbial community structure. Treponema, Herbaspirillum, Butyricimonas and Phaeobacte may be closely related to the occurrence and development of peri-implantitis.
Thompson, Fabiano L; Bruce, Thiago; Gonzalez, Alessandra; Cardoso, Alexander; Clementino, Maysa; Costagliola, Marcela; Hozbor, Constanza; Otero, Ernesto; Piccini, Claudia; Peressutti, Silvia; Schmieder, Robert; Edwards, Robert; Smith, Mathew; Takiyama, Luis Roberto; Vieira, Ricardo; Paranhos, Rodolfo; Artigas, Luis Felipe
2011-02-01
The bacterioplankton diversity of coastal waters along a latitudinal gradient between Puerto Rico and Argentina was analyzed using a total of 134,197 high-quality sequences from the V6 hypervariable region of the small-subunit ribosomal RNA gene (16S rRNA) (mean length of 60 nt). Most of the OTUs were identified into Proteobacteria, Bacteriodetes, Cyanobacteria, and Actinobacteria, corresponding to approx. 80% of the total number of sequences. The number of OTUs corresponding to species varied between 937 and 1946 in the seven locations. Proteobacteria appeared at high frequency in the seven locations. An enrichment of Cyanobacteria was observed in Puerto Rico, whereas an enrichment of Bacteroidetes was detected in the Argentinian shelf and Uruguayan coastal lagoons. The highest number of sequences of Actinobacteria and Acidobacteria were obtained in the Amazon estuary mouth. The rarefaction curves and Good coverage estimator for species diversity suggested a significant coverage, with values ranging between 92 and 97% for Good coverage. Conserved taxa corresponded to aprox. 52% of all sequences. This study suggests that human-contaminated environments may influence bacterioplankton diversity.
Diversity of the Cronobacter Genus as Revealed by Multilocus Sequence Typing
Joseph, S.; Sonbol, H.; Hariri, S.; Desai, P.; McClelland, M.
2012-01-01
Cronobacter (previously known as Enterobacter sakazakii) is a diverse bacterial genus consisting of seven species: C. sakazakii, C. malonaticus, C. turicensis, C. universalis, C. muytjensii, C. dublinensis, and C. condimenti. In this study, we have used a multilocus sequence typing (MLST) approach employing the alleles of 7 genes (atpD, fusA, glnS, gltB, gyrB, infB, and ppsA; total length, 3,036 bp) to investigate the phylogenetic relationship of 325 Cronobacter species isolates. Strains were chosen on the basis of their species, geographic and temporal distribution, source, and clinical outcome. The earliest strain was isolated from milk powder in 1950, and the earliest clinical strain was isolated in 1953. The existence of seven species was supported by MLST. Intraspecific variation ranged from low diversity in C. sakazakii to extensive diversity within some species, such as C. muytjensii and C. dublinensis, including evidence of gene conversion between species. The predominant species from clinical sources was found to be C. sakazakii. C. sakazakii sequence type 4 (ST4) was the predominant sequence type of cerebral spinal fluid isolates from cases of meningitis. PMID:22785185
HIV populations are large and accumulate high genetic diversity in a nonlinear fashion.
Maldarelli, Frank; Kearney, Mary; Palmer, Sarah; Stephens, Robert; Mican, JoAnn; Polis, Michael A; Davey, Richard T; Kovacs, Joseph; Shao, Wei; Rock-Kress, Diane; Metcalf, Julia A; Rehm, Catherine; Greer, Sarah E; Lucey, Daniel L; Danley, Kristen; Alter, Harvey; Mellors, John W; Coffin, John M
2013-09-01
HIV infection is characterized by rapid and error-prone viral replication resulting in genetically diverse virus populations. The rate of accumulation of diversity and the mechanisms involved are under intense study to provide useful information to understand immune evasion and the development of drug resistance. To characterize the development of viral diversity after infection, we carried out an in-depth analysis of single genome sequences of HIV pro-pol to assess diversity and divergence and to estimate replicating population sizes in a group of treatment-naive HIV-infected individuals sampled at single (n = 22) or multiple, longitudinal (n = 11) time points. Analysis of single genome sequences revealed nonlinear accumulation of sequence diversity during the course of infection. Diversity accumulated in recently infected individuals at rates 30-fold higher than in patients with chronic infection. Accumulation of synonymous changes accounted for most of the diversity during chronic infection. Accumulation of diversity resulted in population shifts, but the rates of change were low relative to estimated replication cycle times, consistent with relatively large population sizes. Analysis of changes in allele frequencies revealed effective population sizes that are substantially higher than previous estimates of approximately 1,000 infectious particles/infected individual. Taken together, these observations indicate that HIV populations are large, diverse, and slow to change in chronic infection and that the emergence of new mutations, including drug resistance mutations, is governed by both selection forces and drift.
Conceptual modeling of coincident failures in multiversion software
NASA Technical Reports Server (NTRS)
Littlewood, Bev; Miller, Douglas R.
1989-01-01
Recent work by Eckhardt and Lee (1985) shows that independently developed program versions fail dependently (specifically, simultaneous failure of several is greater than would be the case under true independence). The present authors show there is a precise duality between input choice and program choice in this model and consider a generalization in which different versions can be developed using diverse methodologies. The use of diverse methodologies is shown to decrease the probability of the simultaneous failure of several versions. Indeed, it is theoretically possible to obtain versions which exhibit better than independent failure behavior. The authors try to formalize the notion of methodological diversity by considering the sequence of decision outcomes that constitute a methodology. They show that diversity of decision implies likely diversity of behavior for the different verions developed under such forced diversity. For certain one-out-of-n systems the authors obtain an optimal method for allocating diversity between versions. For two-out-of-three systems there seem to be no simple optimality results which do not depend on constraints which cannot be verified in practice.
Logares, Ramiro; Haverkamp, Thomas H A; Kumar, Surendra; Lanzén, Anders; Nederbragt, Alexander J; Quince, Christopher; Kauserud, Håvard
2012-10-01
The incursion of High-Throughput Sequencing (HTS) in environmental microbiology brings unique opportunities and challenges. HTS now allows a high-resolution exploration of the vast taxonomic and metabolic diversity present in the microbial world, which can provide an exceptional insight on global ecosystem functioning, ecological processes and evolution. This exploration has also economic potential, as we will have access to the evolutionary innovation present in microbial metabolisms, which could be used for biotechnological development. HTS is also challenging the research community, and the current bottleneck is present in the data analysis side. At the moment, researchers are in a sequence data deluge, with sequencing throughput advancing faster than the computer power needed for data analysis. However, new tools and approaches are being developed constantly and the whole process could be depicted as a fast co-evolution between sequencing technology, informatics and microbiologists. In this work, we examine the most popular and recently commercialized HTS platforms as well as bioinformatics methods for data handling and analysis used in microbial metagenomics. This non-exhaustive review is intended to serve as a broad state-of-the-art guide to researchers expanding into this rapidly evolving field. Copyright © 2012 Elsevier B.V. All rights reserved.
USDA-ARS?s Scientific Manuscript database
To assess diversity of Salmonella enterica serotypes present in poultry and their environment from Southern Brazil, the Kauffman-White-LeMinor (KWL) scheme was used to serotype a total of 155 isolates. Isolates were then re-examined with nested PCR and sequencing of the dkgB-linked Intergenic Sequ...
First Genome Sequence of a Mexican Multidrug-Resistant Acinetobacter baumannii Isolate
Graña-Miraglia, Lucía; Lozano, Luis; Castro-Jaimes, Semiramis; Cevallos, Miguel A.; Volkow, Patricia
2016-01-01
Acinetobacter baumannii has emerged as an important nosocomial pathogen worldwide. Here, we present the draft genome of the first multidrug-resistant A. baumannii isolate, sampled from a tertiary hospital in Mexico City. This genome will provide a starting point for studying the genomic diversity of this species in Mexico. PMID:27013043
ERIC Educational Resources Information Center
Montes, Adonay A.; Rodriguez-Valls, Fernando; Schroeder, Laurie
2014-01-01
This article presents an interpersonal methodology designed to increase the cultural awareness of counselor candidates. This methodology was implemented through a sequence of activities, which was part of a multicultural course in the counseling credential program in a university located in Southern California. The goal was to enrich future…