Science.gov

Sample records for bacterioplankton genomes inferred

  1. Inferring tumor progression from genomic heterogeneity

    PubMed Central

    Navin, Nicholas; Krasnitz, Alexander; Rodgers, Linda; Cook, Kerry; Meth, Jennifer; Kendall, Jude; Riggs, Michael; Eberling, Yvonne; Troge, Jennifer; Grubor, Vladimir; Levy, Dan; Lundin, Pär; Månér, Susanne; Zetterberg, Anders; Hicks, James; Wigler, Michael

    2010-01-01

    Cancer progression in humans is difficult to infer because we do not routinely sample patients at multiple stages of their disease. However, heterogeneous breast tumors provide a unique opportunity to study human tumor progression because they still contain evidence of early and intermediate subpopulations in the form of the phylogenetic relationships. We have developed a method we call Sector-Ploidy-Profiling (SPP) to study the clonal composition of breast tumors. SPP involves macro-dissecting tumors, flow-sorting genomic subpopulations by DNA content, and profiling genomes using comparative genomic hybridization (CGH). Breast carcinomas display two classes of genomic structural variation: (1) monogenomic and (2) polygenomic. Monogenomic tumors appear to contain a single major clonal subpopulation with a highly stable chromosome structure. Polygenomic tumors contain multiple clonal tumor subpopulations, which may occupy the same sectors, or separate anatomic locations. In polygenomic tumors, we show that heterogeneity can be ascribed to a few clonal subpopulations, rather than a series of gradual intermediates. By comparing multiple subpopulations from different anatomic locations, we have inferred pathways of cancer progression and the organization of tumor growth. PMID:19903760

  2. Inferring tumor progression from genomic heterogeneity.

    PubMed

    Navin, Nicholas; Krasnitz, Alexander; Rodgers, Linda; Cook, Kerry; Meth, Jennifer; Kendall, Jude; Riggs, Michael; Eberling, Yvonne; Troge, Jennifer; Grubor, Vladimir; Levy, Dan; Lundin, Pär; Månér, Susanne; Zetterberg, Anders; Hicks, James; Wigler, Michael

    2010-01-01

    Cancer progression in humans is difficult to infer because we do not routinely sample patients at multiple stages of their disease. However, heterogeneous breast tumors provide a unique opportunity to study human tumor progression because they still contain evidence of early and intermediate subpopulations in the form of the phylogenetic relationships. We have developed a method we call Sector-Ploidy-Profiling (SPP) to study the clonal composition of breast tumors. SPP involves macro-dissecting tumors, flow-sorting genomic subpopulations by DNA content, and profiling genomes using comparative genomic hybridization (CGH). Breast carcinomas display two classes of genomic structural variation: (1) monogenomic and (2) polygenomic. Monogenomic tumors appear to contain a single major clonal subpopulation with a highly stable chromosome structure. Polygenomic tumors contain multiple clonal tumor subpopulations, which may occupy the same sectors, or separate anatomic locations. In polygenomic tumors, we show that heterogeneity can be ascribed to a few clonal subpopulations, rather than a series of gradual intermediates. By comparing multiple subpopulations from different anatomic locations, we have inferred pathways of cancer progression and the organization of tumor growth.

  3. Single-cell genomics-based analysis of virus-host interactions in marine surface bacterioplankton.

    PubMed

    Labonté, Jessica M; Swan, Brandon K; Poulos, Bonnie; Luo, Haiwei; Koren, Sergey; Hallam, Steven J; Sullivan, Matthew B; Woyke, Tanja; Wommack, K Eric; Stepanauskas, Ramunas

    2015-11-01

    Viral infections dynamically alter the composition and metabolic potential of marine microbial communities and the evolutionary trajectories of host populations with resulting feedback on biogeochemical cycles. It is quite possible that all microbial populations in the ocean are impacted by viral infections. Our knowledge of virus-host relationships, however, has been limited to a minute fraction of cultivated host groups. Here, we utilized single-cell sequencing to obtain genomic blueprints of viruses inside or attached to individual bacterial and archaeal cells captured in their native environment, circumventing the need for host and virus cultivation. A combination of comparative genomics, metagenomic fragment recruitment, sequence anomalies and irregularities in sequence coverage depth and genome recovery were utilized to detect viruses and to decipher modes of virus-host interactions. Members of all three tailed phage families were identified in 20 out of 58 phylogenetically and geographically diverse single amplified genomes (SAGs) of marine bacteria and archaea. At least four phage-host interactions had the characteristics of late lytic infections, all of which were found in metabolically active cells. One virus had genetic potential for lysogeny. Our findings include first known viruses of Thaumarchaeota, Marinimicrobia, Verrucomicrobia and Gammaproteobacteria clusters SAR86 and SAR92. Viruses were also found in SAGs of Alphaproteobacteria and Bacteroidetes. A high fragment recruitment of viral metagenomic reads confirmed that most of the SAG-associated viruses are abundant in the ocean. Our study demonstrates that single-cell genomics, in conjunction with sequence-based computational tools, enable in situ, cultivation-independent insights into host-virus interactions in complex microbial communities. PMID:25848873

  4. Single-cell genomics-based analysis of virus–host interactions in marine surface bacterioplankton

    PubMed Central

    Labonté, Jessica M; Swan, Brandon K; Poulos, Bonnie; Luo, Haiwei; Koren, Sergey; Hallam, Steven J; Sullivan, Matthew B; Woyke, Tanja; Eric Wommack, K; Stepanauskas, Ramunas

    2015-01-01

    Viral infections dynamically alter the composition and metabolic potential of marine microbial communities and the evolutionary trajectories of host populations with resulting feedback on biogeochemical cycles. It is quite possible that all microbial populations in the ocean are impacted by viral infections. Our knowledge of virus–host relationships, however, has been limited to a minute fraction of cultivated host groups. Here, we utilized single-cell sequencing to obtain genomic blueprints of viruses inside or attached to individual bacterial and archaeal cells captured in their native environment, circumventing the need for host and virus cultivation. A combination of comparative genomics, metagenomic fragment recruitment, sequence anomalies and irregularities in sequence coverage depth and genome recovery were utilized to detect viruses and to decipher modes of virus–host interactions. Members of all three tailed phage families were identified in 20 out of 58 phylogenetically and geographically diverse single amplified genomes (SAGs) of marine bacteria and archaea. At least four phage–host interactions had the characteristics of late lytic infections, all of which were found in metabolically active cells. One virus had genetic potential for lysogeny. Our findings include first known viruses of Thaumarchaeota, Marinimicrobia, Verrucomicrobia and Gammaproteobacteria clusters SAR86 and SAR92. Viruses were also found in SAGs of Alphaproteobacteria and Bacteroidetes. A high fragment recruitment of viral metagenomic reads confirmed that most of the SAG-associated viruses are abundant in the ocean. Our study demonstrates that single-cell genomics, in conjunction with sequence-based computational tools, enable in situ, cultivation-independent insights into host–virus interactions in complex microbial communities. PMID:25848873

  5. A metagenomic assessment of winter and summer bacterioplankton from Antarctica Peninsula coastal surface waters.

    PubMed

    Grzymski, Joseph J; Riesenfeld, Christian S; Williams, Timothy J; Dussaq, Alex M; Ducklow, Hugh; Erickson, Matthew; Cavicchioli, Ricardo; Murray, Alison E

    2012-10-01

    Antarctic surface oceans are well-studied during summer when irradiance levels are high, sea ice is melting and primary productivity is at a maximum. Coincident with this timing, the bacterioplankton respond with significant increases in secondary productivity. Little is known about bacterioplankton in winter when darkness and sea-ice cover inhibit photoautotrophic primary production. We report here an environmental genomic and small subunit ribosomal RNA (SSU rRNA) analysis of winter and summer Antarctic Peninsula coastal seawater bacterioplankton. Intense inter-seasonal differences were reflected through shifts in community composition and functional capacities encoded in winter and summer environmental genomes with significantly higher phylogenetic and functional diversity in winter. In general, inferred metabolisms of summer bacterioplankton were characterized by chemoheterotrophy, photoheterotrophy and aerobic anoxygenic photosynthesis while the winter community included the capacity for bacterial and archaeal chemolithoautotrophy. Chemolithoautotrophic pathways were dominant in winter and were similar to those recently reported in global 'dark ocean' mesopelagic waters. If chemolithoautotrophy is widespread in the Southern Ocean in winter, this process may be a previously unaccounted carbon sink and may help account for the unexplained anomalies in surface inorganic nitrogen content. PMID:22534611

  6. A metagenomic assessment of winter and summer bacterioplankton from Antarctica Peninsula coastal surface waters

    PubMed Central

    Grzymski, Joseph J; Riesenfeld, Christian S; Williams, Timothy J; Dussaq, Alex M; Ducklow, Hugh; Erickson, Matthew; Cavicchioli, Ricardo; Murray, Alison E

    2012-01-01

    Antarctic surface oceans are well-studied during summer when irradiance levels are high, sea ice is melting and primary productivity is at a maximum. Coincident with this timing, the bacterioplankton respond with significant increases in secondary productivity. Little is known about bacterioplankton in winter when darkness and sea-ice cover inhibit photoautotrophic primary production. We report here an environmental genomic and small subunit ribosomal RNA (SSU rRNA) analysis of winter and summer Antarctic Peninsula coastal seawater bacterioplankton. Intense inter-seasonal differences were reflected through shifts in community composition and functional capacities encoded in winter and summer environmental genomes with significantly higher phylogenetic and functional diversity in winter. In general, inferred metabolisms of summer bacterioplankton were characterized by chemoheterotrophy, photoheterotrophy and aerobic anoxygenic photosynthesis while the winter community included the capacity for bacterial and archaeal chemolithoautotrophy. Chemolithoautotrophic pathways were dominant in winter and were similar to those recently reported in global ‘dark ocean' mesopelagic waters. If chemolithoautotrophy is widespread in the Southern Ocean in winter, this process may be a previously unaccounted carbon sink and may help account for the unexplained anomalies in surface inorganic nitrogen content. PMID:22534611

  7. Inferring correlation networks from genomic survey data.

    PubMed

    Friedman, Jonathan; Alm, Eric J

    2012-01-01

    High-throughput sequencing based techniques, such as 16S rRNA gene profiling, have the potential to elucidate the complex inner workings of natural microbial communities - be they from the world's oceans or the human gut. A key step in exploring such data is the identification of dependencies between members of these communities, which is commonly achieved by correlation analysis. However, it has been known since the days of Karl Pearson that the analysis of the type of data generated by such techniques (referred to as compositional data) can produce unreliable results since the observed data take the form of relative fractions of genes or species, rather than their absolute abundances. Using simulated and real data from the Human Microbiome Project, we show that such compositional effects can be widespread and severe: in some real data sets many of the correlations among taxa can be artifactual, and true correlations may even appear with opposite sign. Additionally, we show that community diversity is the key factor that modulates the acuteness of such compositional effects, and develop a new approach, called SparCC (available at https://bitbucket.org/yonatanf/sparcc), which is capable of estimating correlation values from compositional data. To illustrate a potential application of SparCC, we infer a rich ecological network connecting hundreds of interacting species across 18 sites on the human body. Using the SparCC network as a reference, we estimated that the standard approach yields 3 spurious species-species interactions for each true interaction and misses 60% of the true interactions in the human microbiome data, and, as predicted, most of the erroneous links are found in the samples with the lowest diversity. PMID:23028285

  8. Inferring Correlation Networks from Genomic Survey Data

    PubMed Central

    Friedman, Jonathan; Alm, Eric J.

    2012-01-01

    High-throughput sequencing based techniques, such as 16S rRNA gene profiling, have the potential to elucidate the complex inner workings of natural microbial communities - be they from the world's oceans or the human gut. A key step in exploring such data is the identification of dependencies between members of these communities, which is commonly achieved by correlation analysis. However, it has been known since the days of Karl Pearson that the analysis of the type of data generated by such techniques (referred to as compositional data) can produce unreliable results since the observed data take the form of relative fractions of genes or species, rather than their absolute abundances. Using simulated and real data from the Human Microbiome Project, we show that such compositional effects can be widespread and severe: in some real data sets many of the correlations among taxa can be artifactual, and true correlations may even appear with opposite sign. Additionally, we show that community diversity is the key factor that modulates the acuteness of such compositional effects, and develop a new approach, called SparCC (available at https://bitbucket.org/yonatanf/sparcc), which is capable of estimating correlation values from compositional data. To illustrate a potential application of SparCC, we infer a rich ecological network connecting hundreds of interacting species across 18 sites on the human body. Using the SparCC network as a reference, we estimated that the standard approach yields 3 spurious species-species interactions for each true interaction and misses 60% of the true interactions in the human microbiome data, and, as predicted, most of the erroneous links are found in the samples with the lowest diversity. PMID:23028285

  9. Robust demographic inference from genomic and SNP data.

    PubMed

    Excoffier, Laurent; Dupanloup, Isabelle; Huerta-Sánchez, Emilia; Sousa, Vitor C; Foll, Matthieu

    2013-10-01

    We introduce a flexible and robust simulation-based framework to infer demographic parameters from the site frequency spectrum (SFS) computed on large genomic datasets. We show that our composite-likelihood approach allows one to study evolutionary models of arbitrary complexity, which cannot be tackled by other current likelihood-based methods. For simple scenarios, our approach compares favorably in terms of accuracy and speed with ∂a∂i, the current reference in the field, while showing better convergence properties for complex models. We first apply our methodology to non-coding genomic SNP data from four human populations. To infer their demographic history, we compare neutral evolutionary models of increasing complexity, including unsampled populations. We further show the versatility of our framework by extending it to the inference of demographic parameters from SNP chips with known ascertainment, such as that recently released by Affymetrix to study human origins. Whereas previous ways of handling ascertained SNPs were either restricted to a single population or only allowed the inference of divergence time between a pair of populations, our framework can correctly infer parameters of more complex models including the divergence of several populations, bottlenecks and migration. We apply this approach to the reconstruction of African demography using two distinct ascertained human SNP panels studied under two evolutionary models. The two SNP panels lead to globally very similar estimates and confidence intervals, and suggest an ancient divergence (>110 Ky) between Yoruba and San populations. Our methodology appears well suited to the study of complex scenarios from large genomic data sets.

  10. Robust Demographic Inference from Genomic and SNP Data

    PubMed Central

    Excoffier, Laurent; Dupanloup, Isabelle; Huerta-Sánchez, Emilia; Sousa, Vitor C.; Foll, Matthieu

    2013-01-01

    We introduce a flexible and robust simulation-based framework to infer demographic parameters from the site frequency spectrum (SFS) computed on large genomic datasets. We show that our composite-likelihood approach allows one to study evolutionary models of arbitrary complexity, which cannot be tackled by other current likelihood-based methods. For simple scenarios, our approach compares favorably in terms of accuracy and speed with , the current reference in the field, while showing better convergence properties for complex models. We first apply our methodology to non-coding genomic SNP data from four human populations. To infer their demographic history, we compare neutral evolutionary models of increasing complexity, including unsampled populations. We further show the versatility of our framework by extending it to the inference of demographic parameters from SNP chips with known ascertainment, such as that recently released by Affymetrix to study human origins. Whereas previous ways of handling ascertained SNPs were either restricted to a single population or only allowed the inference of divergence time between a pair of populations, our framework can correctly infer parameters of more complex models including the divergence of several populations, bottlenecks and migration. We apply this approach to the reconstruction of African demography using two distinct ascertained human SNP panels studied under two evolutionary models. The two SNP panels lead to globally very similar estimates and confidence intervals, and suggest an ancient divergence (>110 Ky) between Yoruba and San populations. Our methodology appears well suited to the study of complex scenarios from large genomic data sets. PMID:24204310

  11. Ancestral genome inference using a genetic algorithm approach.

    PubMed

    Gao, Nan; Yang, Ning; Tang, Jijun

    2013-01-01

    Recent advancement of technologies has now made it routine to obtain and compare gene orders within genomes. Rearrangements of gene orders by operations such as reversal and transposition are rare events that enable researchers to reconstruct deep evolutionary histories. An important application of genome rearrangement analysis is to infer gene orders of ancestral genomes, which is valuable for identifying patterns of evolution and for modeling the evolutionary processes. Among various available methods, parsimony-based methods (including GRAPPA and MGR) are the most widely used. Since the core algorithms of these methods are solvers for the so called median problem, providing efficient and accurate median solver has attracted lots of attention in this field. The "double-cut-and-join" (DCJ) model uses the single DCJ operation to account for all genome rearrangement events. Because mathematically it is much simpler than handling events directly, parsimony methods using DCJ median solvers has better speed and accuracy. However, the DCJ median problem is NP-hard and although several exact algorithms are available, they all have great difficulties when given genomes are distant. In this paper, we present a new algorithm that combines genetic algorithm (GA) with genomic sorting to produce a new method which can solve the DCJ median problem in limited time and space, especially in large and distant datasets. Our experimental results show that this new GA-based method can find optimal or near optimal results for problems ranging from easy to very difficult. Compared to existing parsimony methods which may severely underestimate the true number of evolutionary events, the sorting-based approach can infer ancestral genomes which are much closer to their true ancestors. The code is available at http://phylo.cse.sc.edu. PMID:23658708

  12. Genome-wide inference of ancestral recombination graphs.

    PubMed

    Rasmussen, Matthew D; Hubisz, Melissa J; Gronau, Ilan; Siepel, Adam

    2014-01-01

    The complex correlation structure of a collection of orthologous DNA sequences is uniquely captured by the "ancestral recombination graph" (ARG), a complete record of coalescence and recombination events in the history of the sample. However, existing methods for ARG inference are computationally intensive, highly approximate, or limited to small numbers of sequences, and, as a consequence, explicit ARG inference is rarely used in applied population genomics. Here, we introduce a new algorithm for ARG inference that is efficient enough to apply to dozens of complete mammalian genomes. The key idea of our approach is to sample an ARG of [Formula: see text] chromosomes conditional on an ARG of [Formula: see text] chromosomes, an operation we call "threading." Using techniques based on hidden Markov models, we can perform this threading operation exactly, up to the assumptions of the sequentially Markov coalescent and a discretization of time. An extension allows for threading of subtrees instead of individual sequences. Repeated application of these threading operations results in highly efficient Markov chain Monte Carlo samplers for ARGs. We have implemented these methods in a computer program called ARGweaver. Experiments with simulated data indicate that ARGweaver converges rapidly to the posterior distribution over ARGs and is effective in recovering various features of the ARG for dozens of sequences generated under realistic parameters for human populations. In applications of ARGweaver to 54 human genome sequences from Complete Genomics, we find clear signatures of natural selection, including regions of unusually ancient ancestry associated with balancing selection and reductions in allele age in sites under directional selection. The patterns we observe near protein-coding genes are consistent with a primary influence from background selection rather than hitchhiking, although we cannot rule out a contribution from recurrent selective sweeps. PMID:24831947

  13. Genome-Wide Inference of Ancestral Recombination Graphs

    PubMed Central

    Rasmussen, Matthew D.; Hubisz, Melissa J.; Gronau, Ilan; Siepel, Adam

    2014-01-01

    The complex correlation structure of a collection of orthologous DNA sequences is uniquely captured by the “ancestral recombination graph” (ARG), a complete record of coalescence and recombination events in the history of the sample. However, existing methods for ARG inference are computationally intensive, highly approximate, or limited to small numbers of sequences, and, as a consequence, explicit ARG inference is rarely used in applied population genomics. Here, we introduce a new algorithm for ARG inference that is efficient enough to apply to dozens of complete mammalian genomes. The key idea of our approach is to sample an ARG of chromosomes conditional on an ARG of chromosomes, an operation we call “threading.” Using techniques based on hidden Markov models, we can perform this threading operation exactly, up to the assumptions of the sequentially Markov coalescent and a discretization of time. An extension allows for threading of subtrees instead of individual sequences. Repeated application of these threading operations results in highly efficient Markov chain Monte Carlo samplers for ARGs. We have implemented these methods in a computer program called ARGweaver. Experiments with simulated data indicate that ARGweaver converges rapidly to the posterior distribution over ARGs and is effective in recovering various features of the ARG for dozens of sequences generated under realistic parameters for human populations. In applications of ARGweaver to 54 human genome sequences from Complete Genomics, we find clear signatures of natural selection, including regions of unusually ancient ancestry associated with balancing selection and reductions in allele age in sites under directional selection. The patterns we observe near protein-coding genes are consistent with a primary influence from background selection rather than hitchhiking, although we cannot rule out a contribution from recurrent selective sweeps. PMID:24831947

  14. Adaptive gene expression divergence inferred from population genomics.

    PubMed

    Holloway, Alisha K; Lawniczak, Mara K N; Mezey, Jason G; Begun, David J; Jones, Corbin D

    2007-10-01

    Detailed studies of individual genes have shown that gene expression divergence often results from adaptive evolution of regulatory sequence. Genome-wide analyses, however, have yet to unite patterns of gene expression with polymorphism and divergence to infer population genetic mechanisms underlying expression evolution. Here, we combined genomic expression data--analyzed in a phylogenetic context--with whole genome light-shotgun sequence data from six Drosophila simulans lines and reference sequences from D. melanogaster and D. yakuba. These data allowed us to use molecular population genetics to test for neutral versus adaptive gene expression divergence on a genomic scale. We identified recent and recurrent adaptive evolution along the D. simulans lineage by contrasting sequence polymorphism within D. simulans to divergence from D. melanogaster and D. yakuba. Genes that evolved higher levels of expression in D. simulans have experienced adaptive evolution of the associated 3' flanking and amino acid sequence. Concomitantly, these genes are also decelerating in their rates of protein evolution, which is in agreement with the finding that highly expressed genes evolve slowly. Interestingly, adaptive evolution in 5' cis-regulatory regions did not correspond strongly with expression evolution. Our results provide a genomic view of the intimate link between selection acting on a phenotype and associated genic evolution.

  15. Genomic inference of the metabolism of cosmopolitan subsurface Archaea, Hadesarchaea.

    PubMed

    Baker, Brett J; Saw, Jimmy H; Lind, Anders E; Lazar, Cassandre Sara; Hinrichs, Kai-Uwe; Teske, Andreas P; Ettema, Thijs J G

    2016-02-15

    The subsurface biosphere is largely unexplored and contains a broad diversity of uncultured microbes(1). Despite being one of the few prokaryotic lineages that is cosmopolitan in both the terrestrial and marine subsurface(2-4), the physiological and ecological roles of SAGMEG (South-African Gold Mine Miscellaneous Euryarchaeal Group) Archaea are unknown. Here, we report the metabolic capabilities of this enigmatic group as inferred from genomic reconstructions. Four high-quality (63-90% complete) genomes were obtained from White Oak River estuary and Yellowstone National Park hot spring sediment metagenomes. Phylogenomic analyses place SAGMEG Archaea as a deeply rooting sister clade of the Thermococci, leading us to propose the name Hadesarchaea for this new Archaeal class. With an estimated genome size of around 1.5 Mbp, the genomes of Hadesarchaea are distinctly streamlined, yet metabolically versatile. They share several physiological mechanisms with strict anaerobic Euryarchaeota. Several metabolic characteristics make them successful in the subsurface, including genes involved in CO and H2 oxidation (or H2 production), with potential coupling to nitrite reduction to ammonia (DNRA). This first glimpse into the metabolic capabilities of these cosmopolitan Archaea suggests they are mediating key geochemical processes and are specialized for survival in the subsurface biosphere.

  16. Genomic inference of the metabolism of cosmopolitan subsurface Archaea, Hadesarchaea.

    PubMed

    Baker, Brett J; Saw, Jimmy H; Lind, Anders E; Lazar, Cassandre Sara; Hinrichs, Kai-Uwe; Teske, Andreas P; Ettema, Thijs J G

    2016-01-01

    The subsurface biosphere is largely unexplored and contains a broad diversity of uncultured microbes(1). Despite being one of the few prokaryotic lineages that is cosmopolitan in both the terrestrial and marine subsurface(2-4), the physiological and ecological roles of SAGMEG (South-African Gold Mine Miscellaneous Euryarchaeal Group) Archaea are unknown. Here, we report the metabolic capabilities of this enigmatic group as inferred from genomic reconstructions. Four high-quality (63-90% complete) genomes were obtained from White Oak River estuary and Yellowstone National Park hot spring sediment metagenomes. Phylogenomic analyses place SAGMEG Archaea as a deeply rooting sister clade of the Thermococci, leading us to propose the name Hadesarchaea for this new Archaeal class. With an estimated genome size of around 1.5 Mbp, the genomes of Hadesarchaea are distinctly streamlined, yet metabolically versatile. They share several physiological mechanisms with strict anaerobic Euryarchaeota. Several metabolic characteristics make them successful in the subsurface, including genes involved in CO and H2 oxidation (or H2 production), with potential coupling to nitrite reduction to ammonia (DNRA). This first glimpse into the metabolic capabilities of these cosmopolitan Archaea suggests they are mediating key geochemical processes and are specialized for survival in the subsurface biosphere. PMID:27572167

  17. Population genetic inference from personal genome data: impact of ancestry and admixture on human genomic variation.

    PubMed

    Kidd, Jeffrey M; Gravel, Simon; Byrnes, Jake; Moreno-Estrada, Andres; Musharoff, Shaila; Bryc, Katarzyna; Degenhardt, Jeremiah D; Brisbin, Abra; Sheth, Vrunda; Chen, Rong; McLaughlin, Stephen F; Peckham, Heather E; Omberg, Larsson; Bormann Chung, Christina A; Stanley, Sarah; Pearlstein, Kevin; Levandowsky, Elizabeth; Acevedo-Acevedo, Suehelay; Auton, Adam; Keinan, Alon; Acuña-Alonzo, Victor; Barquera-Lozano, Rodrigo; Canizales-Quinteros, Samuel; Eng, Celeste; Burchard, Esteban G; Russell, Archie; Reynolds, Andy; Clark, Andrew G; Reese, Martin G; Lincoln, Stephen E; Butte, Atul J; De La Vega, Francisco M; Bustamante, Carlos D

    2012-10-01

    Full sequencing of individual human genomes has greatly expanded our understanding of human genetic variation and population history. Here, we present a systematic analysis of 50 human genomes from 11 diverse global populations sequenced at high coverage. Our sample includes 12 individuals who have admixed ancestry and who have varying degrees of recent (within the last 500 years) African, Native American, and European ancestry. We found over 21 million single-nucleotide variants that contribute to a 1.75-fold range in nucleotide heterozygosity across diverse human genomes. This heterozygosity ranged from a high of one heterozygous site per kilobase in west African genomes to a low of 0.57 heterozygous sites per kilobase in segments inferred to have diploid Native American ancestry from the genomes of Mexican and Puerto Rican individuals. We show evidence of all three continental ancestries in the genomes of Mexican, Puerto Rican, and African American populations, and the genome-wide statistics are highly consistent across individuals from a population once ancestry proportions have been accounted for. Using a generalized linear model, we identified subtle variations across populations in the proportion of neutral versus deleterious variation and found that genome-wide statistics vary in admixed populations even once ancestry proportions have been factored in. We further infer that multiple periods of gene flow shaped the diversity of admixed populations in the Americas-70% of the European ancestry in today's African Americans dates back to European gene flow happening only 7-8 generations ago.

  18. Population Genetic Inference from Personal Genome Data: Impact of Ancestry and Admixture on Human Genomic Variation

    PubMed Central

    Kidd, Jeffrey M.; Gravel, Simon; Byrnes, Jake; Moreno-Estrada, Andres; Musharoff, Shaila; Bryc, Katarzyna; Degenhardt, Jeremiah D.; Brisbin, Abra; Sheth, Vrunda; Chen, Rong; McLaughlin, Stephen F.; Peckham, Heather E.; Omberg, Larsson; Bormann Chung, Christina A.; Stanley, Sarah; Pearlstein, Kevin; Levandowsky, Elizabeth; Acevedo-Acevedo, Suehelay; Auton, Adam; Keinan, Alon; Acuña-Alonzo, Victor; Barquera-Lozano, Rodrigo; Canizales-Quinteros, Samuel; Eng, Celeste; Burchard, Esteban G.; Russell, Archie; Reynolds, Andy; Clark, Andrew G.; Reese, Martin G.; Lincoln, Stephen E.; Butte, Atul J.; De La Vega, Francisco M.; Bustamante, Carlos D.

    2012-01-01

    Full sequencing of individual human genomes has greatly expanded our understanding of human genetic variation and population history. Here, we present a systematic analysis of 50 human genomes from 11 diverse global populations sequenced at high coverage. Our sample includes 12 individuals who have admixed ancestry and who have varying degrees of recent (within the last 500 years) African, Native American, and European ancestry. We found over 21 million single-nucleotide variants that contribute to a 1.75-fold range in nucleotide heterozygosity across diverse human genomes. This heterozygosity ranged from a high of one heterozygous site per kilobase in west African genomes to a low of 0.57 heterozygous sites per kilobase in segments inferred to have diploid Native American ancestry from the genomes of Mexican and Puerto Rican individuals. We show evidence of all three continental ancestries in the genomes of Mexican, Puerto Rican, and African American populations, and the genome-wide statistics are highly consistent across individuals from a population once ancestry proportions have been accounted for. Using a generalized linear model, we identified subtle variations across populations in the proportion of neutral versus deleterious variation and found that genome-wide statistics vary in admixed populations even once ancestry proportions have been factored in. We further infer that multiple periods of gene flow shaped the diversity of admixed populations in the Americas—70% of the European ancestry in today’s African Americans dates back to European gene flow happening only 7–8 generations ago. PMID:23040495

  19. Uncertainty in homology inferences: Assessing and improving genomic sequence alignment

    PubMed Central

    Lunter, Gerton; Rocco, Andrea; Mimouni, Naila; Heger, Andreas; Caldeira, Alexandre; Hein, Jotun

    2008-01-01

    Sequence alignment underpins all of comparative genomics, yet it remains an incompletely solved problem. In particular, the statistical uncertainty within inferred alignments is often disregarded, while parametric or phylogenetic inferences are considered meaningless without confidence estimates. Here, we report on a theoretical and simulation study of pairwise alignments of genomic DNA at human–mouse divergence. We find that >15% of aligned bases are incorrect in existing whole-genome alignments, and we identify three types of alignment error, each leading to systematic biases in all algorithms considered. Careful modeling of the evolutionary process improves alignment quality; however, these improvements are modest compared with the remaining alignment errors, even with exact knowledge of the evolutionary model, emphasizing the need for statistical approaches to account for uncertainty. We develop a new algorithm, Marginalized Posterior Decoding (MPD), which explicitly accounts for uncertainties, is less biased and more accurate than other algorithms we consider, and reduces the proportion of misaligned bases by a third compared with the best existing algorithm. To our knowledge, this is the first nonheuristic algorithm for DNA sequence alignment to show robust improvements over the classic Needleman–Wunsch algorithm. Despite this, considerable uncertainty remains even in the improved alignments. We conclude that a probabilistic treatment is essential, both to improve alignment quality and to quantify the remaining uncertainty. This is becoming increasingly relevant with the growing appreciation of the importance of noncoding DNA, whose study relies heavily on alignments. Alignment errors are inevitable, and should be considered when drawing conclusions from alignments. Software and alignments to assist researchers in doing this are provided at http://genserv.anat.ox.ac.uk/grape/. PMID:18073381

  20. Inferring the Dynamics of Effective Population Size Using Autosomal Genomes

    PubMed Central

    Hou, Zheng; Luo, Yin; Wang, Zhisheng; Zheng, Hong-Xiang; Wang, Yi; Zhou, Hang; Wu, Leqin; Jin, Li

    2016-01-01

    Next-generation sequencing technology has provided a great opportunity for inferring human demographic history by investigating changes in the effective population size (Ne). In this report, we introduce a strategy for estimating Ne dynamics, allowing the exploration of large multi-locus SNP datasets. We applied this strategy to the Phase 1 Han Chinese samples from the 1000 Genomes Project. The Han Chinese population has undergone a continuous expansion since 25,000 years ago, at first slowly from about 7,300 to 9,800 (at the end of the last glacial maximum about 15,000 YBP), then more quickly to about 46,000 (at the beginning of the Neolithic about 8,000 YBP), and then even more quickly to reach a population size of about 140,000 (recently). PMID:26832887

  1. Bayesian population genomic inference of crossing over and gene conversion.

    PubMed

    Padhukasahasram, Badri; Rannala, Bruce

    2011-10-01

    Meiotic recombination is a fundamental cellular mechanism in sexually reproducing organisms and its different forms, crossing over and gene conversion both play an important role in shaping genetic variation in populations. Here, we describe a coalescent-based full-likelihood Markov chain Monte Carlo (MCMC) method for jointly estimating the crossing-over, gene-conversion, and mean tract length parameters from population genomic data under a Bayesian framework. Although computationally more expensive than methods that use approximate likelihoods, the relative efficiency of our method is expected to be optimal in theory. Furthermore, it is also possible to obtain a posterior sample of genealogies for the data using this method. We first check the performance of the new method on simulated data and verify its correctness. We also extend the method for inference under models with variable gene-conversion and crossing-over rates and demonstrate its ability to identify recombination hotspots. Then, we apply the method to two empirical data sets that were sequenced in the telomeric regions of the X chromosome of Drosophila melanogaster. Our results indicate that gene conversion occurs more frequently than crossing over in the su-w and su-s gene sequences while the local rates of crossing over as inferred by our program are not low. The mean tract lengths for gene-conversion events are estimated to be ∼70 bp and 430 bp, respectively, for these data sets. Finally, we discuss ideas and optimizations for reducing the execution time of our algorithm.

  2. Catchment-scale biogeography of riverine bacterioplankton

    PubMed Central

    Read, Daniel S; Gweon, Hyun S; Bowes, Michael J; Newbold, Lindsay K; Field, Dawn; Bailey, Mark J; Griffiths, Robert I

    2015-01-01

    Lotic ecosystems such as rivers and streams are unique in that they represent a continuum of both space and time during the transition from headwaters to the river mouth. As microbes have very different controls over their ecology, distribution and dispersion compared with macrobiota, we wished to explore biogeographical patterns within a river catchment and uncover the major drivers structuring bacterioplankton communities. Water samples collected across the River Thames Basin, UK, covering the transition from headwater tributaries to the lower reaches of the main river channel were characterised using 16S rRNA gene pyrosequencing. This approach revealed an ecological succession in the bacterial community composition along the river continuum, moving from a community dominated by Bacteroidetes in the headwaters to Actinobacteria-dominated downstream. Location of the sampling point in the river network (measured as the cumulative water channel distance upstream) was found to be the most predictive spatial feature; inferring that ecological processes pertaining to temporal community succession are of prime importance in driving the assemblages of riverine bacterioplankton communities. A decrease in bacterial activity rates and an increase in the abundance of low nucleic acid bacteria relative to high nucleic acid bacteria were found to correspond with these downstream changes in community structure, suggesting corresponding functional changes. Our findings show that bacterial communities across the Thames basin exhibit an ecological succession along the river continuum, and that this is primarily driven by water residence time rather than the physico-chemical status of the river. PMID:25238398

  3. Catchment-scale biogeography of riverine bacterioplankton.

    PubMed

    Read, Daniel S; Gweon, Hyun S; Bowes, Michael J; Newbold, Lindsay K; Field, Dawn; Bailey, Mark J; Griffiths, Robert I

    2015-02-01

    Lotic ecosystems such as rivers and streams are unique in that they represent a continuum of both space and time during the transition from headwaters to the river mouth. As microbes have very different controls over their ecology, distribution and dispersion compared with macrobiota, we wished to explore biogeographical patterns within a river catchment and uncover the major drivers structuring bacterioplankton communities. Water samples collected across the River Thames Basin, UK, covering the transition from headwater tributaries to the lower reaches of the main river channel were characterised using 16S rRNA gene pyrosequencing. This approach revealed an ecological succession in the bacterial community composition along the river continuum, moving from a community dominated by Bacteroidetes in the headwaters to Actinobacteria-dominated downstream. Location of the sampling point in the river network (measured as the cumulative water channel distance upstream) was found to be the most predictive spatial feature; inferring that ecological processes pertaining to temporal community succession are of prime importance in driving the assemblages of riverine bacterioplankton communities. A decrease in bacterial activity rates and an increase in the abundance of low nucleic acid bacteria relative to high nucleic acid bacteria were found to correspond with these downstream changes in community structure, suggesting corresponding functional changes. Our findings show that bacterial communities across the Thames basin exhibit an ecological succession along the river continuum, and that this is primarily driven by water residence time rather than the physico-chemical status of the river. PMID:25238398

  4. Evolution of mammalian genome organization inferred from comparative gene mapping

    PubMed Central

    Murphy, William J; Stanyon, Roscoe; O'Brien, Stephen J

    2001-01-01

    Comparative genome analyses, including chromosome painting in over 40 diverse mammalian species, ordered gene maps from several representatives of different mammalian and vertebrate orders, and large-scale sequencing of the human and mouse genomes are beginning to provide insight into the rates and patterns of chromosomal evolution on a whole-genome scale, as well as into the forces that have sculpted the genomes of extant mammalian species. PMID:11423011

  5. Systems Biology and Ecology of Streamlined Bacterioplankton

    NASA Astrophysics Data System (ADS)

    Giovannoni, S. J.

    2014-12-01

    The salient feature of streamlined cells is their small genome size, but "streamlining" refers more generally to selection that favors minimization of cell size and complexity. The essence of streamlining theory is that selection is most efficient in organisms that have large effective population sizes, and, in nutrient-limited systems, favors cell architecture that minimizes resources required for replication. Regardless of the cause of genome reduction, lost coding potential eventually dictates loss of function, raising the questions, what genome features are expendable, and how do cells become highly successful with a minimal genomic repertoire? One consequence of reductive evolution in streamlined organisms is atypical patterns of prototrophy, for example the recent discovery of a requirement for the thiamin precursor 4-amino-5-hydroxymethyl-2-methylpyrimidine in some plankton taxa. Examples such as this fit within the framework of the Black Queen Hypothesis, which describes genome reduction that results in reliance on community goods and increased community connectivity. Other examples of genome reduction include losses of regulatory functions, or replacement with simpler regulatory systems, and increased metabolic integration. In one such case, in the order Pelagibacterales, the PII system for regulating responses to N limitation has been replaced with a simpler system composed of fewer genes. Both the absence of common regulatory systems and atypical patterns of prototrophy have been linked to difficulty in culturing Pelagibacterales, lending credibility to the idea that streamlining might broadly explain the phenomenon of the uncultured microbial majority. The success of streamlined osmotrophic bacterioplankton suggests that they successfully compete for labile organic matter and capture a large share of this resource, but an alternative theory postulates they are not good resource competitors and instead prosper by avoiding predation. The answers to these

  6. Genome-wide inference of regulatory networks in Streptomyces coelicolor

    PubMed Central

    2010-01-01

    Background The onset of antibiotics production in Streptomyces species is co-ordinated with differentiation events. An understanding of the genetic circuits that regulate these coupled biological phenomena is essential to discover and engineer the pharmacologically important natural products made by these species. The availability of genomic tools and access to a large warehouse of transcriptome data for the model organism, Streptomyces coelicolor, provides incentive to decipher the intricacies of the regulatory cascades and develop biologically meaningful hypotheses. Results In this study, more than 500 samples of genome-wide temporal transcriptome data, comprising wild-type and more than 25 regulatory gene mutants of Streptomyces coelicolor probed across multiple stress and medium conditions, were investigated. Information based on transcript and functional similarity was used to update a previously-predicted whole-genome operon map and further applied to predict transcriptional networks constituting modules enriched in diverse functions such as secondary metabolism, and sigma factor. The predicted network displays a scale-free architecture with a small-world property observed in many biological networks. The networks were further investigated to identify functionally-relevant modules that exhibit functional coherence and a consensus motif in the promoter elements indicative of DNA-binding elements. Conclusions Despite the enormous experimental as well as computational challenges, a systems approach for integrating diverse genome-scale datasets to elucidate complex regulatory networks is beginning to emerge. We present an integrated analysis of transcriptome data and genomic features to refine a whole-genome operon map and to construct regulatory networks at the cistron level in Streptomyces coelicolor. The functionally-relevant modules identified in this study pose as potential targets for further studies and verification. PMID:20955611

  7. OMA 2011: orthology inference among 1000 complete genomes.

    PubMed

    Altenhoff, Adrian M; Schneider, Adrian; Gonnet, Gaston H; Dessimoz, Christophe

    2011-01-01

    OMA (Orthologous MAtrix) is a database that identifies orthologs among publicly available, complete genomes. Initiated in 2004, the project is at its 11th release. It now includes 1000 genomes, making it one of the largest resources of its kind. Here, we describe recent developments in terms of species covered; the algorithmic pipeline--in particular regarding the treatment of alternative splicing, and new features of the web (OMA Browser) and programming interface (SOAP API). In the second part, we review the various representations provided by OMA and their typical applications. The database is publicly accessible at http://omabrowser.org.

  8. Inferring Demography from Runs of Homozygosity in Whole-Genome Sequence, with Correction for Sequence Errors

    PubMed Central

    MacLeod, Iona M.; Larkin, Denis M.; Lewin, Harris A.; Hayes, Ben J.; Goddard, Mike E.

    2013-01-01

    Whole-genome sequence is potentially the richest source of genetic data for inferring ancestral demography. However, full sequence also presents significant challenges to fully utilize such large data sets and to ensure that sequencing errors do not introduce bias into the inferred demography. Using whole-genome sequence data from two Holstein cattle, we demonstrate a new method to correct for bias caused by hidden errors and then infer stepwise changes in ancestral demography up to present. There was a strong upward bias in estimates of recent effective population size (Ne) if the correction method was not applied to the data, both for our method and the Li and Durbin (Inference of human population history from individual whole-genome sequences. Nature 475:493–496) pairwise sequentially Markovian coalescent method. To infer demography, we use an analytical predictor of multiloci linkage disequilibrium (LD) based on a simple coalescent model that allows for changes in Ne. The LD statistic summarizes the distribution of runs of homozygosity for any given demography. We infer a best fit demography as one that predicts a match with the observed distribution of runs of homozygosity in the corrected sequence data. We use multiloci LD because it potentially holds more information about ancestral demography than pairwise LD. The inferred demography indicates a strong reduction in the Ne around 170,000 years ago, possibly related to the divergence of African and European Bos taurus cattle. This is followed by a further reduction coinciding with the period of cattle domestication, with Ne of between 3,500 and 6,000. The most recent reduction of Ne to approximately 100 in the Holstein breed agrees well with estimates from pedigrees. Our approach can be applied to whole-genome sequence from any diploid species and can be scaled up to use sequence from multiple individuals. PMID:23842528

  9. Using Genetic Distance to Infer the Accuracy of Genomic Prediction.

    PubMed

    Scutari, Marco; Mackay, Ian; Balding, David

    2016-09-01

    The prediction of phenotypic traits using high-density genomic data has many applications such as the selection of plants and animals of commercial interest; and it is expected to play an increasing role in medical diagnostics. Statistical models used for this task are usually tested using cross-validation, which implicitly assumes that new individuals (whose phenotypes we would like to predict) originate from the same population the genomic prediction model is trained on. In this paper we propose an approach based on clustering and resampling to investigate the effect of increasing genetic distance between training and target populations when predicting quantitative traits. This is important for plant and animal genetics, where genomic selection programs rely on the precision of predictions in future rounds of breeding. Therefore, estimating how quickly predictive accuracy decays is important in deciding which training population to use and how often the model has to be recalibrated. We find that the correlation between true and predicted values decays approximately linearly with respect to either FST or mean kinship between the training and the target populations. We illustrate this relationship using simulations and a collection of data sets from mice, wheat and human genetics. PMID:27589268

  10. Using Genetic Distance to Infer the Accuracy of Genomic Prediction

    PubMed Central

    Scutari, Marco; Mackay, Ian

    2016-01-01

    The prediction of phenotypic traits using high-density genomic data has many applications such as the selection of plants and animals of commercial interest; and it is expected to play an increasing role in medical diagnostics. Statistical models used for this task are usually tested using cross-validation, which implicitly assumes that new individuals (whose phenotypes we would like to predict) originate from the same population the genomic prediction model is trained on. In this paper we propose an approach based on clustering and resampling to investigate the effect of increasing genetic distance between training and target populations when predicting quantitative traits. This is important for plant and animal genetics, where genomic selection programs rely on the precision of predictions in future rounds of breeding. Therefore, estimating how quickly predictive accuracy decays is important in deciding which training population to use and how often the model has to be recalibrated. We find that the correlation between true and predicted values decays approximately linearly with respect to either FST or mean kinship between the training and the target populations. We illustrate this relationship using simulations and a collection of data sets from mice, wheat and human genetics. PMID:27589268

  11. Alignment-free genome tree inference by learning group-specific distance metrics.

    PubMed

    Patil, Kaustubh R; McHardy, Alice C

    2013-01-01

    Understanding the evolutionary relationships between organisms is vital for their in-depth study. Gene-based methods are often used to infer such relationships, which are not without drawbacks. One can now attempt to use genome-scale information, because of the ever increasing number of genomes available. This opportunity also presents a challenge in terms of computational efficiency. Two fundamentally different methods are often employed for sequence comparisons, namely alignment-based and alignment-free methods. Alignment-free methods rely on the genome signature concept and provide a computationally efficient way that is also applicable to nonhomologous sequences. The genome signature contains evolutionary signal as it is more similar for closely related organisms than for distantly related ones. We used genome-scale sequence information to infer taxonomic distances between organisms without additional information such as gene annotations. We propose a method to improve genome tree inference by learning specific distance metrics over the genome signature for groups of organisms with similar phylogenetic, genomic, or ecological properties. Specifically, our method learns a Mahalanobis metric for a set of genomes and a reference taxonomy to guide the learning process. By applying this method to more than a thousand prokaryotic genomes, we showed that, indeed, better distance metrics could be learned for most of the 18 groups of organisms tested here. Once a group-specific metric is available, it can be used to estimate the taxonomic distances for other sequenced organisms from the group. This study also presents a large scale comparison between 10 methods--9 alignment-free and 1 alignment-based.

  12. A Cooperative Co-Evolutionary Genetic Algorithm for Tree Scoring and Ancestral Genome Inference.

    PubMed

    Gao, Nan; Zhang, Yan; Feng, Bing; Tang, Jijun

    2015-01-01

    Recent advances of technology have made it easy to obtain and compare whole genomes. Rearrangements of genomes through operations such as reversals and transpositions are rare events that enable researchers to reconstruct deep evolutionary history among species. Some of the popular methods need to search a large tree space for the best scored tree, thus it is desirable to have a fast and accurate method that can score a given tree efficiently. During the tree scoring procedure, the genomic structures of internal tree nodes are also provided, which provide important information for inferring ancestral genomes and for modeling the evolutionary processes. However, computing tree scores and ancestral genomes are very difficult and a lot of researchers have to rely on heuristic methods which have various disadvantages. In this paper, we describe the first genetic algorithm for tree scoring and ancestor inference, which uses a fitness function considering co-evolution, adopts different initial seeding methods to initialize the first population pool, and utilizes a sorting-based approach to realize evolution. Our extensive experiments show that compared with other existing algorithms, this new method is more accurate and can infer ancestral genomes that are much closer to the true ancestors. PMID:26671797

  13. The History of Slavs Inferred from Complete Mitochondrial Genome Sequences

    PubMed Central

    Mielnik-Sikorska, Marta; Daca, Patrycja; Malyarchuk, Boris; Derenko, Miroslava; Skonieczna, Katarzyna; Perkova, Maria; Dobosz, Tadeusz; Grzybowski, Tomasz

    2013-01-01

    To shed more light on the processes leading to crystallization of a Slavic identity, we investigated variability of complete mitochondrial genomes belonging to haplogroups H5 and H6 (63 mtDNA genomes) from the populations of Eastern and Western Slavs, including new samples of Poles, Ukrainians and Czechs presented here. Molecular dating implies formation of H5 approximately 11.5–16 thousand years ago (kya) in the areas of southern Europe. Within ancient haplogroup H6, dated at around 15–28 kya, there is a subhaplogroup H6c, which probably survived the last glaciation in Europe and has undergone expansion only 3–4 kya, together with the ancestors of some European groups, including the Slavs, because H6c has been detected in Czechs, Poles and Slovaks. Detailed analysis of complete mtDNAs allowed us to identify a number of lineages that seem specific for Central and Eastern Europe (H5a1f, H5a2, H5a1r, H5a1s, H5b4, H5e1a, H5u1, some subbranches of H5a1a and H6a1a9). Some of them could possibly be traced back to at least ∼4 kya, which indicates that some of the ancestors of today's Slavs (Poles, Czechs, Slovaks, Ukrainians and Russians) inhabited areas of Central and Eastern Europe much earlier than it was estimated on the basis of archaeological and historical data. We also sequenced entire mitochondrial genomes of several non-European lineages (A, C, D, G, L) found in contemporary populations of Poland and Ukraine. The analysis of these haplogroups confirms the presence of Siberian (C5c1, A8a1) and Ashkenazi-specific (L2a1l2a) mtDNA lineages in Slavic populations. Moreover, we were able to pinpoint some lineages which could possibly reflect the relatively recent contacts of Slavs with nomadic Altaic peoples (C4a1a, G2a, D5a2a1a1). PMID:23342138

  14. A molecular phylogeny of Hemiptera inferred from mitochondrial genome sequences.

    PubMed

    Song, Nan; Liang, Ai-Ping; Bu, Cui-Ping

    2012-01-01

    Classically, Hemiptera is comprised of two suborders: Homoptera and Heteroptera. Homoptera includes Cicadomorpha, Fulgoromorpha and Sternorrhyncha. However, according to previous molecular phylogenetic studies based on 18S rDNA, Fulgoromorpha has a closer relationship to Heteroptera than to other hemipterans, leaving Homoptera as paraphyletic. Therefore, the position of Fulgoromorpha is important for studying phylogenetic structure of Hemiptera. We inferred the evolutionary affiliations of twenty-five superfamilies of Hemiptera using mitochondrial protein-coding genes and rRNAs. We sequenced three mitogenomes, from Pyrops candelaria, Lycorma delicatula and Ricania marginalis, representing two additional families in Fulgoromorpha. Pyrops and Lycorma are representatives of an additional major family Fulgoridae in Fulgoromorpha, whereas Ricania is a second representative of the highly derived clade Ricaniidae. The organization and size of these mitogenomes are similar to those of the sequenced fulgoroid species. Our consensus phylogeny of Hemiptera largely supported the relationships (((Fulgoromorpha,Sternorrhyncha),Cicadomorpha),Heteroptera), and thus supported the classic phylogeny of Hemiptera. Selection of optimal evolutionary models (exclusion and inclusion of two rRNA genes or of third codon positions of protein-coding genes) demonstrated that rapidly evolving and saturated sites should be removed from the analyses.

  15. Inference of gorilla demographic and selective history from whole-genome sequence data.

    PubMed

    McManus, Kimberly F; Kelley, Joanna L; Song, Shiya; Veeramah, Krishna R; Woerner, August E; Stevison, Laurie S; Ryder, Oliver A; Ape Genome Project, Great; Kidd, Jeffrey M; Wall, Jeffrey D; Bustamante, Carlos D; Hammer, Michael F

    2015-03-01

    Although population-level genomic sequence data have been gathered extensively for humans, similar data from our closest living relatives are just beginning to emerge. Examination of genomic variation within great apes offers many opportunities to increase our understanding of the forces that have differentially shaped the evolutionary history of hominid taxa. Here, we expand upon the work of the Great Ape Genome Project by analyzing medium to high coverage whole-genome sequences from 14 western lowland gorillas (Gorilla gorilla gorilla), 2 eastern lowland gorillas (G. beringei graueri), and a single Cross River individual (G. gorilla diehli). We infer that the ancestors of western and eastern lowland gorillas diverged from a common ancestor approximately 261 ka, and that the ancestors of the Cross River population diverged from the western lowland gorilla lineage approximately 68 ka. Using a diffusion approximation approach to model the genome-wide site frequency spectrum, we infer a history of western lowland gorillas that includes an ancestral population expansion of 1.4-fold around 970 ka and a recent 5.6-fold contraction in population size 23 ka. The latter may correspond to a major reduction in African equatorial forests around the Last Glacial Maximum. We also analyze patterns of variation among western lowland gorillas to identify several genomic regions with strong signatures of recent selective sweeps. We find that processes related to taste, pancreatic and saliva secretion, sodium ion transmembrane transport, and cardiac muscle function are overrepresented in genomic regions predicted to have experienced recent positive selection.

  16. Sigma: Strain-level inference of genomes from metagenomic analysis for biosurveillance

    SciTech Connect

    Ahn, Tae-Hyuk; Chai, Juanjuan; Pan, Chongle

    2014-09-29

    Motivation: Metagenomic sequencing of clinical samples provides a promising technique for direct pathogen detection and characterization in biosurveillance. Taxonomic analysis at the strain level can be used to resolve serotypes of a pathogen in biosurveillance. Sigma was developed for strain-level identification and quantification of pathogens using their reference genomes based on metagenomic analysis. Results: Sigma provides not only accurate strain-level inferences, but also three unique capabilities: (i) Sigma quantifies the statistical uncertainty of its inferences, which includes hypothesis testing of identified genomes and confidence interval estimation of their relative abundances; (ii) Sigma enables strain variant calling by assigning metagenomic reads to their most likely reference genomes; and (iii) Sigma supports parallel computing for fast analysis of large datasets. In conclusion, the algorithm performance was evaluated using simulated mock communities and fecal samples with spike-in pathogen strains. Availability and Implementation: Sigma was implemented in C++ with source codes and binaries freely available at http://sigma.omicsbio.org.

  17. How do divergent ecological strategies emerge among marine bacterioplankton lineages?

    PubMed

    Luo, Haiwei; Moran, Mary Ann

    2015-09-01

    Heterotrophic bacteria in pelagic marine environments are frequently categorized into two canonical ecological groups: patch-associated and free-living. This framework provides a conceptual basis for understanding bacterial utilization of oceanic organic matter. Some patch-associated bacteria are ecologically linked with eukaryotic phytoplankton, and this observation fits with predicted coincidence of their genome expansion with marine phytoplankton diversification. By contrast, free-living bacteria in today's oceans typically live singly with streamlined metabolic and regulatory functions that allow them to grow in nutrient-poor seawater. Recent analyses of marine Alphaproteobacteria suggest that some free-living bacterioplankton lineages evolved from patch-associated ancestors up to several hundred million years ago. While evolutionary analyses agree with the hypothesis that natural selection has maintained these distinct ecological strategies and genomic traits in present-day populations, they do not rule out a major role for genetic drift in driving ancient ecological switches. These two evolutionary forces may have acted on ocean bacteria at different geological time scales and under different geochemical constraints, with possible implications for future adaptations to a changing ocean. New evolutionary models and genomic data are leading to a more comprehensive understanding of marine bacterioplankton evolutionary history.

  18. Inferring Strain Mixture within Clinical Plasmodium falciparum Isolates from Genomic Sequence Data.

    PubMed

    O'Brien, John D; Iqbal, Zamin; Wendler, Jason; Amenga-Etego, Lucas

    2016-06-01

    We present a rigorous statistical model that infers the structure of P. falciparum mixtures-including the number of strains present, their proportion within the samples, and the amount of unexplained mixture-using whole genome sequence (WGS) data. Applied to simulation data, artificial laboratory mixtures, and field samples, the model provides reasonable inference with as few as 10 reads or 50 SNPs and works efficiently even with much larger data sets. Source code and example data for the model are provided in an open source fashion. We discuss the possible uses of this model as a window into within-host selection for clinical and epidemiological studies. PMID:27362949

  19. Inferring Strain Mixture within Clinical Plasmodium falciparum Isolates from Genomic Sequence Data

    PubMed Central

    O’Brien, John D.; Amenga-Etego, Lucas

    2016-01-01

    We present a rigorous statistical model that infers the structure of P. falciparum mixtures—including the number of strains present, their proportion within the samples, and the amount of unexplained mixture—using whole genome sequence (WGS) data. Applied to simulation data, artificial laboratory mixtures, and field samples, the model provides reasonable inference with as few as 10 reads or 50 SNPs and works efficiently even with much larger data sets. Source code and example data for the model are provided in an open source fashion. We discuss the possible uses of this model as a window into within-host selection for clinical and epidemiological studies. PMID:27362949

  20. Inference of population splits and mixtures from genome-wide allele frequency data.

    PubMed

    Pickrell, Joseph K; Pritchard, Jonathan K

    2012-01-01

    Many aspects of the historical relationships between populations in a species are reflected in genetic data. Inferring these relationships from genetic data, however, remains a challenging task. In this paper, we present a statistical model for inferring the patterns of population splits and mixtures in multiple populations. In our model, the sampled populations in a species are related to their common ancestor through a graph of ancestral populations. Using genome-wide allele frequency data and a Gaussian approximation to genetic drift, we infer the structure of this graph. We applied this method to a set of 55 human populations and a set of 82 dog breeds and wild canids. In both species, we show that a simple bifurcating tree does not fully describe the data; in contrast, we infer many migration events. While some of the migration events that we find have been detected previously, many have not. For example, in the human data, we infer that Cambodians trace approximately 16% of their ancestry to a population ancestral to other extant East Asian populations. In the dog data, we infer that both the boxer and basenji trace a considerable fraction of their ancestry (9% and 25%, respectively) to wolves subsequent to domestication and that East Asian toy breeds (the Shih Tzu and the Pekingese) result from admixture between modern toy breeds and "ancient" Asian breeds. Software implementing the model described here, called TreeMix, is available at http://treemix.googlecode.com. PMID:23166502

  1. Inference of Population Splits and Mixtures from Genome-Wide Allele Frequency Data

    PubMed Central

    Pickrell, Joseph K.; Pritchard, Jonathan K.

    2012-01-01

    Many aspects of the historical relationships between populations in a species are reflected in genetic data. Inferring these relationships from genetic data, however, remains a challenging task. In this paper, we present a statistical model for inferring the patterns of population splits and mixtures in multiple populations. In our model, the sampled populations in a species are related to their common ancestor through a graph of ancestral populations. Using genome-wide allele frequency data and a Gaussian approximation to genetic drift, we infer the structure of this graph. We applied this method to a set of 55 human populations and a set of 82 dog breeds and wild canids. In both species, we show that a simple bifurcating tree does not fully describe the data; in contrast, we infer many migration events. While some of the migration events that we find have been detected previously, many have not. For example, in the human data, we infer that Cambodians trace approximately 16% of their ancestry to a population ancestral to other extant East Asian populations. In the dog data, we infer that both the boxer and basenji trace a considerable fraction of their ancestry (9% and 25%, respectively) to wolves subsequent to domestication and that East Asian toy breeds (the Shih Tzu and the Pekingese) result from admixture between modern toy breeds and “ancient” Asian breeds. Software implementing the model described here, called TreeMix, is available at http://treemix.googlecode.com. PMID:23166502

  2. Improved genome inference in the MHC using a population reference graph

    PubMed Central

    Dilthey, Alexander; Cox, Charles; Iqbal, Zamin; Nelson, Matthew R.; McVean, Gil

    2015-01-01

    While much is known about human genetic variation, such information is typically ignored in assembling novel genomes. Instead, reads are mapped to a single reference, which can lead to poor characterization of regions of high sequence or structural diversity. We introduce a population reference graph, which combines multiple reference sequences and catalogues of variation. The genomes of novel samples are reconstructed as paths through the graph using an efficient hidden Markov model, allowing for recombination between different haplotypes and additional variants. By applying the method to the 4.5Mb extended MHC region on human chromosome 6, combining eight assembled haplotypes, sequences of known classical HLA alleles and 87,640 SNP variants from the 1000 Genomes Project, we demonstrate, using simulations, SNP genotyping, short-read and long-read data, how the method improves the accuracy of genome inference and reveals regions where the current set of reference sequences is substantially incomplete. PMID:25915597

  3. How to infer relative fitness from a sample of genomic sequences.

    PubMed

    Dayarian, Adel; Shraiman, Boris I

    2014-07-01

    Mounting evidence suggests that natural populations can harbor extensive fitness diversity with numerous genomic loci under selection. It is also known that genealogical trees for populations under selection are quantifiably different from those expected under neutral evolution and described statistically by Kingman's coalescent. While differences in the statistical structure of genealogies have long been used as a test for the presence of selection, the full extent of the information that they contain has not been exploited. Here we demonstrate that the shape of the reconstructed genealogical tree for a moderately large number of random genomic samples taken from a fitness diverse, but otherwise unstructured, asexual population can be used to predict the relative fitness of individuals within the sample. To achieve this we define a heuristic algorithm, which we test in silico, using simulations of a Wright-Fisher model for a realistic range of mutation rates and selection strength. Our inferred fitness ranking is based on a linear discriminator that identifies rapidly coalescing lineages in the reconstructed tree. Inferred fitness ranking correlates strongly with actual fitness, with a genome in the top 10% ranked being in the top 20% fittest with false discovery rate of 0.1-0.3, depending on the mutation/selection parameters. The ranking also enables us to predict the genotypes that future populations inherit from the present one. While the inference accuracy increases monotonically with sample size, samples of 200 nearly saturate the performance. We propose that our approach can be used for inferring relative fitness of genomes obtained in single-cell sequencing of tumors and in monitoring viral outbreaks. PMID:24770330

  4. Higher-level phylogeny of paraneopteran insects inferred from mitochondrial genome sequences

    PubMed Central

    Li, Hu; Shao, Renfu; Song, Nan; Song, Fan; Jiang, Pei; Li, Zhihong; Cai, Wanzhi

    2015-01-01

    Mitochondrial (mt) genome data have been proven to be informative for animal phylogenetic studies but may also suffer from systematic errors, due to the effects of accelerated substitution rate and compositional heterogeneity. We analyzed the mt genomes of 25 insect species from the four paraneopteran orders, aiming to better understand how accelerated substitution rate and compositional heterogeneity affect the inferences of the higher-level phylogeny of this diverse group of hemimetabolous insects. We found substantial heterogeneity in base composition and contrasting rates in nucleotide substitution among these paraneopteran insects, which complicate the inference of higher-level phylogeny. The phylogenies inferred with concatenated sequences of mt genes using maximum likelihood and Bayesian methods and homogeneous models failed to recover Psocodea and Hemiptera as monophyletic groups but grouped, instead, the taxa that had accelerated substitution rates together, including Sternorrhyncha (a suborder of Hemiptera), Thysanoptera, Phthiraptera and Liposcelididae (a family of Psocoptera). Bayesian inference with nucleotide sequences and heterogeneous models (CAT and CAT + GTR), however, recovered Psocodea, Thysanoptera and Hemiptera each as a monophyletic group. Within Psocodea, Liposcelididae is more closely related to Phthiraptera than to other species of Psocoptera. Furthermore, Thysanoptera was recovered as the sister group to Hemiptera. PMID:25704094

  5. Higher-level phylogeny of paraneopteran insects inferred from mitochondrial genome sequences.

    PubMed

    Li, Hu; Shao, Renfu; Song, Nan; Song, Fan; Jiang, Pei; Li, Zhihong; Cai, Wanzhi

    2015-02-23

    Mitochondrial (mt) genome data have been proven to be informative for animal phylogenetic studies but may also suffer from systematic errors, due to the effects of accelerated substitution rate and compositional heterogeneity. We analyzed the mt genomes of 25 insect species from the four paraneopteran orders, aiming to better understand how accelerated substitution rate and compositional heterogeneity affect the inferences of the higher-level phylogeny of this diverse group of hemimetabolous insects. We found substantial heterogeneity in base composition and contrasting rates in nucleotide substitution among these paraneopteran insects, which complicate the inference of higher-level phylogeny. The phylogenies inferred with concatenated sequences of mt genes using maximum likelihood and Bayesian methods and homogeneous models failed to recover Psocodea and Hemiptera as monophyletic groups but grouped, instead, the taxa that had accelerated substitution rates together, including Sternorrhyncha (a suborder of Hemiptera), Thysanoptera, Phthiraptera and Liposcelididae (a family of Psocoptera). Bayesian inference with nucleotide sequences and heterogeneous models (CAT and CAT + GTR), however, recovered Psocodea, Thysanoptera and Hemiptera each as a monophyletic group. Within Psocodea, Liposcelididae is more closely related to Phthiraptera than to other species of Psocoptera. Furthermore, Thysanoptera was recovered as the sister group to Hemiptera.

  6. Microarray Data Processing Techniques for Genome-Scale Network Inference from Large Public Repositories.

    PubMed

    Chockalingam, Sriram; Aluru, Maneesha; Aluru, Srinivas

    2016-09-19

    Pre-processing of microarray data is a well-studied problem. Furthermore, all popular platforms come with their own recommended best practices for differential analysis of genes. However, for genome-scale network inference using microarray data collected from large public repositories, these methods filter out a considerable number of genes. This is primarily due to the effects of aggregating a diverse array of experiments with different technical and biological scenarios. Here we introduce a pre-processing pipeline suitable for inferring genome-scale gene networks from large microarray datasets. We show that partitioning of the available microarray datasets according to biological relevance into tissue- and process-specific categories significantly extends the limits of downstream network construction. We demonstrate the effectiveness of our pre-processing pipeline by inferring genome-scale networks for the model plant Arabidopsis thaliana using two different construction methods and a collection of 11,760 Affymetrix ATH1 microarray chips. Our pre-processing pipeline and the datasets used in this paper are made available at http://alurulab.cc.gatech.edu/microarray-pp.

  7. Microarray Data Processing Techniques for Genome-Scale Network Inference from Large Public Repositories

    PubMed Central

    Chockalingam, Sriram; Aluru, Maneesha; Aluru, Srinivas

    2016-01-01

    Pre-processing of microarray data is a well-studied problem. Furthermore, all popular platforms come with their own recommended best practices for differential analysis of genes. However, for genome-scale network inference using microarray data collected from large public repositories, these methods filter out a considerable number of genes. This is primarily due to the effects of aggregating a diverse array of experiments with different technical and biological scenarios. Here we introduce a pre-processing pipeline suitable for inferring genome-scale gene networks from large microarray datasets. We show that partitioning of the available microarray datasets according to biological relevance into tissue- and process-specific categories significantly extends the limits of downstream network construction. We demonstrate the effectiveness of our pre-processing pipeline by inferring genome-scale networks for the model plant Arabidopsis thaliana using two different construction methods and a collection of 11,760 Affymetrix ATH1 microarray chips. Our pre-processing pipeline and the datasets used in this paper are made available at http://alurulab.cc.gatech.edu/microarray-pp. PMID:27657141

  8. Microarray Data Processing Techniques for Genome-Scale Network Inference from Large Public Repositories.

    PubMed

    Chockalingam, Sriram; Aluru, Maneesha; Aluru, Srinivas

    2016-01-01

    Pre-processing of microarray data is a well-studied problem. Furthermore, all popular platforms come with their own recommended best practices for differential analysis of genes. However, for genome-scale network inference using microarray data collected from large public repositories, these methods filter out a considerable number of genes. This is primarily due to the effects of aggregating a diverse array of experiments with different technical and biological scenarios. Here we introduce a pre-processing pipeline suitable for inferring genome-scale gene networks from large microarray datasets. We show that partitioning of the available microarray datasets according to biological relevance into tissue- and process-specific categories significantly extends the limits of downstream network construction. We demonstrate the effectiveness of our pre-processing pipeline by inferring genome-scale networks for the model plant Arabidopsis thaliana using two different construction methods and a collection of 11,760 Affymetrix ATH1 microarray chips. Our pre-processing pipeline and the datasets used in this paper are made available at http://alurulab.cc.gatech.edu/microarray-pp. PMID:27657141

  9. Inference of gorilla demographic and selective history from whole-genome sequence data.

    PubMed

    McManus, Kimberly F; Kelley, Joanna L; Song, Shiya; Veeramah, Krishna R; Woerner, August E; Stevison, Laurie S; Ryder, Oliver A; Ape Genome Project, Great; Kidd, Jeffrey M; Wall, Jeffrey D; Bustamante, Carlos D; Hammer, Michael F

    2015-03-01

    Although population-level genomic sequence data have been gathered extensively for humans, similar data from our closest living relatives are just beginning to emerge. Examination of genomic variation within great apes offers many opportunities to increase our understanding of the forces that have differentially shaped the evolutionary history of hominid taxa. Here, we expand upon the work of the Great Ape Genome Project by analyzing medium to high coverage whole-genome sequences from 14 western lowland gorillas (Gorilla gorilla gorilla), 2 eastern lowland gorillas (G. beringei graueri), and a single Cross River individual (G. gorilla diehli). We infer that the ancestors of western and eastern lowland gorillas diverged from a common ancestor approximately 261 ka, and that the ancestors of the Cross River population diverged from the western lowland gorilla lineage approximately 68 ka. Using a diffusion approximation approach to model the genome-wide site frequency spectrum, we infer a history of western lowland gorillas that includes an ancestral population expansion of 1.4-fold around 970 ka and a recent 5.6-fold contraction in population size 23 ka. The latter may correspond to a major reduction in African equatorial forests around the Last Glacial Maximum. We also analyze patterns of variation among western lowland gorillas to identify several genomic regions with strong signatures of recent selective sweeps. We find that processes related to taste, pancreatic and saliva secretion, sodium ion transmembrane transport, and cardiac muscle function are overrepresented in genomic regions predicted to have experienced recent positive selection. PMID:25534031

  10. Inference of Gorilla Demographic and Selective History from Whole-Genome Sequence Data

    PubMed Central

    McManus, Kimberly F.; Kelley, Joanna L.; Song, Shiya; Veeramah, Krishna R.; Woerner, August E.; Stevison, Laurie S.; Ryder, Oliver A.; Ape Genome Project, Great; Kidd, Jeffrey M.; Wall, Jeffrey D.; Bustamante, Carlos D.; Hammer, Michael F.

    2015-01-01

    Although population-level genomic sequence data have been gathered extensively for humans, similar data from our closest living relatives are just beginning to emerge. Examination of genomic variation within great apes offers many opportunities to increase our understanding of the forces that have differentially shaped the evolutionary history of hominid taxa. Here, we expand upon the work of the Great Ape Genome Project by analyzing medium to high coverage whole-genome sequences from 14 western lowland gorillas (Gorilla gorilla gorilla), 2 eastern lowland gorillas (G. beringei graueri), and a single Cross River individual (G. gorilla diehli). We infer that the ancestors of western and eastern lowland gorillas diverged from a common ancestor approximately 261 ka, and that the ancestors of the Cross River population diverged from the western lowland gorilla lineage approximately 68 ka. Using a diffusion approximation approach to model the genome-wide site frequency spectrum, we infer a history of western lowland gorillas that includes an ancestral population expansion of 1.4-fold around 970 ka and a recent 5.6-fold contraction in population size 23 ka. The latter may correspond to a major reduction in African equatorial forests around the Last Glacial Maximum. We also analyze patterns of variation among western lowland gorillas to identify several genomic regions with strong signatures of recent selective sweeps. We find that processes related to taste, pancreatic and saliva secretion, sodium ion transmembrane transport, and cardiac muscle function are overrepresented in genomic regions predicted to have experienced recent positive selection. PMID:25534031

  11. Structure-based inference of molecular functions of proteins of unknown function from Berkeley Structural Genomics Center

    SciTech Connect

    Kim, Sung-Hou; Shin, Dong Hae; Hou, Jingtong; Chandonia, John-Marc; Das, Debanu; Choi, In-Geol; Kim, Rosalind; Kim, Sung-Hou

    2007-09-02

    Advances in sequence genomics have resulted in an accumulation of a huge number of protein sequences derived from genome sequences. However, the functions of a large portion of them cannot be inferred based on the current methods of sequence homology detection to proteins of known functions. Three-dimensional structure can have an important impact in providing inference of molecular function (physical and chemical function) of a protein of unknown function. Structural genomics centers worldwide have been determining many 3-D structures of the proteins of unknown functions, and possible molecular functions of them have been inferred based on their structures. Combined with bioinformatics and enzymatic assay tools, the successful acceleration of the process of protein structure determination through high throughput pipelines enables the rapid functional annotation of a large fraction of hypothetical proteins. We present a brief summary of the process we used at the Berkeley Structural Genomics Center to infer molecular functions of proteins of unknown function.

  12. Gene order in rosid phylogeny, inferred from pairwise syntenies among extant genomes

    PubMed Central

    2012-01-01

    Background Ancestral gene order reconstruction for flowering plants has lagged behind developments in yeasts, insects and higher animals, because of the recency of widespread plant genome sequencing, sequencers' embargoes on public data use, paralogies due to whole genome duplication (WGD) and fractionation of undeleted duplicates, extensive paralogy from other sources, and the computational cost of existing methods. Results We address these problems, using the gene order of four core eudicot genomes (cacao, castor bean, papaya and grapevine) that have escaped any recent WGD events, and two others (poplar and cucumber) that descend from independent WGDs, in inferring the ancestral gene order of the rosid clade and those of its main subgroups, the fabids and malvids. We improve and adapt techniques including the OMG method for extracting large, paralogy-free, multiple orthologies from conflated pairwise synteny data among the six genomes and the PATHGROUPS approach for ancestral gene order reconstruction in a given phylogeny, where some genomes may be descendants of WGD events. We use the gene order evidence to evaluate the hypothesis that the order Malpighiales belongs to the malvids rather than as traditionally assigned to the fabids. Conclusions Gene orders of ancestral eudicot species, involving 10,000 or more genes can be reconstructed in an efficient, parsimonious and consistent way, despite paralogies due to WGD and other processes. Pairwise genomic syntenies provide appropriate input to a parameter-free procedure of multiple ortholog identification followed by gene-order reconstruction in solving instances of the "small phylogeny" problem. PMID:22759433

  13. Inferring Bottlenecks from Genome-Wide Samples of Short Sequence Blocks.

    PubMed

    Bunnefeld, Lynsey; Frantz, Laurent A F; Lohse, Konrad

    2015-11-01

    The advent of the genomic era has necessitated the development of methods capable of analyzing large volumes of genomic data efficiently. Being able to reliably identify bottlenecks--extreme population size changes of short duration--not only is interesting in the context of speciation and extinction but also matters (as a null model) when inferring selection. Bottlenecks can be detected in polymorphism data via their distorting effect on the shape of the underlying genealogy. Here, we use the generating function of genealogies to derive the probability of mutational configurations in short sequence blocks under a simple bottleneck model. Given a large number of nonrecombining blocks, we can compute maximum-likelihood estimates of the time and strength of the bottleneck. Our method relies on a simple summary of the joint distribution of polymorphic sites. We extend the site frequency spectrum by counting mutations in frequency classes in short sequence blocks. Using linkage information over short distances in this way gives greater power to detect bottlenecks than the site frequency spectrum and potentially opens up a wide range of demographic histories to blockwise inference. Finally, we apply our method to genomic data from a species of pig (Sus cebifrons) endemic to islands in the center and west of the Philippines to estimate whether a bottleneck occurred upon island colonization and compare our scheme to Li and Durbin's pairwise sequentially Markovian coalescent (PSMC) both for the pig data and using simulations.

  14. Sigma: Strain-level inference of genomes from metagenomic analysis for biosurveillance

    DOE PAGES

    Ahn, Tae-Hyuk; Chai, Juanjuan; Pan, Chongle

    2014-09-29

    Motivation: Metagenomic sequencing of clinical samples provides a promising technique for direct pathogen detection and characterization in biosurveillance. Taxonomic analysis at the strain level can be used to resolve serotypes of a pathogen in biosurveillance. Sigma was developed for strain-level identification and quantification of pathogens using their reference genomes based on metagenomic analysis. Results: Sigma provides not only accurate strain-level inferences, but also three unique capabilities: (i) Sigma quantifies the statistical uncertainty of its inferences, which includes hypothesis testing of identified genomes and confidence interval estimation of their relative abundances; (ii) Sigma enables strain variant calling by assigning metagenomic readsmore » to their most likely reference genomes; and (iii) Sigma supports parallel computing for fast analysis of large datasets. In conclusion, the algorithm performance was evaluated using simulated mock communities and fecal samples with spike-in pathogen strains. Availability and Implementation: Sigma was implemented in C++ with source codes and binaries freely available at http://sigma.omicsbio.org.« less

  15. Inferring Bottlenecks from Genome-Wide Samples of Short Sequence Blocks.

    PubMed

    Bunnefeld, Lynsey; Frantz, Laurent A F; Lohse, Konrad

    2015-11-01

    The advent of the genomic era has necessitated the development of methods capable of analyzing large volumes of genomic data efficiently. Being able to reliably identify bottlenecks--extreme population size changes of short duration--not only is interesting in the context of speciation and extinction but also matters (as a null model) when inferring selection. Bottlenecks can be detected in polymorphism data via their distorting effect on the shape of the underlying genealogy. Here, we use the generating function of genealogies to derive the probability of mutational configurations in short sequence blocks under a simple bottleneck model. Given a large number of nonrecombining blocks, we can compute maximum-likelihood estimates of the time and strength of the bottleneck. Our method relies on a simple summary of the joint distribution of polymorphic sites. We extend the site frequency spectrum by counting mutations in frequency classes in short sequence blocks. Using linkage information over short distances in this way gives greater power to detect bottlenecks than the site frequency spectrum and potentially opens up a wide range of demographic histories to blockwise inference. Finally, we apply our method to genomic data from a species of pig (Sus cebifrons) endemic to islands in the center and west of the Philippines to estimate whether a bottleneck occurred upon island colonization and compare our scheme to Li and Durbin's pairwise sequentially Markovian coalescent (PSMC) both for the pig data and using simulations. PMID:26341659

  16. RegPredict: an integrated system for regulon inference in prokaryotes by comparative genomics approach

    SciTech Connect

    Novichkov, Pavel S.; Rodionov, Dmitry A.; Stavrovskaya, Elena D.; Novichkova, Elena S.; Kazakov, Alexey E.; Gelfand, Mikhail S.; Arkin, Adam P.; Mironov, Andrey A.; Dubchak, Inna

    2010-05-26

    RegPredict web server is designed to provide comparative genomics tools for reconstruction and analysis of microbial regulons using comparative genomics approach. The server allows the user to rapidly generate reference sets of regulons and regulatory motif profiles in a group of prokaryotic genomes. The new concept of a cluster of co-regulated orthologous operons allows the user to distribute the analysis of large regulons and to perform the comparative analysis of multiple clusters independently. Two major workflows currently implemented in RegPredict are: (i) regulon reconstruction for a known regulatory motif and (ii) ab initio inference of a novel regulon using several scenarios for the generation of starting gene sets. RegPredict provides a comprehensive collection of manually curated positional weight matrices of regulatory motifs. It is based on genomic sequences, ortholog and operon predictions from the MicrobesOnline. An interactive web interface of RegPredict integrates and presents diverse genomic and functional information about the candidate regulon members from several web resources. RegPredict is freely accessible at http://regpredict.lbl.gov.

  17. Inferring Where and When Replication Initiates from Genome-Wide Replication Timing Data

    NASA Astrophysics Data System (ADS)

    Baker, A.; Audit, B.; Yang, S. C.-H.; Bechhoefer, J.; Arneodo, A.

    2012-06-01

    Based on an analogy between DNA replication and one dimensional nucleation-and-growth processes, various attempts to infer the local initiation rate I(x,t) of DNA replication origins from replication timing data have been developed in the framework of phase transition kinetics theories. These works have all used curve-fit strategies to estimate I(x,t) from genome-wide replication timing data. Here, we show how to invert analytically the Kolmogorov-Johnson-Mehl-Avrami model and extract I(x,t) directly. Tests on both simulated and experimental budding-yeast data confirm the location and firing-time distribution of replication origins.

  18. Metatranscriptome of marine bacterioplankton during winter time in the North Sea assessed by total RNA sequencing.

    PubMed

    Kopf, Anna; Kostadinov, Ivaylo; Wichels, Antje; Quast, Christian; Glöckner, Frank Oliver

    2015-02-01

    Marine metatranscriptome data was generated as part of a study investigating the bacterioplankton communities towards the end of a diatom-dominated spring phytoplankton bloom. This genomic resource article reports a metatranscriptomic dataset from amidst the winter time prior to the occurrence of the spring diatom bloom. Up to 58% of all sequences could be assigned to predicted genes. Taxonomic analysis based on expressed 16S ribosomal RNA genes identified Alphaproteobacteria and Gammaproteobacteria as the most active community members.

  19. Genetic diversity in Sargasso Sea bacterioplankton.

    PubMed

    Giovannoni, S J; Britschgi, T B; Moyer, C L; Field, K G

    1990-05-01

    Bacterioplankton are recognized as important agents of biogeochemical change in marine ecosystems, yet relatively little is known about the species that make up these communities. Uncertainties about the genetic structure and diversity of natural bacterioplankton populations stem from the traditional difficulties associated with microbial cultivation techniques. Discrepancies between direct counts and plate counts are typically several orders of magnitude, raising doubts as to whether cultivated marine bacteria are actually representative of dominant planktonic species. We have phylogenetically analysed clone libraries of eubacterial 16S ribosomal RNA genes amplified from natural populations of Sargasso Sea picoplankton by the polymerase chain reaction. The analysis indicates the presence of a novel microbial group, the SAR11 cluster, which appears to be a significant component of this oligotrophic bacterioplankton community. A second cluster of lineages related to the oxygenic phototrophs--cyanobacteria, prochlorophytes and chloroplasts--was also observed. However, none of the genes matched the small subunit rRNA sequences of cultivated marine cyanobacteria from similar habitats. The diversity of 16S rRNA genes observed within the clusters suggests that these bacterioplankton may be consortia of independent lineages sharing surprisingly distant common ancestors.

  20. Inferring human population size and separation history from multiple genome sequences

    PubMed Central

    Schiffels, Stephan; Durbin, Richard

    2014-01-01

    The availability of complete human genome sequences from populations across the world has given rise to new population genetic inference methods that explicitly model their ancestral relationship under recombination and mutation. So far, application of these methods to evolutionary history more recent than 20-30 thousand years ago and to population separations has been limited. Here we present a new method that overcomes these shortcomings. The Multiple Sequentially Markovian Coalescent (MSMC) analyses the observed pattern of mutations in multiple individuals, focusing on the first coalescence between any two individuals. Results from applying MSMC to genome sequences from nine populations across the world suggest that the genetic separation of non-African ancestors from African Yoruban ancestors started long before 50,000 years ago, and give information about human population history as recently as 2,000 years ago, including the bottleneck in the peopling of the Americas, and separations within Africa, East Asia and Europe. PMID:24952747

  1. Proteomics-inferred genome typing (PIGT) demonstrates inter-populationrecombination as a strategy for environmental adaptation

    SciTech Connect

    Denef, Vincent; Verberkmoes, Nathan C; Shah, Manesh B; Abraham, Paul E; Lefsrud, Mark G; Hettich, Robert {Bob} L; Banfield, Jillian F.

    2009-01-01

    Analyses of ecological and evolutionary processes that shape microbial consortia are facilitated by comprehensive studies of ecosystems with low species richness. In the current study we evaluated the role of recombination in altering the fitness of chemoautotrophic bacteria in their natural environment. Proteomics-inferred genome typing (PIGT) was used to determine the genomic make-up of Leptospirillum group II populations in 27 biofilms sampled from six locations in the Richmond Mine acid mine drainage system (Iron Mountain, CA) over a four-year period. We observed six distinct genotypes that are recombinants comprised of segments from two parental genotypes. Community genomic analyses revealed additional low abundance recombinant variants. The dominance of some genotypes despite a larger available genome pool, and patterns of spatiotemporal distribution within the ecosystem, indicate selection for distinct recombinants. Genes involved in motility, signal transduction and transport were overrepresented in the tens to hundreds of kilobase recombinant blocks, whereas core metabolic functions were significantly underrepresented. Our findings demonstrate the power of PIGT and reveal that recombination is a mechanism for fine-scale adaptation in this system.

  2. Joint modeling of RNase footprint sequencing profiles for genome-wide inference of RNA structure

    PubMed Central

    Zou, Chenchen; Ouyang, Zhengqing

    2015-01-01

    Recent studies have revealed significant roles of RNA structure in almost every step of RNA processing, including transcription, splicing, transport and translation. RNase footprint sequencing (RNase-seq) has emerged to dissect RNA structures at the genome scale. However, it remains challenging to analyze RNase-seq data because of the issues of signal sparsity, variability and correlations among various RNases. We present a probabilistic framework, joint Poisson-gamma mixture (JPGM), for integrative modeling of multiple RNase-seq profiles. Combining JPGM with hidden Markov model allows genome-wide inference of RNA structures. We apply the joint modeling approach for inferring base pairing states on simulated data sets and RNase-seq profiles of the double-strand specific RNase V1 and single-strand specific RNase S1 in yeast. We demonstrate that joint analysis of V1 and S1 profiles outputs interpretable RNA structure states, while approaches that analyze each profile separately do not. The joint modeling approach predicts the structure states of all nucleotides in 3196 transcripts of yeast without compromising accuracy, while the simple thresholding approach misses 43% of the nucleotides. Furthermore, the posterior probabilities outputted by our model are able to resolve the structural ambiguity of ≈300 000 nucleotides with overlapping V1 and S1 cleavage sites. Our model also generates RNA accessibilities, which are associated with three-dimensional conformations. PMID:26400167

  3. High-Accuracy HLA Type Inference from Whole-Genome Sequencing Data Using Population Reference Graphs

    PubMed Central

    Dilthey, Alexander T.; Gourraud, Pierre-Antoine; McVean, Gil

    2016-01-01

    Genetic variation at the Human Leucocyte Antigen (HLA) genes is associated with many autoimmune and infectious disease phenotypes, is an important element of the immunological distinction between self and non-self, and shapes immune epitope repertoires. Determining the allelic state of the HLA genes (HLA typing) as a by-product of standard whole-genome sequencing data would therefore be highly desirable and enable the immunogenetic characterization of samples in currently ongoing population sequencing projects. Extensive hyperpolymorphism and sequence similarity between the HLA genes, however, pose problems for accurate read mapping and make HLA type inference from whole-genome sequencing data a challenging problem. We describe how to address these challenges in a Population Reference Graph (PRG) framework. First, we construct a PRG for 46 (mostly HLA) genes and pseudogenes, their genomic context and their characterized sequence variants, integrating a database of over 10,000 known allele sequences. Second, we present a sequence-to-PRG paired-end read mapping algorithm that enables accurate read mapping for the HLA genes. Third, we infer the most likely pair of underlying alleles at G group resolution from the IMGT/HLA database at each locus, employing a simple likelihood framework. We show that HLA*PRG, our algorithm, outperforms existing methods by a wide margin. We evaluate HLA*PRG on six classical class I and class II HLA genes (HLA-A, -B, -C, -DQA1, -DQB1, -DRB1) and on a set of 14 samples (3 samples with 2 x 100bp, 11 samples with 2 x 250bp Illumina HiSeq data). Of 158 alleles tested, we correctly infer 157 alleles (99.4%). We also identify and re-type two erroneous alleles in the original validation data. We conclude that HLA*PRG for the first time achieves accuracies comparable to gold-standard reference methods from standard whole-genome sequencing data, though high computational demands (currently ~30–250 CPU hours per sample) remain a significant

  4. Inferring causal genomic alterations in breast cancer using gene expression data

    PubMed Central

    2011-01-01

    Background One of the primary objectives in cancer research is to identify causal genomic alterations, such as somatic copy number variation (CNV) and somatic mutations, during tumor development. Many valuable studies lack genomic data to detect CNV; therefore, methods that are able to infer CNVs from gene expression data would help maximize the value of these studies. Results We developed a framework for identifying recurrent regions of CNV and distinguishing the cancer driver genes from the passenger genes in the regions. By inferring CNV regions across many datasets we were able to identify 109 recurrent amplified/deleted CNV regions. Many of these regions are enriched for genes involved in many important processes associated with tumorigenesis and cancer progression. Genes in these recurrent CNV regions were then examined in the context of gene regulatory networks to prioritize putative cancer driver genes. The cancer driver genes uncovered by the framework include not only well-known oncogenes but also a number of novel cancer susceptibility genes validated via siRNA experiments. Conclusions To our knowledge, this is the first effort to systematically identify and validate drivers for expression based CNV regions in breast cancer. The framework where the wavelet analysis of copy number alteration based on expression coupled with the gene regulatory network analysis, provides a blueprint for leveraging genomic data to identify key regulatory components and gene targets. This integrative approach can be applied to many other large-scale gene expression studies and other novel types of cancer data such as next-generation sequencing based expression (RNA-Seq) as well as CNV data. PMID:21806811

  5. The Phylogeny and Evolutionary Timescale of Muscoidea (Diptera: Brachycera: Calyptratae) Inferred from Mitochondrial Genomes

    PubMed Central

    Wang, Ning; Cameron, Stephen L.; Mao, Meng; Wang, Yuyu; Xi, Yuqiang; Yang, Ding

    2015-01-01

    Muscoidea is a significant dipteran clade that includes house flies (Family Muscidae), latrine flies (F. Fannidae), dung flies (F. Scathophagidae) and root maggot flies (F. Anthomyiidae). It is comprised of approximately 7000 described species. The monophyly of the Muscoidea and the precise relationships of muscoids to the closest superfamily the Oestroidea (blow flies, flesh flies etc) are both unresolved. Until now mitochondrial (mt) genomes were available for only two of the four muscoid families precluding a thorough test of phylogenetic relationships using this data source. Here we present the first two mt genomes for the families Fanniidae (Euryomma sp.) (family Fanniidae) and Anthomyiidae (Delia platura (Meigen, 1826)). We also conducted phylogenetic analyses containing of these newly sequenced mt genomes plus 15 other species representative of dipteran diversity to address the internal relationship of Muscoidea and its systematic position. Both maximum-likelihood and Bayesian analyses suggested that Muscoidea was not a monophyletic group with the relationship: (Fanniidae + Muscidae) + ((Anthomyiidae + Scathophagidae) + (Calliphoridae + Sarcophagidae)), supported by the majority of analysed datasets. This also infers that Oestroidea was paraphyletic in the majority of analyses. Divergence time estimation suggested that the earliest split within the Calyptratae, separating (Tachinidae + Oestridae) from the remaining families, occurred in the Early Eocene. The main divergence within the paraphyletic muscoidea grade was between Fanniidae + Muscidae and the lineage ((Anthomyiidae + Scathophagidae) + (Calliphoridae + Sarcophagidae)) which occurred in the Late Eocene. PMID:26225760

  6. Integration of Multiple Genomic and Phenotype Data to Infer Novel miRNA-Disease Associations

    PubMed Central

    Zhou, Meng; Cheng, Liang; Yang, Haixiu; Wang, Jing; Sun, Jie; Wang, Zhenzhen

    2016-01-01

    MicroRNAs (miRNAs) play an important role in the development and progression of human diseases. The identification of disease-associated miRNAs will be helpful for understanding the molecular mechanisms of diseases at the post-transcriptional level. Based on different types of genomic data sources, computational methods for miRNA-disease association prediction have been proposed. However, individual source of genomic data tends to be incomplete and noisy; therefore, the integration of various types of genomic data for inferring reliable miRNA-disease associations is urgently needed. In this study, we present a computational framework, CHNmiRD, for identifying miRNA-disease associations by integrating multiple genomic and phenotype data, including protein-protein interaction data, gene ontology data, experimentally verified miRNA-target relationships, disease phenotype information and known miRNA-disease connections. The performance of CHNmiRD was evaluated by experimentally verified miRNA-disease associations, which achieved an area under the ROC curve (AUC) of 0.834 for 5-fold cross-validation. In particular, CHNmiRD displayed excellent performance for diseases without any known related miRNAs. The results of case studies for three human diseases (glioblastoma, myocardial infarction and type 1 diabetes) showed that all of the top 10 ranked miRNAs having no known associations with these three diseases in existing miRNA-disease databases were directly or indirectly confirmed by our latest literature mining. All these results demonstrated the reliability and efficiency of CHNmiRD, and it is anticipated that CHNmiRD will serve as a powerful bioinformatics method for mining novel disease-related miRNAs and providing a new perspective into molecular mechanisms underlying human diseases at the post-transcriptional level. CHNmiRD is freely available at http://www.bio-bigdata.com/CHNmiRD. PMID:26849207

  7. Integration of Multiple Genomic and Phenotype Data to Infer Novel miRNA-Disease Associations.

    PubMed

    Shi, Hongbo; Zhang, Guangde; Zhou, Meng; Cheng, Liang; Yang, Haixiu; Wang, Jing; Sun, Jie; Wang, Zhenzhen

    2016-01-01

    MicroRNAs (miRNAs) play an important role in the development and progression of human diseases. The identification of disease-associated miRNAs will be helpful for understanding the molecular mechanisms of diseases at the post-transcriptional level. Based on different types of genomic data sources, computational methods for miRNA-disease association prediction have been proposed. However, individual source of genomic data tends to be incomplete and noisy; therefore, the integration of various types of genomic data for inferring reliable miRNA-disease associations is urgently needed. In this study, we present a computational framework, CHNmiRD, for identifying miRNA-disease associations by integrating multiple genomic and phenotype data, including protein-protein interaction data, gene ontology data, experimentally verified miRNA-target relationships, disease phenotype information and known miRNA-disease connections. The performance of CHNmiRD was evaluated by experimentally verified miRNA-disease associations, which achieved an area under the ROC curve (AUC) of 0.834 for 5-fold cross-validation. In particular, CHNmiRD displayed excellent performance for diseases without any known related miRNAs. The results of case studies for three human diseases (glioblastoma, myocardial infarction and type 1 diabetes) showed that all of the top 10 ranked miRNAs having no known associations with these three diseases in existing miRNA-disease databases were directly or indirectly confirmed by our latest literature mining. All these results demonstrated the reliability and efficiency of CHNmiRD, and it is anticipated that CHNmiRD will serve as a powerful bioinformatics method for mining novel disease-related miRNAs and providing a new perspective into molecular mechanisms underlying human diseases at the post-transcriptional level. CHNmiRD is freely available at http://www.bio-bigdata.com/CHNmiRD.

  8. Integration of Multiple Genomic and Phenotype Data to Infer Novel miRNA-Disease Associations.

    PubMed

    Shi, Hongbo; Zhang, Guangde; Zhou, Meng; Cheng, Liang; Yang, Haixiu; Wang, Jing; Sun, Jie; Wang, Zhenzhen

    2016-01-01

    MicroRNAs (miRNAs) play an important role in the development and progression of human diseases. The identification of disease-associated miRNAs will be helpful for understanding the molecular mechanisms of diseases at the post-transcriptional level. Based on different types of genomic data sources, computational methods for miRNA-disease association prediction have been proposed. However, individual source of genomic data tends to be incomplete and noisy; therefore, the integration of various types of genomic data for inferring reliable miRNA-disease associations is urgently needed. In this study, we present a computational framework, CHNmiRD, for identifying miRNA-disease associations by integrating multiple genomic and phenotype data, including protein-protein interaction data, gene ontology data, experimentally verified miRNA-target relationships, disease phenotype information and known miRNA-disease connections. The performance of CHNmiRD was evaluated by experimentally verified miRNA-disease associations, which achieved an area under the ROC curve (AUC) of 0.834 for 5-fold cross-validation. In particular, CHNmiRD displayed excellent performance for diseases without any known related miRNAs. The results of case studies for three human diseases (glioblastoma, myocardial infarction and type 1 diabetes) showed that all of the top 10 ranked miRNAs having no known associations with these three diseases in existing miRNA-disease databases were directly or indirectly confirmed by our latest literature mining. All these results demonstrated the reliability and efficiency of CHNmiRD, and it is anticipated that CHNmiRD will serve as a powerful bioinformatics method for mining novel disease-related miRNAs and providing a new perspective into molecular mechanisms underlying human diseases at the post-transcriptional level. CHNmiRD is freely available at http://www.bio-bigdata.com/CHNmiRD. PMID:26849207

  9. Co-occurrence Analysis of Microbial Taxa in the Atlantic Ocean Reveals High Connectivity in the Free-Living Bacterioplankton

    PubMed Central

    Milici, Mathias; Deng, Zhi-Luo; Tomasch, Jürgen; Decelle, Johan; Wos-Oxley, Melissa L.; Wang, Hui; Jáuregui, Ruy; Plumeier, Iris; Giebel, Helge-Ansgar; Badewien, Thomas H.; Wurst, Mascha; Pieper, Dietmar H.; Simon, Meinhard; Wagner-Döbler, Irene

    2016-01-01

    We determined the taxonomic composition of the bacterioplankton of the epipelagic zone of the Atlantic Ocean along a latitudinal transect (51°S–47°N) using Illumina sequencing of the V5-V6 region of the 16S rRNA gene and inferred co-occurrence networks. Bacterioplankon community composition was distinct for Longhurstian provinces and water depth. Free-living microbial communities (between 0.22 and 3 μm) were dominated by highly abundant and ubiquitous taxa with streamlined genomes (e.g., SAR11, SAR86, OM1, Prochlorococcus) and could clearly be separated from particle-associated communities which were dominated by Bacteroidetes, Planktomycetes, Verrucomicrobia, and Roseobacters. From a total of 369 different communities we then inferred co-occurrence networks for each size fraction and depth layer of the plankton between bacteria and between bacteria and phototrophic micro-eukaryotes. The inferred networks showed a reduction of edges in the deepest layer of the photic zone. Networks comprised of free-living bacteria had a larger amount of connections per OTU when compared to the particle associated communities throughout the water column. Negative correlations accounted for roughly one third of the total edges in the free-living communities at all depths, while they decreased with depth in the particle associated communities where they amounted for roughly 10% of the total in the last part of the epipelagic zone. Co-occurrence networks of bacteria with phototrophic micro-eukaryotes were not taxon-specific, and dominated by mutual exclusion (~60%). The data show a high degree of specialization to micro-environments in the water column and highlight the importance of interdependencies particularly between free-living bacteria in the upper layers of the epipelagic zone. PMID:27199970

  10. Co-occurrence Analysis of Microbial Taxa in the Atlantic Ocean Reveals High Connectivity in the Free-Living Bacterioplankton.

    PubMed

    Milici, Mathias; Deng, Zhi-Luo; Tomasch, Jürgen; Decelle, Johan; Wos-Oxley, Melissa L; Wang, Hui; Jáuregui, Ruy; Plumeier, Iris; Giebel, Helge-Ansgar; Badewien, Thomas H; Wurst, Mascha; Pieper, Dietmar H; Simon, Meinhard; Wagner-Döbler, Irene

    2016-01-01

    We determined the taxonomic composition of the bacterioplankton of the epipelagic zone of the Atlantic Ocean along a latitudinal transect (51°S-47°N) using Illumina sequencing of the V5-V6 region of the 16S rRNA gene and inferred co-occurrence networks. Bacterioplankon community composition was distinct for Longhurstian provinces and water depth. Free-living microbial communities (between 0.22 and 3 μm) were dominated by highly abundant and ubiquitous taxa with streamlined genomes (e.g., SAR11, SAR86, OM1, Prochlorococcus) and could clearly be separated from particle-associated communities which were dominated by Bacteroidetes, Planktomycetes, Verrucomicrobia, and Roseobacters. From a total of 369 different communities we then inferred co-occurrence networks for each size fraction and depth layer of the plankton between bacteria and between bacteria and phototrophic micro-eukaryotes. The inferred networks showed a reduction of edges in the deepest layer of the photic zone. Networks comprised of free-living bacteria had a larger amount of connections per OTU when compared to the particle associated communities throughout the water column. Negative correlations accounted for roughly one third of the total edges in the free-living communities at all depths, while they decreased with depth in the particle associated communities where they amounted for roughly 10% of the total in the last part of the epipelagic zone. Co-occurrence networks of bacteria with phototrophic micro-eukaryotes were not taxon-specific, and dominated by mutual exclusion (~60%). The data show a high degree of specialization to micro-environments in the water column and highlight the importance of interdependencies particularly between free-living bacteria in the upper layers of the epipelagic zone.

  11. TITAN: inference of copy number architectures in clonal cell populations from tumor whole-genome sequence data.

    PubMed

    Ha, Gavin; Roth, Andrew; Khattra, Jaswinder; Ho, Julie; Yap, Damian; Prentice, Leah M; Melnyk, Nataliya; McPherson, Andrew; Bashashati, Ali; Laks, Emma; Biele, Justina; Ding, Jiarui; Le, Alan; Rosner, Jamie; Shumansky, Karey; Marra, Marco A; Gilks, C Blake; Huntsman, David G; McAlpine, Jessica N; Aparicio, Samuel; Shah, Sohrab P

    2014-11-01

    The evolution of cancer genomes within a single tumor creates mixed cell populations with divergent somatic mutational landscapes. Inference of tumor subpopulations has been disproportionately focused on the assessment of somatic point mutations, whereas computational methods targeting evolutionary dynamics of copy number alterations (CNA) and loss of heterozygosity (LOH) in whole-genome sequencing data remain underdeveloped. We present a novel probabilistic model, TITAN, to infer CNA and LOH events while accounting for mixtures of cell populations, thereby estimating the proportion of cells harboring each event. We evaluate TITAN on idealized mixtures, simulating clonal populations from whole-genome sequences taken from genomically heterogeneous ovarian tumor sites collected from the same patient. In addition, we show in 23 whole genomes of breast tumors that the inference of CNA and LOH using TITAN critically informs population structure and the nature of the evolving cancer genome. Finally, we experimentally validated subclonal predictions using fluorescence in situ hybridization (FISH) and single-cell sequencing from an ovarian cancer patient sample, thereby recapitulating the key modeling assumptions of TITAN. PMID:25060187

  12. TITAN: inference of copy number architectures in clonal cell populations from tumor whole-genome sequence data

    PubMed Central

    Roth, Andrew; Khattra, Jaswinder; Ho, Julie; Yap, Damian; Prentice, Leah M.; Melnyk, Nataliya; McPherson, Andrew; Bashashati, Ali; Laks, Emma; Biele, Justina; Ding, Jiarui; Le, Alan; Rosner, Jamie; Shumansky, Karey; Marra, Marco A.; Gilks, C. Blake; Huntsman, David G.; McAlpine, Jessica N.; Aparicio, Samuel

    2014-01-01

    The evolution of cancer genomes within a single tumor creates mixed cell populations with divergent somatic mutational landscapes. Inference of tumor subpopulations has been disproportionately focused on the assessment of somatic point mutations, whereas computational methods targeting evolutionary dynamics of copy number alterations (CNA) and loss of heterozygosity (LOH) in whole-genome sequencing data remain underdeveloped. We present a novel probabilistic model, TITAN, to infer CNA and LOH events while accounting for mixtures of cell populations, thereby estimating the proportion of cells harboring each event. We evaluate TITAN on idealized mixtures, simulating clonal populations from whole-genome sequences taken from genomically heterogeneous ovarian tumor sites collected from the same patient. In addition, we show in 23 whole genomes of breast tumors that the inference of CNA and LOH using TITAN critically informs population structure and the nature of the evolving cancer genome. Finally, we experimentally validated subclonal predictions using fluorescence in situ hybridization (FISH) and single-cell sequencing from an ovarian cancer patient sample, thereby recapitulating the key modeling assumptions of TITAN. PMID:25060187

  13. Genomic heterogeneity of historical gene flow between two species of newts inferred from transcriptome data.

    PubMed

    Stuglik, Michał T; Babik, Wiesław

    2016-07-01

    The role of gene flow in species formation is a major unresolved issue in speciation biology. Progress in this area requires information on the long-term patterns of gene flow between diverging species. Here, we used thousands of single-nucleotide polymorphisms derived from transcriptome resequencing and a method modeling the joint frequency spectrum of these polymorphisms to reconstruct patterns of historical gene flow between two Lissotriton newts: L. vulgaris (Lv) and L. montandoni (Lm). We tested several models of divergence including complete isolation and various scenarios of historical gene flow. The model of secondary contact received the highest support. According to this model, the species split from their common ancestor ca. 5.5 million years (MY) ago, evolved in isolation for ca. 2 MY, and have been exchanging genes for the last 3.5 MY Demographic changes have been inferred in both species, with the current effective population size of ca. 0.7 million in Lv and 0.2 million in Lm. The postdivergence gene flow resulted in two-directional introgression which affected the genomes of both species, but was more pronounced from Lv to Lm. Interestingly, we found evidence for genomic heterogeneity of interspecific gene flow. This study demonstrates the complexity of long-term gene flow between distinct but incompletely reproductively isolated taxa which divergence was initiated millions of years ago. PMID:27386093

  14. ecoPrimers: inference of new DNA barcode markers from whole genome sequence analysis

    PubMed Central

    Riaz, Tiayyba; Shehzad, Wasim; Viari, Alain; Pompanon, François; Taberlet, Pierre; Coissac, Eric

    2011-01-01

    Using non-conventional markers, DNA metabarcoding allows biodiversity assessment from complex substrates. In this article, we present ecoPrimers, a software for identifying new barcode markers and their associated PCR primers. ecoPrimers scans whole genomes to find such markers without a priori knowledge. ecoPrimers optimizes two quality indices measuring taxonomical range and discrimination to select the most efficient markers from a set of reference sequences, according to specific experimental constraints such as marker length or specifically targeted taxa. The key step of the algorithm is the identification of conserved regions among reference sequences for anchoring primers. We propose an efficient algorithm based on data mining, that allows the analysis of huge sets of sequences. We evaluate the efficiency of ecoPrimers by running it on three different sequence sets: mitochondrial, chloroplast and bacterial genomes. Identified barcode markers correspond either to barcode regions already in use for plants or animals, or to new potential barcodes. Results from empirical experiments carried out on a promising new barcode for analyzing vertebrate diversity fully agree with expectations based on bioinformatics analysis. These tests demonstrate the efficiency of ecoPrimers for inferring new barcodes fitting with diverse experimental contexts. ecoPrimers is available as an open source project at: http://www.grenoble.prabi.fr/trac/ecoPrimers. PMID:21930509

  15. Inferring Quantitative Trait Pathways Associated with Bull Fertility from a Genome-Wide Association Study

    PubMed Central

    Peñagaricano, Francisco; Weigel, Kent A.; Rosa, Guilherme J. M.; Khatib, Hasan

    2013-01-01

    Whole-genome association studies typically focus on genetic markers with the strongest evidence of association. However, single markers often explain only a small component of the genetic variance and hence offer a limited understanding of the trait under study. As such, the objective of this study was to perform a pathway-based association analysis in Holstein dairy cattle in order to identify relevant pathways involved in bull fertility. The results of a single-marker association analysis, using 1,755 bulls with sire conception rate data and genotypes for 38,650 single nucleotide polymorphisms (SNPs), were used in this study. A total of 16,819 annotated genes, including 2,767 significantly associated with bull fertility, were used to interrogate a total of 662 Gene Ontology (GO) terms and 248 InterPro (IP) entries using a test of proportions based on the cumulative hypergeometric distribution. After multiple-testing correction, 20 GO categories and one IP entry showed significant overrepresentation of genes statistically associated with bull fertility. Several of these functional categories such as small GTPases mediated signal transduction, neurogenesis, calcium ion binding, and cytoskeleton are known to be involved in biological processes closely related to male fertility. These results could provide insight into the genetic architecture of this complex trait in dairy cattle. In addition, this study shows that quantitative trait pathways inferred from single-marker analyses could enhance our interpretations of the results of genome-wide association studies. PMID:23335935

  16. The root of the mammalian tree inferred from whole mitochondrial genomes.

    PubMed

    Phillips, Matthew J; Penny, David

    2003-08-01

    Morphological and molecular data are currently contradictory over the position of monotremes with respect to marsupial and placental mammals. As part of a re-evaluation of both forms of data we examine complete mitochondrial genomes in more detail. There is a particularly large discrepancy in the frequencies of thymine and cytosine (T-C) between mitochondrial genomes that appears to affect some deep divergences in the mammalian tree. We report that recoding nucleotides to RY-characters, and partitioning maximum-likelihood analyses among subsets of data reduces such biases, and improves the fit of models to the data, respectively. RY-coding also increases the signal on the internal branches relative to external, and thus increases the phylogenetic signal. In contrast to previous analyses of mitochondrial data, our analyses favor Theria (marsupials plus placentals) over Marsupionta (monotremes plus marsupials). However, a short therian stem lineage is inferred, which is at variance with the traditionally deep placement of monotremes on morphological data. PMID:12878457

  17. Genomic heterogeneity of historical gene flow between two species of newts inferred from transcriptome data.

    PubMed

    Stuglik, Michał T; Babik, Wiesław

    2016-07-01

    The role of gene flow in species formation is a major unresolved issue in speciation biology. Progress in this area requires information on the long-term patterns of gene flow between diverging species. Here, we used thousands of single-nucleotide polymorphisms derived from transcriptome resequencing and a method modeling the joint frequency spectrum of these polymorphisms to reconstruct patterns of historical gene flow between two Lissotriton newts: L. vulgaris (Lv) and L. montandoni (Lm). We tested several models of divergence including complete isolation and various scenarios of historical gene flow. The model of secondary contact received the highest support. According to this model, the species split from their common ancestor ca. 5.5 million years (MY) ago, evolved in isolation for ca. 2 MY, and have been exchanging genes for the last 3.5 MY Demographic changes have been inferred in both species, with the current effective population size of ca. 0.7 million in Lv and 0.2 million in Lm. The postdivergence gene flow resulted in two-directional introgression which affected the genomes of both species, but was more pronounced from Lv to Lm. Interestingly, we found evidence for genomic heterogeneity of interspecific gene flow. This study demonstrates the complexity of long-term gene flow between distinct but incompletely reproductively isolated taxa which divergence was initiated millions of years ago.

  18. PICARA, an analytical pipeline providing probabilistic inference about a priori candidates genes underlying genome-wide association QTL in plants

    Technology Transfer Automated Retrieval System (TEKTRAN)

    PICARA is an analytical pipeline designed to systematically summarize observed SNP/trait associations identified by genome wide association studies (GWAS) and to identify candidate genes involved in the regulation of complex trait variation. The pipeline provides probabilistic inference about a prio...

  19. The evolutionary history of termites as inferred from 66 mitochondrial genomes.

    PubMed

    Bourguignon, Thomas; Lo, Nathan; Cameron, Stephen L; Šobotník, Jan; Hayashi, Yoshinobu; Shigenobu, Shuji; Watanabe, Dai; Roisin, Yves; Miura, Toru; Evans, Theodore A

    2015-02-01

    Termites have colonized many habitats and are among the most abundant animals in tropical ecosystems, which they modify considerably through their actions. The timing of their rise in abundance and of the dispersal events that gave rise to modern termite lineages is not well understood. To shed light on termite origins and diversification, we sequenced the mitochondrial genome of 48 termite species and combined them with 18 previously sequenced termite mitochondrial genomes for phylogenetic and molecular clock analyses using multiple fossil calibrations. The 66 genomes represent most major clades of termites. Unlike previous phylogenetic studies based on fewer molecular data, our phylogenetic tree is fully resolved for the lower termites. The phylogenetic positions of Macrotermitinae and Apicotermitinae are also resolved as the basal groups in the higher termites, but in the crown termitid groups, including Termitinae + Syntermitinae + Nasutitermitinae + Cubitermitinae, the position of some nodes remains uncertain. Our molecular clock tree indicates that the lineages leading to termites and Cryptocercus roaches diverged 170 Ma (153-196 Ma 95% confidence interval [CI]), that modern Termitidae arose 54 Ma (46-66 Ma 95% CI), and that the crown termitid group arose 40 Ma (35-49 Ma 95% CI). This indicates that the distribution of basal termite clades was influenced by the final stages of the breakup of Pangaea. Our inference of ancestral geographic ranges shows that the Termitidae, which includes more than 75% of extant termite species, most likely originated in Africa or Asia, and acquired their pantropical distribution after a series of dispersal and subsequent diversification events.

  20. Covariance Between Genotypic Effects and its Use for Genomic Inference in Half-Sib Families

    PubMed Central

    Wittenburg, Dörte; Teuscher, Friedrich; Klosa, Jan; Reinsch, Norbert

    2016-01-01

    In livestock, current statistical approaches utilize extensive molecular data, e.g., single nucleotide polymorphisms (SNPs), to improve the genetic evaluation of individuals. The number of model parameters increases with the number of SNPs, so the multicollinearity between covariates can affect the results obtained using whole genome regression methods. In this study, dependencies between SNPs due to linkage and linkage disequilibrium among the chromosome segments were explicitly considered in methods used to estimate the effects of SNPs. The population structure affects the extent of such dependencies, so the covariance among SNP genotypes was derived for half-sib families, which are typical in livestock populations. Conditional on the SNP haplotypes of the common parent (sire), the theoretical covariance was determined using the haplotype frequencies of the population from which the individual parent (dam) was derived. The resulting covariance matrix was included in a statistical model for a trait of interest, and this covariance matrix was then used to specify prior assumptions for SNP effects in a Bayesian framework. The approach was applied to one family in simulated scenarios (few and many quantitative trait loci) and using semireal data obtained from dairy cattle to identify genome segments that affect performance traits, as well as to investigate the impact on predictive ability. Compared with a method that does not explicitly consider any of the relationship among predictor variables, the accuracy of genetic value prediction was improved by 10–22%. The results show that the inclusion of dependence is particularly important for genomic inference based on small sample sizes. PMID:27402363

  1. Covariance Between Genotypic Effects and its Use for Genomic Inference in Half-Sib Families.

    PubMed

    Wittenburg, Dörte; Teuscher, Friedrich; Klosa, Jan; Reinsch, Norbert

    2016-09-08

    In livestock, current statistical approaches utilize extensive molecular data, e.g., single nucleotide polymorphisms (SNPs), to improve the genetic evaluation of individuals. The number of model parameters increases with the number of SNPs, so the multicollinearity between covariates can affect the results obtained using whole genome regression methods. In this study, dependencies between SNPs due to linkage and linkage disequilibrium among the chromosome segments were explicitly considered in methods used to estimate the effects of SNPs. The population structure affects the extent of such dependencies, so the covariance among SNP genotypes was derived for half-sib families, which are typical in livestock populations. Conditional on the SNP haplotypes of the common parent (sire), the theoretical covariance was determined using the haplotype frequencies of the population from which the individual parent (dam) was derived. The resulting covariance matrix was included in a statistical model for a trait of interest, and this covariance matrix was then used to specify prior assumptions for SNP effects in a Bayesian framework. The approach was applied to one family in simulated scenarios (few and many quantitative trait loci) and using semireal data obtained from dairy cattle to identify genome segments that affect performance traits, as well as to investigate the impact on predictive ability. Compared with a method that does not explicitly consider any of the relationship among predictor variables, the accuracy of genetic value prediction was improved by 10-22%. The results show that the inclusion of dependence is particularly important for genomic inference based on small sample sizes.

  2. Comparative analysis of mitochondrial genomes in Diplura (hexapoda, arthropoda): taxon sampling is crucial for phylogenetic inferences.

    PubMed

    Chen, Wan-Jun; Koch, Markus; Mallatt, Jon M; Luan, Yun-Xia

    2014-01-01

    Two-pronged bristletails (Diplura) are traditionally classified into three major superfamilies: Campodeoidea, Projapygoidea, and Japygoidea. The interrelationships of these three superfamilies and the monophyly of Diplura have been much debated. Few previous studies included Projapygoidea in their phylogenetic considerations, and its position within Diplura still is a puzzle from both morphological and molecular points of view. Until now, no mitochondrial genome has been sequenced for any projapygoid species. To fill in this gap, we determined and annotated the complete mitochondrial genome of Octostigma sinensis (Octostigmatidae, Projapygoidea), and of three more dipluran species, one each from the Campodeidae, Parajapygidae, and Japygidae. All four newly sequenced dipluran mtDNAs encode the same set of genes in the same gene order as shared by most crustaceans and hexapods. Secondary structure truncations have occurred in trnR, trnC, trnS1, and trnS2, and the reduction of transfer RNA D-arms was found to be taxonomically correlated, with Campodeoidea having experienced the most reduction. Partitioned phylogenetic analyses, based on both amino acids and nucleotides of the protein-coding genes plus the ribosomal RNA genes, retrieve significant support for a monophyletic Diplura within Pancrustacea, with Projapygoidea more closely related to Campodeoidea than to Japygoidea. Another key finding is that monophyly of Diplura cannot be recovered unless Projapygoidea is included in the phylogenetic analyses; this explains the dipluran polyphyly found by past mitogenomic studies. Including Projapygoidea increased the sampling density within Diplura and probably helped by breaking up a long-branch-attraction artifact. This finding provides an example of how proper sampling is significant for phylogenetic inference.

  3. Efficient inference of population size histories and locus-specific mutation rates from large-sample genomic variation data

    PubMed Central

    Bhaskar, Anand; Wang, Y.X. Rachel; Song, Yun S.

    2015-01-01

    With the recent increase in study sample sizes in human genetics, there has been growing interest in inferring historical population demography from genomic variation data. Here, we present an efficient inference method that can scale up to very large samples, with tens or hundreds of thousands of individuals. Specifically, by utilizing analytic results on the expected frequency spectrum under the coalescent and by leveraging the technique of automatic differentiation, which allows us to compute gradients exactly, we develop a very efficient algorithm to infer piecewise-exponential models of the historical effective population size from the distribution of sample allele frequencies. Our method is orders of magnitude faster than previous demographic inference methods based on the frequency spectrum. In addition to inferring demography, our method can also accurately estimate locus-specific mutation rates. We perform extensive validation of our method on simulated data and show that it can accurately infer multiple recent epochs of rapid exponential growth, a signal that is difficult to pick up with small sample sizes. Lastly, we use our method to analyze data from recent sequencing studies, including a large-sample exome-sequencing data set of tens of thousands of individuals assayed at a few hundred genic regions. PMID:25564017

  4. Photoheterotrophy of bacterioplankton is ubiquitous in the surface oligotrophic ocean

    NASA Astrophysics Data System (ADS)

    Evans, Claire; Gómez-Pereira, Paola R.; Martin, Adrian P.; Scanlan, David J.; Zubkov, Mikhail V.

    2015-06-01

    Accurate measurements in the Southern Hemisphere were obtained to test a hypothesis of the ubiquity of photoheterotrophy in the oligotrophic ocean. We present experimental results of light-enhanced uptake of methionine, leucine and ATP by bacterioplankton during two large-scale transects of the South Atlantic. Light increased the uptake of substrates by both dominant bacterioplankton groups, Prochlorococcus and SAR11, as well as for the bulk microbial community. Our consistent experimental evidence strongly indicates that photoheterotrophy is characteristic of dominant bacterioplankton populations in the global oligotrophic ocean.

  5. Genome Size Variation and Species Relationships in Hieracium Sub-genus Pilosella (Asteraceae) as Inferred by Flow Cytometry

    PubMed Central

    Suda, Jan; Krahulcová, Anna; Trávníček, Pavel; Rosenbaumová, Radka; Peckert, Tomáš; Krahulec, František

    2007-01-01

    Background and Aims Hieracium sub-genus Pilosella (hawkweeds) is a taxonomically complicated group of vascular plants, the structure of which is substantially influenced by frequent interspecific hybridization and polyploidization. Two kinds of species, ‘basic’ and ‘intermediate’ (i.e. hybridogenous), are usually recognized. In this study, genome size variation was investigated in a representative set of Central European hawkweeds in order to assess the value of such a data set for species delineation and inference of evolutionary relationships. Methods Holoploid and monoploid genome sizes (C- and Cx-values) were determined using propidium iodide flow cytometry for 376 homogeneously cultivated individuals of Hieracium sub-genus Pilosella, including 24 species (271 individuals), five recent natural hybrids (seven individuals) and experimental F1 hybrids from four parental combinations (98 individuals). Chromosome counts were available for more than half of the plant accessions. Base composition (proportion of AT/GC bases) was cytometrically estimated in 73 individuals. Key Results Seven different ploidy levels (2x–8x) were detected, with intraspecific ploidy polymorphism (up to four different cytotypes) occurring in 11 wild species. Mean 2C-values varied approx. 4·3-fold from 3·53 pg in diploid H. hoppeanum to 15·30 pg in octoploid H. brachiatum. 1Cx-values ranged from 1·72 pg in H. pilosella to 2·16 pg in H. echioides (1·26-fold). The DNA content of (high) polyploids was usually proportional to the DNA values of their diploid/low polyploid counterparts, indicating lack of processes altering genome size (i.e. genome down-sizing). Most species showed constant nuclear DNA amounts, exceptions being three hybridogenous taxa, in which introgressive hybridization was suggested as a presumable trigger for genome size variation. Monoploid genome sizes of hybridogenous species were always between the corresponding values of their putative parents. In addition

  6. BACTERIOPLANKTON DYNAMICS IN A SUBTROPICAL ESTUARY: EVIDENCE FOR SUBSTRATE LIMITATION

    EPA Science Inventory

    Bacterioplankton abundance and metabolic characteristics were measured along a transect in Pensacola Bay, Florida, USA, to examine the factors that control microbial water column processes in this subtropical estuary. The microbial measures included 3 H-L-leucine incorporation, e...

  7. Oceanic fronts: transition zones for bacterioplankton community composition.

    PubMed

    Baltar, Federico; Currie, Kim; Stuck, Esther; Roosa, Stéphanie; Morales, Sergio E

    2016-02-01

    Oceanic fronts are widespread mesoscale features that exist in the boundary between different water masses. Despite the recognized importance of bacterioplankton (including bacteria and archaea) on the marine biogeochemical cycles and the ubiquitousness of fronts, the effect of frontal zones on the distribution of bacterioplankton community remains unknown. Using 16S rRNA gene sequencing coupled with a high spatial resolution analysis of the physical properties of the water masses, we demonstrate strong shifts in bacterioplankton community composition (BCC) across the subtropical frontal zone off New Zealand. The transition between water masses resulted in a clear modification of the dominant taxa and a significant increase in community dissimilarity. Our results, linking physical oceanography and marine molecular ecology, support the strong role of oceanic frontal zones in delimiting the distribution of bacterioplankton in the ocean.

  8. Genome Alignment Spanning Major Poaceae Lineages Reveals Heterogeneous Evolutionary Rates and Alters Inferred Dates for Key Evolutionary Events.

    PubMed

    Wang, Xiyin; Wang, Jingpeng; Jin, Dianchuan; Guo, Hui; Lee, Tae-Ho; Liu, Tao; Paterson, Andrew H

    2015-06-01

    Multiple comparisons among genomes can clarify their evolution, speciation, and functional innovations. To date, the genome sequences of eight grasses representing the most economically important Poaceae (grass) clades have been published, and their genomic-level comparison is an essential foundation for evolutionary, functional, and translational research. Using a formal and conservative approach, we aligned these genomes. Direct comparison of paralogous gene pairs all duplicated simultaneously reveal striking variation in evolutionary rates among whole genomes, with nucleotide substitution slowest in rice and up to 48% faster in other grasses, adding a new dimension to the value of rice as a grass model. We reconstructed ancestral genome contents for major evolutionary nodes, potentially contributing to understanding the divergence and speciation of grasses. Recent fossil evidence suggests revisions of the estimated dates of key evolutionary events, implying that the pan-grass polyploidization occurred ∼96 million years ago and could not be related to the Cretaceous-Tertiary mass extinction as previously inferred. Adjusted dating to reflect both updated fossil evidence and lineage-specific evolutionary rates suggested that maize subgenome divergence and maize-sorghum divergence were virtually simultaneous, a coincidence that would be explained if polyploidization directly contributed to speciation. This work lays a solid foundation for Poaceae translational genomics.

  9. Diazotrophic bacterioplankton in a coral reef lagoon: phylogeny, diel nitrogenase expression and response to phosphate enrichment.

    PubMed

    Hewson, Ian; Moisander, Pia H; Morrison, Amanda E; Zehr, Jonathan P

    2007-05-01

    We investigated diazotrophic bacterioplankton assemblage composition in the Heron Reef lagoon (Great Barrier Reef, Australia) using culture-independent techniques targeting the nifH fragment of the nitrogenase gene. Seawater was collected at 3 h intervals over a period of 72 h (i.e. over diel as well as tidal cycles). An incubation experiment was also conducted to assess the impact of phosphate (PO(4)3*) availability on nifH expression patterns. DNA-based nifH libraries contained primarily sequences that were most similar to nifH from sediment, microbial mat and surface-associated microorganisms, with a few sequences that clustered with typical open ocean phylotypes. In contrast to genomic DNA sequences, libraries prepared from gene transcripts (mRNA amplified by reverse transcription-polymerase chain reaction) were entirely cyanobacterial and contained phylotypes similar to those observed in open ocean plankton. The abundance of Trichodesmium and two uncultured cyanobacterial phylotypes from previous studies (group A and group B) were studied by quantitative-polymerase chain reaction in the lagoon samples. These were detected as transcripts, but were not detected in genomic DNA. The gene transcript abundance of these phylotypes demonstrated variability over several diel cycles. The PO(4)3* enrichment experiment had a clearer pattern of gene expression over diel cycles than the lagoon sampling, however PO(4)3* additions did not result in enhanced transcript abundance relative to control incubations. The results suggest that a number of diazotrophs in bacterioplankton of the reef lagoon may originate from sediment, coral or beachrock surfaces, sloughing into plankton with the flooding tide. The presence of typical open ocean phylotype transcripts in lagoon bacterioplankton may indicate that they are an important component of the N cycle of the coral reef.

  10. Assignment of homoeologs to parental genomes in allopolyploids for species tree inference, with an example from Fumaria (papaveraceae).

    PubMed

    Bertrand, Yann J K; Scheen, Anne-Cathrine; Marcussen, Thomas; Pfeil, Bernard E; de Sousa, Filipe; Oxelman, Bengt

    2015-05-01

    There is a rising awareness that species trees are best inferred from multiple loci while taking into account processes affecting individual gene trees, such as substitution model error (failure of the model to account for the complexity of the data) and coalescent stochasticity (presence of incomplete lineage sorting [ILS]). Although most studies have been carried out in the context of dichotomous species trees, these processes operate also in more complex evolutionary histories involving multiple hybridizations and polyploidy. Recently, methods have been developed that accurately handle ILS in allopolyploids, but they are thus far restricted to networks of diploids and tetraploids. We propose a procedure that improves on this limitation by designing a workflow that assigns homoeologs to hypothetical diploid ancestral genomes prior to genome tree construction. Conflicting assignment hypotheses are evaluated against substitution model error and coalescent stochasticity. Incongruence that cannot be explained by stochastic mechanisms needs to be explained by other processes (e.g., homoploid hybridization or paralogy). The data can then be filtered to build multilabeled genome phylogenies using inference methods that can recover species trees, either in the face of substitution model error and coalescent stochasticity alone, or while simultaneously accounting for hybridization. Methods are already available for folding the resulting multilabeled genome phylogeny into a network. We apply the workflow to the reconstruction of the reticulate phylogeny of the plant genus Fumaria (Papaveraceae) with ploidal levels ranging from 2[Formula: see text] to 14[Formula: see text]. We describe the challenges in recovering nuclear NRPB2 homoeologs in high ploidy species while combining in vivo cloning and direct sequencing techniques. Using parametric bootstrapping simulations we assign nuclear homoeologs and chloroplast sequences (four concatenated loci) to their common

  11. Phylogeny of plastids based on cladistic analysis of gene loss inferred from complete plastid genome sequences.

    PubMed

    Nozaki, Hisayoshi; Ohta, Njij; Matsuzaki, Motomichi; Misumi, Osami; Kuroiwa, Tsuneyoshi

    2003-10-01

    Based on the recent hypothesis on the origin of eukaryotic phototrophs, red algae, green plants, and glaucophytes constitute the "primary photosynthetic eukaryotes" (whose plastids may have originated directly from a cyanobacterium-like prokaryote via primary endosymbiosis), whereas the plastids of other lineages of eukaryotic phototrophs appear to be the result of secondary or tertiary endosymbiotic events (involving a phototrophic eukaryote and a host cell). Although phylogenetic analyses using multiple plastid genes from a wide range of eukaryotic lineages have been carried out, some of the major phylogenetic relationships of plastids remain ambiguous or conflict between different phylogenetic methods used for nucleotide or amino acid substitutions. Therefore, an alternative methodology to infer the plastid phylogeny is needed. Here, we carried out a cladistic analysis of the "loss of plastid genes" after primary endosymbiosis using complete plastid genome sequences from a wide range of eukaryotic phototrophs. Since it is extremely unlikely that plastid genes are regained during plastid evolution, we used the irreversible Camin-Sokal model for our cladistic analysis of the loss of plastid genes. The cladistic analysis of the 274 plastid protein-coding genes resolved the 20 operational taxonomic units representing a wide range of eukaryotic lineages (including three secondary plastid-containing groups) into two large monophyletic groups with high bootstrap values: one corresponded to the red lineage and the other consisted of a large clade composed of the green lineage (green plants and Euglena) and the basal glaucophyte plastid. Although the sister relationship between the green lineage and the Glaucophyta was not resolved in recent phylogenetic studies using amino acid substitutions from multiple plastid genes, it is consistent with the rbcL gene phylogeny and with a recent phylogenetic study using multiple nuclear genes. In addition, our analysis robustly

  12. OrthoFinder: solving fundamental biases in whole genome comparisons dramatically improves orthogroup inference accuracy.

    PubMed

    Emms, David M; Kelly, Steven

    2015-08-06

    Identifying homology relationships between sequences is fundamental to biological research. Here we provide a novel orthogroup inference algorithm called OrthoFinder that solves a previously undetected gene length bias in orthogroup inference, resulting in significant improvements in accuracy. Using real benchmark datasets we demonstrate that OrthoFinder is more accurate than other orthogroup inference methods by between 8 % and 33 %. Furthermore, we demonstrate the utility of OrthoFinder by providing a complete classification of transcription factor gene families in plants revealing 6.9 million previously unobserved relationships.

  13. Comparative population genomics: power and principles for the inference of functionality.

    PubMed

    Lawrie, David S; Petrov, Dmitri A

    2014-04-01

    The availability of sequenced genomes from multiple related organisms allows the detection and localization of functional genomic elements based on the idea that such elements evolve more slowly than neutral sequences. Although such comparative genomics methods have proven useful in discovering functional elements and ascertaining levels of functional constraint in the genome as a whole, here we outline limitations intrinsic to this approach that cannot be overcome by sequencing more species. We argue that it is essential to supplement comparative genomics with ultra-deep sampling of populations from closely related species to enable substantially more powerful genomic scans for functional elements. The convergence of sequencing technology and population genetics theory has made such projects feasible and has exciting implications for functional genomics.

  14. Biogeography of bacterioplankton in the tropical seawaters of Singapore.

    PubMed

    Lau, Stanley C K; Zhang, Rui; Brodie, Eoin L; Piceno, Yvette M; Andersen, Gary; Liu, Wen-Tso

    2013-05-01

    Knowledge about the biogeography of marine bacterioplankton on the global scale in general and in Southeast Asia in particular has been scarce. This study investigated the biogeography of bacterioplankton community in Singapore seawaters. Twelve stations around Singapore island were sampled on different schedules over 1 year. Using PCR-DNA fingerprinting, DNA cloning and sequencing, and microarray hybridization of the 16S rRNA genes, we observed clear spatial variations of bacterioplankton diversity within the small area of the Singapore seas. Water samples collected from the Singapore Strait (south) throughout the year were dominated by DNA sequences affiliated with Cyanobacteria and Alphaproteobacteria that were believed to be associated with the influx of water from the open seas in Southeast Asia. On the contrary, water in the relatively polluted Johor Strait (north) were dominated by Betaproteobacteria, Gammaproteobacteria, and Bacteroidetes and that were presumably associated with river discharge and the relatively eutrophic conditions of the waterway. Bacterioplankton diversity was temporally stable, except for the episodic surge of Pseudoalteromonas, associated with algal blooms. Overall, these results provide valuable insights into the diversity of bacterioplankton communities in Singapore seas and the possible influences of hydrological conditions and anthropogenic activities on the dynamics of the communities.

  15. Reference set of regulons in Desulfovibrionales inferred by comparative genomics approach

    SciTech Connect

    Kazakov, A.E.; Rodionov, D.A.; Price, M.N.; Arkin, A.P.; Dubchak, I.; Novichkov, P.S.

    2010-11-15

    in this study, we carried out large-scale comparative genomics analysis of regulatory interactions in Desulfovibrio vulgaris and 12 related genomes from Desulfovibrionales order using our recently developed web server RegPredict (http://regpredict.lbl.gov). An overall reference collection of 26 Desulfovibrionales regulogs can be accessed through RegPrecise database (http://regpredict.lbl.gov).

  16. Inferring Selective Constraint from Population Genomic Data Suggests Recent Regulatory Turnover in the Human Brain

    PubMed Central

    Schrider, Daniel R.; Kern, Andrew D.

    2015-01-01

    The comparative genomics revolution of the past decade has enabled the discovery of functional elements in the human genome via sequence comparison. While that is so, an important class of elements, those specific to humans, is entirely missed by searching for sequence conservation across species. Here we present an analysis based on variation data among human genomes that utilizes a supervised machine learning approach for the identification of human-specific purifying selection in the genome. Using only allele frequency information from the complete low-coverage 1000 Genomes Project data set in conjunction with a support vector machine trained from known functional and nonfunctional portions of the genome, we are able to accurately identify portions of the genome constrained by purifying selection. Our method identifies previously known human-specific gains or losses of function and uncovers many novel candidates. Candidate targets for gain and loss of function along the human lineage include numerous putative regulatory regions of genes essential for normal development of the central nervous system, including a significant enrichment of gain of function events near neurotransmitter receptor genes. These results are consistent with regulatory turnover being a key mechanism in the evolution of human-specific characteristics of brain development. Finally, we show that the majority of the genome is unconstrained by natural selection currently, in agreement with what has been estimated from phylogenetic methods but in sharp contrast to estimates based on transcriptomics or other high-throughput functional methods. PMID:26590212

  17. High-level phylogeny of the Coleoptera inferred with mitochondrial genome sequences.

    PubMed

    Yuan, Ming-Long; Zhang, Qi-Lin; Zhang, Li; Guo, Zhong-Long; Liu, Yong-Jian; Shen, Yu-Ying; Shao, Renfu

    2016-11-01

    The Coleoptera (beetles) exhibits tremendous morphological, ecological, and behavioral diversity. To better understand the phylogenetics and evolution of beetles, we sequenced three complete mitogenomes from two families (Cleridae and Meloidae), which share conserved mitogenomic features with other completely sequenced beetles. We assessed the influence of six datasets and three inference methods on topology and nodal support within the Coleoptera. We found that both Bayesian inference and maximum likelihood with homogeneous-site models were greatly affected by nucleotide compositional heterogeneity, while the heterogeneous-site mixture model in PhyloBayes could provide better phylogenetic signals for the Coleoptera. The amino acid dataset generated more reliable tree topology at the higher taxonomic levels (i.e. suborders and series), where the inclusion of rRNA genes and the third positions of protein-coding genes improved phylogenetic inference at the superfamily level, especially under a heterogeneous-site model. We recovered the suborder relationships as (Archostemata+Adephaga)+(Myxophaga+Polyphaga). The series relationships within Polyphaga were recovered as (Scirtiformia+(Elateriformia+((Bostrichiformia+Scarabaeiformia+Staphyliniformia)+Cucujiformia))). All superfamilies within Cucujiformia were recovered as monophyletic. We obtained a cucujiform phylogeny of (Cleroidea+(Coccinelloidea+((Lymexyloidea+Tenebrionoidea)+(Cucujoidea+(Chrysomeloidea+Curculionoidea))))). This study showed that although tree topologies were sensitive to data types and inference methods, mitogenomic data could provide useful information for resolving the Coleoptera phylogeny at various taxonomic levels by using suitable datasets and heterogeneous-site models.

  18. High-level phylogeny of the Coleoptera inferred with mitochondrial genome sequences.

    PubMed

    Yuan, Ming-Long; Zhang, Qi-Lin; Zhang, Li; Guo, Zhong-Long; Liu, Yong-Jian; Shen, Yu-Ying; Shao, Renfu

    2016-11-01

    The Coleoptera (beetles) exhibits tremendous morphological, ecological, and behavioral diversity. To better understand the phylogenetics and evolution of beetles, we sequenced three complete mitogenomes from two families (Cleridae and Meloidae), which share conserved mitogenomic features with other completely sequenced beetles. We assessed the influence of six datasets and three inference methods on topology and nodal support within the Coleoptera. We found that both Bayesian inference and maximum likelihood with homogeneous-site models were greatly affected by nucleotide compositional heterogeneity, while the heterogeneous-site mixture model in PhyloBayes could provide better phylogenetic signals for the Coleoptera. The amino acid dataset generated more reliable tree topology at the higher taxonomic levels (i.e. suborders and series), where the inclusion of rRNA genes and the third positions of protein-coding genes improved phylogenetic inference at the superfamily level, especially under a heterogeneous-site model. We recovered the suborder relationships as (Archostemata+Adephaga)+(Myxophaga+Polyphaga). The series relationships within Polyphaga were recovered as (Scirtiformia+(Elateriformia+((Bostrichiformia+Scarabaeiformia+Staphyliniformia)+Cucujiformia))). All superfamilies within Cucujiformia were recovered as monophyletic. We obtained a cucujiform phylogeny of (Cleroidea+(Coccinelloidea+((Lymexyloidea+Tenebrionoidea)+(Cucujoidea+(Chrysomeloidea+Curculionoidea))))). This study showed that although tree topologies were sensitive to data types and inference methods, mitogenomic data could provide useful information for resolving the Coleoptera phylogeny at various taxonomic levels by using suitable datasets and heterogeneous-site models. PMID:27497607

  19. Distribution and evolution of repeated sequences in genomes of Triatominae (Hemiptera-Reduviidae) inferred from genomic in situ hybridization.

    PubMed

    Pita, Sebastian; Panzera, Francisco; Sánchez, Antonio; Panzera, Yanina; Palomeque, Teresa; Lorite, Pedro

    2014-01-01

    The subfamily Triatominae, vectors of Chagas disease, comprises 140 species characterized by a highly homogeneous chromosome number. We analyzed the chromosomal distribution and evolution of repeated sequences in Triatominae genomes by Genomic in situ Hybridization using Triatoma delpontei and Triatoma infestans genomic DNAs as probes. Hybridizations were performed on their own chromosomes and on nine species included in six genera from the two main tribes: Triatomini and Rhodniini. Genomic probes clearly generate two different hybridization patterns, dispersed or accumulated in specific regions or chromosomes. The three used probes generate the same hybridization pattern in each species. However, these patterns are species-specific. In closely related species, the probes strongly hybridized in the autosomal heterochromatic regions, resembling C-banding and DAPI patterns. However, in more distant species these co-localizations are not observed. The heterochromatic Y chromosome is constituted by highly repeated sequences, which is conserved among 10 species of Triatomini tribe suggesting be an ancestral character for this group. However, the Y chromosome in Rhodniini tribe is markedly different, supporting the early evolutionary dichotomy between both tribes. In some species, sex chromosomes and autosomes shared repeated sequences, suggesting meiotic chromatin exchanges among these heterologous chromosomes. Our GISH analyses enabled us to acquire not only reliable information about autosomal repeated sequences distribution but also an insight into sex chromosome evolution in Triatominae. Furthermore, the differentiation obtained by GISH might be a valuable marker to establish phylogenetic relationships and to test the controversial origin of the Triatominae subfamily.

  20. Distribution and Evolution of Repeated Sequences in Genomes of Triatominae (Hemiptera-Reduviidae) Inferred from Genomic In Situ Hybridization

    PubMed Central

    Pita, Sebastian; Panzera, Francisco; Sánchez, Antonio; Panzera, Yanina; Palomeque, Teresa; Lorite, Pedro

    2014-01-01

    The subfamily Triatominae, vectors of Chagas disease, comprises 140 species characterized by a highly homogeneous chromosome number. We analyzed the chromosomal distribution and evolution of repeated sequences in Triatominae genomes by Genomic in situ Hybridization using Triatoma delpontei and Triatoma infestans genomic DNAs as probes. Hybridizations were performed on their own chromosomes and on nine species included in six genera from the two main tribes: Triatomini and Rhodniini. Genomic probes clearly generate two different hybridization patterns, dispersed or accumulated in specific regions or chromosomes. The three used probes generate the same hybridization pattern in each species. However, these patterns are species-specific. In closely related species, the probes strongly hybridized in the autosomal heterochromatic regions, resembling C-banding and DAPI patterns. However, in more distant species these co-localizations are not observed. The heterochromatic Y chromosome is constituted by highly repeated sequences, which is conserved among 10 species of Triatomini tribe suggesting be an ancestral character for this group. However, the Y chromosome in Rhodniini tribe is markedly different, supporting the early evolutionary dichotomy between both tribes. In some species, sex chromosomes and autosomes shared repeated sequences, suggesting meiotic chromatin exchanges among these heterologous chromosomes. Our GISH analyses enabled us to acquire not only reliable information about autosomal repeated sequences distribution but also an insight into sex chromosome evolution in Triatominae. Furthermore, the differentiation obtained by GISH might be a valuable marker to establish phylogenetic relationships and to test the controversial origin of the Triatominae subfamily. PMID:25478792

  1. Data set of phylogenetic analysis inferred based on the complete genomes of the family Nodaviridae.

    PubMed

    Low, Chen-Fei; Bunawan, Hamidun

    2016-09-01

    In this article, nine complete genomes of viruses from the genus Alphanodavirus and Betanodavirus (Family Nodaviridae) were comparatively analyzed and the data of their evolutionary origins and relatedness are reported. The nucleotide sequence alignment of the complete genomes from all species and their deduced evolutionary relationships are presented. High sequence similarity within the genus Betanodavirus compared to the genus Alphanodavirus was revealed in multiple sequence alignment of the Nodaviridae genomes. The amino acid sequence similarity for both RNA1 and RNA2 ORF is more conserved in Betanodavirus, compared to Alphanodavirus. The conserved and variable regions within the virus genome that were defined based on the multiple sequence alignments are presented in this dataset. PMID:27617282

  2. New insights into the Tyrolean Iceman's origin and phenotype as inferred by whole-genome sequencing.

    PubMed

    Keller, Andreas; Graefen, Angela; Ball, Markus; Matzas, Mark; Boisguerin, Valesca; Maixner, Frank; Leidinger, Petra; Backes, Christina; Khairat, Rabab; Forster, Michael; Stade, Björn; Franke, Andre; Mayer, Jens; Spangler, Jessica; McLaughlin, Stephen; Shah, Minita; Lee, Clarence; Harkins, Timothy T; Sartori, Alexander; Moreno-Estrada, Andres; Henn, Brenna; Sikora, Martin; Semino, Ornella; Chiaroni, Jacques; Rootsi, Siiri; Myres, Natalie M; Cabrera, Vicente M; Underhill, Peter A; Bustamante, Carlos D; Vigl, Eduard Egarter; Samadelli, Marco; Cipollini, Giovanna; Haas, Jan; Katus, Hugo; O'Connor, Brian D; Carlson, Marc R J; Meder, Benjamin; Blin, Nikolaus; Meese, Eckart; Pusch, Carsten M; Zink, Albert

    2012-02-28

    The Tyrolean Iceman, a 5,300-year-old Copper age individual, was discovered in 1991 on the Tisenjoch Pass in the Italian part of the Ötztal Alps. Here we report the complete genome sequence of the Iceman and show 100% concordance between the previously reported mitochondrial genome sequence and the consensus sequence generated from our genomic data. We present indications for recent common ancestry between the Iceman and present-day inhabitants of the Tyrrhenian Sea, that the Iceman probably had brown eyes, belonged to blood group O and was lactose intolerant. His genetic predisposition shows an increased risk for coronary heart disease and may have contributed to the development of previously reported vascular calcifications. Sequences corresponding to ~60% of the genome of Borrelia burgdorferi are indicative of the earliest human case of infection with the pathogen for Lyme borreliosis.

  3. New insights into the Tyrolean Iceman's origin and phenotype as inferred by whole-genome sequencing.

    PubMed

    Keller, Andreas; Graefen, Angela; Ball, Markus; Matzas, Mark; Boisguerin, Valesca; Maixner, Frank; Leidinger, Petra; Backes, Christina; Khairat, Rabab; Forster, Michael; Stade, Björn; Franke, Andre; Mayer, Jens; Spangler, Jessica; McLaughlin, Stephen; Shah, Minita; Lee, Clarence; Harkins, Timothy T; Sartori, Alexander; Moreno-Estrada, Andres; Henn, Brenna; Sikora, Martin; Semino, Ornella; Chiaroni, Jacques; Rootsi, Siiri; Myres, Natalie M; Cabrera, Vicente M; Underhill, Peter A; Bustamante, Carlos D; Vigl, Eduard Egarter; Samadelli, Marco; Cipollini, Giovanna; Haas, Jan; Katus, Hugo; O'Connor, Brian D; Carlson, Marc R J; Meder, Benjamin; Blin, Nikolaus; Meese, Eckart; Pusch, Carsten M; Zink, Albert

    2012-01-01

    The Tyrolean Iceman, a 5,300-year-old Copper age individual, was discovered in 1991 on the Tisenjoch Pass in the Italian part of the Ötztal Alps. Here we report the complete genome sequence of the Iceman and show 100% concordance between the previously reported mitochondrial genome sequence and the consensus sequence generated from our genomic data. We present indications for recent common ancestry between the Iceman and present-day inhabitants of the Tyrrhenian Sea, that the Iceman probably had brown eyes, belonged to blood group O and was lactose intolerant. His genetic predisposition shows an increased risk for coronary heart disease and may have contributed to the development of previously reported vascular calcifications. Sequences corresponding to ~60% of the genome of Borrelia burgdorferi are indicative of the earliest human case of infection with the pathogen for Lyme borreliosis. PMID:22426219

  4. Inferring Speciation Processes from Patterns of Natural Variation in Microbial Genomes.

    PubMed

    Krause, David J; Whitaker, Rachel J

    2015-11-01

    Microbial species concepts have long been the focus of contentious debate, fueled by technological limitations to the genetic resolution of species, by the daunting task of investigating phenotypic variation among individual microscopic organisms, and by a lack of understanding of gene flow in reproductively asexual organisms that are prone to promiscuous horizontal gene transfer. Population genomics, the emerging approach of analyzing the complete genomes of a multitude of closely related organisms, is poised to overcome these limitations by providing a window into patterns of genome variation revealing the evolutionary processes through which species diverge. This new approach is more than just an extension of previous multilocus sequencing technologies, in that it provides a comprehensive view of interacting evolutionary processes. Here we argue that the application of population genomic tools in a rigorous population genetic framework will help to identify the processes of microbial speciation and ultimately lead to a general species concept based on the unique biology and ecology of microorganisms.

  5. Demographic History of the Genus Pan Inferred from Whole Mitochondrial Genome Reconstructions

    PubMed Central

    Tucci, Serena; de Manuel, Marc; Ghirotto, Silvia; Benazzo, Andrea; Prado-Martinez, Javier; Lorente-Galdos, Belen; Nam, Kiwoong; Dabad, Marc; Hernandez-Rodriguez, Jessica; Comas, David; Navarro, Arcadi; Schierup, Mikkel H.; Andres, Aida M.; Barbujani, Guido; Hvilsom, Christina; Marques-Bonet, Tomas

    2016-01-01

    The genus Pan is the closest genus to our own and it includes two species, Pan paniscus (bonobos) and Pan troglodytes (chimpanzees). The later is constituted by four subspecies, all highly endangered. The study of the Pan genera has been incessantly complicated by the intricate relationship among subspecies and the statistical limitations imposed by the reduced number of samples or genomic markers analyzed. Here, we present a new method to reconstruct complete mitochondrial genomes (mitogenomes) from whole genome shotgun (WGS) datasets, mtArchitect, showing that its reconstructions are highly accurate and consistent with long-range PCR mitogenomes. We used this approach to build the mitochondrial genomes of 20 newly sequenced samples which, together with available genomes, allowed us to analyze the hitherto most complete Pan mitochondrial genome dataset including 156 chimpanzee and 44 bonobo individuals, with a proportional contribution from all chimpanzee subspecies. We estimated the separation time between chimpanzees and bonobos around 1.15 million years ago (Mya) [0.81–1.49]. Further, we found that under the most probable genealogical model the two clades of chimpanzees, Western + Nigeria-Cameroon and Central + Eastern, separated at 0.59 Mya [0.41–0.78] with further internal separations at 0.32 Mya [0.22–0.43] and 0.16 Mya [0.17–0.34], respectively. Finally, for a subset of our samples, we compared nuclear versus mitochondrial genomes and we found that chimpanzee subspecies have different patterns of nuclear and mitochondrial diversity, which could be a result of either processes affecting the mitochondrial genome, such as hitchhiking or background selection, or a result of population dynamics. PMID:27345955

  6. Demographic History of the Genus Pan Inferred from Whole Mitochondrial Genome Reconstructions.

    PubMed

    Lobon, Irene; Tucci, Serena; de Manuel, Marc; Ghirotto, Silvia; Benazzo, Andrea; Prado-Martinez, Javier; Lorente-Galdos, Belen; Nam, Kiwoong; Dabad, Marc; Hernandez-Rodriguez, Jessica; Comas, David; Navarro, Arcadi; Schierup, Mikkel H; Andres, Aida M; Barbujani, Guido; Hvilsom, Christina; Marques-Bonet, Tomas

    2016-01-01

    The genus Pan is the closest genus to our own and it includes two species, Pan paniscus (bonobos) and Pan troglodytes (chimpanzees). The later is constituted by four subspecies, all highly endangered. The study of the Pan genera has been incessantly complicated by the intricate relationship among subspecies and the statistical limitations imposed by the reduced number of samples or genomic markers analyzed. Here, we present a new method to reconstruct complete mitochondrial genomes (mitogenomes) from whole genome shotgun (WGS) datasets, mtArchitect, showing that its reconstructions are highly accurate and consistent with long-range PCR mitogenomes. We used this approach to build the mitochondrial genomes of 20 newly sequenced samples which, together with available genomes, allowed us to analyze the hitherto most complete Pan mitochondrial genome dataset including 156 chimpanzee and 44 bonobo individuals, with a proportional contribution from all chimpanzee subspecies. We estimated the separation time between chimpanzees and bonobos around 1.15 million years ago (Mya) [0.81-1.49]. Further, we found that under the most probable genealogical model the two clades of chimpanzees, Western + Nigeria-Cameroon and Central + Eastern, separated at 0.59 Mya [0.41-0.78] with further internal separations at 0.32 Mya [0.22-0.43] and 0.16 Mya [0.17-0.34], respectively. Finally, for a subset of our samples, we compared nuclear versus mitochondrial genomes and we found that chimpanzee subspecies have different patterns of nuclear and mitochondrial diversity, which could be a result of either processes affecting the mitochondrial genome, such as hitchhiking or background selection, or a result of population dynamics. PMID:27345955

  7. Demographic History of the Genus Pan Inferred from Whole Mitochondrial Genome Reconstructions.

    PubMed

    Lobon, Irene; Tucci, Serena; de Manuel, Marc; Ghirotto, Silvia; Benazzo, Andrea; Prado-Martinez, Javier; Lorente-Galdos, Belen; Nam, Kiwoong; Dabad, Marc; Hernandez-Rodriguez, Jessica; Comas, David; Navarro, Arcadi; Schierup, Mikkel H; Andres, Aida M; Barbujani, Guido; Hvilsom, Christina; Marques-Bonet, Tomas

    2016-01-01

    The genus Pan is the closest genus to our own and it includes two species, Pan paniscus (bonobos) and Pan troglodytes (chimpanzees). The later is constituted by four subspecies, all highly endangered. The study of the Pan genera has been incessantly complicated by the intricate relationship among subspecies and the statistical limitations imposed by the reduced number of samples or genomic markers analyzed. Here, we present a new method to reconstruct complete mitochondrial genomes (mitogenomes) from whole genome shotgun (WGS) datasets, mtArchitect, showing that its reconstructions are highly accurate and consistent with long-range PCR mitogenomes. We used this approach to build the mitochondrial genomes of 20 newly sequenced samples which, together with available genomes, allowed us to analyze the hitherto most complete Pan mitochondrial genome dataset including 156 chimpanzee and 44 bonobo individuals, with a proportional contribution from all chimpanzee subspecies. We estimated the separation time between chimpanzees and bonobos around 1.15 million years ago (Mya) [0.81-1.49]. Further, we found that under the most probable genealogical model the two clades of chimpanzees, Western + Nigeria-Cameroon and Central + Eastern, separated at 0.59 Mya [0.41-0.78] with further internal separations at 0.32 Mya [0.22-0.43] and 0.16 Mya [0.17-0.34], respectively. Finally, for a subset of our samples, we compared nuclear versus mitochondrial genomes and we found that chimpanzee subspecies have different patterns of nuclear and mitochondrial diversity, which could be a result of either processes affecting the mitochondrial genome, such as hitchhiking or background selection, or a result of population dynamics.

  8. Adaptation, Ecology, and Evolution of the Halophilic Stromatolite Archaeon Halococcus hamelinensis Inferred through Genome Analyses

    PubMed Central

    Gudhka, Reema K.; Neilan, Brett A.; Burns, Brendan P.

    2015-01-01

    Halococcus hamelinensis was the first archaeon isolated from stromatolites. These geomicrobial ecosystems are thought to be some of the earliest known on Earth, yet, despite their evolutionary significance, the role of Archaea in these systems is still not well understood. Detailed here is the genome sequencing and analysis of an archaeon isolated from stromatolites. The genome of H. hamelinensis consisted of 3,133,046 base pairs with an average G+C content of 60.08% and contained 3,150 predicted coding sequences or ORFs, 2,196 (68.67%) of which were protein-coding genes with functional assignments and 954 (29.83%) of which were of unknown function. Codon usage of the H. hamelinensis genome was consistent with a highly acidic proteome, a major adaptive mechanism towards high salinity. Amino acid transport and metabolism, inorganic ion transport and metabolism, energy production and conversion, ribosomal structure, and unknown function COG genes were overrepresented. The genome of H. hamelinensis also revealed characteristics reflecting its survival in its extreme environment, including putative genes/pathways involved in osmoprotection, oxidative stress response, and UV damage repair. Finally, genome analyses indicated the presence of putative transposases as well as positive matches of genes of H. hamelinensis against various genomes of Bacteria, Archaea, and viruses, suggesting the potential for horizontal gene transfer. PMID:25709556

  9. Simultaneous inference of selection and population growth from patterns of variation in the human genome

    PubMed Central

    Williamson, Scott H.; Hernandez, Ryan; Fledel-Alon, Adi; Zhu, Lan; Nielsen, Rasmus; Bustamante, Carlos D.

    2005-01-01

    Natural selection and demographic forces can have similar effects on patterns of DNA polymorphism. Therefore, to infer selection from samples of DNA sequences, one must simultaneously account for demographic effects. Here we take a model-based approach to this problem by developing predictions for patterns of polymorphism in the presence of both population size change and natural selection. If data are available from different functional classes of variation, and a priori information suggests that mutations in one of those classes are selectively neutral, then the putatively neutral class can be used to infer demographic parameters, and inferences regarding selection on other classes can be performed given demographic parameter estimates. This procedure is more robust to assumptions regarding the true underlying demography than previous approaches to detecting and analyzing selection. We apply this method to a large polymorphism data set from 301 human genes and find (i) widespread negative selection acting on standing nonsynonymous variation, (ii) that the fitness effects of nonsynonymous mutations are well predicted by several measures of amino acid exchangeability, especially site-specific methods, and (iii) strong evidence for very recent population growth. PMID:15905331

  10. Higher-level phylogeny of the Hymenoptera inferred from mitochondrial genomes.

    PubMed

    Mao, Meng; Gibson, Tracey; Dowton, Mark

    2015-03-01

    Higher-level hymenopteran relationships remain unresolved in both morphological and molecular analyses. In this study, we present the most comprehensive analyses of hymenopteran relationships based on 48 mitochondrial (mt) genomes. One complete and two nearly complete mt genomes representing three hymenopteran superfamilies were newly sequenced. We assessed the influence of inclusion/exclusion of 3rd codon positions, alignment approaches, partition schemes and phylogenetic approaches on topology and nodal support within the Hymenoptera. The results showed that the topologies were sensitive to the variation of dataset and analytical approach. However, some robust and highly supported relationships were recovered: the Ichneumonomorpha was monophyletic; the Trigonalyoidea+Megalyroidea and the Diaprioidea+Chalcidoidea were consistently recovered; the Cynipoidea was generally recovered as the sister group to the Diaprioidea+Chalcidoidea. In addition, the monophyletic Aculeata and Proctotrupomorpha were recovered in some analyses. Several gene rearrangements were detected in each of the three newly sequenced mt genomes. Specifically, the Ibalia leucospoides mt genome harbors a large inversion of a gene block from trnE to trnS2. Inverted, duplicated A+T rich regions were detected in the Ibalia leucospoides mt genome, which probably played an important role during the formation of the large gene block inversion via recombination.

  11. Inferences of drug responses in cancer cells from cancer genomic features and compound chemical and therapeutic properties.

    PubMed

    Wang, Yongcui; Fang, Jianwen; Chen, Shilong

    2016-01-01

    Accurately predicting the response of a cancer patient to a therapeutic agent is a core goal of precision medicine. Existing approaches were mainly relied primarily on genomic alterations in cancer cells that have been treated with different drugs. Here we focus on predicting drug response based on integration of the heterogeneously pharmacogenomics data from both cell and drug sides. Through a systematical approach, named as PDRCC (Predict Drug Response in Cancer Cells), the cancer genomic alterations and compound chemical and therapeutic properties were incorporated to determine the chemotherapeutic response in cancer patients. Using the Cancer Cell Line Encyclopedia (CCLE) study as the benchmark dataset, all pharmacogenomics data exhibited their roles in inferring the relationships between cancer cells and drugs. When integrating both genomic resources and compound information, the prediction coverage was significantly increased. The validity of PDRCC was also supported by its effective in uncovering the unknown cell-drug associations with database and literature evidences. It set the stage for clinical testing of novel therapeutic strategies, such as the sensitive association between cancer cell 'A549_LUNG' and compound 'Topotecan'. In conclusion, PDRCC offers the possibility for faster, safer, and cheaper the development of novel anti-cancer therapeutics in the early-stage clinical trails. PMID:27645580

  12. Inferences of drug responses in cancer cells from cancer genomic features and compound chemical and therapeutic properties

    PubMed Central

    Wang, Yongcui; Fang, Jianwen; Chen, Shilong

    2016-01-01

    Accurately predicting the response of a cancer patient to a therapeutic agent is a core goal of precision medicine. Existing approaches were mainly relied primarily on genomic alterations in cancer cells that have been treated with different drugs. Here we focus on predicting drug response based on integration of the heterogeneously pharmacogenomics data from both cell and drug sides. Through a systematical approach, named as PDRCC (Predict Drug Response in Cancer Cells), the cancer genomic alterations and compound chemical and therapeutic properties were incorporated to determine the chemotherapeutic response in cancer patients. Using the Cancer Cell Line Encyclopedia (CCLE) study as the benchmark dataset, all pharmacogenomics data exhibited their roles in inferring the relationships between cancer cells and drugs. When integrating both genomic resources and compound information, the prediction coverage was significantly increased. The validity of PDRCC was also supported by its effective in uncovering the unknown cell-drug associations with database and literature evidences. It set the stage for clinical testing of novel therapeutic strategies, such as the sensitive association between cancer cell ‘A549_LUNG’ and compound ‘Topotecan’. In conclusion, PDRCC offers the possibility for faster, safer, and cheaper the development of novel anti-cancer therapeutics in the early-stage clinical trails. PMID:27645580

  13. Inferences of drug responses in cancer cells from cancer genomic features and compound chemical and therapeutic properties

    NASA Astrophysics Data System (ADS)

    Wang, Yongcui; Fang, Jianwen; Chen, Shilong

    2016-09-01

    Accurately predicting the response of a cancer patient to a therapeutic agent is a core goal of precision medicine. Existing approaches were mainly relied primarily on genomic alterations in cancer cells that have been treated with different drugs. Here we focus on predicting drug response based on integration of the heterogeneously pharmacogenomics data from both cell and drug sides. Through a systematical approach, named as PDRCC (Predict Drug Response in Cancer Cells), the cancer genomic alterations and compound chemical and therapeutic properties were incorporated to determine the chemotherapeutic response in cancer patients. Using the Cancer Cell Line Encyclopedia (CCLE) study as the benchmark dataset, all pharmacogenomics data exhibited their roles in inferring the relationships between cancer cells and drugs. When integrating both genomic resources and compound information, the prediction coverage was significantly increased. The validity of PDRCC was also supported by its effective in uncovering the unknown cell-drug associations with database and literature evidences. It set the stage for clinical testing of novel therapeutic strategies, such as the sensitive association between cancer cell ‘A549_LUNG’ and compound ‘Topotecan’. In conclusion, PDRCC offers the possibility for faster, safer, and cheaper the development of novel anti-cancer therapeutics in the early-stage clinical trails.

  14. Mining the semantics of genome super-blocks to infer ancestral architectures.

    PubMed

    Jean, Géraldine; Sherman, David James; Nikolski, Macha

    2009-09-01

    The study of evolutionary mechanisms is made more and more accurate by the increase in the number of fully sequenced genomes. One of the main problems is to reconstruct plausible ancestral genome architectures based on the comparison of contemporary genomes. Current methods have largely focused on finding complete architectures for ancestral genomes, and, due to the computational difficulty of the problem, stop after a small number of equivalent minimal solutions have been found. Recent results suggest, however, that the set of minimum complete architectures is very large and heterogeneous. In fact these solutions are collections of conserved blocks, freely rearranged. In this paper, we identify these conserved super-blocks, using a new method of analysis of ancestral architectures that reconciles both breakpoint and rearrangement analyses, as well as respects biological constraints. The resulting algorithms permit the first reliable reconstruction of plausible ancestral architectures for several non-WGD yeasts simultaneously, a problem hitherto intractable due to the extensive map reshuffling of these species. See online Supplementary Material at www.liebertonline.com. PMID:19772437

  15. Collinearity analysis of Brassica A and C genomes based on an updated inferred unigene order.

    PubMed

    Bancroft, Ian; Fraser, Fiona; Morgan, Colin; Trick, Martin

    2015-06-01

    This data article includes SNP scoring across lines of the Brassica napus TNDH population based on Illumina sequencing of mRNA, expanded to 75 lines. The 21, 323 mapped markers defined 887 recombination bins, representing an updated genetic linkage map for the species. Based on this new map, 5 genome sequence scaffolds were split and the order and orientation of scaffolds updated to establish a new pseudomolecule specification. The order of unigenes and SNP array probes within these pseudomolecules was determined. Unigenes were assessed for sequence similarity to the A and C genomes. The 57, 246 that mapped to both enabled the collinearity of the A and C genomes to be illustrated graphically. Although the great majority was in collinear positions, some were not. Analyses of 60 such instances are presented, suggesting that the breakdown in collinearity was largely due to either the absence of the homoeologue on one genome (resulting in sequence match to a paralogue) or multiple similar sequences being present. The mRNAseq datasets for the TNDH lines are available from the SRA repository (ERA283648); the remaining datasets are supplied with this article.

  16. Infer Metagenomic Abundance and Reveal Homologous Genomes Based on the Structure of Taxonomy Tree.

    PubMed

    Qiu, Yu-Qing; Tian, Xue; Zhang, Shihua

    2015-01-01

    Metagenomic research uses sequencing technologies to investigate the genetic biodiversity of microbiomes presented in various ecosystems or animal tissues. The composition of a microbial community is highly associated with the environment in which the organisms exist. As large amount of sequencing short reads of microorganism genomes obtained, accurately estimating the abundance of microorganisms within a metagenomic sample is becoming an increasing challenge in bioinformatics. In this paper, we describe a hierarchical taxonomy tree-based mixture model (HTTMM) for estimating the abundance of taxon within a microbial community by incorporating the structure of the taxonomy tree. In this model, genome-specific short reads and homologous short reads among genomes can be distinguished and represented by leaf and intermediate nodes in the taxonomy tree, respectively. We adopt an expectation-maximization algorithm to solve this model. Using simulated and real-world data, we demonstrate that the proposed method is superior to both flat mixture model and lowest common ancestry-based methods. Moreover, this model can reveal previously unaddressed homologous genomes.

  17. Structural features of conopeptide genes inferred from partial sequences of the Conus tribblei genome.

    PubMed

    Barghi, Neda; Concepcion, Gisela P; Olivera, Baldomero M; Lluisma, Arturo O

    2016-02-01

    The evolvability of venom components (in particular, the gene-encoded peptide toxins) in venomous species serves as an adaptive strategy allowing them to target new prey types or respond to changes in the prey field. The structure, organization, and expression of the venom peptide genes may provide insights into the molecular mechanisms that drive the evolution of such genes. Conus is a particularly interesting group given the high chemical diversity of their venom peptides, and the rapid evolution of the conopeptide-encoding genes. Conus genomes, however, are large and characterized by a high proportion of repetitive sequences. As a result, the structure and organization of conopeptide genes have remained poorly known. In this study, a survey of the genome of Conus tribblei was undertaken to address this gap. A partial assembly of C. tribblei genome was generated; the assembly, though consisting of a large number of fragments, accounted for 2160.5 Mb of sequence. A large number of repetitive genomic elements consisting of 642.6 Mb of retrotransposable elements, simple repeats, and novel interspersed repeats were observed. We characterized the structural organization and distribution of conotoxin genes in the genome. A significant number of conopeptide genes (estimated to be between 148 and 193) belonging to different superfamilies with complete or nearly complete exon regions were observed, ~60 % of which were expressed. The unexpressed conopeptide genes represent hidden but significant conotoxin diversity. The conotoxin genes also differed in the frequency and length of the introns. The interruption of exons by long introns in the conopeptide genes and the presence of repeats in the introns may indicate the importance of introns in facilitating recombination, evolution and diversification of conotoxins. These findings advance our understanding of the structural framework that promotes the gene-level molecular evolution of venom peptides. PMID:26423067

  18. Structural features of conopeptide genes inferred from partial sequences of the Conus tribblei genome.

    PubMed

    Barghi, Neda; Concepcion, Gisela P; Olivera, Baldomero M; Lluisma, Arturo O

    2016-02-01

    The evolvability of venom components (in particular, the gene-encoded peptide toxins) in venomous species serves as an adaptive strategy allowing them to target new prey types or respond to changes in the prey field. The structure, organization, and expression of the venom peptide genes may provide insights into the molecular mechanisms that drive the evolution of such genes. Conus is a particularly interesting group given the high chemical diversity of their venom peptides, and the rapid evolution of the conopeptide-encoding genes. Conus genomes, however, are large and characterized by a high proportion of repetitive sequences. As a result, the structure and organization of conopeptide genes have remained poorly known. In this study, a survey of the genome of Conus tribblei was undertaken to address this gap. A partial assembly of C. tribblei genome was generated; the assembly, though consisting of a large number of fragments, accounted for 2160.5 Mb of sequence. A large number of repetitive genomic elements consisting of 642.6 Mb of retrotransposable elements, simple repeats, and novel interspersed repeats were observed. We characterized the structural organization and distribution of conotoxin genes in the genome. A significant number of conopeptide genes (estimated to be between 148 and 193) belonging to different superfamilies with complete or nearly complete exon regions were observed, ~60 % of which were expressed. The unexpressed conopeptide genes represent hidden but significant conotoxin diversity. The conotoxin genes also differed in the frequency and length of the introns. The interruption of exons by long introns in the conopeptide genes and the presence of repeats in the introns may indicate the importance of introns in facilitating recombination, evolution and diversification of conotoxins. These findings advance our understanding of the structural framework that promotes the gene-level molecular evolution of venom peptides.

  19. Disentangling seasonal bacterioplankton population dynamics by high-frequency sampling.

    PubMed

    Lindh, Markus V; Sjöstedt, Johanna; Andersson, Anders F; Baltar, Federico; Hugerth, Luisa W; Lundin, Daniel; Muthusamy, Saraladevi; Legrand, Catherine; Pinhassi, Jarone

    2015-07-01

    Multiyear comparisons of bacterioplankton succession reveal that environmental conditions drive community shifts with repeatable patterns between years. However, corresponding insight into bacterioplankton dynamics at a temporal resolution relevant for detailed examination of variation and characteristics of specific populations within years is essentially lacking. During 1 year, we collected 46 samples in the Baltic Sea for assessing bacterial community composition by 16S rRNA gene pyrosequencing (nearly twice weekly during productive season). Beta-diversity analysis showed distinct clustering of samples, attributable to seemingly synchronous temporal transitions among populations (populations defined by 97% 16S rRNA gene sequence identity). A wide spectrum of bacterioplankton dynamics was evident, where divergent temporal patterns resulted both from pronounced differences in relative abundance and presence/absence of populations. Rates of change in relative abundance calculated for individual populations ranged from 0.23 to 1.79 day(-1) . Populations that were persistently dominant, transiently abundant or generally rare were found in several major bacterial groups, implying evolution has favoured a similar variety of life strategies within these groups. These findings suggest that high temporal resolution sampling allows constraining the timescales and frequencies at which distinct populations transition between being abundant or rare, thus potentially providing clues about physical, chemical or biological forcing on bacterioplankton community structure.

  20. Inferring the Phylogeny of Bovidae Using Mitochondrial DNA Sequences: Resolving Power of Individual Genes Relative to Complete Genomes

    PubMed Central

    Arif, Ibrahim A.; Bakir, Mohammad A.; Khan, Haseeb A.

    2012-01-01

    Molecular techniques that assess biodiversity through the analysis of a small segment of mitochondrial genome have been getting wide attention for inferring the mammalian diversity. Due to their highly conserved nature, specific mitochondrial genes offer a promising tool for phylogenetic analysis. However, there is no established criteria for selecting the typical mitochondrial DNA (mtDNA) segments to achieve a greater resolving power. We therefore chose the family Bovidae as a model and compared the tree-topologies resulting from the commonly used and phylogenetically-informative genes including 16S rRNA, 12S rRNA, COI, Cyt b and D-loop with respect to complete mitochondrial genome. The tree topologies from the whole mitochondrial genome of 12 species were not identical albeit similar with those resulting from the five individual genes mentioned above. High bootstrap values were observed for mtDNA compared with that of any single gene. The average pair-wise sequence divergence using different genetic modes was found to be: D-loop (0.229) > Cyt b (0.159) > COI or complete mtDNA (0.143) > 12S rRNA (0.094) > 16S rRNA (0.091). The tree resulting from complete mtDNA clearly separated the 12 taxa of Bovidae into 3 major clusters, one cluster each for subfamily Cervinae and Bovinae and the third cluster comprised the distinctive clades of Caprinae and Antilopinae. However, jumping clades of Antilopinae were observed while using the individual genes. This study showed that Bison bison and Bos Taurus have very close phylogenetic relationship compared to Bubalus bubalis (Bovinae), irrespective of the method used. Our findings suggest that complete mtDNA genome provides most reliable understanding of complex phylogenetic relationships while the reliability of individual gene trees should be verified with high bootstrap support. PMID:22399841

  1. Inferring Population Size History from Large Samples of Genome-Wide Molecular Data - An Approximate Bayesian Computation Approach

    PubMed Central

    Boitard, Simon; Rodríguez, Willy; Jay, Flora; Mona, Stefano; Austerlitz, Frédéric

    2016-01-01

    Inferring the ancestral dynamics of effective population size is a long-standing question in population genetics, which can now be tackled much more accurately thanks to the massive genomic data available in many species. Several promising methods that take advantage of whole-genome sequences have been recently developed in this context. However, they can only be applied to rather small samples, which limits their ability to estimate recent population size history. Besides, they can be very sensitive to sequencing or phasing errors. Here we introduce a new approximate Bayesian computation approach named PopSizeABC that allows estimating the evolution of the effective population size through time, using a large sample of complete genomes. This sample is summarized using the folded allele frequency spectrum and the average zygotic linkage disequilibrium at different bins of physical distance, two classes of statistics that are widely used in population genetics and can be easily computed from unphased and unpolarized SNP data. Our approach provides accurate estimations of past population sizes, from the very first generations before present back to the expected time to the most recent common ancestor of the sample, as shown by simulations under a wide range of demographic scenarios. When applied to samples of 15 or 25 complete genomes in four cattle breeds (Angus, Fleckvieh, Holstein and Jersey), PopSizeABC revealed a series of population declines, related to historical events such as domestication or modern breed creation. We further highlight that our approach is robust to sequencing errors, provided summary statistics are computed from SNPs with common alleles. PMID:26943927

  2. Genome-Wide SNP Discovery, Genotyping and Their Preliminary Applications for Population Genetic Inference in Spotted Sea Bass (Lateolabrax maculatus)

    PubMed Central

    Wang, Juan; Xue, Dong-Xiu; Zhang, Bai-Dong; Li, Yu-Long; Liu, Bing-Jian; Liu, Jin-Xian

    2016-01-01

    Next-generation sequencing and the collection of genome-wide single-nucleotide polymorphisms (SNPs) allow identifying fine-scale population genetic structure and genomic regions under selection. The spotted sea bass (Lateolabrax maculatus) is a non-model species of ecological and commercial importance and widely distributed in northwestern Pacific. A total of 22 648 SNPs was discovered across the genome of L. maculatus by paired-end sequencing of restriction-site associated DNA (RAD-PE) for 30 individuals from two populations. The nucleotide diversity (π) for each population was 0.0028±0.0001 in Dandong and 0.0018±0.0001 in Beihai, respectively. Shallow but significant genetic differentiation was detected between the two populations analyzed by using both the whole data set (FST = 0.0550, P < 0.001) and the putatively neutral SNPs (FST = 0.0347, P < 0.001). However, the two populations were highly differentiated based on the putatively adaptive SNPs (FST = 0.6929, P < 0.001). Moreover, a total of 356 SNPs representing 298 unique loci were detected as outliers putatively under divergent selection by FST-based outlier tests as implemented in BAYESCAN and LOSITAN. Functional annotation of the contigs containing putatively adaptive SNPs yielded hits for 22 of 55 (40%) significant BLASTX matches. Candidate genes for local selection constituted a wide array of functions, including binding, catalytic and metabolic activities, etc. The analyses with the SNPs developed in the present study highlighted the importance of genome-wide genetic variation for inference of population structure and local adaptation in L. maculatus. PMID:27336696

  3. Inferring Population Size History from Large Samples of Genome-Wide Molecular Data - An Approximate Bayesian Computation Approach.

    PubMed

    Boitard, Simon; Rodríguez, Willy; Jay, Flora; Mona, Stefano; Austerlitz, Frédéric

    2016-03-01

    Inferring the ancestral dynamics of effective population size is a long-standing question in population genetics, which can now be tackled much more accurately thanks to the massive genomic data available in many species. Several promising methods that take advantage of whole-genome sequences have been recently developed in this context. However, they can only be applied to rather small samples, which limits their ability to estimate recent population size history. Besides, they can be very sensitive to sequencing or phasing errors. Here we introduce a new approximate Bayesian computation approach named PopSizeABC that allows estimating the evolution of the effective population size through time, using a large sample of complete genomes. This sample is summarized using the folded allele frequency spectrum and the average zygotic linkage disequilibrium at different bins of physical distance, two classes of statistics that are widely used in population genetics and can be easily computed from unphased and unpolarized SNP data. Our approach provides accurate estimations of past population sizes, from the very first generations before present back to the expected time to the most recent common ancestor of the sample, as shown by simulations under a wide range of demographic scenarios. When applied to samples of 15 or 25 complete genomes in four cattle breeds (Angus, Fleckvieh, Holstein and Jersey), PopSizeABC revealed a series of population declines, related to historical events such as domestication or modern breed creation. We further highlight that our approach is robust to sequencing errors, provided summary statistics are computed from SNPs with common alleles.

  4. Genome-Wide SNP Discovery, Genotyping and Their Preliminary Applications for Population Genetic Inference in Spotted Sea Bass (Lateolabrax maculatus).

    PubMed

    Wang, Juan; Xue, Dong-Xiu; Zhang, Bai-Dong; Li, Yu-Long; Liu, Bing-Jian; Liu, Jin-Xian

    2016-01-01

    Next-generation sequencing and the collection of genome-wide single-nucleotide polymorphisms (SNPs) allow identifying fine-scale population genetic structure and genomic regions under selection. The spotted sea bass (Lateolabrax maculatus) is a non-model species of ecological and commercial importance and widely distributed in northwestern Pacific. A total of 22 648 SNPs was discovered across the genome of L. maculatus by paired-end sequencing of restriction-site associated DNA (RAD-PE) for 30 individuals from two populations. The nucleotide diversity (π) for each population was 0.0028±0.0001 in Dandong and 0.0018±0.0001 in Beihai, respectively. Shallow but significant genetic differentiation was detected between the two populations analyzed by using both the whole data set (FST = 0.0550, P < 0.001) and the putatively neutral SNPs (FST = 0.0347, P < 0.001). However, the two populations were highly differentiated based on the putatively adaptive SNPs (FST = 0.6929, P < 0.001). Moreover, a total of 356 SNPs representing 298 unique loci were detected as outliers putatively under divergent selection by FST-based outlier tests as implemented in BAYESCAN and LOSITAN. Functional annotation of the contigs containing putatively adaptive SNPs yielded hits for 22 of 55 (40%) significant BLASTX matches. Candidate genes for local selection constituted a wide array of functions, including binding, catalytic and metabolic activities, etc. The analyses with the SNPs developed in the present study highlighted the importance of genome-wide genetic variation for inference of population structure and local adaptation in L. maculatus. PMID:27336696

  5. Phylogeny and physiology of candidate phylum 'Atribacteria' (OP9/JS1) inferred from cultivation-independent genomics.

    PubMed

    Nobu, Masaru K; Dodsworth, Jeremy A; Murugapiran, Senthil K; Rinke, Christian; Gies, Esther A; Webster, Gordon; Schwientek, Patrick; Kille, Peter; Parkes, R John; Sass, Henrik; Jørgensen, Bo B; Weightman, Andrew J; Liu, Wen-Tso; Hallam, Steven J; Tsiamis, George; Woyke, Tanja; Hedlund, Brian P

    2016-02-01

    The 'Atribacteria' is a candidate phylum in the Bacteria recently proposed to include members of the OP9 and JS1 lineages. OP9 and JS1 are globally distributed, and in some cases abundant, in anaerobic marine sediments, geothermal environments, anaerobic digesters and reactors and petroleum reservoirs. However, the monophyly of OP9 and JS1 has been questioned and their physiology and ecology remain largely enigmatic due to a lack of cultivated representatives. Here cultivation-independent genomic approaches were used to provide a first comprehensive view of the phylogeny, conserved genomic features and metabolic potential of members of this ubiquitous candidate phylum. Previously available and heretofore unpublished OP9 and JS1 single-cell genomic data sets were used as recruitment platforms for the reconstruction of atribacterial metagenome bins from a terephthalate-degrading reactor biofilm and from the monimolimnion of meromictic Sakinaw Lake. The single-cell genomes and metagenome bins together comprise six species- to genus-level groups that represent most major lineages within OP9 and JS1. Phylogenomic analyses of these combined data sets confirmed the monophyly of the 'Atribacteria' inclusive of OP9 and JS1. Additional conserved features within the 'Atribacteria' were identified, including a gene cluster encoding putative bacterial microcompartments that may be involved in aldehyde and sugar metabolism, energy conservation and carbon storage. Comparative analysis of the metabolic potential inferred from these data sets revealed that members of the 'Atribacteria' are likely to be heterotrophic anaerobes that lack respiratory capacity, with some lineages predicted to specialize in either primary fermentation of carbohydrates or secondary fermentation of organic acids, such as propionate. PMID:26090992

  6. Phylogeny and physiology of candidate phylum 'Atribacteria' (OP9/JS1) inferred from cultivation-independent genomics.

    PubMed

    Nobu, Masaru K; Dodsworth, Jeremy A; Murugapiran, Senthil K; Rinke, Christian; Gies, Esther A; Webster, Gordon; Schwientek, Patrick; Kille, Peter; Parkes, R John; Sass, Henrik; Jørgensen, Bo B; Weightman, Andrew J; Liu, Wen-Tso; Hallam, Steven J; Tsiamis, George; Woyke, Tanja; Hedlund, Brian P

    2016-02-01

    The 'Atribacteria' is a candidate phylum in the Bacteria recently proposed to include members of the OP9 and JS1 lineages. OP9 and JS1 are globally distributed, and in some cases abundant, in anaerobic marine sediments, geothermal environments, anaerobic digesters and reactors and petroleum reservoirs. However, the monophyly of OP9 and JS1 has been questioned and their physiology and ecology remain largely enigmatic due to a lack of cultivated representatives. Here cultivation-independent genomic approaches were used to provide a first comprehensive view of the phylogeny, conserved genomic features and metabolic potential of members of this ubiquitous candidate phylum. Previously available and heretofore unpublished OP9 and JS1 single-cell genomic data sets were used as recruitment platforms for the reconstruction of atribacterial metagenome bins from a terephthalate-degrading reactor biofilm and from the monimolimnion of meromictic Sakinaw Lake. The single-cell genomes and metagenome bins together comprise six species- to genus-level groups that represent most major lineages within OP9 and JS1. Phylogenomic analyses of these combined data sets confirmed the monophyly of the 'Atribacteria' inclusive of OP9 and JS1. Additional conserved features within the 'Atribacteria' were identified, including a gene cluster encoding putative bacterial microcompartments that may be involved in aldehyde and sugar metabolism, energy conservation and carbon storage. Comparative analysis of the metabolic potential inferred from these data sets revealed that members of the 'Atribacteria' are likely to be heterotrophic anaerobes that lack respiratory capacity, with some lineages predicted to specialize in either primary fermentation of carbohydrates or secondary fermentation of organic acids, such as propionate.

  7. Causal inference of gene regulation with subnetwork assembly from genetical genomics data

    PubMed Central

    Peng, Chien-Hua; Jiang, Yi-Zhi; Tai, An-Shun; Liu, Chun-Bin; Peng, Shih-Chi; Liao, Chun-Ta; Yen, Tzu-Chen; Hsieh, Wen-Ping

    2014-01-01

    Deciphering the causal networks of gene interactions is critical for identifying disease pathways and disease-causing genes. We introduce a method to reconstruct causal networks based on exploring phenotype-specific modules in the human interactome and including the expression quantitative trait loci (eQTLs) that underlie the joint expression variation of each module. Closely associated eQTLs help anchor the orientation of the network. To overcome the inherent computational complexity of causal network reconstruction, we first deduce the local causality of individual subnetworks using the selected eQTLs and module transcripts. These subnetworks are then integrated to infer a global causal network using a random-field ranking method, which was motivated by animal sociology. We demonstrate how effectively the inferred causality restores the regulatory structure of the networks that mediate lymph node metastasis in oral cancer. Network rewiring clearly characterizes the dynamic regulatory systems of distinct disease states. This study is the first to associate an RXRB-causal network with increased risks of nodal metastasis, tumor relapse, distant metastases and poor survival for oral cancer. Thus, identifying crucial upstream drivers of a signal cascade can facilitate the discovery of potential biomarkers and effective therapeutic targets. PMID:24322297

  8. Karyotypic evolution of the family Sciuridae: inferences from the genome organizations of ground squirrels.

    PubMed

    Li, T; Wang, J; Su, W; Nie, W; Yang, F

    2006-01-01

    Cross-species chromosome painting has made a great contribution to our understanding of the evolution of karyotypes and genome organizations of mammals. Several recent papers of comparative painting between tree and flying squirrels have shed some light on the evolution of the family Sciuridae and the order Rodentia. In the present study we have extended the comparative painting to the Himalayan marmot (Marmotahimalayana) and the African ground squirrel (Xerus cf. erythropus), i.e. representative species from another important squirrel group--the ground squirrels--, and have established genome-wide comparative chromosome maps between human, eastern gray squirrel, and these two ground squirrels. The results show that 1) the squirrels so far studied all have conserved karyotypes that resemble the ancestral karyotype of the order Rodentia; 2) the African ground squirrels could have retained the ancestral karyotype of the family Sciuridae. Furthermore, we have mapped the evolutionary rearrangements onto a molecular-based consensus phylogenetic tree of the family Sciuridae.

  9. Simple Math is Enough: Two Examples of Inferring Functional Associations from Genomic Data

    NASA Technical Reports Server (NTRS)

    Liang, Shoudan

    2003-01-01

    Non-random features in the genomic data are usually biologically meaningful. The key is to choose the feature well. Having a p-value based score prioritizes the findings. If two proteins share a unusually large number of common interaction partners, they tend to be involved in the same biological process. We used this finding to predict the functions of 81 un-annotated proteins in yeast.

  10. Low rate of genomic repatterning in Xenarthra inferred from chromosome painting data.

    PubMed

    Dobigny, G; Yang, F; O'Brien, P C M; Volobouev, V; Kovács, A; Pieczarka, J C; Ferguson-Smith, M A; Robinson, T J

    2005-01-01

    Comparative cytogenetic studies on Xenarthra, one of the most basal mammalian clades in the Placentalia, are virtually absent, being restricted largely to descriptions of conventional karyotypes and diploid numbers. We present a molecular cytogenetic comparison of chromosomes from the two-toed (Choloepus didactylus, 2n = 65) and three-toed sloth species (Bradypus tridactylus, 2n = 52), an anteater (Tamandua tetradactyla, 2n = 54) which, together with some data on the six-banded armadillo (Euphractus sexcinctus, 2n = 58), collectively represent all the major xenarthran lineages. Our results, based on interspecific chromosome painting using flow-sorted two-toed sloth chromosomes as painting probes, show the sloth species to be karyotypically closely related but markedly different from the anteater. We also test the synteny disruptions and segmental associations identified within Pilosa (anteaters and sloths) against the chromosomes of the six-banded armadillo as outgroup taxon. We could thus polarize the 35 non-ambiguously identified chromosomal changes characterizing the evolution of the anteater and sloth genomes and map these to a published sequence-based phylogeny for the group. These data suggest a low rate of genomic repatterning when placed in the context of divergence estimates based on molecular and fossil data. Finally, our results provide a glimpse of a likely ancestral karyotype for the extant Xenarthra, a pivotal group for understanding eutherian genome evolution.

  11. Remarkable variation in maize genome structure inferred from haplotype diversity at the bz locus

    PubMed Central

    Wang, Qinghua; Dooner, Hugo K.

    2006-01-01

    Maize is probably the most diverse of all crop species. Unexpectedly large differences among haplotypes were first revealed in a comparison of the bz genomic regions of two different inbred lines, McC and B73. Retrotransposon clusters, which comprise most of the repetitive DNA in maize, varied markedly in makeup, and location relative to the genes in the region and genic sequences, later shown to be carried by two helitron transposons, also differed between the inbreds. Thus, the allelic bz regions of these Corn Belt inbreds shared only a minority of the total sequence. To investigate further the variation caused by retrotransposons, helitrons, and other insertions, we have analyzed the organization of the bz genomic region in five additional cultivars selected because of their geographic and genetic diversity: the inbreds A188, CML258, and I137TN, and the land races Coroico and NalTel. This vertical comparison has revealed the existence of several new helitrons, new retrotransposons, members of every superfamily of DNA transposons, numerous miniature elements, and novel insertions flanked at either end by TA repeats, which we call TAFTs (TA-flanked transposons). The extent of variation in the region is remarkable. In pairwise comparisons of eight bz haplotypes, the percentage of shared sequences ranges from 25% to 84%. Chimeric haplotypes were identified that combine retrotransposon clusters found in different haplotypes. We propose that recombination in the common gene space greatly amplifies the variability produced by the retrotransposition explosion in the maize ancestry, creating the heterogeneity in genome organization found in modern maize. PMID:17101975

  12. ARG-walker: inference of individual specific strengths of meiotic recombination hotspots by population genomics analysis

    PubMed Central

    2015-01-01

    Background Meiotic recombination hotspots play important roles in various aspects of genomics, but the underlying mechanisms for regulating the locations and strengths of recombination hotspots are not yet fully revealed. Most existing algorithms for estimating recombination rates from sequence polymorphism data can only output average recombination rates of a population, although there is evidence for the heterogeneity in recombination rates among individuals. For genome-wide association studies (GWAS) of recombination hotspots, an efficient algorithm that estimates the individualized strengths of recombination hotspots is highly desirable. Results In this work, we propose a novel graph mining algorithm named ARG-walker, based on random walks on ancestral recombination graphs (ARG), to estimate individual-specific recombination hotspot strengths. Extensive simulations demonstrate that ARG-walker is able to distinguish the hot allele of a recombination hotspot from the cold allele. Integrated with output of ARG-walker, we performed GWAS on the phased haplotype data of the 22 autosome chromosomes of the HapMap Asian population samples of Chinese and Japanese (JPT+CHB). Significant cis-regulatory signals have been detected, which is corroborated by the enrichment of the well-known 13-mer motif CCNCCNTNNCCNC of PRDM9 protein. Moreover, two new DNA motifs have been identified in the flanking regions of the significantly associated SNPs (single nucleotide polymorphisms), which are likely to be new cis-regulatory elements of meiotic recombination hotspots of the human genome. Conclusions Our results on both simulated and real data suggest that ARG-walker is a promising new method for estimating the individual recombination variations. In the future, it could be used to uncover the mechanisms of recombination regulation and human diseases related with recombination hotspots. PMID:26679564

  13. High-throughput single-cell sequencing identifies photoheterotrophs and chemoautotrophs in freshwater bacterioplankton

    PubMed Central

    Martinez-Garcia, Manuel; Swan, Brandon K; Poulton, Nicole J; Gomez, Monica Lluesma; Masland, Dashiell; Sieracki, Michael E; Stepanauskas, Ramunas

    2012-01-01

    Recent discoveries suggest that photoheterotrophs (rhodopsin-containing bacteria (RBs) and aerobic anoxygenic phototrophs (AAPs)) and chemoautotrophs may be significant for marine and freshwater ecosystem productivity. However, their abundance and taxonomic identities remain largely unknown. We used a combination of single-cell and metagenomic DNA sequencing to study the predominant photoheterotrophs and chemoautotrophs inhabiting the euphotic zone of temperate, physicochemically diverse freshwater lakes. Multi-locus sequencing of 712 single amplified genomes, generated by fluorescence-activated cell sorting and whole genome multiple displacement amplification, showed that most of the cosmopolitan freshwater clusters contain photoheterotrophs. These comprised at least 10–23% of bacterioplankton, and RBs were the dominant fraction. Our data demonstrate that Actinobacteria, including clusters acI, Luna and acSTL, are the predominant freshwater RBs. We significantly broaden the known taxonomic range of freshwater RBs, to include Alpha-, Beta-, Gamma- and Deltaproteobacteria, Verrucomicrobia and Sphingobacteria. By sequencing single cells, we found evidence for inter-phyla horizontal gene transfer and recombination of rhodopsin genes and identified specific taxonomic groups involved in these evolutionary processes. Our data suggest that members of the ubiquitous betaproteobacteria Polynucleobacter spp. are the dominant AAPs in temperate freshwater lakes. Furthermore, the RuBisCO (ribulose 1,5-bisphosphate carboxylase/oxygenase) gene was found in several single cells of Betaproteobacteria, Bacteroidetes and Gammaproteobacteria, suggesting that chemoautotrophs may be more prevalent among aerobic bacterioplankton than previously thought. This study demonstrates the power of single-cell DNA sequencing addressing previously unresolved questions about the metabolic potential and evolutionary histories of uncultured microorganisms, which dominate most natural environments

  14. The green impact: bacterioplankton response toward a phytoplankton spring bloom in the southern North Sea assessed by comparative metagenomic and metatranscriptomic approaches

    PubMed Central

    Wemheuer, Bernd; Wemheuer, Franziska; Hollensteiner, Jacqueline; Meyer, Frauke-Dorothee; Voget, Sonja; Daniel, Rolf

    2015-01-01

    Phytoplankton blooms exhibit a severe impact on bacterioplankton communities as they change nutrient availabilities and other environmental factors. In the current study, the response of a bacterioplankton community to a Phaeocystis globosa spring bloom was investigated in the southern North Sea. For this purpose, water samples were taken inside and reference samples outside of an algal spring bloom. Structural changes of the bacterioplankton community were assessed by amplicon-based analysis of 16S rRNA genes and transcripts generated from environmental DNA and RNA, respectively. Several marine groups responded to bloom presence. The abundance of the Roseobacter RCA cluster and the SAR92 clade significantly increased in bloom presence in the total and active fraction of the bacterial community. Functional changes were investigated by direct sequencing of environmental DNA and mRNA. The corresponding datasets comprised more than 500 million sequences across all samples. Metatranscriptomic data sets were mapped on representative genomes of abundant marine groups present in the samples and on assembled metagenomic and metatranscriptomic datasets. Differences in gene expression profiles between non-bloom and bloom samples were recorded. The genome-wide gene expression level of Planktomarina temperata, an abundant member of the Roseobacter RCA cluster, was higher inside the bloom. Genes that were differently expressed included transposases, which showed increased expression levels inside the bloom. This might contribute to the adaptation of this organism toward environmental stresses through genome reorganization. In addition, several genes affiliated to the SAR92 clade were significantly upregulated inside the bloom including genes encoding for proteins involved in isoleucine and leucine incorporation. Obtained results provide novel insights into compositional and functional variations of marine bacterioplankton communities as response to a phytoplankton bloom. PMID

  15. Karyotypic evolution of the family Sciuridae: inferences from the genome organizations of ground squirrels.

    PubMed

    Li, T; Wang, J; Su, W; Nie, W; Yang, F

    2006-01-01

    Cross-species chromosome painting has made a great contribution to our understanding of the evolution of karyotypes and genome organizations of mammals. Several recent papers of comparative painting between tree and flying squirrels have shed some light on the evolution of the family Sciuridae and the order Rodentia. In the present study we have extended the comparative painting to the Himalayan marmot (Marmotahimalayana) and the African ground squirrel (Xerus cf. erythropus), i.e. representative species from another important squirrel group--the ground squirrels--, and have established genome-wide comparative chromosome maps between human, eastern gray squirrel, and these two ground squirrels. The results show that 1) the squirrels so far studied all have conserved karyotypes that resemble the ancestral karyotype of the order Rodentia; 2) the African ground squirrels could have retained the ancestral karyotype of the family Sciuridae. Furthermore, we have mapped the evolutionary rearrangements onto a molecular-based consensus phylogenetic tree of the family Sciuridae. PMID:16484783

  16. Inferring Properties of Ancient Cyanobacteria from Biogeochemical Activity and Genomes of Siderophilic Cyanobacteria

    NASA Technical Reports Server (NTRS)

    McKay, David S.; Brown, I. I.; Tringe, S. G.; Thomas-Keprta, K. E.; Bryant, D. A.; Sarkisova, S. S.; Malley, K.; Sosa, O.; Klatt, C. G.; McKay, D. S.

    2010-01-01

    Interrelationships between life and the planetary system could have simultaneously left landmarks in genomes of microbes and physicochemical signatures in the lithosphere. Verifying the links between genomic features in living organisms and the mineralized signatures generated by these organisms will help to reveal traces of life on Earth and beyond. Among contemporary environments, iron-depositing hot springs (IDHS) may represent one of the most appropriate natural models [1] for insights into ancient life since organisms may have originated on Earth and probably Mars in association with hydrothermal activity [2,3]. IDHS also seem to be appropriate models for studying certain biogeochemical processes that could have taken place in the late Archean and,-or early Paleoproterozoic eras [4, 5]. It has been suggested that inorganic polyphosphate (PPi), in chains of tens to hundreds of phosphate residues linked by high-energy bonds, is environmentally ubiquitous and abundant [6]. Cyanobacteria (CB) react to increased heavy metal concentrations and UV by enhanced generation of PPi bodies (PPB) [7], which are believed to be signatures of life [8]. However, the role of PPi in oxygenic prokaryotes for the suppression of oxidative stress induced by high Fe is poorly studied. Here we present preliminary results of a new mechanism of Fe mineralization in oxygenic prokaryotes, the effect of Fe on the generation of PPi bodies in CB, as well as preliminary analysis of the diversity and phylogeny of proteins involved in the prevention of oxidative stress in phototrophs inhabiting IDHS.

  17. Primate phylogenetic relationships and divergence dates inferred from complete mitochondrial genomes

    PubMed Central

    Hodgson, Jason A.; Burrell, Andrew S.; Sterner, Kirstin N.; Raaum, Ryan L.; Disotell, Todd R.

    2014-01-01

    The origins and the divergence times of the most basal lineages within primates have been difficult to resolve mainly due to the incomplete sampling of early fossil taxa. The main source of contention is related to the discordance between molecular and fossil estimates: while there are no crown primate fossils older than 56 Ma, most molecule-based estimates extend the origins of crown primates into the Cretaceous. Here we present a comprehensive mitogenomic study of primates. We assembled 87 mammalian mitochondrial genomes, including 62 primate species representing all the families of the order. We newly sequenced eleven mitochondrial genomes, including eight Old World monkeys and three strepsirrhines. Phylogenetic analyses support a strong topology, confirming the monophyly for all the major primate clades. In contrast to previous mitogenomic studies, the positions of tarsiers and colugos relative to strepsirrhines and anthropoids are well resolved. In order to improve our understanding of how fossil calibrations affect age estimates within primates, we explore the effect of seventeen fossil calibrations across primates and other mammalian groups and we select a subset of calibrations to date our mitogenomic tree. The divergence date estimates of the Strepsirrhine/Haplorhine split support an origin of crown primates in the Late Cretaceous, at around 74 Ma. This result supports a short fuse model of primate origins, whereby relatively little time passed between the origin of the order and the diversification of its major clades. It also suggests that the early primate fossil record is likely poorly sampled. PMID:24583291

  18. King penguin demography since the last glaciation inferred from genome-wide data

    PubMed Central

    Trucchi, Emiliano; Gratton, Paolo; Whittington, Jason D.; Cristofari, Robin; Le Maho, Yvon; Stenseth, Nils Chr; Le Bohec, Céline

    2014-01-01

    How natural climate cycles, such as past glacial/interglacial patterns, have shaped species distributions at the high-latitude regions of the Southern Hemisphere is still largely unclear. Here, we show how the post-glacial warming following the Last Glacial Maximum (ca 18 000 years ago), allowed the (re)colonization of the fragmented sub-Antarctic habitat by an upper-level marine predator, the king penguin Aptenodytes patagonicus. Using restriction site-associated DNA sequencing and standard mitochondrial data, we tested the behaviour of subsets of anonymous nuclear loci in inferring past demography through coalescent-based and allele frequency spectrum analyses. Our results show that the king penguin population breeding on Crozet archipelago steeply increased in size, closely following the Holocene warming recorded in the Epica Dome C ice core. The following population growth can be explained by a threshold model in which the ecological requirements of this species (year-round ice-free habitat for breeding and access to a major source of food such as the Antarctic Polar Front) were met on Crozet soon after the Pleistocene/Holocene climatic transition. PMID:24920481

  19. King penguin demography since the last glaciation inferred from genome-wide data.

    PubMed

    Trucchi, Emiliano; Gratton, Paolo; Whittington, Jason D; Cristofari, Robin; Le Maho, Yvon; Stenseth, Nils Chr; Le Bohec, Céline

    2014-07-22

    How natural climate cycles, such as past glacial/interglacial patterns, have shaped species distributions at the high-latitude regions of the Southern Hemisphere is still largely unclear. Here, we show how the post-glacial warming following the Last Glacial Maximum (ca 18 000 years ago), allowed the (re)colonization of the fragmented sub-Antarctic habitat by an upper-level marine predator, the king penguin Aptenodytes patagonicus. Using restriction site-associated DNA sequencing and standard mitochondrial data, we tested the behaviour of subsets of anonymous nuclear loci in inferring past demography through coalescent-based and allele frequency spectrum analyses. Our results show that the king penguin population breeding on Crozet archipelago steeply increased in size, closely following the Holocene warming recorded in the Epica Dome C ice core. The following population growth can be explained by a threshold model in which the ecological requirements of this species (year-round ice-free habitat for breeding and access to a major source of food such as the Antarctic Polar Front) were met on Crozet soon after the Pleistocene/Holocene climatic transition. PMID:24920481

  20. King penguin demography since the last glaciation inferred from genome-wide data.

    PubMed

    Trucchi, Emiliano; Gratton, Paolo; Whittington, Jason D; Cristofari, Robin; Le Maho, Yvon; Stenseth, Nils Chr; Le Bohec, Céline

    2014-07-22

    How natural climate cycles, such as past glacial/interglacial patterns, have shaped species distributions at the high-latitude regions of the Southern Hemisphere is still largely unclear. Here, we show how the post-glacial warming following the Last Glacial Maximum (ca 18 000 years ago), allowed the (re)colonization of the fragmented sub-Antarctic habitat by an upper-level marine predator, the king penguin Aptenodytes patagonicus. Using restriction site-associated DNA sequencing and standard mitochondrial data, we tested the behaviour of subsets of anonymous nuclear loci in inferring past demography through coalescent-based and allele frequency spectrum analyses. Our results show that the king penguin population breeding on Crozet archipelago steeply increased in size, closely following the Holocene warming recorded in the Epica Dome C ice core. The following population growth can be explained by a threshold model in which the ecological requirements of this species (year-round ice-free habitat for breeding and access to a major source of food such as the Antarctic Polar Front) were met on Crozet soon after the Pleistocene/Holocene climatic transition.

  1. Integrating gene and protein expression data with genome-scale metabolic networks to infer functional pathways

    PubMed Central

    2013-01-01

    Background The study of cellular metabolism in the context of high-throughput -omics data has allowed us to decipher novel mechanisms of importance in biotechnology and health. To continue with this progress, it is essential to efficiently integrate experimental data into metabolic modeling. Results We present here an in-silico framework to infer relevant metabolic pathways for a particular phenotype under study based on its gene/protein expression data. This framework is based on the Carbon Flux Path (CFP) approach, a mixed-integer linear program that expands classical path finding techniques by considering additional biophysical constraints. In particular, the objective function of the CFP approach is amended to account for gene/protein expression data and influence obtained paths. This approach is termed integrative Carbon Flux Path (iCFP). We show that gene/protein expression data also influences the stoichiometric balancing of CFPs, which provides a more accurate picture of active metabolic pathways. This is illustrated in both a theoretical and real scenario. Finally, we apply this approach to find novel pathways relevant in the regulation of acetate overflow metabolism in Escherichia coli. As a result, several targets which could be relevant for better understanding of the phenomenon leading to impaired acetate overflow are proposed. Conclusions A novel mathematical framework that determines functional pathways based on gene/protein expression data is presented and validated. We show that our approach is able to provide new insights into complex biological scenarios such as acetate overflow in Escherichia coli. PMID:24314206

  2. Evolutionary landscape of amphibians emerging from ancient freshwater fish inferred from complete mitochondrial genomes.

    PubMed

    Wang, Xiao-Tong; Zhang, Yan-Feng; Wu, Qian; Zhang, Hao

    2012-05-01

    It is very interesting that the only extant marine amphibian is the marine frog, Fejervarya cancrivora. This study investigated the reasons for this apparent rarity by conducting a phylogenetic tree analysis of the complete mitochondrial genomes from 14 amphibians, 67 freshwater fishes, four migratory fishes, 35 saltwater fishes, and one hemichordate. The results showed that amphibians, living fossil fishes, and the common ancestors of modern fishes are phylogenetically separated. In general, amphibians, living fossil fishes, saltwater fishes, and freshwater fishes are clustered in different clades. This suggests that the ancestor of living amphibians arose from a type of primordial freshwater fish, rather than the coelacanth, lungfish, or modern saltwater fish. Modern freshwater fish and modern saltwater fish were probably separated from a common ancestor by a single event, caused by crustal movement.

  3. Conflicting genomic signals affect phylogenetic inference in four species of North American pines

    PubMed Central

    Koralewski, Tomasz E.; Mateos, Mariana; Krutovsky, Konstantin V.

    2016-01-01

    Adaptive evolutionary processes in plants may be accompanied by episodes of introgression, parallel evolution and incomplete lineage sorting that pose challenges in untangling species evolutionary history. Genus Pinus (pines) is one of the most abundant and most studied groups among gymnosperms, and a good example of a lineage where these phenomena have been observed. Pines are among the most ecologically and economically important plant species. Some, such as the pines of the southeastern USA (southern pines in subsection Australes), are subjects of intensive breeding programmes. Despite numerous published studies, the evolutionary history of Australes remains ambiguous and often controversial. We studied the phylogeny of four major southern pine species: shortleaf (Pinus echinata), slash (P. elliottii), longleaf (P. palustris) and loblolly (P. taeda), using sequences from 11 nuclear loci and maximum likelihood and Bayesian methods. Our analysis encountered resolution difficulties similar to earlier published studies. Although incomplete lineage sorting and introgression are two phenomena presumptively underlying our results, the phylogenetic inferences seem to be also influenced by the genes examined, with certain topologies supported by sets of genes sharing common putative functionalities. For example, genes involved in wood formation supported the clade echinata–taeda, genes linked to plant defence supported the clade echinata–elliottii and genes linked to water management properties supported the clade echinata–palustris. The support for these clades was very high and consistent across methods. We discuss the potential factors that could underlie these observations, including incomplete lineage sorting, hybridization and parallel or adaptive evolution. Our results likely reflect the relatively short evolutionary history of the subsection that is thought to have begun during the middle Miocene and has been influenced by climate fluctuations. PMID

  4. msCentipede: Modeling Heterogeneity across Genomic Sites and Replicates Improves Accuracy in the Inference of Transcription Factor Binding.

    PubMed

    Raj, Anil; Shim, Heejung; Gilad, Yoav; Pritchard, Jonathan K; Stephens, Matthew

    2015-01-01

    Understanding global gene regulation depends critically on accurate annotation of regulatory elements that are functional in a given cell type. CENTIPEDE, a powerful, probabilistic framework for identifying transcription factor binding sites from tissue-specific DNase I cleavage patterns and genomic sequence content, leverages the hypersensitivity of factor-bound chromatin and the information in the DNase I spatial cleavage profile characteristic of each DNA binding protein to accurately infer functional factor binding sites. However, the model for the spatial profile in this framework fails to account for the substantial variation in the DNase I cleavage profiles across different binding sites. Neither does it account for variation in the profiles at the same binding site across multiple replicate DNase I experiments, which are increasingly available. In this work, we introduce new methods, based on multi-scale models for inhomogeneous Poisson processes, to account for such variation in DNase I cleavage patterns both within and across binding sites. These models account for the spatial structure in the heterogeneity in DNase I cleavage patterns for each factor. Using DNase-seq measurements assayed in a lymphoblastoid cell line, we demonstrate the improved performance of this model for several transcription factors by comparing against the Chip-seq peaks for those factors. Finally, we explore the effects of DNase I sequence bias on inference of factor binding using a simple extension to our framework that allows for a more flexible background model. The proposed model can also be easily applied to paired-end ATAC-seq and DNase-seq data. msCentipede, a Python implementation of our algorithm, is available at http://rajanil.github.io/msCentipede.

  5. msCentipede: Modeling Heterogeneity across Genomic Sites and Replicates Improves Accuracy in the Inference of Transcription Factor Binding

    PubMed Central

    Gilad, Yoav; Pritchard, Jonathan K.; Stephens, Matthew

    2015-01-01

    Understanding global gene regulation depends critically on accurate annotation of regulatory elements that are functional in a given cell type. CENTIPEDE, a powerful, probabilistic framework for identifying transcription factor binding sites from tissue-specific DNase I cleavage patterns and genomic sequence content, leverages the hypersensitivity of factor-bound chromatin and the information in the DNase I spatial cleavage profile characteristic of each DNA binding protein to accurately infer functional factor binding sites. However, the model for the spatial profile in this framework fails to account for the substantial variation in the DNase I cleavage profiles across different binding sites. Neither does it account for variation in the profiles at the same binding site across multiple replicate DNase I experiments, which are increasingly available. In this work, we introduce new methods, based on multi-scale models for inhomogeneous Poisson processes, to account for such variation in DNase I cleavage patterns both within and across binding sites. These models account for the spatial structure in the heterogeneity in DNase I cleavage patterns for each factor. Using DNase-seq measurements assayed in a lymphoblastoid cell line, we demonstrate the improved performance of this model for several transcription factors by comparing against the Chip-seq peaks for those factors. Finally, we explore the effects of DNase I sequence bias on inference of factor binding using a simple extension to our framework that allows for a more flexible background model. The proposed model can also be easily applied to paired-end ATAC-seq and DNase-seq data. msCentipede, a Python implementation of our algorithm, is available at http://rajanil.github.io/msCentipede. PMID:26406244

  6. msCentipede: Modeling Heterogeneity across Genomic Sites and Replicates Improves Accuracy in the Inference of Transcription Factor Binding.

    PubMed

    Raj, Anil; Shim, Heejung; Gilad, Yoav; Pritchard, Jonathan K; Stephens, Matthew

    2015-01-01

    Understanding global gene regulation depends critically on accurate annotation of regulatory elements that are functional in a given cell type. CENTIPEDE, a powerful, probabilistic framework for identifying transcription factor binding sites from tissue-specific DNase I cleavage patterns and genomic sequence content, leverages the hypersensitivity of factor-bound chromatin and the information in the DNase I spatial cleavage profile characteristic of each DNA binding protein to accurately infer functional factor binding sites. However, the model for the spatial profile in this framework fails to account for the substantial variation in the DNase I cleavage profiles across different binding sites. Neither does it account for variation in the profiles at the same binding site across multiple replicate DNase I experiments, which are increasingly available. In this work, we introduce new methods, based on multi-scale models for inhomogeneous Poisson processes, to account for such variation in DNase I cleavage patterns both within and across binding sites. These models account for the spatial structure in the heterogeneity in DNase I cleavage patterns for each factor. Using DNase-seq measurements assayed in a lymphoblastoid cell line, we demonstrate the improved performance of this model for several transcription factors by comparing against the Chip-seq peaks for those factors. Finally, we explore the effects of DNase I sequence bias on inference of factor binding using a simple extension to our framework that allows for a more flexible background model. The proposed model can also be easily applied to paired-end ATAC-seq and DNase-seq data. msCentipede, a Python implementation of our algorithm, is available at http://rajanil.github.io/msCentipede. PMID:26406244

  7. Hierarchical modeling of genome-wide Short Tandem Repeat (STR) markers infers native American prehistory.

    PubMed

    Lewis, Cecil M

    2010-02-01

    This study examines a genome-wide dataset of 678 Short Tandem Repeat loci characterized in 444 individuals representing 29 Native American populations as well as the Tundra Netsi and Yakut populations from Siberia. Using these data, the study tests four current hypotheses regarding the hierarchical distribution of neutral genetic variation in native South American populations: (1) the western region of South America harbors more variation than the eastern region of South America, (2) Central American and western South American populations cluster exclusively, (3) populations speaking the Chibchan-Paezan and Equatorial-Tucanoan language stock emerge as a group within an otherwise South American clade, (4) Chibchan-Paezan populations in Central America emerge together at the tips of the Chibchan-Paezan cluster. This study finds that hierarchical models with the best fit place Central American populations, and populations speaking the Chibchan-Paezan language stock, at a basal position or separated from the South American group, which is more consistent with a serial founder effect into South America than that previously described. Western (Andean) South America is found to harbor similar levels of variation as eastern (Equatorial-Tucanoan and Ge-Pano-Carib) South America, which is inconsistent with an initial west coast migration into South America. Moreover, in all relevant models, the estimates of genetic diversity within geographic regions suggest a major bottleneck or founder effect occurring within the North American subcontinent, before the peopling of Central and South America.

  8. Rock, paper, scissors: harnessing complementarity in ortholog detection methods improves comparative genomic inference.

    PubMed

    Maher, M Cyrus; Hernandez, Ryan D

    2015-04-01

    Ortholog detection (OD) is a lynchpin of most statistical methods in comparative genomics. This task involves accurately identifying genes across species that descend from a common ancestral sequence. OD methods comprise a wide variety of approaches, each with their own benefits and costs under a variety of evolutionary and practical scenarios. In this article, we examine the proteomes of ten mammals by using four methodologically distinct, rigorously filtered OD methods. In head-to-head comparisons, we find that these algorithms significantly outperform one another for 38-45% of the genes analyzed. We leverage this high complementarity through the development MOSAIC, or Multiple Orthologous Sequence Analysis and Integration by Cluster optimization, the first tool for integrating methodologically diverse OD methods. Relative to the four methods examined, MOSAIC more than quintuples the number of alignments for which all species are present while simultaneously maintaining or improving functional-, phylogenetic-, and sequence identity-based measures of ortholog quality. Further, this improvement in alignment quality yields more confidently aligned sites and higher levels of overall conservation, while simultaneously detecting of up to 180% more positively selected sites. We close by highlighting a MOSAIC-specific positively selected sites near the active site of TPSAB1, an enzyme linked to asthma, heart disease, and irritable bowel disease. MOSAIC alignments, source code, and full documentation are available at http://pythonhosted.org/bio-MOSAIC.

  9. Morphological homoplasy, life history evolution, and historical biogeography of plethodontid salamanders inferred from complete mitochondrial genomes

    SciTech Connect

    Mueller, Rachel Lockridge; Macey, J. Robert; Jaekel, Martin; Wake, David B.; Boore, Jeffrey L.

    2004-08-01

    The evolutionary history of the largest salamander family (Plethodontidae) is characterized by extreme morphological homoplasy. Analysis of the mechanisms generating such homoplasy requires an independent, molecular phylogeny. To this end, we sequenced 24 complete mitochondrial genomes (22 plethodontids and two outgroup taxa), added data for three species from GenBank, and performed partitioned and unpartitioned Bayesian, ML, and MP phylogenetic analyses. We explored four dataset partitioning strategies to account for evolutionary process heterogeneity among genes and codon positions, all of which yielded increased model likelihoods and decreased numbers of supported nodes in the topologies (PP > 0.95) relative to the unpartitioned analysis. Our phylogenetic analyses yielded congruent trees that contrast with the traditional morphology-based taxonomy; the monophyly of three out of four major groups is rejected. Reanalysis of current hypotheses in light of these new evolutionary relationships suggests that (1) a larval life history stage re-evolved from a direct-developing ancestor multiple times, (2) there is no phylogenetic support for the ''Out of Appalachia'' hypothesis of plethodontid origins, and (3) novel scenarios must be reconstructed for the convergent evolution of projectile tongues, reduction in toe number, and specialization for defensive tail loss. Some of these novel scenarios imply morphological transformation series that proceed in the opposite direction than was previously thought. In addition, they suggest surprising evolutionary lability in traits previously interpreted to be conservative.

  10. Genome-scale co-evolutionary inference identifies functions and clients of bacterial Hsp90.

    PubMed

    Press, Maximilian O; Li, Hui; Creanza, Nicole; Kramer, Günter; Queitsch, Christine; Sourjik, Victor; Borenstein, Elhanan

    2013-01-01

    The molecular chaperone Hsp90 is essential in eukaryotes, in which it facilitates the folding of developmental regulators and signal transduction proteins known as Hsp90 clients. In contrast, Hsp90 is not essential in bacteria, and a broad characterization of its molecular and organismal function is lacking. To enable such characterization, we used a genome-scale phylogenetic analysis to identify genes that co-evolve with bacterial Hsp90. We find that genes whose gain and loss were coordinated with Hsp90 throughout bacterial evolution tended to function in flagellar assembly, chemotaxis, and bacterial secretion, suggesting that Hsp90 may aid assembly of protein complexes. To add to the limited set of known bacterial Hsp90 clients, we further developed a statistical method to predict putative clients. We validated our predictions by demonstrating that the flagellar protein FliN and the chemotaxis kinase CheA behaved as Hsp90 clients in Escherichia coli, confirming the predicted role of Hsp90 in chemotaxis and flagellar assembly. Furthermore, normal Hsp90 function is important for wild-type motility and/or chemotaxis in E. coli. This novel function of bacterial Hsp90 agreed with our subsequent finding that Hsp90 is associated with a preference for multiple habitats and may therefore face a complex selection regime. Taken together, our results reveal previously unknown functions of bacterial Hsp90 and open avenues for future experimental exploration by implicating Hsp90 in the assembly of membrane protein complexes and adaptation to novel environments. PMID:23874229

  11. [Characteristics of bacterioplankton in the Loess-containing Lake Khanka].

    PubMed

    Shchur, L A; Aponasenko, A D; Ladygina, V P; Lopatin, V N; Makarskaia, G V

    2000-01-01

    Some characteristics of bacterioplankton--generation time, daily (P) and specific (P/B) bacterioplankton production, and bacterial metabolic coefficient K2--in the loess-containing Lake Khanka were determined using five modifications of the bacterial-count procedure with the fluorescent dyes fluorescamin and erythrosin. Experiments showed that the organomineral complex (OMC) in this lake is broken down by chemoorganoheterotrophic bacteria. The increase in the loess content of the lake water intensified bacterial growth and the cycles of potassium, silicon, and other biogenic elements. The addition of starch to a loess suspension activated the breakdown of OMC due to the adsorption of starch on the OMC/water interface and stimulation of the metabolism of attached bacteria.

  12. Phylogeny and biogeography of the family Salamandridae (Amphibia: Caudata) inferred from complete mitochondrial genomes.

    PubMed

    Zhang, Peng; Papenfuss, Theodore J; Wake, Marvalee H; Qu, Lianghu; Wake, David B

    2008-11-01

    Phylogenetic relationships of members of the salamander family Salamandridae were examined using complete mitochondrial genomes collected from 42 species representing all 20 salamandrid genera and five outgroup taxa. Weighted maximum parsimony, partitioned maximum likelihood, and partitioned Bayesian approaches all produce an identical, well-resolved phylogeny; most branches are strongly supported with greater than 90% bootstrap values and 1.0 Bayesian posterior probabilities. Our results support recent taxonomic changes in finding the traditional genera Mertensiella, Euproctus, and Triturus to be non-monophyletic species assemblages. We successfully resolved the current polytomy at the base of the salamandrid tree: the Italian newt genus Salamandrina is sister to all remaining salamandrids. Beyond Salamandrina, a clade comprising all remaining newts is separated from a clade containing the true salamanders. Among these newts, the branching orders of well-supported clades are: primitive newts (Echinotriton, Pleurodeles, and Tylototriton), New World newts (Notophthalmus-Taricha), Corsica-Sardinia newts (Euproctus), and modern European newts (Calotriton, Lissotriton, Mesotriton, Neurergus, Ommatotriton, and Triturus) plus modern Asian newts (Cynops, Pachytriton, and Paramesotriton).Two alternative sets of calibration points and two Bayesian dating methods (BEAST and MultiDivTime) were used to estimate timescales for salamandrid evolution. The estimation difference by dating methods is slight and we propose two sets of timescales based on different calibration choices. The two timescales suggest that the initial diversification of extant salamandrids took place in Europe about 97 or 69Ma. North American salamandrids were derived from their European ancestors by dispersal through North Atlantic Land Bridges in the Late Cretaceous ( approximately 69Ma) or Middle Eocene ( approximately 43Ma). Ancestors of Asian salamandrids most probably dispersed to the eastern Asia

  13. Inferring the progression of multifocal liver cancer from spatial and temporal genomic heterogeneity

    PubMed Central

    Shi, Jie-Yi; Xing, Qingfeng; Duan, Meng; Wang, Zhi-Chao; Yang, Liu-Xiao; Zhao, Ying-Jun; Wang, Xiao-Ying; Liu, Yun; Deng, Minghua; Ding, Zhen-Bin; Ke, Ai-Wu; Zhou, Jian; Fan, Jia; Cao, Ya; Wang, Jiping; Xi, Ruibin; Gao, Qiang

    2016-01-01

    Multifocal tumors developed either as independent tumors or as intrahepatic metastases, are very common in primary liver cancer. However, their molecular pathogenesis remains elusive. Herein, a patient with synchronous two hepatocellular carcinoma (HCC, designated as HCC-A and HCC-B) and one intrahepatic cholangiocarcinoma (ICC), as well as two postoperative recurrent tumors, was enrolled. Multiregional whole-exome sequencing was applied to these tumors to delineate the clonality and heterogeneity. The three primary tumors showed almost no overlaps in mutations and copy number variations. Within each tumor, multiregional sequencing data showed varied intratumoral heterogeneity (21.6% in HCC-A, 20.4% in HCC-B, 53.2% in ICC). The mutational profile of two recurrent tumors showed obvious similarity with HCC-A (86.7% and 86.6% respectively), rather than others, indicating that they originated from HCC-A. The evolutionary history of the two recurrent tumors indicated that intrahepatic micro-metastasis could be an early event during HCC progression. Notably, FAT4 was the only gene mutated in two primary HCCs and the recurrences. Mutation prevalence screen and functional experiments showed that FAT4, harboring somatic coding mutations in 26.7% of HCC, could potently inhibit growth and invasion of HCC cells. In HCC patients, both FAT4 expression and FAT4 mutational status significantly correlated with patient prognosis. Together, our findings suggest that spatial and temporal dissection of genomic alterations during the progression of multifocal liver cancer may help to elucidate the basis for its dismal prognosis. FAT4 acts as a putative tumor suppressor that is frequently inactivated in human HCC. PMID:26672766

  14. Phylogeny of the Sphaerotilus-Leptothrix group inferred from morphological comparisons, genomic fingerprinting, and 16S ribosomal DNA sequence analyses.

    PubMed

    Siering, P L; Ghiorse, W C

    1996-01-01

    Phase-contrast light microscopy revealed that only one of eight cultivated strains belonging to the Sphaerotilus-Leptothrix group of sheathed bacteria actually produced a sheath in standard growth media. Two Sphaerotilus natans strains produced branched cells, but other morphological characteristics that were used to identify these bacteria were consistent with previously published descriptions. Genomic fingerprints, which were obtained by performing PCR amplification with primers corresponding to enterobacterial repetitive intergenic consensus sequences, were useful for distinguishing between the genera Sphaerotilus and Leptothrix, as well as among individual strains. The complete 16S ribosomal DNA (rDNA) sequences of two strains of "Leptothrix discophora" (strains SP-6 and SS-1) were determined. In addition, partial sequences (approximately 300 nucleotides) of one strain of Leptothrix cholodnii (strain LMG 7171), an unidentified Leptothrix strain (strain NC-1), and four strains of Sphaerotilus natans (strains ATCC 13338T [T = type strain], ATCC 15291, ATCC 29329, and ATCC 29330) were determined. We found that two of the S. natans strains (ATCC 15291 and ATCC 13338T), which differed in morphology and in their genomic fingerprints, had identical sequences in the 300-nucleotide region sequenced. Both parsimony and distance matrix methods were used to infer the evolutionary relationships of the eight strains in a comparison of the 16S rDNA sequences of these organisms with 16S rDNA sequences obtained from ribosomal sequence databases. All of the strains clustered in the Rubrivivax subdivision of the beta subclass of the Proteobacteria, which confirmed previously published conclusions concerning selected individual strains. Additional analyses revealed that all of the S. natans strains clustered in one closely related group, while the Leptothrix strains clustered in two separate lineages that were approximately equidistant from the S. natans cluster. This finding

  15. Phylogeny of the Sphaerotilus-Leptothrix group inferred from morphological comparisons, genomic fingerprinting, and 16S ribosomal DNA sequence analyses.

    PubMed

    Siering, P L; Ghiorse, W C

    1996-01-01

    Phase-contrast light microscopy revealed that only one of eight cultivated strains belonging to the Sphaerotilus-Leptothrix group of sheathed bacteria actually produced a sheath in standard growth media. Two Sphaerotilus natans strains produced branched cells, but other morphological characteristics that were used to identify these bacteria were consistent with previously published descriptions. Genomic fingerprints, which were obtained by performing PCR amplification with primers corresponding to enterobacterial repetitive intergenic consensus sequences, were useful for distinguishing between the genera Sphaerotilus and Leptothrix, as well as among individual strains. The complete 16S ribosomal DNA (rDNA) sequences of two strains of "Leptothrix discophora" (strains SP-6 and SS-1) were determined. In addition, partial sequences (approximately 300 nucleotides) of one strain of Leptothrix cholodnii (strain LMG 7171), an unidentified Leptothrix strain (strain NC-1), and four strains of Sphaerotilus natans (strains ATCC 13338T [T = type strain], ATCC 15291, ATCC 29329, and ATCC 29330) were determined. We found that two of the S. natans strains (ATCC 15291 and ATCC 13338T), which differed in morphology and in their genomic fingerprints, had identical sequences in the 300-nucleotide region sequenced. Both parsimony and distance matrix methods were used to infer the evolutionary relationships of the eight strains in a comparison of the 16S rDNA sequences of these organisms with 16S rDNA sequences obtained from ribosomal sequence databases. All of the strains clustered in the Rubrivivax subdivision of the beta subclass of the Proteobacteria, which confirmed previously published conclusions concerning selected individual strains. Additional analyses revealed that all of the S. natans strains clustered in one closely related group, while the Leptothrix strains clustered in two separate lineages that were approximately equidistant from the S. natans cluster. This finding

  16. Phylogenetic Diversity of the Enteric Pathogen Salmonella enterica subsp. enterica Inferred from Genome-Wide Reference-Free SNP Characters

    PubMed Central

    Timme, Ruth E.; Pettengill, James B.; Allard, Marc W.; Strain, Errol; Barrangou, Rodolphe; Wehnes, Chris; Van Kessel, JoAnn S.; Karns, Jeffrey S.; Musser, Steven M.; Brown, Eric W.

    2013-01-01

    The enteric pathogen Salmonella enterica is one of the leading causes of foodborne illness in the world. The species is extremely diverse, containing more than 2,500 named serovars that are designated for their unique antigen characters and pathogenicity profiles—some are known to be virulent pathogens, while others are not. Questions regarding the evolution of pathogenicity, significance of antigen characters, diversity of clustered regularly interspaced short palindromic repeat (CRISPR) loci, among others, will remain elusive until a strong evolutionary framework is established. We present the first large-scale S. enterica subsp. enterica phylogeny inferred from a new reference-free k-mer approach of gathering single nucleotide polymorphisms (SNPs) from whole genomes. The phylogeny of 156 isolates representing 78 serovars (102 were newly sequenced) reveals two major lineages, each with many strongly supported sublineages. One of these lineages is the S. Typhi group; well nested within the phylogeny. Lineage-through-time analyses suggest there have been two instances of accelerated rates of diversification within the subspecies. We also found that antigen characters and CRISPR loci reveal different evolutionary patterns than that of the phylogeny, suggesting that a horizontal gene transfer or possibly a shared environmental acquisition might have influenced the present character distribution. Our study also shows the ability to extract reference-free SNPs from a large set of genomes and then to use these SNPs for phylogenetic reconstruction. This automated, annotation-free approach is an important step forward for bacterial disease tracking and in efficiently elucidating the evolutionary history of highly clonal organisms. PMID:24158624

  17. Redox-Specialized Bacterioplankton Metacommunity in a Temperate Estuary

    PubMed Central

    Laas, Peeter; Simm, Jaak; Lips, Inga; Lips, Urmas; Kisand, Veljo; Metsis, Madis

    2015-01-01

    This study explored the spatiotemporal dynamics of the bacterioplankton community composition in the Gulf of Finland (easternmost sub-basin of the Baltic Sea) based on phylogenetic analysis of 16S rDNA sequences acquired from community samples via pyrosequencing. Investigations of bacterioplankton in hydrographically complex systems provide good insight into the strategies by which microbes deal with spatiotemporal hydrographic gradients, as demonstrated by our research. Many ribotypes were closely affiliated with sequences isolated from environments with similar steep physiochemical gradients and/or seasonal changes, including seasonally anoxic estuaries. Hence, one of the main conclusions of this study is that marine ecosystems where oxygen and salinity gradients co-occur can be considered a habitat for a cosmopolitan metacommunity consisting of specialized groups occupying niches universal to such environments throughout the world. These niches revolve around functional capabilities to utilize different electron receptors and donors (including trace metal and single carbon compounds). On the other hand, temporal shifts in the bacterioplankton community composition at the surface layer were mainly connected to the seasonal succession of phytoplankton and the inflow of freshwater species. We also conclude that many relatively abundant populations are indigenous and well-established in the area. PMID:25860812

  18. Redox-specialized bacterioplankton metacommunity in a temperate estuary.

    PubMed

    Laas, Peeter; Simm, Jaak; Lips, Inga; Lips, Urmas; Kisand, Veljo; Metsis, Madis

    2015-01-01

    This study explored the spatiotemporal dynamics of the bacterioplankton community composition in the Gulf of Finland (easternmost sub-basin of the Baltic Sea) based on phylogenetic analysis of 16S rDNA sequences acquired from community samples via pyrosequencing. Investigations of bacterioplankton in hydrographically complex systems provide good insight into the strategies by which microbes deal with spatiotemporal hydrographic gradients, as demonstrated by our research. Many ribotypes were closely affiliated with sequences isolated from environments with similar steep physiochemical gradients and/or seasonal changes, including seasonally anoxic estuaries. Hence, one of the main conclusions of this study is that marine ecosystems where oxygen and salinity gradients co-occur can be considered a habitat for a cosmopolitan metacommunity consisting of specialized groups occupying niches universal to such environments throughout the world. These niches revolve around functional capabilities to utilize different electron receptors and donors (including trace metal and single carbon compounds). On the other hand, temporal shifts in the bacterioplankton community composition at the surface layer were mainly connected to the seasonal succession of phytoplankton and the inflow of freshwater species. We also conclude that many relatively abundant populations are indigenous and well-established in the area.

  19. Limitations to estimating bacterial cross-species transmission using genetic and genomic markers: inferences from simulation modeling

    PubMed Central

    Benavides, Julio A; Cross, Paul C; Luikart, Gordon; Creel, Scott

    2014-01-01

    Cross-species transmission (CST) of bacterial pathogens has major implications for human health, livestock, and wildlife management because it determines whether control actions in one species may have subsequent effects on other potential host species. The study of bacterial transmission has benefitted from methods measuring two types of genetic variation: variable number of tandem repeats (VNTRs) and single nucleotide polymorphisms (SNPs). However, it is unclear whether these data can distinguish between different epidemiological scenarios. We used a simulation model with two host species and known transmission rates (within and between species) to evaluate the utility of these markers for inferring CST. We found that CST estimates are biased for a wide range of parameters when based on VNTRs and a most parsimonious reconstructed phylogeny. However, estimations of CST rates lower than 5% can be achieved with relatively low bias using as low as 250 SNPs. CST estimates are sensitive to several parameters, including the number of mutations accumulated since introduction, stochasticity, the genetic difference of strains introduced, and the sampling effort. Our results suggest that, even with whole-genome sequences, unbiased estimates of CST will be difficult when sampling is limited, mutation rates are low, or for pathogens that were recently introduced. PMID:25469159

  20. Genome at Juncture of Early Human Migration: A Systematic Analysis of Two Whole Genomes and Thirteen Exomes from Kuwaiti Population Subgroup of Inferred Saudi Arabian Tribe Ancestry

    PubMed Central

    Alsmadi, Osama; Hebbar, Prashantha; Antony, Dinu; Behbehani, Kazem; Thanaraj, Thangavel Alphonse

    2014-01-01

    Population of the State of Kuwait is composed of three genetic subgroups of inferred Persian, Saudi Arabian tribe and Bedouin ancestry. The Saudi Arabian tribe subgroup traces its origin to the Najd region of Saudi Arabia. By sequencing two whole genomes and thirteen exomes from this subgroup at high coverage (>40X), we identify 4,950,724 Single Nucleotide Polymorphisms (SNPs), 515,802 indels and 39,762 structural variations. Of the identified variants, 10,098 (8.3%) exomic SNPs, 139,923 (2.9%) non-exomic SNPs, 5,256 (54.3%) exomic indels, and 374,959 (74.08%) non-exomic indels are ‘novel’. Up to 8,070 (79.9%) of the reported novel biallelic exomic SNPs are seen in low frequency (minor allele frequency <5%). We observe 5,462 known and 1,004 novel potentially deleterious nonsynonymous SNPs. Allele frequencies of common SNPs from the 15 exomes is significantly correlated with those from genotype data of a larger cohort of 48 individuals (Pearson correlation coefficient, 0.91; p <2.2×10−16). A set of 2,485 SNPs show significantly different allele frequencies when compared to populations from other continents. Two notable variants having risk alleles in high frequencies in this subgroup are: a nonsynonymous deleterious SNP (rs2108622 [19:g.15990431C>T] from CYP4F2 gene [MIM:*604426]) associated with warfarin dosage levels [MIM:#122700] required to elicit normal anticoagulant response; and a 3′ UTR SNP (rs6151429 [22:g.51063477T>C]) from ARSA gene [MIM:*607574]) associated with Metachromatic Leukodystrophy [MIM:#250100]. Hemoglobin Riyadh variant (identified for the first time in a Saudi Arabian woman) is observed in the exome data. The mitochondrial haplogroup profiles of the 15 individuals are consistent with the haplogroup diversity seen in Saudi Arabian natives, who are believed to have received substantial gene flow from Africa and eastern provenance. We present the first genome resource imperative for designing future genetic studies in Saudi Arabian

  1. Tidal switch on metabolic activity: Salinity induced responses on bacterioplankton metabolic capabilities in a tropical estuary

    NASA Astrophysics Data System (ADS)

    Thottathil, Shoji D.; Balachandran, K. K.; Jayalakshmy, K. V.; Gupta, G. V. M.; Nair, Shanta

    2008-07-01

    "Biolog" plates were used to study the changes in the metabolic capabilities of bacterioplankton over a complete tidal cycle in a tropical ecosystem (Cochin Estuary) along southwest coast of India. The pattern of utilization of carbon sources showed a definite shift in the community metabolism along a salinity gradient. Multivariate statistical analysis revealed two communities, namely allochthonous bacterioplankton sensitive to salinity and autochthonous bacterioplankton, which are tolerant to wide salinity fluctuations. Regression analysis showed salinity as the most important parameter influencing the physiological profile of bacterioplankton, irrespective of tide. Apart from salinity, limno-tolerant retrievable counts and halo-tolerant retrievable counts also accounted for the metabolic variation of bacterioplankton during low and high tides, respectively. The shift in the substrate utilization from carbohydrates to amino acids appears to be due to the physiological adaptation or nitrogen limitation of bacterial community with increasing salinity.

  2. Phylogeny and biogeography of highly diverged freshwater fish species (Leuciscinae, Cyprinidae, Teleostei) inferred from mitochondrial genome analysis.

    PubMed

    Imoto, Junichi M; Saitoh, Kenji; Sasaki, Takeshi; Yonezawa, Takahiro; Adachi, Jun; Kartavtsev, Yuri P; Miya, Masaki; Nishida, Mutsumi; Hanzawa, Naoto

    2013-02-10

    The distribution of freshwater taxa is a good biogeographic model to study pattern and process of vicariance and dispersal. The subfamily Leuciscinae (Cyprinidae, Teleostei) consists of many species distributed widely in Eurasia and North America. Leuciscinae have been divided into two phyletic groups, leuciscin and phoxinin. The phylogenetic relationships between major clades within the subfamily are poorly understood, largely because of the overwhelming diversity of the group. The origin of the Far Eastern phoxinin is an interesting question regarding the evolutionary history of Leuciscinae. Here we present phylogenetic analysis of 31 species of Leuciscinae and outgroups based on complete mitochondrial genome sequences to clarify the phylogenetic relationships and to infer the evolutionary history of the subfamily. Phylogenetic analysis suggests that the Far Eastern phoxinin species comprised the monophyletic clades Tribolodon, Pseudaspius, Oreoleuciscus and Far Eastern Phoxinus. The Far Eastern phoxinin clade was independent of other Leuciscinae lineages and was closer to North American phoxinins than European leuciscins. All of our analysis also suggested that leuciscins and phoxinins each constituted monophyletic groups. Divergence time estimation suggested that Leuciscinae species diverged from outgroups such as Tincinae to be 83.3 million years ago (Mya) in the Late Cretaceous and leuciscin and phoxinin shared a common ancestor 70.7 Mya. Radiation of Leuciscinae lineages occurred during the Late Cretaceous to Paleocene. This period also witnessed the radiation of tetrapods. Reconstruction of ancestral areas indicates Leuciscinae species originated within Europe. Leuciscin species evolved in Europe and the ancestor of phoxinin was distributed in North America. The Far Eastern phoxinins would have dispersed from North America to Far East across the Beringia land bridge. The present study suggests important roles for the continental rearrangements during the

  3. Phylogeny and biogeography of highly diverged freshwater fish species (Leuciscinae, Cyprinidae, Teleostei) inferred from mitochondrial genome analysis.

    PubMed

    Imoto, Junichi M; Saitoh, Kenji; Sasaki, Takeshi; Yonezawa, Takahiro; Adachi, Jun; Kartavtsev, Yuri P; Miya, Masaki; Nishida, Mutsumi; Hanzawa, Naoto

    2013-02-10

    The distribution of freshwater taxa is a good biogeographic model to study pattern and process of vicariance and dispersal. The subfamily Leuciscinae (Cyprinidae, Teleostei) consists of many species distributed widely in Eurasia and North America. Leuciscinae have been divided into two phyletic groups, leuciscin and phoxinin. The phylogenetic relationships between major clades within the subfamily are poorly understood, largely because of the overwhelming diversity of the group. The origin of the Far Eastern phoxinin is an interesting question regarding the evolutionary history of Leuciscinae. Here we present phylogenetic analysis of 31 species of Leuciscinae and outgroups based on complete mitochondrial genome sequences to clarify the phylogenetic relationships and to infer the evolutionary history of the subfamily. Phylogenetic analysis suggests that the Far Eastern phoxinin species comprised the monophyletic clades Tribolodon, Pseudaspius, Oreoleuciscus and Far Eastern Phoxinus. The Far Eastern phoxinin clade was independent of other Leuciscinae lineages and was closer to North American phoxinins than European leuciscins. All of our analysis also suggested that leuciscins and phoxinins each constituted monophyletic groups. Divergence time estimation suggested that Leuciscinae species diverged from outgroups such as Tincinae to be 83.3 million years ago (Mya) in the Late Cretaceous and leuciscin and phoxinin shared a common ancestor 70.7 Mya. Radiation of Leuciscinae lineages occurred during the Late Cretaceous to Paleocene. This period also witnessed the radiation of tetrapods. Reconstruction of ancestral areas indicates Leuciscinae species originated within Europe. Leuciscin species evolved in Europe and the ancestor of phoxinin was distributed in North America. The Far Eastern phoxinins would have dispersed from North America to Far East across the Beringia land bridge. The present study suggests important roles for the continental rearrangements during the

  4. Unusual bacterioplankton community structure in ultra-oligotrophic Crater Lake

    USGS Publications Warehouse

    Urbach, Ena; Vergin, Kevin L.; Morse, Ariel

    2001-01-01

    The bacterioplankton assemblage in Crater Lake, Oregon (U.S.A.), is different from communities found in other oxygenated lakes, as demonstrated by four small subunit ribosomal ribonucleic acid (SSU rRNA) gene clone libraries and oligonucleotide probe hybridization to RNA from lake water. Populations in the euphotic zone of this deep (589 m), oligotrophic caldera lake are dominated by two phylogenetic clusters of currently uncultivated bacteria: CL120-10, a newly identified cluster in the verrucomicrobiales, and ACK4 actinomycetes, known as a minor constituent of bacterioplankton in other lakes. Deep-water populations at 300 and 500 m are dominated by a different pair of uncultivated taxa: CL500-11, a novel cluster in the green nonsulfur bacteria, and group I marine crenarchaeota. b-Proteobacteria, dominant in most other freshwater environments, are relatively rare in Crater Lake (<=16% of nonchloroplast bacterial rRNA at all depths). Other taxa identified in Crater Lake libraries include a newly identified candidate bacterial division, ABY1, and a newly identified subcluster, CL0-1, within candidate division OP10. Probe analyses confirmed vertical stratification of several microbial groups, similar to patterns observed in open-ocean systems. Additional similarities between Crater Lake and ocean microbial populations include aphotic zone dominance of group I marine crenarchaeota and green nonsulfur bacteria. Comparison of Crater Lake to other lakes studied by rRNA methods suggests that selective factors structuring Crater Lake bacterioplankton populations may include low concentrations of available trace metals and dissolved organic matter, chemistry of infiltrating hydrothermal waters, and irradiation by high levels of ultraviolet light.

  5. Bacterioplankton carbon cycling along the Subtropical Frontal Zone off New Zealand

    NASA Astrophysics Data System (ADS)

    Baltar, Federico; Stuck, Esther; Morales, Sergio; Currie, Kim

    2015-06-01

    Marine heterotrophic bacterioplankton (Bacteria and Archaea) play a central role in ocean carbon cycling. As such, identifying the factors controlling these microbial populations is crucial to fully understanding carbon fluxes. We studied bacterioplankton activities along a transect crossing three water masses (i.e., Subtropical waters [STW], Sub-Antarctic waters [SAW] and neritic waters [NW]) with contrasting nutrient regimes across the Subtropical Frontal Zone. In contrast to bacterioplankton production and community respiration, bacterioplankton respiration increased in the offshore SAW, causing a seaward increase in the contribution of bacteria to community respiration (from 7% to 100%). Cell-specific bacterioplankton respiration also increased in SAW, but cell-specific production did not, suggesting that prokaryotic cells in SAW were investing more energy towards respiration than growth. This was reflected in a 5-fold decline in bacterioplankton growth efficiency (BGE) towards SAW. One way to explain this decrease in BGE could be due to the observed reduction in phytoplankton biomass (and presumably organic matter concentration) towards SAW. However, this would not explain why bacterioplankton respiration was highest in SAW, where phytoplankton biomass was lowest. Another factor affecting BGE could be the iron limitation characteristic of high-nutrient low-chlorophyll (HNLC) regions like SAW. Our field-study based evidences would agree with previous laboratory experiments in which iron stress provoked a decrease in BGE of marine bacterial isolates. Our results suggest that there is a strong gradient in bacterioplankton carbon cycling rates along the Subtropical Frontal Zone, mainly due to the HNLC conditions of SAW. We suggest that Fe-induced reduction of BGE in HNLC regions like SAW could be relevant in marine carbon cycling, inducing bacterioplankton to act as a link or a sink of organic carbon by impacting on the quantity of organic carbon they incorporate

  6. Insights into archaeal evolution and symbiosis from the genomes of a nanoarchaeon and its inferred crenarchaeal host from Obsidian Pool, Yellowstone National Park

    PubMed Central

    2013-01-01

    Background A single cultured marine organism, Nanoarchaeum equitans, represents the Nanoarchaeota branch of symbiotic Archaea, with a highly reduced genome and unusual features such as multiple split genes. Results The first terrestrial hyperthermophilic member of the Nanoarchaeota was collected from Obsidian Pool, a thermal feature in Yellowstone National Park, separated by single cell isolation, and sequenced together with its putative host, a Sulfolobales archaeon. Both the new Nanoarchaeota (Nst1) and N. equitans lack most biosynthetic capabilities, and phylogenetic analysis of ribosomal RNA and protein sequences indicates that the two form a deep-branching archaeal lineage. However, the Nst1 genome is more than 20% larger, and encodes a complete gluconeogenesis pathway as well as the full complement of archaeal flagellum proteins. With a larger genome, a smaller repertoire of split protein encoding genes and no split non-contiguous tRNAs, Nst1 appears to have experienced less severe genome reduction than N. equitans. These findings imply that, rather than representing ancestral characters, the extremely compact genomes and multiple split genes of Nanoarchaeota are derived characters associated with their symbiotic or parasitic lifestyle. The inferred host of Nst1 is potentially autotrophic, with a streamlined genome and simplified central and energetic metabolism as compared to other Sulfolobales. Conclusions Comparison of the N. equitans and Nst1 genomes suggests that the marine and terrestrial lineages of Nanoarchaeota share a common ancestor that was already a symbiont of another archaeon. The two distinct Nanoarchaeota-host genomic data sets offer novel insights into the evolution of archaeal symbiosis and parasitism, enabling further studies of the cellular and molecular mechanisms of these relationships. Reviewers This article was reviewed by Patrick Forterre, Bettina Siebers (nominated by Michael Galperin) and Purification Lopez-Garcia PMID:23607440

  7. Detection of bacterioplankton in immersed cadavers using selective agar plates.

    PubMed

    Kakizaki, Eiji; Kozawa, Shuji; Tashiro, Noriko; Sakai, Masahiro; Yukawa, Nobuhiro

    2009-04-01

    We measured bacterioplankton in blood from cadavers retrieved from the sea (n=12), near estuaries (n=4), rivers (fresh water, n=8) and from bathtubs (n=4) as well as from non-drowned victims (n=10) discovered near aquatic environments. Blood from 11 victims drowned in seawater developed bioluminescent and/or blue colonies (oxidase test positive) on selective media containing 2-4% NaCl. Homology analyses of the 16S rRNA gene showed that all of them were marine bacteria (genera: Photobacterium, Vibrio, Shewanella, Psychrobacter). Blood from all victims drowned in rivers generated blue colonies on plates containing 3%, but not 4% NaCl. Homology analyses showed that the blue colonies were generated from bacteria that inhabit fresh water (Aeromonas). None of the blood samples from victims that drowned in bathtubs generated bioluminescent and blue colonies. However, all cadavers contained bacteria that produced unstained colonies (Staphylococcus, Bacillus, Enterobacter, Escherichia, etc.). Among non-drowned victims, blood from two gave rise to blue colonies on plates containing < or =3% NaCl (Pseudomonas). Of the cadavers found near estuaries, bioluminescent and blue colonies developed from two of them on media containing 2-4% NaCl (Photobacterium, Vibrio, Listonella), but not from two others on plates containing 4% NaCl (at < or =3%; blue colonies, Aeromonas; unstained colonies, Citrobacter, Vagococcus, Proteus, Enterobacter). These results suggested that the presence of numerous bacterioplankton in immersed cadavers could support a conclusion of death by drowning.

  8. Bacterioplankton community variation across river to ocean environmental gradients.

    PubMed

    Fortunato, Caroline S; Crump, Byron C

    2011-08-01

    Coastal zones encompass a complex spectrum of environmental gradients that each impact the composition of bacterioplankton communities. Few studies have attempted to address these gradients comprehensively. We generated a synoptic, 16S rRNA gene-based bacterioplankton community profile of a coastal zone by applying the fingerprinting technique denaturing gradient gel electrophoresis to water samples collected from the Columbia River, estuary, and plume, and along coastal transects covering 360 km of the Oregon and Washington coasts and extending to the deep ocean (>2,000 m). Communities were found to cluster into five distinct groups based on location in the system (ANOSIM, p < 0.003): estuary, plume, epipelagic, shelf bottom (depth < 150 m), and slope bottom (depth > 650 m). Across all environments, abiotic factors (salinity, temperature, depth) explained most of the community variability (ρ = 0.734). But within each coastal environment, biotic factors explained most of the variability. Thus, structuring physical factors in coastal zones, such as salinity and temperature, define the boundaries of many distinct microbial habitats, but within these habitats variability in microbial communities is explained by biological gradients in primary and secondary productivity.

  9. Distribution of biomass of heterotrophic bacterioplankton in the Bohai Sea

    NASA Astrophysics Data System (ADS)

    Bai, Jie; Li, Kuiran; Zhang, Jing; Li, Zhengyan; Gao, Huiwang; Zhang, Haofei

    2005-12-01

    Distribution, variation and impact factors of biomass of bacterioplankton from April to May 1999 in Bohai Sea were studied in DAPI method with epifluorescence microscopy. The biomass in surface waters showed a small day-night variation, varying from 0.13 to 2.51 μg/dm3 with an average of 0.84 μg/dm3. The biomass in bottom waters showed, however, a large variation, changing from 0.15 to 4.18 μg/dm3 with an average of 1.36 μg/dm3. The peak values occurred at 5 and 11 a.m. The bottom water biomass showed a significant correlation with particulate organic carbon ( r=0.639, P<0.05). Heterotrophic bacterioplankton biomass was high in nearshore waters and low in offshore areas with a high biomass zone around Huanghe (Yellow) River mouth, showing the same distribution of nutrients. In vertical distribution, heterotrophic bacteria biomass in bottom waters was higher than that in surface water.

  10. BACTERIOPLANKTON DYNAMICS IN NORTHERN SAN FRANCISCO BAY: ROLE OF PARTICLE ASSOCIATION AND SEASONAL FRESHWATER FLOW

    EPA Science Inventory

    Bacterioplankton abundance and metabolic characteristics were observed in northern San Francisco Bay, California, during spring and summer 1996 at three sites: Central Bay, Suisun Bay, and the Sacramento River. These sites spanned a salinity gradient from marine to freshwater, an...

  11. Bacterioplankton growth and production at the Louisiana hydrocarbon seeps

    NASA Astrophysics Data System (ADS)

    Larock, P. A.; Hyun, J.-H.; Bennison, B. W.

    1994-06-01

    The growth rate and potential production of bacterioplankton in cold hydrocarbon seeps located along the Louisiana coast were determined using a pulse-labeling technique. Surprisingly, community doubling times are on the order of 1.1 h, which compares to laboratory-grown cultures. We also found that there are differences in growth rates on relatively small geographic scales, suggesting the influence of site-specific geological features (e.g., gas hydrate mounds). Proceeding downslope to deeper waters, methane-oxidizing bacteria appeared to play a more significant role in community productivity. These preliminary experiments indicated, quite unexpectedly, that water column microbes are growing at a more rapid rate than in any other marine system so far studied and that methane may serve as a primary nutrient (carbon) source in these seep-associated microbial assemblages.

  12. Functional characterization of somatic mutations in cancer using network-based inference of protein activity | Office of Cancer Genomics

    Cancer.gov

    Identifying the multiple dysregulated oncoproteins that contribute to tumorigenesis in a given patient is crucial for developing personalized treatment plans. However, accurate inference of aberrant protein activity in biological samples is still challenging as genetic alterations are only partially predictive and direct measurements of protein activity are generally not feasible.

  13. Arthropod Phylogenetics in Light of Three Novel Millipede (Myriapoda: Diplopoda) Mitochondrial Genomes with Comments on the Appropriateness of Mitochondrial Genome Sequence Data for Inferring Deep Level Relationships

    PubMed Central

    Brewer, Michael S.; Swafford, Lynn; Spruill, Chad L.; Bond, Jason E.

    2013-01-01

    Background Arthropods are the most diverse group of eukaryotic organisms, but their phylogenetic relationships are poorly understood. Herein, we describe three mitochondrial genomes representing orders of millipedes for which complete genomes had not been characterized. Newly sequenced genomes are combined with existing data to characterize the protein coding regions of myriapods and to attempt to reconstruct the evolutionary relationships within the Myriapoda and Arthropoda. Results The newly sequenced genomes are similar to previously characterized millipede sequences in terms of synteny and length. Unique translocations occurred within the newly sequenced taxa, including one half of the Appalachioria falcifera genome, which is inverted with respect to other millipede genomes. Across myriapods, amino acid conservation levels are highly dependent on the gene region. Additionally, individual loci varied in the level of amino acid conservation. Overall, most gene regions showed low levels of conservation at many sites. Attempts to reconstruct the evolutionary relationships suffered from questionable relationships and low support values. Analyses of phylogenetic informativeness show the lack of signal deep in the trees (i.e., genes evolve too quickly). As a result, the myriapod tree resembles previously published results but lacks convincing support, and, within the arthropod tree, well established groups were recovered as polyphyletic. Conclusions The novel genome sequences described herein provide useful genomic information concerning millipede groups that had not been investigated. Taken together with existing sequences, the variety of compositions and evolution of myriapod mitochondrial genomes are shown to be more complex than previously thought. Unfortunately, the use of mitochondrial protein-coding regions in deep arthropod phylogenetics appears problematic, a result consistent with previously published studies. Lack of phylogenetic signal renders the

  14. The new physician as unwitting quantum mechanic: is adapting Dirac's inference system best practice for personalized medicine, genomics, and proteomics?

    PubMed

    Robson, Barry

    2007-08-01

    What is the Best Practice for automated inference in Medical Decision Support for personalized medicine? A known system already exists as Dirac's inference system from quantum mechanics (QM) using bra-kets and bras where A and B are states, events, or measurements representing, say, clinical and biomedical rules. Dirac's system should theoretically be the universal best practice for all inference, though QM is notorious as sometimes leading to bizarre conclusions that appear not to be applicable to the macroscopic world of everyday world human experience and medical practice. It is here argued that this apparent difficulty vanishes if QM is assigned one new multiplication function @, which conserves conditionality appropriately, making QM applicable to classical inference including a quantitative form of the predicate calculus. An alternative interpretation with the same consequences is if every i = radical-1 in Dirac's QM is replaced by h, an entity distinct from 1 and i and arguably a hidden root of 1 such that h2 = 1. With that exception, this paper is thus primarily a review of the application of Dirac's system, by application of linear algebra in the complex domain to help manipulate information about associations and ontology in complicated data. Any combined bra-ket can be shown to be composed only of the sum of QM-like bra and ket weights c(), times an exponential function of Fano's mutual information measure I(A; B) about the association between A and B, that is, an association rule from data mining. With the weights and Fano measure re-expressed as expectations on finite data using Riemann's Incomplete (i.e., Generalized) Zeta Functions, actual counts of observations for real world sparse data can be readily utilized. Finally, the paper compares identical character, distinguishability of states events or measurements, correlation, mutual information, and orthogonal character, important issues in data mining

  15. The new physician as unwitting quantum mechanic: is adapting Dirac's inference system best practice for personalized medicine, genomics, and proteomics?

    PubMed

    Robson, Barry

    2007-08-01

    What is the Best Practice for automated inference in Medical Decision Support for personalized medicine? A known system already exists as Dirac's inference system from quantum mechanics (QM) using bra-kets and bras where A and B are states, events, or measurements representing, say, clinical and biomedical rules. Dirac's system should theoretically be the universal best practice for all inference, though QM is notorious as sometimes leading to bizarre conclusions that appear not to be applicable to the macroscopic world of everyday world human experience and medical practice. It is here argued that this apparent difficulty vanishes if QM is assigned one new multiplication function @, which conserves conditionality appropriately, making QM applicable to classical inference including a quantitative form of the predicate calculus. An alternative interpretation with the same consequences is if every i = radical-1 in Dirac's QM is replaced by h, an entity distinct from 1 and i and arguably a hidden root of 1 such that h2 = 1. With that exception, this paper is thus primarily a review of the application of Dirac's system, by application of linear algebra in the complex domain to help manipulate information about associations and ontology in complicated data. Any combined bra-ket can be shown to be composed only of the sum of QM-like bra and ket weights c(), times an exponential function of Fano's mutual information measure I(A; B) about the association between A and B, that is, an association rule from data mining. With the weights and Fano measure re-expressed as expectations on finite data using Riemann's Incomplete (i.e., Generalized) Zeta Functions, actual counts of observations for real world sparse data can be readily utilized. Finally, the paper compares identical character, distinguishability of states events or measurements, correlation, mutual information, and orthogonal character, important issues in data mining

  16. Phylogeny and genetic history of the Siberian salamander (Salamandrella keyserlingii, Dybowski, 1870) inferred from complete mitochondrial genomes.

    PubMed

    Malyarchuk, Boris; Derenko, Miroslava; Denisova, Galina

    2013-05-01

    We assessed phylogeny of the Siberian salamander (Salamandrella keyserlingii, Dybowski, 1870), the most northern ectothermic, terrestrial vertebrate in Eurasia, by sequence analysis of complete mitochondrial genomes in 26 specimens from different localities (China, Khabarovsk region, Sakhalin, Yakutia, Magadan region, Chukotka, Kamchatka, Ural, European part of Russia). In addition, a complete mitochondrial genome of the Schrenck salamander, Salamandrella schrenckii, was determined for the first time. Bayesian phylogenetic analysis of the entire mtDNA genomes of S. keyserlingii demonstrates that two haplotype clades, AB and C, radiated about 1.4 million years ago (Mya). Bayesian skyline plots of population size change through time show an expansion around 250 thousand years ago (kya) and then a decline around the Last Glacial Maximum (25 kya) with subsequent restoration of population size. Climatic changes during the Quaternary period have dramatically affected the population genetic structure of the Siberian salamanders. In addition, complete mtDNA sequence analysis allowed us to recognize that the vast area of Northern Eurasia was colonized only by the Siberian salamander clade C1b during the last 150 kya. Meanwhile, we were unable to find evidence of molecular adaptation in this clade by analyzing the whole mitochondrial genomes of the Siberian salamanders.

  17. Phylogeny and genetic history of the Siberian salamander (Salamandrella keyserlingii, Dybowski, 1870) inferred from complete mitochondrial genomes.

    PubMed

    Malyarchuk, Boris; Derenko, Miroslava; Denisova, Galina

    2013-05-01

    We assessed phylogeny of the Siberian salamander (Salamandrella keyserlingii, Dybowski, 1870), the most northern ectothermic, terrestrial vertebrate in Eurasia, by sequence analysis of complete mitochondrial genomes in 26 specimens from different localities (China, Khabarovsk region, Sakhalin, Yakutia, Magadan region, Chukotka, Kamchatka, Ural, European part of Russia). In addition, a complete mitochondrial genome of the Schrenck salamander, Salamandrella schrenckii, was determined for the first time. Bayesian phylogenetic analysis of the entire mtDNA genomes of S. keyserlingii demonstrates that two haplotype clades, AB and C, radiated about 1.4 million years ago (Mya). Bayesian skyline plots of population size change through time show an expansion around 250 thousand years ago (kya) and then a decline around the Last Glacial Maximum (25 kya) with subsequent restoration of population size. Climatic changes during the Quaternary period have dramatically affected the population genetic structure of the Siberian salamanders. In addition, complete mtDNA sequence analysis allowed us to recognize that the vast area of Northern Eurasia was colonized only by the Siberian salamander clade C1b during the last 150 kya. Meanwhile, we were unable to find evidence of molecular adaptation in this clade by analyzing the whole mitochondrial genomes of the Siberian salamanders. PMID:23415986

  18. A Genome-Scale Investigation of How Sequence, Function, and Tree-Based Gene Properties Influence Phylogenetic Inference

    PubMed Central

    Shen, Xing-Xing; Salichos, Leonidas; Rokas, Antonis

    2016-01-01

    Molecular phylogenetic inference is inherently dependent on choices in both methodology and data. Many insightful studies have shown how choices in methodology, such as the model of sequence evolution or optimality criterion used, can strongly influence inference. In contrast, much less is known about the impact of choices in the properties of the data, typically genes, on phylogenetic inference. We investigated the relationships between 52 gene properties (24 sequence-based, 19 function-based, and 9 tree-based) with each other and with three measures of phylogenetic signal in two assembled data sets of 2,832 yeast and 2,002 mammalian genes. We found that most gene properties, such as evolutionary rate (measured through the percent average of pairwise identity across taxa) and total tree length, were highly correlated with each other. Similarly, several gene properties, such as gene alignment length, Guanine-Cytosine content, and the proportion of tree distance on internal branches divided by relative composition variability (treeness/RCV), were strongly correlated with phylogenetic signal. Analysis of partial correlations between gene properties and phylogenetic signal in which gene evolutionary rate and alignment length were simultaneously controlled, showed similar patterns of correlations, albeit weaker in strength. Examination of the relative importance of each gene property on phylogenetic signal identified gene alignment length, alongside with number of parsimony-informative sites and variable sites, as the most important predictors. Interestingly, the subsets of gene properties that optimally predicted phylogenetic signal differed considerably across our three phylogenetic measures and two data sets; however, gene alignment length and RCV were consistently included as predictors of all three phylogenetic measures in both yeasts and mammals. These results suggest that a handful of sequence-based gene properties are reliable predictors of phylogenetic signal

  19. A Genome-Scale Investigation of How Sequence, Function, and Tree-Based Gene Properties Influence Phylogenetic Inference.

    PubMed

    Shen, Xing-Xing; Salichos, Leonidas; Rokas, Antonis

    2016-01-01

    Molecular phylogenetic inference is inherently dependent on choices in both methodology and data. Many insightful studies have shown how choices in methodology, such as the model of sequence evolution or optimality criterion used, can strongly influence inference. In contrast, much less is known about the impact of choices in the properties of the data, typically genes, on phylogenetic inference. We investigated the relationships between 52 gene properties (24 sequence-based, 19 function-based, and 9 tree-based) with each other and with three measures of phylogenetic signal in two assembled data sets of 2,832 yeast and 2,002 mammalian genes. We found that most gene properties, such as evolutionary rate (measured through the percent average of pairwise identity across taxa) and total tree length, were highly correlated with each other. Similarly, several gene properties, such as gene alignment length, Guanine-Cytosine content, and the proportion of tree distance on internal branches divided by relative composition variability (treeness/RCV), were strongly correlated with phylogenetic signal. Analysis of partial correlations between gene properties and phylogenetic signal in which gene evolutionary rate and alignment length were simultaneously controlled, showed similar patterns of correlations, albeit weaker in strength. Examination of the relative importance of each gene property on phylogenetic signal identified gene alignment length, alongside with number of parsimony-informative sites and variable sites, as the most important predictors. Interestingly, the subsets of gene properties that optimally predicted phylogenetic signal differed considerably across our three phylogenetic measures and two data sets; however, gene alignment length and RCV were consistently included as predictors of all three phylogenetic measures in both yeasts and mammals. These results suggest that a handful of sequence-based gene properties are reliable predictors of phylogenetic signal

  20. Marine bacterioplankton biomass, activity and community structure in the vicinity of Antarctic icebergs

    NASA Astrophysics Data System (ADS)

    Murray, Alison E.; Peng, Vivian; Tyler, Charlotte; Wagh, Protima

    2011-06-01

    We studied marine bacterioplankton in the Scotia Sea in June 2008 and in the northwest Weddell Sea in March to mid April 2009 in waters proximal to three free-drifting icebergs (SS-1, A-43k, and C-18a), in a region with a high density of smaller icebergs (iceberg alley), and at stations that were upstream of the iceberg trajectories designated as far-field reference sites that were between 16-75 km away. Hydrographic parameters were used to define water masses in which comparisons between bacterioplankton-associated characteristics (abundance, leucine incorporation into protein, aminopeptidase activities and community structure) within and between water masses could be made. Early winter Scotia Sea bacterioplankton had low levels of cells and low heterotrophic production rates in the upper 50 m. Influences of the icebergs on bacterioplankton at this time of year were minimal, if not deleterious, as we found lower levels of heterotrophic production near A-43k in comparison to stations >16 km away. Additionally, the results point to small but significant differences in cell abundance, heterotrophic production, and community structure between the two icebergs studied. These icebergs differed greatly in size and the findings suggest that the larger iceberg had a greater effect. In the NW Weddell Sea in March-mid April bacterioplankton were twice as abundant and had heterotrophic productions rates that were 8-fold higher than what we determined in the Scotia Sea, though levels were still quite low, which is typical for autumn. We did not detect direct iceberg-related influences on the bacterioplankton characteristics studied here. Clues to understanding bacterioplankton responses may lie in the details of community structure, as there were some significant differences in community structure in the winter water and underlying upper circumpolar deep-water masses between stations occupied close to C-18a and at stations 18 km away (i.e. Polaribacter and Pelagibacter

  1. Coupling Bacterioplankton Populations and Environment to Community Function in Coastal Temperate Waters

    PubMed Central

    Traving, Sachia J.; Bentzon-Tilia, Mikkel; Knudsen-Leerbeck, Helle; Mantikci, Mustafa; Hansen, Jørgen L. S.; Stedmon, Colin A.; Sørensen, Helle; Markager, Stiig; Riemann, Lasse

    2016-01-01

    Bacterioplankton play a key role in marine waters facilitating processes important for carbon cycling. However, the influence of specific bacterial populations and environmental conditions on bacterioplankton community performance remains unclear. The aim of the present study was to identify drivers of bacterioplankton community functions, taking into account the variability in community composition and environmental conditions over seasons, in two contrasting coastal systems. A Least Absolute Shrinkage and Selection Operator (LASSO) analysis of the biological and chemical data obtained from surface waters over a full year indicated that specific bacterial populations were linked to measured functions. Namely, Synechococcus (Cyanobacteria) was strongly correlated with protease activity. Both function and community composition showed seasonal variation. However, the pattern of substrate utilization capacity could not be directly linked to the community dynamics. The overall importance of dissolved organic matter (DOM) parameters in the LASSO models indicate that bacterioplankton respond to the present substrate landscape, with a particular importance of nitrogenous DOM. The identification of common drivers of bacterioplankton community functions in two different systems indicates that the drivers may be of broader relevance in coastal temperate waters. PMID:27729909

  2. Role of bacterioplankton as producers in two high-productivity marine ecosystems

    SciTech Connect

    Krempin, D.W.

    1986-01-01

    This dissertation addresses the quantitative role of bacterioplankton as producers of particulate biomass in two marine planktonic environments. Some of the assumptions involved in using the incorporation of /sup 3/H-thymidine into cold trichloroacetic acid insoluble material to estimate bacterioplankton production may be improved, or eliminated, with use of /sup 3/H-thymidine or /sup 3/P-orthophosphate to label cells, followed by enzymatic hydrolysis with DNAse to measure newly synthesized DNA. Large increases in the rates of primary production and secondary production were observed during seasonal upwelling events examined for the Organization of Persistent Upwelling Structures (OPUS) study. While chlorophyll a concentration also increased bacterioplankton biomass remained relatively constant. Secondary production was roughly 10% of primary production. During the Antarctic Marine Ecosystem Research at the Ice Edge Zone (AMERIEZ) study, chlorophyll a, which varied by 2 orders of magnitude, and bacterioplankton biomass, which changed by less than 1 order of magnitude, were greatest near the ice edge, where maxima in the rates of primary production and secondary production also were observed. The covariability of microbial standing stocks and rate processes in different zones of an upwelling and an ice edge ecosystem was used to support the concepts of ecological succession, zonation and trophic coupling of phytoplankton and bacterioplankton assemblages in the OPUS and AMERIEZ studies.

  3. Thermal discharge-created increasing temperatures alter the bacterioplankton composition and functional redundancy.

    PubMed

    Xiong, Jinbo; Xiong, Shangling; Qian, Peng; Zhang, Demin; Liu, Lian; Fei, Yuejun

    2016-12-01

    Elevated seawater temperature has altered the coupling between coastal primary production and heterotrophic bacterioplankton respiration. This shift, in turn, could influence the feedback of ocean ecosystem to climate warming. However, little is known about how natural bacterioplankton community responds to increasing seawater temperature. To investigate warming effects on the bacterioplankton community, we collected water samples from temperature gradients (ranged from 15.0 to 18.6 °C) created by a thermal flume of a coal power plant. The results showed that increasing temperatures significantly stimulated bacterial abundance, grazing rate, and altered bacterioplankton community compositions (BCCs). The spatial distribution of bacterioplankton community followed a distance similarity decay relationship, with a turnover of 0.005. A variance partitioning analysis showed that temperature directly constrained 2.01 % variation in BCCs, while temperature-induced changes in water geochemical and grazing rate indirectly accounted for 4.03 and 12.8 % of the community variance, respectively. Furthermore, the relative abundances of 24 bacterial families were linearly increased or decreased (P < 0.05 in all cases) with increasing temperatures. Notably, the change pattern for a given bacterial family was in concert with its known functions. In addition, community functional redundancy consistently decreased along the temperature gradient. This study demonstrates that elevated temperature, combined with substrate supply and trophic interactions, dramatically alters BCCs, concomitant with decreases in functional redundancy. The responses of sensitive assemblages are temperature dependent, which could indicate temperature departures. PMID:27620732

  4. High temporal but low spatial heterogeneity of bacterioplankton in the Chesapeake Bay.

    PubMed

    Kan, Jinjun; Suzuki, Marcelino T; Wang, Kui; Evans, Sarah E; Chen, Feng

    2007-11-01

    Compared to freshwater and the open ocean, less is known about bacterioplankton community structure and spatiotemporal dynamics in estuaries, particularly those with long residence times. The Chesapeake Bay is the largest estuary in the United States, but despite its ecological and economic significance, little is known about its microbial community composition. A rapid screening approach, ITS (internal transcribed spacer)-LH (length heterogeneity)-PCR, was used to screen six rRNA operon (16S rRNA-ITS-23S rRNA) clone libraries constructed from bacterioplankton collected in three distinct regions of the Chesapeake Bay over two seasons. The natural length variation of the 16S-23S rRNA gene ITS region, as well as the presence and location of tRNA-alanine coding regions within the ITS, was determined for 576 clones. Clones representing unique ITS-LH-PCR sizes were sequenced and identified. Dramatic shifts in bacterial composition (changes within subgroups or clades) were observed for the Alphaproteobacteria (Roseobacter clade, SAR11), Cyanobacteria (Synechococcus), and Actinobacteria, suggesting strong seasonal variation within these taxonomic groups. Despite large gradients in salinity and phytoplankton parameters, a remarkably homogeneous bacterioplankton community was observed in the bay in each season. Stronger seasonal, rather than spatial, variation of the bacterioplankton population was also supported by denaturing gradient gel electrophoresis and LH-PCR analyses, indicating that environmental parameters with stronger seasonal, rather than regional, dynamics, such as temperature, might determine bacterioplankton community composition in the Chesapeake Bay.

  5. Prevalent genome streamlining and latitudinal divergence of planktonic bacteria in the surface ocean

    PubMed Central

    Swan, Brandon K.; Tupper, Ben; Sczyrba, Alexander; Lauro, Federico M.; Martinez-Garcia, Manuel; González, José M.; Luo, Haiwei; Wright, Jody J.; Landry, Zachary C.; Hanson, Niels W.; Thompson, Brian P.; Poulton, Nicole J.; Schwientek, Patrick; Acinas, Silvia G.; Giovannoni, Stephen J.; Moran, Mary Ann; Hallam, Steven J.; Cavicchioli, Ricardo; Woyke, Tanja; Stepanauskas, Ramunas

    2013-01-01

    Planktonic bacteria dominate surface ocean biomass and influence global biogeochemical processes, but remain poorly characterized owing to difficulties in cultivation. Using large-scale single cell genomics, we obtained insight into the genome content and biogeography of many bacterial lineages inhabiting the surface ocean. We found that, compared with existing cultures, natural bacterioplankton have smaller genomes, fewer gene duplications, and are depleted in guanine and cytosine, noncoding nucleotides, and genes encoding transcription, signal transduction, and noncytoplasmic proteins. These findings provide strong evidence that genome streamlining and oligotrophy are prevalent features among diverse, free-living bacterioplankton, whereas existing laboratory cultures consist primarily of copiotrophs. The apparent ubiquity of metabolic specialization and mixotrophy, as predicted from single cell genomes, also may contribute to the difficulty in bacterioplankton cultivation. Using metagenome fragment recruitment against single cell genomes, we show that the global distribution of surface ocean bacterioplankton correlates with temperature and latitude and is not limited by dispersal at the time scales required for nucleotide substitution to exceed the current operational definition of bacterial species. Single cell genomes with highly similar small subunit rRNA gene sequences exhibited significant genomic and biogeographic variability, highlighting challenges in the interpretation of individual gene surveys and metagenome assemblies in environmental microbiology. Our study demonstrates the utility of single cell genomics for gaining an improved understanding of the composition and dynamics of natural microbial assemblages. PMID:23801761

  6. Polytene Chromosomal Maps of 11 Drosophila Species: The Order of Genomic Scaffolds Inferred From Genetic and Physical Maps

    PubMed Central

    Schaeffer, Stephen W.; Bhutkar, Arjun; McAllister, Bryant F.; Matsuda, Muneo; Matzkin, Luciano M.; O'Grady, Patrick M.; Rohde, Claudia; Valente, Vera L. S.; Aguadé, Montserrat; Anderson, Wyatt W.; Edwards, Kevin; Garcia, Ana C. L.; Goodman, Josh; Hartigan, James; Kataoka, Eiko; Lapoint, Richard T.; Lozovsky, Elena R.; Machado, Carlos A.; Noor, Mohamed A. F.; Papaceit, Montserrat; Reed, Laura K.; Richards, Stephen; Rieger, Tania T.; Russo, Susan M.; Sato, Hajime; Segarra, Carmen; Smith, Douglas R.; Smith, Temple F.; Strelets, Victor; Tobari, Yoshiko N.; Tomimura, Yoshihiko; Wasserman, Marvin; Watts, Thomas; Wilson, Robert; Yoshida, Kiyohito; Markow, Therese A.; Gelbart, William M.; Kaufman, Thomas C.

    2008-01-01

    The sequencing of the 12 genomes of members of the genus Drosophila was taken as an opportunity to reevaluate the genetic and physical maps for 11 of the species, in part to aid in the mapping of assembled scaffolds. Here, we present an overview of the importance of cytogenetic maps to Drosophila biology and to the concepts of chromosomal evolution. Physical and genetic markers were used to anchor the genome assembly scaffolds to the polytene chromosomal maps for each species. In addition, a computational approach was used to anchor smaller scaffolds on the basis of the analysis of syntenic blocks. We present the chromosomal map data from each of the 11 sequenced non-Drosophila melanogaster species as a series of sections. Each section reviews the history of the polytene chromosome maps for each species, presents the new polytene chromosome maps, and anchors the genomic scaffolds to the cytological maps using genetic and physical markers. The mapping data agree with Muller's idea that the majority of Drosophila genes are syntenic. Despite the conservation of genes within homologous chromosome arms across species, the karyotypes of these species have changed through the fusion of chromosomal arms followed by subsequent rearrangement events. PMID:18622037

  7. Energetic differences between bacterioplankton trophic groups and coral reef resistance.

    PubMed

    McDole Somera, Tracey; Bailey, Barbara; Barott, Katie; Grasis, Juris; Hatay, Mark; Hilton, Brett J; Hisakawa, Nao; Nosrat, Bahador; Nulton, James; Silveira, Cynthia B; Sullivan, Chris; Brainard, Russell E; Rohwer, Forest

    2016-04-27

    Coral reefs are among the most productive and diverse marine ecosystems on the Earth. They are also particularly sensitive to changing energetic requirements by different trophic levels. Microbialization specifically refers to the increase in the energetic metabolic demands of microbes relative to macrobes and is significantly correlated with increasing human influence on coral reefs. In this study, metabolic theory of ecology is used to quantify the relative contributions of two broad bacterioplankton groups, autotrophs and heterotrophs, to energy flux on 27 Pacific coral reef ecosystems experiencing human impact to varying degrees. The effective activation energy required for photosynthesis is lower than the average energy of activation for the biochemical reactions of the Krebs cycle, and changes in the proportional abundance of these two groups can greatly affect rates of energy and materials cycling. We show that reef-water communities with a higher proportional abundance of microbial autotrophs expend more metabolic energy per gram of microbial biomass. Increased energy and materials flux through fast energy channels (i.e. water-column associated microbial autotrophs) may dampen the detrimental effects of increased heterotrophic loads (e.g. coral disease) on coral reef systems experiencing anthropogenic disturbance. PMID:27097927

  8. Energetic differences between bacterioplankton trophic groups and coral reef resistance.

    PubMed

    McDole Somera, Tracey; Bailey, Barbara; Barott, Katie; Grasis, Juris; Hatay, Mark; Hilton, Brett J; Hisakawa, Nao; Nosrat, Bahador; Nulton, James; Silveira, Cynthia B; Sullivan, Chris; Brainard, Russell E; Rohwer, Forest

    2016-04-27

    Coral reefs are among the most productive and diverse marine ecosystems on the Earth. They are also particularly sensitive to changing energetic requirements by different trophic levels. Microbialization specifically refers to the increase in the energetic metabolic demands of microbes relative to macrobes and is significantly correlated with increasing human influence on coral reefs. In this study, metabolic theory of ecology is used to quantify the relative contributions of two broad bacterioplankton groups, autotrophs and heterotrophs, to energy flux on 27 Pacific coral reef ecosystems experiencing human impact to varying degrees. The effective activation energy required for photosynthesis is lower than the average energy of activation for the biochemical reactions of the Krebs cycle, and changes in the proportional abundance of these two groups can greatly affect rates of energy and materials cycling. We show that reef-water communities with a higher proportional abundance of microbial autotrophs expend more metabolic energy per gram of microbial biomass. Increased energy and materials flux through fast energy channels (i.e. water-column associated microbial autotrophs) may dampen the detrimental effects of increased heterotrophic loads (e.g. coral disease) on coral reef systems experiencing anthropogenic disturbance.

  9. Effects of large river dam regulation on bacterioplankton community structure.

    PubMed

    Ruiz-González, Clara; Proia, Lorenzo; Ferrera, Isabel; Gasol, Josep M; Sabater, Sergi

    2013-05-01

    Large rivers are commonly regulated by damming, yet the effects of such disruption on prokaryotic communities have seldom been studied. We describe the effects of the three large reservoirs of the Ebro River (NE Iberian Peninsula) on bacterioplankton assemblages by comparing several sites located before and after the impoundments on three occasions. We monitored the abundances of several bacterial phylotypes identified by rRNA gene probing, and those of two functional groups (picocyanobacteria and aerobic anoxygenic phototrophic bacteria-AAPs). Much greater numbers of particles colonized by bacteria were found in upstream waters than downstream sites. Picocyanobacteria were found in negligible numbers at most sites, whereas AAPs constituted up to 14% of total prokaryotes, but there was no clear effect of reservoirs on the spatial dynamics of these two groups. Instead, damming caused a pronounced decline in Betaproteobacteria, Gammaproteobacteria and Bacteroidetes from upstream to downstream sites, whereas Alphaproteobacteria and Actinobacteria significantly increased after the reservoirs. Redundancy analysis revealed that conductivity, temperature and dissolved inorganic nitrogen were the environmental predictors that best explained the observed variability in bacterial community composition. Our data show that impoundments exerted significant impacts on bacterial riverine assemblages and call attention to the unforeseen ecological consequences of river regulation. PMID:23278359

  10. Reproducibility of Vibrionaceae population structure in coastal bacterioplankton

    PubMed Central

    Szabo, Gitta; Preheim, Sarah P; Kauffman, Kathryn M; David, Lawrence A; Shapiro, Jesse; Alm, Eric J; Polz, Martin F

    2013-01-01

    How reproducibly microbial populations assemble in the wild remains poorly understood. Here, we assess evidence for ecological specialization and predictability of fine-scale population structure and habitat association in coastal ocean Vibrionaceae across years. We compare Vibrionaceae lifestyles in the bacterioplankton (combinations of free-living, particle, or zooplankton associations) measured using the same sampling scheme in 2006 and 2009 to assess whether the same groups show the same environmental association year after year. This reveals complex dynamics with populations falling primarily into two categories: (i) nearly equally represented in each of the two samplings and (ii) highly skewed, often to an extent that they appear exclusive to one or the other sampling times. Importantly, populations recovered at the same abundance in both samplings occupied highly similar habitats suggesting predictable and robust environmental association while skewed abundances of some populations may be triggered by shifts in ecological conditions. The latter is supported by difference in the composition of large eukaryotic plankton between years, with samples in 2006 being dominated by copepods, and those in 2009 by diatoms. Overall, the comparison supports highly predictable population-habitat linkage but highlights the fact that complex, and often unmeasured, environmental dynamics in habitat occurrence may have strong effects on population dynamics. PMID:23178668

  11. Adaptive change inferred from genomic population analysis of the ST93 epidemic clone of community-associated methicillin-resistant Staphylococcus aureus.

    PubMed

    Stinear, Timothy P; Holt, Kathryn E; Chua, Kyra; Stepnell, Justin; Tuck, Kellie L; Coombs, Geoffrey; Harrison, Paul Francis; Seemann, Torsten; Howden, Benjamin P

    2014-02-01

    Community-associated methicillin-resistant Staphylococcus aureus (CA-MRSA) has emerged as a major public health problem around the world. In Australia, ST93-IV[2B] is the dominant CA-MRSA clone and displays significantly greater virulence than other S. aureus. Here, we have examined the evolution of ST93 via genomic analysis of 12 MSSA and 44 MRSA ST93 isolates, collected from around Australia over a 17-year period. Comparative analysis revealed a core genome of 2.6 Mb, sharing greater than 99.7% nucleotide identity. The accessory genome was 0.45 Mb and comprised additional mobile DNA elements, harboring resistance to erythromycin, trimethoprim, and tetracycline. Phylogenetic inference revealed a molecular clock and suggested that a single clone of methicillin susceptible, Panton-Valentine leukocidin (PVL) positive, ST93 S. aureus likely spread from North Western Australia in the early 1970s, acquiring methicillin resistance at least twice in the mid 1990s. We also explored associations between genotype and important MRSA phenotypes including oxacillin MIC and production of exotoxins (α-hemolysin [Hla], δ-hemolysin [Hld], PSMα3, and PVL). High-level expression of Hla is a signature feature of ST93 and reduced expression in eight isolates was readily explained by mutations in the agr locus. However, subtle but significant decreases in Hld were also noted over time that coincided with decreasing oxacillin resistance and were independent of agr mutations. The evolution of ST93 S. aureus is thus associated with a reduction in both exotoxin expression and oxacillin MIC, suggesting MRSA ST93 isolates are under pressure for adaptive change. PMID:24482534

  12. Adaptive Change Inferred from Genomic Population Analysis of the ST93 Epidemic Clone of Community-Associated Methicillin-Resistant Staphylococcus aureus

    PubMed Central

    Stinear, Timothy P.; Holt, Kathryn E.; Chua, Kyra; Stepnell, Justin; Tuck, Kellie L.; Coombs, Geoffrey; Harrison, Paul Francis; Seemann, Torsten; Howden, Benjamin P.

    2014-01-01

    Community-associated methicillin-resistant Staphylococcus aureus (CA-MRSA) has emerged as a major public health problem around the world. In Australia, ST93-IV[2B] is the dominant CA-MRSA clone and displays significantly greater virulence than other S. aureus. Here, we have examined the evolution of ST93 via genomic analysis of 12 MSSA and 44 MRSA ST93 isolates, collected from around Australia over a 17-year period. Comparative analysis revealed a core genome of 2.6 Mb, sharing greater than 99.7% nucleotide identity. The accessory genome was 0.45 Mb and comprised additional mobile DNA elements, harboring resistance to erythromycin, trimethoprim, and tetracycline. Phylogenetic inference revealed a molecular clock and suggested that a single clone of methicillin susceptible, Panton-Valentine leukocidin (PVL) positive, ST93 S. aureus likely spread from North Western Australia in the early 1970s, acquiring methicillin resistance at least twice in the mid 1990s. We also explored associations between genotype and important MRSA phenotypes including oxacillin MIC and production of exotoxins (α-hemolysin [Hla], δ-hemolysin [Hld], PSMα3, and PVL). High-level expression of Hla is a signature feature of ST93 and reduced expression in eight isolates was readily explained by mutations in the agr locus. However, subtle but significant decreases in Hld were also noted over time that coincided with decreasing oxacillin resistance and were independent of agr mutations. The evolution of ST93 S. aureus is thus associated with a reduction in both exotoxin expression and oxacillin MIC, suggesting MRSA ST93 isolates are under pressure for adaptive change. PMID:24482534

  13. Structure, expression profile and phylogenetic inference of chalcone isomerase-like genes from the narrow-leafed lupin (Lupinus angustifolius L.) genome

    PubMed Central

    Przysiecka, Łucja; Książkiewicz, Michał; Wolko, Bogdan; Naganowska, Barbara

    2015-01-01

    Lupins, like other legumes, have a unique biosynthesis scheme of 5-deoxy-type flavonoids and isoflavonoids. A key enzyme in this pathway is chalcone isomerase (CHI), a member of CHI-fold protein family, encompassing subfamilies of CHI1, CHI2, CHI-like (CHIL), and fatty acid-binding (FAP) proteins. Here, two Lupinus angustifolius (narrow-leafed lupin) CHILs, LangCHIL1 and LangCHIL2, were identified and characterized using DNA fingerprinting, cytogenetic and linkage mapping, sequencing and expression profiling. Clones carrying CHIL sequences were assembled into two contigs. Full gene sequences were obtained from these contigs, and mapped in two L. angustifolius linkage groups by gene-specific markers. Bacterial artificial chromosome fluorescence in situ hybridization approach confirmed the localization of two LangCHIL genes in distinct chromosomes. The expression profiles of both LangCHIL isoforms were very similar. The highest level of transcription was in the roots of the third week of plant growth; thereafter, expression declined. The expression of both LangCHIL genes in leaves and stems was similar and low. Comparative mapping to reference legume genome sequences revealed strong syntenic links; however, LangCHIL2 contig had a much more conserved structure than LangCHIL1. LangCHIL2 is assumed to be an ancestor gene, whereas LangCHIL1 probably appeared as a result of duplication. As both copies are transcriptionally active, questions arise concerning their hypothetical functional divergence. Screening of the narrow-leafed lupin genome and transcriptome with CHI-fold protein sequences, followed by Bayesian inference of phylogeny and cross-genera synteny survey, identified representatives of all but one (CHI1) main subfamilies. They are as follows: two copies of CHI2, FAPa2 and CHIL, and single copies of FAPb and FAPa1. Duplicated genes are remnants of whole genome duplication which is assumed to have occurred after the divergence of Lupinus, Arachis, and Glycine

  14. Adaptive change inferred from genomic population analysis of the ST93 epidemic clone of community-associated methicillin-resistant Staphylococcus aureus.

    PubMed

    Stinear, Timothy P; Holt, Kathryn E; Chua, Kyra; Stepnell, Justin; Tuck, Kellie L; Coombs, Geoffrey; Harrison, Paul Francis; Seemann, Torsten; Howden, Benjamin P

    2014-02-01

    Community-associated methicillin-resistant Staphylococcus aureus (CA-MRSA) has emerged as a major public health problem around the world. In Australia, ST93-IV[2B] is the dominant CA-MRSA clone and displays significantly greater virulence than other S. aureus. Here, we have examined the evolution of ST93 via genomic analysis of 12 MSSA and 44 MRSA ST93 isolates, collected from around Australia over a 17-year period. Comparative analysis revealed a core genome of 2.6 Mb, sharing greater than 99.7% nucleotide identity. The accessory genome was 0.45 Mb and comprised additional mobile DNA elements, harboring resistance to erythromycin, trimethoprim, and tetracycline. Phylogenetic inference revealed a molecular clock and suggested that a single clone of methicillin susceptible, Panton-Valentine leukocidin (PVL) positive, ST93 S. aureus likely spread from North Western Australia in the early 1970s, acquiring methicillin resistance at least twice in the mid 1990s. We also explored associations between genotype and important MRSA phenotypes including oxacillin MIC and production of exotoxins (α-hemolysin [Hla], δ-hemolysin [Hld], PSMα3, and PVL). High-level expression of Hla is a signature feature of ST93 and reduced expression in eight isolates was readily explained by mutations in the agr locus. However, subtle but significant decreases in Hld were also noted over time that coincided with decreasing oxacillin resistance and were independent of agr mutations. The evolution of ST93 S. aureus is thus associated with a reduction in both exotoxin expression and oxacillin MIC, suggesting MRSA ST93 isolates are under pressure for adaptive change.

  15. Complete genome and molecular epidemiological data infer the maintenance of rabies among kudu (Tragelaphus strepsiceros) in Namibia.

    PubMed

    Scott, Terence P; Fischer, Melina; Khaiseb, Siegfried; Freuling, Conrad; Höper, Dirk; Hoffmann, Bernd; Markotter, Wanda; Müller, Thomas; Nel, Louis H

    2013-01-01

    Rabies in kudu is unique to Namibia and two major peaks in the epizootic have occurred since it was first noted in 1977. Due to the large numbers of kudu that were affected, it was suspected that horizontal transmission of rabies occurs among kudu and that rabies was being maintained independently within the Namibian kudu population - separate from canid cycles, despite geographic overlap. In this study, it was our aim to show, through phylogenetic analyses, that rabies was being maintained independently within the Namibian kudu population. We also tested, through complete genome sequencing of four rabies virus isolates from jackal and kudu, whether specific mutations occurred in the virus genome due to host adaptation. We found the separate grouping of all rabies isolates from kudu to those of any other canid species in Namibia, suggesting that rabies was being maintained independently in kudu. Additionally, we noted several mutations unique to isolates from kudu, suggesting that these mutations may be due to the adaptation of rabies to a new host. In conclusion, we show clear evidence that rabies is being maintained independently in the Namibian kudu population - a unique phenomenon with ecological and economic impacts. PMID:23527015

  16. Complete mitochondrial genome of Otis tarda (Gruiformes: Otididae) and phylogeny of Gruiformes inferred from mitochondrial DNA sequences.

    PubMed

    Yang, Rong; Wu, Xiaobing; Yan, Peng; Su, Xia; Yang, Banghe

    2010-10-01

    The complete nucleotide sequence of mitochondrial genome of the Great bustard (Otis tarda) was determined by using polymerase chain reaction (PCR) method. The genome is 16,849 bp in size, containing 13 protein-coding, 2 ribosomal and 22 transfer RNA genes. Sequences of the tRNA genes can be folded into canonical cloverleaf secondary structure except for tRNA-Cys and tRNA-Ser (AGY), which lose "DHU" arm. Sequence analysis showed that the O. tarda mitochondrial control region (mtCR) contained many elements in common with other avian mtCRs. A microsatellite repeat was found in the 3'-peripheral domain of the O. tarda mtCR. Based on the mitochondrial DNA sequences of 12S rRNA, 16S rRNA and tRNA-Val, a phylogenetic study of Gruiformes was performed. The result showed that Otididae was a sister group to "core Gruiformes" and Charadriiformes with strong support (97% posterior probability values) in Bayesian analysis. The taxonomic status of Rhynochetidae, Mesitornithidae, Pedionomidae and Turnicidae that traditionally belonged to Gruiformes was also discussed in this paper. PMID:19823949

  17. Complete genome and molecular epidemiological data infer the maintenance of rabies among kudu (Tragelaphus strepsiceros) in Namibia.

    PubMed

    Scott, Terence P; Fischer, Melina; Khaiseb, Siegfried; Freuling, Conrad; Höper, Dirk; Hoffmann, Bernd; Markotter, Wanda; Müller, Thomas; Nel, Louis H

    2013-01-01

    Rabies in kudu is unique to Namibia and two major peaks in the epizootic have occurred since it was first noted in 1977. Due to the large numbers of kudu that were affected, it was suspected that horizontal transmission of rabies occurs among kudu and that rabies was being maintained independently within the Namibian kudu population - separate from canid cycles, despite geographic overlap. In this study, it was our aim to show, through phylogenetic analyses, that rabies was being maintained independently within the Namibian kudu population. We also tested, through complete genome sequencing of four rabies virus isolates from jackal and kudu, whether specific mutations occurred in the virus genome due to host adaptation. We found the separate grouping of all rabies isolates from kudu to those of any other canid species in Namibia, suggesting that rabies was being maintained independently in kudu. Additionally, we noted several mutations unique to isolates from kudu, suggesting that these mutations may be due to the adaptation of rabies to a new host. In conclusion, we show clear evidence that rabies is being maintained independently in the Namibian kudu population - a unique phenomenon with ecological and economic impacts.

  18. Molecular phylogeny of the nettle family (Urticaceae) inferred from multiple loci of three genomes and extensive generic sampling.

    PubMed

    Wu, Zeng-Yuan; Monro, Alex K; Milne, Richard I; Wang, Hong; Yi, Ting-Shuang; Liu, Jie; Li, De-Zhu

    2013-12-01

    Urticaceae is one of the larger Angiosperm families, but relationships within it remain poorly known. This study presents the first densely sampled molecular phylogeny of Urticaceae, using maximum likelihood (ML), maximum parsimony (MP) and Bayesian inference (BI) to analyze the DNA sequence data from two nuclear (ITS and 18S), four chloroplast (matK, rbcL, rpll4-rps8-infA-rpl36, trnL-trnF) and one mitochondrial (matR) loci. We sampled 169 accessions representing 122 species, representing 47 of the 54 recognized genera within Urticaceae, including four of the six sometimes separated as Cecropiaceae. Major results included: (1) Urticaceae including Cecropiaceae was monophyletic; (2) Cecropiaceae was biphyletic, with both lineages nested within Urticaceae; (3) Urticaceae can be divided into four well-supported clades; (4) previously erected tribes or subfamilies were broadly supported, with some additions and alterations; (5) the monophyly of many genera was supported, whereas Boehmeria, Pellionia, Pouzolzia and Urera were clearly polyphyletic, while Urtica and Pilea each had a small genus nested within them; (6) relationships between genera were clarified, mostly with substantial support. These results clarify that some morphological characters have been overstated and others understated in previous classifications of the family, and provide a strong foundation for future studies on biogeography, character evolution, and circumscription of difficult genera.

  19. Inferring genome-wide functional modulatory network: a case study on NF-κB/RelA transcription factor.

    PubMed

    Li, Xueling; Zhu, Min; Brasier, Allan R; Kudlicki, Andrzej S

    2015-04-01

    How different pathways lead to the activation of a specific transcription factor (TF) with specific effects is not fully understood. We model context-specific transcriptional regulation as a modulatory network: triplets composed of a TF, target gene, and modulator. Modulators usually affect the activity of a specific TF at the posttranscriptional level in a target gene-specific action mode. This action may be classified as enhancement, attenuation, or inversion of either activation or inhibition. As a case study, we inferred, from a large collection of expression profiles, all potential modulations of NF-κB/RelA. The predicted modulators include many proteins previously not reported as physically binding to RelA but with relevant functions, such as RNA processing, cell cycle, mitochondrion, ubiquitin-dependent proteolysis, and chromatin modification. Modulators from different processes exert specific prevalent action modes on distinct pathways. Modulators from noncoding RNA, RNA-binding proteins, TFs, and kinases modulate the NF-κB/RelA activity with specific action modes consistent with their molecular functions and modulation level. The modulatory networks of NF-κB/RelA in the context epithelial-mesenchymal transition (EMT) and burn injury have different modulators, including those involved in extracellular matrix (FBN1), cytoskeletal regulation (ACTN1), and metastasis-associated lung adenocarcinoma transcript 1 (MALAT1), a long intergenic nonprotein coding RNA, and tumor suppression (FOXP1) for EMT, and TXNIP, GAPDH, PKM2, IFIT5, LDHA, NID1, and TPP1 for burn injury.

  20. The phylogenetic position of the giant devil ray Mobula mobular (Bonnaterre, 1788) (Myliobatiformes, Myliobatidae) inferred from the mitochondrial genome.

    PubMed

    Bustamante, Carlos; Barría, Claudio; Vargas-Caro, Carolina; Ovenden, Jennifer R; Bennett, Michael B

    2016-09-01

    The giant devil ray, Mobula mobular, is a member of one of the most distinct groups of cartilaginous fishes, the Mobulidae (manta and devil rays), and is the only mobulid assessed as Endangered due its restricted distribution, high bycatch mortality and suspected population decline. The complete mitochondrial genome is 18 913 base pairs in length and comprises 2 rRNAs, 13 protein-coding genes, 22 tRNAs and 2 non-coding regions. Comparison with the partial mitogenome of M. japanica suggests a sister-cryptic species complex and two different taxonomic units. However, the limited divergence within the species (>99.9% genetic identity) may be the result of a geographically and numerically restricted population of M. mobular within the Mediterranean Sea.

  1. [Genetic Differentiation among Natural Populations of the Lizard Complex Darevskia raddei as Inferred from Genome Microsatellite Marking].

    PubMed

    Omelchenko, A V; Girnykh, A E; Osipov, F A; Vergub, A A; Petrosyan, V G; Danielyan, F D; Arakelyan, M S; Ryskov, A P

    2016-02-01

    The article presents the genetic parameters of the populations of lizards of the Darevskia raddei complex (D. raddei nairensis and D. raddei raddei) and the populations of D. valentini calculated on the basis of the analysis of variability of 50 allelic variants of the three nuclear genome microsatellite-containing loci of 83 individuals. It was demonstrated that the F(st) genetic distances between the populations of D. raddei nairensis and D. raddei raddei were not statistically significantly different from the F(st) genetic distances between the populations of different species, D. raddei and D. valentini. At the same time, these distances were statistically significantly higher than the F(st) distances between the populations belonging to one species within the genus Darevskia. These data suggest deep divergence between the populations of D. raddei raddei and D. raddei nairensis of the D. raddei complex and there arises the question on considering them as separate species.

  2. pH influences the importance of niche-related and neutral processes in lacustrine bacterioplankton assembly.

    PubMed

    Ren, Lijuan; Jeppesen, Erik; He, Dan; Wang, Jianjun; Liboriussen, Lone; Xing, Peng; Wu, Qinglong L

    2015-05-01

    pH is an important factor that shapes the structure of bacterial communities. However, we have very limited information about the patterns and processes by which overall bacterioplankton communities assemble across wide pH gradients in natural freshwater lakes. Here, we used pyrosequencing to analyze the bacterioplankton communities in 25 discrete freshwater lakes in Denmark with pH levels ranging from 3.8 to 8.8. We found that pH was the key factor impacting lacustrine bacterioplankton community assembly. More acidic lakes imposed stronger environmental filtering, which decreased the richness and evenness of bacterioplankton operational taxonomic units (OTUs) and largely shifted community composition. Although environmental filtering was determined to be the most important determinant of bacterioplankton community assembly, the importance of neutral assembly processes must also be considered, notably in acidic lakes, where the species (OTU) diversity was low. We observed that the strong effect of environmental filtering in more acidic lakes was weakened by the enhanced relative importance of neutral community assembly, and bacterioplankton communities tended to be less phylogenetically clustered in more acidic lakes. In summary, we propose that pH is a major environmental determinant in freshwater lakes, regulating the relative importance and interplay between niche-related and neutral processes and shaping the patterns of freshwater lake bacterioplankton biodiversity. PMID:25724952

  3. pH influences the importance of niche-related and neutral processes in lacustrine bacterioplankton assembly.

    PubMed

    Ren, Lijuan; Jeppesen, Erik; He, Dan; Wang, Jianjun; Liboriussen, Lone; Xing, Peng; Wu, Qinglong L

    2015-05-01

    pH is an important factor that shapes the structure of bacterial communities. However, we have very limited information about the patterns and processes by which overall bacterioplankton communities assemble across wide pH gradients in natural freshwater lakes. Here, we used pyrosequencing to analyze the bacterioplankton communities in 25 discrete freshwater lakes in Denmark with pH levels ranging from 3.8 to 8.8. We found that pH was the key factor impacting lacustrine bacterioplankton community assembly. More acidic lakes imposed stronger environmental filtering, which decreased the richness and evenness of bacterioplankton operational taxonomic units (OTUs) and largely shifted community composition. Although environmental filtering was determined to be the most important determinant of bacterioplankton community assembly, the importance of neutral assembly processes must also be considered, notably in acidic lakes, where the species (OTU) diversity was low. We observed that the strong effect of environmental filtering in more acidic lakes was weakened by the enhanced relative importance of neutral community assembly, and bacterioplankton communities tended to be less phylogenetically clustered in more acidic lakes. In summary, we propose that pH is a major environmental determinant in freshwater lakes, regulating the relative importance and interplay between niche-related and neutral processes and shaping the patterns of freshwater lake bacterioplankton biodiversity.

  4. Phylogenetic inference and SSR characterization of tropical woody bamboos tribe Bambuseae (Poaceae: Bambusoideae) based on complete plastid genome sequences.

    PubMed

    Vieira, Leila do Nascimento; Dos Anjos, Karina Goulart; Faoro, Helisson; Fraga, Hugo Pacheco de Freitas; Greco, Thiago Machado; Pedrosa, Fábio de Oliveira; de Souza, Emanuel Maltempi; Rogalski, Marcelo; de Souza, Robson Francisco; Guerra, Miguel Pedro

    2016-05-01

    The complete plastome sequencing is an efficient option for increasing phylogenetic resolution and evolutionary studies, as well as may greatly facilitate the use of plastid DNA markers in plant population genetic studies. Merostachys and Guadua stand out as the most common and the highest potential utilization bamboos indigenous of Brazil. Here, we sequenced the complete plastome sequences of the Brazilian Guadua chacoensis and Merostachys sp. to perform full plastome phylogeny and characterize the occurrence, type, and distribution of SRRs using 20 Bambuseae species. The determined plastome sequence of Merostachys sp. and G. chacoensis is 136,334 and 135,403 bp in size, respectively, with an identical gene content and typical quadripartite structure consisting of a pair of IRs separated by the LSC and SSC regions. The Maximum Likelihood and Bayesian Inference analyses produced phylogenomic trees identical in topology. These trees supported monophyly of Paleotropical and Neotropical Bamboos clades. The Neotropical bamboos segregated into three well-supported lineages, Chusqueinae, Guaduinae, and Arthrostylidiinae, with the last two forming a well-supported sister relationship. Paleotropical bamboos segregated into two well-supported lineages, Hickeliinae and Bambusinae + Melocanninae. We identified 141.8 cpSSR in Bambuseae plastomes and an inferior value (38.15) for plastome coding sequences. Among them, we identified 16 polymorphic SSR loci, with number of alleles varying from 3 to 10. These 16 polymorphic cpSSR loci in Bambuseae plastome can be assessed for the intraspecific level of polymorphism, leading to innovative highly sensitive phylogeographic and population genetics studies for this tribe.

  5. Genomic Alteration in Head and Neck Squamous Cell Carcinoma (HNSCC) Cell Lines Inferred from Karyotyping, Molecular Cytogenetics, and Array Comparative Genomic Hybridization.

    PubMed

    Singchat, Worapong; Hitakomate, Ekarat; Rerkarmnuaychoke, Budsaba; Suntronpong, Aorarat; Fu, Beiyuan; Bodhisuwan, Winai; Peyachoknagul, Surin; Yang, Fengtang; Koontongkaew, Sittichai; Srikulnath, Kornsorn

    2016-01-01

    Genomic alteration in head and neck squamous cell carcinoma (HNSCC) was studied in two cell line pairs (HN30-HN31 and HN4-HN12) using conventional C-banding, multiplex fluorescence in situ hybridization (M-FISH), and array comparative genomic hybridization (array CGH). HN30 and HN4 were derived from primary lesions in the pharynx and base of tongue, respectively, and HN31 and HN12 were derived from lymph-node metastatic lesions belonging to the same patients. Gain of chromosome 1, 7, and 11 were shared in almost all cell lines. Hierarchical clustering revealed that HN31 was closely related to HN4, which shared eight chromosome alteration cases. Large C-positive heterochromatins were found in the centromeric region of chromosome 9 in HN31 and HN4, which suggests complex structural amplification of the repetitive sequence. Array CGH revealed amplification of 7p22.3p11.2, 8q11.23q12.1, and 14q32.33 in all cell lines involved with tumorigenesis and inflammation genes. The amplification of 2p21 (SIX3), 11p15.5 (H19), and 11q21q22.3 (MAML2, PGR, TRPC6, and MMP family) regions, and deletion of 9p23 (PTPRD) and 16q23.1 (WWOX) regions were identified in HN31 and HN12. Interestingly, partial loss of PTPRD (9p23) and WWOX (16q23.1) genes was identified in HN31 and HN12, and the level of gene expression tended to be the down-regulation of PTPRD, with no detectable expression of the WWOX gene. This suggests that the scarcity of PTPRD and WWOX genes might have played an important role in progression of HNSCC, and could be considered as a target for cancer therapy or a biomarker in molecular pathology.

  6. Genomic Alteration in Head and Neck Squamous Cell Carcinoma (HNSCC) Cell Lines Inferred from Karyotyping, Molecular Cytogenetics, and Array Comparative Genomic Hybridization.

    PubMed

    Singchat, Worapong; Hitakomate, Ekarat; Rerkarmnuaychoke, Budsaba; Suntronpong, Aorarat; Fu, Beiyuan; Bodhisuwan, Winai; Peyachoknagul, Surin; Yang, Fengtang; Koontongkaew, Sittichai; Srikulnath, Kornsorn

    2016-01-01

    Genomic alteration in head and neck squamous cell carcinoma (HNSCC) was studied in two cell line pairs (HN30-HN31 and HN4-HN12) using conventional C-banding, multiplex fluorescence in situ hybridization (M-FISH), and array comparative genomic hybridization (array CGH). HN30 and HN4 were derived from primary lesions in the pharynx and base of tongue, respectively, and HN31 and HN12 were derived from lymph-node metastatic lesions belonging to the same patients. Gain of chromosome 1, 7, and 11 were shared in almost all cell lines. Hierarchical clustering revealed that HN31 was closely related to HN4, which shared eight chromosome alteration cases. Large C-positive heterochromatins were found in the centromeric region of chromosome 9 in HN31 and HN4, which suggests complex structural amplification of the repetitive sequence. Array CGH revealed amplification of 7p22.3p11.2, 8q11.23q12.1, and 14q32.33 in all cell lines involved with tumorigenesis and inflammation genes. The amplification of 2p21 (SIX3), 11p15.5 (H19), and 11q21q22.3 (MAML2, PGR, TRPC6, and MMP family) regions, and deletion of 9p23 (PTPRD) and 16q23.1 (WWOX) regions were identified in HN31 and HN12. Interestingly, partial loss of PTPRD (9p23) and WWOX (16q23.1) genes was identified in HN31 and HN12, and the level of gene expression tended to be the down-regulation of PTPRD, with no detectable expression of the WWOX gene. This suggests that the scarcity of PTPRD and WWOX genes might have played an important role in progression of HNSCC, and could be considered as a target for cancer therapy or a biomarker in molecular pathology. PMID:27501229

  7. Genomic Alteration in Head and Neck Squamous Cell Carcinoma (HNSCC) Cell Lines Inferred from Karyotyping, Molecular Cytogenetics, and Array Comparative Genomic Hybridization

    PubMed Central

    Rerkarmnuaychoke, Budsaba; Suntronpong, Aorarat; Fu, Beiyuan; Bodhisuwan, Winai; Peyachoknagul, Surin; Yang, Fengtang; Koontongkaew, Sittichai; Srikulnath, Kornsorn

    2016-01-01

    Genomic alteration in head and neck squamous cell carcinoma (HNSCC) was studied in two cell line pairs (HN30-HN31 and HN4-HN12) using conventional C-banding, multiplex fluorescence in situ hybridization (M-FISH), and array comparative genomic hybridization (array CGH). HN30 and HN4 were derived from primary lesions in the pharynx and base of tongue, respectively, and HN31 and HN12 were derived from lymph-node metastatic lesions belonging to the same patients. Gain of chromosome 1, 7, and 11 were shared in almost all cell lines. Hierarchical clustering revealed that HN31 was closely related to HN4, which shared eight chromosome alteration cases. Large C-positive heterochromatins were found in the centromeric region of chromosome 9 in HN31 and HN4, which suggests complex structural amplification of the repetitive sequence. Array CGH revealed amplification of 7p22.3p11.2, 8q11.23q12.1, and 14q32.33 in all cell lines involved with tumorigenesis and inflammation genes. The amplification of 2p21 (SIX3), 11p15.5 (H19), and 11q21q22.3 (MAML2, PGR, TRPC6, and MMP family) regions, and deletion of 9p23 (PTPRD) and 16q23.1 (WWOX) regions were identified in HN31 and HN12. Interestingly, partial loss of PTPRD (9p23) and WWOX (16q23.1) genes was identified in HN31 and HN12, and the level of gene expression tended to be the down-regulation of PTPRD, with no detectable expression of the WWOX gene. This suggests that the scarcity of PTPRD and WWOX genes might have played an important role in progression of HNSCC, and could be considered as a target for cancer therapy or a biomarker in molecular pathology. PMID:27501229

  8. Demographic inferences using short-read genomic data in an approximate Bayesian computation framework: in silico evaluation of power, biases and proof of concept in Atlantic walrus.

    PubMed

    Shafer, Aaron B A; Gattepaille, Lucie M; Stewart, Robert E A; Wolf, Jochen B W

    2015-01-01

    Approximate Bayesian computation (ABC) is a powerful tool for model-based inference of demographic histories from large genetic data sets. For most organisms, its implementation has been hampered by the lack of sufficient genetic data. Genotyping-by-sequencing (GBS) provides cheap genome-scale data to fill this gap, but its potential has not fully been exploited. Here, we explored power, precision and biases of a coalescent-based ABC approach where GBS data were modelled with either a population mutation parameter (θ) or a fixed site (FS) approach, allowing single or several segregating sites per locus. With simulated data ranging from 500 to 50 000 loci, a variety of demographic models could be reliably inferred across a range of timescales and migration scenarios. Posterior estimates were informative with 1000 loci for migration and split time in simple population divergence models. In more complex models, posterior distributions were wide and almost reverted to the uninformative prior even with 50 000 loci. ABC parameter estimates, however, were generally more accurate than an alternative composite-likelihood method. Bottleneck scenarios proved particularly difficult, and only recent bottlenecks without recovery could be reliably detected and dated. Notably, minor-allele-frequency filters - usual practice for GBS data - negatively affected nearly all estimates. With this in mind, we used a combination of FS and θ approaches on empirical GBS data generated from the Atlantic walrus (Odobenus rosmarus rosmarus), collectively providing support for a population split before the last glacial maximum followed by asymmetrical migration and a high Arctic bottleneck. Overall, this study evaluates the potential and limitations of GBS data in an ABC-coalescence framework and proposes a best-practice approach. PMID:25482153

  9. First all-in-one diagnostic tool for DNA intelligence: genome-wide inference of biogeographic ancestry, appearance, relatedness, and sex with the Identitas v1 Forensic Chip.

    PubMed

    Keating, Brendan; Bansal, Aruna T; Walsh, Susan; Millman, Jonathan; Newman, Jonathan; Kidd, Kenneth; Budowle, Bruce; Eisenberg, Arthur; Donfack, Joseph; Gasparini, Paolo; Budimlija, Zoran; Henders, Anjali K; Chandrupatla, Hareesh; Duffy, David L; Gordon, Scott D; Hysi, Pirro; Liu, Fan; Medland, Sarah E; Rubin, Laurence; Martin, Nicholas G; Spector, Timothy D; Kayser, Manfred

    2013-05-01

    When a forensic DNA sample cannot be associated directly with a previously genotyped reference sample by standard short tandem repeat profiling, the investigation required for identifying perpetrators, victims, or missing persons can be both costly and time consuming. Here, we describe the outcome of a collaborative study using the Identitas Version 1 (v1) Forensic Chip, the first commercially available all-in-one tool dedicated to the concept of developing intelligence leads based on DNA. The chip allows parallel interrogation of 201,173 genome-wide autosomal, X-chromosomal, Y-chromosomal, and mitochondrial single nucleotide polymorphisms for inference of biogeographic ancestry, appearance, relatedness, and sex. The first assessment of the chip's performance was carried out on 3,196 blinded DNA samples of varying quantities and qualities, covering a wide range of biogeographic origin and eye/hair coloration as well as variation in relatedness and sex. Overall, 95 % of the samples (N = 3,034) passed quality checks with an overall genotype call rate >90 % on variable numbers of available recorded trait information. Predictions of sex, direct match, and first to third degree relatedness were highly accurate. Chip-based predictions of biparental continental ancestry were on average ~94 % correct (further support provided by separately inferred patrilineal and matrilineal ancestry). Predictions of eye color were 85 % correct for brown and 70 % correct for blue eyes, and predictions of hair color were 72 % for brown, 63 % for blond, 58 % for black, and 48 % for red hair. From the 5 % of samples (N = 162) with <90 % call rate, 56 % yielded correct continental ancestry predictions while 7 % yielded sufficient genotypes to allow hair and eye color prediction. Our results demonstrate that the Identitas v1 Forensic Chip holds great promise for a wide range of applications including criminal investigations, missing person investigations, and for national security

  10. Demographic inferences using short-read genomic data in an approximate Bayesian computation framework: in silico evaluation of power, biases and proof of concept in Atlantic walrus.

    PubMed

    Shafer, Aaron B A; Gattepaille, Lucie M; Stewart, Robert E A; Wolf, Jochen B W

    2015-01-01

    Approximate Bayesian computation (ABC) is a powerful tool for model-based inference of demographic histories from large genetic data sets. For most organisms, its implementation has been hampered by the lack of sufficient genetic data. Genotyping-by-sequencing (GBS) provides cheap genome-scale data to fill this gap, but its potential has not fully been exploited. Here, we explored power, precision and biases of a coalescent-based ABC approach where GBS data were modelled with either a population mutation parameter (θ) or a fixed site (FS) approach, allowing single or several segregating sites per locus. With simulated data ranging from 500 to 50 000 loci, a variety of demographic models could be reliably inferred across a range of timescales and migration scenarios. Posterior estimates were informative with 1000 loci for migration and split time in simple population divergence models. In more complex models, posterior distributions were wide and almost reverted to the uninformative prior even with 50 000 loci. ABC parameter estimates, however, were generally more accurate than an alternative composite-likelihood method. Bottleneck scenarios proved particularly difficult, and only recent bottlenecks without recovery could be reliably detected and dated. Notably, minor-allele-frequency filters - usual practice for GBS data - negatively affected nearly all estimates. With this in mind, we used a combination of FS and θ approaches on empirical GBS data generated from the Atlantic walrus (Odobenus rosmarus rosmarus), collectively providing support for a population split before the last glacial maximum followed by asymmetrical migration and a high Arctic bottleneck. Overall, this study evaluates the potential and limitations of GBS data in an ABC-coalescence framework and proposes a best-practice approach.

  11. Southeast Asian origins of five Hill Tribe populations and correlation of genetic to linguistic relationships inferred with genome-wide SNP data.

    PubMed

    Listman, J B; Malison, R T; Sanichwankul, K; Ittiwut, C; Mutirangura, A; Gelernter, J

    2011-02-01

    In Thailand, the term Hill Tribe is used to describe populations whose members traditionally practice slash and burn agriculture and reside in the mountains. These tribes are thought to have migrated throughout Asia for up to 5,000 years, including migrations through Southern China and/or Southeast Asia. There have been continuous migrations southward from China into Thailand for approximately the past thousand years and the present geographic range of any given tribe straddles multiple political borders. As none of these populations have autochthonous scripts, written histories have until recently, been externally produced. Northern Asian, Tibetan, and Siberian origins of Hill Tribes have been proposed. All purport endogamy and have nonmutually intelligible languages. To test hypotheses regarding the geographic origins of these populations, relatedness and migrations among them and neighboring populations, and whether their genetic relationships correspond with their linguistic relationships, we analyzed 2,445 genome-wide SNP markers in 118 individuals from five Thai Hill Tribe populations (Akha, Hmong, Karen, Lahu, and Lisu), 90 individuals from majority Thai populations, and 826 individuals from Asian and Oceanean HGDP and HapMap populations using a Bayesian clustering method. Considering these results within the context of results ofrecent large-scale studies of Asian geographic genetic variation allows us to infer a shared Southeast Asian origin of these five Hill Tribe populations as well ancestry components that distinguish among them seen in successive levels of clustering. In addition, the inferred level of shared ancestry among the Hill Tribes corresponds well to relationships among their languages. PMID:20979205

  12. Southeast Asian origins of five Hill Tribe populations and correlation of genetic to linguistic relationships inferred with genome-wide SNP data.

    PubMed

    Listman, J B; Malison, R T; Sanichwankul, K; Ittiwut, C; Mutirangura, A; Gelernter, J

    2011-02-01

    In Thailand, the term Hill Tribe is used to describe populations whose members traditionally practice slash and burn agriculture and reside in the mountains. These tribes are thought to have migrated throughout Asia for up to 5,000 years, including migrations through Southern China and/or Southeast Asia. There have been continuous migrations southward from China into Thailand for approximately the past thousand years and the present geographic range of any given tribe straddles multiple political borders. As none of these populations have autochthonous scripts, written histories have until recently, been externally produced. Northern Asian, Tibetan, and Siberian origins of Hill Tribes have been proposed. All purport endogamy and have nonmutually intelligible languages. To test hypotheses regarding the geographic origins of these populations, relatedness and migrations among them and neighboring populations, and whether their genetic relationships correspond with their linguistic relationships, we analyzed 2,445 genome-wide SNP markers in 118 individuals from five Thai Hill Tribe populations (Akha, Hmong, Karen, Lahu, and Lisu), 90 individuals from majority Thai populations, and 826 individuals from Asian and Oceanean HGDP and HapMap populations using a Bayesian clustering method. Considering these results within the context of results ofrecent large-scale studies of Asian geographic genetic variation allows us to infer a shared Southeast Asian origin of these five Hill Tribe populations as well ancestry components that distinguish among them seen in successive levels of clustering. In addition, the inferred level of shared ancestry among the Hill Tribes corresponds well to relationships among their languages.

  13. Phylogenetic composition of bacterioplankton assemblages from the Arctic Ocean.

    PubMed

    Bano, Nasreen; Hollibaugh, James T

    2002-02-01

    We analyzed the phylogenetic composition of bacterioplankton assemblages in 11 Arctic Ocean samples collected over three seasons (winter-spring 1995, summer 1996, and summer-fall 1997) by sequencing cloned fragments of 16S rRNA genes. The sequencing effort was directed by denaturing gradient gel electrophoresis (DGGE) screening of samples and the clone libraries. Sequences of 88 clones fell into seven major lineages of the domain Bacteria: alpha(36%)-, gamma(32%)-, delta(14%)-, and epsilon(1%)-Proteobacteria; Cytophaga-Flexibacter-Bacteroides spp. (9%); Verrucomicrobium spp. (6%); and green nonsulfur bacteria (2%). A total of 34% of the cloned sequences (excluding clones in the SAR11 and Roseobacter groups) had sequence similarities that were <94% compared to previously reported sequences, indicating the presence of novel sequences. DGGE fingerprints of the selected samples showed that most of the bands were common to all samples in all three seasons. However, additional bands representing sequences related to Cytophaga and Polaribacter species were found in samples collected during the summer and fall. Of the clones in a library generated from one sample collected in spring of 1995, 50% were the same and were most closely affiliated (99% similarity) with Alteromonas macleodii, while 50% of the clones in another sample were most closely affiliated (90 to 96% similarity) with Oceanospirillum sp. The majority of the cloned sequences were most closely related to uncultured, environmental sequences. Prominent among these were members of the SAR11 group. Differences between mixed-layer and halocline samples were apparent in DGGE fingerprints and clone libraries. Sequences related to alpha-Proteobacteria (dominated by SAR11) were abundant (52%) in samples from the mixed layer, while sequences related to gamma-proteobacteria were more abundant (44%) in halocline samples. Two bands corresponding to sequences related to SAR307 (common in deep water) and the high-G+C gram

  14. The evolutionary history of Plasmodium vivax as inferred from mitochondrial genomes: parasite genetic diversity in the Americas.

    PubMed

    Taylor, Jesse E; Pacheco, M Andreína; Bacon, David J; Beg, Mohammad A; Machado, Ricardo Luiz; Fairhurst, Rick M; Herrera, Socrates; Kim, Jung-Yeon; Menard, Didier; Póvoa, Marinete Marins; Villegas, Leopoldo; Mulyanto; Snounou, Georges; Cui, Liwang; Zeyrek, Fadile Yildiz; Escalante, Ananias A

    2013-09-01

    Plasmodium vivax is the most prevalent human malaria parasite in the Americas. Previous studies have contrasted the genetic diversity of parasite populations in the Americas with those in Asia and Oceania, concluding that New World populations exhibit low genetic diversity consistent with a recent introduction. Here we used an expanded sample of complete mitochondrial genome sequences to investigate the diversity of P. vivax in the Americas as well as in other continental populations. We show that the diversity of P. vivax in the Americas is comparable to that in Asia and Oceania, and we identify several divergent clades circulating in South America that may have resulted from independent introductions. In particular, we show that several haplotypes sampled in Venezuela and northeastern Brazil belong to a clade that diverged from the other P. vivax lineages at least 30,000 years ago, albeit not necessarily in the Americas. We propose that, unlike in Asia where human migration increases local genetic diversity, the combined effects of the geographical structure and the low incidence of vivax malaria in the Americas has resulted in patterns of low local but high regional genetic diversity. This could explain previous views that P. vivax in the Americas has low genetic diversity because these were based on studies carried out in limited areas. Further elucidation of the complex geographical pattern of P. vivax variation will be important both for diversity assessments of genes encoding candidate vaccine antigens and in the formulation of control and surveillance measures aimed at malaria elimination.

  15. BACTERIOPLANKTON DYNAMICS IN PENSACOLA BAY, FL, USA: ROLE OF PHYTOPLANKTON AND DETRIAL CARBON SOURCES

    EPA Science Inventory

    Bacterioplankton Dynamics in Pensacola Bay, FL, USA: Role of Phytoplankton and Detrital Carbon Sources (Abstract). To be presented at the16th Biennial Conference of the Estuarine Research Foundation, ERF 2001: An Estuarine Odyssey, 4-8 November 2001, St. Pete Beach, FL. 1 p. (ER...

  16. In situ interactions between photosynthetic picoeukaryotes and bacterioplankton in the Atlantic Ocean: evidence for mixotrophy.

    PubMed

    Hartmann, Manuela; Zubkov, Mikhail V; Scanlan, Dave J; Lepère, Cécile

    2013-12-01

    Heterotrophic bacterioplankton, cyanobacteria and phototrophic picoeukaryotes (< 5 μm in size) numerically dominate planktonic oceanic communities. While feeding on bacterioplankton is often attributed to aplastidic protists, recent evidence suggests that phototrophic picoeukaryotes could be important bacterivores. Here, we present direct visual evidence from the surface mixed layer of the Atlantic Ocean that bacterioplankton are internalized by phototrophic picoeukaryotes. In situ interactions of phototrophic picoeukaryotes and bacterioplankton (specifically Prochlorococcus cyanobacteria and the SAR11 clade) were investigated using a combination of flow cytometric cell sorting and dual tyramide signal amplification fluorescence in situ hybridization. Using this method, we observed plastidic Prymnesiophyceae and Chrysophyceae cells containing Prochlorococcus, and to a lesser extent SAR11 cells. These microscopic observations of in situ microbial trophic interactions demonstrate the frequency and likely selectivity of phototrophic picoeukaryote bacterivory in the surface mixed layer of both the North and South Atlantic subtropical gyres and adjacent equatorial region, broadening our views on the ecological role of the smallest oceanic plastidic protists. PMID:24249292

  17. Interactions between hydrology and water chemistry shape bacterioplankton biogeography across boreal freshwater networks.

    PubMed

    Niño-García, Juan Pablo; Ruiz-González, Clara; Del Giorgio, Paul A

    2016-07-01

    Disentangling the mechanisms shaping bacterioplankton communities across freshwater ecosystems requires considering a hydrologic dimension that can influence both dispersal and local sorting, but how the environment and hydrology interact to shape the biogeography of freshwater bacterioplankton over large spatial scales remains unexplored. Using Illumina sequencing of the 16S ribosomal RNA gene, we investigate the large-scale spatial patterns of bacterioplankton across 386 freshwater systems from seven distinct regions in boreal Québec. We show that both hydrology and local water chemistry (mostly pH) interact to shape a sequential structuring of communities from highly diverse assemblages in headwater streams toward larger rivers and lakes dominated by fewer taxa. Increases in water residence time along the hydrologic continuum were accompanied by major losses of bacterial richness and by an increased differentiation of communities driven by local conditions (pH and other related variables). This suggests that hydrology and network position modulate the relative role of environmental sorting and mass effects on community assembly by determining both the time frame for bacterial growth and the composition of the immigrant pool. The apparent low dispersal limitation (that is, the lack of influence of geographic distance on the spatial patterns observed at the taxonomic resolution used) suggests that these boreal bacterioplankton communities derive from a shared bacterial pool that enters the networks through the smallest streams, largely dominated by mass effects, and that is increasingly subjected to local sorting of species during transit along the hydrologic continuum.

  18. INFLUENCE OF LIGHT ON BACTERIOPLANKTON PRODUCTION AND RESPIRATION IN A SUBTROPICAL CORAL REEF

    EPA Science Inventory

    The influence of sunlight on bacterioplankton production (14C-leucine (Leu) and 3H-thymidine (TdR) incorporation; changes in cell abundances) and O2 consumption was investigated in a shallow subtropical coral reef located near Key Largo, Florida. Quartz (light) and opaque (dark) ...

  19. Bacterioplankton: a sink for carbon in a coastal marine plankton community

    SciTech Connect

    Ducklow, H.W.; Purdie, D.A.; Williams, P.J.LeB.; Davis, J.M.

    1986-05-16

    Recent determinations of high production rates (up to 30% of primary production in surface waters) implicate free-living marine bacterioplankton as a link in a microbial loop that supplements phytoplankton as food for herbivores. An enclosed water column of 300 cubic meters was used to test the microbial loop hypothesis by following the fate of carbon-14-labeled bacterioplankton for over 50 days. Only 2% of the label initially fixed from carbon-14-labeled glucose by bacteria was present in larger organisms after 13 days, at which time about 20% of the total label added remained in the particulate fraction. Most of the label appeared to pass directly from particles smaller than 1 micrometer (heterotrophic bacterioplankton and some bacteriovores) to respired labeled carbon dioxide or to regenerated dissolved organic carbon-14. Secondary (and, by implication, primary) production by organisms smaller than 1 micrometer may not be an important food source in marine food chains. Bacterioplankton can be a sink for carbon in planktonic food webs and may serve principally as agents of nutrient regeneration rather than as food.

  20. In situ interactions between photosynthetic picoeukaryotes and bacterioplankton in the Atlantic Ocean: evidence for mixotrophy.

    PubMed

    Hartmann, Manuela; Zubkov, Mikhail V; Scanlan, Dave J; Lepère, Cécile

    2013-12-01

    Heterotrophic bacterioplankton, cyanobacteria and phototrophic picoeukaryotes (< 5 μm in size) numerically dominate planktonic oceanic communities. While feeding on bacterioplankton is often attributed to aplastidic protists, recent evidence suggests that phototrophic picoeukaryotes could be important bacterivores. Here, we present direct visual evidence from the surface mixed layer of the Atlantic Ocean that bacterioplankton are internalized by phototrophic picoeukaryotes. In situ interactions of phototrophic picoeukaryotes and bacterioplankton (specifically Prochlorococcus cyanobacteria and the SAR11 clade) were investigated using a combination of flow cytometric cell sorting and dual tyramide signal amplification fluorescence in situ hybridization. Using this method, we observed plastidic Prymnesiophyceae and Chrysophyceae cells containing Prochlorococcus, and to a lesser extent SAR11 cells. These microscopic observations of in situ microbial trophic interactions demonstrate the frequency and likely selectivity of phototrophic picoeukaryote bacterivory in the surface mixed layer of both the North and South Atlantic subtropical gyres and adjacent equatorial region, broadening our views on the ecological role of the smallest oceanic plastidic protists.

  1. Patterns of marine bacterioplankton biodiversity in the surface waters of the Scotia Arc, Southern Ocean.

    PubMed

    Jamieson, Rachel E; Rogers, Alex D; Billett, David S M; Smale, Dan A; Pearce, David A

    2012-05-01

    Spatial patchiness in marine surface bacterioplankton populations was investigated in the Southern Ocean, where the Antarctic Circumpolar Current meets the islands of the Scotia Arc and is subjected to terrestrial input, upwelling of nutrients and seasonal phytoplankton blooms. Total bacterioplankton population density, group-specific taxonomic distribution and six of eight dominant members of the bacterioplankton community were found to be consistent across 18 nearshore sites at eight locations around the Scotia Arc. Results from seven independent 16S rRNA gene clone libraries (1223 sequences in total) and fluorescent in situ hybridization suggested that microbial assemblages were predominantly homogeneous between Scotia Arc sites, where the Alphaproteobacteria, Gammaproteobacteria and the Cytophaga-Flavobacterium-Bacteroidetes cluster were the dominant bacterial groups. Of the 1223 useable sequences generated, 1087 (89%) shared ≥ 97% similarity with marine microorganisms and 331 (27%) matched published sequences previously detected in permanently cold Arctic and Antarctic marine environments. Taken together, results suggest that the dominant bacterioplankton groups are consistent between locations, but significant differences may be detected across the rare biodiversity. PMID:22273466

  2. Interactions between hydrology and water chemistry shape bacterioplankton biogeography across boreal freshwater networks.

    PubMed

    Niño-García, Juan Pablo; Ruiz-González, Clara; Del Giorgio, Paul A

    2016-07-01

    Disentangling the mechanisms shaping bacterioplankton communities across freshwater ecosystems requires considering a hydrologic dimension that can influence both dispersal and local sorting, but how the environment and hydrology interact to shape the biogeography of freshwater bacterioplankton over large spatial scales remains unexplored. Using Illumina sequencing of the 16S ribosomal RNA gene, we investigate the large-scale spatial patterns of bacterioplankton across 386 freshwater systems from seven distinct regions in boreal Québec. We show that both hydrology and local water chemistry (mostly pH) interact to shape a sequential structuring of communities from highly diverse assemblages in headwater streams toward larger rivers and lakes dominated by fewer taxa. Increases in water residence time along the hydrologic continuum were accompanied by major losses of bacterial richness and by an increased differentiation of communities driven by local conditions (pH and other related variables). This suggests that hydrology and network position modulate the relative role of environmental sorting and mass effects on community assembly by determining both the time frame for bacterial growth and the composition of the immigrant pool. The apparent low dispersal limitation (that is, the lack of influence of geographic distance on the spatial patterns observed at the taxonomic resolution used) suggests that these boreal bacterioplankton communities derive from a shared bacterial pool that enters the networks through the smallest streams, largely dominated by mass effects, and that is increasingly subjected to local sorting of species during transit along the hydrologic continuum. PMID:26849312

  3. Effects of nutrients on specific growth rate of bacterioplankton in oligotrophic lake water cultures

    SciTech Connect

    Coveney, M.F.; Wetzel, R.G. )

    1992-01-01

    The effects of organic and inorganic nutrient additions on the specific growth rates of bacterioplankton in oligotrophic lake water cultures were investigated. Lake water was first passed through 0.8-{mu}m-pore-size filters (prescreening) to remove bacterivores and to minimize confounding effects of algae. Specific growth rates were calculated from changes in both bacterial cell numbers and biovolumes over 36 h. Gross specific growth rates in unmanipulated control samples were estimated through separate measurements of grazing losses by use of penicillin. The addition of mixed organic substrates alone to prescreened water did not significantly increase bacterioplankton specific growth rates. The addition of inorganic phosphorus alone significantly increased one or both specific growth rates in three of four experiments, and one experiment showed a secondary stimulation by organic substrates. The stimulatory effects of phosphorus addition were greatest concurrently with the highest alkaline phosphatase activity in the lake water. Because bacteria have been shown to dominate inorganic phosphorus uptake in other P-deficient systems, the demonstration that phosphorus, rather than organic carbon, can limit bacterioplankton growth suggests direct competition between phytoplankton and bacterioplankton for inorganic phosphorus.

  4. An association network analysis among microeukaryotes and bacterioplankton reveals algal bloom dynamics.

    PubMed

    Tan, Shangjin; Zhou, Jin; Zhu, Xiaoshan; Yu, Shichen; Zhan, Wugen; Wang, Bo; Cai, Zhonghua

    2015-02-01

    Algal blooms are a worldwide phenomenon and the biological interactions that underlie their regulation are only just beginning to be understood. It is established that algal microorganisms associate with many other ubiquitous, oceanic organisms, but the interactions that lead to the dynamics of bloom formation are currently unknown. To address this gap, we used network approaches to investigate the association patterns among microeukaryotes and bacterioplankton in response to a natural Scrippsiella trochoidea bloom. This is the first study to apply network approaches to bloom dynamics. To this end, terminal restriction fragment (T-RF) length polymorphism analysis showed dramatic changes in community compositions of microeukaryotes and bacterioplankton over the blooming period. A variance ratio test revealed significant positive overall associations both within and between microeukaryotic and bacterioplankton communities. An association network generated from significant correlations between T-RFs revealed that S. trochoidea had few connections to other microeukaryotes and bacterioplankton and was placed on the edge. This lack of connectivity allowed for the S. trochoidea sub-network to break off from the overall network. These results allowed us to propose a conceptual model for explaining how changes in microbial associations regulate the dynamics of an algal bloom. In addition, key T-RFs were screened by principal components analysis, correlation coefficients, and network analysis. Dominant T-RFs were then identified through 18S and 16S rRNA gene clone libraries. Results showed that microeukaryotes clustered predominantly with Dinophyceae and Perkinsea while the majority of bacterioplankton identified were Alphaproteobacteria, Gammaproteobacteria, and Bacteroidetes. The ecologi-cal roles of both were discussed in the context of these findings. PMID:26986263

  5. Coral and macroalgal exudates vary in neutral sugar composition and differentially enrich reef bacterioplankton lineages

    PubMed Central

    Nelson, Craig E; Goldberg, Stuart J; Wegley Kelly, Linda; Haas, Andreas F; Smith, Jennifer E; Rohwer, Forest; Carlson, Craig A

    2013-01-01

    Increasing algal cover on tropical reefs worldwide may be maintained through feedbacks whereby algae outcompete coral by altering microbial activity. We hypothesized that algae and coral release compositionally distinct exudates that differentially alter bacterioplankton growth and community structure. We collected exudates from the dominant hermatypic coral holobiont Porites spp. and three dominant macroalgae (one each Ochrophyta, Rhodophyta and Chlorophyta) from reefs of Mo'orea, French Polynesia. We characterized exudates by measuring dissolved organic carbon (DOC) and fractional dissolved combined neutral sugars (DCNSs) and subsequently tracked bacterioplankton responses to each exudate over 48 h, assessing cellular growth, DOC/DCNS utilization and changes in taxonomic composition (via 16S rRNA amplicon pyrosequencing). Fleshy macroalgal exudates were enriched in the DCNS components fucose (Ochrophyta) and galactose (Rhodophyta); coral and calcareous algal exudates were enriched in total DCNS but in the same component proportions as ambient seawater. Rates of bacterioplankton growth and DOC utilization were significantly higher in algal exudate treatments than in coral exudate and control incubations with each community selectively removing different DCNS components. Coral exudates engendered the smallest shift in overall bacterioplankton community structure, maintained high diversity and enriched taxa from Alphaproteobacteria lineages containing cultured representatives with relatively few virulence factors (VFs) (Hyphomonadaceae and Erythrobacteraceae). In contrast, macroalgal exudates selected for less diverse communities heavily enriched in copiotrophic Gammaproteobacteria lineages containing cultured pathogens with increased VFs (Vibrionaceae and Pseudoalteromonadaceae). Our results demonstrate that algal exudates are enriched in DCNS components, foster rapid growth of bacterioplankton and select for bacterial populations with more potential VFs than

  6. Validity of Eucaryote Inhibitors for Assessing Production and Grazing Mortality of Marine Bacterioplankton

    PubMed Central

    Taylor, Gordon T.; Pace, Michael L.

    1987-01-01

    Application of eucaryote inhibitors to the estimation of production and grazing mortality of bacterioplankton was evaluated. Exposure to a range of concentrations of thiram, cycloheximide, and neutral red (0.4 to 210, 36 to 1,777, 4 to 346 μM, respectively) was 98 to 100% effective at inhibiting growth of a chrysomonad in culture. Exposure to colchicine and griseofulvin (50 to 1,000 μM for both) yielded only 24 to 94 and 53 to 79% inhibition, respectively. Exposures to thiram, neutral red, and griseofulvin were 90 to 100% effective at inhibiting growth in culture of a ciliate, Cyclidium sp., and the responses to colchicine and cycloheximide were variable (64 to 100 and 0 to 100% inhibition, respectively). Thiram and neutral red inhibited field populations of nanozooplankton more effectively than cycloheximide and colchicine. Direct effects of eucaryote inhibitors on growing cultures of bacterioplankton varied with parameters measured and duration of exposure. After 3-day exposures, specific growth rates and “instantaneous” heterotrophic potential ([14C]glucose uptake) were not consistently affected, but biosynthetic activity (RNA and DNA syntheses) was depressed. The degree of inhibition of isolates and field populations of phytoplankton depended upon type of inhibitor and phytoplankton species. In field experiments, it was possible to calculate rates of bacterioplankton production and grazing mortality for only 16 of 29 inhibitor experiments and for 4 of 10 size fractionation experiments. Bacterioplankton production and mortality estimates varied greatly with the eucaryote inhibitor used, and those derived from inhibition techniques were substantially different from those derived from fractionation techniques. The poor performances of both techniques are attributed to the following: (i) effects of inhibitors on phytoplankton, (ii) indirect effects of the inhibitors on bacterioplankton, and (iii) insufficient separation of grazers from prey by filtration

  7. Bacterioplankton responses to iron enrichment during the SAGE experiment

    NASA Astrophysics Data System (ADS)

    Kuparinen, J.; Hall, J.; Ellwood, M.; Safi, K.; Peloquin, J.; Katz, D.

    2011-03-01

    We studied the microbial food web in the upper 100 m of the water column in iron-limited sub-Antarctic HNLC waters south-east of New Zealand in the SAGE experiment in 2004, with focus on bacterioplankton. Samples were collected daily from inside and outside the iron enriched patch. Short term enrichment experiments were conducted on board in 4 L polycarbonate bottles with water outside the iron enriched patch to study single and combined effects of micronutrient additions on microbial food web. Low bacterial growth was recorded in the study area with community turnover times of 50 h or more during the study period. Measurements of bacterial standing stocks and production rates in the study show minor responses to the large scale iron enrichment, with increase in rates and stocks after the first enrichment and at the end of the study period after the third iron enrichment when solar radiation increased and wind mixing decreased. The average daily bacterial production rates were 31.5 and 33.7 mgCm -2 d -1 for the OUT and IN stations, respectively; thus overall there was not a significant difference between the control and the iron-enriched patch. In the bottle experiments bacterial thymidine incorporation showed responses to single iron and silicic acid enrichments and a major growth response to the combined iron and sucrose enrichments. Phytoplankton chlorophyll- a showed clear stimulation by single additions of iron and silicic acid and silicic acid enhanced the iron impact. Cobalt additions had no effect on bacteria growth and a negative effect on phytoplankton growth. Low bacterial in situ growth rates and the enrichment experiments suggest that bacteria are co-limited by iron and carbon, and that bacterial iron uptake is dependent on carbon supply by the food web. With the high iron quota (μmol Fe mol C -1) bacteria may scavenge considerable amounts of the excess iron, and thus influence the relative importance of the microbial food web as a carbon sink.

  8. Rapid turnover of dissolved DMS and DMSP by defined bacterioplankton communities in the stratified euphotic zone of the North Sea

    NASA Astrophysics Data System (ADS)

    Zubkov, Mikhail V.; Fuchs, Bernhard M.; Archer, Stephen D.; Kiene, Ronald P.; Amann, Rudolf; Burkill, Peter H.

    Bacterioplankton-driven turnover of the algal osmolyte, dimethylsulphoniopropionate (DMSP), and its degradation product, dimethylsulphide (DMS) the major natural source of atmospheric sulphur, were studied during a Lagrangian SF 6-tracer experiment in the North Sea (60°N, 3°E). The water mass sampled within the euphotic zone was characterised by a surface mixed layer (from 0 m to 13-30 m) and a subsurface layer (from 13-30 m to 45-58 m) separated by a 2°C thermocline spanning 2 m. The fluxes of dissolved DMSP (DMSPd) and DMS were determined using radioactive tracer techniques. Rates of the simultaneous incorporation of 14C-leucine and 3H-thymidine were measured to estimate bacterioplankton production. Flow cytometry was employed to discriminate subpopulations and to determine the numbers and biomass of bacterioplankton by staining for nucleic acids and proteins. Bacterioplankton subpopulations were separated by flow cytometric sorting and their composition determined using 16S ribosomal gene cloning/sequencing and fluorescence in situ hybridisation with designed group-specific oligonucleotide probes. A subpopulation, dominated by bacteria related to Roseobacter-( α-proteobacteria), constituted 26-33% of total bacterioplankton numbers and 45-48% of biomass in both surface and subsurface layers. The other abundant prokaryotes were a group within the SAR86 cluster of γ-proteobacteria and bacteria from the Cytophaga-Flavobacterium—cluster. Bacterial consumption of DMSPd was greater in the subsurface layer (41 nM d -1) than in the surface layer (20 nM d -1). Bacterioplankton tightly controlled the DMSPd pool, particularly in the subsurface layer, with a turnover time of 2 h, whereas the turnover time of DMSPd in the surface layer was 10 h. Consumed DMSP satisfied the majority of sulphur demands of bacterioplankton, even though bacterioplankton assimilated only about 2.5% and 6.0% of consumed DMSPd sulphur in the surface and subsurface layers, respectively

  9. Inferring horizontal gene transfer.

    PubMed

    Ravenhall, Matt; Škunca, Nives; Lassalle, Florent; Dessimoz, Christophe

    2015-05-01

    Horizontal or Lateral Gene Transfer (HGT or LGT) is the transmission of portions of genomic DNA between organisms through a process decoupled from vertical inheritance. In the presence of HGT events, different fragments of the genome are the result of different evolutionary histories. This can therefore complicate the investigations of evolutionary relatedness of lineages and species. Also, as HGT can bring into genomes radically different genotypes from distant lineages, or even new genes bearing new functions, it is a major source of phenotypic innovation and a mechanism of niche adaptation. For example, of particular relevance to human health is the lateral transfer of antibiotic resistance and pathogenicity determinants, leading to the emergence of pathogenic lineages. Computational identification of HGT events relies upon the investigation of sequence composition or evolutionary history of genes. Sequence composition-based ("parametric") methods search for deviations from the genomic average, whereas evolutionary history-based ("phylogenetic") approaches identify genes whose evolutionary history significantly differs from that of the host species. The evaluation and benchmarking of HGT inference methods typically rely upon simulated genomes, for which the true history is known. On real data, different methods tend to infer different HGT events, and as a result it can be difficult to ascertain all but simple and clear-cut HGT events. PMID:26020646

  10. Inferring Horizontal Gene Transfer

    PubMed Central

    Lassalle, Florent; Dessimoz, Christophe

    2015-01-01

    Horizontal or Lateral Gene Transfer (HGT or LGT) is the transmission of portions of genomic DNA between organisms through a process decoupled from vertical inheritance. In the presence of HGT events, different fragments of the genome are the result of different evolutionary histories. This can therefore complicate the investigations of evolutionary relatedness of lineages and species. Also, as HGT can bring into genomes radically different genotypes from distant lineages, or even new genes bearing new functions, it is a major source of phenotypic innovation and a mechanism of niche adaptation. For example, of particular relevance to human health is the lateral transfer of antibiotic resistance and pathogenicity determinants, leading to the emergence of pathogenic lineages [1]. Computational identification of HGT events relies upon the investigation of sequence composition or evolutionary history of genes. Sequence composition-based ("parametric") methods search for deviations from the genomic average, whereas evolutionary history-based ("phylogenetic") approaches identify genes whose evolutionary history significantly differs from that of the host species. The evaluation and benchmarking of HGT inference methods typically rely upon simulated genomes, for which the true history is known. On real data, different methods tend to infer different HGT events, and as a result it can be difficult to ascertain all but simple and clear-cut HGT events. PMID:26020646

  11. Magnitude and regulation of bacterioplankton respiratory quotient across freshwater environmental gradients

    PubMed Central

    Berggren, Martin; Lapierre, Jean-François; del Giorgio, Paul A

    2012-01-01

    Bacterioplankton respiration (BR) may represent the largest single sink of organic carbon in the biosphere and constitutes an important driver of atmospheric carbon dioxide (CO2) emissions from freshwaters. Complete understanding of BR is precluded by the fact that most studies need to assume a respiratory quotient (RQ; mole of CO2 produced per mole of O2 consumed) to calculate rates of BR. Many studies have, without clear support, assumed a fixed RQ around 1. Here we present 72 direct measurements of bacterioplankton RQ that we carried out in epilimnetic samples of 52 freshwater sites in Québec (Canada), using O2 and CO2 optic sensors. The RQs tended to converge around 1.2, but showed large variability (s.d.=0.45) and significant correlations with major gradients of ecosystem-level, substrate-level and bacterial community-level characteristics. Experiments with natural bacterioplankton using different single substrates suggested that RQ is intimately linked to the elemental composition of the respired compounds. RQs were on average low in net autotrophic systems, where bacteria likely were utilizing mainly reduced substrates, whereas we found evidence that the dominance of highly oxidized substrates, for example, organic acids formed by photo-chemical processes, led to high RQ in the more heterotrophic systems. Further, we suggest that BR contributes to a substantially larger share of freshwater CO2 emissions than presently believed based on the assumption that RQ is ∼1. Our study demonstrates that bacterioplankton RQ is not only a practical aspect of BR determination, but also a major ecosystem state variable that provides unique information about aquatic ecosystem functioning. PMID:22094347

  12. Covariance of bacterioplankton composition and environmental variables in a temperate delta system

    USGS Publications Warehouse

    Stepanauskas, R.; Moran, M.A.; Bergamaschi, B.A.; Hollibaugh, J.T.

    2003-01-01

    We examined seasonal and spatial variation in bacterioplankton composition in the Sacramento-San Joaquin River Delta (CA) using terminal restriction fragment length polymorphism (T-RFLP) analysis. Cloned 16S rRNA genes from this system were used for putative identification of taxa dominating the T-RFLP profiles. Both cloning and T-RFLP analysis indicated that Actinobacteria, Verrucomicrobia, Cytophaga-Flavobacterium and Proteobacteria were the most abundant bacterioplankton groups in the Delta. Despite the broad variety of sampled habitats (deep water channels, lakes, marshes, agricultural drains, freshwater and brackish areas), and the spatial and temporal differences in hydrology, temperature and water chemistry among the sampling campaigns, T-RFLP electropherograms from all samples were similar, indicating that the same bacterioplankton phylotypes dominated in the various habitats of the Delta throughout the year. However, principal component analysis (PCA) and partial least-squares regression (PLS) of T-RFLP profiles revealed consistent grouping of samples on a seasonal, but not a spatial, basis. ??-Proteobacteria related to Ralstonia, Actinobacteria related to Microthrix, and ??-Proteobacteria identical to the environmental Clone LD12 had the highest relative abundance in summer/fall T-RFLP profiles and were associated with low river flow, high pH, and a number of optical and chemical characteristics of dissolved organic carbon (DOC) indicative of an increased proportion of phytoplankton-produced organic material as opposed to allochthonous, terrestrially derived organic material. On the other hand, Geobacter-related ??-Proteobacteria showed a relative increase in abundance in T-RFLP analysis during winter/spring, and probably were washed out from watershed soils or sediment. Various phylotypes associated with the same phylogenetic division, based on tentative identification of T-RFLP fragments, exhibited diverse seasonal patterns, suggesting that ecological

  13. Impact of solar radiation on bacterioplankton in Laguna Vilama, a hypersaline Andean lake (4650 m)

    NASA Astrophysics Data System (ADS)

    FaríAs, MaríA. Eugenia; FernáNdez-Zenoff, Verónica; Flores, Regina; OrdóñEz, Omar; EstéVez, Cristina

    2009-06-01

    Laguna Vilama is a hypersaline Lake located at 4660 m altitude in the northwest of Argentina high up in the Andean Puna. The impact of ultraviolet (UV) radiation on bacterioplankton was studied by collecting samples at different times of the day. Molecular analysis (DGGE) showed that the bacterioplankton community is characterized by Gamma-proteobacteria (Halomonas sp., Marinobacter sp.), Alpha-proteobacteria (Roseobacter sp.), HGC (Agrococcus jenensis and an uncultured bacterium), and CFB (uncultured Bacteroidetes). During the day, minor modifications in bacterial diversity such as intensification of Bacteroidetes' signal and an emergence of Gamma-proteobacteria (Marinobacter flavimaris) were observed after solar exposure. DNA damage, measured as an accumulation of Cyclobutane Pyrimidine Dimers (CPDs), in bacterioplankton and naked DNA increased from 100 CPDs MB-1 at 1200 local time (LT) to 300 CPDs MB-1 at 1600 LT, and from 80 CPDs MB-1 at 1200 LT to 640 CPDs MB-1 at 1600 LT, respectively. In addition, pure cultures of Pseudomonas sp. V1 and Brachybacterium sp. V5, two bacteria previously isolated from this environment, were exposed simultaneously with the community, and viability of both strains diminished after solar exposure. No CPD accumulation was observed in either of the exposed cultures, but an increase in mutagenesis was detected in V5. Of both strains only Brachybacterium sp. V5 showed CPD accumulation in naked DNA. These results suggest that the bacterioplankton community is well adapted to this highly solar irradiated environment showing little accumulation of CPDs and few changes in the community composition. They also demonstrate that these microorganisms contain efficient mechanisms against UV damage.

  14. Characterization of lysogens in bacterioplankton assemblages of the southern California borderland.

    PubMed

    Hewson, Ian; Fuhrman, Jed A

    2007-05-01

    Viruses cause significant mortality of marine microorganisms; however, their role in shaping the composition of microbial assemblages has not been fully elucidated. Because viruses may form lysogenic relationships with their hosts, temperate viruses may influence bacterial assemblage structures through direct lysis of hosts when induced by environmental stimuli or by homoimmunity (i.e., immunity to closely related viruses). We investigated the components of bacterioplankton assemblages that bore prophage using the lysogenic induction agent mitomycin C. Seawater was collected at two locations (the San Pedro Ocean Time Series Station and in the Santa Barbara Channel) in the Southern California Borderland and amended with mitomycin C. After 24-h incubation, the community structure of bacterioplankton was compared with unamended controls using automated rRNA intergenic spacer analysis. The addition of mitomycin C to seawater had effects on the community structure of bacterioplankton, stimulating detectable overall diversity and richness of fingerprints and causing the assemblages within incubations to become different to control assemblages. Most negatively impacted operational taxonomic units (OTU) in mitomycin C-amended incubations individually comprised a large fraction of total amplified DNA in initial seawater (5.3-23.3% of amplified DNA fluorescence) fingerprints, and data suggest that these include organisms putatively classified as members of the gamma-Proteobacteria, SAR11 cluster, and Synechococcus groups. The stimulation of assemblage richness by induction of lysogens, and the reduction in the contribution to total DNA of common OTU (and concomitant increase in rare OTU), suggests that temperate phage have the potential to strongly influence the diversity of bacterioplankton assemblages. Because lysogenic OTU may also be resistant to closely related lytic (i.e., free-living) viruses, the impact of lytic virioplankton on assemblages may only be pronounced

  15. Tracking differential incorporation of dissolved organic carbon types among diverse lineages of Sargasso Sea bacterioplankton.

    PubMed

    Nelson, Craig E; Carlson, Craig A

    2012-06-01

    Bacterioplankton are the primary trophic conduit for dissolved organic carbon (DOC) and linking community structure with DOC utilization is central to understanding global carbon cycling. We coupled stable isotope probing (SIP) with 16S rRNA pyrosequencing in dark seawater culture experiments on euphotic and mesopelagic communities from the Sargasso Sea. Parallel cultures were amended with equimolar quantities of four DO(13) C substrates to simultaneously evaluate community utilization and population-specific incorporation. Of the substrates tested - two cyanobacterial products (exudates or lysates from a culture of Synechococcus) and two defined monosaccharides (glucose or gluconic acid) - the cyanobacterial exudates were incorporated by the greatest diversity of oligotrophic bacterioplankton populations in surface waters, including taxa from > 10 major subclades within the Flavobacteria, Actinobacteria, Verrucomicrobia and Proteobacteria (including SAR11). In contrast, the monosaccharide glucose was not incorporated by any taxa belonging to extant oligotrophic oceanic clades. Conversely, proteobacterial copiotrophs, which were rare in the ambient water (< 0.1% of sequences), grew rapidly on all DOC amendments at both depths, but with different substrate preferences among lineages. We present a new analytical framework for using SIP to detect DOC incorporation across diverse oligotrophic bacterioplankton and discuss implications for the ecology of bacterial-DOC interactions among populations of diverging trophic strategies.

  16. Non-random assembly of bacterioplankton communities in the subtropical north pacific ocean.

    PubMed

    Eiler, Alexander; Hayakawa, Darin H; Rappé, Michael S

    2011-01-01

    The exploration of bacterial diversity in the global ocean has revealed new taxa and previously unrecognized metabolic potential; however, our understanding of what regulates this diversity is limited. Using terminal restriction fragment length polymorphism (T-RFLP) data from bacterial small-subunit ribosomal RNA genes we show that, independent of depth and time, a large fraction of bacterioplankton co-occurrence patterns are non-random in the oligotrophic North Pacific subtropical gyre (NPSG). Pair-wise correlations of all identified operational taxonomic units (OTUs) revealed a high degree of significance, with 6.6% of the pair-wise co-occurrences being negatively correlated and 20.7% of them being positive. The most abundant OTUs, putatively identified as Prochlorococcus, SAR11, and SAR116 bacteria, were among the most correlated OTUs. As expected, bacterial community composition lacked statistically significant patterns of seasonality in the mostly stratified water column except in a few depth horizons of the sunlit surface waters, with higher frequency variations in community structure apparently related to populations associated with the deep chlorophyll maximum. Communities were structured vertically into epipelagic, mesopelagic, and bathypelagic populations. Permutation-based statistical analyses of T-RFLP data and their corresponding metadata revealed a broad range of putative environmental drivers controlling bacterioplankton community composition in the NPSG, including concentrations of inorganic nutrients and phytoplankton pigments. Together, our results suggest that deterministic forces such as environmental filtering and interactions among taxa determine bacterioplankton community patterns, and consequently affect ecosystem functions in the NPSG.

  17. Tracking differential incorporation of dissolved organic carbon types among diverse lineages of Sargasso Sea bacterioplankton.

    PubMed

    Nelson, Craig E; Carlson, Craig A

    2012-06-01

    Bacterioplankton are the primary trophic conduit for dissolved organic carbon (DOC) and linking community structure with DOC utilization is central to understanding global carbon cycling. We coupled stable isotope probing (SIP) with 16S rRNA pyrosequencing in dark seawater culture experiments on euphotic and mesopelagic communities from the Sargasso Sea. Parallel cultures were amended with equimolar quantities of four DO(13) C substrates to simultaneously evaluate community utilization and population-specific incorporation. Of the substrates tested - two cyanobacterial products (exudates or lysates from a culture of Synechococcus) and two defined monosaccharides (glucose or gluconic acid) - the cyanobacterial exudates were incorporated by the greatest diversity of oligotrophic bacterioplankton populations in surface waters, including taxa from > 10 major subclades within the Flavobacteria, Actinobacteria, Verrucomicrobia and Proteobacteria (including SAR11). In contrast, the monosaccharide glucose was not incorporated by any taxa belonging to extant oligotrophic oceanic clades. Conversely, proteobacterial copiotrophs, which were rare in the ambient water (< 0.1% of sequences), grew rapidly on all DOC amendments at both depths, but with different substrate preferences among lineages. We present a new analytical framework for using SIP to detect DOC incorporation across diverse oligotrophic bacterioplankton and discuss implications for the ecology of bacterial-DOC interactions among populations of diverging trophic strategies. PMID:22507662

  18. Understanding diversity patterns in bacterioplankton communities from a sub-Antarctic peatland.

    PubMed

    Quiroga, María Victoria; Valverde, Angel; Mataloni, Gabriela; Cowan, Don

    2015-06-01

    Bacterioplankton communities inhabiting peatlands have the potential to influence local ecosystem functions. However, most microbial ecology research in such wetlands has been done in ecosystems (mostly peat soils) of the Northern Hemisphere, and very little is known of the factors that drive bacterial community assembly in other regions of the world. In this study, we used high-throughput sequencing to analyse the structure of the bacterial communities in five pools located in a sub-Antarctic peat bog (Tierra del Fuego, Argentina), and tested for relationships between bacterial communities and environmental conditions. Bacterioplankton communities in peat bog pools were diverse and dominated by members of the Proteobacteria, Actinobacteria, Bacteroidetes and Verrucomicrobia. Community structure was largely explained by differences in hydrological connectivity, pH and nutrient status (ombrotrophic versus minerotrophic pools). Bacterioplankton communities in ombrotrophic pools showed phylogenetic clustering, suggesting a dominant role of deterministic processes in shaping these assemblages. These correlations between habitat characteristics and bacterial diversity patterns provide new insights into the factors regulating microbial populations in peatland ecosystems.

  19. Seasonal assemblages and short-lived blooms in coastal north-west Atlantic Ocean bacterioplankton.

    PubMed

    El-Swais, Heba; Dunn, Katherine A; Bielawski, Joseph P; Li, William K W; Walsh, David A

    2015-10-01

    Temperate oceans are inhabited by diverse and temporally dynamic bacterioplankton communities. However, the role of the environment, resources and phytoplankton dynamics in shaping marine bacterioplankton communities at different time scales remains poorly constrained. Here, we combined time series observations (time scales of weeks to years) with molecular analysis of formalin-fixed samples from a coastal inlet of the north-west Atlantic Ocean to show that a combination of temperature, nitrate, small phytoplankton and Synechococcus abundances are best predictors for annual bacterioplankton community variability, explaining 38% of the variation. Using Bayesian mixed modelling, we identified assemblages of co-occurring bacteria associated with different seasonal periods, including the spring bloom (e.g. Polaribacter, Ulvibacter, Alteromonadales and ARCTIC96B-16) and the autumn bloom (e.g. OM42, OM25, OM38 and Arctic96A-1 clades of Alphaproteobacteria, and SAR86, OM60 and SAR92 clades of Gammaproteobacteria). Community variability over spring bloom development was best explained by silicate (32%)--an indication of rapid succession of bacterial taxa in response to diatom biomass--while nanophytoplankton as well as picophytoplankton abundance explained community variability (16-27%) over the transition into and out of the autumn bloom. Moreover, the seasonal structure was punctuated with short-lived blooms of rare bacteria including the KSA-1 clade of Sphingobacteria related to aromatic hydrocarbon-degrading bacteria.

  20. In situ substrate preferences of abundant bacterioplankton populations in a prealpine freshwater lake

    PubMed Central

    Salcher, Michaela M; Posch, Thomas; Pernthaler, Jakob

    2013-01-01

    The substrate partitioning of sympatric populations of freshwater bacterioplankton was studied via microautoradiography and fluorescence in situ hybridization. Fourteen radiolabeled tracers were used to assess microbial acquisition spectra of low-molecular-weight (LMW) organic compounds. The most abundant group, ac1 Actinobacteria, were highly active in leucine, thymidine and glucose assimilation, whereas Alphaproteobacteria from the LD12 lineage (the freshwater sister clade of SAR11) only weakly incorporated these tracers, but exhibited a distinct preference for glutamine and glutamate. Different Bacteroidetes showed contrasting uptake patterns: Flavobacteriales did not incorporate significant amounts of any LMW compound, and Cyclobacteriaceae were clearly specialized on leucine, glucose and arginine. Betaproteobacteria represented the most active and versatile bacterioplankton fraction and >90% of them could be assigned to eight species- to genus-like populations with contrasting substrate specialization. Limnohabitans sp. were the most abundant and active Betaproteobacteria, incorporating almost all tracers. While three closely related betaproteobacterial populations substantially differed in their uptake spectra, two more distantly related lineages had very similar preferences, and one population did not incorporate any tracer. The dominant phototrophic microorganism, the filamentous cyanobacterium Planktothrix rubescens, assimilated several substrates, whereas other (pico)cyanobacteria had no heterotrophic activity. The variable extent of specialization by the studied bacterial taxa on subsets of LMW compounds contrasts theoretical considerations about non-selective microbial substrate assimilation at oligotrophic conditions. This physiological niche separation might be one explanation for the coexistence of freshwater bacterioplankton species in a seemingly uniform environment. PMID:23235289

  1. Bacterioplankton community shifts associated with epipelagic and mesopelagic waters in the Southern Ocean

    PubMed Central

    Yu, Zheng; Yang, Jun; Liu, Lemian; Zhang, Wenjing; Amalfitano, Stefano

    2015-01-01

    The Southern Ocean is among the least explored marine environments on Earth, and still little is known about regional and vertical variability in the diversity of Antarctic marine prokaryotes. In this study, the bacterioplankton community in both epipelagic and mesopelagic waters was assessed at two adjacent stations by high-throughput sequencing and quantitative PCR. Water temperature was significantly higher in the superficial photic zone, while higher salinity and dissolved oxygen were recorded in the deeper water layers. The highest abundance of the bacterioplankton was found at a depth of 75 m, corresponding to the deep chlorophyll maximum layer. Both Alphaproteobacteria and Gammaproteobacteria were the most abundant taxa throughout the water column, while more sequences affiliated to Cyanobacteria and unclassified bacteria were identified from surface and the deepest waters, respectively. Temperature was the most significant environmental variable affecting the bacterial community structure. The bacterial community composition displayed significant differences at the epipelagic layers between two stations, whereas those in the mesopelagic waters were more similar to each other. Our results indicated that the epipelagic bacterioplankton might be dominated by short-term environmental variable conditions, whereas the mesopelagic communities appeared to be structured by longer water-mass residence time and relative stable environmental factors. PMID:26256889

  2. Identification of polyamine-responsive bacterioplankton taxa in South Atlantic Bight.

    PubMed

    Lu, Xinxin; Sun, Shulei; Hollibaugh, James T; Mou, Xiaozhen

    2015-12-01

    Putrescine and spermidine are short-chained aliphatic polyamines (PAs) that are ubiquitously distributed in seawater. These compounds may be important sources of dissolved organic carbon and nitrogen for marine bacterioplankton. Here, we used pyrotag sequencing to quantify the response of bacterioplankton to putrescine and spermidine amendments in microcosms established using surface waters collected at various stations in the South Atlantic Bight in October 2011. Our analysis showed that PA-responsive bacterioplankton consisted of bacterial taxa that are typically dominant in marine systems. Rhodobacteraceae (Alphaproteobacteria) was the taxon most responsive to PA additions at the nearshore site. Gammaproteobacteria of the families Piscirickettsiaceae; Vibrionaceae; and Vibrionaceae and Pseudoalteromonadaceae, were the dominant PA-responsive taxa in samples from the river-influenced coastal station, offshore station and open ocean station, respectively. The spatial variability of PA-responsive taxa may be attributed to differences in composition of the initial bacterial community and variations of in situ physiochemical conditions among sites. Our results also provided the first empirical evidence that Gammaproteobacteria might play an important role in PA transformation in marine systems.

  3. Temporal variability in the diversity and composition of stream bacterioplankton communities.

    PubMed

    Portillo, Maria C; Anderson, Suzanne P; Fierer, Noah

    2012-09-01

    Bacterioplankton in freshwater streams play a critical role in stream nutrient cycling. Despite their ecological importance, the temporal variability in the structure of stream bacterioplankton communities remains understudied. We investigated the composition and temporal variability of stream bacterial communities and the influence of physicochemical parameters on these communities. We used barcoded pyrosequencing to survey bacterial communities in 107 streamwater samples collected from four locations in the Colorado Rocky Mountains from September 2008 to November 2009. The four sampled locations harboured distinct communities yet, at each sampling location, there was pronounced temporal variability in both community composition and alpha diversity levels. These temporal shifts in bacterioplankton community structure were not seasonal; rather, their diversity and composition appeared to be driven by intermittent changes in various streamwater biogeochemical conditions. Bacterial communities varied independently of time, as indicated by the observation that communities in samples collected close together in time were no more similar than those collected months apart. The temporal turnover in community composition was higher than observed in most previously studied microbial, plant or animal communities, highlighting the importance of stochastic processes and disturbance events in structuring these communities over time. Detailed temporal sampling is important if the objective is to monitor microbial community dynamics in pulsed ecosystems like streams.

  4. Bacterioplankton community shifts associated with epipelagic and mesopelagic waters in the Southern Ocean.

    PubMed

    Yu, Zheng; Yang, Jun; Liu, Lemian; Zhang, Wenjing; Amalfitano, Stefano

    2015-01-01

    The Southern Ocean is among the least explored marine environments on Earth, and still little is known about regional and vertical variability in the diversity of Antarctic marine prokaryotes. In this study, the bacterioplankton community in both epipelagic and mesopelagic waters was assessed at two adjacent stations by high-throughput sequencing and quantitative PCR. Water temperature was significantly higher in the superficial photic zone, while higher salinity and dissolved oxygen were recorded in the deeper water layers. The highest abundance of the bacterioplankton was found at a depth of 75 m, corresponding to the deep chlorophyll maximum layer. Both Alphaproteobacteria and Gammaproteobacteria were the most abundant taxa throughout the water column, while more sequences affiliated to Cyanobacteria and unclassified bacteria were identified from surface and the deepest waters, respectively. Temperature was the most significant environmental variable affecting the bacterial community structure. The bacterial community composition displayed significant differences at the epipelagic layers between two stations, whereas those in the mesopelagic waters were more similar to each other. Our results indicated that the epipelagic bacterioplankton might be dominated by short-term environmental variable conditions, whereas the mesopelagic communities appeared to be structured by longer water-mass residence time and relative stable environmental factors.

  5. Metabolic diversity of heterotrophic bacterioplankton over winter and spring in the coastal Arctic Ocean.

    PubMed

    Sala, Maria Montserrat; Terrado, Ramon; Lovejoy, Connie; Unrein, Fernando; Pedrós-Alió, Carlos

    2008-04-01

    Metabolic diversity of heterotrophic bacterioplankton was tracked from early winter through spring with Biolog Ecoplates under the seasonally ice covered arctic shelf in the Canadian Arctic (Franklin Bay, Beaufort Sea). Samples were taken every 6 days from December 2003 to May 2004 at the surface, the halocline where a temperature inversion occurs, and at 200 m, close to the bottom. Despite the low nutrient levels and low chlorophyll a, suggesting oligotrophy in the winter surface waters, the number of substrates used (NSU) was greater than in spring, when chlorophyll a concentrations increased. Denaturing gradient gel electrophorisis analysis also indicated that the winter and spring bacterial communities were phylogenetically distinct, with several new bands appearing in spring. In spring, the bacterial community would have access to the freshly produced organic carbon from the early phytoplankton bloom and the growth of rapidly growing specialist phenotypes would be favoured. In contrast, in winter bacterioplankton consumed more complex organic matter originated during the previous year's phytoplankton production. At the other depths we tested the NSU was similar to that for the winter surface, with no seasonal pattern. Instead, bacterioplankton metabolism seemed to be influenced by resuspension, advection, and sedimentation events that contributed organic matter that enhanced bacterial metabolism. PMID:18218033

  6. Non-Random Assembly of Bacterioplankton Communities in the Subtropical North Pacific Ocean

    PubMed Central

    Eiler, Alexander; Hayakawa, Darin H.; Rappé, Michael S.

    2011-01-01

    The exploration of bacterial diversity in the global ocean has revealed new taxa and previously unrecognized metabolic potential; however, our understanding of what regulates this diversity is limited. Using terminal restriction fragment length polymorphism (T-RFLP) data from bacterial small-subunit ribosomal RNA genes we show that, independent of depth and time, a large fraction of bacterioplankton co-occurrence patterns are non-random in the oligotrophic North Pacific subtropical gyre (NPSG). Pair-wise correlations of all identified operational taxonomic units (OTUs) revealed a high degree of significance, with 6.6% of the pair-wise co-occurrences being negatively correlated and 20.7% of them being positive. The most abundant OTUs, putatively identified as Prochlorococcus, SAR11, and SAR116 bacteria, were among the most correlated OTUs. As expected, bacterial community composition lacked statistically significant patterns of seasonality in the mostly stratified water column except in a few depth horizons of the sunlit surface waters, with higher frequency variations in community structure apparently related to populations associated with the deep chlorophyll maximum. Communities were structured vertically into epipelagic, mesopelagic, and bathypelagic populations. Permutation-based statistical analyses of T-RFLP data and their corresponding metadata revealed a broad range of putative environmental drivers controlling bacterioplankton community composition in the NPSG, including concentrations of inorganic nutrients and phytoplankton pigments. Together, our results suggest that deterministic forces such as environmental filtering and interactions among taxa determine bacterioplankton community patterns, and consequently affect ecosystem functions in the NPSG. PMID:21747815

  7. Decrease of NH4+-N by bacterioplankton accelerated the removal of cyanobacterial blooms in aerated aquatic ecosystem.

    PubMed

    Yang, Xi; Xie, Ping; Ma, Zhimei; Wang, Qing; Fan, Huihui; Shen, Hong

    2013-11-01

    We used aerated systems to assess the influence of the bacterioplankton community on cyanobacterial blooms in algae/post-bloom of Lake Taihu, China. Bacterioplankton community diversity was evaluated by polymerase chain reaction-denaturing gradient gel electrophoresis (PCR-DGGE) fingerprinting. Chemical analysis and nitrogen dynamic changes illustrated that NH4+-N was nitrified to NO2--N and NO3--N by bacterioplankton. Finally, NH4+-N was exhausted and NO3--N was denitrified to NO2--N, while the accumulation of NO2--N indicated that bacterioplankton with completely aerobic denitrification ability were lacking in the water samples collected from Lake Taihu. We suggested that adding completely aerobic denitrification bacteria (to denitrify NO2--N to N2) would improve the water quality. PCR-DGGE and sequencing results showed that more than1/3 of the bacterial species were associated with the removal of nitrogen, and Acidovorax temperans was the dominant one. PCR-DGGE, variation of nitrogen, removal efficiencies of chlorophyll-a and canonical correspondence analysis indicated that the bacterioplanktonsignificantly influenced the physiological and biochemical changes of cyanobacteria. Additionally, the unweighted pair-group method with arithmetic means revealed there was no obvious harm to the microecosystem from aeration. The present study demonstrated that bacterioplankton can play crucial roles in aerated ecosystems, which could control the impact of cyanobacterial blooms in eutrophicated fresh water systems.

  8. Effects of temperature and nutrients on changes in genetic diversity of bacterioplankton communities in a semi-closed bay, South Korea.

    PubMed

    Kim, Hyun Jung; Jung, Seung Won; Lim, Dhong-Il; Jang, Min-Chul; Lee, Taek-Kyun; Shin, Kyoungsoon; Ki, Jang-Seu

    2016-05-15

    Bacterioplankton communities in a semi-closed bay (Jangmok Bay, South Korea) were analysed using a 16S rDNA multiplex 454 pyrosequencing approach. Diversity and operational taxonomic units of bacterioplankton communities in the Jangmok Bay are highest in cold water seasons and lowest in warm water ones. During cold seasons, α-proteobacteria respond rapidly to pulses of the concentration of inorganic nutrients, while γ-proteobacteria during warm water seasons are the most active type of bacterioplankton resent in the prevailing conditions, which include high dissolved organic carbon, chemical oxygen demand and primary production. Cyanobacteria, a minor group constituting 4.58% of the total bacterioplankton, are more abundant at low temperature. Flavobacteria are more abundant in nutrient-rich conditions and the abundance of this group also demonstrated a delayed decline following summer phytoplankton blooms. The pronounced seasonal oscillations in phosphorus concentration and temperature exert strong selection pressure on bacterioplankton communities. PMID:27001714

  9. Abundance and productivity of bacterioplankton in relation to seasonal upwelling in the northwest Indian Ocean

    NASA Astrophysics Data System (ADS)

    Wiebinga, Cas J.; Veldhuis, Marcel J. W.; De Baar, Hein J. W.

    1997-03-01

    The role of bacterioplankton in the Somali Current, the Gulf of Aden and the Red Sea was studied during the SW- (May-August 1992) and NE-monsoon (January-February 1993). The diversity in physical and biological characteristics of the regions and seasons is reflected in a broad range of both phyto- and bacterioplankton production. During the SW-monsoon, the Somali current showed highest bacterial production (up to 849 mgC m -2 day -1) in regions with enrichment of the surface waters by upwelling of cold, nutrient-rich, deep water. In contrast, the Gulf of Aden and the Red Sea were most productive during the NE-monsoon (average 225 mgC m -2 day -1). Depth profiles of the upper 300 m in general showed a subsurface maximum in bacterial abundance and production at 20-70 m depth. Heterotrophic activity and primary production were closely correlated, indicating the dependence of bacterioplankton on local phytoplankton-derived organic carbon and their ability to adapt quickly to changes in the environment. The bacterial carbon demand in the upper 300 m of the water column was largely supplied by phytoplankton production in the euphotic zone. Bacterial production was 18 ± 7% (average ± S.D.) of primary production. Assuming an assimilation efficiency of 50% for marine bacteria, they consumed up to half of the carbon produced by the phytoplankton. Cycling of carbon within the euphotic zone appears to be achieved by intense grazing by (micro)zooplankton and subsequent remineralization.

  10. Validity of eucaryote inhibitors for assessing production and grazing mortality of marine bacterioplankton. [Cyclidium sp

    SciTech Connect

    Taylor, G.T.; Pace, M.L.

    1987-01-01

    Application of eucaryote inhibitors to the estimation of production and grazing mortality of bacterioplankton was evaluated. Exposure to a range of concentrations of thiram, cycloheximide, and neutral red (0.4 to 210, 36 to 1777, 4 to 346 ..mu..M, respectively) was 98 to 100% effective at inhibiting growth of a chrysomonad in culture. Exposure to colchicine and griseofulvin (50 to 1000 ..mu..M for both) yielded only 24 to 94 and 53 to 79% inhibition, respectively. Exposures to thiram, neutral red, and griseofulvin were 90 to 100% effective at inhibiting the growth in culture of a ciliate, Cyclidium sp., and the responses to colchicine and cycloheximide were variable (64 to 100 and 0 to 100% inhibition, respectively). Thiram and neutral red inhibited field populations of nanozooplankton more effectively than cycloheximide and colchicine. Direct effects of eucaryote inhibitors on growing cultures of bacterioplankton varied with parameters measured and duration of exposure. After 3-day exposures, specific growth rates and instantaneous heterotrophic potential ((/sup 14/C)glucose uptake) were not consistently affected, but biosynthetic activity (RNA and DNA syntheses) was depressed. The degree of inhibition of isolates and field populations of phytoplankton depended upon type of inhibitor and phytoplankton species. In field experiments, it was possible to calculate rates of bacterioplankton production and grazing mortality for only 16 of 29 inhibitor experiments and for 4 of 10 size fractionation experiments. Because of the inconsistent results obtained in this investigation, the authors strongly recommend exercising caution in the application of inhibitor techniques to ecological problems, especially in phototrophically dominated systems.

  11. Spatial and temporal dynamics of phytoplankton and bacterioplankton biomass in Sanya Bay, northern South China Sea.

    PubMed

    Zhou, Weihua; Li, Tao; Cai, Chuanghua; Huang, Liangmin; Wang, Hankui; Xu, Jirong; Dong, Junde; Zhang, Si

    2009-01-01

    The composition of phytoplankton and the dynamics of phytoplankton and bacterioplankton biomass (PB and BB, respectively) of Sanya Bay, South China Sea, were determined. A total of 168 species (67 genera) phytoplankton were identified, including Bacillariophyta (diatom, 128 species), Pyrrophyta (35 species), Cyanophyta (3 species), and Chrysophyta (2 species). Annual average abundance of phytoplankton was 1.2 x 10(7) cells/m3, with the highest abundance in autumn, and the lowest in summer. Annual average diversity index (H') and evenness (J) values were 3.96 and 0.70, respectively. Average chlorophyll-a was 2.5 mg/m3, and the average PB was 124 mg C/m3, with the highest value in autumn. Surface PB was higher than the bottom, except for summer. Annual mean bacterioplankton abundance and BB were 6.9 x 10(11) cells/m3 and 13.8 mg C/m3, respectively. The highest BB was found in summer, followed by winter, spring, and autumn. Surface BB was higher than bottom all year round. The spatial distribution patterns of PB and BB were very similar with the highest biomass in the estuary, and decreased seaward, primarily due to the terrestrial input from the Sanya River and influx of oceanic water. The main factor influencing PB and BB was dissolved inorganic nitrogen (DIN). Other factors such as temperature, which is above 22 degrees C throughout the year, had a negligible impact. The correlation between BB and PB was significant (P < 0.01). The annual average ratio of BB/PB was 0.12 (0.06-0.15). Phytoplankton primary production was one of the most important factors in controlling the distribution of bacterioplankton.

  12. Comparative genomics - a perspective.

    PubMed

    Sivashankari, Selvarajan; Shanmughavel, Piramanayagam

    2007-03-27

    The rapidly emerging field of comparative genomics has yielded dramatic results. Comparative genome analysis has become feasible with the availability of a number of completely sequenced genomes. Comparison of complete genomes between organisms allow for global views on genome evolution and the availability of many completely sequenced genomes increases the predictive power in deciphering the hidden information in genome design, function and evolution. Thus, comparison of human genes with genes from other genomes in a genomic landscape could help assign novel functions for un-annotated genes. Here, we discuss the recently used techniques for comparative genomics and their derived inferences in genome biology.

  13. Comparative genomics - A perspective

    PubMed Central

    Sivashankari, Selvarajan; Shanmughavel, Piramanayagam

    2007-01-01

    The rapidly emerging field of comparative genomics has yielded dramatic results. Comparative genome analysis has become feasible with the availability of a number of completely sequenced genomes. Comparison of complete genomes between organisms allow for global views on genome evolution and the availability of many completely sequenced genomes increases the predictive power in deciphering the hidden information in genome design, function and evolution. Thus, comparison of human genes with genes from other genomes in a genomic landscape could help assign novel functions for un-annotated genes. Here, we discuss the recently used techniques for comparative genomics and their derived inferences in genome biology. PMID:17597925

  14. Substrate-controlled succession of marine bacterioplankton populations induced by a phytoplankton bloom.

    PubMed

    Teeling, Hanno; Fuchs, Bernhard M; Becher, Dörte; Klockow, Christine; Gardebrecht, Antje; Bennke, Christin M; Kassabgy, Mariette; Huang, Sixing; Mann, Alexander J; Waldmann, Jost; Weber, Marc; Klindworth, Anna; Otto, Andreas; Lange, Jana; Bernhardt, Jörg; Reinsch, Christine; Hecker, Michael; Peplies, Jörg; Bockelmann, Frank D; Callies, Ulrich; Gerdts, Gunnar; Wichels, Antje; Wiltshire, Karen H; Glöckner, Frank Oliver; Schweder, Thomas; Amann, Rudolf

    2012-05-01

    Phytoplankton blooms characterize temperate ocean margin zones in spring. We investigated the bacterioplankton response to a diatom bloom in the North Sea and observed a dynamic succession of populations at genus-level resolution. Taxonomically distinct expressions of carbohydrate-active enzymes (transporters; in particular, TonB-dependent transporters) and phosphate acquisition strategies were found, indicating that distinct populations of Bacteroidetes, Gammaproteobacteria, and Alphaproteobacteria are specialized for successive decomposition of algal-derived organic matter. Our results suggest that algal substrate availability provided a series of ecological niches in which specialized populations could bloom. This reveals how planktonic species, despite their seemingly homogeneous habitat, can evade extinction by direct competition. PMID:22556258

  15. Predicted Protein Subcellular Localization in Dominant Surface Ocean Bacterioplankton

    PubMed Central

    2012-01-01

    Bacteria consume dissolved organic matter (DOM) through hydrolysis, transport and intracellular metabolism, and these activities occur in distinct subcellular localizations. Bacterial protein subcellular localizations for several major marine bacterial groups were predicted using genomic, metagenomic and metatranscriptomic data sets following modification of MetaP software for use with partial gene sequences. The most distinct pattern of subcellular localization was found for Bacteroidetes, whose genomes were substantially enriched with outer membrane and extracellular proteins but depleted of inner membrane proteins compared with five other taxa (SAR11, Roseobacter, Synechococcus, Prochlorococcus, oligotrophic marine Gammaproteobacteria). When subcellular localization patterns were compared between genes and transcripts, three taxa had expression biased toward proteins localized to cell locations outside of the cytosol (SAR11, Roseobacter, and Synechococcus), as expected based on the importance of carbon and nutrient acquisition in an oligotrophic ocean, but two taxa did not (oligotrophic marine Gammaproteobacteria and Bacteroidetes). Diel variations in the fraction and putative gene functions of transcripts encoding inner membrane and periplasmic proteins compared to cytoplasmic proteins suggest a close coupling of photosynthetic extracellular release and bacterial consumption, providing insights into interactions between phytoplankton, bacteria, and DOM. PMID:22773648

  16. Exploitation of a high genomic mutation rate in Solenopsis invicta virus 1 to infer demographic information about its host, Solenopsis invicta

    Technology Transfer Automated Retrieval System (TEKTRAN)

    The RNA-dependent RNA polymerase (RdRp) region of Solenopsis invicta virus 1 (SINV-1) was sequenced from 47 infected colonies of S. invicta, S. richteri, S. geminata, and S. invicta/ richteri hybrids collected from across the USA, northern Argentina, and northern Taiwan in an attempt to infer demogr...

  17. A spruce gene map infers ancient plant genome reshuffling and subsequent slow evolution in the gymnosperm lineage leading to extant conifers

    PubMed Central

    2012-01-01

    Background Seed plants are composed of angiosperms and gymnosperms, which diverged from each other around 300 million years ago. While much light has been shed on the mechanisms and rate of genome evolution in flowering plants, such knowledge remains conspicuously meagre for the gymnosperms. Conifers are key representatives of gymnosperms and the sheer size of their genomes represents a significant challenge for characterization, sequencing and assembling. Results To gain insight into the macro-organisation and long-term evolution of the conifer genome, we developed a genetic map involving 1,801 spruce genes. We designed a statistical approach based on kernel density estimation to analyse gene density and identified seven gene-rich isochors. Groups of co-localizing genes were also found that were transcriptionally co-regulated, indicative of functional clusters. Phylogenetic analyses of 157 gene families for which at least two duplicates were mapped on the spruce genome indicated that ancient gene duplicates shared by angiosperms and gymnosperms outnumbered conifer-specific duplicates by a ratio of eight to one. Ancient duplicates were much more translocated within and among spruce chromosomes than conifer-specific duplicates, which were mostly organised in tandem arrays. Both high synteny and collinearity were also observed between the genomes of spruce and pine, two conifers that diverged more than 100 million years ago. Conclusions Taken together, these results indicate that much genomic evolution has occurred in the seed plant lineage before the split between gymnosperms and angiosperms, and that the pace of evolution of the genome macro-structure has been much slower in the gymnosperm lineage leading to extent conifers than that seen for the same period of time in flowering plants. This trend is largely congruent with the contrasted rates of diversification and morphological evolution observed between these two groups of seed plants. PMID:23102090

  18. Palaeohexaploid ancestry for Caryophyllales inferred from extensive gene-based physical and genetic mapping of the sugar beet genome (Beta vulgaris).

    PubMed

    Dohm, Juliane C; Lange, Cornelia; Holtgräwe, Daniela; Sörensen, Thomas Rosleff; Borchardt, Dietrich; Schulz, Britta; Lehrach, Hans; Weisshaar, Bernd; Himmelbauer, Heinz

    2012-05-01

    Sugar beet (Beta vulgaris) is an important crop plant that accounts for 30% of the world's sugar production annually. The genus Beta is a distant relative of currently sequenced taxa within the core eudicotyledons; the genomic characterization of sugar beet is essential to make its genome accessible to molecular dissection. Here, we present comprehensive genomic information in genetic and physical maps that cover all nine chromosomes. Based on this information we identified the proposed ancestral linkage groups of rosids and asterids within the sugar beet genome. We generated an extended genetic map that comprises 1127 single nucleotide polymorphism markers prepared from expressed sequence tags and bacterial artificial chromosome (BAC) end sequences. To construct a genome-wide physical map, we hybridized gene-derived oligomer probes against two BAC libraries with 9.5-fold cumulative coverage of the 758 Mbp genome. More than 2500 probes and clones were integrated both in genetic maps and the physical data. The final physical map encompasses 535 chromosomally anchored contigs that contains 8361 probes and 22 815 BAC clones. By using the gene order established with the physical map, we detected regions of synteny between sugar beet (order Caryophyllales) and rosid species that involves 1400-2700 genes in the sequenced genomes of Arabidopsis, poplar, grapevine, and cacao. The data suggest that Caryophyllales share the palaeohexaploid ancestor proposed for rosids and asterids. Taken together, we here provide extensive molecular resources for sugar beet and enable future high-resolution trait mapping, gene identification, and cross-referencing to regions sequenced in other plant species.

  19. Snowmelt-driven changes in dissolved organic matter and bacterioplankton communities in the Heilongjiang watershed of China.

    PubMed

    Qiu, Linlin; Cui, Hongyang; Wu, Junqiu; Wang, Baijie; Zhao, Yue; Li, Jiming; Jia, Liming; Wei, Zimin

    2016-06-15

    Bacterioplankton plays a significant role in the circulation of materials and ecosystem function in the biosphere. Dissolved organic matter (DOM) from dead plant material and surface soil leaches into water bodies when snow melts. In our study, water samples from nine sampling sites along the Heilongjiang watershed were collected in February and June 2014 during which period snowmelt occurred. The goal of this study was to characterize changes in DOM and bacterioplankton community composition (BCC) associated with snowmelt, the effects of DOM, environmental and geographical factors on the distribution of BCC and interactions of aquatic bacterioplankton populations with different sources of DOM in the Heilongjiang watershed. BCC was measured by denaturing gradient gel electrophoresis (DGGE). DOM was measured by excitation-emission matrix (EEM) fluorescence spectroscopy. Bacterioplankton exhibited a distinct seasonal change in community composition due to snowmelt at all sampling points except for EG. Redundancy analysis (RDA) indicated that BCC was more closely related to DOM (Components 1 and 4, dissolved organic carbon, biochemical oxygen demand and chlorophyll a) and environmental factors (water temperature and nitrate nitrogen) than geographical factors. Furthermore, DOM had a greater impact on BCC than environmental factors (29.80 vs. 15.90% of the variation). Overall, spring snowmelt played an important role in altering the quality and quantity of DOM and BCC in the Heilongjiang watershed.

  20. The structure and stability of the bacterioplankton community in Antarctic freshwater lakes, subject to extremely rapid environmental change.

    PubMed

    Pearce, David A

    2005-06-01

    In this study, variation in the bacterioplankton community structure of three Antarctic lakes of different nutrient status, was determined in relation to physical and chemical gradients at depth and at time intervals, across the seasonal transition from winter ice-cover to the summer ice-free period. The three lakes studied were: Moss Lake (low nutrient, with typical nutrient concentrations of 80 microg l(-1) nitrate and 10 microg l(-1) dissolved reactive phosphate), Sombre Lake (low nutrient, but becoming progressively enriched, with typical nutrient concentrations of 185 microg l(-1) nitrate and 7 microg l(-1) dissolved reactive phosphate) and Heywood Lake (enriched, with typical nutrient concentrations of 1180 microg l(-1) nitrate and 124 microg l(-1) dissolved reactive phosphate). Bacterioplankton community structure was determined using a combination of PCR amplification of 16S rRNA gene fragments and denaturing gradient gel electrophoresis (DGGE). Results indicated marked changes in this bacterioplankton community structure, which were particularly associated with the transition period. However, significant changes also occurred during the period of holomixis. Comparison of the results from lakes of different nutrient status suggest that increased levels of nutrient input, and in the timing and duration of ice cover will lead to marked changes in the structure and stability of the bacterioplankton community at existing levels of environmental change.

  1. Response of marine bacterioplankton pH homeostasis gene expression to elevated CO2

    NASA Astrophysics Data System (ADS)

    Bunse, Carina; Lundin, Daniel; Karlsson, Christofer M. G.; Akram, Neelam; Vila-Costa, Maria; Palovaara, Joakim; Svensson, Lovisa; Holmfeldt, Karin; González, José M.; Calvo, Eva; Pelejero, Carles; Marrasé, Cèlia; Dopson, Mark; Gasol, Josep M.; Pinhassi, Jarone

    2016-05-01

    Human-induced ocean acidification impacts marine life. Marine bacteria are major drivers of biogeochemical nutrient cycles and energy fluxes; hence, understanding their performance under projected climate change scenarios is crucial for assessing ecosystem functioning. Whereas genetic and physiological responses of phytoplankton to ocean acidification are being disentangled, corresponding functional responses of bacterioplankton to pH reduction from elevated CO2 are essentially unknown. Here we show, from metatranscriptome analyses of a phytoplankton bloom mesocosm experiment, that marine bacteria responded to lowered pH by enhancing the expression of genes encoding proton pumps, such as respiration complexes, proteorhodopsin and membrane transporters. Moreover, taxonomic transcript analysis showed that distinct bacterial groups expressed different pH homeostasis genes in response to elevated CO2. These responses were substantial for numerous pH homeostasis genes under low-chlorophyll conditions (chlorophyll a <2.5 μg l-1) however, the changes in gene expression under high-chlorophyll conditions (chlorophyll a >20 μg l-1) were low. Given that proton expulsion through pH homeostasis mechanisms is energetically costly, these findings suggest that bacterioplankton adaptation to ocean acidification could have long-term effects on the economy of ocean ecosystems.

  2. Impact of warming on phyto-bacterioplankton coupling and bacterial community composition in experimental mesocosms.

    PubMed

    von Scheibner, Markus; Dörge, Petra; Biermann, Antje; Sommer, Ulrich; Hoppe, Hans-Georg; Jürgens, Klaus

    2014-03-01

    Global warming is assumed to alter the trophic interactions and carbon flow patterns of aquatic food webs. The impact of temperature on phyto-bacterioplankton coupling and bacterial community composition (BCC) was the focus of the present study, in which an indoor mesocosm experiment with natural plankton communities from the western Baltic Sea was conducted. A 6 °C increase in water temperature resulted, as predicted, in tighter coupling between the diatom-dominated phytoplankton and heterotrophic bacteria, accompanied by a strong increase in carbon flow into bacterioplankton during the phytoplankton bloom phase. Suppressed bacterial development at cold in situ temperatures probably reflected lowered bacterial production and grazing by protists, as the latter were less affected by low temperatures. BCC was strongly influenced by the phytoplankton bloom stage and to a lesser extent by temperature. Under both temperature regimes, Gammaproteobacteria clearly dominated during the phytoplankton peak, with Glaciecola sp. as the single most abundant taxon. However, warming induced the appearance of additional bacterial taxa belonging to Betaproteobacteria and Bacteroidetes. Our results show that warming during an early phytoplankton bloom causes a shift towards a more heterotrophic system, with the appearance of new bacterial taxa suggesting a potential for utilization of a broader substrate spectrum.

  3. Response of rare, common and abundant bacterioplankton to anthropogenic perturbations in a Mediterranean coastal site.

    PubMed

    Baltar, Federico; Palovaara, Joakim; Vila-Costa, Maria; Salazar, Guillem; Calvo, Eva; Pelejero, Carles; Marrasé, Cèlia; Gasol, Josep M; Pinhassi, Jarone

    2015-06-01

    Bacterioplankton communities are made up of a small set of abundant taxa and a large number of low-abundant organisms (i.e. 'rare biosphere'). Despite the critical role played by bacteria in marine ecosystems, it remains unknown how this large diversity of organisms are affected by human-induced perturbations, or what controls the responsiveness of rare compared to abundant bacteria. We studied the response of a Mediterranean bacterioplankton community to two anthropogenic perturbations (i.e. nutrient enrichment and/or acidification) in two mesocosm experiments (in winter and summer). Nutrient enrichment increased the relative abundance of some operational taxonomic units (OTUs), e.g. Polaribacter, Tenacibaculum, Rhodobacteraceae and caused a relative decrease in others (e.g. Croceibacter). Interestingly, a synergistic effect of acidification and nutrient enrichment was observed on specific OTUs (e.g. SAR86). We analyzed the OTUs that became abundant at the end of the experiments and whether they belonged to the rare (<0.1% of relative abundance), the common (0.1-1.0% of relative abundance) or the abundant (>1% relative abundance) fractions. Most of the abundant OTUs at the end of the experiments were abundant, or at least common, in the original community of both experiments, suggesting that ecosystem alterations do not necessarily call for rare members to grow. PMID:26032602

  4. Phytoplankton, bacterioplankton and virioplankton structure and function across the southern Great Barrier Reef shelf

    NASA Astrophysics Data System (ADS)

    Alongi, Daniel M.; Patten, Nicole L.; McKinnon, David; Köstner, Nicole; Bourne, David G.; Brinkman, Richard

    2015-02-01

    Bacterioplankton and phytoplankton dynamics, pelagic respiration, virioplankton abundance, and the diversity of pelagic diazotrophs and other bacteria were examined in relation to water-column nutrients and vertical mixing across the southern Great Barrier Reef (GBR) shelf where sharp inshore to offshore gradients in water chemistry and hydrology prevail. A principal component analysis (PCA) revealed station groups clustered geographically, suggesting across-shelf differences in plankton function and structure driven by changes in mixing intensity, sediment resuspension, and the relative contributions of terrestrial, reef and oceanic nutrients. At most stations and sampling periods, microbial abundance and activities peaked both inshore and at channels between outer shelf reefs of the Pompey Reef complex. PCA also revealed that virioplankton numbers and biomass correlated with bacterioplankton numbers and production, and that bacterial growth and respiration correlated with net primary production, suggesting close virus-bacteria-phytoplankton interactions; all plankton groups correlated with particulate C, N, and P. Strong vertical mixing facilitates tight coupling of pelagic and benthic shelf processes as, on average, 37% and 56% of N and P demands of phytoplankton are derived from benthic nutrient regeneration and resuspension. These across-shelf planktonic trends mirror those of the benthic microbial community.

  5. Influence of macrophyte decomposition on growth rate and community structure of Okefenokee Swamp bacterioplankton

    SciTech Connect

    Murray, R.E.; Hodson, R.E.

    1986-02-01

    Dissolved substances released during decomposition of the white water lily (Nymphaea odorata) can alter the growth rate of Okefenokee Swamp bacterioplankton. In microcosm experiments dissolved compounds released bacterioplankton, followed by a period of intense bacterial growth. Rates of (/sup 3/H)thymidine incorporation and turnover of dissolved D-glucose were depressed by over 85%, 3 h after the addition of Nymphaea leachates to microcosms containing Okefenokee Swamp water. Bacterial activity subsequently recovered; after 20 h (/sup 3/H)thymidine incorporation in leachate-treated microcosms was 10-fold greater than that in control microcosms. The recovery of activity was due to a shift in the composition of the bacterial population toward resistance to the inhibitory compounds present in Nymphaea leachates. Inhibitory compounds released during the decomposition of aquatic macrophytes thus act as selective agents which alter the community structure of the bacterial population with respect to leachate resistance. Soluble compounds derived from macrophyte decomposition influence the rate of bacterial secondary production and the availability of microbial biomass to microconsumers.

  6. Evidence of bacterioplankton community adaptation in response to long-term mariculture disturbance

    PubMed Central

    Xiong, Jinbo; Chen, Heping; Hu, Changju; Ye, Xiansen; Kong, Dingjiang; Zhang, Demin

    2015-01-01

    Understanding the underlying mechanisms that shape the temporal dynamics of a microbial community has important implications for predicting the trajectory of an ecosystem’s response to anthropogenic disturbances. Here, we evaluated the seasonal dynamics of bacterioplankton community composition (BCC) following more than three decades of mariculture disturbance in Xiangshan Bay. Clear seasonal succession and site (fish farm and control site) separation of the BCC were observed, which were primarily shaped by temperature, dissolved oxygen and sampling time. However, the sensitive bacterial families consistently changed in relative abundance in response to mariculture disturbance, regardless of the season. Temporal changes in the BCC followed the time-decay for similarity relationship at both sites. Notably, mariculture disturbance significantly (P < 0.001) flattened the temporal turnover but intensified bacterial species-to-species interactions. The decrease in bacterial temporal turnover under long-term mariculture disturbance was coupled with a consistent increase in the percentage of deterministic processes that constrained bacterial assembly based on a null model analysis. The results demonstrate that the BCC is sensitive to mariculture disturbance; however, a bacterioplankton community could adapt to a long-term disturbance via attenuating temporal turnover and intensifying species-species interactions. These findings expand our current understanding of microbial assembly in response to long-term anthropogenic disturbances. PMID:26471739

  7. Identification of Associations between Bacterioplankton and Photosynthetic Picoeukaryotes in Coastal Waters.

    PubMed

    Farnelid, Hanna M; Turk-Kubo, Kendra A; Zehr, Jonathan P

    2016-01-01

    Photosynthetic picoeukaryotes are significant contributors to marine primary productivity. Associations between marine bacterioplankton and picoeukaryotes frequently occur and can have large biogeochemical impacts. We used flow cytometry to sort cells from seawater to identify non-eukaryotic phylotypes that are associated with photosynthetic picoeukaryotes. Samples were collected at the Santa Cruz wharf on Monterey Bay, CA, USA during summer and fall, 2014. The phylogeny of associated microbes was assessed through 16S rRNA gene amplicon clone and Illumina MiSeq libraries. The most frequently detected bacterioplankton phyla within the photosynthetic picoeukaryote sorts were Proteobacteria (Alphaproteobacteria and Gammaproteobacteria) and Bacteroidetes. Intriguingly, the presence of free-living bacterial genera in the photosynthetic picoeukaryote sorts could suggest that some of the photosynthetic picoeukaryotes were mixotrophs. However, the occurrence of bacterial sequences, which were not prevalent in the corresponding bulk seawater samples, indicates that there was also a selection for specific OTUs in association with photosynthetic picoeukaryotes suggesting specific functional associations. The results show that diverse bacterial phylotypes are found in association with photosynthetic picoeukaryotes. Taxonomic identification of these associations is a prerequisite for further characterizing and to elucidate their metabolic pathways and ecological functions.

  8. Relation between presence-absence of a visible nucleoid and metabolic activity in bacterioplankton cells

    SciTech Connect

    Choi, Joon, W.; Sherr, E.B.; Sherr, B.F.

    1996-09-01

    We investigated the report of Zweifel and Hagstroem that only a portion of marine bacteria contain nucleoids--the DNA-containing regions of procaryotic cells-- and that such bacteria correspond to the active or viable fraction of bacterioplankton. In Oregon coastal waters, 21-64% of bacteria had visible nucleoids; number of nucleoid-visible (NV) bacteria were greater than numbers of metabolically active bacteria, based on cells with active electron transport systems (ETS) and intact cell membranes. During log growth of a marine isolate, proportions of NV and ETS-active cells approached 100%. In stationary growth phase, the fraction of ETS-active cells decreased rapidly, while that of NV cells remained high for 7 d. When starved cells of the isolate were resupplied with nutrient (50 mg liter{sup -1} peptone), total cell number did not increase during the initial 6 h, but the proportion of NV cells increased from 27 to 100%, and that of ETS-active cells from 6 to 75%. In an analogous experiment with a bacterioplankton assemblage, a similar trend was observed: the number of NV cells double during the initial 6 h prior to an increase in total cell counts. These results show that some bacteria without visible nucleoids are capable of becoming NV cells, and thus have DNa in a nucleoid region not detectable with the method used here. 18 refs., 4 figs., 1 tab.

  9. Spatial-Temporal Changes of Bacterioplankton Community along an Exhorheic River

    PubMed Central

    Ma, Lili; Mao, Guannan; Liu, Jie; Gao, Guanghai; Zou, Changliang; Bartlam, Mark G.; Wang, Yingying

    2016-01-01

    To date, few aquatic microbial ecology studies have discussed the variability of the microbial community in exorheic river ecosystems on both the spatial and seasonal scales. In this study, we examined the spatio-temporal variation of bacterioplankton community composition in an anthropogenically influenced exorheic river, the Haihe River in Tianjin, China, using pyrosequencing analysis of 16S rRNA genes. It was verified by one-way ANOVA that the spatial variability of the bacterioplankton community composition over the whole river was stronger than the seasonal variation. Salinity was a major factor leading to spatial differentiation of the microbial community structure into riverine and estuarial parts. A high temperature influence on the seasonal bacterial community variation was only apparent within certain kinds of environments (e.g., the riverine part). Bacterial community richness and diversity both exhibited significant spatial changes, and their seasonal variations were completely different in the two environments studied here. Furthermore, riverine bacterial community assemblages were subdivided into urban and rural groups due to changes in the nutritional state of the river. In addition, the nutrient-loving group including Limnohabitans, Hydrogenophaga, and Polynucleobacter were abundant in the urbanized Haihe River, indicating the environmental factors in these anthropogenic waterbodies heavily influence the core freshwater community composition. PMID:26973627

  10. Recruitment of Members from the Rare Biosphere of Marine Bacterioplankton Communities after an Environmental Disturbance

    PubMed Central

    Sjöstedt, Johanna; Koch-Schmidt, Per; Pontarp, Mikael; Canbäck, Björn; Tunlid, Anders; Lundberg, Per; Hagström, Åke

    2012-01-01

    A bacterial community may be resistant to environmental disturbances if some of its species show metabolic flexibility and physiological tolerance to the changing conditions. Alternatively, disturbances can change the composition of the community and thereby potentially affect ecosystem processes. The impact of disturbance on the composition of bacterioplankton communities was examined in continuous seawater cultures. Bacterial assemblages from geographically closely connected areas, the Baltic Sea (salinity 7 and high dissolved organic carbon [DOC]) and Skagerrak (salinity 28 and low DOC), were exposed to gradual opposing changes in salinity and DOC over a 3-week period such that the Baltic community was exposed to Skagerrak salinity and DOC and vice versa. Denaturing gradient gel electrophoresis and clone libraries of PCR-amplified 16S rRNA genes showed that the composition of the transplanted communities differed significantly from those held at constant salinity. Despite this, the growth yields (number of cells ml−1) were similar, which suggests similar levels of substrate utilization. Deep 454 pyrosequencing of 16S rRNA genes showed that the composition of the disturbed communities had changed due to the recruitment of phylotypes present in the rare biosphere of the original community. The study shows that members of the rare biosphere can become abundant in a bacterioplankton community after disturbance and that those bacteria can have important roles in maintaining ecosystem processes. PMID:22194288

  11. Identification of Associations between Bacterioplankton and Photosynthetic Picoeukaryotes in Coastal Waters.

    PubMed

    Farnelid, Hanna M; Turk-Kubo, Kendra A; Zehr, Jonathan P

    2016-01-01

    Photosynthetic picoeukaryotes are significant contributors to marine primary productivity. Associations between marine bacterioplankton and picoeukaryotes frequently occur and can have large biogeochemical impacts. We used flow cytometry to sort cells from seawater to identify non-eukaryotic phylotypes that are associated with photosynthetic picoeukaryotes. Samples were collected at the Santa Cruz wharf on Monterey Bay, CA, USA during summer and fall, 2014. The phylogeny of associated microbes was assessed through 16S rRNA gene amplicon clone and Illumina MiSeq libraries. The most frequently detected bacterioplankton phyla within the photosynthetic picoeukaryote sorts were Proteobacteria (Alphaproteobacteria and Gammaproteobacteria) and Bacteroidetes. Intriguingly, the presence of free-living bacterial genera in the photosynthetic picoeukaryote sorts could suggest that some of the photosynthetic picoeukaryotes were mixotrophs. However, the occurrence of bacterial sequences, which were not prevalent in the corresponding bulk seawater samples, indicates that there was also a selection for specific OTUs in association with photosynthetic picoeukaryotes suggesting specific functional associations. The results show that diverse bacterial phylotypes are found in association with photosynthetic picoeukaryotes. Taxonomic identification of these associations is a prerequisite for further characterizing and to elucidate their metabolic pathways and ecological functions. PMID:27148165

  12. Short-Term Dynamics of North Sea Bacterioplankton-Dissolved Organic Matter Coherence on Molecular Level

    PubMed Central

    Lucas, Judith; Koester, Irina; Wichels, Antje; Niggemann, Jutta; Dittmar, Thorsten; Callies, Ulrich; Wiltshire, Karen H.; Gerdts, Gunnar

    2016-01-01

    Remineralization and transformation of dissolved organic matter (DOM) by marine microbes shape the DOM composition and thus, have large impact on global carbon and nutrient cycling. However, information on bacterioplankton-DOM interactions on a molecular level is limited. We examined the variation of bacterial community composition (BCC) at Helgoland Roads (North Sea) in relation to variation of molecular DOM composition and various environmental parameters on short-time scales. Surface water samples were taken daily over a period of 20 days. Bacterial community and molecular DOM composition were assessed via 16S rRNA gene tag sequencing and ultrahigh resolution Fourier-transform ion cyclotron resonance mass spectrometry (FT-ICR-MS), respectively. Environmental conditions were driven by a coastal water influx during the first half of the sampling period and the onset of a summer phytoplankton bloom toward the end of the sampling period. These phenomena led to a distinct grouping of bacterial communities and DOM composition which was particularly influenced by total dissolved nitrogen (TDN) concentration, temperature, and salinity, as revealed by distance-based linear regression analyses. Bacterioplankton-DOM interaction was demonstrated in strong correlations between specific bacterial taxa and particular DOM molecules, thus, suggesting potential specialization on particular substrates. We propose that a combination of high resolution techniques, as used in this study, may provide substantial information on substrate generalists and specialists and thus, contribute to prediction of BCC variation. PMID:27014241

  13. Identification of Associations between Bacterioplankton and Photosynthetic Picoeukaryotes in Coastal Waters

    PubMed Central

    Farnelid, Hanna M.; Turk-Kubo, Kendra A.; Zehr, Jonathan P.

    2016-01-01

    Photosynthetic picoeukaryotes are significant contributors to marine primary productivity. Associations between marine bacterioplankton and picoeukaryotes frequently occur and can have large biogeochemical impacts. We used flow cytometry to sort cells from seawater to identify non-eukaryotic phylotypes that are associated with photosynthetic picoeukaryotes. Samples were collected at the Santa Cruz wharf on Monterey Bay, CA, USA during summer and fall, 2014. The phylogeny of associated microbes was assessed through 16S rRNA gene amplicon clone and Illumina MiSeq libraries. The most frequently detected bacterioplankton phyla within the photosynthetic picoeukaryote sorts were Proteobacteria (Alphaproteobacteria and Gammaproteobacteria) and Bacteroidetes. Intriguingly, the presence of free-living bacterial genera in the photosynthetic picoeukaryote sorts could suggest that some of the photosynthetic picoeukaryotes were mixotrophs. However, the occurrence of bacterial sequences, which were not prevalent in the corresponding bulk seawater samples, indicates that there was also a selection for specific OTUs in association with photosynthetic picoeukaryotes suggesting specific functional associations. The results show that diverse bacterial phylotypes are found in association with photosynthetic picoeukaryotes. Taxonomic identification of these associations is a prerequisite for further characterizing and to elucidate their metabolic pathways and ecological functions. PMID:27148165

  14. Impact of photochemical processing of DOC on the bacterioplankton respiratory quotient in aquatic ecosystems

    NASA Astrophysics Data System (ADS)

    Allesson, Lina; Ström, Lena; Berggren, Martin

    2016-07-01

    Many studies assume a respiratory quotient (RQ = molar ratio of CO2 produced to O2 consumed) close to 1 when calculating bacterioplankton respiration. However, evidence suggests that RQ depends on the chemical composition of the respired substrate pool that may be altered by photochemical production of oxygen-rich substrates, resulting in elevated RQs. Here we conducted a novel study of the impact of photochemical processing of dissolved organic carbon (DOC) on RQ. We monitored the bacterial RQ in bioassays of both ultraviolet light irradiated and nonirradiated humic lake water, using optic gas-pressure sensors. In the experimentally irradiated samples the average RQ value was significantly higher (3.4-3.5 [±0.4 standard error (SE)]) than that in the dark controls (1.3 [±0.1 SE]). Our results show that the RQ is systematically higher than 1 when the bacterial metabolism in large part is based on photoproducts. By assuming an RQ of 1, bacterioplankton respiration in freshwater ecosystems may be greatly underestimated.

  15. Influence of Macrophyte Decomposition on Growth Rate and Community Structure of Okefenokee Swamp Bacterioplankton

    PubMed Central

    Murray, Robert E.; Hodson, Robert E.

    1986-01-01

    Dissolved substances released during decomposition of the white water lily (Nymphaea odorata) can alter the growth rate of Okefenokee Swamp bacterioplankton. In microcosm experiments dissolved compounds released from senescent Nymphaea leaves caused a transient reduction in the abundance and activity of water column bacterioplankton, followed by a period of intense bacterial growth. Rates of [3H]thymidine incorporation and turnover of dissolved d-glucose were depressed by over 85%, 3 h after the addition of Nymphaea leachates to microcosms containing Okefenokee Swamp water. Bacterial activity subsequently recovered; after 20 h [3H]thymidine incorporation in leachate-treated microcosms was 10-fold greater than that in control microcosms. The recovery of activity was due to a shift in the composition of the bacterial population toward resistance to the inhibitory compounds present in Nymphaea leachates. Inhibitory compounds released during the decomposition of aquatic macrophytes thus act as selective agents which alter the community structure of the bacterial population with respect to leachate resistance. Soluble compounds derived from macrophyte decomposition influence the rate of bacterial secondary production and the availability of microbial biomass to microconsumers. Images PMID:16346986

  16. Bacterioplankton and bacteriobenthos of the Amur estuary and the adjacent areas in the Summer of 2006

    NASA Astrophysics Data System (ADS)

    Karetnikova, E. A.; Garetova, L. A.

    2009-06-01

    This work presents the results of the study of the plankton and benthic microbiocenoses of the Amur estuary. It is shown that the distribution of the total abundance and indicator groups of the bacteriobenthos are characterized by stronger heterogeneity compared with the bacterioplankton and that it depends on the Amur river drainage and the bottom type. The river drainage helps by increasing the overall bacterioplankton abundance in the near-mouth part of the estuary. Microorganisms utilizing low concentrations of OM play a major role in the processes of the OM utilization in the water and bottom sediments. Saprophytic bacteria play a significant role in the OM utilization only in the water at certain sampling sites in the Tatarskiy Strait and Sakhalin Bay and in the bottom sediments taken in the mouth part of the estuary. Some parts of the estuary subject to organic contamination are found according to the microbiological characteristics. It is shown that the fluctuation of the salinity leads to a change of the share of bacteria with different food demands in the microbial community.

  17. Purple Sulfur Bacteria Control the Growth of Aerobic Heterotrophic Bacterioplankton in a Meromictic Salt Lake

    PubMed Central

    Overmann, J.; Beatty, J. T.; Hall, K. J.

    1996-01-01

    In meromictic Mahoney Lake, British Columbia, Canada, the heterotrophic bacterial production in the mixolimnion exceeded concomitant primary production by a factor of 7. Bacterial growth rates were correlated neither to primary production nor to the amount of chlorophyll a. Both results indicate an uncoupling of bacteria and phytoplankton. In the chemocline of the lake, an extremely dense population of the purple sulfur bacterium Amoebobacter purpureus is present year round. We investigated whether anoxygenic phototrophs are significant for the growth of aerobic bacterioplankton in the overlaying water. Bacterial growth rates in the mixolimnion were limited by inorganic phosphorus or nitrogen most of the time, and the biomass of heterotrophic bacteria did not increase until, in autumn, 86% of the cells of A. purpureus appeared in the mixolimnion because of their reduced buoyant density. The increase in heterotrophic bacterial biomass, soluble phosphorus concentrations below the detection limit, and an extraordinarily high activity of alkaline phosphatase in the mixolimnion indicate a rapid liberation of organically bound phosphorus from A. purpureus cells accompanied by a simultaneous incorporation into heterotrophic bacterioplankton. High concentrations of allochthonously derived dissolved organic carbon (mean, 60 mg of C(middot)liter(sup-1)) were measured in the lake water. In Mahoney Lake, liberation of phosphorus from upwelling purple sulfur bacteria and degradation of allochthonous dissolved organic carbon as an additional carbon source render heterotrophic bacterial production largely independent of the photosynthesis of phytoplankton. A recycling of inorganic nutrients via phototrophic bacteria also appears to be relevant in other lakes with anoxic bottom waters. PMID:16535399

  18. Physiology and phylogeny of the candidate phylum "Atribacteria" (formerly OP9/JS1) inferred from single-cell genomics and metagenomics

    NASA Astrophysics Data System (ADS)

    Dodsworth, J. A.; Murugapiran, S.; Blainey, P. C.; Nobu, M.; Rinke, C.; Schwientek, P.; Gies, E.; Webster, G.; Kille, P.; Weightman, A.; Liu, W. T.; Hallam, S.; Tsiamis, G.; Swingley, W.; Ross, C.; Tringe, S. G.; Chain, P. S.; Scholz, M. B.; Lo, C. C.; Raymond, J.; Quake, S. R.; Woyke, T.; Hedlund, B. P.

    2014-12-01

    Single-cell sequencing and metagenomics have extended the genomics revolution to yet-uncultivated microorganisms and provided insights into the coding potential of this so-called "microbial dark matter", including microbes belonging candidate phyla with no cultivated representatives. As more datasets emerge, comparison of individual genomes from different lineages and habitats can provide insight into the phylogeny, conserved features, and potential metabolic diversity of candidate phyla. The candidate bacterial phylum OP9 was originally found in Obsidian Pool, Yellowstone National Park, and it has since been detected in geothermal springs, petroleum reservoirs, and engineered thermal environments worldwide. JS1, another uncultivated bacterial lineage affiliated with OP9, is often abundant in marine sediments associated with methane hydrates, hydrocarbon seeps, and on continental margins and shelves, and is found in other non-thermal marine and subsurface environments. The phylogenetic relationship between OP9, JS1, and other Bacteria has not been fully resolved, and to date no axenic cultures from these lineages have been reported. Recently, 31 single amplified genomes (SAGs) from six distinct OP9 and JS1 lineages have been obtained using flow cytometric and microfluidic techniques. These SAGs were used to inform metagenome binning techniques that identified OP9/JS1 sequences in several metagenomes, extending genomic coverage in three of the OP9 and JS1 lineages. Phylogenomic analyses of these SAG and metagenome bin datasets suggest that OP9 and JS1 constitute a single, deeply branching phylum, for which the name "Atribacteria" has recently been proposed. Overall, members of the "Atribacteria" are predicted to be heterotrophic anaerobes without the capacity for respiration, with some lineages potentially specializing in secondary fermentation of organic acids. A set of signature "Atribacteria" genes was tentatively identified, including components of a bacterial

  19. Temporal Patterns in Bacterioplankton Community Composition in Three Reservoirs of Similar Trophic Status in Shenzhen, China

    PubMed Central

    Li, Jiancheng; Chen, Cheng; Lu, Jun; Lei, Anping; Hu, Zhangli

    2016-01-01

    The bacterioplankton community composition’s (BCC) spatial and temporal variation patterns in three reservoirs (Shiyan, Xikeng, and LuoTian Reservoir) of similar trophic status in Bao’an District, Shenzhen (China), were investigated using PCR amplification of the 16S rDNA gene and the denaturing gradient gel electrophoresis (DGGE) techniques. Water samples were collected monthly in each reservoir during 12 consecutive months. Distinct differences were detected in band number, pattern, and density of DGGE at different sampling sites and time points. Analysis of the DGGE fingerprints showed that changes in the bacterial community structure mainly varied with seasons, and the patterns of change indicated that seasonal forces might have a more significant impact on the BCC than eutrophic status in the reservoirs, despite the similar Shannon-Weiner index among the three reservoirs. The sequences obtained from excised bands were affiliated with Cyanobacteria, Firmicutes, Bacteriodetes, Acidobacteria, Actinobacteria, Planctomycetes, and Proteobacteria. PMID:27322295

  20. Marine bacterioplankton diversity and community composition in an antarctic coastal environment.

    PubMed

    Lo Giudice, Angelina; Caruso, Consolazione; Mangano, Santina; Bruni, Vivia; De Domenico, Maria; Michaud, Luigi

    2012-01-01

    The bacterial community inhabiting the water column at Terra Nova Bay (Ross Sea, Antarctica) was examined by the fluorescent in situ hybridization (FISH) technique and the genotypic and phenotypic characterization of 606 bacterial isolates. Overall, the FISH analysis revealed a bacterioplankton composition that was typical of Antarctic marine environments with the Cytophaga/Flavobacter (CF) group of Bacteroidetes that was equally dominant with the Actinobacteria and Gammaproteobacteria. As sampling was performed during the decay of sea-ice, it is plausible to assume the origin of Bacteroidetes from the sea-ice compartment where they probably thrive in high concentration of DOM which is efficiently remineralized to inorganic nutrients. This finding was supported by the isolation of Gelidibacter, Polaribacter, and Psychroflexus members (generally well represented in Antarctic sea-ice) which showed the ability to hydrolyze macromolecules, probably through the production of extracellular enzymes. A consistently pronounced abundance of the Gammaproteobacteria (67.8%) was also detected within the cultivable fraction. Altogether, the genera Psychromonas and Pseudoalteromonas accounted for 65.4% of total isolates and were ubiquitous, thus suggesting that they may play a key role within the analyzed bacterioplankton community. In particular, Pseudoalteromonas isolates possessed nitrate reductase and were able to hydrolyze substrates for protease, esterase, and β-galactosidase, thus indicating their involvement in the carbon and nitrogen cycling. Finally, the obtained results highlight the ability of the Actinobacteria to survive and proliferate in the Terra Nova Bay seawater as they generally showed a wide range of salt tolerance and appeared to be particularly competitive with strictly marine bacteria by better utilizing supplied carbon sources. PMID:21748267

  1. Transient changes in bacterioplankton communities induced by the submarine volcanic eruption of El Hierro (Canary Islands).

    PubMed

    Ferrera, Isabel; Arístegui, Javier; González, José M; Montero, María F; Fraile-Nuez, Eugenio; Gasol, Josep M

    2015-01-01

    The submarine volcanic eruption occurring near El Hierro (Canary Islands) in October 2011 provided a unique opportunity to determine the effects of such events on the microbial populations of the surrounding waters. The birth of a new underwater volcano produced a large plume of vent material detectable from space that led to abrupt changes in the physical-chemical properties of the water column. We combined flow cytometry and 454-pyrosequencing of 16S rRNA gene amplicons (V1-V3 regions for Bacteria and V3-V5 for Archaea) to monitor the area around the volcano through the eruptive and post-eruptive phases (November 2011 to April 2012). Flow cytometric analyses revealed higher abundance and relative activity (expressed as a percentage of high-nucleic acid content cells) of heterotrophic prokaryotes during the eruptive process as compared to post-eruptive stages. Changes observed in populations detectable by flow cytometry were more evident at depths closer to the volcano (~70-200 m), coinciding also with oxygen depletion. Alpha-diversity analyses revealed that species richness (Chao1 index) decreased during the eruptive phase; however, no dramatic changes in community composition were observed. The most abundant taxa during the eruptive phase were similar to those in the post-eruptive stages and to those typically prevalent in oceanic bacterioplankton communities (i.e. the alphaproteobacterial SAR11 group, the Flavobacteriia class of the Bacteroidetes and certain groups of Gammaproteobacteria). Yet, although at low abundance, we also detected the presence of taxa not typically found in bacterioplankton communities such as the Epsilonproteobacteria and members of the candidate division ZB3, particularly during the eruptive stage. These groups are often associated with deep-sea hydrothermal vents or sulfur-rich springs. Both cytometric and sequence analyses showed that once the eruption ceased, evidences of the volcano-induced changes were no longer observed. PMID

  2. Respiratory Succession and Community Succession of Bacterioplankton in Seasonally Anoxic Estuarine Waters▿

    PubMed Central

    Crump, Byron C.; Peranteau, Cherie; Beckingham, Barbara; Cornwell, Jeffrey C.

    2007-01-01

    Anoxia occurs in bottom waters of stratified estuaries when respiratory consumption of oxygen, primarily by bacteria, outpaces atmospheric and photosynthetic reoxygenation. Once water becomes anoxic, bacterioplankton must change their metabolism to some form of anaerobic respiration. Analysis of redox chemistry in water samples spanning the oxycline of Chesapeake Bay during the summer of 2004 suggested that there was a succession of respiratory metabolism following the loss of oxygen. Bacterial community doubling time, calculated from bacterial abundance (direct counts) and production (anaerobic leucine incorporation), ranged from 0.36 to 0.75 day and was always much shorter than estimates of the time that the bottom water was anoxic (18 to 44 days), indicating that there was adequate time for bacterial community composition to shift in response to changing redox conditions. However, community composition (as determined by PCR-denaturing gradient gel electrophoresis analysis of 16S rRNA genes) in anoxic waters was very similar to that in surface waters in June when nitrate respiration was apparent in the water column and only partially shifted away from the composition of the surface community after nitrate was depleted. Anoxic water communities did not change dramatically until August, when sulfate respiration appeared to dominate. Surface water populations that remained dominant in anoxic waters were Synechococcus sp., Gammaproteobacteria in the SAR86 clade, and Alphaproteobacteria relatives of Pelagibacter ubique, including a putative estuarine-specific Pelagibacter cluster. Populations that developed in anoxic water were most similar (<92% similarity) to uncultivated Firmicutes, uncultivated Bacteroidetes, Gammaproteobacteria in the genus Thioalcalovibrio, and the uncultivated SAR406 cluster. These results indicate that typical estuarine bacterioplankton switch to anaerobic metabolism under anoxic conditions but are ultimately replaced by different organisms

  3. Transient Changes in Bacterioplankton Communities Induced by the Submarine Volcanic Eruption of El Hierro (Canary Islands)

    PubMed Central

    Ferrera, Isabel; Arístegui, Javier; González, José M.; Montero, María F.; Fraile-Nuez, Eugenio; Gasol, Josep M.

    2015-01-01

    The submarine volcanic eruption occurring near El Hierro (Canary Islands) in October 2011 provided a unique opportunity to determine the effects of such events on the microbial populations of the surrounding waters. The birth of a new underwater volcano produced a large plume of vent material detectable from space that led to abrupt changes in the physical-chemical properties of the water column. We combined flow cytometry and 454-pyrosequencing of 16S rRNA gene amplicons (V1–V3 regions for Bacteria and V3–V5 for Archaea) to monitor the area around the volcano through the eruptive and post-eruptive phases (November 2011 to April 2012). Flow cytometric analyses revealed higher abundance and relative activity (expressed as a percentage of high-nucleic acid content cells) of heterotrophic prokaryotes during the eruptive process as compared to post-eruptive stages. Changes observed in populations detectable by flow cytometry were more evident at depths closer to the volcano (~70–200 m), coinciding also with oxygen depletion. Alpha-diversity analyses revealed that species richness (Chao1 index) decreased during the eruptive phase; however, no dramatic changes in community composition were observed. The most abundant taxa during the eruptive phase were similar to those in the post-eruptive stages and to those typically prevalent in oceanic bacterioplankton communities (i.e. the alphaproteobacterial SAR11 group, the Flavobacteriia class of the Bacteroidetes and certain groups of Gammaproteobacteria). Yet, although at low abundance, we also detected the presence of taxa not typically found in bacterioplankton communities such as the Epsilonproteobacteria and members of the candidate division ZB3, particularly during the eruptive stage. These groups are often associated with deep-sea hydrothermal vents or sulfur-rich springs. Both cytometric and sequence analyses showed that once the eruption ceased, evidences of the volcano-induced changes were no longer observed

  4. Quantification of Carbon and Phosphorus Co-Limitation in Bacterioplankton: New Insights on an Old Topic

    PubMed Central

    Dorado-García, Irene; Medina-Sánchez, Juan Manuel; Herrera, Guillermo; Cabrerizo, Marco J.; Carrillo, Presentación

    2014-01-01

    Because the nature of the main resource that limits bacterioplankton (e.g. organic carbon [C] or phosphorus [P]) has biogeochemical implications concerning organic C accumulation in freshwater ecosystems, empirical knowledge is needed concerning how bacteria respond to these two resources, available alone or together. We performed field experiments of resource manipulation (2×2 factorial design, with the addition of C, P, or both combined) in two Mediterranean freshwater ecosystems with contrasting trophic states (oligotrophy vs. eutrophy) and trophic natures (autotrophy vs. heterotrophy, measured as gross primary production:respiration ratio). Overall, the two resources synergistically co-limited bacterioplankton, i.e. the magnitude of the response of bacterial production and abundance to the two resources combined was higher than the additive response in both ecosystems. However, bacteria also responded positively to single P and C additions in the eutrophic ecosystem, but not to single C in the oligotrophic one, consistent with the value of the ratio between bacterial C demand and algal C supply. Accordingly, the trophic nature rather than the trophic state of the ecosystems proves to be a key feature determining the expected types of resource co-limitation of bacteria, as summarized in a proposed theoretical framework. The actual types of co-limitation shifted over time and partially deviated (a lesser degree of synergism) from the theoretical expectations, particularly in the eutrophic ecosystem. These deviations may be explained by extrinsic ecological forces to physiological limitations of bacteria, such as predation, whose role in our experiments is supported by the relationship between the dynamics of bacteria and bacterivores tested by SEMs (structural equation models). Our study, in line with the increasingly recognized role of freshwater ecosystems in the global C cycle, suggests that further attention should be focussed on the biotic interactions that

  5. Effect of protistan grazing on the frequency of dividing cells in bacterioplankton assemblages.

    PubMed

    Sherr, B F; Sherr, E B; McDaniel, J

    1992-08-01

    Grazing by phagotrophic flagellates and ciliates is a major source of mortality for bacterioplankton in both marine and freshwater systems. Recent studies have demonstrated a positive relationship between clearance rate and prey size for bacterivorous protists. We tested the idea that, by selectively grazing the larger (more actively growing or dividing) cells in a bacterial assemblage, protists control bacterial standing stock abundances by directly cropping bacterial production. Samples of estuarine water were passed through 0.8-mum-pore-size filters (bacteria only) or 20-mum-mesh screens (bacteria and bacterivorous protists) and placed in dialysis tubing suspended in 7 liters of unfiltered water. Changes in total bacterial biovolume per milliliter (bacterial biomass), frequency of dividing cells (FDC), and average per cell biovolume were followed over a period of 24 h. In three experiments, the FDC increased more rapidly and attained higher values in water passed through 0.8-mum-pore-size filters (average, 5.1 to 8.9%; maximum, 15.5%) compared with FDC values in water passed through 20-mum-mesh screens (average, 2.7 to 5.3%; maximum, 6.7%). Increases in bacterial biomass per milliliter lagged behind increases in FDC by about 4 to 6 h. Grazed bacterial assemblages were characterized by lower total biomasses and smaller average cell sizes compared with those of cells in nongrazed assemblages. We conclude that bacterivorous protists control bacterial standing stock abundances partly by preferentially removing dividing cells. Selective grazing of the more actively growing cells may also explain, in part, the ability of slow-growing cells to persist in bacterioplankton assemblages.

  6. Quantification of carbon and phosphorus co-limitation in bacterioplankton: new insights on an old topic.

    PubMed

    Dorado-García, Irene; Medina-Sánchez, Juan Manuel; Herrera, Guillermo; Cabrerizo, Marco J; Carrillo, Presentación

    2014-01-01

    Because the nature of the main resource that limits bacterioplankton (e.g. organic carbon [C] or phosphorus [P]) has biogeochemical implications concerning organic C accumulation in freshwater ecosystems, empirical knowledge is needed concerning how bacteria respond to these two resources, available alone or together. We performed field experiments of resource manipulation (2×2 factorial design, with the addition of C, P, or both combined) in two Mediterranean freshwater ecosystems with contrasting trophic states (oligotrophy vs. eutrophy) and trophic natures (autotrophy vs. heterotrophy, measured as gross primary production:respiration ratio). Overall, the two resources synergistically co-limited bacterioplankton, i.e. the magnitude of the response of bacterial production and abundance to the two resources combined was higher than the additive response in both ecosystems. However, bacteria also responded positively to single P and C additions in the eutrophic ecosystem, but not to single C in the oligotrophic one, consistent with the value of the ratio between bacterial C demand and algal C supply. Accordingly, the trophic nature rather than the trophic state of the ecosystems proves to be a key feature determining the expected types of resource co-limitation of bacteria, as summarized in a proposed theoretical framework. The actual types of co-limitation shifted over time and partially deviated (a lesser degree of synergism) from the theoretical expectations, particularly in the eutrophic ecosystem. These deviations may be explained by extrinsic ecological forces to physiological limitations of bacteria, such as predation, whose role in our experiments is supported by the relationship between the dynamics of bacteria and bacterivores tested by SEMs (structural equation models). Our study, in line with the increasingly recognized role of freshwater ecosystems in the global C cycle, suggests that further attention should be focussed on the biotic interactions that

  7. An Independent Genome Duplication Inferred from Hox Paralogs in the American Paddlefish—A Representative Basal Ray-Finned Fish and Important Comparative Reference

    PubMed Central

    Crow, Karen D.; Smith, Christopher D.; Cheng, Jan-Fang; Wagner, Günter P.; Amemiya, Chris T.

    2012-01-01

    Vertebrates have experienced two rounds of whole-genome duplication (WGD) in the stem lineages of deep nodes within the group and a subsequent duplication event in the stem lineage of the teleosts—a highly diverse group of ray-finned fishes. Here, we present the first full Hox gene sequences for any member of the Acipenseriformes, the American paddlefish, and confirm that an independent WGD occurred in the paddlefish lineage, approximately 42 Ma based on sequences spanning the entire HoxA cluster and eight genes on the HoxD gene cluster. These clusters comprise different HOX loci and maintain conserved synteny relative to bichir, zebrafish, stickleback, and pufferfish, as well as human, mouse, and chick. We also provide a gene genealogy for the duplicated fzd8 gene in paddlefish and present evidence for the first Hox14 gene in any ray-finned fish. Taken together, these data demonstrate that the American paddlefish has an independently duplicated genome. Substitution patterns of the “alpha” paralogs on both the HoxA and HoxD gene clusters suggest transcriptional inactivation consistent with functional diploidization. Further, there are similarities in the pattern of sequence divergence among duplicated Hox genes in paddlefish and teleost lineages, even though they occurred independently approximately 200 Myr apart. We highlight implications on comparative analyses in the study of the “fin-limb transition” as well as gene and genome duplication in bony fishes, which includes all ray-finned fishes as well as the lobe-finned fishes and tetrapod vertebrates. PMID:22851613

  8. An independent genome duplication inferred from Hox paralogs in the American paddlefish--a representative basal ray-finned fish and important comparative reference.

    PubMed

    Crow, Karen D; Smith, Christopher D; Cheng, Jan-Fang; Wagner, Günter P; Amemiya, Chris T

    2012-01-01

    Vertebrates have experienced two rounds of whole-genome duplication (WGD) in the stem lineages of deep nodes within the group and a subsequent duplication event in the stem lineage of the teleosts-a highly diverse group of ray-finned fishes. Here, we present the first full Hox gene sequences for any member of the Acipenseriformes, the American paddlefish, and confirm that an independent WGD occurred in the paddlefish lineage, approximately 42 Ma based on sequences spanning the entire HoxA cluster and eight genes on the HoxD gene cluster. These clusters comprise different HOX loci and maintain conserved synteny relative to bichir, zebrafish, stickleback, and pufferfish, as well as human, mouse, and chick. We also provide a gene genealogy for the duplicated fzd8 gene in paddlefish and present evidence for the first Hox14 gene in any ray-finned fish. Taken together, these data demonstrate that the American paddlefish has an independently duplicated genome. Substitution patterns of the "alpha" paralogs on both the HoxA and HoxD gene clusters suggest transcriptional inactivation consistent with functional diploidization. Further, there are similarities in the pattern of sequence divergence among duplicated Hox genes in paddlefish and teleost lineages, even though they occurred independently approximately 200 Myr apart. We highlight implications on comparative analyses in the study of the "fin-limb transition" as well as gene and genome duplication in bony fishes, which includes all ray-finned fishes as well as the lobe-finned fishes and tetrapod vertebrates.

  9. Differences in inferred genome-wide signals of positive selection during the evolution of Trypanosoma cruzi and Leishmania spp. lineages: A result of disparities in host and tissue infection ranges?

    PubMed

    Flores-López, Carlos A; Machado, Carlos A

    2015-07-01

    Trypanosoma cruzi and Leishmania spp. are kinetoplastids responsible for Chagas disease and Leishmaniasis, neglected tropical diseases for which there are no effective methods of control. These two human pathogens differ widely in the range of mammal species they can infect, their cell/tissue tropism and cell invasion mechanisms. Whether such major biological differences have had any impact on genome-wide patterns of genetic diversification in both pathogens has not been explored. The recent genome sequencing projects of medically important species of Leishmania and T. cruzi lineages provide unique resources for performing comparative evolutionary analyses to address that question. We show that inferred genome-wide signals of positive selection are higher in T. cruzi proteins than in Leishmania spp. proteins. We report significant differences in the fraction of protein-coding genes showing evidence of positive selection in the two groups of parasites, and also report that the intensity of positive selection and the proportion of sites under selection are higher in T. cruzi than in Leishmania spp. The pattern is unlikely to be the result of confounding factors like differences in GC content, average gene length or differences in reproductive mode between the two taxa. We propose that the greater versatility of T. cruzi in its host range, cell tropism and cell invasion mechanisms may explain the observed differences between the two groups of parasites. Genes showing evidence of positive selection within each taxonomic group may be under diversifying selection to evade the immune system and thus, depending on their functions, could represent viable candidates for the development of drugs or vaccines for these neglected human diseases.

  10. The phylogenetic relationships of insectivores with special reference to the lesser hedgehog tenrec as inferred from the complete sequence of their mitochondrial genome.

    PubMed

    Nikaido, Masato; Cao, Ying; Okada, Norihiro; Hasegawa, Masami

    2003-02-01

    The complete mitochondrial genome of a lesser hedgehog tenrec Echinops telfairi was determined in this study. It is an endemic African insectivore that is found specifically in Madagascar. The tenrec's back is covered with hedgehog-like spines. Unlike other spiny mammals, such as spiny mice, spiny rats, spiny dormice and porcupines, lesser hedgehog tenrecs look amazingly like true hedgehogs (Erinaceidae). However, they are distinguished morphologically from hedgehogs by the absence of a jugal bone. We determined the complete sequence of the mitochondrial genome of a lesser hedgehog tenrec and analyzed the results phylogenetically to determine the relationships between the tenrec and other insectivores (moles, shrews and hedgehogs), as well as the relationships between the tenrec and endemic African mammals, classified as Afrotheria, that have recently been shown by molecular analysis to be close relatives of the tenrec. Our data confirmed the afrotherian status of the tenrec, and no direct relation was recovered between the tenrec and the hedgehog. Comparing our data with those of others, we found that within-species variations in the mitochondrial DNA of lesser hedgehog tenrecs appear to be the largest recognized to date among mammals, apart from orangutans, which might be interesting from the view point of evolutionary history of tenrecs on Madagascar.

  11. Comparison of inferred relatedness based on multilocus variable-number tandem-repeat analysis and whole genome sequencing of Vibrio cholerae O1.

    PubMed

    Rashid, Mahamud-Ur; Almeida, Mathieu; Azman, Andrew S; Lindsay, Brianna R; Sack, David A; Colwell, Rita R; Huq, Anwar; Morris, J Glenn; Alam, Munirul; Stine, O Colin

    2016-06-01

    Vibrio cholerae causes cholera, a severe diarrheal disease. Understanding the local genetic diversity and transmission of V. cholerae will improve our ability to control cholera. Vibrio cholerae isolates clustered in genetically related groups (clonal complexes, CC) by multilocus variable tandem-repeat analysis (MLVA) were compared by whole genome sequencing (WGS). Isolates in CC1 had been isolated from two geographical locations. Isolates in a second genetically distinct group, CC2, were isolated only at one location. Using WGS, CC1 isolates from both locations revealed, on average, 43.8 nucleotide differences, while those strains comprising CC2 averaged 19.7 differences. Strains from both MLVA-CCs had an average difference of 106.6. Thus, isolates comprising CC1 were more closely related (P < 10(-6)) to each other than to isolates in CC2. Within a MLVA-CC, after removing all paralogs, alternative alleles were found in all possible combinations on separate chromosomes indicative of recombination within the core genome. Including recombination did not affect the distinctiveness of the MLVA-CCs when measured by WGS. We found that WGS generally reflected the same genetic relatedness of isolates as MLVA, indicating that isolates from the same MLVA-CC shared a more recent common ancestor than isolates from the same location that clustered in a distinct MLVA-CC. PMID:27190166

  12. Damage to DNA in bacterioplankton: a model of damage by ultraviolet radiation and its repair as influenced by vertical mixing.

    PubMed

    Huot, Y; Jeffrey, W H; Davis, R F; Cullen, J J

    2000-07-01

    A model of UV-induced DNA damage in oceanic bacterioplankton was developed and tested against previously published and novel measurements of cyclobutane pyrimidine dimers (CPD) in surface layers of the ocean. The model describes the effects of solar irradiance, wind-forced mixing of bacterioplankton and optical properties of the water on net DNA damage in the water column. The biological part includes the induction of CPD by UV radiation and repair of this damage through photoreactivation and excision. The modeled damage is compared with measured variability of CPD in the ocean: diel variation in natural bacterioplankton communities at the surface and in vertical profiles under different wind conditions (net damage as influenced by repair and mixing); in situ incubation of natural assemblages of bacterioplankton (damage and repair, no mixing); and in situ incubation of DNA solutions (no repair, no mixing). The model predictions are generally consistent with the measurements, showing similar patterns with depth, time and wind speed. A sensitivity analysis assesses the effect on net DNA damage of varying ozone thickness, colored dissolved organic matter concentration, chlorophyll concentration, wind speed and mixed layer depth. Ozone thickness and mixed layer depth are the most important factors affecting net DNA damage in the mixed layer. From the model, the total amplification factor (TAF; a relative measure of the increase of damage associated with a decrease in ozone thickness) for net DNA damage in the euphotic zone is 1.7, as compared with 2.1-2.2 for irradiance weighted for damage to DNA at the surface.

  13. Seasonal dynamics of bacterioplankton community structure in a eutrophic lake as determined by 5S rRNA analysis.

    PubMed

    Höfle, M G; Haas, H; Dominik, K

    1999-07-01

    Community structure of bacterioplankton was studied during the major growth season for phytoplankton (April to October) in the epilimnion of a temperate eutrophic lake (Lake Plusssee, northern Germany) by using comparative 5S rRNA analysis. Estimates of the relative abundances of single taxonomic groups were made on the basis of the amounts of single 5S rRNA bands obtained after high-resolution electrophoresis of RNA directly from the bacterioplankton. Full-sequence analysis of single environmental 5S rRNAs enabled the identification of single taxonomic groups of bacteria. Comparison of partial 5S rRNA sequences allowed the detection of changes of single taxa over time. Overall, the whole bacterioplankton community showed two to eight abundant (>4% of the total 5S rRNA) taxa. A distinctive seasonal succession was observed in the taxonomic structure of this pelagic community. A rather-stable community structure, with seven to eight different taxonomic units, was observed beginning in April during the spring phytoplankton bloom. A strong reduction in this diversity occurred at the beginning of the clear-water phase (early May), when only two to four abundant taxa were observed, with one taxon dominating (up to 72% of the total 5S rRNA). The community structure during summer stagnation (June and July) was characterized by frequent changes of different dominating taxa. During late summer, a dinoflagellate bloom (Ceratium hirudinella) occurred, with Comamonas acidovorans (beta-subclass of the class Proteobacteria) becoming the dominant bacterial species (average abundance of 43% of the total 5S rRNA). Finally, the seasonal dynamics of the community structure of bacterioplankton were compared with the abundances of other major groups of the aquatic food web, such as phyto- and zooplankton, revealing that strong grazing pressure by zooplankton can reduce microbial diversity substantially in pelagic environments.

  14. Response of bacterioplankton community structure to an artificial gradient of pCO2 in the Arctic Ocean

    NASA Astrophysics Data System (ADS)

    Zhang, R.; Xia, X.; Lau, S. C. K.; Motegi, C.; Weinbauer, M. G.; Jiao, N.

    2013-06-01

    In order to test the influences of ocean acidification on the ocean pelagic ecosystem, so far the largest CO2 manipulation mesocosm study (European Project on Ocean Acidification, EPOCA) was performed in Kings Bay (Kongsfjorden), Spitsbergen. During a 30 day incubation, bacterial diversity was investigated using DNA fingerprinting and clone library analysis of bacterioplankton samples. Terminal restriction fragment length polymorphism (T-RFLP) analysis of the PCR amplicons of the 16S rRNA genes revealed that general bacterial diversity, taxonomic richness and community structure were influenced by the variation of productivity during the time of incubation, but not the degree of ocean acidification. A BIOENV analysis suggested a complex control of bacterial community structure by various biological and chemical environmental parameters. The maximum apparent diversity of bacterioplankton (i.e., the number of T-RFs) in high and low pCO2 treatments differed significantly. A negative relationship between the relative abundance of Bacteroidetes and pCO2 levels was observed for samples at the end of the experiment by the combination of T-RFLP and clone library analysis. Our study suggests that ocean acidification affects the development of bacterial assemblages and potentially impacts the ecological function of the bacterioplankton in the marine ecosystem.

  15. Proteomic-based stable isotope probing reveals taxonomically Distinct Patterns in Amino Acid Assimilation by Coastal Marine Bacterioplankton

    DOE PAGES

    Bryson, Samuel; Li, Zhou; Pett-Ridge, Jennifer; Robert L. Hettich; Mayali, Xavier; Pan, Chongle; Mueller, Ryan S.

    2016-04-26

    Heterotrophic marine bacterioplankton are a critical component of the carbon cycle, processing nearly a quarter of annual global primary production, yet defining how substrate utilization preferences and resource partitioning structure these microbial communities remains a challenge. In this study, we utilized proteomics-based stable isotope probing (proteomic SIP) to characterize the assimilation of amino acids by coastal marine bacterioplankton populations. We incubated microcosms of seawater collected from Newport, OR and Monterey Bay, CA with 1 M 13C-amino acids for 15 and 32 hours. Subsequent analysis of 13C incorporation into protein biomass quantified the frequency and extent of isotope enrichment for identifiedmore » proteins. Using these metrics we tested whether amino acid assimilation patterns were different for specific bacterioplankton populations. Proteins associated with Rhodobacterales and Alteromonadales tended to have a significantly high number of tandem mass spectra from 13C-enriched peptides, while Flavobacteriales and SAR11 proteins generally had significantly low numbers of 13C-enriched spectra. Rhodobacterales proteins associated with amino acid transport and metabolism had an increased frequency of 13C-enriched spectra at time-point 2, while Alteromonadales ribosomal proteins were 13C- enriched across time-points. Overall, proteomic SIP facilitated quantitative comparisons of dissolved free amino acids assimilation by specific taxa, both between sympatric populations and between protein functional groups within discrete populations, allowing an unprecedented examination of population-level metabolic responses to resource acquisition in complex microbial communities.« less

  16. Phylogenetic shifts of bacterioplankton community composition along the Pearl Estuary: the potential impact of hypoxia and nutrients

    PubMed Central

    Liu, Jiwen; Fu, Bingbing; Yang, Hongmei; Zhao, Meixun; He, Biyan; Zhang, Xiao-Hua

    2015-01-01

    The significance of salinity in shaping bacterial communities dwelling in estuarine areas has been well documented. However, the influences of other environmental factors such as dissolved oxygen and nutrients in determining distribution patterns of both individual taxa and bacterial communities inhabited local estuarine regions remain elusive. Here, bacterioplankton community structures of surface and bottom waters from eight sites along the Pearl Estuary were characterized with 16S rRNA gene pyrosequencing. The results showed significant differences of bacterioplankton community between freshwater and saltwater sites, and further between surface and bottom waters of saltwater sites. Synechococcus dominated the surface water of saltwater sites while Oceanospirillales, SAR11 and SAR406 were prevalent in the bottom water. Betaproteobacteria was abundant in freshwater sites, with no significant difference between water layers. Occurrence of phylogenetic shifts in taxa affiliated to the same clade was also detected. Dissolved oxygen explained most of the bacterial community variation in the redundancy analysis targeting only freshwater sites, whereas nutrients and salinity explained most of the variation across all samples in the Pearl Estuary. Methylophilales (mainly PE2 clade) was positively correlated to dissolved oxygen, whereas Rhodocyclales (mainly R.12up clade) was negatively correlated. Moreover, high nutrient inputs to the freshwater area of the Pearl Estuary have shifted the bacterial communities toward copiotrophic groups, such as Sphingomonadales. The present study demonstrated that the overall nutrients and freshwater hypoxia play important roles in determining bacterioplankton compositions and provided insights into the potential ecological roles of specific taxa in estuarine environments. PMID:25713564

  17. Modeling the impact of riverine DON removal by marine bacterioplankton on primary production in the Arctic Ocean

    NASA Astrophysics Data System (ADS)

    Le Fouest, V.; Manizza, M.; Tremblay, B.; Babin, M.

    2014-12-01

    The planktonic and biogeochemical dynamics of the Arctic shelves exhibit a strong variability in response to Arctic warming. In this study, in order to elucidate on the processes regulating the production of phytoplankton (PP) and bacterioplankton (BP) and their interactions, we employ a biogeochemical model coupled to a pan-Arctic ocean-sea ice model (MITgcm) to explicitly simulate and quantify the contribution of usable dissolved organic nitrogen (DON) drained by the major circum-Arctic rivers on PP and BP in a scenario of melting sea ice (1998-2011). Model simulations suggest that on average between 1998 and 2011, the removal of usable RDON by bacterioplankton is responsible of a ~26% increase of the annual BP for the whole Arctic Ocean. With respect to total PP, the model simulates an increase of ~8% on an annual basis and of ~18% in summer. Recycled ammonium is responsible for the PP increase. The recycling of RDON by bacterioplankton promotes higher BP and PP but there is no significant temporal trend in the BP : PP ratio within the ice-free shelves over the 1998-2011 period. This suggests no significant evolution in the balance between autotrophy and heterotrophy in the last decade with a constant annual flux of RDON into the coastal ocean although changes in RDON supply and further reduction in sea ice cover could potentially alter this delicate balance.

  18. Modelling the impact of riverine DON removal by marine bacterioplankton on primary production in the Arctic Ocean

    NASA Astrophysics Data System (ADS)

    Le Fouest, V.; Manizza, M.; Tremblay, B.; Babin, M.

    2015-06-01

    The planktonic and biogeochemical dynamics of the Arctic shelves exhibit a strong variability in response to Arctic warming. In this study, we employ a biogeochemical model coupled to a pan-Arctic ocean-sea ice model (MITgcm) to elucidate the processes regulating the primary production (PP) of phytoplankton, bacterioplankton (BP), and their interactions. The model explicitly simulates and quantifies the contribution of usable dissolved organic nitrogen (DON) drained by the major circum-Arctic rivers to PP and BP in a scenario of melting sea ice (1998-2011). Model simulations suggest that, on average between 1998 and 2011, the removal of usable riverine dissolved organic nitrogen (RDON) by bacterioplankton is responsible for a ~ 26% increase in the annual BP for the whole Arctic Ocean. With respect to total PP, the model simulates an increase of ~ 8% on an annual basis and of ~ 18% in summer. Recycled ammonium is responsible for the PP increase. The recycling of RDON by bacterioplankton promotes higher BP and PP, but there is no significant temporal trend in the BP : PP ratio within the ice-free shelves over the 1998-2011 period. This suggests no significant evolution in the balance between autotrophy and heterotrophy in the last decade, with a constant annual flux of RDON into the coastal ocean, although changes in RDON supply and further reduction in sea-ice cover could potentially alter this delicate balance.

  19. Bacterioplankton in antarctic ocean waters during late austral winter: abundance, frequency of dividing cells, and estimates of production.

    PubMed

    Hanson, R B; Shafer, D; Ryan, T; Pope, D H; Lowery, H K

    1983-05-01

    Bacterioplankton productivity in Antarctic waters of the eastern South Pacific Ocean and Drake Passage was estimated by direct counts and frequency of dividing cells (FDC). Total bacterioplankton assemblages were enumerated by epifluorescent microscopy. The experimentally determined relationship between in situ FDC and the potential instantaneous growth rate constant (mu) is best described by the regression equation ln mu = 0.081 FDC - 3.73. In the eastern South Pacific Ocean, bacterioplankton abundance (2 x 10 to 3.5 x 10 cells per ml) and FDC (11%) were highest at the Polar Front (Antarctic Convergence). North of the Subantarctic Front, abundance and FDC were between 1 x 10 to 2 x 10 cells per ml and 3 to 5%, respectively, and were vertically homogeneous to a depth of 600 m. In Drake Passage, abundance (10 x 10 cells per ml) and FDC (16%) were highest in waters south of the Polar Front and near the sea ice. Subantarctic waters in Drake Passage contained 4 x 10 cells per ml with 4 to 5% FDC. Instantaneous growth rate constants ranged between 0.029 and 0.088 h. Using estimates of potential mu and measured standing stocks, we estimated productivity to range from 0.62 mug of C per liter . day in the eastern South Pacific Ocean to 17.1 mug of C per liter . day in the Drake Passage near the sea ice. PMID:16346297

  20. The bioinvasion of Guam: inferring geographic origin, pace, pattern and process of an invasive lizard (Carlia) in the Pacific using multi-locus genomic data

    USGS Publications Warehouse

    Austin, C.C.; Rittmeyer, E.N.; Oliver, L.A.; Andermann, J.O.; Zug, G.R.; Rodda, G.H.; Jackson, N.D.

    2011-01-01

    Invasive species often have dramatic negative effects that lead to the deterioration and loss of biodiversity frequently coupled with the burden of expensive biocontrol programs and subversion of socioeconomic stability. The fauna and flora of oceanic islands are particularly susceptible to invasive species and the increase of global movements of humans and their products since WW II has caused numerous anthropogenic translocations and increased the ills of human-mediated invasions. We use a multi-locus genomic dataset to identify geographic origin, pace, pattern and historical process of an invasive scincid lizard (Carlia) that has been inadvertently introduced to Guam, the Northern Marianas, and Palau. This lizard is of major importance as its introduction is thought to have assisted in the establishment of the invasive brown treesnake (Boiga irregularis) on Guam by providing a food resource. Our findings demonstrate multiple waves of introductions that appear to be concordant with movements of Allied and Imperial Japanese forces in the Pacific during World War II.

  1. Genetic Architecture of Flowering Time in Maize As Inferred From Quantitative Trait Loci Meta-analysis and Synteny Conservation With the Rice Genome

    PubMed Central

    Chardon, Fabien; Virlon, Bérangère; Moreau, Laurence; Falque, Matthieu; Joets, Johann; Decousset, Laurent; Murigneux, Alain; Charcosset, Alain

    2004-01-01

    Genetic architecture of flowering time in maize was addressed by synthesizing a total of 313 quantitative trait loci (QTL) available for this trait. These were analyzed first with an overview statistic that highlighted regions of key importance and then with a meta-analysis method that yielded a synthetic genetic model with 62 consensus QTL. Six of these displayed a major effect. Meta-analysis led in this case to a twofold increase in the precision in QTL position estimation, when compared to the most precise initial QTL position within the corresponding region. The 62 consensus QTL were compared first to the positions of the few flowering-time candidate genes that have been mapped in maize. We then projected rice candidate genes onto the maize genome using a synteny conservation approach based on comparative mapping between the maize genetic map and japonica rice physical map. This yielded 19 associations between maize QTL and genes involved in flowering time in rice and in Arabidopsis. Results suggest that the combination of meta-analysis within a species of interest and synteny-based projections from a related model plant can be an efficient strategy for identifying new candidate genes for trait variation. PMID:15611184

  2. Genetic architecture of flowering time in maize as inferred from quantitative trait loci meta-analysis and synteny conservation with the rice genome.

    PubMed

    Chardon, Fabien; Virlon, Bérangère; Moreau, Laurence; Falque, Matthieu; Joets, Johann; Decousset, Laurent; Murigneux, Alain; Charcosset, Alain

    2004-12-01

    Genetic architecture of flowering time in maize was addressed by synthesizing a total of 313 quantitative trait loci (QTL) available for this trait. These were analyzed first with an overview statistic that highlighted regions of key importance and then with a meta-analysis method that yielded a synthetic genetic model with 62 consensus QTL. Six of these displayed a major effect. Meta-analysis led in this case to a twofold increase in the precision in QTL position estimation, when compared to the most precise initial QTL position within the corresponding region. The 62 consensus QTL were compared first to the positions of the few flowering-time candidate genes that have been mapped in maize. We then projected rice candidate genes onto the maize genome using a synteny conservation approach based on comparative mapping between the maize genetic map and japonica rice physical map. This yielded 19 associations between maize QTL and genes involved in flowering time in rice and in Arabidopsis. Results suggest that the combination of meta-analysis within a species of interest and synteny-based projections from a related model plant can be an efficient strategy for identifying new candidate genes for trait variation.

  3. Model-based verification of hypotheses on the origin of modern Japanese revisited by Bayesian inference based on genome-wide SNP data.

    PubMed

    Nakagome, Shigeki; Sato, Takehiro; Ishida, Hajime; Hanihara, Tsunehiko; Yamaguchi, Tetsutaro; Kimura, Ryosuke; Mano, Shuhei; Oota, Hiroki

    2015-06-01

    Various hypotheses for the peopling of the Japanese archipelago have been proposed, which can be classified into three models: transformation, replacement, and hybridization. In recent years, one of the hybridization models ("dual-structure model") has been widely accepted. According to this model, Neolithic hunter-gatherers known as Jomon, who are assumed to have originated in southeast Asia and lived in the Japanese archipelago greater than 10,000 years ago, admixed with an agricultural people known as Yayoi, whom were migrants from the East Asian continent 2,000-3,000 years ago. Meanwhile, some anthropologists propose that rather, morphological differences between the Jomon and Yayoi people can be explained by microevolution following the lifestyle change. To resolve this controversy, we compared three demographic models by approximate Bayesian computation using genome-wide single nucleotide polymorphism (gwSNP) data from the Ainu people who are thought to be direct descendants of indigenous Jomon. If we assume Chinese people sampled in Beijing from HapMap have the same ancestry as Yayoi, then the hybridization model is predicted to be between 29 and 63 times more likely than the replacement and transformation models, respectively. Furthermore, our data provide strong support for a model in which the Jomon lineages had population structure diversified in local areas before the admixture event. Initial divergence between the Jomon and Yayoi ancestries was dated to late Pleistocene, followed by the divergence of Jomon lineages at early Holocene. These results suggest gwSNP data provides a detailed picture of the complex hybridization model for Japanese population history.

  4. Genome sequence of Silicibacter pomeroyi reveals adaptations to the marine environment.

    PubMed

    Moran, Mary Ann; Buchan, Alison; González, José M; Heidelberg, John F; Whitman, William B; Kiene, Ronald P; Henriksen, James R; King, Gary M; Belas, Robert; Fuqua, Clay; Brinkac, Lauren; Lewis, Matt; Johri, Shivani; Weaver, Bruce; Pai, Grace; Eisen, Jonathan A; Rahe, Elisha; Sheldon, Wade M; Ye, Wenying; Miller, Todd R; Carlton, Jane; Rasko, David A; Paulsen, Ian T; Ren, Qinghu; Daugherty, Sean C; Deboy, Robert T; Dodson, Robert J; Durkin, A Scott; Madupu, Ramana; Nelson, William C; Sullivan, Steven A; Rosovitz, M J; Haft, Daniel H; Selengut, Jeremy; Ward, Naomi

    2004-12-16

    Since the recognition of prokaryotes as essential components of the oceanic food web, bacterioplankton have been acknowledged as catalysts of most major biogeochemical processes in the sea. Studying heterotrophic bacterioplankton has been challenging, however, as most major clades have never been cultured or have only been grown to low densities in sea water. Here we describe the genome sequence of Silicibacter pomeroyi, a member of the marine Roseobacter clade (Fig. 1), the relatives of which comprise approximately 10-20% of coastal and oceanic mixed-layer bacterioplankton. This first genome sequence from any major heterotrophic clade consists of a chromosome (4,109,442 base pairs) and megaplasmid (491,611 base pairs). Genome analysis indicates that this organism relies upon a lithoheterotrophic strategy that uses inorganic compounds (carbon monoxide and sulphide) to supplement heterotrophy. Silicibacter pomeroyi also has genes advantageous for associations with plankton and suspended particles, including genes for uptake of algal-derived compounds, use of metabolites from reducing microzones, rapid growth and cell-density-dependent regulation. This bacterium has a physiology distinct from that of marine oligotrophs, adding a new strategy to the recognized repertoire for coping with a nutrient-poor ocean.

  5. Environmental rather than spatial factors structure bacterioplankton communities in shallow lakes along a > 6000 km latitudinal gradient in South America.

    PubMed

    Souffreau, Caroline; Van der Gucht, Katleen; van Gremberghe, Ineke; Kosten, Sarian; Lacerot, Gissell; Lobão, Lúcia Meirelles; de Moraes Huszar, Vera Lúcia; Roland, Fabio; Jeppesen, Erik; Vyverman, Wim; De Meester, Luc

    2015-07-01

    Metacommunity studies on lake bacterioplankton indicate the importance of environmental factors in structuring communities. Yet most of these studies cover relatively small spatial scales. We assessed the relative importance of environmental and spatial factors in shaping bacterioplankton communities across a > 6000 km latitudinal range, studying 48 shallow lowland lakes in the tropical, tropicali (isothermal subzone of the tropics) and tundra climate regions of South America using denaturing gradient gel electrophoresis. Bacterioplankton community composition (BCC) differed significantly across regions. Although a large fraction of the variation in BCC remained unexplained, the results supported a consistent significant contribution of local environmental variables and to a lesser extent spatial variables, irrespective of spatial scale. Upon correction for space, mainly biotic environmental factors significantly explained the variation in BCC. The abundance of pelagic cladocerans remained particularly significant, suggesting grazer effects on bacterioplankton communities in the studied lakes. These results confirm that bacterioplankton communities are predominantly structured by environmental factors, even over a large-scale latitudinal gradient (6026 km), and stress the importance of including biotic variables in studies that aim to understand patterns in BCC.

  6. Submerged macrophytes shape the abundance and diversity of bacterial denitrifiers in bacterioplankton and epiphyton in the Shallow Fresh Lake Taihu, China.

    PubMed

    Fan, Zhou; Han, Rui-Ming; Ma, Jie; Wang, Guo-Xiang

    2016-07-01

    nirK and nirS genes are important functional genes involved in the denitrification pathway. Recent studies about these two denitrifying genes are focusing on sediment and wastewater microbe. In this study, we conducted a comparative analysis of the abundance and diversity of denitrifiers in the epiphyton of submerged macrophytes Potamogeton malaianus and Ceratophyllum demersum as well as in bacterioplankton in the shallow fresh lake Taihu, China. Results showed that nirK and nirS genes had significant different niches in epiphyton and bacterioplankton. Bacterioplankton showed greater abundance of nirK gene in terms of copy numbers and lower abundance of nirS gene. Significant difference in the abundance of nirK and nirS genes also existed between the epiphyton from different submerged macrophytes. Similar community diversity yet different community abundance was observed between epiphytic bacteria and bacterioplankton. No apparent seasonal variation was found either in epiphytic bacteria or bacterioplankton; however, environmental parameters seemed to have direct relevancy with nirK and nirS genes. Our study suggested that submerged macrophytes have greater influence than seasonal parameters in shaping the presence and abundance of bacterial denitrifiers. Further investigation needs to focus on the potential contact and relative contribution between denitrifiers and environmental factors. PMID:27048324

  7. Inhibitory Effect of Solar Radiation on Thymidine and Leucine Incorporation by Freshwater and Marine Bacterioplankton

    PubMed Central

    Sommaruga, R.; Obernosterer, I.; Herndl, G. J.; Psenner, R.

    1997-01-01

    We studied the effect of solar radiation on the incorporation of [(sup3)H]thymidine ([(sup3)H]TdR) and [(sup14)C]leucine ([(sup14)C]Leu) by bacterioplankton in a high mountain lake and the northern Adriatic Sea. After short-term exposure (3 to 4 h) of natural bacterial assemblages to sunlight just beneath the surface, the rates of incorporation of [(sup3)H]TdR and [(sup14)C]Leu were reduced at both sites by up to (symbl)70% compared to those for the dark control. Within the solar UV radiation (290 to 400 nm), the inhibition was caused exclusively by UV-A radiation (320 to 400 nm). However, photosynthetically active radiation (PAR) (400 to 700 nm) contributed almost equally to this effect. Experiments with samples from the high mountain lake showed that at a depth of 2.5 m, the inhibition was caused almost exclusively by UV-A radiation. At a depth of 8.5 m, where chlorophyll a concentrations were higher than those in the upper water column, the rates of incorporation of [(sup3)H]TdR were higher in those samples exposed to full sunlight or to UV-A plus PAR than in the dark control. In laboratory experiments with artificial UV light, the incorporation of [(sup3)H]TdR and [(sup14)C]Leu by mixed bacterial lake cultures was also inhibited mainly by UV-A. In contrast, in the presence of the green alga Chlamydomonas geitleri at a chlorophyll a concentration of 2.5 (mu)g liter(sup-1), inhibition by UV radiation was significantly reduced. These results suggest that there may be complex interactions among UV radiation, heterotrophic bacteria, and phytoplankton and their release of extracellular organic carbon. Our findings indicate that the wavelengths which caused the strongest inhibition of TdR and Leu incorporation by bacterioplankton in the water column were in the UV-A range. However, it may be premature to extrapolate this effect to estimates of bacterial production before more precise information on how solar radiation affects the transport of TdR and Leu into the cell

  8. Deep Learning for Population Genetic Inference

    PubMed Central

    Sheehan, Sara; Song, Yun S.

    2016-01-01

    Given genomic variation data from multiple individuals, computing the likelihood of complex population genetic models is often infeasible. To circumvent this problem, we introduce a novel likelihood-free inference framework by applying deep learning, a powerful modern technique in machine learning. Deep learning makes use of multilayer neural networks to learn a feature-based function from the input (e.g., hundreds of correlated summary statistics of data) to the output (e.g., population genetic parameters of interest). We demonstrate that deep learning can be effectively employed for population genetic inference and learning informative features of data. As a concrete application, we focus on the challenging problem of jointly inferring natural selection and demography (in the form of a population size change history). Our method is able to separate the global nature of demography from the local nature of selection, without sequential steps for these two factors. Studying demography and selection jointly is motivated by Drosophila, where pervasive selection confounds demographic analysis. We apply our method to 197 African Drosophila melanogaster genomes from Zambia to infer both their overall demography, and regions of their genome under selection. We find many regions of the genome that have experienced hard sweeps, and fewer under selection on standing variation (soft sweep) or balancing selection. Interestingly, we find that soft sweeps and balancing selection occur more frequently closer to the centromere of each chromosome. In addition, our demographic inference suggests that previously estimated bottlenecks for African Drosophila melanogaster are too extreme. PMID:27018908

  9. Deep Learning for Population Genetic Inference.

    PubMed

    Sheehan, Sara; Song, Yun S

    2016-03-01

    Given genomic variation data from multiple individuals, computing the likelihood of complex population genetic models is often infeasible. To circumvent this problem, we introduce a novel likelihood-free inference framework by applying deep learning, a powerful modern technique in machine learning. Deep learning makes use of multilayer neural networks to learn a feature-based function from the input (e.g., hundreds of correlated summary statistics of data) to the output (e.g., population genetic parameters of interest). We demonstrate that deep learning can be effectively employed for population genetic inference and learning informative features of data. As a concrete application, we focus on the challenging problem of jointly inferring natural selection and demography (in the form of a population size change history). Our method is able to separate the global nature of demography from the local nature of selection, without sequential steps for these two factors. Studying demography and selection jointly is motivated by Drosophila, where pervasive selection confounds demographic analysis. We apply our method to 197 African Drosophila melanogaster genomes from Zambia to infer both their overall demography, and regions of their genome under selection. We find many regions of the genome that have experienced hard sweeps, and fewer under selection on standing variation (soft sweep) or balancing selection. Interestingly, we find that soft sweeps and balancing selection occur more frequently closer to the centromere of each chromosome. In addition, our demographic inference suggests that previously estimated bottlenecks for African Drosophila melanogaster are too extreme. PMID:27018908

  10. Deep Learning for Population Genetic Inference.

    PubMed

    Sheehan, Sara; Song, Yun S

    2016-03-01

    Given genomic variation data from multiple individuals, computing the likelihood of complex population genetic models is often infeasible. To circumvent this problem, we introduce a novel likelihood-free inference framework by applying deep learning, a powerful modern technique in machine learning. Deep learning makes use of multilayer neural networks to learn a feature-based function from the input (e.g., hundreds of correlated summary statistics of data) to the output (e.g., population genetic parameters of interest). We demonstrate that deep learning can be effectively employed for population genetic inference and learning informative features of data. As a concrete application, we focus on the challenging problem of jointly inferring natural selection and demography (in the form of a population size change history). Our method is able to separate the global nature of demography from the local nature of selection, without sequential steps for these two factors. Studying demography and selection jointly is motivated by Drosophila, where pervasive selection confounds demographic analysis. We apply our method to 197 African Drosophila melanogaster genomes from Zambia to infer both their overall demography, and regions of their genome under selection. We find many regions of the genome that have experienced hard sweeps, and fewer under selection on standing variation (soft sweep) or balancing selection. Interestingly, we find that soft sweeps and balancing selection occur more frequently closer to the centromere of each chromosome. In addition, our demographic inference suggests that previously estimated bottlenecks for African Drosophila melanogaster are too extreme.

  11. Jellyfish-associated bacterial communities and bacterioplankton in Indonesian Marine lakes.

    PubMed

    Cleary, Daniel F R; Becking, Leontine E; Polónia, Ana R M; Freitas, Rossana M; Gomes, Newton C M

    2016-05-01

    In the present study, we compared communities of bacteria in two jellyfish species (the 'golden' jellyfish Mastigias cf.papua and the box jellyfish Tripedalia cf.cystophora) and water in three marine lakes located in the Berau region of northeastern Borneo, Indonesia. Jellyfish-associated bacterial communities were compositionally distinct and less diverse than bacterioplankton communities. Alphaproteobacteria, Gammaproteobacteria, Synechococcophycidae and Flavobacteriia were the most abundant classes in water. Jellyfish-associated bacterial communities were dominated by OTUs assigned to the Gammaproteobacteria (family Endozoicimonaceae), Mollicutes, Spirochaetes and Alphaproteobacteria (orders Kiloniellales and Rhodobacterales). Mollicutes were mainly restricted to Mastigias whereas Spirochaetes and the order Kiloniellales were most abundant in Tripedalia hosts. The most abundant OTU overall in jellyfish hosts was assigned to the family Endozoicimonaceae and was highly similar to organisms in Genbank obtained from various hosts including an octocoral, bivalve and fish species. Other abundant OTUs included an OTU assigned to the order Entomoplasmatales and mainly found in Mastigias hosts and OTUs assigned to the Spirochaetes and order Kiloniellales and mainly found in Tripedalia hosts. The low sequence similarity of the Entomoplasmatales OTU to sequences in Genbank suggests that it may be a novel lineage inhabiting Mastigias and possibly restricted to marine lakes. PMID:27004797

  12. Away from darkness: a review on the effects of solar radiation on heterotrophic bacterioplankton activity

    PubMed Central

    Ruiz-González, Clara; Simó, Rafel; Sommaruga, Ruben; Gasol, Josep M.

    2013-01-01

    Heterotrophic bacterioplankton are main consumers of dissolved organic matter (OM) in aquatic ecosystems, including the sunlit upper layers of the ocean and freshwater bodies. Their well-known sensitivity to ultraviolet radiation (UVR), together with some recently discovered mechanisms bacteria have evolved to benefit from photosynthetically available radiation (PAR), suggest that natural sunlight plays a relevant, yet difficult to predict role in modulating bacterial biogeochemical functions in aquatic ecosystems. Three decades of experimental work assessing the effects of sunlight on natural bacterial heterotrophic activity reveal responses ranging from high stimulation to total inhibition. In this review, we compile the existing studies on the topic and discuss the potential causes underlying these contrasting results, with special emphasis on the largely overlooked influences of the community composition and the previous light exposure conditions, as well as the different temporal and spatial scales at which exposure to solar radiation fluctuates. These intricate sunlight-bacteria interactions have implications for our understanding of carbon fluxes in aquatic systems, yet further research is necessary before we can accurately evaluate or predict the consequences of increasing surface UVR levels associated with global change. PMID:23734148

  13. The hidden seasonality of the rare biosphere in coastal marine bacterioplankton.

    PubMed

    Alonso-Sáez, Laura; Díaz-Pérez, Laura; Morán, Xosé Anxelu G

    2015-10-01

    Rare microbial taxa are increasingly recognized to play key ecological roles, but knowledge of their spatio-temporal dynamics is lacking. In a time-series study in coastal waters, we detected 83 bacterial lineages with significant seasonality, including environmentally relevant taxa where little ecological information was available. For example, Verrucomicrobia had recurrent maxima in summer, while the Flavobacteria NS4, NS5 and NS2b clades had contrasting seasonal niches. Among the seasonal taxa, only 4 were abundant and persistent, 20 cycled between rare and abundant and, remarkably, most of them (59) were always rare (contributing < 1% of total reads). We thus demonstrate that seasonal patterns in marine bacterioplankton are largely driven by lineages that never sustain abundant populations. A fewer number of rare taxa (20) also produced episodic 'blooms', and these events were highly synchronized, mostly occurring on a single month. The recurrent seasonal growth and loss of rare bacteria opens new perspectives on the temporal dynamics of the rare biosphere, hitherto mainly characterized by dormancy and episodes of 'boom and bust', as envisioned by the seed-bank hypothesis. The predictable patterns of seasonal reoccurrence are relevant for understanding the ecology of rare bacteria, which may include key players for the functioning of marine ecosystems.

  14. Inorganic Phosphorus Stimulation of Bacterioplankton Production in a Meso-Eutrophic Lake †

    PubMed Central

    Toolan, Tara; Wehr, John D.; Findlay, Stuart

    1991-01-01

    Experiments were conducted to determine whether production of heterotrophic bacterioplankton in a small meso-eutrophic lake was influenced by the dissolved inorganic phosphorus (DIP) supply. DIP may indirectly limit bacterial production by limiting phytoplankton, which in turn may limit the carbon available to bacteria. Direct DIP limitation of bacteria occurs where the availability of DIP for bacteria is insufficient to maintain growth. This work examined direct DIP limitation of bacteria by removing phytoplankton and incubating flasks with or without added P in the dark. Bacterial production was measured via the rate of incorporation of [3H]thymidine ([3H]TdR) into DNA. Bacterial abundance was followed with epifluorescent direct counts. Rates of [3H]TdR incorporation were significantly greater in flasks with added DIP, and changes in cell abundances generally paralleled increases in [3H]TdR incorporation. Even very small additions of P (0.05 μM) were sufficient to stimulate production. DIP addition to whole lakewater also stimulated [3H]TdR incorporation relative to that in zero-addition controls, but there was not a concurrent increase in bacterial cell numbers. The stimulation of [3H]TdR incorporation after DIP addition to whole lakewater was significantly less than the stimulation due to DIP addition to 1-μm-pore-size-filtered lakewater. In this study, addition of DIP caused as much as an eightfold stimulation of [3H]TdR incorporation. PMID:16348528

  15. Large-scale distribution and production of bacterioplankton in the Adriatic Sea

    NASA Astrophysics Data System (ADS)

    Gallina, Alessandra A.; Celussi, Mauro; Del Negro, Paola

    2011-08-01

    Two oceanographic cruises encompassing the whole Adriatic Sea were carried out during February and October 2008. Selected stations were sampled at several depths to determine total prokaryotes and picocyanobacteria abundance using epifluorescence microscopy, and to estimate prokaryotic carbon production by 3H-leucine incorporation. Biological data were related to physical parameters including temperature, salinity and fluorescence, and an attempt to associate bacterial dynamics to water mass characteristics was performed. In both seasons prokaryotic distribution and production showed a decreasing latitudinal gradient likely dependent on riverine inputs highlighted by a strong negative correlation with salinity ( P < 0.001). A vertical gradient with higher cell numbers at the surface and lower values at the bottom layer was also always detected. In the southern basin in February, however, picocyanobacteria were retrieved also in deep waters, probably linked to higher nutrient loads carried by the Levantine Intermediate Waters and/or the deep water ventilation known to occur in this area. From an oceanographic point of view, we sampled within four different water types, but no relationship between these water types and bacterioplankton abundances was found. The present work contributes to the acquisition of a more holistic overview of prokaryotic distribution and production in the Adriatic Sea, both on a spatial and temporal scale.

  16. The native bacterioplankton community in the central baltic sea is influenced by freshwater bacterial species.

    PubMed

    Riemann, L; Leitet, C; Pommier, T; Simu, K; Holmfeldt, K; Larsson, U; Hagström, A

    2008-01-01

    The Baltic Sea is one of the largest brackish environments on Earth. Despite extensive knowledge about food web interactions and pelagic ecosystem functioning, information about the bacterial community composition in the Baltic Sea is scarce. We hypothesized that due to the eutrophic low-salinity environment and the long water residence time (>5 years), the bacterioplankton community from the Baltic proper shows a native "brackish" composition influenced by both freshwater and marine phylotypes. The bacterial community composition in surface water (3-m depth) was examined at a single station throughout a full year. Denaturing gradient gel electrophoresis (DGGE) showed that the community composition changed over the year. Further, it indicated that at the four extensive samplings (16S rRNA gene clone libraries and bacterial isolates from low- and high-nutrient agar plates and seawater cultures), different bacterial assemblages associated with different environmental conditions were present. Overall, the sequencing of 26 DGGE bands, 160 clones, 209 plate isolates, and 9 dilution culture isolates showed that the bacterial assemblage in surface waters of the central Baltic Sea was dominated by Bacteroidetes but exhibited a pronounced influence of typical freshwater phylogenetic groups within Actinobacteria, Verrucomicrobia, and Betaproteobacteria and a lack of typical marine taxa. This first comprehensive analysis of bacterial community composition in the central Baltic Sea points to the existence of an autochthonous estuarine community uniquely adapted to the environmental conditions prevailing in this brackish environment.

  17. Use of phytoplankton-derived dissolved organic carbon by different types of bacterioplankton.

    PubMed

    Sarmento, Hugo; Gasol, Josep M

    2012-09-01

    Phytoplankton and heterotrophic prokaryotes are major components of the microbial food web and interact continuously: heterotrophic prokaryotes utilize the dissolved organic carbon derived from phytoplankton exudation or cell lysis (DOCp), and mineralization by heterotrophic prokaryotes provides inorganic nutrients for phytoplankton. For this reason, these communities are expected to be closely linked, although the study of the interactions between them is still a major challenge. Recent studies have presented interactions between phytoplankton and heterotrophic prokaryotes based on coexistence or covariation throughout a time-series. However, a real quantification of the carbon flow within these networks (defined as the interaction strength, IS) has not been achieved yet. This is critical to understand the selectivity degree of bacteria responding to specific algal DOCp. Here we used microautoradiography to quantify the preferences of the major heterotrophic prokaryote phylogenetic groups on DOC derived from several representative phytoplankton species, and expressed these preferences as an IS value. The distribution of the ISs was not random but rather skewed towards weak interactions, in a similar way as the distributions described for stable complex non-microbial ecosystems, indicating that there are some cases of high specificity on the use of specific algal DOCp by some bacterial groups, but weak interactions are more common and may be relevant as well. The variety of IS patterns observed supports the view that the vast range of different resources (different types of organic molecules) available in the sea selects and maintains the high levels of diversity described for marine bacterioplankton.

  18. Organic substrate quality as the link between bacterioplankton carbon demand and growth efficiency in a temperate salt-marsh estuary.

    PubMed

    Apple, Jude K; del Giorgio, P A

    2007-12-01

    Bacterioplankton communities play a key role in aquatic carbon cycling, specifically with respect to the magnitude of organic carbon processed and partitioning of this carbon into biomass and respiratory losses. Studies of bacterioplankton carbon demand (BCD) and growth efficiency (BGE) frequently report higher values in more productive systems, suggesting these aspects of carbon metabolism may be positively coupled. However, the existence of such a relationship in natural aquatic systems has yet to be identified. Using a comprehensive 2-year study of bacterioplankton carbon metabolism in a temperate estuary, we investigated BCD and BGE and explored factors that may modulate their magnitude and coherence, including nutrient concentrations, dissolved nutrient uptake and source and quality of dissolved organic carbon (DOC). During the course of our study, BCD ranged from 0.4 to 15.9 microg l(-1) h(-1), with an overall mean of 3.8 microg l(-1) h(-1). Mean BGE was similar to that reported for other estuarine systems (0.32) and of comparable range (that is, 0.06-0.68). Initial analyses identified a negative correlation between BCD and BGE, yet removal of the effect of temperature revealed an underlying positive coupling that was also correlated with long-term DOC lability. Whereas BCD was weakly related to ambient DOC concentrations, neither BCD nor BGE showed any relationship with ambient nutrient concentrations or nutrient uptake stoichiometries. We conclude that in this carbon-rich estuary, organic matter source and quality play an important role in regulating the magnitude of carbon metabolism and may be more important than nutrient availability alone in the regulation of BGE.

  19. Free-Living and Particle-Associated Bacterioplankton in Large Rivers of the Mississippi River Basin Demonstrate Biogeographic Patterns.

    PubMed

    Jackson, Colin R; Millar, Justin J; Payne, Jason T; Ochs, Clifford A

    2014-12-01

    The different drainage basins of large rivers such as the Mississippi River represent interesting systems in which to study patterns in freshwater microbial biogeography. Spatial variability in bacterioplankton communities in six major rivers (the Upper Mississippi, Missouri, Illinois, Ohio, Tennessee, and Arkansas) of the Mississippi River Basin was characterized using Ion Torrent 16S rRNA amplicon sequencing. When all systems were combined, particle-associated (>3 μm) bacterial assemblages were found to be different from free-living bacterioplankton in terms of overall community structure, partly because of differences in the proportional abundance of sequences affiliated with major bacterial lineages (Alphaproteobacteria, Cyanobacteria, and Planctomycetes). Both particle-associated and free-living communities ordinated by river system, a pattern that was apparent even after rare sequences or those affiliated with Cyanobacteria were removed from the analyses. Ordination of samples by river system correlated with environmental characteristics of each river, such as nutrient status and turbidity. Communities in the Upper Mississippi and the Missouri and in the Ohio and the Tennessee, pairs of rivers that join each other, contained similar taxa in terms of presence-absence data but differed in the proportional abundance of major lineages. The most common sequence types detected in particle-associated communities were picocyanobacteria in the Synechococcus/Prochlorococcus/Cyanobium (Syn/Pro) clade, while free-living communities also contained a high proportion of LD12 (SAR11/Pelagibacter)-like Alphaproteobacteria. This research shows that while different tributaries of large river systems such as the Mississippi River harbor distinct bacterioplankton communities, there is also microhabitat variation such as that between free-living and particle-associated assemblages.

  20. Bacterioplankton Biogeography of the Atlantic Ocean: A Case Study of the Distance-Decay Relationship

    PubMed Central

    Milici, Mathias; Tomasch, Jürgen; Wos-Oxley, Melissa L.; Decelle, Johan; Jáuregui, Ruy; Wang, Hui; Deng, Zhi-Luo; Plumeier, Iris; Giebel, Helge-Ansgar; Badewien, Thomas H.; Wurst, Mascha; Pieper, Dietmar H.; Simon, Meinhard; Wagner-Döbler, Irene

    2016-01-01

    In order to determine the influence of geographical distance, depth, and Longhurstian province on bacterial community composition and compare it with the composition of photosynthetic micro-eukaryote communities, 382 samples from a depth-resolved latitudinal transect (51°S–47°N) from the epipelagic zone of the Atlantic ocean were analyzed by Illumina amplicon sequencing. In the upper 100 m of the ocean, community similarity decreased toward the equator for 6000 km, but subsequently increased again, reaching similarity values of 40–60% for samples that were separated by ~12,000 km, resulting in a U-shaped distance-decay curve. We conclude that adaptation to local conditions can override the linear distance-decay relationship in the upper epipelagial of the Atlantic Ocean which is apparently not restrained by barriers to dispersal, since the same taxa were shared between the most distant communities. The six Longhurstian provinces covered by the transect were comprised of distinct microbial communities; ~30% of variation in community composition could be explained by province. Bacterial communities belonging to the deeper layer of the epipelagic zone (140–200 m) lacked a distance-decay relationship altogether and showed little provincialism. Interestingly, those biogeographical patterns were consistently found for bacteria from three different size fractions of the plankton with different taxonomic composition, indicating conserved underlying mechanisms. Analysis of the chloroplast 16S rRNA gene sequences revealed that phytoplankton composition was strongly correlated with both free-living and particle associated bacterial community composition (R between 0.51 and 0.62, p < 0.002). The data show that biogeographical patterns commonly found in macroecology do not hold for marine bacterioplankton, most likely because dispersal and evolution occur at drastically different rates in bacteria. PMID:27199923

  1. A metaproteomic assessment of winter and summer bacterioplankton from Antarctic Peninsula coastal surface waters.

    PubMed

    Williams, Timothy J; Long, Emilie; Evans, Flavia; Demaere, Mathew Z; Lauro, Federico M; Raftery, Mark J; Ducklow, Hugh; Grzymski, Joseph J; Murray, Alison E; Cavicchioli, Ricardo

    2012-10-01

    A metaproteomic survey of surface coastal waters near Palmer Station on the Antarctic Peninsula, West Antarctica, was performed, revealing marked differences in the functional capacity of summer and winter communities of bacterioplankton. Proteins from Flavobacteria were more abundant in the summer metaproteome, whereas winter was characterized by proteins from ammonia-oxidizing Marine Group I Crenarchaeota. Proteins prevalent in both seasons were from SAR11 and Rhodobacterales clades of Alphaproteobacteria, as well as many lineages of Gammaproteobacteria. The metaproteome data were used to elucidate the main metabolic and energy generation pathways and transport processes occurring at the microbial level in each season. In summer, autotrophic carbon assimilation appears to be driven by oxygenic photoautotrophy, consistent with high light availability and intensity. In contrast, during the dark polar winter, the metaproteome supported the occurrence of chemolithoautotrophy via the 3-hydroxypropionate/4-hydroxybutyrate cycle and the reverse tricarboxylic acid cycle of ammonia-oxidizing archaea and nitrite-oxidizing bacteria, respectively. Proteins involved in nitrification were also detected in the metaproteome. Taurine appears to be an important source of carbon and nitrogen for heterotrophs (especially SAR11), with transporters and enzymes for taurine uptake and degradation abundant in the metaproteome. Divergent heterotrophic strategies for Alphaproteobacteria and Flavobacteria were indicated by the metaproteome data, with Alphaproteobacteria capturing (by high-affinity transport) and processing labile solutes, and Flavobacteria expressing outer membrane receptors for particle adhesion to facilitate the exploitation of non-labile substrates. TonB-dependent receptors from Gammaproteobacteria and Flavobacteria (particularly in summer) were abundant, indicating that scavenging of substrates was likely an important strategy for these clades of Southern Ocean bacteria

  2. Depth profiles of bacterioplankton assemblages and their activities in the Ross Sea

    NASA Astrophysics Data System (ADS)

    Celussi, Mauro; Cataletto, Bruno; Fonda Umani, Serena; Del Negro, Paola

    2009-12-01

    The identification of bacterial community structure has led, since the beginning of the 1990s, to the idea that bacterioplankton populations are stratified in the water column and that diverse lineages with mostly unknown phenotypes dominate marine microbial communities. The diversity of depth-related assemblages is also reflected in their patterns of activities, as bacteria affiliated to different groups can express different activities in a given ecosystem. We analysed bacterial assemblages (DGGE fingerprinting) and their activities (prokaryotic carbon production, protease, phosphatase, chitinase, beta-glucosidase and lipase activities) in two areas in the Ross Sea, differing mainly in their productivity regime: two stations are located in the Terra Nova Bay polynya area (highly productive during summer) and two close to Cape Adare (low phytoplankton biomass and activity). At every station a pronounced stratification of bacterial assemblages was identified, highlighting epipelagic communities differing substantially from the mesopelagic and the bathypelagic communities. Multivariate analysis suggested that pressure and indirectly light-affected variables (i.e. oxygen and fluorescence) had a great effect on the bacterial communities outcompeting the possible influences of temperature and dissolved organic carbon concentration. Generally activities decreased with depth even though a signal of the Circumpolar Deep Water (CDW) at one of the northern stations corresponded to an increase in some of the degradative activities, generating some 'hot spots' in the profile. We also found that similar assemblages express similar metabolic requirements reflected in analogous patterns of activity (similar degradative potential and leucine uptake rate). Furthermore, the presence of eukaryotic chloroplasts' 16S rDNA in deep samples highlighted how in some cases the dense surface-water formation (in this case High Salinity Shelf Water—HSSW) and downwelling can affect, at least

  3. Bacterioplankton Biogeography of the Atlantic Ocean: A Case Study of the Distance-Decay Relationship.

    PubMed

    Milici, Mathias; Tomasch, Jürgen; Wos-Oxley, Melissa L; Decelle, Johan; Jáuregui, Ruy; Wang, Hui; Deng, Zhi-Luo; Plumeier, Iris; Giebel, Helge-Ansgar; Badewien, Thomas H; Wurst, Mascha; Pieper, Dietmar H; Simon, Meinhard; Wagner-Döbler, Irene

    2016-01-01

    In order to determine the influence of geographical distance, depth, and Longhurstian province on bacterial community composition and compare it with the composition of photosynthetic micro-eukaryote communities, 382 samples from a depth-resolved latitudinal transect (51°S-47°N) from the epipelagic zone of the Atlantic ocean were analyzed by Illumina amplicon sequencing. In the upper 100 m of the ocean, community similarity decreased toward the equator for 6000 km, but subsequently increased again, reaching similarity values of 40-60% for samples that were separated by ~12,000 km, resulting in a U-shaped distance-decay curve. We conclude that adaptation to local conditions can override the linear distance-decay relationship in the upper epipelagial of the Atlantic Ocean which is apparently not restrained by barriers to dispersal, since the same taxa were shared between the most distant communities. The six Longhurstian provinces covered by the transect were comprised of distinct microbial communities; ~30% of variation in community composition could be explained by province. Bacterial communities belonging to the deeper layer of the epipelagic zone (140-200 m) lacked a distance-decay relationship altogether and showed little provincialism. Interestingly, those biogeographical patterns were consistently found for bacteria from three different size fractions of the plankton with different taxonomic composition, indicating conserved underlying mechanisms. Analysis of the chloroplast 16S rRNA gene sequences revealed that phytoplankton composition was strongly correlated with both free-living and particle associated bacterial community composition (R between 0.51 and 0.62, p < 0.002). The data show that biogeographical patterns commonly found in macroecology do not hold for marine bacterioplankton, most likely because dispersal and evolution occur at drastically different rates in bacteria.

  4. Temporal and Vertical Distributions of Bacterioplankton at the Gray's Reef National Marine Sanctuary

    PubMed Central

    Lu, Xinxin; Sun, Shulei; Zhang, Yu-Qin; Hollibaugh, James T.

    2014-01-01

    Large spatial scales and long-term shifts of bacterial community composition (BCC) in the open ocean can often be reliably predicted based on the dynamics of physical-chemical variables. The power of abiotic factors in shaping BCC on shorter time scales in shallow estuarine mixing zones is less clear. We examined the diurnal variation in BCC at different water depths in the spring and fall of 2011 at a station in the Gray's Reef National Marine Sanctuary (GRNMS). This site is located in the transition zone between the estuarine plume and continental shelf waters of the South Atlantic Bight. A total of 234,516 pyrotag sequences of bacterial 16S rRNA genes were recovered; they were taxonomically affiliated with >200 families of 23 bacterial phyla. Nonmetric multidimensional scaling analysis revealed significant differences in BCC between spring and fall samples, likely due to seasonality in the concentrations of dissolved organic carbon and nitrate plus nitrite. Within each diurnal sampling, BCC differed significantly by depth only in the spring and differed significantly between day and night only in the fall. The former variation largely tracked changes in light availability, while the latter was most correlated with concentrations of polyamines and chlorophyll a. Our results suggest that at the GRNMS, a coastal mixing zone, diurnal variation in BCC is attributable to the mixing of local and imported bacterioplankton rather than to bacterial growth in response to environmental changes. Our results also indicate that, like members of the Roseobacter clade, SAR11 bacteria may play an important role in processing dissolved organic material in coastal oceans. PMID:25416764

  5. A metaproteomic assessment of winter and summer bacterioplankton from Antarctic Peninsula coastal surface waters

    PubMed Central

    Williams, Timothy J; Long, Emilie; Evans, Flavia; DeMaere, Mathew Z; Lauro, Federico M; Raftery, Mark J; Ducklow, Hugh; Grzymski, Joseph J; Murray, Alison E; Cavicchioli, Ricardo

    2012-01-01

    A metaproteomic survey of surface coastal waters near Palmer Station on the Antarctic Peninsula, West Antarctica, was performed, revealing marked differences in the functional capacity of summer and winter communities of bacterioplankton. Proteins from Flavobacteria were more abundant in the summer metaproteome, whereas winter was characterized by proteins from ammonia-oxidizing Marine Group I Crenarchaeota. Proteins prevalent in both seasons were from SAR11 and Rhodobacterales clades of Alphaproteobacteria, as well as many lineages of Gammaproteobacteria. The metaproteome data were used to elucidate the main metabolic and energy generation pathways and transport processes occurring at the microbial level in each season. In summer, autotrophic carbon assimilation appears to be driven by oxygenic photoautotrophy, consistent with high light availability and intensity. In contrast, during the dark polar winter, the metaproteome supported the occurrence of chemolithoautotrophy via the 3-hydroxypropionate/4-hydroxybutyrate cycle and the reverse tricarboxylic acid cycle of ammonia-oxidizing archaea and nitrite-oxidizing bacteria, respectively. Proteins involved in nitrification were also detected in the metaproteome. Taurine appears to be an important source of carbon and nitrogen for heterotrophs (especially SAR11), with transporters and enzymes for taurine uptake and degradation abundant in the metaproteome. Divergent heterotrophic strategies for Alphaproteobacteria and Flavobacteria were indicated by the metaproteome data, with Alphaproteobacteria capturing (by high-affinity transport) and processing labile solutes, and Flavobacteria expressing outer membrane receptors for particle adhesion to facilitate the exploitation of non-labile substrates. TonB-dependent receptors from Gammaproteobacteria and Flavobacteria (particularly in summer) were abundant, indicating that scavenging of substrates was likely an important strategy for these clades of Southern Ocean bacteria

  6. Genomic insights into bacterial DMSP transformations.

    PubMed

    Moran, Mary Ann; Reisch, Chris R; Kiene, Ronald P; Whitman, William B

    2012-01-01

    Genomic and functional genomic methods applied to both model organisms and natural communities have rapidly advanced understanding of bacterial dimethylsulfoniopropionate (DMSP) degradation in the ocean. The genes for the two main pathways in bacterial degradation, routing DMSP to distinctly different biogeochemical fates, have recently been identified. The genes dmdA, -B, -C, and -D mediate the demethylation of DMSP and facilitate retention of carbon and sulfur in the marine microbial food web. The genes dddD, -L, -P, -Q, -W, and -Y mediate the cleavage of DMSP to dimethylsulfide (DMS), with important consequences for ocean-atmosphere sulfur flux. In ocean metagenomes, sufficient copies of these genes are present for approximately 60% of surface ocean bacterial cells to directly participate in DMSP degradation. The factors that regulate these two competing pathways remain elusive, but gene transcription analyses of natural bacterioplankton communities are making headway in unraveling the intricacies of bacterial DMSP processing in the ocean.

  7. Genomic Insights into Bacterial DMSP Transformations

    NASA Astrophysics Data System (ADS)

    Moran, Mary Ann; Reisch, Chris R.; Kiene, Ronald P.; Whitman, William B.

    2012-01-01

    Genomic and functional genomic methods applied to both model organisms and natural communities have rapidly advanced understanding of bacterial dimethylsulfoniopropionate (DMSP) degradation in the ocean. The genes for the two main pathways in bacterial degradation, routing DMSP to distinctly different biogeochemical fates, have recently been identified. The genes dmdA, -B, -C, and -D mediate the demethylation of DMSP and facilitate retention of carbon and sulfur in the marine microbial food web. The genes dddD, -L, -P, -Q, -W, and -Y mediate the cleavage of DMSP to dimethylsulfide (DMS), with important consequences for ocean-atmosphere sulfur flux. In ocean metagenomes, sufficient copies of these genes are present for ˜60% of surface ocean bacterial cells to directly participate in DMSP degradation. The factors that regulate these two competing pathways remain elusive, but gene transcription analyses of natural bacterioplankton communities are making headway in unraveling the intricacies of bacterial DMSP processing in the ocean.

  8. Bacterioplankton features and its relations with doc characteristics and other limnological variables in Paraná river floodplain environments (PR/MS-Brazil)

    PubMed Central

    Teixeira, Mariana Carolina; Santana, Natália Fernanda; de Azevedo, Júlio César Rodrigues; Pagioro, Thomaz Aurélio

    2011-01-01

    Since the introduction of the Microbial Loop concept, many studies aimed to explain the role of bacterioplankton and dissolved organic carbon (DOC) in aquatic ecosystems. Paraná River floodplain system is a very complex environment where these subjects were little explored. The aim of this work was to characterize bacterial community in terms of density, biomass and biovolume in some water bodies of this floodplain and to verify its temporal variation and its relation with some limnological variables, including some indicators of DOC quality, obtained through Ultraviolet-visible (UV-VIS) and fluorescence spectroscopic analysis. Bacterial density, biomass and biovolume are similar to those from other freshwater environments and both density and biomass were higher in the period with less rain. The limnological and spectroscopic features that showed any relation with bacterioplankton were the concentrations of N-NH4 and P-PO4, water transparency, and some indicators of DOC quality and origin. The analysis of these relations showed a possible competition between bacterioplankton and phytoplankton for inorganic nutrients and that the DOC used by bacterioplankton is labile and probably from aquatic macrophytes. PMID:24031705

  9. Community differentiation and population enrichment of Sargasso Sea bacterioplankton in the euphotic zone of a mesoscale mode-water eddy.

    PubMed

    Nelson, Craig E; Carlson, Craig A; Ewart, Courtney S; Halewood, Elisa R

    2014-03-01

    Eddies are mesoscale oceanographic features (∼ 200 km diameter) that can cause transient blooms of phytoplankton by shifting density isoclines in relation to light and nutrient resources. To better understand how bacterioplankton respond to eddies, we examined depth-resolved distributions of bacterial populations across an anticyclonic mode-water eddy in the Sargasso Sea. Previous work on this eddy has documented elevated phytoplankton productivity and diatom abundance within the eddy centre with coincident bacterial productivity and biomass maxima. We illustrate bacterial community shifts within the eddy centre, differentiating populations uplifted along isopycnals from those enriched or depleted at horizons of enhanced bacterial and primary productivity. Phylotypes belonging to the Roseobacter, OCS116 and marine Actinobacteria clades were enriched in the eddy core and were highly correlated with pigment-based indicators of diatom abundance, supporting developing hypotheses that members of these clades associate with phytoplankton blooms. Typical mesopelagic clades (SAR202, SAR324, SAR406 and SAR11 IIb) were uplifted within the eddy centre, increasing bacterial diversity in the lower euphotic zone. Typical surface oligotrophic clades (SAR116, OM75, Prochlorococcus and SAR11 Ia) were relatively depleted in the eddy centre. The biogeochemical context of a bloom-inducing eddy provides insight into the ecology of the diverse uncultured bacterioplankton dominating the oligotrophic oceans.

  10. [Bacterioplankton index of biotic integrity (BP-IBI): an approach for assessing river ecosystem health in Dianchi watershed].

    PubMed

    Huang, Yi; Shu, Zhong-Ya

    2013-08-01

    The index of biotic integrity (IBI) has been widely applied to the health assessment of river ecosystems. However, the currently available IBI methods are lack of decomposer-based assessment. Based on the T-RFLP result of bacterioplankton, we developed the bacterioplankton index of biotic integrity (BP-IBI) after the screening of major environmental factors and candidate metrics to assess the health of the inflow rivers in Dianchi Watershed. The evaluation result indicated that the eco-health conditions of 11 reference sites were either level I (8 sites) or level II (3 sites), while the 27 damaged sites were level I (4 sites), level II (14 sites), level III (7 sites), and level IV (2 sites), and there was no level V site. Compared with the other IBI methods and the integrated pollution index, BP-IBI showed better effect in reflecting the influence of the key environmental factors, the land use types and the upstream water types in river ecosystems. Therefore, BP-IBI is a good method to characterize the health status of river ecosystems.

  11. Response of bacterioplankton community structure to an artificial gradient of pCO2 in the Arctic Ocean

    NASA Astrophysics Data System (ADS)

    Zhang, R.; Xia, X.; Lau, S. C. K.; Motegi, C.; Weinbauer, M. G.; Jiao, N.

    2012-08-01

    The influences of ocean acidification on bacterial diversity were investigated using DNA fingerprinting and clone library analysis of bacterioplankton samples collected from the largest CO2 manipulation mesocosm study that had been performed thus far. Terminal restriction fragment length polymorphism analysis of the PCR amplicons of the 16S rRNA genes revealed that bacterial diversity, species richness and community structure varied with the time of incubation but not the degree of ocean acidification. The phylogenetic composition of the major bacterial assemblage after a 30-day incubation under various pCO2 concentrations did not show clear effects of pCO2 levels. However, the maximum apparent diversity and species richness which occurred during incubation differed in the high and low pCO2 treatments, in which different bacterial community structure harbored. In addition, total alkalinity was one of the contributing factors for the temporal variations in bacterial community structure observed during incubation. A negative relationship between the relative abundance of Bacteroidetes and pCO2 levels was observed for samples at the end of the experiment. Our study suggested that ocean acidification affected the development of bacterial assemblages and potentially impacts the ecological function of the bacterioplankton in the marine ecosystem.

  12. Community differentiation and population enrichment of Sargasso Sea bacterioplankton in the euphotic zone of a mesoscale mode-water eddy.

    PubMed

    Nelson, Craig E; Carlson, Craig A; Ewart, Courtney S; Halewood, Elisa R

    2014-03-01

    Eddies are mesoscale oceanographic features (∼ 200 km diameter) that can cause transient blooms of phytoplankton by shifting density isoclines in relation to light and nutrient resources. To better understand how bacterioplankton respond to eddies, we examined depth-resolved distributions of bacterial populations across an anticyclonic mode-water eddy in the Sargasso Sea. Previous work on this eddy has documented elevated phytoplankton productivity and diatom abundance within the eddy centre with coincident bacterial productivity and biomass maxima. We illustrate bacterial community shifts within the eddy centre, differentiating populations uplifted along isopycnals from those enriched or depleted at horizons of enhanced bacterial and primary productivity. Phylotypes belonging to the Roseobacter, OCS116 and marine Actinobacteria clades were enriched in the eddy core and were highly correlated with pigment-based indicators of diatom abundance, supporting developing hypotheses that members of these clades associate with phytoplankton blooms. Typical mesopelagic clades (SAR202, SAR324, SAR406 and SAR11 IIb) were uplifted within the eddy centre, increasing bacterial diversity in the lower euphotic zone. Typical surface oligotrophic clades (SAR116, OM75, Prochlorococcus and SAR11 Ia) were relatively depleted in the eddy centre. The biogeochemical context of a bloom-inducing eddy provides insight into the ecology of the diverse uncultured bacterioplankton dominating the oligotrophic oceans. PMID:24589288

  13. Near-Bottom Hypoxia Impacts Dynamics of Bacterioplankton Assemblage throughout Water Column of the Gulf of Finland (Baltic Sea).

    PubMed

    Laas, Peeter; Šatova, Elina; Lips, Inga; Lips, Urmas; Simm, Jaak; Kisand, Veljo; Metsis, Madis

    2016-01-01

    Over the past century the spread of hypoxia in the Baltic Sea has been drastic, reaching its 'arm' into the easternmost sub-basin, the Gulf of Finland. The hydrographic and climatological properties of the gulf offer a broad suite of discrete niches for microbial communities. The current study explores spatiotemporal dynamics of bacterioplankton community in the Gulf of Finland using massively parallel sequencing of 16S rRNA fragments obtained by amplifying community DNA from spring to autumn period. The presence of redoxcline and drastic seasonal changes make spatiotemporal dynamics of bacterioplankton community composition (BCC) and abundances in such estuary remarkably complex. To the best of our knowledge, this is the first study that analyses spatiotemporal dynamics of BCC in relation to phytoplankton bloom throughout the water column (and redoxcline), not only at the surface layer. We conclude that capability to survive (or benefit from) shifts between oxic and hypoxic conditions is vital adaptation for bacteria to thrive in such environments. Our results contribute to the understanding of emerging patterns in BCCs that occupy hydrographically similar estuaries dispersed all over the world, and we suggest the presence of a global redox- and salinity-driven metacommunity. These results have important implications for understanding long-term ecological and biogeochemical impacts of hypoxia expansion in the Baltic Sea (and similar ecosystems), as well as global biogeography of bacteria specialized inhabiting similar ecosystems. PMID:27213812

  14. Near-Bottom Hypoxia Impacts Dynamics of Bacterioplankton Assemblage throughout Water Column of the Gulf of Finland (Baltic Sea)

    PubMed Central

    Laas, Peeter; Šatova, Elina; Lips, Inga; Lips, Urmas; Simm, Jaak; Kisand, Veljo; Metsis, Madis

    2016-01-01

    Over the past century the spread of hypoxia in the Baltic Sea has been drastic, reaching its ‘arm’ into the easternmost sub-basin, the Gulf of Finland. The hydrographic and climatological properties of the gulf offer a broad suite of discrete niches for microbial communities. The current study explores spatiotemporal dynamics of bacterioplankton community in the Gulf of Finland using massively parallel sequencing of 16S rRNA fragments obtained by amplifying community DNA from spring to autumn period. The presence of redoxcline and drastic seasonal changes make spatiotemporal dynamics of bacterioplankton community composition (BCC) and abundances in such estuary remarkably complex. To the best of our knowledge, this is the first study that analyses spatiotemporal dynamics of BCC in relation to phytoplankton bloom throughout the water column (and redoxcline), not only at the surface layer. We conclude that capability to survive (or benefit from) shifts between oxic and hypoxic conditions is vital adaptation for bacteria to thrive in such environments. Our results contribute to the understanding of emerging patterns in BCCs that occupy hydrographically similar estuaries dispersed all over the world, and we suggest the presence of a global redox- and salinity-driven metacommunity. These results have important implications for understanding long-term ecological and biogeochemical impacts of hypoxia expansion in the Baltic Sea (and similar ecosystems), as well as global biogeography of bacteria specialized inhabiting similar ecosystems. PMID:27213812

  15. Molecular analyses of the diversity in marine bacterioplankton assemblages along the coastline of the northeastern Gulf of Mexico.

    PubMed

    Olapade, Ola A

    2010-10-01

    Bacterial community diversity in marine bacterioplankton assemblages were examined in 3 coastal locations along the northeastern Gulf of Mexico (GOM) using 16S rRNA gene libraries and fluorescence in situ hybridization approaches. The majority of the sequences (30%-60%) were similar to the 16S rRNA gene sequences of unknown bacteria; however, the operational taxonomic units from members of the Cyanobacteria, Proteobacteria, and Bacteroidetes were also present at the 3 GOM sites. Overall, sequence diversity was more similar between the Gulf sites of Carrabelle and Ochlockonee than between either of the Gulf sites and Apalachicola Bay. Fluorescence in situ hybridization analyses revealed the quantitative predominance of members of the Alphaproteobacteria subclass and the Cytophaga-Flavobacterium cluster within the bacterioplankton assemblages. In general, the study further reveals the presence of many bacterial taxa that have been previously found to be dominant in coastal marine environments. Differences observed in the representation of the various bacterial phylogenetic groups among the GOM coastal sites could be partly attributed to dynamic variations in several site-specific conditions, including intermittent tidal events, nutrient availability, and anthropogenic influences.

  16. Consequences of increased terrestrial dissolved organic matter and temperature on bacterioplankton community composition during a Baltic Sea mesocosm experiment.

    PubMed

    Lindh, Markus V; Lefébure, Robert; Degerman, Rickard; Lundin, Daniel; Andersson, Agneta; Pinhassi, Jarone

    2015-06-01

    Predicted increases in runoff of terrestrial dissolved organic matter (DOM) and sea surface temperatures implicate substantial changes in energy fluxes of coastal marine ecosystems. Despite marine bacteria being critical drivers of marine carbon cycling, knowledge of compositional responses within bacterioplankton communities to such disturbances is strongly limited. Using 16S rRNA gene pyrosequencing, we examined bacterioplankton population dynamics in Baltic Sea mesocosms with treatments combining terrestrial DOM enrichment and increased temperature. Among the 200 most abundant taxa, 62 % either increased or decreased in relative abundance under changed environmental conditions. For example, SAR11 and SAR86 populations proliferated in combined increased terrestrial DOM/temperature mesocosms, while the hgcI and CL500-29 clades (Actinobacteria) decreased in the same mesocosms. Bacteroidetes increased in both control mesocosms and in the combined increased terrestrial DOM/temperature mesocosms. These results indicate considerable and differential responses among distinct bacterial populations to combined climate change effects, emphasizing the potential of such effects to induce shifts in ecosystem function and carbon cycling in the future Baltic Sea. PMID:26022323

  17. Bacterioplankton communities of Crater Lake, OR: Dynamic changes with euphotic zone food web structure and stable deep water populations

    USGS Publications Warehouse

    Urbach, E.; Vergin, K.L.; Larson, G.L.; Giovannoni, S.J.

    2007-01-01

    The distribution of bacterial and archaeal species in Crater Lake plankton varies dramatically over depth and with time, as assessed by hybridization of group-specific oligonucleotides to RNA extracted from lakewater. Nonmetric, multidimensional scaling (MDS) analysis of relative bacterial phylotype densities revealed complex relationships among assemblages sampled from depth profiles in July, August and September of 1997 through 1999. CL500-11 green nonsulfur bacteria (Phylum Chloroflexi) and marine Group I crenarchaeota are consistently dominant groups in the oxygenated deep waters at 300 and 500 m. Other phylotypes found in the deep waters are similar to surface and mid-depth populations and vary with time. Euphotic zone assemblages are dominated either by ??-proteobacteria or CL120-10 verrucomicrobia, and ACK4 actinomycetes. MDS analyses of euphotic zone populations in relation to environmental variables and phytoplankton and zooplankton population structures reveal apparent links between Daphnia pulicaria zooplankton population densities and microbial community structure. These patterns may reflect food web interactions that link kokanee salmon population densities to community structure of the bacterioplankton, via fish predation on Daphnia with cascading consequences to Daphnia bacterivory and predation on bacterivorous protists. These results demonstrate a stable bottom-water microbial community. They also extend previous observations of food web-driven changes in euphotic zone bacterioplankton community structure to an oligotrophic setting. ?? 2007 Springer Science+Business Media B.V.

  18. Inferring genetic networks from microarray data.

    SciTech Connect

    May, Elebeoba Eni; Davidson, George S.; Martin, Shawn Bryan; Werner-Washburne, Margaret C.; Faulon, Jean-Loup Michel

    2004-06-01

    In theory, it should be possible to infer realistic genetic networks from time series microarray data. In practice, however, network discovery has proved problematic. The three major challenges are: (1) inferring the network; (2) estimating the stability of the inferred network; and (3) making the network visually accessible to the user. Here we describe a method, tested on publicly available time series microarray data, which addresses these concerns. The inference of genetic networks from genome-wide experimental data is an important biological problem which has received much attention. Approaches to this problem have typically included application of clustering algorithms [6]; the use of Boolean networks [12, 1, 10]; the use of Bayesian networks [8, 11]; and the use of continuous models [21, 14, 19]. Overviews of the problem and general approaches to network inference can be found in [4, 3]. Our approach to network inference is similar to earlier methods in that we use both clustering and Boolean network inference. However, we have attempted to extend the process to better serve the end-user, the biologist. In particular, we have incorporated a system to assess the reliability of our network, and we have developed tools which allow interactive visualization of the proposed network.

  19. Phytozome Comparative Plant Genomics Portal

    SciTech Connect

    Goodstein, David; Batra, Sajeev; Carlson, Joseph; Hayes, Richard; Phillips, Jeremy; Shu, Shengqiang; Schmutz, Jeremy; Rokhsar, Daniel

    2014-09-09

    The Dept. of Energy Joint Genome Institute is a genomics user facility supporting DOE mission science in the areas of Bioenergy, Carbon Cycling, and Biogeochemistry. The Plant Program at the JGI applies genomic, analytical, computational and informatics platforms and methods to: 1. Understand and accelerate the improvement (domestication) of bioenergy crops 2. Characterize and moderate plant response to climate change 3. Use comparative genomics to identify constrained elements and infer gene function 4. Build high quality genomic resource platforms of JGI Plant Flagship genomes for functional and experimental work 5. Expand functional genomic resources for Plant Flagship genomes

  20. Bacterioplankton community structure in a maritime antarctic oligotrophic lake during a period of holomixis, as determined by denaturing gradient gel electrophoresis (DGGE) and fluorescence in situ hybridization (FISH).

    PubMed

    Pearce, D A

    2003-07-01

    The bacterioplankton community structure in Moss Lake, a maritime Antarctic oligotrophic lake, was determined with vertical depth in the water column, during the ice-free period on Signy Island in the South Orkney Islands. Bacterioplankton community structure was determined using a combination of direct counting of 4',6-diamidino-2-phenylindole (DAPI) stained cells, PCR amplification of 16S rRNA gene fragments, denaturing gradient gel electrophoresis (DGGE) and in situ hybridization with group-specific, fluorescently labeled oligonucleotide probes. Using PCR amplification of 16S rRNA gene fragments and DGGE, the bacterioplankton community composition was shown to be constant with vertical depth in the water column. Specific bacterioplankton species identified through cloning and sequencing the DGGE products obtained were Flavobacterium xinjiangensis (a Flavobacterium), Leptothrix discophora (a beta-Proteobacterium), and a number of uncultured groups: two beta-Proteobacteria, an unclassified Proteobacterium, three sequences from Actinobacteria, and a Cyanobacterium. Fluorescence in situ hybridization (FISH), however, demonstrated that there were minor but significant fluctuations in different groups of bacteria with vertical depth in the water column. It showed that the beta-Proteobacteria accounted for between 26.4 and 71.5%, the alpha-Proteobacteria 2.3-10.6%, the gamma-Proteobacteria 0-29.6%, and the Cytophaga-Flavobacterium group 1.8-23.5% of cells hybridizing to a universal probe. This study reports the first description of the community structure of an oligotrophic Antarctic freshwater lake as determined by PCR-dependent and PCR-independent molecular techniques. It also suggests that the bacterioplankton community of Moss Lake contains classes of bacteria known to be important in freshwater systems elsewhere in the world. PMID:12739078

  1. Temporal variation in the specific growth rate of bacterioplankton in the River Cauvery and its four down stream tributaries in Karnataka State, India.

    PubMed

    Tondoti Sathyanarayana Rao, Harsha; Yamakanamardi, Sadanand Mallappa; Mallaiah, Mahadeveswamy

    2009-07-01

    The temporal variation in the Specific Growth Rate (SGR) of natural population of heterotrophic bacterioplankton of the river Cauvery and its four down stream tributaries in Karnataka State was monitored over a period of two years from February 2000 to January 2002. The SGR was calculated by taking into account only the abundance of bacterioplankton at the beginning (0 h) and at the end (48 h) incubation period, at room or river temperature. The mean SGR was less and significantly different in the surface waters of river Kapila, Shimsha, Suvarnavathy and Arkavathy. But it was more and significantly different in river Cauvery when compared to other tributaries. This suggests that the river Cauvery was more favorable habitat for SGR of bacterioplankton than the other four watercourses studied. Investigation of interrelationship between SGR and other bacterial variables showed presence of only one correlation with direct counts of particle bound bacteria in river Arkavathy. Further, the relationship between SGR of bacterioplankton and other environmental variables showed the presence of six correlations in river Shimsha, five in river Suvarnavathy, three in river Cauvery, and two each in river Kapila and river Arkavathy. Negative SGR were recorded on thirteen occasions in river Cauvery followed by eleven in river Shimsha, nine in river Suvarnavathy, seven in river Arkavathy and five in river Kapila, out of fifty SGR determinations. This negative SGR were a result of decrease in the observed bacterial cell counts after 48 h incubation from that of 0 h count. The probable reason for such negative growth rate and dependency of SGR of bacterioplankton and environmental variables has been discussed. PMID:18600464

  2. Maximum likelihood inference of reticulate evolutionary histories.

    PubMed

    Yu, Yun; Dong, Jianrong; Liu, Kevin J; Nakhleh, Luay

    2014-11-18

    Hybridization plays an important role in the evolution of certain groups of organisms, adaptation to their environments, and diversification of their genomes. The evolutionary histories of such groups are reticulate, and methods for reconstructing them are still in their infancy and have limited applicability. We present a maximum likelihood method for inferring reticulate evolutionary histories while accounting simultaneously for incomplete lineage sorting. Additionally, we propose methods for assessing confidence in the amount of reticulation and the topology of the inferred evolutionary history. Our method obtains accurate estimates of reticulate evolutionary histories on simulated datasets. Furthermore, our method provides support for a hypothesis of a reticulate evolutionary history inferred from a set of house mouse (Mus musculus) genomes. As evidence of hybridization in eukaryotic groups accumulates, it is essential to have methods that infer reticulate evolutionary histories. The work we present here allows for such inference and provides a significant step toward putting phylogenetic networks on par with phylogenetic trees as a model of capturing evolutionary relationships. PMID:25368173

  3. Significance of Viral Lysis and Flagellate Grazing as Factors Controlling Bacterioplankton Production in a Eutrophic Lake

    PubMed Central

    Weinbauer, Markus G.; Höfle, Manfred G.

    1998-01-01

    The effects of viral lysis and heterotrophic nanoflagellate (HNF) grazing on bacterial mortality were estimated in a eutrophic lake (Lake Plußsee in northern Germany) which was separated by a steep temperature and oxygen gradient into a warm and oxic epilimnion and a cold and anoxic hypolimnion. Two transmission electron microscopy-based methods (whole-cell examination and thin sections) were used to determine the frequency of visibly infected cells, and a model was used to estimate bacterial mortality due to viral lysis. Examination of thin sections also showed that between 20.2 and 29.2% (average, 26.1%) of the bacterial cells were empty (ghosts) and thus could not contribute to viral production. The most important finding was that the mechanism for regulating bacterial production shifted with depth from grazing control in the epilimnion to control due to viral lysis in the hypolimnion. We estimated that in the epilimnion viral lysis accounted on average for 8.4 to 41.8% of the summed mortality (calculated by determining the sum of the mortalities due to lysis and grazing), compared to 51.3 to 91.0% of the summed mortality in the metalimninon and 88.5 to 94.2% of the summed mortality in the hypolimnion. Estimates of summed mortality values indicated that bacterial production was controlled completely or almost completely in the epilimnion (summed mortality, 66.6 to 128.5%) and the hypolimnion (summed mortality, 43.4 to 103.3%), whereas in the metalimnion viral lysis and HNF grazing were not sufficient to control bacterial production (summed mortality, 22.4 to 56.7%). The estimated contribution of organic matter released by viral lysis of cells into the pool of dissolved organic matter (DOM) was low; however, since cell lysis products are very likely labile compared to the bulk DOM, they might stimulate bacterial production. The high mortality of bacterioplankton due to viral lysis in anoxic water indicates that a significant portion of bacterial production in

  4. Gene-network inference by message passing

    NASA Astrophysics Data System (ADS)

    Braunstein, A.; Pagnani, A.; Weigt, M.; Zecchina, R.

    2008-01-01

    The inference of gene-regulatory processes from gene-expression data belongs to the major challenges of computational systems biology. Here we address the problem from a statistical-physics perspective and develop a message-passing algorithm which is able to infer sparse, directed and combinatorial regulatory mechanisms. Using the replica technique, the algorithmic performance can be characterized analytically for artificially generated data. The algorithm is applied to genome-wide expression data of baker's yeast under various environmental conditions. We find clear cases of combinatorial control, and enrichment in common functional annotations of regulated genes and their regulators.

  5. Inference in `poor` languages

    SciTech Connect

    Petrov, S.

    1996-10-01

    Languages with a solvable implication problem but without complete and consistent systems of inference rules (`poor` languages) are considered. The problem of existence of finite complete and consistent inference rule system for a ``poor`` language is stated independently of the language or rules syntax. Several properties of the problem arc proved. An application of results to the language of join dependencies is given.

  6. Impacts of combined overfishing and oil spills on the plankton trophodynamics of the West Florida shelf over the last half century of 1965-2011: A two-dimensional simulation analysis of biotic state transitions, from a zooplankton- to a bacterioplankton-modulated ecosystem.

    NASA Astrophysics Data System (ADS)

    Walsh, J. J.; Lenes, J. M.; Darrow, B.; Parks, A.; Weisberg, R. H.

    2016-03-01

    Over 50 years of multiple anthropogenic perturbations, Florida zooplankton stocks of the northeastern Gulf of Mexico declined ten-fold, with increments of mainly dominant toxic dinoflagellate harmful algal blooms (HABs), rather than diatoms, and a shift in loci of nutrient remineralization and oxygen depletion by bacterioplankton, from the sea floor to near surface waters. Yet, lytic bacterial biomass and associated ammonification only increased at most five-fold over the same time period, with consequently little indication of new, expanded "dead zones" of diatom-induced hypoxia. After bacterial lysis of intact cells of these increased HABs, the remaining residues of zooplankton biomass decrements evidently instead exited the water column as malign aerosolized HAB asthma triggers, correlated by co-traveling mercury aerosols, within wind-borne sea sprays. To unravel the causal mechanisms of these inferred decadal food web transitions, a 36-state variable plankton model of algal, bacterial, protozoan, and copepod component communities replicated daily time series of each plankton group's representatives on the West Florida shelf (WFS) during 1965-2011. At the lower phytoplankton trophic levels, 52% of the ungrazed HAB increments, between 1965-1967 and 2001-2002 before recent oil spills, remained in the water column to kill fishes and fuel bacterioplankton. But, another 48% of the WFS primary production then left the ocean's surface as a harbinger of increased public health hazards during continuing sea spray exports of salts, HAB toxins, and Hg poisons. Following the Deepwater Horizon petroleum releases in 2010, little additional change of element partition among the altered importance of WFS food web components of the trophic pyramid then pertained between 2001-2002 and 2010-2011, despite when anomalous upwelled nutrient supplies instead favored retrograde benign, oil-tolerant diatoms over the HABs during 2010. Indeed, by 2011 HABs were back, with biomass

  7. Interannual variability of phyto-bacterioplankton biomass and production in coastal and offshore waters of the Baltic Sea.

    PubMed

    Legrand, Catherine; Fridolfsson, Emil; Bertos-Fortis, Mireia; Lindehoff, Elin; Larsson, Per; Pinhassi, Jarone; Andersson, Agneta

    2015-06-01

    The microbial part of the pelagic food web is seldom characterized in models despite its major contribution to biogeochemical cycles. In the Baltic Sea, spatial and temporal high frequency sampling over three years revealed changes in heterotrophic bacteria and phytoplankton coupling (biomass and production) related to hydrographic properties of the ecosystem. Phyto- and bacterioplankton were bottom-up driven in both coastal and offshore areas. Cold winter temperature was essential for phytoplankton to conform to the successional sequence in temperate waters. In terms of annual carbon production, the loss of the spring bloom (diatoms and dinoflagellates) after mild winters tended not to be compensated for by other taxa, not even summer cyanobacteria. These results improve our ability to project Baltic Sea ecosystem response to short- and long-term environmental changes.

  8. Genome content of uncultivated marine Roseobacters in the surface ocean.

    PubMed

    Luo, Haiwei; Löytynoja, Ari; Moran, Mary Ann

    2012-01-01

    Understanding of the ecological roles and evolutionary histories of marine bacterial taxa can be complicated by mismatches in genome content between wild populations and their better-studied cultured relatives. We used computed patterns of non-synonymous (amino acid-altering) nucleotide diversity in marine metagenomic data to provide high-confidence identification of DNA fragments from uncultivated members of the Roseobacter clade, an abundant taxon of heterotrophic marine bacterioplankton in the world's oceans. Differences in gene stoichiometry in the Global Ocean Survey metagenomic data set compared with 39 sequenced isolates indicated that natural Roseobacter populations differ systematically in several genomic attributes from their cultured representatives, including fewer genes for signal transduction and cell surface modifications but more genes for Sec-like protein secretion systems, anaplerotic CO(2) incorporation, and phosphorus and sulfate uptake. Several of these trends match well with characteristics previously identified as distinguishing r- versus K-selected ecological strategies in bacteria, suggesting that the r-strategist model assigned to cultured roseobacters may be less applicable to their free-living oceanic counterparts. The metagenomic Roseobacter DNA fragments revealed several traits with evolutionary histories suggestive of horizontal gene transfer from other marine bacterioplankton taxa or viruses, including pyrophosphatases and glycosylation proteins.

  9. CGAT: computational genomics analysis toolkit.

    PubMed

    Sims, David; Ilott, Nicholas E; Sansom, Stephen N; Sudbery, Ian M; Johnson, Jethro S; Fawcett, Katherine A; Berlanga-Taylor, Antonio J; Luna-Valero, Sebastian; Ponting, Chris P; Heger, Andreas

    2014-05-01

    Computational genomics seeks to draw biological inferences from genomic datasets, often by integrating and contextualizing next-generation sequencing data. CGAT provides an extensive suite of tools designed to assist in the analysis of genome scale data from a range of standard file formats. The toolkit enables filtering, comparison, conversion, summarization and annotation of genomic intervals, gene sets and sequences. The tools can both be run from the Unix command line and installed into visual workflow builders, such as Galaxy.

  10. Contrasting patterns of free-living bacterioplankton diversity in macrophyte-dominated versus phytoplankton blooming regimes in Dianchi Lake, a shallow lake in China

    NASA Astrophysics Data System (ADS)

    Wang, Yujing; Li, Huabing; Xing, Peng; Wu, Qinglong

    2016-04-01

    Freshwater shallow lakes typically exhibit two alternative stable states under certain nutrient loadings: macrophyte-dominated and phytoplankton-dominated water regimes. An ecosystem regime shift from macrophytes to phytoplankton blooming typically reduces the number of species of invertebrates and fishes and results in the homogenization of communities in freshwater lakes. We investigated how microbial biodiversity has responded to a shift of the ecosystem regime in Dianchi Lake, which was previously fully covered with submerged macrophytes but currently harbors both ecological states. We observed marked divergence in the diversity and community composition of bacterioplankton between the two regimes. Although species richness, estimated as the number of operational taxonomic units and phylogenetic diversity (PD), was higher in the phytoplankton dominated ecosystem after this shift, the dissimilarity of bacterioplankton community across space decreased. This decrease in beta diversity was accompanied by loss of planktonic bacteria unique to the macrophyte-dominated ecosystem. Mantel tests between bacterioplankton community distances and Euclidian distance of environmental parameters indicated that this reduced bacterial community diff erentiation primarily reflected the loss of environmental niches, particularly in the macrophyte regime. The loss of this small-scale heterogeneity in bacterial communities should be considered when assessing long-term biodiversity changes in response to ecosystem regime conversions in freshwater lakes.

  11. Bromodeoxyuridine (BrdU) Labeling and Subsequent Fluorescence Activated Cell Sorting for Culture-independent Identification of Dissolved Organic Carbon-degrading Bacterioplankton

    PubMed Central

    Lu, Xinxin; Moran, Mary Ann; Mou, Xiaozhen

    2011-01-01

    Microbes are major agents mediating the degradation of numerous dissolved organic carbon (DOC) substrates in aquatic environments. However, identification of bacterial taxa that transform specific pools of DOC in nature poses a technical challenge. Here we describe an approach that couples bromodeoxyuridine (BrdU) incorporation, fluorescence activated cell sorting (FACS), and 16S rRNA gene-based molecular analysis that allows culture-independent identification of bacterioplankton capable of degrading a specific DOC compound in aquatic environments. Triplicate bacterioplankton microcosms are set up to receive both BrdU and a model DOC compound (DOC amendments), or only BrdU (no-addition control). BrdU substitutes the positions of thymidine in newly synthesized bacterial DNA and BrdU-labeled DNA can be readily immunodetected 1,2. Through a 24-hr incubation, bacterioplankton that are able to use the added DOC compound are expected to be selectively activated, and therefore have higher levels of BrdU incorporation (HI cells) than non-responsive cells in the DOC amendments and cells in no-addition controls (low BrdU incorporation cells, LI cells). After fluorescence immunodetection, HI cells are distinguished and physically separated from the LI cells by fluorescence activated cell sorting (FACS) 3. Sorted DOC-responsive cells (HI cells) are extracted for DNA and taxonomically identified through subsequent 16S rRNA gene-based analyses including PCR, clone library construction and sequencing. PMID:21931294

  12. Effects of decreased resource availability, protozoan grazing and viral impact on a structure of bacterioplankton assemblage in a canyon-shaped reservoir.

    PubMed

    Hornák, Karel; Masín, Michal; Jezbera, Jan; Bettarel, Yvan; Nedoma, Jirí; Sime-Ngando, Télesphore; Simek, Karel

    2005-05-01

    We conducted a transplant experiment to elucidate the effects of different levels of grazing pressure, nutrient availability, especially phosphorus, and the impact of viruses on the changes in the structure of bacterioplankton assemblage in a meso-eutrophic reservoir. A sample taken from the nutrient-rich inflow part of the reservoir was size-fractionated and incubated in dialysis bags in both inflow and dam area. The structure of bacterial assemblage was examined by fluorescence in situ hybridization using oligonucleotide probes with different levels of specificity. In terms of the relative proportions of different bacterial groups, we found very few significant changes in the bacterioplankton composition after transplanting the treatments to the nutrient-poor dam area. However, we observed marked shifts in morphology and biomass towards the development of filaments, flocs and "vibrio-like" morphotypes of selected probe-defined groups of bacteria induced by increased grazing pressure. Despite the very high abundances of viruses in all the treatments, their effects on bacterioplankton were rather negligible.

  13. [Genomic structure of the autotetraploid oat species Avena macrostachya inferred from comparative analysis of the ITS1 and ITS2 sequences: on the oat karyotype evolution during the early stages of the Avena species divergence].

    PubMed

    Rodionov, A V; Tiupa, N B; Kim, E S; Machs, E M; Loskutov, I G

    2005-05-01

    To examine the genomic structure of Avena macrostachya, internal transcribed spacers, ITS1 and ITS2, as well as nuclear 5.8S tRNA genes from three oat species with AsAs karyotype (A. wiestii, A. hirtula, and A. atlantica), and those from A. longiglumis (AlAl), A. canariensis (AcAc), A. ventricosa (CvCv), A. pilosa, and A. clauda (CpCp) were sequenced. All species of the genus Avena examined represented a monophyletic group (bootstrap index = 98), within which two branches, i.e., species with A- and C-genomes, were distinguished (bootstrap indices = 100). The subject of our study, A. macrostachya, albeit belonging to the phylogenetic branch of C-genome oat species (karyotype with submetacentic and subacrocentric chromosomes), has preserved an isobrachyal karyotype, (i.e., that containing metacentric chromosomes), probably typical of the common Avena ancestor. It was suggested to classify the A. macrostachya genome as a specific form of C-genome, Cm-genome. Among the species from other genera studied, Arrhenatherum elatius was found to be the closest to Avena in ITS1 and ITS structure. Phylogenetic relationships between Avena and Helictotrichon remain intriguingly uncertain. The HPR389153 sequence from H. pratense genome was closest to the ITS1 sequences specific to the Avena A-genomes (p-distance = 0.0237), while the differences of this sequence from the ITS1 of A. macrostachya reached 0.1221. On the other hand, HAD389117 from H. adsurgens was close to the ITS1 specific to Avena C-genomes (p-distance = 0.0189), while its differences from the A-genome specific ITS1 sequences reached 0.1221. It seems likely that the appearance of highly polyploid (2n = 12-21x) species of H. pratense and H. adsurgens could be associated with interspecific hybridization involving Mediterranean oat species carrying A- and C-genomes. A hypothesis on the pathways of Avena chromosomes evolution during the early stages the oat species divergence is proposed.

  14. Trace Metal Acquisition by Marine Heterotrophic Bacterioplankton with Contrasting Trophic Strategies

    PubMed Central

    Barbeau, Katherine A.

    2016-01-01

    Heterotrophic bacteria in the SAR11 and Roseobacter lineages shape the marine carbon, nitrogen, phosphorous, and sulfur cycles, yet they do so having adopted divergent ecological strategies. Currently, it is unknown whether these globally significant groups partition into specific niches with respect to micronutrients (e.g., trace metals) and how that may affect marine trace metal cycling. Here, we used comparative genomics to identify diverse iron, cobalt, nickel, copper, and zinc uptake capabilities in SAR11 and Roseobacter genomes and uncover surprising unevenness within and between lineages. The strongest predictors for the extent of the metal uptake gene content are the total number of transporters per genome, genome size, total metal transporters, and GC content, but numerous exceptions exist in both groups. Taken together, our results suggest that SAR11 have strongly minimized their trace metal uptake versatility, with high-affinity zinc uptake being a unique exception. The larger Roseobacter genomes have greater trace metal uptake versatility on average, but they also appear to have greater plasticity, resulting in phylogenetically similar genomes having largely different capabilities. Ultimately, phylogeny is predictive of the diversity and extent of 20 to 33% of all metal uptake systems, suggesting that specialization in metal utilization mostly occurred independently from overall lineage diversification in both SAR11 and Roseobacter. We interpret these results as reflecting relatively recent trace metal niche partitioning in both lineages, suggesting that concentrations and chemical forms of metals in the marine environment are important factors shaping the gene content of marine heterotrophic Alphaproteobacteria of the SAR11 and Roseobacter lineages. PMID:26729720

  15. Trace Metal Acquisition by Marine Heterotrophic Bacterioplankton with Contrasting Trophic Strategies.

    PubMed

    Hogle, Shane L; Thrash, J Cameron; Dupont, Chris L; Barbeau, Katherine A

    2016-03-01

    Heterotrophic bacteria in the SAR11 and Roseobacter lineages shape the marine carbon, nitrogen, phosphorous, and sulfur cycles, yet they do so having adopted divergent ecological strategies. Currently, it is unknown whether these globally significant groups partition into specific niches with respect to micronutrients (e.g., trace metals) and how that may affect marine trace metal cycling. Here, we used comparative genomics to identify diverse iron, cobalt, nickel, copper, and zinc uptake capabilities in SAR11 and Roseobacter genomes and uncover surprising unevenness within and between lineages. The strongest predictors for the extent of the metal uptake gene content are the total number of transporters per genome, genome size, total metal transporters, and GC content, but numerous exceptions exist in both groups. Taken together, our results suggest that SAR11 have strongly minimized their trace metal uptake versatility, with high-affinity zinc uptake being a unique exception. The larger Roseobacter genomes have greater trace metal uptake versatility on average, but they also appear to have greater plasticity, resulting in phylogenetically similar genomes having largely different capabilities. Ultimately, phylogeny is predictive of the diversity and extent of 20 to 33% of all metal uptake systems, suggesting that specialization in metal utilization mostly occurred independently from overall lineage diversification in both SAR11 and Roseobacter. We interpret these results as reflecting relatively recent trace metal niche partitioning in both lineages, suggesting that concentrations and chemical forms of metals in the marine environment are important factors shaping the gene content of marine heterotrophic Alphaproteobacteria of the SAR11 and Roseobacter lineages. PMID:26729720

  16. The Bayes Inference Engine

    SciTech Connect

    Hanson, K.M.; Cunningham, G.S.

    1996-04-01

    The authors are developing a computer application, called the Bayes Inference Engine, to provide the means to make inferences about models of physical reality within a Bayesian framework. The construction of complex nonlinear models is achieved by a fully object-oriented design. The models are represented by a data-flow diagram that may be manipulated by the analyst through a graphical programming environment. Maximum a posteriori solutions are achieved using a general, gradient-based optimization algorithm. The application incorporates a new technique of estimating and visualizing the uncertainties in specific aspects of the model.

  17. Biogeography and environmental genomics of the Roseobacter-affiliated pelagic CHAB-I-5 lineage.

    PubMed

    Billerbeck, Sara; Wemheuer, Bernd; Voget, Sonja; Poehlein, Anja; Giebel, Helge-Ansgar; Brinkhoff, Thorsten; Gram, Lone; Jeffrey, Wade H; Daniel, Rolf; Simon, Meinhard

    2016-01-01

    The identification and functional characterization of microbial communities remains a prevailing topic in microbial oceanography as information on environmentally relevant pelagic prokaryotes is still limited. The Roseobacter group, an abundant lineage of marine Alphaproteobacteria, can constitute large proportions of the bacterioplankton. Roseobacters also occur associated with eukaryotic organisms and possess streamlined as well as larger genomes from 2.2 to >5 Mpb. Here, we show that one pelagic cluster of this group, CHAB-I-5, occurs globally from tropical to polar regions and accounts for up to 22% of the active North Sea bacterioplankton in the summer. The first sequenced genome of a CHAB-I-5 organism comprises 3.6 Mbp and exhibits features of an oligotrophic lifestyle. In a metatranscriptome of North Sea surface waters, 98% of the encoded genes were present, and genes encoding various ABC transporters, glutamate synthase and CO oxidation were particularly upregulated. Phylogenetic gene content analyses of 41 genomes of the Roseobacter group revealed a unique cluster of pelagic organisms distinct from other lineages of this group, highlighting the adaptation to life in nutrient-depleted environments. PMID:27572966

  18. Diurnal variation in bacterioplankton composition and DNA damage in the microbial community from an Andean oligotrophic lake.

    PubMed

    Fernández-Zenoff, María V; Estévez, María C; Farías, María E

    2014-01-01

    Laguna Azul is an oligotrophic lake situated at 4,560 m above sea level and subject to a high level of solar radiation. Bacterioplankton community composition (BCC) was analysed by denaturing gradient gel electrophoresis and the impact of solar ultraviolet radiation was assessed by measuring cyclobutane pyrimidine dimers (CPD). Furthermore, pure cultures of Acinetobacter johnsonii A2 and Rhodococcus sp. A5 were exposed simultaneously and CPD accumulation was studied. Gel analyses generated a total of 7 sequences belonging to Alpha-proteobacteria (1 band), Beta-proteobacteria (1 band), Bacteroidetes (2 bands), Actinobacteria (1 band), and Firmicutes (1 band). DGGE profiles showed minimal changes in BCC and no CPD was detected even though a high level of damage was found in biodosimeters. A. johnsonii A2 showed low level of DNA damage while Rhodococcus sp. A5 exhibited high resistance since no CPD were detected under natural UV-B exposure, suggesting that the bacterial community is well adapted to this highly solar irradiated environment. PMID:25576421

  19. Betaproteobacteria Limnohabitans strains increase fecundity in the crustacean Daphnia magna: symbiotic relationship between major bacterioplankton and zooplankton in freshwater ecosystem.

    PubMed

    Peerakietkhajorn, Saranya; Kato, Yasuhiko; Kasalický, Vojtěch; Matsuura, Tomoaki; Watanabe, Hajime

    2016-09-01

    How symbioses between bacteria and aquatic animals influence food webs in freshwater ecosystems is a fundamental question in ecology. We investigated symbiosis between a crustacean zooplankton Daphnia magna and its dominant bacterial symbiont Limnohabitans, an abundant and globally distributed freshwater Betaproteobacteria. Aposymbiotic juvenile Daphnia were prepared and exposed to any of four Limnohabitans sp. - Limnohabitans strains DM1, 2KL-3, 2KL-7 and Limnohabitans planktonicus strain II-D5, all previously found in D. magna digestive tract or culture. Re-infected Daphnia were cultured until they produced the first clutch of juveniles. Limnohabitans strain DM1 and L. planktonicus strain II-D5 successfully re-infected Daphnia through single exposure at the first instar juvenile stage. In contrast to aposymbiotic Daphnia that produced non-viable juveniles, re-infected Daphnia produced viable juveniles and increased fecundity to levels of that of symbiotic Daphnia. Re-infected Daphnia did not increase their number of eggs nor growth rates. Limnohabitans strains 2KL-7 and 2KL-3 could not recover fecundity even in multiple exposures during culture. This study shows the functional evidence demonstrating that a single bacterium Limnohabitans regulates fecundity of the consumer Daphnia through symbiosis. Our results indicated that symbiotic relationship between major bacterioplankton and zooplankton is important for maintaining the population of zooplankton in freshwater ecosystems.

  20. Diurnal variation in bacterioplankton composition and DNA damage in the microbial community from an Andean oligotrophic lake.

    PubMed

    Fernández-Zenoff, María V; Estévez, María C; Farías, María E

    2014-01-01

    Laguna Azul is an oligotrophic lake situated at 4,560 m above sea level and subject to a high level of solar radiation. Bacterioplankton community composition (BCC) was analysed by denaturing gradient gel electrophoresis and the impact of solar ultraviolet radiation was assessed by measuring cyclobutane pyrimidine dimers (CPD). Furthermore, pure cultures of Acinetobacter johnsonii A2 and Rhodococcus sp. A5 were exposed simultaneously and CPD accumulation was studied. Gel analyses generated a total of 7 sequences belonging to Alpha-proteobacteria (1 band), Beta-proteobacteria (1 band), Bacteroidetes (2 bands), Actinobacteria (1 band), and Firmicutes (1 band). DGGE profiles showed minimal changes in BCC and no CPD was detected even though a high level of damage was found in biodosimeters. A. johnsonii A2 showed low level of DNA damage while Rhodococcus sp. A5 exhibited high resistance since no CPD were detected under natural UV-B exposure, suggesting that the bacterial community is well adapted to this highly solar irradiated environment.

  1. Adaptation of an abundant Roseobacter RCA organism to pelagic systems revealed by genomic and transcriptomic analyses.

    PubMed

    Voget, Sonja; Wemheuer, Bernd; Brinkhoff, Thorsten; Vollmers, John; Dietrich, Sascha; Giebel, Helge-Ansgar; Beardsley, Christine; Sardemann, Carla; Bakenhus, Insa; Billerbeck, Sara; Daniel, Rolf; Simon, Meinhard

    2015-02-01

    The RCA (Roseobacter clade affiliated) cluster, with an internal 16S rRNA gene sequence similarity of >98%, is the largest cluster of the marine Roseobacter clade and most abundant in temperate to (sub)polar oceans, constituting up to 35% of total bacterioplankton. The genome analysis of the first described species of the RCA cluster, Planktomarina temperata RCA23, revealed that this phylogenetic lineage is deeply branching within the Roseobacter clade. It shares not >65.7% of homologous genes with any other organism of this clade. The genome is the smallest of all closed genomes of the Roseobacter clade, exhibits various features of genome streamlining and encompasses genes for aerobic anoxygenic photosynthesis (AAP) and CO oxidation. In order to assess the biogeochemical significance of the RCA cluster we investigated a phytoplankton spring bloom in the North Sea. This cluster constituted 5.1% of the total, but 10-31% (mean 18.5%) of the active bacterioplankton. A metatranscriptomic analysis showed that the genome of P. temperata RCA23 was transcribed to 94% in the bloom with some variations during day and night. The genome of P. temperata RCA23 was also retrieved to 84% from metagenomic data sets from a Norwegian fjord and to 82% from stations of the Global Ocean Sampling expedition in the northwestern Atlantic. In this region, up to 6.5% of the total reads mapped on the genome of P. temperata RCA23. This abundant taxon appears to be a major player in ocean biogeochemistry. PMID:25083934

  2. Adaptation of an abundant Roseobacter RCA organism to pelagic systems revealed by genomic and transcriptomic analyses

    PubMed Central

    Voget, Sonja; Wemheuer, Bernd; Brinkhoff, Thorsten; Vollmers, John; Dietrich, Sascha; Giebel, Helge-Ansgar; Beardsley, Christine; Sardemann, Carla; Bakenhus, Insa; Billerbeck, Sara; Daniel, Rolf; Simon, Meinhard

    2015-01-01

    The RCA (Roseobacter clade affiliated) cluster, with an internal 16S rRNA gene sequence similarity of >98%, is the largest cluster of the marine Roseobacter clade and most abundant in temperate to (sub)polar oceans, constituting up to 35% of total bacterioplankton. The genome analysis of the first described species of the RCA cluster, Planktomarina temperata RCA23, revealed that this phylogenetic lineage is deeply branching within the Roseobacter clade. It shares not >65.7% of homologous genes with any other organism of this clade. The genome is the smallest of all closed genomes of the Roseobacter clade, exhibits various features of genome streamlining and encompasses genes for aerobic anoxygenic photosynthesis (AAP) and CO oxidation. In order to assess the biogeochemical significance of the RCA cluster we investigated a phytoplankton spring bloom in the North Sea. This cluster constituted 5.1% of the total, but 10–31% (mean 18.5%) of the active bacterioplankton. A metatranscriptomic analysis showed that the genome of P. temperata RCA23 was transcribed to 94% in the bloom with some variations during day and night. The genome of P. temperata RCA23 was also retrieved to 84% from metagenomic data sets from a Norwegian fjord and to 82% from stations of the Global Ocean Sampling expedition in the northwestern Atlantic. In this region, up to 6.5% of the total reads mapped on the genome of P. temperata RCA23. This abundant taxon appears to be a major player in ocean biogeochemistry. PMID:25083934

  3. Towards Context Sensitive Information Inference.

    ERIC Educational Resources Information Center

    Song, D.; Bruza, P. D.

    2003-01-01

    Discusses information inference from a psychologistic stance and proposes an information inference mechanism that makes inferences via computations of information flow through an approximation of a conceptual space. Highlights include cognitive economics of information processing; context sensitivity; and query models for information retrieval.…

  4. Multimodel inference and adaptive management

    USGS Publications Warehouse

    Rehme, S.E.; Powell, L.A.; Allen, C.R.

    2011-01-01

    Ecology is an inherently complex science coping with correlated variables, nonlinear interactions and multiple scales of pattern and process, making it difficult for experiments to result in clear, strong inference. Natural resource managers, policy makers, and stakeholders rely on science to provide timely and accurate management recommendations. However, the time necessary to untangle the complexities of interactions within ecosystems is often far greater than the time available to make management decisions. One method of coping with this problem is multimodel inference. Multimodel inference assesses uncertainty by calculating likelihoods among multiple competing hypotheses, but multimodel inference results are often equivocal. Despite this, there may be pressure for ecologists to provide management recommendations regardless of the strength of their study’s inference. We reviewed papers in the Journal of Wildlife Management (JWM) and the journal Conservation Biology (CB) to quantify the prevalence of multimodel inference approaches, the resulting inference (weak versus strong), and how authors dealt with the uncertainty. Thirty-eight percent and 14%, respectively, of articles in the JWM and CB used multimodel inference approaches. Strong inference was rarely observed, with only 7% of JWM and 20% of CB articles resulting in strong inference. We found the majority of weak inference papers in both journals (59%) gave specific management recommendations. Model selection uncertainty was ignored in most recommendations for management. We suggest that adaptive management is an ideal method to resolve uncertainty when research results in weak inference.

  5. Regulon inference without arbitrary thresholds: three levels of sensitivity

    SciTech Connect

    Dubchak, Pavel Novichkov, Elena Stavrovskaya, Dmitry Rodionov, Andrey Mironov, Inna; Rodionov, Dmitry; Mironov, Andrey; Dubchak, Inna; Novichkov, P.S.

    2010-11-15

    Reconstruction of transcriptional regulatory networks is one of the major challenges facing the bioinformatics community in view of constantly growing number of complete genomes. The comparative genomics approach has been successfully used for the analysis of the transcriptional regulation of many metabolic systems in various bacteria taxa. The key step in this approach is given a position weight matrix, find an optimal threshold for the search of potential binding sites in genomes. In our previous work we proposed an approach for automatic selection of TFBS score threshold coupled with inference of regulon content. In this study we developed two modifications of this approach providing two additional levels of sensitivity.

  6. Visual Inference Programming

    NASA Technical Reports Server (NTRS)

    Wheeler, Kevin; Timucin, Dogan; Rabbette, Maura; Curry, Charles; Allan, Mark; Lvov, Nikolay; Clanton, Sam; Pilewskie, Peter

    2002-01-01

    The goal of visual inference programming is to develop a software framework data analysis and to provide machine learning algorithms for inter-active data exploration and visualization. The topics include: 1) Intelligent Data Understanding (IDU) framework; 2) Challenge problems; 3) What's new here; 4) Framework features; 5) Wiring diagram; 6) Generated script; 7) Results of script; 8) Initial algorithms; 9) Independent Component Analysis for instrument diagnosis; 10) Output sensory mapping virtual joystick; 11) Output sensory mapping typing; 12) Closed-loop feedback mu-rhythm control; 13) Closed-loop training; 14) Data sources; and 15) Algorithms. This paper is in viewgraph form.

  7. Graphical inference for Infovis.

    PubMed

    Wickham, Hadley; Cook, Dianne; Hofmann, Heike; Buja, Andreas

    2010-01-01

    How do we know if what we see is really there? When visualizing data, how do we avoid falling into the trap of apophenia where we see patterns in random noise? Traditionally, infovis has been concerned with discovering new relationships, and statistics with preventing spurious relationships from being reported. We pull these opposing poles closer with two new techniques for rigorous statistical inference of visual discoveries. The "Rorschach" helps the analyst calibrate their understanding of uncertainty and "line-up" provides a protocol for assessing the significance of visual discoveries, protecting against the discovery of spurious structure.

  8. Strong indirect effects of a submersed aquatic macrophyte, Vallisneria americana, on bacterioplankton densities in a mesotrophic lake.

    PubMed

    Huss, A A; Wehr, J D

    2004-05-01

    Phytoplankton and allochthonous matter are important sources of dissolved organic carbon (DOC) for planktonic bacteria in aquatic ecosystems. But in small temperate lakes, aquatic macrophytes may also be an important source of DOC, as well as a source or sink for inorganic nutrients. We conducted micro- and mesocosm studies to investigate the possible effects of an actively growing macrophyte, Vallisneria americana, on bacterial growth and water chemistry in mesotrophic Calder Lake. A first microcosm (1 L) study conducted under high ambient NH4+ levels (NH4+ > or = 10 microM) demonstrated that macrophytes had a positive effect on bacterial densities through release of DOC and P. A second microcosm experiment, conducted under NH4+-depleted conditions (NH4+ < 10 microM), examined inter- active effects of macrophytes and their sediments on bacterial growth and water chemistry. Non-rooted macrophytes had negative effects on bacterial numbers, while rooted macrophytes had no significant effects, despite significant increases in DOC and P. A 70-L mesocosm experiment manipulated macrophytes, as well as N and P supply under surplus NH4-+conditions (NH4+ > or = 10 gmicro), and measured effects on bacterial growth, Chl a concentrations, and water chemistry. Bacterial growth and Chl a concentrations declined with macrophyte additions, while bacterial densities increased with P addition (with or without N). Results suggest that the submersed macrophyte Vallisneria exerts a strong but indirect effect on bacteria by modifying nutrient conditions and/or suppressing phytoplankton. Effects of living macrophytes differed with ambient nutrient conditions: under NH4+-surplus conditions, submersed macrophytes stimulated bacterioplankton through release of DOC or P, but in NH4-+depleted conditions, the influence of Vallisneria was negative or neutral. Effects of living macrophytes on planktonic bacteria were apparently mediated by the macrophytes use and/or release of nutrients, as well

  9. Functional Responses of Bacterioplankton Diversity and Metabolism to Experimental Bottom-Up and Top-Down Forcings.

    PubMed

    Pradeep Ram, A S; Chaibi-Slouma, S; Keshri, J; Colombet, J; Sime-Ngando, T

    2016-08-01

    We conducted an experimental approach using microcosms to simultaneously examine the functional response of natural freshwater bacterial assemblages to the impact of resources (nutrients) and top-down factors (viruses and grazers) on bacterial physiological state and their community structure. Addition of organic and inorganic nutrients led to the proliferation of high nucleic acid content bacterial cells accompanied by high bacterial growth efficiency (considered as proxy of bacterial carbon metabolism) estimates, suggesting that this subgroup represented the most active fraction of bacterial community and had a high capacity to incorporate carbon into its biomass. However, their rapid growth induced the pressure of viral lytic infection which led to their lysis toward the end of the experiment. In microcosms with flagellates plus viruses, and with viruses alone, the selective removal of metabolically active high nucleic acid cells through viral lysis benefitted the less active low nucleic acid content cells, perhaps via the use of lysis products for its growth and survival. Changes in bacterial physiological state in microcosms were reflected in their community structure which was examined using 16S ribosomal RNA (rRNA) gene sequencing by Illumina's Miseq platform. Chao estimator and Shannon diversity index values suggested that bacterial species richness was highest in the presence of both the top-down factors, indicating a tighter control of bacterioplankton dominants within a relatively stable bacterial community. The increase in bacterial metabolism with nutrient addition followed by subsequent lysis of bacterial dominants indicate that both resources and top-down factors work in concert for the sustenance of stable bacterial communities. PMID:27179523

  10. Functional Responses of Bacterioplankton Diversity and Metabolism to Experimental Bottom-Up and Top-Down Forcings.

    PubMed

    Pradeep Ram, A S; Chaibi-Slouma, S; Keshri, J; Colombet, J; Sime-Ngando, T

    2016-08-01

    We conducted an experimental approach using microcosms to simultaneously examine the functional response of natural freshwater bacterial assemblages to the impact of resources (nutrients) and top-down factors (viruses and grazers) on bacterial physiological state and their community structure. Addition of organic and inorganic nutrients led to the proliferation of high nucleic acid content bacterial cells accompanied by high bacterial growth efficiency (considered as proxy of bacterial carbon metabolism) estimates, suggesting that this subgroup represented the most active fraction of bacterial community and had a high capacity to incorporate carbon into its biomass. However, their rapid growth induced the pressure of viral lytic infection which led to their lysis toward the end of the experiment. In microcosms with flagellates plus viruses, and with viruses alone, the selective removal of metabolically active high nucleic acid cells through viral lysis benefitted the less active low nucleic acid content cells, perhaps via the use of lysis products for its growth and survival. Changes in bacterial physiological state in microcosms were reflected in their community structure which was examined using 16S ribosomal RNA (rRNA) gene sequencing by Illumina's Miseq platform. Chao estimator and Shannon diversity index values suggested that bacterial species richness was highest in the presence of both the top-down factors, indicating a tighter control of bacterioplankton dominants within a relatively stable bacterial community. The increase in bacterial metabolism with nutrient addition followed by subsequent lysis of bacterial dominants indicate that both resources and top-down factors work in concert for the sustenance of stable bacterial communities.

  11. Circular inferences in schizophrenia.

    PubMed

    Jardri, Renaud; Denève, Sophie

    2013-11-01

    A considerable number of recent experimental and computational studies suggest that subtle impairments of excitatory to inhibitory balance or regulation are involved in many neurological and psychiatric conditions. The current paper aims to relate, specifically and quantitatively, excitatory to inhibitory imbalance with psychotic symptoms in schizophrenia. Considering that the brain constructs hierarchical causal models of the external world, we show that the failure to maintain the excitatory to inhibitory balance results in hallucinations as well as in the formation and subsequent consolidation of delusional beliefs. Indeed, the consequence of excitatory to inhibitory imbalance in a hierarchical neural network is equated to a pathological form of causal inference called 'circular belief propagation'. In circular belief propagation, bottom-up sensory information and top-down predictions are reverberated, i.e. prior beliefs are misinterpreted as sensory observations and vice versa. As a result, these predictions are counted multiple times. Circular inference explains the emergence of erroneous percepts, the patient's overconfidence when facing probabilistic choices, the learning of 'unshakable' causal relationships between unrelated events and a paradoxical immunity to perceptual illusions, which are all known to be associated with schizophrenia. PMID:24065721

  12. Moment inference from tomograms

    USGS Publications Warehouse

    Day-Lewis, F. D.; Chen, Y.; Singha, K.

    2007-01-01

    Time-lapse geophysical tomography can provide valuable qualitative insights into hydrologic transport phenomena associated with aquifer dynamics, tracer experiments, and engineered remediation. Increasingly, tomograms are used to infer the spatial and/or temporal moments of solute plumes; these moments provide quantitative information about transport processes (e.g., advection, dispersion, and rate-limited mass transfer) and controlling parameters (e.g., permeability, dispersivity, and rate coefficients). The reliability of moments calculated from tomograms is, however, poorly understood because classic approaches to image appraisal (e.g., the model resolution matrix) are not directly applicable to moment inference. Here, we present a semi-analytical approach to construct a moment resolution matrix based on (1) the classic model resolution matrix and (2) image reconstruction from orthogonal moments. Numerical results for radar and electrical-resistivity imaging of solute plumes demonstrate that moment values calculated from tomograms depend strongly on plume location within the tomogram, survey geometry, regularization criteria, and measurement error. Copyright 2007 by the American Geophysical Union.

  13. Circular inferences in schizophrenia.

    PubMed

    Jardri, Renaud; Denève, Sophie

    2013-11-01

    A considerable number of recent experimental and computational studies suggest that subtle impairments of excitatory to inhibitory balance or regulation are involved in many neurological and psychiatric conditions. The current paper aims to relate, specifically and quantitatively, excitatory to inhibitory imbalance with psychotic symptoms in schizophrenia. Considering that the brain constructs hierarchical causal models of the external world, we show that the failure to maintain the excitatory to inhibitory balance results in hallucinations as well as in the formation and subsequent consolidation of delusional beliefs. Indeed, the consequence of excitatory to inhibitory imbalance in a hierarchical neural network is equated to a pathological form of causal inference called 'circular belief propagation'. In circular belief propagation, bottom-up sensory information and top-down predictions are reverberated, i.e. prior beliefs are misinterpreted as sensory observations and vice versa. As a result, these predictions are counted multiple times. Circular inference explains the emergence of erroneous percepts, the patient's overconfidence when facing probabilistic choices, the learning of 'unshakable' causal relationships between unrelated events and a paradoxical immunity to perceptual illusions, which are all known to be associated with schizophrenia.

  14. Expansion of biological pathways based on evolutionary inference

    PubMed Central

    Li, Yang; Calvo, Sarah E.; Gutman, Roee

    2014-01-01

    Summary Availability of diverse genomes makes it possible to predict gene function based on shared evolutionary history. This approach can be challenging, however, for pathways whose components do not exhibit a shared history, but rather, consist of distinct “evolutionary modules.” We introduce a computational algorithm, CLIME (clustering by inferred models of evolution), which inputs a eukaryotic species tree, homology matrix, and pathway (gene set) of interest. CLIME partitions the gene set into disjoint evolutionary modules, simultaneously learning the number of modules and a tree-based evolutionary history that defines each module. CLIME then expands each module by scanning the genome for new components that likely arose under the inferred evolutionary model. Application of CLIME to ∼1000 annotated human pathways, organelles and proteomes of yeast, red algae, and malaria, reveals unanticipated evolutionary modularity and novel, co-evolving components. CLIME is freely available and should become increasingly powerful with the growing wealth of eukaryotic genomes. PMID:24995987

  15. Transplant experiments uncover Baltic Sea basin-specific responses in bacterioplankton community composition and metabolic activities.

    PubMed

    Lindh, Markus V; Figueroa, Daniela; Sjöstedt, Johanna; Baltar, Federico; Lundin, Daniel; Andersson, Agneta; Legrand, Catherine; Pinhassi, Jarone

    2015-01-01

    Anthropogenically induced changes in precipitation are projected to generate increased river runoff to semi-enclosed seas, increasing loads of terrestrial dissolved organic matter and decreasing salinity. To determine how bacterial community structure and functioning adjust to such changes, we designed microcosm transplant experiments with Baltic Proper (salinity 7.2) and Bothnian Sea (salinity 3.6) water. Baltic Proper bacteria generally reached higher abundances than Bothnian Sea bacteria in both Baltic Proper and Bothnian Sea water, indicating higher adaptability. Moreover, Baltic Proper bacteria growing in Bothnian Sea water consistently showed highest bacterial production and beta-glucosidase activity. These metabolic responses were accompanied by basin-specific changes in bacterial community structure. For example, Baltic Proper Pseudomonas and Limnobacter populations increased markedly in relative abundance in Bothnian Sea water, indicating a replacement effect. In contrast, Roseobacter and Rheinheimera populations were stable or increased in abundance when challenged by either of the waters, indicating an adjustment effect. Transplants to Bothnian Sea water triggered the initial emergence of particular Burkholderiaceae populations, and transplants to Baltic Proper water triggered Alteromonadaceae populations. Notably, in the subsequent re-transplant experiment, a priming effect resulted in further increases to dominance of these populations. Correlated changes in community composition and metabolic activity were observed only in the transplant experiment and only at relatively high phylogenetic resolution. This suggested an importance of successional progression for interpreting relationships between bacterial community composition and functioning. We infer that priming effects on bacterial community structure by natural episodic events or climate change induced forcing could translate into long-term changes in bacterial ecosystem process rates. PMID

  16. Response of bacterioplankton activity in an Arctic fjord system to elevated pCO2: results from a mesocosm perturbation study

    NASA Astrophysics Data System (ADS)

    Piontek, J.; Borchard, C.; Sperling, M.; Schulz, K. G.; Riebesell, U.; Engel, A.

    2013-01-01

    The effect of elevated seawater carbon dioxide (CO2) on the activity of a natural bacterioplankton community in an Arctic fjord system was investigated by a mesocosm perturbation study in the frame of the European Project on Ocean Acidification (EPOCA). A pCO2 range of 175-1085 μatm was set up in nine mesocosms deployed in the Kongsfjorden (Svalbard). The activity of natural extracellular enzyme assemblages increased in response to acidification. Rates of β-glucosidase and leucine-aminopeptidase increased along the gradient of mesocosm pCO2. A decrease in seawater pH of 0.5 units almost doubled rates of both enzymes. Heterotrophic bacterial activity was closely coupled to phytoplankton productivity in this experiment. The bacterioplankton community responded to rising chlorophyll a concentrations after a lag phase of only a few days with increasing protein production and extracellular enzyme activity. Time-integrated primary production and bacterial protein production were positively correlated, strongly suggesting that higher amounts of phytoplankton-derived organic matter were assimilated by heterotrophic bacteria at increased primary production. Primary production increased under high pCO2 in this study, and it can be suggested that the efficient heterotrophic carbon utilisation had the potential to counteract the enhanced autotrophic CO2 fixation. However, our results also show that beneficial pCO2-related effects on bacterial activity can be mitigated by the top-down control of bacterial abundances in natural microbial communities.

  17. Stimulation of viral infection of bacterioplankton during a mesoscale iron fertilization experiment in the Southern Ocean

    NASA Astrophysics Data System (ADS)

    Weinbauer, M. G.; Arrieta, J.-M.; Herndl, G. J.

    2003-04-01

    A mesoscale iron fertilization in the Southern Ocean (Eisenex ) induced a phytoplankton bloom within three weeks observation as well as in an increased bacterial abundance and production. Viral abundance and viral production were stimulated as well. A virus-dilution approach was used to estimate the frequency of infected cells (FIC) and the frequency of lysogenic cells (FLC), i.e. cells with a dormant viral genome. While the FLC did not vary strongly within the iron-enriched patch and did not differ from waters outside the patch, FIC increased significantly within the iron fertilized patch. This suggests that induction of the lytic cycle in lysogenic cells was not significant. Rather, the stimulated bacterial production and abundance within the patch resulted in higher and more successful encounters between viruses and hosts and thus in higher FIC values. Consequently, the iron fertilization enhanced the influence of viral infection in the microbial food web. According to the current model, this should result a stimulation of bacterial production, since lysed bacterial cells cannot be consumed up by protists and transferred to higher trophic level; lysis products can be taken up by bacteria and thus organic carbon spins within this viral loop. Viral infection is a significant and previously overlooked factor in the carbon flow during iron fertilization experiments.

  18. Transcriptional response of bathypelagic marine bacterioplankton to the Deepwater Horizon oil spill.

    PubMed

    Rivers, Adam R; Sharma, Shalabh; Tringe, Susannah G; Martin, Jeffrey; Joye, Samantha B; Moran, Mary Ann

    2013-12-01

    The Deepwater Horizon blowout released a massive amount of oil and gas into the deep ocean between April and July 2010, stimulating microbial blooms of petroleum-degrading bacteria. To understand the metabolic response of marine microorganisms, we sequenced ≈ 66 million community transcripts that revealed the identity of metabolically active microbes and their roles in petroleum consumption. Reads were assigned to reference genes from ≈ 2700 bacterial and archaeal taxa, but most assignments (39%) were to just six genomes representing predominantly methane- and petroleum-degrading Gammaproteobacteria. Specific pathways for the degradation of alkanes, aromatic compounds and methane emerged from the metatranscriptomes, with some transcripts assigned to methane monooxygenases representing highly divergent homologs that may degrade either methane or short alkanes. The microbial community in the plume was less taxonomically and functionally diverse than the unexposed community below the plume; this was due primarily to decreased species evenness resulting from Gammaproteobacteria blooms. Surprisingly, a number of taxa (related to SAR11, Nitrosopumilus and Bacteroides, among others) contributed equal numbers of transcripts per liter in both the unexposed and plume samples, suggesting that some groups were unaffected by the petroleum inputs and blooms of degrader taxa, and may be important for re-establishing the pre-spill microbial community structure.

  19. Transcriptional response of bathypelagic marine bacterioplankton to the Deepwater Horizon oil spill

    PubMed Central

    Rivers, Adam R; Sharma, Shalabh; Tringe, Susannah G; Martin, Jeffrey; Joye, Samantha B; Moran, Mary Ann

    2013-01-01

    The Deepwater Horizon blowout released a massive amount of oil and gas into the deep ocean between April and July 2010, stimulating microbial blooms of petroleum-degrading bacteria. To understand the metabolic response of marine microorganisms, we sequenced ∼66 million community transcripts that revealed the identity of metabolically active microbes and their roles in petroleum consumption. Reads were assigned to reference genes from ∼2700 bacterial and archaeal taxa, but most assignments (39%) were to just six genomes representing predominantly methane- and petroleum-degrading Gammaproteobacteria. Specific pathways for the degradation of alkanes, aromatic compounds and methane emerged from the metatranscriptomes, with some transcripts assigned to methane monooxygenases representing highly divergent homologs that may degrade either methane or short alkanes. The microbial community in the plume was less taxonomically and functionally diverse than the unexposed community below the plume; this was due primarily to decreased species evenness resulting from Gammaproteobacteria blooms. Surprisingly, a number of taxa (related to SAR11, Nitrosopumilus and Bacteroides, among others) contributed equal numbers of transcripts per liter in both the unexposed and plume samples, suggesting that some groups were unaffected by the petroleum inputs and blooms of degrader taxa, and may be important for re-establishing the pre-spill microbial community structure. PMID:23902988

  20. Bayesian inference in geomagnetism

    NASA Technical Reports Server (NTRS)

    Backus, George E.

    1988-01-01

    The inverse problem in empirical geomagnetic modeling is investigated, with critical examination of recently published studies. Particular attention is given to the use of Bayesian inference (BI) to select the damping parameter lambda in the uniqueness portion of the inverse problem. The mathematical bases of BI and stochastic inversion are explored, with consideration of bound-softening problems and resolution in linear Gaussian BI. The problem of estimating the radial magnetic field B(r) at the earth core-mantle boundary from surface and satellite measurements is then analyzed in detail, with specific attention to the selection of lambda in the studies of Gubbins (1983) and Gubbins and Bloxham (1985). It is argued that the selection method is inappropriate and leads to lambda values much larger than those that would result if a reasonable bound on the heat flow at the CMB were assumed.

  1. BIE: Bayesian Inference Engine

    NASA Astrophysics Data System (ADS)

    Weinberg, Martin D.

    2013-12-01

    The Bayesian Inference Engine (BIE) is an object-oriented library of tools written in C++ designed explicitly to enable Bayesian update and model comparison for astronomical problems. To facilitate "what if" exploration, BIE provides a command line interface (written with Bison and Flex) to run input scripts. The output of the code is a simulation of the Bayesian posterior distribution from which summary statistics e.g. by taking moments, or determine confidence intervals and so forth, can be determined. All of these quantities are fundamentally integrals and the Markov Chain approach produces variates heta distributed according to P( heta|D) so moments are trivially obtained by summing of the ensemble of variates.

  2. Bayes factors and multimodel inference

    USGS Publications Warehouse

    Link, W.A.; Barker, R.J.; Thomson, David L.; Cooch, Evan G.; Conroy, Michael J.

    2009-01-01

    Multimodel inference has two main themes: model selection, and model averaging. Model averaging is a means of making inference conditional on a model set, rather than on a selected model, allowing formal recognition of the uncertainty associated with model choice. The Bayesian paradigm provides a natural framework for model averaging, and provides a context for evaluation of the commonly used AIC weights. We review Bayesian multimodel inference, noting the importance of Bayes factors. Noting the sensitivity of Bayes factors to the choice of priors on parameters, we define and propose nonpreferential priors as offering a reasonable standard for objective multimodel inference.

  3. GENIES: gene network inference engine based on supervised analysis.

    PubMed

    Kotera, Masaaki; Yamanishi, Yoshihiro; Moriya, Yuki; Kanehisa, Minoru; Goto, Susumu

    2012-07-01

    Gene network inference engine based on supervised analysis (GENIES) is a web server to predict unknown part of gene network from various types of genome-wide data in the framework of supervised network inference. The originality of GENIES lies in the construction of a predictive model using partially known network information and in the integration of heterogeneous data with kernel methods. The GENIES server accepts any 'profiles' of genes or proteins (e.g. gene expression profiles, protein subcellular localization profiles and phylogenetic profiles) or pre-calculated gene-gene similarity matrices (or 'kernels') in the tab-delimited file format. As a training data set to learn a predictive model, the users can choose either known molecular network information in the KEGG PATHWAY database or their own gene network data. The user can also select an algorithm of supervised network inference, choose various parameters in the method, and control the weights of heterogeneous data integration. The server provides the list of newly predicted gene pairs, maps the predicted gene pairs onto the associated pathway diagrams in KEGG PATHWAY and indicates candidate genes for missing enzymes in organism-specific metabolic pathways. GENIES (http://www.genome.jp/tools/genies/) is publicly available as one of the genome analysis tools in GenomeNet.

  4. Effect of elevated CO2 on the dynamics of particle attached and free living bacterioplankton communities in an Arctic fjord

    NASA Astrophysics Data System (ADS)

    Sperling, M.; Piontek, J.; Gerdts, G.; Wichels, A.; Schunck, H.; Roy, A.-S.; La Roche, J.; Gilbert, J.; Bittner, L.; Romac, S.; Riebesell, U.; Engel, A.

    2012-08-01

    The increase in atmospheric carbon dioxide (CO2) results in acidification of the oceans, expected to lead to the fastest drop in ocean pH in the last 300 million years, if anthropogenic emissions are continued at present rate. Due to higher solubility of gases in cold waters and increased exposure to the atmosphere by decreasing ice cover, the Arctic Ocean will be among the areas most strongly affected by ocean acidification. Yet, the response of the plankton community of high latitudes to ocean acidification has not been studied so far. This work is part of the Arctic campaign of the European Project on Ocean Acidification (EPOCA) in 2010, employing 9 in situ mesocosms of about 45 000 l each to simulate ocean acidification in Kongsfjorden, Svalbard (78°56.2' N 11°53.6' E). In the present study, we investigated effects of elevated CO2 on the composition and richness of particle attached (PA; >3 μm) and free living (FL; <3 μm >0.2 μm) bacterial communities by Automated Ribosomal Intergenic Spacer Analysis (ARISA) in 6 of the mesocosms and the surrounding fjord, ranging from 185 to 1050 initial μatm pCO2. ARISA was able to resolve about 20-30 bacterial band-classes per sample and allowed for a detailed investigation of the explicit richness. Both, the PA and the FL bacterioplankton community exhibited a strong temporal development, which was driven mainly by temperature and phytoplankton development. In response to the breakdown of a picophytoplankton bloom (phase 3 of the experiment), number of ARISA-band classes in the PA-community were reduced at low and medium CO2 (∼180-600 μatm) by about 25%, while it was more or less stable at high CO2 (∼ 650-800 μatm). We hypothesise that enhanced viral lysis and enhanced availability of organic substrates at high CO2 resulted in a more diverse PA-bacterial community in the post-bloom phase. Despite lower cell numbers and extracellular enzyme activities in the post-bloom phase, bacterial protein production was

  5. Learning to Observe "and" Infer

    ERIC Educational Resources Information Center

    Hanuscin, Deborah L.; Park Rogers, Meredith A.

    2008-01-01

    Researchers describe the need for students to have multiple opportunities and social interaction to learn about the differences between observation and inference and their role in developing scientific explanations (Harlen 2001; Simpson 2000). Helping children develop their skills of observation and inference in science while emphasizing the…

  6. Feature Inference Learning and Eyetracking

    ERIC Educational Resources Information Center

    Rehder, Bob; Colner, Robert M.; Hoffman, Aaron B.

    2009-01-01

    Besides traditional supervised classification learning, people can learn categories by inferring the missing features of category members. It has been proposed that feature inference learning promotes learning a category's internal structure (e.g., its typical features and interfeature correlations) whereas classification promotes the learning of…

  7. Improving Inferences from Multiple Methods.

    ERIC Educational Resources Information Center

    Shotland, R. Lance; Mark, Melvin M.

    1987-01-01

    Multiple evaluation methods (MEMs) can cause an inferential challenge, although there are strategies to strengthen inferences. Practical and theoretical issues involved in the use by social scientists of MEMs, three potential problems in drawing inferences from MEMs, and short- and long-term strategies for alleviating these problems are outlined.…

  8. Causal Inference in Retrospective Studies.

    ERIC Educational Resources Information Center

    Holland, Paul W.; Rubin, Donald B.

    1988-01-01

    The problem of drawing causal inferences from retrospective case-controlled studies is considered. A model for causal inference in prospective studies is applied to retrospective studies. Limitations of case-controlled studies are formulated concerning relevant parameters that can be estimated in such studies. A coffee-drinking/myocardial…

  9. Causal Inference and Developmental Psychology

    ERIC Educational Resources Information Center

    Foster, E. Michael

    2010-01-01

    Causal inference is of central importance to developmental psychology. Many key questions in the field revolve around improving the lives of children and their families. These include identifying risk factors that if manipulated in some way would foster child development. Such a task inherently involves causal inference: One wants to know whether…

  10. Methods for causal inference from gene perturbation experiments and validation

    PubMed Central

    Meinshausen, Nicolai; Hauser, Alain; Mooij, Joris M.; Peters, Jonas; Versteeg, Philip; Bühlmann, Peter

    2016-01-01

    Inferring causal effects from observational and interventional data is a highly desirable but ambitious goal. Many of the computational and statistical methods are plagued by fundamental identifiability issues, instability, and unreliable performance, especially for large-scale systems with many measured variables. We present software and provide some validation of a recently developed methodology based on an invariance principle, called invariant causal prediction (ICP). The ICP method quantifies confidence probabilities for inferring causal structures and thus leads to more reliable and confirmatory statements for causal relations and predictions of external intervention effects. We validate the ICP method and some other procedures using large-scale genome-wide gene perturbation experiments in Saccharomyces cerevisiae. The results suggest that prediction and prioritization of future experimental interventions, such as gene deletions, can be improved by using our statistical inference techniques. PMID:27382150

  11. Inferring Indel Parameters using a Simulation-based Approach

    PubMed Central

    Levy Karin, Eli; Rabin, Avigayel; Ashkenazy, Haim; Shkedy, Dafna; Avram, Oren; Cartwright, Reed A.; Pupko, Tal

    2015-01-01

    In this study, we present a novel methodology to infer indel parameters from multiple sequence alignments (MSAs) based on simulations. Our algorithm searches for the set of evolutionary parameters describing indel dynamics which best fits a given input MSA. In each step of the search, we use parametric bootstraps and the Mahalanobis distance to estimate how well a proposed set of parameters fits input data. Using simulations, we demonstrate that our methodology can accurately infer the indel parameters for a large variety of plausible settings. Moreover, using our methodology, we show that indel parameters substantially vary between three genomic data sets: Mammals, bacteria, and retroviruses. Finally, we demonstrate how our methodology can be used to simulate MSAs based on indel parameters inferred from real data sets. PMID:26537226

  12. Methods for causal inference from gene perturbation experiments and validation.

    PubMed

    Meinshausen, Nicolai; Hauser, Alain; Mooij, Joris M; Peters, Jonas; Versteeg, Philip; Bühlmann, Peter

    2016-07-01

    Inferring causal effects from observational and interventional data is a highly desirable but ambitious goal. Many of the computational and statistical methods are plagued by fundamental identifiability issues, instability, and unreliable performance, especially for large-scale systems with many measured variables. We present software and provide some validation of a recently developed methodology based on an invariance principle, called invariant causal prediction (ICP). The ICP method quantifies confidence probabilities for inferring causal structures and thus leads to more reliable and confirmatory statements for causal relations and predictions of external intervention effects. We validate the ICP method and some other procedures using large-scale genome-wide gene perturbation experiments in Saccharomyces cerevisiae The results suggest that prediction and prioritization of future experimental interventions, such as gene deletions, can be improved by using our statistical inference techniques. PMID:27382150

  13. Chloroplast Phylogenomic Inference of Green Algae Relationships

    PubMed Central

    Sun, Linhua; Fang, Ling; Zhang, Zhenhua; Chang, Xin; Penny, David; Zhong, Bojian

    2016-01-01

    The green algal phylum Chlorophyta has six diverse classes, but the phylogenetic relationship of the classes within Chlorophyta remains uncertain. In order to better understand the ancient Chlorophyta evolution, we have applied a site pattern sorting method to study compositional heterogeneity and the model fit in the green algal chloroplast genomic data. We show that the fastest-evolving sites are significantly correlated with among-site compositional heterogeneity, and these sites have a much poorer fit to the evolutionary model. Our phylogenomic analyses suggest that the class Chlorophyceae is a monophyletic group, and the classes Ulvophyceae, Trebouxiophyceae and Prasinophyceae are non-monophyletic groups. Our proposed phylogenetic tree of Chlorophyta will offer new insights to investigate ancient green algae evolution, and our analytical framework will provide a useful approach for evaluating and mitigating the potential errors of phylogenomic inferences. PMID:26846729

  14. Response of bacterioplankton activity in an Arctic fjord system to elevated pCO2: results from a mesocosm perturbation study

    NASA Astrophysics Data System (ADS)

    Piontek, J.; Borchard, C.; Sperling, M.; Schulz, K. G.; Riebesell, U.; Engel, A.

    2012-08-01

    The effect of elevated seawater carbon dioxide (CO2) on the activity of a natural bacterioplankton community in an Arctic fjord system was investigated by a mesocosm perturbation study in the frame of the European Project on Ocean Acidification (EPOCA). A pCO2 range of 175-1085 μatm was set up in nine mesocosms deployed in the Kongsfjorden (Svalbard). The bacterioplankton communities responded to rising chlorophyll a concentrations after a lag phase of only a few days with increasing protein production and extracellular enzyme activity and revealed a close coupling of heterotrophic bacterial activity to phytoplankton productivity in this experiment. The natural extracellular enzyme assemblages showed increased activity in response to moderate acidification. A decrease in seawater pH of 0.5 units roughly doubled rates of β-glucosidase and leucine-aminopeptidase. Activities of extracellular enzymes in the mesocosms were directly related to both seawater pH and primary production. Also primary production and bacterial protein production in the mesocosms at different pCO2 were positively correlated. Therefore, it can be suggested that the efficient heterotrophic carbon utilization in this Arctic microbial food web had the potential to counteract increased phytoplankton production that was achieved under elevated pCO2 in this study. However, our results also show that the transfer of beneficial pCO2-related effects on the cellular bacterial metabolism to the scale of community activity and organic matter degradation can be mitigated by the top-down control of bacterial abundances in natural microbial communities.

  15. The Pattern of Change in the Abundances of Specific Bacterioplankton Groups Is Consistent across Different Nutrient-Enriched Habitats in Crete

    PubMed Central

    Fodelianakis, Stilianos; Papageorgiou, Nafsika; Pitta, Paraskevi; Kasapidis, Panagiotis; Karakassis, Ioannis

    2014-01-01

    A common source of disturbance for coastal aquatic habitats is nutrient enrichment through anthropogenic activities. Although the water column bacterioplankton communities in these environments have been characterized in some cases, changes in α-diversity and/or the abundances of specific taxonomic groups across enriched habitats remain unclear. Here, we investigated the bacterial community changes at three different nutrient-enriched and adjacent undisturbed habitats along the north coast of Crete, Greece: a fish farm, a closed bay within a town with low water renewal rates, and a city port where the level of nutrient enrichment and the trophic status of the habitat were different. Even though changes in α-diversity were different at each site, we observed across the sites a common change pattern accounting for most of the community variation for five of the most abundant bacterial groups: a decrease in the abundance of the Pelagibacteraceae and SAR86 and an increase in the abundance of the Alteromonadaceae, Rhodobacteraceae, and Cryomorphaceae in the impacted sites. The abundances of the groups that increased and decreased in the impacted sites were significantly correlated (positively and negatively, respectively) with the total heterotrophic bacterial counts and the concentrations of dissolved organic carbon and/or dissolved nitrogen and chlorophyll α, indicating that the common change pattern was associated with nutrient enrichment. Our results provide an in situ indication concerning the association of specific bacterioplankton groups with nutrient enrichment. These groups could potentially be used as indicators for nutrient enrichment if the pattern is confirmed over a broader spatial and temporal scale by future studies. PMID:24747897

  16. Pathway network inference from gene expression data

    PubMed Central

    2014-01-01

    Background The development of high-throughput omics technologies enabled genome-wide measurements of the activity of cellular elements and provides the analytical resources for the progress of the Systems Biology discipline. Analysis and interpretation of gene expression data has evolved from the gene to the pathway and interaction level, i.e. from the detection of differentially expressed genes, to the establishment of gene interaction networks and the identification of enriched functional categories. Still, the understanding of biological systems requires a further level of analysis that addresses the characterization of the interaction between functional modules. Results We present a novel computational methodology to study the functional interconnections among the molecular elements of a biological system. The PANA approach uses high-throughput genomics measurements and a functional annotation scheme to extract an activity profile from each functional block -or pathway- followed by machine-learning methods to infer the relationships between these functional profiles. The result is a global, interconnected network of pathways that represents the functional cross-talk within the molecular system. We have applied this approach to describe the functional transcriptional connections during the yeast cell cycle and to identify pathways that change their connectivity in a disease condition using an Alzheimer example. Conclusions PANA is a useful tool to deepen in our understanding of the functional interdependences that operate within complex biological systems. We show the approach is algorithmically consistent and the inferred network is well supported by the available functional data. The method allows the dissection of the molecular basis of the functional connections and we describe the different regulatory mechanisms that explain the network's topology obtained for the yeast cell cycle data. PMID:25032889

  17. Metabolic potential of a single cell belonging to one of the most abundant lineages in freshwater bacterioplankton

    PubMed Central

    Garcia, Sarahi L; McMahon, Katherine D; Martinez-Garcia, Manuel; Srivastava, Abhishek; Sczyrba, Alexander; Stepanauskas, Ramunas; Grossart, Hans-Peter; Woyke, Tanja; Warnecke, Falk

    2013-01-01

    Actinobacteria within the acI lineage are often numerically dominating in freshwater ecosystems, where they can account for >50% of total bacteria in the surface water. However, they remain uncultured to date. We thus set out to use single-cell genomics to gain insights into their genetic make-up, with the aim of learning about their physiology and ecological niche. A representative from the highly abundant acI-B1 group was selected for shotgun genomic sequencing. We obtained a draft genomic sequence in 75 larger contigs (sum=1.16 Mb), with an unusually low genomic G+C mol% (∼42%). Actinobacteria core gene analysis suggests an almost complete genome recovery. We found that the acI-B1 cell had a small genome, with a rather low percentage of genes having no predicted functions (∼15%) as compared with other cultured and genome-sequenced microbial species. Our metabolic reconstruction hints at a facultative aerobe microorganism with many transporters and enzymes for pentoses utilization (for example, xylose). We also found an actinorhodopsin gene that may contribute to energy conservation under unfavorable conditions. This project reveals the metabolic potential of a member of the global abundant freshwater Actinobacteria. PMID:22810059

  18. Bayesian Inference: with ecological applications

    USGS Publications Warehouse

    Link, William A.; Barker, Richard J.

    2010-01-01

    This text provides a mathematically rigorous yet accessible and engaging introduction to Bayesian inference with relevant examples that will be of interest to biologists working in the fields of ecology, wildlife management and environmental studies as well as students in advanced undergraduate statistics.. This text opens the door to Bayesian inference, taking advantage of modern computational efficiencies and easily accessible software to evaluate complex hierarchical models.

  19. Active inference, communication and hermeneutics.

    PubMed

    Friston, Karl J; Frith, Christopher D

    2015-07-01

    Hermeneutics refers to interpretation and translation of text (typically ancient scriptures) but also applies to verbal and non-verbal communication. In a psychological setting it nicely frames the problem of inferring the intended content of a communication. In this paper, we offer a solution to the problem of neural hermeneutics based upon active inference. In active inference, action fulfils predictions about how we will behave (e.g., predicting we will speak). Crucially, these predictions can be used to predict both self and others--during speaking and listening respectively. Active inference mandates the suppression of prediction errors by updating an internal model that generates predictions--both at fast timescales (through perceptual inference) and slower timescales (through perceptual learning). If two agents adopt the same model, then--in principle--they can predict each other and minimise their mutual prediction errors. Heuristically, this ensures they are singing from the same hymn sheet. This paper builds upon recent work on active inference and communication to illustrate perceptual learning using simulated birdsongs. Our focus here is the neural hermeneutics implicit in learning, where communication facilitates long-term changes in generative models that are trying to predict each other. In other words, communication induces perceptual learning and enables others to (literally) change our minds and vice versa.

  20. Active inference, communication and hermeneutics☆

    PubMed Central

    Friston, Karl J.; Frith, Christopher D.

    2015-01-01

    Hermeneutics refers to interpretation and translation of text (typically ancient scriptures) but also applies to verbal and non-verbal communication. In a psychological setting it nicely frames the problem of inferring the intended content of a communication. In this paper, we offer a solution to the problem of neural hermeneutics based upon active inference. In active inference, action fulfils predictions about how we will behave (e.g., predicting we will speak). Crucially, these predictions can be used to predict both self and others – during speaking and listening respectively. Active inference mandates the suppression of prediction errors by updating an internal model that generates predictions – both at fast timescales (through perceptual inference) and slower timescales (through perceptual learning). If two agents adopt the same model, then – in principle – they can predict each other and minimise their mutual prediction errors. Heuristically, this ensures they are singing from the same hymn sheet. This paper builds upon recent work on active inference and communication to illustrate perceptual learning using simulated birdsongs. Our focus here is the neural hermeneutics implicit in learning, where communication facilitates long-term changes in generative models that are trying to predict each other. In other words, communication induces perceptual learning and enables others to (literally) change our minds and vice versa. PMID:25957007

  1. Active inference, communication and hermeneutics.

    PubMed

    Friston, Karl J; Frith, Christopher D

    2015-07-01

    Hermeneutics refers to interpretation and translation of text (typically ancient scriptures) but also applies to verbal and non-verbal communication. In a psychological setting it nicely frames the problem of inferring the intended content of a communication. In this paper, we offer a solution to the problem of neural hermeneutics based upon active inference. In active inference, action fulfils predictions about how we will behave (e.g., predicting we will speak). Crucially, these predictions can be used to predict both self and others--during speaking and listening respectively. Active inference mandates the suppression of prediction errors by updating an internal model that generates predictions--both at fast timescales (through perceptual inference) and slower timescales (through perceptual learning). If two agents adopt the same model, then--in principle--they can predict each other and minimise their mutual prediction errors. Heuristically, this ensures they are singing from the same hymn sheet. This paper builds upon recent work on active inference and communication to illustrate perceptual learning using simulated birdsongs. Our focus here is the neural hermeneutics implicit in learning, where communication facilitates long-term changes in generative models that are trying to predict each other. In other words, communication induces perceptual learning and enables others to (literally) change our minds and vice versa. PMID:25957007

  2. Advances in computer simulation of genome evolution: toward more realistic evolutionary genomics analysis by approximate bayesian computation.

    PubMed

    Arenas, Miguel

    2015-04-01

    NGS technologies present a fast and cheap generation of genomic data. Nevertheless, ancestral genome inference is not so straightforward due to complex evolutionary processes acting on this material such as inversions, translocations, and other genome rearrangements that, in addition to their implicit complexity, can co-occur and confound ancestral inferences. Recently, models of genome evolution that accommodate such complex genomic events are emerging. This letter explores these novel evolutionary models and proposes their incorporation into robust statistical approaches based on computer simulations, such as approximate Bayesian computation, that may produce a more realistic evolutionary analysis of genomic data. Advantages and pitfalls in using these analytical methods are discussed. Potential applications of these ancestral genomic inferences are also pointed out.

  3. Optimal inference with suboptimal models: Addiction and active Bayesian inference

    PubMed Central

    Schwartenbeck, Philipp; FitzGerald, Thomas H.B.; Mathys, Christoph; Dolan, Ray; Wurst, Friedrich; Kronbichler, Martin; Friston, Karl

    2015-01-01

    When casting behaviour as active (Bayesian) inference, optimal inference is defined with respect to an agent’s beliefs – based on its generative model of the world. This contrasts with normative accounts of choice behaviour, in which optimal actions are considered in relation to the true structure of the environment – as opposed to the agent’s beliefs about worldly states (or the task). This distinction shifts an understanding of suboptimal or pathological behaviour away from aberrant inference as such, to understanding the prior beliefs of a subject that cause them to behave less ‘optimally’ than our prior beliefs suggest they should behave. Put simply, suboptimal or pathological behaviour does not speak against understanding behaviour in terms of (Bayes optimal) inference, but rather calls for a more refined understanding of the subject’s generative model upon which their (optimal) Bayesian inference is based. Here, we discuss this fundamental distinction and its implications for understanding optimality, bounded rationality and pathological (choice) behaviour. We illustrate our argument using addictive choice behaviour in a recently described ‘limited offer’ task. Our simulations of pathological choices and addictive behaviour also generate some clear hypotheses, which we hope to pursue in ongoing empirical work. PMID:25561321

  4. From the Beauty of Genomic Landscapes to the Strength of Transcriptional Mechanisms.

    PubMed

    Natoli, Gioacchino

    2016-03-24

    Genomic analyses are commonly used to infer trends and broad rules underlying transcriptional control. The innovative approach by Tong et al. to interrogate genomic datasets allows extracting mechanistic information on the specific regulation of individual genes.

  5. Array Comparative Genomic Hybridizations: Assessing the ability to recapture evolutionary relationships using an in silico approach

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Comparative Genomic Hybridization (CGH) with DNA microarrays has many biological applications including surveys of copy number changes in tumorigenesis, species detection and identification, and functional genomics studies among related organisms. Array CGH has also been used to infer phylogenetic r...

  6. From the Beauty of Genomic Landscapes to the Strength of Transcriptional Mechanisms.

    PubMed

    Natoli, Gioacchino

    2016-03-24

    Genomic analyses are commonly used to infer trends and broad rules underlying transcriptional control. The innovative approach by Tong et al. to interrogate genomic datasets allows extracting mechanistic information on the specific regulation of individual genes. PMID:27015303

  7. The small genome of an abundant coastal ocean methylotroph.

    PubMed

    Giovannoni, Stephen J; Hayakawa, Darin H; Tripp, H James; Stingl, Ulrich; Givan, Scott A; Cho, Jang-Cheon; Oh, Hyun-Myung; Kitner, Joshua B; Vergin, Kevin L; Rappé, Michael S

    2008-07-01

    OM43 is a clade of uncultured beta-proteobacteria that is commonly found in environmental nucleic acid sequences from productive coastal ocean ecosystems, and some freshwater environments, but is rarely detected in ocean gyres. Ecological studies associate OM43 with phytoplankton blooms, and evolutionary relationships indicate that they might be methylotrophs. Here we report on the genome sequence and metabolic properties of the first axenic isolate of the OM43 clade, strain HTCC2181, which was obtained using new procedures for culturing cells in natural seawater. We found that this strain is an obligate methylotroph that cannot oxidize methane but can use the oxidized C1 compounds methanol and formaldehyde as sources of carbon and energy. Its complete genome is 1304 428 bp in length, the smallest yet reported for a free-living cell. The HTCC2181 genome includes genes for xanthorhodopsin and retinal biosynthesis, an auxiliary system for producing transmembrane electrochemical potentials from light. The discovery that HTCC2181 is an extremely simple specialist in C1 metabolism suggests an unanticipated, important role for oxidized C1 compounds as substrates for bacterioplankton productivity in coastal ecosystems. PMID:18393994

  8. Decoding Plant and Animal Genome Plasticity from Differential Paleo-Evolutionary Patterns and Processes

    PubMed Central

    Murat, Florent; de Peer, Yves Van; Salse, Jérôme

    2012-01-01

    Continuing advances in genome sequencing technologies and computational methods for comparative genomics currently allow inferring the evolutionary history of entire plant and animal genomes. Based on the comparison of the plant and animal genome paleohistory, major differences are unveiled in 1) evolutionary mechanisms (i.e., polyploidization versus diploidization processes), 2) genome conservation (i.e., coding versus noncoding sequence maintenance), and 3) modern genome architecture (i.e., genome organization including repeats expansion versus contraction phenomena). This article discusses how extant animal and plant genomes are the result of inherently different rates and modes of genome evolution resulting in relatively stable animal and much more dynamic and plastic plant genomes. PMID:22833223

  9. Children's Category-Based Inferences Affect Classification

    ERIC Educational Resources Information Center

    Ross, Brian H.; Gelman, Susan A.; Rosengren, Karl S.

    2005-01-01

    Children learn many new categories and make inferences about these categories. Much work has examined how children make inferences on the basis of category knowledge. However, inferences may also affect what is learned about a category. Four experiments examine whether category-based inferences during category learning influence category knowledge…

  10. Causal inference from observational data.

    PubMed

    Listl, Stefan; Jürges, Hendrik; Watt, Richard G

    2016-10-01

    Randomized controlled trials have long been considered the 'gold standard' for causal inference in clinical research. In the absence of randomized experiments, identification of reliable intervention points to improve oral health is often perceived as a challenge. But other fields of science, such as social science, have always been challenged by ethical constraints to conducting randomized controlled trials. Methods have been established to make causal inference using observational data, and these methods are becoming increasingly relevant in clinical medicine, health policy and public health research. This study provides an overview of state-of-the-art methods specifically designed for causal inference in observational data, including difference-in-differences (DiD) analyses, instrumental variables (IV), regression discontinuity designs (RDD) and fixed-effects panel data analysis. The described methods may be particularly useful in dental research, not least because of the increasing availability of routinely collected administrative data and electronic health records ('big data'). PMID:27111146

  11. We infer light in space.

    PubMed

    Schirillo, James A

    2013-10-01

    In studies of lightness and color constancy, the terms lightness and brightness refer to the qualia corresponding to perceived surface reflectance and perceived luminance, respectively. However, what has rarely been considered is the fact that the volume of space containing surfaces appears neither empty, void, nor black, but filled with light. Helmholtz (1866/1962) came closest to describing this phenomenon when discussing inferred illumination, but previous theoretical treatments have fallen short by restricting their considerations to the surfaces of objects. The present work is among the first to explore how we infer the light present in empty space. It concludes with several research examples supporting the theory that humans can infer the differential levels and chromaticities of illumination in three-dimensional space. PMID:23435628

  12. Inferring Diversity: Life After Shannon

    NASA Astrophysics Data System (ADS)

    Giffin, Adom

    The diversity of a community that cannot be fully counted must be inferred. The two preeminent inference methods are the MaxEnt method, which uses information in the form of constraints and Bayes' rule which uses information in the form of data. It has been shown that these two methods are special cases of the method of Maximum (relative) Entropy (ME). We demonstrate how this method can be used as a measure of diversity that not only reproduces the features of Shannon's index but exceeds them by allowing more types of information to be included in the inference. A specific example is solved in detail. Additionally, the entropy that is found is the same form as the thermodynamic entropy.

  13. Beyond the bounds of orthology: functional inference from metagenomic context.

    PubMed

    Vey, Gregory; Moreno-Hagelsieb, Gabriel

    2010-07-01

    The effectiveness of the computational inference of function by genomic context is bounded by the diversity of known microbial genomes. Although metagenomes offer access to previously inaccessible organisms, their fragmentary nature prevents the conventional establishment of orthologous relationships required for reliably predicting functional interactions. We introduce a protocol for the prediction of functional interactions using data sources without information about orthologous relationships. To illustrate this process, we use the Sargasso Sea metagenome to construct a functional interaction network for the Escherichia coli K12 genome. We identify two reliability metrics, target intergenic distance and source interaction count, and apply them to selectively filter the predictions retained to construct the network of functional interactions. The resulting network contains 2297 nodes with 10 072 edges with a positive predictive value of 0.80. The metagenome yielded 8423 functional interactions beyond those found using only the genomic orthologs as a data source. This amounted to a 134% increase in the total number of functional interactions that are predicted by combining the metagenome and the genomic orthologs versus the genomic orthologs alone. In the absence of detectable orthologous relationships it remains feasible to derive a reliable set of predicted functional interactions. This offers a strategy for harnessing other metagenomes and homologs in general. Because metagenomes allow access to previously unreachable microorganisms, this will result in expanding the universe of known functional interactions thus furthering our understanding of functional organization. PMID:20419183

  14. Beyond the bounds of orthology: functional inference from metagenomic context.

    PubMed

    Vey, Gregory; Moreno-Hagelsieb, Gabriel

    2010-07-01

    The effectiveness of the computational inference of function by genomic context is bounded by the diversity of known microbial genomes. Although metagenomes offer access to previously inaccessible organisms, their fragmentary nature prevents the conventional establishment of orthologous relationships required for reliably predicting functional interactions. We introduce a protocol for the prediction of functional interactions using data sources without information about orthologous relationships. To illustrate this process, we use the Sargasso Sea metagenome to construct a functional interaction network for the Escherichia coli K12 genome. We identify two reliability metrics, target intergenic distance and source interaction count, and apply them to selectively filter the predictions retained to construct the network of functional interactions. The resulting network contains 2297 nodes with 10 072 edges with a positive predictive value of 0.80. The metagenome yielded 8423 functional interactions beyond those found using only the genomic orthologs as a data source. This amounted to a 134% increase in the total number of functional interactions that are predicted by combining the metagenome and the genomic orthologs versus the genomic orthologs alone. In the absence of detectable orthologous relationships it remains feasible to derive a reliable set of predicted functional interactions. This offers a strategy for harnessing other metagenomes and homologs in general. Because metagenomes allow access to previously unreachable microorganisms, this will result in expanding the universe of known functional interactions thus furthering our understanding of functional organization.

  15. Perception, illusions and Bayesian inference.

    PubMed

    Nour, Matthew M; Nour, Joseph M

    2015-01-01

    Descriptive psychopathology makes a distinction between veridical perception and illusory perception. In both cases a perception is tied to a sensory stimulus, but in illusions the perception is of a false object. This article re-examines this distinction in light of new work in theoretical and computational neurobiology, which views all perception as a form of Bayesian statistical inference that combines sensory signals with prior expectations. Bayesian perceptual inference can solve the 'inverse optics' problem of veridical perception and provides a biologically plausible account of a number of illusory phenomena, suggesting that veridical and illusory perceptions are generated by precisely the same inferential mechanisms.

  16. Inferring biotic interactions from proxies.

    PubMed

    Morales-Castilla, Ignacio; Matias, Miguel G; Gravel, Dominique; Araújo, Miguel B

    2015-06-01

    Inferring biotic interactions from functional, phylogenetic and geographical proxies remains one great challenge in ecology. We propose a conceptual framework to infer the backbone of biotic interaction networks within regional species pools. First, interacting groups are identified to order links and remove forbidden interactions between species. Second, additional links are removed by examination of the geographical context in which species co-occur. Third, hypotheses are proposed to establish interaction probabilities between species. We illustrate the framework using published food-webs in terrestrial and marine systems. We conclude that preliminary descriptions of the web of life can be made by careful integration of data with theory.

  17. The Causal Meaning of Genomic Predictors and How It Affects Construction and Comparison of Genome-Enabled Selection Models

    PubMed Central

    Valente, Bruno D.; Morota, Gota; Peñagaricano, Francisco; Gianola, Daniel; Weigel, Kent; Rosa, Guilherme J. M.

    2015-01-01

    The term “effect” in additive genetic effect suggests a causal meaning. However, inferences of such quantities for selection purposes are typically viewed and conducted as a prediction task. Predictive ability as tested by cross-validation is currently the most acceptable criterion for comparing models and evaluating new methodologies. Nevertheless, it does not directly indicate if predictors reflect causal effects. Such evaluations would require causal inference methods that are not typical in genomic prediction for selection. This suggests that the usual approach to infer genetic effects contradicts the label of the quantity inferred. Here we investigate if genomic predictors for selection should be treated as standard predictors or if they must reflect a causal effect to be useful, requiring causal inference methods. Conducting the analysis as a prediction or as a causal inference task affects, for example, how covariates of the regression model are chosen, which may heavily affect the magnitude of genomic predictors and therefore selection decisions. We demonstrate that selection requires learning causal genetic effects. However, genomic predictors from some models might capture noncausal signal, providing good predictive ability but poorly representing true genetic effects. Simulated examples are used to show that aiming for predictive ability may lead to poor modeling decisions, while causal inference approaches may guide the construction of regression models that better infer the target genetic effect even when they underperform in cross-validation tests. In conclusion, genomic selection models should be constructed to aim primarily for identifiability of causal genetic effects, not for predictive ability. PMID:25908318

  18. The Causal Meaning of Genomic Predictors and How It Affects Construction and Comparison of Genome-Enabled Selection Models.

    PubMed

    Valente, Bruno D; Morota, Gota; Peñagaricano, Francisco; Gianola, Daniel; Weigel, Kent; Rosa, Guilherme J M

    2015-06-01

    The term "effect" in additive genetic effect suggests a causal meaning. However, inferences of such quantities for selection purposes are typically viewed and conducted as a prediction task. Predictive ability as tested by cross-validation is currently the most acceptable criterion for comparing models and evaluating new methodologies. Nevertheless, it does not directly indicate if predictors reflect causal effects. Such evaluations would require causal inference methods that are not typical in genomic prediction for selection. This suggests that the usual approach to infer genetic effects contradicts the label of the quantity inferred. Here we investigate if genomic predictors for selection should be treated as standard predictors or if they must reflect a causal effect to be useful, requiring causal inference methods. Conducting the analysis as a prediction or as a causal inference task affects, for example, how covariates of the regression model are chosen, which may heavily affect the magnitude of genomic predictors and therefore selection decisions. We demonstrate that selection requires learning causal genetic effects. However, genomic predictors from some models might capture noncausal signal, providing good predictive ability but poorly representing true genetic effects. Simulated examples are used to show that aiming for predictive ability may lead to poor modeling decisions, while causal inference approaches may guide the construction of regression models that better infer the target genetic effect even when they underperform in cross-validation tests. In conclusion, genomic selection models should be constructed to aim primarily for identifiability of causal genetic effects, not for predictive ability.

  19. Biochemical composition of pico-, nano- and micro-particulate organic matter and bacterioplankton biomass in the oligotrophic Cretan Sea (NE Mediterranean)

    NASA Astrophysics Data System (ADS)

    Danovaro, Roberto; Dell'Anno, Antonio; Pusceddu, Antonio; Daniela Marrale; Della Croce, Norberto; Fabiano, Mauro; Tselepides, Anastasios

    2000-08-01

    The biochemical composition of different particle size classes (pico-, nano- and micro-particulate matter) and the bacterioplankton biomass were studied over an annual cycle in the Cretan Sea (South Aegean Sea, NE Mediterranean; from 40 to 1540 m depth) to investigate the origin, composition and fate of the suspended particles and to quantify bacterioplankton contribution to organic carbon pools. The oligotrophy of this system was indicated by the extremely low particulate lipid, protein and carbohydrate concentrations (4-15 times lower than in more productive systems). The biopolymeric carbon (BPC as the sum of lipid, protein and carbohydrate carbon) accounted for 80-100% of POC, suggesting the autochthonous origin of the particles. The most evident characteristic of this oligotrophic environment was the dominance of the pico-particles through all seasons, accounting for 43-45% of total carbohydrates, proteins and lipids. The proximate composition of the organic particles revealed the dominance of carbohydrates in all size-classes and highest values of the protein to carbohydrate ratio in the pico-particulate fraction. The relative proportion of the pico-, nano- and micro-particulate carbohydrates, proteins and lipids varied seasonally. The increase in the average particle size from February to September 95, probably as a result of aggregation, appeared to be related to the ‘thermal stability’ of the water column. The analysis of the vertical distribution of the three size classes revealed an increase in the pico fraction and a decrease in the larger components with increasing depth suggesting that nano- and micro-particles were being degraded and fragmented in the deeper water layers. Bacterial densities ranged from 1.1 to 8.8 x 10 8 cells l -1. Bacterial biomass accounted on average for more than 56% (up to 74%) of BPC and was by far, the most important living component. Bacterial-N accounted for a large proportion (>90%) of the protein nitrogen pool

  20. Perceptual Inference and Autistic Traits

    ERIC Educational Resources Information Center

    Skewes, Joshua C; Jegindø, Else-Marie; Gebauer, Line

    2015-01-01

    Autistic people are better at perceiving details. Major theories explain this in terms of bottom-up sensory mechanisms or in terms of top-down cognitive biases. Recently, it has become possible to link these theories within a common framework. This framework assumes that perception is implicit neural inference, combining sensory evidence with…

  1. Science Shorts: Observation versus Inference

    ERIC Educational Resources Information Center

    Leager, Craig R.

    2008-01-01

    When you observe something, how do you know for sure what you are seeing, feeling, smelling, or hearing? Asking students to think critically about their encounters with the natural world will help to strengthen their understanding and application of the science-process skills of observation and inference. In the following lesson, students make…

  2. Sample Size and Correlational Inference

    ERIC Educational Resources Information Center

    Anderson, Richard B.; Doherty, Michael E.; Friedrich, Jeff C.

    2008-01-01

    In 4 studies, the authors examined the hypothesis that the structure of the informational environment makes small samples more informative than large ones for drawing inferences about population correlations. The specific purpose of the studies was to test predictions arising from the signal detection simulations of R. B. Anderson, M. E. Doherty,…

  3. Improving Explanatory Inferences from Assessments

    ERIC Educational Resources Information Center

    Diakow, Ronli Phyllis

    2013-01-01

    This dissertation comprises three papers that propose, discuss, and illustrate models to make improved inferences about research questions regarding student achievement in education. Addressing the types of questions common in educational research today requires three different "extensions" to traditional educational assessment: (1)…

  4. Bayesian Inference of Reticulate Phylogenies under the Multispecies Network Coalescent

    PubMed Central

    Wen, Dingqiao; Yu, Yun; Nakhleh, Luay

    2016-01-01

    The multispecies coalescent (MSC) is a statistical framework that models how gene genealogies grow within the branches of a species tree. The field of computational phylogenetics has witnessed an explosion in the development of methods for species tree inference under MSC, owing mainly to the accumulating evidence of incomplete lineage sorting in phylogenomic analyses. However, the evolutionary history of a set of genomes, or species, could be reticulate due to the occurrence of evolutionary processes such as hybridization or horizontal gene transfer. We report on a novel method for Bayesian inference of genome and species phylogenies under the multispecies network coalescent (MSNC). This framework models gene evolution within the branches of a phylogenetic network, thus incorporating reticulate evolutionary processes, such as hybridization, in addition to incomplete lineage sorting. As phylogenetic networks with different numbers of reticulation events correspond to points of different dimensions in the space of models, we devise a reversible-jump Markov chain Monte Carlo (RJMCMC) technique for sampling the posterior distribution of phylogenetic networks under MSNC. We implemented the methods in the publicly available, open-source software package PhyloNet and studied their performance on simulated and biological data. The work extends the reach of Bayesian inference to phylogenetic networks and enables new evolutionary analyses that account for reticulation. PMID:27144273

  5. Bayesian Inference of Reticulate Phylogenies under the Multispecies Network Coalescent.

    PubMed

    Wen, Dingqiao; Yu, Yun; Nakhleh, Luay

    2016-05-01

    The multispecies coalescent (MSC) is a statistical framework that models how gene genealogies grow within the branches of a species tree. The field of computational phylogenetics has witnessed an explosion in the development of methods for species tree inference under MSC, owing mainly to the accumulating evidence of incomplete lineage sorting in phylogenomic analyses. However, the evolutionary history of a set of genomes, or species, could be reticulate due to the occurrence of evolutionary processes such as hybridization or horizontal gene transfer. We report on a novel method for Bayesian inference of genome and species phylogenies under the multispecies network coalescent (MSNC). This framework models gene evolution within the branches of a phylogenetic network, thus incorporating reticulate evolutionary processes, such as hybridization, in addition to incomplete lineage sorting. As phylogenetic networks with different numbers of reticulation events correspond to points of different dimensions in the space of models, we devise a reversible-jump Markov chain Monte Carlo (RJMCMC) technique for sampling the posterior distribution of phylogenetic networks under MSNC. We implemented the methods in the publicly available, open-source software package PhyloNet and studied their performance on simulated and biological data. The work extends the reach of Bayesian inference to phylogenetic networks and enables new evolutionary analyses that account for reticulation. PMID:27144273

  6. Permanent draft genomes of the Rhodopirellula maiorica strain SM1.

    PubMed

    Richter, Michael; Richter-Heitmann, Tim; Klindworth, Anna; Wegner, Carl-Eric; Frank, Carsten S; Harder, Jens; Glöckner, Frank Oliver

    2014-02-01

    The genome of Rhodopirellula maiorica strain SM1 was sequenced as a permanent draft to complement the full genome sequence of the type strain Rhodopirellula baltica SH1(T). This isolate is part of a larger study to infer the biogeography of Rhodopirellula species in European marine waters, as well as to amend the genus description of R. baltica. This genomics resource article is the fifth of a series of five publications reporting in total eight new permanent daft genomes of Rhodopirellula species.

  7. How to infer gene networks from expression profiles, revisited.

    PubMed

    Penfold, Christopher A; Wild, David L

    2011-12-01

    Inferring the topology of a gene-regulatory network (GRN) from genome-scale time-series measurements of transcriptional change has proved useful for disentangling complex biological processes. To address the challenges associated with this inference, a number of competing approaches have previously been used, including examples from information theory, Bayesian and dynamic Bayesian networks (DBNs), and ordinary differential equation (ODE) or stochastic differential equation. The performance of these competing approaches have previously been assessed using a variety of in silico and in vivo datasets. Here, we revisit this work by assessing the performance of more recent network inference algorithms, including a novel non-parametric learning approach based upon nonlinear dynamical systems. For larger GRNs, containing hundreds of genes, these non-parametric approaches more accurately infer network structures than do traditional approaches, but at significant computational cost. For smaller systems, DBNs are competitive with the non-parametric approaches with respect to computational time and accuracy, and both of these approaches appear to be more accurate than Granger causality-based methods and those using simple ODEs models.

  8. Inference of directional selection and mutation parameters assuming equilibrium.

    PubMed

    Vogl, Claus; Bergman, Juraj

    2015-12-01

    In a classical study, Wright (1931) proposed a model for the evolution of a biallelic locus under the influence of mutation, directional selection and drift. He derived the equilibrium distribution of the allelic proportion conditional on the scaled mutation rate, the mutation bias and the scaled strength of directional selection. The equilibrium distribution can be used for inference of these parameters with genome-wide datasets of "site frequency spectra" (SFS). Assuming that the scaled mutation rate is low, Wright's model can be approximated by a boundary-mutation model, where mutations are introduced into the population exclusively from sites fixed for the preferred or unpreferred allelic states. With the boundary-mutation model, inference can be partitioned: (i) the shape of the SFS distribution within the polymorphic region is determined by random drift and directional selection, but not by the mutation parameters, such that inference of the selection parameter relies exclusively on the polymorphic sites in the SFS; (ii) the mutation parameters can be inferred from the amount of polymorphic and monomorphic preferred and unpreferred alleles, conditional on the selection parameter. Herein, we derive maximum likelihood estimators for the mutation and selection parameters in equilibrium and apply the method to simulated SFS data as well as empirical data from a Madagascar population of Drosophila simulans.

  9. Ecological Genomics of the Uncultivated Marine Roseobacter Lineage CHAB-I-5

    PubMed Central

    Zhang, Yao; Sun, Ying; Jiao, Nianzhi; Stepanauskas, Ramunas

    2016-01-01

    Members of the marine Roseobacter clade are major participants in global carbon and sulfur cycles. While roseobacters are well represented in cultures, several abundant pelagic lineages, including SAG-O19, DC5-80-3, and NAC11-7, remain largely uncultivated and show evidence of genome streamlining. Here, we analyzed the partial genomes of three single cells affiliated with CHAB-I-5, another abundant but exclusively uncultivated Roseobacter lineage. Members of this lineage encode several metabolic potentials that are absent in streamlined genomes. Examples are quorum sensing and type VI secretion systems, which enable them to effectively interact with host and other bacteria. Further analysis of the CHAB-I-5 single-cell amplified genomes (SAGs) predicted that this lineage comprises members with relatively large genomes (4.1 to 4.4 Mbp) and a high fraction of noncoding DNA (10 to 12%), which is similar to what is observed in many cultured, nonstreamlined Roseobacter lineages. The four uncultured lineages, while exhibiting highly variable geographic distributions, together represent >60% of the global pelagic roseobacters. They are consistently enriched in genes encoding the capabilities of light harvesting, oxidation of “energy-rich” reduced sulfur compounds and methylated amines, uptake and catabolism of various carbohydrates and osmolytes, and consumption of abundant exudates from phytoplankton. These traits may define the global prevalence of the four lineages among marine bacterioplankton. PMID:26826224

  10. Selection Maintains Low Genomic GC Content in Marine SAR11 Lineages.

    PubMed

    Luo, Haiwei; Thompson, Luke R; Stingl, Ulrich; Hughes, Austin L

    2015-10-01

    The genomic G+C content of ocean bacteria varies from below 30% to over 60%. This broad range of base composition is likely shaped by distinct mutational processes, recombination, effective population size, and selection driven by environmental factors. A number of studies have hypothesized that depletion of G/C in genomes of marine bacterioplankton cells is an adaptation to the nitrogen-poor pelagic oceans, but they failed to disentangle environmental factors from mutational biases and population history. Here, we reconstructed the evolutionary changes of bases at synonymous sites in genomes of two marine SAR11 populations and a freshwater counterpart with its evolutionary origin rooted in the marine lineage. Although they all have similar genome sizes, DNA repair gene repertoire, and base compositions, there is a stronger bias toward A/T changes, a reduced frequency of nitrogenous amino acids, and an exclusive occurrence of polyamine, opine, and taurine transport systems in the ocean populations, consistent with a greater nitrogen stress in surface oceans compared with freshwater lakes. Furthermore, the ratio of nonsynoymous to synonymous nucleotide diversity is not statistically distinguishable among these populations, suggesting that population history has a limited effect. Taken together, the ecological transition of SAR11 from ocean to freshwater habitats makes nitrogen more available to these organisms, and thus relaxation of purifying selection drove a genome-wide reduction in the frequency of G/C to A/T changes in the freshwater population.

  11. Inferring human stem cell behaviour from epigenetic drift.

    PubMed

    Shibata, D

    2009-01-01

    Although conventional experimental manipulations are impractical, it may be possible to infer human stem cell fates by 'reading' histories recorded within their genomes. Genomes are almost perfect copies of copies, and ancestries may be surreptitiously recorded by replication errors that inevitably accumulate. The greater the number of divisions, the greater the number of replication errors ('a molecular clock hypothesis'). Mutations rarely occur during a lifetime, but DNA methylation patterns are also copied after DNA replication and measurably drift with ageing at certain CpG sites in mitotic tissues, such as the colon. Such passenger methylation pattern variation may effectively function as 'epigenetic' somatic cell mitotic clocks. Replication errors can only accumulate in long-lived stem cell lineages, so methylation pattern drift largely records stem cell behaviour. How methylation patterns may encode stem cell ancestries is illustrated with two types of small reproductive units--colon crypt niches with continuous genealogies, and hair follicles with punctuated genealogies. Potentially, the genealogy of any human cell may be inferred by 'reading' its genome. PMID:19031430

  12. Enhanced viral production and virus-mediated mortality of bacterioplankton in a natural iron-fertilized bloom event above the Kerguelen Plateau

    NASA Astrophysics Data System (ADS)

    Malits, A.; Christaki, U.; Obernosterer, I.; Weinbauer, M. G.

    2014-12-01

    Above the Kerguelen Plateau in the Southern Ocean natural iron fertilization sustains a large phytoplankton bloom over 3 months during austral summer. During the KEOPS1 project (KErguelen Ocean and Plateau compared Study1) we sampled this phytoplankton bloom during its declining phase along with the surrounding high-nutrient-low-chlorophyll (HNLC) waters to study the effect of natural iron fertilization on the role of viruses in the microbial food web. Bacterial and viral abundances were 1.7 and 2.1 times, respectively, higher within the bloom than in HNLC waters. Viral production and virus-mediated mortality of bacterioplankton were 4.1 and 4.9 times, respectively, higher in the bloom, while the fraction of infected cells (FIC) and the fraction of lysogenic cells (FLC) showed no significant differences between environments. The present study suggests viruses to be more important for bacterial mortality within the bloom and dominate over grazing of heterotrophic nanoflagellates (HNFs) during the late bloom phase. As a consequence, at least at a late bloom stage, viral lysis shunts part of the photosynthetically fixed carbon in iron-fertilized regions into the dissolved organic matter (DOM) pool with potentially less particulate organic carbon transferred to larger members of the food web or exported.

  13. Enhanced viral production and virus-mediated mortality of bacterioplankton in a natural iron-fertilized bloom event above the Kerguelen Plateau

    NASA Astrophysics Data System (ADS)

    Malits, A.; Christaki, U.; Obernosterer, I.; Weinbauer, M. G.

    2014-07-01

    Above the Kerguelen Plateau in the Southern Ocean natural iron fertilization sustains a large phytoplankton bloom over three months during austral summer. During the KEOPS1 project (KErguelen Ocean and Plateau compared Study1) we sampled this phytoplankton bloom during its declining phase along with the surrounding HNLC waters to study the effect of natural iron fertilization on the role of viruses in the microbial food web. Bacterial and viral abundances were 1.7 and 2.1 times, respectively, higher within the bloom than in HNLC waters. Viral production and virus-mediated mortality of bacterioplankton was 4.1 and 4.9 times, respectively, higher in the bloom, while the fraction of infected cells (FIC) and the fraction of lysogenic cells (FLC) showed no significant differences between environments. The present study suggests viruses to be more important for bacterial mortality within the bloom and dominate over protozoan grazing during the late bloom phase. As a consequence, at least at a late bloom stage, viral lysis shunts part of the photosynthetically fixed carbon in iron-fertilized regions into the dissolved organic matter (DOM) pool with potentially less particulate organic carbon transfered to larger members of the food web or exported.

  14. Diversity and activity of marine bacterioplankton during a diatom bloom in the North Sea assessed by total RNA and pyrotag sequencing.

    PubMed

    Klindworth, Anna; Mann, Alexander J; Huang, Sixing; Wichels, Antje; Quast, Christian; Waldmann, Jost; Teeling, Hanno; Glöckner, Frank Oliver

    2014-12-01

    A recent investigation of bacterioplankton communities in the German Bight towards the end of a diatom-dominated spring phytoplankton bloom revealed pronounced successions of distinct bacterial clades. A combination of metagenomics and metaproteomics indicated that these clades had distinct substrate spectra and consumed different algal substrates. In this study we re-analyzed samples from the initial study by total community RNA (metatranscriptomics) and 16S rRNA gene amplicon sequencing. This complementary approach provided new insights into the community composition and expressed genes as well as the assessment of metabolic activity levels of distinct clades. Flavobacteria (genera Ulvibacter, Formosa, and Polaribacter), Alphaproteobacteria (SAR11 clade and Rhodobacteraceae) and Gammaproteobacteria (genus Reinekea and SAR92 clade) were the most abundant taxa. Mapping of the metatranscriptome data on assembled and taxonomically classified metagenome data of the same samples substantiated that Formosa and Polaribacter acted as major algal polymer degraders, whereas Rhodobacteraceae and Reinekea spp. exhibited less specialized substrate spectra. In addition, we found that members of the Rhodobacteraceae and SAR92 clade showed high metabolic activity levels, which suggests that these clades played a more important role during the bloom event as indicated by their in situ abundances.

  15. Bacterioplankton dynamics along the gradient from highly eutrophic Pearl River Estuary to oligotrophic northern South China Sea in wet season: implication for anthropogenic inputs.

    PubMed

    Zhou, Weihua; Long, Aimin; Jiang, Tao; Chen, Shaoyong; Huang, Liangmin; Huang, Hui; Cai, Chuanghua; Yan, Yan

    2011-04-01

    Bacterioplankton abundance (BA) and biomass (BB) from the eutrophic Pearl River Estuary (PRE) to the oligotrophic northern South China Sea (NSCS) were studied in the wet season. BA was significantly higher (p < 0.05) in PRE (12.51 ± 3.52 x 10⁸ cells L⁻¹), than in the continental shelf neritic province (CSNP, 4.95 ± 2.21 x 10⁸ cells L⁻¹) and in the deep oceanic province (OP, 3.16 ± 1.56 x 10⁸ cells L⁻¹). Nutrient-replete PRE waters (DIN > 100 μM and PO₄ > 1 μM) resulted in high chl a and BB, whereas nutrient-depleted offshore waters (DIN < 5 μM and PO₄ < 0.5 μM) had low biomass. Temperature (> 26 °C) was not the controlling factor of BA. BB was significantly correlated with chl a biomass both in PRE and NSCS. The bacteria to phytoplankton biomass (BB/PB) ratio increased clearly along the gradient from near-shore PRE (0.15) to offshore CSNP (0.93) and deep OP (2.75), indicating the important role of small cells in the open ocean compared to estuarine and coastal zones.

  16. Homoeologs: What Are They and How Do We Infer Them?

    PubMed

    Glover, Natasha M; Redestig, Henning; Dessimoz, Christophe

    2016-07-01

    The evolutionary history of nearly all flowering plants includes a polyploidization event. Homologous genes resulting from allopolyploidy are commonly referred to as 'homoeologs', although this term has not always been used precisely or consistently in the literature. With several allopolyploid genome sequencing projects under way, there is a pressing need for computational methods for homoeology inference. Here we review the definition of homoeology in historical and modern contexts and propose a precise and testable definition highlighting the connection between homoeologs and orthologs. In the second part, we survey experimental and computational methods of homoeolog inference, considering the strengths and limitations of each approach. Establishing a precise and evolutionarily meaningful definition of homoeology is essential for understanding the evolutionary consequences of polyploidization.

  17. Relevance of different prior knowledge sources for inferring gene interaction networks.

    PubMed

    Olsen, Catharina; Bontempi, Gianluca; Emmert-Streib, Frank; Quackenbush, John; Haibe-Kains, Benjamin

    2014-01-01

    When inferring networks from high-throughput genomic data, one of the main challenges is the subsequent validation of these networks. In the best case scenario, the true network is partially known from previous research results published in structured databases or research articles. Traditionally, inferred networks are validated against these known interactions. Whenever the recovery rate is gauged to be high enough, subsequent high scoring but unknown inferred interactions are deemed good candidates for further experimental validation. Therefore such validation framework strongly depends on the quantity and quality of published interactions and presents serious pitfalls: (1) availability of these known interactions for the studied problem might be sparse; (2) quantitatively comparing different inference algorithms is not trivial; and (3) the use of these known interactions for validation prevents their integration in the inference procedure. The latter is particularly relevant as it has recently been showed that integration of priors during network inference significantly improves the quality of inferred networks. To overcome these problems when validating inferred networks, we recently proposed a data-driven validation framework based on single gene knock-down experiments. Using this framework, we were able to demonstrate the benefits of integrating prior knowledge and expression data. In this paper we used this framework to assess the quality of different sources of prior knowledge on their own and in combination with different genomic data sets in colorectal cancer. We observed that most prior sources lead to significant F-scores. Furthermore, their integration with genomic data leads to a significant increase in F-scores, especially for priors extracted from full text PubMed articles, known co-expression modules and genetic interactions. Lastly, we observed that the results are consistent for three different data sets: experimental knock-down data and two

  18. A genomic perspective on hybridization and speciation

    PubMed Central

    Payseur, Bret A.; Rieseberg, Loren H.

    2016-01-01

    Hybridization among diverging lineages is common in nature. Genomic data provide a special opportunity to characterize the history of hybridization and the genetic basis of speciation. We review existing methods and empirical studies to identify recent advances in the genomics of hybridization, as well as issues that need to be addressed. Notable progress has been made in the development of methods for detecting hybridization and inferring individual ancestries. However, few approaches reconstruct the magnitude and timing of gene flow, estimate the fitness of hybrids or incorporate knowledge of recombination rate. Empirical studies indicate that the genomic consequences of hybridization are complex, including a highly heterogeneous landscape of differentiation. Inferred characteristics of hybridization differ substantially among species groups. Loci showing unusual patterns – which may contribute to reproductive barriers – are usually scattered throughout the genome, with potential enrichment in sex chromosomes and regions of reduced recombination. We caution against the growing trend of interpreting genomic variation in summary statistics across genomes as evidence of differential gene flow. We argue that converting genomic patterns into useful inferences about hybridization will ultimately require models and methods that directly incorporate key ingredients of speciation, including the dynamic nature of gene flow, selection acting in hybrid populations and recombination rate variation. PMID:26836441

  19. Network Plasticity as Bayesian Inference

    PubMed Central

    Legenstein, Robert; Maass, Wolfgang

    2015-01-01

    General results from statistical learning theory suggest to understand not only brain computations, but also brain plasticity as probabilistic inference. But a model for that has been missing. We propose that inherently stochastic features of synaptic plasticity and spine motility enable cortical networks of neurons to carry out probabilistic inference by sampling from a posterior distribution of network configurations. This model provides a viable alternative to existing models that propose convergence of parameters to maximum likelihood values. It explains how priors on weight distributions and connection probabilities can be merged optimally with learned experience, how cortical networks can generalize learned information so well to novel experiences, and how they can compensate continuously for unforeseen disturbances of the network. The resulting new theory of network plasticity explains from a functional perspective a number of experimental data on stochastic aspects of synaptic plasticity that previously appeared to be quite puzzling. PMID:26545099

  20. Bayesian inference on proportional elections.

    PubMed

    Brunello, Gabriel Hideki Vatanabe; Nakano, Eduardo Yoshio

    2015-01-01

    Polls for majoritarian voting systems usually show estimates of the percentage of votes for each candidate. However, proportional vote systems do not necessarily guarantee the candidate with the most percentage of votes will be elected. Thus, traditional methods used in majoritarian elections cannot be applied on proportional elections. In this context, the purpose of this paper was to perform a Bayesian inference on proportional elections considering the Brazilian system of seats distribution. More specifically, a methodology to answer the probability that a given party will have representation on the chamber of deputies was developed. Inferences were made on a Bayesian scenario using the Monte Carlo simulation technique, and the developed methodology was applied on data from the Brazilian elections for Members of the Legislative Assembly and Federal Chamber of Deputies in 2010. A performance rate was also presented to evaluate the efficiency of the methodology. Calculations and simulations were carried out using the free R statistical software.

  1. Bayesian Inference on Proportional Elections

    PubMed Central

    Brunello, Gabriel Hideki Vatanabe; Nakano, Eduardo Yoshio

    2015-01-01

    Polls for majoritarian voting systems usually show estimates of the percentage of votes for each candidate. However, proportional vote systems do not necessarily guarantee the candidate with the most percentage of votes will be elected. Thus, traditional methods used in majoritarian elections cannot be applied on proportional elections. In this context, the purpose of this paper was to perform a Bayesian inference on proportional elections considering the Brazilian system of seats distribution. More specifically, a methodology to answer the probability that a given party will have representation on the chamber of deputies was developed. Inferences were made on a Bayesian scenario using the Monte Carlo simulation technique, and the developed methodology was applied on data from the Brazilian elections for Members of the Legislative Assembly and Federal Chamber of Deputies in 2010. A performance rate was also presented to evaluate the efficiency of the methodology. Calculations and simulations were carried out using the free R statistical software. PMID:25786259

  2. Statistical learning and selective inference

    PubMed Central

    Taylor, Jonathan; Tibshirani, Robert J.

    2015-01-01

    We describe the problem of “selective inference.” This addresses the following challenge: Having mined a set of data to find potential associations, how do we properly assess the strength of these associations? The fact that we have “cherry-picked”—searched for the strongest associations—means that we must set a higher bar for declaring significant the associations that we see. This challenge becomes more important in the era of big data and complex statistical modeling. The cherry tree (dataset) can be very large and the tools for cherry picking (statistical learning methods) are now very sophisticated. We describe some recent new developments in selective inference and illustrate their use in forward stepwise regression, the lasso, and principal components analysis. PMID:26100887

  3. Causal inference based on counterfactuals

    PubMed Central

    Höfler, M

    2005-01-01

    Background The counterfactual or potential outcome model has become increasingly standard for causal inference in epidemiological and medical studies. Discussion This paper provides an overview on the counterfactual and related approaches. A variety of conceptual as well as practical issues when estimating causal effects are reviewed. These include causal interactions, imperfect experiments, adjustment for confounding, time-varying exposures, competing risks and the probability of causation. It is argued that the counterfactual model of causal effects captures the main aspects of causality in health sciences and relates to many statistical procedures. Summary Counterfactuals are the basis of causal inference in medicine and epidemiology. Nevertheless, the estimation of counterfactual differences pose several difficulties, primarily in observational studies. These problems, however, reflect fundamental barriers only when learning from observations, and this does not invalidate the counterfactual concept. PMID:16159397

  4. Cortical circuits for perceptual inference.

    PubMed

    Friston, Karl; Kiebel, Stefan

    2009-10-01

    This paper assumes that cortical circuits have evolved to enable inference about the causes of sensory input received by the brain. This provides a principled specification of what neural circuits have to achieve. Here, we attempt to address how the brain makes inferences by casting inference as an optimisation problem. We look at how the ensuing recognition dynamics could be supported by directed connections and message-passing among neuronal populations, given our knowledge of intrinsic and extrinsic neuronal connections. We assume that the brain models the world as a dynamic system, which imposes causal structure on the sensorium. Perception is equated with the optimisation or inversion of this internal model, to explain sensory input. Given a model of how sensory data are generated, we use a generic variational approach to model inversion to furnish equations that prescribe recognition; i.e., the dynamics of neuronal activity that represents the causes of sensory input. Here, we focus on a model whose hierarchical and dynamical structure enables simulated brains to recognise and predict sequences of sensory states. We first review these models and their inversion under a variational free-energy formulation. We then show that the brain has the necessary infrastructure to implement this inversion and present stimulations using synthetic birds that generate and recognise birdsongs.

  5. Cortical circuits for perceptual inference.

    PubMed

    Friston, Karl; Kiebel, Stefan

    2009-10-01

    This paper assumes that cortical circuits have evolved to enable inference about the causes of sensory input received by the brain. This provides a principled specification of what neural circuits have to achieve. Here, we attempt to address how the brain makes inferences by casting inference as an optimisation problem. We look at how the ensuing recognition dynamics could be supported by directed connections and message-passing among neuronal populations, given our knowledge of intrinsic and extrinsic neuronal connections. We assume that the brain models the world as a dynamic system, which imposes causal structure on the sensorium. Perception is equated with the optimisation or inversion of this internal model, to explain sensory input. Given a model of how sensory data are generated, we use a generic variational approach to model inversion to furnish equations that prescribe recognition; i.e., the dynamics of neuronal activity that represents the causes of sensory input. Here, we focus on a model whose hierarchical and dynamical structure enables simulated brains to recognise and predict sequences of sensory states. We first review these models and their inversion under a variational free-energy formulation. We then show that the brain has the necessary infrastructure to implement this inversion and present stimulations using synthetic birds that generate and recognise birdsongs. PMID:19635656

  6. Comparative analysis of chromosome counts infers three paleopolyploidies in the mollusca.

    PubMed

    Hallinan, Nathaniel M; Lindberg, David R

    2011-01-01

    The study of paleopolyploidies requires the comparison of multiple whole genome sequences. If the branches of a phylogeny on which a whole-genome duplication (WGD) occurred could be identified before genome sequencing, taxa could be selected that provided a better assessment of that genome duplication. Here, we describe a likelihood model in which the number of chromosomes in a genome evolves according to a Markov process with one rate of chromosome duplication and loss that is proportional to the number of chromosomes in the genome and another stochastic rate at which every chromosome in the genome could duplicate in a single event. We compare the maximum likelihoods of a model in which the genome duplication rate varies to one in which it is fixed at zero using the Akaike information criterion, to determine if a model with WGDs is a good fit for the data. Once it has been determined that the data does fit the WGD model, we infer the phylogenetic position of paleopolyploidies by calculating the posterior probability that a WGD occurred on each branch of the taxon tree. Here, we apply this model to a molluscan tree represented by 124 taxa and infer three putative WGD events. In the Gastropoda, we identify a single branch within the Hypsogastropoda and one of two branches at the base of the Stylommatophora. We also identify one or two branches near the base of the Cephalopoda.

  7. Homology-Independent Metrics for Comparative Genomics

    PubMed Central

    Coutinho, Tarcisio José Domingos; Franco, Glória Regina; Lobo, Francisco Pereira

    2015-01-01

    A mainstream procedure to analyze the wealth of genomic data available nowadays is the detection of homologous regions shared across genomes, followed by the extraction of biological information from the patterns of conservation and variation observed in such regions. Although of pivotal importance, comparative genomic procedures that rely on homology inference are obviously not applicable if no homologous regions are detectable. This fact excludes a considerable portion of “genomic dark matter” with no significant similarity — and, consequently, no inferred homology to any other known sequence — from several downstream comparative genomic methods. In this review we compile several sequence metrics that do not rely on homology inference and can be used to compare nucleotide sequences and extract biologically meaningful information from them. These metrics comprise several compositional parameters calculated from sequence data alone, such as GC content, dinucleotide odds ratio, and several codon bias metrics. They also share other interesting properties, such as pervasiveness (patterns persist on smaller scales) and phylogenetic signal. We also cite examples where these homology-independent metrics have been successfully applied to support several bioinformatics challenges, such as taxonomic classification of biological sequences without homology inference. They where also used to detect higher-order patterns of interactions in biological systems, ranging from detecting coevolutionary trends between the genomes of viruses and their hosts to characterization of gene pools of entire microbial communities. We argue that, if correctly understood and applied, homology-independent metrics can add important layers of biological information in comparative genomic studies without prior homology inference. PMID:26029354

  8. Category Representation for Classification and Feature Inference

    ERIC Educational Resources Information Center

    Johansen, Mark K.; Kruschke, John K.

    2005-01-01

    This research's purpose was to contrast the representations resulting from learning of the same categories by either classifying instances or inferring instance features. Prior inference learning research, particularly T. Yamauchi and A. B. Markman (1998), has suggested that feature inference learning fosters prototype representation, whereas…

  9. Genomic reconstruction of the transcriptional regulatory network in Bacillus subtilis.

    PubMed

    Leyn, Semen A; Kazanov, Marat D; Sernova, Natalia V; Ermakova, Ekaterina O; Novichkov, Pavel S; Rodionov, Dmitry A

    2013-06-01

    The adaptation of microorganisms to their environment is controlled by complex transcriptional regulatory networks (TRNs), which are still only partially understood even for model species. Genome scale annotation of regulatory features of genes and TRN reconstruction are challenging tasks of microbial genomics. We used the knowledge-driven comparative-genomics approach implemented in the RegPredict Web server to infer TRN in the model Gram-positive bacterium Bacillus subtilis and 10 related Bacillales species. For transcription factor (TF) regulons, we combined the available information from the DBTBS database and the literature with bioinformatics tools, allowing inference of TF binding sites (TFBSs), comparative analysis of the genomic context of predicted TFBSs, functional assignment of target genes, and effector prediction. For RNA regulons, we used known RNA regulatory motifs collected in the Rfam database to scan genomes and analyze the genomic context of new RNA sites. The inferred TRN in B. subtilis comprises regulons for 129 TFs and 24 regulatory RNA families. First, we analyzed 66 TF regulons with previously known TFBSs in B. subtilis and projected them to other Bacillales genomes, resulting in refinement of TFBS motifs and identification of novel regulon members. Second, we inferred motifs and described regulons for 28 experimentally studied TFs with previously unknown TFBSs. Third, we discovered novel motifs and reconstructed regulons for 36 previously uncharacterized TFs. The inferred collection of regulons is available in the RegPrecise database (http://regprecise.lbl.gov/) and can be used in genetic experiments, metabolic modeling, and evolutionary analysis.

  10. On the importance of being structured: instantaneous coalescence rates and human evolution--lessons for ancestral population size inference?

    PubMed

    Mazet, O; Rodríguez, W; Grusea, S; Boitard, S; Chikhi, L

    2016-04-01

    Most species are structured and influenced by processes that either increased or reduced gene flow between populations. However, most population genetic inference methods assume panmixia and reconstruct a history characterized by population size changes. This is potentially problematic as population structure can generate spurious signals of population size change through time. Moreover, when the model assumed for demographic inference is misspecified, genomic data will likely increase the precision of misleading if not meaningless parameters. For instance, if data were generated under an n-island model (characterized by the number of islands and migrants exchanged) inference based on a model of population size change would produce precise estimates of a bottleneck that would be meaningless. In addition, archaeological or climatic events around the bottleneck's timing might provide a reasonable but potentially misleading scenario. In a context of model uncertainty (panmixia versus structure) genomic data may thus not necessarily lead to improved statistical inference. We consider two haploid genomes and develop a theory that explains why any demographic model with structure will necessarily be interpreted as a series of changes in population size by inference methods ignoring structure. We formalize a parameter, the inverse instantaneous coalescence rate, and show that it is equivalent to a population size only in panmictic models, and is mostly misleading for structured models. We argue that this issue affects all population genetics methods ignoring population structure which may thus infer population size changes that never took place. We apply our approach to human genomic data. PMID:26647653

  11. A Genome-Wide Landscape of Retrocopies in Primate Genomes.

    PubMed

    Navarro, Fábio C P; Galante, Pedro A F

    2015-08-01

    Gene duplication is a key factor contributing to phenotype diversity across and within species. Although the availability of complete genomes has led to the extensive study of genomic duplications, the dynamics and variability of gene duplications mediated by retrotransposition are not well understood. Here, we predict mRNA retrotransposition and use comparative genomics to investigate their origin and variability across primates. Analyzing seven anthropoid primate genomes, we found a similar number of mRNA retrotranspositions (∼7,500 retrocopies) in Catarrhini (Old Word Monkeys, including humans), but a surprising large number of retrocopies (∼10,000) in Platyrrhini (New World Monkeys), which may be a by-product of higher long interspersed nuclear element 1 activity in these genomes. By inferring retrocopy orthology, we dated most of the primate retrocopy origins, and estimated a decrease in the fixation rate in recent primate history, implying a smaller number of species-specific retrocopies. Moreover, using RNA-Seq data, we identified approximately 3,600 expressed retrocopies. As expected, most of these retrocopies are located near or within known genes, present tissue-specific and even species-specific expression patterns, and no expression correlation to their parental genes. Taken together, our results provide further evidence that mRNA retrotransposition is an active mechanism in primate evolution and suggest that retrocopies may not only introduce great genetic variability between lineages but also create a large reservoir of potentially functional new genomic loci in primate genomes. PMID:26224704

  12. sick: The Spectroscopic Inference Crank

    NASA Astrophysics Data System (ADS)

    Casey, Andrew R.

    2016-03-01

    There exists an inordinate amount of spectral data in both public and private astronomical archives that remain severely under-utilized. The lack of reliable open-source tools for analyzing large volumes of spectra contributes to this situation, which is poised to worsen as large surveys successively release orders of magnitude more spectra. In this article I introduce sick, the spectroscopic inference crank, a flexible and fast Bayesian tool for inferring astrophysical parameters from spectra. sick is agnostic to the wavelength coverage, resolving power, or general data format, allowing any user to easily construct a generative model for their data, regardless of its source. sick can be used to provide a nearest-neighbor estimate of model parameters, a numerically optimized point estimate, or full Markov Chain Monte Carlo sampling of the posterior probability distributions. This generality empowers any astronomer to capitalize on the plethora of published synthetic and observed spectra, and make precise inferences for a host of astrophysical (and nuisance) quantities. Model intensities can be reliably approximated from existing grids of synthetic or observed spectra using linear multi-dimensional interpolation, or a Cannon-based model. Additional phenomena that transform the data (e.g., redshift, rotational broadening, continuum, spectral resolution) are incorporated as free parameters and can be marginalized away. Outlier pixels (e.g., cosmic rays or poorly modeled regimes) can be treated with a Gaussian mixture model, and a noise model is included to account for systematically underestimated variance. Combining these phenomena into a scalar-justified, quantitative model permits precise inferences with credible uncertainties on noisy data. I describe the common model features, the implementation details, and the default behavior, which is balanced to be suitable for most astronomical applications. Using a forward model on low-resolution, high signal

  13. Universum Inference and Corpus Homogeneity

    NASA Astrophysics Data System (ADS)

    Vogel, Carl; Lynch, Gerard; Janssen, Jerom

    Universum Inference is re-interpreted for assessment of corpus homogeneity in computational stylometry. Recent stylometric research quantifies strength of characterization within dramatic works by assessing the homogeneity of corpora associated with dramatic personas. A methodological advance is suggested to mitigate the potential for the assessment of homogeneity to be achieved by chance. Baseline comparison analysis is constructed for contributions to debates by nonfictional participants: the corpus analyzed consists of transcripts of US Presidential and Vice-Presidential debates from the 2000 election cycle. The corpus is also analyzed in translation to Italian, Spanish and Portuguese. Adding randomized categories makes assessments of homogeneity more conservative.

  14. Inside the Pan-genome - Methods and Software Overview

    PubMed Central

    Guimarães, Luis Carlos; Florczak-Wyspianska, Jolanta; de Jesus, Leandro Benevides; Viana, Marcus Vinícius Canário; Silva, Artur; Ramos, Rommel Thiago Jucá; Soares, Siomar de Castro; Soares, Siomar de Castro

    2015-01-01

    The number of genomes that have been deposited in databases has increased exponentially after the advent of Next-Generation Sequencing (NGS), which produces high-throughput sequence data; this circumstance has demanded the development of new bioinformatics software and the creation of new areas, such as comparative genomics. In comparative genomics, the genetic content of an organism is compared against other organisms, which helps in the prediction of gene function and coding region sequences, identification of evolutionary events and determination of phylogenetic relationships. However, expanding comparative genomics to a large number of related bacteria, we can infer their lifestyles, gene repertoires and minimal genome size. In this context, a powerful approach called Pan-genome has been initiated and developed. This approach involves the genomic comparison of different strains of the same species, or even genus. Its main goal is to establish the total number of non-redundant genes that are present in a determined dataset. Pan-genome consists of three parts: core genome; accessory or dispensable genome; and species-specific or strain-specific genes. Furthermore, pan-genome is considered to be “open” as long as new genes are added significantly to the total repertoire for each new additional genome and “closed” when the newly added genomes cannot be inferred to significantly increase the total repertoire of the genes. To perform all of the required calculations, a substantial amount of software has been developed, based on orthologous and paralogous gene identification. PMID:27006628

  15. Accurate Inference of Local Phased Ancestry of Modern Admixed Populations

    PubMed Central

    Ma, Yamin; Zhao, Jian; Wong, Jian-Syuan; Ma, Li; Li, Wenzhi; Fu, Guoxing; Xu, Wei; Zhang, Kui; Kittles, Rick A.; Li, Yun; Song, Qing

    2014-01-01

    Population stratification is a growing concern in genetic-association studies. Averaged ancestry at the genome level (global ancestry) is insufficient for detecting the population substructures and correcting population stratifications in association studies. Local and phase stratification are needed for human genetic studies, but current technologies cannot be applied on the entire genome data due to various technical caveats. Here we developed a novel approach (aMAP, ancestry of Modern Admixed Populations) for inferring local phased ancestry. It took about 3 seconds on a desktop computer to finish a local ancestry analysis for each human genome with 1.4-million SNPs. This method also exhibits the scalability to larger datasets with respect to the number of SNPs, the number of samples, and the size of reference panels. It can detect the lack of the proxy of reference panels. The accuracy was 99.4%. The aMAP software has a capacity for analyzing 6-way admixed individuals. As the biomedical community continues to expand its efforts to increase the representation of diverse populations, and as the number of large whole-genome sequence datasets continues to grow rapidly, there is an increasing demand on rapid and accurate local ancestry analysis in genetics, pharmacogenomics, population genetics, and clinical diagnosis. PMID:25052506

  16. The deepest divergences in land plants inferred from phylogenomic evidence

    PubMed Central

    Qiu, Yin-Long; Li, Libo; Wang, Bin; Chen, Zhiduan; Knoop, Volker; Groth-Malonek, Milena; Dombrovska, Olena; Lee, Jungho; Kent, Livija; Rest, Joshua; Estabrook, George F.; Hendry, Tory A.; Taylor, David W.; Testa, Christopher M.; Ambros, Mathew; Crandall-Stotler, Barbara; Duff, R. Joel; Stech, Michael; Frey, Wolfgang; Quandt, Dietmar; Davis, Charles C.

    2006-01-01

    Phylogenetic relationships among the four major lineages of land plants (liverworts, mosses, hornworts, and vascular plants) remain vigorously contested; their resolution is essential to our understanding of the origin and early evolution of land plants. We analyzed three different complementary data sets: a multigene supermatrix, a genomic structural character matrix, and a chloroplast genome sequence matrix, using maximum likelihood, maximum parsimony, and compatibility methods. Analyses of all three data sets strongly supported liverworts as the sister to all other land plants, and analyses of the multigene and chloroplast genome matrices provided moderate to strong support for hornworts as the sister to vascular plants. These results highlight the important roles of liverworts and hornworts in two major events of plant evolution: the water-to-land transition and the change from a haploid gametophyte generation-dominant life cycle in bryophytes to a diploid sporophyte generation-dominant life cycle in vascular plants. This study also demonstrates the importance of using a multifaceted approach to resolve difficult nodes in the tree of life. In particular, it is shown here that densely sampled taxon trees built with multiple genes provide an indispensable test of taxon-sparse trees inferred from genome sequences. PMID:17030812

  17. Accurate inference of local phased ancestry of modern admixed populations.

    PubMed

    Ma, Yamin; Zhao, Jian; Wong, Jian-Syuan; Ma, Li; Li, Wenzhi; Fu, Guoxing; Xu, Wei; Zhang, Kui; Kittles, Rick A; Li, Yun; Song, Qing

    2014-01-01

    Population stratification is a growing concern in genetic-association studies. Averaged ancestry at the genome level (global ancestry) is insufficient for detecting the population substructures and correcting population stratifications in association studies. Local and phase stratification are needed for human genetic studies, but current technologies cannot be ap