sequence diversity observed: Topics by Science.gov

Sample records for sequence diversity observed

Endophyte Microbiome Diversity in Micropropagated Atriplex canescens and Atriplex torreyi var griffithsii

PubMed Central

Lucero, Mary E.; Unc, Adrian; Cooke, Peter; Dowd, Scot; Sun, Shulei

2011-01-01

Microbial diversity associated with micropropagated Atriplex species was assessed using microscopy, isolate culturing, and sequencing. Light, electron, and confocal microscopy revealed microbial cells in aseptically regenerated leaves and roots. Clone libraries and tag-encoded FLX amplicon pyrosequencing (TEFAP) analysis amplified sequences from callus homologous to diverse fungal and bacterial taxa. Culturing isolated some seed borne endophyte taxa which could be readily propagated apart from the host. Microbial cells were observed within biofilm-like residues associated with plant cell surfaces and intercellular spaces. Various universal primers amplified both plant and microbial sequences, with different primers revealing different patterns of fungal diversity. Bacterial and fungal TEFAP followed by alignment with sequences from curated databases revealed 7 bacterial and 17 ascomycete taxa in A. canescens, and 5 bacterial taxa in A. torreyi. Additional diversity was observed among isolates and clone libraries. Micropropagated Atriplex retains a complex, intimately associated microbiome which includes diverse strains well poised to interact in manners that influence host physiology. Microbiome analysis was facilitated by high throughput sequencing methods, but primer biases continue to limit recovery of diverse sequences from even moderately complex communities. PMID:21437280
Investigation of the bottleneck leading to the domestication of maize

PubMed Central

Eyre-Walker, Adam; Gaut, Rebecca L.; Hilton, Holly; Feldman, Dawn L.; Gaut, Brandon S.

1998-01-01

Maize (Zea mays ssp. mays) is genetically diverse, yet it is also morphologically distinct from its wild relatives. These two observations are somewhat contradictory: the first observation is consistent with a large historical population size for maize, but the latter observation is consistent with strong, diversity-limiting selection during maize domestication. In this study, we sampled sequence diversity, coupled with simulations of the coalescent process, to study the dynamics of a population bottleneck during the domestication of maize. To do this, we determined the DNA sequence of a 1,400-bp region of the Adh1 locus from 19 individuals representing maize, its presumed progenitor (Z. mays ssp. parviglumis), and a more distant relative (Zea luxurians). The sequence data were used to guide coalescent simulations of population bottlenecks associated with domestication. Our study confirms high genetic diversity in maize—maize contains 75% of the variation found in its progenitor and is more diverse than its wild relative, Z. luxurians—but it also suggests that sequence diversity in maize can be explained by a bottleneck of short duration and very small size. For example, the breadth of genetic diversity in maize is consistent with a founding population of only 20 individuals when the domestication event is 10 generations in length. PMID:9539756
Characterizing novel endogenous retroviruses from genetic variation inferred from short sequence reads

PubMed Central

Mourier, Tobias; Mollerup, Sarah; Vinner, Lasse; Hansen, Thomas Arn; Kjartansdóttir, Kristín Rós; Guldberg Frøslev, Tobias; Snogdal Boutrup, Torsten; Nielsen, Lars Peter; Willerslev, Eske; Hansen, Anders J.

2015-01-01

From Illumina sequencing of DNA from brain and liver tissue from the lion, Panthera leo, and tumor samples from the pike-perch, Sander lucioperca, we obtained two assembled sequence contigs with similarity to known retroviruses. Phylogenetic analyses suggest that the pike-perch retrovirus belongs to the epsilonretroviruses, and the lion retrovirus to the gammaretroviruses. To determine if these novel retroviral sequences originate from an endogenous retrovirus or from a recently integrated exogenous retrovirus, we assessed the genetic diversity of the parental sequences from which the short Illumina reads are derived. First, we showed by simulations that we can robustly infer the level of genetic diversity from short sequence reads. Second, we find that the measures of nucleotide diversity inferred from our retroviral sequences significantly exceed the level observed from Human Immunodeficiency Virus infections, prompting us to conclude that the novel retroviruses are both of endogenous origin. Through further simulations, we rule out the possibility that the observed elevated levels of nucleotide diversity are the result of co-infection with two closely related exogenous retroviruses. PMID:26493184
Diversity of virus-host systems in hypersaline Lake Retba, Senegal.

PubMed

Sime-Ngando, Télesphore; Lucas, Soizick; Robin, Agnès; Tucker, Kimberly Pause; Colombet, Jonathan; Bettarel, Yvan; Desmond, Elie; Gribaldo, Simonetta; Forterre, Patrick; Breitbart, Mya; Prangishvili, David

2011-08-01

Remarkable morphological diversity of virus-like particles was observed by transmission electron microscopy in a hypersaline water sample from Lake Retba, Senegal. The majority of particles morphologically resembled hyperthermophilic archaeal DNA viruses isolated from extreme geothermal environments. Some hypersaline viral morphotypes have not been previously observed in nature, and less than 1% of observed particles had a head-and-tail morphology, which is typical for bacterial DNA viruses. Culture-independent analysis of the microbial diversity in the sample suggested the dominance of extremely halophilic archaea. Few of the 16S sequences corresponded to known archeal genera (Haloquadratum, Halorubrum and Natronomonas), whereas the majority represented novel archaeal clades. Three sequences corresponded to a new basal lineage of the haloarchaea. Bacteria belonged to four major phyla, consistent with the known diversity in saline environments. Metagenomic sequencing of DNA from the purified virus-like particles revealed very few similarities to the NCBI non-redundant database at either the nucleotide or amino acid level. Some of the identifiable virus sequences were most similar to previously described haloarchaeal viruses, but no sequence similarities were found to archaeal viruses from extreme geothermal environments. A large proportion of the sequences had similarity to previously sequenced viral metagenomes from solar salterns. © 2010 Society for Applied Microbiology and Blackwell Publishing Ltd.
RTS,S/AS01 malaria vaccine mismatch observed among Plasmodium falciparum isolates from southern and central Africa and globally.

PubMed

Pringle, Julia C; Carpi, Giovanna; Almagro-Garcia, Jacob; Zhu, Sha Joe; Kobayashi, Tamaki; Mulenga, Modest; Bobanga, Thierry; Chaponda, Mike; Moss, William J; Norris, Douglas E

2018-04-26

The RTS,S/AS01 malaria vaccine encompasses the central repeats and C-terminal of Plasmodium falciparum circumsporozoite protein (PfCSP). Although no Phase II clinical trial studies observed evidence of strain-specific immunity, recent studies show a decrease in vaccine efficacy against non-vaccine strain parasites. In light of goals to reduce malaria morbidity, anticipating the effectiveness of RTS,S/AS01 is critical to planning widespread vaccine introduction. We deep sequenced C-terminal Pfcsp from 77 individuals living along the international border in Luapula Province, Zambia and Haut-Katanga Province, the Democratic Republic of the Congo (DRC) and compared translated amino acid haplotypes to the 3D7 vaccine strain. Only 5.2% of the 193 PfCSP sequences from the Zambia-DRC border region matched 3D7 at all 84 amino acids. To further contextualize the genetic diversity sampled in this study with global PfCSP diversity, we analyzed an additional 3,809 Pfcsp sequences from the Pf3k database and constructed a haplotype network representing 15 countries from Africa and Asia. The diversity observed in our samples was similar to the diversity observed in the global haplotype network. These observations underscore the need for additional research assessing genetic diversity in P. falciparum and the impact of PfCSP diversity on RTS,S/AS01 efficacy.
Theileria parva antigens recognized by CD8+ T cells show varying degrees of diversity in buffalo-derived infected cell lines.

PubMed

Sitt, Tatjana; Pelle, Roger; Chepkwony, Maurine; Morrison, W Ivan; Toye, Philip

2018-05-06

The extent of sequence diversity among the genes encoding 10 antigens (Tp1-10) known to be recognized by CD8+ T lymphocytes from cattle immune to Theileria parva was analysed. The sequences were derived from parasites in 23 buffalo-derived cell lines, three cattle-derived isolates and one cloned cell line obtained from a buffalo-derived stabilate. The results revealed substantial variation among the antigens through sequence diversity. The greatest nucleotide and amino acid diversity were observed in Tp1, Tp2 and Tp9. Tp5 and Tp7 showed the least amount of allelic diversity, and Tp5, Tp6 and Tp7 had the lowest levels of protein diversity. Tp6 was the most conserved protein; only a single non-synonymous substitution was found in all obtained sequences. The ratio of non-synonymous: synonymous substitutions varied from 0.84 (Tp1) to 0.04 (Tp6). Apart from Tp2 and Tp9, we observed no variation in the other defined CD8+ T cell epitopes (Tp4, 5, 7 and 8), indicating that epitope variation is not a universal feature of T. parva antigens. In addition to providing markers that can be used to examine the diversity in T. parva populations, the results highlight the potential for using conserved antigens to develop vaccines that provide broad protection against T. parva.
Comparison of the Diversity of Basidiomycetes from Dead Wood of the Manchurian fir (Abies holophylla) as Evaluated by Fruiting Body Collection, Mycelial Isolation, and 454 Sequencing.

PubMed

Jang, Yeongseon; Jang, Seokyoon; Min, Mihee; Hong, Joo-Hyun; Lee, Hanbyul; Lee, Hwanhwi; Lim, Young Woon; Kim, Jae-Jin

2015-10-01

In this study, three different methods (fruiting body collection, mycelial isolation, and 454 sequencing) were implemented to determine the diversity of wood-inhabiting basidiomycetes from dead Manchurian fir (Abies holophylla). The three methods recovered similar species richness (26 species from fruiting bodies, 32 species from mycelia, and 32 species from 454 sequencing), but Fisher's alpha, Shannon-Wiener, Simpson's diversity indices of fungal communities indicated fruiting body collection and mycelial isolation displayed higher diversity compared with 454 sequencing. In total, 75 wood-inhabiting basidiomycetes were detected. The most frequently observed species were Heterobasidion orientale (fruiting body collection), Bjerkandera adusta (mycelial isolation), and Trichaptum fusco-violaceum (454 sequencing). Only two species, Hymenochaete yasudae and Hypochnicium karstenii, were detected by all three methods. This result indicated that Manchurian fir harbors a diverse basidiomycetous fungal community and for complete estimation of fungal diversity, multiple methods should be used. Further studies are required to understand their ecology in the context of forest ecosystems.
Comparison of a High-Resolution Melting Assay to Next-Generation Sequencing for Analysis of HIV Diversity

PubMed Central

Cousins, Matthew M.; Ou, San-San; Wawer, Maria J.; Munshaw, Supriya; Swan, David; Magaret, Craig A.; Mullis, Caroline E.; Serwadda, David; Porcella, Stephen F.; Gray, Ronald H.; Quinn, Thomas C.; Donnell, Deborah; Eshleman, Susan H.

2012-01-01

Next-generation sequencing (NGS) has recently been used for analysis of HIV diversity, but this method is labor-intensive, costly, and requires complex protocols for data analysis. We compared diversity measures obtained using NGS data to those obtained using a diversity assay based on high-resolution melting (HRM) of DNA duplexes. The HRM diversity assay provides a single numeric score that reflects the level of diversity in the region analyzed. HIV gag and env from individuals in Rakai, Uganda, were analyzed in a previous study using NGS (n = 220 samples from 110 individuals). Three sequence-based diversity measures were calculated from the NGS sequence data (percent diversity, percent complexity, and Shannon entropy). The amplicon pools used for NGS were analyzed with the HRM diversity assay. HRM scores were significantly associated with sequence-based measures of HIV diversity for both gag and env (P < 0.001 for all measures). The level of diversity measured by the HRM diversity assay and NGS increased over time in both regions analyzed (P < 0.001 for all measures except for percent complexity in gag), and similar amounts of diversification were observed with both methods (P < 0.001 for all measures except for percent complexity in gag). Diversity measures obtained using the HRM diversity assay were significantly associated with those from NGS, and similar increases in diversity over time were detected by both methods. The HRM diversity assay is faster and less expensive than NGS, facilitating rapid analysis of large studies of HIV diversity and evolution. PMID:22785188
Rapid evolution of the env gene leader sequence in cats naturally infected with feline immunodeficiency virus

PubMed Central

Hughes, Joseph; Biek, Roman; Litster, Annette; Willett, Brian J.; Hosie, Margaret J.

2015-01-01

Analysing the evolution of feline immunodeficiency virus (FIV) at the intra-host level is important in order to address whether the diversity and composition of viral quasispecies affect disease progression. We examined the intra-host diversity and the evolutionary rates of the entire env and structural fragments of the env sequences obtained from sequential blood samples in 43 naturally infected domestic cats that displayed different clinical outcomes. We observed in the majority of cats that FIV env showed very low levels of intra-host diversity. We estimated that env evolved at a rate of 1.16×10−3 substitutions per site per year and demonstrated that recombinant sequences evolved faster than non-recombinant sequences. It was evident that the V3–V5 fragment of FIV env displayed higher evolutionary rates in healthy cats than in those with terminal illness. Our study provided the first evidence that the leader sequence of env, rather than the V3–V5 sequence, had the highest intra-host diversity and the highest evolutionary rate of all env fragments, consistent with this region being under a strong selective pressure for genetic variation. Overall, FIV env displayed relatively low intra-host diversity and evolved slowly in naturally infected cats. The maximum evolutionary rate was observed in the leader sequence of env. Although genetic stability is not necessarily a prerequisite for clinical stability, the higher genetic stability of FIV compared with human immunodeficiency virus might explain why many naturally infected cats do not progress rapidly to AIDS. PMID:25535323
Epstein-Barr Virus Latent Membrane Protein 1 Genetic Variability in Peripheral Blood B Cells and Oropharyngeal Fluids

PubMed Central

Renzette, Nicholas; Somasundaran, Mohan; Brewster, Frank; Coderre, James; Weiss, Eric R.; McManus, Margaret; Greenough, Thomas; Tabak, Barbara; Garber, Manuel; Kowalik, Timothy F.

2014-01-01

ABSTRACT We report the diversity of latent membrane protein 1 (LMP1) gene founder sequences and the level of Epstein-Barr virus (EBV) genome variability over time and across anatomic compartments by using virus genomes amplified directly from oropharyngeal wash specimens and peripheral blood B cells during acute infection and convalescence. The intrahost nucleotide variability of the founder virus was 0.02% across the region sequences, and diversity increased significantly over time in the oropharyngeal compartment (P = 0.004). The LMP1 region showing the greatest level of variability in both compartments, and over time, was concentrated within the functional carboxyl-terminal activating regions 2 and 3 (CTAR2 and CTAR3). Interestingly, a deletion in a proline-rich repeat region (amino acids 274 to 289) of EBV commonly reported in EBV sequenced from cancer specimens was not observed in acute infectious mononucleosis (AIM) patients. Taken together, these data highlight the diversity in circulating EBV genomes and its potential importance in disease pathogenesis and vaccine design. IMPORTANCE This study is among the first to leverage an improved high-throughput deep-sequencing methodology to investigate directly from patient samples the degree of diversity in Epstein-Barr virus (EBV) populations and the extent to which viral genome diversity develops over time in the infected host. Significant variability of circulating EBV latent membrane protein 1 (LMP1) gene sequences was observed between cellular and oral wash samples, and this variability increased over time in oral wash samples. The significance of EBV genetic diversity in transmission and disease pathogenesis are discussed. PMID:24429365
Epstein-Barr virus latent membrane protein 1 genetic variability in peripheral blood B cells and oropharyngeal fluids.

PubMed

Renzette, Nicholas; Somasundaran, Mohan; Brewster, Frank; Coderre, James; Weiss, Eric R; McManus, Margaret; Greenough, Thomas; Tabak, Barbara; Garber, Manuel; Kowalik, Timothy F; Luzuriaga, Katherine

2014-04-01

We report the diversity of latent membrane protein 1 (LMP1) gene founder sequences and the level of Epstein-Barr virus (EBV) genome variability over time and across anatomic compartments by using virus genomes amplified directly from oropharyngeal wash specimens and peripheral blood B cells during acute infection and convalescence. The intrahost nucleotide variability of the founder virus was 0.02% across the region sequences, and diversity increased significantly over time in the oropharyngeal compartment (P = 0.004). The LMP1 region showing the greatest level of variability in both compartments, and over time, was concentrated within the functional carboxyl-terminal activating regions 2 and 3 (CTAR2 and CTAR3). Interestingly, a deletion in a proline-rich repeat region (amino acids 274 to 289) of EBV commonly reported in EBV sequenced from cancer specimens was not observed in acute infectious mononucleosis (AIM) patients. Taken together, these data highlight the diversity in circulating EBV genomes and its potential importance in disease pathogenesis and vaccine design. This study is among the first to leverage an improved high-throughput deep-sequencing methodology to investigate directly from patient samples the degree of diversity in Epstein-Barr virus (EBV) populations and the extent to which viral genome diversity develops over time in the infected host. Significant variability of circulating EBV latent membrane protein 1 (LMP1) gene sequences was observed between cellular and oral wash samples, and this variability increased over time in oral wash samples. The significance of EBV genetic diversity in transmission and disease pathogenesis are discussed.
Ancient diversity and geographical sub-structuring in African buffalo Theileria parva populations revealed through metagenetic analysis of antigen-encoding loci.

PubMed

Hemmink, Johanneke D; Sitt, Tatjana; Pelle, Roger; de Klerk-Lorist, Lin-Mari; Shiels, Brian; Toye, Philip G; Morrison, W Ivan; Weir, William

2018-03-01

An infection and treatment protocol involving infection with a mixture of three parasite isolates and simultaneous treatment with oxytetracycline is currently used to vaccinate cattle against Theileria parva. While vaccination results in high levels of protection in some regions, little or no protection is observed in areas where animals are challenged predominantly by parasites of buffalo origin. A previous study involving sequencing of two antigen-encoding genes from a series of parasite isolates indicated that this is associated with greater antigenic diversity in buffalo-derived T. parva. The current study set out to extend these analyses by applying high-throughput sequencing to ex vivo samples from naturally infected buffalo to determine the extent of diversity in a set of antigen-encoding genes. Samples from two populations of buffalo, one in Kenya and the other in South Africa, were examined to investigate the effect of geographical distance on the nature of sequence diversity. The results revealed a number of significant findings. First, there was a variable degree of nucleotide sequence diversity in all gene segments examined, with the percentage of polymorphic nucleotides ranging from 10% to 69%. Second, large numbers of allelic variants of each gene were found in individual animals, indicating multiple infection events. Third, despite the observed diversity in nucleotide sequences, several of the gene products had highly conserved amino acid sequences, and thus represent potential candidates for vaccine development. Fourth, although compelling evidence for population differentiation between the Kenyan and South African T. parva parasites was identified, analysis of molecular variance for each gene revealed that the majority of the underlying nucleotide sequence polymorphism was common to both areas, indicating that much of this aspect of genetic variation in the parasite population arose prior to geographic separation. Copyright © 2018 The Authors. Published by Elsevier Ltd.. All rights reserved.
The genetic diversity of merozoite surface antigen 1 (MSA-1) among Babesia bovis detected from cattle populations in Thailand, Brazil and Ghana.

PubMed

Nagano, Daisuke; Sivakumar, Thillaiampalam; De De Macedo, Alane Caine Costa; Inpankaew, Tawin; Alhassan, Andy; Igarashi, Ikuo; Yokoyama, Naoaki

2013-11-01

In the present study, we screened blood DNA samples obtained from cattle bred in Brazil (n=164) and Ghana (n=80) for Babesia bovis using a diagnostic PCR assay and found prevalences of 14.6% and 46.3%, respectively. Subsequently, the genetic diversity of B. bovis in Thailand, Brazil and Ghana was analyzed, based on the DNA sequence of merozoite surface antigen-1 (MSA-1). In Thailand, MSA-1 sequences were relatively conserved and found in a single clade of the phylogram, while Brazilian MSA-1 sequences showed high genetic diversity and were dispersed across three different clades. In contrast, the sequences from Ghanaian samples were detected in two different clades, one of which contained only a single Ghanaian sequence. The identities among the MSA-1 sequences from Thailand, Brazil and Ghana were 99.0-100%, 57.5-99.4% and 60.3-100%, respectively, while the similarities among the deduced MSA-1 amino acid sequences within the respective countries were 98.4-100%, 59.4-99.7% and 58.7-100%, respectively. These observations suggested that the genetic diversity of B. bovis based on MSA-1 sequences was higher in Brazil and Ghana than in Thailand. The current data highlight the importance of conducting extensive studies on the genetic diversity of B. bovis before designing immune control strategies in each surveyed country.
Analysis of genetic diversity and population structure of oil palm (Elaeis guineensis) from China and Malaysia based on species-specific simple sequence repeat markers.

PubMed

Zhou, L X; Xiao, Y; Xia, W; Yang, Y D

2015-12-08

Genetic diversity and patterns of population structure of the 94 oil palm lines were investigated using species-specific simple sequence repeat (SSR) markers. We designed primers for 63 SSR loci based on their flanking sequences and conducted amplification in 94 oil palm DNA samples. The amplification result showed that a relatively high level of genetic diversity was observed between oil palm individuals according a set of 21 polymorphic microsatellite loci. The observed heterozygosity (Ho) was 0.3683 and 0.4035, with an average of 0.3859. The Ho value was a reliable determinant of the discriminatory power of the SSR primer combinations. The principal component analysis and unweighted pair-group method with arithmetic averaging cluster analysis showed the 94 oil palm lines were grouped into one cluster. These results demonstrated that the oil palm in Hainan Province of China and the germplasm introduced from Malaysia may be from the same source. The SSR protocol was effective and reliable for assessing the genetic diversity of oil palm. Knowledge of the genetic diversity and population structure will be crucial for establishing appropriate management stocks for this species.
From Environmental Sequences to Morphology: Observation and Characterisation of a Paulinellid Testate Amoeba (Micropyxidiella edaphonis gen. nov. sp. nov. Euglyphida, Paulinellidae) from Soil using Fluorescent in situ Hybridization.

PubMed

Tarnawski, Sonia-Estelle; Lara, Enrique

2015-05-01

High microbial diversity is revealed by environmental DNA surveys. However, nothing is known about the morphology and function of these potentially new organisms. In the course of an environmental soil diversity study, we found for the first time environmental sequences that reveal the presence of Paulinellidae (a mostly marine and marginally freshwater family of euglyphid testate amoebae) in samples of forest litter from different geographic origins. The new sequences form a basal, robust clade in the family. We used fluorescent in situ hybridization (FISH) to detect the organisms from which these sequences derived. We isolated the cells and documented them with light and scanning electron microscopy. Based on these observations, we described these organisms as Micropyxidiella edaphonis gen. nov. sp. nov. The organisms were very small testate amoebae (generally less than 10μm) with an irregular proteinaceous test. This suggests an unknown diversity in testate amoebae, and calls for extending this type of investigations to other protist groups which are known only as environmental DNA sequences. Copyright © 2015 Elsevier GmbH. All rights reserved.
Scaling laws describe memories of host-pathogen riposte in the HIV population.

PubMed

Barton, John P; Kardar, Mehran; Chakraborty, Arup K

2015-02-17

The enormous genetic diversity and mutability of HIV has prevented effective control of this virus by natural immune responses or vaccination. Evolution of the circulating HIV population has thus occurred in response to diverse, ultimately ineffective, immune selection pressures that randomly change from host to host. We show that the interplay between the diversity of human immune responses and the ways that HIV mutates to evade them results in distinct sets of sequences defined by similar collectively coupled mutations. Scaling laws that relate these sets of sequences resemble those observed in linguistics and other branches of inquiry, and dynamics reminiscent of neural networks are observed. Like neural networks that store memories of past stimulation, the circulating HIV population stores memories of host-pathogen combat won by the virus. We describe an exactly solvable model that captures the main qualitative features of the sets of sequences and a simple mechanistic model for the origin of the observed scaling laws. Our results define collective mutational pathways used by HIV to evade human immune responses, which could guide vaccine design.
Diversity of Functionally Permissive Sequences in the Receptor-Binding Site of Influenza Hemagglutinin.

PubMed

Wu, Nicholas C; Xie, Jia; Zheng, Tianqing; Nycholat, Corwin M; Grande, Geramie; Paulson, James C; Lerner, Richard A; Wilson, Ian A

2017-06-14

Influenza A virus hemagglutinin (HA) initiates viral entry by engaging host receptor sialylated glycans via its receptor-binding site (RBS). The amino acid sequence of the RBS naturally varies across avian and human influenza virus subtypes and is also evolvable. However, functional sequence diversity in the RBS has not been fully explored. Here, we performed a large-scale mutational analysis of the RBS of A/WSN/33 (H1N1) and A/Hong Kong/1/1968 (H3N2) HAs. Many replication-competent mutants not yet observed in nature were identified, including some that could escape from an RBS-targeted broadly neutralizing antibody. This functional sequence diversity is made possible by pervasive epistasis in the RBS 220-loop and can be buffered by avidity in viral receptor binding. Overall, our study reveals that the HA RBS can accommodate a much greater range of sequence diversity than previously thought, which has significant implications for the complex evolutionary interrelationships between receptor specificity and immune escape. Copyright © 2017 Elsevier Inc. All rights reserved.
Analysis of intra-host genetic diversity of Prunus necrotic ringspot virus (PNRSV) using amplicon next generation sequencing.

PubMed

Kinoti, Wycliff M; Constable, Fiona E; Nancarrow, Narelle; Plummer, Kim M; Rodoni, Brendan

2017-01-01

PCR amplicon next generation sequencing (NGS) analysis offers a broadly applicable and targeted approach to detect populations of both high- or low-frequency virus variants in one or more plant samples. In this study, amplicon NGS was used to explore the diversity of the tripartite genome virus, Prunus necrotic ringspot virus (PNRSV) from 53 PNRSV-infected trees using amplicons from conserved gene regions of each of PNRSV RNA1, RNA2 and RNA3. Sequencing of the amplicons from 53 PNRSV-infected trees revealed differing levels of polymorphism across the three different components of the PNRSV genome with a total number of 5040, 2083 and 5486 sequence variants observed for RNA1, RNA2 and RNA3 respectively. The RNA2 had the lowest diversity of sequences compared to RNA1 and RNA3, reflecting the lack of flexibility tolerated by the replicase gene that is encoded by this RNA component. Distinct PNRSV phylo-groups, consisting of closely related clusters of sequence variants, were observed in each of PNRSV RNA1, RNA2 and RNA3. Most plant samples had a single phylo-group for each RNA component. Haplotype network analysis showed that smaller clusters of PNRSV sequence variants were genetically connected to the largest sequence variant cluster within a phylo-group of each RNA component. Some plant samples had sequence variants occurring in multiple PNRSV phylo-groups in at least one of each RNA and these phylo-groups formed distinct clades that represent PNRSV genetic strains. Variants within the same phylo-group of each Prunus plant sample had ≥97% similarity and phylo-groups within a Prunus plant sample and between samples had less ≤97% similarity. Based on the analysis of diversity, a definition of a PNRSV genetic strain was proposed. The proposed definition was applied to determine the number of PNRSV genetic strains in each of the plant samples and the complexity in defining genetic strains in multipartite genome viruses was explored.
Nucleotide Sequence Diversity and Linkage Disequilibrium of Four Nuclear Loci in Foxtail Millet (Setaria italica).

PubMed

He, Shui-Lian; Yang, Yang; Morrell, Peter L; Yi, Ting-Shuang

2015-01-01

Foxtail millet (Setaria italica (L.) Beauv) is one of the earliest domesticated grains, which has been cultivated in northern China by 8,700 years before present (YBP) and across Eurasia by 4,000 YBP. Owing to a small genome and diploid nature, foxtail millet is a tractable model crop for studying functional genomics of millets and bioenergy grasses. In this study, we examined nucleotide sequence diversity, geographic structure, and levels of linkage disequilibrium at four nuclear loci (ADH1, G3PDH, IGS1 and TPI1) in representative samples of 311 landrace accessions across its cultivated range. Higher levels of nucleotide sequence and haplotype diversity were observed in samples from China relative to other sampled regions. Genetic assignment analysis classified the accessions into seven clusters based on nucleotide sequence polymorphisms. Intralocus LD decayed rapidly to half the initial value within ~1.2 kb or less.
Genetic diversity and population structure of the endangered marsupial Sarcophilus harrisii (Tasmanian devil)

PubMed Central

Miller, Webb; Hayes, Vanessa M.; Ratan, Aakrosh; Petersen, Desiree C.; Wittekindt, Nicola E.; Miller, Jason; Walenz, Brian; Knight, James; Qi, Ji; Zhao, Fangqing; Wang, Qingyu; Bedoya-Reina, Oscar C.; Katiyar, Neerja; Tomsho, Lynn P.; Kasson, Lindsay McClellan; Hardie, Rae-Anne; Woodbridge, Paula; Tindall, Elizabeth A.; Bertelsen, Mads Frost; Dixon, Dale; Pyecroft, Stephen; Helgen, Kristofer M.; Lesk, Arthur M.; Pringle, Thomas H.; Patterson, Nick; Zhang, Yu; Kreiss, Alexandre; Woods, Gregory M.; Jones, Menna E.; Schuster, Stephan C.

2011-01-01

The Tasmanian devil (Sarcophilus harrisii) is threatened with extinction because of a contagious cancer known as Devil Facial Tumor Disease. The inability to mount an immune response and to reject these tumors might be caused by a lack of genetic diversity within a dwindling population. Here we report a whole-genome analysis of two animals originating from extreme northwest and southeast Tasmania, the maximal geographic spread, together with the genome from a tumor taken from one of them. A 3.3-Gb de novo assembly of the sequence data from two complementary next-generation sequencing platforms was used to identify 1 million polymorphic genomic positions, roughly one-quarter of the number observed between two genetically distant human genomes. Analysis of 14 complete mitochondrial genomes from current and museum specimens, as well as mitochondrial and nuclear SNP markers in 175 animals, suggests that the observed low genetic diversity in today's population preceded the Devil Facial Tumor Disease disease outbreak by at least 100 y. Using a genetically characterized breeding stock based on the genome sequence will enable preservation of the extant genetic diversity in future Tasmanian devil populations. PMID:21709235

Whole-genome sequencing and analyses identify high genetic heterogeneity, diversity and endemicity of rotavirus genotype P[6] strains circulating in Africa.

PubMed

Nyaga, Martin M; Tan, Yi; Seheri, Mapaseka L; Halpin, Rebecca A; Akopov, Asmik; Stucker, Karla M; Fedorova, Nadia B; Shrivastava, Susmita; Duncan Steele, A; Mwenda, Jason M; Pickett, Brett E; Das, Suman R; Jeffrey Mphahlele, M

2018-05-18

Rotavirus A (RVA) exhibits a wide genotype diversity globally. Little is known about the genetic composition of genotype P[6] from Africa. This study investigated possible evolutionary mechanisms leading to genetic diversity of genotype P[6] VP4 sequences. Phylogenetic analyses on 167 P[6] VP4 full-length sequences were conducted, which included six porcine-origin sequences. Of the 167 sequences, 57 were newly acquired through whole genome sequencing as part of this study. The other 110 sequences were all publicly-available global P[6] VP4 full-length sequences downloaded from GenBank. The strength of association between the phenotypic features and the phylogeny was also determined. A number of reassortment and mixed infections of RVA genotype P[6] strains were observed in this study. Phylogenetic analyses demostrated the extensive genetic diversity that exists among human P[6] strains, porcine-like strains, their concomitant clades/subclades and estimated that P[6] VP4 gene has a higher substitution rate with the mean of 1.05E-3 substitutions/site/year. Further, the phylogenetic analyses indicated that genotype P[6] strains were endemic in Africa, characterised by an extensive genetic diversity and long-time local evolution of the viruses. This was also supported by phylogeographic clustering and G-genotype clustering of the P[6] strains when Bayesian Tip-association Significance testing (BaTS) was applied, clearly supporting that the viruses evolved locally in Africa instead of spatial mixing among different regions. Overall, the results demonstrated that multiple mechanisms such as reassortment events, various mutations and possibly interspecies transmission account for the enormous diversity of genotype P[6] strains in Africa. These findings highlight the need for continued global surveillance of rotavirus diversity. Copyright © 2018 Elsevier B.V. All rights reserved.
Early Epstein-Barr Virus Genomic Diversity and Convergence toward the B95.8 Genome in Primary Infection.

PubMed

Weiss, Eric R; Lamers, Susanna L; Henderson, Jennifer L; Melnikov, Alexandre; Somasundaran, Mohan; Garber, Manuel; Selin, Liisa; Nusbaum, Chad; Luzuriaga, Katherine

2018-01-15

Over 90% of the world's population is persistently infected with Epstein-Barr virus. While EBV does not cause disease in most individuals, it is the common cause of acute infectious mononucleosis (AIM) and has been associated with several cancers and autoimmune diseases, highlighting a need for a preventive vaccine. At present, very few primary, circulating EBV genomes have been sequenced directly from infected individuals. While low levels of diversity and low viral evolution rates have been predicted for double-stranded DNA (dsDNA) viruses, recent studies have demonstrated appreciable diversity in common dsDNA pathogens (e.g., cytomegalovirus). Here, we report 40 full-length EBV genome sequences obtained from matched oral wash and B cell fractions from a cohort of 10 AIM patients. Both intra- and interpatient diversity were observed across the length of the entire viral genome. Diversity was most pronounced in viral genes required for establishing latent infection and persistence, with appreciable levels of diversity also detected in structural genes, including envelope glycoproteins. Interestingly, intrapatient diversity declined significantly over time ( P < 0.01), and this was particularly evident on comparison of viral genomes sequenced from B cell fractions in early primary infection and convalescence ( P < 0.001). B cell-associated viral genomes were observed to converge, becoming nearly identical to the B95.8 reference genome over time (Spearman rank-order correlation test; r = -0.5589, P = 0.0264). The reduction in diversity was most marked in the EBV latency genes. In summary, our data suggest independent convergence of diverse viral genome sequences toward a reference-like strain within a relatively short period following primary EBV infection. IMPORTANCE Identification of viral proteins with low variability and high immunogenicity is important for the development of a protective vaccine. Knowledge of genome diversity within circulating viral populations is a key step in this process, as is the expansion of intrahost genomic variation during infection. We report full-length EBV genomes sequenced from the blood and oral wash of 10 individuals early in primary infection and during convalescence. Our data demonstrate considerable diversity within the pool of circulating EBV strains, as well as within individual patients. Overall viral diversity decreased from early to persistent infection, particularly in latently infected B cells, which serve as the viral reservoir. Reduction in B cell-associated viral genome diversity coincided with a convergence toward a reference-like EBV genotype. Greater convergence positively correlated with time after infection, suggesting that the reference-like genome is the result of selection. Copyright © 2018 American Society for Microbiology.
Genetic diversity of Babesia bovis in virulent and attenuated strains.

PubMed

Mazuz, M L; Molad, T; Fish, L; Leibovitz, B; Wolkomirsky, R; Fleiderovitz, L; Shkap, V

2012-03-01

The aim of this study was to compare the genetic diversity of the single copy Bv80 gene sequences of Babesia bovis in populations of attenuated and virulent parasites. PCR/ RT-PCR followed by cloning and sequence analyses of 4 attenuated and 4 virulent strains were performed. Multiple fragments in the range of 420 to 744 bp were amplified by PCR or RT-PCR. Cloning of the PCR fragments and sequence analyses revealed the presence of mixed subpopulations in either virulent or attenuated parasites with a total of 19 variants with 12 different sequences that differed in number and type of tandem repeats. High levels of intra- and inter-strain diversity of the Bv80 gene, with the presence of mixed populations of parasites were found in both the virulent field isolates and the attenuated vaccine strains. In addition, during the attenuation process, sequence analyses showed changes in the pattern of the parasite subpopulations. Despite high polymorphism found by sequence analyses, the patterns observed and the number of repeats, order, or motifs found could not discriminate between virulent field isolates and attenuated vaccine strains of the parasite.
Diversity of phytases in the rumen.

PubMed

Nakashima, Brenda A; McAllister, Tim A; Sharma, Ranjana; Selinger, L Brent

2007-01-01

Examples of a new class of phytase related to protein tyrosine phosphatases (PTP) were recently isolated from several anaerobic bacteria from the rumen of cattle. In this study, the diversity of PTP-like phytase gene sequences in the rumen was surveyed by using the polymerase chain reaction (PCR). Two sets of degenerate primers were used to amplify sequences from rumen fluid total community DNA and genomic DNA from nine bacterial isolates. Four novel PTP-like phytase sequences were retrieved from rumen fluid, whereas all nine of the anaerobic bacterial isolates investigated in this work contained PTP-like phytase sequences. One isolate, Selenomonas lacticifex, contained two distinct PTP-like phytase sequences, suggesting that multiple phytate hydrolyzing enzymes are present in this bacterium. The degenerate primer and PCR conditions described here, as well as novel sequences obtained in this study, will provide a valuable resource for future studies on this new class of phytase. The observed diversity of microbial phytases in the rumen may account for the ability of ruminants to derive a significant proportion of their phosphorus requirements from phytate.
[Study on Microbial Diversity of Peri-implantitis Subgingival by High-throughput Sequencing].

PubMed

Li, Zhi-jie; Wang, Shao-guo; Li, Yue-hong; Tu, Dong-xiang; Liu, Shi-yun; Nie, Hong-bing; Li, Zhi-qiang; Zhang, Ju-mei

2015-07-01

To study microbial diversity of peri-implantitis subgingival with high-throughput sequencing, and investigate microbiological etiology of peri-implantitis. Subgingival plaques were sampled from the patients with peri-implantitis (D group) and non-peri-implantitis subjects (N group). The microbiological diversity of the subgingival plaques was detected by sequencing V4 region of 16S rRNA with Illumina Miseq platform. The diversity of the community structure was analyzed using Mothur software. A total of 156 507 gene sequences were detected in nine samples and 4 402 operational taxonomic units (OTUs) were found. Selenomonas, Pseudomonas, and Fusobacterium were dominant bacteria in D group, while Fusobacterium, Veillonella and Streptococcus were dominant bacteria in N group. Differences between peri-implantitis and non-peri-implantitis bacterial communities were observed at all phylogenetic levels by LEfSe, which was also found in PcoA test. The occurrence of peri-implantitis is not only related to periodontitis pathogenic microbe, but also related with the changes of oral microbial community structure. Treponema, Herbaspirillum, Butyricimonas and Phaeobacte may be closely related to the occurrence and development of peri-implantitis.
Coastal bacterioplankton community diversity along a latitudinal gradient in Latin America by means of V6 tag pyrosequencing.

PubMed

Thompson, Fabiano L; Bruce, Thiago; Gonzalez, Alessandra; Cardoso, Alexander; Clementino, Maysa; Costagliola, Marcela; Hozbor, Constanza; Otero, Ernesto; Piccini, Claudia; Peressutti, Silvia; Schmieder, Robert; Edwards, Robert; Smith, Mathew; Takiyama, Luis Roberto; Vieira, Ricardo; Paranhos, Rodolfo; Artigas, Luis Felipe

2011-02-01

The bacterioplankton diversity of coastal waters along a latitudinal gradient between Puerto Rico and Argentina was analyzed using a total of 134,197 high-quality sequences from the V6 hypervariable region of the small-subunit ribosomal RNA gene (16S rRNA) (mean length of 60 nt). Most of the OTUs were identified into Proteobacteria, Bacteriodetes, Cyanobacteria, and Actinobacteria, corresponding to approx. 80% of the total number of sequences. The number of OTUs corresponding to species varied between 937 and 1946 in the seven locations. Proteobacteria appeared at high frequency in the seven locations. An enrichment of Cyanobacteria was observed in Puerto Rico, whereas an enrichment of Bacteroidetes was detected in the Argentinian shelf and Uruguayan coastal lagoons. The highest number of sequences of Actinobacteria and Acidobacteria were obtained in the Amazon estuary mouth. The rarefaction curves and Good coverage estimator for species diversity suggested a significant coverage, with values ranging between 92 and 97% for Good coverage. Conserved taxa corresponded to aprox. 52% of all sequences. This study suggests that human-contaminated environments may influence bacterioplankton diversity.
The Diversity Present in 5140 Human Mitochondrial Genomes

PubMed Central

Pereira, Luísa; Freitas, Fernando; Fernandes, Verónica; Pereira, Joana B.; Costa, Marta D.; Costa, Stephanie; Máximo, Valdemar; Macaulay, Vincent; Rocha, Ricardo; Samuels, David C.

2009-01-01

We analyzed the current status (as of the end of August 2008) of human mitochondrial genomes deposited in GenBank, amounting to 5140 complete or coding-region sequences, in order to present an overall picture of the diversity present in the mitochondrial DNA of the global human population. To perform this task, we developed mtDNA-GeneSyn, a computer tool that identifies and exhaustedly classifies the diversity present in large genetic data sets. The diversity observed in the 5140 human mitochondrial genomes was compared with all possible transitions and transversions from the standard human mitochondrial reference genome. This comparison showed that tRNA and rRNA secondary structures have a large effect in limiting the diversity of the human mitochondrial sequences, whereas for the protein-coding genes there is a bias toward less variation at the second codon positions. The analysis of the observed amino acid variations showed a tolerance of variations that convert between the amino acids V, I, A, M, and T. This defines a group of amino acids with similar chemical properties that can interconvert by a single transition. PMID:19426953
Frequency and genetic characterization of V(DD)J recombinants in the human peripheral blood antibody repertoire.

PubMed

Briney, Bryan S; Willis, Jordan R; Hicar, Mark D; Thomas, James W; Crowe, James E

2012-09-01

Antibody heavy-chain recombination that results in the incorporation of multiple diversity (D) genes, although uncommon, contributes substantially to the diversity of the human antibody repertoire. Such recombination allows the generation of heavy chain complementarity determining region 3 (HCDR3) regions of extreme length and enables junctional regions that, because of the nucleotide bias of N-addition regions, are difficult to produce through normal V(D)J recombination. Although this non-classical recombination process has been observed infrequently, comprehensive analysis of the frequency and genetic characteristics of such events in the human peripheral blood antibody repertoire has not been possible because of the rarity of such recombinants and the limitations of traditional sequencing technologies. Here, through the use of high-throughput sequencing of the normal human peripheral blood antibody repertoire, we analysed the frequency and genetic characteristics of V(DD)J recombinants. We found that these recombinations were present in approximately 1 in 800 circulating B cells, and that the frequency was severely reduced in memory cell subsets. We also found that V(DD)J recombination can occur across the spectrum of diversity genes, indicating that virtually all recombination signal sequences that flank diversity genes are amenable to V(DD)J recombination. Finally, we observed a repertoire bias in the diversity gene repertoire at the upstream (5') position, and discovered that this bias was primarily attributable to the order of diversity genes in the genomic locus. © 2012 The Authors. Immunology © 2012 Blackwell Publishing Ltd.
Analysis of Facultative Lithotroph Distribution and Diversity on Volcanic Deposits by Use of the Large Subunit of Ribulose 1,5-Bisphosphate Carboxylase/Oxygenase†

PubMed Central

Nanba, K.; King, G. M.; Dunfield, K.

2004-01-01

A 492- to 495-bp fragment of the gene coding for the large subunit of the form I ribulose 1,5-bisphosphate carboxylase/oxygenase (RubisCO) (rbcL) was amplified by PCR from facultatively lithotrophic aerobic CO-oxidizing bacteria, colorless and purple sulfide-oxidizing microbial mats, and genomic DNA extracts from tephra and ash deposits from Kilauea volcano, for which atmospheric CO and hydrogen have been previously documented as important substrates. PCR products from the mats and volcanic sites were used to construct rbcL clone libraries. Phylogenetic analyses showed that the rbcL sequences from all isolates clustered with form IC rbcL sequences derived from facultative lithotrophs. In contrast, the microbial mat clone sequences clustered with sequences from obligate lithotrophs representative of form IA rbcL. Clone sequences from volcanic sites fell within the form IC clade, suggesting that these sites were dominated by facultative lithotrophs, an observation consistent with biogeochemical patterns at the sites. Based on phylogenetic and statistical analyses, clone libraries differed significantly among volcanic sites, indicating that they support distinct lithotrophic assemblages. Although some of the clone sequences were similar to known rbcL sequences, most were novel. Based on nucleotide diversity and average pairwise difference, a forested site and an 1894 lava flow were found to support the most diverse and least diverse lithotrophic populations, respectively. These indices of diversity were not correlated with rates of atmospheric CO and hydrogen uptake but were correlated with estimates of respiration and microbial biomass. PMID:15066819
Analysis of facultative lithotroph distribution and diversity on volcanic deposits by use of the large subunit of ribulose 1,5-bisphosphate carboxylase/oxygenase.

PubMed

Nanba, K; King, G M; Dunfield, K

2004-04-01

A 492- to 495-bp fragment of the gene coding for the large subunit of the form I ribulose 1,5-bisphosphate carboxylase/oxygenase (RubisCO) (rbcL) was amplified by PCR from facultatively lithotrophic aerobic CO-oxidizing bacteria, colorless and purple sulfide-oxidizing microbial mats, and genomic DNA extracts from tephra and ash deposits from Kilauea volcano, for which atmospheric CO and hydrogen have been previously documented as important substrates. PCR products from the mats and volcanic sites were used to construct rbcL clone libraries. Phylogenetic analyses showed that the rbcL sequences from all isolates clustered with form IC rbcL sequences derived from facultative lithotrophs. In contrast, the microbial mat clone sequences clustered with sequences from obligate lithotrophs representative of form IA rbcL. Clone sequences from volcanic sites fell within the form IC clade, suggesting that these sites were dominated by facultative lithotrophs, an observation consistent with biogeochemical patterns at the sites. Based on phylogenetic and statistical analyses, clone libraries differed significantly among volcanic sites, indicating that they support distinct lithotrophic assemblages. Although some of the clone sequences were similar to known rbcL sequences, most were novel. Based on nucleotide diversity and average pairwise difference, a forested site and an 1894 lava flow were found to support the most diverse and least diverse lithotrophic populations, respectively. These indices of diversity were not correlated with rates of atmospheric CO and hydrogen uptake but were correlated with estimates of respiration and microbial biomass.
Expansion of the Preimmune Antibody Repertoire by Junctional Diversity in Bos taurus

PubMed Central

Liljavirta, Jenni; Niku, Mikael; Pessa-Morikawa, Tiina; Ekman, Anna; Iivanainen, Antti

2014-01-01

Cattle have a limited range of immunoglobulin genes which are further diversified by antigen independent somatic hypermutation in fetuses. Junctional diversity generated during somatic recombination contributes to antibody diversity but its relative significance has not been comprehensively studied. We have investigated the importance of terminal deoxynucleotidyl transferase (TdT) -mediated junctional diversity to the bovine immunoglobulin repertoire. We also searched for new bovine heavy chain diversity (IGHD) genes as the information of the germline sequences is essential to define the junctional boundaries between gene segments. New heavy chain variable genes (IGHV) were explored to address the gene usage in the fetal recombinations. Our bioinformatics search revealed five new IGHD genes, which included the longest IGHD reported so far, 154 bp. By genomic sequencing we found 26 new IGHV sequences that represent potentially new IGHV genes or allelic variants. Sequence analysis of immunoglobulin heavy chain cDNA libraries of fetal bone marrow, ileum and spleen showed 0 to 36 nontemplated N-nucleotide additions between variable, diversity and joining genes. A maximum of 8 N nucleotides were also identified in the light chains. The junctional base profile was biased towards A and T nucleotide additions (64% in heavy chain VD, 52% in heavy chain DJ and 61% in light chain VJ junctions) in contrast to the high G/C content which is usually observed in mice. Sequence analysis also revealed extensive exonuclease activity, providing additional diversity. B-lymphocyte specific TdT expression was detected in bovine fetal bone marrow by reverse transcription-qPCR and immunofluorescence. These results suggest that TdT-mediated junctional diversity and exonuclease activity contribute significantly to the size of the cattle preimmune antibody repertoire already in the fetal period. PMID:24926997
Unveiling fungal zooflagellates as members of freshwater picoeukaryotes: evidence from a molecular diversity study in a deep meromictic lake.

PubMed

Lefèvre, Emilie; Bardot, Corinne; Noël, Christophe; Carrias, Jean-François; Viscogliosi, Eric; Amblard, Christian; Sime-Ngando, Télesphore

2007-01-01

This study presents an original 18S rRNA PCR survey of the freshwater picoeukaryote community, and was designed to detect unidentified heterotrophic picoflagellates (size range 0.6-5 microm) which are prevalent throughout the year within the heterotrophic flagellate assemblage in Lake Pavin. Four clone libraries were constructed from samples collected in two contrasting zones in the lake. Computerized statistic tools have suggested that sequence retrieval was representative of the in situ picoplankton diversity. The two sampling zones exhibited similar diversity patterns but shared only about 5% of the operational taxonomic units (OTUs). Phylogenetic analysis clustered our sequences into three taxonomic groups: Alveolates (30% of OTUs), Fungi (23%) and Cercozoa (19%). Fungi thus substantially contributed to the detected diversity, as was additionally supported by direct microscopic observations of fungal zoospores and sporangia. A large fraction of the sequences belonged to parasites, including Alveolate sequences affiliated to the genus Perkinsus known as zooparasites, and chytrids that include host-specific parasitic fungi of various freshwater phytoplankton species, primarily diatoms. Phylogenetic analysis revealed five novel clades that probably include typical freshwater environmental sequences. Overall, from the unsuspected fungal diversity unveiled, we think that fungal zooflagellates have been misidentified as phagotrophic nanoflagellates in previous studies. This is in agreement with a recent experimental demonstration that zoospore-producing fungi and parasitic activity may play an important role in aquatic food webs.
Evaluation of haplotype diversity of Achatina fulica (Lissachatina) [Bowdich] from Indian sub-continent by means of 16S rDNA sequence and its phylogenetic relationships with other global populations.

PubMed

Ayyagari, Vijaya Sai; Sreerama, Krupanidhi

2017-08-01

Achatina fulica (Lissachatina fulica) is one of the most invasive species found across the globe causing a significant damage to crops, vegetables, and horticultural plants. This terrestrial snail is native to east Africa and spread to different parts of the world by introductions. India, a hot spot for biodiversity of several endemic gastropods, has witnessed an outburst of this snail population in several parts of the country posing a serious threat to crop loss and also to human health. With an objective to evaluate the genetic diversity of this snail, we have sampled this snail from different parts of India and analyzed its haplotype diversity by means of 16S rDNA sequence information. Apart from this, we have studied the phylogenetic relationships of the isolates sequenced in the present study in relation with other global populations by Bayesian and Maximum-likelihood approaches. Of the isolates sequenced, haplotype 'C' is the predominant one. A new haplotype 'S' from the state of Odisha was observed. The isolates sequenced in the present study clustered with its conspecifics from the Indian sub-continent. Haplotype network analyses were also carried out for studying the evolution of different haplotypes. It was observed that haplotype 'S' was associated with a Mauritius haplotype 'H', indicating the possibility of multiple introductions of A. fulica to India.
Community phylogenetic analysis of moderately thermophilic cyanobacterial mats from China, the Philippines and Thailand.

PubMed

Hongmei, Jing; Aitchison, Jonathan C; Lacap, Donnabella C; Peerapornpisal, Yuwadee; Sompong, Udomluk; Pointing, Stephen B

2005-08-01

Most community molecular studies of thermophilic cyanobacterial mats to date have focused on Synechococcus occurring at temperatures of approximately 50-65 degrees C. These reveal that molecular diversity exceeds that indicated by morphology, and that phylogeographic lineages exist. The moderately thermophilic and generally filamentous cyanobacterial mat communities occurring at lower temperatures have not previously been investigated at the community molecular level. Here we report community diversity in mats of 42-53 degrees C recovered from previously unstudied geothermal locations. Separation of 16S rRNA gene-defined genotypes from community DNA was achieved by DGGE. Genotypic diversity was greater than morphotype diversity in all mats sampled, although genotypes generally corresponded to observed morphotypes. Thirty-six sequences were recovered from DGGE bands. Phylogenetic analyses revealed these to form novel thermophilic lineages distinct from their mesophilic counterparts, within Calothrix, Cyanothece, Fischerella, Phormidium, Pleurocapsa, Oscillatoria and Synechococcus. Where filamentous cyanobacterial sequences belonging to the same genus were recovered from the same site, these were generally closely affiliated. Location-specific sequences were observed for some genotypes recovered from geochemically similar yet spatially separated sites, thus providing evidence for phylogeographic lineages that evolve in isolation. Other genotypes were more closely affiliated to geographically remote counterparts from similar habitats suggesting that adaptation to certain niches is also important.
Function and diversity of P0 proteins among cotton leafroll dwarf virus isolates.

PubMed

Cascardo, Renan S; Arantes, Ighor L G; Silva, Tatiane F; Sachetto-Martins, Gilberto; Vaslin, Maité F S; Corrêa, Régis L

2015-08-12

The RNA silencing pathway is an important anti-viral defense mechanism in plants. As a counter defense, some members of the viral family Luteoviridae are able to evade host immunity by encoding the P0 RNA silencing suppressor protein. Here we explored the functional diversity of P0 proteins among eight cotton leafroll dwarf virus (CLRDV) isolates, a virus associated with a worldwide cotton disease known as cotton blue disease (CBD). CLRDV-infected cotton plants of different varieties were collected from five growing fields in Brazil and their P0 sequences compared to three previously obtained isolates. P0's silencing suppression activities were scored based on transient expression experiments in Nicotiana benthamiana leaves. High sequence diversity was observed among CLRDV P0 proteins, indicating that some isolates found in cotton varieties formerly resistant to CLRDV should be regarded as new genotypes within the species. All tested proteins were able to suppress local and systemic silencing, but with significantly variable degrees. All P0 proteins were able to mediate the decay of ARGONAUTE proteins, a key component of the RNA silencing machinery. The sequence diversity observed in CLRDV P0s is also reflected in their silencing suppression capabilities. However, the strength of local and systemic silencing suppression was not correlated for some proteins.
Chemical-biogeographic survey of secondary metabolism in soil.

PubMed

Charlop-Powers, Zachary; Owen, Jeremy G; Reddy, Boojala Vijay B; Ternei, Melinda A; Brady, Sean F

2014-03-11

In this study, we compare biosynthetic gene richness and diversity of 96 soil microbiomes from diverse environments found throughout the southwestern and northeastern regions of the United States. The 454-pyroseqencing of nonribosomal peptide adenylation (AD) and polyketide ketosynthase (KS) domain fragments amplified from these microbiomes provide a means to evaluate the variation of secondary metabolite biosynthetic diversity in different soil environments. Through soil composition and AD- and KS-amplicon richness analysis, we identify soil types with elevated biosynthetic potential. In general, arid soils show the richest observed biosynthetic diversity, whereas brackish sediments and pine forest soils show the least. By mapping individual environmental amplicon sequences to sequences derived from functionally characterized biosynthetic gene clusters, we identified conserved soil type-specific secondary metabolome enrichment patterns despite significant sample-to-sample sequence variation. These data are used to create chemical biogeographic distribution maps for biomedically valuable families of natural products in the environment that should prove useful for directing the discovery of bioactive natural products in the future.
Population genomics of intrapatient HIV-1 evolution

PubMed Central

Zanini, Fabio; Brodin, Johanna; Thebo, Lina; Lanz, Christa; Bratt, Göran; Albert, Jan; Neher, Richard A

2015-01-01

Many microbial populations rapidly adapt to changing environments with multiple variants competing for survival. To quantify such complex evolutionary dynamics in vivo, time resolved and genome wide data including rare variants are essential. We performed whole-genome deep sequencing of HIV-1 populations in 9 untreated patients, with 6-12 longitudinal samples per patient spanning 5-8 years of infection. The data can be accessed and explored via an interactive web application. We show that patterns of minor diversity are reproducible between patients and mirror global HIV-1 diversity, suggesting a universal landscape of fitness costs that control diversity. Reversions towards the ancestral HIV-1 sequence are observed throughout infection and account for almost one third of all sequence changes. Reversion rates depend strongly on conservation. Frequent recombination limits linkage disequilibrium to about 100bp in most of the genome, but strong hitch-hiking due to short range linkage limits diversity. DOI: http://dx.doi.org/10.7554/eLife.11282.001 PMID:26652000
Compound haplotypes at Xp11.23 and human population growth in Eurasia.

PubMed

Alonso, S; Armour, J A L

2004-09-01

To investigate patterns of diversity and the evolutionary history of Eurasians, we have sequenced a 2.8 kb region at Xp11.23 in a sample of African and Eurasian chromosomes. This region is in a long intron of CLCN5 and is immediately flanked by a highly variable minisatellite, DXS255, and a human-specific Ta0 LINE. Compared to Africans, Eurasians showed a marked reduction in sequence diversity. The main Euro-Asiatic haplotype seems to be the ancestral haplotype for the whole sample. Coalescent simulations, including recombination and exponential growth, indicate a median length of strong linkage disequilibrium, up to approximately 9 kb for this area. The Ka/Ks ratio between the coding sequence of human CLCN5 and its mouse orthologue is much less than 1. This implies that the region sequenced is unlikely to be under the strong influence of positive selective processes on CLCN5, mutations in which have been associated with disorders such as Dent's disease. In contrast, a scenario based on a population bottleneck and exponential growth seems a more likely explanation for the reduced diversity observed in Eurasians. Coalescent analysis and linked minisatellite diversity (which reaches a gene diversity value greater than 98% in Eurasians) suggest an estimated age of origin of the Euro-Asiatic diversity compatible with a recent out-of-Africa model for colonization of Eurasia by modern Homo sapiens.
HIV-1 envelope sequence-based diversity measures for identifying recent infections

PubMed Central

Kafando, Alexis; Fournier, Eric; Serhir, Bouchra; Martineau, Christine; Doualla-Bell, Florence; Sangaré, Mohamed Ndongo; Sylla, Mohamed; Chamberland, Annie; El-Far, Mohamed; Charest, Hugues

2017-01-01

Identifying recent HIV-1 infections is crucial for monitoring HIV-1 incidence and optimizing public health prevention efforts. To identify recent HIV-1 infections, we evaluated and compared the performance of 4 sequence-based diversity measures including percent diversity, percent complexity, Shannon entropy and number of haplotypes targeting 13 genetic segments within the env gene of HIV-1. A total of 597 diagnostic samples obtained in 2013 and 2015 from recently and chronically HIV-1 infected individuals were selected. From the selected samples, 249 (134 from recent versus 115 from chronic infections) env coding regions, including V1-C5 of gp120 and the gp41 ectodomain of HIV-1, were successfully amplified and sequenced by next generation sequencing (NGS) using the Illumina MiSeq platform. The ability of the four sequence-based diversity measures to correctly identify recent HIV infections was evaluated using the frequency distribution curves, median and interquartile range and area under the curve (AUC) of the receiver operating characteristic (ROC). Comparing the median and interquartile range and evaluating the frequency distribution curves associated with the 4 sequence-based diversity measures, we observed that the percent diversity, number of haplotypes and Shannon entropy demonstrated significant potential to discriminate recent from chronic infections (p<0.0001). Using the AUC of ROC analysis, only the Shannon entropy measure within three HIV-1 env segments could accurately identify recent infections at a satisfactory level. The env segments were gp120 C2_1 (AUC = 0.806), gp120 C2_3 (AUC = 0.805) and gp120 V3 (AUC = 0.812). Our results clearly indicate that the Shannon entropy measure represents a useful tool for predicting HIV-1 infection recency. PMID:29284009
HIV-1 envelope sequence-based diversity measures for identifying recent infections.

PubMed

Kafando, Alexis; Fournier, Eric; Serhir, Bouchra; Martineau, Christine; Doualla-Bell, Florence; Sangaré, Mohamed Ndongo; Sylla, Mohamed; Chamberland, Annie; El-Far, Mohamed; Charest, Hugues; Tremblay, Cécile L

2017-01-01

Identifying recent HIV-1 infections is crucial for monitoring HIV-1 incidence and optimizing public health prevention efforts. To identify recent HIV-1 infections, we evaluated and compared the performance of 4 sequence-based diversity measures including percent diversity, percent complexity, Shannon entropy and number of haplotypes targeting 13 genetic segments within the env gene of HIV-1. A total of 597 diagnostic samples obtained in 2013 and 2015 from recently and chronically HIV-1 infected individuals were selected. From the selected samples, 249 (134 from recent versus 115 from chronic infections) env coding regions, including V1-C5 of gp120 and the gp41 ectodomain of HIV-1, were successfully amplified and sequenced by next generation sequencing (NGS) using the Illumina MiSeq platform. The ability of the four sequence-based diversity measures to correctly identify recent HIV infections was evaluated using the frequency distribution curves, median and interquartile range and area under the curve (AUC) of the receiver operating characteristic (ROC). Comparing the median and interquartile range and evaluating the frequency distribution curves associated with the 4 sequence-based diversity measures, we observed that the percent diversity, number of haplotypes and Shannon entropy demonstrated significant potential to discriminate recent from chronic infections (p<0.0001). Using the AUC of ROC analysis, only the Shannon entropy measure within three HIV-1 env segments could accurately identify recent infections at a satisfactory level. The env segments were gp120 C2_1 (AUC = 0.806), gp120 C2_3 (AUC = 0.805) and gp120 V3 (AUC = 0.812). Our results clearly indicate that the Shannon entropy measure represents a useful tool for predicting HIV-1 infection recency.

HLA DNA Sequence Variation among Human Populations: Molecular Signatures of Demographic and Selective Events

PubMed Central

Buhler, Stéphane; Sanchez-Mazas, Alicia

2011-01-01

Molecular differences between HLA alleles vary up to 57 nucleotides within the peptide binding coding region of human Major Histocompatibility Complex (MHC) genes, but it is still unclear whether this variation results from a stochastic process or from selective constraints related to functional differences among HLA molecules. Although HLA alleles are generally treated as equidistant molecular units in population genetic studies, DNA sequence diversity among populations is also crucial to interpret the observed HLA polymorphism. In this study, we used a large dataset of 2,062 DNA sequences defined for the different HLA alleles to analyze nucleotide diversity of seven HLA genes in 23,500 individuals of about 200 populations spread worldwide. We first analyzed the HLA molecular structure and diversity of these populations in relation to geographic variation and we further investigated possible departures from selective neutrality through Tajima's tests and mismatch distributions. All results were compared to those obtained by classical approaches applied to HLA allele frequencies. Our study shows that the global patterns of HLA nucleotide diversity among populations are significantly correlated to geography, although in some specific cases the molecular information reveals unexpected genetic relationships. At all loci except HLA-DPB1, populations have accumulated a high proportion of very divergent alleles, suggesting an advantage of heterozygotes expressing molecularly distant HLA molecules (asymmetric overdominant selection model). However, both different intensities of selection and unequal levels of gene conversion may explain the heterogeneous mismatch distributions observed among the loci. Also, distinctive patterns of sequence divergence observed at the HLA-DPB1 locus suggest current neutrality but old selective pressures on this gene. We conclude that HLA DNA sequences advantageously complement HLA allele frequencies as a source of data used to explore the genetic history of human populations, and that their analysis allows a more thorough investigation of human MHC molecular evolution. PMID:21408106
New nitrogen-fixing microorganisms detected in oligotrophic oceans by amplification of nitrogenase (nifH) genes

DOE Office of Scientific and Technical Information (OSTI.GOV)

Zehr, J.P.; Mellon, M.T.; Zani, S.

1998-09-01

Oligotrophic oceanic waters of the central ocean gyres typically have extremely low dissolved fixed inorganic nitrogen concentrations, but few nitrogen-fixing microorganisms from the oceanic environment have been cultivated. Nitrogenase gene (nifH) sequences amplified directly from oceanic waters showed that the open ocean contains more diverse diazotrophic microbial populations and more diverse habitats for nitrogen fixers than previously observed by classical microbiological techniques. Nitrogenase genes derived from unicellular and filamentous cyanobacteria, as well as from the {alpha} and {gamma} subdivisions of the class Proteobacteria, were found in both the Atlantic and Pacific oceans. nifH sequences that cluster phylogenetically with sequences frommore » sulfate reducers or clostridia were found associated with planktonic crustaceans. Nitrogenase sequence types obtained from invertebrates represented phylotypes distinct from the phylotypes detected in the picoplankton size fraction. The results indicate that there are in the oceanic environment several distinct potentially nitrogen-fixing microbial assemblages that include representatives of diverse phylotypes.« less
Fungal communities from the calcareous deep-sea sediments in the Southwest India Ridge revealed by Illumina sequencing technology.

PubMed

Zhang, Likui; Kang, Manyu; Huang, Yangchao; Yang, Lixiang

2016-05-01

The diversity and ecological significance of bacteria and archaea in deep-sea environments have been thoroughly investigated, but eukaryotic microorganisms in these areas, such as fungi, are poorly understood. To elucidate fungal diversity in calcareous deep-sea sediments in the Southwest India Ridge (SWIR), the internal transcribed spacer (ITS) regions of rRNA genes from two sediment metagenomic DNA samples were amplified and sequenced using the Illumina sequencing platform. The results revealed that 58-63 % and 36-42 % of the ITS sequences (97 % similarity) belonged to Basidiomycota and Ascomycota, respectively. These findings suggest that Basidiomycota and Ascomycota are the predominant fungal phyla in the two samples. We also found that Agaricomycetes, Leotiomycetes, and Pezizomycetes were the major fungal classes in the two samples. At the species level, Thelephoraceae sp. and Phialocephala fortinii were major fungal species in the two samples. Despite the low relative abundance, unidentified fungal sequences were also observed in the two samples. Furthermore, we found that there were slight differences in fungal diversity between the two sediment samples, although both were collected from the SWIR. Thus, our results demonstrate that calcareous deep-sea sediments in the SWIR harbor diverse fungi, which augment the fungal groups in deep-sea sediments. This is the first report of fungal communities in calcareous deep-sea sediments in the SWIR revealed by Illumina sequencing.
Genetic characterization of guava (psidium guajava l.) Germplasm in the United States using microsatellite markers

USDA-ARS?s Scientific Manuscript database

Genetic diversity of thirty five Psidium guajava accessions maintained at the USDA, National Plants Germplasm System, Hilo, HI, was characterized using 20 simple sequence repeat (SSR) markers. Diversity analysis detected a total of 178 alleles ranging from four to 16. The observed mean heterozygosit...
Analysis of intra-host genetic diversity of Prunus necrotic ringspot virus (PNRSV) using amplicon next generation sequencing

PubMed Central

Constable, Fiona E.; Nancarrow, Narelle; Plummer, Kim M.; Rodoni, Brendan

2017-01-01

PCR amplicon next generation sequencing (NGS) analysis offers a broadly applicable and targeted approach to detect populations of both high- or low-frequency virus variants in one or more plant samples. In this study, amplicon NGS was used to explore the diversity of the tripartite genome virus, Prunus necrotic ringspot virus (PNRSV) from 53 PNRSV-infected trees using amplicons from conserved gene regions of each of PNRSV RNA1, RNA2 and RNA3. Sequencing of the amplicons from 53 PNRSV-infected trees revealed differing levels of polymorphism across the three different components of the PNRSV genome with a total number of 5040, 2083 and 5486 sequence variants observed for RNA1, RNA2 and RNA3 respectively. The RNA2 had the lowest diversity of sequences compared to RNA1 and RNA3, reflecting the lack of flexibility tolerated by the replicase gene that is encoded by this RNA component. Distinct PNRSV phylo-groups, consisting of closely related clusters of sequence variants, were observed in each of PNRSV RNA1, RNA2 and RNA3. Most plant samples had a single phylo-group for each RNA component. Haplotype network analysis showed that smaller clusters of PNRSV sequence variants were genetically connected to the largest sequence variant cluster within a phylo-group of each RNA component. Some plant samples had sequence variants occurring in multiple PNRSV phylo-groups in at least one of each RNA and these phylo-groups formed distinct clades that represent PNRSV genetic strains. Variants within the same phylo-group of each Prunus plant sample had ≥97% similarity and phylo-groups within a Prunus plant sample and between samples had less ≤97% similarity. Based on the analysis of diversity, a definition of a PNRSV genetic strain was proposed. The proposed definition was applied to determine the number of PNRSV genetic strains in each of the plant samples and the complexity in defining genetic strains in multipartite genome viruses was explored. PMID:28632759
Cyanobacterial Diversity in Microbial Mats from the Hypersaline Lagoon System of Araruama, Brazil: An In-depth Polyphasic Study.

PubMed

Ramos, Vitor M C; Castelo-Branco, Raquel; Leão, Pedro N; Martins, Joana; Carvalhal-Gomes, Sinda; Sobrinho da Silva, Frederico; Mendonça Filho, João G; Vasconcelos, Vitor M

2017-01-01

Microbial mats are complex, micro-scale ecosystems that can be found in a wide range of environments. In the top layer of photosynthetic mats from hypersaline environments, a large diversity of cyanobacteria typically predominates. With the aim of strengthening the knowledge on the cyanobacterial diversity present in the coastal lagoon system of Araruama (state of Rio de Janeiro, Brazil), we have characterized three mat samples by means of a polyphasic approach. We have used morphological and molecular data obtained by culture-dependent and -independent methods. Moreover, we have compared different classification methodologies and discussed the outcomes, challenges, and pitfalls of these methods. Overall, we show that Araruama's lagoons harbor a high cyanobacterial diversity. Thirty-six unique morphospecies could be differentiated, which increases by more than 15% the number of morphospecies and genera already reported for the entire Araruama system. Morphology-based data were compared with the 16S rRNA gene phylogeny derived from isolate sequences and environmental sequences obtained by PCR-DGGE and pyrosequencing. Most of the 48 phylotypes could be associated with the observed morphospecies at the order level. More than one third of the sequences demonstrated to be closely affiliated (best BLAST hit results of ≥99%) with cyanobacteria from ecologically similar habitats. Some sequences had no close relatives in the public databases, including one from an isolate, being placed as "loner" sequences within different orders. This hints at hidden cyanobacterial diversity in the mats of the Araruama system, while reinforcing the relevance of using complementary approaches to study cyanobacterial diversity.
Cyanobacterial Diversity in Microbial Mats from the Hypersaline Lagoon System of Araruama, Brazil: An In-depth Polyphasic Study

PubMed Central

Ramos, Vitor M. C.; Castelo-Branco, Raquel; Leão, Pedro N.; Martins, Joana; Carvalhal-Gomes, Sinda; Sobrinho da Silva, Frederico; Mendonça Filho, João G.; Vasconcelos, Vitor M.

2017-01-01

Microbial mats are complex, micro-scale ecosystems that can be found in a wide range of environments. In the top layer of photosynthetic mats from hypersaline environments, a large diversity of cyanobacteria typically predominates. With the aim of strengthening the knowledge on the cyanobacterial diversity present in the coastal lagoon system of Araruama (state of Rio de Janeiro, Brazil), we have characterized three mat samples by means of a polyphasic approach. We have used morphological and molecular data obtained by culture-dependent and -independent methods. Moreover, we have compared different classification methodologies and discussed the outcomes, challenges, and pitfalls of these methods. Overall, we show that Araruama's lagoons harbor a high cyanobacterial diversity. Thirty-six unique morphospecies could be differentiated, which increases by more than 15% the number of morphospecies and genera already reported for the entire Araruama system. Morphology-based data were compared with the 16S rRNA gene phylogeny derived from isolate sequences and environmental sequences obtained by PCR-DGGE and pyrosequencing. Most of the 48 phylotypes could be associated with the observed morphospecies at the order level. More than one third of the sequences demonstrated to be closely affiliated (best BLAST hit results of ≥99%) with cyanobacteria from ecologically similar habitats. Some sequences had no close relatives in the public databases, including one from an isolate, being placed as “loner” sequences within different orders. This hints at hidden cyanobacterial diversity in the mats of the Araruama system, while reinforcing the relevance of using complementary approaches to study cyanobacterial diversity. PMID:28713360
Genomic distribution and estimation of nucleotide diversity in natural populations: perspectives from the collared flycatcher (Ficedula albicollis) genome.

PubMed

Dutoit, Ludovic; Burri, Reto; Nater, Alexander; Mugal, Carina F; Ellegren, Hans

2017-07-01

Properly estimating genetic diversity in populations of nonmodel species requires a basic understanding of how diversity is distributed across the genome and among individuals. To this end, we analysed whole-genome resequencing data from 20 collared flycatchers (genome size ≈1.1 Gb; 10.13 million single nucleotide polymorphisms detected). Genomewide nucleotide diversity was almost identical among individuals (mean = 0.00394, range = 0.00384-0.00401), but diversity levels varied extensively across the genome (95% confidence interval for 200-kb windows = 0.0013-0.0053). Diversity was related to selective constraint such that in comparison with intergenic DNA, diversity at fourfold degenerate sites was reduced to 85%, 3' UTRs to 82%, 5' UTRs to 70% and nondegenerate sites to 12%. There was a strong positive correlation between diversity and chromosome size, probably driven by a higher density of targets for selection on smaller chromosomes increasing the diversity-reducing effect of linked selection. Simulations exploring the ability of sequence data from a small number of genetic markers to capture the observed diversity clearly demonstrated that diversity estimation from finite sampling of such data is bound to be associated with large confidence intervals. Nevertheless, we show that precision in diversity estimation in large outbred population benefits from increasing the number of loci rather than the number of individuals. Simulations mimicking RAD sequencing showed that this approach gives accurate estimates of genomewide diversity. Based on the patterns of observed diversity and the performed simulations, we provide broad recommendations for how genetic diversity should be estimated in natural populations. © 2016 The Authors. Molecular Ecology Resources Published by John Wiley & Sons Ltd.
HIV populations are large and accumulate high genetic diversity in a nonlinear fashion.

PubMed

Maldarelli, Frank; Kearney, Mary; Palmer, Sarah; Stephens, Robert; Mican, JoAnn; Polis, Michael A; Davey, Richard T; Kovacs, Joseph; Shao, Wei; Rock-Kress, Diane; Metcalf, Julia A; Rehm, Catherine; Greer, Sarah E; Lucey, Daniel L; Danley, Kristen; Alter, Harvey; Mellors, John W; Coffin, John M

2013-09-01

HIV infection is characterized by rapid and error-prone viral replication resulting in genetically diverse virus populations. The rate of accumulation of diversity and the mechanisms involved are under intense study to provide useful information to understand immune evasion and the development of drug resistance. To characterize the development of viral diversity after infection, we carried out an in-depth analysis of single genome sequences of HIV pro-pol to assess diversity and divergence and to estimate replicating population sizes in a group of treatment-naive HIV-infected individuals sampled at single (n = 22) or multiple, longitudinal (n = 11) time points. Analysis of single genome sequences revealed nonlinear accumulation of sequence diversity during the course of infection. Diversity accumulated in recently infected individuals at rates 30-fold higher than in patients with chronic infection. Accumulation of synonymous changes accounted for most of the diversity during chronic infection. Accumulation of diversity resulted in population shifts, but the rates of change were low relative to estimated replication cycle times, consistent with relatively large population sizes. Analysis of changes in allele frequencies revealed effective population sizes that are substantially higher than previous estimates of approximately 1,000 infectious particles/infected individual. Taken together, these observations indicate that HIV populations are large, diverse, and slow to change in chronic infection and that the emergence of new mutations, including drug resistance mutations, is governed by both selection forces and drift.
Sequence Variability and Geographic Distribution of Lassa Virus, Sierra Leone

PubMed Central

Stockelman, Michael G.; Moses, Lina M.; Park, Matthew; Stenger, David A.; Ansumana, Rashid; Bausch, Daniel G.; Lin, Baochuan

2015-01-01

Lassa virus (LASV) is endemic to parts of West Africa and causes highly fatal hemorrhagic fever. The multimammate rat (Mastomys natalensis) is the only known reservoir of LASV. Most human infections result from zoonotic transmission. The very diverse LASV genome has 4 major lineages associated with different geographic locations. We used reverse transcription PCR and resequencing microarrays to detect LASV in 41 of 214 samples from rodents captured at 8 locations in Sierra Leone. Phylogenetic analysis of partial sequences of nucleoprotein (NP), glycoprotein precursor (GPC), and polymerase (L) genes showed 5 separate clades within lineage IV of LASV in this country. The sequence diversity was higher than previously observed; mean diversity was 7.01% for nucleoprotein gene at the nucleotide level. These results may have major implications for designing diagnostic tests and therapeutic agents for LASV infections in Sierra Leone. PMID:25811712
Strong latitudinal and vertical biogeography of Synechococcus diversity in the equatorial Pacific Ocean

NASA Astrophysics Data System (ADS)

Martiny, A.; Kent, A. G.; Mouginot, C.; Baer, S. E.; Lomas, M. W.

2016-02-01

Extensive genetic diversity has been observed within Synechococcus including the presence of multiple major clades. However, the biogeography and underlying environmental drivers of these clades remain elusive. Here, we developed a new high-throughput sequencing assay using rpoC1 as marker combined with Illumina sequencing. Using this, we identified the genetic diversity of Synechococcus from 200 samples in an eastern Pacific Ocean transect between 19˚N and 3˚S. We used a placement method to identify the phylogenetic affiliation of each sequence and detected extensive diversity including multiple previously undescribed clades. We observed clear biogeographical domains, with Clade 2 dominant in the northern part of the transect, Clade CRD peaking at the equator, and Clade 1 dominant deeper in the water column throughout the transect. This biogeography, along with physical and nutrient data, suggests that Clade 2 represents a high temperature, low macronutrient ecotype, CRD a high temperature but low iron ecotype, and at least part of Clade 1 a low-light ecotype. The shift between Clade 2 and CRD occurred at 7˚N, whereas the concentration of macronutrients was low down to 4˚N, before increasing. This biogeography indicates that Synechococcus cells experience iron stress up to 7˚N despite low concentrations of phosphate and nitrate. The overall biogeography closely matched the distribution of Prochlorococcus diversity in this region, suggesting a parallel evolution of ecotypes in these two major lineages of marine Cyanobacteria.
Combined Use of 16S Ribosomal DNA and 16S rRNA To Study the Bacterial Community of Polychlorinated Biphenyl-Polluted Soil

PubMed Central

Nogales, Balbina; Moore, Edward R. B.; Llobet-Brossa, Enrique; Rossello-Mora, Ramon; Amann, Rudolf; Timmis, Kenneth N.

2001-01-01

The bacterial diversity assessed from clone libraries prepared from rRNA (two libraries) and ribosomal DNA (rDNA) (one library) from polychlorinated biphenyl (PCB)-polluted soil has been analyzed. A good correspondence of the community composition found in the two types of library was observed. Nearly 29% of the cloned sequences in the rDNA library were identical to sequences in the rRNA libraries. More than 60% of the total cloned sequence types analyzed were grouped in phylogenetic groups (a clone group with sequence similarity higher than 97% [98% for Burkholderia and Pseudomonas-type clones]) represented in both types of libraries. Some of those phylogenetic groups, mostly represented by a single (or pair) of cloned sequence type(s), were observed in only one of the types of library. An important difference between the libraries was the lack of clones representative of the Actinobacteria in the rDNA library. The PCB-polluted soil exhibited a high bacterial diversity which included representatives of two novel lineages. The apparent abundance of bacteria affiliated to the beta-subclass of the Proteobacteria, and to the genus Burkholderia in particular, was confirmed by fluorescence in situ hybridization analysis. The possible influence on apparent diversity of low template concentrations was assessed by dilution of the RNA template prior to amplification by reverse transcription-PCR. Although differences in the composition of the two rRNA libraries obtained from high and low RNA concentrations were observed, the main components of the bacterial community were represented in both libraries, and therefore their detection was not compromised by the lower concentrations of template used in this study. PMID:11282645
Sequence diversity within the reovirus S2 gene: reovirus genes reassort in nature, and their termini are predicted to form a panhandle motif.

PubMed Central

Chapell, J D; Goral, M I; Rodgers, S E; dePamphilis, C W; Dermody, T S

1994-01-01

To better understand genetic diversity within mammalian reoviruses, we determined S2 nucleotide and deduced sigma 2 amino acid sequences of nine reovirus strains and compared these sequences with those of prototype strains of the three reovirus serotypes. The S2 gene and sigma 2 protein are highly conserved among the four type 1, one type 2, and seven type 3 strains studied. Phylogenetic analyses based on S2 nucleotide sequences of the 12 reovirus strains indicate that diversity within the S2 gene is independent of viral serotype. Additionally, we found marked topological differences between phylogenetic trees generated from S1 and S2 gene nucleotide sequences of the seven type 3 strains. These results demonstrate that reovirus S1 and S2 genes have distinct evolutionary histories, thus providing phylogenetic evidence for lateral transfer of reovirus genes in nature. When variability among the 12 sigma 2-encoding S2 nucleotide sequences was analyzed at synonymous positions, we found that approximately 60 nucleotides at the 5' terminus and 30 nucleotides at the 3' terminus were markedly conserved in comparison with other sigma 2-encoding regions of S2. Predictions of RNA secondary structures indicate that the more conserved S2 sequences participate in the formation of an extended region of duplex RNA interrupted by a pair of stem-loops. Among the 12 deduced sigma 2 amino acid sequences examined, substitutions were observed at only 11% of amino acid positions. This finding suggests that constraints on the structure or function of sigma 2, perhaps in part because of its location in the virion core, have limited sequence diversity within this protein. PMID:8289378
Highly Diverse Endophytic and Soil Fusarium oxysporum Populations Associated with Field-Grown Tomato Plants

PubMed Central

Demers, Jill E.; Gugino, Beth K.

2014-01-01

The diversity and genetic differentiation of populations of Fusarium oxysporum associated with tomato fields, both endophytes obtained from tomato plants and isolates obtained from soil surrounding the sampled plants, were investigated. A total of 609 isolates of F. oxysporum were obtained, 295 isolates from a total of 32 asymptomatic tomato plants in two fields and 314 isolates from eight soil cores sampled from the area surrounding the plants. Included in this total were 112 isolates from the stems of all 32 plants, a niche that has not been previously included in F. oxysporum population genetics studies. Isolates were characterized using the DNA sequence of the translation elongation factor 1α gene. A diverse population of 26 sequence types was found, although two sequence types represented nearly two-thirds of the isolates studied. The sequence types were placed in different phylogenetic clades within F. oxysporum, and endophytic isolates were not monophyletic. Multiple sequence types were found in all plants, with an average of 4.2 per plant. The population compositions differed between the two fields but not between soil samples within each field. A certain degree of differentiation was observed between populations associated with different tomato cultivars, suggesting that the host genotype may affect the composition of plant-associated F. oxysporum populations. No clear patterns of genetic differentiation were observed between endophyte populations and soil populations, suggesting a lack of specialization of endophytic isolates. PMID:25304514
Sequence analysis of the msp4 gene of Anaplasma ovis strains

USGS Publications Warehouse

de la Fuente, J.; Atkinson, M.W.; Naranjo, V.; Fernandez de Mera, I. G.; Mangold, A.J.; Keating, K.A.; Kocan, K.M.

2007-01-01

Anaplasma ovis (Rickettsiales: Anaplasmataceae) is a tick-borne pathogen of sheep, goats and wild ruminants. The genetic diversity of A. ovis strains has not been well characterized due to the lack of sequence information. In this study, we evaluated bighorn sheep (Ovis canadensis) and mule deer (Odocoileus hemionus) from Montana for infection with A. ovis by serology and sequence analysis of the msp4 gene. Antibodies to Anaplasma spp. were detected in 37% and 39% of bighorn sheep and mule deer analyzed, respectively. Four new msp4 genotypes were identified. The A. ovis msp4 sequences identified herein were analyzed together with sequences reported previously for the characterization of the genetic diversity of A. ovis strains in comparison with other Anaplasma spp. The results of these studies demonstrated that although A. ovis msp4 genotypes may vary among geographic regions and between sheep and deer hosts, the variation observed was less than the variation observed between A. marginale and A. phagocytophilum strains. The results reported herein further confirm that A. ovis infection occurs in natural wild ruminant populations in Western United States and that bighorn sheep and mule deer may serve as wildlife reservoirs of A. ovis. ?? 2006.
Investigating the diversity of the 18S SSU rRNA hyper-variable region of Theileria in cattle and Cape buffalo (Syncerus caffer) from southern Africa using a next generation sequencing approach.

PubMed

Mans, Ben J; Pienaar, Ronel; Ratabane, John; Pule, Boitumelo; Latif, Abdalla A

2016-07-01

Molecular classification and systematics of the Theileria is based on the analysis of the 18S rRNA gene. Reverse line blot or conventional sequencing approaches have disadvantages in the study of 18S rRNA diversity and a next-generation 454 sequencing approach was investigated. The 18S rRNA gene was amplified using RLB primers coupled to 96 unique sequence identifiers (MIDs). Theileria positive samples from African buffalo (672) and cattle (480) from southern Africa were combined in batches of 96 and sequenced using the GS Junior 454 sequencer to produce 825711 informative sequences. Sequences were extracted based on MIDs and analysed to identify Theileria genotypes. Genotypes observed in buffalo and cattle were confirmed in the current study, while no new genotypes were discovered. Genotypes showed specific geographic distributions, most probably linked with vector distributions. Host specificity of buffalo and cattle specific genotypes were confirmed and prevalence data as well as relative parasitemia trends indicate preference for different hosts. Mixed infections are common with African buffalo carrying more genotypes compared to cattle. Associative or exclusion co-infection profiles were observed between genotypes that may have implications for speciation and systematics: specifically that more Theileria species may exist in cattle and buffalo than currently recognized. Analysis of primers used for Theileria parva diagnostics indicate that no new genotypes will be amplified by the current primer sets confirming their specificity. T. parva SNP variants that occur in the 18S rRNA hypervariable region were confirmed. A next generation sequencing approach is useful in obtaining comprehensive knowledge regarding 18S rRNA diversity and prevalence for the Theileria, allowing for the assessment of systematics and diagnostic assays based on the 18S gene. Copyright © 2016 Elsevier GmbH. All rights reserved.
Relative profile analysis of molecular markers for identification and genetic discrimination of loaches (Pisces, Nemacheilidae).

PubMed

Patil, Tejas Suresh; Tamboli, Asif Shabodin; Patil, Swapnil Mahadeo; Bhosale, Amrut Ravindra; Govindwar, Sanjay Prabhu; Muley, Dipak Vishwanathrao

2016-01-01

Genus Nemacheilus, Nemachilichthys and Schistura belong to the family Nemacheilidae of the order Cypriniformes. The present investigation was undertaken to observe genetic diversity, phylogenetic relationship and to develop a molecular-based tool for taxonomic identification. For this purpose, four different types of molecular markers were utilized in which 29 random amplified polymorphic DNA (RAPD), 25 inter-simple sequence repeat (ISSR) markers, and 10 amplified fragment length polymorphism (AFLP) marker sets were screened and mitochondrial COI gene was sequenced. This study added COI barcodes for the identification of Nemacheilus anguilla, Nemachilichthys rueppelli and Schistura denisoni. RAPD showed higher polymorphism (100%) than the ISSR (93.75-100%) and AFLP (93.86-98.96%). The polymorphic information content (PIC), heterozygosity, multiplex ratio, and gene diversity was observed highest for AFLP primers, whereas the major allele frequency was observed higher for RAPD (0.5556) and lowest for AFLP (0.1667). The COI region of all individuals was successfully amplified and sequenced, which gave a 100% species resolution. Copyright © 2016 Académie des sciences. Published by Elsevier SAS. All rights reserved.
Microbial community structure in a full-scale anaerobic treatment plant during start-up and first year of operation revealed by high-throughput 16S rRNA gene amplicon sequencing.

PubMed

Fykse, Else Marie; Aarskaug, Tone; Madslien, Elisabeth H; Dybwad, Marius

2016-12-01

High-throughput amplicon sequencing of six biomass samples from a full-scale anaerobic reactor at a Norwegian wood and pulp factory using Biothane Biobed Expanded Granular Sludge Bed (EGSB) technology during start-up and first year of operation was performed. A total of 106,166 16S rRNA gene sequences (V3-V5 region) were obtained. The number of operational taxonomic units (OTUs) ranged from 595 to 2472, and a total of 38 different phyla and 143 families were observed. The predominant phyla were Bacteroidetes, Chloroflexi, Firmicutes, Proteobacteria, and Spirochaetes. A more diverse microbial community was observed in the inoculum biomass coming from an Upflow Anaerobic Sludge Blanket (USAB) reactor, reflecting an adaptation of the inoculum diversity to the specific conditions of the new reactor. In addition, no taxa classified as obligate pathogens were identified and potentially opportunistic pathogens were absent or observed in low abundances. No Legionella bacteria were identified by traditional culture-based and molecular methods. Copyright © 2016 Elsevier Ltd. All rights reserved.
A Comparison of Anammox Bacterial Abundance and Community Structures in Three Different Emerged Plants-Related Sediments.

PubMed

Chu, Jinyu; Zhang, Jinping; Zhou, Xiaohong; Liu, Biao; Li, Yimin

2015-09-01

Quantitative polymerase chain reaction (qPCR) assays and 16S rRNA gene clone libraries were used to document the abundance, diversity and community structure of anaerobic ammonia-oxidising (anammox) bacteria in the rhizosphere and non-rhizosphere sediments of three emergent macrophyte species (Iris pseudacorus, Thalia dealbata and Typha orientalis). The qPCR results confirmed the existence of anammox bacteria (AMX) with observed log number of gene copies per dry gram sediment ranging from 5.00 to 6.78. AMX was more abundant in T. orientalis-associated sediments than in the other two plant species. The I. pseudacorus- and T. orientalis-associated sediments had higher Shannon diversity values, indicating higher AMX diversity in these sediments. Based on the 16S rRNA gene, Candidatus 'Brocadia', Candidatus 'Kuenenia', Candidatus 'Jettenia' and new clusters were observed with the predominant Candidatus 'Kuenenia' cluster. The I. pseudacorus-associated sediments contained all the sequences of the C. 'Jettenia' cluster. Sequences obtained from T. orientalis-associated sediments contributed more than 90 % sequences in the new cluster, whereas none was found from I. pseudacorus. The new cluster was distantly related to known sequences; thus, this cluster was grouped outside the known clusters, indicating that the new cluster may be a new Planctomycetales genus. Further studies should be undertaken to confirm this finding.
Characterizing partial AZFc deletions of the Y chromosome with amplicon-specific sequence markers

PubMed Central

Navarro-Costa, Paulo; Pereira, Luísa; Alves, Cíntia; Gusmão, Leonor; Proença, Carmen; Marques-Vidal, Pedro; Rocha, Tiago; Correia, Sónia C; Jorge, Sónia; Neves, António; Soares, Ana P; Nunes, Joaquim; Calhaz-Jorge, Carlos; Amorim, António; Plancha, Carlos E; Gonçalves, João

2007-01-01

Background The AZFc region of the human Y chromosome is a highly recombinogenic locus containing multi-copy male fertility genes located in repeated DNA blocks (amplicons). These AZFc gene families exhibit slight sequence variations between copies which are considered to have functional relevance. Yet, partial AZFc deletions yield phenotypes ranging from normospermia to azoospermia, thwarting definite conclusions on their real impact on fertility. Results The amplicon content of partial AZFc deletion products was characterized with novel amplicon-specific sequence markers. Data indicate that partial AZFc deletions are a male infertility risk [odds ratio: 5.6 (95% CI: 1.6–30.1)] and although high diversity of partial deletion products and sequence conversion profiles were recorded, the AZFc marker profiles detected in fertile men were also observed in infertile men. Additionally, the assessment of rearrangement recurrence by Y-lineage analysis indicated that while partial AZFc deletions occurred in highly diverse samples, haplotype diversity was minimal in fertile men sharing identical marker profiles. Conclusion Although partial AZFc deletion products are highly heterogeneous in terms of amplicon content, this plasticity is not sufficient to account for the observed phenotypical variance. The lack of causative association between the deletion of specific gene copies and infertility suggests that AZFc gene content might be part of a multifactorial network, with Y-lineage evolution emerging as a possible phenotype modulator. PMID:17903263

A phylogenetic framework facilitates Y-STR variant discovery and classification via massively parallel sequencing.

PubMed

Huszar, Tunde I; Jobling, Mark A; Wetton, Jon H

2018-04-12

Short tandem repeats on the male-specific region of the Y chromosome (Y-STRs) are permanently linked as haplotypes, and therefore Y-STR sequence diversity can be considered within the robust framework of a phylogeny of haplogroups defined by single nucleotide polymorphisms (SNPs). Here we use massively parallel sequencing (MPS) to analyse the 23 Y-STRs in Promega's prototype PowerSeq™ Auto/Mito/Y System kit (containing the markers of the PowerPlex® Y23 [PPY23] System) in a set of 100 diverse Y chromosomes whose phylogenetic relationships are known from previous megabase-scale resequencing. Including allele duplications and alleles resulting from likely somatic mutation, we characterised 2311 alleles, demonstrating 99.83% concordance with capillary electrophoresis (CE) data on the same sample set. The set contains 267 distinct sequence-based alleles (an increase of 58% compared to the 169 detectable by CE), including 60 novel Y-STR variants phased with their flanking sequences which have not been reported previously to our knowledge. Variation includes 46 distinct alleles containing non-reference variants of SNPs/indels in both repeat and flanking regions, and 145 distinct alleles containing repeat pattern variants (RPV). For DYS385a,b, DYS481 and DYS390 we observed repeat count variation in short flanking segments previously considered invariable, and suggest new MPS-based structural designations based on these. We considered the observed variation in the context of the Y phylogeny: several specific haplogroup associations were observed for SNPs and indels, reflecting the low mutation rates of such variant types; however, RPVs showed less phylogenetic coherence and more recurrence, reflecting their relatively high mutation rates. In conclusion, our study reveals considerable additional diversity at the Y-STRs of the PPY23 set via MPS analysis, demonstrates high concordance with CE data, facilitates nomenclature standardisation, and places Y-STR sequence variants in their phylogenetic context. Copyright © 2018 The Authors. Published by Elsevier B.V. All rights reserved.
SCRaMbLE generates designed combinatorial stochastic diversity in synthetic chromosomes

PubMed Central

Shen, Yue; Stracquadanio, Giovanni; Wang, Yun; Yang, Kun; Mitchell, Leslie A.; Xue, Yaxin; Cai, Yizhi; Chen, Tai; Dymond, Jessica S.; Kang, Kang; Gong, Jianhui; Zeng, Xiaofan; Zhang, Yongfen; Li, Yingrui; Feng, Qiang; Xu, Xun; Wang, Jun; Wang, Jian; Yang, Huanming; Boeke, Jef D.; Bader, Joel S.

2016-01-01

Synthetic chromosome rearrangement and modification by loxP-mediated evolution (SCRaMbLE) generates combinatorial genomic diversity through rearrangements at designed recombinase sites. We applied SCRaMbLE to yeast synthetic chromosome arm synIXR (43 recombinase sites) and then used a computational pipeline to infer or unscramble the sequence of recombinations that created the observed genomes. Deep sequencing of 64 synIXR SCRaMbLE strains revealed 156 deletions, 89 inversions, 94 duplications, and 55 additional complex rearrangements; several duplications are consistent with a double rolling circle mechanism. Every SCRaMbLE strain was unique, validating the capability of SCRaMbLE to explore a diverse space of genomes. Rearrangements occurred exclusively at designed loxPsym sites, with no significant evidence for ectopic rearrangements or mutations involving synthetic regions, the 99% nonsynthetic nuclear genome, or the mitochondrial genome. Deletion frequencies identified genes required for viability or fast growth. Replacement of 3′ UTR by non-UTR sequence had surprisingly little effect on fitness. SCRaMbLE generates genome diversity in designated regions, reveals fitness constraints, and should scale to simultaneous evolution of multiple synthetic chromosomes. PMID:26566658
The Relevance of HLA Sequencing in Population Genetics Studies

PubMed Central

Sanchez-Mazas, Alicia

2014-01-01

Next generation sequencing (NGS) is currently being adapted by different biotechnological platforms to the standard typing method for HLA polymorphism, the huge diversity of which makes this initiative particularly challenging. Boosting the molecular characterization of the HLA genes through efficient, rapid, and low-cost technologies is expected to amplify the success of tissue transplantation by enabling us to find donor-recipient matching for rare phenotypes. But the application of NGS technologies to the molecular mapping of the MHC region also anticipates essential changes in population genetic studies. Huge amounts of HLA sequence data will be available in the next years for different populations, with the potential to change our understanding of HLA variation in humans. In this review, we first explain how HLA sequencing allows a better assessment of the HLA diversity in human populations, taking also into account the methodological difficulties it introduces at the statistical level; secondly, we show how analyzing HLA sequence variation may improve our comprehension of population genetic relationships by facilitating the identification of demographic events that marked human evolution; finally, we discuss the interest of both HLA and genome-wide sequencing and genotyping in detecting functionally significant SNPs in the MHC region, the latter having also contributed to the makeup of the HLA molecular diversity observed today. PMID:25126587
The relevance of HLA sequencing in population genetics studies.

PubMed

Sanchez-Mazas, Alicia; Meyer, Diogo

2014-01-01

Next generation sequencing (NGS) is currently being adapted by different biotechnological platforms to the standard typing method for HLA polymorphism, the huge diversity of which makes this initiative particularly challenging. Boosting the molecular characterization of the HLA genes through efficient, rapid, and low-cost technologies is expected to amplify the success of tissue transplantation by enabling us to find donor-recipient matching for rare phenotypes. But the application of NGS technologies to the molecular mapping of the MHC region also anticipates essential changes in population genetic studies. Huge amounts of HLA sequence data will be available in the next years for different populations, with the potential to change our understanding of HLA variation in humans. In this review, we first explain how HLA sequencing allows a better assessment of the HLA diversity in human populations, taking also into account the methodological difficulties it introduces at the statistical level; secondly, we show how analyzing HLA sequence variation may improve our comprehension of population genetic relationships by facilitating the identification of demographic events that marked human evolution; finally, we discuss the interest of both HLA and genome-wide sequencing and genotyping in detecting functionally significant SNPs in the MHC region, the latter having also contributed to the makeup of the HLA molecular diversity observed today.
Alterations of microbiota in urine from women with interstitial cystitis

PubMed Central

2012-01-01

Background Interstitial Cystitis (IC) is a chronic inflammatory condition of the bladder with unknown etiology. The aim of this study was to characterize the microbial community present in the urine from IC female patients by 454 high throughput sequencing of the 16S variable regions V1V2 and V6. The taxonomical composition, richness and diversity of the IC microbiota were determined and compared to the microbial profile of asymptomatic healthy female (HF) urine. Results The composition and distribution of bacterial sequences differed between the urine microbiota of IC patients and HFs. Reduced sequence richness and diversity were found in IC patient urine, and a significant difference in the community structure of IC urine in relation to HF urine was observed. More than 90% of the IC sequence reads were identified as belonging to the bacterial genus Lactobacillus, a marked increase compared to 60% in HF urine. Conclusion The 16S rDNA sequence data demonstrates a shift in the composition of the bacterial community in IC urine. The reduced microbial diversity and richness is accompanied by a higher abundance of the bacterial genus Lactobacillus, compared to HF urine. This study demonstrates that high throughput sequencing analysis of urine microbiota in IC patients is a powerful tool towards a better understanding of this enigmatic disease. PMID:22974186
Motility and Flagellar Glycosylation in Clostridium difficile▿ †

PubMed Central

Twine, Susan M.; Reid, Christopher W.; Aubry, Annie; McMullin, David R.; Fulton, Kelly M.; Austin, John; Logan, Susan M.

2009-01-01

In this study, intact flagellin proteins were purified from strains of Clostridium difficile and analyzed using quadrupole time of flight and linear ion trap mass spectrometers. Top-down studies showed the flagellin proteins to have a mass greater than that predicted from the corresponding gene sequence. These top-down studies revealed marker ions characteristic of glycan modifications. Additionally, diversity in the observed masses of glycan modifications was seen between strains. Electron transfer dissociation mass spectrometry was used to demonstrate that the glycan was attached to the flagellin protein backbone in O linkage via a HexNAc residue in all strains examined. Bioinformatic analysis of C. difficile genomes revealed diversity with respect to glycan biosynthesis gene content within the flagellar biosynthesis locus, likely reflected by the observed flagellar glycan diversity. In C. difficile strain 630, insertional inactivation of a glycosyltransferase gene (CD0240) present in all sequenced genomes resulted in an inability to produce flagellar filaments at the cell surface and only minor amounts of unmodified flagellin protein. PMID:19749038
Microbial colonization in diverse surface soil types in Surtsey and diversity analysis of its subsurface microbiota

NASA Astrophysics Data System (ADS)

Marteinsson, V.; Klonowski, A.; Reynisson, E.; Vannier, P.; Sigurdsson, B. D.; Ólafsson, M.

2015-02-01

Colonization of life on Surtsey has been observed systematically since the formation of the island 50 years ago. Although the first colonisers were prokaryotes, such as bacteria and blue-green algae, most studies have been focused on the settlement of plants and animals but less on microbial succession. To explore microbial colonization in diverse soils and the influence of associated vegetation and birds on numbers of environmental bacteria, we collected 45 samples from different soil types on the surface of the island. Total viable bacterial counts were performed with the plate count method at 22, 30 and 37 °C for all soil samples, and the amount of organic matter and nitrogen (N) was measured. Selected samples were also tested for coliforms, faecal coliforms and aerobic and anaerobic bacteria. The subsurface biosphere was investigated by collecting liquid subsurface samples from a 181 m borehole with a special sampler. Diversity analysis of uncultivated biota in samples was performed by 16S rRNA gene sequences analysis and cultivation. Correlation was observed between nutrient deficits and the number of microorganisms in surface soil samples. The lowest number of bacteria (1 × 104-1 × 105 cells g-1) was detected in almost pure pumice but the count was significantly higher (1 × 106-1 × 109 cells g-1) in vegetated soil or pumice with bird droppings. The number of faecal bacteria correlated also to the total number of bacteria and type of soil. Bacteria belonging to Enterobacteriaceae were only detected in vegetated samples and samples containing bird droppings. The human pathogens Salmonella, Campylobacter and Listeria were not in any sample. Both thermophilic bacteria and archaea 16S rDNA sequences were found in the subsurface samples collected at 145 and 172 m depth at 80 and 54 °C, respectively, but no growth was observed in enrichments. The microbiota sequences generally showed low affiliation to any known 16S rRNA gene sequences.
Microbial colonisation in diverse surface soil types in Surtsey and diversity analysis of its subsurface microbiota

NASA Astrophysics Data System (ADS)

Marteinsson, V.; Klonowski, A.; Reynisson, E.; Vannier, P.; Sigurdsson, B. D.; Ólafsson, M.

2014-09-01

Colonisation of life on Surtsey has been observed systematically since the formation of the island 50 years ago. Although the first colonisers were prokaryotes, such as bacteria and blue-green algae, most studies have been focusing on settlement of plants and animals but less on microbial succession. To explore microbial colonization in diverse soils and the influence of associate vegetation and birds on numbers of environmental bacteria, we collected 45 samples from different soils types on the surface of the island. Total viable bacterial counts were performed with plate count at 22, 30 and 37 °C for all soils samples and the amount of organic matter and nitrogen (N) was measured. Selected samples were also tested for coliforms, faecal coliforms aerobic and anaerobic bacteria. The deep subsurface biosphere was investigated by collecting liquid subsurface samples from a 182 m borehole with a special sampler. Diversity analysis of uncultivated biota in samples was performed by 16S rRNA gene sequences analysis and cultivation. Correlation was observed between N deficits and the number of microorganisms in surface soils samples. The lowest number of bacteria (1 × 104-1 × 105 g-1) was detected in almost pure pumice but the count was significant higher (1 × 106-1 × 109 g-1) in vegetated soil or pumice with bird droppings. The number of faecal bacteria correlated also to the total number of bacteria and type of soil. Bacteria belonging to Enterobacteriaceae were only detected in vegetated and samples containing bird droppings. The human pathogens Salmonella, Campylobacter and Listeria were not in any sample. Both thermophilic bacteria and archaea 16S rDNA sequences were found in the subsurface samples collected at 145 m and 172 m depth at 80 °C and 54 °C, respectively, but no growth was observed in enrichments. The microbiota sequences generally showed low affiliation to any known 16S rRNA gene sequences.
Antibiotics reduce genetic diversity of core species in the honeybee gut microbiome.

PubMed

Raymann, Kasie; Bobay, Louis-Marie; Moran, Nancy A

2018-04-01

The gut microbiome plays a key role in animal health, and perturbing it can have detrimental effects. One major source of perturbation to microbiomes, in humans and human-associated animals, is exposure to antibiotics. Most studies of how antibiotics affect the microbiome have used amplicon sequencing of highly conserved 16S rRNA sequences, as in a recent study showing that antibiotic treatment severely alters the species-level composition of the honeybee gut microbiome. But because the standard 16S rRNA-based methods cannot resolve closely related strains, strain-level changes could not be evaluated. To address this gap, we used amplicon sequencing of protein-coding genes to assess effects of antibiotics on fine-scale genetic diversity of the honeybee gut microbiota. We followed the population dynamics of alleles within two dominant core species of the bee gut community, Gilliamella apicola and Snodgrassella alvi, following antibiotic perturbation. Whereas we observed a large reduction in genetic diversity in G. apicola, S. alvi diversity was mostly unaffected. The reduction in G. apicola diversity accompanied an increase in the frequency of several alleles, suggesting resistance to antibiotic treatment. We find that antibiotic perturbation can cause major shifts in diversity and that the extent of these shifts can vary substantially across species. Thus, antibiotics impact not only species composition, but also allelic diversity within species, potentially affecting hosts if variants with particular functions are reduced or eliminated. Overall, we show that amplicon sequencing of protein-coding genes, without clustering into operational taxonomic units, provides an accurate picture of the fine-scale dynamics of microbial communities over time. © 2017 John Wiley & Sons Ltd.
Genetic diversity and antigenicity variation of Babesia bovis merozoite surface antigen-1 (MSA-1) in Thailand.

PubMed

Tattiyapong, Muncharee; Sivakumar, Thillaiampalam; Takemae, Hitoshi; Simking, Pacharathon; Jittapalapong, Sathaporn; Igarashi, Ikuo; Yokoyama, Naoaki

2016-07-01

Babesia bovis, an intraerythrocytic protozoan parasite, causes severe clinical disease in cattle worldwide. The genetic diversity of parasite antigens often results in different immune profiles in infected animals, hindering efforts to develop immune control methodologies against the B. bovis infection. In this study, we analyzed the genetic diversity of the merozoite surface antigen-1 (msa-1) gene using 162 B. bovis-positive blood DNA samples sourced from cattle populations reared in different geographical regions of Thailand. The identity scores shared among 93 msa-1 gene sequences isolated by PCR amplification were 43.5-100%, and the similarity values among the translated amino acid sequences were 42.8-100%. Of 23 total clades detected in our phylogenetic analysis, Thai msa-1 gene sequences occurred in 18 clades; seven among them were composed of sequences exclusively from Thailand. To investigate differential antigenicity of isolated MSA-1 proteins, we expressed and purified eight recombinant MSA-1 (rMSA-1) proteins, including an rMSA-1 from B. bovis Texas (T2Bo) strain and seven rMSA-1 proteins based on the Thai msa-1 sequences. When these antigens were analyzed in a western blot assay, anti-T2Bo cattle serum strongly reacted with the rMSA-1 from T2Bo, as well as with three other rMSA-1 proteins that shared 54.9-68.4% sequence similarity with T2Bo MSA-1. In contrast, no or weak reactivity was observed for the remaining rMSA-1 proteins, which shared low sequence similarity (35.0-39.7%) with T2Bo MSA-1. While demonstrating the high genetic diversity of the B. bovis msa-1 gene in Thailand, the present findings suggest that the genetic diversity results in antigenicity variations among the MSA-1 antigens of B. bovis in Thailand. Copyright © 2016 Elsevier B.V. All rights reserved.
Fine-Scale Bacterial Beta Diversity within a Complex Ecosystem (Zodletone Spring, OK, USA): The Role of the Rare Biosphere

PubMed Central

Youssef, Noha H.; Couger, M. B.; Elshahed, Mostafa S.

2010-01-01

Background The adaptation of pyrosequencing technologies for use in culture-independent diversity surveys allowed for deeper sampling of ecosystems of interest. One extremely well suited area of interest for pyrosequencing-based diversity surveys that has received surprisingly little attention so far, is examining fine scale (e.g. micrometer to millimeter) beta diversity in complex microbial ecosystems. Methodology/Principal Findings We examined the patterns of fine scale Beta diversity in four adjacent sediment samples (1mm apart) from the source of an anaerobic sulfide and sulfur rich spring (Zodletone spring) in southwestern Oklahoma, USA. Using pyrosequencing, a total of 292,130 16S rRNA gene sequences were obtained. The beta diversity patterns within the four datasets were examined using various qualitative and quantitative similarity indices. Low levels of Beta diversity (high similarity indices) were observed between the four samples at the phylum-level. However, at a putative species (OTU0.03) level, higher levels of beta diversity (lower similarity indices) were observed. Further examination of beta diversity patterns within dominant and rare members of the community indicated that at the putative species level, beta diversity is much higher within rare members of the community. Finally, sub-classification of rare members of Zodletone spring community based on patterns of novelty and uniqueness, and further examination of fine scale beta diversity of each of these subgroups indicated that members of the community that are unique, but non novel showed the highest beta diversity within these subgroups of the rare biosphere. Conclusions/Significance The results demonstrate the occurrence of high inter-sample diversity within seemingly identical samples from a complex habitat. We reason that such unexpected diversity should be taken into consideration when exploring gamma diversity of various ecosystems, as well as planning for sequencing-intensive metagenomic surveys of highly complex ecosystems. PMID:20865128
High throughput SNP discovery and genotyping in grapevine (Vitis vinifera L.) by combining a re-sequencing approach and SNPlex technology

PubMed Central

Lijavetzky, Diego; Cabezas, José Antonio; Ibáñez, Ana; Rodríguez, Virginia; Martínez-Zapater, José M

2007-01-01

Background Single-nucleotide polymorphisms (SNPs) are the most abundant type of DNA sequence polymorphisms. Their higher availability and stability when compared to simple sequence repeats (SSRs) provide enhanced possibilities for genetic and breeding applications such as cultivar identification, construction of genetic maps, the assessment of genetic diversity, the detection of genotype/phenotype associations, or marker-assisted breeding. In addition, the efficiency of these activities can be improved thanks to the ease with which SNP genotyping can be automated. Expressed sequence tags (EST) sequencing projects in grapevine are allowing for the in silico detection of multiple putative sequence polymorphisms within and among a reduced number of cultivars. In parallel, the sequence of the grapevine cultivar Pinot Noir is also providing thousands of polymorphisms present in this highly heterozygous genome. Still the general application of those SNPs requires further validation since their use could be restricted to those specific genotypes. Results In order to develop a large SNP set of wide application in grapevine we followed a systematic re-sequencing approach in a group of 11 grape genotypes corresponding to ancient unrelated cultivars as well as wild plants. Using this approach, we have sequenced 230 gene fragments, what represents the analysis of over 1 Mb of grape DNA sequence. This analysis has allowed the discovery of 1573 SNPs with an average of one SNP every 64 bp (one SNP every 47 bp in non-coding regions and every 69 bp in coding regions). Nucleotide diversity in grape (π = 0.0051) was found to be similar to values observed in highly polymorphic plant species such as maize. The average number of haplotypes per gene sequence was estimated as six, with three haplotypes representing over 83% of the analyzed sequences. Short-range linkage disequilibrium (LD) studies within the analyzed sequences indicate the existence of a rapid decay of LD within the selected grapevine genotypes. To validate the use of the detected polymorphisms in genetic mapping, cultivar identification and genetic diversity studies we have used the SNPlex™ genotyping technology in a sample of grapevine genotypes and segregating progenies. Conclusion These results provide accurate values for nucleotide diversity in coding sequences and a first estimate of short-range LD in grapevine. Using SNPlex™ genotyping we have shown the application of a set of discovered SNPs as molecular markers for cultivar identification, linkage mapping and genetic diversity studies. Thus, the combination a highly efficient re-sequencing approach and the SNPlex™ high throughput genotyping technology provide a powerful tool for grapevine genetic analysis. PMID:18021442
Microbial eukaryotic diversity and distribution in a river plume and cyclonic eddy-influenced ecosystem in the South China Sea

PubMed Central

Wu, Wenxue; Wang, Lei; Liao, Yu; Huang, Bangqin

2015-01-01

To evaluate microbial eukaryotic diversity and distribution in mesoscale processes, we investigated 18S rDNA diversity in a river plume and cyclonic eddy-influenced ecosystem in the southwestern South China Sea (SCS). Restriction fragment length polymorphism analysis was carried out using multiple primer sets. Relative to a wide range of previous similar studies, we observed a significantly higher proportion of sequences of pigmented taxa. Among the photosynthetic groups, Haptophyta accounted for 27.7% of the sequenced clones, which belonged primarily to Prymnesiophyceae. Unexpectedly, five operational taxonomic units of Cryptophyta were closely related to freshwater species. The Chlorophyta mostly fell within the Prasinophyceae, which was comprised of six clades, including Clade III, which is detected in the SCS for the first time in this study. Among the photosynthetic stramenopiles, Chrysophyceae was the most diverse taxon, which included seven clades. The majority of 18S rDNA sequences affiliated with the Dictyochophyceae, Eustigmatophyceae, and Pelagophyceae were closely related to those of pure cultures. The results of redundancy analysis and the permutation Mantel test based on unweighted UniFrac distances, conducted for spatial analyses of the Haptophyta subclades suggested that the Mekong River plume and cyclonic eddy play important roles in regulating microbial eukaryotic diversity and distribution in the southwestern SCS. PMID:26268071
Genomic diversity of the human intestinal parasite Entamoeba histolytica

PubMed Central

2012-01-01

Background Entamoeba histolytica is a significant cause of disease worldwide. However, little is known about the genetic diversity of the parasite. We re-sequenced the genomes of ten laboratory cultured lines of the eukaryotic pathogen Entamoeba histolytica in order to develop a picture of genetic diversity across the genome. Results The extreme nucleotide composition bias and repetitiveness of the E. histolytica genome provide a challenge for short-read mapping, yet we were able to define putative single nucleotide polymorphisms in a large portion of the genome. The results suggest a rather low level of single nucleotide diversity, although genes and gene families with putative roles in virulence are among the more polymorphic genes. We did observe large differences in coverage depth among genes, indicating differences in gene copy number between genomes. We found evidence indicating that recombination has occurred in the history of the sequenced genomes, suggesting that E. histolytica may reproduce sexually. Conclusions E. histolytica displays a relatively low level of nucleotide diversity across its genome. However, large differences in gene family content and gene copy number are seen among the sequenced genomes. The pattern of polymorphism indicates that E. histolytica reproduces sexually, or has done so in the past, which has previously been suggested but not proven. PMID:22630046
Coral-the world's most diverse symbiotic ecosystem.

PubMed

Blackall, Linda L; Wilson, Bryan; van Oppen, Madeleine J H

2015-11-01

Zooxanthellate corals (i.e. those harbouring Symbiodinium) are the main builders of the world's shallow-water marine coral reefs. They represent intimate diverse symbioses between coral animals, single-celled photosynthetic dinoflagellates (Symbiodinium spp.), other microscopic eukaryotes, prokaryotes and viruses. Crabs and other crustaceans, worms, sponges, bivalves and hydrozoans, fishes, sea urchins, octopuses and sea stars are itinerant members of these 'rainforests of the sea'. This review focuses on the biodiversity of scleractinian coral animals and their best studied microscopic epi- and endosymbionts. In relation to coral-associated species diversity, Symbiodinium internal transcribed spacer region sequence types tally 10(2) -10(3) or up to ~15 different operational taxonomic units (OTUs, or putative species at the 97% sequence identity level; this cut-off was chosen based on intragenomic sequence diversity observed in monoclonal cultures) and prokaryotes (mostly bacterial) total 10(2) -10(4) OTUs. We analysed all publically accessible 16S rRNA gene sequence data and found Gammaproteobacteria were extremely abundant, followed by Alphaproteobacteria. Notably, Archaea were poorly represented and 'unassigned OTUs' were abundant in data generated by high-throughput DNA sequencing studies of corals. We outline and compare model systems that could be used in future studies of the coral holobiont. In our future directions, we recommend a global coral sampling effort including substantial attention being paid to method of coral tissue acquisition, which compartments (mucus, tissue, skeleton) to explore, broadening the holobiont members considered and linking biodiversity with functional investigations. © 2015 John Wiley & Sons Ltd.
Archaeal and bacterial diversity in two hot springs from geothermal regions in Bulgaria as demostrated by 16S rRNA and GH-57 genes.

PubMed

Stefanova, Katerina; Tomova, Iva; Tomova, Anna; Radchenkova, Nadja; Atanassov, Ivan; Kambourova, Margarita

2015-12-01

Archaeal and bacterial diversity in two Bulgarian hot springs, geographically separated with different tectonic origin and different temperature of water was investigated exploring two genes, 16S rRNA and GH-57. Archaeal diversity was significantly higher in the hotter spring Levunovo (LV) (82°C); on the contrary, bacterial diversity was higher in the spring Vetren Dol (VD) (68°C). The analyzed clones from LV library were referred to twenty eight different sequence types belonging to five archaeal groups from Crenarchaeota and Euryarchaeota. A domination of two groups was observed, Candidate Thaumarchaeota and Methanosarcinales. The majority of the clones from VD were referred to HWCG (Hot Water Crenarchaeotic Group). The formation of a group of thermophiles in the order Methanosarcinales was suggested. Phylogenetic analysis revealed high numbers of novel sequences, more than one third of archaeal and half of the bacterial phylotypes displayed similarity lower than 97% with known ones. The retrieved GH-57 gene sequences showed a complex phylogenic distribution. The main part of the retrieved homologous GH-57 sequences affiliated with bacterial phyla Bacteroidetes, Deltaproteobacteria, Candidate Saccharibacteria and affiliation of almost half of the analyzed sequences is not fully resolved. GH-57 gene analysis allows an increased resolution of the biodiversity assessment and in depth analysis of specific taxonomic groups. [Int Microbiol 18(4):217-223 (2015)]. Copyright© by the Spanish Society for Microbiology and Institute for Catalan Studies.
Genetic diversity and natural selection of Plasmodium knowlesi merozoite surface protein 1 paralog gene in Malaysia.

PubMed

Ahmed, Md Atique; Fauzi, Muh; Han, Eun-Taek

2018-03-14

Human infections due to the monkey malaria parasite Plasmodium knowlesi is on the rise in most Southeast Asian countries specifically Malaysia. The C-terminal 19 kDa domain of PvMSP1P is a potential vaccine candidate, however, no study has been conducted in the orthologous gene of P. knowlesi. This study investigates level of polymorphisms, haplotypes and natural selection of full-length pkmsp1p in clinical samples from Malaysia. A total of 36 full-length pkmsp1p sequences along with the reference H-strain and 40 C-terminal pkmsp1p sequences from clinical isolates of Malaysia were downloaded from published genomes. Genetic diversity, polymorphism, haplotype and natural selection were determined using DnaSP 5.10 and MEGA 5.0 software. Genealogical relationships were determined using haplotype network tree in NETWORK software v5.0. Population genetic differentiation index (F ST ) and population structure of parasite was determined using Arlequin v3.5 and STRUCTURE v2.3.4 software. Comparison of 36 full-length pkmsp1p sequences along with the H-strain identified 339 SNPs (175 non-synonymous and 164 synonymous substitutions). The nucleotide diversity across the full-length gene was low compared to its ortholog pvmsp1p. The nucleotide diversity was higher toward the N-terminal domains (pkmsp1p-83 and 30) compared to the C-terminal domains (pkmsp1p-38, 33 and 19). Phylogenetic analysis of full-length genes identified 2 distinct clusters of P. knowlesi from Malaysian Borneo. The 40 pkmsp1p-19 sequences showed low polymorphisms with 16 polymorphisms leading to 18 haplotypes. In total there were 10 synonymous and 6 non-synonymous substitutions and 12 cysteine residues were intact within the two EGF domains. Evidence of strong purifying selection was observed within the full-length sequences as well in all the domains. Shared haplotypes of 40 pkmsp1p-19 were identified within Malaysian Borneo haplotypes. This study is the first to report on the genetic diversity and natural selection of pkmsp1p. A low level of genetic diversity and strong evidence of negative selection was detected and observed in all the domains of pkmsp1p of P. knowlesi indicating functional constrains. Shared haplotypes were identified within pkmsp1p-19 highlighting further evaluation using larger number of clinical samples from Malaysia.
Investigating Salmonella Eko from Various Sources in Nigeria by Whole Genome Sequencing to Identify the Source of Human Infections

PubMed Central

Leekitcharoenphon, Pimlapas; Raufu, Ibrahim; Nielsen, Mette T.; Rosenqvist Lund, Birthe S.; Ameh, James A.; Ambali, Abdul G.; Sørensen, Gitte; Le Hello, Simon; Aarestrup, Frank M.; Hendriksen, Rene S.

2016-01-01

Twenty-six Salmonella enterica serovar Eko isolated from various sources in Nigeria were investigated by whole genome sequencing to identify the source of human infections. Diversity among the isolates was observed and camel and cattle were identified as the primary reservoirs and the most likely source of the human infections. PMID:27228329
Evaluation of the reproducibility of amplicon sequencing with Illumina MiSeq platform

PubMed Central

Van Nostrand, Joy D.; Ning, Daliang; Sun, Bo; Xue, Kai; Liu, Feifei; Deng, Ye; Liang, Yuting; Zhou, Jizhong

2017-01-01

Illumina’s MiSeq has become the dominant platform for gene amplicon sequencing in microbial ecology studies; however, various technical concerns, such as reproducibility, still exist. To assess reproducibility, 16S rRNA gene amplicons from 18 soil samples of a reciprocal transplantation experiment were sequenced on an Illumina MiSeq. The V4 region of 16S rRNA gene from each sample was sequenced in triplicate with each replicate having a unique barcode. The average OTU overlap, without considering sequence abundance, at a rarefaction level of 10,323 sequences was 33.4±2.1% and 20.2±1.7% between two and among three technical replicates, respectively. When OTU sequence abundance was considered, the average sequence abundance weighted OTU overlap was 85.6±1.6% and 81.2±2.1% for two and three replicates, respectively. Removing singletons significantly increased the overlap for both (~1–3%, p<0.001). Increasing the sequencing depth to 160,000 reads by deep sequencing increased OTU overlap both when sequence abundance was considered (95%) and when not (44%). However, if singletons were not removed the overlap between two technical replicates (not considering sequence abundance) plateaus at 39% with 30,000 sequences. Diversity measures were not affected by the low overlap as α-diversities were similar among technical replicates while β-diversities (Bray-Curtis) were much smaller among technical replicates than among treatment replicates (e.g., 0.269 vs. 0.374). Higher diversity coverage, but lower OTU overlap, was observed when replicates were sequenced in separate runs. Detrended correspondence analysis indicated that while there was considerable variation among technical replicates, the reproducibility was sufficient for detecting treatment effects for the samples examined. These results suggest that although there is variation among technical replicates, amplicon sequencing on MiSeq is useful for analyzing microbial community structure if used appropriately and with caution. For example, including technical replicates, removing spurious sequences and unrepresentative OTUs, using a clustering method with a high stringency for OTU generation, estimating treatment effects at higher taxonomic levels, and adapting the unique molecular identifier (UMI) and other newly developed methods to lower PCR and sequencing error and to identify true low abundance rare species all can increase reproducibility. PMID:28453559
Evaluation of the reproducibility of amplicon sequencing with Illumina MiSeq platform

DOE Office of Scientific and Technical Information (OSTI.GOV)

Wen, Chongqing; Wu, Liyou; Qin, Yujia

Illumina's MiSeq has become the dominant platform for gene amplicon sequencing in microbial ecology studies; however, various technical concerns, such as reproducibility, still exist. To assess reproducibility, 16S rRNA gene amplicons from 18 soil samples of a reciprocal transplantation experiment were sequenced on an Illumina MiSeq. The V4 region of 16S rRNA gene from each sample was sequenced in triplicate with each replicate having a unique barcode. The average OTU overlap, without considering sequence abundance, at a rarefaction level of 10,323 sequences was 33.4±2.1% and 20.2±1.7% between two and among three technical replicates, respectively. When OTU sequence abundance was considered,more » the average sequence abundance weighted OTU overlap was 85.6±1.6% and 81.2±2.1% for two and three replicates, respectively. Removing singletons significantly increased the overlap for both (~1-3%, p<0.001). Increasing the sequencing depth to 160,000 reads by deep sequencing increased OTU overlap both when sequence abundance was considered (95%) and when not (44%). However, if singletons were not removed the overlap between two technical replicates (not considering sequence abundance) plateaus at 39% with 30,000 sequences. Diversity measures were not affected by the low overlap as α-diversities were similar among technical replicates while β-diversities (Bray-Curtis) were much smaller among technical replicates than among treatment replicates (e.g., 0.269 vs. 0.374). Higher diversity coverage, but lower OTU overlap, was observed when replicates were sequenced in separate runs. Detrended correspondence analysis indicated that while there was considerable variation among technical replicates, the reproducibility was sufficient for detecting treatment effects for the samples examined. These results suggest that although there is variation among technical replicates, amplicon sequencing on MiSeq is useful for analyzing microbial community structure if used appropriately and with caution. For example, including technical replicates, removing spurious sequences and unrepresentative OTUs, using a clustering method with a high stringency for OTU generation, estimating treatment effects at higher taxonomic levels, and adapting the unique molecular identifier (UMI) and other newly developed methods to lower PCR and sequencing error and to identify true low abundance rare species all can increase reproducibility.« less

Evaluation of the reproducibility of amplicon sequencing with Illumina MiSeq platform

DOE PAGES

Wen, Chongqing; Wu, Liyou; Qin, Yujia; ...

2017-04-28

Illumina's MiSeq has become the dominant platform for gene amplicon sequencing in microbial ecology studies; however, various technical concerns, such as reproducibility, still exist. To assess reproducibility, 16S rRNA gene amplicons from 18 soil samples of a reciprocal transplantation experiment were sequenced on an Illumina MiSeq. The V4 region of 16S rRNA gene from each sample was sequenced in triplicate with each replicate having a unique barcode. The average OTU overlap, without considering sequence abundance, at a rarefaction level of 10,323 sequences was 33.4±2.1% and 20.2±1.7% between two and among three technical replicates, respectively. When OTU sequence abundance was considered,more » the average sequence abundance weighted OTU overlap was 85.6±1.6% and 81.2±2.1% for two and three replicates, respectively. Removing singletons significantly increased the overlap for both (~1-3%, p<0.001). Increasing the sequencing depth to 160,000 reads by deep sequencing increased OTU overlap both when sequence abundance was considered (95%) and when not (44%). However, if singletons were not removed the overlap between two technical replicates (not considering sequence abundance) plateaus at 39% with 30,000 sequences. Diversity measures were not affected by the low overlap as α-diversities were similar among technical replicates while β-diversities (Bray-Curtis) were much smaller among technical replicates than among treatment replicates (e.g., 0.269 vs. 0.374). Higher diversity coverage, but lower OTU overlap, was observed when replicates were sequenced in separate runs. Detrended correspondence analysis indicated that while there was considerable variation among technical replicates, the reproducibility was sufficient for detecting treatment effects for the samples examined. These results suggest that although there is variation among technical replicates, amplicon sequencing on MiSeq is useful for analyzing microbial community structure if used appropriately and with caution. For example, including technical replicates, removing spurious sequences and unrepresentative OTUs, using a clustering method with a high stringency for OTU generation, estimating treatment effects at higher taxonomic levels, and adapting the unique molecular identifier (UMI) and other newly developed methods to lower PCR and sequencing error and to identify true low abundance rare species all can increase reproducibility.« less
Evaluation of the reproducibility of amplicon sequencing with Illumina MiSeq platform.

PubMed

Wen, Chongqing; Wu, Liyou; Qin, Yujia; Van Nostrand, Joy D; Ning, Daliang; Sun, Bo; Xue, Kai; Liu, Feifei; Deng, Ye; Liang, Yuting; Zhou, Jizhong

2017-01-01

Illumina's MiSeq has become the dominant platform for gene amplicon sequencing in microbial ecology studies; however, various technical concerns, such as reproducibility, still exist. To assess reproducibility, 16S rRNA gene amplicons from 18 soil samples of a reciprocal transplantation experiment were sequenced on an Illumina MiSeq. The V4 region of 16S rRNA gene from each sample was sequenced in triplicate with each replicate having a unique barcode. The average OTU overlap, without considering sequence abundance, at a rarefaction level of 10,323 sequences was 33.4±2.1% and 20.2±1.7% between two and among three technical replicates, respectively. When OTU sequence abundance was considered, the average sequence abundance weighted OTU overlap was 85.6±1.6% and 81.2±2.1% for two and three replicates, respectively. Removing singletons significantly increased the overlap for both (~1-3%, p<0.001). Increasing the sequencing depth to 160,000 reads by deep sequencing increased OTU overlap both when sequence abundance was considered (95%) and when not (44%). However, if singletons were not removed the overlap between two technical replicates (not considering sequence abundance) plateaus at 39% with 30,000 sequences. Diversity measures were not affected by the low overlap as α-diversities were similar among technical replicates while β-diversities (Bray-Curtis) were much smaller among technical replicates than among treatment replicates (e.g., 0.269 vs. 0.374). Higher diversity coverage, but lower OTU overlap, was observed when replicates were sequenced in separate runs. Detrended correspondence analysis indicated that while there was considerable variation among technical replicates, the reproducibility was sufficient for detecting treatment effects for the samples examined. These results suggest that although there is variation among technical replicates, amplicon sequencing on MiSeq is useful for analyzing microbial community structure if used appropriately and with caution. For example, including technical replicates, removing spurious sequences and unrepresentative OTUs, using a clustering method with a high stringency for OTU generation, estimating treatment effects at higher taxonomic levels, and adapting the unique molecular identifier (UMI) and other newly developed methods to lower PCR and sequencing error and to identify true low abundance rare species all can increase reproducibility.
Learning to Observe in a Geomorphological Context

ERIC Educational Resources Information Center

Martinez, Patricia; Bannan-Ritland, Brenda; Peters, Erin E.; Baek, John

2011-01-01

This three-lesson sequence, addressing the topic of slow geomorphological change caused by water movement, integrates a Web-based system called Goinquire into a series of activities aimed to help upper-elementary, diverse students improve their observation skills and content knowledge in geomorphology. During the inquiry-based lessons, students…
Archaeon and archaeal virus diversity classification via sequence entropy and fractal dimension

NASA Astrophysics Data System (ADS)

Tremberger, George, Jr.; Gallardo, Victor; Espinoza, Carola; Holden, Todd; Gadura, N.; Cheung, E.; Schneider, P.; Lieberman, D.; Cheung, T.

2010-09-01

Archaea are important potential candidates in astrobiology as their metabolism includes solar, inorganic and organic energy sources. Archaeal viruses would also be expected to be present in a sustainable archaeal exobiological community. Genetic sequence Shannon entropy and fractal dimension can be used to establish a two-dimensional measure for classification and phylogenetic study of these organisms. A sequence fractal dimension can be calculated from a numerical series consisting of the atomic numbers of each nucleotide. Archaeal 16S and 23S ribosomal RNA sequences were studied. Outliers in the 16S rRNA fractal dimension and entropy plot were found to be halophilic archaea. Positive correlation (R-square ~ 0.75, N = 18) was observed between fractal dimension and entropy across the studied species. The 16S ribosomal RNA sequence entropy correlates with the 23S ribosomal RNA sequence entropy across species with R-square 0.93, N = 18. Entropy values correspond positively with branch lengths of a published phylogeny. The studied archaeal virus sequences have high fractal dimensions of 2.02 or more. A comparison of selected extremophile sequences with archaeal sequences from the Humboldt Marine Ecosystem database (Wood-Hull Oceanography Institute, MIT) suggests the presence of continuous sequence expression as inferred from distributions of entropy and fractal dimension, consistent with the diversity expected in an exobiological archaeal community.
SCRaMbLE generates designed combinatorial stochastic diversity in synthetic chromosomes.

PubMed

Shen, Yue; Stracquadanio, Giovanni; Wang, Yun; Yang, Kun; Mitchell, Leslie A; Xue, Yaxin; Cai, Yizhi; Chen, Tai; Dymond, Jessica S; Kang, Kang; Gong, Jianhui; Zeng, Xiaofan; Zhang, Yongfen; Li, Yingrui; Feng, Qiang; Xu, Xun; Wang, Jun; Wang, Jian; Yang, Huanming; Boeke, Jef D; Bader, Joel S

2016-01-01

Synthetic chromosome rearrangement and modification by loxP-mediated evolution (SCRaMbLE) generates combinatorial genomic diversity through rearrangements at designed recombinase sites. We applied SCRaMbLE to yeast synthetic chromosome arm synIXR (43 recombinase sites) and then used a computational pipeline to infer or unscramble the sequence of recombinations that created the observed genomes. Deep sequencing of 64 synIXR SCRaMbLE strains revealed 156 deletions, 89 inversions, 94 duplications, and 55 additional complex rearrangements; several duplications are consistent with a double rolling circle mechanism. Every SCRaMbLE strain was unique, validating the capability of SCRaMbLE to explore a diverse space of genomes. Rearrangements occurred exclusively at designed loxPsym sites, with no significant evidence for ectopic rearrangements or mutations involving synthetic regions, the 99% nonsynthetic nuclear genome, or the mitochondrial genome. Deletion frequencies identified genes required for viability or fast growth. Replacement of 3' UTR by non-UTR sequence had surprisingly little effect on fitness. SCRaMbLE generates genome diversity in designated regions, reveals fitness constraints, and should scale to simultaneous evolution of multiple synthetic chromosomes. © 2016 Shen et al.; Published by Cold Spring Harbor Laboratory Press.
Complete genomic sequences of Propionibacterium freudenreichii phages from Swiss cheese reveal greater diversity than Cutibacterium (formerly Propionibacterium) acnes phages.

PubMed

Cheng, Lucy; Marinelli, Laura J; Grosset, Noël; Fitz-Gibbon, Sorel T; Bowman, Charles A; Dang, Brian Q; Russell, Daniel A; Jacobs-Sera, Deborah; Shi, Baochen; Pellegrini, Matteo; Miller, Jeff F; Gautier, Michel; Hatfull, Graham F; Modlin, Robert L

2018-03-01

A remarkable exception to the large genetic diversity often observed for bacteriophages infecting a specific bacterial host was found for the Cutibacterium acnes (formerly Propionibacterium acnes) phages, which are highly homogeneous. Phages infecting the related species, which is also a member of the Propionibacteriaceae family, Propionibacterium freudenreichii, a bacterium used in production of Swiss-type cheeses, have also been described and are common contaminants of the cheese manufacturing process. However, little is known about their genetic composition and diversity. We obtained seven independently isolated bacteriophages that infect P. freudenreichii from Swiss-type cheese samples, and determined their complete genome sequences. These data revealed that all seven phage isolates are of similar genomic length and GC% content, but their genomes are highly diverse, including genes encoding the capsid, tape measure, and tail proteins. In contrast to C. acnes phages, all P. freudenreichii phage genomes encode a putative integrase protein, suggesting they are capable of lysogenic growth. This is supported by the finding of related prophages in some P. freudenreichii strains. The seven phages could further be distinguished as belonging to two distinct genomic types, or 'clusters', based on nucleotide sequences, and host range analyses conducted on a collection of P. freudenreichii strains show a higher degree of host specificity than is observed for the C. acnes phages. Overall, our data demonstrate P. freudenreichii bacteriophages are distinct from C. acnes phages, as evidenced by their higher genetic diversity, potential for lysogenic growth, and more restricted host ranges. This suggests substantial differences in the evolution of these related species from the Propionibacteriaceae family and their phages, which is potentially related to their distinct environmental niches.
Determining the Diversity and Species Abundance Patterns in Arctic Soils using Rational Methods for Exploring Microbial Diversity

NASA Astrophysics Data System (ADS)

Ovreas, L.; Quince, C.; Sloan, W.; Lanzen, A.; Davenport, R.; Green, J.; Coulson, S.; Curtis, T.

2012-12-01

Arctic microbial soil communities are intrinsically interesting and poorly characterised. We have inferred the diversity and species abundance distribution of 6 Arctic soils: new and mature soil at the foot of a receding glacier, Arctic Semi Desert, the foot of bird cliffs and soil underlying Arctic Tundra Heath: all near Ny-Ålesund, Spitsbergen. Diversity, distribution and sample sizes were estimated using the rational method of Quince et al., (Isme Journal 2 2008:997-1006) to determine the most plausible underlying species abundance distribution. A log-normal species abundance curve was found to give a slightly better fit than an inverse Gaussian curve if, and only if, sequencing error was removed. The median estimates of diversity of operational taxonomic units (at the 3% level) were 3600-5600 (lognormal assumed) and 2825-4100 (inverse Gaussian assumed). The nature and origins of species abundance distributions are poorly understood but may yet be grasped by observing and analysing such distributions in the microbial world. The sample size required to observe the distribution (by sequencing 90% of the taxa) varied between ~ 106 and ~105 for the lognormal and inverse Gaussian respectively. We infer that between 5 and 50 GB of sequencing would be required to capture 90% or the metagenome. Though a principle components analysis clearly divided the sites into three groups there was a high (20-45%) degree of overlap in between locations irrespective of geographical proximity. Interestingly, the nearest relatives of the most abundant taxa at a number of most sites were of alpine or polar origin. Samples plotted on first two principal components together with arbitrary discriminatory OTUs
The major histocompatibility complex of tassel-eared squirrels. II. Genetic diversity associated with Abert squirrels.

PubMed

Wettstein, P J; States, J S

1986-01-01

The extent of polymorphism and the rate of divergence of class I and class II sequences mapping to the mammalian major histocompatibility complex (MHC) have been the subject of experimentation and speculation. To provide further insight into the evolution of the MHC we have initiated the analysis of two geographically isolated subspecies of tassel-eared squirrels. In the preceding communication we described the number and polymorphism of TSLA class I and class II sequences in Kaibab squirrels (S. aberti kaibabensis), which live north of the Grand Canyon. In this report we present a parallel analysis of Abert squirrels (S. aberti aberti), which live south of the Grand Canyon in northern Arizona. Genomic DNA from 12 Abert squirrels was digested with restriction enzymes, electrophoresed, blotted, and hybridized with DR alpha, DR beta, DQ alpha, DQ beta, and HLA-B7 probes. The results of these hybridizations were remarkably similar to those obtained in Kaibab squirrels. The majority of class I and class II bands were identical in size and number, suggesting that Abert and Kaibab squirrels have not significantly diverged in the TSLA complex despite their geographical separation. Relative polymorphism of class II sequences was similar to that observed with Kaibab squirrels: beta sequences exhibited higher polymorphism than alpha sequences. As in Kaibab squirrels, a number of alpha and beta sequences were apparently carried on the same fragments. In comparison to class II beta sequences, there was limited polymorphism in class I sequences, although a diverse number of class I genotypes were observed. Attempts to identify segregating TSLA haplotypes were futile in that the only families of sequences with concordant distributions were DQ alpha and DQ beta. These observations and those obtained with Kaibab squirrels suggest that the present-day TSLA haplotypes of both subspecies are derived from a limited number of common, progenitor haplotypes through repeated intra-TSLA recombination.
Prospective identification of parasitic sequences in phage display screens

PubMed Central

Matochko, Wadim L.; Cory Li, S.; Tang, Sindy K.Y.; Derda, Ratmir

2014-01-01

Phage display empowered the development of proteins with new function and ligands for clinically relevant targets. In this report, we use next-generation sequencing to analyze phage-displayed libraries and uncover a strong bias induced by amplification preferences of phage in bacteria. This bias favors fast-growing sequences that collectively constitute <0.01% of the available diversity. Specifically, a library of 109 random 7-mer peptides (Ph.D.-7) includes a few thousand sequences that grow quickly (the ‘parasites’), which are the sequences that are typically identified in phage display screens published to date. A similar collapse was observed in other libraries. Using Illumina and Ion Torrent sequencing and multiple biological replicates of amplification of Ph.D.-7 library, we identified a focused population of 770 ‘parasites’. In all, 197 sequences from this population have been identified in literature reports that used Ph.D.-7 library. Many of these enriched sequences have confirmed function (e.g. target binding capacity). The bias in the literature, thus, can be viewed as a selection with two different selection pressures: (i) target-binding selection, and (ii) amplification-induced selection. Enrichment of parasitic sequences could be minimized if amplification bias is removed. Here, we demonstrate that emulsion amplification in libraries of ∼106 diverse clones prevents the biased selection of parasitic clones. PMID:24217917
Deconvoluting simulated metagenomes: the performance of hard- and soft- clustering algorithms applied to metagenomic chromosome conformation capture (3C)

PubMed Central

DeMaere, Matthew Z.

2016-01-01

Background Chromosome conformation capture, coupled with high throughput DNA sequencing in protocols like Hi-C and 3C-seq, has been proposed as a viable means of generating data to resolve the genomes of microorganisms living in naturally occuring environments. Metagenomic Hi-C and 3C-seq datasets have begun to emerge, but the feasibility of resolving genomes when closely related organisms (strain-level diversity) are present in the sample has not yet been systematically characterised. Methods We developed a computational simulation pipeline for metagenomic 3C and Hi-C sequencing to evaluate the accuracy of genomic reconstructions at, above, and below an operationally defined species boundary. We simulated datasets and measured accuracy over a wide range of parameters. Five clustering algorithms were evaluated (2 hard, 3 soft) using an adaptation of the extended B-cubed validation measure. Results When all genomes in a sample are below 95% sequence identity, all of the tested clustering algorithms performed well. When sequence data contains genomes above 95% identity (our operational definition of strain-level diversity), a naive soft-clustering extension of the Louvain method achieves the highest performance. Discussion Previously, only hard-clustering algorithms have been applied to metagenomic 3C and Hi-C data, yet none of these perform well when strain-level diversity exists in a metagenomic sample. Our simple extension of the Louvain method performed the best in these scenarios, however, accuracy remained well below the levels observed for samples without strain-level diversity. Strain resolution is also highly dependent on the amount of available 3C sequence data, suggesting that depth of sequencing must be carefully considered during experimental design. Finally, there appears to be great scope to improve the accuracy of strain resolution through further algorithm development. PMID:27843713
Selection and Trans-Species Polymorphism of Major Histocompatibility Complex Class II Genes in the Order Crocodylia

PubMed Central

Jaratlerdsiri, Weerachai; Isberg, Sally R.; Higgins, Damien P.; Miles, Lee G.; Gongora, Jaime

2014-01-01

Major Histocompatibility Complex (MHC) class II genes encode for molecules that aid in the presentation of antigens to helper T cells. MHC characterisation within and between major vertebrate taxa has shed light on the evolutionary mechanisms shaping the diversity within this genomic region, though little characterisation has been performed within the Order Crocodylia. Here we investigate the extent and effect of selective pressures and trans-species polymorphism on MHC class II α and β evolution among 20 extant species of Crocodylia. Selection detection analyses showed that diversifying selection influenced MHC class II β diversity, whilst diversity within MHC class II α is the result of strong purifying selection. Comparison of translated sequences between species revealed the presence of twelve trans-species polymorphisms, some of which appear to be specific to the genera Crocodylus and Caiman. Phylogenetic reconstruction clustered MHC class II α sequences into two major clades representing the families Crocodilidae and Alligatoridae. However, no further subdivision within these clades was evident and, based on the observation that most MHC class II α sequences shared the same trans-species polymorphisms, it is possible that they correspond to the same gene lineage across species. In contrast, phylogenetic analyses of MHC class II β sequences showed a mixture of subclades containing sequences from Crocodilidae and/or Alligatoridae, illustrating orthologous relationships among those genes. Interestingly, two of the subclades containing sequences from both Crocodilidae and Alligatoridae shared specific trans-species polymorphisms, suggesting that they may belong to ancient lineages pre-dating the divergence of these two families from the common ancestor 85–90 million years ago. The results presented herein provide an immunogenetic resource that may be used to further assess MHC diversity and functionality in Crocodylia. PMID:24503938
Archaeal β diversity patterns under the seafloor along geochemical gradients

NASA Astrophysics Data System (ADS)

Koyano, Hitoshi; Tsubouchi, Taishi; Kishino, Hirohisa; Akutsu, Tatsuya

2014-09-01

Recently, deep drilling into the seafloor has revealed that there are vast sedimentary ecosystems of diverse microorganisms, particularly archaea, in subsurface areas. We investigated the β diversity patterns of archaeal communities in sediment layers under the seafloor and their determinants. This study was accomplished by analyzing large environmental samples of 16S ribosomal RNA gene sequences and various geochemical data collected from a sediment core of 365.3 m, obtained by drilling into the seafloor off the east coast of the Shimokita Peninsula. To extract the maximum amount of information from these environmental samples, we first developed a method for measuring β diversity using sequence data by applying probability theory on a set of strings developed by two of the authors in a previous publication. We introduced an index of β diversity between sequence populations from which the sequence data were sampled. We then constructed an estimator of the β diversity index based on the sequence data and demonstrated that it converges to the β diversity index between sequence populations with probability of 1 as the number of sampled sequences increases. Next, we applied this new method to quantify β diversities between archaeal sequence populations under the seafloor and constructed a quantitative model of the estimated β diversity patterns. Nearly 90% of the variation in the archaeal β diversity was explained by a model that included as variables the differences in the abundances of chlorine, iodine, and carbon between the sediment layers.
Physical mapping of repetitive DNA suggests 2n reduction in Amazon turtles Podocnemis (Testudines: Podocnemididae)

PubMed Central

Cavalcante, Manoella Gemaque; Bastos, Carlos Eduardo Matos Carvalho; Nagamachi, Cleusa Yoshiko; Pieczarka, Julio Cesar; Vicari, Marcelo Ricardo; Noronha, Renata Coelho Rodrigues

2018-01-01

Cytogenetic studies show that there is great karyotypic diversity in order Testudines (2n = 26–68), and that this may be mainly attributed to the presence/absence of microchromosomes. Members of the Podocnemididae family have the smallest diploid numbers of this order (2n = 26–28), which may be a derived condition of the group. Diverse studies suggest that repetitive-DNA-rich sites generally act as hotspots for double-strand breaks and chromosomal reorganization. In this context, we used fluorescent in situ hybridization (FISH) to map telomeric sequences (TTAGGG)n, 45S rDNA, and the genes encoding histones H1 and H3 in two species of genus Podocnemis. We also observed conservation of the 45S rDNA and H1 histone sequences (probable case of conserved synteny), but multiple conserved and non-conserved clusters of H3 genes, which colocalized with the interstitial telomeric sequences in the Podocnemis genome. Our results suggest that fusions have occurred between macro and microchromosomes or between microchromosomes, leading to the observed reduction in diploid number in the family Podocnemididae. PMID:29813087
Physical mapping of repetitive DNA suggests 2n reduction in Amazon turtles Podocnemis (Testudines: Podocnemididae).

PubMed

Cavalcante, Manoella Gemaque; Bastos, Carlos Eduardo Matos Carvalho; Nagamachi, Cleusa Yoshiko; Pieczarka, Julio Cesar; Vicari, Marcelo Ricardo; Noronha, Renata Coelho Rodrigues

2018-01-01

Cytogenetic studies show that there is great karyotypic diversity in order Testudines (2n = 26-68), and that this may be mainly attributed to the presence/absence of microchromosomes. Members of the Podocnemididae family have the smallest diploid numbers of this order (2n = 26-28), which may be a derived condition of the group. Diverse studies suggest that repetitive-DNA-rich sites generally act as hotspots for double-strand breaks and chromosomal reorganization. In this context, we used fluorescent in situ hybridization (FISH) to map telomeric sequences (TTAGGG)n, 45S rDNA, and the genes encoding histones H1 and H3 in two species of genus Podocnemis. We also observed conservation of the 45S rDNA and H1 histone sequences (probable case of conserved synteny), but multiple conserved and non-conserved clusters of H3 genes, which colocalized with the interstitial telomeric sequences in the Podocnemis genome. Our results suggest that fusions have occurred between macro and microchromosomes or between microchromosomes, leading to the observed reduction in diploid number in the family Podocnemididae.
Beyond Bacteria: A Study of the Enteric Microbial Consortium in Extremely Low Birth Weight Infants

PubMed Central

Cotton, Charles Michael; Goldberg, Ronald N.; Wynn, James L.; Jackson, Robert B.; Seed, Patrick C.

2011-01-01

Extremely low birth weight (ELBW) infants have high morbidity and mortality, frequently due to invasive infections from bacteria, fungi, and viruses. The microbial communities present in the gastrointestinal tracts of preterm infants may serve as a reservoir for invasive organisms and remain poorly characterized. We used deep pyrosequencing to examine the gut-associated microbiome of 11 ELBW infants in the first postnatal month, with a first time determination of the eukaryote microbiota such as fungi and nematodes, including bacteria and viruses that have not been previously described. Among the fungi observed, Candida sp. and Clavispora sp. dominated the sequences, but a range of environmental molds were also observed. Surprisingly, seventy-one percent of the infant fecal samples tested contained ribosomal sequences corresponding to the parasitic organism Trichinella. Ribosomal DNA sequences for the roundworm symbiont Xenorhabdus accompanied these sequences in the infant with the greatest proportion of Trichinella sequences. When examining ribosomal DNA sequences in aggregate, Enterobacteriales, Pseudomonas, Staphylococcus, and Enterococcus were the most abundant bacterial taxa in a low diversity bacterial community (mean Shannon-Weaver Index of 1.02±0.69), with relatively little change within individual infants through time. To supplement the ribosomal sequence data, shotgun sequencing was performed on DNA from multiple displacement amplification (MDA) of total fecal genomic DNA from two infants. In addition to the organisms mentioned previously, the metagenome also revealed sequences for gram positive and gram negative bacteriophages, as well as human adenovirus C. Together, these data reveal surprising eukaryotic and viral microbial diversity in ELBW enteric microbiota dominated bytypes of bacteria known to cause invasive disease in these infants. PMID:22174751
Nitrous Oxide Reductase (nosZ) Gene Fragments Differ between Native and Cultivated Michigan Soils

PubMed Central

Stres, Blaž; Mahne, Ivan; Avguštin, Gorazd; Tiedje, James M.

2004-01-01

The effect of standard agricultural management on the genetic heterogeneity of nitrous oxide reductase (nosZ) fragments from denitrifying prokaryotes in native and cultivated soil was explored. Thirty-six soil cores were composited from each of the two soil management conditions. nosZ gene fragments were amplified from triplicate samples, and PCR products were cloned and screened by restriction fragment length polymorphism (RFLP). The total nosZ RFLP profiles increased in similarity with soil sample size until triplicate 3-g samples produced visually identical RFLP profiles for each treatment. Large differences in total nosZ profiles were observed between the native and cultivated soils. The fragments representing major groups of clones encountered at least twice and four randomly selected clones with unique RFLP patterns were sequenced to verify nosZ identity. The sequence diversity of nosZ clones from the cultivated field was higher, and only eight patterns were found in clone libraries from both soils among the 182 distinct nosZ RFLP patterns identified from the two soils. A group of clones that comprised 32% of all clones dominated the gene library of native soil, whereas many minor groups were observed in the gene library of cultivated soil. The 95% confidence intervals of the Chao1 nonparametric richness estimator for nosZ RFLP data did not overlap, indicating that the levels of species richness are significantly different in the two soils, the cultivated soil having higher diversity. Phylogenetic analysis of deduced amino acid sequences grouped the majority of nosZ clones into an interleaved Michigan soil cluster whose cultured members are α-Proteobacteria. Only four nosZ sequences from cultivated soil and one from the native soil were related to sequences found in γ-Proteobacteria. Sequences from the native field formed a distinct, closely related cluster (Dmean = 0.16) containing 91.6% of the native clones. Clones from the cultivated field were more distantly related to each other (Dmean = 0.26), and 65% were found outside of the cluster from the native soil, further indicating a difference in the two communities. Overall, there appears to be a relationship between use and richness, diversity, and the phylogenetic position of nosZ sequences, indicating that agricultural use of soil caused a shift to a more diverse denitrifying community. PMID:14711656
Microbial diversity and composition of the sediment in the drinking water reservoir Saidenbach (Saxonia, Germany).

PubMed

Röske, Kerstin; Sachse, René; Scheerer, Carola; Röske, Isolde

2012-02-01

Sediments contain a huge number and diversity of microorganisms that are important for the flux of material and are pivotal to all major biogeochemical cycles. Sediments of reservoirs are affected by a wide spectrum of allochthous and autochthonous influences providing versatile environments along the flow of water within the reservoir. Here we report on the microbial diversity in sediments of the mesotrophic drinking water reservoir Saidenbach, Germany, featuring a pronounced longitudinal gradient in sediment composition in the reservoir system. Three sampling sites were selected along the gradient, and the microbial communities in two sediment depths were characterized using catalysed reporter deposition fluorescence in situ hybridization (CARD-FISH) and a bar-coded pyrosequencing approach. Multivariate statistic was used to reveal relationships between sequence diversity and the environmental conditions. The microbial communities were tremendously diverse with a Shannon index of diversity (H') ranging from 6.7 to 7.1. 18,986 sequences could be classified into 37 phyla including candidate divisions, but the full extent of genetic diversity was not captured. While CARD-FISH gave an overview about the community composition, more detailed information was gained by pyrosequencing. Bacteria were more abundant than Archaea. The dominating phylum in all samples was Proteobacteria, especially Betaproteobacteria and Deltaproteobacteria. Furthermore, sequences of Bacteroidetes, Verrucomicrobia, Acidobacteria, Chlorobi, Nitrospira, Spirochaetes, Gammaproteobacteria, Alphaproteobacteria, Chloroflexi, and Gemmatimonadetes were found. The site ammonium concentration, water content and organic matter content revealed to be strongest environmental predictors explaining the observed significant differences in the community composition between sampling sites. Copyright © 2011 Elsevier GmbH. All rights reserved.
Wolbachia association with the tsetse fly, Glossina fuscipes fuscipes, reveals high levels of genetic diversity and complex evolutionary dynamics

PubMed Central

2013-01-01

Background Wolbachia pipientis, a diverse group of α-proteobacteria, can alter arthropod host reproduction and confer a reproductive advantage to Wolbachia-infected females (cytoplasmic incompatibility (CI)). This advantage can alter host population genetics because Wolbachia-infected females produce more offspring with their own mitochondrial DNA (mtDNA) haplotypes than uninfected females. Thus, these host haplotypes become common or fixed (selective sweep). Although simulations suggest that for a CI-mediated sweep to occur, there must be a transient phase with repeated initial infections of multiple individual hosts by different Wolbachia strains, this has not been observed empirically. Wolbachia has been found in the tsetse fly, Glossina fuscipes fuscipes, but it is not limited to a single host haplotype, suggesting that CI did not impact its population structure. However, host population genetic differentiation could have been generated if multiple Wolbachia strains interacted in some populations. Here, we investigated Wolbachia genetic variation in G. f. fuscipes populations of known host genetic composition in Uganda. We tested for the presence of multiple Wolbachia strains using Multi-Locus Sequence Typing (MLST) and for an association between geographic region and host mtDNA haplotype using Wolbachia DNA sequence from a variable locus, groEL (heat shock protein 60). Results MLST demonstrated that some G. f. fuscipes carry Wolbachia strains from two lineages. GroEL revealed high levels of sequence diversity within and between individuals (Haplotype diversity = 0.945). We found Wolbachia associated with 26 host mtDNA haplotypes, an unprecedented result. We observed a geographical association of one Wolbachia lineage with southern host mtDNA haplotypes, but it was non-significant (p = 0.16). Though most Wolbachia-infected host haplotypes were those found in the contact region between host mtDNA groups, this association was non-significant (p = 0.17). Conclusions High Wolbachia sequence diversity and the association of Wolbachia with multiple host haplotypes suggest that different Wolbachia strains infected G. f. fuscipes multiple times independently. We suggest that these observations reflect a transient phase in Wolbachia evolution that is influenced by the long gestation and low reproductive output of tsetse. Although G. f. fuscipes is superinfected with Wolbachia, our data does not support that bidirectional CI has influenced host genetic diversity in Uganda. PMID:23384159
Strategies for Achieving High Sequencing Accuracy for Low Diversity Samples and Avoiding Sample Bleeding Using Illumina Platform

PubMed Central

Mitra, Abhishek; Skrzypczak, Magdalena; Ginalski, Krzysztof; Rowicka, Maga

2015-01-01

Sequencing microRNA, reduced representation sequencing, Hi-C technology and any method requiring the use of in-house barcodes result in sequencing libraries with low initial sequence diversity. Sequencing such data on the Illumina platform typically produces low quality data due to the limitations of the Illumina cluster calling algorithm. Moreover, even in the case of diverse samples, these limitations are causing substantial inaccuracies in multiplexed sample assignment (sample bleeding). Such inaccuracies are unacceptable in clinical applications, and in some other fields (e.g. detection of rare variants). Here, we discuss how both problems with quality of low-diversity samples and sample bleeding are caused by incorrect detection of clusters on the flowcell during initial sequencing cycles. We propose simple software modifications (Long Template Protocol) that overcome this problem. We present experimental results showing that our Long Template Protocol remarkably increases data quality for low diversity samples, as compared with the standard analysis protocol; it also substantially reduces sample bleeding for all samples. For comprehensiveness, we also discuss and compare experimental results from alternative approaches to sequencing low diversity samples. First, we discuss how the low diversity problem, if caused by barcodes, can be avoided altogether at the barcode design stage. Second and third, we present modified guidelines, which are more stringent than the manufacturer’s, for mixing low diversity samples with diverse samples and lowering cluster density, which in our experience consistently produces high quality data from low diversity samples. Fourth and fifth, we present rescue strategies that can be applied when sequencing results in low quality data and when there is no more biological material available. In such cases, we propose that the flowcell be re-hybridized and sequenced again using our Long Template Protocol. Alternatively, we discuss how analysis can be repeated from saved sequencing images using the Long Template Protocol to increase accuracy. PMID:25860802
Microbial eukaryotic diversity and distribution in a river plume and cyclonic eddy-influenced ecosystem in the South China Sea.

PubMed

Wu, Wenxue; Wang, Lei; Liao, Yu; Huang, Bangqin

2015-10-01

To evaluate microbial eukaryotic diversity and distribution in mesoscale processes, we investigated 18S rDNA diversity in a river plume and cyclonic eddy-influenced ecosystem in the southwestern South China Sea (SCS). Restriction fragment length polymorphism analysis was carried out using multiple primer sets. Relative to a wide range of previous similar studies, we observed a significantly higher proportion of sequences of pigmented taxa. Among the photosynthetic groups, Haptophyta accounted for 27.7% of the sequenced clones, which belonged primarily to Prymnesiophyceae. Unexpectedly, five operational taxonomic units of Cryptophyta were closely related to freshwater species. The Chlorophyta mostly fell within the Prasinophyceae, which was comprised of six clades, including Clade III, which is detected in the SCS for the first time in this study. Among the photosynthetic stramenopiles, Chrysophyceae was the most diverse taxon, which included seven clades. The majority of 18S rDNA sequences affiliated with the Dictyochophyceae, Eustigmatophyceae, and Pelagophyceae were closely related to those of pure cultures. The results of redundancy analysis and the permutation Mantel test based on unweighted UniFrac distances, conducted for spatial analyses of the Haptophyta subclades suggested that the Mekong River plume and cyclonic eddy play important roles in regulating microbial eukaryotic diversity and distribution in the southwestern SCS. © 2015 The Authors. MicrobiologyOpen published by John Wiley & Sons Ltd.

Physicochemical control of bacterial and protist community composition and diversity in Antarctic sea ice.

PubMed

Torstensson, Anders; Dinasquet, Julie; Chierici, Melissa; Fransson, Agneta; Riemann, Lasse; Wulff, Angela

2015-10-01

Due to climate change, sea ice experiences changes in terms of extent and physical properties. In order to understand how sea ice microbial communities are affected by changes in physicochemical properties of the ice, we used 454-sequencing of 16S and 18S rRNA genes to examine environmental control of microbial diversity and composition in Antarctic sea ice. We observed a high diversity and richness of bacteria, which were strongly negatively correlated with temperature and positively with brine salinity. We suggest that bacterial diversity in sea ice is mainly controlled by physicochemical properties of the ice, such as temperature and salinity, and that sea ice bacterial communities are sensitive to seasonal and environmental changes. For the first time in Antarctic interior sea ice, we observed a strong eukaryotic dominance of the dinoflagellate phylotype SL163A10, comprising 63% of the total sequences. This phylotype is known to be kleptoplastic and could be a significant primary producer in sea ice. We conclude that mixotrophic flagellates may play a greater role in the sea ice microbial ecosystem than previously believed, and not only during the polar night but also during summer when potential food sources are abundant. © 2015 Society for Applied Microbiology and John Wiley & Sons Ltd.
High levels of diversity characterize mandrill (Mandrillus sphinx) Mhc-DRB sequences.

PubMed

Abbott, Kristin M; Wickings, E Jean; Knapp, Leslie A

2006-08-01

The major histocompatibility complex (MHC) is highly polymorphic in most primate species studied thus far. The rhesus macaque (Macaca mulatta) has been studied extensively and the Mhc-DRB region demonstrates variability similar to humans. The extent of MHC diversity is relatively unknown for other Old World monkeys (OWM), especially among genera other than Macaca. A molecular survey of the Mhc-DRB region in mandrills (Mandrillus sphinx) revealed extensive variability, suggesting that other OWMs may also possess high levels of Mhc-DRB polymorphism. In the present study, 33 Mhc-DRB loci were identified from only 13 animals. Eleven were wild-born and presumed to be unrelated and two were captive-born twins. Two to seven different sequences were identified for each individual, suggesting that some mandrills may have as many as four Mhc-DRB loci on a single haplotype. From these sequences, representatives of at least six Mhc-DRB loci or lineages were identified. As observed in other primates, some new lineages may have arisen through the process of gene conversion. These findings indicate that mandrills have Mhc-DRB diversity not unlike rhesus macaques and humans.
[Sequence-based typing of enviromental Legionella pneumophila isolates in Guangzhou].

PubMed

Zhang, Ying; Qu, Pinghua; Zhang, Jian; Chen, Shouyi

2011-03-01

To characterize the genes of Legionella pneumophila isolated from different water source in Guangzhou from 2006 to 2009. To genotype the strains by using sequence-based typing (SBT) scheme. In total 44 L. pneumophila strains were identified by SBT with 7 diversifying genes of flaA, asd, mip, pilE, mompS, proA and neuA. Analysis of the amplicons sequence was taken in the European Working Group for Legionella Infections (EWGLI) international SBT database to obtain the allelic profiles and sequence types (STs). Serogroups were typed by latex agglutination test. Data from SBT revealed a high diversity among the strains and ST01 accounts for 30% (13/ 44). Fifteen new STs were discovered from 20 STs and 2 of them were newly assigned (ST887 and ST888) by EWGLI. SBT Phylogenetic tree was generated by SplitsTree and BURST programs. High diversity and specificity were observed of the L. pneumophila strains in Guangzhou. SBT is useful for L. pneumophila genomic study and epidemiological surveillance.
Diversity in the 18S SSU rRNA V4 hyper-variable region of Theileria spp. in Cape buffalo (Syncerus caffer) and cattle from southern Africa.

PubMed

Mans, Ben J; Pienaar, Ronel; Latif, Abdalla A; Potgieter, Fred T

2011-05-01

Sequence variation within the 18S SSU rRNA V4 hyper-variable region can affect the accuracy of real-time hybridization probe-based diagnostics for the detection of Theileria spp. infections. This is relevant for assays that use non-specific primers, such as the real-time hybridization assay for T. parva (Sibeko et al. 2008). To assess the effect of sequence variation on this test, the Theileria 18S gene from 62 buffalo and 49 cattle samples was cloned and ∼1000 clones sequenced. Twenty-six genotypes were detected which included known and novel genotypes for the T. buffeli, T. mutans, T. taurotragi and T. velifera clades. A novel genotype related to T. sp. (sable) was also detected in 1 bovine sample. Theileria genotypic diversity was higher in buffalo compared to cattle. Polymorphism within the T. parva hyper-variable region was confirmed by aberrant real-time melting peaks and supported by sequencing of the S5 ribosomal gene. Analysis of the S5 gene suggests that this gene can be a marker for species differentiation. T. parva, T. sp. (buffalo) and T. sp. (bougasvlei) remain the only genotypes amplified by the primer set of the hybridization assay. Therefore, the 18S sequence diversity observed does not seem to affect the current real-time hybridization assay for T. parva.
Prunus persica crop management as step toward AMF diversity conservation for the sustainable soil management

NASA Astrophysics Data System (ADS)

Alguacil, M. M.; Torrecillas, E.; Lozano, Z.; Garcia-Orenes, F.; Roldan, A.

2012-04-01

We investigated the diversity of arbuscular mycorrhizal fungi (AMF) in roots of Prunus persica under two fertilization treatments (CF: consisted of application of chicken manure (1400 kg.ha-1), urea (140 kg.ha-1), complex fertilizer 12-12-17/2 (280 kg.ha-1), and potassium sulfate (40 kg.ha-1) and IF: consisted of application of urea (140 kg.ha-1), complex fertilizer 12-12-17/2 (400 kg.ha-1) and potassium sulfate (70 kg.ha-1)) combined with integrated pest management (IM) or chemical pest management (CM), in a tropical agroecosystem in the north of Venezuela. Our goal was to ascertain how different fertilizers/pest management can modify the AMF diversity colonizing P. persica roots as an important step towards sustainable soil use and therefore protection of biodiversity. The AM fungal small-subunit (SSU) rRNA genes were subjected to PCR, cloning, sequencing and phylogenetic analyses. Twenty-one different phylotypes were identified, which were grouped in five families: Glomeraceae, Paraglomeraceae, Acaulosporaceae, Gigasporaceae and Archaeosporaceae. Sixteen of these sequence groups belonged to the genus Glomus, two to Paraglomus, one to Acaulospora, one to Scutellospora and one to Archaeospora. A different distribution of the AMF phylotypes as consequence of the difference between treatments was observed. Thus, the AMF communities of tree roots in the (IF+CM) treatment had the lowest diversity (H'=1.78) with the lowest total number of AMF sequence types (9). The trees from both (CF+IM) and (IF+IM) treatments had similar AMF diversity (H'?2.00); while the treatment (CF+CM) yielded the highest number of different AMF sequence types (17) and showed the highest diversity index (H'=2.69). In conclusion, the crop management including combination of organic and inorganic fertilization and chemical pest control appears to be the most suitable strategy with respect to reactivate the AMF diversity in the roots of this crop and thus, the agricultural and environmental sustainability in the agroecosystem.
Characterizing the genetic diversity of the monkey malaria parasite Plasmodium cynomolgi

PubMed Central

Sutton, Patrick L.; Luo, Zunping; Divis, Paul C. S.; Friedrich, Volney K.; Conway, David J.; Singh, Balbir; Barnwell, John W.; Carlton, Jane M.; Sullivan, Steven A.

2016-01-01

Plasmodium cynomolgi is a malaria parasite that typically infects Asian macaque monkeys, and humans on rare occasions. P. cynomolgi serves as a model system for the human malaria parasite Plasmodium vivax, with which it shares such important biological characteristics as formation of a dormant liver stage and a preference to invade reticulocytes. While genomes of three P. cynomolgi strains have been sequenced, genetic diversity of P. cynomolgi has not been widely investigated. To address this we developed the first panel of P. cynomolgi microsatellite markers to genotype eleven P. cynomolgi laboratory strains and 18 field isolates from Sarawak, Malaysian Borneo. We found diverse genotypes among most of the laboratory strains, though two nominally different strains were found to be genetically identical, We also investigated sequence polymorphism in two erythrocyte invasion gene families, the reticulocyte binding protein and Duffy binding protein genes, in these strains. We also observed copy number variation in rbp genes. PMID:26980604
Global biogeographic sampling of bacterial secondary metabolism

PubMed Central

Charlop-Powers, Zachary; Owen, Jeremy G; Reddy, Boojala Vijay B; Ternei, Melinda A; Guimarães, Denise O; de Frias, Ulysses A; Pupo, Monica T; Seepe, Prudy; Feng, Zhiyang; Brady, Sean F

2015-01-01

Recent bacterial (meta)genome sequencing efforts suggest the existence of an enormous untapped reservoir of natural-product-encoding biosynthetic gene clusters in the environment. Here we use the pyro-sequencing of PCR amplicons derived from both nonribosomal peptide adenylation domains and polyketide ketosynthase domains to compare biosynthetic diversity in soil microbiomes from around the globe. We see large differences in domain populations from all except the most proximal and biome-similar samples, suggesting that most microbiomes will encode largely distinct collections of bacterial secondary metabolites. Our data indicate a correlation between two factors, geographic distance and biome-type, and the biosynthetic diversity found in soil environments. By assigning reads to known gene clusters we identify hotspots of biomedically relevant biosynthetic diversity. These observations not only provide new insights into the natural world, they also provide a road map for guiding future natural products discovery efforts. DOI: http://dx.doi.org/10.7554/eLife.05048.001 PMID:25599565
Phylogenetic analysis reveals conservation and diversification of micro RNA166 genes among diverse plant species.

PubMed

Barik, Suvakanta; SarkarDas, Shabari; Singh, Archita; Gautam, Vibhav; Kumar, Pramod; Majee, Manoj; Sarkar, Ananda K

2014-01-01

Similar to the majority of the microRNAs, mature miR166s are derived from multiple members of MIR166 genes (precursors) and regulate various aspects of plant development by negatively regulating their target genes (Class III HD-ZIP). The evolutionary conservation or functional diversification of miRNA166 family members remains elusive. Here, we show the phylogenetic relationships among MIR166 precursor and mature sequences from three diverse model plant species. Despite strong conservation, some mature miR166 sequences, such as ppt-miR166m, have undergone sequence variation. Critical sequence variation in ppt-miR166m has led to functional diversification, as it targets non-HD-ZIPIII gene transcript (s). MIR166 precursor sequences have diverged in a lineage specific manner, and both precursors and mature osa-miR166i/j are highly conserved. Interestingly, polycistronic MIR166s were present in Physcomitrella and Oryza but not in Arabidopsis. The nature of cis-regulatory motifs on the upstream promoter sequences of MIR166 genes indicates their possible contribution to the functional variation observed among miR166 species. Copyright © 2013 Elsevier Inc. All rights reserved.
Global phylogenetic analysis of Escherichia coli and plasmids carrying the mcr-1 gene indicates bacterial diversity but plasmid restriction.

PubMed

Matamoros, Sébastien; van Hattem, Jarne M; Arcilla, Maris S; Willemse, Niels; Melles, Damian C; Penders, John; Vinh, Trung Nguyen; Thi Hoa, Ngo; de Jong, Menno D; Schultsz, Constance

2017-11-10

To understand the dynamics behind the worldwide spread of the mcr-1 gene, we determined the population structure of Escherichia coli and of mobile genetic elements (MGEs) carrying the mcr-1 gene. After a systematic review of the literature we included 65 E. coli whole genome sequences (WGS), adding 6 recently sequenced travel related isolates, and 312 MLST profiles. We included 219 MGEs described in 7 Enterobacteriaceae species isolated from human, animal and environmental samples. Despite a high overall diversity, 2 lineages were observed in the E. coli population that may function as reservoirs of the mcr-1 gene, the largest of which was linked to ST10, a sequence type known for its ubiquity in human faecal samples and in food samples. No genotypic clustering by geographical origin or isolation source was observed. Amongst a total of 13 plasmid incompatibility types, the IncI2, IncX4 and IncHI2 plasmids accounted for more than 90% of MGEs carrying the mcr-1 gene. We observed significant geographical clustering with regional spread of IncHI2 plasmids in Europe and IncI2 in Asia. These findings point towards promiscuous spread of the mcr-1 gene by efficient horizontal gene transfer dominated by a limited number of plasmid incompatibility types.
Statistical inference of the generation probability of T-cell receptors from sequence repertoires.

PubMed

Murugan, Anand; Mora, Thierry; Walczak, Aleksandra M; Callan, Curtis G

2012-10-02

Stochastic rearrangement of germline V-, D-, and J-genes to create variable coding sequence for certain cell surface receptors is at the origin of immune system diversity. This process, known as "VDJ recombination", is implemented via a series of stochastic molecular events involving gene choices and random nucleotide insertions between, and deletions from, genes. We use large sequence repertoires of the variable CDR3 region of human CD4+ T-cell receptor beta chains to infer the statistical properties of these basic biochemical events. Because any given CDR3 sequence can be produced in multiple ways, the probability distribution of hidden recombination events cannot be inferred directly from the observed sequences; we therefore develop a maximum likelihood inference method to achieve this end. To separate the properties of the molecular rearrangement mechanism from the effects of selection, we focus on nonproductive CDR3 sequences in T-cell DNA. We infer the joint distribution of the various generative events that occur when a new T-cell receptor gene is created. We find a rich picture of correlation (and absence thereof), providing insight into the molecular mechanisms involved. The generative event statistics are consistent between individuals, suggesting a universal biochemical process. Our probabilistic model predicts the generation probability of any specific CDR3 sequence by the primitive recombination process, allowing us to quantify the potential diversity of the T-cell repertoire and to understand why some sequences are shared between individuals. We argue that the use of formal statistical inference methods, of the kind presented in this paper, will be essential for quantitative understanding of the generation and evolution of diversity in the adaptive immune system.
The Sorcerer II Global Ocean Sampling expedition: expanding the universe of protein families.

PubMed

Yooseph, Shibu; Sutton, Granger; Rusch, Douglas B; Halpern, Aaron L; Williamson, Shannon J; Remington, Karin; Eisen, Jonathan A; Heidelberg, Karla B; Manning, Gerard; Li, Weizhong; Jaroszewski, Lukasz; Cieplak, Piotr; Miller, Christopher S; Li, Huiying; Mashiyama, Susan T; Joachimiak, Marcin P; van Belle, Christopher; Chandonia, John-Marc; Soergel, David A; Zhai, Yufeng; Natarajan, Kannan; Lee, Shaun; Raphael, Benjamin J; Bafna, Vineet; Friedman, Robert; Brenner, Steven E; Godzik, Adam; Eisenberg, David; Dixon, Jack E; Taylor, Susan S; Strausberg, Robert L; Frazier, Marvin; Venter, J Craig

2007-03-01

Metagenomics projects based on shotgun sequencing of populations of micro-organisms yield insight into protein families. We used sequence similarity clustering to explore proteins with a comprehensive dataset consisting of sequences from available databases together with 6.12 million proteins predicted from an assembly of 7.7 million Global Ocean Sampling (GOS) sequences. The GOS dataset covers nearly all known prokaryotic protein families. A total of 3,995 medium- and large-sized clusters consisting of only GOS sequences are identified, out of which 1,700 have no detectable homology to known families. The GOS-only clusters contain a higher than expected proportion of sequences of viral origin, thus reflecting a poor sampling of viral diversity until now. Protein domain distributions in the GOS dataset and current protein databases show distinct biases. Several protein domains that were previously categorized as kingdom specific are shown to have GOS examples in other kingdoms. About 6,000 sequences (ORFans) from the literature that heretofore lacked similarity to known proteins have matches in the GOS data. The GOS dataset is also used to improve remote homology detection. Overall, besides nearly doubling the number of current proteins, the predicted GOS proteins also add a great deal of diversity to known protein families and shed light on their evolution. These observations are illustrated using several protein families, including phosphatases, proteases, ultraviolet-irradiation DNA damage repair enzymes, glutamine synthetase, and RuBisCO. The diversity added by GOS data has implications for choosing targets for experimental structure characterization as part of structural genomics efforts. Our analysis indicates that new families are being discovered at a rate that is linear or almost linear with the addition of new sequences, implying that we are still far from discovering all protein families in nature.
Comparative genomics reveals high biological diversity and specific adaptations in the industrially and medically important fungal genus Aspergillus.

PubMed

de Vries, Ronald P; Riley, Robert; Wiebenga, Ad; Aguilar-Osorio, Guillermo; Amillis, Sotiris; Uchima, Cristiane Akemi; Anderluh, Gregor; Asadollahi, Mojtaba; Askin, Marion; Barry, Kerrie; Battaglia, Evy; Bayram, Özgür; Benocci, Tiziano; Braus-Stromeyer, Susanna A; Caldana, Camila; Cánovas, David; Cerqueira, Gustavo C; Chen, Fusheng; Chen, Wanping; Choi, Cindy; Clum, Alicia; Dos Santos, Renato Augusto Corrêa; Damásio, André Ricardo de Lima; Diallinas, George; Emri, Tamás; Fekete, Erzsébet; Flipphi, Michel; Freyberg, Susanne; Gallo, Antonia; Gournas, Christos; Habgood, Rob; Hainaut, Matthieu; Harispe, María Laura; Henrissat, Bernard; Hildén, Kristiina S; Hope, Ryan; Hossain, Abeer; Karabika, Eugenia; Karaffa, Levente; Karányi, Zsolt; Kraševec, Nada; Kuo, Alan; Kusch, Harald; LaButti, Kurt; Lagendijk, Ellen L; Lapidus, Alla; Levasseur, Anthony; Lindquist, Erika; Lipzen, Anna; Logrieco, Antonio F; MacCabe, Andrew; Mäkelä, Miia R; Malavazi, Iran; Melin, Petter; Meyer, Vera; Mielnichuk, Natalia; Miskei, Márton; Molnár, Ákos P; Mulé, Giuseppina; Ngan, Chew Yee; Orejas, Margarita; Orosz, Erzsébet; Ouedraogo, Jean Paul; Overkamp, Karin M; Park, Hee-Soo; Perrone, Giancarlo; Piumi, Francois; Punt, Peter J; Ram, Arthur F J; Ramón, Ana; Rauscher, Stefan; Record, Eric; Riaño-Pachón, Diego Mauricio; Robert, Vincent; Röhrig, Julian; Ruller, Roberto; Salamov, Asaf; Salih, Nadhira S; Samson, Rob A; Sándor, Erzsébet; Sanguinetti, Manuel; Schütze, Tabea; Sepčić, Kristina; Shelest, Ekaterina; Sherlock, Gavin; Sophianopoulou, Vicky; Squina, Fabio M; Sun, Hui; Susca, Antonia; Todd, Richard B; Tsang, Adrian; Unkles, Shiela E; van de Wiele, Nathalie; van Rossen-Uffink, Diana; Oliveira, Juliana Velasco de Castro; Vesth, Tammi C; Visser, Jaap; Yu, Jae-Hyuk; Zhou, Miaomiao; Andersen, Mikael R; Archer, David B; Baker, Scott E; Benoit, Isabelle; Brakhage, Axel A; Braus, Gerhard H; Fischer, Reinhard; Frisvad, Jens C; Goldman, Gustavo H; Houbraken, Jos; Oakley, Berl; Pócsi, István; Scazzocchio, Claudio; Seiboth, Bernhard; vanKuyk, Patricia A; Wortman, Jennifer; Dyer, Paul S; Grigoriev, Igor V

2017-02-14

The fungal genus Aspergillus is of critical importance to humankind. Species include those with industrial applications, important pathogens of humans, animals and crops, a source of potent carcinogenic contaminants of food, and an important genetic model. The genome sequences of eight aspergilli have already been explored to investigate aspects of fungal biology, raising questions about evolution and specialization within this genus. We have generated genome sequences for ten novel, highly diverse Aspergillus species and compared these in detail to sister and more distant genera. Comparative studies of key aspects of fungal biology, including primary and secondary metabolism, stress response, biomass degradation, and signal transduction, revealed both conservation and diversity among the species. Observed genomic differences were validated with experimental studies. This revealed several highlights, such as the potential for sex in asexual species, organic acid production genes being a key feature of black aspergilli, alternative approaches for degrading plant biomass, and indications for the genetic basis of stress response. A genome-wide phylogenetic analysis demonstrated in detail the relationship of the newly genome sequenced species with other aspergilli. Many aspects of biological differences between fungal species cannot be explained by current knowledge obtained from genome sequences. The comparative genomics and experimental study, presented here, allows for the first time a genus-wide view of the biological diversity of the aspergilli and in many, but not all, cases linked genome differences to phenotype. Insights gained could be exploited for biotechnological and medical applications of fungi.
Diversity of Pico- to Mesoplankton along the 2000 km Salinity Gradient of the Baltic Sea

PubMed Central

Hu, Yue O. O.; Karlson, Bengt; Charvet, Sophie; Andersson, Anders F.

2016-01-01

Microbial plankton form the productive base of both marine and freshwater ecosystems and are key drivers of global biogeochemical cycles of carbon and nutrients. Plankton diversity is immense with representations from all major phyla within the three domains of life. So far, plankton monitoring has mainly been based on microscopic identification, which has limited sensitivity and reproducibility, not least because of the numerical majority of plankton being unidentifiable under the light microscope. High-throughput sequencing of taxonomic marker genes offers a means to identify taxa inaccessible by traditional methods; thus, recent studies have unveiled an extensive previously unknown diversity of plankton. Here, we conducted ultra-deep Illumina sequencing (average 105 sequences/sample) of rRNA gene amplicons of surface water eukaryotic and bacterial plankton communities sampled in summer along a 2000 km transect following the salinity gradient of the Baltic Sea. Community composition was strongly correlated with salinity for both bacterial and eukaryotic plankton assemblages, highlighting the importance of salinity for structuring the biodiversity within this ecosystem. In contrast, no clear trends in alpha-diversity for bacterial or eukaryotic communities could be detected along the transect. The distribution of major planktonic taxa followed expected patterns as observed in monitoring programs, but groups novel to the Baltic Sea were also identified, such as relatives to the coccolithophore Emiliana huxleyi detected in the northern Baltic Sea. This study provides the first ultra-deep sequencing-based survey on eukaryotic and bacterial plankton biogeography in the Baltic Sea. PMID:27242706
Within-Host Variations of Human Papillomavirus Reveal APOBEC Signature Mutagenesis in the Viral Genome.

PubMed

Hirose, Yusuke; Onuki, Mamiko; Tenjimbayashi, Yuri; Mori, Seiichiro; Ishii, Yoshiyuki; Takeuchi, Takamasa; Tasaka, Nobutaka; Satoh, Toyomi; Morisada, Tohru; Iwata, Takashi; Miyamoto, Shingo; Matsumoto, Koji; Sekizawa, Akihiko; Kukimoto, Iwao

2018-06-15

Persistent infection with oncogenic human papillomaviruses (HPVs) causes cervical cancer, accompanied by the accumulation of somatic mutations into the host genome. There are concomitant genetic changes in the HPV genome during viral infection; however, their relevance to cervical carcinogenesis is poorly understood. Here, we explored within-host genetic diversity of HPV by performing deep-sequencing analyses of viral whole-genome sequences in clinical specimens. The whole genomes of HPV types 16, 52, and 58 were amplified by type-specific PCR from total cellular DNA of cervical exfoliated cells collected from patients with cervical intraepithelial neoplasia (CIN) and invasive cervical cancer (ICC) and were deep sequenced. After constructing a reference viral genome sequence for each specimen, nucleotide positions showing changes with >0.5% frequencies compared to the reference sequence were determined for individual samples. In total, 1,052 positions of nucleotide variations were detected in HPV genomes from 151 samples (CIN1, n = 56; CIN2/3, n = 68; ICC, n = 27), with various numbers per sample. Overall, C-to-T and C-to-A substitutions were the dominant changes observed across all histological grades. While C-to-T transitions were predominantly detected in CIN1, their prevalence was decreased in CIN2/3 and fell below that of C-to-A transversions in ICC. Analysis of the trinucleotide context encompassing substituted bases revealed that TpCpN, a preferred target sequence for cellular APOBEC cytosine deaminases, was a primary site for C-to-T substitutions in the HPV genome. These results strongly imply that the APOBEC proteins are drivers of HPV genome mutation, particularly in CIN1 lesions. IMPORTANCE HPVs exhibit surprisingly high levels of genetic diversity, including a large repertoire of minor genomic variants in each viral genotype. Here, by conducting deep-sequencing analyses, we show for the first time a comprehensive snapshot of the within-host genetic diversity of high-risk HPVs during cervical carcinogenesis. Quasispecies harboring minor nucleotide variations in viral whole-genome sequences were extensively observed across different grades of CIN and cervical cancer. Among the within-host variations, C-to-T transitions, a characteristic change mediated by cellular APOBEC cytosine deaminases, were predominantly detected throughout the whole viral genome, most strikingly in low-grade CIN lesions. The results strongly suggest that within-host variations of the HPV genome are primarily generated through the interaction with host cell DNA-editing enzymes and that such within-host variability is an evolutionary source of the genetic diversity of HPVs. Copyright © 2018 American Society for Microbiology.
Deep-branching Novel Lineages and High Diversity of Haptophytes in the Skagerrak (Norway) Uncovered by 454 Pyrosequencing

PubMed Central

Egge, Elianne S; Eikrem, Wenche; Edvardsen, Bente

2015-01-01

Microalgae in the division Haptophyta may be difficult to identify to species by microscopy because they are small and fragile. Here, we used high-throughput sequencing to explore the diversity of haptophytes in outer Oslofjorden, Skagerrak, and supplemented this with electron microscopy. Nano- and picoplanktonic subsurface samples were collected monthly for 2 yr, and the haptophytes were targeted by amplification of RNA/cDNA with Haptophyta-specific 18S ribosomal DNA V4 primers. Pyrosequencing revealed higher species richness of haptophytes than previously observed in the Skagerrak by microscopy. From ca. 400,000 reads we obtained 156 haptophyte operational taxonomic units (OTUs) after rigorous filtering and 99.5% clustering. The majority (84%) of the OTUs matched environmental sequences not linked to a morphological species, most of which were affiliated with the order Prymnesiales. Phylogenetic analyses including Oslofjorden OTUs and available cultured and environmental haptophyte sequences showed that several of the OTUs matched sequences forming deep-branching lineages, potentially representing novel haptophyte classes. Pyrosequencing also retrieved cultured species not previously reported by microscopy in the Skagerrak. Electron microscopy revealed species not yet genetically characterised and some potentially novel taxa. This study contributes to linking genotype to phenotype within this ubiquitous and ecologically important protist group, and reveals great, unknown diversity. PMID:25099994
Genetic diversity of HIV-1 non-B strains in Sicily: evidence of intersubtype recombinants by sequence analysis of gag, pol, and env genes.

PubMed

Tramuto, Fabio; Bonura, Filippa; Perna, Anna Maria; Mancuso, Salvatrice; Firenze, Alberto; Romano, Nino; Vitale, Francesco

2007-09-01

The molecular epidemiology of HIV-1 strains in Sicily (Italy) was phylogenetically investigated by the analysis of HIV-1 gag, pol, and env gene sequences from 11 HIV-1 non-B strains from 408 HIV-1-seropositive patients observed from September 2001 to August 2006. Sequences suggestive of recombination were further investigated by bootscanning analysis of various fragments. Overall, we identified several second-generation recombinant (SGRs) strains, which contained genetic material of CRF02_AG in at least one gene. Notably, three individuals were found to be infected with subsubtype A3, and one of them showed genetic recombination with subsubtype A4. The current study emphasizes the genetic analysis of gag, pol, and env genes as a powerful tool to trace the spread of complex HIV-1 recombinant forms, and highlight the genetic diversity of HIV-1 non-B strains in Italy.
Anchoring genome sequence to chromosomes of the central bearded dragon (Pogona vitticeps) enables reconstruction of ancestral squamate macrochromosomes and identifies sequence content of the Z chromosome.

PubMed

Deakin, Janine E; Edwards, Melanie J; Patel, Hardip; O'Meally, Denis; Lian, Jinmin; Stenhouse, Rachael; Ryan, Sam; Livernois, Alexandra M; Azad, Bhumika; Holleley, Clare E; Li, Qiye; Georges, Arthur

2016-06-10

Squamates (lizards and snakes) are a speciose lineage of reptiles displaying considerable karyotypic diversity, particularly among lizards. Understanding the evolution of this diversity requires comparison of genome organisation between species. Although the genomes of several squamate species have now been sequenced, only the green anole lizard has any sequence anchored to chromosomes. There is only limited gene mapping data available for five other squamates. This makes it difficult to reconstruct the events that have led to extant squamate karyotypic diversity. The purpose of this study was to anchor the recently sequenced central bearded dragon (Pogona vitticeps) genome to chromosomes to trace the evolution of squamate chromosomes. Assigning sequence to sex chromosomes was of particular interest for identifying candidate sex determining genes. By using two different approaches to map conserved blocks of genes, we were able to anchor approximately 42 % of the dragon genome sequence to chromosomes. We constructed detailed comparative maps between dragon, anole and chicken genomes, and where possible, made broader comparisons across Squamata using cytogenetic mapping information for five other species. We show that squamate macrochromosomes are relatively well conserved between species, supporting findings from previous molecular cytogenetic studies. Macrochromosome diversity between members of the Toxicofera clade has been generated by intrachromosomal, and a small number of interchromosomal, rearrangements. We reconstructed the ancestral squamate macrochromosomes by drawing upon comparative cytogenetic mapping data from seven squamate species and propose the events leading to the arrangements observed in representative species. In addition, we assigned over 8 Mbp of sequence containing 219 genes to the Z chromosome, providing a list of genes to begin testing as candidate sex determining genes. Anchoring of the dragon genome has provided substantial insight into the evolution of squamate genomes, enabling us to reconstruct ancestral macrochromosome arrangements at key positions in the squamate phylogeny, demonstrating that fusions between macrochromosomes or fusions of macrochromosomes and microchromosomes, have played an important role during the evolution of squamate genomes. Assigning sequence to the sex chromosomes has identified NR5A1 as a promising candidate sex determining gene in the dragon.
Diversity of Secondary Structure in Catalytic Peptides with β-Turn-Biased Sequences

PubMed Central

2016-01-01

X-ray crystallography has been applied to the structural analysis of a series of tetrapeptides that were previously assessed for catalytic activity in an atroposelective bromination reaction. Common to the series is a central Pro-Xaa sequence, where Pro is either l- or d-proline, which was chosen to favor nucleation of canonical β-turn secondary structures. Crystallographic analysis of 35 different peptide sequences revealed a range of conformational states. The observed differences appear not only in cases where the Pro-Xaa loop-region is altered, but also when seemingly subtle alterations to the flanking residues are introduced. In many instances, distinct conformers of the same sequence were observed, either as symmetry-independent molecules within the same unit cell or as polymorphs. Computational studies using DFT provided additional insight into the analysis of solid-state structural features. Select X-ray crystal structures were compared to the corresponding solution structures derived from measured proton chemical shifts, 3J-values, and 1H–1H-NOESY contacts. These findings imply that the conformational space available to simple peptide-based catalysts is more diverse than precedent might suggest. The direct observation of multiple ground state conformations for peptides of this family, as well as the dynamic processes associated with conformational equilibria, underscore not only the challenge of designing peptide-based catalysts, but also the difficulty in predicting their accessible transition states. These findings implicate the advantages of low-barrier interconversions between conformations of peptide-based catalysts for multistep, enantioselective reactions. PMID:28029251
A genomic scale map of genetic diversity in Trypanosoma cruzi

PubMed Central

2012-01-01

Background Trypanosoma cruzi, the causal agent of Chagas Disease, affects more than 16 million people in Latin America. The clinical outcome of the disease results from a complex interplay between environmental factors and the genetic background of both the human host and the parasite. However, knowledge of the genetic diversity of the parasite, is currently limited to a number of highly studied loci. The availability of a number of genomes from different evolutionary lineages of T. cruzi provides an unprecedented opportunity to look at the genetic diversity of the parasite at a genomic scale. Results Using a bioinformatic strategy, we have clustered T. cruzi sequence data available in the public domain and obtained multiple sequence alignments in which one or two alleles from the reference CL-Brener were included. These data covers 4 major evolutionary lineages (DTUs): TcI, TcII, TcIII, and the hybrid TcVI. Using these set of alignments we have identified 288,957 high quality single nucleotide polymorphisms and 1,480 indels. In a reduced re-sequencing study we were able to validate ~ 97% of high-quality SNPs identified in 47 loci. Analysis of how these changes affect encoded protein products showed a 0.77 ratio of synonymous to non-synonymous changes in the T. cruzi genome. We observed 113 changes that introduce or remove a stop codon, some causing significant functional changes, and a number of tri-allelic and tetra-allelic SNPs that could be exploited in strain typing assays. Based on an analysis of the observed nucleotide diversity we show that the T. cruzi genome contains a core set of genes that are under apparent purifying selection. Interestingly, orthologs of known druggable targets show statistically significant lower nucleotide diversity values. Conclusions This study provides the first look at the genetic diversity of T. cruzi at a genomic scale. The analysis covers an estimated ~ 60% of the genetic diversity present in the population, providing an essential resource for future studies on the development of new drugs and diagnostics, for Chagas Disease. These data is available through the TcSNP database (http://snps.tcruzi.org). PMID:23270511
Genetic clustering and polymorphism of the merozoite surface protein-3 of Plasmodium knowlesi clinical isolates from Peninsular Malaysia.

PubMed

De Silva, Jeremy Ryan; Lau, Yee Ling; Fong, Mun Yik

2017-01-03

The simian malaria parasite Plasmodium knowlesi has been reported to cause significant numbers of human infection in South East Asia. Its merozoite surface protein-3 (MSP3) is a protein that belongs to a multi-gene family of proteins first found in Plasmodium falciparum. Several studies have evaluated the potential of P. falciparum MSP3 as a potential vaccine candidate. However, to date no detailed studies have been carried out on P. knowlesi MSP3 gene (pkmsp3). The present study investigates the genetic diversity, and haplotypes groups of pkmsp3 in P. knowlesi clinical samples from Peninsular Malaysia. Blood samples were collected from P. knowlesi malaria patients within a period of 4 years (2008-2012). The pkmsp3 gene of the isolates was amplified via PCR, and subsequently cloned and sequenced. The full length pkmsp3 sequence was divided into Domain A and Domain B. Natural selection, genetic diversity, and haplotypes of pkmsp3 were analysed using MEGA6 and DnaSP ver. 5.10.00 programmes. From 23 samples, 48 pkmsp3 sequences were successfully obtained. At the nucleotide level, 101 synonymous and 238 non-synonymous mutations were observed. Tests of neutrality were not significant for the full length, Domain A or Domain B sequences. However, the dN/dS ratio of Domain B indicates purifying selection for this domain. Analysis of the deduced amino acid sequences revealed 42 different haplotypes. Neighbour Joining phylogenetic tree and haplotype network analyses revealed that the haplotypes clustered into two distinct groups. A moderate level of genetic diversity was observed in the pkmsp3 and only the C-terminal region (Domain B) appeared to be under purifying selection. The separation of the pkmsp3 into two haplotype groups provides further evidence of the existence of two distinct P. knowlesi types or lineages. Future studies should investigate the diversity of pkmsp3 among P. knowlesi isolates in North Borneo, where large numbers of human knowlesi malaria infection still occur.

Partial bisulfite conversion for unique template sequencing

PubMed Central

Kumar, Vijay; Rosenbaum, Julie; Wang, Zihua; Forcier, Talitha; Ronemus, Michael; Wigler, Michael

2018-01-01

Abstract We introduce a new protocol, mutational sequencing or muSeq, which uses sodium bisulfite to randomly deaminate unmethylated cytosines at a fixed and tunable rate. The muSeq protocol marks each initial template molecule with a unique mutation signature that is present in every copy of the template, and in every fragmented copy of a copy. In the sequenced read data, this signature is observed as a unique pattern of C-to-T or G-to-A nucleotide conversions. Clustering reads with the same conversion pattern enables accurate count and long-range assembly of initial template molecules from short-read sequence data. We explore count and low-error sequencing by profiling 135 000 restriction fragments in a PstI representation, demonstrating that muSeq improves copy number inference and significantly reduces sporadic sequencer error. We explore long-range assembly in the context of cDNA, generating contiguous transcript clusters greater than 3,000 bp in length. The muSeq assemblies reveal transcriptional diversity not observable from short-read data alone. PMID:29161423
Sequence diversity patterns suggesting balancing selection in partially sex-linked genes of the plant Silene latifolia are not generated by demographic history or gene flow.

PubMed

Guirao-Rico, Sara; Sánchez-Gracia, Alejandro; Charlesworth, Deborah

2017-03-01

DNA sequence diversity in genes in the partially sex-linked pseudoautosomal region (PAR) of the sex chromosomes of the plant Silene latifolia is higher than expected from within-species diversity of other genes. This could be the footprint of sexually antagonistic (SA) alleles that are maintained by balancing selection in a PAR gene (or genes) and affect polymorphism in linked genome regions. SA selection is predicted to occur during sex chromosome evolution, but it is important to test whether the unexpectedly high sequence polymorphism could be explained without it, purely by the combined effects of partial linkage with the sex-determining region and the population's demographic history, including possible introgression from Silene dioica. To test this, we applied approximate Bayesian computation-based model choice to autosomal sequence diversity data, to find the most plausible scenario for the recent history of S. latifolia and then to estimate the posterior density of the most relevant parameters. We then used these densities to simulate variation to be expected at PAR genes. We conclude that an excess of variants at high frequencies at PAR genes should arise in S. latifolia populations only for genes with strong associations with fully sex-linked genes, which requires closer linkage with the fully sex-linked region than that estimated for the PAR genes where apparent deviations from neutrality were observed. These results support the need to invoke selection to explain the S. latifolia PAR gene diversity, and encourage further work to test the possibility of balancing selection due to sexual antagonism. © 2016 John Wiley & Sons Ltd.
Molecular diversity of α-gliadin expressed genes in genetically contrasted spelt (Triticum aestivum ssp. spelta) accessions and comparison with bread wheat (T. aestivum ssp. aestivum) and related diploid Triticum and Aegilops species.

PubMed

Dubois, Benjamin; Bertin, Pierre; Mingeot, Dominique

2016-01-01

The gluten proteins of cereals such as bread wheat ( Triticum aestivum ssp. aestivum ) and spelt ( T. aestivum ssp. spelta ) are responsible for celiac disease (CD). The α-gliadins constitute the most immunogenic class of gluten proteins as they include four main T-cell stimulatory epitopes that affect CD patients. Spelt has been less studied than bread wheat and could constitute a source of valuable diversity. The objective of this work was to study the genetic diversity of spelt α-gliadin transcripts and to compare it with those of bread wheat. Genotyping data from 85 spelt accessions obtained with 19 simple sequence repeat (SSR) markers were used to select 11 contrasted accessions, from which 446 full open reading frame α-gliadin genes were cloned and sequenced, which revealed a high allelic diversity. High variations among the accessions were highlighted, in terms of the proportion of α-gliadin sequences from each of the three genomes (A, B and D), and their composition in the four T-cell stimulatory epitopes. An accession from Tajikistan stood out, having a particularly high proportion of α-gliadins from the B genome and a low immunogenic content. Even if no clear separation between spelt and bread wheat sequences was shown, spelt α-gliadins displayed specific features concerning e.g. the frequencies of some amino acid substitutions. Given this observation and the variations in toxicity revealed in the spelt accessions in this study, the high genetic diversity held in spelt germplasm collections could be a valuable resource in the development of safer varieties for CD patients.
Phylodynamic analysis and molecular diversity of the avian infectious bronchitis virus of chickens in Brazil.

PubMed

Fraga, Aline Padilha de; Gräf, Tiago; Pereira, Cleiton Schneider; Ikuta, Nilo; Fonseca, André Salvador Kazantzi; Lunge, Vagner Ricardo

2018-07-01

Avian infectious bronchitis virus (IBV) is the etiological agent of a highly contagious disease, which results in severe economic losses to the poultry industry. The spike protein (S1 subunit) is responsible for the molecular diversity of the virus and many sero/genotypes are described around the world. Recently a new standardized classification of the IBV molecular diversity was conducted, based on phylogenetic analysis of the S1 gene sequences sampled worldwide. Brazil is one of the biggest poultry producers in the world and the present study aimed to review the molecular diversity and reconstruct the evolutionary history of IBV in the country. All IBV S1 gene sequences, with local and year of collection information available on GenBank, were retrieved. Phylogenetic analyses were carried out based on a maximum likelihood method for the classification of genotypes occurring in Brazil, according to the new classification. Bayesian phylogenetic analyses were performed with the Brazilian clade and related international sequences to determine the evolutionary history of IBV in Brazil. A total of 143 Brazilian sequences were classified as GI-11 and 46 as GI-1 (Mass). Within the GI-11 clade, we have identified a potential recombinant strain circulating in Brazil. Phylodynamic analysis demonstrated that IBV GI-11 lineage was introduced in Brazil in the 1950s (1951, 1917-1975 95% HPD) and population dynamics was mostly constant throughout the time. Despite the national vaccination protocols, our results show the widespread dissemination and maintenance of the IBV GI-11 lineage in Brazil and highlight the importance of continuous surveillance to evaluate the impact of currently used vaccine strains on the observed viral diversity of the country. Copyright © 2018 Elsevier B.V. All rights reserved.
Structural diversity of domain superfamilies in the CATH database.

PubMed

Reeves, Gabrielle A; Dallman, Timothy J; Redfern, Oliver C; Akpor, Adrian; Orengo, Christine A

2006-07-14

The CATH database of domain structures has been used to explore the structural variation of homologous domains in 294 well populated domain structure superfamilies, each containing at least three sequence diverse relatives. Our analyses confirm some previously detected trends relating sequence divergence to structural variation but for a much larger dataset and in some superfamilies the new data reveal exceptional structural variation. Use of a new algorithm (2DSEC) to analyse variability in secondary structure compositions across a superfamily sheds new light on how structures evolve. 2DSEC detects inserted secondary structures that embellish the core of conserved secondary structures found throughout the superfamily. Analysis showed that for 56% of highly populated superfamilies (>9 sequence diverse relatives), there are twofold or more increases in the numbers of secondary structures in some relatives. In some families fivefold increases occur, sometimes modifying the fold of the domain. Manual inspection of secondary structure insertions or embellishments in 48 particularly variable superfamilies revealed that although these insertions were usually discontiguous in the sequence they were often co-located in 3D resulting in a larger structural motif that often modified the geometry of the active site or the surface conformation promoting diverse domain partnerships and protein interactions. These observations, supported by automatic analysis of all well populated CATH families, suggest that accretion of small secondary structure insertions may provide a simple mechanism for evolving new functions in diverse relatives. Some layered domain architectures (e.g. mainly-beta and alpha-beta sandwiches) that recur highly in the genomes more frequently exploit these types of embellishments to modify function. In these architectures, aggregation occurs most often at the edges, top or bottom of the beta-sheets. Information on structural variability across domain superfamilies has been made available through the CATH Dictionary of Homologous Structures (DHS).
Dynamics of soil diazotrophic community structure, diversity, and functioning during the cropping period of cotton (Gossypium hirsutum).

PubMed

Rai, Sandhya; Singh, Dileep Kumar; Annapurna, Kannepalli

2015-01-01

The soil sampled at different growth stages along the cropping period of cotton were analyzed using various molecular tools: restriction fragment length polymorphism (RFLP), terminal restriction length polymorphism (T-RFLP), and cloning-sequencing. The cluster analysis of the diazotrophic community structure of early sampled soil (0, 15, and 30 days) was found to be more closely related to each other than the later sampled one. Phylogenetic and diversity analysis of sequences obtained from the first (0 Day; C0) and last soil sample (180 day; C180) confirmed the data. The phylogenetic analysis revealed that C0 was having more unique sequences than C180 (presence of γ-Proteobacteria exclusively in C0). A relatively higher richness of diazotrophic community sequences was observed in C0 (S(ACE) : 30.76; S(Chao1) : 20.94) than C180 (S(ACE) : 18.00; S(Chao1) : 18.00) while the evenness component of Shannon diversity index increased from C0 (0.97) to C180 (1.15). The impact of routine agricultural activities was more evident based on diazotrophic activity (measured by acetylene reduction assay) than its structure and diversity. The nitrogenase activity of C0 (1264.85 ± 35.7 ηmol of ethylene production g(-1) dry soil h(-1) ) was statistically higher when compared to all other values (p < 0.05). There was no correlation found between diazotrophic community structure/diversity and N2 fixation rates. Thus, considerable functional redundancy of nifH was concluded to be existing at the experimental site. © 2015 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Diversity of the small subunit ribosomal RNA gene of the arbuscular mycorrhizal fungi colonizing Clintonia borealis from a mixed-wood boreal forest.

PubMed

DeBellis, Tonia; Widden, Paul

2006-11-01

Arbuscular mycorrhizal fungi (AMF) communities in Clintonia borealis roots from a boreal mixed forests in northwestern Québec were investigated. Roots were sampled from 100 m2 plots whose overstory was dominated by either trembling aspen (Populus tremuloides Michx.), white birch (Betula papyrifera Marsh.), or mixed white spruce (Picea glauca (Moench) Voss) and balsam fir (Abies balsamea (L.) Mill.). Part of the 18S ribosomal gene of the AMF was amplified and the resulting PCR products were cloned. Restriction analysis of the 576 resulting clones yielded 92 different restriction patterns which were then sequenced. Fifty-two sequences closely matched other Glomus sequences from Genbank. Phylogenetic analysis revealed 10 different AMF sequence types, most of which clustered with other uncultured AM sequences from plant roots from various field sites. Compared with other AMF communities from comparable studies, richness and diversity were higher than observed in an arable field, but lower than seen in a tropical forest and a temperate wetland. The AMF communities from Clintonia roots under the different canopy types did not differ significantly and the dominant sequence type, which clustered with AM sequences from a variety of environments and hosts at distant geographical locations, represented 66.9% of all the clones analyzed.
Comparing Ecological and Genetic Diversity Within the Marine Diatom Genus Pseudo-nitzschia: A Multiregional Synthesis

NASA Astrophysics Data System (ADS)

Hubbard, K.; Bruzek, S.

2016-02-01

The globally distributed marine diatom genus Pseudo-nitzschia consists of approximately 40 species, more than half of which occur in US coastal waters. Here, sensitive genetic tools targeting a variable portion of the internal transcribed spacer 1 (ITS1) region of the rRNA gene were used to assess Pseudo-nitzschia spp. diversity in more than 600 environmental DNA samples collected from US Atlantic, Pacific, and Gulf of Mexico waters. Community-based approaches employed genus-specific primers for environmental DNA fingerprinting and targeted sequencing. For the Gulf of Mexico samples especially, a nested PCR approach (with or without degenerate primers) improved resolution of species diversity. To date, more than 40 unique ITS1 amplicon sizes have been repeatedly observed in ITS1 fingerprints. Targeted sequencing of environmental DNA as well as single chains isolated from live samples indicate that many of these represent novel and known inter- and intra-specific Pseudo-nitzschia diversity. A few species (e.g., P. pungens, P. cuspidata) occur across all three regions, whereas other species and intraspecific variants occurred at local to regional spatial scales only. Generally, species frequently co-occur in complex assemblages, and transitions in Pseudo-nitzschia community composition occur seasonally, prior to bloom initiation, and across (cross-shelf, latitudinal, and vertical) environmental gradients. These observations highlight the dynamic nature of diatom community composition in the marine environment and the importance of classifying diversity at relevant ecological and/or taxonomic scales.
Comparative RNA sequencing reveals substantial genetic variation in endangered primates

PubMed Central

Perry, George H.; Melsted, Páll; Marioni, John C.; Wang, Ying; Bainer, Russell; Pickrell, Joseph K.; Michelini, Katelyn; Zehr, Sarah; Yoder, Anne D.; Stephens, Matthew; Pritchard, Jonathan K.; Gilad, Yoav

2012-01-01

Comparative genomic studies in primates have yielded important insights into the evolutionary forces that shape genetic diversity and revealed the likely genetic basis for certain species-specific adaptations. To date, however, these studies have focused on only a small number of species. For the majority of nonhuman primates, including some of the most critically endangered, genome-level data are not yet available. In this study, we have taken the first steps toward addressing this gap by sequencing RNA from the livers of multiple individuals from each of 16 mammalian species, including humans and 11 nonhuman primates. Of the nonhuman primate species, five are lemurs and two are lorisoids, for which little or no genomic data were previously available. To analyze these data, we developed a method for de novo assembly and alignment of orthologous gene sequences across species. We assembled an average of 5721 gene sequences per species and characterized diversity and divergence of both gene sequences and gene expression levels. We identified patterns of variation that are consistent with the action of positive or directional selection, including an 18-fold enrichment of peroxisomal genes among genes whose regulation likely evolved under directional selection in the ancestral primate lineage. Importantly, we found no relationship between genetic diversity and endangered status, with the two most endangered species in our study, the black and white ruffed lemur and the Coquerel's sifaka, having the highest genetic diversity among all primates. Our observations imply that many endangered lemur populations still harbor considerable genetic variation. Timely efforts to conserve these species alongside their habitats have, therefore, strong potential to achieve long-term success. PMID:22207615
Genetic diversity analysis of Gossypium arboreum germplasm accessions using genotyping-by-sequencing.

PubMed

Li, Ruijuan; Erpelding, John E

2016-10-01

The diploid cotton species Gossypium arboreum possesses many favorable agronomic traits such as drought tolerance and disease resistance, which can be utilized in the development of improved upland cotton cultivars. The USDA National Plant Germplasm System maintains more than 1600 G. arboreum accessions. Little information is available on the genetic diversity of the collection thereby limiting the utilization of this cotton species. The genetic diversity and population structure of the G. arboreum germplasm collection were assessed by genotyping-by-sequencing of 375 accessions. Using genome-wide single nucleotide polymorphism sequence data, two major clusters were inferred with 302 accessions in Cluster 1, 64 accessions in Cluster 2, and nine accessions unassigned due to their nearly equal membership to each cluster. These two clusters were further evaluated independently resulting in the identification of two sub-clusters for the 302 Cluster 1 accessions and three sub-clusters for the 64 Cluster 2 accessions. Low to moderate genetic diversity between clusters and sub-clusters were observed indicating a narrow genetic base. Cluster 2 accessions were more genetically diverse and the majority of the accessions in this cluster were landraces. In contrast, Cluster 1 is composed of varieties or breeding lines more recently added to the collection. The majority of the accessions had kinship values ranging from 0.6 to 0.8. Eight pairs of accessions were identified as potential redundancies due to their high kinship relatedness. The genetic diversity and genotype data from this study are essential to enhance germplasm utilization to identify genetically diverse accessions for the detection of quantitative trait loci associated with important traits that would benefit upland cotton improvement.
Genetic diversity of the Plasmodium falciparum apical membrane antigen I gene in parasite population from the China-Myanmar border area.

PubMed

Zhu, Xiaotong; Zhao, Zhenjun; Feng, Yonghui; Li, Peipei; Liu, Fei; Liu, Jun; Yang, Zhaoqing; Yan, Guiyun; Fan, Qi; Cao, Yaming; Cui, Liwang

2016-04-01

To investigate the genetic diversity of the Plasmodium falciparum apical membrane antigen 1 (PfAMA1) gene in Southeast Asia, we determined PfAMA1 sequences from 135 field isolates collected from the China-Myanmar border area and compared them with 956 publically available PfAMA1 sequences from seven global P. falciparum populations. This analysis revealed high genetic diversity of PfAMA1 in global P. falciparum populations with a total of 229 haplotypes identified. The genetic diversity of PfAMA1 gene from the China-Myanmar border is not evenly distributed in the different domains of this gene. Sequence diversity in PfAMA1 from the China-Myanmar border is lower than that observed in Thai, African and Oceanian populations, but higher than that in the South American population. This appeared to correlate well with the levels of endemicity of different malaria-endemic regions, where hyperendemic regions favor genetic cross of the parasite isolates and generation of higher genetic diversity. Neutrality tests show significant departure from neutrality in the entire ectodomain and Domain I of PfAMA1 in the China-Myanmar border parasite population. We found evidence supporting a substantial continent-wise genetic structure among P. falciparum populations, with the highest genetic differentiation detected between the China-Myanmar border and the South American populations. Whereas no alleles were unique to a specific region, there were considerable geographical differences in major alleles and their frequencies, highlighting further necessity to include more PfAMA1 alleles in vaccine designs. Copyright © 2016 Elsevier B.V. All rights reserved.
Post-main-sequence planetary system evolution.

PubMed

Veras, Dimitri

2016-02-01

The fates of planetary systems provide unassailable insights into their formation and represent rich cross-disciplinary dynamical laboratories. Mounting observations of post-main-sequence planetary systems necessitate a complementary level of theoretical scrutiny. Here, I review the diverse dynamical processes which affect planets, asteroids, comets and pebbles as their parent stars evolve into giant branch, white dwarf and neutron stars. This reference provides a foundation for the interpretation and modelling of currently known systems and upcoming discoveries.
Diversity of Microbial Carbohydrate-Active enZYmes (CAZYmes) Associated with Freshwater and Soil Samples from Caatinga Biome.

PubMed

Andrade, Ana Camila; Fróes, Adriana; Lopes, Fabyano Álvares Cardoso; Thompson, Fabiano L; Krüger, Ricardo Henrique; Dinsdale, Elizabeth; Bruce, Thiago

2017-07-01

Semi-arid and arid areas occupy about 33% of terrestrial ecosystems. However, little information is available about microbial diversity in the semi-arid Caatinga, which represents a unique biome that extends to about 11% of the Brazilian territory and is home to extraordinary diversity and high endemism level of species. In this study, we characterized the diversity of microbial genes associated with biomass conversion (carbohydrate-active enzymes, or so-called CAZYmes) in soil and freshwater of the Caatinga. Our results showed distinct CAZYme profiles in the soil and freshwater samples. Glycoside hydrolases and glycosyltransferases were the most abundant CAZYme families, with glycoside hydrolases more dominant in soil (∼44%) and glycosyltransferases more abundant in freshwater (∼50%). The abundances of individual glycoside hydrolase, glycosyltransferase, and carbohydrate-binding module subfamilies varied widely between soil and water samples. A predominance of glycoside hydrolases was observed in soil, and a higher contribution of enzymes involved in carbohydrate biosynthesis was observed in freshwater. The main taxa associated with the CAZYme sequences were Planctomycetia (relative abundance in soil, 29%) and Alphaproteobacteria (relative abundance in freshwater, 27%). Approximately 5-7% of CAZYme sequences showed low similarity with sequences deposited in non-redundant databases, suggesting putative homologues. Our findings represent a first attempt to describe specific microbial CAZYme profiles for environmental samples. Characterizing these enzyme groups associated with the conversion of carbohydrates in nature will improve our understanding of the significant roles of enzymes in the carbon cycle. We identified a CAZYme signature that can be used to discriminate between soil and freshwater samples, and this signature may be related to the microbial species adapted to the habitat. The data show the potential ecological roles of the CAZYme repertoire and associated biotechnological applications.
Genetic diversity and differentiation in reef-building Millepora species, as revealed by cross-species amplification of fifteen novel microsatellite loci.

PubMed

Dubé, Caroline E; Planes, Serge; Zhou, Yuxiang; Berteaux-Lecellier, Véronique; Boissin, Emilie

2017-01-01

Quantifying the genetic diversity in natural populations is crucial to address ecological and evolutionary questions. Despite recent advances in whole-genome sequencing, microsatellite markers have remained one of the most powerful tools for a myriad of population genetic approaches. Here, we used the 454 sequencing technique to develop microsatellite loci in the fire coral Millepora platyphylla , an important reef-builder of Indo-Pacific reefs . We tested the cross-species amplification of these loci in five other species of the genus Millepora and analysed its success in correlation with the genetic distances between species using mitochondrial 16S sequences. We succeeded in discovering fifteen microsatellite loci in our target species M. platyphylla, among which twelve were polymorphic with 2-13 alleles and a mean observed heterozygosity of 0.411. Cross-species amplification in the five other Millepora species revealed a high probability of amplification success (71%) and polymorphism (59%) of the loci. Our results show no evidence of decreased heterozygosity with increasing genetic distance. However, only one locus enabled measures of genetic diversity in the Caribbean species M. complanata due to high proportions of null alleles for most of the microsatellites. This result indicates that our novel markers may only be useful for the Indo-Pacific species of Millepora. Measures of genetic diversity revealed significant linkage disequilibrium, moderate levels of observed heterozygosity (0.323-0.496) and heterozygote deficiencies for the Indo-Pacific species. The accessibility to new polymorphic microsatellite markers for hydrozoan Millepora species creates new opportunities for future research on processes driving the complexity of their colonisation success on many Indo-Pacific reefs.
The biological features and genetic diversity of novel fish rhabdovirus isolates in China.

PubMed

Fu, Xiaozhe; Lin, Qiang; Liang, Hongru; Liu, Lihui; Huang, Zhibin; Li, Ningqiu; Su, Jianguo

2017-09-01

The Rhabdoviridae is a diverse family of negative-sense single-stranded RNA viruses which infects mammals, birds, reptiles, fish, insects and plants. Herein, we reported the isolation and characterization of 6 novel viruses from diseased fish collected from China including SCRV-QY, SCRV-SS, SCRV-GM, CmRV-FS, MsRV-SS, OmbRV-JM. The typical clinical symptom of diseased fish was hemorrhaging. Efficient propagation of these isolates in a Chinese perch brain cell line was determined by means of observation of cytopathic effect, RT-PCR and electron microscopy. Sequence alignment and phylogenetic analysis of the complete G protein sequences revealed that these isolates were clustered into one monophyletic lineage belonging to the species Siniperca chuatsi rhabdovirus.
Active Site Characterization of Proteases Sequences from Different Species of Aspergillus.

PubMed

Morya, V K; Yadav, Virendra K; Yadav, Sangeeta; Yadav, Dinesh

2016-09-01

A total of 129 proteases sequences comprising 43 serine proteases, 36 aspartic proteases, 24 cysteine protease, 21 metalloproteases, and 05 neutral proteases from different Aspergillus species were analyzed for the catalytically active site residues using MEROPS database and various bioinformatics tools. Different proteases have predominance of variable active site residues. In case of 24 cysteine proteases of Aspergilli, the predominant active site residues observed were Gln193, Cys199, His364, Asn384 while for 43 serine proteases, the active site residues namely Asp164, His193, Asn284, Ser349 and Asp325, His357, Asn454, Ser519 were frequently observed. The analysis of 21 metalloproteases of Aspergilli revealed Glu298 and Glu388, Tyr476 as predominant active site residues. In general, Aspergilli species-specific active site residues were observed for different types of protease sequences analyzed. The phylogenetic analysis of these 129 proteases sequences revealed 14 different clans representing different types of proteases with diverse active site residues.
Evaluation of genetic diversity amongst Descurainia sophia L. genotypes by inter-simple sequence repeat (ISSR) marker.

PubMed

Saki, Sahar; Bagheri, Hedayat; Deljou, Ali; Zeinalabedini, Mehrshad

2016-01-01

Descurainia sophia is a valuable medicinal plant in family of Brassicaceae. To determine the range of diversity amongst D. sophia in Iran, 32 naturally distributed plants belonging to six natural populations of the Iranian plateau were investigated by inter-simple sequence repeat (ISSR) markers. The average percentage of polymorphism produced by 12 ISSR primers was 86 %. The PIC values for primers ranged from 0.22 to 0.40 and Rp values ranged between 6.5 and 19.9. The relative genetic diversity of the populations was not high (Gst =0.32). However, the value of gene flow revealed by the ISSR marker was high (Nm = 1.03). UPGMA clustering method based on Jaccard similarity coefficient grouped the genotypes into two major clusters. Graph results from Neighbor-Net Network generated after a 1000 bootstrap test using Jaccard coefficient, and STRUCTURE analysis confirmed the UPGMA clustering. The first three PCAs represented 57.31 % of the total variation. The high levels of genetic diversity were observed within populations, which is useful in breeding and conservation programs. ISSR is found to be an eligible marker to study genetic diversity of D. sophia.
New Arsenate Reductase Gene (arrA) PCR Primers for Diversity Assessment and Quantification in Environmental Samples

PubMed Central

Sorensen, Darwin L.; Dupont, R. Ryan

2016-01-01

ABSTRACT The extent of arsenic contamination in drinking water and its potential threat to human health have resulted in considerable research interest in the microbial species responsible for arsenic reduction. The arsenate reductase gene (arrA), an important component of the microbial arsenate reduction system, has been widely used as a biomarker to study arsenate-reducing microorganisms. A new primer pair was designed and evaluated for quantitative PCR (qPCR) and high-throughput sequencing of the arrA gene, because currently available PCR primers are not suitable for these applications. The primers were evaluated in silico and empirically tested for amplification of arrA genes in clones and for amplification and high-throughput sequencing of arrA genes from soil and groundwater samples. In silico, this primer pair matched (≥90% DNA identity) 86% of arrA gene sequences from GenBank. Empirical evaluation showed successful amplification of arrA gene clones of diverse phylogenetic groups, as well as amplification and high-throughput sequencing of independent soil and groundwater samples without preenrichment, suggesting that these primers are highly specific and can amplify a broad diversity of arrA genes. The arrA gene diversity from soil and groundwater samples from the Cache Valley Basin (CVB) in Utah was greater than anticipated. We observed a significant correlation between arrA gene abundance, quantified through qPCR, and reduced arsenic (AsIII) concentrations in the groundwater samples. Furthermore, we demonstrated that these primers can be useful for studying the diversity of arsenate-reducing microbial communities and the ways in which their relative abundance in groundwater may be associated with different groundwater quality parameters. IMPORTANCE Arsenic is a major drinking water contaminant that threatens the health of millions of people worldwide. The extent of arsenic contamination and its potential threat to human health have resulted in considerable interest in the study of microbial species responsible for the reduction of arsenic, i.e., the conversion of AsV to AsIII. In this study, we developed a new primer pair to evaluate the diversity and abundance of arsenate-reducing microorganisms in soil and groundwater samples from the CVB in Utah. We observed significant arrA gene diversity in the CVB soil and groundwater samples, and arrA gene abundance was significantly correlated with the reduced arsenic (AsIII) concentrations in the groundwater samples. We think that these primers are useful for studying the ecology of arsenate-reducing microorganisms in different environments. PMID:27913413
Assessment of species diversity and distribution of an ancient diatom lineage using a DNA metabarcoding approach.

PubMed

Nanjappa, Deepak; Audic, Stephane; Romac, Sarah; Kooistra, Wiebe H C F; Zingone, Adriana

2014-01-01

Continuous efforts to estimate actual diversity and to trace the species distribution and ranges in the natural environments have gone in equal pace with advancements of the technologies in the study of microbial species diversity from microscopic observations to DNA-based barcoding. DNA metabarcoding based on Next Generation Sequencing (NGS) constitutes the latest advancement in these efforts. Here we use NGS data from different sites to investigate the geographic range of six species of the diatom family Leptocylindraceae and to identify possible new taxa within the family. We analysed the V4 and V9 regions of the nuclear-encoded SSU rDNA gene region in the NGS database of the European ERA-Biodiversa project BioMarKs, collected in plankton and sediments at six coastal sites in European coastal waters, as well as environmental sequences from the NCBI database. All species known in the family Leptocylindraceae were detected in both datasets, but the much larger Illumina V9 dataset showed a higher species coverage at the various sites than the 454 V4 dataset. Sequences identical or similar to the references of Leptocylindrus aporus, L. convexus, L. danicus/hargravesii and Tenuicylindrus belgicus were found in the Mediterranean Sea, North Atlantic Ocean and Black Sea as well as at locations outside Europe. Instead, sequences identical or close to that of L. minimus were found in the North Atlantic Ocean and the Black Sea but not in the Mediterranean Sea, while sequences belonging to a yet undescribed taxon were encountered only in Oslo Fjord and Baffin Bay. Identification of Leptocylindraceae species in NGS datasets has expanded our knowledge of the species biogeographic distribution and of the overall diversity of this diatom family. Individual species appear to be widespread, but not all of them are found everywhere. Despite the sequencing depth allowed by NGS and the wide geographic area covered by this study, the diversity of this ancient diatom family appears to be low, at least at the level of the marker used in this study.
Assessment of Species Diversity and Distribution of an Ancient Diatom Lineage Using a DNA Metabarcoding Approach

PubMed Central

Nanjappa, Deepak; Audic, Stephane; Romac, Sarah; Kooistra, Wiebe H. C. F.; Zingone, Adriana

2014-01-01

Background Continuous efforts to estimate actual diversity and to trace the species distribution and ranges in the natural environments have gone in equal pace with advancements of the technologies in the study of microbial species diversity from microscopic observations to DNA-based barcoding. DNA metabarcoding based on Next Generation Sequencing (NGS) constitutes the latest advancement in these efforts. Here we use NGS data from different sites to investigate the geographic range of six species of the diatom family Leptocylindraceae and to identify possible new taxa within the family. Methodology/Principal Findings We analysed the V4 and V9 regions of the nuclear-encoded SSU rDNA gene region in the NGS database of the European ERA-Biodiversa project BioMarKs, collected in plankton and sediments at six coastal sites in European coastal waters, as well as environmental sequences from the NCBI database. All species known in the family Leptocylindraceae were detected in both datasets, but the much larger Illumina V9 dataset showed a higher species coverage at the various sites than the 454 V4 dataset. Sequences identical or similar to the references of Leptocylindrus aporus, L. convexus, L. danicus/hargravesii and Tenuicylindrus belgicus were found in the Mediterranean Sea, North Atlantic Ocean and Black Sea as well as at locations outside Europe. Instead, sequences identical or close to that of L. minimus were found in the North Atlantic Ocean and the Black Sea but not in the Mediterranean Sea, while sequences belonging to a yet undescribed taxon were encountered only in Oslo Fjord and Baffin Bay. Conclusions/Significance Identification of Leptocylindraceae species in NGS datasets has expanded our knowledge of the species biogeographic distribution and of the overall diversity of this diatom family. Individual species appear to be widespread, but not all of them are found everywhere. Despite the sequencing depth allowed by NGS and the wide geographic area covered by this study, the diversity of this ancient diatom family appears to be low, at least at the level of the marker used in this study. PMID:25133638

Using High-Throughput Sequencing to Leverage Surveillance of Genetic Diversity and Oseltamivir Resistance: A Pilot Study during the 2009 Influenza A(H1N1) Pandemic

PubMed Central

Téllez-Sosa, Juan; Rodríguez, Mario Henry; Gómez-Barreto, Rosa E.; Valdovinos-Torres, Humberto; Hidalgo, Ana Cecilia; Cruz-Hervert, Pablo; Luna, René Santos; Carrillo-Valenzo, Erik; Ramos, Celso; García-García, Lourdes; Martínez-Barnetche, Jesús

2013-01-01

Background Influenza viruses display a high mutation rate and complex evolutionary patterns. Next-generation sequencing (NGS) has been widely used for qualitative and semi-quantitative assessment of genetic diversity in complex biological samples. The “deep sequencing” approach, enabled by the enormous throughput of current NGS platforms, allows the identification of rare genetic viral variants in targeted genetic regions, but is usually limited to a small number of samples. Methodology and Principal Findings We designed a proof-of-principle study to test whether redistributing sequencing throughput from a high depth-small sample number towards a low depth-large sample number approach is feasible and contributes to influenza epidemiological surveillance. Using 454-Roche sequencing, we sequenced at a rather low depth, a 307 bp amplicon of the neuraminidase gene of the Influenza A(H1N1) pandemic (A(H1N1)pdm) virus from cDNA amplicons pooled in 48 barcoded libraries obtained from nasal swab samples of infected patients (n = 299) taken from May to November, 2009 pandemic period in Mexico. This approach revealed that during the transition from the first (May-July) to second wave (September-November) of the pandemic, the initial genetic variants were replaced by the N248D mutation in the NA gene, and enabled the establishment of temporal and geographic associations with genetic diversity and the identification of mutations associated with oseltamivir resistance. Conclusions NGS sequencing of a short amplicon from the NA gene at low sequencing depth allowed genetic screening of a large number of samples, providing insights to viral genetic diversity dynamics and the identification of genetic variants associated with oseltamivir resistance. Further research is needed to explain the observed replacement of the genetic variants seen during the second wave. As sequencing throughput rises and library multiplexing and automation improves, we foresee that the approach presented here can be scaled up for global genetic surveillance of influenza and other infectious diseases. PMID:23843978
The transmission dynamics and diversity of human metapneumovirus in Peru.

PubMed

Pollett, Simon; Trovão, Nidia S; Tan, Yi; Eden, John-Sebastian; Halpin, Rebecca A; Bera, Jayati; Das, Suman R; Wentworth, David; Ocaña, Victor; Mendocilla, Silvia M; Álvarez, Carlos; Calisto, Maria E; Garcia, Josefina; Halsey, Eric; Ampuero, Julia S; Nelson, Martha I; Leguia, Mariana

2017-12-29

The transmission dynamics of human metapneumovirus (HMPV) in tropical countries remain unclear. Further understanding of the genetic diversity of the virus could aid in HMPV vaccine design and improve our understanding of respiratory virus transmission dynamics in low- and middle-income countries. We examined the evolution of HMPV in Peru through phylogenetic analysis of 61 full genome HMPV sequences collected in three ecologically diverse regions of Peru (Lima, Piura, and Iquitos) during 2008-2012, comprising the largest data set of HMPV whole genomes sequenced from any tropical country to date. We revealed extensive genetic diversity generated by frequent viral introductions, with little evidence of local persistence. While considerable viral traffic between non-Peruvian countries and Peru was observed, HMPV epidemics in Peruvian locales were more frequently epidemiologically linked with other sites within Peru. We showed that Iquitos experienced greater HMPV traffic than the similar sized city of Piura by both Bayesian and maximum likelihood methods. There is extensive HMPV genetic diversity even within smaller and relatively less connected cities of Peru and this virus is spatially fluid. Greater diversity of HMPV in Iquitos compared to Piura may relate to higher volumes of human movement, including air traffic to this location. © 2017 The Authors. Influenza and Other Respiratory Viruses Published by John Wiley & Sons Ltd.
Formyltetrahydrofolate Synthetase Gene Diversity in the Guts of Higher Termites with Different Diets and Lifestyles ▿ †

PubMed Central

Ottesen, Elizabeth A.; Leadbetter, Jared R.

2011-01-01

In this study, we examine gene diversity for formyl-tetrahydrofolate synthetase (FTHFS), a key enzyme in homoacetogenesis, recovered from the gut microbiota of six species of higher termites. The “higher” termites (family Termitidae), which represent the majority of extant termite species and genera, engage in a broader diversity of feeding and nesting styles than the “lower” termites. Previous studies of termite gut homoacetogenesis have focused on wood-feeding lower termites, from which the preponderance of FTHFS sequences recovered were related to those from acetogenic treponemes. While sequences belonging to this group were present in the guts of all six higher termites examined, treponeme-like FTHFS sequences represented the majority of recovered sequences in only two species (a wood-feeding Nasutitermes sp. and a palm-feeding Microcerotermes sp.). The remaining four termite species analyzed (a Gnathamitermes sp. and two Amitermes spp. that were recovered from subterranean nests with indeterminate feeding strategies and a litter-feeding Rhynchotermes sp.) yielded novel FTHFS clades not observed in lower termites. These termites yielded two distinct clusters of probable purinolytic Firmicutes and a large group of potential homoacetogens related to sequences previously recovered from the guts of omnivorous cockroaches. These findings suggest that the gut environments of different higher termite species may select for different groups of homoacetogens, with some species hosting treponeme-dominated homoacetogen populations similar to those of wood-feeding, lower termites while others host Firmicutes-dominated communities more similar to those of omnivorous cockroaches. PMID:21441328
Flooding greatly affects the diversity of arbuscular mycorrhizal fungi communities in the roots of wetland plants.

PubMed

Wang, Yutao; Huang, Yelin; Qiu, Qiu; Xin, Guorong; Yang, Zhongyi; Shi, Suhua

2011-01-01

The communities of arbuscular mycorrhizal fungi (AMF) colonizing the roots of three mangrove species were characterized along a tidal gradient in a mangrove swamp. A fragment, designated SSU-ITS-LSU, including part of the small subunit (SSU), the entire internal transcribed spacer (ITS) and part of the large subunit (LSU) of rDNA from samples of AMF-colonized roots was amplified, cloned and sequenced using AMF-specific primers. Similar levels of AMF diversity to those observed in terrestrial ecosystems were detected in the roots, indicating that the communities of AMF in wetland ecosystems are not necessarily low in diversity. In total, 761 Glomeromycota sequences were obtained, which grouped, according to phylogenetic analysis using the SSU-ITS-LSU fragment, into 23 phylotypes, 22 of which belonged to Glomeraceae and one to Acaulosporaceae. The results indicate that flooding plays an important role in AMF diversity, and its effects appear to depend on the degree (duration) of flooding. Both host species and tide level affected community structure of AMF, indicating the presence of habitat and host species preferences.
Flooding Greatly Affects the Diversity of Arbuscular Mycorrhizal Fungi Communities in the Roots of Wetland Plants

PubMed Central

Wang, Yutao; Huang, Yelin; Qiu, Qiu; Xin, Guorong; Yang, Zhongyi; Shi, Suhua

2011-01-01

The communities of arbuscular mycorrhizal fungi (AMF) colonizing the roots of three mangrove species were characterized along a tidal gradient in a mangrove swamp. A fragment, designated SSU-ITS-LSU, including part of the small subunit (SSU), the entire internal transcribed spacer (ITS) and part of the large subunit (LSU) of rDNA from samples of AMF-colonized roots was amplified, cloned and sequenced using AMF-specific primers. Similar levels of AMF diversity to those observed in terrestrial ecosystems were detected in the roots, indicating that the communities of AMF in wetland ecosystems are not necessarily low in diversity. In total, 761 Glomeromycota sequences were obtained, which grouped, according to phylogenetic analysis using the SSU-ITS-LSU fragment, into 23 phylotypes, 22 of which belonged to Glomeraceae and one to Acaulosporaceae. The results indicate that flooding plays an important role in AMF diversity, and its effects appear to depend on the degree (duration) of flooding. Both host species and tide level affected community structure of AMF, indicating the presence of habitat and host species preferences. PMID:21931734
Bacterial diversity at different stages of the composting process

PubMed Central

2010-01-01

Background Composting is an aerobic microbiological process that is facilitated by bacteria and fungi. Composting is also a method to produce fertilizer or soil conditioner. Tightened EU legislation now requires treatment of the continuously growing quantities of organic municipal waste before final disposal. However, some full-scale composting plants experience difficulties with the efficiency of biowaste degradation and with the emission of noxious odours. In this study we examine the bacterial species richness and community structure of an optimally working pilot-scale compost plant, as well as a full-scale composting plant experiencing typical problems. Bacterial species composition was determined by isolating total DNA followed by amplifying and sequencing the gene encoding the 16S ribosomal RNA. Results Over 1500 almost full-length 16S rRNA gene sequences were analysed and of these, over 500 were present only as singletons. Most of the sequences observed in either one or both of the composting processes studied here were similar to the bacterial species reported earlier in composts, including bacteria from the phyla Actinobacteria, Bacteroidetes, Firmicutes, Proteobacteria and Deinococcus-Thermus. In addition, a number of previously undetected bacterial phylotypes were observed. Statistical calculations estimated a total bacterial diversity of over 2000 different phylotypes in the studied composts. Conclusions Interestingly, locally enriched or evolved bacterial variants of familiar compost species were observed in both composts. A detailed comparison of the bacterial diversity revealed a large difference in composts at the species and strain level from the different composting plants. However, at the genus level, the difference was much smaller and illustrated a delay of the composting process in the full-scale, sub-optimally performing plants. PMID:20350306
Molecular evidence of hybridization in sympatric populations of the Enantia jethys complex (Lepidoptera: Pieridae).

PubMed

Jasso-Martínez, Jovana M; Machkour-M'Rabet, Salima; Vila, Roger; Rodríguez-Arnaiz, Rosario; Castañeda-Sortibrán, América Nitxin

2018-01-01

Hybridization events are frequently demonstrated in natural butterfly populations. One interesting butterfly complex species is the Enantia jethys complex that has been studied for over a century; many debates exist regarding the species composition of this complex. Currently, three species that live sympatrically in the Gulf slope of Mexico (Enantia jethys, E. mazai, and E. albania) are recognized in this complex (based on morphological and molecular studies). Where these species live in sympatry, some cases of interspecific mating have been observed, suggesting hybridization events. Considering this, we employed a multilocus approach (analyses of mitochondrial and nuclear sequences: COI, RpS5, and Wg; and nuclear dominant markers: inter-simple sequence repeat (ISSRs) to study hybridization in sympatric populations from Veracruz, Mexico. Genetic diversity parameters were determined for all molecular markers, and species identification was assessed by different methods such as analyses of molecular variance (AMOVA), clustering, principal coordinate analysis (PCoA), gene flow, and PhiPT parameters. ISSR molecular markers were used for a more profound study of hybridization process. Although species of the Enantia jethys complex have a low dispersal capacity, we observed high genetic diversity, probably reflecting a high density of individuals locally. ISSR markers provided evidence of a contemporary hybridization process, detecting a high number of hybrids (from 17% to 53%) with significant differences in genetic diversity. Furthermore, a directional pattern of hybridization was observed from E. albania to other species. Phylogenetic study through DNA sequencing confirmed the existence of three clades corresponding to the three species previously recognized by morphological and molecular studies. This study underlines the importance of assessing hybridization in evolutionary studies, by tracing the lineage separation process that leads to the origin of new species. Our research demonstrates that hybridization processes have a high occurrence in natural populations.
Error correction and statistical analyses for intra-host comparisons of feline immunodeficiency virus diversity from high-throughput sequencing data.

PubMed

Liu, Yang; Chiaromonte, Francesca; Ross, Howard; Malhotra, Raunaq; Elleder, Daniel; Poss, Mary

2015-06-30

Infection with feline immunodeficiency virus (FIV) causes an immunosuppressive disease whose consequences are less severe if cats are co-infected with an attenuated FIV strain (PLV). We use virus diversity measurements, which reflect replication ability and the virus response to various conditions, to test whether diversity of virulent FIV in lymphoid tissues is altered in the presence of PLV. Our data consisted of the 3' half of the FIV genome from three tissues of animals infected with FIV alone, or with FIV and PLV, sequenced by 454 technology. Since rare variants dominate virus populations, we had to carefully distinguish sequence variation from errors due to experimental protocols and sequencing. We considered an exponential-normal convolution model used for background correction of microarray data, and modified it to formulate an error correction approach for minor allele frequencies derived from high-throughput sequencing. Similar to accounting for over-dispersion in counts, this accounts for error-inflated variability in frequencies - and quite effectively reproduces empirically observed distributions. After obtaining error-corrected minor allele frequencies, we applied ANalysis Of VAriance (ANOVA) based on a linear mixed model and found that conserved sites and transition frequencies in FIV genes differ among tissues of dual and single infected cats. Furthermore, analysis of minor allele frequencies at individual FIV genome sites revealed 242 sites significantly affected by infection status (dual vs. single) or infection status by tissue interaction. All together, our results demonstrated a decrease in FIV diversity in bone marrow in the presence of PLV. Importantly, these effects were weakened or undetectable when error correction was performed with other approaches (thresholding of minor allele frequencies; probabilistic clustering of reads). We also queried the data for cytidine deaminase activity on the viral genome, which causes an asymmetric increase in G to A substitutions, but found no evidence for this host defense strategy. Our error correction approach for minor allele frequencies (more sensitive and computationally efficient than other algorithms) and our statistical treatment of variation (ANOVA) were critical for effective use of high-throughput sequencing data in understanding viral diversity. We found that co-infection with PLV shifts FIV diversity from bone marrow to lymph node and spleen.
Leveraging genome-wide datasets to quantify the functional role of the anti-Shine-Dalgarno sequence in regulating translation efficiency.

PubMed

Hockenberry, Adam J; Pah, Adam R; Jewett, Michael C; Amaral, Luís A N

2017-01-01

Studies dating back to the 1970s established that sequence complementarity between the anti-Shine-Dalgarno (aSD) sequence on prokaryotic ribosomes and the 5' untranslated region of mRNAs helps to facilitate translation initiation. The optimal location of aSD sequence binding relative to the start codon, the full extents of the aSD sequence and the functional form of the relationship between aSD sequence complementarity and translation efficiency have not been fully resolved. Here, we investigate these relationships by leveraging the sequence diversity of endogenous genes and recently available genome-wide estimates of translation efficiency. We show that-after accounting for predicted mRNA structure-aSD sequence complementarity increases the translation of endogenous mRNAs by roughly 50%. Further, we observe that this relationship is nonlinear, with translation efficiency maximized for mRNAs with intermediate levels of aSD sequence complementarity. The mechanistic insights that we observe are highly robust: we find nearly identical results in multiple datasets spanning three distantly related bacteria. Further, we verify our main conclusions by re-analysing a controlled experimental dataset. © 2017 The Authors.
Approach to determine the diversity of Legionella species by nested PCR-DGGE in aquatic environments.

PubMed

Huang, Wen-Chien; Tsai, Hsin-Chi; Tao, Chi-Wei; Chen, Jung-Sheng; Shih, Yi-Jia; Kao, Po-Min; Huang, Tung-Yi; Hsu, Bing-Mu

2017-01-01

In this study, we describe a nested PCR-DGGE strategy to detect Legionella communities from river water samples. The nearly full-length 16S rRNA gene was amplified using bacterial primer in the first step. After, the amplicons were employed as DNA templates in the second PCR using Legionella specific primer. The third round of gene amplification was conducted to gain PCR fragments apposite for DGGE analysis. Then the total numbers of amplified genes were observed in DGGE bands of products gained with primers specific for the diversity of Legionella species. The DGGE patterns are thus potential for a high-throughput preliminary determination of aquatic environmental Legionella species before sequencing. Comparative DNA sequence analysis of excised DGGE unique band patterns showed the identity of the Legionella community members, including a reference profile with two pathogenic species of Legionella strains. In addition, only members of Legionella pneumophila and uncultured Legionella sp. were detected. Development of three step nested PCR-DGGE tactic is seen as a useful method for studying the diversity of Legionella community. The method is rapid and provided sequence information for phylogenetic analysis.
Approach to determine the diversity of Legionella species by nested PCR-DGGE in aquatic environments

PubMed Central

Huang, Wen-Chien; Tsai, Hsin-Chi; Tao, Chi-Wei; Chen, Jung-Sheng; Shih, Yi-Jia; Kao, Po-Min; Huang, Tung-Yi; Hsu, Bing-Mu

2017-01-01

In this study, we describe a nested PCR-DGGE strategy to detect Legionella communities from river water samples. The nearly full-length 16S rRNA gene was amplified using bacterial primer in the first step. After, the amplicons were employed as DNA templates in the second PCR using Legionella specific primer. The third round of gene amplification was conducted to gain PCR fragments apposite for DGGE analysis. Then the total numbers of amplified genes were observed in DGGE bands of products gained with primers specific for the diversity of Legionella species. The DGGE patterns are thus potential for a high-throughput preliminary determination of aquatic environmental Legionella species before sequencing. Comparative DNA sequence analysis of excised DGGE unique band patterns showed the identity of the Legionella community members, including a reference profile with two pathogenic species of Legionella strains. In addition, only members of Legionella pneumophila and uncultured Legionella sp. were detected. Development of three step nested PCR-DGGE tactic is seen as a useful method for studying the diversity of Legionella community. The method is rapid and provided sequence information for phylogenetic analysis. PMID:28166249
Bacterial diversity along a 2600 km river continuum

PubMed Central

Savio, Domenico; Sinclair, Lucas; Ijaz, Umer Z.; Parajka, Juraj; Reischer, Georg H.; Stadler, Philipp; Blaschke, Alfred P.; Blöschl, Günter; Mach, Robert L.; Kirschner, Alexander K. T.; Farnleitner, Andreas H.

2015-01-01

Summary The bacterioplankton diversity in large rivers has thus far been under‐sampled despite the importance of streams and rivers as components of continental landscapes. Here, we present a comprehensive dataset detailing the bacterioplankton diversity along the midstream of the Danube River and its tributaries. Using 16S rRNA‐gene amplicon sequencing, our analysis revealed that bacterial richness and evenness gradually declined downriver in both the free‐living and particle‐associated bacterial communities. These shifts were also supported by beta diversity analysis, where the effects of tributaries were negligible in regards to the overall variation. In addition, the river was largely dominated by bacteria that are commonly observed in freshwaters. Dominated by the acI lineage, the freshwater SAR11 (LD12) and the P olynucleobacter group, typical freshwater taxa increased in proportion downriver and were accompanied by a decrease in soil and groundwater‐affiliated bacteria. Based on views of the meta‐community and River Continuum Concept, we interpret the observed taxonomic patterns and accompanying changes in alpha and beta diversity with the intention of laying the foundation for a unified concept for river bacterioplankton diversity. PMID:25922985
Assessing Diversity of DNA Structure-Related Sequence Features in Prokaryotic Genomes

PubMed Central

Huang, Yongjie; Mrázek, Jan

2014-01-01

Prokaryotic genomes are diverse in terms of their nucleotide and oligonucleotide composition as well as presence of various sequence features that can affect physical properties of the DNA molecule. We present a survey of local sequence patterns which have a potential to promote non-canonical DNA conformations (i.e. different from standard B-DNA double helix) and interpret the results in terms of relationships with organisms' habitats, phylogenetic classifications, and other characteristics. Our present work differs from earlier similar surveys not only by investigating a wider range of sequence patterns in a large number of genomes but also by using a more realistic null model to assess significant deviations. Our results show that simple sequence repeats and Z-DNA-promoting patterns are generally suppressed in prokaryotic genomes, whereas palindromes and inverted repeats are over-represented. Representation of patterns that promote Z-DNA and intrinsic DNA curvature increases with increasing optimal growth temperature (OGT), and decreases with increasing oxygen requirement. Additionally, representations of close direct repeats, palindromes and inverted repeats exhibit clear negative trends with increasing OGT. The observed relationships with environmental characteristics, particularly OGT, suggest possible evolutionary scenarios of structural adaptation of DNA to particular environmental niches. PMID:24408877
A Preliminary Study of Viral Metagenomics of French Bat Species in Contact with Humans: Identification of New Mammalian Viruses

PubMed Central

Dacheux, Laurent; Cervantes-Gonzalez, Minerva; Guigon, Ghislaine; Thiberge, Jean-Michel; Vandenbogaert, Mathias; Maufrais, Corinne

2014-01-01

The prediction of viral zoonosis epidemics has become a major public health issue. A profound understanding of the viral population in key animal species acting as reservoirs represents an important step towards this goal. Bats harbor diverse viruses, some of which are of particular interest because they cause severe human diseases. However, little is known about the diversity of the global population of viruses found in bats (virome). We determined the viral diversity of five different French insectivorous bat species (nine specimens in total) in close contact with humans. Sequence-independent amplification, high-throughput sequencing with Illumina technology and a dedicated bioinformatics analysis pipeline were used on pooled tissues (brain, liver and lungs). Comparisons of the sequences of contigs and unassembled reads provided a global taxonomic distribution of virus-related sequences for each sample, highlighting differences both within and between bat species. Many viral families were present in these viromes, including viruses known to infect bacteria, plants/fungi, insects or vertebrates, the most relevant being those infecting mammals (Retroviridae, Herpesviridae, Bunyaviridae, Poxviridae, Flaviviridae, Reoviridae, Bornaviridae, Picobirnaviridae). In particular, we detected several new mammalian viruses, including rotaviruses, gammaretroviruses, bornaviruses and bunyaviruses with the identification of the first bat nairovirus. These observations demonstrate that bats naturally harbor viruses from many different families, most of which infect mammals. They may therefore constitute a major reservoir of viral diversity that should be analyzed carefully, to determine the role played by bats in the spread of zoonotic viral infections. PMID:24489870
Body Site Is a More Determinant Factor than Human Population Diversity in the Healthy Skin Microbiome

PubMed Central

Perez Perez, Guillermo I.; Gao, Zhan; Jourdain, Roland; Ramirez, Julia; Gany, Francesca; Clavaud, Cecile; Demaude, Julien

2016-01-01

We studied skin microbiota present in three skin sites (forearm, axilla, scalp) in men from six ethnic groups living in New York City. Methods. Samples were obtained at baseline and after four days following use of neutral soap and stopping regular hygiene products, including shampoos and deodorants. DNA was extracted using the MoBio Power Lyzer kit and 16S rRNA gene sequences determined on the IIlumina MiSeq platform, using QIIME for analysis. Results. Our analysis confirmed skin swabbing as a useful method for sampling different areas of the skin because DNA concentrations and number of sequences obtained across subject libraries were similar. We confirmed that skin location was the main factor determining the composition of bacterial communities. Alpha diversity, expressed as number of species observed, was greater in arm than on scalp or axilla in all studied groups. We observed an unexpected increase in α-diversity on arm, with similar tendency on scalp, in the South Asian group after subjects stopped using their regular shampoos and deodorants. Significant differences at phylum and genus levels were observed between subjects of the different ethnic origins at all skin sites. Conclusions. We conclude that ethnicity and particular soap and shampoo practices are secondary factors compared to the ecological zone of the human body in determining cutaneous microbiota composition. PMID:27088867
Mutation signatures of carcinogen exposure: genome-wide detection and new opportunities for cancer prevention

PubMed Central

2014-01-01

Exposure to environmental mutagens is an important cause of human cancer, and measures to reduce mutagenic and carcinogenic exposures have been highly successful at controlling cancer. Until recently, it has been possible to connect the chemical characteristics of mutagens to actual mutations observed in human tumors only indirectly. Now, next-generation sequencing technology enables us to observe in detail the DNA-sequence-level effects of well-known mutagens, such as ultraviolet radiation and tobacco smoke, as well as endogenous mutagenic processes, such as those involving activated DNA cytidine deaminases (APOBECs). We can also observe the effects of less well-known but potent mutagens, including those recently found to be present in some herbal remedies. Crucially, we can now tease apart the superimposed effects of several mutational exposures and processes and determine which ones occurred during the development of individual tumors. Here, we review advances in detecting these mutation signatures and discuss the implications for surveillance and prevention of cancer. The number of sequenced tumors from diverse cancer types and multiple geographic regions is growing explosively, and the genomes of these tumors will bear the signatures of even more diverse mutagenic exposures. Thus, we envision development of wide-ranging compendia of mutation signatures from tumors and a concerted effort to experimentally elucidate the signatures of a large number of mutagens. This information will be used to link signatures observed in tumors to the exposures responsible for them, which will offer unprecedented opportunities for prevention. PMID:25031618
Post-main-sequence planetary system evolution

PubMed Central

Veras, Dimitri

2016-01-01

The fates of planetary systems provide unassailable insights into their formation and represent rich cross-disciplinary dynamical laboratories. Mounting observations of post-main-sequence planetary systems necessitate a complementary level of theoretical scrutiny. Here, I review the diverse dynamical processes which affect planets, asteroids, comets and pebbles as their parent stars evolve into giant branch, white dwarf and neutron stars. This reference provides a foundation for the interpretation and modelling of currently known systems and upcoming discoveries. PMID:26998326
Lactobacillus buchneri genotyping on the basis of clustered regularly interspaced short palindromic repeat (CRISPR) locus diversity.

PubMed

Briner, Alexandra E; Barrangou, Rodolphe

2014-02-01

Clustered regularly interspaced short palindromic repeats (CRISPR) in combination with associated sequences (cas) constitute the CRISPR-Cas immune system, which uptakes DNA from invasive genetic elements as novel "spacers" that provide a genetic record of immunization events. We investigated the potential of CRISPR-based genotyping of Lactobacillus buchneri, a species relevant for commercial silage, bioethanol, and vegetable fermentations. Upon investigating the occurrence and diversity of CRISPR-Cas systems in Lactobacillus buchneri genomes, we observed a ubiquitous occurrence of CRISPR arrays containing a 36-nucleotide (nt) type II-A CRISPR locus adjacent to four cas genes, including the universal cas1 and cas2 genes and the type II signature gene cas9. Comparative analysis of CRISPR spacer content in 26 L. buchneri pickle fermentation isolates associated with spoilage revealed 10 unique locus genotypes that contained between 9 and 29 variable spacers. We observed a set of conserved spacers at the ancestral end, reflecting a common origin, as well as leader-end polymorphisms, reflecting recent divergence. Some of these spacers showed perfect identity with phage sequences, and many spacers showed homology to Lactobacillus plasmid sequences. Following a comparative analysis of sequences immediately flanking protospacers that matched CRISPR spacers, we identified a novel putative protospacer-adjacent motif (PAM), 5'-AAAA-3'. Overall, these findings suggest that type II-A CRISPR-Cas systems are valuable for genotyping of L. buchneri.
Modelling the Dust Around Vega-Like Stars

NASA Technical Reports Server (NTRS)

Sylvester, Roger J.; Skinner, C. J.; Barlow, M. J.

1996-01-01

Models are presented of four Vega-like stars: main-sequence stars with infrared emission from circumstellar dust. The dusty environments of the four stars are rather diverse, as shown by their spectral energy distributions. Good fits to the observations were obtained for all four stars.
Comparative microbial diversity analyses of modern marine thrombolitic mats by barcoded pyrosequencing.

PubMed

Mobberley, Jennifer M; Ortega, Maya C; Foster, Jamie S

2012-01-01

Thrombolites are unlaminated carbonate structures that form as a result of the metabolic interactions of complex microbial mat communities. Thrombolites have a long geological history; however, little is known regarding the microbes associated with modern structures. In this study, we use a barcoded 16S rRNA gene-pyrosequencing approach coupled with morphological analysis to assess the bacterial, cyanobacterial and archaeal diversity associated with actively forming thrombolites found in Highborne Cay, Bahamas. Analyses revealed four distinct microbial mat communities referred to as black, beige, pink and button mats on the surfaces of the thrombolites. At a coarse phylogenetic resolution, the domain bacterial sequence libraries from the four mats were similar, with Proteobacteria and Cyanobacteria being the most abundant. At the finer resolution of the rRNA gene sequences, significant differences in community structure were observed, with dramatically different cyanobacterial communities. Of the four mat types, the button mats contained the highest diversity of Cyanobacteria, and were dominated by two sequence clusters with high similarity to the genus Dichothrix, an organism associated with the deposition of carbonate. Archaeal diversity was low, but varied in all mat types, and the archaeal community was predominately composed of members of the Thaumarchaeota and Euryarchaeota. The morphological and genetic data support the hypothesis that the four mat types are distinctive thrombolitic mat communities. © 2011 Society for Applied Microbiology and Blackwell Publishing Ltd.

Characterisation of culture-independent and -dependent microbial communities in a high-temperature offshore chalk petroleum reservoir.

PubMed

Kaster, Krista M; Bonaunet, Kristin; Berland, Harald; Kjeilen-Eilertsen, Grethe; Brakstad, Odd Gunnar

2009-11-01

Recent studies have indicated that oil reservoirs harbour diverse microbial communities. Culture-dependent and culture-independent methods were used to evaluate the microbial diversity in produced water samples of the Ekofisk oil field, a high temperature, and fractured chalk reservoir in the North Sea. DGGE analyses of 16S rRNA gene fragments were used to assess the microbial diversity of both archaeal and bacterial communities in produced water samples and enrichment cultures from 4 different wells (B-08, X-08, X-18 and X-25). Low diversity communities were found when 16S rDNA libraries of bacterial and archaeal assemblages were generated from total community DNA obtained from produced water samples and enrichment cultures. Sequence analysis of the clones indicated close matches to microbes associated with high-temperature oil reservoirs or other similar environments. Sequences were found to be similar to members of the genera Thermotoga, Caminicella, Thermoanaerobacter, Archaeoglobus, Thermococcus, and Methanobulbus. Enrichment cultures obtained from the produced water samples were dominated by sheathed rods. Sequence analyses of the cultures indicated predominance of the genera Petrotoga, Arcobacter, Archaeoglobus and Thermococcus. The communities of both produced water and enrichment cultures appeared to be dominated by thermophilic fermenters capable of reducing sulphur compounds. These results suggest that the biochemical processes in the Ekofisk chalk reservoir are similar to those observed in high-temperature sandstone reservoirs.
Microbial Communities on Seafloor Basalts at Dorado Outcrop Reflect Level of Alteration and Highlight Global Lithic Clades

PubMed Central

Lee, Michael D.; Walworth, Nathan G.; Sylvan, Jason B.; Edwards, Katrina J.; Orcutt, Beth N.

2015-01-01

Areas of exposed basalt along mid-ocean ridges and at seafloor outcrops serve as conduits of fluid flux into and out of a subsurface ocean, and microbe–mineral interactions can influence alteration reactions at the rock–water interface. Located on the eastern flank of the East Pacific Rise, Dorado Outcrop is a site of low-temperature (<20°C) hydrothermal venting and represents a new end-member in the current survey of seafloor basalt biomes. Consistent with prior studies, a survey of 16S rRNA gene sequence diversity using universal primers targeting the V4 hypervariable region revealed much greater richness and diversity on the seafloor rocks than in surrounding seawater. Overall, Gamma-, Alpha-, and Deltaproteobacteria, and Thaumarchaeota dominated the sequenced communities, together making up over half of the observed diversity, though bacterial sequences were more abundant than archaeal in all samples. The most abundant bacterial reads were closely related to the obligate chemolithoautotrophic, sulfur-oxidizing Thioprofundum lithotrophicum, suggesting carbon and sulfur cycling as dominant metabolic pathways in this system. Representatives of Thaumarchaeota were detected in relatively high abundance on the basalts in comparison to bottom water, possibly indicating ammonia oxidation. In comparison to other sequence datasets from globally distributed seafloor basalts, this study reveals many overlapping and cosmopolitan phylogenetic groups and also suggests that substrate age correlates with community structure. PMID:26779122
Novel, diverse RNA viruses from Mediterranean isolates of the phytopathogenic fungus, Rosellinia necatrix: insights into evolutionary biology of fungal viruses.

PubMed

Arjona-Lopez, Juan Manuel; Telengech, Paul; Jamal, Atif; Hisano, Sakae; Kondo, Hideki; Yelin, Mery Dafny; Arjona-Girona, Isabel; Kanematsu, Satoko; Lopez-Herrera, Carlos José; Suzuki, Nobuhiro

2018-04-01

To reveal mycovirus diversity, we conducted a search of as-yet-unexplored Mediterranean isolates of the phytopathogenic ascomycete Rosellinia necatrix for virus infections. Of seventy-nine, eleven fungal isolates tested RNA virus-positive, with many showing coinfections, indicating a virus incidence of 14%, which is slightly lower than that (approximately 20%) previously reported for extensive surveys of over 1000 Japanese R. necatrix isolates. All viral sequences were fully or partially characterized by Sanger and next-generation sequencing. These sequences appear to represent isolates of various new species spanning at least 6 established or previously proposed families such as Partiti-, Hypo-, Megabirna-, Yado-kari-, Fusagra- and Fusarividae, as well as a newly proposed family, Megatotiviridae. This observation greatly expands the diversity of R. necatrix viruses, because no hypo-, fusagra- or megatotiviruses were previously reported from R. necatrix. The sequence analyses showed a rare horizontal gene transfer event of the 2A-like protease domain between a dsRNA (phlegivirus) and a positive-sense, single-stranded RNA virus (hypovirus). Moreover, many of the newly detected viruses showed the closest relation to viruses reported from fungi other than R. necatrix, such as Fusarium spp., which are sympatric to R. necatrix. These combined results imply horizontal virus transfer between these soil-inhabitant fungi. © 2018 Society for Applied Microbiology and John Wiley & Sons Ltd.
Genetic and antigenic diversity of Theileria parva in cattle in Eastern and Southern zones of Tanzania. A study to support control of East Coast fever.

PubMed

Elisa, Mwega; Hasan, Salih Dia; Moses, Njahira; Elpidius, Rukambile; Skilton, Robert; Gwakisa, Paul

2015-04-01

This study investigated the genetic and antigenic diversity of Theileria parva in cattle from the Eastern and Southern zones of Tanzania. Thirty-nine (62%) positive samples were genotyped using 14 mini- and microsatellite markers with coverage of all four T. parva chromosomes. Wright's F index (F(ST) = 0 × 094) indicated a high level of panmixis. Linkage equilibrium was observed in the two zones studied, suggesting existence of a panmyctic population. In addition, sequence analysis of CD8+ T-cell target antigen genes Tp1 revealed a single protein sequence in all samples analysed, which is also present in the T. parva Muguga strain, which is a component of the FAO1 vaccine. All Tp2 epitope sequences were identical to those in the T. parva Muguga strain, except for one variant of a Tp2 epitope, which is found in T. parva Kiambu 5 strain, also a component the FAO1 vaccine. Neighbour joining tree of the nucleotide sequences of Tp2 showed clustering according to geographical origin. Our results show low genetic and antigenic diversity of T. parva within the populations analysed. This has very important implications for the development of sustainable control measures for T. parva in Eastern and Southern zones of Tanzania, where East Coast fever is endemic.
Construction of nested genetic core collections to optimize the exploitation of natural diversity in Vitis vinifera L. subsp. sativa

PubMed Central

Le Cunff, Loïc; Fournier-Level, Alexandre; Laucou, Valérie; Vezzulli, Silvia; Lacombe, Thierry; Adam-Blondon, Anne-Françoise; Boursiquot, Jean-Michel; This, Patrice

2008-01-01

Background The first high quality draft of the grape genome sequence has just been published. This is a critical step in accessing all the genes of this species and increases the chances of exploiting the natural genetic diversity through association genetics. However, our basic knowledge of the extent of allelic variation within the species is still not sufficient. Towards this goal, we constructed nested genetic core collections (G-cores) to capture the simple sequence repeat (SSR) diversity of the grape cultivated compartment (Vitis vinifera L. subsp. sativa) from the world's largest germplasm collection (Domaine de Vassal, INRA Hérault, France), containing 2262 unique genotypes. Results Sub-samples of 12, 24, 48 and 92 varieties of V. vinifera L. were selected based on their genotypes for 20 SSR markers using the M-strategy. They represent respectively 58%, 73%, 83% and 100% of total SSR diversity. The capture of allelic diversity was analyzed by sequencing three genes scattered throughout the genome on 233 individuals: 41 single nucleotide polymorphisms (SNPs) were identified using the G-92 core (one SNP for every 49 nucleotides) while only 25 were observed using a larger sample of 141 individuals selected on the basis of 50 morphological traits, thus demonstrating the reliability of the approach. Conclusion The G-12 and G-24 core-collections displayed respectively 78% and 88% of the SNPs respectively, and are therefore of great interest for SNP discovery studies. Furthermore, the nested genetic core collections satisfactorily reflected the geographic and the genetic diversity of grape, which are also of great interest for the study of gene evolution in this species. PMID:18384667
Cultivation Versus Molecular Analysis of Banana (Musa sp.) Shoot-Tip Tissue Reveals Enormous Diversity of Normally Uncultivable Endophytic Bacteria.

PubMed

Thomas, Pious; Sekhar, Aparna Chandra

2017-05-01

The interior of plants constitutes a unique environment for microorganisms with various organisms inhabiting as endophytes. Unlike subterranean plant parts, aboveground parts are relatively less explored for endophytic microbial diversity. We employed a combination of cultivation and molecular approaches to study the endophytic bacterial diversity in banana shoot-tips. Cultivable bacteria from 20 sucker shoot-tips of cv. Grand Naine included 37 strains under 16 genera and three phyla (Proteobacteria, Actinobacteria, Firmicutes). 16S rRNA gene-ribotyping approach on 799f and 1492r PCR-amplicons to avoid plant organelle sequences was ineffective showing limited bacterial diversity. 16S rRNA metagene profiling targeting the V3-V4 hypervariable region after filtering out the chloroplast (74.2 %), mitochondrial (22.9 %), and unknown sequences (1.1 %) revealed enormous bacterial diversity. Proteobacteria formed the predominant phylum (64 %) succeeded by Firmicutes (12.1 %), Actinobacteria (9.5 %), Bacteroidetes (6.4 %), Planctomycetes, Cyanobacteria, and minor shares (<1 %) of 14 phyla including several candidate phyla besides the domain Euryarchaeota (0.2 %). Microbiome analysis of single shoot-tips through 16S rRNA V3 region profiling showed similar taxonomic richness and diversity and was less affected by plant sequence interferences. DNA extraction kit ominously influenced the phylogenetic diversity. The study has revealed vast diversity of normally uncultivable endophytic bacteria prevailing in banana shoot-tips (20 phyla, 46 classes) with about 2.6 % of the deciphered 269 genera and 1.5 % of the 656 observed species from the same source of shoot-tips attained through cultivation. The predominant genera included several agriculturally important bacteria. The study reveals an immense ecosystem of endophytic bacteria in banana shoot tissues endorsing the earlier documentation of intracellular "Cytobacts" and "Peribacts" with possible roles in plant holobiome and hologenome.
Microbial diversity and component variation in Xiaguan Tuo Tea during pile fermentation

PubMed Central

Li, Min; Yang, Xinrui; Gui, Xin; Chen, Guofeng; Chu, Jiuyun; He, Xingwang; Wang, Weitao; Han, Feng

2018-01-01

Xiaguan Tuo Tea is largely consumed by the Chinese, but there is little research into the microbial diversity and component changes during the fermentation of this tea. In this study, we first used fluorescence in situ hybridization (FISH), next-generation sequencing (NGS) and chemical analysis methods to determine the microbial abundance and diversity and the chemical composition during fermentation. The FISH results showed that the total number of microorganisms ranges from 2.3×102 to 4.0×108 cells per gram of sample during fermentation and is mainly dominated by fungi. In the early fermentation stages, molds are dominant (0.6×102~2.8×106 cells/g, 0~35 d). However, in the late stages of fermentation, yeasts are dominant (3.6×104~9.6×106 cells/g, 35~56 d). The bacteria have little effect during the fermentation of tea (102~103 cells/g, <1% of fungus values). Of these fungi, A. niger (Aspergillus niger) and B. adeninivorans (Blastobotrys adeninivorans) are identified as the two most common strains, based on Next-generation Sequencing (NGS) analysis. Peak diversity in tea was observed at day 35 of fermentation (Shannon–Weaver index: 1.195857), and lower diversity was observed on days 6 and 56 of fermentation (Shannon–Weaver index 0.860589 and 1.119106, respectively). During the microbial fermentation, compared to the unfermented tea, the tea polyphenol content decreased by 54%, and the caffeine content increased by 59%. Theanine and free amino acid contents were reduced during fermentation by 81.1 and 92.85%, respectively. PMID:29462204
Microbial diversity and component variation in Xiaguan Tuo Tea during pile fermentation.

PubMed

Li, Haizhou; Li, Min; Yang, Xinrui; Gui, Xin; Chen, Guofeng; Chu, Jiuyun; He, Xingwang; Wang, Weitao; Han, Feng; Li, Ping

2018-01-01

Xiaguan Tuo Tea is largely consumed by the Chinese, but there is little research into the microbial diversity and component changes during the fermentation of this tea. In this study, we first used fluorescence in situ hybridization (FISH), next-generation sequencing (NGS) and chemical analysis methods to determine the microbial abundance and diversity and the chemical composition during fermentation. The FISH results showed that the total number of microorganisms ranges from 2.3×102 to 4.0×108 cells per gram of sample during fermentation and is mainly dominated by fungi. In the early fermentation stages, molds are dominant (0.6×102~2.8×106 cells/g, 0~35 d). However, in the late stages of fermentation, yeasts are dominant (3.6×104~9.6×106 cells/g, 35~56 d). The bacteria have little effect during the fermentation of tea (102~103 cells/g, <1% of fungus values). Of these fungi, A. niger (Aspergillus niger) and B. adeninivorans (Blastobotrys adeninivorans) are identified as the two most common strains, based on Next-generation Sequencing (NGS) analysis. Peak diversity in tea was observed at day 35 of fermentation (Shannon-Weaver index: 1.195857), and lower diversity was observed on days 6 and 56 of fermentation (Shannon-Weaver index 0.860589 and 1.119106, respectively). During the microbial fermentation, compared to the unfermented tea, the tea polyphenol content decreased by 54%, and the caffeine content increased by 59%. Theanine and free amino acid contents were reduced during fermentation by 81.1 and 92.85%, respectively.
Interfaces of Malignant and Immunologic Clonal Dynamics in Ovarian Cancer.

PubMed

Zhang, Allen W; McPherson, Andrew; Milne, Katy; Kroeger, David R; Hamilton, Phineas T; Miranda, Alex; Funnell, Tyler; Little, Nicole; de Souza, Camila P E; Laan, Sonya; LeDoux, Stacey; Cochrane, Dawn R; Lim, Jamie L P; Yang, Winnie; Roth, Andrew; Smith, Maia A; Ho, Julie; Tse, Kane; Zeng, Thomas; Shlafman, Inna; Mayo, Michael R; Moore, Richard; Failmezger, Henrik; Heindl, Andreas; Wang, Yi Kan; Bashashati, Ali; Grewal, Diljot S; Brown, Scott D; Lai, Daniel; Wan, Adrian N C; Nielsen, Cydney B; Huebner, Curtis; Tessier-Cloutier, Basile; Anglesio, Michael S; Bouchard-Côté, Alexandre; Yuan, Yinyin; Wasserman, Wyeth W; Gilks, C Blake; Karnezis, Anthony N; Aparicio, Samuel; McAlpine, Jessica N; Huntsman, David G; Holt, Robert A; Nelson, Brad H; Shah, Sohrab P

2018-05-07

High-grade serous ovarian cancer (HGSC) exhibits extensive malignant clonal diversity with widespread but non-random patterns of disease dissemination. We investigated whether local immune microenvironment factors shape tumor progression properties at the interface of tumor-infiltrating lymphocytes (TILs) and cancer cells. Through multi-region study of 212 samples from 38 patients with whole-genome sequencing, immunohistochemistry, histologic image analysis, gene expression profiling, and T and B cell receptor sequencing, we identified three immunologic subtypes across samples and extensive within-patient diversity. Epithelial CD8+ TILs negatively associated with malignant diversity, reflecting immunological pruning of tumor clones inferred by neoantigen depletion, HLA I loss of heterozygosity, and spatial tracking between T cell and tumor clones. In addition, combinatorial prognostic effects of mutational processes and immune properties were observed, illuminating how specific genomic aberration types associate with immune response and impact survival. We conclude that within-patient spatial immune microenvironment variation shapes intraperitoneal malignant spread, provoking new evolutionary perspectives on HGSC clonal dispersion. Copyright © 2018 Elsevier Inc. All rights reserved.
Genetic Diversity of HIV-1 in Tunisia.

PubMed

El Moussi, Awatef; Thomson, Michael M; Delgado, Elena; Cuevas, María Teresa; Nasr, Majda; Abid, Salma; Ben Hadj Kacem, Mohamed Ali; Benaissa Tiouiri, Hanene; Letaief, Amel; Chakroun, Mohamed; Ben Jemaa, Mounir; Hamdouni, Hayet; Tej Dellagi, Rafla; Kheireddine, Khaled; Boutiba, Ilhem; Pérez-Álvarez, Lucía; Slim, Amine

2017-01-01

In this study, the genetic diversity of HIV-1 in Tunisia was analyzed. For this, 193 samples were collected in different regions of Tunisia between 2012 and 2015. A protease and reverse transcriptase fragment were amplified and sequenced. Phylogenetic analyses were performed through maximum likelihood and recombination was analyzed by bootscanning. Six HIV-1 subtypes (B, A1, G, D, C, and F2), 5 circulating recombinant forms (CRF02_AG, CRF25_cpx, CRF43_02G, CRF06_cpx, and CRF19_cpx), and 11 unique recombinant forms were identified. Subtype B (46.4%) and CRF02_AG (39.4%) were the predominant genetic forms. A group of 44 CRF02_AG sequences formed a distinct Tunisian cluster, which also included four viruses from western Europe. Nine viruses were closely related to isolates collected in other African or in European countries. In conclusion, a high HIV-1 genetic diversity is observed in Tunisia and the local spread of CRF02_AG is first documented in this country.
Partial bisulfite conversion for unique template sequencing.

PubMed

Kumar, Vijay; Rosenbaum, Julie; Wang, Zihua; Forcier, Talitha; Ronemus, Michael; Wigler, Michael; Levy, Dan

2018-01-25

We introduce a new protocol, mutational sequencing or muSeq, which uses sodium bisulfite to randomly deaminate unmethylated cytosines at a fixed and tunable rate. The muSeq protocol marks each initial template molecule with a unique mutation signature that is present in every copy of the template, and in every fragmented copy of a copy. In the sequenced read data, this signature is observed as a unique pattern of C-to-T or G-to-A nucleotide conversions. Clustering reads with the same conversion pattern enables accurate count and long-range assembly of initial template molecules from short-read sequence data. We explore count and low-error sequencing by profiling 135 000 restriction fragments in a PstI representation, demonstrating that muSeq improves copy number inference and significantly reduces sporadic sequencer error. We explore long-range assembly in the context of cDNA, generating contiguous transcript clusters greater than 3,000 bp in length. The muSeq assemblies reveal transcriptional diversity not observable from short-read data alone. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.
Necessary Sequencing Depth and Clustering Method to Obtain Relatively Stable Diversity Patterns in Studying Fish Gut Microbiota.

PubMed

Xiao, Fanshu; Yu, Yuhe; Li, Jinjin; Juneau, Philippe; Yan, Qingyun

2018-05-25

The 16S rRNA gene is one of the most commonly used molecular markers for estimating bacterial diversity during the past decades. However, there is no consistency about the sequencing depth (from thousand to millions of sequences per sample), and the clustering methods used to generate OTUs may also be different among studies. These inconsistent premises make effective comparisons among studies difficult or unreliable. This study aims to examine the necessary sequencing depth and clustering method that would be needed to ensure a stable diversity patterns for studying fish gut microbiota. A total number of 42 samples dataset of Siniperca chuatsi (carnivorous fish) gut microbiota were used to test how the sequencing depth and clustering may affect the alpha and beta diversity patterns of fish intestinal microbiota. Interestingly, we found that the sequencing depth (resampling 1000-11,000 per sample) and the clustering methods (UPARSE and UCLUST) did not bias the estimates of the diversity patterns during the fish development from larva to adult. Although we should acknowledge that a suitable sequencing depth may differ case by case, our finding indicates that a shallow sequencing such as 1000 sequences per sample may be also enough to reflect the general diversity patterns of fish gut microbiota. However, we have shown in the present study that strict pre-processing of the original sequences is required to ensure reliable results. This study provides evidences to help making a strong scientific choice of the sequencing depth and clustering method for future studies on fish gut microbiota patterns, but at the same time reducing as much as possible the costs related to the analysis.
Comparative genomic sequence analysis of novel Helicoverpa armigera nucleopolyhedrovirus (NPV) isolated from Kenya and three other previously sequenced Helicoverpa spp. NPVs.

PubMed

Ogembo, Javier Gordon; Caoili, Barbara L; Shikata, Masamitsu; Chaeychomsri, Sudawan; Kobayashi, Michihiro; Ikeda, Motoko

2009-10-01

A newly cloned Helicoverpa armigera nucleopolyhedrovirus (HearNPV) from Kenya, HearNPV-NNg1, has a higher insecticidal activity than HearNPV-G4, which also exhibits lower insecticidal activity than HearNPV-C1. In the search for genes and/or nucleotide sequences that might be involved in the observed virulence differences among Helicoverpa spp. NPVs, the entire genome of NNg1 was sequenced and compared with previously sequenced genomes of G4, C1 and Helicoverpa zea single-nucleocapsid NPV (Hz). The NNg1 genome was 132,425 bp in length, with a total of 143 putative open reading frames (ORFs), and shared high levels of overall amino acid and nucleotide sequence identities with G4, C1 and Hz. Three NNg1 ORFs, ORF5, ORF100 and ORF124, which were shared with C1, were absent in G4 and Hz, while NNg1 and C1 were missing a homologue of G4/Hz ORF5. Another three ORFs, ORF60 (bro-b), ORF119 and ORF120, and one direct repeat sequence (dr) were unique to NNg1. Relative to the overall nucleotide sequence identity, lower sequence identities were observed between NNg1 hrs and the homologous hrs in the other three Helicoverpa spp. NPVs, despite containing the same number of hrs located at essentially the same positions on the genomes. Differences were also observed between NNg1 and each of the other three Helicoverpa spp. NPVs in the diversity of bro genes encoded on the genomes. These results indicate several putative genes and nucleotide sequences that may be responsible for the virulence differences observed among Helicoverpa spp., yet the specific genes and/or nucleotide sequences responsible have not been identified.
Molecular Epidemiology of Oyster-Related Human Noroviruses and Their Global Genetic Diversity and Temporal-Geographical Distribution from 1983 to 2014

PubMed Central

Yu, Yongxin; Cai, Hui; Hu, Linghao; Lei, Rongwei; Pan, Yingjie; Yan, Shuling

2015-01-01

Noroviruses (NoVs) are a leading cause of epidemic and sporadic cases of acute gastroenteritis worldwide. Oysters are well recognized as the main vectors of environmentally transmitted NoVs, and disease outbreaks linked to oyster consumption have been commonly observed. Here, to quantify the genetic diversity, temporal distribution, and circulation of oyster-related NoVs on a global scale, 1,077 oyster-related NoV sequences deposited from 1983 to 2014 were downloaded from both NCBI GenBank and the NoroNet outbreak database and were then screened for quality control. A total of 665 sequences with reliable information were obtained and were subsequently subjected to genotyping and phylogenetic analyses. The results indicated that the majority of oyster-related NoV sequences were obtained from coastal countries and regions and that the numbers of sequences in these regions were unevenly distributed. Moreover, >80% of human NoV genotypes were detected in oyster samples or oyster-related outbreaks. A higher proportion of genogroup I (GI) (34%) was observed for oyster-related sequences than for non-oyster-related outbreaks, where GII strains dominated with an overwhelming majority of >90%, indicating that the prevalences of GI and GII are different in humans and oysters. In addition, a related convergence of the circulation trend was found between oyster-related NoV sequences and human pandemic outbreaks. This suggests that oysters not only act as a vector of NoV through environmental transmission but also serve as an important reservoir of human NoVs. These results highlight the importance of oysters in the persistence and transmission of human NoVs in the environment and have important implications for the surveillance of human NoVs in oyster samples. PMID:26319869
DNA-Encoded Solid-Phase Synthesis: Encoding Language Design and Complex Oligomer Library Synthesis.

PubMed

MacConnell, Andrew B; McEnaney, Patrick J; Cavett, Valerie J; Paegel, Brian M

2015-09-14

The promise of exploiting combinatorial synthesis for small molecule discovery remains unfulfilled due primarily to the "structure elucidation problem": the back-end mass spectrometric analysis that significantly restricts one-bead-one-compound (OBOC) library complexity. The very molecular features that confer binding potency and specificity, such as stereochemistry, regiochemistry, and scaffold rigidity, are conspicuously absent from most libraries because isomerism introduces mass redundancy and diverse scaffolds yield uninterpretable MS fragmentation. Here we present DNA-encoded solid-phase synthesis (DESPS), comprising parallel compound synthesis in organic solvent and aqueous enzymatic ligation of unprotected encoding dsDNA oligonucleotides. Computational encoding language design yielded 148 thermodynamically optimized sequences with Hamming string distance ≥ 3 and total read length <100 bases for facile sequencing. Ligation is efficient (70% yield), specific, and directional over 6 encoding positions. A series of isomers served as a testbed for DESPS's utility in split-and-pool diversification. Single-bead quantitative PCR detected 9 × 10(4) molecules/bead and sequencing allowed for elucidation of each compound's synthetic history. We applied DESPS to the combinatorial synthesis of a 75,645-member OBOC library containing scaffold, stereochemical and regiochemical diversity using mixed-scale resin (160-μm quality control beads and 10-μm screening beads). Tandem DNA sequencing/MALDI-TOF MS analysis of 19 quality control beads showed excellent agreement (<1 ppt) between DNA sequence-predicted mass and the observed mass. DESPS synergistically unites the advantages of solid-phase synthesis and DNA encoding, enabling single-bead structural elucidation of complex compounds and synthesis using reactions normally considered incompatible with unprotected DNA. The widespread availability of inexpensive oligonucleotide synthesis, enzymes, DNA sequencing, and PCR make implementation of DESPS straightforward, and may prompt the chemistry community to revisit the synthesis of more complex and diverse libraries.
Survey of corticioid fungi in North American pinaceous forests reveals hyperdiversity, underpopulated sequence databases, and species that are potentially ectomycorrhizal.

PubMed

Rosenthal, Lisa M; Larsson, Karl-Henrik; Branco, Sara; Chung, Judy A; Glassman, Sydney I; Liao, Hui-Ling; Peay, Kabir G; Smith, Dylan P; Talbot, Jennifer M; Taylor, John W; Vellinga, Else C; Vilgalys, Rytas; Bruns, Thomas D

2017-01-01

The corticioid fungi are commonly encountered, highly diverse, ecologically important, and understudied. We collected specimens in 60 pine and spruce forests across North America to survey corticioid fungal frequency and distribution and to compile an internal transcribed spacer (ITS) database for the group. Sanger sequences from the ITS region of vouchered specimens were compared with sequences on GenBank and UNITE, and with high-throughput sequence data from soil and roots taken at the same sites. Out of 425 high-quality Sanger sequences from vouchered specimens, we recovered 223 distinct operational taxonomic units (OTUs), the majority of which could not be assigned to species by matching to the BLAST database. Corticioid fungi were found to be hyperdiverse, as supported by the observations that nearly two-thirds of our OTUs were represented by single collections and species estimator curves showed steep slopes with no plateaus. We estimate that 14.8-24.7% of our voucher-based OTUs are likely to be ectomycorrhizal (EM). Corticioid fungi recovered from the soil formed a different community assemblage, with EM taxa accounting for 40.5-58.6% of OTUs. We compared basidioma sequences with EM root tips from our data, GenBank, or UNITE, and with this approach, we reiterate existing speculations that Trechispora stellulata is EM. We found that corticioid fungi have a significant distance-decay pattern, adding to the literature supporting fungi as having geographically structured communities. This study provides a first view of the diversity of this important group across North American pine forests, but much of the biology and taxonomy of these diverse, important, and widespread fungi remains unknown.
Open-Source Sequence Clustering Methods Improve the State Of the Art.

PubMed

Kopylova, Evguenia; Navas-Molina, Jose A; Mercier, Céline; Xu, Zhenjiang Zech; Mahé, Frédéric; He, Yan; Zhou, Hong-Wei; Rognes, Torbjørn; Caporaso, J Gregory; Knight, Rob

2016-01-01

Sequence clustering is a common early step in amplicon-based microbial community analysis, when raw sequencing reads are clustered into operational taxonomic units (OTUs) to reduce the run time of subsequent analysis steps. Here, we evaluated the performance of recently released state-of-the-art open-source clustering software products, namely, OTUCLUST, Swarm, SUMACLUST, and SortMeRNA, against current principal options (UCLUST and USEARCH) in QIIME, hierarchical clustering methods in mothur, and USEARCH's most recent clustering algorithm, UPARSE. All the latest open-source tools showed promising results, reporting up to 60% fewer spurious OTUs than UCLUST, indicating that the underlying clustering algorithm can vastly reduce the number of these derived OTUs. Furthermore, we observed that stringent quality filtering, such as is done in UPARSE, can cause a significant underestimation of species abundance and diversity, leading to incorrect biological results. Swarm, SUMACLUST, and SortMeRNA have been included in the QIIME 1.9.0 release. IMPORTANCE Massive collections of next-generation sequencing data call for fast, accurate, and easily accessible bioinformatics algorithms to perform sequence clustering. A comprehensive benchmark is presented, including open-source tools and the popular USEARCH suite. Simulated, mock, and environmental communities were used to analyze sensitivity, selectivity, species diversity (alpha and beta), and taxonomic composition. The results demonstrate that recent clustering algorithms can significantly improve accuracy and preserve estimated diversity without the application of aggressive filtering. Moreover, these tools are all open source, apply multiple levels of multithreading, and scale to the demands of modern next-generation sequencing data, which is essential for the analysis of massive multidisciplinary studies such as the Earth Microbiome Project (EMP) (J. A. Gilbert, J. K. Jansson, and R. Knight, BMC Biol 12:69, 2014, http://dx.doi.org/10.1186/s12915-014-0069-1).
Extensive variation at MHC DRB in the New Zealand sea lion (Phocarctos hookeri) provides evidence for balancing selection

PubMed Central

Osborne, A J; Zavodna, M; Chilvers, B L; Robertson, B C; Negro, S S; Kennedy, M A; Gemmell, N J

2013-01-01

Marine mammals are often reported to possess reduced variation of major histocompatibility complex (MHC) genes compared with their terrestrial counterparts. We evaluated diversity at two MHC class II B genes, DQB and DRB, in the New Zealand sea lion (Phocarctos hookeri, NZSL) a species that has suffered high mortality owing to bacterial epizootics, using Sanger sequencing and haplotype reconstruction, together with next-generation sequencing. Despite this species' prolonged history of small population size and highly restricted distribution, we demonstrate extensive diversity at MHC DRB with 26 alleles, whereas MHC DQB is dimorphic. We identify four DRB codons, predicted to be involved in antigen binding, that are evolving under adaptive evolution. Our data suggest diversity at DRB may be maintained by balancing selection, consistent with the role of this locus as an antigen-binding region and the species' recent history of mass mortality during a series of bacterial epizootics. Phylogenetic analyses of DQB and DRB sequences from pinnipeds and other carnivores revealed significant allelic diversity, but little phylogenetic depth or structure among pinniped alleles; thus, we could neither confirm nor refute the possibility of trans-species polymorphism in this group. The phylogenetic pattern observed however, suggests some significant evolutionary constraint on these loci in the recent past, with the pattern consistent with that expected following an epizootic event. These data may help further elucidate some of the genetic factors underlying the unusually high susceptibility to bacterial infection of the threatened NZSL, and help us to better understand the extent and pattern of MHC diversity in pinnipeds. PMID:23572124
Diversity and stratification of archaea in a hypersaline microbial mat.

PubMed

Robertson, Charles E; Spear, John R; Harris, J Kirk; Pace, Norman R

2009-04-01

The Guerrero Negro (GN) hypersaline microbial mats have become one focus for biogeochemical studies of stratified ecosystems. The GN mats are found beneath several of a series of ponds of increasing salinity that make up a solar saltern fed from Pacific Ocean water pumped from the Laguna Ojo de Liebre near GN, Baja California Sur, Mexico. Molecular surveys of the laminated photosynthetic microbial mat below the fourth pond in the series identified an enormous diversity of bacteria in the mat, but archaea have received little attention. To determine the bulk contribution of archaeal phylotypes to the pond 4 study site, we determined the phylogenetic distribution of archaeal rRNA gene sequences in PCR libraries based on nominally universal primers. The ratios of bacterial/archaeal/eukaryotic rRNA genes, 90%/9%/1%, suggest that the archaeal contribution to the metabolic activities of the mat may be significant. To explore the distribution of archaea in the mat, sequences derived using archaeon-specific PCR primers were surveyed in 10 strata of the 6-cm-thick mat. The diversity of archaea overall was substantial albeit less than the diversity observed previously for bacteria. Archaeal diversity, mainly euryarchaeotes, was highest in the uppermost 2 to 3 mm of the mat and decreased rapidly with depth, where crenarchaeotes dominated. Only 3% of the sequences were specifically related to known organisms including methanogens. While some mat archaeal clades corresponded with known chemical gradients, others did not, which is likely explained by heretofore-unrecognized gradients. Some clades did not segregate by depth in the mat, indicating broad metabolic repertoires, undersampling, or both.
High bacterial diversity of biological soil crusts in water tracks over permafrost in the high arctic polar desert.

PubMed

Steven, Blaire; Lionard, Marie; Kuske, Cheryl R; Vincent, Warwick F

2013-01-01

In this study we report the bacterial diversity of biological soil crusts (biocrusts) inhabiting polar desert soils at the northern land limit of the Arctic polar region (83° 05 N). Employing pyrosequencing of bacterial 16S rRNA genes this study demonstrated that these biocrusts harbor diverse bacterial communities, often as diverse as temperate latitude communities. The effect of wetting pulses on the composition of communities was also determined by collecting samples from soils outside and inside of permafrost water tracks, hill slope flow paths that drain permafrost-affected soils. The intermittent flow regime in the water tracks was correlated with altered relative abundance of phylum level taxonomic bins in the bacterial communities, but the alterations varied between individual sampling sites. Bacteria related to the Cyanobacteria and Acidobacteria demonstrated shifts in relative abundance based on their location either inside or outside of the water tracks. Among cyanobacterial sequences, the proportion of sequences belonging to the family Oscillatoriales consistently increased in relative abundance in the samples from inside the water tracks compared to those outside. Acidobacteria showed responses to wetting pulses in the water tracks, increasing in abundance at one site and decreasing at the other two sites. Subdivision 4 acidobacterial sequences tended to follow the trends in the total Acidobacteria relative abundance, suggesting these organisms were largely responsible for the changes observed in the Acidobacteria. Taken together, these data suggest that the bacterial communities of these high latitude polar biocrusts are diverse but do not show a consensus response to intermittent flow in water tracks over high Arctic permafrost.

Reproducibility and quantitation of amplicon sequencing-based detection

PubMed Central

Zhou, Jizhong; Wu, Liyou; Deng, Ye; Zhi, Xiaoyang; Jiang, Yi-Huei; Tu, Qichao; Xie, Jianping; Van Nostrand, Joy D; He, Zhili; Yang, Yunfeng

2011-01-01

To determine the reproducibility and quantitation of the amplicon sequencing-based detection approach for analyzing microbial community structure, a total of 24 microbial communities from a long-term global change experimental site were examined. Genomic DNA obtained from each community was used to amplify 16S rRNA genes with two or three barcode tags as technical replicates in the presence of a small quantity (0.1% wt/wt) of genomic DNA from Shewanella oneidensis MR-1 as the control. The technical reproducibility of the amplicon sequencing-based detection approach is quite low, with an average operational taxonomic unit (OTU) overlap of 17.2%±2.3% between two technical replicates, and 8.2%±2.3% among three technical replicates, which is most likely due to problems associated with random sampling processes. Such variations in technical replicates could have substantial effects on estimating β-diversity but less on α-diversity. A high variation was also observed in the control across different samples (for example, 66.7-fold for the forward primer), suggesting that the amplicon sequencing-based detection approach could not be quantitative. In addition, various strategies were examined to improve the comparability of amplicon sequencing data, such as increasing biological replicates, and removing singleton sequences and less-representative OTUs across biological replicates. Finally, as expected, various statistical analyses with preprocessed experimental data revealed clear differences in the composition and structure of microbial communities between warming and non-warming, or between clipping and non-clipping. Taken together, these results suggest that amplicon sequencing-based detection is useful in analyzing microbial community structure even though it is not reproducible and quantitative. However, great caution should be taken in experimental design and data interpretation when the amplicon sequencing-based detection approach is used for quantitative analysis of the β-diversity of microbial communities. PMID:21346791
A diverse family of serine proteinase genes expressed in cotton boll weevil (Anthonomus grandis): implications for the design of pest-resistant transgenic cotton plants.

PubMed

Oliveira-Neto, Osmundo B; Batista, João A N; Rigden, Daniel J; Fragoso, Rodrigo R; Silva, Rodrigo O; Gomes, Eliane A; Franco, Octávio L; Dias, Simoni C; Cordeiro, Célia M T; Monnerat, Rose G; Grossi-De-Sá, Maria F

2004-09-01

Fourteen different cDNA fragments encoding serine proteinases were isolated by reverse transcription-PCR from cotton boll weevil (Anthonomus grandis) larvae. A large diversity between the sequences was observed, with a mean pairwise identity of 22% in the amino acid sequence. The cDNAs encompassed 11 trypsin-like sequences classifiable into three families and three chymotrypsin-like sequences belonging to a single family. Using a combination of 5' and 3' RACE, the full-length sequence was obtained for five of the cDNAs, named Agser2, Agser5, Agser6, Agser10 and Agser21. The encoded proteins included amino acid sequence motifs of serine proteinase active sites, conserved cysteine residues, and both zymogen activation and signal peptides. Southern blotting analysis suggested that one or two copies of these serine proteinase genes exist in the A. grandis genome. Northern blotting analysis of Agser2 and Agser5 showed that for both genes, expression is induced upon feeding and is concentrated in the gut of larvae and adult insects. Reverse northern analysis of the 14 cDNA fragments showed that only two trypsin-like and two chymotrypsin-like were expressed at detectable levels. Under the effect of the serine proteinase inhibitors soybean Kunitz trypsin inhibitor and black-eyed pea trypsin/chymotrypsin inhibitor, expression of one of the trypsin-like sequences was upregulated while expression of the two chymotrypsin-like sequences was downregulated. Copyright 2004 Elsevier Ltd.
Prevalence of the F-type lectin domain.

PubMed

Bishnoi, Ritika; Khatri, Indu; Subramanian, Srikrishna; Ramya, T N C

2015-08-01

F-type lectins are fucolectins with characteristic fucose and calcium-binding sequence motifs and a unique lectin fold (the "F-type" fold). F-type lectins are phylogenetically widespread with selective distribution. Several eukaryotic F-type lectins have been biochemically and structurally characterized, and the F-type lectin domain (FLD) has also been studied in the bacterial proteins, Streptococcus mitis lectinolysin and Streptococcus pneumoniae SP2159. However, there is little knowledge about the extent of occurrence of FLDs and their domain organization, especially, in bacteria. We have now mined the extensive genomic sequence information available in the public databases with sensitive sequence search techniques in order to exhaustively survey prokaryotic and eukaryotic FLDs. We report 437 FLD sequence clusters (clustered at 80% sequence identity) from eukaryotic, eubacterial and viral proteins. Domain architectures are diverse but mostly conserved in closely related organisms, and domain organizations of bacterial FLD-containing proteins are very different from their eukaryotic counterparts, suggesting unique specialization of FLDs to suit different requirements. Several atypical phylogenetic associations hint at lateral transfer. Among eukaryotes, we observe an expansion of FLDs in terms of occurrence and domain organization diversity in the taxa Mollusca, Hemichordata and Branchiostomi, perhaps coinciding with greater emphasis on innate immune strategies in these organisms. The naturally occurring FLDs with diverse domain organizations that we have identified here will be useful for future studies aimed at creating designer molecular platforms for directing desired biological activities to fucosylated glycoconjugates in target niches. © The Author 2015. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
Deep sequencing of amplified Prasinovirus and host green algal genes from an Indian Ocean transect reveals interacting trophic dependencies and new genotypes.

PubMed

Clerissi, Camille; Desdevises, Yves; Romac, Sarah; Audic, Stéphane; de Vargas, Colomban; Acinas, Silvia G; Casotti, Raffaella; Poulain, Julie; Wincker, Patrick; Hingamp, Pascal; Ogata, Hiroyuki; Grimsley, Nigel

2015-12-01

High-throughput sequencing of Prasinovirus DNA polymerase and host green algal (Mamiellophyceae) ribosomal RNA genes was used to analyse the diversity and distribution of these taxa over a ∼10 000 km latitudinal section of the Indian Ocean. New viral and host groups were identified among the different trophic conditions observed, and highlighted that although unknown prasinoviruses are diverse, the cosmopolitan algal genera Bathycoccus, Micromonas and Ostreococcus represent a large proportion of the host diversity. While Prasinovirus communities were correlated to both the geography and the environment, host communities were not, perhaps because the genetic marker used lacked sufficient resolution. Nevertheless, analysis of single environmental variables showed that eutrophic conditions strongly influence the distributions of both hosts and viruses. Moreover, these communities were not correlated, in their composition or specific richness. These observations could result from antagonistic dynamics, such as that illustrated in a prey-predator model, and/or because hosts might be under a complex set of selective pressures. Both of these reasons must be considered to interpret environmental surveys of viruses and hosts, because covariation does not always imply interaction. © 2015 Society for Applied Microbiology and John Wiley & Sons Ltd.
Microbial community analysis of the hypersaline water of the Dead Sea using high-throughput amplicon sequencing.

PubMed

Jacob, Jacob H; Hussein, Emad I; Shakhatreh, Muhamad Ali K; Cornelison, Christopher T

2017-10-01

Amplicon sequencing using next-generation technology (bTEFAP ® ) has been utilized in describing the diversity of Dead Sea microbiota. The investigated area is a well-known salt lake in the western part of Jordan found in the lowest geographical location in the world (more than 420 m below sea level) and characterized by extreme salinity (approximately, 34%) in addition to other extreme conditions (low pH, unique ionic composition different from sea water). DNA was extracted from Dead Sea water. A total of 314,310 small subunit RNA (SSU rRNA) sequences were parsed, and 288,452 sequences were then clustered. For alpha diversity analysis, sample was rarefied to 3,000 sequences. The Shannon-Wiener index curve plot reached a plateau at approximately 3,000 sequences indicating that sequencing depth was sufficient to capture the full scope of microbial diversity. Archaea was found to be dominating the sequences (52%), whereas Bacteria constitute 45% of the sequences. Altogether, prokaryotic sequences (which constitute 97% of all sequences) were found to predominate. The findings expand on previous studies by using high-throughput amplicon sequencing to describe the microbial community in an environment which in recent years has been shown to hide some interesting diversity. © 2017 The Authors. MicrobiologyOpen published by John Wiley & Sons Ltd.
The Tara Oceans voyage reveals global diversity and distribution patterns of marine planktonic ciliates

PubMed Central

Gimmler, Anna; Korn, Ralf; de Vargas, Colomban; Audic, Stéphane; Stoeck, Thorsten

2016-01-01

Illumina reads of the SSU-rDNA-V9 region obtained from the circumglobal Tara Oceans expedition allow the investigation of protistan plankton diversity patterns on a global scale. We analyzed 6,137,350 V9-amplicons from ocean surface waters and the deep chlorophyll maximum, which were taxonomically assigned to the phylum Ciliophora. For open ocean samples global planktonic ciliate diversity is relatively low (ca. 1,300 observed and predicted ciliate OTUs). We found that 17% of all detected ciliate OTUs occurred in all oceanic regions under study. On average, local ciliate OTU richness represented 27% of the global ciliate OTU richness, indicating that a large proportion of ciliates is widely distributed. Yet, more than half of these OTUs shared <90% sequence similarity with reference sequences of described ciliates. While alpha-diversity measures (richness and exp(Shannon H)) are hardly affected by contemporary environmental conditions, species (OTU) turnover and community similarity (β-diversity) across taxonomic groups showed strong correlation to environmental parameters. Logistic regression models predicted significant correlations between the occurrence of specific ciliate genera and individual nutrients, the oceanic carbonate system and temperature. Planktonic ciliates displayed distinct vertical distributions relative to chlorophyll a. In contrast, the Tara Oceans dataset did not reveal any evidence that latitude is structuring ciliate communities. PMID:27633177
Low diversity in the mitogenome of sperm whales revealed by next-generation sequencing

Treesearch

Alana Alexander; Debbie Steel; Beth Slikas; Kendra Hoekzema; Colm Carraher; Matthew Parks; Richard Cronn; C. Scott Baker

2012-01-01

Large population sizes and global distributions generally associate with high mitochondrial DNA control region (CR) diversity. The sperm whale (Physeter macrocephalus) is an exception, showing low CR diversity relative to other cetaceans; however, diversity levels throughout the remainder of the sperm whale mitogenome are unknown. We sequenced 20...
Universal Sequence Replication, Reversible Polymerization and Early Functional Biopolymers: A Model for the Initiation of Prebiotic Sequence Evolution

PubMed Central

Walker, Sara Imari; Grover, Martha A.; Hud, Nicholas V.

2012-01-01

Many models for the origin of life have focused on understanding how evolution can drive the refinement of a preexisting enzyme, such as the evolution of efficient replicase activity. Here we present a model for what was, arguably, an even earlier stage of chemical evolution, when polymer sequence diversity was generated and sustained before, and during, the onset of functional selection. The model includes regular environmental cycles (e.g. hydration-dehydration cycles) that drive polymers between times of replication and functional activity, which coincide with times of different monomer and polymer diffusivity. Template-directed replication of informational polymers, which takes place during the dehydration stage of each cycle, is considered to be sequence-independent. New sequences are generated by spontaneous polymer formation, and all sequences compete for a finite monomer resource that is recycled via reversible polymerization. Kinetic Monte Carlo simulations demonstrate that this proposed prebiotic scenario provides a robust mechanism for the exploration of sequence space. Introduction of a polymer sequence with monomer synthetase activity illustrates that functional sequences can become established in a preexisting pool of otherwise non-functional sequences. Functional selection does not dominate system dynamics and sequence diversity remains high, permitting the emergence and spread of more than one functional sequence. It is also observed that polymers spontaneously form clusters in simulations where polymers diffuse more slowly than monomers, a feature that is reminiscent of a previous proposal that the earliest stages of life could have been defined by the collective evolution of a system-wide cooperation of polymer aggregates. Overall, the results presented demonstrate the merits of considering plausible prebiotic polymer chemistries and environments that would have allowed for the rapid turnover of monomer resources and for regularly varying monomer/polymer diffusivities. PMID:22493682
Novel lineages of Prochlorococcus and Synechococcus in the global oceans.

PubMed

Huang, Sijun; Wilhelm, Steven W; Harvey, H Rodger; Taylor, Karen; Jiao, Nianzhi; Chen, Feng

2012-02-01

Picocyanobacteria represented by Prochlorococcus and Synechococcus have an important role in oceanic carbon fixation and nutrient cycling. In this study, we compared the community composition of picocyanobacteria from diverse marine ecosystems ranging from estuary to open oceans, tropical to polar oceans and surface to deep water, based on the sequences of 16S-23S rRNA internal transcribed spacer (ITS). A total of 1339 ITS sequences recovered from 20 samples unveiled diverse and several previously unknown clades of Prochlorococcus and Synechococcus. Six high-light (HL)-adapted Prochlorococcus clades were identified, among which clade HLVI had not been described previously. Prochlorococcus clades HLIII, HLIV and HLV, detected in the Equatorial Pacific samples, could be related to the HNLC clades recently found in the high-nutrient, low-chlorophyll (HNLC), iron-depleted tropical oceans. At least four novel Synechococcus clades (out of six clades in total) in subcluster 5.3 were found in subtropical open oceans and the South China Sea. A niche partitioning with depth was observed in the Synechococcus subcluster 5.3. Members of Synechococcus subcluster 5.2 were dominant in the high-latitude waters (northern Bering Sea and Chukchi Sea), suggesting a possible cold-adaptation of some marine Synechococcus in this subcluster. A distinct shift of the picocyanobacterial community was observed from the Bering Sea to the Chukchi Sea, which reflected the change of water temperature. Our study demonstrates that oceanic systems contain a large pool of diverse picocyanobacteria, and further suggest that new genotypes or ecotypes of picocyanobacteria will continue to emerge, as microbial consortia are explored with advanced sequencing technology.
Effect of Next-Generation Exome Sequencing Depth for Discovery of Diagnostic Variants.

PubMed

Kim, Kyung; Seong, Moon-Woo; Chung, Won-Hyong; Park, Sung Sup; Leem, Sangseob; Park, Won; Kim, Jihyun; Lee, KiYoung; Park, Rae Woong; Kim, Namshin

2015-06-01

Sequencing depth, which is directly related to the cost and time required for the generation, processing, and maintenance of next-generation sequencing data, is an important factor in the practical utilization of such data in clinical fields. Unfortunately, identifying an exome sequencing depth adequate for clinical use is a challenge that has not been addressed extensively. Here, we investigate the effect of exome sequencing depth on the discovery of sequence variants for clinical use. Toward this, we sequenced ten germ-line blood samples from breast cancer patients on the Illumina platform GAII(x) at a high depth of ~200×. We observed that most function-related diverse variants in the human exonic regions could be detected at a sequencing depth of 120×. Furthermore, investigation using a diagnostic gene set showed that the number of clinical variants identified using exome sequencing reached a plateau at an average sequencing depth of about 120×. Moreover, the phenomena were consistent across the breast cancer samples.
Exploring fungal diversity in deep-sea sediments from Okinawa Trough using high-throughput Illumina sequencing

NASA Astrophysics Data System (ADS)

Zhang, Xiao-Yong; Wang, Guang-Hua; Xu, Xin-Ya; Nong, Xu-Hua; Wang, Jie; Amin, Muhammad; Qi, Shu-Hua

2016-10-01

The present study investigated the fungal diversity in four different deep-sea sediments from Okinawa Trough using high-throughput Illumina sequencing of the nuclear ribosomal internal transcribed spacer-1 (ITS1). A total of 40,297 fungal ITS1 sequences clustered into 420 operational taxonomic units (OTUs) with 97% sequence similarity and 170 taxa were recovered from these sediments. Most ITS1 sequences (78%) belonged to the phylum Ascomycota, followed by Basidiomycota (17.3%), Zygomycota (1.5%) and Chytridiomycota (0.8%), and a small proportion (2.4%) belonged to unassigned fungal phyla. Compared with previous studies on fungal diversity of sediments from deep-sea environments by culture-dependent approach and clone library analysis, the present result suggested that Illumina sequencing had been dramatically accelerating the discovery of fungal community of deep-sea sediments. Furthermore, our results revealed that Sordariomycetes was the most diverse and abundant fungal class in this study, challenging the traditional view that the diversity of Sordariomycetes phylotypes was low in the deep-sea environments. In addition, more than 12 taxa accounted for 21.5% sequences were found to be rarely reported as deep-sea fungi, suggesting the deep-sea sediments from Okinawa Trough harbored a plethora of different fungal communities compared with other deep-sea environments. To our knowledge, this study is the first exploration of the fungal diversity in deep-sea sediments from Okinawa Trough using high-throughput Illumina sequencing.
Substrates of Peltigera Lichens as a Potential Source of Cyanobionts.

PubMed

Zúñiga, Catalina; Leiva, Diego; Carú, Margarita; Orlando, Julieta

2017-10-01

Photobiont availability is one of the main factors determining the success of the lichenization process. Although multiple sources of photobionts have been proposed, there is no substantial evidence confirming that the substrates on which lichens grow are one of them. In this work, we obtained cyanobacterial 16S ribosomal RNA gene sequences from the substrates underlying 186 terricolous Peltigera cyanolichens from localities in Southern Chile and maritime Antarctica and compared them with the sequences of the cyanobionts of these lichens, in order to determine if cyanobacteria potentially available for lichenization were present in the substrates. A phylogenetic analysis of the sequences showed that Nostoc phylotypes dominated the cyanobacterial communities of the substrates in all sites. Among them, an overlap was observed between the phylotypes of the lichen cyanobionts and those of the cyanobacteria present in their substrates, suggesting that they could be a possible source of lichen photobionts. Also, in most cases, higher Nostoc diversity was observed in the lichens than in the substrates from each site. A better understanding of cyanobacterial diversity in lichen substrates and their relatives in the lichens would bring insights into mycobiont selection and the distribution patterns of lichens, providing a background for hypothesis testing and theory development for future studies of the lichenization process.
High Diversity of Myocyanophage in Various Aquatic Environments Revealed by High-Throughput Sequencing of Major Capsid Protein Gene With a New Set of Primers.

PubMed

Hou, Weiguo; Wang, Shang; Briggs, Brandon R; Li, Gaoyuan; Xie, Wei; Dong, Hailiang

2018-01-01

Myocyanophages, a group of viruses infecting cyanobacteria, are abundant and play important roles in elemental cycling. Here we investigated the particle-associated viral communities retained on 0.2 μm filters and in sediment samples (representing ancient cyanophage communities) from four ocean and three lake locations, using high-throughput sequencing and a newly designed primer pair targeting a gene fragment (∼145-bp in length) encoding the cyanophage gp23 major capsid protein (MCP). Diverse viral communities were detected in all samples. The fragments of 142-, 145-, and 148-bp in length were most abundant in the amplicons, and most sequences (>92%) belonged to cyanophages. Additionally, different sequencing depths resulted in different diversity estimates of the viral community. Operational taxonomic units obtained from deep sequencing of the MCP gene covered the majority of those obtained from shallow sequencing, suggesting that deep sequencing exhibited a more complete picture of cyanophage community than shallow sequencing. Our results also revealed a wide geographic distribution of marine myocyanophages, i.e., higher dissimilarities of the myocyanophage communities corresponded with the larger distances between the sampling sites. Collectively, this study suggests that the newly designed primer pair can be effectively used to study the community and diversity of myocyanophage from different environments, and the high-throughput sequencing represents a good method to understand viral diversity.
High Diversity of Myocyanophage in Various Aquatic Environments Revealed by High-Throughput Sequencing of Major Capsid Protein Gene With a New Set of Primers

PubMed Central

Hou, Weiguo; Wang, Shang; Briggs, Brandon R.; Li, Gaoyuan; Xie, Wei; Dong, Hailiang

2018-01-01

Myocyanophages, a group of viruses infecting cyanobacteria, are abundant and play important roles in elemental cycling. Here we investigated the particle-associated viral communities retained on 0.2 μm filters and in sediment samples (representing ancient cyanophage communities) from four ocean and three lake locations, using high-throughput sequencing and a newly designed primer pair targeting a gene fragment (∼145-bp in length) encoding the cyanophage gp23 major capsid protein (MCP). Diverse viral communities were detected in all samples. The fragments of 142-, 145-, and 148-bp in length were most abundant in the amplicons, and most sequences (>92%) belonged to cyanophages. Additionally, different sequencing depths resulted in different diversity estimates of the viral community. Operational taxonomic units obtained from deep sequencing of the MCP gene covered the majority of those obtained from shallow sequencing, suggesting that deep sequencing exhibited a more complete picture of cyanophage community than shallow sequencing. Our results also revealed a wide geographic distribution of marine myocyanophages, i.e., higher dissimilarities of the myocyanophage communities corresponded with the larger distances between the sampling sites. Collectively, this study suggests that the newly designed primer pair can be effectively used to study the community and diversity of myocyanophage from different environments, and the high-throughput sequencing represents a good method to understand viral diversity.
Evolution of the arginase fold and functional diversity

PubMed Central

Dowling, Daniel P.; Costanzo, Luigi Di; Gennadios, Heather A.; Christianson, David W.

2009-01-01

The large number of protein structures deposited in the Protein Data Bank allows for the identification of novel structural superfamilies based on conservation of fold in addition to conservation of amino acid sequence. Since sequence diverges more rapidly than fold in protein evolution, proteins with little or no significant sequence identity are occasionally observed to adopt similar folds, thereby reflecting unanticipated evolutionary relationships. Here, we review the unique α/β fold first observed in the manganese metalloenzyme rat liver arginase, consisting of a parallel 8 stranded β-sheet surrounded by several helices, and its evolutionary relationship with the zinc-requiring and/or iron-requiring histone deacetylases and acetylpolyamine amidohydrolases. Structural comparisons reveal key features of the core α/β fold that contribute to the divergent metal ion specificity and stoichiometry required for the chemical and biological functions of these enzymes. PMID:18360740
Information-Theoretic Uncertainty of SCFG-Modeled Folding Space of The Non-coding RNA

PubMed Central

Manzourolajdad, Amirhossein; Wang, Yingfeng; Shaw, Timothy I.; Malmberg, Russell L.

2012-01-01

RNA secondary structure ensembles define probability distributions for alternative equilibrium secondary structures of an RNA sequence. Shannon’s Entropy is a measure for the amount of diversity present in any ensemble. In this work, Shannon’s entropy of the SCFG ensemble on an RNA sequence is derived and implemented in polynomial time for both structurally ambiguous and unambiguous grammars. Micro RNA sequences generally have low folding entropy, as previously discovered. Surprisingly, signs of significantly high folding entropy were observed in certain ncRNA families. More effective models coupled with targeted randomization tests can lead to a better insight into folding features of these families. PMID:23160142
High protists diversity in the plankton of sulfurous lakes and lagoons examined by 18s rRNA gene sequence analyses.

PubMed

Triadó-Margarit, Xavier; Casamayor, Emilio O

2015-12-01

Diversity of small protists was studied in sulfidic and anoxic (euxinic) stratified karstic lakes and coastal lagoons by 18S rRNA gene analyses. We hypothesized a major sulfide effect, reducing protist diversity and richness with only a few specialized populations adapted to deal with low-redox conditions and high-sulfide concentrations. However, genetic fingerprinting suggested similar ecological diversity in anoxic and sulfurous than in upper oxygen rich water compartments with specific populations inhabiting euxinic waters. Many of them agreed with genera previously identified by microscopic observations, but also new and unexpected groups were detected. Most of the sequences matched a rich assemblage of Ciliophora (i.e., Coleps, Prorodon, Plagiopyla, Strombidium, Metopus, Vorticella and Caenomorpha, among others) and algae (mainly Cryptomonadales). Unidentified Cercozoa, Fungi, Stramenopiles and Discoba were recurrently found. The lack of GenBank counterparts was higher in deep hypolimnetic waters and appeared differentially allocated in the different taxa, being higher within Discoba and lower in Cryptophyceae. A larger number of populations than expected were specifically detected in the deep sulfurous waters, with unknown ecological interactions and metabolic capabilities. © 2015 Society for Applied Microbiology and John Wiley & Sons Ltd.
The Epigenomic Landscape of Prokaryotes

DOE PAGES

Blow, Matthew J.; Clark, Tyson A.; Daum, Chris G.; ...

2016-02-12

DNA methylation acts in concert with restriction enzymes to protect the integrity of prokaryotic genomes. Studies in a limited number of organisms suggest that methylation also contributes to prokaryotic genome regulation, but the prevalence and properties of such non-restriction-associated methylation systems remain poorly understood. Here, we used single molecule, real-time sequencing to map DNA modifications including m6A, m4C, and m5C across the genomes of 230 diverse bacterial and archaeal species. We observed DNA methylation in nearly all (93%) organisms examined, and identified a total of 834 distinct reproducibly methylated motifs. This data enabled annotation of the DNA binding specificities ofmore » 620 DNA Methyltransferases (MTases), doubling known specificities for previously hard to study Type I, IIG and III MTases, and revealing their extraordinary diversity. Strikingly, 48% of organisms harbor active Type II MTases with no apparent cognate restriction enzyme. These active ‘orphan’ MTases are present in diverse bacterial and archaeal phyla and show motif specificities and methylation patterns consistent with functions in gene regulation and DNA replication. Our results reveal the pervasive presence of DNA methylation throughout the prokaryotic kingdoms, as well as the diversity of sequence specificities and potential functions of DNA methylation systems.« less
The Epigenomic Landscape of Prokaryotes

DOE Office of Scientific and Technical Information (OSTI.GOV)

Blow, Matthew J.; Clark, Tyson A.; Daum, Chris G.

DNA methylation acts in concert with restriction enzymes to protect the integrity of prokaryotic genomes. Studies in a limited number of organisms suggest that methylation also contributes to prokaryotic genome regulation, but the prevalence and properties of such non-restriction-associated methylation systems remain poorly understood. Here, we used single molecule, real-time sequencing to map DNA modifications including m6A, m4C, and m5C across the genomes of 230 diverse bacterial and archaeal species. We observed DNA methylation in nearly all (93%) organisms examined, and identified a total of 834 distinct reproducibly methylated motifs. This data enabled annotation of the DNA binding specificities ofmore » 620 DNA Methyltransferases (MTases), doubling known specificities for previously hard to study Type I, IIG and III MTases, and revealing their extraordinary diversity. Strikingly, 48% of organisms harbor active Type II MTases with no apparent cognate restriction enzyme. These active ‘orphan’ MTases are present in diverse bacterial and archaeal phyla and show motif specificities and methylation patterns consistent with functions in gene regulation and DNA replication. Our results reveal the pervasive presence of DNA methylation throughout the prokaryotic kingdoms, as well as the diversity of sequence specificities and potential functions of DNA methylation systems.« less
Genetic Diversity and Molecular Evolution of Chinese Waxy Maize Germplasm

PubMed Central

Zheng, Hongjian; Wang, Hui; Yang, Hua; Wu, Jinhong; Shi, Biao; Cai, Run; Xu, Yunbi; Wu, Aizhong; Luo, Lijun

2013-01-01

Waxy maize (Zea mays L. var. certaina Kulesh), with many excellent characters in terms of starch composition and economic value, has grown in China for a long history and its production has increased dramatically in recent decades. However, the evolution and origin of waxy maize still remains unclear. We studied the genetic diversity of Chinese waxy maize including typical landraces and inbred lines by SSR analysis and the results showed a wide genetic diversity in the Chinese waxy maize germplasm. We analyzed the origin and evolution of waxy maize by sequencing 108 samples, and downloading 52 sequences from GenBank for the waxy locus in a number of accessions from genus Zea. A sharp reduction of nucleotide diversity and significant neutrality tests (Tajima’s D and Fu and Li’s F*) were observed at the waxy locus in Chinese waxy maize but not in nonglutinous maize. Phylogenetic analysis indicated that Chinese waxy maize originated from the cultivated flint maize and most of the modern waxy maize inbred lines showed a distinct independent origin and evolution process compared with the germplasm from Southwest China. The results indicated that an agronomic trait can be quickly improved to meet production demand by selection. PMID:23818949

Mitochondrial DNA markers reveal high genetic diversity but low genetic differentiation in the black fly Simulium tani Takaoka & Davies along an elevational gradient in Malaysia.

PubMed

Low, Van Lun; Adler, Peter H; Takaoka, Hiroyuki; Ya'cob, Zubaidah; Lim, Phaik Eem; Tan, Tiong Kai; Lim, Yvonne A L; Chen, Chee Dhang; Norma-Rashid, Yusoff; Sofian-Azirun, Mohd

2014-01-01

The population genetic structure of Simulium tani was inferred from mitochondria-encoded sequences of cytochrome c oxidase subunits I (COI) and II (COII) along an elevational gradient in Cameron Highlands, Malaysia. A statistical parsimony network of 71 individuals revealed 71 haplotypes in the COI gene and 43 haplotypes in the COII gene; the concatenated sequences of the COI and COII genes revealed 71 haplotypes. High levels of genetic diversity but low levels of genetic differentiation were observed among populations of S. tani at five elevations. The degree of genetic diversity, however, was not in accordance with an altitudinal gradient, and a Mantel test indicated that elevation did not have a limiting effect on gene flow. No ancestral haplotype of S. tani was found among the populations. Pupae with unique structural characters at the highest elevation showed a tendency to form their own haplotype cluster, as revealed by the COII gene. Tajima's D, Fu's Fs, and mismatch distribution tests revealed population expansion of S. tani in Cameron Highlands. A strong correlation was found between nucleotide diversity and the levels of dissolved oxygen in the streams where S. tani was collected.
Conservation of the C-type lectin fold for massive sequence variation in a Treponema diversity-generating retroelement

DOE Office of Scientific and Technical Information (OSTI.GOV)

Le Coq, Johanne; Ghosh, Partho

2012-06-19

Anticipatory ligand binding through massive protein sequence variation is rare in biological systems, having been observed only in the vertebrate adaptive immune response and in a phage diversity-generating retroelement (DGR). Earlier work has demonstrated that the prototypical DGR variable protein, major tropism determinant (Mtd), meets the demands of anticipatory ligand binding by novel means through the C-type lectin (CLec) fold. However, because of the low sequence identity among DGR variable proteins, it has remained unclear whether the CLec fold is a general solution for DGRs. We have addressed this problem by determining the structure of a second DGR variable protein,more » TvpA, from the pathogenic oral spirochete Treponema denticola. Despite its weak sequence identity to Mtd ({approx}16%), TvpA was found to also have a CLec fold, with predicted variable residues exposed in a ligand-binding site. However, this site in TvpA was markedly more variable than the one in Mtd, reflecting the unprecedented approximate 10{sup 20} potential variability of TvpA. In addition, similarity between TvpA and Mtd with formylglycine-generating enzymes was detected. These results provide strong evidence for the conservation of the formylglycine-generating enzyme-type CLec fold among DGRs as a means of accommodating massive sequence variation.« less
Conservation of the C-type lectin fold for massive sequence variation in a Treponema diversity-generating retroelement

PubMed Central

Le Coq, Johanne; Ghosh, Partho

2011-01-01

Anticipatory ligand binding through massive protein sequence variation is rare in biological systems, having been observed only in the vertebrate adaptive immune response and in a phage diversity-generating retroelement (DGR). Earlier work has demonstrated that the prototypical DGR variable protein, major tropism determinant (Mtd), meets the demands of anticipatory ligand binding by novel means through the C-type lectin (CLec) fold. However, because of the low sequence identity among DGR variable proteins, it has remained unclear whether the CLec fold is a general solution for DGRs. We have addressed this problem by determining the structure of a second DGR variable protein, TvpA, from the pathogenic oral spirochete Treponema denticola. Despite its weak sequence identity to Mtd (∼16%), TvpA was found to also have a CLec fold, with predicted variable residues exposed in a ligand-binding site. However, this site in TvpA was markedly more variable than the one in Mtd, reflecting the unprecedented approximate 1020 potential variability of TvpA. In addition, similarity between TvpA and Mtd with formylglycine-generating enzymes was detected. These results provide strong evidence for the conservation of the formylglycine-generating enzyme-type CLec fold among DGRs as a means of accommodating massive sequence variation. PMID:21873231
Conservation of the C-type lectin fold for massive sequence variation in a Treponema diversity-generating retroelement.

PubMed

Le Coq, Johanne; Ghosh, Partho

2011-08-30

Anticipatory ligand binding through massive protein sequence variation is rare in biological systems, having been observed only in the vertebrate adaptive immune response and in a phage diversity-generating retroelement (DGR). Earlier work has demonstrated that the prototypical DGR variable protein, major tropism determinant (Mtd), meets the demands of anticipatory ligand binding by novel means through the C-type lectin (CLec) fold. However, because of the low sequence identity among DGR variable proteins, it has remained unclear whether the CLec fold is a general solution for DGRs. We have addressed this problem by determining the structure of a second DGR variable protein, TvpA, from the pathogenic oral spirochete Treponema denticola. Despite its weak sequence identity to Mtd (∼16%), TvpA was found to also have a CLec fold, with predicted variable residues exposed in a ligand-binding site. However, this site in TvpA was markedly more variable than the one in Mtd, reflecting the unprecedented approximate 10(20) potential variability of TvpA. In addition, similarity between TvpA and Mtd with formylglycine-generating enzymes was detected. These results provide strong evidence for the conservation of the formylglycine-generating enzyme-type CLec fold among DGRs as a means of accommodating massive sequence variation.
Ultra-deep sequencing reveals high prevalence and broad structural diversity of hepatitis B surface antigen mutations in a global population

PubMed Central

Gencay, Mikael; Hübner, Kirsten; Gohl, Peter; Seffner, Anja; Weizenegger, Michael; Neofytos, Dionysios; Batrla, Richard; Woeste, Andreas; Kim, Hyon-suk; Westergaard, Gaston; Reinsch, Christine; Brill, Eva; Thu Thuy, Pham Thi; Hoang, Bui Huu; Sonderup, Mark; Spearman, C. Wendy; Pabinger, Stephan; Gautier, Jérémie; Brancaccio, Giuseppina; Fasano, Massimo; Santantonio, Teresa; Gaeta, Giovanni B.; Nauck, Markus; Kaminski, Wolfgang E.

2017-01-01

The diversity of the hepatitis B surface antigen (HBsAg) has a significant impact on the performance of diagnostic screening tests and the clinical outcome of hepatitis B infection. Neutralizing or diagnostic antibodies against the HBsAg are directed towards its highly conserved major hydrophilic region (MHR), in particular towards its “a” determinant subdomain. Here, we explored, on a global scale, the genetic diversity of the HBsAg MHR in a large, multi-ethnic cohort of randomly selected subjects with HBV infection from four continents. A total of 1553 HBsAg positive blood samples of subjects originating from 20 different countries across Africa, America, Asia and central Europe were characterized for amino acid variation in the MHR. Using highly sensitive ultra-deep sequencing, we found 72.8% of the successfully sequenced subjects (n = 1391) demonstrated amino acid sequence variation in the HBsAg MHR. This indicates that the global variation frequency in the HBsAg MHR is threefold higher than previously reported. The majority of the amino acid mutations were found in the HBV genotypes B (28.9%) and C (25.4%). Collectively, we identified 345 distinct amino acid mutations in the MHR. Among these, we report 62 previously unknown mutations, which extends the worldwide pool of currently known HBsAg MHR mutations by 22%. Importantly, topological analysis identified the “a” determinant upstream flanking region as the structurally most diverse subdomain of the HBsAg MHR. The highest prevalence of “a” determinant region mutations was observed in subjects from Asia, followed by the African, American and European cohorts, respectively. Finally, we found that more than half (59.3%) of all HBV subjects investigated carried multiple MHR mutations. Together, this worldwide ultra-deep sequencing based genotyping study reveals that the global prevalence and structural complexity of variation in the hepatitis B surface antigen have, to date, been significantly underappreciated. PMID:28472040
Ultra-deep sequencing reveals high prevalence and broad structural diversity of hepatitis B surface antigen mutations in a global population.

PubMed

Gencay, Mikael; Hübner, Kirsten; Gohl, Peter; Seffner, Anja; Weizenegger, Michael; Neofytos, Dionysios; Batrla, Richard; Woeste, Andreas; Kim, Hyon-Suk; Westergaard, Gaston; Reinsch, Christine; Brill, Eva; Thu Thuy, Pham Thi; Hoang, Bui Huu; Sonderup, Mark; Spearman, C Wendy; Pabinger, Stephan; Gautier, Jérémie; Brancaccio, Giuseppina; Fasano, Massimo; Santantonio, Teresa; Gaeta, Giovanni B; Nauck, Markus; Kaminski, Wolfgang E

2017-01-01

The diversity of the hepatitis B surface antigen (HBsAg) has a significant impact on the performance of diagnostic screening tests and the clinical outcome of hepatitis B infection. Neutralizing or diagnostic antibodies against the HBsAg are directed towards its highly conserved major hydrophilic region (MHR), in particular towards its "a" determinant subdomain. Here, we explored, on a global scale, the genetic diversity of the HBsAg MHR in a large, multi-ethnic cohort of randomly selected subjects with HBV infection from four continents. A total of 1553 HBsAg positive blood samples of subjects originating from 20 different countries across Africa, America, Asia and central Europe were characterized for amino acid variation in the MHR. Using highly sensitive ultra-deep sequencing, we found 72.8% of the successfully sequenced subjects (n = 1391) demonstrated amino acid sequence variation in the HBsAg MHR. This indicates that the global variation frequency in the HBsAg MHR is threefold higher than previously reported. The majority of the amino acid mutations were found in the HBV genotypes B (28.9%) and C (25.4%). Collectively, we identified 345 distinct amino acid mutations in the MHR. Among these, we report 62 previously unknown mutations, which extends the worldwide pool of currently known HBsAg MHR mutations by 22%. Importantly, topological analysis identified the "a" determinant upstream flanking region as the structurally most diverse subdomain of the HBsAg MHR. The highest prevalence of "a" determinant region mutations was observed in subjects from Asia, followed by the African, American and European cohorts, respectively. Finally, we found that more than half (59.3%) of all HBV subjects investigated carried multiple MHR mutations. Together, this worldwide ultra-deep sequencing based genotyping study reveals that the global prevalence and structural complexity of variation in the hepatitis B surface antigen have, to date, been significantly underappreciated.
Actinobacterial Diversity in Volcanic Caves and Associated Geomicrobiological Interactions

PubMed Central

Riquelme, Cristina; Marshall Hathaway, Jennifer J.; Enes Dapkevicius, Maria de L. N.; Miller, Ana Z.; Kooser, Ara; Northup, Diana E.; Jurado, Valme; Fernandez, Octavio; Saiz-Jimenez, Cesareo; Cheeptham, Naowarat

2015-01-01

Volcanic caves are filled with colorful microbial mats on the walls and ceilings. These volcanic caves are found worldwide, and studies are finding vast bacteria diversity within these caves. One group of bacteria that can be abundant in volcanic caves, as well as other caves, is Actinobacteria. As Actinobacteria are valued for their ability to produce a variety of secondary metabolites, rare and novel Actinobacteria are being sought in underexplored environments. The abundance of novel Actinobacteria in volcanic caves makes this environment an excellent location to study these bacteria. Scanning electron microscopy (SEM) from several volcanic caves worldwide revealed diversity in the morphologies present. Spores, coccoid, and filamentous cells, many with hair-like or knobby extensions, were some of the microbial structures observed within the microbial mat samples. In addition, the SEM study pointed out that these features figure prominently in both constructive and destructive mineral processes. To further investigate this diversity, we conducted both Sanger sequencing and 454 pyrosequencing of the Actinobacteria in volcanic caves from four locations, two islands in the Azores, Portugal, and Hawai'i and New Mexico, USA. This comparison represents one of the largest sequencing efforts of Actinobacteria in volcanic caves to date. The diversity was shown to be dominated by Actinomycetales, but also included several newly described orders, such as Euzebyales, and Gaiellales. Sixty-two percent of the clones from the four locations shared less than 97% similarity to known sequences, and nearly 71% of the clones were singletons, supporting the commonly held belief that volcanic caves are an untapped resource for novel and rare Actinobacteria. The amplicon libraries depicted a wider view of the microbial diversity in Azorean volcanic caves revealing three additional orders, Rubrobacterales, Solirubrobacterales, and Coriobacteriales. Studies of microbial ecology in volcanic caves are still very limited. To rectify this deficiency, the results from our study help fill in the gaps in our knowledge of actinobacterial diversity and their potential roles in the volcanic cave ecosystems. PMID:26696966
High-Resolution Microbial Community Succession of Microbially Induced Concrete Corrosion in Working Sanitary Manholes

PubMed Central

Ling, Alison L.; Robertson, Charles E.; Harris, J. Kirk; Frank, Daniel N.; Kotter, Cassandra V.; Stevens, Mark J.; Pace, Norman R.; Hernandez, Mark T.

2015-01-01

Microbially-induced concrete corrosion in headspaces threatens wastewater infrastructure worldwide. Models for predicting corrosion rates in sewer pipe networks rely largely on information from culture-based investigations. In this study, the succession of microbes associated with corroding concrete was characterized over a one-year monitoring campaign using rRNA sequence-based phylogenetic methods. New concrete specimens were exposed in two highly corrosive manholes (high concentrations of hydrogen sulfide and carbon dioxide gas) on the Colorado Front Range for up to a year. Community succession on corroding surfaces was assessed using Illumina MiSeq sequencing of 16S bacterial rRNA amplicons and Sanger sequencing of 16S universal rRNA clones. Microbial communities associated with corrosion fronts presented distinct succession patterns which converged to markedly low α-diversity levels (< 10 taxa) in conjunction with decreasing pH. The microbial community succession pattern observed in this study agreed with culture-based models that implicate acidophilic sulfur-oxidizer Acidithiobacillus spp. in advanced communities, with two notable exceptions. Early communities exposed to alkaline surface pH presented relatively high α-diversity, including heterotrophic, nitrogen-fixing, and sulfur-oxidizing genera, and one community exposed to neutral surface pH presented a diverse transition community comprised of less than 20% sulfur-oxidizers. PMID:25748024
High-resolution microbial community succession of microbially induced concrete corrosion in working sanitary manholes.

PubMed

Ling, Alison L; Robertson, Charles E; Harris, J Kirk; Frank, Daniel N; Kotter, Cassandra V; Stevens, Mark J; Pace, Norman R; Hernandez, Mark T

2015-01-01

Microbially-induced concrete corrosion in headspaces threatens wastewater infrastructure worldwide. Models for predicting corrosion rates in sewer pipe networks rely largely on information from culture-based investigations. In this study, the succession of microbes associated with corroding concrete was characterized over a one-year monitoring campaign using rRNA sequence-based phylogenetic methods. New concrete specimens were exposed in two highly corrosive manholes (high concentrations of hydrogen sulfide and carbon dioxide gas) on the Colorado Front Range for up to a year. Community succession on corroding surfaces was assessed using Illumina MiSeq sequencing of 16S bacterial rRNA amplicons and Sanger sequencing of 16S universal rRNA clones. Microbial communities associated with corrosion fronts presented distinct succession patterns which converged to markedly low α-diversity levels (< 10 taxa) in conjunction with decreasing pH. The microbial community succession pattern observed in this study agreed with culture-based models that implicate acidophilic sulfur-oxidizer Acidithiobacillus spp. in advanced communities, with two notable exceptions. Early communities exposed to alkaline surface pH presented relatively high α-diversity, including heterotrophic, nitrogen-fixing, and sulfur-oxidizing genera, and one community exposed to neutral surface pH presented a diverse transition community comprised of less than 20% sulfur-oxidizers.
Population Diversity and Dynamics of Streptococcus mitis, Streptococcus oralis, and Streptococcus infantis in the Upper Respiratory Tracts of Adults, Determined by a Nonculture Strategy▿

PubMed Central

Bek-Thomsen, Malene; Tettelin, Hervé; Hance, Ioana; Nelson, Karen E.; Kilian, Mogens

2008-01-01

We reinvestigated the clonal diversity and dynamics of Streptococcus mitis and two other abundant members of the commensal microbiota of the upper respiratory tract, Streptococcus oralis and Streptococcus infantis, to obtain information about the origin of frequently emerging clones in this habitat. A culture-independent method was used, based on cloning and sequencing of PCR amplicons of the housekeeping gene gdh, which shows remarkable, yet species-specific, genetic polymorphism. Samples were collected from all potential ecological niches in the oral cavity and pharynx of two adults on two occasions separated by 2 years. Based on analysis of close to 10,000 sequences, significant diversity was observed in populations of all three species. Fluctuations in the relative proportions of individual clones and species were observed over time. While a few clones dominated, the proportions of most clones were very small. The results show that the frequent turnover of S. mitis, S. oralis, and S. infantis clones observed by cultivation can be explained by fluctuations in the relative proportions of clones, most of which are below the level of detection by the traditional culture technique, possibly combined with loss and acquisition from contacts. These findings provide a platform for understanding the mechanisms that govern the balance within the complex microbiota at mucosal sites and between the microbiota and the mucosal immune system of the host. PMID:18316382
Burkholderia pseudomallei sequencing identifies genomic clades with distinct recombination, accessory, and epigenetic profiles

PubMed Central

Nandi, Tannistha; Holden, Matthew T.G.; Didelot, Xavier; Mehershahi, Kurosh; Boddey, Justin A.; Beacham, Ifor; Peak, Ian; Harting, John; Baybayan, Primo; Guo, Yan; Wang, Susana; How, Lee Chee; Sim, Bernice; Essex-Lopresti, Angela; Sarkar-Tyson, Mitali; Nelson, Michelle; Smither, Sophie; Ong, Catherine; Aw, Lay Tin; Hoon, Chua Hui; Michell, Stephen; Studholme, David J.; Titball, Richard; Chen, Swaine L.; Parkhill, Julian

2015-01-01

Burkholderia pseudomallei (Bp) is the causative agent of the infectious disease melioidosis. To investigate population diversity, recombination, and horizontal gene transfer in closely related Bp isolates, we performed whole-genome sequencing (WGS) on 106 clinical, animal, and environmental strains from a restricted Asian locale. Whole-genome phylogenies resolved multiple genomic clades of Bp, largely congruent with multilocus sequence typing (MLST). We discovered widespread recombination in the Bp core genome, involving hundreds of regions associated with multiple haplotypes. Highly recombinant regions exhibited functional enrichments that may contribute to virulence. We observed clade-specific patterns of recombination and accessory gene exchange, and provide evidence that this is likely due to ongoing recombination between clade members. Reciprocally, interclade exchanges were rarely observed, suggesting mechanisms restricting gene flow between clades. Interrogation of accessory elements revealed that each clade harbored a distinct complement of restriction-modification (RM) systems, predicted to cause clade-specific patterns of DNA methylation. Using methylome sequencing, we confirmed that representative strains from separate clades indeed exhibit distinct methylation profiles. Finally, using an E. coli system, we demonstrate that Bp RM systems can inhibit uptake of non-self DNA. Our data suggest that RM systems borne on mobile elements, besides preventing foreign DNA invasion, may also contribute to limiting exchanges of genetic material between individuals of the same species. Genomic clades may thus represent functional units of genetic isolation in Bp, modulating intraspecies genetic diversity. PMID:25236617
Lactobacillus buchneri Genotyping on the Basis of Clustered Regularly Interspaced Short Palindromic Repeat (CRISPR) Locus Diversity

PubMed Central

Briner, Alexandra E.

2014-01-01

Clustered regularly interspaced short palindromic repeats (CRISPR) in combination with associated sequences (cas) constitute the CRISPR-Cas immune system, which uptakes DNA from invasive genetic elements as novel “spacers” that provide a genetic record of immunization events. We investigated the potential of CRISPR-based genotyping of Lactobacillus buchneri, a species relevant for commercial silage, bioethanol, and vegetable fermentations. Upon investigating the occurrence and diversity of CRISPR-Cas systems in Lactobacillus buchneri genomes, we observed a ubiquitous occurrence of CRISPR arrays containing a 36-nucleotide (nt) type II-A CRISPR locus adjacent to four cas genes, including the universal cas1 and cas2 genes and the type II signature gene cas9. Comparative analysis of CRISPR spacer content in 26 L. buchneri pickle fermentation isolates associated with spoilage revealed 10 unique locus genotypes that contained between 9 and 29 variable spacers. We observed a set of conserved spacers at the ancestral end, reflecting a common origin, as well as leader-end polymorphisms, reflecting recent divergence. Some of these spacers showed perfect identity with phage sequences, and many spacers showed homology to Lactobacillus plasmid sequences. Following a comparative analysis of sequences immediately flanking protospacers that matched CRISPR spacers, we identified a novel putative protospacer-adjacent motif (PAM), 5′-AAAA-3′. Overall, these findings suggest that type II-A CRISPR-Cas systems are valuable for genotyping of L. buchneri. PMID:24271175
Sequence variation and phylogenetic analysis of envelope glycoprotein of hepatitis G virus.

PubMed

Lim, M Y; Fry, K; Yun, A; Chong, S; Linnen, J; Fung, K; Kim, J P

1997-11-01

A transfusion-transmissible agent provisionally designated hepatitis G virus (HGV) was recently identified. In this study, we examined the variability of the HGV genome by analysing sequences in the putative envelope region from 72 isolates obtained from diverse geographical sources. The 1561 nucleotide sequence of the E1/E2/NS2a region of HGV was determined from 12 isolates, and compared with three published sequences. The most variability was observed in 400 nucleotides at the N terminus of E2. We next analysed this 400 nucleotide envelope variable region (EV) from an additional 60 HGV isolates. This sequence varied considerably among the 75 isolates, with overall identity ranging from 79.3% to 99.5% at the nucleotide level, and from 83.5% to 100% at the amino acid level. However, hypervariable regions were not identified. Phylogenetic analyses indicated that the 75 HGV isolates belong to a single genotype. A single-tier distribution of evolutionary distances was observed among the 15 E1/E2/NS2a sequences and the 75 EV sequences. In contrast, 11 isolates of HCV were analysed and showed a three-tiered distribution, representing genotypes, subtypes, and isolates. The 75 isolates of HGV fell into four clusters on the phylogenetic tree. Tight geographical clustering was observed among the HGV isolates from Japan and Korea.
High genetic diversity of Vibrio cholerae in the European lake Neusiedler See is associated with intensive recombination in the reed habitat and the long-distance transfer of strains.

PubMed

Pretzer, Carina; Druzhinina, Irina S; Amaro, Carmen; Benediktsdóttir, Eva; Hedenström, Ingela; Hervio-Heath, Dominique; Huhulescu, Steliana; Schets, Franciska M; Farnleitner, Andreas H; Kirschner, Alexander K T

2017-01-01

Coastal marine Vibrio cholerae populations usually exhibit high genetic diversity. To assess the genetic diversity of abundant V. cholerae non-O1/non-O139 populations in the Central European lake Neusiedler See, we performed a phylogenetic analysis based on recA, toxR, gyrB and pyrH loci sequenced for 472 strains. The strains were isolated from three ecologically different habitats in a lake that is a hot-spot of migrating birds and an important bathing water. We also analyzed 76 environmental and human V. cholerae non-O1/non-O139 isolates from Austria and other European countries and added sequences of seven genome-sequenced strains. Phylogenetic analysis showed that the lake supports a unique endemic diversity of V. cholerae that is particularly rich in the reed stand. Phylogenetic trees revealed that many V. cholerae isolates from European countries were genetically related to the strains present in the lake belonging to statistically supported monophyletic clades. We hypothesize that the observed phenomena can be explained by the high degree of genetic recombination that is particularly intensive in the reed stand, acting along with the long distance transfer of strains most probably via birds and/or humans. Thus, the Neusiedler See may serve as a bioreactor for the appearance of new strains with new (pathogenic) properties. © 2016 The Authors. Environmental Microbiology published by Society for Applied Microbiology and John Wiley & Sons Ltd.
Genetic diversity of Pinus nigra Arn. populations in Southern Spain and Northern Morocco revealed by inter-simple sequence repeat profiles.

PubMed

Rubio-Moraga, Angela; Candel-Perez, David; Lucas-Borja, Manuel E; Tiscar, Pedro A; Viñegla, Benjamin; Linares, Juan C; Gómez-Gómez, Lourdes; Ahrazem, Oussama

2012-01-01

Eight Pinus nigra Arn. populations from Southern Spain and Northern Morocco were examined using inter-simple sequence repeat markers to characterize the genetic variability amongst populations. Pair-wise population genetic distance ranged from 0.031 to 0.283, with a mean of 0.150 between populations. The highest inter-population average distance was between PaCU from Cuenca and YeCA from Cazorla, while the lowest distance was between TaMO from Morocco and MA Sierra Mágina populations. Analysis of molecular variance (AMOVA) and Nei's genetic diversity analyses revealed higher genetic variation within the same population than among different populations. Genetic differentiation (Gst) was 0.233. Cuenca showed the highest Nei's genetic diversity followed by the Moroccan region, Sierra Mágina, and Cazorla region. However, clustering of populations was not in accordance with their geographical locations. Principal component analysis showed the presence of two major groups-Group 1 contained all populations from Cuenca while Group 2 contained populations from Cazorla, Sierra Mágina and Morocco-while Bayesian analysis revealed the presence of three clusters. The low genetic diversity observed in PaCU and YeCA is probably a consequence of inappropriate management since no estimation of genetic variability was performed before the silvicultural treatments. Data indicates that the inter-simple sequence repeat (ISSR) method is sufficiently informative and powerful to assess genetic variability among populations of P. nigra.
Genetic Diversity of Pinus nigra Arn. Populations in Southern Spain and Northern Morocco Revealed By Inter-Simple Sequence Repeat Profiles †

PubMed Central

Rubio-Moraga, Angela; Candel-Perez, David; Lucas-Borja, Manuel E.; Tiscar, Pedro A.; Viñegla, Benjamin; Linares, Juan C.; Gómez-Gómez, Lourdes; Ahrazem, Oussama

2012-01-01

Eight Pinus nigra Arn. populations from Southern Spain and Northern Morocco were examined using inter-simple sequence repeat markers to characterize the genetic variability amongst populations. Pair-wise population genetic distance ranged from 0.031 to 0.283, with a mean of 0.150 between populations. The highest inter-population average distance was between PaCU from Cuenca and YeCA from Cazorla, while the lowest distance was between TaMO from Morocco and MA Sierra Mágina populations. Analysis of molecular variance (AMOVA) and Nei’s genetic diversity analyses revealed higher genetic variation within the same population than among different populations. Genetic differentiation (Gst) was 0.233. Cuenca showed the highest Nei’s genetic diversity followed by the Moroccan region, Sierra Mágina, and Cazorla region. However, clustering of populations was not in accordance with their geographical locations. Principal component analysis showed the presence of two major groups—Group 1 contained all populations from Cuenca while Group 2 contained populations from Cazorla, Sierra Mágina and Morocco—while Bayesian analysis revealed the presence of three clusters. The low genetic diversity observed in PaCU and YeCA is probably a consequence of inappropriate management since no estimation of genetic variability was performed before the silvicultural treatments. Data indicates that the inter-simple sequence repeat (ISSR) method is sufficiently informative and powerful to assess genetic variability among populations of P. nigra. PMID:22754321
Reading biological processes from nucleotide sequences

NASA Astrophysics Data System (ADS)

Murugan, Anand

Cellular processes have traditionally been investigated by techniques of imaging and biochemical analysis of the molecules involved. The recent rapid progress in our ability to manipulate and read nucleic acid sequences gives us direct access to the genetic information that directs and constrains biological processes. While sequence data is being used widely to investigate genotype-phenotype relationships and population structure, here we use sequencing to understand biophysical mechanisms. We present work on two different systems. First, in chapter 2, we characterize the stochastic genetic editing mechanism that produces diverse T-cell receptors in the human immune system. We do this by inferring statistical distributions of the underlying biochemical events that generate T-cell receptor coding sequences from the statistics of the observed sequences. This inferred model quantitatively describes the potential repertoire of T-cell receptors that can be produced by an individual, providing insight into its potential diversity and the probability of generation of any specific T-cell receptor. Then in chapter 3, we present work on understanding the functioning of regulatory DNA sequences in both prokaryotes and eukaryotes. Here we use experiments that measure the transcriptional activity of large libraries of mutagenized promoters and enhancers and infer models of the sequence-function relationship from this data. For the bacterial promoter, we infer a physically motivated 'thermodynamic' model of the interaction of DNA-binding proteins and RNA polymerase determining the transcription rate of the downstream gene. For the eukaryotic enhancers, we infer heuristic models of the sequence-function relationship and use these models to find synthetic enhancer sequences that optimize inducibility of expression. Both projects demonstrate the utility of sequence information in conjunction with sophisticated statistical inference techniques for dissecting underlying biophysical mechanisms.
Genotype diversity of hepatitis C virus (HCV) in HCV-associated liver disease patients in Indonesia.

PubMed

Utama, Andi; Tania, Navessa Padma; Dhenni, Rama; Gani, Rino Alvani; Hasan, Irsan; Sanityoso, Andri; Lelosutan, Syafruddin A R; Martamala, Ruswhandi; Lesmana, Laurentius Adrianus; Sulaiman, Ali; Tai, Susan

2010-09-01

Hepatitis C virus (HCV) genotype distribution in Indonesia has been reported. However, the identification of HCV genotype was based on 5'-UTR or NS5B sequence. This study was aimed to observe HCV core sequence variation among HCV-associated liver disease patients in Jakarta, and to analyse the HCV genotype diversity based on the core sequence. Sixty-eight chronic hepatitis (CH), 48 liver cirrhosis (LC) and 34 hepatocellular carcinoma (HCC) were included in this study. HCV core variation was analysed by direct sequencing. Alignment of HCV core sequences demonstrated that the core sequence was relatively varied among the genotype. Indeed, 237 bases of the core sequence could classify the HCV subtype; however, 236 bases failed to differentiate several subtypes. Based on 237 bases of the core sequences, the HCV strains were classified into genotypes 1 (subtypes 1a, 1b and 1c), 2 (subtypes 2a, 2e and 2f) and 3 (subtypes 3a and 3k). The HCV 1b (47.3%) was the most prevalent, followed by subtypes 1c (18.7%), 3k (10.7%), 2a (10.0%), 1a (6.7%), 2e (5.3%), 2f (0.7%) and 3a (0.7%). HCV 1b was the most common in all patients, and the prevalence increased with the severity of liver disease (36.8% in CH, 54.2% in LC and 58.8% in HCC). These results were similar to a previous report based on NS5B sequence analysis. Hepatitis C virus core sequence (237 bases) could identify the HCV subtype and the prevalence of HCV subtype based on core sequence was similar to those based on the NS5B region.
Flexible theta sequence compression mediated via phase precessing interneurons

PubMed Central

Chadwick, Angus; van Rossum, Mark CW; Nolan, Matthew F

2016-01-01

Encoding of behavioral episodes as spike sequences during hippocampal theta oscillations provides a neural substrate for computations on events extended across time and space. However, the mechanisms underlying the numerous and diverse experimentally observed properties of theta sequences remain poorly understood. Here we account for theta sequences using a novel model constrained by the septo-hippocampal circuitry. We show that when spontaneously active interneurons integrate spatial signals and theta frequency pacemaker inputs, they generate phase precessing action potentials that can coordinate theta sequences in place cell populations. We reveal novel constraints on sequence generation, predict cellular properties and neural dynamics that characterize sequence compression, identify circuit organization principles for high capacity sequential representation, and show that theta sequences can be used as substrates for association of conditioned stimuli with recent and upcoming events. Our results suggest mechanisms for flexible sequence compression that are suited to associative learning across an animal’s lifespan. DOI: http://dx.doi.org/10.7554/eLife.20349.001 PMID:27929374
High-throughput sequence-based analysis of the bacterial composition of kefir and an associated kefir grain.

PubMed

Dobson, Alleson; O'Sullivan, Orla; Cotter, Paul D; Ross, Paul; Hill, Colin

2011-07-01

Lacticin 3147 is a two-peptide broad spectrum lantibiotic produced by Lactococcus lactis DPC3147 shown to inhibit a number of clinically relevant Gram-positive pathogens. Initially isolated from an Irish kefir grain, lacticin 3147 is one of the most extensively studied lantibiotics to date. In this study, the bacterial diversity of the Irish kefir grain from which L. lactis DPC3147 was originally isolated was for the first time investigated using a high-throughput parallel sequencing strategy. A total of 17 416 unique V4 variable regions of the 16S rRNA gene were analysed from both the kefir starter grain and its derivative kefir-fermented milk. Firmicutes (which includes the lactic acid bacteria) was the dominant phylum accounting for > 92% of sequences. Within the Firmicutes, dramatic differences in abundance were observed when the starter grain and kefir milk fermentate were compared. The kefir grain-associated bacterial community was largely composed of the Lactobacillaceae family while Streptococcaceae (primarily Lactococcus spp.) was the dominant family within the kefir milk fermentate. Sequencing data confirmed previous findings that the microbiota of kefir milk and the starter grain are quite different while at the same time, establishing that the microbial diversity of the starter grain is not uniform with a greater level of diversity associated with the interior kefir starter grain compared with the exterior. © 2011 Teagasc Food Research Centre, Moorepark. FEMS Microbiology Letters © 2011 Federation of European Microbiological Societies. Published by Blackwell Publishing Ltd.

It's all relative: ranking the diversity of aquatic bacterial communities.

PubMed

Shaw, Allison K; Halpern, Aaron L; Beeson, Karen; Tran, Bao; Venter, J Craig; Martiny, Jennifer B H

2008-09-01

The study of microbial diversity patterns is hampered by the enormous diversity of microbial communities and the lack of resources to sample them exhaustively. For many questions about richness and evenness, however, one only needs to know the relative order of diversity among samples rather than total diversity. We used 16S libraries from the Global Ocean Survey to investigate the ability of 10 diversity statistics (including rarefaction, non-parametric, parametric, curve extrapolation and diversity indices) to assess the relative diversity of six aquatic bacterial communities. Overall, we found that the statistics yielded remarkably similar rankings of the samples for a given sequence similarity cut-off. This correspondence, despite the different underlying assumptions of the statistics, suggests that diversity statistics are a useful tool for ranking samples of microbial diversity. In addition, sequence similarity cut-off influenced the diversity ranking of the samples, demonstrating that diversity statistics can also be used to detect differences in phylogenetic structure among microbial communities. Finally, a subsampling analysis suggests that further sequencing from these particular clone libraries would not have substantially changed the richness rankings of the samples.
Unexpected biodiversity of ciliates in marine samples from below the photic zone.

PubMed

Grattepanche, Jean-David; Santoferrara, Luciana F; McManus, George B; Katz, Laura A

2016-08-01

Marine microbial eukaryotes play critical roles in planktonic food webs and have been described as most diverse in the photic zone where productivity is high. We used high-throughput sequencing (HTS) to analyse the spatial distribution of planktonic ciliate diversity from shallow waters (<30 m depth) to beyond the continental shelf (>800 m depth) along a 163 km transect off the coast of New England, USA. We focus on ciliates in the subclasses Oligotrichia and Choreotrichia (class Spirotrichea), as these taxa are major components of marine food webs. We did not observe the decrease of diversity below the photic zone expected based on productivity and previous analyses. Instead, we saw an increase of diversity with depth. We also observed that the ciliate communities assessed by HTS cluster by depth layer and degree of water column stratification, suggesting that community assembly is driven by environmental factors. Across our samples, abundant OTUs tend to match previously characterized morphospecies while rare OTUs are more often undescribed, consistent with the idea that species in the rare biosphere remain to be characterized by microscopy. Finally, samples taken below the photic zone also reveal the prevalence of two uncharacterized (i.e. lacking sequenced morphospecies) clades - clusters X1 and X2 - that are enriched within the nano-sized fraction (2-10 μm) and are defined by deletions within the region of the SSU-rDNA analysed here. Together, these data reinforce that we still have much to learn about microbial diversity in marine ecosystems, especially in deep-waters that may be a reservoir for rare species and uncharacterized taxa. © 2016 John Wiley & Sons Ltd.
Cheese rind communities provide tractable systems for in situ and in vitro studies of microbial diversity

PubMed Central

Wolfe, Benjamin E.; Button, Julie E.; Santarelli, Marcela; Dutton, Rachel J.

2014-01-01

SUMMARY Tractable microbial communities are needed to bridge the gap between observations of patterns of microbial diversity and mechanisms that can explain these patterns. We developed cheese rinds as model microbial communities by characterizing in situ patterns of diversity and by developing an in vitro system for community reconstruction. Sequencing of 137 different rind communities across 10 countries revealed 24 widely distributed and culturable genera of bacteria and fungi as dominant community members. Reproducible community types formed independent of geographic location of production. Intensive temporal sampling demonstrated that assembly of these communities is highly reproducible. Patterns of community composition and succession observed in situ can be recapitulated in a simple in vitro system. Widespread positive and negative interactions were identified between bacterial and fungal community members. Cheese rind microbial communities represent an experimentally tractable system for defining mechanisms that influence microbial community assembly and function. PMID:25036636
Comparison of mitochondrial DNA control region sequence and microsatellite DNA analyses in estimating population structure and gene flow rates in Atlantic sturgeon Acipenser oxyrinchus

USGS Publications Warehouse

Wirgin, I.; Waldman, J.; Stabile, J.; Lubinski, B.; King, T.

2002-01-01

Atlantic sturgeon Acipenser oxyrinchus is large, long-lived, and anadromous with subspecies distributed along the Atlantic (A. oxyrinchus oxyrinchus) and Gulf of Mexico (A. o. desotoi) coasts of North America. Although it is not certain if extirpation of some population units has occurred, because of anthropogenic influences abundances of all populations are low compared with historical levels. Informed management of A. oxyrinchus demands a detailed knowledge of its population structure, levels of genetic diversity, and likelihood to home to natal rivers. We compared the use of mitochondrial DNA (mtDNA) control region sequence and microsatellite nuclear DNA (nDNA) analyses in identifying the stock structure and homing fidelity of Atlantic and Gulf coast populations of A. oxyrinchus. The approaches were concordant in that they revealed moderate to high levels of genetic diversity and suggested that populations of Atlantic sturgeon are highly structured. At least six genetically distinct management units were detected using the two approaches among the rivers surveyed. Mitochondrial DNA sequences revealed a significant cline in haplotype diversity along the Atlantic coast with monomorphism observed in Canadian populations. High levels of nDNA diversity were also observed among populations along the Atlantic coast, including the two Canadian populations, probably resulting from the more rapid rate of mutational and evolutionary change at microsatellite loci. Estimates of gene flow among populations were similar between both approaches with the exception that because of mtDNA monomorphism in Canadian populations, gene flow estimates between them were unobtainable. Analyses of both genomes provided high resolution and confidence in characterizing the population structure of Atlantic sturgeon. Microsatellite analysis was particularly informative in delineating population structure in rivers that were recently glaciated and may prove diagnostic in rivers that are geographically proximal along the south Atlantic coast of the US.
Blocks of limited haplotype diversity revealed by high-resolution scanning of human chromosome 21.

PubMed

Patil, N; Berno, A J; Hinds, D A; Barrett, W A; Doshi, J M; Hacker, C R; Kautzer, C R; Lee, D H; Marjoribanks, C; McDonough, D P; Nguyen, B T; Norris, M C; Sheehan, J B; Shen, N; Stern, D; Stokowski, R P; Thomas, D J; Trulson, M O; Vyas, K R; Frazer, K A; Fodor, S P; Cox, D R

2001-11-23

Global patterns of human DNA sequence variation (haplotypes) defined by common single nucleotide polymorphisms (SNPs) have important implications for identifying disease associations and human traits. We have used high-density oligonucleotide arrays, in combination with somatic cell genetics, to identify a large fraction of all common human chromosome 21 SNPs and to directly observe the haplotype structure defined by these SNPs. This structure reveals blocks of limited haplotype diversity in which more than 80% of a global human sample can typically be characterized by only three common haplotypes.
Comprehensive phylogenetic analysis of bacterial reverse transcriptases.

PubMed

Toro, Nicolás; Nisa-Martínez, Rafael

2014-01-01

Much less is known about reverse transcriptases (RTs) in prokaryotes than in eukaryotes, with most prokaryotic enzymes still uncharacterized. Two surveys involving BLAST searches for RT genes in prokaryotic genomes revealed the presence of large numbers of diverse, uncharacterized RTs and RT-like sequences. Here, using consistent annotation across all sequenced bacterial species from GenBank and other sources via RAST, available from the PATRIC (Pathogenic Resource Integration Center) platform, we have compiled the data for currently annotated reverse transcriptases from completely sequenced bacterial genomes. RT sequences are broadly distributed across bacterial phyla, but green sulfur bacteria and cyanobacteria have the highest levels of RT sequence diversity (≤85% identity) per genome. By contrast, phylum Actinobacteria, for which a large number of genomes have been sequenced, was found to have a low RT sequence diversity. Phylogenetic analyses revealed that bacterial RTs could be classified into 17 main groups: group II introns, retrons/retron-like RTs, diversity-generating retroelements (DGRs), Abi-like RTs, CRISPR-Cas-associated RTs, group II-like RTs (G2L), and 11 other groups of RTs of unknown function. Proteobacteria had the highest potential functional diversity, as they possessed most of the RT groups. Group II introns and DGRs were the most widely distributed RTs in bacterial phyla. Our results provide insights into bacterial RT phylogeny and the basis for an update of annotation systems based on sequence/domain homology.
Comprehensive Phylogenetic Analysis of Bacterial Reverse Transcriptases

PubMed Central

Toro, Nicolás; Nisa-Martínez, Rafael

2014-01-01

Much less is known about reverse transcriptases (RTs) in prokaryotes than in eukaryotes, with most prokaryotic enzymes still uncharacterized. Two surveys involving BLAST searches for RT genes in prokaryotic genomes revealed the presence of large numbers of diverse, uncharacterized RTs and RT-like sequences. Here, using consistent annotation across all sequenced bacterial species from GenBank and other sources via RAST, available from the PATRIC (Pathogenic Resource Integration Center) platform, we have compiled the data for currently annotated reverse transcriptases from completely sequenced bacterial genomes. RT sequences are broadly distributed across bacterial phyla, but green sulfur bacteria and cyanobacteria have the highest levels of RT sequence diversity (≤85% identity) per genome. By contrast, phylum Actinobacteria, for which a large number of genomes have been sequenced, was found to have a low RT sequence diversity. Phylogenetic analyses revealed that bacterial RTs could be classified into 17 main groups: group II introns, retrons/retron-like RTs, diversity-generating retroelements (DGRs), Abi-like RTs, CRISPR-Cas-associated RTs, group II-like RTs (G2L), and 11 other groups of RTs of unknown function. Proteobacteria had the highest potential functional diversity, as they possessed most of the RT groups. Group II introns and DGRs were the most widely distributed RTs in bacterial phyla. Our results provide insights into bacterial RT phylogeny and the basis for an update of annotation systems based on sequence/domain homology. PMID:25423096
Evidence for a Complex Class of Nonadenylated mRNA in Drosophila

PubMed Central

Zimmerman, J. Lynn; Fouts, David L.; Manning, Jerry E.

1980-01-01

The amount, by mass, of poly(A+) mRNA present in the polyribosomes of third-instar larvae of Drosophila melanogaster, and the relative contribution of the poly(A+) mRNA to the sequence complexity of total polysomal RNA, has been determined. Selective removal of poly(A+) mRNA from total polysomal RNA by use of either oligo-dT-cellulose, or poly(U)-sepharose affinity chromatography, revealed that only 0.15% of the mass of the polysomal RNA was present as poly(A+) mRNA. The present study shows that this RNA hybridized at saturation with 3.3% of the single-copy DNA in the Drosophila genome. After correction for asymmetric transcription and reactability of the DNA, 7.4% of the single-copy DNA in the Drosophila genome is represented in larval poly(A+) mRNA. This corresponds to 6.73 x 106 nucleotides of mRNA coding sequences, or approximately 5,384 diverse RNA sequences of average size 1,250 nucleotides. However, total polysomal RNA hybridizes at saturation to 10.9% of the single-copy DNA sequences. After correcting this value for asymmetric transcription and tracer DNA reactability, 24% of the single-copy DNA in Drosophila is represented in total polysomal RNA. This corresponds to 2.18 x 107 nucleotides of RNA coding sequences or 17,440 diverse RNA molecules of size 1,250 nucleotides. This value is 3.2 times greater than that observed for poly(A+) mRNA, and indicates that ≃69% of the polysomal RNA sequence complexity is contributed by nonadenylated RNA. Furthermore, if the number of different structural genes represented in total polysomal RNA is ≃1.7 x 104, then the number of genes expressed in third-instar larvae exceeds the number of chromomeres in Drosophila by about a factor of three. This numerology indicates that the number of chromomeres observed in polytene chromosomes does not reflect the number of structural gene sequences in the Drosophila genome. PMID:6777246
Microbial biogeography of San Francisco Bay sediments

NASA Astrophysics Data System (ADS)

Lee, J. A.; Francis, C. A.

2014-12-01

The largest estuary on the west coast of North America, San Francisco Bay is an ecosystem of enormous biodiversity, and also enormous human impact. The benthos has experienced dredging, occupation by invasive species, and over a century of sediment input as a result of hydraulic mining. Although the Bay's great cultural and ecological importance has inspired numerous surveys of the benthic macrofauna, to date there has been almost no investigation of the microbial communities on the Bay floor. An understanding of those microbial communities would contribute significantly to our understanding of both the biogeochemical processes (which are driven by the microbiota) and the physical processes (which contribute to microbial distributions) in the Bay. Here, we present the first broad survey of bacterial and archaeal taxa in the sediments of the San Francisco Bay. We conducted 16S rRNA community sequencing of bacteria and archaea in sediment samples taken bimonthly for one year, from five sites spanning the salinity gradient between Suisun and Central Bay, in order to capture the effect of both spatial and temporal environmental variation on microbial diversity. From the same samples we also conducted deep sequencing of a nitrogen-cycling functional gene, nirS, allowing an assessment of evolutionary diversity at a much finer taxonomic scale within an important and widespread functional group of bacteria. We paired these sequencing projects with extensive geochemical metadata as well as information about macrofaunal distribution. Our data reveal a diversity of distinct biogeographical patterns among different taxa: clades ubiquitous across sites; clades that respond to measurable environmental drivers; and clades that show geographical site-specificity. These community datasets allow us to test the hypothesis that salinity is a major driver of both overall microbial community structure and community structure of the denitrifying bacteria specifically; and to assess whether patterns of diversity observed at the broadest of taxonomic scales also apply to patterns observed within a single extremely diverse gene (nirS). In sum, this project provides a first look at the forces driving the migration and selection of microbial communities in San Francisco Bay.
Recombination enhances HIV-1 envelope diversity by facilitating the survival of latent genomic fragments in the plasma virus population

DOE Office of Scientific and Technical Information (OSTI.GOV)

Immonen, Taina T.; Conway, Jessica M.; Romero-Severson, Ethan O.

HIV-1 is subject to immune pressure exerted by the host, giving variants that escape the immune response an advantage. Virus released from activated latent cells competes against variants that have continually evolved and adapted to host immune pressure. Nevertheless, there is increasing evidence that virus displaying a signal of latency survives in patient plasma despite having reduced fitness due to long-term immune memory. We investigated the survival of virus with latent envelope genomic fragments by simulating within-host HIV-1 sequence evolution and the cycling of viral lineages in and out of the latent reservoir. Our model incorporates a detailed mutation processmore » including nucleotide substitution, recombination, latent reservoir dynamics, diversifying selection pressure driven by the immune response, and purifying selection pressure asserted by deleterious mutations. We evaluated the ability of our model to capture sequence evolution in vivo by comparing our simulated sequences to HIV-1 envelope sequence data from 16 HIV-infected untreated patients. Empirical sequence divergence and diversity measures were qualitatively and quantitatively similar to those of our simulated HIV-1 populations, suggesting that our model invokes realistic trends of HIV-1 genetic evolution. Moreover, reconstructed phylogenies of simulated and patient HIV-1 populations showed similar topological structures. Our simulation results suggest that recombination is a key mechanism facilitating the persistence of virus with latent envelope genomic fragments in the productively infected cell population. Recombination increased the survival probability of latent virus forms approximately 13-fold. Prevalence of virus with latent fragments in productively infected cells was observed in only 2% of simulations when we ignored recombination, while the proportion increased to 27% of simulations when we allowed recombination. We also found that the selection pressures exerted by different fitness landscapes influenced the shape of phylogenies, diversity trends, and survival of virus with latent genomic fragments. Furthermore, our model predicts that the persistence of latent genomic fragments from multiple different ancestral origins increases sequence diversity in plasma for reasonable fitness landscapes.« less
Recombination enhances HIV-1 envelope diversity by facilitating the survival of latent genomic fragments in the plasma virus population

DOE PAGES

Immonen, Taina T.; Conway, Jessica M.; Romero-Severson, Ethan O.; ...

2015-12-22

HIV-1 is subject to immune pressure exerted by the host, giving variants that escape the immune response an advantage. Virus released from activated latent cells competes against variants that have continually evolved and adapted to host immune pressure. Nevertheless, there is increasing evidence that virus displaying a signal of latency survives in patient plasma despite having reduced fitness due to long-term immune memory. We investigated the survival of virus with latent envelope genomic fragments by simulating within-host HIV-1 sequence evolution and the cycling of viral lineages in and out of the latent reservoir. Our model incorporates a detailed mutation processmore » including nucleotide substitution, recombination, latent reservoir dynamics, diversifying selection pressure driven by the immune response, and purifying selection pressure asserted by deleterious mutations. We evaluated the ability of our model to capture sequence evolution in vivo by comparing our simulated sequences to HIV-1 envelope sequence data from 16 HIV-infected untreated patients. Empirical sequence divergence and diversity measures were qualitatively and quantitatively similar to those of our simulated HIV-1 populations, suggesting that our model invokes realistic trends of HIV-1 genetic evolution. Moreover, reconstructed phylogenies of simulated and patient HIV-1 populations showed similar topological structures. Our simulation results suggest that recombination is a key mechanism facilitating the persistence of virus with latent envelope genomic fragments in the productively infected cell population. Recombination increased the survival probability of latent virus forms approximately 13-fold. Prevalence of virus with latent fragments in productively infected cells was observed in only 2% of simulations when we ignored recombination, while the proportion increased to 27% of simulations when we allowed recombination. We also found that the selection pressures exerted by different fitness landscapes influenced the shape of phylogenies, diversity trends, and survival of virus with latent genomic fragments. Furthermore, our model predicts that the persistence of latent genomic fragments from multiple different ancestral origins increases sequence diversity in plasma for reasonable fitness landscapes.« less
Variation in Seed Fatty Acid Composition, and Sequence Divergence in the FAD2 Gene Coding Region between Wild and Cultivated Sesame

USDA-ARS?s Scientific Manuscript database

Sesame germplasm harbors genetic diversity which can be useful for sesame improvement in breeding programs. Seven accessions with different levels of oleic acid were selected from the entire USDA sesame germplasm collection (1232 accessions) and planted for morphological observation and re-examinati...
Improving ITS sequence data for identification of plant pathogenic fungi

Treesearch

R. Henrik Nilsson; Kevin D. Hyde; Julia Pawłowska; Martin Ryberg; Leho Tedersoo; Anders Bjørnsgard Aas; Siti A. Alias; Artur Alves; Cajsa Lisa Anderson; Alexandre Antonelli; A. Elizabeth Arnold; Barbara Bahnmann; Mohammad Bahram; Johan Bengtsson-Palme; Anna Berlin; Sara Branco; Putarak Chomnunti; Asha Dissanayake; Rein Drenkhan; Hanna Friberg; Tobias Guldberg Frøslev; Bettina Halwachs; Martin Hartmann; Beatrice Henricot; Ruvishika Jayawardena; Ari Jumpponen; Håvard Kauserud; Sonja Koskela; Tomasz Kulik; Kare Liimatainen; Björn D. Lindahl; Daniel Lindner; Jian-Kui Liu; Sajeewa Maharachchikumbura; Dimuthu Manamgoda; Svante Martinsson; Maria Alice Neves; Tuula Niskanen; Stephan Nylinder; Olinto Liparini Pereira; Danilo Batista Pinho; Teresita M. Porter; Valentin Queloz; Taavi Riit; Marisol Sánchez-García; Filipe de Sousa; Emil Stefańczyk; Mariusz Tadych; Susumu Takamatsu; Qing Tian; Dhanushka Udayanga; Martin Unterseher; Zheng Wang; Saowanee Wikee; Jiye Yan; Ellen Larsson; Karl-Henrik Larsson; Urmas Kõljalg; Kessy Abarenkov

2014-01-01

Plant pathogenic fungi are a large and diverse assemblage of eukaryotes with substantial impacts on natural ecosystems and human endeavours. These taxa often have complex and poorly understood life cycles, lack observable, discriminatory morphological characters, and may not be amenable to in vitro culturing. As a result, species identification is frequently difficult...
Bacterial microbiome in the nose of healthy cats and in cats with nasal disease

PubMed Central

Tress, Barbara; Suchodolski, Jan S.; Nisar, Tariq; Ravindran, Prajesh; Weber, Karin; Hartmann, Katrin; Schulz, Bianka S.

2017-01-01

Background Traditionally, changes in the microbial population of the nose have been assessed using conventional culture techniques. Sequencing of bacterial 16S rRNA genes demonstrated that the human nose is inhabited by a rich and diverse bacterial microbiome that cannot be detected using culture-based methods. The goal of this study was to describe the nasal microbiome of healthy cats, cats with nasal neoplasia, and cats with feline upper respiratory tract disease (FURTD). Methodology/Principal findings DNA was extracted from nasal swabs of healthy cats (n = 28), cats with nasal neoplasia (n = 16), and cats with FURTD (n = 15), and 16S rRNA genes were sequenced. High species richness was observed in all samples. Rarefaction analysis revealed that healthy cats living indoors had greater species richness (observed species p = 0.042) and Shannon diversity (p = 0.003) compared with healthy cats living outdoors. Higher species richness (observed species p = 0.001) and Shannon diversity (p<0.001) were found in middle-aged cats in comparison to healthy cats in different age groups. Principal coordinate analysis revealed separate clustering based on similarities in bacterial molecular phylogenetic trees of 16S rRNA genes for indoor and outdoor cats. In all groups examined, the most abundant phyla identified were Proteobacteria, Firmicutes, and Bacteroidetes. At the genus level, 375 operational taxonomic units (OTUs) were identified. In healthy cats and cats with FURTD, Moraxella spp. was the most common genus, while it was unclassified Bradyrhizobiaceae in cats with nasal neoplasia. High individual variability was observed. Conclusion This study demonstrates that the nose of cats is inhabited by much more variable and diverse microbial communities than previously shown. Future research in this field might help to develop new diagnostic tools to easily identify nasal microbial changes, relate them to certain disease processes, and help clinicians in the decision process of antibiotic selection for individual patients. PMID:28662139
Genetic variation and population structure in Jamunapari goats using microsatellites, mitochondrial DNA, and milk protein genes.

PubMed

Rout, P K; Thangraj, K; Mandal, A; Roy, R

2012-01-01

Jamunapari, a dairy goat breed of India, has been gradually declining in numbers in its home tract over the years. We have analysed genetic variation and population history in Jamunapari goats based on 17 microsatellite loci, 2 milk protein loci, mitochondrial hypervariable region I (HVRI) sequencing, and three Y-chromosomal gene sequencing. We used the mitochondrial DNA (mtDNA) mismatch distribution, microsatellite data, and bottleneck tests to infer the population history and demography. The mean number of alleles per locus was 9.0 indicating that the allelic variation was high in all the loci and the mean heterozygosity was 0.769 at nuclear loci. Although the population size is smaller than 8,000 individuals, the amount of variability both in terms of allelic richness and gene diversity was high in all the microsatellite loci except ILST 005. The gene diversity and effective number of alleles at milk protein loci were higher than the 10 other Indian goat breeds that they were compared to. Mismatch analysis was carried out and the analysis revealed that the population curve was unimodal indicating the expansion of population. The genetic diversity of Y-chromosome genes was low in the present study. The observed mean M ratio in the population was above the critical significance value (Mc) and close to one indicating that it has maintained a slowly changing population size. The mode-shift test did not detect any distortion of allele frequency and the heterozygosity excess method showed that there was no significant departure from mutation-drift equilibrium detected in the population. However, the effects of genetic bottlenecks were observed in some loci due to decreased heterozygosity and lower level of M ratio. There were two observed genetic subdivisions in the population supporting the observations of farmers in different areas. This base line information on genetic diversity, bottleneck analysis, and mismatch analysis was obtained to assist the conservation decision and management of the breed.
Genetic Variation and Population Structure in Jamunapari Goats Using Microsatellites, Mitochondrial DNA, and Milk Protein Genes

PubMed Central

Rout, P. K.; Thangraj, K.; Mandal, A.; Roy, R.

2012-01-01

Jamunapari, a dairy goat breed of India, has been gradually declining in numbers in its home tract over the years. We have analysed genetic variation and population history in Jamunapari goats based on 17 microsatellite loci, 2 milk protein loci, mitochondrial hypervariable region I (HVRI) sequencing, and three Y-chromosomal gene sequencing. We used the mitochondrial DNA (mtDNA) mismatch distribution, microsatellite data, and bottleneck tests to infer the population history and demography. The mean number of alleles per locus was 9.0 indicating that the allelic variation was high in all the loci and the mean heterozygosity was 0.769 at nuclear loci. Although the population size is smaller than 8,000 individuals, the amount of variability both in terms of allelic richness and gene diversity was high in all the microsatellite loci except ILST 005. The gene diversity and effective number of alleles at milk protein loci were higher than the 10 other Indian goat breeds that they were compared to. Mismatch analysis was carried out and the analysis revealed that the population curve was unimodal indicating the expansion of population. The genetic diversity of Y-chromosome genes was low in the present study. The observed mean M ratio in the population was above the critical significance value (Mc) and close to one indicating that it has maintained a slowly changing population size. The mode-shift test did not detect any distortion of allele frequency and the heterozygosity excess method showed that there was no significant departure from mutation-drift equilibrium detected in the population. However, the effects of genetic bottlenecks were observed in some loci due to decreased heterozygosity and lower level of M ratio. There were two observed genetic subdivisions in the population supporting the observations of farmers in different areas. This base line information on genetic diversity, bottleneck analysis, and mismatch analysis was obtained to assist the conservation decision and management of the breed. PMID:22606053
Molecular characterization of the 17D-204 yellow fever vaccine.

PubMed

Salmona, Maud; Gazaignes, Sandrine; Mercier-Delarue, Severine; Garnier, Fabienne; Korimbocus, Jehanara; Colin de Verdière, Nathalie; LeGoff, Jerome; Roques, Pierre; Simon, François

2015-10-05

The worldwide use of yellow fever (YF) live attenuated vaccines came recently under close scrutiny as rare but serious adverse events have been reported. The population identified at major risk for these safety issues were extreme ages and immunocompromised subjects. Study NCT01426243 conducted by the French National Agency for AIDS research is an ongoing interventional study to evaluate the safety of the vaccine and the specific immune responses in HIV-infected patients following 17D-204 vaccination. As a preliminary study, we characterized the molecular diversity from E gene of the single 17D-204 vaccine batch used in this clinical study. Eight vials of lyophilized 17D-204 vaccine (Stamaril, Sanofi-Pasteur, Lyon, France) of the E5499 batch were reconstituted for viral quantification, cloning and sequencing of C/prM/E region. The average rate of virions per vial was 8.68 ± 0.07 log₁₀ genome equivalents with a low coefficient of variation (0.81%). 246 sequences of the C/prM/E region (29-33 per vials) were generated and analyzed for the eight vials, 25 (10%) being defective and excluded from analyses. 95% of sequences had at least one nucleotide mutation. The mutations were observed on 662 variant sites distributed through all over the 1995 nucleotides sequence and were mainly non-synonymous (66%). Genome variability between vaccine vials was highly homogeneous with a nucleotide distance ranging from 0.29% to 0.41%. Average p-distances observed for each vial were also homogeneous, ranging from 0.15% to 0.31%. This study showed a homogenous YF virus RNA quantity in vaccine vials within a single lot and a low clonal diversity inter and intra vaccine vials. These results are consistent with a recent study showing that the main mechanism of attenuation resulted in the loss of diversity in the YF virus quasi-species. Copyright © 2015 Elsevier Ltd. All rights reserved.
Estimating Bacterial Diversity for Ecological Studies: Methods, Metrics, and Assumptions

PubMed Central

Birtel, Julia; Walser, Jean-Claude; Pichon, Samuel; Bürgmann, Helmut; Matthews, Blake

2015-01-01

Methods to estimate microbial diversity have developed rapidly in an effort to understand the distribution and diversity of microorganisms in natural environments. For bacterial communities, the 16S rRNA gene is the phylogenetic marker gene of choice, but most studies select only a specific region of the 16S rRNA to estimate bacterial diversity. Whereas biases derived from from DNA extraction, primer choice and PCR amplification are well documented, we here address how the choice of variable region can influence a wide range of standard ecological metrics, such as species richness, phylogenetic diversity, β-diversity and rank-abundance distributions. We have used Illumina paired-end sequencing to estimate the bacterial diversity of 20 natural lakes across Switzerland derived from three trimmed variable 16S rRNA regions (V3, V4, V5). Species richness, phylogenetic diversity, community composition, β-diversity, and rank-abundance distributions differed significantly between 16S rRNA regions. Overall, patterns of diversity quantified by the V3 and V5 regions were more similar to one another than those assessed by the V4 region. Similar results were obtained when analyzing the datasets with different sequence similarity thresholds used during sequences clustering and when the same analysis was used on a reference dataset of sequences from the Greengenes database. In addition we also measured species richness from the same lake samples using ARISA Fingerprinting, but did not find a strong relationship between species richness estimated by Illumina and ARISA. We conclude that the selection of 16S rRNA region significantly influences the estimation of bacterial diversity and species distributions and that caution is warranted when comparing data from different variable regions as well as when using different sequencing techniques. PMID:25915756
Overlap and diversity in antimicrobial peptide databases: compiling a non-redundant set of sequences.

PubMed

Aguilera-Mendoza, Longendri; Marrero-Ponce, Yovani; Tellez-Ibarra, Roberto; Llorente-Quesada, Monica T; Salgado, Jesús; Barigye, Stephen J; Liu, Jun

2015-08-01

The large variety of antimicrobial peptide (AMP) databases developed to date are characterized by a substantial overlap of data and similarity of sequences. Our goals are to analyze the levels of redundancy for all available AMP databases and use this information to build a new non-redundant sequence database. For this purpose, a new software tool is introduced. A comparative study of 25 AMP databases reveals the overlap and diversity among them and the internal diversity within each database. The overlap analysis shows that only one database (Peptaibol) contains exclusive data, not present in any other, whereas all sequences in the LAMP_Patent database are included in CAMP_Patent. However, the majority of databases have their own set of unique sequences, as well as some overlap with other databases. The complete set of non-duplicate sequences comprises 16 990 cases, which is almost half of the total number of reported peptides. On the other hand, the diversity analysis identifies the most and least diverse databases and proves that all databases exhibit some level of redundancy. Finally, we present a new parallel-free software, named Dover Analyzer, developed to compute the overlap and diversity between any number of databases and compile a set of non-redundant sequences. These results are useful for selecting or building a suitable representative set of AMPs, according to specific needs. © The Author 2015. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.
Differential sequence diversity at merozoite surface protein-1 locus of Plasmodium knowlesi from humans and macaques in Thailand.

PubMed

Putaporntip, Chaturong; Thongaree, Siriporn; Jongwutiwes, Somchai

2013-08-01

To determine the genetic diversity and potential transmission routes of Plasmodium knowlesi, we analyzed the complete nucleotide sequence of the gene encoding the merozoite surface protein-1 of this simian malaria (Pkmsp-1), an asexual blood-stage vaccine candidate, from naturally infected humans and macaques in Thailand. Analysis of Pkmsp-1 sequences from humans (n=12) and monkeys (n=12) reveals five conserved and four variable domains. Most nucleotide substitutions in conserved domains were dimorphic whereas three of four variable domains contained complex repeats with extensive sequence and size variation. Besides purifying selection in conserved domains, evidence of intragenic recombination scattering across Pkmsp-1 was detected. The number of haplotypes, haplotype diversity, nucleotide diversity and recombination sites of human-derived sequences exceeded that of monkey-derived sequences. Phylogenetic networks based on concatenated conserved sequences of Pkmsp-1 displayed a character pattern that could have arisen from sampling process or the presence of two independent routes of P. knowlesi transmission, i.e. from macaques to human and from human to humans in Thailand. Copyright © 2013 Elsevier B.V. All rights reserved.

High Bacterial Diversity of Biological Soil Crusts in Water Tracks over Permafrost in the High Arctic Polar Desert

DOE PAGES

Steven, Blaire; Lionard, Marie; Kuske, Cheryl R.; ...

2013-08-13

In this paper we report the bacterial diversity of biological soil crusts (biocrusts) inhabiting polar desert soils at the northern land limit of the Arctic polar region (83° 05 N). Employing pyrosequencing of bacterial 16S rRNA genes this study demonstrated that these biocrusts harbor diverse bacterial communities, often as diverse as temperate latitude communities. The effect of wetting pulses on the composition of communities was also determined by collecting samples from soils outside and inside of permafrost water tracks, hill slope flow paths that drain permafrost-affected soils. The intermittent flow regime in the water tracks was correlated with altered relativemore » abundance of phylum level taxonomic bins in the bacterial communities, but the alterations varied between individual sampling sites. Bacteria related to the Cyanobacteria and Acidobacteria demonstrated shifts in relative abundance based on their location either inside or outside of the water tracks. Among cyanobacterial sequences, the proportion of sequences belonging to the family Oscillatoriales consistently increased in relative abundance in the samples from inside the water tracks compared to those outside. Acidobacteria showed responses to wetting pulses in the water tracks, increasing in abundance at one site and decreasing at the other two sites. Subdivision 4 acidobacterial sequences tended to follow the trends in the total Acidobacteria relative abundance, suggesting these organisms were largely responsible for the changes observed in the Acidobacteria. Finally, taken together, these data suggest that the bacterial communities of these high latitude polar biocrusts are diverse but do not show a consensus response to intermittent flow in water tracks over high Arctic permafrost.« less
Calibrating snakehead diversity with DNA barcodes: expanding taxonomic coverage to enable identification of potential and established invasive species.

PubMed

Serrao, Natasha R; Steinke, Dirk; Hanner, Robert H

2014-01-01

Detecting and documenting the occurrence of invasive species outside their native range requires tools to support their identification. This can be challenging for taxa with diverse life stages and/or problematic or unresolved morphological taxonomies. DNA barcoding provides a potent method for identifying invasive species, as it allows for species identification at all life stages, including fragmentary remains. It also provides an efficient interim taxonomic framework for quantifying cryptic genetic diversity by parsing barcode sequences into discontinuous haplogroup clusters (typical of reproductively isolated species) and labelling them with unique alphanumeric identifiers. Snakehead fishes are a diverse group of opportunistic predators endemic to Asia and Africa that may potentially pose significant threats as aquatic invasive species. At least three snakehead species (Channa argus, C. maculata, and C. marulius) are thought to have entered North America through the aquarium and live-food fish markets, and have established populations, yet their origins remain unclear. The objectives of this study were to assemble a library of DNA barcode sequences derived from expert identified reference specimens in order to determine the identity and aid invasion pathway analysis of the non-indigenous species found in North America using DNA barcodes. Sequences were obtained from 121 tissue samples representing 25 species and combined with public records from GenBank for a total of 36 putative species, which then partitioned into 49 discrete haplogroups. Multiple divergent clusters were observed within C. gachua, C. marulius, C. punctata and C. striata suggesting the potential presence of cryptic species diversity within these lineages. Our findings demonstrate that DNA barcoding is a valuable tool for species identification in challenging and under-studied taxonomic groups such as snakeheads, and provides a useful framework for inferring invasion pathway analysis.
High Bacterial Diversity of Biological Soil Crusts in Water Tracks over Permafrost in the High Arctic Polar Desert

PubMed Central

Steven, Blaire; Lionard, Marie; Kuske, Cheryl R.; Vincent, Warwick F.

2013-01-01

In this study we report the bacterial diversity of biological soil crusts (biocrusts) inhabiting polar desert soils at the northern land limit of the Arctic polar region (83° 05 N). Employing pyrosequencing of bacterial 16S rRNA genes this study demonstrated that these biocrusts harbor diverse bacterial communities, often as diverse as temperate latitude communities. The effect of wetting pulses on the composition of communities was also determined by collecting samples from soils outside and inside of permafrost water tracks, hill slope flow paths that drain permafrost-affected soils. The intermittent flow regime in the water tracks was correlated with altered relative abundance of phylum level taxonomic bins in the bacterial communities, but the alterations varied between individual sampling sites. Bacteria related to the Cyanobacteria and Acidobacteria demonstrated shifts in relative abundance based on their location either inside or outside of the water tracks. Among cyanobacterial sequences, the proportion of sequences belonging to the family Oscillatoriales consistently increased in relative abundance in the samples from inside the water tracks compared to those outside. Acidobacteria showed responses to wetting pulses in the water tracks, increasing in abundance at one site and decreasing at the other two sites. Subdivision 4 acidobacterial sequences tended to follow the trends in the total Acidobacteria relative abundance, suggesting these organisms were largely responsible for the changes observed in the Acidobacteria. Taken together, these data suggest that the bacterial communities of these high latitude polar biocrusts are diverse but do not show a consensus response to intermittent flow in water tracks over high Arctic permafrost. PMID:23967218
Analysis of genetic diversity using SNP markers in oat

USDA-ARS?s Scientific Manuscript database

A large-scale single nucleotide polymorphism (SNP) discovery was carried out in cultivated oat using Roche 454 sequencing methods. DNA sequences were generated from cDNAs originating from a panel of 20 diverse oat cultivars, and from Diversity Array Technology (DArT) genomic complexity reductions fr...
MHC class II DQB diversity in the Japanese black bear, Ursus thibetanus japonicus

PubMed Central

2012-01-01

Background The major histocompatibility complex (MHC) genes are one of the most important genetic systems in the vertebrate immune response. The diversity of MHC genes may directly influence the survival of individuals against infectious disease. However, there has been no investigation of MHC diversity in the Asiatic black bear (Ursus thibetanus). Here, we analyzed 270-bp nucleotide sequences of the entire exon 2 region of the MHC DQB gene by using 188 samples from the Japanese black bear (Ursus thibetanus japonicus) from 12 local populations. Results Among 185 of 188 samples, we identified 44 MHC variants that encoded 31 different amino acid sequences (allotypes) and one putative pseudogene. The phylogenetic analysis suggests that MHC variants detected from the Japanese black bear are derived from the DQB locus. One of the 31 DQB allotypes, Urth-DQB*01, was found to be common to all local populations. Moreover, this allotype was shared between the black bear on the Asian continent and the Japanese black bear, suggesting that Urth-DQB*01 might have been maintained in the ancestral black bear population for at least 300,000 years. Our findings, from calculating the ratio of non-synonymous to synonymous substitutions, indicate that balancing selection has maintained genetic variation of peptide-binding residues at the DQB locus of the Japanese black bear. From examination of genotype frequencies among local populations, we observed a considerably lower level of observed heterozygosity than expected. Conclusions The low level of observed heterozygosity suggests that genetic drift reduced DQB diversity in the Japanese black bear due to a bottleneck event at the population or species level. The decline of DQB diversity might have been accelerated by the loss of rare variants that have been maintained by negative frequency-dependent selection. Nevertheless, DQB diversity of the black bear appears to be relatively high compared with some other endangered mammalian species. This result suggests that the Japanese black bears may also retain more potential resistance against pathogens than other endangered mammalian species. To prevent further decline of potential resistance against pathogens, a conservation policy for the Japanese black bear should be designed to maintain MHC rare variants in each local population. PMID:23190438
MHC class II DQB diversity in the Japanese black bear, Ursus thibetanus japonicus.

PubMed

Yasukochi, Yoshiki; Kurosaki, Toshifumi; Yoneda, Masaaki; Koike, Hiroko; Satta, Yoko

2012-11-29

The major histocompatibility complex (MHC) genes are one of the most important genetic systems in the vertebrate immune response. The diversity of MHC genes may directly influence the survival of individuals against infectious disease. However, there has been no investigation of MHC diversity in the Asiatic black bear (Ursus thibetanus). Here, we analyzed 270-bp nucleotide sequences of the entire exon 2 region of the MHC DQB gene by using 188 samples from the Japanese black bear (Ursus thibetanus japonicus) from 12 local populations. Among 185 of 188 samples, we identified 44 MHC variants that encoded 31 different amino acid sequences (allotypes) and one putative pseudogene. The phylogenetic analysis suggests that MHC variants detected from the Japanese black bear are derived from the DQB locus. One of the 31 DQB allotypes, Urth-DQB*01, was found to be common to all local populations. Moreover, this allotype was shared between the black bear on the Asian continent and the Japanese black bear, suggesting that Urth-DQB*01 might have been maintained in the ancestral black bear population for at least 300,000 years. Our findings, from calculating the ratio of non-synonymous to synonymous substitutions, indicate that balancing selection has maintained genetic variation of peptide-binding residues at the DQB locus of the Japanese black bear. From examination of genotype frequencies among local populations, we observed a considerably lower level of observed heterozygosity than expected. The low level of observed heterozygosity suggests that genetic drift reduced DQB diversity in the Japanese black bear due to a bottleneck event at the population or species level. The decline of DQB diversity might have been accelerated by the loss of rare variants that have been maintained by negative frequency-dependent selection. Nevertheless, DQB diversity of the black bear appears to be relatively high compared with some other endangered mammalian species. This result suggests that the Japanese black bears may also retain more potential resistance against pathogens than other endangered mammalian species. To prevent further decline of potential resistance against pathogens, a conservation policy for the Japanese black bear should be designed to maintain MHC rare variants in each local population.
Metagenomic analysis of viral diversity in respiratory samples from patients with respiratory tract infections in Kuwait.

PubMed

Madi, Nada; Al-Nakib, Widad; Mustafa, Abu Salim; Habibi, Nazima

2018-03-01

A metagenomic approach based on target independent next-generation sequencing has become a known method for the detection of both known and novel viruses in clinical samples. This study aimed to use the metagenomic sequencing approach to characterize the viral diversity in respiratory samples from patients with respiratory tract infections. We have investigated 86 respiratory samples received from various hospitals in Kuwait between 2015 and 2016 for the diagnosis of respiratory tract infections. A metagenomic approach using the next-generation sequencer to characterize viruses was used. According to the metagenomic analysis, an average of 145, 019 reads were identified, and 2% of these reads were of viral origin. Also, metagenomic analysis of the viral sequences revealed many known respiratory viruses, which were detected in 30.2% of the clinical samples. Also, sequences of non-respiratory viruses were detected in 14% of the clinical samples, while sequences of non-human viruses were detected in 55.8% of the clinical samples. The average genome coverage of the viruses was 12% with the highest genome coverage of 99.2% for respiratory syncytial virus, and the lowest was 1% for torque teno midi virus 2. Our results showed 47.7% agreement between multiplex Real-Time PCR and metagenomics sequencing in the detection of respiratory viruses in the clinical samples. Though there are some difficulties in using this method to clinical samples such as specimen quality, these observations are indicative of the promising utility of the metagenomic sequencing approach for the identification of respiratory viruses in patients with respiratory tract infections. © 2017 Wiley Periodicals, Inc.
Global ecological pattern of ammonia-oxidizing archaea.

PubMed

Cao, Huiluo; Auguet, Jean-Christophe; Gu, Ji-Dong

2013-01-01

The global distribution of ammonia-oxidizing archaea (AOA), which play a pivotal role in the nitrification process, has been confirmed through numerous ecological studies. Though newly available amoA (ammonia monooxygenase subunit A) gene sequences from new environments are accumulating rapidly in public repositories, a lack of information on the ecological and evolutionary factors shaping community assembly of AOA on the global scale is apparent. We conducted a meta-analysis on uncultured AOA using over ca. 6,200 archaeal amoA gene sequences, so as to reveal their community distribution patterns along a wide spectrum of physicochemical conditions and habitat types. The sequences were dereplicated at 95% identity level resulting in a dataset containing 1,476 archaeal amoA gene sequences from eight habitat types: namely soil, freshwater, freshwater sediment, estuarine sediment, marine water, marine sediment, geothermal system, and symbiosis. The updated comprehensive amoA phylogeny was composed of three major monophyletic clusters (i.e. Nitrosopumilus, Nitrosotalea, Nitrosocaldus) and a non-monophyletic cluster constituted mostly by soil and sediment sequences that we named Nitrososphaera. Diversity measurements indicated that marine and estuarine sediments as well as symbionts might be the largest reservoirs of AOA diversity. Phylogenetic analyses were further carried out using macroevolutionary analyses to explore the diversification pattern and rates of nitrifying archaea. In contrast to other habitats that displayed constant diversification rates, marine planktonic AOA interestingly exhibit a very recent and accelerating diversification rate congruent with the lowest phylogenetic diversity observed in their habitats. This result suggested the existence of AOA communities with different evolutionary history in the different habitats. Based on an up-to-date amoA phylogeny, this analysis provided insights into the possible evolutionary mechanisms and environmental parameters that shape AOA community assembly at global scale.
Chimeric 16S rRNA sequence formation and detection in Sanger and 454-pyrosequenced PCR amplicons

PubMed Central

Haas, Brian J.; Gevers, Dirk; Earl, Ashlee M.; Feldgarden, Mike; Ward, Doyle V.; Giannoukos, Georgia; Ciulla, Dawn; Tabbaa, Diana; Highlander, Sarah K.; Sodergren, Erica; Methé, Barbara; DeSantis, Todd Z.; Petrosino, Joseph F.; Knight, Rob; Birren, Bruce W.

2011-01-01

Bacterial diversity among environmental samples is commonly assessed with PCR-amplified 16S rRNA gene (16S) sequences. Perceived diversity, however, can be influenced by sample preparation, primer selection, and formation of chimeric 16S amplification products. Chimeras are hybrid products between multiple parent sequences that can be falsely interpreted as novel organisms, thus inflating apparent diversity. We developed a new chimera detection tool called Chimera Slayer (CS). CS detects chimeras with greater sensitivity than previous methods, performs well on short sequences such as those produced by the 454 Life Sciences (Roche) Genome Sequencer, and can scale to large data sets. By benchmarking CS performance against sequences derived from a controlled DNA mixture of known organisms and a simulated chimera set, we provide insights into the factors that affect chimera formation such as sequence abundance, the extent of similarity between 16S genes, and PCR conditions. Chimeras were found to reproducibly form among independent amplifications and contributed to false perceptions of sample diversity and the false identification of novel taxa, with less-abundant species exhibiting chimera rates exceeding 70%. Shotgun metagenomic sequences of our mock community appear to be devoid of 16S chimeras, supporting a role for shotgun metagenomics in validating novel organisms discovered in targeted sequence surveys. PMID:21212162
Evaluation of genetic diversity in Chinese kale (Brassica oleracea L. var. alboglabra Bailey) by using rapid amplified polymorphic DNA and sequence-related amplified polymorphism markers.

PubMed

Zhang, J; Zhang, L G

2014-02-14

Chinese kale is an original Chinese vegetable of the Cruciferae family. To select suitable parents for hybrid breeding, we thoroughly analyzed the genetic diversity of Chinese kale. Random amplified polymorphic DNA (RAPD) and sequence-related amplified polymorphism (SRAP) molecular markers were used to evaluate the genetic diversity across 21 Chinese kale accessions from AVRDC and Guangzhou in China. A total of 104 bands were detected by 11 RAPD primers, of which 66 (63.5%) were polymorphic, and 229 polymorphic bands (68.4%) were observed in 335 bands amplified by 17 SRAP primer combinations. The dendrogram showed the grouping of the 21 accessions into 4 main clusters based on RAPD data, and into 6 clusters based on SRAP and combined data (RAPD + SRAP). The clustering of accessions based on SRAP data was consistent with petal colors. The Mantel test indicated a poor fit for the RAPD and SRAP data (r = 0.16). These results have an important implication for Chinese kale germplasm characterization and improvement.
Different Lactobacillus populations dominate in "Chorizo de León" manufacturing performed in different production plants.

PubMed

Quijada, Narciso M; De Filippis, Francesca; Sanz, José Javier; García-Fernández, María Del Camino; Rodríguez-Lázaro, David; Ercolini, Danilo; Hernández, Marta

2018-04-01

"Chorizo de Léon" is a high-value Spanish dry fermented sausage traditionally manufactured without the use of starter cultures, owing to the activity of a house-specific autochthonous microbiota that naturally contaminates the meat from the environment, the equipment and the raw materials. Lactic acid bacteria (particularly Lactobacillus) and coagulase-negative cocci (mainly Staphylococcus) have been reported as the most important bacterial groups regarding the organoleptic and safety properties of the dry fermented sausages. In this study, samples from raw minced meat to final products were taken from five different producers and the microbial diversity was investigated by high-throughput sequencing of 16S rRNA gene amplicons. The diverse microbial composition observed during the first stages of "Chorizo de Léon" evolved during ripening to a microbiota mainly composed by Lactobacillus in the final product. Oligotyping performed on 16S rRNA gene sequences of Lactobacillus and Staphylococcus populations revealed sub-genus level diversity within the different manufacturers, likely responsible of the characteristic organoleptic properties of the products from different companies. Copyright © 2017 Elsevier Ltd. All rights reserved.
Genetic diversity of the HpyC1I restriction modification system in Helicobacter pylori.

PubMed

Lehours, Philippe; Dupouy, Sandrine; Chaineux, Julien; Ruskoné-Fourmestraux, Agnès; Delchier, Jean-Charles; Morgner, Andrea; Mégraud, Francis; Ménard, Armelle

2007-04-01

Helicobacter pylori is unique because of the unusually high number and diversity of its restriction modification (R-M) systems. HpyC1I R-M was recently characterized and contains an endonuclease which is an isoschizomer of the endonuclease BccI. This R-M is involved in adherence to gastric epithelial cells, a crucial step in bacterial pathogenesis. This observation illustrates the fact that R-M systems have other putative biological functions in addition to protecting the bacterial genome from external DNA. The genomic diversity of HpyC1I R-M was evaluated more precisely on a large collection of H. pylori strains by PCR, susceptibility to BccI digestion and sequencing. The results obtained support the mechanism of gain and loss of this R-M system in the H. pylori genome, and suggest that it is an ancestral system which gradually disappears during H. pylori evolution, following successive steps: (1) inactivation of the endonuclease gene, followed or accompanied by: (2) inactivation of the methyltransferase genes, and then: (3) definitive loss, leaving only short endonuclease remnant sequences.
Phenotypic Heterogeneity of Genomically-Diverse Isolates of Streptococcus mutans

PubMed Central

Palmer, Sara R.; Miller, James H.; Abranches, Jacqueline; Zeng, Lin; Lefebure, Tristan; Richards, Vincent P.; Lemos, José A.; Stanhope, Michael J.; Burne, Robert A.

2013-01-01

High coverage, whole genome shotgun (WGS) sequencing of 57 geographically- and genetically-diverse isolates of Streptococcus mutans from individuals of known dental caries status was recently completed. Of the 57 sequenced strains, fifteen isolates, were selected based primarily on differences in gene content and phenotypic characteristics known to affect virulence and compared with the reference strain UA159. A high degree of variability in these properties was observed between strains, with a broad spectrum of sensitivities to low pH, oxidative stress (air and paraquat) and exposure to competence stimulating peptide (CSP). Significant differences in autolytic behavior and in biofilm development in glucose or sucrose were also observed. Natural genetic competence varied among isolates, and this was correlated to the presence or absence of competence genes, comCDE and comX, and to bacteriocins. In general strains that lacked the ability to become competent possessed fewer genes for bacteriocins and immunity proteins or contained polymorphic variants of these genes. WGS sequence analysis of the pan-genome revealed, for the first time, components of a Type VII secretion system in several S. mutans strains, as well as two putative ORFs that encode possible collagen binding proteins located upstream of the cnm gene, which is associated with host cell invasiveness. The virulence of these particular strains was assessed in a wax-worm model. This is the first study to combine a comprehensive analysis of key virulence-related phenotypes with extensive genomic analysis of a pathogen that evolved closely with humans. Our analysis highlights the phenotypic diversity of S. mutans isolates and indicates that the species has evolved a variety of adaptive strategies to persist in the human oral cavity and, when conditions are favorable, to initiate disease. PMID:23613838
Genotyping-by-sequencing (GBS) revealed molecular genetic diversity of Iranian wheat landraces and cultivars

USDA-ARS?s Scientific Manuscript database

Genetic diversity is an essential resource for breeders to improve new cultivars with desirable characteristics. Recently genotyping-by-sequencing (GBS), a next generation sequencing (NGS) based technology that can simplify complex genomes, has been used as a high-throughput and cost-effective molec...
Transcriptome sequencing of diverse peanut (arachis) wild species and the cultivated species reveals a wealth of untapped genetic variability

USDA-ARS?s Scientific Manuscript database

Next generation sequencing technologies and improved bioinformatics methods have provided opportunities to study sequence variability in complex polyploid transcriptomes. In this study, we used a diverse panel of twenty-two Arachis accessions representing seven Arachis hypogaea market classes, A-, B...
Arbuscular mycorrhizal fungi diversity influenced by different agricultural management practices in a semi-arid Mediterranean agro-ecosystem

NASA Astrophysics Data System (ADS)

de Mar Alguacil, Maria; Torrecillas, Emma; Garcia-Orenes, Fuensanta; Torres, Maria Pilar; Roldan, Antonio

2013-04-01

The arbuscular mycorrhizal fungi (AMF) are a key, integral component of the stability, sustainability and functioning of ecosystems. In this study a field experiment was performed at the El Teularet-Sierra de Enguera Experimental Station (eastern Spain) to assess the influence during a 6-yr period of different agricultural practices on the diversity of arbuscular mycorrhizal fungi (AMF). The management practices included residual herbicide use, ploughing, ploughing + oats, addition of oat straw mulch and a control (land abandonment). Adjacent soil under natural vegetation was used as a reference for local, high-quality soil and as a control for comparison with the agricultural soils under different management practices. The AM fungal small-subunit (SSU) rRNA genes were subjected to PCR, cloning, sequencing and phylogenetic analyses. Thirty-six different phylotypes were identified, which were grouped in four families: Glomeraceae, Paraglomeraceae, Ambisporaceae and Claroideoglomeraceae. The first results showed significant differences in the distribution of the AMF phylotypes as consequence of the difference between agricultural management practices. Thus, the lowest diversity was observed for the plot that was treated with herbicide. The management practices including ploughing and ploughing + oats had similar AMF diversity. Oat straw mulching yielded the highest number of different AMF sequence types and showed the highest diversity index. Thus, this treatment could be more suitable in sustainable soil use and therefore protection of biodiversity.
Association of high-risk sexual behaviour with diversity of the vaginal microbiota and abundance of Lactobacillus

PubMed Central

Wessels, Jocelyn M.; Lajoie, Julie; Vitali, Danielle; Omollo, Kenneth; Kimani, Joshua; Oyugi, Julius; Cheruiyot, Juliana; Kimani, Makubo; Mungai, John N.; Akolo, Maureen; Stearns, Jennifer C.; Surette, Michael G.; Fowke, Keith R.

2017-01-01

Objective To compare the vaginal microbiota of women engaged in high-risk sexual behaviour (sex work) with women who are not engaged in high-risk sexual behaviour. Diverse vaginal microbiota, low in Lactobacillus species, like those in bacterial vaginosis (BV), are associated with increased prevalence of sexually transmitted infections (STIs) and human immunodeficiency virus (HIV) acquisition. Although high-risk sexual behaviour increases risk for STIs, the vaginal microbiota of sex workers is understudied. Methods A retrospective cross-sectional study was conducted comparing vaginal microbiota of women who are not engaged in sex work (non-sex worker controls, NSW, N = 19) and women engaged in sex work (female sex workers, FSW, N = 48), using Illumina sequencing (16S rRNA, V3 region). Results Bacterial richness and diversity were significantly less in controls, than FSW. Controls were more likely to have Lactobacillus as the most abundant genus (58% vs. 17%; P = 0.002) and composition of their vaginal microbiota differed from FSW (PERMANOVA, P = 0.001). Six microbiota clusters were detected, including a high diversity cluster with three sub-clusters, and 55% of women with low Nugent Scores fell within this cluster. High diversity was observed by 16S sequencing in FSW, regardless of Nugent Scores, suggesting that Nugent Score may not be capable of capturing the diversity present in the FSW vaginal microbiota. Conclusions High-risk sexual behaviour is associated with diversity of the vaginal microbiota and lack of Lactobacillus. These factors could contribute to increased risk of STIs and HIV in women engaged in high-risk sexual behaviour. PMID:29095928
Association of high-risk sexual behaviour with diversity of the vaginal microbiota and abundance of Lactobacillus.

PubMed

Wessels, Jocelyn M; Lajoie, Julie; Vitali, Danielle; Omollo, Kenneth; Kimani, Joshua; Oyugi, Julius; Cheruiyot, Juliana; Kimani, Makubo; Mungai, John N; Akolo, Maureen; Stearns, Jennifer C; Surette, Michael G; Fowke, Keith R; Kaushic, Charu

2017-01-01

To compare the vaginal microbiota of women engaged in high-risk sexual behaviour (sex work) with women who are not engaged in high-risk sexual behaviour. Diverse vaginal microbiota, low in Lactobacillus species, like those in bacterial vaginosis (BV), are associated with increased prevalence of sexually transmitted infections (STIs) and human immunodeficiency virus (HIV) acquisition. Although high-risk sexual behaviour increases risk for STIs, the vaginal microbiota of sex workers is understudied. A retrospective cross-sectional study was conducted comparing vaginal microbiota of women who are not engaged in sex work (non-sex worker controls, NSW, N = 19) and women engaged in sex work (female sex workers, FSW, N = 48), using Illumina sequencing (16S rRNA, V3 region). Bacterial richness and diversity were significantly less in controls, than FSW. Controls were more likely to have Lactobacillus as the most abundant genus (58% vs. 17%; P = 0.002) and composition of their vaginal microbiota differed from FSW (PERMANOVA, P = 0.001). Six microbiota clusters were detected, including a high diversity cluster with three sub-clusters, and 55% of women with low Nugent Scores fell within this cluster. High diversity was observed by 16S sequencing in FSW, regardless of Nugent Scores, suggesting that Nugent Score may not be capable of capturing the diversity present in the FSW vaginal microbiota. High-risk sexual behaviour is associated with diversity of the vaginal microbiota and lack of Lactobacillus. These factors could contribute to increased risk of STIs and HIV in women engaged in high-risk sexual behaviour.
A comparison of sequencing platforms and bioinformatics pipelines for compositional analysis of the gut microbiome.

PubMed

Allali, Imane; Arnold, Jason W; Roach, Jeffrey; Cadenas, Maria Belen; Butz, Natasha; Hassan, Hosni M; Koci, Matthew; Ballou, Anne; Mendoza, Mary; Ali, Rizwana; Azcarate-Peril, M Andrea

2017-09-13

Advancements in Next Generation Sequencing (NGS) technologies regarding throughput, read length and accuracy had a major impact on microbiome research by significantly improving 16S rRNA amplicon sequencing. As rapid improvements in sequencing platforms and new data analysis pipelines are introduced, it is essential to evaluate their capabilities in specific applications. The aim of this study was to assess whether the same project-specific biological conclusions regarding microbiome composition could be reached using different sequencing platforms and bioinformatics pipelines. Chicken cecum microbiome was analyzed by 16S rRNA amplicon sequencing using Illumina MiSeq, Ion Torrent PGM, and Roche 454 GS FLX Titanium platforms, with standard and modified protocols for library preparation. We labeled the bioinformatics pipelines included in our analysis QIIME1 and QIIME2 (de novo OTU picking [not to be confused with QIIME version 2 commonly referred to as QIIME2]), QIIME3 and QIIME4 (open reference OTU picking), UPARSE1 and UPARSE2 (each pair differs only in the use of chimera depletion methods), and DADA2 (for Illumina data only). GS FLX+ yielded the longest reads and highest quality scores, while MiSeq generated the largest number of reads after quality filtering. Declines in quality scores were observed starting at bases 150-199 for GS FLX+ and bases 90-99 for MiSeq. Scores were stable for PGM-generated data. Overall microbiome compositional profiles were comparable between platforms; however, average relative abundance of specific taxa varied depending on sequencing platform, library preparation method, and bioinformatics analysis. Specifically, QIIME with de novo OTU picking yielded the highest number of unique species and alpha diversity was reduced with UPARSE and DADA2 compared to QIIME. The three platforms compared in this study were capable of discriminating samples by treatment, despite differences in diversity and abundance, leading to similar biological conclusions. Our results demonstrate that while there were differences in depth of coverage and phylogenetic diversity, all workflows revealed comparable treatment effects on microbial diversity. To increase reproducibility and reliability and to retain consistency between similar studies, it is important to consider the impact on data quality and relative abundance of taxa when selecting NGS platforms and analysis tools for microbiome studies.
Humboldt's spa: microbial diversity is controlled by temperature in geothermal environments

PubMed Central

Sharp, Christine E; Brady, Allyson L; Sharp, Glen H; Grasby, Stephen E; Stott, Matthew B; Dunfield, Peter F

2014-01-01

Over 200 years ago Alexander von Humboldt (1808) observed that plant and animal diversity peaks at tropical latitudes and decreases toward the poles, a trend he attributed to more favorable temperatures in the tropics. Studies to date suggest that this temperature–diversity gradient is weak or nonexistent for Bacteria and Archaea. To test the impacts of temperature as well as pH on bacterial and archaeal diversity, we performed pyrotag sequencing of 16S rRNA genes retrieved from 165 soil, sediment and biomat samples of 36 geothermal areas in Canada and New Zealand, covering a temperature range of 7.5–99 °C and a pH range of 1.8–9.0. This represents the widest ranges of temperature and pH yet examined in a single microbial diversity study. Species richness and diversity indices were strongly correlated to temperature, with R2 values up to 0.62 for neutral–alkaline springs. The distributions were unimodal, with peak diversity at 24 °C and decreasing diversity at higher and lower temperature extremes. There was also a significant pH effect on diversity; however, in contrast to previous studies of soil microbial diversity, pH explained less of the variability (13–20%) than temperature in the geothermal samples. No correlation was observed between diversity values and latitude from the equator, and we therefore infer a direct temperature effect in our data set. These results demonstrate that temperature exerts a strong control on microbial diversity when considered over most of the temperature range within which life is possible. PMID:24430481

Humboldt's spa: microbial diversity is controlled by temperature in geothermal environments.

PubMed

Sharp, Christine E; Brady, Allyson L; Sharp, Glen H; Grasby, Stephen E; Stott, Matthew B; Dunfield, Peter F

2014-06-01

Over 200 years ago Alexander von Humboldt (1808) observed that plant and animal diversity peaks at tropical latitudes and decreases toward the poles, a trend he attributed to more favorable temperatures in the tropics. Studies to date suggest that this temperature-diversity gradient is weak or nonexistent for Bacteria and Archaea. To test the impacts of temperature as well as pH on bacterial and archaeal diversity, we performed pyrotag sequencing of 16S rRNA genes retrieved from 165 soil, sediment and biomat samples of 36 geothermal areas in Canada and New Zealand, covering a temperature range of 7.5-99 °C and a pH range of 1.8-9.0. This represents the widest ranges of temperature and pH yet examined in a single microbial diversity study. Species richness and diversity indices were strongly correlated to temperature, with R(2) values up to 0.62 for neutral-alkaline springs. The distributions were unimodal, with peak diversity at 24 °C and decreasing diversity at higher and lower temperature extremes. There was also a significant pH effect on diversity; however, in contrast to previous studies of soil microbial diversity, pH explained less of the variability (13-20%) than temperature in the geothermal samples. No correlation was observed between diversity values and latitude from the equator, and we therefore infer a direct temperature effect in our data set. These results demonstrate that temperature exerts a strong control on microbial diversity when considered over most of the temperature range within which life is possible.
Twenty-one genome sequences from Pseudomonas species and 19 genome sequences from diverse bacteria isolated from the rhizosphere and endosphere of Populus deltoides.

PubMed

Brown, Steven D; Utturkar, Sagar M; Klingeman, Dawn M; Johnson, Courtney M; Martin, Stanton L; Land, Miriam L; Lu, Tse-Yuan S; Schadt, Christopher W; Doktycz, Mitchel J; Pelletier, Dale A

2012-11-01

To aid in the investigation of the Populus deltoides microbiome, we generated draft genome sequences for 21 Pseudomonas strains and 19 other diverse bacteria isolated from Populus deltoides roots. Genome sequences for isolates similar to Acidovorax, Bradyrhizobium, Brevibacillus, Caulobacter, Chryseobacterium, Flavobacterium, Herbaspirillum, Novosphingobium, Pantoea, Phyllobacterium, Polaromonas, Rhizobium, Sphingobium, and Variovorax were generated.
Global sequence diversity of the lactate dehydrogenase gene in Plasmodium falciparum.

PubMed

Simpalipan, Phumin; Pattaradilokrat, Sittiporn; Harnyuttanakorn, Pongchai

2018-01-09

Antigen-detecting rapid diagnostic tests (RDTs) have been recommended by the World Health Organization for use in remote areas to improve malaria case management. Lactate dehydrogenase (LDH) of Plasmodium falciparum is one of the main parasite antigens employed by various commercial RDTs. It has been hypothesized that the poor detection of LDH-based RDTs is attributed in part to the sequence diversity of the gene. To test this, the present study aimed to investigate the genetic diversity of the P. falciparum ldh gene in Thailand and to construct the map of LDH sequence diversity in P. falciparum populations worldwide. The ldh gene was sequenced for 50 P. falciparum isolates in Thailand and compared with hundreds of sequences from P. falciparum populations worldwide. Several indices of molecular variation were calculated, including the proportion of polymorphic sites, the average nucleotide diversity index (π), and the haplotype diversity index (H). Tests of positive selection and neutrality tests were performed to determine signatures of natural selection on the gene. Mean genetic distance within and between species of Plasmodium ldh was analysed to infer evolutionary relationships. Nucleotide sequences of P. falciparum ldh could be classified into 9 alleles, encoding 5 isoforms of LDH. L1a was the most common allelic type and was distributed in P. falciparum populations worldwide. Plasmodium falciparum ldh sequences were highly conserved, with haplotype and nucleotide diversity values of 0.203 and 0.0004, respectively. The extremely low genetic diversity was maintained by purifying selection, likely due to functional constraints. Phylogenetic analysis inferred the close genetic relationship of P. falciparum to malaria parasites of great apes, rather than to other human malaria parasites. This study revealed the global genetic variation of the ldh gene in P. falciparum, providing knowledge for improving detection of LDH-based RDTs and supporting the candidacy of LDH as a therapeutic drug target.
HIV-1 sequence variation between isolates from mother-infant transmission pairs

DOE Office of Scientific and Technical Information (OSTI.GOV)

Wike, C.M.; Daniels, M.R.; Furtado, M.

1991-12-31

To examine the sequence diversity of human immunodeficiency virus type 1 (HIV-1) between known transmission sets, sequences from the V3 and V4-V5 region of the env gene from 4 mother-infant pairs were analyzed. The mean interpatient sequence variation between isolates from linked mother-infant pairs was comparable to the sequence diversity found between isolates from other close contacts. The mean intrapatient variation was significantly less in the infants` isolates then the isolates from both their mothers and other characterized intrapatient sequence sets. In addition, a distinct and characteristic difference in the glycosylation pattern preceding the V3 loop was found between eachmore » linked transmission pair. These findings indicate that selection of specific genotypic variants, which may play a role in some direct transmission sets, and the duration of infection are important factors in the degree of diversity seen between the sequence sets.« less
A polyphasic taxonomic approach in isolated strains of Cyanobacteria from thermal springs of Greece.

PubMed

Bravakos, Panos; Kotoulas, Georgios; Skaraki, Katerina; Pantazidou, Adriani; Economou-Amilli, Athena

2016-05-01

Strains of Cyanobacteria isolated from mats of 9 thermal springs of Greece have been studied for their taxonomic evaluation. A polyphasic taxonomic approach was employed which included: morphological observations by light microscopy and scanning electron microscopy, maximum parsimony, maximum likelihood and Bayesian analysis of 16S rDNA sequences, secondary structural comparisons of 16S-23S rRNA Internal Transcribed Spacer sequences, and finally environmental data. The 17 cyanobacterial isolates formed a diverse group that contained filamentous, coccoid and heterocytous strains. These included representatives of the polyphyletic genera of Synechococcus and Phormidium, and the orders Oscillatoriales, Spirulinales, Chroococcales and Nostocales. After analysis, at least 6 new taxa at the genus level provide new evidence in the taxonomy of Cyanobacteria and highlight the abundant diversity of thermal spring environments with many potential endemic species or ecotypes. Copyright © 2016 Elsevier Inc. All rights reserved.
Development of microsatellite markers using next-generation sequencing for the columnar cactus Echinopsis chiloensis (Cactaceae).

PubMed

Ossa, Carmen G; Larridon, Isabel; Peralta, Gioconda; Asselman, Pieter; Pérez, Fernanda

2016-12-01

The aim of this study was to develop microsatellite markers as a tool to study population structure, genetic diversity and effective population size of Echinopsis chiloensis, an endemic cactus from arid and semiarid regions of Central Chile. We developed 12 polymorphic microsatellite markers for E. chiloensis using next-generation sequencing and tested them in 60 individuals from six sites, covering all the latitudinal range of this species. The number of alleles per locus ranged from 3 to 8, while the observed (Ho) and expected (He) heterozygosity ranged from 0.0 to 0.80 and from 0.10 to 0.76, respectively. We also detected significant differences between sites, with F ST values ranging from 0.05 to 0.29. Microsatellite markers will enable us to estimate genetic diversity and population structure of E. chiloensis in future ecological and phylogeographic studies.
In situ expression of eukaryotic ice-binding proteins in microbial communities of Arctic and Antarctic sea ice.

PubMed

Uhlig, Christiane; Kilpert, Fabian; Frickenhaus, Stephan; Kegel, Jessica U; Krell, Andreas; Mock, Thomas; Valentin, Klaus; Beszteri, Bánk

2015-11-01

Ice-binding proteins (IBPs) have been isolated from various sea-ice organisms. Their characterisation points to a crucial role in protecting the organisms in sub-zero environments. However, their in situ abundance and diversity in natural sea-ice microbial communities is largely unknown. In this study, we analysed the expression and phylogenetic diversity of eukaryotic IBP transcripts from microbial communities of Arctic and Antarctic sea ice. IBP transcripts were found in abundances similar to those of proteins involved in core cellular processes such as photosynthesis. Eighty-nine percent of the IBP transcripts grouped with known IBP sequences from diatoms, haptophytes and crustaceans, but the majority represented novel sequences not previously characterized in cultured organisms. The observed high eukaryotic IBP expression in natural eukaryotic sea ice communities underlines the essential role of IBPs for survival of many microorganisms in communities living under the extreme conditions of polar sea ice.
Nonpareil 3: Fast Estimation of Metagenomic Coverage and Sequence Diversity.

PubMed

Rodriguez-R, Luis M; Gunturu, Santosh; Tiedje, James M; Cole, James R; Konstantinidis, Konstantinos T

2018-01-01

Estimations of microbial community diversity based on metagenomic data sets are affected, often to an unknown degree, by biases derived from insufficient coverage and reference database-dependent estimations of diversity. For instance, the completeness of reference databases cannot be generally estimated since it depends on the extant diversity sampled to date, which, with the exception of a few habitats such as the human gut, remains severely undersampled. Further, estimation of the degree of coverage of a microbial community by a metagenomic data set is prohibitively time-consuming for large data sets, and coverage values may not be directly comparable between data sets obtained with different sequencing technologies. Here, we extend Nonpareil, a database-independent tool for the estimation of coverage in metagenomic data sets, to a high-performance computing implementation that scales up to hundreds of cores and includes, in addition, a k -mer-based estimation as sensitive as the original alignment-based version but about three hundred times as fast. Further, we propose a metric of sequence diversity ( N d ) derived directly from Nonpareil curves that correlates well with alpha diversity assessed by traditional metrics. We use this metric in different experiments demonstrating the correlation with the Shannon index estimated on 16S rRNA gene profiles and show that N d additionally reveals seasonal patterns in marine samples that are not captured by the Shannon index and more precise rankings of the magnitude of diversity of microbial communities in different habitats. Therefore, the new version of Nonpareil, called Nonpareil 3, advances the toolbox for metagenomic analyses of microbiomes. IMPORTANCE Estimation of the coverage provided by a metagenomic data set, i.e., what fraction of the microbial community was sampled by DNA sequencing, represents an essential first step of every culture-independent genomic study that aims to robustly assess the sequence diversity present in a sample. However, estimation of coverage remains elusive because of several technical limitations associated with high computational requirements and limiting statistical approaches to quantify diversity. Here we described Nonpareil 3, a new bioinformatics algorithm that circumvents several of these limitations and thus can facilitate culture-independent studies in clinical or environmental settings, independent of the sequencing platform employed. In addition, we present a new metric of sequence diversity based on rarefied coverage and demonstrate its use in communities from diverse ecosystems.
Ubiquity and Diversity of Heterotrophic Bacterial nasA Genes in Diverse Marine Environments

PubMed Central

Jiang, Xuexia; Dang, Hongyue; Jiao, Nianzhi

2015-01-01

Nitrate uptake by heterotrophic bacteria plays an important role in marine N cycling. However, few studies have investigated the diversity of environmental nitrate assimilating bacteria (NAB). In this study, the diversity and biogeographical distribution of NAB in several global oceans and particularly in the western Pacific marginal seas were investigated using both cultivation and culture-independent molecular approaches. Phylogenetic analyses based on 16S rRNA and nasA (encoding the large subunit of the assimilatory nitrate reductase) gene sequences indicated that the cultivable NAB in South China Sea belonged to the α-Proteobacteria, γ-Proteobacteria and CFB (Cytophaga-Flavobacteria-Bacteroides) bacterial groups. In all the environmental samples of the present study, α-Proteobacteria, γ-Proteobacteria and Bacteroidetes were found to be the dominant nasA-harboring bacteria. Almost all of the α-Proteobacteria OTUs were classified into three Roseobacter-like groups (I to III). Clone library analysis revealed previously underestimated nasA diversity; e.g. the nasA gene sequences affiliated with β-Proteobacteria, ε-Proteobacteria and Lentisphaerae were observed in the field investigation for the first time, to the best of our knowledge. The geographical and vertical distributions of seawater nasA-harboring bacteria indicated that NAB were highly diverse and ubiquitously distributed in the studied marginal seas and world oceans. Niche adaptation and separation and/or limited dispersal might mediate the NAB composition and community structure in different water bodies. In the shallow-water Kueishantao hydrothermal vent environment, chemolithoautotrophic sulfur-oxidizing bacteria were the primary NAB, indicating a unique nitrate-assimilating community in this extreme environment. In the coastal water of the East China Sea, the relative abundance of Alteromonas and Roseobacter-like nasA gene sequences responded closely to algal blooms, indicating that NAB may be active participants contributing to the bloom dynamics. Our statistical results suggested that salinity, temperature and nitrate may be some of the key environmental factors controlling the composition and dynamics of the marine NAB communities. PMID:25647610
Analysis of genotype diversity and evolution of Dengue virus serotype 2 using complete genomes

PubMed Central

Waman, Vaishali P.; Kolekar, Pandurang; Ramtirthkar, Mukund R.; Kale, Mohan M.

2016-01-01

Background Dengue is one of the most common arboviral diseases prevalent worldwide and is caused by Dengue viruses (genus Flavivirus, family Flaviviridae). There are four serotypes of Dengue Virus (DENV-1 to DENV-4), each of which is further subdivided into distinct genotypes. DENV-2 is frequently associated with severe dengue infections and epidemics. DENV-2 consists of six genotypes such as Asian/American, Asian I, Asian II, Cosmopolitan, American and sylvatic. Comparative genomic study was carried out to infer population structure of DENV-2 and to analyze the role of evolutionary and spatiotemporal factors in emergence of diversifying lineages. Methods Complete genome sequences of 990 strains of DENV-2 were analyzed using Bayesian-based population genetics and phylogenetic approaches to infer genetically distinct lineages. The role of spatiotemporal factors, genetic recombination and selection pressure in the evolution of DENV-2 is examined using the sequence-based bioinformatics approaches. Results DENV-2 genetic structure is complex and consists of fifteen subpopulations/lineages. The Asian/American genotype is observed to be diversified into seven lineages. The Asian I, Cosmopolitan and sylvatic genotypes were found to be subdivided into two lineages, each. The populations of American and Asian II genotypes were observed to be homogeneous. Significant evidence of episodic positive selection was observed in all the genes, except NS4A. Positive selection operational on a few codons in envelope gene confers antigenic and lineage diversity in the American strains of Asian/American genotype. Selection on codons of non-structural genes was observed to impact diversification of lineages in Asian I, cosmopolitan and sylvatic genotypes. Evidence of intra/inter-genotype recombination was obtained and the uncertainty in classification of recombinant strains was resolved using the population genetics approach. Discussion Complete genome-based analysis revealed that the worldwide population of DENV-2 strains is subdivided into fifteen lineages. The population structure of DENV-2 is spatiotemporal and is shaped by episodic positive selection and recombination. Intra-genotype diversity was observed in four genotypes (Asian/American, Asian I, cosmopolitan and sylvatic). Episodic positive selection on envelope and non-structural genes translates into antigenic diversity and appears to be responsible for emergence of strains/lineages in DENV-2 genotypes. Understanding of the genotype diversity and emerging lineages will be useful to design strategies for epidemiological surveillance and vaccine design. PMID:27635316
Microbial Diversity in Deep-sea Methane Seep Sediments Presented by SSU rRNA Gene Tag Sequencing

PubMed Central

Nunoura, Takuro; Takaki, Yoshihiro; Kazama, Hiromi; Hirai, Miho; Ashi, Juichiro; Imachi, Hiroyuki; Takai, Ken

2012-01-01

Microbial community structures in methane seep sediments in the Nankai Trough were analyzed by tag-sequencing analysis for the small subunit (SSU) rRNA gene using a newly developed primer set. The dominant members of Archaea were Deep-sea Hydrothermal Vent Euryarchaeotic Group 6 (DHVEG 6), Marine Group I (MGI) and Deep Sea Archaeal Group (DSAG), and those in Bacteria were Alpha-, Gamma-, Delta- and Epsilonproteobacteria, Chloroflexi, Bacteroidetes, Planctomycetes and Acidobacteria. Diversity and richness were examined by 8,709 and 7,690 tag-sequences from sediments at 5 and 25 cm below the seafloor (cmbsf), respectively. The estimated diversity and richness in the methane seep sediment are as high as those in soil and deep-sea hydrothermal environments, although the tag-sequences obtained in this study were not sufficient to show whole microbial diversity in this analysis. We also compared the diversity and richness of each taxon/division between the sediments from the two depths, and found that the diversity and richness of some taxa/divisions varied significantly along with the depth. PMID:22510646
Investigation of terpene diversification across multiple sequenced plant genomes

PubMed Central

Boutanaev, Alexander M.; Moses, Tessa; Zi, Jiachen; Nelson, David R.; Mugford, Sam T.; Peters, Reuben J.; Osbourn, Anne

2015-01-01

Plants produce an array of specialized metabolites, including chemicals that are important as medicines, flavors, fragrances, pigments and insecticides. The vast majority of this metabolic diversity is untapped. Here we take a systematic approach toward dissecting genetic components of plant specialized metabolism. Focusing on the terpenes, the largest class of plant natural products, we investigate the basis of terpene diversity through analysis of multiple sequenced plant genomes. The primary drivers of terpene diversification are terpenoid synthase (TS) “signature” enzymes (which generate scaffold diversity), and cytochromes P450 (CYPs), which modify and further diversify these scaffolds, so paving the way for further downstream modifications. Our systematic search of sequenced plant genomes for all TS and CYP genes reveals that distinct TS/CYP gene pairs are found together far more commonly than would be expected by chance, and that certain TS/CYP pairings predominate, providing signals for key events that are likely to have shaped terpene diversity. We recover TS/CYP gene pairs for previously characterized terpene metabolic gene clusters and demonstrate new functional pairing of TSs and CYPs within previously uncharacterized clusters. Unexpectedly, we find evidence for different mechanisms of pathway assembly in eudicots and monocots; in the former, microsyntenic blocks of TS/CYP gene pairs duplicate and provide templates for the evolution of new pathways, whereas in the latter, new pathways arise by mixing and matching of individual TS and CYP genes through dynamic genome rearrangements. This is, to our knowledge, the first documented observation of the unique pattern of TS and CYP assembly in eudicots and monocots. PMID:25502595
Genetic diversity of influenza A(H1N1)2009 virus circulating during the season 2010-2011 in Spain.

PubMed

Ledesma, Juan; Pozo, Francisco; Reina, Gabriel; Blasco, Miriam; Rodríguez, Guadalupe; Montes, Milagrosa; López-Miragaya, Isabel; Salvador, Carmen; Reina, Jordi; Ortíz de Lejarazu, Raúl; Egido, Pilar; López Barba, José; Delgado, Concepción; Cuevas, María Teresa; Casas, Inmaculada

2012-01-01

Genetic diversity of influenza A(H1N1)2009 viruses has been reported since the pandemic virus emerged in April 2009. Different genetic clades have been identified and defined based on amino acid substitutions found in the haemagglutinin (HA) protein sequences. In Spain, circulating influenza viruses are monitored each season by the regional laboratories enrolled in the Spanish Influenza Surveillance System (SISS). The analysis of the HA gene sequence helps to detect the genetic diversity and viral evolution. To perform an analysis of the genetic diversity of influenza A(H1N1)2009 viruses circulating in Spain during the season 2010-2011 based on analysis of the HA sequence gene. Phylogenetic analysis based on the HA1 subunit of the haemagglutinin gene was carried out on 220 influenza A(H1N1)2009 viruses circulating during the season 2010-2011. Six different genetic groups were identified among circulating A(H1N1)2009 viruses, five of them were previously reported during season 2010-2011. A new group, characterized by E172K and K308E changes and a proline at position 83, was observed in 12.27% of the Spanish viruses. Co-circulation of six different genetic groups of influenza A(H1N1)2009 viruses was identified in Spain during the season 2010-2011. Nevertheless, at this stage, none of the groups identified to date have resulted in significant antigenic changes according to data collected by World Health Organization Collaborating Centres for influenza surveillance. Copyright © 2011 Elsevier B.V. All rights reserved.
Comparative sequence analysis of domain I of Plasmodium falciparum apical membrane antigen 1 from Saudi Arabia and worldwide isolates.

PubMed

Al-Qahtani, Ahmed A; Abdel-Muhsin, Abdel-Muhsin A; Dajem, Saad M Bin; AlSheikh, Adel Ali H; Bohol, Marie Fe F; Al-Ahdal, Mohammed N; Putaporntip, Chaturong; Jongwutiwes, Somchai

2016-04-01

The apical membrane antigen 1 of Plasmodium falciparum (PfAMA1) plays a crucial role in erythrocyte invasion and is a target of protective antibodies. Although domain I of PfAMA1 has been considered a promising vaccine component, extensive sequence diversity in this domain could compromise an effective vaccine design. To explore the extent of sequence diversity in domain I of PfAMA1, P. falciparum-infected blood samples from Saudi Arabia collected between 2007 and 2009 were analyzed and compared with those from worldwide parasite populations. Forty-six haplotypes and a novel codon change (M190V) were found among Saudi Arabian isolates. The haplotype diversity (0.948±0.004) and nucleotide diversity (0.0191±0.0008) were comparable to those from African hyperendemic countries. Positive selection in domain I of PfAMA1 among Saudi Arabian parasite population was observed because nonsynonymous nucleotide substitutions per nonsynonymous site (dN) significantly exceeded synonymous nucleotide substitutions per synonymous site (dS) and Tajima's D and its related statistics significantly deviated from neutrality in the positive direction. Despite a relatively low prevalence of malaria in Saudi Arabia, a minimum of 17 recombination events occurred in domain I. Genetic differentiation was significant between P. falciparum in Saudi Arabia and parasites from other geographic origins. Several shared or closely related haplotypes were found among parasites from different geographic areas, suggesting that vaccine derived from multiple shared epitopes could be effective across endemic countries. Copyright © 2016 Elsevier B.V. All rights reserved.
Genetic Diversity of Bacterial Communities and Gene Transfer Agents in Northern South China Sea

PubMed Central

Sun, Fu-Lin; Wang, You-Shao; Wu, Mei-Lin; Jiang, Zhao-Yu; Sun, Cui-Ci; Cheng, Hao

2014-01-01

Pyrosequencing of the 16S ribosomal RNA gene (rDNA) amplicons was performed to investigate the unique distribution of bacterial communities in northern South China Sea (nSCS) and evaluate community structure and spatial differences of bacterial diversity. Cyanobacteria, Proteobacteria, Actinobacteria, and Bacteroidetes constitute the majority of bacteria. The taxonomic description of bacterial communities revealed that more Chroococcales, SAR11 clade, Acidimicrobiales, Rhodobacterales, and Flavobacteriales are present in the nSCS waters than other bacterial groups. Rhodobacterales were less abundant in tropical water (nSCS) than in temperate and cold waters. Furthermore, the diversity of Rhodobacterales based on the gene transfer agent (GTA) major capsid gene (g5) was investigated. Four g5 gene clone libraries were constructed from samples representing different regions and yielded diverse sequences. Fourteen g5 clusters could be identified among 197 nSCS clones. These clusters were also related to known g5 sequences derived from genome-sequenced Rhodobacterales. The composition of g5 sequences in surface water varied with the g5 sequences in the sampling sites; this result indicated that the Rhodobacterales population could be highly diverse in nSCS. Phylogenetic tree analysis result indicated distinguishable diversity patterns among tropical (nSCS), temperate, and cold waters, thereby supporting the niche adaptation of specific Rhodobacterales members in unique environments. PMID:25364820
Phylogenetic diversity and biogeography of the Mamiellophyceae lineage of eukaryotic phytoplankton across the oceans.

PubMed

Monier, Adam; Worden, Alexandra Z; Richards, Thomas A

2016-08-01

High-throughput diversity amplicon sequencing of marine microbial samples has revealed that members of the Mamiellophyceae lineage are successful phytoplankton in many oceanic habitats. Indeed, these eukaryotic green algae can dominate the picoplanktonic biomass, however, given the broad expanses of the oceans, their geographical distributions and the phylogenetic diversity of some groups remain poorly characterized. As these algae play a foundational role in marine food webs, it is crucial to assess their global distribution in order to better predict potential changes in abundance and community structure. To this end, we analyzed the V9-18S small subunit rDNA sequences deposited from the Tara Oceans expedition to evaluate the diversity and biogeography of these phytoplankton. Our results show that the phylogenetic composition of Mamiellophyceae communities is in part determined by geographical provenance, and do not appear to be influenced - in the samples recovered - by water depth, at least at the resolution possible with the V9-18S. Phylogenetic classification of Mamiellophyceae sequences revealed that the Dolichomastigales order encompasses more sequence diversity than other orders in this lineage. These results indicate that a large fraction of the Mamiellophyceae diversity has been hitherto overlooked, likely because of a combination of size fraction, sequencing and geographical limitations. © 2016 Society for Applied Microbiology and John Wiley & Sons Ltd.
Proteopedia: 3D Visualization and Annotation of Transcription Factor-DNA Readout Modes

ERIC Educational Resources Information Center

Dantas Machado, Ana Carolina; Saleebyan, Skyler B.; Holmes, Bailey T.; Karelina, Maria; Tam, Julia; Kim, Sharon Y.; Kim, Keziah H.; Dror, Iris; Hodis, Eran; Martz, Eric; Compeau, Patricia A.; Rohs, Remo

2012-01-01

3D visualization assists in identifying diverse mechanisms of protein-DNA recognition that can be observed for transcription factors and other DNA binding proteins. We used Proteopedia to illustrate transcription factor-DNA readout modes with a focus on DNA shape, which can be a function of either nucleotide sequence (Hox proteins) or base pairing…
From algae to angiosperms–inferring the phylogeny of green plants (Viridiplantae) from 360 plastid genomes

PubMed Central

2014-01-01

Background Next-generation sequencing has provided a wealth of plastid genome sequence data from an increasingly diverse set of green plants (Viridiplantae). Although these data have helped resolve the phylogeny of numerous clades (e.g., green algae, angiosperms, and gymnosperms), their utility for inferring relationships across all green plants is uncertain. Viridiplantae originated 700-1500 million years ago and may comprise as many as 500,000 species. This clade represents a major source of photosynthetic carbon and contains an immense diversity of life forms, including some of the smallest and largest eukaryotes. Here we explore the limits and challenges of inferring a comprehensive green plant phylogeny from available complete or nearly complete plastid genome sequence data. Results We assembled protein-coding sequence data for 78 genes from 360 diverse green plant taxa with complete or nearly complete plastid genome sequences available from GenBank. Phylogenetic analyses of the plastid data recovered well-supported backbone relationships and strong support for relationships that were not observed in previous analyses of major subclades within Viridiplantae. However, there also is evidence of systematic error in some analyses. In several instances we obtained strongly supported but conflicting topologies from analyses of nucleotides versus amino acid characters, and the considerable variation in GC content among lineages and within single genomes affected the phylogenetic placement of several taxa. Conclusions Analyses of the plastid sequence data recovered a strongly supported framework of relationships for green plants. This framework includes: i) the placement of Zygnematophyceace as sister to land plants (Embryophyta), ii) a clade of extant gymnosperms (Acrogymnospermae) with cycads + Ginkgo sister to remaining extant gymnosperms and with gnetophytes (Gnetophyta) sister to non-Pinaceae conifers (Gnecup trees), and iii) within the monilophyte clade (Monilophyta), Equisetales + Psilotales are sister to Marattiales + leptosporangiate ferns. Our analyses also highlight the challenges of using plastid genome sequences in deep-level phylogenomic analyses, and we provide suggestions for future analyses that will likely incorporate plastid genome sequence data for thousands of species. We particularly emphasize the importance of exploring the effects of different partitioning and character coding strategies. PMID:24533922
Comparative analysis of the feline immunoglobulin repertoire.

PubMed

Steiniger, Sebastian C J; Glanville, Jacob; Harris, Douglas W; Wilson, Thomas L; Ippolito, Gregory C; Dunham, Steven A

2017-03-01

Next-Generation Sequencing combined with bioinformatics is a powerful tool for analyzing the large number of DNA sequences present in the expressed antibody repertoire and these data sets can be used to advance a number of research areas including antibody discovery and engineering. The accurate measurement of the immune repertoire sequence composition, diversity and abundance is important for understanding the repertoire response in infections, vaccinations and cancer immunology and could also be useful for elucidating novel molecular targets. In this study 4 individual domestic cats (Felis catus) were subjected to antibody repertoire sequencing with total number of sequences generated 1079863 for VH for IgG, 1050824 VH for IgM, 569518 for VK and 450195 for VL. Our analysis suggests that a similar VDJ expression patterns exists across all cats. Similar to the canine repertoire, the feline repertoire is dominated by a single subgroup, namely VH3. The antibody paratope of felines showed similar amino acid variation when compared to human, mouse and canine counterparts. All animals show a similarly skewed VH CDR-H3 profile and, when compared to canine, human and mouse, distinct differences are observed. Our study represents the first attempt to characterize sequence diversity in the expressed feline antibody repertoire and this demonstrates the utility of using NGS to elucidate entire antibody repertoires from individual animals. These data provide significant insight into understanding the feline immune system function. Copyright © 2017 International Alliance for Biological Standardization. Published by Elsevier Ltd. All rights reserved.
Unravelling the Molecular Epidemiology and Genetic Diversity among Burkholderia pseudomallei Isolates from South India Using Multi-Locus Sequence Typing.

PubMed

Tellapragada, Chaitanya; Kamthan, Aayushi; Shaw, Tushar; Ke, Vandana; Kumar, Subodh; Bhat, Vinod; Mukhopadhyay, Chiranjay

2016-01-01

There is a slow but steady rise in the case detection rates of melioidosis from various parts of the Indian sub-continent in the past two decades. However, the epidemiology of the disease in India and the surrounding South Asian countries remains far from well elucidated. Multi-locus sequence typing (MLST) is a useful epidemiological tool to study the genetic relatedness of bacterial isolates both with-in and across the countries. With this background, we studied the molecular epidemiology of 32 Burkholderia pseudomallei isolates (31 clinical and 1 soil isolate) obtained during 2006-2015 from various parts of south India using multi-locus sequencing typing and analysis. Of the 32 isolates included in the analysis, 30 (93.7%) had novel allelic profiles that were not reported previously. Sequence type (ST) 1368 (n = 15, 46.8%) with allelic profile (1, 4, 6, 4, 1, 1, 3) was the most common genotype observed. We did not observe a genotypic association of STs with geographical location, type of infection and year of isolation in the present study. Measure of genetic differentiation (FST) between Indian and the rest of world isolates was 0.14413. Occurrence of the same ST across three adjacent states of south India suggest the dispersion of B.pseudomallei across the south western coastal part of India with limited geographical clustering. However, majority of the STs reported from the present study remained as "outliers" on the eBURST "Population snapshot", suggesting the genetic diversity of Indian isolates from the Australasian and Southeast Asian isolates.

New Insight Into the Diversity of SemiSWEET Sugar Transporters and the Homologs in Prokaryotes

PubMed Central

Jia, Baolei; Hao, Lujiang; Xuan, Yuan Hu; Jeon, Che Ok

2018-01-01

Sugars will eventually be exported transporters (SWEETs) and SemiSWEETs represent a family of sugar transporters in eukaryotes and prokaryotes, respectively. SWEETs contain seven transmembrane helices (TMHs), while SemiSWEETs contain three. The functions of SemiSWEETs are less studied. In this perspective article, we analyzed the diversity and conservation of SemiSWEETs and further proposed the possible functions. 1,922 SemiSWEET homologs were retrieved from the UniProt database, which is not proportional to the sequenced prokaryotic genomes. However, these proteins are very diverse in sequences and can be classified into 19 clusters when >50% sequence identity is required. Moreover, a gene context analysis indicated that several SemiSWEETs are located in the operons that are related to diverse carbohydrate metabolism. Several proteins with seven TMHs can be found in bacteria, and sequence alignment suggested that these proteins in bacteria may be formed by the duplication and fusion. Multiple sequence alignments showed that the amino acids for sugar translocation are still conserved and coevolved, although the sequences show diversity. Among them, the functions of a few amino acids are still not clear. These findings highlight the challenges that exist in SemiSWEETs and provide future researchers the foundation to explore these uncharted areas. PMID:29872447
New Insight Into the Diversity of SemiSWEET Sugar Transporters and the Homologs in Prokaryotes.

PubMed

Jia, Baolei; Hao, Lujiang; Xuan, Yuan Hu; Jeon, Che Ok

2018-01-01

Sugars will eventually be exported transporters (SWEETs) and SemiSWEETs represent a family of sugar transporters in eukaryotes and prokaryotes, respectively. SWEETs contain seven transmembrane helices (TMHs), while SemiSWEETs contain three. The functions of SemiSWEETs are less studied. In this perspective article, we analyzed the diversity and conservation of SemiSWEETs and further proposed the possible functions. 1,922 SemiSWEET homologs were retrieved from the UniProt database, which is not proportional to the sequenced prokaryotic genomes. However, these proteins are very diverse in sequences and can be classified into 19 clusters when >50% sequence identity is required. Moreover, a gene context analysis indicated that several SemiSWEETs are located in the operons that are related to diverse carbohydrate metabolism. Several proteins with seven TMHs can be found in bacteria, and sequence alignment suggested that these proteins in bacteria may be formed by the duplication and fusion. Multiple sequence alignments showed that the amino acids for sugar translocation are still conserved and coevolved, although the sequences show diversity. Among them, the functions of a few amino acids are still not clear. These findings highlight the challenges that exist in SemiSWEETs and provide future researchers the foundation to explore these uncharted areas.
Diversity of Tn1546 in vanA-positive Enterococcus faecium clinical isolates with VanA, VanB, and VanD phenotypes and susceptibility to vancomycin.

PubMed

Cha, J O; Yoo, J I; Kim, H K; Kim, H S; Yoo, J S; Lee, Y S; Jung, Y H

2013-10-01

To investigate diversity in the vanA cluster in Enterococcus faecium isolates from nontertiary hospitals. We identified 43 vanA-positive Ent. faecium isolates, including two vancomycin-susceptible isolates, from hospitals between 2003 and 2006. Of these isolates, >85% were resistant to ampicillin, erythromycin and ciprofloxacin. The vanA cluster was classified into six types using overlapping PCR, but the prototype transposon Tn1546 was not found. Most vanA-positive vancomycin-resistant Enterococcus (VRE) carried IS1216V and belonged to Type III (58·1%) or Type II (20·9%). vanY, vanZ and IS1216V were observed in the left and right ends of Type III with long-range PCR. IS1216V was also observed within vanS and vanX in the two vancomycin-susceptible isolates and in two vancomycin-resistant isolates. No VRE isolates with VanB and VanD phenotypes contained point mutations in vanS, unlike in previous reports. Sequence types (STs) of all isolates belonged to clonal complex 17, and ST78 was predominant. Insertion sequences, especially IS1216V, cause structural variation in the vanA cluster. We report the first observation of vanY and vanZ at the left end of Tn1546 in clinical isolates. This is the first report of the frequency of vancomycin resistance and diversity of Tn1546 in vanA-positive Ent. faecium isolates from nontertiary hospitals. © 2013 The Society for Applied Microbiology.
The evolution and population structure of Lactobacillus fermentum from different naturally fermented products as determined by multilocus sequence typing (MLST).

PubMed

Dan, Tong; Liu, Wenjun; Song, Yuqin; Xu, Haiyan; Menghe, Bilige; Zhang, Heping; Sun, Zhihong

2015-05-20

Lactobacillus fermentum is economically important in the production and preservation of fermented foods. A repeatable and discriminative typing method was devised to characterize L. fermentum at the molecular level. The multilocus sequence typing (MLST) scheme developed was based on analysis of the internal sequence of 11 housekeeping gene fragments (clpX, dnaA, dnaK, groEL, murC, murE, pepX, pyrG, recA, rpoB, and uvrC). MLST analysis of 203 isolates of L. fermentum from Mongolia and seven provinces/ autonomous regions in China identified 57 sequence types (ST), 27 of which were represented by only a single isolate, indicating high genetic diversity. Phylogenetic analyses based on the sequence of the 11 housekeeping gene fragments indicated that the L. fermentum isolates analyzed belonged to two major groups. A standardized index of association (I A (S)) indicated a weak clonal population structure in L. fermentum. Split decomposition analysis indicated that recombination played an important role in generating the genetic diversity observed in L. fermentum. The results from the minimum spanning tree strongly suggested that evolution of L. fermentum STs was not correlated with geography or food-type. The MLST scheme developed will be valuable for further studies on the evolution and population structure of L. fermentum isolates used in food products.
Comparing Sanger sequencing and high-throughput metabarcoding for inferring photobiont diversity in lichens.

PubMed

Paul, Fiona; Otte, Jürgen; Schmitt, Imke; Dal Grande, Francesco

2018-06-05

The implementation of HTS (high-throughput sequencing) approaches is rapidly changing our understanding of the lichen symbiosis, by uncovering high bacterial and fungal diversity, which is often host-specific. Recently, HTS methods revealed the presence of multiple photobionts inside a single thallus in several lichen species. This differs from Sanger technology, which typically yields a single, unambiguous algal sequence per individual. Here we compared HTS and Sanger methods for estimating the diversity of green algal symbionts within lichen thalli using 240 lichen individuals belonging to two species of lichen-forming fungi. According to HTS data, Sanger technology consistently yielded the most abundant photobiont sequence in the sample. However, if the second most abundant photobiont exceeded 30% of the total HTS reads in a sample, Sanger sequencing generally failed. Our results suggest that most lichen individuals in the two analyzed species, Lasallia hispanica and L. pustulata, indeed contain a single, predominant green algal photobiont. We conclude that Sanger sequencing is a valid approach to detect the dominant photobionts in lichen individuals and populations. We discuss which research areas in lichen ecology and evolution will continue to benefit from Sanger sequencing, and which areas will profit from HTS approaches to assessing symbiont diversity.
Genetic diversity and temporal variation of the marine Synechococcus community in the subtropical coastal waters of Hong Kong.

PubMed

Jing, Hongmei; Zhang, Rui; Pointing, Stephen B; Liu, Hongbin; Qian, Peiyuan

2009-03-01

The phylogenetic diversity of the marine Synechococcus community in the subtropical coastal waters of Hong Kong, China, was examined through intergenic transcribed spacer clone libraries. All the sequences obtained fell within both marine cluster A (MC-A) and B (MC-B), with MC-A phylotypes dominating throughout the year. Distinct phylogenetic lineages specific to Hong Kong waters were detected from both MC-A and MC-B. The highest Synechococcus community diversity occurred in December, but the highest Synechococcus abundance occurred in August. On the other hand, both the abundance and diversity of Synechococcus showed a minimum in February. The remarkable seasonal variations of Synechococcus diversity observed were likely the result of the changes of hydrographic condition modulated by monsoons. Principal component analysis revealed that the in situ abiotic water characteristics, especially salinity and water turbidity, explained much of the variability of the marine Synechococcus population diversity in Hong Kong coastal waters. In addition, the temporal changes of Synechococcus abundance were largely driven by water temperature.
Genetic diversity in the candidate trees of Madhuca indica J. F. Gmel. (Mahua) revealed by inter-simple sequence repeats (ISSRs).

PubMed

Nimbalkar, S D; Jade, S S; Kauthale, V K; Agale, S; Bahulikar, R A

2018-03-01

Madhuca indica provides livelihood to several tribal people in India, where the flowers are used for extraction of sweet juices having multiple applications. Certain trees have more value as judged by the tribal people mainly based on yield and quality performance of the trees, and these trees were selected for the genetic diversity analyses. Genetic diversity of 48 candidate Mahua trees from Etapalli, Dadagaon, and Jawhar, Maharashtra, India, was assessed using ISSR markers. Fourteen ISSR primers revealed a total of 132 polymorphic bands giving overall 92% polymorphism. Genetic diversity, in terms of expected number of alleles (Ne), the observed number of alleles (Na), Nei's genetic diversity (H), and Shannon's information index ( I ) was 1.921, 1.333, 0.211, and 0.337, respectively, and suggested lower genetic diversity. Region wise analysis revealed higher genetic diversity for site Etapalli ( H = 0.206) and lowest at Dhadgaon ( H = 0.140). Etapalli area possesses higher forest cover than Dhadgaon and Jawhar. Additionally, in Dhadgaon and Jawhar M. indica trees are restricted to field bunds; both reasons might contribute to lower genetic diversity in these regions. The dendrogram and the principal coordinate analyses showed no region-specific clustering. The clustering patterns were supported by AMOVA where higher genetic variance was observed within trees and lower variance among regions. Long-distance dispersal and/or higher human interference might be responsible for low diversity and higher genetic variance within the candidate trees.
Longitudinal and Cross-Sectional Genetic Diversity in the Korean Peninsula Based on the P vivax Merozoite Surface Protein Gene.

PubMed

Kim, Jung-Yeon; Suh, Eun-Jung; Yu, Hyo-Soon; Jung, Hyun-Sik; Park, In-Ho; Choi, Yien-Kyeoug; Choi, Kyoung-Mi; Cho, Shin-Hyeong; Lee, Won-Ja

2011-12-01

Vivax malaria has reemerged and become endemic in Korea. Our study aimed to analyze by both longitudinal and cross-sectional genetic diversity of this malaria based on the P vivax Merozoite Surface Protein (PvMSP) gene parasites recently found in the Korean peninsula. PvMSP-1 gene sequence analysis from P vivax isolates (n = 835) during the 1996-2010 period were longitudinally analyzed and the isolates from the Korean peninsula through South Korea, the demilitarized zone and North Korea collected in 2008-2010 were enrolled in an overall analysis of MSP-1 gene diversity. New recombinant subtypes and severe multiple-cloneinfection rates were observed in recent vivax parasites. Regional variation was also observed in the study sites. This study revealed the great complexity of genetic variation and rapid dissemination of genes in P vivax. It also showed interesting patterns of diversity depending, on the region in the Korean Peninsula. Understanding the parasiteninsula. Under genetic variation may help to analyze trends and assess the extent of endemic malaria in Korea.
D-loop haplotype diversity in Brazilian horse breeds

PubMed Central

Ianella, Patrícia; Albuquerque, Maria do Socorro Maués; Paiva, Samuel Rezende; do Egito, Andréa Alves; Almeida, Leonardo Daniel; Sereno, Fabiana T. P. S.; Carvalho, Luiz Felipe Ramos; Mariante, Arthur da Silva; McManus, Concepta Margaret

2017-01-01

Abstract The first horses were brought to Brazil by the colonizers after 1534. Over the centuries, these animals evolved and adapted to local environmental conditions usually unsuitable for exotic breeds, thereby originating locally adapted Brazilian breeds. The present work represents the first description of maternal genetic diversity in these horse breeds based on D-loop sequences. A D-Loop HSV-I fragment of 252 bp, from 141 horses belonging to ten Brazilian breeds / genetic groups (locally adapted and specialized breeds) were analysed. Thirty-five different haplotypes belonging to 18 haplogroups were identified with 33 polymorphic sites. Haplotype diversity (varying from 0.20 to 0.96) and nucleotide diversity (varying from 0.0039 to 0.0239) was lower for locally adapted than for specialized breeds, with the same pattern observed for FST values. Haplogroups identified in Brazilian breeds are in agreement with previous findings in South American samples. The low variability observed mainly in locally adapted breeds, indicates that, to ensure conservation of these breeds, careful reproductive management is needed. Additional genetic characterization studies are required to support accurate decision-making. PMID:28863209
Geographic Structuring of the Plasmodium falciparum Sarco(endo)plasmic Reticulum Ca2+ ATPase (PfSERCA) Gene Diversity

PubMed Central

Pinto, João; Gribaldo, Simonetta; Legrand, Eric; Niang, Makhtar; Kim, Nimol; Pharath, Lim; Volnay, Béatrice; Ekala, Marie Therese; Bouchier, Christiane; Fandeur, Thierry; Berzosa, Pedro; Benito, Agustin; Ferreira, Isabel Dinis; Ferreira, Cynthia; Vieira, Pedro Paulo; Alecrim, Maria das Graças; Mercereau-Puijalon, Odile; Cravo, Pedro

2010-01-01

Artemisinin, a thapsigargin-like sesquiterpene has been shown to inhibit the Plasmodium falciparum sarco/endoplasmic reticulum calcium-ATPase PfSERCA. To collect baseline pfserca sequence information before field deployment of Artemisinin-based Combination therapies that may select mutant parasites, we conducted a sequence analysis of 100 isolates from multiple sites in Africa, Asia and South America. Coding sequence diversity was large, with 29 mutated codons, including 32 SNPs (average of one SNP/115 bp), of which 19 were novel mutations. Most SNP detected in this study were clustered within a region in the cytosolic head of the protein. The PfSERCA functional domains were very well conserved, with non synonymous mutations located outside the functional domains, except for the S769N mutation associated in French Guiana with elevated IC50 for artemether. The S769N mutation is located close to the hinge of the headpiece, which in other species modulates calcium affinity and in consequence efficacy of inhibitors, possibly linking calcium homeostasis to drug resistance. Genetic diversity was highest in Senegal, Brazil and French Guiana, and few mutations were identified in Asia. Population genetic analysis was conducted for a partial fragment of the gene encompassing nucleotide coordinates 87-2862 (unambiguous sequence available for 96 isolates). This supported a geographic clustering, with a separation between Old and New World samples and one dominant ancestral haplotype. Genetic drift alone cannot explain the observed polymorphism, suggesting that other evolutionary mechanisms are operating. One possible contributor could be the frequency of haemoglobinopathies that are associated with calcium dysregulation in the erythrocyte. PMID:20195531
Genotyping-by-Sequencing Analysis for Determining Population Structure of Finger Millet Germplasm of Diverse Origins.

PubMed

Kumar, Anil; Sharma, Divya; Tiwari, Apoorv; Jaiswal, J P; Singh, N K; Sood, Salej

2016-07-01

Finger millet [ (L.) Gaertn.] is grown mainly by subsistence farmers in arid and semiarid regions of the world. To broaden its genetic base and to boost its production, it is of paramount importance to characterize and genotype the diverse gene pool of this important food and nutritional security crop. However, as a result of nonavailability of the genome sequence of finger millet, the progress could not be made in realizing the molecular basis of unique qualities of the crop. In the present investigation, attempts have been made to characterize the genetically diverse collection of 113 finger millet accessions through whole-genome genotyping-by-sequencing (GBS), which resulted in a genome-wide set of 23,000 single-nucleotide polymorphisms (SNPs) segregating across the entire collection and several thousand SNPs segregating within every accession. A model-based population structure analysis reveals the presence of three subpopulations among the finger millet accessions, which are in parallel with the results of phylogenetic analysis. The observed population structure is consistent with the hypothesis that finger millet was domesticated first in Africa, and from there it was introduced to India some 3000 yr ago. A total of 1128 gene ontology (GO) terms were assigned to SNP-carrying genes for three main categories: biological process, cellular component, and molecular function. Facilitated access to high-throughput genotyping and sequencing technologies are likely to improve the breeding process in developing countries, and as such, this data will be very useful to breeders who are working for the genetic improvement of finger millet. Copyright © 2016 Crop Science Society of America.
The Incidence and Genetic Diversity of Apple Mosaic Virus (ApMV) and Prune Dwarf Virus (PDV) in Prunus Species in Australia

PubMed Central

Constable, Fiona E.; Nancarrow, Narelle; Rodoni, Brendan

2018-01-01

Apple mosaic virus (ApMV) and prune dwarf virus (PDV) are amongst the most common viruses infecting Prunus species worldwide but their incidence and genetic diversity in Australia is not known. In a survey of 127 Prunus tree samples collected from five states in Australia, ApMV and PDV occurred in 4 (3%) and 13 (10%) of the trees respectively. High-throughput sequencing (HTS) of amplicons from partial conserved regions of RNA1, RNA2, and RNA3, encoding the methyltransferase (MT), RNA-dependent RNA polymerase (RdRp), and the coat protein (CP) genes respectively, of ApMV and PDV was used to determine the genetic diversity of the Australian isolates of each virus. Phylogenetic comparison of Australian ApMV and PDV amplicon HTS variants and full length genomes of both viruses with isolates occurring in other countries identified genetic strains of each virus occurring in Australia. A single Australian Prunus infecting ApMV genetic strain was identified as all ApMV isolates sequence variants formed a single phylogenetic group in each of RNA1, RNA2, and RNA3. Two Australian PDV genetic strains were identified based on the combination of observed phylogenetic groups in each of RNA1, RNA2, and RNA3 and one Prunus tree had both strains. The accuracy of amplicon sequence variants phylogenetic analysis based on segments of each virus RNA were confirmed by phylogenetic analysis of full length genome sequences of Australian ApMV and PDV isolates and all published ApMV and PDV genomes from other countries. PMID:29562672
Cultivable Anaerobic Microbiota of Severe Early Childhood Caries▿¶

PubMed Central

Tanner, A. C. R.; Mathney, J. M. J.; Kent, R. L.; Chalmers, N. I.; Hughes, C. V.; Loo, C. Y.; Pradhan, N.; Kanasi, E.; Hwang, J.; Dahlan, M. A.; Papadopolou, E.; Dewhirst, F. E.

2011-01-01

Severe early childhood caries (ECC), while strongly associated with Streptococcus mutans using selective detection (culture, PCR), has also been associated with a widely diverse microbiota using molecular cloning approaches. The aim of this study was to evaluate the microbiota of severe ECC using anaerobic culture. The microbial composition of dental plaque from 42 severe ECC children was compared with that of 40 caries-free children. Bacterial samples were cultured anaerobically on blood and acid (pH 5) agars. Isolates were purified, and partial sequences for the 16S rRNA gene were obtained from 5,608 isolates. Sequence-based analysis of the 16S rRNA isolate libraries from blood and acid agars of severe ECC and caries-free children had >90% population coverage, with greater diversity occurring in the blood isolate library. Isolate sequences were compared with taxon sequences in the Human Oral Microbiome Database (HOMD), and 198 HOMD taxa were identified, including 45 previously uncultivated taxa, 29 extended HOMD taxa, and 45 potential novel groups. The major species associated with severe ECC included Streptococcus mutans, Scardovia wiggsiae, Veillonella parvula, Streptococcus cristatus, and Actinomyces gerensceriae. S. wiggsiae was significantly associated with severe ECC children in the presence and absence of S. mutans detection. We conclude that anaerobic culture detected as wide a diversity of species in ECC as that observed using cloning approaches. Culture coupled with 16S rRNA identification identified over 74 isolates for human oral taxa without previously cultivated representatives. The major caries-associated species were S. mutans and S. wiggsiae, the latter of which is a candidate as a newly recognized caries pathogen. PMID:21289150
Genetic diversity and population structure of Lactobacillus delbrueckii subspecies bulgaricus isolated from naturally fermented dairy foods.

PubMed

Song, Yuqin; Sun, Zhihong; Guo, Chenyi; Wu, Yarong; Liu, Wenjun; Yu, Jie; Menghe, Bilige; Yang, Ruifu; Zhang, Heping

2016-03-04

Lactobacillus delbrueckii subsp. bulgaricus is one of the most widely used starter culture strains in industrial fermented dairy manufacture. It is also common in naturally fermented dairy foods made using traditional methods. The subsp. bulgaricus strains found in naturally fermented foods may be useful for improving current industrial starter cultures; however, little is known regarding its genetic diversity and population structure. Here, a collection of 298 L. delbrueckii strains from naturally fermented products in Mongolia, Russia, and West China was analyzed by multi-locus sequence typing based on eight conserved genes. The 251 confirmed subsp. bulgaricus strains produced 106 unique sequence types, the majority of which were assigned to five clonal complexes (CCs). The geographical distribution of CCs was uneven, with CC1 dominated by Mongolian and Russian isolates, and CC2-CC5 isolates exclusively from Xinjiang, China. Population structure analysis suggested six lineages, L1-L6, with various homologous recombination rates. Although L2-L5 were mainly restricted within specific regions, strains belonging to L1 and L6 were observed in diverse regions, suggesting historical transmission events. These results greatly enhance our knowledge of the population diversity of subsp. bulgaricus strains, and suggest that strains from CC1 and L4 may be useful as starter strains in industrial fermentation.
Fine-scale analysis of 16S rRNA sequences reveals a high level of taxonomic diversity among vaginal Atopobium spp.

PubMed Central

Mendes-Soares, Helena; Krishnan, Vandhana; Settles, Matthew L.; Ravel, Jacques; Brown, Celeste J.; Forney, Larry J.

2015-01-01

Although vaginal microbial communities of some healthy women have high proportions of Atopobium vaginae, the genus Atopobium is more commonly associated with bacterial vaginosis, a syndrome associated with an increased risk of adverse pregnancy outcomes and the transmission of sexually transmitted diseases. Genetic differences within Atopobium species may explain why single species can be associated with both health and disease. We used 16S rRNA gene sequences from previously published studies to explore the taxonomic diversity of the genus Atopobium in vaginal microbial communities of healthy women. Although A. vaginae was the species most commonly found, we also observed three other Atopobium species in the vaginal microbiota, one of which, A. parvulum, was not previously known to reside in the human vagina. Furthermore, we found several potential novel species of the genus Atopobium and multiple phylogenetic clades of A. vaginae. The diversity of Atopobium found in our study, which focused only on samples from healthy women, is greater than previously recognized, suggesting that analysis of samples from women with BV would yield even more diversity. Classification of microbes only to the genus level may thus obfuscate differences that might be important to better understand health or disease. PMID:25778779
Genetic variation of Sargassum horneri populations detected by inter-simple sequence repeats.

PubMed

Ren, J R; Yang, R; He, Y Y; Sun, Q H

2015-01-30

The seaweed Sargassum horneri is an important brown alga in the marine environment, and it is an important raw material in the alginate industry. Unfortunately, the fixed resource that was originally reported is now reduced or disappeared, and increased floating populations have been reported in recent years. We sampled a floating population and 4 fixed cultivated populations of S. horneri along the coast of Zhejiang, China. Inter-simple sequence repeat (ISSR) markers were applied in this research to analyze the genetic variation between floating populations and fixed cultivated populations of S. horneri. In total, 220 loci were amplified with 23 ISSR primers. The percentage of polymorphic loci within each population ranged from 53.64 to 95.45%. The highest diversity was observed in population 3, which was the local species that was suspension cultured in the lab and then fixed cultivated in the Nanji Islands before sampling. The lowest diversity was obtained in the floating population 4. The genetic distances among the 5 S. horneri populations ranged from 0.0819 to 0.2889, and the distance tendency confirmed the genetic diversity. The results suggest that the floating population had the lowest genetic diversity and could not be joined into the cluster branch of the fixed cultivated populations.
Mitochondrial DNA Markers Reveal High Genetic Diversity but Low Genetic Differentiation in the Black Fly Simulium tani Takaoka & Davies along an Elevational Gradient in Malaysia

PubMed Central

Low, Van Lun; Adler, Peter H.; Takaoka, Hiroyuki; Ya’cob, Zubaidah; Lim, Phaik Eem; Tan, Tiong Kai; Lim, Yvonne A. L.; Chen, Chee Dhang; Norma-Rashid, Yusoff; Sofian-Azirun, Mohd

2014-01-01

The population genetic structure of Simulium tani was inferred from mitochondria-encoded sequences of cytochrome c oxidase subunits I (COI) and II (COII) along an elevational gradient in Cameron Highlands, Malaysia. A statistical parsimony network of 71 individuals revealed 71 haplotypes in the COI gene and 43 haplotypes in the COII gene; the concatenated sequences of the COI and COII genes revealed 71 haplotypes. High levels of genetic diversity but low levels of genetic differentiation were observed among populations of S. tani at five elevations. The degree of genetic diversity, however, was not in accordance with an altitudinal gradient, and a Mantel test indicated that elevation did not have a limiting effect on gene flow. No ancestral haplotype of S. tani was found among the populations. Pupae with unique structural characters at the highest elevation showed a tendency to form their own haplotype cluster, as revealed by the COII gene. Tajima’s D, Fu’s Fs, and mismatch distribution tests revealed population expansion of S. tani in Cameron Highlands. A strong correlation was found between nucleotide diversity and the levels of dissolved oxygen in the streams where S. tani was collected. PMID:24941043
Microbial eukaryotic distributions and diversity patterns in a deep-sea methane seep ecosystem.

PubMed

Pasulka, Alexis L; Levin, Lisa A; Steele, Josh A; Case, David H; Landry, Michael R; Orphan, Victoria J

2016-09-01

Although chemosynthetic ecosystems are known to support diverse assemblages of microorganisms, the ecological and environmental factors that structure microbial eukaryotes (heterotrophic protists and fungi) are poorly characterized. In this study, we examined the geographic, geochemical and ecological factors that influence microbial eukaryotic composition and distribution patterns within Hydrate Ridge, a methane seep ecosystem off the coast of Oregon using a combination of high-throughput 18S rRNA tag sequencing, terminal restriction fragment length polymorphism fingerprinting, and cloning and sequencing of full-length 18S rRNA genes. Microbial eukaryotic composition and diversity varied as a function of substrate (carbonate versus sediment), activity (low activity versus active seep sites), sulfide concentration, and region (North versus South Hydrate Ridge). Sulfide concentration was correlated with changes in microbial eukaryotic composition and richness. This work also revealed the influence of oxygen content in the overlying water column and water depth on microbial eukaryotic composition and diversity, and identified distinct patterns from those previously observed for bacteria, archaea and macrofauna in methane seep ecosystems. Characterizing the structure of microbial eukaryotic communities in response to environmental variability is a key step towards understanding if and how microbial eukaryotes influence seep ecosystem structure and function. © 2016 Society for Applied Microbiology and John Wiley & Sons Ltd.
[Observation of genetic diversity in dental plaque of elder people with root caries].

PubMed

Ma, Shan-fen; Liang, Jing-ping; Jiang, Yun-tao; Zhu, Cai-lian

2011-08-01

Bacterial community in dental plaque of elder people was analyzed to learn about the microhabitat composition and diversity. Dental plaque samples were collected from 25 elders. PCR-based denaturing gradient gel electrophoresis (PCR-DGGE) was used to evaluate the microbial diversity by displaying PCR-generated 16SrDNA fragments that migrate at different distances, reflecting the different sequence of fragment. SPSS12.0 software was used to analyze the variance of genotypes between different groups of bacteria. Genotypes of bacteria in dental plaques in the root caries group was significantly more than the other two groups. Crown caries group and caries-free group had no significant difference. The genetic diversity of the dental plaque microflora in the root caries group is significantly higher than coronal caries group and caries-free group.
Diversity of mitochondrial DNA lineages in South Siberia.

PubMed

Derenko, M V; Grzybowski, T; Malyarchuk, B A; Dambueva, I K; Denisova, G A; Czarny, J; Dorzhu, C M; Kakpakov, V T; Miścicka-Sliwka, D; Woźniak, M; Zakharov, I A

2003-09-01

To investigate the origin and evolution of aboriginal populations of South Siberia, a comprehensive mitochondrial DNA (mtDNA) analysis (HVR1 sequencing combined with RFLP typing) of 480 individuals, representing seven Altaic-speaking populations (Altaians, Khakassians, Buryats, Sojots, Tuvinians, Todjins and Tofalars), was performed. Additionally, HVR2 sequence information was obtained for 110 Altaians, providing, in particular, some novel details of the East Asian mtDNA phylogeny. The total sample revealed 81% East Asian (M*, M7, M8, M9, M10, C, D, G, Z, A, B, F, N9a, Y) and 17% West Eurasian (H, U, J, T, I, N1a, X) matrilineal genetic contribution, but with regional differences within South Siberia. The highest influx of West Eurasian mtDNAs was observed in populations from the East Sayan and Altai regions (from 12.5% to 34.5%), whereas in populations from the Baikal region this contribution was markedly lower (less than 10%). The considerable substructure within South Siberian haplogroups B, F, and G, together with the high degree of haplogroup C and D diversity revealed there, allows us to conclude that South Siberians carry the genetic imprint of early-colonization phase of Eurasia. Statistical analyses revealed that South Siberian populations contain high levels of mtDNA diversity and high heterogeneity of mtDNA sequences among populations (Fst = 5.05%) that might be due to geography but not due to language and anthropological features.

A comparative study of AMF diversity in annual and perennial plant species from semiarid gypsum soils.

NASA Astrophysics Data System (ADS)

Alguacil, M. M.; Torrecillas, E.; Roldán, A.; Díaz, G.; Torres, P.

2012-04-01

The arbuscular mycorrhizal fungi (AMF) communities composition regulate plant interactions and determine the structure of plant communities. In this study we analysed the diversity of AMF in the roots of two perennial gypsophyte plant species, Herniaria fruticosa and Senecio auricula, and an annual herbaceous species, Bromus rubens, growing in a gypsum soil from a semiarid area. The objective was to determine whether perennial and annual host plants support different AMF communities in their roots and whether there are AMF species that might be indicators of specific functional plant roles in these ecosystems. The roots were analysed by nested PCR, cloning, sequencing of the ribosomal DNA small subunit region and phylogenetic analysis. Twenty AMF sequence types, belonging to the Glomus group A, Glomus group B, Diversisporaceae, Acaulosporaceae, Archaeosporaceae and Paraglomeraceae, were identified. Both gypsophyte perennial species had differing compositions of the AMF community and higher diversity when compared with the annual species, showing preferential selection by specific AMF sequences types. B. rubens did not show host specificity, sharing the full composition of its AMF community with both perennial plant species. Seasonal variations in the competitiveness of AM fungi could explain the observed differences in AMF community composition, but this is still a working hypothesis that requires the analysis of further data obtained from a higher number of both annual and perennial plant species in order to be fully tested.
Twenty-One Genome Sequences from Pseudomonas Species and 19 Genome Sequences from Diverse Bacteria Isolated from the Rhizosphere and Endosphere of Populus deltoides

PubMed Central

Utturkar, Sagar M.; Klingeman, Dawn M.; Johnson, Courtney M.; Martin, Stanton L.; Land, Miriam L.; Lu, Tse-Yuan S.; Schadt, Christopher W.; Doktycz, Mitchel J.

2012-01-01

To aid in the investigation of the Populus deltoides microbiome, we generated draft genome sequences for 21 Pseudomonas strains and 19 other diverse bacteria isolated from Populus deltoides roots. Genome sequences for isolates similar to Acidovorax, Bradyrhizobium, Brevibacillus, Caulobacter, Chryseobacterium, Flavobacterium, Herbaspirillum, Novosphingobium, Pantoea, Phyllobacterium, Polaromonas, Rhizobium, Sphingobium, and Variovorax were generated. PMID:23045501
Twenty-One Genome Sequences from Pseudomonas Species and 19 Genome Sequences from Diverse Bacteria Isolated from the Rhizosphere and Endosphere of Populus deltoides

DOE Office of Scientific and Technical Information (OSTI.GOV)

Brown, Steven D; Utturkar, Sagar M; Klingeman, Dawn Marie

To aid in the investigation of the Populus deltoides microbiome we generated draft genome sequences for twenty one Pseudomonas and twenty one other diverse bacteria isolated from Populus deltoides roots. Genome sequences for isolates similar to Acidovorax, Bradyrhizobium, Brevibacillus, Burkholderia, Caulobacter, Chryseobacterium, Flavobacterium, Herbaspirillum, Novosphingobium, Pantoea, Phyllobacterium, Polaromonas, Rhizobium, Sphingobium and Variovorax were generated.
Sequence analysis of a few species of termites (Order: Isoptera) on the basis of partial characterization of COII gene.

PubMed

Sobti, Ranbir Chander; Kumari, Mamtesh; Sharma, Vijay Lakshmi; Sodhi, Monika; Mukesh, Manishi; Shouche, Yogesh

2009-11-01

The present study was aimed to get the nucleotide sequences of a part of COII mitochondrial gene amplified from individuals of five species of Termites (Isoptera: Termitidae: Macrotermitinae). Four of them belonged to the genus Odontotermes (O. obesus, O. horni, O. bhagwatii and Odontotermes sp.) and one to Microtermes (M. obesi). Partial COII gene fragments were amplified by using specific primers. The sequences so obtained were characterized to calculate the frequencies of each nucleotide bases and a high A + T content was observed. The interspecific pairwise sequence divergence in Odontotermes species ranged from 6.5% to 17.1% across COII fragment. M. obesi sequence diversity ranged from 2.5 with Odontotermes sp. to 19.0% with O. bhagwatii. Phylogenetic trees drawn on the basis of distance neighbour-joining method revealed three main clades clustering all the individuals according to their genera and families.
Culture and the sequence of steps in theory of mind development.

PubMed

Shahaeian, Ameneh; Peterson, Candida C; Slaughter, Virginia; Wellman, Henry M

2011-09-01

To examine cultural contrasts in the ordered sequence of conceptual developments leading to theory of mind (ToM), we compared 135 3- to 6-year-olds (77 Australians; 58 Iranians) on an established 5-step ToM scale (Wellman & Liu, 2004). There was a cross-cultural difference in the sequencing of ToM steps but not in overall rates of ToM mastery. In line with our predictions, the children from Iran conformed to a distinctive sequence previously observed only in children in China. In contrast to the case with children from Australia (and the United States), knowledge access was understood earlier than opinion diversity in children from Iran, consistent with this collectivist culture's emphasis on filial respect, dispute avoidance, and acquiring knowledge. Having a sibling was linked with faster overall ToM progress in Australia only and was not related to scale sequences in either culture.
Genetic diversity of Clostridium perfringens type A isolates from animals, food poisoning outbreaks and sludge

PubMed Central

Johansson, Anders; Aspan, Anna; Bagge, Elisabeth; Båverud, Viveca; Engström, Björn E; Johansson, Karl-Erik

2006-01-01

Background Clostridium perfringens, a serious pathogen, causes enteric diseases in domestic animals and food poisoning in humans. The epidemiological relationship between C. perfringens isolates from the same source has previously been investigated chiefly by pulsed-field gel electrophoresis (PFGE). In this study the genetic diversity of C. perfringens isolated from various animals, from food poisoning outbreaks and from sludge was investigated. Results We used PFGE to examine the genetic diversity of 95 C. perfringens type A isolates from eight different sources. The isolates were also examined for the presence of the beta2 toxin gene (cpb2) and the enterotoxin gene (cpe). The cpb2 gene from the 28 cpb2-positive isolates was also partially sequenced (519 bp, corresponding to positions 188 to 706 in the consensus cpb2 sequence). The results of PFGE revealed a wide genetic diversity among the C. perfringens type A isolates. The genetic relatedness of the isolates ranged from 58 to 100% and 56 distinct PFGE types were identified. Almost all clusters with similar patterns comprised isolates with a known epidemiological correlation. Most of the isolates from pig, horse and sheep carried the cpb2 gene. All isolates originating from food poisoning outbreaks carried the cpe gene and three of these also carried cpb2. Two evolutionary different populations were identified by sequence analysis of the partially sequenced cpb2 genes from our study and cpb2 sequences previously deposited in GenBank. Conclusion As revealed by PFGE, there was a wide genetic diversity among C. perfringens isolates from different sources. Epidemiologically related isolates showed a high genetic similarity, as expected, while isolates with no obvious epidemiological relationship expressed a lesser degree of genetic similarity. The wide diversity revealed by PFGE was not reflected in the 16S rRNA sequences, which had a considerable degree of sequence similarity. Sequence comparison of the partially sequenced cpb2 gene revealed two genetically different populations. This is to our knowledge the first study in which the genetic diversity of C. perfringens isolates both from different animals species, from food poisoning outbreaks and from sludge has been investigated. PMID:16737528
Low DNA Sequence Diversity of the Intergenic Spacer 1 Region in the Human Skin Commensal Fungi Malassezia sympodialis and M. dermatis Isolated from Patients with Malassezia-Associated Skin Diseases and Healthy Subjects.

PubMed

Cho, Otomi; Sugita, Takashi

2016-12-01

As DNA sequences of the intergenic spacer (IGS) region in the rRNA gene show remarkable intraspecies diversity compared with the small subunit, large subunit, and internal transcribed spacer region, the IGS region has been used as an epidemiological tool in studies on Malassezia globosa and M. restricta, which are responsible for the exacerbation of atopic dermatitis (AD) and seborrheic dermatitis (SD). However, the IGS regions of M. sympodialis and M. dermatis obtained from the skin of patients with AD and SD, as well as healthy subjects, lacked sequence diversity. Of the 105 M. sympodialis strains and the 40 M. dermatis strains, the sequences of 103 (98.1 %) and 39 (97.5 %), respectively, were identical. Thus, given the lack of intraspecies diversity in the IGS regions of M. sympodialis and M. dermatis, studies of the diversity of these species should be performed using appropriate genes and not the IGS.
Endophytic bacterial diversity in grapevine (Vitis vinifera L.) leaves described by 16S rRNA gene sequence analysis and length heterogeneity-PCR.

PubMed

Bulgari, Daniela; Casati, Paola; Brusetti, Lorenzo; Quaglino, Fabio; Brasca, Milena; Daffonchio, Daniele; Bianco, Piero Attilio

2009-08-01

Diversity of bacterial endophytes associated with grapevine leaf tissues was analyzed by cultivation and cultivation-independent methods. In order to identify bacterial endophytes directly from metagenome, a protocol for bacteria enrichment and DNA extraction was optimized. Sequence analysis of 16S rRNA gene libraries underscored five diverse Operational Taxonomic Units (OTUs), showing best sequence matches with gamma-Proteobacteria, family Enterobacteriaceae, with a dominance of the genus Pantoea. Bacteria isolation through cultivation revealed the presence of six OTUs, showing best sequence matches with Actinobacteria, genus Curtobacterium, and with Firmicutes genera Bacillus and Enterococcus. Length Heterogeneity-PCR (LH-PCR) electrophoretic peaks from single bacterial clones were used to setup a database representing the bacterial endophytes identified in association with grapevine tissues. Analysis of healthy and phytoplasma-infected grapevine plants showed that LH-PCR could be a useful complementary tool for examining the diversity of bacterial endophytes especially for diversity survey on a large number of samples.
Comparative genomics of citric-acid producing Aspergillus niger ATCC 1015 versus enzyme-producing CBS 513.88

DOE Office of Scientific and Technical Information (OSTI.GOV)

Andersen, Mikael R.; Salazar, Margarita; Schaap, Peter

2011-06-01

The filamentous fungus Aspergillus niger exhibits great diversity in its phenotype. It is found globally, both as marine and terrestrial strains, produces both organic acids and hydrolytic enzymes in high amounts, and some isolates exhibit pathogenicity. Although the genome of an industrial enzyme-producing A. niger strain (CBS 513.88) has already been sequenced, the versatility and diversity of this species compels additional exploration. We therefore undertook whole genome sequencing of the acidogenic A. niger wild type strain (ATCC 1015), and produced a genome sequence of very high quality. Only 15 gaps are present in the sequence and half the telomeric regionsmore » have been elucidated. Moreover, sequence information from ATCC 1015 was utilized to improve the genome sequence of CBS 513.88. Chromosome-level comparisons uncovered several genome rearrangements, deletions, a clear case of strain-specific horizontal gene transfer, and identification of 0.8 megabase of novel sequence. Single nucleotide polymorphisms per kilobase (SNPs/kb) between the two strains were found to be exceptionally high (average: 7.8, maximum: 160 SNPs/kb). High variation within the species was confirmed with exo-metabolite profiling and phylogenetics. Detailed lists of alleles were generated, and genotypic differences were observed to accumulate in metabolic pathways essential to acid production and protein synthesis. A transcriptome analysis revealed up-regulation of the electron transport chain, specifically the alternative oxidative pathway in ATCC 1015, while CBS 513.88 showed significant up regulation of genes associated with biosynthesis of amino acids that are abundant in glucoamylase A, tRNA-synthases and protein transporters.« less
Viral morphogenesis is the dominant source of sequence censorship in M13 combinatorial peptide phage display.

DOE Office of Scientific and Technical Information (OSTI.GOV)

Rodi, D. J.; Soares, A. S.; Makowski, L.

Novel statistical methods have been developed and used to quantitate and annotate the sequence diversity within combinatorial peptide libraries on the basis of small numbers (1-200) of sequences selected at random from commercially available M13 p3-based phage display libraries. These libraries behave statistically as though they correspond to populations containing roughly 4.0{+-}1.6% of the random dodecapeptides and 7.9{+-}2.6% of the random constrained heptapeptides that are theoretically possible within the phage populations. Analysis of amino acid residue occurrence patterns shows no demonstrable influence on sequence censorship by Escherichia coli tRNA isoacceptor profiles or either overall codon or Class II codon usagemore » patterns, suggesting no metabolic constraints on recombinant p3 synthesis. There is an overall depression in the occurrence of cysteine, arginine and glycine residues and an overabundance of proline, threonine and histidine residues. The majority of position-dependent amino acid sequence bias is clustered at three positions within the inserted peptides of the dodecapeptide library, +1, +3 and +12 downstream from the signal peptidase cleavage site. Conformational tendency measures of the peptides indicate a significant preference for inserts favoring a {beta}-turn conformation. The observed protein sequence limitations can primarily be attributed to genetic codon degeneracy and signal peptidase cleavage preferences. These data suggest that for applications in which maximal sequence diversity is essential, such as epitope mapping or novel receptor identification, combinatorial peptide libraries should be constructed using codon-corrected trinucleotide cassettes within vector-host systems designed to minimize morphogenesis-related censorship.« less
Molecular epidemiology of oyster-related human noroviruses and their global genetic diversity and temporal-geographical distribution from 1983 to 2014.

PubMed

Yu, Yongxin; Cai, Hui; Hu, Linghao; Lei, Rongwei; Pan, Yingjie; Yan, Shuling; Wang, Yongjie

2015-11-01

Noroviruses (NoVs) are a leading cause of epidemic and sporadic cases of acute gastroenteritis worldwide. Oysters are well recognized as the main vectors of environmentally transmitted NoVs, and disease outbreaks linked to oyster consumption have been commonly observed. Here, to quantify the genetic diversity, temporal distribution, and circulation of oyster-related NoVs on a global scale, 1,077 oyster-related NoV sequences deposited from 1983 to 2014 were downloaded from both NCBI GenBank and the NoroNet outbreak database and were then screened for quality control. A total of 665 sequences with reliable information were obtained and were subsequently subjected to genotyping and phylogenetic analyses. The results indicated that the majority of oyster-related NoV sequences were obtained from coastal countries and regions and that the numbers of sequences in these regions were unevenly distributed. Moreover, >80% of human NoV genotypes were detected in oyster samples or oyster-related outbreaks. A higher proportion of genogroup I (GI) (34%) was observed for oyster-related sequences than for non-oyster-related outbreaks, where GII strains dominated with an overwhelming majority of >90%, indicating that the prevalences of GI and GII are different in humans and oysters. In addition, a related convergence of the circulation trend was found between oyster-related NoV sequences and human pandemic outbreaks. This suggests that oysters not only act as a vector of NoV through environmental transmission but also serve as an important reservoir of human NoVs. These results highlight the importance of oysters in the persistence and transmission of human NoVs in the environment and have important implications for the surveillance of human NoVs in oyster samples. Copyright © 2015, American Society for Microbiology. All Rights Reserved.
TREE2FASTA: a flexible Perl script for batch extraction of FASTA sequences from exploratory phylogenetic trees.

PubMed

Sauvage, Thomas; Plouviez, Sophie; Schmidt, William E; Fredericq, Suzanne

2018-03-05

The body of DNA sequence data lacking taxonomically informative sequence headers is rapidly growing in user and public databases (e.g. sequences lacking identification and contaminants). In the context of systematics studies, sorting such sequence data for taxonomic curation and/or molecular diversity characterization (e.g. crypticism) often requires the building of exploratory phylogenetic trees with reference taxa. The subsequent step of segregating DNA sequences of interest based on observed topological relationships can represent a challenging task, especially for large datasets. We have written TREE2FASTA, a Perl script that enables and expedites the sorting of FASTA-formatted sequence data from exploratory phylogenetic trees. TREE2FASTA takes advantage of the interactive, rapid point-and-click color selection and/or annotations of tree leaves in the popular Java tree-viewer FigTree to segregate groups of FASTA sequences of interest to separate files. TREE2FASTA allows for both simple and nested segregation designs to facilitate the simultaneous preparation of multiple data sets that may overlap in sequence content.
Fine grained compositional analysis of Port Everglades Inlet microbiome using high throughput DNA sequencing.

PubMed

O'Connell, Lauren; Gao, Song; McCorquodale, Donald; Fleisher, Jay; Lopez, Jose V

2018-01-01

Similar to natural rivers, manmade inlets connect inland runoff to the ocean. Port Everglades Inlet (PEI) is a busy cargo and cruise ship port in South Florida, which can act as a source of pollution to surrounding beaches and offshore coral reefs. Understanding the composition and fluctuations of bacterioplankton communities ("microbiomes") in major port inlets is important due to potential impacts on surrounding environments. We hypothesize seasonal microbial fluctuations, which were profiled by high throughput 16S rRNA amplicon sequencing and analysis. Surface water samples were collected every week for one year. A total of four samples per month, two from each sampling location, were used for statistical analysis creating a high sampling frequency and finer sampling scale than previous inlet microbiome studies. We observed significant differences in community alpha diversity between months and seasons. Analysis of composition of microbiomes (ANCOM) tests were run in QIIME 2 at genus level taxonomic classification to determine which genera were differentially abundant between seasons and months. Beta diversity results yielded significant differences in PEI community composition in regard to month, season, water temperature, and salinity. Analysis of potentially pathogenic genera showed presence of Staphylococcus and Streptococcus . However, statistical analysis indicated that these organisms were not present in significantly high abundances throughout the year or between seasons. Significant differences in alpha diversity were observed when comparing microbial communities with respect to time. This observation stems from the high community evenness and low community richness in August. This indicates that only a few organisms dominated the community during this month. August had lower than average rainfall levels for a wet season, which may have contributed to less runoff, and fewer bacterial groups introduced into the port surface waters. Bacterioplankton beta diversity differed significantly by month, season, water temperature, and salinity. The 2013-2014 dry season (October-April), was warmer and wetter than historical averages. This may have driven significant differences in beta diversity. Increased nitrogen and phosphorous concentrations were observed in these dry season months, possibly creating favorable bacterial growth conditions. Potentially pathogenic genera were present in the PEI. However their relatively low, non-significant abundance levels highlight their relatively low risk for public health concerns. This study represents the first to sample a large port at this sampling scale and sequencing depth. These data can help establish the inlet microbial community baseline and supplement the vital monitoring of local marine and recreational environments, all the more poignant in context of local reef disease outbreaks and worldwide coral reef collapse in wake of a harsh 2014-16 El Niño event.
Sequence-Based Discovery Demonstrates That Fixed Light Chain Human Transgenic Rats Produce a Diverse Repertoire of Antigen-Specific Antibodies.

PubMed

Harris, Katherine E; Aldred, Shelley Force; Davison, Laura M; Ogana, Heather Anne N; Boudreau, Andrew; Brüggemann, Marianne; Osborn, Michael; Ma, Biao; Buelow, Benjamin; Clarke, Starlynn C; Dang, Kevin H; Iyer, Suhasini; Jorgensen, Brett; Pham, Duy T; Pratap, Payal P; Rangaswamy, Udaya S; Schellenberger, Ute; van Schooten, Wim C; Ugamraj, Harshad S; Vafa, Omid; Buelow, Roland; Trinklein, Nathan D

2018-01-01

We created a novel transgenic rat that expresses human antibodies comprising a diverse repertoire of heavy chains with a single common rearranged kappa light chain (IgKV3-15-JK1). This fixed light chain animal, called OmniFlic, presents a unique system for human therapeutic antibody discovery and a model to study heavy chain repertoire diversity in the context of a constant light chain. The purpose of this study was to analyze heavy chain variable gene usage, clonotype diversity, and to describe the sequence characteristics of antigen-specific monoclonal antibodies (mAbs) isolated from immunized OmniFlic animals. Using next-generation sequencing antibody repertoire analysis, we measured heavy chain variable gene usage and the diversity of clonotypes present in the lymph node germinal centers of 75 OmniFlic rats immunized with 9 different protein antigens. Furthermore, we expressed 2,560 unique heavy chain sequences sampled from a diverse set of clonotypes as fixed light chain antibody proteins and measured their binding to antigen by ELISA. Finally, we measured patterns and overall levels of somatic hypermutation in the full B-cell repertoire and in the 2,560 mAbs tested for binding. The results demonstrate that OmniFlic animals produce an abundance of antigen-specific antibodies with heavy chain clonotype diversity that is similar to what has been described with unrestricted light chain use in mammals. In addition, we show that sequence-based discovery is a highly effective and efficient way to identify a large number of diverse monoclonal antibodies to a protein target of interest.
Mitochondrial DNA diversity of the Amerindian populations living in the Andean Piedmont of Bolivia: Chimane, Moseten, Aymara and Quechua.

PubMed

Corella, Alfons; Bert, Francesc; Pérez-Pérez, Alejandro; Gené, Manel; Turbón, Daniel

2007-01-01

Chimane, Moseten Aymara and Quechua are Amerindian populations living in the Bolivian Piedmont, a characteristic ecoregion between the eastern slope of the Andean mountains and the Amazonian Llanos de Moxos. In both neighbouring areas, dense and complex societies have developed over the centuries. The Piedmont area is especially interesting from a human peopling perspective since there is no clear evidence regarding the genetic influence and peculiarities of these populations. This land has been used extensively as a territory of economic and cultural exchange between the Andes and Amazonia, however Chimane and Moseten populations have been sufficiently isolated from their neighbour groups to be recognized as distinct populations. Genetic information suggests that evolutionary processes, such as genetic drift, natural selection and genetic admixture have formed the history of the Piedmont populations. The objective of this study is to characterize the genetic diversity of the Piedmont populations, analysing the sequence variability of the HVR-I control region in the mitochondrial DNA (mtDNA). Haplogroup mtDNA data available from the whole of Central and South America were utilized to determine the relationship of the Piedmont populations with other Amerindian populations. Hair pulls were obtained in situ, and DNA from non-related individuals was extracted using a standard Chelex 100 method. A 401 bp DNA fragment of HVR-I region was amplified using standard procedures. Two independent 401 and 328 bp DNA fragments were sequenced separately for each sample. The sequence analyses included mismatch distribution and mean pairwise differences, median network analyses, AMOVA and principal component analyses. The genetic diversity of DNA sequences was measured and compared with other South Amerindian populations. The genetic diversity of 401 nucleotide mtDNA sequences, in the hypervariable Control Region, from positions 16 000-16 400, was characterized in a sample of 46 Amerindians living in the Piedmont area in the Beni Department of Bolivia. The results obtained indicate that the genetic diversity in the area is higher than that observed in other American groups living in much larger areas and despite the reduced size of the studied area the human groups analysed show high levels of inter-group variability. In addition, results show that Amerindian populations living in the Piedmont are genetically more related to those in the Andean than in the Amazonian populations.
Sequence Diversity Diagram for comparative analysis of multiple sequence alignments.

PubMed

Sakai, Ryo; Aerts, Jan

2014-01-01

The sequence logo is a graphical representation of a set of aligned sequences, commonly used to depict conservation of amino acid or nucleotide sequences. Although it effectively communicates the amount of information present at every position, this visual representation falls short when the domain task is to compare between two or more sets of aligned sequences. We present a new visual presentation called a Sequence Diversity Diagram and validate our design choices with a case study. Our software was developed using the open-source program called Processing. It loads multiple sequence alignment FASTA files and a configuration file, which can be modified as needed to change the visualization. The redesigned figure improves on the visual comparison of two or more sets, and it additionally encodes information on sequential position conservation. In our case study of the adenylate kinase lid domain, the Sequence Diversity Diagram reveals unexpected patterns and new insights, for example the identification of subgroups within the protein subfamily. Our future work will integrate this visual encoding into interactive visualization tools to support higher level data exploration tasks.
Diversity of immunoglobulin lambda light chain gene usage over developmental stages in the horse.

PubMed

Tallmadge, Rebecca L; Tseng, Chia T; Felippe, M Julia B

2014-10-01

To further studies of neonatal immune responses to pathogens and vaccination, we investigated the dynamics of B lymphocyte development and immunoglobulin (Ig) gene diversity. Previously we demonstrated that equine fetal Ig VDJ sequences exhibit combinatorial and junctional diversity levels comparable to those of adult Ig VDJ sequences. Herein, RACE clones from fetal, neonatal, foal, and adult lymphoid tissue were assessed for Ig lambda light chain combinatorial, junctional, and sequence diversity. Remarkably, more lambda variable genes (IGLV) were used during fetal life than later stages and IGLV gene usage differed significantly with time, in contrast to the Ig heavy chain. Junctional diversity measured by CDR3L length was constant over time. Comparison of Ig lambda transcripts to germline revealed significant increases in nucleotide diversity over time, even during fetal life. These results suggest that the Ig lambda light chain provides an additional dimension of diversity to the equine Ig repertoire. Copyright © 2014 Elsevier Ltd. All rights reserved.
Improved serial analysis of V1 ribosomal sequence tags (SARST-V1) provides a rapid, comprehensive, sequence-based characterization of bacterial diversity and community composition.

PubMed

Yu, Zhongtang; Yu, Marie; Morrison, Mark

2006-04-01

Serial analysis of ribosomal sequence tags (SARST) is a recently developed technology that can generate large 16S rRNA gene (rrs) sequence data sets from microbiomes, but there are numerous enzymatic and purification steps required to construct the ribosomal sequence tag (RST) clone libraries. We report here an improved SARST method, which still targets the V1 hypervariable region of rrs genes, but reduces the number of enzymes, oligonucleotides, reagents, and technical steps needed to produce the RST clone libraries. The new method, hereafter referred to as SARST-V1, was used to examine the eubacterial diversity present in community DNA recovered from the microbiome resident in the ovine rumen. The 190 sequenced clones contained 1055 RSTs and no less than 236 unique phylotypes (based on > or = 95% sequence identity) that were assigned to eight different eubacterial phyla. Rarefaction and monomolecular curve analyses predicted that the complete RST clone library contains 99% of the 353 unique phylotypes predicted to exist in this microbiome. When compared with ribosomal intergenic spacer analysis (RISA) of the same community DNA sample, as well as a compilation of nine previously published conventional rrs clone libraries prepared from the same type of samples, the RST clone library provided a more comprehensive characterization of the eubacterial diversity present in rumen microbiomes. As such, SARST-V1 should be a useful tool applicable to comprehensive examination of diversity and composition in microbiomes and offers an affordable, sequence-based method for diversity analysis.
Diverse nucleotide compositions and sequence fluctuation in Rubisco protein genes

NASA Astrophysics Data System (ADS)

Holden, Todd; Dehipawala, S.; Cheung, E.; Bienaime, R.; Ye, J.; Tremberger, G., Jr.; Schneider, P.; Lieberman, D.; Cheung, T.

2011-10-01

The Rubisco protein-enzyme is arguably the most abundance protein on Earth. The biology dogma of transcription and translation necessitates the study of the Rubisco genes and Rubisco-like genes in various species. Stronger correlation of fractal dimension of the atomic number fluctuation along a DNA sequence with Shannon entropy has been observed in the studied Rubisco-like gene sequences, suggesting a more diverse evolutionary pressure and constraints in the Rubisco sequences. The strategy of using metal for structural stabilization appears to be an ancient mechanism, with data from the porphobilinogen deaminase gene in Capsaspora owczarzaki and Monosiga brevicollis. Using the chi-square distance probability, our analysis supports the conjecture that the more ancient Rubisco-like sequence in Microcystis aeruginosa would have experienced very different evolutionary pressure and bio-chemical constraint as compared to Bordetella bronchiseptica, the two microbes occupying either end of the correlation graph. Our exploratory study would indicate that high fractal dimension Rubisco sequence would support high carbon dioxide rate via the Michaelis- Menten coefficient; with implication for the control of the whooping cough pathogen Bordetella bronchiseptica, a microbe containing a high fractal dimension Rubisco-like sequence (2.07). Using the internal comparison of chi-square distance probability for 16S rRNA (~ E-22) versus radiation repair Rec-A gene (~ E-05) in high GC content Deinococcus radiodurans, our analysis supports the conjecture that high GC content microbes containing Rubisco-like sequence are likely to include an extra-terrestrial origin, relative to Deinococcus radiodurans. Similar photosynthesis process that could utilize host star radiation would not compete with radiation resistant process from the biology dogma perspective in environments such as Mars and exoplanets.
Phylogenetic analysis of a spontaneous cocoa bean fermentation metagenome reveals new insights into its bacterial and fungal community diversity.

PubMed

Illeghems, Koen; De Vuyst, Luc; Papalexandratou, Zoi; Weckx, Stefan

2012-01-01

This is the first report on the phylogenetic analysis of the community diversity of a single spontaneous cocoa bean box fermentation sample through a metagenomic approach involving 454 pyrosequencing. Several sequence-based and composition-based taxonomic profiling tools were used and evaluated to avoid software-dependent results and their outcome was validated by comparison with previously obtained culture-dependent and culture-independent data. Overall, this approach revealed a wider bacterial (mainly γ-Proteobacteria) and fungal diversity than previously found. Further, the use of a combination of different classification methods, in a software-independent way, helped to understand the actual composition of the microbial ecosystem under study. In addition, bacteriophage-related sequences were found. The bacterial diversity depended partially on the methods used, as composition-based methods predicted a wider diversity than sequence-based methods, and as classification methods based solely on phylogenetic marker genes predicted a more restricted diversity compared with methods that took all reads into account. The metagenomic sequencing analysis identified Hanseniaspora uvarum, Hanseniaspora opuntiae, Saccharomyces cerevisiae, Lactobacillus fermentum, and Acetobacter pasteurianus as the prevailing species. Also, the presence of occasional members of the cocoa bean fermentation process was revealed (such as Erwinia tasmaniensis, Lactobacillus brevis, Lactobacillus casei, Lactobacillus rhamnosus, Lactococcus lactis, Leuconostoc mesenteroides, and Oenococcus oeni). Furthermore, the sequence reads associated with viral communities were of a restricted diversity, dominated by Myoviridae and Siphoviridae, and reflecting Lactobacillus as the dominant host. To conclude, an accurate overview of all members of a cocoa bean fermentation process sample was revealed, indicating the superiority of metagenomic sequencing over previously used techniques.

Bacterial diversity in typical Italian salami at different ripening stages as revealed by high-throughput sequencing of 16S rRNA amplicons.

PubMed

Połka, Justyna; Rebecchi, Annalisa; Pisacane, Vincenza; Morelli, Lorenzo; Puglisi, Edoardo

2015-04-01

The bacterial diversity involved in food fermentations is one of the most important factors shaping the final characteristics of traditional foods. Knowledge about this diversity can be greatly improved by the application of high-throughput sequencing technologies (HTS) coupled to the PCR amplification of the 16S rRNA subunit. Here we investigated the bacterial diversity in batches of Salame Piacentino PDO (Protected Designation of Origin), a dry fermented sausage that is typical of a regional area of Northern Italy. Salami samples from 6 different local factories were analysed at 0, 21, 49 and 63 days of ripening; raw meat at time 0 and casing samples at 21 days of ripening where also analysed, and the effect of starter addition was included in the experimental set-up. Culture-based microbiological analyses and PCR-DGGE were carried out in order to be compared with HTS results. A total of 722,196 high quality sequences were obtained after trimming, paired-reads assembly and quality screening of raw reads obtained by Illumina MiSeq sequencing of the two bacterial 16S hypervariable regions V3 and V4; manual curation of 16S database allowed a correct taxonomical classification at the species for 99.5% of these reads. Results confirmed the presence of main bacterial species involved in the fermentation of salami as assessed by PCR-DGGE, but with a greater extent of resolution and quantitative assessments that are not possible by the mere analyses of gel banding patterns. Thirty-two different Staphylococcus and 33 Lactobacillus species where identified in the salami from different producers, while the whole data set obtained accounted for 13 main families and 98 rare ones, 23 of which were present in at least 10% of the investigated samples, with casings being the major sources of the observed diversity. Multivariate analyses also showed that batches from 6 local producers tend to cluster altogether after 21 days of ripening, thus indicating that HTS has the potential for fine scale differentiation of local fermented foods. Copyright © 2014 Elsevier Ltd. All rights reserved.
Probing the Rare Biosphere of the North-West Mediterranean Sea: An Experiment with High Sequencing Effort.

PubMed

Crespo, Bibiana G; Wallhead, Philip J; Logares, Ramiro; Pedrós-Alió, Carlos

2016-01-01

High-throughput sequencing (HTS) techniques have suggested the existence of a wealth of species with very low relative abundance: the rare biosphere. We attempted to exhaustively map this rare biosphere in two water samples by performing an exceptionally deep pyrosequencing analysis (~500,000 final reads per sample). Species data were derived by a 97% identity criterion and various parametric distributions were fitted to the observed counts. Using the best-fitting Sichel distribution we estimate a total species richness of 1,568-1,669 (95% Credible Interval) and 5,027-5,196 for surface and deep water samples respectively, implying that 84-89% of the total richness in those two samples was sequenced, and we predict that a quadrupling of the present sequencing effort would suffice to observe 90% of the total richness in both samples. Comparing the HTS results with a culturing approach we found that most of the cultured taxa were not obtained by HTS, despite the high sequencing effort. Culturing therefore remains a useful tool for uncovering marine bacterial diversity, in addition to its other uses for studying the ecology of marine bacteria.
Employing genome-wide SNP discovery and genotyping strategy to extrapolate the natural allelic diversity and domestication patterns in chickpea

PubMed Central

Kujur, Alice; Bajaj, Deepak; Upadhyaya, Hari D.; Das, Shouvik; Ranjan, Rajeev; Shree, Tanima; Saxena, Maneesha S.; Badoni, Saurabh; Kumar, Vinod; Tripathi, Shailesh; Gowda, C. L. L.; Sharma, Shivali; Singh, Sube; Tyagi, Akhilesh K.; Parida, Swarup K.

2015-01-01

The genome-wide discovery and high-throughput genotyping of SNPs in chickpea natural germplasm lines is indispensable to extrapolate their natural allelic diversity, domestication, and linkage disequilibrium (LD) patterns leading to the genetic enhancement of this vital legume crop. We discovered 44,844 high-quality SNPs by sequencing of 93 diverse cultivated desi, kabuli, and wild chickpea accessions using reference genome- and de novo-based GBS (genotyping-by-sequencing) assays that were physically mapped across eight chromosomes of desi and kabuli. Of these, 22,542 SNPs were structurally annotated in different coding and non-coding sequence components of genes. Genes with 3296 non-synonymous and 269 regulatory SNPs could functionally differentiate accessions based on their contrasting agronomic traits. A high experimental validation success rate (92%) and reproducibility (100%) along with strong sensitivity (93–96%) and specificity (99%) of GBS-based SNPs was observed. This infers the robustness of GBS as a high-throughput assay for rapid large-scale mining and genotyping of genome-wide SNPs in chickpea with sub-optimal use of resources. With 23,798 genome-wide SNPs, a relatively high intra-specific polymorphic potential (49.5%) and broader molecular diversity (13–89%)/functional allelic diversity (18–77%) was apparent among 93 chickpea accessions, suggesting their tremendous applicability in rapid selection of desirable diverse accessions/inter-specific hybrids in chickpea crossbred varietal improvement program. The genome-wide SNPs revealed complex admixed domestication pattern, extensive LD estimates (0.54–0.68) and extended LD decay (400–500 kb) in a structured population inclusive of 93 accessions. These findings reflect the utility of our identified SNPs for subsequent genome-wide association study (GWAS) and selective sweep-based domestication trait dissection analysis to identify potential genomic loci (gene-associated targets) specifically regulating important complex quantitative agronomic traits in chickpea. The numerous informative genome-wide SNPs, natural allelic diversity-led domestication pattern, and LD-based information generated in our study have got multidimensional applicability with respect to chickpea genomics-assisted breeding. PMID:25873920
Diversity of human immunodeficiency virus type 1 subtypes in Kagera and Kilimanjaro regions, Tanzania.

PubMed

Nyombi, Balthazar M; Kristiansen, Knut I; Bjune, Gunnar; Müller, Fredrik; Holm-Hansen, Carol

2008-06-01

A strategy to prevent the spread of HIV-1 worldwide is complicated by the high genetic diversity of the virus. To gain a better understanding of the HIV-1 genetic diversity in Tanzania, a molecular epidemiological investigation was conducted in Kagera and Kilimanjaro regions. While several studies have addressed HIV-1 subtypes in Tanzania, this is the first study to describe the virus subtypes circulating in Kagera. The Kagera region is the epicenter of the HIV-1 epidemic in Africa, and it was therefore of interest to compare the prevalence of HIV subtypes in this region and Kilimanjaro. Blood samples were obtained from 246 HIV-1-infected pregnant women attending antenatal clinics. Plasma HIV-1 RNA was extracted, amplified, and sequenced in the env C2V3 and/or pol regions from 209 samples. Based on the analysis of env C2V3 and pol sequences, 47.4% had concordant subtypes, 19.1% were discordant indicating recombination, and for 33.5% sequences were obtained for only one region. The distribution HIV-1 subtypes based on the phylogenetic analysis of paired env C2V3/ pol sequences in Kagera region was A/A (27.8%), C/C (29.6%), D/D (16.7%), and unique recombinant forms (25.9%), and in Kilimanjaro region was A/A (32.9%), C/C (25.9%), D/D (10.6%), CRF10_CD (1.2%), and unique recombinant forms (29.4%). The env C2V3 subsubtype A2 and env C2V3/pol CRF10_CD were also observed indicating that these recombinants are circulating in Tanzania. The high diversity of HIV-1 subtypes and the high prevalence of recombinants demonstrated in this study necessitate expanded and continuous monitoring of the epidemic in Tanzania. The trend may have implications for current national control strategies against the HIV-1 epidemic.
Coexistence and Within-Host Evolution of Diversified Lineages of Hypermutable Pseudomonas aeruginosa in Long-term Cystic Fibrosis Infections

PubMed Central

Feliziani, Sofía; Moyano, Alejandro J.; Di Rienzo, Julio A.; Krogh Johansen, Helle; Molin, Søren; Smania, Andrea M.

2014-01-01

The advent of high-throughput sequencing techniques has made it possible to follow the genomic evolution of pathogenic bacteria by comparing longitudinally collected bacteria sampled from human hosts. Such studies in the context of chronic airway infections by Pseudomonas aeruginosa in cystic fibrosis (CF) patients have indicated high bacterial population diversity. Such diversity may be driven by hypermutability resulting from DNA mismatch repair system (MRS) deficiency, a common trait evolved by P. aeruginosa strains in CF infections. No studies to date have utilized whole-genome sequencing to investigate within-host population diversity or long-term evolution of mutators in CF airways. We sequenced the genomes of 13 and 14 isolates of P. aeruginosa mutator populations from an Argentinian and a Danish CF patient, respectively. Our collection of isolates spanned 6 and 20 years of patient infection history, respectively. We sequenced 11 isolates from a single sample from each patient to allow in-depth analysis of population diversity. Each patient was infected by clonal populations of bacteria that were dominated by mutators. The in vivo mutation rate of the populations was ∼100 SNPs/year–∼40-fold higher than rates in normo-mutable populations. Comparison of the genomes of 11 isolates from the same sample showed extensive within-patient genomic diversification; the populations were composed of different sub-lineages that had coexisted for many years since the initial colonization of the patient. Analysis of the mutations identified genes that underwent convergent evolution across lineages and sub-lineages, suggesting that the genes were targeted by mutation to optimize pathogenic fitness. Parallel evolution was observed in reduction of overall catabolic capacity of the populations. These findings are useful for understanding the evolution of pathogen populations and identifying new targets for control of chronic infections. PMID:25330091
454 Pyrosequencing to Describe Microbial Eukaryotic Community Composition, Diversity and Relative Abundance: A Test for Marine Haptophytes

PubMed Central

Egge, Elianne; Bittner, Lucie; Andersen, Tom; Audic, Stéphane; de Vargas, Colomban; Edvardsen, Bente

2013-01-01

Next generation sequencing of ribosomal DNA is increasingly used to assess the diversity and structure of microbial communities. Here we test the ability of 454 pyrosequencing to detect the number of species present, and assess the relative abundance in terms of cell numbers and biomass of protists in the phylum Haptophyta. We used a mock community consisting of equal number of cells of 11 haptophyte species and compared targeting DNA and RNA/cDNA, and two different V4 SSU rDNA haptophyte-biased primer pairs. Further, we tested four different bioinformatic filtering methods to reduce errors in the resulting sequence dataset. With sequencing depth of 11000–20000 reads and targeting cDNA with Haptophyta specific primers Hap454 we detected all 11 species. A rarefaction analysis of expected number of species recovered as a function of sampling depth suggested that minimum 1400 reads were required here to recover all species in the mock community. Relative read abundance did not correlate to relative cell numbers. Although the species represented with the largest biomass was also proportionally most abundant among the reads, there was generally a weak correlation between proportional read abundance and proportional biomass of the different species, both with DNA and cDNA as template. The 454 sequencing generated considerable spurious diversity, and more with cDNA than DNA as template. With initial filtering based only on match with barcode and primer we observed 100-fold more operational taxonomic units (OTUs) at 99% similarity than the number of species present in the mock community. Filtering based on quality scores, or denoising with PyroNoise resulted in ten times more OTU99% than the number of species. Denoising with AmpliconNoise reduced the number of OTU99% to match the number of species present in the mock community. Based on our analyses, we propose a strategy to more accurately depict haptophyte diversity using 454 pyrosequencing. PMID:24069303
Low Diversity in the Mitogenome of Sperm Whales Revealed by Next-Generation Sequencing

PubMed Central

Alexander, Alana; Steel, Debbie; Slikas, Beth; Hoekzema, Kendra; Carraher, Colm; Parks, Matthew; Cronn, Richard; Baker, C. Scott

2013-01-01

Large population sizes and global distributions generally associate with high mitochondrial DNA control region (CR) diversity. The sperm whale (Physeter macrocephalus) is an exception, showing low CR diversity relative to other cetaceans; however, diversity levels throughout the remainder of the sperm whale mitogenome are unknown. We sequenced 20 mitogenomes from 17 sperm whales representative of worldwide diversity using Next Generation Sequencing (NGS) technologies (Illumina GAIIx, Roche 454 GS Junior). Resequencing of three individuals with both NGS platforms and partial Sanger sequencing showed low discrepancy rates (454-Illumina: 0.0071%; Sanger-Illumina: 0.0034%; and Sanger-454: 0.0023%) confirming suitability of both NGS platforms for investigating low mitogenomic diversity. Using the 17 sperm whale mitogenomes in a phylogenetic reconstruction with 41 other species, including 11 new dolphin mitogenomes, we tested two hypotheses for the low CR diversity. First, the hypothesis that CR-specific constraints have reduced diversity solely in the CR was rejected as diversity was low throughout the mitogenome, not just in the CR (overall diversity π = 0.096%; protein-coding 3rd codon = 0.22%; CR = 0.35%), and CR phylogenetic signal was congruent with protein-coding regions. Second, the hypothesis that slow substitution rates reduced diversity throughout the sperm whale mitogenome was rejected as sperm whales had significantly higher rates of CR evolution and no evidence of slow coding region evolution relative to other cetaceans. The estimated time to most recent common ancestor for sperm whale mitogenomes was 72,800 to 137,400 years ago (95% highest probability density interval), consistent with previous hypotheses of a bottleneck or selective sweep as likely causes of low mitogenome diversity. PMID:23254394
Low diversity in the mitogenome of sperm whales revealed by next-generation sequencing.

PubMed

Alexander, Alana; Steel, Debbie; Slikas, Beth; Hoekzema, Kendra; Carraher, Colm; Parks, Matthew; Cronn, Richard; Baker, C Scott

2013-01-01

Large population sizes and global distributions generally associate with high mitochondrial DNA control region (CR) diversity. The sperm whale (Physeter macrocephalus) is an exception, showing low CR diversity relative to other cetaceans; however, diversity levels throughout the remainder of the sperm whale mitogenome are unknown. We sequenced 20 mitogenomes from 17 sperm whales representative of worldwide diversity using Next Generation Sequencing (NGS) technologies (Illumina GAIIx, Roche 454 GS Junior). Resequencing of three individuals with both NGS platforms and partial Sanger sequencing showed low discrepancy rates (454-Illumina: 0.0071%; Sanger-Illumina: 0.0034%; and Sanger-454: 0.0023%) confirming suitability of both NGS platforms for investigating low mitogenomic diversity. Using the 17 sperm whale mitogenomes in a phylogenetic reconstruction with 41 other species, including 11 new dolphin mitogenomes, we tested two hypotheses for the low CR diversity. First, the hypothesis that CR-specific constraints have reduced diversity solely in the CR was rejected as diversity was low throughout the mitogenome, not just in the CR (overall diversity π = 0.096%; protein-coding 3rd codon = 0.22%; CR = 0.35%), and CR phylogenetic signal was congruent with protein-coding regions. Second, the hypothesis that slow substitution rates reduced diversity throughout the sperm whale mitogenome was rejected as sperm whales had significantly higher rates of CR evolution and no evidence of slow coding region evolution relative to other cetaceans. The estimated time to most recent common ancestor for sperm whale mitogenomes was 72,800 to 137,400 years ago (95% highest probability density interval), consistent with previous hypotheses of a bottleneck or selective sweep as likely causes of low mitogenome diversity.
Development of an oligonucleotide probe for Aureobasidium pullulans based on the small-subunit rRNA gene.

PubMed Central

Li, S; Cullen, D; Hjort, M; Spear, R; Andrews, J H

1996-01-01

Aureobasidium pullulans, a cosmopolitan yeast-like fungus, colonizes leaf surfaces and has potential as a biocontrol agent of pathogens. To assess the feasibility of rRNA as a target for A. pullulans-specific oligonucleotide probes, we compared the nucleotide sequences of the small-subunit rRNA (18S) genes of 12 geographically diverse A. pullulans strains. Extreme sequence conservation was observed. The consensus A. pullulans sequence was compared with other fungal sequences to identify potential probes. A 21-mer probe which hybridized to the 12 A. pullulans strains but not to 98 other fungi, including 82 isolates from the phylloplane, was identified. A 17-mer highly specific for Cladosporium herbarum was also identified. These probes have potential in monitoring and quantifying fungi in leaf surface and other microbial communities. PMID:8633850
High-throughput sequencing reveals unprecedented diversities of Aspergillus species in outdoor air.

PubMed

Lee, S; An, C; Xu, S; Lee, S; Yamamoto, N

2016-09-01

This study used the Illumina MiSeq to analyse compositions and diversities of Aspergillus species in outdoor air. The seasonal air samplings were performed at two locations in Seoul, South Korea. The results showed the relative abundances of all Aspergillus species combined ranging from 0·20 to 18% and from 0·19 to 21% based on the number of the internal transcribed spacer 1 (ITS1) and β-tubulin (BenA) gene sequences respectively. Aspergillus fumigatus was the most dominant species with the mean relative abundances of 1·2 and 5·5% based on the number of the ITS1 and BenA sequences respectively. A total of 29 Aspergillus species were detected and identified down to the species rank, among which nine species were known opportunistic pathogens. Remarkably, eight of the nine pathogenic species were detected by either one of the two markers, suggesting the need of using multiple markers and/or primer pairs when the assessments are made based on the high-throughput sequencing. Due to diversity of species within the genus Aspergillus, the high-throughput sequencing was useful to characterize their compositions and diversities in outdoor air, which are thought to be difficult to be accurately characterized by conventional culture and/or Sanger sequencing-based techniques. Aspergillus is a diverse genus of fungi with more than 300 species reported in literature. Aspergillus is important since some species are known allergens and opportunistic human pathogens. Traditionally, growth-dependent methods have been used to detect Aspergillus species in air. However, these methods are limited in the number of isolates that can be analysed for their identities, resulting in inaccurate characterizations of Aspergillus diversities. This study used the high-throughput sequencing to explore Aspergillus diversities in outdoor, which are thought to be difficult to be accurately characterized by traditional growth-dependent techniques. © 2016 The Society for Applied Microbiology.
Influenza A virus evolution and spatio-temporal dynamics in Eurasian wild birds: a phylogenetic and phylogeographical study of whole-genome sequence data

PubMed Central

Lewis, Nicola S.; Verhagen, Josanne H.; Javakhishvili, Zurab; Russell, Colin A.; Lexmond, Pascal; Westgeest, Kim B.; Bestebroer, Theo M.; Halpin, Rebecca A.; Lin, Xudong; Ransier, Amy; Fedorova, Nadia B.; Stockwell, Timothy B.; Latorre-Margalef, Neus; Olsen, Björn; Smith, Gavin; Bahl, Justin; Wentworth, David E.; Waldenström, Jonas; Fouchier, Ron A. M.

2015-01-01

Low pathogenic avian influenza A viruses (IAVs) have a natural host reservoir in wild waterbirds and the potential to spread to other host species. Here, we investigated the evolutionary, spatial and temporal dynamics of avian IAVs in Eurasian wild birds. We used whole-genome sequences collected as part of an intensive long-term Eurasian wild bird surveillance study, and combined this genetic data with temporal and spatial information to explore the virus evolutionary dynamics. Frequent reassortment and co-circulating lineages were observed for all eight genomic RNA segments over time. There was no apparent species-specific effect on the diversity of the avian IAVs. There was a spatial and temporal relationship between the Eurasian sequences and significant viral migration of avian IAVs from West Eurasia towards Central Eurasia. The observed viral migration patterns differed between segments. Furthermore, we discuss the challenges faced when analysing these surveillance and sequence data, and the caveats to be borne in mind when drawing conclusions from the apparent results of such analyses. PMID:25904147
Assessing Species Diversity Using Metavirome Data: Methods and Challenges.

PubMed

Herath, Damayanthi; Jayasundara, Duleepa; Ackland, David; Saeed, Isaam; Tang, Sen-Lin; Halgamuge, Saman

2017-01-01

Assessing biodiversity is an important step in the study of microbial ecology associated with a given environment. Multiple indices have been used to quantify species diversity, which is a key biodiversity measure. Measuring species diversity of viruses in different environments remains a challenge relative to measuring the diversity of other microbial communities. Metagenomics has played an important role in elucidating viral diversity by conducting metavirome studies; however, metavirome data are of high complexity requiring robust data preprocessing and analysis methods. In this review, existing bioinformatics methods for measuring species diversity using metavirome data are categorised broadly as either sequence similarity-dependent methods or sequence similarity-independent methods. The former includes a comparison of DNA fragments or assemblies generated in the experiment against reference databases for quantifying species diversity, whereas estimates from the latter are independent of the knowledge of existing sequence data. Current methods and tools are discussed in detail, including their applications and limitations. Drawbacks of the state-of-the-art method are demonstrated through results from a simulation. In addition, alternative approaches are proposed to overcome the challenges in estimating species diversity measures using metavirome data.
The ecology and diversity of microbial eukaryotes in geothermal springs.

PubMed

Oliverio, Angela M; Power, Jean F; Washburne, Alex; Cary, S Craig; Stott, Matthew B; Fierer, Noah

2018-04-16

Decades of research into the Bacteria and Archaea living in geothermal spring ecosystems have yielded great insight into the diversity of life and organismal adaptations to extreme environmental conditions. Surprisingly, while microbial eukaryotes (protists) are also ubiquitous in many environments, their diversity across geothermal springs has mostly been ignored. We used high-throughput sequencing to illuminate the diversity and structure of microbial eukaryotic communities found in 160 geothermal springs with broad ranges in temperature and pH across the Taupō Volcanic Zone in New Zealand. Protistan communities were moderately predictable in composition and varied most strongly across gradients in pH and temperature. Moreover, this variation mirrored patterns observed for bacterial and archaeal communities across the same spring samples, highlighting that there are similar ecological constraints across the tree of life. While extreme pH values were associated with declining protist diversity, high temperature springs harbored substantial amounts of protist diversity. Although protists are often overlooked in geothermal springs and other extreme environments, our results indicate that such environments can host distinct and diverse protistan communities.
Marine Fungi: Their Ecology and Molecular Diversity

NASA Astrophysics Data System (ADS)

Richards, Thomas A.; Jones, Meredith D. M.; Leonard, Guy; Bass, David

2012-01-01

Fungi appear to be rare in marine environments. There are relatively few marine isolates in culture, and fungal small subunit ribosomal DNA (SSU rDNA) sequences are rarely recovered in marine clone library experiments (i.e., culture-independent sequence surveys of eukaryotic microbial diversity from environmental DNA samples). To explore the diversity of marine fungi, we took a broad selection of SSU rDNA data sets and calculated a summary phylogeny. Bringing these data together identified a diverse collection of marine fungi, including sequences branching close to chytrids (flagellated fungi), filamentous hypha-forming fungi, and multicellular fungi. However, the majority of the sequences branched with ascomycete and basidiomycete yeasts. We discuss evidence for 36 novel marine lineages, the majority and most divergent of which branch with the chytrids. We then investigate what these data mean for the evolutionary history of the Fungi and specifically marine-terrestrial transitions. Finally, we discuss the roles of fungi in marine ecosystems.
A communal catalogue reveals Earth's multiscale microbial diversity.

PubMed

Thompson, Luke R; Sanders, Jon G; McDonald, Daniel; Amir, Amnon; Ladau, Joshua; Locey, Kenneth J; Prill, Robert J; Tripathi, Anupriya; Gibbons, Sean M; Ackermann, Gail; Navas-Molina, Jose A; Janssen, Stefan; Kopylova, Evguenia; Vázquez-Baeza, Yoshiki; González, Antonio; Morton, James T; Mirarab, Siavash; Zech Xu, Zhenjiang; Jiang, Lingjing; Haroon, Mohamed F; Kanbar, Jad; Zhu, Qiyun; Jin Song, Se; Kosciolek, Tomasz; Bokulich, Nicholas A; Lefler, Joshua; Brislawn, Colin J; Humphrey, Gregory; Owens, Sarah M; Hampton-Marcell, Jarrad; Berg-Lyons, Donna; McKenzie, Valerie; Fierer, Noah; Fuhrman, Jed A; Clauset, Aaron; Stevens, Rick L; Shade, Ashley; Pollard, Katherine S; Goodwin, Kelly D; Jansson, Janet K; Gilbert, Jack A; Knight, Rob

2017-11-23

Our growing awareness of the microbial world's importance and diversity contrasts starkly with our limited understanding of its fundamental structure. Despite recent advances in DNA sequencing, a lack of standardized protocols and common analytical frameworks impedes comparisons among studies, hindering the development of global inferences about microbial life on Earth. Here we present a meta-analysis of microbial community samples collected by hundreds of researchers for the Earth Microbiome Project. Coordinated protocols and new analytical methods, particularly the use of exact sequences instead of clustered operational taxonomic units, enable bacterial and archaeal ribosomal RNA gene sequences to be followed across multiple studies and allow us to explore patterns of diversity at an unprecedented scale. The result is both a reference database giving global context to DNA sequence data and a framework for incorporating data from future studies, fostering increasingly complete characterization of Earth's microbial diversity.
Nodulation-dependent communities of culturable bacterial endophytes from stems of field-grown soybeans.

PubMed

Okubo, Takashi; Ikeda, Seishi; Kaneko, Takakazu; Eda, Shima; Mitsui, Hisayuki; Sato, Shusei; Tabata, Satoshi; Minamisawa, Kiwamu

2009-01-01

Endophytic bacteria (247 isolates) were randomly isolated from surface-sterilized stems of non-nodulated (Nod(-)), wild-type nodulated (Nod(+)), and hypernodulated (Nod(++)) soybeans (Glycine max [L.] Merr) on three agar media (R2A, nutrient agar, and potato dextrose agar). Their diversity was compared on the basis of 16S rRNA gene sequences. The phylogenetic composition depended on the soybean nodulation phenotype, although diversity indexes were not correlated with nodulation phenotype. The most abundant phylum throughout soybean lines tested was Proteobacteria (58-79%). Gammaproteobacteria was the dominant class (21-72%) with a group of Pseudomonas sp. significantly abundant in Nod(+) soybeans. A high abundance of Alphaproteobacteria was observed in Nod(-) soybeans, which was explained by the increase in bacterial isolates of the families Rhizobiaceae and Sphingomonadaceae. A far greater abundance of Firmicutes was observed in Nod(-) and Nod(++) mutant soybeans than in Nod(+) soybeans. An impact of culture media on the diversity of isolated endophytic bacteria was also observed: The highest diversity indexes were obtained on the R2A medium, which enabled us to access Alphaproteobacteria and other phyla more frequently. The above results indicated that the extent of nodulation changes the phylogenetic composition of culturable bacterial endophytes in soybean stems.
Increasing Sequence Diversity with Flexible Backbone Protein Design: The Complete Redesign of a Protein Hydrophobic Core

DOE Office of Scientific and Technical Information (OSTI.GOV)

Murphy, Grant S.; Mills, Jeffrey L.; Miley, Michael J.

2015-10-15

Protein design tests our understanding of protein stability and structure. Successful design methods should allow the exploration of sequence space not found in nature. However, when redesigning naturally occurring protein structures, most fixed backbone design algorithms return amino acid sequences that share strong sequence identity with wild-type sequences, especially in the protein core. This behavior places a restriction on functional space that can be explored and is not consistent with observations from nature, where sequences of low identity have similar structures. Here, we allow backbone flexibility during design to mutate every position in the core (38 residues) of a four-helixmore » bundle protein. Only small perturbations to the backbone, 12 {angstrom}, were needed to entirely mutate the core. The redesigned protein, DRNN, is exceptionally stable (melting point >140C). An NMR and X-ray crystal structure show that the side chains and backbone were accurately modeled (all-atom RMSD = 1.3 {angstrom}).« less
Genetic Diversity of Picocyanobacteria in Tibetan Lakes: Assessing the Endemic and Universal Distributions

PubMed Central

Hu, Anyi; Liu, Xiaobo; Chen, Feng; Yao, Tandong; Jiao, Nianzhi

2014-01-01

The phylogenetic diversity of picocyanobacteria in seven alkaline lakes on the Tibetan Plateau was analyzed using the molecular marker 16S-23S rRNA internal transcribed spacer sequence. A total of 1,077 environmental sequences retrieved from the seven lakes were grouped into seven picocyanobacterial clusters, with two clusters newly described here. Each of the lakes was dominated by only one or two clusters, while different lakes could have disparate communities, suggesting low alpha diversity but high beta diversity of picocyanobacteria in these high-altitude freshwater and saline lakes. Several globally distributed clusters were found in these Tibetan lakes, such as subalpine cluster I and the Cyanobium gracile cluster. Although other clusters likely exhibit geographic restriction to the plateau temporally, reflecting endemicity, they can indeed be distributed widely on the plateau. Lakes with similar salinities may have similar genetic populations despite a large geographic distance. Canonical correspondence analysis identified salinity as the only environmental factor that may in part explain the diversity variations among lakes. Mantel tests suggested that the community similarities among lakes are independent of geographic distance. A portion of the picocyanobacterial clusters appear to be restricted to a narrow salinity range, while others are likely adapted to a broad range. A seasonal survey of Lake Namucuo across 3 years did not show season-related variations in diversity, and depth-related population partitioning was observed along a vertical profile of the lake. Our study emphasizes the high dispersive potential of picocyanobacteria and suggests that the regional distribution may result from adaptation to specified environments. PMID:25281375
Antibiotic resistance and population structure of cystic fibrosis Pseudomonas aeruginosa isolates from a Spanish multi-centre study.

PubMed

López-Causapé, Carla; de Dios-Caballero, Juan; Cobo, Marta; Escribano, Amparo; Asensio, Óscar; Oliver, Antonio; Del Campo, Rosa; Cantón, Rafael; Solé, Amparó; Cortell, Isidoro; Asensio, Oscar; García, Gloria; Martínez, María Teresa; Cols, María; Salcedo, Antonio; Vázquez, Carlos; Baranda, Félix; Girón, Rosa; Quintana, Esther; Delgado, Isabel; de Miguel, María Ángeles; García, Marta; Oliva, Concepción; Prados, María Concepción; Barrio, María Isabel; Pastor, María Dolores; Olveira, Casilda; de Gracia, Javier; Álvarez, Antonio; Escribano, Amparo; Castillo, Silvia; Figuerola, Joan; Togores, Bernat; Oliver, Antonio; López, Carla; de Dios Caballero, Juan; Tato, Marta; Máiz, Luis; Suárez, Lucrecia; Cantón, Rafael

2017-09-01

The first Spanish multi-centre study on the microbiology of cystic fibrosis (CF) was conducted from 2013 to 2014. The study involved 24 CF units from 17 hospitals, and recruited 341 patients. The aim of this study was to characterise Pseudomonas aeruginosa isolates, 79 of which were recovered from 75 (22%) patients. The study determined the population structure, antibiotic susceptibility profile and genetic background of the strains. Fifty-five percent of the isolates were multi-drug-resistant, and 16% were extensively-drug-resistant. Defective mutS and mutL genes were observed in mutator isolates (15.2%). Considerable genetic diversity was observed by pulsed-field gel electrophoresis (70 patterns) and multi-locus sequence typing (72 sequence types). International epidemic clones were not detected. Fifty-one new and 14 previously described array tube (AT) genotypes were detected by AT technology. This study found a genetically unrelated and highly diverse CF P. aeruginosa population in Spain, not represented by the epidemic clones widely distributed across Europe, with multiple combinations of virulence factors and high antimicrobial resistance rates (except for colistin). Copyright © 2017 Elsevier B.V. and International Society of Chemotherapy. All rights reserved.
Novel Method for High-Throughput Full-Length IGHV-D-J Sequencing of the Immune Repertoire from Bulk B-Cells with Single-Cell Resolution.

PubMed

Vergani, Stefano; Korsunsky, Ilya; Mazzarello, Andrea Nicola; Ferrer, Gerardo; Chiorazzi, Nicholas; Bagnara, Davide

2017-01-01

Efficient and accurate high-throughput DNA sequencing of the adaptive immune receptor repertoire (AIRR) is necessary to study immune diversity in healthy subjects and disease-related conditions. The high complexity and diversity of the AIRR coupled with the limited amount of starting material, which can compromise identification of the full biological diversity makes such sequencing particularly challenging. AIRR sequencing protocols often fail to fully capture the sampled AIRR diversity, especially for samples containing restricted numbers of B lymphocytes. Here, we describe a library preparation method for immunoglobulin sequencing that results in an exhaustive full-length repertoire where virtually every sampled B-cell is sequenced. This maximizes the likelihood of identifying and quantifying the entire IGHV-D-J repertoire of a sample, including the detection of rearrangements present in only one cell in the starting population. The methodology establishes the importance of circumventing genetic material dilution in the preamplification phases and incorporates the use of certain described concepts: (1) balancing the starting material amount and depth of sequencing, (2) avoiding IGHV gene-specific amplification, and (3) using Unique Molecular Identifier. Together, this methodology is highly efficient, in particular for detecting rare rearrangements in the sampled population and when only a limited amount of starting material is available.

Genetic Diversity Analysis of Highly Incomplete SNP Genotype Data with Imputations: An Empirical Assessment

PubMed Central

Fu, Yong-Bi

2014-01-01

Genotyping by sequencing (GBS) recently has emerged as a promising genomic approach for assessing genetic diversity on a genome-wide scale. However, concerns are not lacking about the uniquely large unbalance in GBS genotype data. Although some genotype imputation has been proposed to infer missing observations, little is known about the reliability of a genetic diversity analysis of GBS data, with up to 90% of observations missing. Here we performed an empirical assessment of accuracy in genetic diversity analysis of highly incomplete single nucleotide polymorphism genotypes with imputations. Three large single-nucleotide polymorphism genotype data sets for corn, wheat, and rice were acquired, and missing data with up to 90% of missing observations were randomly generated and then imputed for missing genotypes with three map-independent imputation methods. Estimating heterozygosity and inbreeding coefficient from original, missing, and imputed data revealed variable patterns of bias from assessed levels of missingness and genotype imputation, but the estimation biases were smaller for missing data without genotype imputation. The estimates of genetic differentiation were rather robust up to 90% of missing observations but became substantially biased when missing genotypes were imputed. The estimates of topology accuracy for four representative samples of interested groups generally were reduced with increased levels of missing genotypes. Probabilistic principal component analysis based imputation performed better in terms of topology accuracy than those analyses of missing data without genotype imputation. These findings are not only significant for understanding the reliability of the genetic diversity analysis with respect to large missing data and genotype imputation but also are instructive for performing a proper genetic diversity analysis of highly incomplete GBS or other genotype data. PMID:24626289
Diverse molecular signatures for ribosomally ‘active’ Perkinsea in marine sediments

PubMed Central

2014-01-01

Background Perkinsea are a parasitic lineage within the eukaryotic superphylum Alveolata. Recent studies making use of environmental small sub-unit ribosomal RNA gene (SSU rDNA) sequencing methodologies have detected a significant diversity and abundance of Perkinsea-like phylotypes in freshwater environments. In contrast only a few Perkinsea environmental sequences have been retrieved from marine samples and only two groups of Perkinsea have been cultured and morphologically described and these are parasites of marine molluscs or marine protists. These two marine groups form separate and distantly related phylogenetic clusters, composed of closely related lineages on SSU rDNA trees. Here, we test the hypothesis that Perkinsea are a hitherto under-sampled group in marine environments. Using 454 diversity ‘tag’ sequencing we investigate the diversity and distribution of these protists in marine sediments and water column samples taken from the Deep Chlorophyll Maximum (DCM) and sub-surface using both DNA and RNA as the source template and sampling four European offshore locations. Results We detected the presence of 265 sequences branching with known Perkinsea, the majority of them recovered from marine sediments. Moreover, 27% of these sequences were sampled from RNA derived cDNA libraries. Phylogenetic analyses classify a large proportion of these sequences into 38 cluster groups (including 30 novel marine cluster groups), which share less than 97% sequence similarity suggesting this diversity encompasses a range of biologically and ecologically distinct organisms. Conclusions These results demonstrate that the Perkinsea lineage is considerably more diverse than previously detected in marine environments. This wide diversity of Perkinsea-like protists is largely retrieved in marine sediment with a significant proportion detected in RNA derived libraries suggesting this diversity represents ribosomally ‘active’ and intact cells. Given the phylogenetic range of hosts infected by known Perkinsea parasites, these data suggest that Perkinsea either play a significant but hitherto unrecognized role as parasites in marine sediments and/or members of this group are present in the marine sediment possibly as part of the ‘seed bank’ microbial community. PMID:24779375
Deep sequencing of the Trypanosoma cruzi GP63 surface proteases reveals diversity and diversifying selection among chronic and congenital Chagas disease patients.

PubMed

Llewellyn, Martin S; Messenger, Louisa A; Luquetti, Alejandro O; Garcia, Lineth; Torrico, Faustino; Tavares, Suelene B N; Cheaib, Bachar; Derome, Nicolas; Delepine, Marc; Baulard, Céline; Deleuze, Jean-Francois; Sauer, Sascha; Miles, Michael A

2015-04-01

Chagas disease results from infection with the diploid protozoan parasite Trypanosoma cruzi. T. cruzi is highly genetically diverse, and multiclonal infections in individual hosts are common, but little studied. In this study, we explore T. cruzi infection multiclonality in the context of age, sex and clinical profile among a cohort of chronic patients, as well as paired congenital cases from Cochabamba, Bolivia and Goias, Brazil using amplicon deep sequencing technology. A 450bp fragment of the trypomastigote TcGP63I surface protease gene was amplified and sequenced across 70 chronic and 22 congenital cases on the Illumina MiSeq platform. In addition, a second, mitochondrial target--ND5--was sequenced across the same cohort of cases. Several million reads were generated, and sequencing read depths were normalized within patient cohorts (Goias chronic, n = 43, Goias congenital n = 2, Bolivia chronic, n = 27; Bolivia congenital, n = 20), Among chronic cases, analyses of variance indicated no clear correlation between intra-host sequence diversity and age, sex or symptoms, while principal coordinate analyses showed no clustering by symptoms between patients. Between congenital pairs, we found evidence for the transmission of multiple sequence types from mother to infant, as well as widespread instances of novel genotypes in infants. Finally, non-synonymous to synonymous (dn:ds) nucleotide substitution ratios among sequences of TcGP63Ia and TcGP63Ib subfamilies within each cohort provided powerful evidence of strong diversifying selection at this locus. Our results shed light on the diversity of parasite DTUs within each patient, as well as the extent to which parasite strains pass between mother and foetus in congenital cases. Although we were unable to find any evidence that parasite diversity accumulates with age in our study cohorts, putative diversifying selection within members of the TcGP63I gene family suggests a link between genetic diversity within this gene family and survival in the mammalian host.
Development of Genomic Microsatellite Markers in Carthamus tinctorius L. (Safflower) Using Next Generation Sequencing and Assessment of Their Cross-Species Transferability and Utility for Diversity Analysis

PubMed Central

Variath, Murali Tottekkad; Joshi, Gopal; Bali, Sapinder; Agarwal, Manu; Kumar, Amar; Jagannath, Arun; Goel, Shailendra

2015-01-01

Background Safflower (Carthamus tinctorius L.), an Asteraceae member, yields high quality edible oil rich in unsaturated fatty acids and is resilient to dry conditions. The crop holds tremendous potential for improvement through concerted molecular breeding programs due to the availability of significant genetic and phenotypic diversity. Genomic resources that could facilitate such breeding programs remain largely underdeveloped in the crop. The present study was initiated to develop a large set of novel microsatellite markers for safflower using next generation sequencing. Principal Findings Low throughput genome sequencing of safflower was performed using Illumina paired end technology providing ~3.5X coverage of the genome. Analysis of sequencing data allowed identification of 23,067 regions harboring perfect microsatellite loci. The safflower genome was found to be rich in dinucleotide repeats followed by tri-, tetra-, penta- and hexa-nucleotides. Primer pairs were designed for 5,716 novel microsatellite sequences with repeat length ≥ 20 bases and optimal flanking regions. A subset of 325 microsatellite loci was tested for amplification, of which 294 loci produced robust amplification. The validated primers were used for assessment of 23 safflower accessions belonging to diverse agro-climatic zones of the world leading to identification of 93 polymorphic primers (31.6%). The numbers of observed alleles at each locus ranged from two to four and mean polymorphism information content was found to be 0.3075. The polymorphic primers were tested for cross-species transferability on nine wild relatives of cultivated safflower. All primers except one showed amplification in at least two wild species while 25 primers amplified across all the nine species. The UPGMA dendrogram clustered C. tinctorius accessions and wild species separately into two major groups. The proposed progenitor species of safflower, C. oxyacantha and C. palaestinus were genetically closer to cultivated safflower and formed a distinct cluster. The cluster analysis also distinguished diploid and tetraploid wild species of safflower. Conclusion Next generation sequencing of safflower genome generated a large set of microsatellite markers. The novel markers developed in this study will add to the existing repertoire of markers and can be used for diversity analysis, synteny studies, construction of linkage maps and marker-assisted selection. PMID:26287743
Diversity and phylogenetic relationships among Bartonella strains from Thai bats.

PubMed

McKee, Clifton D; Kosoy, Michael Y; Bai, Ying; Osikowicz, Lynn M; Franka, Richard; Gilbert, Amy T; Boonmar, Sumalee; Rupprecht, Charles E; Peruski, Leonard F

2017-01-01

Bartonellae are phylogenetically diverse, intracellular bacteria commonly found in mammals. Previous studies have demonstrated that bats have a high prevalence and diversity of Bartonella infections globally. Isolates (n = 42) were obtained from five bat species in four provinces of Thailand and analyzed using sequences of the citrate synthase gene (gltA). Sequences clustered into seven distinct genogroups; four of these genogroups displayed similarity with Bartonella spp. sequences from other bats in Southeast Asia, Africa, and Eastern Europe. Thirty of the isolates representing these seven genogroups were further characterized by sequencing four additional loci (ftsZ, nuoG, rpoB, and ITS) to clarify their evolutionary relationships with other Bartonella species and to assess patterns of diversity among strains. Among the seven genogroups, there were differences in the number of sequence variants, ranging from 1-5, and the amount of nucleotide divergence, ranging from 0.035-3.9%. Overall, these seven genogroups meet the criteria for distinction as novel Bartonella species, with sequence divergence among genogroups ranging from 6.4-15.8%. Evidence of intra- and intercontinental phylogenetic relationships and instances of homologous recombination among Bartonella genogroups in related bat species were found in Thai bats.
On the use of high-throughput sequencing for the study of cyanobacterial diversity in Antarctic aquatic mats.

PubMed

Pessi, Igor Stelmach; Maalouf, Pedro De Carvalho; Laughinghouse, Haywood Dail; Baurain, Denis; Wilmotte, Annick

2016-06-01

The study of Antarctic cyanobacterial diversity has been mostly limited to morphological identification and traditional molecular techniques. High-throughput sequencing (HTS) allows a much better understanding of microbial distribution in the environment, but its application is hampered by several methodological and analytical challenges. In this work, we explored the use of HTS as a tool for the study of cyanobacterial diversity in Antarctic aquatic mats. Our results highlight the importance of using artificial communities to validate the parameters of the bioinformatics procedure used to analyze natural communities, since pipeline-dependent biases had a strong effect on the observed community structures. Analysis of microbial mats from five Antarctic lakes and an aquatic biofilm from the Sub-Antarctic showed that HTS is a valuable tool for the assessment of cyanobacterial diversity. The majority of the operational taxonomic units retrieved were related to filamentous taxa such as Leptolyngbya and Phormidium, which are common genera in Antarctic lacustrine microbial mats. However, other phylotypes related to different taxa such as Geitlerinema, Pseudanabaena, Synechococcus, Chamaesiphon, Calothrix, and Coleodesmium were also found. Results revealed a much higher diversity than what had been reported using traditional methods and also highlighted remarkable differences between the cyanobacterial communities of the studied lakes. The aquatic biofilm from the Sub-Antarctic had a distinct cyanobacterial community from the Antarctic lakes, which in turn displayed a salinity-dependent community structure at the phylotype level. © 2016 Phycological Society of America.
Characterization of Metabolically Active Bacterial Populations in Subseafloor Nankai Trough Sediments above, within, and below the Sulfate–Methane Transition Zone

PubMed Central

Mills, Heath J.; Reese, Brandi Kiel; Shepard, Alicia K.; Riedinger, Natascha; Dowd, Scot E.; Morono, Yuki; Inagaki, Fumio

2012-01-01

A remarkable number of microbial cells have been enumerated within subseafloor sediments, suggesting a biological impact on geochemical processes in the subseafloor habitat. However, the metabolically active fraction of these populations is largely uncharacterized. In this study, an RNA-based molecular approach was used to determine the diversity and community structure of metabolically active bacterial populations in the upper sedimentary formation of the Nankai Trough seismogenic zone. Samples used in this study were collected from the slope apron sediment overlying the accretionary prism at Site C0004 during the Integrated Ocean Drilling Program Expedition 316. The sediments represented microbial habitats above, within, and below the sulfate–methane transition zone (SMTZ), which was observed approximately 20 m below the seafloor (mbsf). Small subunit ribosomal RNA were extracted, quantified, amplified, and sequenced using high-throughput 454 pyrosequencing, indicating the occurrence of metabolically active bacterial populations to a depth of 57 mbsf. Transcript abundance and bacterial diversity decreased with increasing depth. The two communities below the SMTZ were similar at the phylum level, however only a 24% overlap was observed at the genus level. Active bacterial community composition was not confined to geochemically predicted redox stratification despite the deepest sample being more than 50 m below the oxic/anoxic interface. Genus-level classification suggested that the metabolically active subseafloor bacterial populations had similarities to previously cultured organisms. This allowed predictions of physiological potential, expanding understanding of the subseafloor microbial ecosystem. Unique community structures suggest very diverse active populations compared to previous DNA-based diversity estimates, providing more support for enhancing community characterizations using more advanced sequencing techniques. PMID:22485111
Gene transfer agent (GTA) genes reveal diverse and dynamic Roseobacter and Rhodobacter populations in the Chesapeake Bay.

PubMed

Zhao, Yanlin; Wang, Kui; Budinoff, Charles; Buchan, Alison; Lang, Andrew; Jiao, Nianzhi; Chen, Feng

2009-03-01

Within the bacterial class Alphaproteobacteria, the order Rhodobacterales contains the Roseobacter and Rhodobacter clades. Roseobacters are abundant and play important biogeochemical roles in marine environments. Roseobacter and Rhodobacter genomes contain a conserved gene transfer agent (GTA) gene cluster, and GTA-mediated gene transfer has been observed in these groups of bacteria. In this study, we investigated the genetic diversity of these two groups in Chesapeake Bay surface waters using a specific PCR primer set targeting the conserved Rhodobacterales GTA major capsid protein gene (g5). The g5 gene was successfully amplified from 26 Rhodobacterales isolates and the bay microbial communities using this primer set. Four g5 clone libraries were constructed from microbial assemblages representing different regions and seasons of the bay and yielded diverse sequences. In total, 12 distinct g5 clusters could be identified among 158 Chesapeake Bay clones, 11 fall within the Roseobacter clade, and one falls in the Rhodobacter clade. The vast majority of the clusters (10 out of 12) lack cultivated representatives. The composition of g5 sequences varied dramatically along the bay during the wintertime, and a distinct Roseobacter population composition between winter and summer was observed. The congruence between g5 and 16S rRNA gene phylogenies indicates that g5 may serve as a useful genetic marker to investigate diversity and abundance of Roseobacter and Rhodobacter in natural environments. The presence of the g5 gene in the natural populations of Roseobacter and Rhodobacter implies that genetic exchange through GTA transduction could be an important mechanism for maintaining the metabolic flexibility of these groups of bacteria.
Phase Diversity Applied to Sunspot Observations

NASA Astrophysics Data System (ADS)

Tritschler, A.; Schmidt, W.; Knolker, M.

We present preliminary results of a multi-colour phase diversity experiment carried out with the Multichannel Filter System of the Vacuum Tower Telescope at the Observatorio del Teide on Tenerife. We apply phase-diversity imaging to a time sequence of sunspot filtergrams taken in three continuum bands and correct the seeing influence for each image. A newly developed phase diversity device allowing for the projection of both the focused and the defocused image onto a single CCD chip was used in one of the wavelength channels. With the information about the wavefront obtained by the image reconstruction algorithm the restoration of the other two bands can be performed as well. The processed and restored data set will then be used to derive the temperature and proper motion of the umbral dots. Data analysis is still under way, and final results will be given in a forthcoming article.
Increasing Clinical Severity during a Dengue Virus Type 3 Cuban Epidemic: Deep Sequencing of Evolving Viral Populations

PubMed Central

Blanc, Hervé; Bordería, Antonio V.; Díaz, Gisell; Henningsson, Rasmus; Gonzalez, Daniel; Santana, Emidalys; Alvarez, Mayling; Castro, Osvaldo; Fontes, Magnus; Vignuzzi, Marco; Guzman, Maria G.

2016-01-01

ABSTRACT During the dengue virus type 3 (DENV-3) epidemic that occurred in Havana in 2001 to 2002, severe disease was associated with the infection sequence DENV-1 followed by DENV-3 (DENV-1/DENV-3), while the sequence DENV-2/DENV-3 was associated with mild/asymptomatic infections. To determine the role of the virus in the increasing severity demonstrated during the epidemic, serum samples collected at different time points were studied. A total of 22 full-length sequences were obtained using a deep-sequencing approach. Bayesian phylogenetic analysis of consensus sequences revealed that two DENV-3 lineages were circulating in Havana at that time, both grouped within genotype III. The predominant lineage is closely related to Peruvian and Ecuadorian strains, while the minor lineage is related to Venezuelan strains. According to consensus sequences, relatively few nonsynonymous mutations were observed; only one was fixed during the epidemic at position 4380 in the NS2B gene. Intrahost genetic analysis indicated that a significant minor population was selected and became predominant toward the end of the epidemic. In conclusion, greater variability was detected during the epidemic's progression in terms of significant minority variants, particularly in the nonstructural genes. An increasing trend of genetic diversity toward the end of the epidemic was observed only for synonymous variant allele rates, with higher variability in secondary cases. Remarkably, significant intrahost genetic variation was demonstrated within the same patient during the course of secondary infection with DENV-1/DENV-3, including changes in the structural proteins premembrane (PrM) and envelope (E). Therefore, the dynamic of evolving viral populations in the context of heterotypic antibodies could be related to the increasing clinical severity observed during the epidemic. IMPORTANCE Based on the evidence that DENV fitness is context dependent, our research has focused on the study of viral factors associated with intraepidemic increasing severity in a unique epidemiological setting. Here, we investigated the intrahost genetic diversity in acute human samples collected at different time points during the DENV-3 epidemic that occurred in Cuba in 2001 to 2002 using a deep-sequencing approach. We concluded that greater variability in significant minor populations occurred as the epidemic progressed, particularly in the nonstructural genes, with higher variability observed in secondary infection cases. Remarkably, for the first time significant intrahost genetic variation was demonstrated within the same patient during the course of secondary infection with DENV-1/DENV-3, including changes in structural proteins. These findings indicate that high-resolution approaches are needed to unravel molecular mechanisms involved in dengue pathogenesis. PMID:26889031
Contrasting Patterns of Genomic Diversity Reveal Accelerated Genetic Drift but Reduced Directional Selection on X-Chromosome in Wild and Domestic Sheep Species.

PubMed

Chen, Ze-Hui; Zhang, Min; Lv, Feng-Hua; Ren, Xue; Li, Wen-Rong; Liu, Ming-Jun; Nam, Kiwoong; Bruford, Michael W; Li, Meng-Hua

2018-04-01

Analyses of genomic diversity along the X chromosome and of its correlation with autosomal diversity can facilitate understanding of evolutionary forces in shaping sex-linked genomic architecture. Strong selective sweeps and accelerated genetic drift on the X-chromosome have been inferred in primates and other model species, but no such insight has yet been gained in domestic animals compared with their wild relatives. Here, we analyzed X-chromosome variability in a large ovine data set, including a BeadChip array for 943 ewes from the world's sheep populations and 110 whole genomes of wild and domestic sheep. Analyzing whole-genome sequences, we observed a substantially reduced X-to-autosome diversity ratio (∼0.6) compared with the value expected under a neutral model (0.75). In particular, one large X-linked segment (43.05-79.25 Mb) was found to show extremely low diversity, most likely due to a high density of coding genes, featuring highly conserved regions. In general, we observed higher nucleotide diversity on the autosomes, but a flat diversity gradient in X-linked segments, as a function of increasing distance from the nearest genes, leading to a decreased X: autosome (X/A) diversity ratio and contrasting to the positive correlation detected in primates and other model animals. Our evidence suggests that accelerated genetic drift but reduced directional selection on X chromosome, as well as sex-biased demographic events, explain low X-chromosome diversity in sheep species. The distinct patterns of X-linked and X/A diversity we observed between Middle Eastern and non-Middle Eastern sheep populations can be explained by multiple migrations, selection, and admixture during the domestic sheep's recent postdomestication demographic expansion, coupled with natural selection for adaptation to new environments. In addition, we identify important novel genes involved in abnormal behavioral phenotypes, metabolism, and immunity, under selection on the sheep X-chromosome.
Contrasting Patterns of Genomic Diversity Reveal Accelerated Genetic Drift but Reduced Directional Selection on X-Chromosome in Wild and Domestic Sheep Species

PubMed Central

Chen, Ze-Hui; Zhang, Min; Lv, Feng-Hua; Ren, Xue; Li, Wen-Rong; Liu, Ming-Jun; Nam, Kiwoong; Bruford, Michael W; Li, Meng-Hua

2018-01-01

Abstract Analyses of genomic diversity along the X chromosome and of its correlation with autosomal diversity can facilitate understanding of evolutionary forces in shaping sex-linked genomic architecture. Strong selective sweeps and accelerated genetic drift on the X-chromosome have been inferred in primates and other model species, but no such insight has yet been gained in domestic animals compared with their wild relatives. Here, we analyzed X-chromosome variability in a large ovine data set, including a BeadChip array for 943 ewes from the world’s sheep populations and 110 whole genomes of wild and domestic sheep. Analyzing whole-genome sequences, we observed a substantially reduced X-to-autosome diversity ratio (∼0.6) compared with the value expected under a neutral model (0.75). In particular, one large X-linked segment (43.05–79.25 Mb) was found to show extremely low diversity, most likely due to a high density of coding genes, featuring highly conserved regions. In general, we observed higher nucleotide diversity on the autosomes, but a flat diversity gradient in X-linked segments, as a function of increasing distance from the nearest genes, leading to a decreased X: autosome (X/A) diversity ratio and contrasting to the positive correlation detected in primates and other model animals. Our evidence suggests that accelerated genetic drift but reduced directional selection on X chromosome, as well as sex-biased demographic events, explain low X-chromosome diversity in sheep species. The distinct patterns of X-linked and X/A diversity we observed between Middle Eastern and non-Middle Eastern sheep populations can be explained by multiple migrations, selection, and admixture during the domestic sheep’s recent postdomestication demographic expansion, coupled with natural selection for adaptation to new environments. In addition, we identify important novel genes involved in abnormal behavioral phenotypes, metabolism, and immunity, under selection on the sheep X-chromosome. PMID:29790980
Identifying airborne fungi in Seoul, Korea using metagenomics.

PubMed

Oh, Seung-Yoon; Fong, Jonathan J; Park, Myung Soo; Chang, Limseok; Lim, Young Woon

2014-06-01

Fungal spores are widespread and common in the atmosphere. In this study, we use a metagenomic approach to study the fungal diversity in six total air samples collected from April to May 2012 in Seoul, Korea. This springtime period is important in Korea because of the peak in fungal spore concentration and Asian dust storms, although the year of this study (2012) was unique in that were no major Asian dust events. Clustering sequences for operational taxonomic unit (OTU) identification recovered 1,266 unique OTUs in the combined dataset, with between 223᾿96 OTUs present in individual samples. OTUs from three fungal phyla were identified. For Ascomycota, Davidiella (anamorph: Cladosporium) was the most common genus in all samples, often accounting for more than 50% of all sequences in a sample. Other common Ascomycota genera identified were Alternaria, Didymella, Khuskia, Geosmitha, Penicillium, and Aspergillus. While several Basidiomycota genera were observed, Chytridiomycota OTUs were only present in one sample. Consistency was observed within sampling days, but there was a large shift in species composition from Ascomycota dominant to Basidiomycota dominant in the middle of the sampling period. This marked change may have been caused by meteorological events. A potential set of 40 allergy-inducing genera were identified, accounting for a large proportion of the diversity present (22.5᾿7.2%). Our study identifies high fungal diversity and potentially high levels of fungal allergens in springtime air of Korea, and provides a good baseline for future comparisons with Asian dust storms.
Structuring of Bacterioplankton Diversity in a Large Tropical Bay

PubMed Central

Gregoracci, Gustavo B.; Nascimento, Juliana R.; Cabral, Anderson S.; Paranhos, Rodolfo; Valentin, Jean L.; Thompson, Cristiane C.; Thompson, Fabiano L.

2012-01-01

Structuring of bacterioplanktonic populations and factors that determine the structuring of specific niche partitions have been demonstrated only for a limited number of colder water environments. In order to better understand the physical chemical and biological parameters that may influence bacterioplankton diversity and abundance, we examined their productivity, abundance and diversity in the second largest Brazilian tropical bay (Guanabara Bay, GB), as well as seawater physical chemical and biological parameters of GB. The inner bay location with higher nutrient input favored higher microbial (including vibrio) growth. Metagenomic analysis revealed a predominance of Gammaproteobacteria in this location, while GB locations with lower nutrient concentration favored Alphaproteobacteria and Flavobacteria. According to the subsystems (SEED) functional analysis, GB has a distinctive metabolic signature, comprising a higher number of sequences in the metabolism of phosphorus and aromatic compounds and a lower number of sequences in the photosynthesis subsystem. The apparent phosphorus limitation appears to influence the GB metagenomic signature of the three locations. Phosphorus is also one of the main factors determining changes in the abundance of planktonic vibrios, suggesting that nutrient limitation can be observed at community (metagenomic) and population levels (total prokaryote and vibrio counts). PMID:22363639
High-Level Diversity of Tailed Phages, Eukaryote-Associated Viruses, and Virophage-Like Elements in the Metaviromes of Antarctic Soils

PubMed Central

Zablocki, Olivier; van Zyl, Lonnie; Adriaenssens, Evelien M.; Rubagotti, Enrico; Tuffin, Marla; Cary, Stephen Craig

2014-01-01

The metaviromes of two distinct Antarctic hyperarid desert soil communities have been characterized. Hypolithic communities, cyanobacterium-dominated assemblages situated on the ventral surfaces of quartz pebbles embedded in the desert pavement, showed higher virus diversity than surface soils, which correlated with previous bacterial community studies. Prokaryotic viruses (i.e., phages) represented the largest viral component (particularly Mycobacterium phages) in both habitats, with an identical hierarchical sequence abundance of families of tailed phages (Siphoviridae > Myoviridae > Podoviridae). No archaeal viruses were found. Unexpectedly, cyanophages were poorly represented in both metaviromes and were phylogenetically distant from currently characterized cyanophages. Putative phage genomes were assembled and showed a high level of unaffiliated genes, mostly from hypolithic viruses. Moreover, unusual gene arrangements in which eukaryotic and prokaryotic virus-derived genes were found within identical genome segments were observed. Phycodnaviridae and Mimiviridae viruses were the second-most-abundant taxa and more numerous within open soil. Novel virophage-like sequences (within the Sputnik clade) were identified. These findings highlight high-level virus diversity and novel species discovery potential within Antarctic hyperarid soils and may serve as a starting point for future studies targeting specific viral groups. PMID:25172856
Relative Abundance and Diversity of Bacterial Methanotrophs at the Oxic-Anoxic Interface of the Congo Deep-Sea Fan.

PubMed

Bessette, Sandrine; Moalic, Yann; Gautey, Sébastien; Lesongeur, Françoise; Godfroy, Anne; Toffin, Laurent

2017-01-01

Sitting at ∼5,000 m water depth on the Congo-Angola margin and ∼760 km offshore of the West African coast, the recent lobe complex of the Congo deep-sea fan receives large amounts of fluvial sediments (3-5% organic carbon). This organic-rich sedimentation area harbors habitats with chemosynthetic communities similar to those of cold seeps. In this study, we investigated relative abundance, diversity and distribution of aerobic methane-oxidizing bacteria (MOB) communities at the oxic-anoxic interface of sedimentary habitats by using fluorescence in situ hybridization and comparative sequence analysis of particulate mono-oxygenase ( pmoA ) genes. Our findings revealed that sedimentary habitats of the recent lobe complex hosted type I and type II MOB cells and comparisons of pmoA community compositions showed variations among the different organic-rich habitats. Furthermore, the pmoA lineages were taxonomically more diverse compared to methane seep environments and were related to those found at cold seeps. Surprisingly, MOB phylogenetic lineages typical of terrestrial environments were observed at such water depth. In contrast, MOB cells or pmoA sequences were not detected at the previous lobe complex that is disconnected from the Congo River inputs.
Diversity Profile of Microbes Associated with Anaerobic Sulfur Oxidation in an Upflow Anaerobic Sludge Blanket Reactor Treating Municipal Sewage

PubMed Central

Aida, Azrina A.; Kuroda, Kyohei; Yamamoto, Masamitsu; Nakamura, Akinobu; Hatamoto, Masashi; Yamaguchi, Takashi

2015-01-01

We herein analyzed the diversity of microbes involved in anaerobic sulfur oxidation in an upflow anaerobic sludge blanket (UASB) reactor used for treating municipal sewage under low-temperature conditions. Anaerobic sulfur oxidation occurred in the absence of oxygen, with nitrite and nitrate as electron acceptors; however, reactor performance parameters demonstrated that anaerobic conditions were maintained. In order to gain insights into the underlying basis of anaerobic sulfur oxidation, the microbial diversity that exists in the UASB sludge was analyzed comprehensively to determine their identities and contribution to sulfur oxidation. Sludge samples were collected from the UASB reactor over a period of 2 years and used for bacterial 16S rRNA gene-based terminal restriction fragment length polymorphism (T-RFLP) and next-generation sequencing analyses. T-RFLP and sequencing results both showed that microbial community patterns changed markedly from day 537 onwards. Bacteria belonging to the genus Desulforhabdus within the phylum Proteobacteria and uncultured bacteria within the phylum Fusobacteria were the main groups observed during the period of anaerobic sulfur oxidation. Their abundance correlated with temperature, suggesting that these bacterial groups played roles in anaerobic sulfur oxidation in UASB reactors. PMID:25817585
Distribution and diversity of Prochlorococcus ecotypes in the Red Sea.

PubMed

Shibl, Ahmed A; Thompson, Luke R; Ngugi, David K; Stingl, Ulrich

2014-07-01

Photosynthetic prokaryotes of the genus Prochlorococcus play a major role in global primary production in the world's oligotrophic oceans. A recent study on pelagic bacterioplankton communities in the northern and central Red Sea indicated that the predominant cyanobacterial 16S rRNA gene sequence types were from Prochlorococcus cells belonging to a high-light-adapted ecotype (HL II). In this study, we analyzed microdiversity of Prochlorococcus sp. at multiple depths within and below the euphotic zone in the northern, central, and southern regions of the Red Sea, as well as in surface waters in the same locations, but in a different season. Prochlorococcus dominated the communities in clone libraries of the amplified 16S-23S rRNA internal transcribed spacer (ITS) region. Almost no differences were found between samples from coastal or open-water sites, but a high diversity of Prochlorococcus ecotypes was detected at 100-meter depth in the water column. In addition, an unusual dominance of HL II-related sequences was observed in deeper waters. Our results indicate that the Red Sea harbors diverse Prochlorococcus lineages, but no novel ecotypes, despite its unusual physicochemical properties. © 2014 Federation of European Microbiological Societies. Published by John Wiley & Sons Ltd. All rights reserved.
Helminth Colonization Is Associated with Increased Diversity of the Gut Microbiota

PubMed Central

Lee, Soo Ching; Tang, Mei San; Lim, Yvonne A. L.; Choy, Seow Huey; Kurtz, Zachary D.; Cox, Laura M.; Gundra, Uma Mahesh; Cho, Ilseung; Bonneau, Richard; Blaser, Martin J.; Chua, Kek Heng; Loke, P'ng

2014-01-01

Soil-transmitted helminths colonize more than 1.5 billion people worldwide, yet little is known about how they interact with bacterial communities in the gut microbiota. Differences in the gut microbiota between individuals living in developed and developing countries may be partly due to the presence of helminths, since they predominantly infect individuals from developing countries, such as the indigenous communities in Malaysia we examine in this work. We compared the composition and diversity of bacterial communities from the fecal microbiota of 51 people from two villages in Malaysia, of which 36 (70.6%) were infected by helminths. The 16S rRNA V4 region was sequenced at an average of nineteen thousand sequences per samples. Helminth-colonized individuals had greater species richness and number of observed OTUs with enrichment of Paraprevotellaceae, especially with Trichuris infection. We developed a new approach of combining centered log-ratio (clr) transformation for OTU relative abundances with sparse Partial Least Squares Discriminant Analysis (sPLS-DA) to enable more robust predictions of OTU interrelationships. These results suggest that helminths may have an impact on the diversity, bacterial community structure and function of the gut microbiota. PMID:24851867
Microbial colonization of basaltic glasses in hydrothermal organic-rich sediments at Guaymas Basin

PubMed Central

Callac, Nolwenn; Rommevaux-Jestin, Céline; Rouxel, Olivier; Lesongeur, Françoise; Liorzou, Céline; Bollinger, Claire; Ferrant, Antony; Godfroy, Anne

2013-01-01

Oceanic basalts host diverse microbial communities with various metabolisms involved in C, N, S, and Fe biogeochemical cycles which may contribute to mineral and glass alteration processes at, and below the seafloor. In order to study the microbial colonization on basaltic glasses and their potential biotic/abiotic weathering products, two colonization modules called AISICS (“Autonomous in situ Instrumented Colonization System”) were deployed in hydrothermal deep-sea sediments at the Guaymas Basin for 8 days and 22 days. Each AISICS module contained 18 colonizers (including sterile controls) filled with basaltic glasses of contrasting composition. Chemical analyses of ambient fluids sampled through the colonizers showed a greater contribution of hydrothermal fluids (maximum temperature 57.6°C) for the module deployed during the longer time period. For each colonizer, the phylogenetic diversity and metabolic function of bacterial and archaeal communities were explored using a molecular approach by cloning and sequencing. Results showed large microbial diversity in all colonizers. The bacterial distribution was primarily linked to the deployment duration, as well as the depth for the short deployment time module. Some 16s rRNA sequences formed a new cluster of Epsilonproteobacteria. Within the Archaea the retrieved diversity could not be linked to either duration, depth or substrata. However, mcrA gene sequences belonging to the ANME-1 mcrA-guaymas cluster were found sometimes associated with their putative sulfate-reducers syntrophs depending on the colonizers. Although no specific glass alteration texture was identified, nano-crystals of barite and pyrite were observed in close association with organic matter, suggesting a possible biological mediation. This study gives new insights into the colonization steps of volcanic rock substrates and the capability of microbial communities to exploit new environmental conditions. PMID:23986754

Bacterial diversity in the oxygen minimum zone of the eastern tropical South Pacific.

PubMed

Stevens, Heike; Ulloa, Osvaldo

2008-05-01

The structure and diversity of bacterial communities associated with the oxygen minimum zone (OMZ) of the eastern tropical South Pacific was studied through phylogenetic analysis. Clone libraries of 16S rRNA gene fragments were constructed using environmental DNA collected from the OMZ (60 m and 200 m), the sea surface (10 m), and the deep oxycline (450 m). At the class level, the majority of sequences affiliated to the gamma- (53.7%) and alpha-Proteobacteria (19.7%), and to the Bacteroidetes (11.2%). A vertical partitioning of the bacterial communities was observed, with main differences between the suboxic OMZ and the more oxygenated surface and deep oxycline waters. At the surface, the microbial community was predominantly characterized by SAR86, Loktanella and unclassified Flavobacteriaceae, whereas the deeper layer was dominated by Sulfitobacter and unclassified Alteromonadaceae. In the OMZ, major constituents affiliated to the marine SAR11 clade and to thiotrophic gamma-symbionts (25% of all sequences), a group not commonly found in pelagic waters. Sequences affiliating to the phylum Chloroflexi, to the AGG47 and SAR202 clades, to the delta-Proteobacteria, to the Acidobacteria, and to the 'anammox group' of the Planctomycetes were found exclusively in the OMZ. The bacterial richness in the OMZ was higher than in the oxic surface and deeper oxycline, as revealed by rarefaction analysis and the Chao1 richness estimator (surface: 45 +/- 8, deeper oxycline: 76 +/- 26; OMZ (60 m): 97 +/- 33, OMZ (200 m): 109 +/- 31). OMZ bacterial diversity indices (Fisher's: approximately 30 +/- 5, Shannon's: approximately 3.31, inverse Simpson's: approximately 20) were similar to those found in other pelagic marine environments. Thus, our results indicate a distinct and diverse bacterial community within the OMZ, with presumably novel and yet uncultivated bacterial lineages.
Reduced representation approaches to interrogate genome diversity in large repetitive plant genomes.

PubMed

Hirsch, Cory D; Evans, Joseph; Buell, C Robin; Hirsch, Candice N

2014-07-01

Technology and software improvements in the last decade now provide methodologies to access the genome sequence of not only a single accession, but also multiple accessions of plant species. This provides a means to interrogate species diversity at the genome level. Ample diversity among accessions in a collection of species can be found, including single-nucleotide polymorphisms, insertions and deletions, copy number variation and presence/absence variation. For species with small, non-repetitive rich genomes, re-sequencing of query accessions is robust, highly informative, and economically feasible. However, for species with moderate to large sized repetitive-rich genomes, technical and economic barriers prevent en masse genome re-sequencing of accessions. Multiple approaches to access a focused subset of loci in species with larger genomes have been developed, including reduced representation sequencing, exome capture and transcriptome sequencing. Collectively, these approaches have enabled interrogation of diversity on a genome scale for large plant genomes, including crop species important to worldwide food security. © The Author 2014. Published by Oxford University Press. All rights reserved. For permissions, please email: journals.permissions@oup.com.
Vampires in the oceans: predatory cercozoan amoebae in marine habitats.

PubMed

Berney, Cédric; Romac, Sarah; Mahé, Frédéric; Santini, Sébastien; Siano, Raffaele; Bass, David

2013-12-01

Vampire amoebae (vampyrellids) are predators of algae, fungi, protozoa and small metazoans known primarily from soils and in freshwater habitats. They are among the very few heterotrophic naked, filose and reticulose protists that have received some attention from a morphological and ecological point of view over the last few decades, because of the peculiar mode of feeding of known species. Yet, the true extent of their biodiversity remains largely unknown. Here we use a complementary approach of culturing and sequence database mining to address this issue, focusing our efforts on marine environments, where vampyrellids are very poorly known. We present 10 new vampyrellid isolates, 8 from marine or brackish sediments, and 2 from soil or freshwater sediment. Two of the former correspond to the genera Thalassomyxa Grell and Penardia Cash for which sequence data were previously unavailable. Small-subunit ribosomal DNA analysis confirms they are all related to previously sequenced vampyrellids. An exhaustive screening of the NCBI GenBank database and of 454 sequence data generated by the European BioMarKs consortium revealed hundreds of distinct environmental vampyrellid sequences. We show that vampyrellids are much more diverse than previously thought, especially in marine habitats. Our new isolates, which cover almost the full phylogenetic range of vampyrellid sequences revealed in this study, offer a rare opportunity to integrate data from environmental DNA surveys with phenotypic information. However, the very large genetic diversity we highlight within vampyrellids (especially in marine sediments and soils) contrasts with the paradoxically low morphological distinctiveness we observed across our isolates.
Tolerance of DNA Mismatches in Dmc1 Recombinase-mediated DNA Strand Exchange.

PubMed

Borgogno, María V; Monti, Mariela R; Zhao, Weixing; Sung, Patrick; Argaraña, Carlos E; Pezza, Roberto J

2016-03-04

Recombination between homologous chromosomes is required for the faithful meiotic segregation of chromosomes and leads to the generation of genetic diversity. The conserved meiosis-specific Dmc1 recombinase catalyzes homologous recombination triggered by DNA double strand breaks through the exchange of parental DNA sequences. Although providing an efficient rate of DNA strand exchange between polymorphic alleles, Dmc1 must also guard against recombination between divergent sequences. How DNA mismatches affect Dmc1-mediated DNA strand exchange is not understood. We have used fluorescence resonance energy transfer to study the mechanism of Dmc1-mediated strand exchange between DNA oligonucleotides with different degrees of heterology. The efficiency of strand exchange is highly sensitive to the location, type, and distribution of mismatches. Mismatches near the 3' end of the initiating DNA strand have a small effect, whereas most mismatches near the 5' end impede strand exchange dramatically. The Hop2-Mnd1 protein complex stimulates Dmc1-catalyzed strand exchange on homologous DNA or containing a single mismatch. We observed that Dmc1 can reject divergent DNA sequences while bypassing a few mismatches in the DNA sequence. Our findings have important implications in understanding meiotic recombination. First, Dmc1 acts as an initial barrier for heterologous recombination, with the mismatch repair system providing a second level of proofreading, to ensure that ectopic sequences are not recombined. Second, Dmc1 stepping over infrequent mismatches is likely critical for allowing recombination between the polymorphic sequences of homologous chromosomes, thus contributing to gene conversion and genetic diversity. © 2016 by The American Society for Biochemistry and Molecular Biology, Inc.
Tolerance of DNA Mismatches in Dmc1 Recombinase-mediated DNA Strand Exchange*

PubMed Central

Borgogno, María V.; Monti, Mariela R.; Zhao, Weixing; Sung, Patrick; Argaraña, Carlos E.; Pezza, Roberto J.

2016-01-01

Recombination between homologous chromosomes is required for the faithful meiotic segregation of chromosomes and leads to the generation of genetic diversity. The conserved meiosis-specific Dmc1 recombinase catalyzes homologous recombination triggered by DNA double strand breaks through the exchange of parental DNA sequences. Although providing an efficient rate of DNA strand exchange between polymorphic alleles, Dmc1 must also guard against recombination between divergent sequences. How DNA mismatches affect Dmc1-mediated DNA strand exchange is not understood. We have used fluorescence resonance energy transfer to study the mechanism of Dmc1-mediated strand exchange between DNA oligonucleotides with different degrees of heterology. The efficiency of strand exchange is highly sensitive to the location, type, and distribution of mismatches. Mismatches near the 3′ end of the initiating DNA strand have a small effect, whereas most mismatches near the 5′ end impede strand exchange dramatically. The Hop2-Mnd1 protein complex stimulates Dmc1-catalyzed strand exchange on homologous DNA or containing a single mismatch. We observed that Dmc1 can reject divergent DNA sequences while bypassing a few mismatches in the DNA sequence. Our findings have important implications in understanding meiotic recombination. First, Dmc1 acts as an initial barrier for heterologous recombination, with the mismatch repair system providing a second level of proofreading, to ensure that ectopic sequences are not recombined. Second, Dmc1 stepping over infrequent mismatches is likely critical for allowing recombination between the polymorphic sequences of homologous chromosomes, thus contributing to gene conversion and genetic diversity. PMID:26709229
Vampires in the oceans: predatory cercozoan amoebae in marine habitats

PubMed Central

Berney, Cédric; Romac, Sarah; Mahé, Frédéric; Santini, Sébastien; Siano, Raffaele; Bass, David

2013-01-01

Vampire amoebae (vampyrellids) are predators of algae, fungi, protozoa and small metazoans known primarily from soils and in freshwater habitats. They are among the very few heterotrophic naked, filose and reticulose protists that have received some attention from a morphological and ecological point of view over the last few decades, because of the peculiar mode of feeding of known species. Yet, the true extent of their biodiversity remains largely unknown. Here we use a complementary approach of culturing and sequence database mining to address this issue, focusing our efforts on marine environments, where vampyrellids are very poorly known. We present 10 new vampyrellid isolates, 8 from marine or brackish sediments, and 2 from soil or freshwater sediment. Two of the former correspond to the genera Thalassomyxa Grell and Penardia Cash for which sequence data were previously unavailable. Small-subunit ribosomal DNA analysis confirms they are all related to previously sequenced vampyrellids. An exhaustive screening of the NCBI GenBank database and of 454 sequence data generated by the European BioMarKs consortium revealed hundreds of distinct environmental vampyrellid sequences. We show that vampyrellids are much more diverse than previously thought, especially in marine habitats. Our new isolates, which cover almost the full phylogenetic range of vampyrellid sequences revealed in this study, offer a rare opportunity to integrate data from environmental DNA surveys with phenotypic information. However, the very large genetic diversity we highlight within vampyrellids (especially in marine sediments and soils) contrasts with the paradoxically low morphological distinctiveness we observed across our isolates. PMID:23864128
Evaluation of the genetic diversity of Plum pox virus in a single plum tree.

PubMed

Predajňa, Lukáš; Šubr, Zdeno; Candresse, Thierry; Glasa, Miroslav

2012-07-01

Genetic diversity of Plum pox virus (PPV) and its distribution within a single perennial woody host (plum, Prunus domestica) has been evaluated. A plum tree was triply infected by chip-budding with PPV-M, PPV-D and PPV-Rec isolates in 2003 and left to develop untreated under open field conditions. In September 2010 leaf and fruit samples were collected from different parts of the tree canopy. A 745-bp NIb-CP fragment of PPV genome, containing the hypervariable region encoding the CP N-terminal end was amplified by RT-PCR from each sample and directly sequenced to determine the dominant sequence. In parallel, the PCR products were cloned and a total of 105 individual clones were sequenced. Sequence analysis revealed that after 7 years of infection, only PPV-M was still detectable in the tree and that the two other isolates (PPV-Rec and PPV-D) had been displaced. Despite the fact that the analysis targeted a relatively short portion of the genome, a substantial amount of intra-isolate variability was observed for PPV-M. A total of 51 different haplotypes could be identified from the 105 individual sequences, two of which were largely dominant. However, no clear-cut structuration of the viral population by the tree architecture could be highlighted although the results obtained suggest the possibility of intra-leaf/fruit differentiation of the viral population. Comparison of the consensus sequence with the original source isolate showed no difference, suggesting within-plant stability of this original isolate under open field conditions. Copyright © 2012 Elsevier B.V. All rights reserved.
Comparative Genomics Reveals the Diversity of Restriction-Modification Systems and DNA Methylation Sites in Listeria monocytogenes.

PubMed

Chen, Poyin; den Bakker, Henk C; Korlach, Jonas; Kong, Nguyet; Storey, Dylan B; Paxinos, Ellen E; Ashby, Meredith; Clark, Tyson; Luong, Khai; Wiedmann, Martin; Weimer, Bart C

2017-02-01

Listeria monocytogenes is a bacterial pathogen that is found in a wide variety of anthropogenic and natural environments. Genome sequencing technologies are rapidly becoming a powerful tool in facilitating our understanding of how genotype, classification phenotypes, and virulence phenotypes interact to predict the health risks of individual bacterial isolates. Currently, 57 closed L. monocytogenes genomes are publicly available, representing three of the four phylogenetic lineages, and they suggest that L. monocytogenes has high genomic synteny. This study contributes an additional 15 closed L. monocytogenes genomes that were used to determine the associations between the genome and methylome with host invasion magnitude. In contrast to previous findings, large chromosomal inversions and rearrangements were detected in five isolates at the chromosome terminus and within rRNA genes, including a previously undescribed inversion within rRNA-encoding regions. Each isolate's epigenome contained highly diverse methyltransferase recognition sites, even within the same serotype and methylation pattern. Eleven strains contained a single chromosomally encoded methyltransferase, one strain contained two methylation systems (one system on a plasmid), and three strains exhibited no methylation, despite the occurrence of methyltransferase genes. In three isolates a new, unknown DNA modification was observed in addition to diverse methylation patterns, accompanied by a novel methylation system. Neither chromosome rearrangement nor strain-specific patterns of epigenome modification observed within virulence genes were correlated with serotype designation, clonal complex, or in vitro infectivity. These data suggest that genome diversity is larger than previously considered in L. monocytogenes and that as more genomes are sequenced, additional structure and methylation novelty will be observed in this organism. Listeria monocytogenes is the causative agent of listeriosis, a disease which manifests as gastroenteritis, meningoencephalitis, and abortion. Among Salmonella, Escherichia coli, Campylobacter, and Listeria-causing the most prevalent foodborne illnesses-infection by L. monocytogenes carries the highest mortality rate. The ability of L. monocytogenes to regulate its response to various harsh environments enables its persistence and transmission. Small-scale comparisons of L. monocytogenes focusing solely on genome contents reveal a highly syntenic genome yet fail to address the observed diversity in phenotypic regulation. This study provides a large-scale comparison of 302 L. monocytogenes isolates, revealing the importance of the epigenome and restriction-modification systems as major determinants of L. monocytogenes phylogenetic grouping and subsequent phenotypic expression. Further examination of virulence genes of select outbreak strains reveals an unprecedented diversity in methylation statuses despite high degrees of genome conservation. Copyright © 2017 American Society for Microbiology.
The Hidden Diversity of Flagellated Protists in Soil.

PubMed

Venter, Paul Christiaan; Nitsche, Frank; Arndt, Hartmut

2018-07-01

Protists are among the most diverse and abundant eukaryotes in soil. However, gaps between described and sequenced protist morphospecies still present a pending problem when surveying environmental samples for known species using molecular methods. The number of sequences in the molecular PR 2 database (∼130,000) is limited compared to the species richness expected (>1 million protist species) - limiting the recovery rate. This is important, since high throughput sequencing (HTS) methods are used to find associative patterns between functional traits, taxa and environmental parameters. We performed HTS to survey soil flagellates in 150 grasslands of central Europe, and tested the recovery rate of ten previously isolated and cultivated cercomonad species, among locally found diversity. We recovered sequences for reference soil flagellate species, but also a great number of their phylogenetically evaluated genetic variants, among rare and dominant taxa with presumably own biogeography. This was recorded among dominant (cercozoans, Sandona), rare (apusozoans) and a large hidden diversity of predominantly aquatic protists in soil (choanoflagellates, bicosoecids) often forming novel clades associated with uncultured environmental sequences. Evaluating the reads, instead of the OTUs that individual reads are usually clustered into, we discovered that much of this hidden diversity may be lost due to clustering. Copyright © 2018 Elsevier GmbH. All rights reserved.
Genetic diversity and connectivity in the East African giant mud crab Scylla serrata: Implications for fisheries management.

PubMed

Rumisha, Cyrus; Huyghe, Filip; Rapanoel, Diary; Mascaux, Nemo; Kochzius, Marc

2017-01-01

The giant mud crab Scylla serrata provides an important source of income and food to coastal communities in East Africa. However, increasing demand and exploitation due to the growing coastal population, export trade, and tourism industry are threatening the sustainability of the wild stock of this species. Because effective management requires a clear understanding of the connectivity among populations, this study was conducted to assess the genetic diversity and connectivity in the East African mangrove crab S. serrata. A section of 535 base pairs of the cytochrome oxidase subunit I (COI) gene and eight microsatellite loci were analysed from 230 tissue samples of giant mud crabs collected from Kenya, Tanzania, Mozambique, Madagascar, and South Africa. Microsatellite genetic diversity (He) ranged between 0.56 and 0.6. The COI sequences showed 57 different haplotypes associated with low nucleotide diversity (current nucleotide diversity = 0.29%). In addition, the current nucleotide diversity was lower than the historical nucleotide diversity, indicating overexploitation or historical bottlenecks in the recent history of the studied population. Considering that the coastal population is growing rapidly, East African countries should promote sustainable fishing practices and sustainable use of mangrove resources to protect mud crabs and other marine fauna from the increasing pressure of exploitation. While microsatellite loci did not show significant genetic differentiation (p > 0.05), COI sequences revealed significant genetic divergence between sites on the East coast of Madagascar (ECM) and sites on the West coast of Madagascar, mainland East Africa, as well as the Seychelles. Since East African countries agreed to achieve the Convention on Biological Diversity (CBD) target to protect over 10% of their marine areas by 2020, the observed pattern of connectivity and the measured genetic diversity can serve to provide useful information for designing networks of marine protected areas.
Phylogenetic diversity of culturable fungi in the Heshang Cave, central China

PubMed Central

Man, Baiying; Wang, Hongmei; Xiang, Xing; Wang, Ruicheng; Yun, Yuan; Gong, Linfeng

2015-01-01

Caves are nutrient-limited and dark subterranean ecosystems. To date, attention has been focused on geological research of caves in China, whilst indigenous microbial diversity has been insufficiently characterized. Here, we report the fungal diversity in the pristine, oligotrophic, karst Heshang Cave, central China, using a culture-dependent method coupled with the analysis of the fungal rRNA-ITS gene sequences. A total of 194 isolates were obtained with six different media from 14 sampling sites of sediments, weathered rocks, and bat guanos. Phylogenetic analysis clustered the 194 sequenced isolates into 33 genera within 15 orders of three phyla, Ascomycota, Basidiomycota, and Zygomycota, indicating a high degree of fungal diversity in the Heshang Cave. Notably, 16 out of the 36 fungal genera were also frequently observed in solution caves around the world and 23 genera were previously found in carbonate cave, indicating potential similarities among fungal communities in cave ecosystems. However, 10 genera in this study were not reported previously in any solution caves, thus expanding our knowledge about fungal diversity in cave ecosystems. Moreover, culturable fungal diversity varied from one habitat to another within the cave, being the highest in sediments, followed by weathered rocks and bat guanos as indicated by α-diversity indexes. At the genus level, Penicillium accounted for 40, 54, and 52% in three habitats of sediments, weathered rocks, and bat guanos, respectively. Trichoderma, Paecilomyces, and Aspergillus accounted for 9, 22, and 37% in the above habitats, correspondingly. Despite of the dominance of Penicillium in all samples, β-diversity index indicated significant differences between each two fungal communities in the three habitats in view of both the composition and abundance. Our study is the first report on fungal communities in a natural pristine solution cave system in central China and sheds light on fungal diversity and functions in cave ecosystems. PMID:26539184
Flagellin diversity in Clostridium botulinum groups I and II: a new strategy for strain identification.

PubMed

Paul, Catherine J; Twine, Susan M; Tam, Kevin J; Mullen, James A; Kelly, John F; Austin, John W; Logan, Susan M

2007-05-01

Strains of Clostridium botulinum are traditionally identified by botulinum neurotoxin type; however, identification of an additional target for typing would improve differentiation. Isolation of flagellar filaments and analysis by sodium dodecyl sulfate-polyacrylamide gel electrophoresis (SDS-PAGE) showed that C. botulinum produced multiple flagellin proteins. Nano-liquid chromatography-tandem mass spectrometry (nLC-MS/MS) analysis of in-gel tryptic digests identified peptides in all flagellin bands that matched two homologous tandem flagellin genes identified in the C. botulinum Hall A genome. Designated flaA1 and flaA2, these open reading frames encode the major structural flagellins of C. botulinum. Colony PCR and sequencing of flaA1/A2 variable regions classified 80 environmental and clinical strains into group I or group II and clustered isolates into 12 flagellar types. Flagellar type was distinct from neurotoxin type, and epidemiologically related isolates clustered together. Sequencing a larger PCR product, obtained during amplification of flaA1/A2 from type E strain Bennett identified a second flagellin gene, flaB. LC-MS analysis confirmed that flaB encoded a large type E-specific flagellin protein, and the predicted molecular mass for FlaB matched that observed by SDS-PAGE. In contrast, the molecular mass of FlaA was 2 to 12 kDa larger than the mass predicted by the flaA1/A2 sequence of a given strain, suggesting that FlaA is posttranslationally modified. While identification of FlaB, and the observation by SDS-PAGE of different masses of the FlaA proteins, showed the flagellin proteins of C. botulinum to be diverse, the presence of the flaA1/A2 gene in all strains examined facilitates single locus sequence typing of C. botulinum using the flagellin variable region.
Capturing diversity of marine heterotrophic protists: one cell at a time

PubMed Central

Heywood, Jane L; Sieracki, Michael E; Bellows, Wendy; Poulton, Nicole J; Stepanauskas, Ramunas

2011-01-01

Recent applications of culture-independent, molecular methods have revealed unexpectedly high diversity in a variety of functional and phylogenetic groups of microorganisms in the ocean. However, none of the existing research tools are free from significant limitations, such as PCR and cloning biases, low phylogenetic resolution and others. Here, we employed novel, single-cell sequencing techniques to assess the composition of small (<10 μm diameter), heterotrophic protists from the Gulf of Maine. Single cells were isolated by flow cytometry, their genomes amplified, and 18S rRNA marker genes were amplified and sequenced. We compared the results to traditional environmental PCR cloning of sorted cells. The diversity of heterotrophic protists was significantly higher in the library of single amplified genomes (SAGs) than in environmental PCR clone libraries of the 18S rRNA gene, obtained from the same coastal sample. Libraries of SAGs, but not clones contained several recently discovered, uncultured groups, including picobiliphytes and novel marine stramenopiles. Clone, but not SAG, libraries contained several large clusters of identical and nearly identical sequences of Dinophyceae, Cercozoa and Stramenopiles. Similar results were obtained using two alternative primer sets, suggesting that PCR biases may not be the only explanation for the observed patterns. Instead, differences in the number of 18S rRNA gene copies among the various protist taxa probably had a significant role in determining the PCR clone composition. These results show that single-cell sequencing has the potential to more accurately assess protistan community composition than previously established methods. In addition, the creation of SAG libraries opens opportunities for the analysis of multiple genes or entire genomes of the uncultured protist groups. PMID:20962875
Diversity of Two-Domain Laccase-Like Multicopper Oxidase Genes in Streptomyces spp.: Identification of Genes Potentially Involved in Extracellular Activities and Lignocellulose Degradation during Composting of Agricultural Waste

PubMed Central

Lu, Lunhui; Zhang, Jiachao; Chen, Anwei; Chen, Ming; Jiang, Min; Yuan, Yujie; Wu, Haipeng; Lai, Mingyong; He, Yibin

2014-01-01

Traditional three-domain fungal and bacterial laccases have been extensively studied for their significance in various biotechnological applications. Growing molecular evidence points to a wide occurrence of more recently recognized two-domain laccase-like multicopper oxidase (LMCO) genes in Streptomyces spp. However, the current knowledge about their ecological role and distribution in natural or artificial ecosystems is insufficient. The aim of this study was to investigate the diversity and composition of Streptomyces two-domain LMCO genes in agricultural waste composting, which will contribute to the understanding of the ecological function of Streptomyces two-domain LMCOs with potential extracellular activity and ligninolytic capacity. A new specific PCR primer pair was designed to target the two conserved copper binding regions of Streptomyces two-domain LMCO genes. The obtained sequences mainly clustered with Streptomyces coelicolor, Streptomyces violaceusniger, and Streptomyces griseus. Gene libraries retrieved from six composting samples revealed high diversity and a rapid succession of Streptomyces two-domain LMCO genes during composting. The obtained sequence types cluster in 8 distinct clades, most of which are homologous with Streptomyces two-domain LMCO genes, but the sequences of clades III and VIII do not match with any reference sequence of known streptomycetes. Both lignocellulose degradation rates and phenol oxidase activity at pH 8.0 in the composting process were found to be positively associated with the abundance of Streptomyces two-domain LMCO genes. These observations provide important clues that Streptomyces two-domain LMCOs are potentially involved in bacterial extracellular phenol oxidase activities and lignocellulose breakdown during agricultural waste composting. PMID:24657870
Highly divergent ancient gene families in metagenomic samples are compatible with additional divisions of life.

PubMed

Lopez, Philippe; Halary, Sébastien; Bapteste, Eric

2015-10-26

Microbial genetic diversity is often investigated via the comparison of relatively similar 16S molecules through multiple alignments between reference sequences and novel environmental samples using phylogenetic trees, direct BLAST matches, or phylotypes counts. However, are we missing novel lineages in the microbial dark universe by relying on standard phylogenetic and BLAST methods? If so, how can we probe that universe using alternative approaches? We performed a novel type of multi-marker analysis of genetic diversity exploiting the topology of inclusive sequence similarity networks. Our protocol identified 86 ancient gene families, well distributed and rarely transferred across the 3 domains of life, and retrieved their environmental homologs among 10 million predicted ORFs from human gut samples and other metagenomic projects. Numerous highly divergent environmental homologs were observed in gut samples, although the most divergent genes were over-represented in non-gut environments. In our networks, most divergent environmental genes grouped exclusively with uncultured relatives, in maximal cliques. Sequences within these groups were under strong purifying selection and presented a range of genetic variation comparable to that of a prokaryotic domain. Many genes families included environmental homologs that were highly divergent from cultured homologs: in 79 gene families (including 18 ribosomal proteins), Bacteria and Archaea were less divergent than some groups of environmental sequences were to any cultured or viral homologs. Moreover, some groups of environmental homologs branched very deeply in phylogenetic trees of life, when they were not too divergent to be aligned. These results underline how limited our understanding of the most diverse elements of the microbial world remains, and encourage a deeper exploration of natural communities and their genetic resources, hinting at the possibility that still unknown yet major divisions of life have yet to be discovered.
Characterization of the bacterial biodiversity in Pico cheese (an artisanal Azorean food).

PubMed

Riquelme, Cristina; Câmara, Sandra; Dapkevicius, Maria de Lurdes N Enes; Vinuesa, Pablo; da Silva, Célia Costa Gomes; Malcata, F Xavier; Rego, Oldemiro A

2015-01-02

This work presents the first study on the bacterial communities in Pico cheese, a traditional cheese of the Azores (Portugal), made from raw cow's milk. Pyrosequencing of tagged amplicons of the V3-V4 regions of the 16S rDNA and Operational Taxonomic Unit-based (OTU-based) analysis were applied to obtain an overall idea of the microbiota in Pico cheese and to elucidate possible differences between cheese-makers (A, B and C) and maturation times. Pyrosequencing revealed a high bacterial diversity in Pico cheese. Four phyla (Firmicutes, Proteobacteria, Actinobacteria and Bacteroidetes) and 54 genera were identified. The predominant genus was Lactococcus (77% of the sequences). Sequences belonging to major cheese-borne pathogens were not found. Staphylococcus accounted for 0.5% of the sequences. Significant differences in bacterial community composition were observed between cheese-maker B and the other two units that participated in the study. However, OTU analysis identified a set of taxa (Lactococcus, Streptococcus, Acinetobacter, Enterococcus, Lactobacillus, Staphylococcus, Rothia, Pantoea and unclassified genera belonging to the Enterobacteriaceae family) that would represent the core components of artisanal Pico cheese microbiota. A diverse bacterial community was present at early maturation, with an increase in the number of phylotypes up to 2 weeks, followed by a decrease at the end of ripening. The most remarkable trend in abundance patterns throughout ripening was an increase in the number of sequences belonging to the Lactobacillus genus, with a concomitant decrease in Acinetobacter, and Stenotrophomonas. Microbial rank abundance curves showed that Pico cheese's bacterial communities are characterized by a few dominant taxa and many low-abundance, highly diverse taxa that integrate the so-called "rare biosphere". Copyright © 2014 Elsevier B.V. All rights reserved.
Archaea in the foregut of macropod marsupials: PCR and amplicon sequence-based observations.

PubMed

Klieve, A V; Ouwerkerk, D; Maguire, A J

2012-11-01

To investigate, using culture-independent techniques, the presence and diversity of methanogenic archaea in the foregut of kangaroos. DNA was extracted from forestomach contents of 42 kangaroos (three species), three sheep and three cattle. Four qualitative and quantitative PCR assays targeting the archaeal domain (16S rRNA gene) or the functional methanogenesis gene, mcrA, were used to determine the presence and population density of archaea in kangaroos and whether they were likely to be methanogens. All ruminal samples were positive for archaea, produced PCR product of expected size, contained high numbers of archaea and high numbers of cells with mcrA genes. Kangaroos were much more diverse and contradictory. Fourteen kangaroos had detectable archaea with numbers 10- to 1000-fold fewer than sheep and cattle. Many kangaroos that did not possess archaea were positive for the mcrA gene and had detectable numbers of cells with this gene and vice versa. DNA sequence analysis of kangaroos' archaeal 16S rRNA gene clones show that many methanogens were related to Methanosphaera stadmanae. Other sequences were related to non-methanogenic archaea (Thermoplasma sp.), and a number of kangaroos had mcrA gene sequences related to methane oxidising archaea (ANME). Discrepancies between qualitative and quantitative PCR assays for archaea and the mcrA gene suggest that the archaeal communities are very diverse and it is possible that novel species exist. Archaea (in general) were below detectable limits in many kangaroos, especially Red kangaroos; when present they are in lower numbers than in ruminants, and the archaea are not necessarily methanogenic. The determination of why this is the case in the kangaroo foregut could assist in reducing emissions from other ecosystems in the future. © 2012 The Authors Journal of Applied Microbiology © 2012 The Society for Applied Microbiology.
Genetic variation in eleven phase I drug metabolism genes in an ethnically diverse population.

PubMed

Solus, Joseph F; Arietta, Brenda J; Harris, James R; Sexton, David P; Steward, John Q; McMunn, Chara; Ihrie, Patrick; Mehall, Janelle M; Edwards, Todd L; Dawson, Elliott P

2004-10-01

The extent of genetic variation found in drug metabolism genes and its contribution to interindividual variation in response to medication remains incompletely understood. To better determine the identity and frequency of variation in 11 phase I drug metabolism genes, the exons and flanking intronic regions of the cytochrome P450 (CYP) isoenzyme genes CYP1A1, CYP1A2, CYP2A6, CYP2B6, CYP2C8, CYP2C9, CYP2C19, CYP2D6, CYP2E1, CYP3A4 and CYP3A5 were amplified from genomic DNA and sequenced. A total of 60 kb of bi-directional sequence was generated from each of 93 human DNAs, which included Caucasian, African-American and Asian samples. There were 388 different polymorphisms identified. These included 269 non-coding, 45 synonymous and 74 non-synonymous polymorphisms. Of these, 54% were novel and included 176 non-coding, 14 synonymous and 21 non-synonymous polymorphisms. Of the novel variants observed, 85 were represented by single occurrences of the minor allele in the sample set. Much of the variation observed was from low-frequency alleles. Comparatively, these genes are variation-rich. Calculations measuring genetic diversity revealed that while the values for the individual genes are widely variable, the overall nucleotide diversity of 7.7 x 10(-4) and polymorphism parameter of 11.5 x 10(-4) are higher than those previously reported for other gene sets. Several independent measurements indicate that these genes are under selective pressure, particularly for polymorphisms corresponding to non-synonymous amino acid changes. There is relatively little difference in measurements of diversity among the ethnic groups, but there are large differences among the genes and gene subfamilies themselves. Of the three CYP subfamilies involved in phase I drug metabolism (1, 2, and 3), subfamily 2 displays the highest levels of genetic diversity.
Hepatitis C virus quasispecies and pseudotype analysis from acute infection to chronicity in HIV-1 co-infected individuals.

PubMed

Ferns, R Bridget; Tarr, Alexander W; Hue, Stephane; Urbanowicz, Richard A; McClure, C Patrick; Gilson, Richard; Ball, Jonathan K; Nastouli, Eleni; Garson, Jeremy A; Pillay, Deenan

2016-05-01

HIV-1 infected patients who acquire HCV infection have higher rates of chronicity and liver disease progression than patients with HCV mono-infection. Understanding early events in this pathogenic process is important. We applied single genome sequencing of the E1 to NS3 regions and viral pseudotype neutralization assays to explore the consequences of viral quasispecies evolution from pre-seroconversion to chronicity in four co-infected individuals (mean follow up 566 days). We observed that one to three founder viruses were transmitted. Relatively low viral sequence diversity, possibly related to an impaired immune response, due to HIV infection was observed in three patients. However, the fourth patient, after an early purifying selection displayed increasing E2 sequence evolution, possibly related to being on suppressive antiretroviral therapy. Viral pseudotypes generated from HCV variants showed relative resistance to neutralization by autologous plasma but not to plasma collected from later time points, confirming ongoing virus escape from antibody neutralization. Copyright © 2016 Elsevier Inc. All rights reserved.
Modeling Host Genetic Regulation of Influenza Pathogenesis in the Collaborative Cross

PubMed Central

Ferris, Martin T.; Aylor, David L.; Bottomly, Daniel; Whitmore, Alan C.; Aicher, Lauri D.; Bell, Timothy A.; Bradel-Tretheway, Birgit; Bryan, Janine T.; Buus, Ryan J.; Gralinski, Lisa E.; Haagmans, Bart L.; McMillan, Leonard; Miller, Darla R.; Rosenzweig, Elizabeth; Valdar, William; Wang, Jeremy; Churchill, Gary A.; Threadgill, David W.; McWeeney, Shannon K.; Katze, Michael G.; Pardo-Manuel de Villena, Fernando; Baric, Ralph S.; Heise, Mark T.

2013-01-01

Genetic variation contributes to host responses and outcomes following infection by influenza A virus or other viral infections. Yet narrow windows of disease symptoms and confounding environmental factors have made it difficult to identify polymorphic genes that contribute to differential disease outcomes in human populations. Therefore, to control for these confounding environmental variables in a system that models the levels of genetic diversity found in outbred populations such as humans, we used incipient lines of the highly genetically diverse Collaborative Cross (CC) recombinant inbred (RI) panel (the pre-CC population) to study how genetic variation impacts influenza associated disease across a genetically diverse population. A wide range of variation in influenza disease related phenotypes including virus replication, virus-induced inflammation, and weight loss was observed. Many of the disease associated phenotypes were correlated, with viral replication and virus-induced inflammation being predictors of virus-induced weight loss. Despite these correlations, pre-CC mice with unique and novel disease phenotype combinations were observed. We also identified sets of transcripts (modules) that were correlated with aspects of disease. In order to identify how host genetic polymorphisms contribute to the observed variation in disease, we conducted quantitative trait loci (QTL) mapping. We identified several QTL contributing to specific aspects of the host response including virus-induced weight loss, titer, pulmonary edema, neutrophil recruitment to the airways, and transcriptional expression. Existing whole-genome sequence data was applied to identify high priority candidate genes within QTL regions. A key host response QTL was located at the site of the known anti-influenza Mx1 gene. We sequenced the coding regions of Mx1 in the eight CC founder strains, and identified a novel Mx1 allele that showed reduced ability to inhibit viral replication, while maintaining protection from weight loss. PMID:23468633

Centromeric enrichment of LINE-1 retrotransposons and its significance for the chromosome evolution of Phyllostomid bats.

PubMed

de Sotero-Caio, Cibele Gomes; Cabral-de-Mello, Diogo Cavalcanti; Calixto, Merilane da Silva; Valente, Guilherme Targino; Martins, Cesar; Loreto, Vilma; de Souza, Maria José; Santos, Neide

2017-10-01

Despite their ubiquitous incidence, little is known about the chromosomal distribution of long interspersed elements (LINEs) in mammalian genomes. Phyllostomid bats, characterized by lineages with distinct trends of chromosomal evolution coupled with remarkable ecological and taxonomic diversity, represent good models to understand how these repetitive sequences contribute to the evolution of genome architecture and its link to lineage diversification. To test the hypothesis that LINE-1 sequences were important modifiers of bat genome architecture, we characterized the distribution of LINE-1-derived sequences on genomes of 13 phyllostomid species within a phylogenetic framework. We found massive accumulation of LINE-1 elements in the centromeres of most species: a rare phenomenon on mammalian genomes. We hypothesize that expansion of these elements has occurred early in the radiation of phyllostomids and recurred episodically. LINE-1 expansions on centromeric heterochromatin probably spurred chromosomal change before the radiation of phyllostomids into the extant 11 subfamilies and contributed to the high degree of karyotypic variation observed among different lineages. Understanding centromere architecture in a variety of taxa promises to explain how lineage-specific changes on centromere structure can contribute to karyotypic diversity while not disrupting functional constraints for proper cell division.
When are pathogen genome sequences informative of transmission events?

PubMed Central

Ferguson, Neil; Jombart, Thibaut

2018-01-01

Recent years have seen the development of numerous methodologies for reconstructing transmission trees in infectious disease outbreaks from densely sampled whole genome sequence data. However, a fundamental and as of yet poorly addressed limitation of such approaches is the requirement for genetic diversity to arise on epidemiological timescales. Specifically, the position of infected individuals in a transmission tree can only be resolved by genetic data if mutations have accumulated between the sampled pathogen genomes. To quantify and compare the useful genetic diversity expected from genetic data in different pathogen outbreaks, we introduce here the concept of ‘transmission divergence’, defined as the number of mutations separating whole genome sequences sampled from transmission pairs. Using parameter values obtained by literature review, we simulate outbreak scenarios alongside sequence evolution using two models described in the literature to describe transmission divergence of ten major outbreak-causing pathogens. We find that while mean values vary significantly between the pathogens considered, their transmission divergence is generally very low, with many outbreaks characterised by large numbers of genetically identical transmission pairs. We describe the impact of transmission divergence on our ability to reconstruct outbreaks using two outbreak reconstruction tools, the R packages outbreaker and phybreak, and demonstrate that, in agreement with previous observations, genetic sequence data of rapidly evolving pathogens such as RNA viruses can provide valuable information on individual transmission events. Conversely, sequence data of pathogens with lower mean transmission divergence, including Streptococcus pneumoniae, Shigella sonnei and Clostridium difficile, provide little to no information about individual transmission events. Our results highlight the informational limitations of genetic sequence data in certain outbreak scenarios, and demonstrate the need to expand the toolkit of outbreak reconstruction tools to integrate other types of epidemiological data. PMID:29420641
Comparative genomics of citric-acid-producing Aspergillus niger ATCC 1015 versus enzyme-producing CBS 513.88

PubMed Central

Andersen, Mikael R.; Salazar, Margarita P.; Schaap, Peter J.; van de Vondervoort, Peter J.I.; Culley, David; Thykaer, Jette; Frisvad, Jens C.; Nielsen, Kristian F.; Albang, Richard; Albermann, Kaj; Berka, Randy M.; Braus, Gerhard H.; Braus-Stromeyer, Susanna A.; Corrochano, Luis M.; Dai, Ziyu; van Dijck, Piet W.M.; Hofmann, Gerald; Lasure, Linda L.; Magnuson, Jon K.; Menke, Hildegard; Meijer, Martin; Meijer, Susan L.; Nielsen, Jakob B.; Nielsen, Michael L.; van Ooyen, Albert J.J.; Pel, Herman J.; Poulsen, Lars; Samson, Rob A.; Stam, Hein; Tsang, Adrian; van den Brink, Johannes M.; Atkins, Alex; Aerts, Andrea; Shapiro, Harris; Pangilinan, Jasmyn; Salamov, Asaf; Lou, Yigong; Lindquist, Erika; Lucas, Susan; Grimwood, Jane; Grigoriev, Igor V.; Kubicek, Christian P.; Martinez, Diego; van Peij, Noël N.M.E.; Roubos, Johannes A.; Nielsen, Jens; Baker, Scott E.

2011-01-01

The filamentous fungus Aspergillus niger exhibits great diversity in its phenotype. It is found globally, both as marine and terrestrial strains, produces both organic acids and hydrolytic enzymes in high amounts, and some isolates exhibit pathogenicity. Although the genome of an industrial enzyme-producing A. niger strain (CBS 513.88) has already been sequenced, the versatility and diversity of this species compel additional exploration. We therefore undertook whole-genome sequencing of the acidogenic A. niger wild-type strain (ATCC 1015) and produced a genome sequence of very high quality. Only 15 gaps are present in the sequence, and half the telomeric regions have been elucidated. Moreover, sequence information from ATCC 1015 was used to improve the genome sequence of CBS 513.88. Chromosome-level comparisons uncovered several genome rearrangements, deletions, a clear case of strain-specific horizontal gene transfer, and identification of 0.8 Mb of novel sequence. Single nucleotide polymorphisms per kilobase (SNPs/kb) between the two strains were found to be exceptionally high (average: 7.8, maximum: 160 SNPs/kb). High variation within the species was confirmed with exo-metabolite profiling and phylogenetics. Detailed lists of alleles were generated, and genotypic differences were observed to accumulate in metabolic pathways essential to acid production and protein synthesis. A transcriptome analysis supported up-regulation of genes associated with biosynthesis of amino acids that are abundant in glucoamylase A, tRNA-synthases, and protein transporters in the protein producing CBS 513.88 strain. Our results and data sets from this integrative systems biology analysis resulted in a snapshot of fungal evolution and will support further optimization of cell factories based on filamentous fungi. PMID:21543515
Global sequence variation in the histidine-rich proteins 2 and 3 of Plasmodium falciparum: implications for the performance of malaria rapid diagnostic tests

PubMed Central

2010-01-01

Background Accurate diagnosis is essential for prompt and appropriate treatment of malaria. While rapid diagnostic tests (RDTs) offer great potential to improve malaria diagnosis, the sensitivity of RDTs has been reported to be highly variable. One possible factor contributing to variable test performance is the diversity of parasite antigens. This is of particular concern for Plasmodium falciparum histidine-rich protein 2 (PfHRP2)-detecting RDTs since PfHRP2 has been reported to be highly variable in isolates of the Asia-Pacific region. Methods The pfhrp2 exon 2 fragment from 458 isolates of P. falciparum collected from 38 countries was amplified and sequenced. For a subset of 80 isolates, the exon 2 fragment of histidine-rich protein 3 (pfhrp3) was also amplified and sequenced. DNA sequence and statistical analysis of the variation observed in these genes was conducted. The potential impact of the pfhrp2 variation on RDT detection rates was examined by analysing the relationship between sequence characteristics of this gene and the results of the WHO product testing of malaria RDTs: Round 1 (2008), for 34 PfHRP2-detecting RDTs. Results Sequence analysis revealed extensive variations in the number and arrangement of various repeats encoded by the genes in parasite populations world-wide. However, no statistically robust correlation between gene structure and RDT detection rate for P. falciparum parasites at 200 parasites per microlitre was identified. Conclusions The results suggest that despite extreme sequence variation, diversity of PfHRP2 does not appear to be a major cause of RDT sensitivity variation. PMID:20470441
Molecular analysis of microbial diversity in corrosion samples from energy transmission towers.

PubMed

Oliveira, Valéria M; Lopes-Oliveira, Patrícia F; Passarini, Michel R Z; Menezes, Claudia B A; Oliveira, Walter R C; Rocha, Adriano J; Sette, Lara D

2011-04-01

Microbial diversity in corrosion samples from energy transmission towers was investigated using molecular methods. Ribosomal DNA fragments were used to assemble gene libraries. Sequence analysis indicated 10 bacterial genera within the phyla Proteobacteria, Firmicutes, Actinobacteria and Bacteroidetes. In the two libraries generated from corroded screw-derived samples, the genus Acinetobacter was the most abundant. Acinetobacter and Clostridium spp. dominated, with similar percentages, in the libraries derived from corrosion scrapings. Fungal clones were affiliated with 14 genera belonging to the phyla Ascomycota and Basidiomycota; of these, Capnobotryella and Fellomyces were the most abundant fungi observed. Several of the microorganisms had not previously been associated with biofilms and corrosion, reinforcing the need to use molecular techniques to achieve a more comprehensive assessment of microbial diversity in environmental samples.
Population genetic implications from sequence variation in four Y chromosome genes.

PubMed

Shen, P; Wang, F; Underhill, P A; Franco, C; Yang, W H; Roxas, A; Sung, R; Lin, A A; Hyman, R W; Vollrath, D; Davis, R W; Cavalli-Sforza, L L; Oefner, P J

2000-06-20

Some insight into human evolution has been gained from the sequencing of four Y chromosome genes. Primary genomic sequencing determined gene SMCY to be composed of 27 exons that comprise 4,620 bp of coding sequence. The unfinished sequencing of the 5' portion of gene UTY1 was completed by primer walking, and a total of 20 exons were found. By using denaturing HPLC, these two genes, as well as DBY and DFFRY, were screened for polymorphic sites in 53-72 representatives of the five continents. A total of 98 variants were found, yielding nucleotide diversity estimates of 2.45 x 10(-5), 5. 07 x 10(-5), and 8.54 x 10(-5) for the coding regions of SMCY, DFFRY, and UTY1, respectively, with no variant having been observed in DBY. In agreement with most autosomal genes, diversity estimates for the noncoding regions were about 2- to 3-fold higher and ranged from 9. 16 x 10(-5) to 14.2 x 10(-5) for the four genes. Analysis of the frequencies of derived alleles for all four genes showed that they more closely fit the expectation of a Luria-Delbrück distribution than a distribution expected under a constant population size model, providing evidence for exponential population growth. Pairwise nucleotide mismatch distributions date the occurrence of population expansion to approximately 28,000 years ago. This estimate is in accord with the spread of Aurignacian technology and the disappearance of the Neanderthals.
Genetic Diversity and Reassortment of Hantaan Virus Tripartite RNA Genomes in Nature, the Republic of Korea

PubMed Central

Kim, Jeong-Ah; Kim, Won-keun; No, Jin Sun; Lee, Seung-Ho; Lee, Sook-Young; Kim, Ji Hye; Kho, Jeong Hoon; Lee, Daesang; Song, Dong Hyun; Gu, Se Hun; Jeong, Seong Tae; Park, Man-Seong; Kim, Heung-Chul; Klein, Terry A.; Song, Jin-Won

2016-01-01

Background Hantaan virus (HTNV), a negative sense tripartite RNA virus of the Family Bunyaviridae, is the most prevalent hantavirus in the Republic of Korea (ROK). It is the causative agent of Hemorrhagic Fever with Renal Syndrome (HFRS) in humans and maintained in the striped field mouse, Apodemus agrarius, the primary zoonotic host. Clinical HFRS cases have been reported commonly in HFRS-endemic areas of Gyeonggi province. Recently, the death of a member of the ROK military from Gangwon province due to HFRS prompted an investigation of the epidemiology and distribution of hantaviruses in Gangwon and Gyeonggi provinces that border the demilitarized zone separating North and South Korea. Methodology and Principal Findings To elucidate the geographic distribution and molecular diversity of HTNV, whole genome sequences of HTNV Large (L), Medium (M), and Small (S) segments were acquired from lung tissues of A. agrarius captured from 2003–2014. Consistent with the clinical incidence of HFRS established by the Korea Centers for Disease Control & Prevention (KCDC), the prevalence of HTNV in naturally infected mice in Gangwon province was lower than for Gyeonggi province. Whole genomic sequences of 34 HTNV strains were identified and a phylogenetic analysis showed geographic diversity of the virus in the limited areas. Reassortment analysis first suggested an occurrence of genetic exchange of HTNV genomes in nature, ROK. Conclusion/Significance This study is the first report to demonstrate the molecular prevalence of HTNV in Gangwon province. Whole genome sequencing of HTNV showed well-supported geographic lineages and the molecular diversity in the northern region of ROK due to a natural reassortment of HTNV genomes. These observations contribute to a better understanding of the genetic diversity and molecular evolution of hantaviruses. Also, the full-length of HTNV tripartite genomes will provide a database for phylogeographic analysis of spatial and temporal outbreaks of hantavirus infection. PMID:27315053
In-depth genome analyses of viruses from vaccine-derived rabies cases and corresponding live-attenuated oral rabies vaccines.

PubMed

Pfaff, Florian; Müller, Thomas; Freuling, Conrad M; Fehlner-Gardiner, Christine; Nadin-Davis, Susan; Robardet, Emmanuelle; Cliquet, Florence; Vuta, Vlad; Hostnik, Peter; Mettenleiter, Thomas C; Beer, Martin; Höper, Dirk

2018-02-10

Live-attenuated rabies virus strains such as those derived from the field isolate Street Alabama Dufferin (SAD) have been used extensively and very effectively as oral rabies vaccines for the control of fox rabies in both Europe and Canada. Although these vaccines are safe, some cases of vaccine-derived rabies have been detected during rabies surveillance accompanying these campaigns. In recent analysis it was shown that some commercial SAD vaccines consist of diverse viral populations, rather than clonal genotypes. For cases of vaccine-derived rabies, only consensus sequence data have been available to date and information concerning their population diversity was thus lacking. In our study, we used high-throughput sequencing to analyze 11 cases of vaccine-derived rabies, and compared their viral population diversity to the related oral rabies vaccines using pairwise Manhattan distances. This extensive deep sequencing analysis of vaccine-derived rabies cases observed during oral vaccination programs provided deeper insights into the effect of accidental in vivo replication of genetically diverse vaccine strains in the central nervous system of target and non-target species under field conditions. The viral population in vaccine-derived cases appeared to be clonal in contrast to their parental vaccines. The change from a state of high population diversity present in the vaccine batches to a clonal genotype in the affected animal may indicate the presence of a strong bottleneck during infection. In conclusion, it is very likely that these few cases are the consequence of host factors and not the result of the selection of a more virulent genotype. Furthermore, this type of vaccine-derived rabies leads to the selection of clonal genotypes and the selected variants were genetically very similar to potent SAD vaccines that have undergone a history of in vitro selection. Copyright © 2018. Published by Elsevier Ltd.
Elaeis oleifera Genomic-SSR Markers: Exploitation in Oil Palm Germplasm Diversity and Cross-Amplification in Arecaceae

PubMed Central

Zaki, Noorhariza Mohd; Singh, Rajinder; Rosli, Rozana; Ismail, Ismanizan

2012-01-01

Species-specific simple sequence repeat (SSR) markers are favored for genetic studies and marker-assisted selection (MAS) breeding for oil palm genetic improvement. This report characterizes 20 SSR markers from an Elaeis oleifera genomic library (gSSR). Characterization of the repeat type in 2000 sequences revealed a high percentage of di-nucleotides (63.6%), followed by tri-nucleotides (24.2%). Primer pairs were successfully designed for 394 of the E. oleifera gSSRs. Subsequent analysis showed the ability of the 20 selected E. oleifera gSSR markers to reveal genetic diversity in the genus Elaeis. The average Polymorphism Information Content (PIC) value for the SSRs was 0.402, with the tri-repeats showing the highest average PIC (0.626). Low values of observed heterozygosity (Ho) (0.164) and highly positive fixation indices (Fis) in the E. oleifera germplasm collection, compared to the E. guineensis, indicated an excess of homozygosity in E. oleifera. The transferability of the markers to closely related palms, Elaeis guineensis, Cocos nucifera and ornamental palms is also reported. Sequencing the amplicons of three selected E. oleifera gSSRs across both species and palm taxa revealed variations in the repeat-units. The study showed the potential of E. oleifera gSSR markers to reveal genetic diversity in the genus Elaeis. The markers are also a valuable genetic resource for studying E. oleifera and other genus in the Arecaceae family. PMID:22605966
Bacterial diversity characterization in petroleum samples from Brazilian reservoirs

PubMed Central

de Oliveira, Valéria Maia; Sette, Lara Durães; Simioni, Karen Christina Marques; dos Santos Neto, Eugênio Vaz

2008-01-01

This study aimed at evaluating potential differences among the bacterial communities from formation water and oil samples originated from biodegraded and non-biodegraded Brazilian petroleum reservoirs by using a PCR-DGGE based approach. Environmental DNA was isolated and used in PCR reactions with bacterial primers, followed by separation of 16S rDNA fragments in the DGGE. PCR products were also cloned and sequenced, aiming at the taxonomic affiliation of the community members. The fingerprints obtained allowed the direct comparison among the bacterial communities from oil samples presenting distinct degrees of biodegradation, as well as between the communities of formation water and oil sample from the non-biodegraded reservoir. Very similar DGGE band profiles were observed for all samples, and the diversity of the predominant bacterial phylotypes was shown to be low. Cloning and sequencing results revealed major differences between formation water and oil samples from the non-biodegraded reservoir. Bacillus sp. and Halanaerobium sp. were shown to be the predominant components of the bacterial community from the formation water sample, whereas the oil sample also included Alicyclobacillus acidoterrestris, Rhodococcus sp., Streptomyces sp. and Acidithiobacillus ferrooxidans. The PCR-DGGE technique, combined with cloning and sequencing of PCR products, revealed the presence of taxonomic groups not found previously in these samples when using cultivation-based methods and 16S rRNA gene library assembly, confirming the need of a polyphasic study in order to improve the knowledge of the extent of microbial diversity in such extreme environments. PMID:24031244
Identification, genetic localization, and allelic diversity of selectively amplified microsatellite polymorphic loci in lettuce and wild relatives (Lactuca spp.).

PubMed

Witsenboer, H; Michelmore, R W; Vogel, J

1997-12-01

Selectively amplified microsatellite polymorphic locus (SAMPL) analysis is a method of amplifying microsatellite loci using generic PCR primers. SAMPL analysis uses one AFLP primer in combination with a primer complementary to microsatellite sequences. SAMPL primers based on compound microsatellite sequences provided the clearest amplification patterns. We explored the potential of SAMPL analysis in lettuce to detect PCR-based codominant microsatellite markers. Fifty-eight SAMPLs were identified and placed on the genetic map. Seventeen were codominant. SAMPLs were dispersed with RFLP markers on 11 of the 12 main linkage groups in lettuce, indicating that they have a similar genomic distribution. Some but not all fragments amplified by SAMPL analysis were confirmed to contain microsatellite sequences by Southern hybridization. Forty-five cultivars of lettuce and five wild species of Lactuca were analyzed to determine the allelic diversity for codominant SAMPLs. From 3 to 11 putative alleles were found for each SAMPL; 2-6 alleles were found within Lactuca sativa and 1-3 alleles were found among the crisphead genotypes, the most genetically homogeneous plant type of L. sativa. This allelic diversity is greater than that found for RFLP markers. Numerous new alleles were observed in the wild species; however, there were frequent null alleles. Therefore, SAMPL analysis is more applicable to intraspecific than to interspecific comparisons. A phenetic analysis based on SAMPLs resulted in a dendrogram similar to those based on RFLP and AFLP markers.
Genetic analysis of the Hungarian draft horse population using partial mitochondrial DNA D-loop sequencing.

PubMed

Csizmár, Nikolett; Mihók, Sándor; Jávor, András; Kusza, Szilvia

2018-01-01

The Hungarian draft is a horse breed with a recent mixed ancestry created in the 1920s by crossing local mares with draught horses imported from France and Belgium. The interest in its conservation and characterization has increased over the last few years. The aim of this work is to contribute to the characterization of the endangered Hungarian heavy draft horse populations in order to obtain useful information to implement conservation strategies for these genetic stocks. To genetically characterize the breed and to set up the basis for a conservation program, in the present study a hypervariable region of the mitochrondial DNA (D-loop) was used to assess genetic diversity in Hungarian draft horses. Two hundred and eighty five sequences obtained in our laboratory and 419 downloaded sequences available from Genbank were analyzed. One hundred and sixty-four haplotypes and thirty-six polymorphic sites were observed. High haplotype and nucleotide diversity values ( H d = 0.954 ± 0.004; π = 0.028 ± 0.0004) were identified in Hungarian population, although they were higher within than among the different populations ( H d = 0.972 ± 0.002; π = 0.03097 ± 0.002). Fourteen of the previously observed seventeen haplogroups were detected. Our samples showed a large intra- and interbreed variation. There was no clear clustering on the median joining network figure. The overall information collected in this work led us to consider that the genetic scenario observed for Hungarian draft breed is more likely the result of contributions from 'ancestrally' different genetic backgrounds. This study could contribute to the development of a breeding plan for Hungarian draft horses and help to formulate a genetic conservation plan, avoiding inbreeding while.
Concentration and diversity of uncultured Legionella spp. in two unchlorinated drinking water supplies with different concentrations of natural organic matter.

PubMed

Wullings, Bart A; Bakker, Geo; van der Kooij, Dick

2011-01-01

Two unchlorinated drinking water supplies were investigated to assess the potential of water treatment and distribution systems to support the growth of Legionella spp. The treatment plant for supply A distributed treated groundwater with a low concentration (<0.5 ppm of C) of natural organic matter (NOM), and the treatment plant for supply B distributed treated groundwater with a high NOM concentration (8 ppm of C). In both supplies, the water temperature ranged from about 10°C after treatment to 18°C during distribution. The concentrations of Legionella spp. in distributed water, analyzed with quantitative PCR (Q-PCR), averaged 2.9 (± 1.9) × 10(2) cells liter(-1) in supply A and 2.5 (± 1.6) × 10(3) cells liter(-1) in supply B. No Legionella was observed with the culture method. A total of 346 clones (96 operational taxonomical units [OTUs] with ≥97% sequence similarity) were retrieved from water and biofilms of supply A and 251 (43 OTUs) from supply B. The estimation of the average value of total species richness (Chao1) in supply A (153) was clearly higher than that for supply B (58). In each supply, about 77% of the sequences showed <97% similarity to described species. Sequences related to L. pneumophila were only incidentally observed. The Legionella populations of the two supplies are divided into two distinct clusters based on distances in the phylogenetic tree as fractions of the branch length. Thus, a large variety of mostly yet-undescribed Legionella spp. proliferates in unchlorinated water supplies at temperatures below 18°C. The lowest concentration and greatest diversity were observed in the supply with the low NOM concentration.
Genetic Diversity among Clostridium botulinum Strains Harboring bont/A2 and bont/A3 Genes

PubMed Central

Raphael, Brian H.; Joseph, Lavin A.; Meno, Sarah R.; Fernández, Rafael A.; Maslanka, Susan E.

2012-01-01

Clostridium botulinum type A strains are known to be genetically diverse and widespread throughout the world. Genetic diversity studies have focused mainly on strains harboring one type A botulinum toxin gene, bont/A1, although all reported bont/A gene variants have been associated with botulism cases. Our study provides insight into the genetic diversity of C. botulinum type A strains, which contain bont/A2 (n = 42) and bont/A3 (n = 4) genes, isolated from diverse samples and geographic origins. Genetic diversity was assessed by using bont nucleotide sequencing, content analysis of the bont gene clusters, multilocus sequence typing (MLST), and pulsed-field gel electrophoresis (PFGE). Sequences of bont genes obtained in this study showed 99.9 to 100% identity with other bont/A2 or bont/A3 gene sequences available in public databases. The neurotoxin gene clusters of the subtype A2 and A3 strains analyzed in this study were similar in gene content. C. botulinum strains harboring bont/A2 and bont/A3 genes were divided into six and two MLST profiles, respectively. Four groups of strains shared a similarity of at least 95% by PFGE; the largest group included 21 out of 46 strains. The strains analyzed in this study showed relatively limited genetic diversity using either MLST or PFGE. PMID:23042179
Genetic diversity and population structure of Lactobacillus delbrueckii subspecies bulgaricus isolated from naturally fermented dairy foods

PubMed Central

Song, Yuqin; Sun, Zhihong; Guo, Chenyi; Wu, Yarong; Liu, Wenjun; Yu, Jie; Menghe, Bilige; Yang, Ruifu; Zhang, Heping

2016-01-01

Lactobacillus delbrueckii subsp. bulgaricus is one of the most widely used starter culture strains in industrial fermented dairy manufacture. It is also common in naturally fermented dairy foods made using traditional methods. The subsp. bulgaricus strains found in naturally fermented foods may be useful for improving current industrial starter cultures; however, little is known regarding its genetic diversity and population structure. Here, a collection of 298 L. delbrueckii strains from naturally fermented products in Mongolia, Russia, and West China was analyzed by multi-locus sequence typing based on eight conserved genes. The 251 confirmed subsp. bulgaricus strains produced 106 unique sequence types, the majority of which were assigned to five clonal complexes (CCs). The geographical distribution of CCs was uneven, with CC1 dominated by Mongolian and Russian isolates, and CC2–CC5 isolates exclusively from Xinjiang, China. Population structure analysis suggested six lineages, L1–L6, with various homologous recombination rates. Although L2–L5 were mainly restricted within specific regions, strains belonging to L1 and L6 were observed in diverse regions, suggesting historical transmission events. These results greatly enhance our knowledge of the population diversity of subsp. bulgaricus strains, and suggest that strains from CC1 and L4 may be useful as starter strains in industrial fermentation. PMID:26940047
Genetic diversity of rice tungro spherical virus in tungro-endemic provinces of the Philippines and Indonesia.

PubMed

Azzam, O; Yambao, M L; Muhsin, M; McNally, K L; Umadhay, K M

2000-01-01

The two adjacent genes of coat protein 1 and 2 of rice tungro spherical virus (RTSV) were amplified from total RNA extracts of serologically indistinguishable field isolates from the Philippines and Indonesia, using reverse transcriptase polymerase chain reaction (RT-PCR). Digestion with HindIII and BstYI restriction endonucleases differentiated the amplified DNA products into eight distinct coat protein genotypes. These genotypes were then used as indicators of virus diversity in the field. Inter- and intra-site diversities were determined over three cropping seasons. At each of the sites surveyed, one or two main genotypes prevailed together with other related minor or mixed genotypes that did not replace the main genotype over the sampling time. The cluster of genotypes found at the Philippines sites was significantly different from the one at the Indonesia sites, suggesting geographic isolation for virus populations. Phylogenetic studies based on the nucleotide sequences of 38 selected isolates confirm the spatial distribution of RTSV virus populations but show that gene flow may occur between populations. Under the present conditions, rice varieties do not seem to exert selective pressure on the virus populations. Based on the selective constraints in the coat protein amino acid sequences and the virus genetic composition per site, a negative selection model followed by random-sampling events due to vector transmissions is proposed to explain the inter-site diversity observed.
Population genetic diversity and genetic structure of Spodoptera exigua around the Bohai Gulf area of China based on mitochondrial DNA signatures.

PubMed

Zhou, L-H; Wang, X-Y; Lei, J-J

2016-09-30

The beet armyworm, Spodoptera exigua (Lepidoptera: Noctuidae), is an economically important pest that causes major losses in some main crop-producing areas of China. To control this pest effectively, it is necessary to investigate its population genetic diversity and genetic structure around the Bohai Gulf area of China. In this study, we used two mitochondrial genes, COI (578 bp) and Cytb (724 bp), to investigate its genetic diversity. We obtained 622 COI sequences and 462 Cytb sequences from 23 populations, and 28 and 73 haplotypes, respectively, were identified. Low to moderate levels of genetic diversity (COI: Hd = 0.267 ± 0.023, Pi = 0.00082 ± 0.00010; Cytb: Hd = 0.689 ± 0.018, Pi = 0.00255 ± 0.00029) for the total populations were observed. Phylogenetic and median-joining network analyses indicated no distinct geographical distribution pattern among the haplotypes. Overall, this study revealed that there was significant differentiation among the populations (COI: F ST = 0.158, P < 0.001; Cytb: F ST = 0.148, P < 0.001). F ST values for Shenyang, Baoding, and Funing were significantly different to those for most of the other populations. Finally, unimodal mismatch distribution analysis, combined with negative neutrality test results, showed a recent population expansion of the beet armyworm around the Bohai Gulf area of China.
Genomic analysis of bluetongue virus episystems in Australia and Indonesia.

PubMed

Firth, Cadhla; Blasdell, Kim R; Amos-Ritchie, Rachel; Sendow, Indrawati; Agnihotri, Kalpana; Boyle, David B; Daniels, Peter; Kirkland, Peter D; Walker, Peter J

2017-11-23

The distribution of bluetongue viruses (BTV) in Australia is represented by two distinct and interconnected epidemiological systems (episystems)-one distributed primarily in the north and one in the east. The northern episystem is characterised by substantially greater antigenic diversity than the eastern episystem; yet the forces that act to limit the diversity present in the east remain unclear. Previous work has indicated that the northern episystem is linked to that of island South East Asia and Melanesia, and that BTV present in Indonesia, Papua New Guinea and East Timor, may act as source populations for new serotypes and genotypes of BTV to enter Australia's north. In this study, the genomes of 49 bluetongue viruses from the eastern episystem and 13 from Indonesia were sequenced and analysed along with 27 previously published genome sequences from the northern Australian episystem. The results of this analysis confirm that the Australian BTV population has its origins in the South East Asian/Melanesian episystem, and that incursions into northern Australia occur with some regularity. In addition, the presence of limited genetic diversity in the eastern episystem relative to that found in the north supports the presence of substantial, but not complete, barriers to gene flow between the northern and eastern Australian episystems. Genetic bottlenecks between each successive episystem are evident, and appear to be responsible for the reduction in BTV genetic diversity observed in the north to south-east direction.
Cow teat skin, a potential source of diverse microbial populations for cheese production.

PubMed

Verdier-Metz, Isabelle; Gagne, Geneviève; Bornes, Stéphanie; Monsallier, Françoise; Veisseire, Philippe; Delbès-Paus, Céline; Montel, Marie-Christine

2012-01-01

The diversity of the microbial community on cow teat skin was evaluated using a culture-dependent method based on the use of different dairy-specific media, followed by the identification of isolates by 16S rRNA gene sequencing. This was combined with a direct molecular approach by cloning and 16S rRNA gene sequencing. This study highlighted the large diversity of the bacterial community that may be found on teat skin, where 79.8% of clones corresponded to various unidentified species as well as 66 identified species, mainly belonging to those commonly found in raw milk (Enterococcus, Pediococcus, Enterobacter, Pantoea, Aerococcus, and Staphylococcus). Several of them, such as nonstarter lactic acid bacteria (NSLAB), Staphylococcus, and Actinobacteria, may contribute to the development of the sensory characteristics of cheese during ripening. Therefore, teat skin could be an interesting source or vector of biodiversity for milk. Variations of microbial counts and diversity between the farms studied have been observed. Moreover, Staphylococcus auricularis, Staphylococcus devriesei, Staphylococcus arlettae, Streptococcus bovis, Streptococcus equinus, Clavibacter michiganensis, Coprococcus catus, or Arthrobacter gandavensis commensal bacteria of teat skin and teat canal, as well as human skin, are not common in milk, suggesting that there is a breakdown of microbial flow from animal to milk. It would then be interesting to thoroughly study this microbial flow from teat to milk.
Genetic variations in merozoite surface antigen genes of Babesia bovis detected in Vietnamese cattle and water buffaloes.

PubMed

Yokoyama, Naoaki; Sivakumar, Thillaiampalam; Tuvshintulga, Bumduuren; Hayashida, Kyoko; Igarashi, Ikuo; Inoue, Noboru; Long, Phung Thang; Lan, Dinh Thi Bich

2015-03-01

The genes that encode merozoite surface antigens (MSAs) in Babesia bovis are genetically diverse. In this study, we analyzed the genetic diversity of B. bovis MSA-1, MSA-2b, and MSA-2c genes in Vietnamese cattle and water buffaloes. Blood DNA samples from 258 cattle and 49 water buffaloes reared in the Thua Thien Hue province of Vietnam were screened with a B. bovis-specific diagnostic PCR assay. The B. bovis-positive DNA samples (23 cattle and 16 water buffaloes) were then subjected to PCR assays to amplify the MSA-1, MSA-2b, and MSA-2c genes. Sequencing analyses showed that the Vietnamese MSA-1 and MSA-2b sequences are genetically diverse, whereas MSA-2c is relatively conserved. The nucleotide identity values for these MSA gene sequences were similar in the cattle and water buffaloes. Consistent with the sequencing data, the Vietnamese MSA-1 and MSA-2b sequences were dispersed across several clades in the corresponding phylogenetic trees, whereas the MSA-2c sequences occurred in a single clade. Cattle- and water-buffalo-derived sequences also often clustered together on the phylogenetic trees. The Vietnamese MSA-1, MSA-2b, and MSA-2c sequences were then screened for recombination with automated methods. Of the seven recombination events detected, five and two were associated with the MSA-2b and MSA-2c recombinant sequences, respectively, whereas no MSA-1 recombinants were detected among the sequences analyzed. Recombination between the sequences derived from cattle and water buffaloes was very common, and the resultant recombinant sequences were found in both host animals. These data indicate that the genetic diversity of the MSA sequences does not differ between cattle and water buffaloes in Vietnam. They also suggest that recombination between the B. bovis MSA sequences in both cattle and water buffaloes might contribute to the genetic variation in these genes in Vietnam. Copyright © 2015 Elsevier B.V. All rights reserved.

Foliar fungi of Betula pendula: impact of tree species mixtures and assessment methods

PubMed Central

Nguyen, Diem; Boberg, Johanna; Cleary, Michelle; Bruelheide, Helge; Hönig, Lydia; Koricheva, Julia; Stenlid, Jan

2017-01-01

Foliar fungi of silver birch (Betula pendula) in an experimental Finnish forest were investigated across a gradient of tree species richness using molecular high-throughput sequencing and visual macroscopic assessment. We hypothesized that the molecular approach detects more fungal taxa than visual assessment, and that there is a relationship among the most common fungal taxa detected by both techniques. Furthermore, we hypothesized that the fungal community composition, diversity, and distribution patterns are affected by changes in tree diversity. Sequencing revealed greater diversity of fungi on birch leaves than the visual assessment method. One species showed a linear relationship between the methods. Species-specific variation in fungal community composition could be partially explained by tree diversity, though overall fungal diversity was not affected by tree diversity. Analysis of specific fungal taxa indicated tree diversity effects at the local neighbourhood scale, where the proportion of birch among neighbouring trees varied, but not at the plot scale. In conclusion, both methods may be used to determine tree diversity effects on the foliar fungal community. However, high-throughput sequencing provided higher resolution of the fungal community, while the visual macroscopic assessment detected functionally active fungal species. PMID:28150710
Classification of Cowpox Viruses into Several Distinct Clades and Identification of a Novel Lineage

PubMed Central

Franke, Annika; Pfaff, Florian; Jenckel, Maria; Hoffmann, Bernd; Höper, Dirk; Antwerpen, Markus; Meyer, Hermann; Beer, Martin; Hoffmann, Donata

2017-01-01

Cowpox virus (CPXV) was considered as uniform species within the genus Orthopoxvirus (OPV). Previous phylogenetic analysis indicated that CPXV is polyphyletic and isolates may cluster into different clades with two of these clades showing genetic similarities to either variola (VARV) or vaccinia viruses (VACV). Further analyses were initiated to assess both the genetic diversity and the evolutionary background of circulating CPXVs. Here we report the full-length sequences of 20 CPXV strains isolated from different animal species and humans in Germany. A phylogenetic analysis of altogether 83 full-length OPV genomes confirmed the polyphyletic character of the species CPXV and suggested at least four different clades. The German isolates from this study mainly clustered into two CPXV-like clades, and VARV- and VACV-like strains were not observed. A single strain, isolated from a cotton-top tamarin, clustered distantly from all other CPXVs and might represent a novel and unique evolutionary lineage. The classification of CPXV strains into clades roughly followed their geographic origin, with the highest clade diversity so far observed for Germany. Furthermore, we found evidence for recombination between OPV clades without significant disruption of the observed clustering. In conclusion, this analysis markedly expands the number of available CPXV full-length sequences and confirms the co-circulation of several CPXV clades in Germany, and provides the first data about a new evolutionary CPXV lineage. PMID:28604604
Intrastrain heterogeneity of the mgpB gene in Mycoplasma genitalium is extensive in vitro and in vivo and suggests that variation is generated via recombination with repetitive chromosomal sequences.

PubMed

Iverson-Cabral, Stefanie L; Astete, Sabina G; Cohen, Craig R; Rocha, Eduardo P C; Totten, Patricia A

2006-07-01

Mycoplasma genitalium is associated with reproductive tract disease in women and may persist in the lower genital tract for months, potentially increasing the risk of upper tract infection and transmission to uninfected partners. Despite its exceptionally small genome (580 kb), approximately 4% is composed of repeated elements known as MgPar sequences (MgPa repeats) based on their homology to the mgpB gene that encodes the immunodominant MgPa adhesin protein. The presence of these MgPar sequences, as well as mgpB variability between M. genitalium strains, suggests that mgpB and MgPar sequences recombine to produce variant MgPa proteins. To examine the extent and generation of diversity within single strains of the organism, we examined mgpB variation within M. genitalium strain G-37 and observed sequence heterogeneity that could be explained by recombination between the mgpB expression site and putative donor MgPar sequences. Similarly, we analyzed mgpB sequences from cervical specimens from a persistently infected woman (21 months) and identified 17 different mgpB variants within a single infecting M. genitalium strain, confirming that mgpB heterogeneity occurs over the course of a natural infection. These observations support the hypothesis that recombination occurs between the mgpB gene and MgPar sequences and that the resulting antigenically distinct MgPa variants may contribute to immune evasion and persistence of infection.
Screening and Characterization of RAPD Markers in Viscerotropic Leishmania Parasites

PubMed Central

Mkada–Driss, Imen; Talbi, Chiraz; Guerbouj, Souheila; Driss, Mehdi; Elamine, Elwaleed M.; Cupolillo, Elisa; Mukhtar, Moawia M.; Guizani, Ikram

2014-01-01

Visceral leishmaniasis (VL) is mainly due to the Leishmania donovani complex. VL is endemic in many countries worldwide including East Africa and the Mediterranean region where the epidemiology is complex. Taxonomy of these pathogens is under controversy but there is a correlation between their genetic diversity and geographical origin. With steady increase in genome knowledge, RAPD is still a useful approach to identify and characterize novel DNA markers. Our aim was to identify and characterize polymorphic DNA markers in VL Leishmania parasites in diverse geographic regions using RAPD in order to constitute a pool of PCR targets having the potential to differentiate among the VL parasites. 100 different oligonucleotide decamers having arbitrary DNA sequences were screened for reproducible amplification and a selection of 28 was used to amplify DNA from 12 L. donovani, L. archibaldi and L. infantum strains having diverse origins. A total of 155 bands were amplified of which 60.65% appeared polymorphic. 7 out of 28 primers provided monomorphic patterns. Phenetic analysis allowed clustering the parasites according to their geographical origin. Differentially amplified bands were selected, among them 22 RAPD products were successfully cloned and sequenced. Bioinformatic analysis allowed mapping of the markers and sequences and priming sites analysis. This study was complemented with Southern-blot to confirm assignment of markers to the kDNA. The bioinformatic analysis identified 16 nuclear and 3 minicircle markers. Analysis of these markers highlighted polymorphisms at RAPD priming sites with mainly 5′ end transversions, and presence of inter– and intra– taxonomic complex sequence and microsatellites variations; a bias in transitions over transversions and indels between the different sequences compared is observed, which is however less marked between L. infantum and L. donovani. The study delivers a pool of well-documented polymorphic DNA markers, to develop molecular diagnostics assays to characterize and differentiate VL causing agents. PMID:25313833
Stability of operational taxonomic units: an important but neglected property for analyzing microbial diversity.

PubMed

He, Yan; Caporaso, J Gregory; Jiang, Xiao-Tao; Sheng, Hua-Fang; Huse, Susan M; Rideout, Jai Ram; Edgar, Robert C; Kopylova, Evguenia; Walters, William A; Knight, Rob; Zhou, Hong-Wei

2015-01-01

The operational taxonomic unit (OTU) is widely used in microbial ecology. Reproducibility in microbial ecology research depends on the reliability of OTU-based 16S ribosomal subunit RNA (rRNA) analyses. Here, we report that many hierarchical and greedy clustering methods produce unstable OTUs, with membership that depends on the number of sequences clustered. If OTUs are regenerated with additional sequences or samples, sequences originally assigned to a given OTU can be split into different OTUs. Alternatively, sequences assigned to different OTUs can be merged into a single OTU. This OTU instability affects alpha-diversity analyses such as rarefaction curves, beta-diversity analyses such as distance-based ordination (for example, Principal Coordinate Analysis (PCoA)), and the identification of differentially represented OTUs. Our results show that the proportion of unstable OTUs varies for different clustering methods. We found that the closed-reference method is the only one that produces completely stable OTUs, with the caveat that sequences that do not match a pre-existing reference sequence collection are discarded. As a compromise to the factors listed above, we propose using an open-reference method to enhance OTU stability. This type of method clusters sequences against a database and includes unmatched sequences by clustering them via a relatively stable de novo clustering method. OTU stability is an important consideration when analyzing microbial diversity and is a feature that should be taken into account during the development of novel OTU clustering methods.
Population diversity of Diaphorina citri (Hemiptera: Liviidae) in China based on whole mitochondrial genome sequences.

PubMed

Wu, Fengnian; Jiang, Hongyan; Beattie, G Andrew C; Holford, Paul; Chen, Jianchi; Wallis, Christopher M; Zheng, Zheng; Deng, Xiaoling; Cen, Yijing

2018-04-24

Diaphorina citri (Asian citrus psyllid; ACP) transmits 'Candidatus Liberibacter asiaticus' associated with citrus Huanglongbing (HLB). ACP has been reported in 11 provinces/regions in China, yet its population diversity remains unclear. In this study, we evaluated ACP population diversity in China using representative whole mitochondrial genome (mitogenome) sequences. Additional mitogenome sequences outside China were also acquired and evaluated. The sizes of the 27 ACP mitogenome sequences ranged from 14 986 to 15 030 bp. Along with three previously published mitogenome sequences, the 30 sequences formed three major mitochondrial groups (MGs): MG1, present in southwestern China and occurring at elevations above 1000 m; MG2, present in southeastern China and Southeast Asia (Cambodia, Indonesia, Malaysia, and Vietnam) and occurring at elevations below 180 m; and MG3, present in the USA and Pakistan. Single nucleotide polymorphisms in five genes (cox2, atp8, nad3, nad1 and rrnL) contributed mostly in the ACP diversity. Among these genes, rrnL had the most variation. Mitogenome sequences analyses revealed two major phylogenetic groups of ACP present in China as well as a possible unique group present currently in Pakistan and the USA. The information could have significant implications for current ACP control and HLB management. © 2018 Society of Chemical Industry. © 2018 Society of Chemical Industry.
Microbial and functional diversity of a subterrestrial high pH groundwater associated to serpentinization.

PubMed

Tiago, Igor; Veríssimo, António

2013-06-01

Microbial and functional diversity were assessed, from a serpentinization-driven subterrestrial alkaline aquifer - Cabeço de Vide Aquifer (CVA) in Portugal. DGGE analyses revealed the presence of a stable microbial community. By 16S rRNA gene libraries and pyrosequencing analyses, a diverse bacterial composition was determined, contrasting with low archaeal diversity. Within Bacteria the majority of the populations were related to organisms or sequences affiliated to class Clostridia, but members of classes Acidobacteria, Actinobacteria, Alphaproteobacteria, Betaproteobacteria, Deinococci, Gammaproteobacteria and of the phyla Bacteroidetes, Chloroflexi and Nitrospira were also detected. Domain Archaea encompassed mainly sequences affiliated to Euryarchaeota. Only form I RuBisCO - cbbL was detected. Autotrophic carbon fixation via the rTCA, 3-HP and 3-HP/4H-B cycles could not be confirmed. The detected APS reductase alpha subunit - aprA sequences were phylogenetically related to sequences of sulfate-reducing bacteria belonging to Clostridia, and also to sequences of chemolithoautothrophic sulfur-oxidizing bacteria belonging to Betaproteobacteria. Sequences of methyl coenzyme M reductase - mcrA were phylogenetically affiliated to sequences belonging to Anaerobic Methanotroph group 1 (ANME-1). The populations found and the functional key markers detected in CVA suggest that metabolisms related to H2 , methane and/or sulfur may be the major driving forces in this environment. © 2012 Society for Applied Microbiology and Blackwell Publishing Ltd.
Phylogenetic and ecological analyses of soil and sporocarp DNA sequences reveal high diversity and strong habitat partitioning in the boreal ectomycorrhizal genus Russula (Russulales; Basidiomycota)

Treesearch

József Geml; Gary A. Laursen; Ian C. Herriott; Jack M. McFarland; Michael G. Booth; Niall Lennon; H. Chad Nusbaum; D. Lee Taylor

2010-01-01

Although critical for the functioning of ecosystems, fungi are poorly known in high-latitude regions. Here, we provide the first genetic diversity assessment of one of the most diverse and abundant ectomycorrhizal genera in Alaska: Russula. We analyzed internal transcribed spacer rDNA sequences from sporocarps and soil samples using phylogenetic...
Broad Surveys of DNA Viral Diversity Obtained through Viral Metagenomics of Mosquitoes

PubMed Central

Ng, Terry Fei Fan; Willner, Dana L.; Lim, Yan Wei; Schmieder, Robert; Chau, Betty; Nilsson, Christina; Anthony, Simon; Ruan, Yijun; Rohwer, Forest; Breitbart, Mya

2011-01-01

Viruses are the most abundant and diverse genetic entities on Earth; however, broad surveys of viral diversity are hindered by the lack of a universal assay for viruses and the inability to sample a sufficient number of individual hosts. This study utilized vector-enabled metagenomics (VEM) to provide a snapshot of the diversity of DNA viruses present in three mosquito samples from San Diego, California. The majority of the sequences were novel, suggesting that the viral community in mosquitoes, as well as the animal and plant hosts they feed on, is highly diverse and largely uncharacterized. Each mosquito sample contained a distinct viral community. The mosquito viromes contained sequences related to a broad range of animal, plant, insect and bacterial viruses. Animal viruses identified included anelloviruses, circoviruses, herpesviruses, poxviruses, and papillomaviruses, which mosquitoes may have obtained from vertebrate hosts during blood feeding. Notably, sequences related to human papillomaviruses were identified in one of the mosquito samples. Sequences similar to plant viruses were identified in all mosquito viromes, which were potentially acquired through feeding on plant nectar. Numerous bacteriophages and insect viruses were also detected, including a novel densovirus likely infecting Culex erythrothorax. Through sampling insect vectors, VEM enables broad survey of viral diversity and has significantly increased our knowledge of the DNA viruses present in mosquitoes. PMID:21674005
Elucidating the genomic architecture of Asian EGFR-mutant lung adenocarcinoma through multi-region exome sequencing.

PubMed

Nahar, Rahul; Zhai, Weiwei; Zhang, Tong; Takano, Angela; Khng, Alexis J; Lee, Yin Yeng; Liu, Xingliang; Lim, Chong Hee; Koh, Tina P T; Aung, Zaw Win; Lim, Tony Kiat Hon; Veeravalli, Lavanya; Yuan, Ju; Teo, Audrey S M; Chan, Cheryl X; Poh, Huay Mei; Chua, Ivan M L; Liew, Audrey Ann; Lau, Dawn Ping Xi; Kwang, Xue Lin; Toh, Chee Keong; Lim, Wan-Teck; Lim, Bing; Tam, Wai Leong; Tan, Eng-Huat; Hillmer, Axel M; Tan, Daniel S W

2018-01-15

EGFR-mutant lung adenocarcinomas (LUAD) display diverse clinical trajectories and are characterized by rapid but short-lived responses to EGFR tyrosine kinase inhibitors (TKIs). Through sequencing of 79 spatially distinct regions from 16 early stage tumors, we show that despite low mutation burdens, EGFR-mutant Asian LUADs unexpectedly exhibit a complex genomic landscape with frequent and early whole-genome doubling, aneuploidy, and high clonal diversity. Multiple truncal alterations, including TP53 mutations and loss of CDKN2A and RB1, converge on cell cycle dysregulation, with late sector-specific high-amplitude amplifications and deletions that potentially beget drug resistant clones. We highlight the association between genomic architecture and clinical phenotypes, such as co-occurring truncal drivers and primary TKI resistance. Through comparative analysis with published smoking-related LUAD, we postulate that the high intra-tumor heterogeneity observed in Asian EGFR-mutant LUAD may be contributed by an early dominant driver, genomic instability, and low background mutation rates.
Specifics of the methodological approach to the study of nanoparticle impact on human health in the production of non-metallic nanomaterials for construction purposes

NASA Astrophysics Data System (ADS)

Ayzenshtadt, A. M.; Frolova, M. A.; Makhova, T. A.; Danilov, V. E.; Gupta, Piyush K.; Verma, Rama S.

2018-01-01

Minerals samples of mixed-genesis rocks in a finely dispersed state were obtained and studied, namely sand deposit (Kholmogory district) and basalt (Myandukha deposit, Plesetsk district) in Arkhangelsk region. The paper provides the chemical composition data used to calculate the specific mass atomization energy of rocks. The energy parameters of the micro and nano systems of the rock samples - free surface energy and surface activity - were calculated. For toxicological evaluation of the materials obtained, next-generation sequencing (NGS) was used to perform metagenomic analysis which allowed determining the species diversity of microorganisms in the samples under study. It was shown that the sequencing method and metagenomic analysis are applicable and provide good reproducibility for the analysis of the toxicological properties of selected rock samples. The correlation of the surface activity of finely dispersed rock systems and the species diversity of cultivated microorganisms on the raw material was observed.
Microbial population Diversity of indigenous acidophilic bacteria for recovering the valuable resources

NASA Astrophysics Data System (ADS)

Kim, B.; Cho, K.; Lee, D.; Choi, N.; Park, C.

2011-12-01

A taxon- or group-specific PCR primer serves as a valuable tool for studying the bioleaching mechanisms of a particular group of microorganisms. Especially for an uncultured (or very difficult to isolate from their environments) group of microorganisms, the group-specific PCR primer is essential for the investigation of distribution patterns and the estimation of genetic diversity of the target microorganisms. This study investigated the Biodiversity through molecular biology method using the three different indigenous acidophilic bacteria collected from acid mine drainage in Go-seong and Yeon-hwa, Korea and acidic hot spring in Hatchnobaru, Japan. We performed the optical analysis (phase-contrast microscope and SEM), base sequencing. In the phase-contrast microscope(X 4,000) and SEM analysis, the rod-shaped bacteria with 1μm in length were observed. The results of base sequencing using EzTaxon server data revealed Acidithiobacillus ferrooxidans (Go-seong - 97.79%, Yeon-hwa - 97.90% and Hatchnobaru - 97.97%)
Nitrification and occurrence of salt-tolerant nitrifying bacteria in the Negev desert soils.

PubMed

Nejidat, Ali

2005-03-01

Ammonia oxidation potential, major ammonia oxidizers and occurrence of salt-tolerant nitrifying bacteria were studied in soil samples collected from diverse ecosystems along the northern Negev desert. Great diversity in ammonia oxidation potential was observed among the soil samples, and ammonia oxidizers were the rate-limiting step of nitrification. Denaturing gradient gel electrophoresis and partial 16S rRNA gene sequences indicate that members of the genus Nitrosospira are the major ammonia oxidizers in the natural desert soil samples. Upon enrichment with different salt concentrations, salt-tolerant nitrifying enrichments were established from several soil samples. In two enrichments, nitrification was not inhibited by 400 mM NaCl. Electrophoretic analysis and partial 16S rRNA gene sequences indicate that Nitrosomonas species were dominant in the 400 mM salt enrichment. The results point towards the potential of the desert ecosystem as a source of stress-tolerant nitrifying bacteria or other microorganisms with important properties.
Characterization of an endogenous retrovirus class in elephants and their relatives

PubMed Central

Greenwood, Alex D; Englbrecht, Claudia C; MacPhee, Ross DE

2004-01-01

Background Endogenous retrovirus-like elements (ERV-Ls, primed with tRNA leucine) are a diverse group of reiterated sequences related to foamy viruses and widely distributed among mammals. As shown in previous investigations, in many primates and rodents this class of elements has remained transpositionally active, as reflected by increased copy number and high sequence diversity within and among taxa. Results Here we examine whether proviral-like sequences may be suitable molecular probes for investigating the phylogeny of groups known to have high element diversity. As a test we characterized ERV-Ls occurring in a sample of extant members of superorder Uranotheria (Asian and African elephants, manatees, and hyraxes). The ERV-L complement in this group is even more diverse than previously suspected, and there is sequence evidence for active expansion, particularly in elephantids. Many of the elements characterized have protein coding potential suggestive of activity. Conclusions In general, the evidence supports the hypothesis that the complement had a single origin within basal Uranotheria. PMID:15476555
The spectrum of genomic signatures: from dinucleotides to chaos game representation.

PubMed

Wang, Yingwei; Hill, Kathleen; Singh, Shiva; Kari, Lila

2005-02-14

In the post genomic era, access to complete genome sequence data for numerous diverse species has opened multiple avenues for examining and comparing primary DNA sequence organization of entire genomes. Previously, the concept of a genomic signature was introduced with the observation of species-type specific Dinucleotide Relative Abundance Profiles (DRAPs); dinucleotides were identified as the subsequences with the greatest bias in representation in a majority of genomes. Herein, we demonstrate that DRAP is one particular genomic signature contained within a broader spectrum of signatures. Within this spectrum, an alternative genomic signature, Chaos Game Representation (CGR), provides a unique visualization of patterns in sequence organization. A genomic signature is associated with a particular integer order or subsequence length that represents a measure of the resolution or granularity in the analysis of primary DNA sequence organization. We quantitatively explore the organizational information provided by genomic signatures of different orders through different distance measures, including a novel Image Distance. The Image Distance and other existing distance measures are evaluated by comparing the phylogenetic trees they generate for 26 complete mitochondrial genomes from a diversity of species. The phylogenetic tree generated by the Image Distance is compatible with the known relatedness of species. Quantitative evaluation of the spectrum of genomic signatures may be used to ultimately gain insight into the determinants and biological relevance of the genome signatures.
Phylogeny of Banana Streak Virus reveals recent and repetitive endogenization in the genome of its banana host (Musa sp.).

PubMed

Gayral, Philippe; Iskra-Caruana, Marie-Line

2009-07-01

Banana streak virus (BSV) is a plant dsDNA pararetrovirus (family Caulimoviridae, genus badnavirus). Although integration is not an essential step in the BSV replication cycle, the nuclear genome of banana (Musa sp.) contains BSV endogenous pararetrovirus sequences (BSV EPRVs). Some BSV EPRVs are infectious by reconstituting a functional viral genome. Recent studies revealed a large molecular diversity of episomal BSV viruses (i.e., nonintegrated) while others focused on BSV EPRV sequences only. In this study, the evolutionary history of badnavirus integration in banana was inferred from phylogenetic relationships between BSV and BSV EPRVs. The relative evolution rates and selective pressures (d(N)/d(S) ratio) were also compared between endogenous and episomal viral sequences. At least 27 recent independent integration events occurred after the divergence of three banana species, indicating that viral integration is a recent and frequent phenomenon. Relaxation of selective pressure on badnaviral sequences that experienced neutral evolution after integration in the plant genome was recorded. Additionally, a significant decrease (35%) in the EPRV evolution rate was observed compared to BSV, reflecting the difference in the evolution rate between episomal dsDNA viruses and plant genome. The comparison of our results with the evolution rate of the Musa genome and other reverse-transcribing viruses suggests that EPRVs play an active role in episomal BSV diversity and evolution.
Simultaneous profiling of seed-associated bacteria and fungi reveals antagonistic interactions between microorganisms within a shared epiphytic microbiome on Triticum and Brassica seeds.

PubMed

Links, Matthew G; Demeke, Tigst; Gräfenhan, Tom; Hill, Janet E; Hemmingsen, Sean M; Dumonceaux, Tim J

2014-04-01

In order to address the hypothesis that seeds from ecologically and geographically diverse plants harbor characteristic epiphytic microbiota, we characterized the bacterial and fungal microbiota associated with Triticum and Brassica seed surfaces. The total microbial complement was determined by amplification and sequencing of a fragment of chaperonin 60 (cpn60). Specific microorganisms were quantified by qPCR. Bacteria and fungi corresponding to operational taxonomic units (OTU) that were identified in the sequencing study were isolated and their interactions examined. A total of 5477 OTU were observed from seed washes. Neither total epiphytic bacterial load nor community richness/evenness was significantly different between the seed types; 578 OTU were shared among all samples at a variety of abundances. Hierarchical clustering revealed that 203 were significantly different in abundance on Triticum seeds compared with Brassica. Microorganisms isolated from seeds showed 99-100% identity between the cpn60 sequences of the isolates and the OTU sequences from this shared microbiome. Bacterial strains identified as Pantoea agglomerans had antagonistic properties toward one of the fungal isolates (Alternaria sp.), providing a possible explanation for their reciprocal abundances on both Triticum and Brassica seeds. cpn60 enabled the simultaneous profiling of bacterial and fungal microbiota and revealed a core seed-associated microbiota shared between diverse plant genera. © 2014 AAFC. New Phytologist © 2014 New Phytologist Trust.
Simultaneous profiling of seed-associated bacteria and fungi reveals antagonistic interactions between microorganisms within a shared epiphytic microbiome on Triticum and Brassica seeds

PubMed Central

Links, Matthew G; Demeke, Tigst; Gräfenhan, Tom; Hill, Janet E; Hemmingsen, Sean M; Dumonceaux, Tim J

2014-01-01

In order to address the hypothesis that seeds from ecologically and geographically diverse plants harbor characteristic epiphytic microbiota, we characterized the bacterial and fungal microbiota associated with Triticum and Brassica seed surfaces. The total microbial complement was determined by amplification and sequencing of a fragment of chaperonin 60 (cpn60). Specific microorganisms were quantified by qPCR. Bacteria and fungi corresponding to operational taxonomic units (OTU) that were identified in the sequencing study were isolated and their interactions examined. A total of 5477 OTU were observed from seed washes. Neither total epiphytic bacterial load nor community richness/evenness was significantly different between the seed types; 578 OTU were shared among all samples at a variety of abundances. Hierarchical clustering revealed that 203 were significantly different in abundance on Triticum seeds compared with Brassica. Microorganisms isolated from seeds showed 99–100% identity between the cpn60 sequences of the isolates and the OTU sequences from this shared microbiome. Bacterial strains identified as Pantoea agglomerans had antagonistic properties toward one of the fungal isolates (Alternaria sp.), providing a possible explanation for their reciprocal abundances on both Triticum and Brassica seeds. cpn60 enabled the simultaneous profiling of bacterial and fungal microbiota and revealed a core seed-associated microbiota shared between diverse plant genera. PMID:24444052
Genetic diversity of the merozoite surface protein-3 gene in Plasmodium falciparum populations in Thailand.

PubMed

Pattaradilokrat, Sittiporn; Sawaswong, Vorthon; Simpalipan, Phumin; Kaewthamasorn, Morakot; Siripoon, Napaporn; Harnyuttanakorn, Pongchai

2016-10-21

An effective malaria vaccine is an urgently needed tool to fight against human malaria, the most deadly parasitic disease of humans. One promising candidate is the merozoite surface protein-3 (MSP-3) of Plasmodium falciparum. This antigenic protein, encoded by the merozoite surface protein (msp-3) gene, is polymorphic and classified according to size into the two allelic types of K1 and 3D7. A recent study revealed that both the K1 and 3D7 alleles co-circulated within P. falciparum populations in Thailand, but the extent of the sequence diversity and variation within each allelic type remains largely unknown. The msp-3 gene was sequenced from 59 P. falciparum samples collected from five endemic areas (Mae Hong Son, Kanchanaburi, Ranong, Trat and Ubon Ratchathani) in Thailand and analysed for nucleotide sequence diversity, haplotype diversity and deduced amino acid sequence diversity. The gene was also subject to population genetic analysis (F st ) and neutrality tests (Tajima's D, Fu and Li D* and Fu and Li' F* tests) to determine any signature of selection. The sequence analyses revealed eight unique DNA haplotypes and seven amino acid sequence variants, with a haplotype and nucleotide diversity of 0.828 and 0.049, respectively. Neutrality tests indicated that the polymorphism detected in the alanine heptad repeat region of MSP-3 was maintained by positive diversifying selection, suggesting its role as a potential target of protective immune responses and supporting its role as a vaccine candidate. Comparison of MSP-3 variants among parasite populations in Thailand, India and Nigeria also inferred a close genetic relationship between P. falciparum populations in Asia. This study revealed the extent of the msp-3 gene diversity in P. falciparum in Thailand, providing the fundamental basis for the better design of future blood stage malaria vaccines against P. falciparum.
Digital data for Quick Response (QR) codes of thermophiles to identify and compare the bacterial species isolated from Unkeshwar hot springs (India).

PubMed

Rekadwad, Bhagwan N; Khobragade, Chandrahasya N

2016-03-01

16S rRNA sequences of morphologically and biochemically identified 21 thermophilic bacteria isolated from Unkeshwar hot springs (19°85'N and 78°25'E), Dist. Nanded (India) has been deposited in NCBI repository. The 16S rRNA gene sequences were used to generate QR codes for sequences (FASTA format and full Gene Bank information). Diversity among the isolates is compared with known isolates and evaluated using CGR, FCGR and PCA i.e. visual comparison and evaluation respectively. Considerable biodiversity was observed among the identified bacteria isolated from Unkeshwar hot springs. The hyperlinked QR codes, CGR, FCGR and PCA of all the isolates are made available to the users on a portal https://sites.google.com/site/bhagwanrekadwad/.

Intermediary metabolism in protists: a sequence-based view of facultative anaerobic metabolism in evolutionarily diverse eukaryotes.

PubMed

Ginger, Michael L; Fritz-Laylin, Lillian K; Fulton, Chandler; Cande, W Zacheus; Dawson, Scott C

2010-12-01

Protists account for the bulk of eukaryotic diversity. Through studies of gene and especially genome sequences the molecular basis for this diversity can be determined. Evident from genome sequencing are examples of versatile metabolism that go far beyond the canonical pathways described for eukaryotes in textbooks. In the last 2-3 years, genome sequencing and transcript profiling has unveiled several examples of heterotrophic and phototrophic protists that are unexpectedly well-equipped for ATP production using a facultative anaerobic metabolism, including some protists that can (Chlamydomonas reinhardtii) or are predicted (Naegleria gruberi, Acanthamoeba castellanii, Amoebidium parasiticum) to produce H(2) in their metabolism. It is possible that some enzymes of anaerobic metabolism were acquired and distributed among eukaryotes by lateral transfer, but it is also likely that the common ancestor of eukaryotes already had far more metabolic versatility than was widely thought a few years ago. The discussion of core energy metabolism in unicellular eukaryotes is the subject of this review. Since genomic sequencing has so far only touched the surface of protist diversity, it is anticipated that sequences of additional protists may reveal an even wider range of metabolic capabilities, while simultaneously enriching our understanding of the early evolution of eukaryotes. Copyright © 2010 Elsevier GmbH. All rights reserved.
Strain-Level Diversity of Secondary Metabolism in Streptomyces albus

PubMed Central

Seipke, Ryan F.

2015-01-01

Streptomyces spp. are robust producers of medicinally-, industrially- and agriculturally-important small molecules. Increased resistance to antibacterial agents and the lack of new antibiotics in the pipeline have led to a renaissance in natural product discovery. This endeavor has benefited from inexpensive high quality DNA sequencing technology, which has generated more than 140 genome sequences for taxonomic type strains and environmental Streptomyces spp. isolates. Many of the sequenced streptomycetes belong to the same species. For instance, Streptomyces albus has been isolated from diverse environmental niches and seven strains have been sequenced, consequently this species has been sequenced more than any other streptomycete, allowing valuable analyses of strain-level diversity in secondary metabolism. Bioinformatics analyses identified a total of 48 unique biosynthetic gene clusters harboured by Streptomyces albus strains. Eighteen of these gene clusters specify the core secondary metabolome of the species. Fourteen of the gene clusters are contained by one or more strain and are considered auxiliary, while 16 of the gene clusters encode the production of putative strain-specific secondary metabolites. Analysis of Streptomyces albus strains suggests that each strain of a Streptomyces species likely harbours at least one strain-specific biosynthetic gene cluster. Importantly, this implies that deep sequencing of a species will not exhaust gene cluster diversity and will continue to yield novelty. PMID:25635820
Intermediary Metabolism in Protists: a Sequence-based View of Facultative Anaerobic Metabolism in Evolutionarily Diverse Eukaryotes

PubMed Central

Ginger, Michael L.; Fritz-Laylin, Lillian K.; Fulton, Chandler; Cande, W. Zacheus; Dawson, Scott C.

2011-01-01

Protists account for the bulk of eukaryotic diversity. Through studies of gene and especially genome sequences the molecular basis for this diversity can be determined. Evident from genome sequencing are examples of versatile metabolism that go far beyond the canonical pathways described for eukaryotes in textbooks. In the last 2–3 years, genome sequencing and transcript profiling has unveiled several examples of heterotrophic and phototrophic protists that are unexpectedly well-equipped for ATP production using a facultative anaerobic metabolism, including some protists that can (Chlamydomonas reinhardtii) or are predicted (Naegleria gruberi, Acanthamoeba castellanii, Amoebidium parasiticum) to produce H2 in their metabolism. It is possible that some enzymes of anaerobic metabolism were acquired and distributed among eukaryotes by lateral transfer, but it is also likely that the common ancestor of eukaryotes already had far more metabolic versatility than was widely thought a few years ago. The discussion of core energy metabolism in unicellular eukaryotes is the subject of this review. Since genomic sequencing has so far only touched the surface of protist diversity, it is anticipated that sequences of additional protists may reveal an even wider range of metabolic capabilities, while simultaneously enriching our understanding of the early evolution of eukaryotes. PMID:21036663
A not-so-big crisis: re-reading Silurian conodont diversity in a sequence-stratigraphic framework

NASA Astrophysics Data System (ADS)

Jarochowska, Emilia; Munnecke, Axel

2016-04-01

Conodonts are extensively used in Ordovician through Triassic biostratigraphy and fossil-based geochemistry. However, their distribution in rock successions is commonly taken at face value, without taking into account their diverse and poorly understood ecology. Multielement taxonomy, ontogenetic and environmental variability, difficulties in extraction, and relative rarity all contribute to the general lack of quantitative studies on conodont stratigraphic distribution and temporal turnover. With respect to Silurian conodonts, the concept of recurrent conodont extinction events - the so called Ireviken, Mulde and Lau events - has become a standard in the stratigraphic literature. The concept has been proposed based on qualitative observations of local extirpations of open-marine pelagic or nekto-benthic taxa and temporary dominance of shallow-water species in the Silurian succession of the Swedish island of Gotland. These changes coincided with positive carbon isotope excursions, abrupt facies shifts, "blooms" of benthic fauna, and changes in reef communities, which have all been combined into a general view of Silurian bio-geochemical events. This view posits a deterministic, reproducible pattern in Silurian conodont diversity, attributed to recurrent ecological or geochemical conditions. The growing body of sequence-stratigraphic interpretations across these events in Gotland and other sections worldwide indicate that in all cases the Silurian "events" are associated with rapid global regressions. This suggests that faunal changes such as the dominance of shallow-water, low-diversity conodont fauna and the increase of benthic invertebrate diversity and abundance represent predictable consequences of the variation in the completeness of the rock record and preservation potential of different environments. Our studies in Poland and Ukraine indicate that the magnitude of change in the taxonomic composition of conodont assemblages across the middle Silurian global regression and the hypothesized Mulde Event is proportional to the associated facies shift. Quantitative data on facies distribution of individual conodont species combined with sequence stratigraphic architecture provides a testable model for the impact of sea-level changes on perceived conodont diversity in a section or basin. This approach highlights the need for quantitative data on conodont distribution in their environmental context, their integration into conodont-based stratigraphy and geochemistry, and for the regular use of Occam's razor to interpretations of paleobiodiversity.
Diversity of Babesia bovis merozoite surface antigen genes in the Philippines.

PubMed

Tattiyapong, Muncharee; Sivakumar, Thillaiampalam; Ybanez, Adrian Patalinghug; Ybanez, Rochelle Haidee Daclan; Perez, Zandro Obligado; Guswanto, Azirwan; Igarashi, Ikuo; Yokoyama, Naoaki

2014-02-01

Babesia bovis is the causative agent of fatal babesiosis in cattle. In the present study, we investigated the genetic diversity of B. bovis among Philippine cattle, based on the genes that encode merozoite surface antigens (MSAs). Forty-one B. bovis-positive blood DNA samples from cattle were used to amplify the msa-1, msa-2b, and msa-2c genes. In phylogenetic analyses, the msa-1, msa-2b, and msa-2c gene sequences generated from Philippine B. bovis-positive DNA samples were found in six, three, and four different clades, respectively. All of the msa-1 and most of the msa-2b sequences were found in clades that were formed only by Philippine msa sequences in the respective phylograms. While all the msa-1 sequences from the Philippines showed similarity to those formed by Australian msa-1 sequences, the msa-2b sequences showed similarity to either Australian or Mexican msa-2b sequences. In contrast, msa-2c sequences from the Philippines were distributed across all the clades of the phylogram, although one clade was formed exclusively by Philippine msa-2c sequences. Similarities among the deduced amino acid sequences of MSA-1, MSA-2b, and MSA-2c from the Philippines were 62.2-100, 73.1-100, and 67.3-100%, respectively. The present findings demonstrate that B. bovis populations are genetically diverse in the Philippines. This information will provide a good foundation for the future design and implementation of improved immunological preventive methodologies against bovine babesiosis in the Philippines. The study has also generated a set of data that will be useful for futher understanding of the global genetic diversity of this important parasite. © 2013.
[Community composition and diversity of endophytic fungi from roots of Sinopodophyllum hexandrum in forest of Upper-north mountain of Qinghai province].

PubMed

Ning, Yi; Li, Yan-Ling; Zhou, Guo-Ying; Yang, Lu-Cun; Xu, Wen-Hua

2016-04-01

High throughput sequencing technology is also called Next Generation Sequencing (NGS), which can sequence hundreds and thousands sequences in different samples at the same time. In the present study, the culture-independent high throughput sequencing technology was applied to sequence the fungi metagenomic DNA of the fungal internal transcribed spacer 1(ITS 1) in the root of Sinopodophyllum hexandrum. Sequencing data suggested that after the quality control, 22 565 reads were remained. Cluster similarity analysis was done based on 97% sequence similarity, which obtained 517 OTUs for the three samples (LD1, LD2 and LD3). All the fungi which identified from all the reads of OTUs based on 0.8 classification thresholds using the software of RDP classifier were classified as 13 classes, 35 orders, 44 family, 55 genera. Among these genera, the genus of Tetracladium was the dominant genera in all samples(35.49%, 68.55% and 12.96%).The Shannon's diversity indices and the Simpson indices of the endophytic fungi in the samples ranged from 1.75-2.92, 0.11-0.32, respectively.This is the first time for applying high through put sequencing technol-ogyto analyze the community composition and diversity of endophytic fungi in the medicinal plant, and the results showed that there were hyper diver sity and high community composition complexity of endophytic fungi in the root of S. hexandrum. It is also proved that the high through put sequencing technology has great advantage for analyzing ecommunity composition and diversity of endophtye in the plant. Copyright© by the Chinese Pharmaceutical Association.
Sequences of 95 human MHC haplotypes reveal extreme coding variation in genes other than highly polymorphic HLA class I and II

PubMed Central

Norman, Paul J.; Norberg, Steven J.; Guethlein, Lisbeth A.; Nemat-Gorgani, Neda; Royce, Thomas; Wroblewski, Emily E.; Dunn, Tamsen; Mann, Tobias; Alicata, Claudia; Hollenbach, Jill A.; Chang, Weihua; Shults Won, Melissa; Gunderson, Kevin L.; Abi-Rached, Laurent; Ronaghi, Mostafa; Parham, Peter

2017-01-01

The most polymorphic part of the human genome, the MHC, encodes over 160 proteins of diverse function. Half of them, including the HLA class I and II genes, are directly involved in immune responses. Consequently, the MHC region strongly associates with numerous diseases and clinical therapies. Notoriously, the MHC region has been intractable to high-throughput analysis at complete sequence resolution, and current reference haplotypes are inadequate for large-scale studies. To address these challenges, we developed a method that specifically captures and sequences the 4.8-Mbp MHC region from genomic DNA. For 95 MHC homozygous cell lines we assembled, de novo, a set of high-fidelity contigs and a sequence scaffold, representing a mean 98% of the target region. Included are six alternative MHC reference sequences of the human genome that we completed and refined. Characterization of the sequence and structural diversity of the MHC region shows the approach accurately determines the sequences of the highly polymorphic HLA class I and HLA class II genes and the complex structural diversity of complement factor C4A/C4B. It has also uncovered extensive and unexpected diversity in other MHC genes; an example is MUC22, which encodes a lung mucin and exhibits more coding sequence alleles than any HLA class I or II gene studied here. More than 60% of the coding sequence alleles analyzed were previously uncharacterized. We have created a substantial database of robust reference MHC haplotype sequences that will enable future population scale studies of this complicated and clinically important region of the human genome. PMID:28360230
Bacterial diversity in permanently cold and alkaline ikaite columns from Greenland.

PubMed

Schmidt, Mariane; Priemé, Anders; Stougaard, Peter

2006-12-01

Bacterial diversity in alkaline (pH 10.4) and permanently cold (4 degrees C) ikaite tufa columns from the Ikka Fjord, SW Greenland, was investigated using growth characterization of cultured bacterial isolates with Terminal-restriction fragment length polymorphism (T-RFLP) and sequence analysis of bacterial 16S rRNA gene fragments. More than 200 bacterial isolates were characterized with respect to pH and temperature tolerance, and it was shown that the majority were cold-active alkaliphiles. T-RFLP analysis revealed distinct bacterial communities in different fractions of three ikaite columns, and, along with sequence analysis, it showed the presence of rich and diverse bacterial communities. Rarefaction analysis showed that the 109 sequenced clones in the 16S rRNA gene library represented between 25 and 65% of the predicted species richness in the three ikaite columns investigated. Phylogenetic analysis of the 16S rRNA gene sequences revealed many sequences with similarity to alkaliphilic or psychrophilic bacteria, and showed that 33% of the cloned sequences and 33% of the cultured bacteria showed less than 97% sequence identity to known sequences in databases, and may therefore represent yet unknown species.
Severe chronic osteomyelitis caused by Morganella morganii with high population diversity.

PubMed

Zhu, Jialiang; Li, Haifeng; Feng, Li; Yang, Min; Yang, Ronggong; Yang, Lin; Li, Li; Li, Ruoyan; Liu, Minshan; Hou, Shuxun; Ke, Yuehua; Li, Wenfeng; Bai, Fan

2016-09-01

A case of chronic osteomyelitis probably caused by Morganella morganii, occurring over a period of 30 years, is reported. The organism was identified through a combination of sample culture, direct sequencing, and 16S RNA gene amplicon sequencing. Further whole-genome sequencing and population structure analysis of the isolates from the patient showed the bacterial population to be highly diverse. This case provides a valuable example of a long-term infection caused by an opportunistic pathogen, M. morganii, with high diversity, which might evolve during replication within the host. Copyright © 2016 The Authors. Published by Elsevier Ltd.. All rights reserved.
Molecular characterization and genetic diversity of ESBL-producing Escherichia coli colonizing the migratory Franklin's gulls (Leucophaeus pipixcan) in Antofagasta, North of Chile.

PubMed

Báez, John; Hernández-García, Marta; Guamparito, Constanza; Díaz, Sofía; Olave, Abdon; Guerrero, Katherine; Cantón, Rafael; Baquero, Fernando; Gahona, Joselyne; Valenzuela, Nicomedes; Del Campo, Rosa; Silva, Juan

2015-02-01

The role of wild animals, particularly migratory birds, in the dissemination of antibiotic-resistant bacteria between geographically distant ecosystems is usually underestimated. The aim of this work was to characterize the Escherichia coli population from Franklin's gull feces, focusing on the extended-spectrum β-lactamase (ESBL)-producing strains. In the summer of 2011, 124 fecal swabs from seagulls (1 of each) migrating from the United States and Canada to the coast of Antofagasta, north of Chile, were collected. Samples were seeded on MacConkey agar supplemented with 2 μg/ml of cefotaxime and a single colony from each plate was tested for ESBL production by the double-disk ESBL synergy test. Antibiotic susceptibility was determined by the disk diffusion method and blaESBL genes were amplified and sequenced. The genetic diversity of isolates was explored by pulsed-field gel electrophoresis (PFGE)-XbaI and multilocus sequence typing. A total of 91 E. coli isolates with high rates of antibiotic resistance were identified. Carbapenemase production was not detected, whereas 67 of the 91 (54%) isolates exhibited an ESBL phenotype due to the presence of CTX-M-15 (61.3%), CTX-M-2 (19.3%), CTX-M-22 (16.1%), and CTX-M-3 (1.6%) coding genes. High genetic diversity was observed, with 30 PFGE patterns and 23 sequence types (STs), including ST131 (18%), ST44 (15%), ST617 (9%), and ST10 (9%). Results presented here are complementary to those previously reported by Hernández et al. in the same gull species, but located in the Central Region of Chile. Differences observed between gulls from both areas lead us to hypothesize that gulls from the northern location retain, as gut carriers, those resistant bacteria acquired in the United States and/or Canada.
Application of novel polymorphic microsatellite loci identified in the Korean Pacific Abalone (Haliotis diversicolor supertexta (Haliotidae)) in the genetic characterization of wild and released populations.

PubMed

An, Hye Suck; Lee, Jang Wook; Hong, Seong Wan

2012-01-01

The small abalone, Haliotis diversicolor supertexta, of the family Haliotidae, is one of the most important species of marine shellfish in eastern Asia. Over the past few decades, this species has drastically declined in Korea. Thus, hatchery-bred seeds have been released into natural coastal areas to compensate for the reduced fishery resources. However, information on the genetic background of the small abalone is scarce. In this study, 20 polymorphic microsatellite DNA markers were identified using next-generation sequencing techniques and used to compare allelic variation between wild and released abalone populations in Korea. Using high-throughput genomic sequencing, a total of 1516 (2.26%; average length of 385 bp) reads containing simple sequence repeats were obtained from 86,011 raw reads. Among the 99 loci screened, 28 amplified successfully, and 20 were polymorphic. When comparing allelic variation between wild and released abalone populations, a total of 243 different alleles were observed, with 18.7 alleles per locus. High genetic diversity (mean heterozygosity = 0.81; mean allelic number = 15.5) was observed in both populations. A statistical analysis of the fixation index (F(ST)) and analysis of molecular variance (AMOVA) indicated limited genetic differences between the two populations (F(ST) = 0.002, p > 0.05). Although no significant reductions in the genetic diversity were found in the released population compared with the wild population (p > 0.05), the genetic diversity parameters revealed that the seeds released for stock abundance had a different genetic composition. These differences are likely a result of hatchery selection and inbreeding. Additionally, all the primer pair sets were effectively amplified in another congeneric species, H. diversicolor diversicolor, indicating that these primers are useful for both abalone species. These microsatellite loci may be valuable for future aquaculture and population genetic studies aimed at developing conservation and management plans for these two abalone species.
Genetic diversity of Plasmodium vivax and Plasmodium falciparum in Honduras

PubMed Central

2012-01-01

Background Understanding the population structure of Plasmodium species through genetic diversity studies can assist in the design of more effective malaria control strategies, particularly in vaccine development. Central America is an area where malaria is a public health problem, but little is known about the genetic diversity of the parasite’s circulating species. This study aimed to investigate the allelic frequency and molecular diversity of five surface antigens in field isolates from Honduras. Methods Five molecular markers were analysed to determine the genotypes of Plasmodium vivax and Plasmodium falciparum from endemic areas in Honduras. Genetic diversity of ama-1, msp-1 and csp was investigated for P. vivax, and msp-1 and msp-2 for P. falciparum. Allelic frequencies were calculated and sequence analysis performed. Results and conclusion A high genetic diversity was observed within Plasmodium isolates from Honduras. A different number of genotypes were elucidated: 41 (n = 77) for pvama-1; 23 (n = 84) for pvcsp; and 23 (n = 35) for pfmsp-1. Pvcsp sequences showed VK210 as the only subtype present in Honduran isolates. Pvmsp-1 (F2) was the most polymorphic marker for P. vivax isolates while pvama-1 was least variable. All three allelic families described for pfmsp-1 (n = 30) block 2 (K1, MAD20, and RO33), and both allelic families described for the central domain of pfmsp-2 (n = 11) (3D7 and FC27) were detected. However, K1 and 3D7 allelic families were predominant. All markers were randomly distributed across the country and no geographic correlation was found. To date, this is the most complete report on molecular characterization of P. vivax and P. falciparum field isolates in Honduras with regards to genetic diversity. These results indicate that P. vivax and P. falciparum parasite populations are highly diverse in Honduras despite the low level of transmission. PMID:23181845
Genetic diversity of Plasmodium vivax and Plasmodium falciparum in Honduras.

PubMed

Lopez, Ana Cecilia; Ortiz, Andres; Coello, Jorge; Sosa-Ochoa, Wilfredo; Torres, Rosa E Mejia; Banegas, Engels I; Jovel, Irina; Fontecha, Gustavo A

2012-11-26

Understanding the population structure of Plasmodium species through genetic diversity studies can assist in the design of more effective malaria control strategies, particularly in vaccine development. Central America is an area where malaria is a public health problem, but little is known about the genetic diversity of the parasite's circulating species. This study aimed to investigate the allelic frequency and molecular diversity of five surface antigens in field isolates from Honduras. Five molecular markers were analysed to determine the genotypes of Plasmodium vivax and Plasmodium falciparum from endemic areas in Honduras. Genetic diversity of ama-1, msp-1 and csp was investigated for P. vivax, and msp-1 and msp-2 for P. falciparum. Allelic frequencies were calculated and sequence analysis performed. A high genetic diversity was observed within Plasmodium isolates from Honduras. A different number of genotypes were elucidated: 41 (n = 77) for pvama-1; 23 (n = 84) for pvcsp; and 23 (n = 35) for pfmsp-1. Pvcsp sequences showed VK210 as the only subtype present in Honduran isolates. Pvmsp-1 (F2) was the most polymorphic marker for P. vivax isolates while pvama-1 was least variable. All three allelic families described for pfmsp-1 (n = 30) block 2 (K1, MAD20, and RO33), and both allelic families described for the central domain of pfmsp-2 (n = 11) (3D7 and FC27) were detected. However, K1 and 3D7 allelic families were predominant. All markers were randomly distributed across the country and no geographic correlation was found. To date, this is the most complete report on molecular characterization of P. vivax and P. falciparum field isolates in Honduras with regards to genetic diversity. These results indicate that P. vivax and P. falciparum parasite populations are highly diverse in Honduras despite the low level of transmission.
Pesticide Side Effects in an Agricultural Soil Ecosystem as Measured by amoA Expression Quantification and Bacterial Diversity Changes

PubMed Central

Feld, Louise; Hjelmsø, Mathis Hjort; Nielsen, Morten Schostag; Jacobsen, Anne Dorthe; Rønn, Regin; Ekelund, Flemming; Krogh, Paul Henning; Strobel, Bjarne Westergaard; Jacobsen, Carsten Suhr

2015-01-01

Background and Methods Assessing the effects of pesticide hazards on microbiological processes in the soil is currently based on analyses that provide limited insight into the ongoing processes. This study proposes a more comprehensive approach. The side effects of pesticides may appear as changes in the expression of specific microbial genes or as changes in diversity. To assess the impact of pesticides on gene expression, we focused on the amoA gene, which is involved in ammonia oxidation. We prepared soil microcosms and exposed them to dazomet, mancozeb or no pesticide. We hypothesized that the amount of amoA transcript decreases upon pesticide application, and to test this hypothesis, we used reverse-transcription qPCR. We also hypothesized that bacterial diversity is affected by pesticides. This hypothesis was investigated via 454 sequencing and diversity analysis of the 16S ribosomal RNA and RNA genes, representing the active and total soil bacterial communities, respectively. Results and Conclusion Treatment with dazomet reduced both the bacterial and archaeal amoA transcript numbers by more than two log units and produced long-term effects for more than 28 days. Mancozeb also inhibited the numbers of amoA transcripts, but only transiently. The bacterial and archaeal amoA transcripts were both sensitive bioindicators of pesticide side effects. Additionally, the numbers of bacterial amoA transcripts correlated with nitrate production in N-amended microcosms. Dazomet reduced the total bacterial numbers by one log unit, but the population size was restored after twelve days. The diversity of the active soil bacteria also seemed to be re-established after twelve days. However, the total bacterial diversity as reflected in the 16S ribosomal RNA gene sequences was largely dominated by Firmicutes and Proteobacteria at day twelve, likely reflecting a halt in the growth of early opportunists and the re-establishment of a more diverse population. We observed no effects of mancozeb on diversity. PMID:25938467
MicRhoDE: a curated database for the analysis of microbial rhodopsin diversity and evolution

PubMed Central

Boeuf, Dominique; Audic, Stéphane; Brillet-Guéguen, Loraine; Caron, Christophe; Jeanthon, Christian

2015-01-01

Microbial rhodopsins are a diverse group of photoactive transmembrane proteins found in all three domains of life and in viruses. Today, microbial rhodopsin research is a flourishing research field in which new understandings of rhodopsin diversity, function and evolution are contributing to broader microbiological and molecular knowledge. Here, we describe MicRhoDE, a comprehensive, high-quality and freely accessible database that facilitates analysis of the diversity and evolution of microbial rhodopsins. Rhodopsin sequences isolated from a vast array of marine and terrestrial environments were manually collected and curated. To each rhodopsin sequence are associated related metadata, including predicted spectral tuning of the protein, putative activity and function, taxonomy for sequences that can be linked to a 16S rRNA gene, sampling date and location, and supporting literature. The database currently covers 7857 aligned sequences from more than 450 environmental samples or organisms. Based on a robust phylogenetic analysis, we introduce an operational classification system with multiple phylogenetic levels ranging from superclusters to species-level operational taxonomic units. An integrated pipeline for online sequence alignment and phylogenetic tree construction is also provided. With a user-friendly interface and integrated online bioinformatics tools, this unique resource should be highly valuable for upcoming studies of the biogeography, diversity, distribution and evolution of microbial rhodopsins. Database URL: http://micrhode.sb-roscoff.fr. PMID:26286928
MicRhoDE: a curated database for the analysis of microbial rhodopsin diversity and evolution.

PubMed

Boeuf, Dominique; Audic, Stéphane; Brillet-Guéguen, Loraine; Caron, Christophe; Jeanthon, Christian

2015-01-01

Microbial rhodopsins are a diverse group of photoactive transmembrane proteins found in all three domains of life and in viruses. Today, microbial rhodopsin research is a flourishing research field in which new understandings of rhodopsin diversity, function and evolution are contributing to broader microbiological and molecular knowledge. Here, we describe MicRhoDE, a comprehensive, high-quality and freely accessible database that facilitates analysis of the diversity and evolution of microbial rhodopsins. Rhodopsin sequences isolated from a vast array of marine and terrestrial environments were manually collected and curated. To each rhodopsin sequence are associated related metadata, including predicted spectral tuning of the protein, putative activity and function, taxonomy for sequences that can be linked to a 16S rRNA gene, sampling date and location, and supporting literature. The database currently covers 7857 aligned sequences from more than 450 environmental samples or organisms. Based on a robust phylogenetic analysis, we introduce an operational classification system with multiple phylogenetic levels ranging from superclusters to species-level operational taxonomic units. An integrated pipeline for online sequence alignment and phylogenetic tree construction is also provided. With a user-friendly interface and integrated online bioinformatics tools, this unique resource should be highly valuable for upcoming studies of the biogeography, diversity, distribution and evolution of microbial rhodopsins. Database URL: http://micrhode.sb-roscoff.fr. © The Author(s) 2015. Published by Oxford University Press.
Palynological composition of a Lower Cretaceous South American tropical sequence: climatic implications and diversity comparisons with other latitudes.

PubMed

Mejia-Velasquez, Paula J; Dilcher, David L; Jaramillo, Carlos A; Fortini, Lucas B; Manchester, Steven R

2012-11-01

Reconstruction of floristic patterns during the early diversification of angiosperms is impeded by the scarce fossil record, especially in tropical latitudes. Here we collected quantitative palynological data from a stratigraphic sequence in tropical South America to provide floristic and climatic insights into such tropical environments during the Early Cretaceous. We reconstructed the floristic composition of an Aptian-Albian tropical sequence from central Colombia using quantitative palynology (rarefied species richness and abundance) and used it to infer its predominant climatic conditions. Additionally, we compared our results with available quantitative data from three other sequences encompassing 70 floristic assemblages to determine latitudinal diversity patterns. Abundance of humidity indicators was higher than that of aridity indicators (61% vs. 10%). Additionally, we found an angiosperm latitudinal diversity gradient (LDG) for the Aptian, but not for the Albian, and an inverted LDG of the overall diversity for the Albian. Angiosperm species turnover during the Albian, however, was higher in humid tropics. There were humid climates in northwestern South America during the Aptian-Albian interval contrary to the widespread aridity expected for the tropical belt. The Albian inverted overall LDG is produced by a faster increase in per-sample angiosperm and pteridophyte diversity in temperate latitudes. However, humid tropical sequences had higher rates of floristic turnover suggesting a higher degree of morphological variation than in temperate regions.
Palynological composition of a Lower Cretaceous South American tropical sequence: Climatic implications and diversity comparisons with other latitudes.

USGS Publications Warehouse

Mejia-Velasquez, Paula J.; Dilcher, David L.; Jaramillo, Carlos A.; Fortini, Lucas B.; Manchester, Steven R.

2012-01-01

Premise of the study: Reconstruction of floristic patterns during the early diversification of angiosperms is impeded by the scarce fossil record, especially in tropical latitudes. Here we collected quantitative palynological data from a stratigraphic sequence in tropical South America to provide floristic and climatic insights into such tropical environments during the Early Cretaceous. Methods: We reconstructed the floristic composition of an Aptian-Albian tropical sequence from central Colombia using quantitative palynology (rarefied species richness and abundance) and used it to infer its predominant climatic conditions. Additionally, we compared our results with available quantitative data from three other sequences encompassing 70 floristic assemblages to determine latitudinal diversity patterns. Key results: Abundance of humidity indicators was higher than that of aridity indicators (61% vs. 10%). Additionally, we found an angiosperm latitudinal diversity gradient (LDG) for the Aptian, but not for the Albian, and an inverted LDG of the overall diversity for the Albian. Angiosperm species turnover during the Albian, however, was higher in humid tropics. Conclusions: There were humid climates in northwestern South America during the Aptian-Albian interval contrary to the widespread aridity expected for the tropical belt. The Albian inverted overall LDG is produced by a faster increase in per-sample angiosperm and pteridophyte diversity in temperate latitudes. However, humid tropical sequences had higher rates of floristic turnover suggesting a higher degree of morphological variation than in temperate regions.
Robust k-mer frequency estimation using gapped k-mers

PubMed Central

Ghandi, Mahmoud; Mohammad-Noori, Morteza

2013-01-01

Oligomers of fixed length, k, commonly known as k-mers, are often used as fundamental elements in the description of DNA sequence features of diverse biological function, or as intermediate elements in the constuction of more complex descriptors of sequence features such as position weight matrices. k-mers are very useful as general sequence features because they constitute a complete and unbiased feature set, and do not require parameterization based on incomplete knowledge of biological mechanisms. However, a fundamental limitation in the use of k-mers as sequence features is that as k is increased, larger spatial correlations in DNA sequence elements can be described, but the frequency of observing any specific k-mer becomes very small, and rapidly approaches a sparse matrix of binary counts. Thus any statistical learning approach using k-mers will be susceptible to noisy estimation of k-mer frequencies once k becomes large. Because all molecular DNA interactions have limited spatial extent, gapped k-mers often carry the relevant biological signal. Here we use gapped k-mer counts to more robustly estimate the ungapped k-mer frequencies, by deriving an equation for the minimum norm estimate of k-mer frequencies given an observed set of gapped k-mer frequencies. We demonstrate that this approach provides a more accurate estimate of the k-mer frequencies in real biological sequences using a sample of CTCF binding sites in the human genome. PMID:23861010
Robust k-mer frequency estimation using gapped k-mers.

PubMed

Ghandi, Mahmoud; Mohammad-Noori, Morteza; Beer, Michael A

2014-08-01

Oligomers of fixed length, k, commonly known as k-mers, are often used as fundamental elements in the description of DNA sequence features of diverse biological function, or as intermediate elements in the constuction of more complex descriptors of sequence features such as position weight matrices. k-mers are very useful as general sequence features because they constitute a complete and unbiased feature set, and do not require parameterization based on incomplete knowledge of biological mechanisms. However, a fundamental limitation in the use of k-mers as sequence features is that as k is increased, larger spatial correlations in DNA sequence elements can be described, but the frequency of observing any specific k-mer becomes very small, and rapidly approaches a sparse matrix of binary counts. Thus any statistical learning approach using k-mers will be susceptible to noisy estimation of k-mer frequencies once k becomes large. Because all molecular DNA interactions have limited spatial extent, gapped k-mers often carry the relevant biological signal. Here we use gapped k-mer counts to more robustly estimate the ungapped k-mer frequencies, by deriving an equation for the minimum norm estimate of k-mer frequencies given an observed set of gapped k-mer frequencies. We demonstrate that this approach provides a more accurate estimate of the k-mer frequencies in real biological sequences using a sample of CTCF binding sites in the human genome.

spads 1.0: a toolbox to perform spatial analyses on DNA sequence data sets.

PubMed

Dellicour, Simon; Mardulyn, Patrick

2014-05-01

SPADS 1.0 (for 'Spatial and Population Analysis of DNA Sequences') is a population genetic toolbox for characterizing genetic variability within and among populations from DNA sequences. In view of the drastic increase in genetic information available through sequencing methods, spads was specifically designed to deal with multilocus data sets of DNA sequences. It computes several summary statistics from populations or groups of populations, performs input file conversions for other population genetic programs and implements locus-by-locus and multilocus versions of two clustering algorithms to study the genetic structure of populations. The toolbox also includes two MATLAB and r functions, GDISPAL and GDIVPAL, to display differentiation and diversity patterns across landscapes. These functions aim to generate interpolating surfaces based on multilocus distance and diversity indices. In the case of multiple loci, such surfaces can represent a useful alternative to multiple pie charts maps traditionally used in phylogeography to represent the spatial distribution of genetic diversity. These coloured surfaces can also be used to compare different data sets or different diversity and/or distance measures estimated on the same data set. © 2013 John Wiley & Sons Ltd.
Genetic diversity analysis of Leuconostoc mesenteroides from Korean vegetables and food products by multilocus sequence typing.

PubMed

Sharma, Anshul; Kaur, Jasmine; Lee, Sulhee; Park, Young-Seo

2018-06-01

In the present study, 35 Leuconostoc mesenteroides strains isolated from vegetables and food products from South Korea were studied by multilocus sequence typing (MLST) of seven housekeeping genes (atpA, groEL, gyrB, pheS, pyrG, rpoA, and uvrC). The fragment sizes of the seven amplified housekeeping genes ranged in length from 366 to 1414 bp. Sequence analysis indicated 27 different sequence types (STs) with 25 of them being represented by a single strain indicating high genetic diversity, whereas the remaining 2 were characterized by five strains each. In total, 220 polymorphic nucleotide sites were detected among seven housekeeping genes. The phylogenetic analysis based on the STs of the seven loci indicated that the 35 strains belonged to two major groups, A (28 strains) and B (7 strains). Split decomposition analysis showed that intraspecies recombination played a role in generating diversity among strains. The minimum spanning tree showed that the evolution of the STs was not correlated with food source. This study signifies that the multilocus sequence typing is a valuable tool to access the genetic diversity among L. mesenteroides strains from South Korea and can be used further to monitor the evolutionary changes.
Genetic diversity assessment of anoxygenic photosynthetic bacteria by distance-based grouping analysis of pufM sequences.

PubMed

Zeng, Y H; Chen, X H; Jiao, N Z

2007-12-01

To assess how completely the diversity of anoxygenic phototrophic bacteria (APB) was sampled in natural environments. All nucleotide sequences of the APB marker gene pufM from cultures and environmental clones were retrieved from the GenBank database. A set of cutoff values (sequence distances 0.06, 0.15 and 0.48 for species, genus, and (sub)phylum levels, respectively) was established using a distance-based grouping program. Analysis of the environmental clones revealed that current efforts on APB isolation and sampling in natural environments are largely inadequate. Analysis of the average distance between each identified genus and an uncultured environmental pufM sequence indicated that the majority of cultured APB genera lack environmental representatives. The distance-based grouping method is fast and efficient for bulk functional gene sequences analysis. The results clearly show that we are at a relatively early stage in sampling the global richness of APB species. Periodical assessment will undoubtedly facilitate in-depth analysis of potential biogeographical distribution pattern of APB. This is the first attempt to assess the present understanding of APB diversity in natural environments. The method used is also useful for assessing the diversity of other functional genes.
Genetic diversity and geographic differentiation in the threatened species Dysosma pleiantha in China as revealed by ISSR analysis.

PubMed

Zong, Min; Liu, Hai-Long; Qiu, Ying-Xiong; Yang, Shu-Zhen; Zhao, Ming-Shui; Fu, Cheng-Xin

2008-04-01

Dysosma pleiantha, an important threatened medicinal plant species, is restricted in distribution to southeastern China. The species is capable of reproducing both sexually and asexually. In this study, inter-simple sequence repeat marker data were obtained and analyzed with respect to genetic variation and genetic structure. The extent of clonality, together with the clonal and sexual reproductive strategies, varied among sites, and the populations under harsh ecological conditions tended to have large clones with relatively low clonal diversity caused by vegetative reproduction. The ramets sharing the same genotype show a clumped distribution. Across all populations surveyed, average within-population diversity was remarkably low (e.g., 0.111 for Nei's gene diversity), with populations from the nature reserves maintaining relatively high amounts of genetic diversity. Among all populations, high genetic differentiation (AMOVA: Phi(ST) = 0.500; Nei's genetic diversity: G (ST) = 0.465, Bayesian analysis: Phi(B) = 0.436) was detected, together with an isolation-by-distance pattern. Low seedling recruitment due to inbreeding, restricted gene flow, and genetic drift are proposed as determinant factors responsible for the low genetic diversity and high genetic differentiation observed.
Deep Sequencing of the Trypanosoma cruzi GP63 Surface Proteases Reveals Diversity and Diversifying Selection among Chronic and Congenital Chagas Disease Patients

PubMed Central

Llewellyn, Martin S.; Messenger, Louisa A.; Luquetti, Alejandro O.; Garcia, Lineth; Torrico, Faustino; Tavares, Suelene B. N.; Cheaib, Bachar; Derome, Nicolas; Delepine, Marc; Baulard, Céline; Deleuze, Jean-Francois; Sauer, Sascha; Miles, Michael A.

2015-01-01

Background Chagas disease results from infection with the diploid protozoan parasite Trypanosoma cruzi. T. cruzi is highly genetically diverse, and multiclonal infections in individual hosts are common, but little studied. In this study, we explore T. cruzi infection multiclonality in the context of age, sex and clinical profile among a cohort of chronic patients, as well as paired congenital cases from Cochabamba, Bolivia and Goias, Brazil using amplicon deep sequencing technology. Methodology/ Principal Findings A 450bp fragment of the trypomastigote TcGP63I surface protease gene was amplified and sequenced across 70 chronic and 22 congenital cases on the Illumina MiSeq platform. In addition, a second, mitochondrial target—ND5—was sequenced across the same cohort of cases. Several million reads were generated, and sequencing read depths were normalized within patient cohorts (Goias chronic, n = 43, Goias congenital n = 2, Bolivia chronic, n = 27; Bolivia congenital, n = 20), Among chronic cases, analyses of variance indicated no clear correlation between intra-host sequence diversity and age, sex or symptoms, while principal coordinate analyses showed no clustering by symptoms between patients. Between congenital pairs, we found evidence for the transmission of multiple sequence types from mother to infant, as well as widespread instances of novel genotypes in infants. Finally, non-synonymous to synonymous (dn:ds) nucleotide substitution ratios among sequences of TcGP63Ia and TcGP63Ib subfamilies within each cohort provided powerful evidence of strong diversifying selection at this locus. Conclusions/Significance Our results shed light on the diversity of parasite DTUs within each patient, as well as the extent to which parasite strains pass between mother and foetus in congenital cases. Although we were unable to find any evidence that parasite diversity accumulates with age in our study cohorts, putative diversifying selection within members of the TcGP63I gene family suggests a link between genetic diversity within this gene family and survival in the mammalian host. PMID:25849488
Population-genomic variation within RNA viruses of the Western honey bee, Apis mellifera, inferred from deep sequencing

PubMed Central

2013-01-01

Background Deep sequencing of viruses isolated from infected hosts is an efficient way to measure population-genetic variation and can reveal patterns of dispersal and natural selection. In this study, we mined existing Illumina sequence reads to investigate single-nucleotide polymorphisms (SNPs) within two RNA viruses of the Western honey bee (Apis mellifera), deformed wing virus (DWV) and Israel acute paralysis virus (IAPV). All viral RNA was extracted from North American samples of honey bees or, in one case, the ectoparasitic mite Varroa destructor. Results Coverage depth was generally lower for IAPV than DWV, and marked gaps in coverage occurred in several narrow regions (< 50 bp) of IAPV. These coverage gaps occurred across sequencing runs and were virtually unchanged when reads were re-mapped with greater permissiveness (up to 8% divergence), suggesting a recurrent sequencing artifact rather than strain divergence. Consensus sequences of DWV for each sample showed little phylogenetic divergence, low nucleotide diversity, and strongly negative values of Fu and Li’s D statistic, suggesting a recent population bottleneck and/or purifying selection. The Kakugo strain of DWV fell outside of all other DWV sequences at 100% bootstrap support. IAPV consensus sequences supported the existence of multiple clades as had been previously reported, and Fu and Li’s D was closer to neutral expectation overall, although a sliding-window analysis identified a significantly positive D within the protease region, suggesting selection maintains diversity in that region. Within-sample mean diversity was comparable between the two viruses on average, although for both viruses there was substantial variation among samples in mean diversity at third codon positions and in the number of high-diversity sites. FST values were bimodal for DWV, likely reflecting neutral divergence in two low-diversity populations, whereas IAPV had several sites that were strong outliers with very low FST. Conclusions This initial survey of genetic variation within honey bee RNA viruses suggests future directions for studies examining the underlying causes of population-genetic structure in these economically important pathogens. PMID:23497218
Population-genomic variation within RNA viruses of the Western honey bee, Apis mellifera, inferred from deep sequencing.

PubMed

Cornman, Robert Scott; Boncristiani, Humberto; Dainat, Benjamin; Chen, Yanping; vanEngelsdorp, Dennis; Weaver, Daniel; Evans, Jay D

2013-03-07

Deep sequencing of viruses isolated from infected hosts is an efficient way to measure population-genetic variation and can reveal patterns of dispersal and natural selection. In this study, we mined existing Illumina sequence reads to investigate single-nucleotide polymorphisms (SNPs) within two RNA viruses of the Western honey bee (Apis mellifera), deformed wing virus (DWV) and Israel acute paralysis virus (IAPV). All viral RNA was extracted from North American samples of honey bees or, in one case, the ectoparasitic mite Varroa destructor. Coverage depth was generally lower for IAPV than DWV, and marked gaps in coverage occurred in several narrow regions (< 50 bp) of IAPV. These coverage gaps occurred across sequencing runs and were virtually unchanged when reads were re-mapped with greater permissiveness (up to 8% divergence), suggesting a recurrent sequencing artifact rather than strain divergence. Consensus sequences of DWV for each sample showed little phylogenetic divergence, low nucleotide diversity, and strongly negative values of Fu and Li's D statistic, suggesting a recent population bottleneck and/or purifying selection. The Kakugo strain of DWV fell outside of all other DWV sequences at 100% bootstrap support. IAPV consensus sequences supported the existence of multiple clades as had been previously reported, and Fu and Li's D was closer to neutral expectation overall, although a sliding-window analysis identified a significantly positive D within the protease region, suggesting selection maintains diversity in that region. Within-sample mean diversity was comparable between the two viruses on average, although for both viruses there was substantial variation among samples in mean diversity at third codon positions and in the number of high-diversity sites. FST values were bimodal for DWV, likely reflecting neutral divergence in two low-diversity populations, whereas IAPV had several sites that were strong outliers with very low FST. This initial survey of genetic variation within honey bee RNA viruses suggests future directions for studies examining the underlying causes of population-genetic structure in these economically important pathogens.
The Intestinal Eukaryotic and Bacterial Biome of Spotted Hyenas: The Impact of Social Status and Age on Diversity and Composition.

PubMed

Heitlinger, Emanuel; Ferreira, Susana C M; Thierer, Dagmar; Hofer, Heribert; East, Marion L

2017-01-01

In mammals, two factors likely to affect the diversity and composition of intestinal bacteria (bacterial microbiome) and eukaryotes (eukaryome) are social status and age. In species in which social status determines access to resources, socially dominant animals maintain better immune processes and health status than subordinates. As high species diversity is an index of ecosystem health, the intestinal biome of healthier, socially dominant animals should be more diverse than those of subordinates. Gradual colonization of the juvenile intestine after birth predicts lower intestinal biome diversity in juveniles than adults. We tested these predictions on the effect of: (1) age (juvenile/adult) and (2) social status (low/high) on bacterial microbiome and eukaryome diversity and composition in the spotted hyena ( Crocuta crocuta ), a highly social, female-dominated carnivore in which social status determines access to resources. We comprehensively screened feces from 35 individually known adult females and 7 juveniles in the Serengeti ecosystem for bacteria and eukaryotes, using a set of 48 different amplicons (4 for bacterial 16S, 44 for eukaryote 18S) in a multi-amplicon sequencing approach. We compared sequence abundances to classical coprological egg or oocyst counts. For all parasite taxa detected in more than six samples, the number of sequence reads significantly predicted the number of eggs or oocysts counted, underscoring the value of an amplicon sequencing approach for quantitative measurements of parasite load. In line with our predictions, our results revealed a significantly less diverse microbiome in juveniles than adults and a significantly higher diversity of eukaryotes in high-ranking than low-ranking animals. We propose that free-ranging wildlife can provide an intriguing model system to assess the adaptive value of intestinal biome diversity for both bacteria and eukaryotes.
The Intestinal Eukaryotic and Bacterial Biome of Spotted Hyenas: The Impact of Social Status and Age on Diversity and Composition

PubMed Central

Heitlinger, Emanuel; Ferreira, Susana C. M.; Thierer, Dagmar; Hofer, Heribert; East, Marion L.

2017-01-01

In mammals, two factors likely to affect the diversity and composition of intestinal bacteria (bacterial microbiome) and eukaryotes (eukaryome) are social status and age. In species in which social status determines access to resources, socially dominant animals maintain better immune processes and health status than subordinates. As high species diversity is an index of ecosystem health, the intestinal biome of healthier, socially dominant animals should be more diverse than those of subordinates. Gradual colonization of the juvenile intestine after birth predicts lower intestinal biome diversity in juveniles than adults. We tested these predictions on the effect of: (1) age (juvenile/adult) and (2) social status (low/high) on bacterial microbiome and eukaryome diversity and composition in the spotted hyena (Crocuta crocuta), a highly social, female-dominated carnivore in which social status determines access to resources. We comprehensively screened feces from 35 individually known adult females and 7 juveniles in the Serengeti ecosystem for bacteria and eukaryotes, using a set of 48 different amplicons (4 for bacterial 16S, 44 for eukaryote 18S) in a multi-amplicon sequencing approach. We compared sequence abundances to classical coprological egg or oocyst counts. For all parasite taxa detected in more than six samples, the number of sequence reads significantly predicted the number of eggs or oocysts counted, underscoring the value of an amplicon sequencing approach for quantitative measurements of parasite load. In line with our predictions, our results revealed a significantly less diverse microbiome in juveniles than adults and a significantly higher diversity of eukaryotes in high-ranking than low-ranking animals. We propose that free-ranging wildlife can provide an intriguing model system to assess the adaptive value of intestinal biome diversity for both bacteria and eukaryotes. PMID:28670573
Measuring the diversity of the human microbiota with targeted next-generation sequencing.

PubMed

Finotello, Francesca; Mastrorilli, Eleonora; Di Camillo, Barbara

2016-12-26

The human microbiota is a complex ecological community of commensal, symbiotic and pathogenic microorganisms harboured by the human body. Next-generation sequencing (NGS) technologies, in particular targeted amplicon sequencing of the 16S ribosomal RNA gene (16S-seq), are enabling the identification and quantification of human-resident microorganisms at unprecedented resolution, providing novel insights into the role of the microbiota in health and disease. Once microbial abundances are quantified through NGS data analysis, diversity indices provide valuable mathematical tools to describe the ecological complexity of a single sample or to detect species differences between samples. However, diversity is not a determined physical quantity for which a consensus definition and unit of measure have been established, and several diversity indices are currently available. Furthermore, they were originally developed for macroecology and their robustness to the possible bias introduced by sequencing has not been characterized so far. To assist the reader with the selection and interpretation of diversity measures, we review a panel of broadly used indices, describing their mathematical formulations, purposes and properties, and characterize their behaviour and criticalities in dependence of the data features using simulated data as ground truth. In addition, we make available an R package, DiversitySeq, which implements in a unified framework the full panel of diversity indices and a simulator of 16S-seq data, and thus represents a valuable resource for the analysis of diversity from NGS count data and for the benchmarking of computational methods for 16S-seq. © The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.
Genome-wide genotyping-by-sequencing data provide a high-resolution view of wild Helianthus diversity, genetic structure, and interspecies gene flow.

PubMed

Baute, Gregory J; Owens, Gregory L; Bock, Dan G; Rieseberg, Loren H

2016-12-01

Wild sunflowers harbor considerable genetic diversity and are a major resource for improvement of the cultivated sunflower, Helianthus annuus. The Helianthus genus is also well known for its propensity for gene flow between taxa. We surveyed genomic diversity of 292 samples of wild Helianthus from 22 taxa that are cross-compatible with the cultivar using genotyping by sequencing. With these data, we derived a high-resolution phylogeny of the taxa, interrogated genome-wide levels of diversity, explored H. annuus population structure, and identified localized gene flow between H. annuus and its close relatives. Our phylogenomic analyses confirmed a number of previously established interspecific relationships and indicated for the first time that a newly described annual sunflower, H. winteri, is nested within H. annuus. Principal component analyses showed that H. annuus has geographic population structure with most notable subpopulations occurring in California and Texas. While gene flow was identified between H. annuus and H. bolanderi in California and between H. annuus and H. argophyllus in Texas, this genetic exchange does not appear to drive observed patterns of H. annuus population structure. Wild H. annuus remains an excellent resource for cultivated sunflower breeding effort because of its diversity and the ease with which it can be crossed with cultivated H. annuus. Cases of interspecific gene flow such as those documented here also indicate wild H. annuus can act as a bridge to capture alleles from other wild taxa; continued breeding efforts with it may therefore reap the largest rewards. © 2016 Botanical Society of America.
Multi-locus and long amplicon sequencing approach to study microbial diversity at species level using the MinION™ portable nanopore sequencer

PubMed Central

Sanz, Yolanda

2017-01-01

Abstract The miniaturized and portable DNA sequencer MinION™ has demonstrated great potential in different analyses such as genome-wide sequencing, pathogen outbreak detection and surveillance, human genome variability, and microbial diversity. In this study, we tested the ability of the MinION™ platform to perform long amplicon sequencing in order to design new approaches to study microbial diversity using a multi-locus approach. After compiling a robust database by parsing and extracting the rrn bacterial region from more than 67000 complete or draft bacterial genomes, we demonstrated that the data obtained during sequencing of the long amplicon in the MinION™ device using R9 and R9.4 chemistries were sufficient to study 2 mock microbial communities in a multiplex manner and to almost completely reconstruct the microbial diversity contained in the HM782D and D6305 mock communities. Although nanopore-based sequencing produces reads with lower per-base accuracy compared with other platforms, we presented a novel approach consisting of multi-locus and long amplicon sequencing using the MinION™ MkIb DNA sequencer and R9 and R9.4 chemistries that help to overcome the main disadvantage of this portable sequencing platform. Furthermore, the nanopore sequencing library, constructed with the last releases of pore chemistry (R9.4) and sequencing kit (SQK-LSK108), permitted the retrieval of the higher level of 1D read accuracy sufficient to characterize the microbial species present in each mock community analysed. Improvements in nanopore chemistry, such as minimizing base-calling errors and new library protocols able to produce rapid 1D libraries, will provide more reliable information in the near future. Such data will be useful for more comprehensive and faster specific detection of microbial species and strains in complex ecosystems. PMID:28605506
Complete sequence and diversity of a maize-associated Polerovirus in East Africa

USDA-ARS?s Scientific Manuscript database

Since 2011-2012, Maize lethal necrosis (MLN) has emerged in East Africa, causing massive yield loss and propelling research to identify viruses and virus populations present in maize. As expected, next generation sequencing (NGS) has revealed diverse and abundant viruses from the family Potyviridae,...
The complete genome sequences of 65 Campylobacter jejuni and C. coli strains

USDA-ARS?s Scientific Manuscript database

Campylobacter jejuni (Cj) and C. coli (Cc) are genetically highly diverse based on various molecular methods including MLST, microarray-based comparisons and the whole genome sequences of a few strains. Cj and Cc diversity is also exhibited by variable capsular polysaccharides (CPS) that are the maj...
Maize HapMap2 identifies extant variation from a genome in flux

USDA-ARS?s Scientific Manuscript database

The maize genome is the largest, most diverse and complex plant genome sequenced to date. Using high-throughput sequencing to access genetic variation and a population genetics model to score the polymorphisms, we characterize and unite the diversity of the world’s key breeding germplasm, wild rela...
Additional annotation of the pig transcriptome using integrated Iso-seq and Illumina RNA-seq analysis

USDA-ARS?s Scientific Manuscript database

Alternative splicing is a well-known phenomenon that dramatically increases eukaryotic transcriptome diversity. The extent of mRNA isoform diversity among porcine tissues was assessed using Pacific Biosciences single-molecule long-read isoform sequencing (Iso-Seq) and Illumina short read sequencing ...
Genetic diversity and demographic instability in Riftia pachyptila tubeworms from eastern Pacific hydrothermal vents

PubMed Central

2011-01-01

Background Deep-sea hydrothermal vent animals occupy patchy and ephemeral habitats supported by chemosynthetic primary production. Volcanic and tectonic activities controlling the turnover of these habitats contribute to demographic instability that erodes genetic variation within and among colonies of these animals. We examined DNA sequences from one mitochondrial and three nuclear gene loci to assess genetic diversity in the siboglinid tubeworm, Riftia pachyptila, a widely distributed constituent of vents along the East Pacific Rise and Galápagos Rift. Results Genetic differentiation (FST) among populations increased with geographical distances, as expected under a linear stepping-stone model of dispersal. Low levels of DNA sequence diversity occurred at all four loci, allowing us to exclude the hypothesis that an idiosyncratic selective sweep eliminated mitochondrial diversity alone. Total gene diversity declined with tectonic spreading rates. The southernmost populations, which are subjected to superfast spreading rates and high probabilities of extinction, are relatively homogenous genetically. Conclusions Compared to other vent species, DNA sequence diversity is extremely low in R. pachyptila. Though its dispersal abilities appear to be effective, the low diversity, particularly in southern hemisphere populations, is consistent with frequent local extinction and (re)colonization events. PMID:21489281
Genetic diversity and demographic instability in Riftia pachyptila tubeworms from eastern Pacific hydrothermal vents

USGS Publications Warehouse

Coykendall, D.K.; Johnson, S.B.; Karl, S.A.; Lutz, R.A.; Vrijenhoek, R.C.

2011-01-01

Background: Deep-sea hydrothermal vent animals occupy patchy and ephemeral habitats supported by chemosynthetic primary production. Volcanic and tectonic activities controlling the turnover of these habitats contribute to demographic instability that erodes genetic variation within and among colonies of these animals. We examined DNA sequences from one mitochondrial and three nuclear gene loci to assess genetic diversity in the siboglinid tubeworm, Riftia pachyptila, a widely distributed constituent of vents along the East Pacific Rise and Galpagos Rift. Results: Genetic differentiation (FST) among populations increased with geographical distances, as expected under a linear stepping-stone model of dispersal. Low levels of DNA sequence diversity occurred at all four loci, allowing us to exclude the hypothesis that an idiosyncratic selective sweep eliminated mitochondrial diversity alone. Total gene diversity declined with tectonic spreading rates. The southernmost populations, which are subjected to superfast spreading rates and high probabilities of extinction, are relatively homogenous genetically. Conclusions: Compared to other vent species, DNA sequence diversity is extremely low in R. pachyptila. Though its dispersal abilities appear to be effective, the low diversity, particularly in southern hemisphere populations, is consistent with frequent local extinction and (re)colonization events. ?? 2011 Coykendall et al; licensee BioMed Central Ltd.
MHC class I diversity in chimpanzees and bonobos.

PubMed

Maibach, Vincent; Hans, Jörg B; Hvilsom, Christina; Marques-Bonet, Tomas; Vigilant, Linda

2017-10-01

Major histocompatibility complex (MHC) class I genes are critically involved in the defense against intracellular pathogens. MHC diversity comparisons among samples of closely related taxa may reveal traces of past or ongoing selective processes. The bonobo and chimpanzee are the closest living evolutionary relatives of humans and last shared a common ancestor some 1 mya. However, little is known concerning MHC class I diversity in bonobos or in central chimpanzees, the most numerous and genetically diverse chimpanzee subspecies. Here, we used a long-read sequencing technology (PacBio) to sequence the classical MHC class I genes A, B, C, and A-like in 20 and 30 wild-born bonobos and chimpanzees, respectively, with a main focus on central chimpanzees to assess and compare diversity in those two species. We describe in total 21 and 42 novel coding region sequences for the two species, respectively. In addition, we found evidence for a reduced MHC class I diversity in bonobos as compared to central chimpanzees as well as to western chimpanzees and humans. The reduced bonobo MHC class I diversity may be the result of a selective process in their evolutionary past since their split from chimpanzees.
First Molecular Identification and Phylogeny of Moroccan Anopheles sergentii (Diptera: Culicidae) Based on Second Internal Transcribed Spencer (ITS2) and Cytochrome c Oxidase I (COI) Sequences.

PubMed

Benabdelkrim Filali, Oumama; Kabine, Mostafa; El Hamouchi, Adil; Lemrani, Meryem; Debboun, Mustapha; Sarih, M'hammed

2018-06-05

Anopheles sergentii known as the "oasis vector" or the "desert malaria vector" is considered the main vector of malaria in the southern parts of Morocco. Its presence in Morocco is confirmed for the first time through sequencing of mitochondrial DNA (mDNA) cytochrome c oxidase subunit I (COI) barcodes and nuclear ribosomal DNA (rDNA) second internal transcribed spacer (ITS2) sequences and direct comparison with specimens of A. sergentii of other countries. The DNA barcodes (n = 39) obtained from A. sergentii collected in 2015 and 2016 showed more diversity with 10 haplotypes, compared with 3 haplotypes obtained from ITS2 sequences (n = 59). Moreover, the comparison using the ITS2 sequences showed closer evolutionary relationship between the Moroccan and Egyptian strains than the Iranian strain. Nevertheless, genetic differences due to geographical segregation were also observed. This study provides the first report on the sequence of rDNA-ITS2 and mtDNA COI, which could be used to better understand the biodiversity of A. sergentii.

High-accuracy identification of incident HIV-1 infections using a sequence clustering based diversity measure.

PubMed

Xia, Xia-Yu; Ge, Meng; Hsi, Jenny H; He, Xiang; Ruan, Yu-Hua; Wang, Zhi-Xin; Shao, Yi-Ming; Pan, Xian-Ming

2014-01-01

Accurate estimates of HIV-1 incidence are essential for monitoring epidemic trends and evaluating intervention efforts. However, the long asymptomatic stage of HIV-1 infection makes it difficult to effectively distinguish incident infections from chronic ones. Current incidence assays based on serology or viral sequence diversity are both still lacking in accuracy. In the present work, a sequence clustering based diversity (SCBD) assay was devised by utilizing the fact that viral sequences derived from each transmitted/founder (T/F) strain tend to cluster together at early stage, and that only the intra-cluster diversity is correlated with the time since HIV-1 infection. The dot-matrix pairwise alignment was used to eliminate the disproportional impact of insertion/deletions (indels) and recombination events, and so was the proportion of clusterable sequences (Pc) as an index to identify late chronic infections with declined viral genetic diversity. Tested on a dataset containing 398 incident and 163 chronic infection cases collected from the Los Alamos HIV database (last modified 2/8/2012), our SCBD method achieved 99.5% sensitivity and 98.8% specificity, with an overall accuracy of 99.3%. Further analysis and evaluation also suggested its performance was not affected by host factors such as the viral subtypes and transmission routes. The SCBD method demonstrated the potential of sequencing based techniques to become useful for identifying incident infections. Its use may be most advantageous for settings with low to moderate incidence relative to available resources. The online service is available at http://www.bioinfo.tsinghua.edu.cn:8080/SCBD/index.jsp.
Genetic Diversity of Sheep Breeds from Albania, Greece, and Italy Assessed by Mitochondrial DNA and Nuclear Polymorphisms (SNPs)

PubMed Central

Pariset, Lorraine; Mariotti, Marco; Gargani, Maria; Joost, Stephane; Negrini, Riccardo; Perez, Trinidad; Bruford, Michael; Ajmone Marsan, Paolo; Valentini, Alessio

2011-01-01

We employed mtDNA and nuclear SNPs to investigate the genetic diversity of sheep breeds of three countries of the Mediterranean basin: Albania, Greece, and Italy. In total, 154 unique mtDNA haplotypes were detected by means of D-loop sequence analysis. The major nucleotide diversity was observed in Albania. We identified haplogroups, A, B, and C in Albanian and Greek samples, while Italian individuals clustered in groups A and B. In general, the data show a pattern reflecting old migrations that occurred in postneolithic and historical times. PCA analysis on SNP data differentiated breeds with good correspondence to geographical locations. This could reflect geographical isolation, selection operated by local sheep farmers, and different flock management and breed admixture that occurred in the last centuries. PMID:22125424
Decreased plant productivity resulting from plant group removal experiment constrains soil microbial functional diversity.

PubMed

Zhang, Ximei; Johnston, Eric R; Barberán, Albert; Ren, Yi; Lü, Xiaotao; Han, Xingguo

2017-10-01

Anthropogenic environmental changes are accelerating the rate of biodiversity loss on Earth. Plant diversity loss is predicted to reduce soil microbial diversity primarily due to the decreased variety of carbon/energy resources. However, this intuitive hypothesis is supported by sparse empirical evidence, and most underlying mechanisms remain underexplored or obscure altogether. We constructed four diversity gradients (0-3) in a five-year plant functional group removal experiment in a steppe ecosystem in Inner Mongolia, China, and quantified microbial taxonomic and functional diversity with shotgun metagenome sequencing. The treatments had little effect on microbial taxonomic diversity, but were found to decrease functional gene diversity. However, the observed decrease in functional gene diversity was more attributable to a loss in plant productivity, rather than to the loss of any individual plant functional group per se. Reduced productivity limited fresh plant resources supplied to microorganisms, and thus, intensified the pressure of ecological filtering, favoring genes responsible for energy production/conversion, material transport/metabolism and amino acid recycling, and accordingly disfavored many genes with other functions. Furthermore, microbial respiration was correlated with the variation in functional composition but not taxonomic composition. Overall, the amount of carbon/energy resources driving microbial gene diversity was identified to be the critical linkage between above- and belowground communities, contrary to the traditional framework of linking plant clade/taxonomic diversity to microbial taxonomic diversity. © 2017 John Wiley & Sons Ltd.
Using metabarcoding to reveal and quantify plant-pollinator interactions

PubMed Central

Pornon, André; Escaravage, Nathalie; Burrus, Monique; Holota, Hélène; Khimoun, Aurélie; Mariette, Jérome; Pellizzari, Charlène; Iribar, Amaia; Etienne, Roselyne; Taberlet, Pierre; Vidal, Marie; Winterton, Peter; Zinger, Lucie; Andalo, Christophe

2016-01-01

Given the ongoing decline of both pollinators and plants, it is crucial to implement effective methods to describe complex pollination networks across time and space in a comprehensive and high-throughput way. Here we tested if metabarcoding may circumvent the limits of conventional methodologies in detecting and quantifying plant-pollinator interactions. Metabarcoding experiments on pollen DNA mixtures described a positive relationship between the amounts of DNA from focal species and the number of trnL and ITS1 sequences yielded. The study of pollen loads of insects captured in plant communities revealed that as compared to the observation of visits, metabarcoding revealed 2.5 times more plant species involved in plant-pollinator interactions. We further observed a tight positive relationship between the pollen-carrying capacities of insect taxa and the number of trnL and ITS1 sequences. The number of visits received per plant species also positively correlated to the number of their ITS1 and trnL sequences in insect pollen loads. By revealing interactions hard to observe otherwise, metabarcoding significantly enlarges the spatiotemporal observation window of pollination interactions. By providing new qualitative and quantitative information, metabarcoding holds great promise for investigating diverse facets of interactions and will provide a new perception of pollination networks as a whole. PMID:27255732
[Current applications of high-throughput DNA sequencing technology in antibody drug research].

PubMed

Yu, Xin; Liu, Qi-Gang; Wang, Ming-Rong

2012-03-01

Since the publication of a high-throughput DNA sequencing technology based on PCR reaction was carried out in oil emulsions in 2005, high-throughput DNA sequencing platforms have been evolved to a robust technology in sequencing genomes and diverse DNA libraries. Antibody libraries with vast numbers of members currently serve as a foundation of discovering novel antibody drugs, and high-throughput DNA sequencing technology makes it possible to rapidly identify functional antibody variants with desired properties. Herein we present a review of current applications of high-throughput DNA sequencing technology in the analysis of antibody library diversity, sequencing of CDR3 regions, identification of potent antibodies based on sequence frequency, discovery of functional genes, and combination with various display technologies, so as to provide an alternative approach of discovery and development of antibody drugs.
Variation in the number of nucleoli and incomplete homogenization of 18S ribosomal DNA sequences in leaf cells of the cultivated Oriental ginseng (Panax ginseng Meyer).

PubMed

Chelomina, Galina N; Rozhkovan, Konstantin V; Voronova, Anastasia N; Burundukova, Olga L; Muzarok, Tamara I; Zhuravlev, Yuri N

2016-04-01

Wild ginseng, Panax ginseng Meyer, is an endangered species of medicinal plants. In the present study, we analyzed variations within the ribosomal DNA (rDNA) cluster to gain insight into the genetic diversity of the Oriental ginseng, P. ginseng, at artificial plant cultivation. The roots of wild P. ginseng plants were sampled from a nonprotected natural population of the Russian Far East. The slides were prepared from leaf tissues using the squash technique for cytogenetic analysis. The 18S rDNA sequences were cloned and sequenced. The distribution of nucleotide diversity, recombination events, and interspecific phylogenies for the total 18S rDNA sequence data set was also examined. In mesophyll cells, mononucleolar nuclei were estimated to be dominant (75.7%), while the remaining nuclei contained two to four nucleoli. Among the analyzed 18S rDNA clones, 20% were identical to the 18S rDNA sequence of P. ginseng from Japan, and other clones differed in one to six substitutions. The nucleotide polymorphism was more expressed at the positions 440-640 bp, and distributed in variable regions, expansion segments, and conservative elements of core structure. The phylogenetic analysis confirmed conspecificity of ginseng plants cultivated in different regions, with two fixed mutations between P. ginseng and other species. This study identified the evidences of the intragenomic nucleotide polymorphism in the 18S rDNA sequences of P. ginseng. These data suggest that, in cultivated plants, the observed genome instability may influence the synthesis of biologically active compounds, which are widely used in traditional medicine.
Variation in the number of nucleoli and incomplete homogenization of 18S ribosomal DNA sequences in leaf cells of the cultivated Oriental ginseng (Panax ginseng Meyer)

PubMed Central

Chelomina, Galina N.; Rozhkovan, Konstantin V.; Voronova, Anastasia N.; Burundukova, Olga L.; Muzarok, Tamara I.; Zhuravlev, Yuri N.

2015-01-01

Background Wild ginseng, Panax ginseng Meyer, is an endangered species of medicinal plants. In the present study, we analyzed variations within the ribosomal DNA (rDNA) cluster to gain insight into the genetic diversity of the Oriental ginseng, P. ginseng, at artificial plant cultivation. Methods The roots of wild P. ginseng plants were sampled from a nonprotected natural population of the Russian Far East. The slides were prepared from leaf tissues using the squash technique for cytogenetic analysis. The 18S rDNA sequences were cloned and sequenced. The distribution of nucleotide diversity, recombination events, and interspecific phylogenies for the total 18S rDNA sequence data set was also examined. Results In mesophyll cells, mononucleolar nuclei were estimated to be dominant (75.7%), while the remaining nuclei contained two to four nucleoli. Among the analyzed 18S rDNA clones, 20% were identical to the 18S rDNA sequence of P. ginseng from Japan, and other clones differed in one to six substitutions. The nucleotide polymorphism was more expressed at the positions 440–640 bp, and distributed in variable regions, expansion segments, and conservative elements of core structure. The phylogenetic analysis confirmed conspecificity of ginseng plants cultivated in different regions, with two fixed mutations between P. ginseng and other species. Conclusion This study identified the evidences of the intragenomic nucleotide polymorphism in the 18S rDNA sequences of P. ginseng. These data suggest that, in cultivated plants, the observed genome instability may influence the synthesis of biologically active compounds, which are widely used in traditional medicine. PMID:27158239
High-throughput sequencing of the chloroplast and mitochondrion of Chlamydomonas reinhardtii to generate improved de novo assemblies, analyze expression patterns and transcript speciation, and evaluate diversity among laboratory strains and wild isolates

DOE Office of Scientific and Technical Information (OSTI.GOV)

Gallaher, Sean D.; Fitz-Gibbon, Sorel T.; Strenkert, Daniela

Chlamydomonas reinhardtii is a unicellular chlorophyte alga that is widely studied as a reference organism for understanding photosynthesis, sensory and motile cilia, and for development of an algal-based platform for producing biofuels and bio-products. Its highly repetitive, ~205-kbp circular chloroplast genome and ~15.8-kbp linear mitochondrial genome were sequenced prior to the advent of high-throughput sequencing technologies. Here, high coverage shotgun sequencing was used to assemble both organellar genomes de novo. These new genomes correct dozens of errors in the prior genome sequences and annotations. Gen-ome sequencing coverage indicates that each cell contains on average 83 copies of the chloroplast genomemore » and 130 copies of the mitochondrial genome. Using protocols and analyses optimized for organellar tran-scripts, RNA-Seq was used to quantify their relative abundances across 12 different growth conditions. Forty-six percent of total cellular mRNA is attributable to high expression from a few dozen chloroplast genes. RNA-Seq data were used to guide gene annotation, to demonstrate polycistronic gene expression, and to quantify splicing of psaA and psbA introns. In contrast to a conclusion from a recent study, we found that chloroplast transcripts are not edited. Unexpectedly, cytosine-rich polynucleotide tails were observed at the 3’-end of all mitochondrial transcripts. A comparative genomics analysis of eight laboratory strains and 11 wild isolates of C. reinhardtii identified 2658 variants in the organellargenomes, which is 1/10th as much genetic diversity as is found in the nucleus.« less
Identifying active foraminifera in the Sea of Japan using metatranscriptomic approach

NASA Astrophysics Data System (ADS)

Lejzerowicz, Franck; Voltsky, Ivan; Pawlowski, Jan

2013-02-01

Metagenetics represents an efficient and rapid tool to describe environmental diversity patterns of microbial eukaryotes based on ribosomal DNA sequences. However, the results of metagenetic studies are often biased by the presence of extracellular DNA molecules that are persistent in the environment, especially in deep-sea sediment. As an alternative, short-lived RNA molecules constitute a good proxy for the detection of active species. Here, we used a metatranscriptomic approach based on RNA-derived (cDNA) sequences to study the diversity of the deep-sea benthic foraminifera and compared it to the metagenetic approach. We analyzed 257 ribosomal DNA and cDNA sequences obtained from seven sediments samples collected in the Sea of Japan at depths ranging from 486 to 3665 m. The DNA and RNA-based approaches gave a similar view of the taxonomic composition of foraminiferal assemblage, but differed in some important points. First, the cDNA dataset was dominated by sequences of rotaliids and robertiniids, suggesting that these calcareous species, some of which have been observed in Rose Bengal stained samples, are the most active component of foraminiferal community. Second, the richness of monothalamous (single-chambered) foraminifera was particularly high in DNA extracts from the deepest samples, confirming that this group of foraminifera is abundant but not necessarily very active in the deep-sea sediments. Finally, the high divergence of undetermined sequences in cDNA dataset indicate the limits of our database and lack of knowledge about some active but possibly rare species. Our study demonstrates the capability of the metatranscriptomic approach to detect active foraminiferal species and prompt its use in future high-throughput sequencing-based environmental surveys.
Infection rate and genetic diversity of Giardia duodenalis in pet and stray dogs in Henan Province, China.

PubMed

Qi, Meng; Dong, Haiju; Wang, Rongjun; Li, Junqiang; Zhao, Jinfeng; Zhang, Longxian; Luo, Jianxun

2016-04-01

Giardia duodenalis is an important protozoan parasite that is known to be zoonotic. To assess the potential zoonotic transmission of giardiasis from dogs and to identify genetic diversity of G. duodenalis in dog populations, we examined the infection rate and genotypes of G. duodenalis in both pet dogs (from pet dog farms, pet shops, pet hospitals, pet markets) and stray dogs of different ages in Henan Province, China. A total of 940 fresh fecal specimens were collected from 2007 to 2013 in Henan Province. The overall infection rate of G. duodenalis was 14.3% (134/940) as determined by microscopy, with the highest infection rate (17.3%) observed in dogs from shelters. Young dogs were more likely to be infected with G. duodenalis than adult dogs, and G. duodenalis cysts were found more frequently in diarrheic dogs. All G. duodenalis-positive isolates were characterized at the triose phosphate isomerase (tpi), glutamate dehydrogenase (gdh), and β-giardin (bg) loci, and 37, 51, and 48 sequences were obtained, respectively. The dog-specific assemblages C and D were identified using multi-locus sequence analysis. Six novel sequences of the tpi locus, one novel sequence of the gdh locus and two novel sequences of the bg locus were detected among the G. duodenalis assemblage C isolates, while two novel sequences of the gdh locus were found among the G. duodenalis assemblage D isolates. Our data indicate that G. duodenalis is a common parasite and cause of diarrheal disease in dogs in Henan Province. However, there was no evidence for zoonotic G. duodenalis assemblages in the study population. Copyright © 2015 Elsevier Ireland Ltd. All rights reserved.
Cultivation of Hard-To-Culture Subsurface Mercury-Resistant Bacteria and Discovery of New merA Gene Sequences▿

PubMed Central

Rasmussen, L. D.; Zawadsky, C.; Binnerup, S. J.; Øregaard, G.; Sørensen, S. J.; Kroer, N.

2008-01-01

Mercury-resistant bacteria may be important players in mercury biogeochemistry. To assess the potential for mercury reduction by two subsurface microbial communities, resistant subpopulations and their merA genes were characterized by a combined molecular and cultivation-dependent approach. The cultivation method simulated natural conditions by using polycarbonate membranes as a growth support and a nonsterile soil slurry as a culture medium. Resistant bacteria were pregrown to microcolony-forming units (mCFU) before being plated on standard medium. Compared to direct plating, culturability was increased up to 2,800 times and numbers of mCFU were similar to the total number of mercury-resistant bacteria in the soils. Denaturing gradient gel electrophoresis analysis of DNA extracted from membranes suggested stimulation of growth of hard-to-culture bacteria during the preincubation. A total of 25 different 16S rRNA gene sequences were observed, including Alpha-, Beta-, and Gammaproteobacteria; Actinobacteria; Firmicutes; and Bacteroidetes. The diversity of isolates obtained by direct plating included eight different 16S rRNA gene sequences (Alpha- and Betaproteobacteria and Actinobacteria). Partial sequencing of merA of selected isolates led to the discovery of new merA sequences. With phylum-specific merA primers, PCR products were obtained for Alpha- and Betaproteobacteria and Actinobacteria but not for Bacteroidetes and Firmicutes. The similarity to known sequences ranged between 89 and 95%. One of the sequences did not result in a match in the BLAST search. The results illustrate the power of integrating advanced cultivation methodology with molecular techniques for the characterization of the diversity of mercury-resistant populations and assessing the potential for mercury reduction in contaminated environments. PMID:18441111
Construction of a scFv Library with Synthetic, Non-combinatorial CDR Diversity.

PubMed

Bai, Xuelian; Shim, Hyunbo

2017-01-01

Many large synthetic antibody libraries have been designed, constructed, and successfully generated high-quality antibodies suitable for various demanding applications. While synthetic antibody libraries have many advantages such as optimized framework sequences and a broader sequence landscape than natural antibodies, their sequence diversities typically are generated by random combinatorial synthetic processes which cause the incorporation of many undesired CDR sequences. Here, we describe the construction of a synthetic scFv library using oligonucleotide mixtures that contain predefined, non-combinatorially synthesized CDR sequences. Each CDR is first inserted to a master scFv framework sequence and the resulting single-CDR libraries are subjected to a round of proofread panning. The proofread CDR sequences are assembled to produce the final scFv library with six diversified CDRs.
Salinity affects compositional traits of epibacterial communities on the brown macroalga Fucus vesiculosus.

PubMed

Stratil, Stephanie B; Neulinger, Sven C; Knecht, Henrik; Friedrichs, Anette K; Wahl, Martin

2014-05-01

Epibiotic biofilms have the potential to control major aspects of the biology and ecology of their hosts. Their composition and function may thus be essential for the health of the host. We tested the influence of salinity on the composition of epibacterial communities associated with the brown macroalga Fucus vesiculosus. Algal individuals were incubated at three salinities (5, 19, and 25) for 14 days and nonliving reference substrata (stones) were included in the experiment. Subsequently, the composition of their surface-associated bacterial communities was analyzed by 454 pyrosequencing of 16S rRNA gene sequences. Redundancy analysis revealed that the composition of epiphytic and epilithic communities significantly differed and were both affected by salinity. We found that 5% of 2494 epiphytic operational taxonomic units at 97% sequence similarity were responsible for the observed shifts. Epibacterial α-diversity was significantly lower at salinity 5 but did not differ between substrata. Our results indicate that salinity is an important factor in structuring alga-associated epibacterial communities with respect to composition and/or diversity. Whether direct or indirect mechanisms (via altered biotic interactions) may have been responsible for the observed shifts is discussed. © 2014 Federation of European Microbiological Societies. Published by John Wiley & Sons Ltd. All rights reserved.
Salmonella enterica Prophage Sequence Profiles Reflect Genome Diversity and Can Be Used for High Discrimination Subtyping.

PubMed

Mottawea, Walid; Duceppe, Marc-Olivier; Dupras, Andrée A; Usongo, Valentine; Jeukens, Julie; Freschi, Luca; Emond-Rheault, Jean-Guillaume; Hamel, Jeremie; Kukavica-Ibrulj, Irena; Boyle, Brian; Gill, Alexander; Burnett, Elton; Franz, Eelco; Arya, Gitanjali; Weadge, Joel T; Gruenheid, Samantha; Wiedmann, Martin; Huang, Hongsheng; Daigle, France; Moineau, Sylvain; Bekal, Sadjia; Levesque, Roger C; Goodridge, Lawrence D; Ogunremi, Dele

2018-01-01

Non-typhoidal Salmonella is a leading cause of foodborne illness worldwide. Prompt and accurate identification of the sources of Salmonella responsible for disease outbreaks is crucial to minimize infections and eliminate ongoing sources of contamination. Current subtyping tools including single nucleotide polymorphism (SNP) typing may be inadequate, in some instances, to provide the required discrimination among epidemiologically unrelated Salmonella strains. Prophage genes represent the majority of the accessory genes in bacteria genomes and have potential to be used as high discrimination markers in Salmonella . In this study, the prophage sequence diversity in different Salmonella serovars and genetically related strains was investigated. Using whole genome sequences of 1,760 isolates of S. enterica representing 151 Salmonella serovars and 66 closely related bacteria, prophage sequences were identified from assembled contigs using PHASTER. We detected 154 different prophages in S. enterica genomes. Prophage sequences were highly variable among S. enterica serovars with a median ± interquartile range (IQR) of 5 ± 3 prophage regions per genome. While some prophage sequences were highly conserved among the strains of specific serovars, few regions were lineage specific. Therefore, strains belonging to each serovar could be clustered separately based on their prophage content. Analysis of S . Enteritidis isolates from seven outbreaks generated distinct prophage profiles for each outbreak. Taken altogether, the diversity of the prophage sequences correlates with genome diversity. Prophage repertoires provide an additional marker for differentiating S. enterica subtypes during foodborne outbreaks.
Diversity and function in microbial mats from the Lucky Strike hydrothermal vent field.

PubMed

Crépeau, Valentin; Cambon Bonavita, Marie-Anne; Lesongeur, Françoise; Randrianalivelo, Henintsoa; Sarradin, Pierre-Marie; Sarrazin, Jozée; Godfroy, Anne

2011-06-01

Diversity and function in microbial mats from the Lucky Strike hydrothermal vent field (Mid-Atlantic Ridge) were investigated using molecular approaches. DNA and RNA were extracted from mat samples overlaying hydrothermal deposits and Bathymodiolus azoricus mussel assemblages. We constructed and analyzed libraries of 16S rRNA gene sequences and sequences of functional genes involved in autotrophic carbon fixation [forms I and II RuBisCO (cbbL/M), ATP-citrate lyase B (aclB)]; methane oxidation [particulate methane monooxygenase (pmoA)] and sulfur oxidation [adenosine-5'-phosphosulfate reductase (aprA) and soxB]. To gain new insights into the relationships between mats and mussels, we also used new domain-specific 16S rRNA gene primers targeting Bathymodiolus sp. symbionts. All identified archaeal sequences were affiliated with a single group: the marine group 1 Thaumarchaeota. In contrast, analyses of bacterial sequences revealed much higher diversity, although two phyla Proteobacteria and Bacteroidetes were largely dominant. The 16S rRNA gene sequence library revealed that species affiliated to Beggiatoa Gammaproteobacteria were the dominant active population. Analyses of DNA and RNA functional gene libraries revealed a diverse and active chemolithoautotrophic population. Most of these sequences were affiliated with Gammaproteobacteria, including hydrothermal fauna symbionts, Thiotrichales and Methylococcales. PCR and reverse transcription-PCR using 16S rRNA gene primers targeted to Bathymodiolus sp. symbionts revealed sequences affiliated with both methanotrophic and thiotrophic endosymbionts. © 2011 Federation of European Microbiological Societies. Published by Blackwell Publishing Ltd. All rights reserved.
PCR Primers to Study the Diversity of Expressed Fungal Genes Encoding Lignocellulolytic Enzymes in Soils Using High-Throughput Sequencing

PubMed Central

Barbi, Florian; Bragalini, Claudia; Vallon, Laurent; Prudent, Elsa; Dubost, Audrey; Fraissinet-Tachet, Laurence; Marmeisse, Roland; Luis, Patricia

2014-01-01

Plant biomass degradation in soil is one of the key steps of carbon cycling in terrestrial ecosystems. Fungal saprotrophic communities play an essential role in this process by producing hydrolytic enzymes active on the main components of plant organic matter. Open questions in this field regard the diversity of the species involved, the major biochemical pathways implicated and how these are affected by external factors such as litter quality or climate changes. This can be tackled by environmental genomic approaches involving the systematic sequencing of key enzyme-coding gene families using soil-extracted RNA as material. Such an approach necessitates the design and evaluation of gene family-specific PCR primers producing sequence fragments compatible with high-throughput sequencing approaches. In the present study, we developed and evaluated PCR primers for the specific amplification of fungal CAZy Glycoside Hydrolase gene families GH5 (subfamily 5) and GH11 encoding endo-β-1,4-glucanases and endo-β-1,4-xylanases respectively as well as Basidiomycota class II peroxidases, corresponding to the CAZy Auxiliary Activity family 2 (AA2), active on lignin. These primers were experimentally validated using DNA extracted from a wide range of Ascomycota and Basidiomycota species including 27 with sequenced genomes. Along with the published primers for Glycoside Hydrolase GH7 encoding enzymes active on cellulose, the newly design primers were shown to be compatible with the Illumina MiSeq sequencing technology. Sequences obtained from RNA extracted from beech or spruce forest soils showed a high diversity and were uniformly distributed in gene trees featuring the global diversity of these gene families. This high-throughput sequencing approach using several degenerate primers constitutes a robust method, which allows the simultaneous characterization of the diversity of different fungal transcripts involved in plant organic matter degradation and may lead to the discovery of complex patterns in gene expression of soil fungal communities. PMID:25545363
Temporal and Spatial Diversity of Bacterial Communities in Coastal Waters of the South China Sea

PubMed Central

Du, Jikun; Xiao, Kai; Li, Li; Ding, Xian; Liu, Helu; Lu, Yongjun; Zhou, Shining

2013-01-01

Bacteria are recognized as important drivers of biogeochemical processes in all aquatic ecosystems. Temporal and geographical patterns in ocean bacterial communities have been observed in many studies, but the temporal and spatial patterns in the bacterial communities from the South China Sea remained unexplored. To determine the spatiotemporal patterns, we generated 16S rRNA datasets for 15 samples collected from the five regularly distributed sites of the South China Sea in three seasons (spring, summer, winter). A total of 491 representative sequences were analyzed by MOTHUR, yielding 282 operational taxonomic units (OTUs) grouped at 97% stringency. Significant temporal variations of bacterial diversity were observed. Richness and diversity indices indicated that summer samples were the most diverse. The main bacterial group in spring and summer samples was Alphaproteobacteria, followed by Cyanobacteria and Gammaproteobacteria, whereas Cyanobacteria dominated the winter samples. Spatial patterns in the samples were observed that samples collected from the coastal (D151, D221) waters and offshore (D157, D1512, D224) waters clustered separately, the coastal samples harbored more diverse bacterial communities. However, the temporal pattern of the coastal site D151 was contrary to that of the coastal site D221. The LIBSHUFF statistics revealed noticeable differences among the spring, summer and winter libraries collected at five sites. The UPGMA tree showed there were temporal and spatial heterogeneity of bacterial community composition in coastal waters of the South China Sea. The water salinity (P=0.001) contributed significantly to the bacteria-environment relationship. Our results revealed that bacterial community structures were influenced by environmental factors and community-level changes in 16S-based diversity were better explained by spatial patterns than by temporal patterns. PMID:23785512
Pantoea ananatis Genetic Diversity Analysis Reveals Limited Genomic Diversity as Well as Accessory Genes Correlated with Onion Pathogenicity.

PubMed

Stice, Shaun P; Stumpf, Spencer D; Gitaitis, Ron D; Kvitko, Brian H; Dutta, Bhabesh

2018-01-01

Pantoea ananatis is a member of the family Enterobacteriaceae and an enigmatic plant pathogen with a broad host range. Although P. ananatis strains can be aggressive on onion causing foliar necrosis and onion center rot, previous genomic analysis has shown that P. ananatis lacks the primary virulence secretion systems associated with other plant pathogens. We assessed a collection of fifty P. ananatis strains collected from Georgia over three decades to determine genetic factors that correlated with onion pathogenic potential. Previous genetic analysis studies have compared strains isolated from different hosts with varying diseases potential and isolation sources. Strains varied greatly in their pathogenic potential and aggressiveness on different cultivated Allium species like onion, leek, shallot, and chive. Using multi-locus sequence analysis (MLSA) and repetitive extragenic palindrome repeat (rep)-PCR techniques, we did not observe any correlation between onion pathogenic potential and genetic diversity among strains. Whole genome sequencing and pan-genomic analysis of a sub-set of 10 strains aided in the identification of a novel series of genetic regions, likely plasmid borne, and correlating with onion pathogenicity observed on single contigs of the genetic assemblies. We named these loci Onion Virulence Regions (OVR) A-D. The OVR loci contain genes involved in redox regulation as well as pectate lyase and rhamnogalacturonase genes. Previous studies have not identified distinct genetic loci or plasmids correlating with onion foliar pathogenicity or pathogenicity on a single host pathosystem. The lack of focus on a single host system for this phytopathgenic disease necessitates the pan-genomic analysis performed in this study.
Taxonomic and functional characteristics of microbial communities and their correlation with physicochemical properties of four geothermal springs in Odisha, India

PubMed Central

Badhai, Jhasketan; Ghosh, Tarini S.; Das, Subrata K.

2015-01-01

This study describes microbial diversity in four tropical hot springs representing moderately thermophilic environments (temperature range: 40–58°C; pH: 7.2–7.4) with discrete geochemistry. Metagenome sequence data showed a dominance of Bacteria over Archaea; the most abundant phyla were Chloroflexi and Proteobacteria, although other phyla were also present, such as Acetothermia, Nitrospirae, Acidobacteria, Firmicutes, Deinococcus-Thermus, Bacteroidetes, Thermotogae, Euryarchaeota, Verrucomicrobia, Ignavibacteriae, Cyanobacteria, Actinobacteria, Planctomycetes, Spirochaetes, Armatimonadetes, Crenarchaeota, and Aquificae. The distribution of major genera and their statistical correlation analyses with the physicochemical parameters predicted that the temperature, aqueous concentrations of ions (such as sodium, chloride, sulfate, and bicarbonate), total hardness, dissolved solids and conductivity were the main environmental variables influencing microbial community composition and diversity. Despite the observed high taxonomic diversity, there were only little variations in the overall functional profiles of the microbial communities in the four springs. Genes involved in the metabolism of carbohydrates and carbon fixation were the most abundant functional class of genes present in these hot springs. The distribution of genes involved in carbon fixation predicted the presence of all the six known autotrophic pathways in the metagenomes. A high prevalence of genes involved in membrane transport, signal transduction, stress response, bacterial chemotaxis, and flagellar assembly were observed along with genes involved in the pathways of xenobiotic degradation and metabolism. The analysis of the metagenomic sequences affiliated to the candidate phylum Acetothermia from spring TB-3 provided new insight into the metabolism and physiology of yet-unknown members of this lineage of bacteria. PMID:26579081
Pantoea ananatis Genetic Diversity Analysis Reveals Limited Genomic Diversity as Well as Accessory Genes Correlated with Onion Pathogenicity

PubMed Central

Stice, Shaun P.; Stumpf, Spencer D.; Gitaitis, Ron D.; Kvitko, Brian H.; Dutta, Bhabesh

2018-01-01

Pantoea ananatis is a member of the family Enterobacteriaceae and an enigmatic plant pathogen with a broad host range. Although P. ananatis strains can be aggressive on onion causing foliar necrosis and onion center rot, previous genomic analysis has shown that P. ananatis lacks the primary virulence secretion systems associated with other plant pathogens. We assessed a collection of fifty P. ananatis strains collected from Georgia over three decades to determine genetic factors that correlated with onion pathogenic potential. Previous genetic analysis studies have compared strains isolated from different hosts with varying diseases potential and isolation sources. Strains varied greatly in their pathogenic potential and aggressiveness on different cultivated Allium species like onion, leek, shallot, and chive. Using multi-locus sequence analysis (MLSA) and repetitive extragenic palindrome repeat (rep)-PCR techniques, we did not observe any correlation between onion pathogenic potential and genetic diversity among strains. Whole genome sequencing and pan-genomic analysis of a sub-set of 10 strains aided in the identification of a novel series of genetic regions, likely plasmid borne, and correlating with onion pathogenicity observed on single contigs of the genetic assemblies. We named these loci Onion Virulence Regions (OVR) A-D. The OVR loci contain genes involved in redox regulation as well as pectate lyase and rhamnogalacturonase genes. Previous studies have not identified distinct genetic loci or plasmids correlating with onion foliar pathogenicity or pathogenicity on a single host pathosystem. The lack of focus on a single host system for this phytopathgenic disease necessitates the pan-genomic analysis performed in this study. PMID:29491851

Taxonomic and functional characteristics of microbial communities and their correlation with physicochemical properties of four geothermal springs in Odisha, India.

PubMed

Badhai, Jhasketan; Ghosh, Tarini S; Das, Subrata K

2015-01-01

This study describes microbial diversity in four tropical hot springs representing moderately thermophilic environments (temperature range: 40-58°C; pH: 7.2-7.4) with discrete geochemistry. Metagenome sequence data showed a dominance of Bacteria over Archaea; the most abundant phyla were Chloroflexi and Proteobacteria, although other phyla were also present, such as Acetothermia, Nitrospirae, Acidobacteria, Firmicutes, Deinococcus-Thermus, Bacteroidetes, Thermotogae, Euryarchaeota, Verrucomicrobia, Ignavibacteriae, Cyanobacteria, Actinobacteria, Planctomycetes, Spirochaetes, Armatimonadetes, Crenarchaeota, and Aquificae. The distribution of major genera and their statistical correlation analyses with the physicochemical parameters predicted that the temperature, aqueous concentrations of ions (such as sodium, chloride, sulfate, and bicarbonate), total hardness, dissolved solids and conductivity were the main environmental variables influencing microbial community composition and diversity. Despite the observed high taxonomic diversity, there were only little variations in the overall functional profiles of the microbial communities in the four springs. Genes involved in the metabolism of carbohydrates and carbon fixation were the most abundant functional class of genes present in these hot springs. The distribution of genes involved in carbon fixation predicted the presence of all the six known autotrophic pathways in the metagenomes. A high prevalence of genes involved in membrane transport, signal transduction, stress response, bacterial chemotaxis, and flagellar assembly were observed along with genes involved in the pathways of xenobiotic degradation and metabolism. The analysis of the metagenomic sequences affiliated to the candidate phylum Acetothermia from spring TB-3 provided new insight into the metabolism and physiology of yet-unknown members of this lineage of bacteria.
Investigation of Microbial Diversity in Geothermal Hot Springs in Unkeshwar, India, Based on 16S rRNA Amplicon Metagenome Sequencing

PubMed Central

Mehetre, Gajanan T.; Paranjpe, Aditi; Dastager, Syed G.

2016-01-01

Microbial diversity in geothermal waters of the Unkeshwar hot springs in Maharashtra, India, was studied using 16S rRNA amplicon metagenomic sequencing. Taxonomic analysis revealed the presence of Bacteroidetes, Proteobacteria, Cyanobacteria, Actinobacteria, Archeae, and OD1 phyla. Metabolic function prediction analysis indicated a battery of biological information systems indicating rich and novel microbial diversity, with potential biotechnological applications in this niche. PMID:26950332
Genomic Diversity and Evolution of the Lyssaviruses

PubMed Central

Delmas, Olivier; Holmes, Edward C.; Talbi, Chiraz; Larrous, Florence; Dacheux, Laurent; Bouchier, Christiane; Bourhy, Hervé

2008-01-01

Lyssaviruses are RNA viruses with single-strand, negative-sense genomes responsible for rabies-like diseases in mammals. To date, genomic and evolutionary studies have most often utilized partial genome sequences, particularly of the nucleoprotein and glycoprotein genes, with little consideration of genome-scale evolution. Herein, we report the first genomic and evolutionary analysis using complete genome sequences of all recognised lyssavirus genotypes, including 14 new complete genomes of field isolates from 6 genotypes and one genotype that is completely sequenced for the first time. In doing so we significantly increase the extent of genome sequence data available for these important viruses. Our analysis of these genome sequence data reveals that all lyssaviruses have the same genomic organization. A phylogenetic analysis reveals strong geographical structuring, with the greatest genetic diversity in Africa, and an independent origin for the two known genotypes that infect European bats. We also suggest that multiple genotypes may exist within the diversity of viruses currently classified as ‘Lagos Bat’. In sum, we show that rigorous phylogenetic techniques based on full length genome sequence provide the best discriminatory power for genotype classification within the lyssaviruses. PMID:18446239
Novel chytrid lineages dominate fungal sequences in diverse marine and freshwater habitats

NASA Astrophysics Data System (ADS)

Comeau, André M.; Vincent, Warwick F.; Bernier, Louis; Lovejoy, Connie

2016-07-01

In aquatic environments, fungal communities remain little studied despite their taxonomic and functional diversity. To extend the ecological coverage of this group, we conducted an in-depth analysis of fungal sequences within our collection of 3.6 million V4 18S rRNA pyrosequences originating from 319 individual marine (including sea-ice) and freshwater samples from libraries generated within diverse projects studying Arctic and temperate biomes in the past decade. Among the ~1.7 million post-filtered reads of highest taxonomic and phylogenetic quality, 23,263 fungal sequences were identified. The overall mean proportion was 1.35%, but with large variability; for example, from 0.01 to 59% of total sequences for Arctic seawater samples. Almost all sample types were dominated by Chytridiomycota-like sequences, followed by moderate-to-minor contributions of Ascomycota, Cryptomycota and Basidiomycota. Species and/or strain richness was high, with many novel sequences and high niche separation. The affinity of the most common reads to phytoplankton parasites suggests that aquatic fungi deserve renewed attention for their role in algal succession and carbon cycling.
Heterosis: Many Genes, Many Mechanisms—End the Search for an Undiscovered Unifying Theory

DOE PAGES

Kaeppler, Shawn

2012-01-01

Heterosis is the increase in vigor that is observed in progenies of matings of diverse individuals from different species, isolated populations, or selected strains within species or populations. Heterosis has been of immense economic value in agriculture and has important implications regarding the fitness and fecundity of individuals in natural populations. Genetic models based on complementation of deleterious alleles, especially in the context of linkage and epistasis, are consistent with many observed manifestations of heterosis. The search for the genes and alleles that underlie heterosis, as well as for broader allele-independent, genomewide mechanisms, has encompassed many species and systems. Commonmore » themes across these studies indicate that sequence diversity is necessary but not sufficient to produce heterotic phenotypes, and that the molecular pathways that produce heterosis involve chromatin modification, transcriptional control, translation and protein processing, and interactions between and within developmental and biochemical pathways. Taken together, there are many and diverse molecular mechanisms that translate DNA into phenotype, and it is the combination of all these mechanisms across many genes that produce heterosis in complex traits.« less
Universal Influenza B Virus Genomic Amplification Facilitates Sequencing, Diagnostics, and Reverse Genetics

PubMed Central

Zhou, Bin; Lin, Xudong; Wang, Wei; Halpin, Rebecca A.; Bera, Jayati; Stockwell, Timothy B.; Barr, Ian G.

2014-01-01

Although human influenza B virus (IBV) is a significant human pathogen, its great genetic diversity has limited our ability to universally amplify the entire genome for subsequent sequencing or vaccine production. The generation of sequence data via next-generation approaches and the rapid cloning of viral genes are critical for basic research, diagnostics, antiviral drugs, and vaccines to combat IBV. To overcome the difficulty of amplifying the diverse and ever-changing IBV genome, we developed and optimized techniques that amplify the complete segmented negative-sense RNA genome from any IBV strain in a single tube/well (IBV genomic amplification [IBV-GA]). Amplicons for >1,000 diverse IBV genomes from different sample types (e.g., clinical specimens) were generated and sequenced using this robust technology. These approaches are sensitive, robust, and sequence independent (i.e., universally amplify past, present, and future IBVs), which facilitates next-generation sequencing and advanced genomic diagnostics. Importantly, special terminal sequences engineered into the optimized IBV-GA2 products also enable ligation-free cloning to rapidly generate reverse-genetics plasmids, which can be used for the rescue of recombinant viruses and/or the creation of vaccine seed stock. PMID:24501036
A model of human motor sequence learning explains facilitation and interference effects based on spike-timing dependent plasticity.

PubMed

Wang, Quan; Rothkopf, Constantin A; Triesch, Jochen

2017-08-01

The ability to learn sequential behaviors is a fundamental property of our brains. Yet a long stream of studies including recent experiments investigating motor sequence learning in adult human subjects have produced a number of puzzling and seemingly contradictory results. In particular, when subjects have to learn multiple action sequences, learning is sometimes impaired by proactive and retroactive interference effects. In other situations, however, learning is accelerated as reflected in facilitation and transfer effects. At present it is unclear what the underlying neural mechanism are that give rise to these diverse findings. Here we show that a recently developed recurrent neural network model readily reproduces this diverse set of findings. The self-organizing recurrent neural network (SORN) model is a network of recurrently connected threshold units that combines a simplified form of spike-timing dependent plasticity (STDP) with homeostatic plasticity mechanisms ensuring network stability, namely intrinsic plasticity (IP) and synaptic normalization (SN). When trained on sequence learning tasks modeled after recent experiments we find that it reproduces the full range of interference, facilitation, and transfer effects. We show how these effects are rooted in the network's changing internal representation of the different sequences across learning and how they depend on an interaction of training schedule and task similarity. Furthermore, since learning in the model is based on fundamental neuronal plasticity mechanisms, the model reveals how these plasticity mechanisms are ultimately responsible for the network's sequence learning abilities. In particular, we find that all three plasticity mechanisms are essential for the network to learn effective internal models of the different training sequences. This ability to form effective internal models is also the basis for the observed interference and facilitation effects. This suggests that STDP, IP, and SN may be the driving forces behind our ability to learn complex action sequences.
Analysis of host preference and geographical distribution of Anastrepha suspensa (Diptera: Tephritidae) using phylogenetic analyses of mitochondrial cytochrome oxidase I DNA sequence data.

PubMed

Boykin, L M; Shatters, R G; Hall, D G; Burns, R E; Franqui, R A

2006-10-01

Anastrepha suspensa (Loew) is an economically important pest, restricted to the Greater Antilles and southern Florida. It infests a wide variety of hosts and is of quarantine importance in citrus, a multi-million dollar industry in Florida. The observed recent increase in citrus infested with A. suspensa in Florida has raised questions regarding host-specificity of certain populations and genetic diversity of the pest throughout its geographical distribution. Cytochrome oxidase I (COI) DNA sequence data was used to characterize the genetic diversity of A. suspensa from Florida and Caribbean populations reared from different host plants. Maximum likelihood and Bayesian phylogenetic methods were used to analyse COI data. Sequence variation among mitochondrial COI genes from 107 A. suspensa samples collected throughout Florida and the Caribbean ranged between 0 and 10% and placed all A. suspensa as a monophyletic group that united all A. suspensa in a clade sister to a Central American group of the A. fraterculus paraphyletic species complex. The most likely tree of the COI locus indicated that COI sequence variation was too low to provide resolution at the subspecies level, therefore monophyletic groups based on host-plant use, geography (Florida, Jamaica, Cayman Islands, Puerto Rico or Dominican Republic) or population sampled are not supported. This result indicates that either no population segregation has occurred based on these biological or geographical distinctions and that this is a generalist, polyphagous invasive genotype. Alternatively, if populations are distinct, the segregation event was more recent than can be distinguished based on COI sequence variation.
A global view of structure–function relationships in the tautomerase superfamily

PubMed Central

Davidson, Rebecca; Baas, Bert-Jan; Akiva, Eyal; Holliday, Gemma L.; Polacco, Benjamin J.; LeVieux, Jake A.; Pullara, Collin R.; Zhang, Yan Jessie; Whitman, Christian P.

2018-01-01

The tautomerase superfamily (TSF) consists of more than 11,000 nonredundant sequences present throughout the biosphere. Characterized members have attracted much attention because of the unusual and key catalytic role of an N-terminal proline. These few characterized members catalyze a diverse range of chemical reactions, but the full scale of their chemical capabilities and biological functions remains unknown. To gain new insight into TSF structure–function relationships, we performed a global analysis of similarities across the entire superfamily and computed a sequence similarity network to guide classification into distinct subgroups. Our results indicate that TSF members are found in all domains of life, with most being present in bacteria. The eukaryotic members of the cis-3-chloroacrylic acid dehalogenase subgroup are limited to fungal species, whereas the macrophage migration inhibitory factor subgroup has wide eukaryotic representation (including mammals). Unexpectedly, we found that 346 TSF sequences lack Pro-1, of which 85% are present in the malonate semialdehyde decarboxylase subgroup. The computed network also enabled the identification of similarity paths, namely sequences that link functionally diverse subgroups and exhibit transitional structural features that may help explain reaction divergence. A structure-guided comparison of these linker proteins identified conserved transitions between them, and kinetic analysis paralleled these observations. Phylogenetic reconstruction of the linker set was consistent with these findings. Our results also suggest that contemporary TSF members may have evolved from a short 4-oxalocrotonate tautomerase–like ancestor followed by gene duplication and fusion. Our new linker-guided strategy can be used to enrich the discovery of sequence/structure/function transitions in other enzyme superfamilies. PMID:29184004
Evolution of meiotic recombination genes in maize and teosinte.

PubMed

Sidhu, Gaganpreet K; Warzecha, Tomasz; Pawlowski, Wojciech P

2017-01-25

Meiotic recombination is a major source of genetic variation in eukaryotes. The role of recombination in evolution is recognized but little is known about how evolutionary forces affect the recombination pathway itself. Although the recombination pathway is fundamentally conserved across different species, genetic variation in recombination components and outcomes has been observed. Theoretical predictions and empirical studies suggest that changes in the recombination pathway are likely to provide adaptive abilities to populations experiencing directional or strong selection pressures, such as those occurring during species domestication. We hypothesized that adaptive changes in recombination may be associated with adaptive evolution patterns of genes involved in meiotic recombination. To examine how maize evolution and domestication affected meiotic recombination genes, we studied patterns of sequence polymorphism and divergence in eleven genes controlling key steps in the meiotic recombination pathway in a diverse set of maize inbred lines and several accessions of teosinte, the wild ancestor of maize. We discovered that, even though the recombination genes generally exhibited high sequence conservation expected in a pathway controlling a key cellular process, they showed substantial levels and diverse patterns of sequence polymorphism. Among others, we found differences in sequence polymorphism patterns between tropical and temperate maize germplasms. Several recombination genes displayed patterns of polymorphism indicative of adaptive evolution. Despite their ancient origin and overall sequence conservation, meiotic recombination genes can exhibit extensive and complex patterns of molecular evolution. Changes in these genes could affect the functioning of the recombination pathway, and may have contributed to the successful domestication of maize and its expansion to new cultivation areas.
Contrasting patterns of selection between MHC I and II across populations of Humboldt and Magellanic penguins.

PubMed

Sallaberry-Pincheira, Nicole; González-Acuña, Daniel; Padilla, Pamela; Dantas, Gisele P M; Luna-Jorquera, Guillermo; Frere, Esteban; Valdés-Velásquez, Armando; Vianna, Juliana A

2016-10-01

The evolutionary and adaptive potential of populations or species facing an emerging infectious disease depends on their genetic diversity in genes, such as the major histocompatibility complex (MHC). In birds, MHC class I deals predominantly with intracellular infections (e.g., viruses) and MHC class II with extracellular infections (e.g., bacteria). Therefore, patterns of MHC I and II diversity may differ between species and across populations of species depending on the relative effect of local and global environmental selective pressures, genetic drift, and gene flow. We hypothesize that high gene flow among populations of Humboldt and Magellanic penguins limits local adaptation in MHC I and MHC II, and signatures of selection differ between markers, locations, and species. We evaluated the MHC I and II diversity using 454 next-generation sequencing of 100 Humboldt and 75 Magellanic penguins from seven different breeding colonies. Higher genetic diversity was observed in MHC I than MHC II for both species, explained by more than one MHC I loci identified. Large population sizes, high gene flow, and/or similar selection pressures maintain diversity but limit local adaptation in MHC I. A pattern of isolation by distance was observed for MHC II for Humboldt penguin suggesting local adaptation, mainly on the northernmost studied locality. Furthermore, trans-species alleles were found due to a recent speciation for the genus or convergent evolution. High MHC I and MHC II gene diversity described is extremely advantageous for the long-term survival of the species.
Contrasting epidemic histories reveal pathogen-mediated balancing selection on class II MHC diversity in a wild songbird.

PubMed

Hawley, Dana M; Fleischer, Robert C

2012-01-01

The extent to which pathogens maintain the extraordinary polymorphism at vertebrate Major Histocompatibility Complex (MHC) genes via balancing selection has intrigued evolutionary biologists for over half a century, but direct tests remain challenging. Here we examine whether a well-characterized epidemic of Mycoplasmal conjunctivitis resulted in balancing selection on class II MHC in a wild songbird host, the house finch (Carpodacus mexicanus). First, we confirmed the potential for pathogen-mediated balancing selection by experimentally demonstrating that house finches with intermediate to high multi-locus MHC diversity are more resistant to challenge with Mycoplasma gallisepticum. Second, we documented sequence and diversity-based signatures of pathogen-mediated balancing selection at class II MHC in exposed host populations that were absent in unexposed, control populations across an equivalent time period. Multi-locus MHC diversity significantly increased in exposed host populations following the epidemic despite initial compromised diversity levels from a recent introduction bottleneck in the exposed host range. We did not observe equivalent changes in allelic diversity or heterozygosity across eight neutral microsatellite loci, suggesting that the observations reflect selection rather than neutral demographic processes. Our results indicate that a virulent pathogen can exert sufficient balancing selection on class II MHC to rescue compromised levels of genetic variation for host resistance in a recently bottlenecked population. These results provide evidence for Haldane's long-standing hypothesis that pathogens directly contribute to the maintenance of the tremendous levels of genetic variation detected in natural populations of vertebrates.
Genetic diversity based on 28S rDNA sequences among populations of Culex quinquefasciatus collected at different locations in Tamil Nadu, India.

PubMed

Sakthivelkumar, S; Ramaraj, P; Veeramani, V; Janarthanan, S

2015-09-01

The basis of the present study was to distinguish the existence of any genetic variability among populations of Culex quinquefasciatus which would be a valuable tool in the management of mosquito control programmes. In the present study, population of Cx. quinquefasciatus collected at different locations in Tamil Nadu were analyzed for their genetic variation based on 28S rDNA D2 region nucleotide sequences. A high degree of genetic polymorphism was detected in the sequences of D2 region of 28S rDNA on the predicted secondary structures in spite of high nucleotide sequence similarity. The findings based on secondary structure using rDNA sequences suggested the existence of a complex genotypic diversity of Cx. quinquefasciatus population collected at different locations of Tamil Nadu, India. This complexity in genetic diversity in a single mosquito population collected at different locations is considered an important issue towards their influence and nature of vector potential of these mosquitoes.
Exploring the environmental diversity of kinetoplastid flagellates in the high-throughput DNA sequencing era

PubMed Central

d’Avila-Levy, Claudia Masini; Boucinha, Carolina; Kostygov, Alexei; Santos, Helena Lúcia Carneiro; Morelli, Karina Alessandra; Grybchuk-Ieremenko, Anastasiia; Duval, Linda; Votýpka, Jan; Yurchenko, Vyacheslav; Grellier, Philippe; Lukeš, Julius

2015-01-01

The class Kinetoplastea encompasses both free-living and parasitic species from a wide range of hosts. Several representatives of this group are responsible for severe human diseases and for economic losses in agriculture and livestock. While this group encompasses over 30 genera, most of the available information has been derived from the vertebrate pathogenic genera Leishmaniaand Trypanosoma. Recent studies of the previously neglected groups of Kinetoplastea indicated that the actual diversity is much higher than previously thought. This article discusses the known segment of kinetoplastid diversity and how gene-directed Sanger sequencing and next-generation sequencing methods can help to deepen our knowledge of these interesting protists. PMID:26602872
Assessing the genetic diversity of Cu resistance in mine tailings through high-throughput recovery of full-length copA genes

PubMed Central

Li, Xiaofang; Zhu, Yong-Guan; Shaban, Babak; Bruxner, Timothy J. C.; Bond, Philip L.; Huang, Longbin

2015-01-01

Characterizing the genetic diversity of microbial copper (Cu) resistance at the community level remains challenging, mainly due to the polymorphism of the core functional gene copA. In this study, a local BLASTN method using a copA database built in this study was developed to recover full-length putative copA sequences from an assembled tailings metagenome; these sequences were then screened for potentially functioning CopA using conserved metal-binding motifs, inferred by evolutionary trace analysis of CopA sequences from known Cu resistant microorganisms. In total, 99 putative copA sequences were recovered from the tailings metagenome, out of which 70 were found with high potential to be functioning in Cu resistance. Phylogenetic analysis of selected copA sequences detected in the tailings metagenome showed that topology of the copA phylogeny is largely congruent with that of the 16S-based phylogeny of the tailings microbial community obtained in our previous study, indicating that the development of copA diversity in the tailings might be mainly through vertical descent with few lateral gene transfer events. The method established here can be used to explore copA (and potentially other metal resistance genes) diversity in any metagenome and has the potential to exhaust the full-length gene sequences for downstream analyses. PMID:26286020
Deciphering amphibian diversity through DNA barcoding: chances and challenges.

PubMed

Vences, Miguel; Thomas, Meike; Bonett, Ronald M; Vieites, David R

2005-10-29

Amphibians globally are in decline, yet there is still a tremendous amount of unrecognized diversity, calling for an acceleration of taxonomic exploration. This process will be greatly facilitated by a DNA barcoding system; however, the mitochondrial population structure of many amphibian species presents numerous challenges to such a standardized, single locus, approach. Here we analyse intra- and interspecific patterns of mitochondrial variation in two distantly related groups of amphibians, mantellid frogs and salamanders, to determine the promise of DNA barcoding with cytochrome oxidase subunit I (cox1) sequences in this taxon. High intraspecific cox1 divergences of 7-14% were observed (18% in one case) within the whole set of amphibian sequences analysed. These high values are not caused by particularly high substitution rates of this gene but by generally deep mitochondrial divergences within and among amphibian species. Despite these high divergences, cox1 sequences were able to correctly identify species including disparate geographic variants. The main problems with cox1 barcoding of amphibians are (i) the high variability of priming sites that hinder the application of universal primers to all species and (ii) the observed distinct overlap of intraspecific and interspecific divergence values, which implies difficulties in the definition of threshold values to identify candidate species. Common discordances between geographical signatures of mitochondrial and nuclear markers in amphibians indicate that a single-locus approach can be problematic when high accuracy of DNA barcoding is required. We suggest that a number of mitochondrial and nuclear genes may be used as DNA barcoding markers to complement cox1.
Microbial diversity in raw milk and traditional fermented dairy products (Hurood cheese and Jueke) from Inner Mongolia, China.

PubMed

Gao, M L; Hou, H M; Teng, X X; Zhu, Y L; Hao, H S; Zhang, G L

2017-03-08

Hurood cheese (HC) and Jueke (Jk) are 2 traditional fermented dairy products produced from raw milk (RM) in the Inner Mongolia region of China. They have a long history of production and consumption. The microbial compositions of RM, HC, and Jk vary greatly, and are influenced by their geographical origins and unique processing methods. In this study, 2 batches of RM, HC, and Jk samples were collected (April and August 2015) from the Zhenglan Banner, a region located in the southern part of Inner Mongolian belonging to the Xilingol league prefecture. The bacterial and fungal diversities of the samples were determined by 16S rRNA and 18S rRNA gene sequence analysis, respectively. A total of 112 bacterial and 30 fungal sequences were identified, with Firmicutes and Ascomycota being the predominant phyla for bacteria and fungi, respectively. Lactococcus and Lactobacillus were identified as the main bacterial genera, whereas Kluyveromyces was the predominant fungus identified in the 3 dairy products. Different bacterial and fungal compositions were observed in RM, HC, and Jk samples collected at different times. These results suggested that time of production may be an important factor influencing the microbial diversity present in RM, HC, and Jk.
Relative Abundance and Diversity of Bacterial Methanotrophs at the Oxic–Anoxic Interface of the Congo Deep-Sea Fan

PubMed Central

Bessette, Sandrine; Moalic, Yann; Gautey, Sébastien; Lesongeur, Françoise; Godfroy, Anne; Toffin, Laurent

2017-01-01

Sitting at ∼5,000 m water depth on the Congo-Angola margin and ∼760 km offshore of the West African coast, the recent lobe complex of the Congo deep-sea fan receives large amounts of fluvial sediments (3–5% organic carbon). This organic-rich sedimentation area harbors habitats with chemosynthetic communities similar to those of cold seeps. In this study, we investigated relative abundance, diversity and distribution of aerobic methane-oxidizing bacteria (MOB) communities at the oxic–anoxic interface of sedimentary habitats by using fluorescence in situ hybridization and comparative sequence analysis of particulate mono-oxygenase (pmoA) genes. Our findings revealed that sedimentary habitats of the recent lobe complex hosted type I and type II MOB cells and comparisons of pmoA community compositions showed variations among the different organic-rich habitats. Furthermore, the pmoA lineages were taxonomically more diverse compared to methane seep environments and were related to those found at cold seeps. Surprisingly, MOB phylogenetic lineages typical of terrestrial environments were observed at such water depth. In contrast, MOB cells or pmoA sequences were not detected at the previous lobe complex that is disconnected from the Congo River inputs. PMID:28487684
High-level diversity of tailed phages, eukaryote-associated viruses, and virophage-like elements in the metaviromes of antarctic soils.

PubMed

Zablocki, Olivier; van Zyl, Lonnie; Adriaenssens, Evelien M; Rubagotti, Enrico; Tuffin, Marla; Cary, Stephen Craig; Cowan, Don

2014-11-01

The metaviromes of two distinct Antarctic hyperarid desert soil communities have been characterized. Hypolithic communities, cyanobacterium-dominated assemblages situated on the ventral surfaces of quartz pebbles embedded in the desert pavement, showed higher virus diversity than surface soils, which correlated with previous bacterial community studies. Prokaryotic viruses (i.e., phages) represented the largest viral component (particularly Mycobacterium phages) in both habitats, with an identical hierarchical sequence abundance of families of tailed phages (Siphoviridae > Myoviridae > Podoviridae). No archaeal viruses were found. Unexpectedly, cyanophages were poorly represented in both metaviromes and were phylogenetically distant from currently characterized cyanophages. Putative phage genomes were assembled and showed a high level of unaffiliated genes, mostly from hypolithic viruses. Moreover, unusual gene arrangements in which eukaryotic and prokaryotic virus-derived genes were found within identical genome segments were observed. Phycodnaviridae and Mimiviridae viruses were the second-most-abundant taxa and more numerous within open soil. Novel virophage-like sequences (within the Sputnik clade) were identified. These findings highlight high-level virus diversity and novel species discovery potential within Antarctic hyperarid soils and may serve as a starting point for future studies targeting specific viral groups. Copyright © 2014, American Society for Microbiology. All Rights Reserved.
Widespread distribution and a new recombinant species of Brazilian virus associated with cotton blue disease

PubMed Central

Silva, TF; Corrêa, RL; Castilho, Y; Silvie, P; Bélot, J-L; Vaslin, MFS

2008-01-01

Background Cotton blue disease (CBD), an important global cotton crop pathology responsible for major economic losses, is prevalent in the major cotton-producing states of Brazil. Typical CBD symptoms include stunting due to internodal shortening, leaf rolling, intense green foliage, and yellowing veins. Atypical CBD symptoms, including reddish and withered leaves, were also observed in Brazilian cotton fields in 2007. Recently, a Polerovirus named Cotton leafroll dwarf virus (CLRDV) was shown to be associated with CBD. Results To understand the distribution and genetic diversity of CLRDV in Brazil, we analyzed 23 CBD-symptomatic plants from susceptible cotton varieties originating from five of the six most important cotton-growing states, from 2004–2007. Here, we report on CLRDV diversity in plants with typical or atypical CBD symptoms by comparing viral coat protein, RNA polymerase (RdRp), and intergenic region genomic sequences. Conclusion The virus had a widespread distribution with a low genetic diversity; however, three divergent isolates were associated with atypical CBD symptoms. These divergent isolates had a CLRDV-related coat protein but a distinct RdRp sequence, and probably arose from recombination events. Based on the taxonomic rules for the family Luteoviridae, we propose that these three isolates represent isolates of a new species in the genus Polerovirus. PMID:18937850

Some links on this page may take you to non-federal websites. Their policies may differ from this site.